BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 024317
         (269 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224079468|ref|XP_002305875.1| predicted protein [Populus trichocarpa]
 gi|222848839|gb|EEE86386.1| predicted protein [Populus trichocarpa]
          Length = 241

 Score =  333 bits (855), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 164/245 (66%), Positives = 194/245 (79%), Gaps = 6/245 (2%)

Query: 27  LLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESV-VKRL 85
           +LEDGY VTTV+DGH+L INPH+V  R  SS+++VLDSSRS FYTL FP+S+ SV VKRL
Sbjct: 1   MLEDGYMVTTVLDGHKLNINPHAVQLR--SSEIVVLDSSRSVFYTLPFPISQASVMVKRL 58

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
           +G+G  GY DGEPG ARF+KPKSFAVD++GN+YVAD+ NH +RKI+N GVT+   G   +
Sbjct: 59  SGEGKTGYIDGEPGLARFNKPKSFAVDLRGNVYVADQQNHAVRKISNSGVTSTIVGNYSQ 118

Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
            GR DGP + A+FS+DFE+ FVP ICALLISDHGNQL+RQI+LKPEDC   SQ  SALGA
Sbjct: 119 TGRQDGPGKTATFSSDFEVLFVPQICALLISDHGNQLLRQIDLKPEDCVIGSQ--SALGA 176

Query: 206 VSVWVLVSVLSCLVSLVIGFVARPYIIRHTGRLDPP-LQHDMEALPNQSGETSTDLLLRH 264
           V  WVL   LSCL+ +VIG   RPY+I HTGRL  P  Q DMEALPNQSGE+STD LLR 
Sbjct: 177 VKFWVLGLALSCLLGIVIGIATRPYVIPHTGRLQTPSFQQDMEALPNQSGESSTDELLRR 236

Query: 265 QKRNC 269
           QKRNC
Sbjct: 237 QKRNC 241


>gi|224065156|ref|XP_002301692.1| predicted protein [Populus trichocarpa]
 gi|222843418|gb|EEE80965.1| predicted protein [Populus trichocarpa]
          Length = 264

 Score =  320 bits (821), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 157/246 (63%), Positives = 190/246 (77%), Gaps = 6/246 (2%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESV-VKR 84
           ++LEDGY VTTV+DGH+L +NPH+V  R  SSDL+VLDSS+S FYTL FP+S++ V VKR
Sbjct: 23  IMLEDGYMVTTVMDGHKLNVNPHAVQLR--SSDLVVLDSSKSVFYTLPFPISQDGVMVKR 80

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
           L+G G +GY DGEPG ARF+KPKSF VD++GN+YVAD+ NH +RKI++ G+TT   G   
Sbjct: 81  LSGSGDKGYIDGEPGLARFNKPKSFTVDLRGNVYVADQLNHAVRKISSSGMTTTIAGNYS 140

Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
           + GR DGP + A+FS DFE+ FVP ICALLISDHGNQL+RQ++LK EDC   SQ  SALG
Sbjct: 141 QIGRQDGPGETATFSTDFEVLFVPQICALLISDHGNQLLRQVDLKQEDCIIGSQ--SALG 198

Query: 205 AVSVWVLVSVLSCLVSLVIGFVARPYIIRHTGRLDPP-LQHDMEALPNQSGETSTDLLLR 263
           AV  WVL  VLSCL  L  GF  RPY+I HTGR   P  Q DMEALPNQSGE++T+ LLR
Sbjct: 199 AVKFWVLGLVLSCLFGLATGFAIRPYVIPHTGRRQTPSFQQDMEALPNQSGESNTEELLR 258

Query: 264 HQKRNC 269
            QKRNC
Sbjct: 259 RQKRNC 264


>gi|18395326|ref|NP_564209.1| NHL domain-containing protein [Arabidopsis thaliana]
 gi|15724248|gb|AAL06517.1|AF412064_1 At1g23890/T23E23_13 [Arabidopsis thaliana]
 gi|16974539|gb|AAL31179.1| At1g23890/T23E23_13 [Arabidopsis thaliana]
 gi|110742589|dbj|BAE99208.1| hypothetical protein [Arabidopsis thaliana]
 gi|332192326|gb|AEE30447.1| NHL domain-containing protein [Arabidopsis thaliana]
          Length = 261

 Score =  317 bits (813), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 159/260 (61%), Positives = 202/260 (77%), Gaps = 5/260 (1%)

Query: 7   YSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSR 66
           +SS+++ L  ++F+LVS  ++LE+GY VTTV+DGH+  +NP+++   PGSS+LIVLDSS 
Sbjct: 5   FSSVSLFLF-VVFNLVSGKIVLEEGYEVTTVVDGHKSGLNPYTIHALPGSSNLIVLDSSG 63

Query: 67  SAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
           S FYT SFPLS +SV+ R AGDG  G+ DG+ G++RF KP+ FAVD KGN+YVADKSN  
Sbjct: 64  STFYTTSFPLSVDSVINRFAGDGSSGHVDGKAGNSRFSKPRGFAVDAKGNVYVADKSNKA 123

Query: 127 IRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
           IRKI++ G VTTIAGG SK  G  DGPAQNA+FS+DFE+TFVP  C LL+SDHGN++IRQ
Sbjct: 124 IRKISSSGSVTTIAGGISKAFGHRDGPAQNATFSSDFEITFVPQRCCLLVSDHGNEMIRQ 183

Query: 186 INLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLVIGFVARPYIIRHTGRLDPPL-QH 244
           INLK EDC ++S S   LG  S+W +  VLSC++ + IGF  RPY+IRHTGR +PPL  H
Sbjct: 184 INLKEEDCLENSHSN--LGTYSLWSIGIVLSCILGVAIGFAVRPYVIRHTGRSEPPLVHH 241

Query: 245 DMEALPNQSGETSTDLLLRH 264
           DMEA PNQ G TS+DLLL H
Sbjct: 242 DMEASPNQIGGTSSDLLLLH 261


>gi|147798774|emb|CAN69795.1| hypothetical protein VITISV_022545 [Vitis vinifera]
          Length = 459

 Score =  316 bits (809), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 163/275 (59%), Positives = 211/275 (76%), Gaps = 12/275 (4%)

Query: 1   MASHFVYSSLAITLLSLLFSLV----SSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
           MA++    SL++ ++  +F++     S+ L+LEDGYTV TV DG++LEINPHS++ R GS
Sbjct: 1   MATYL--QSLSLFIVFTIFTIAAIHGSADLVLEDGYTVRTVFDGNKLEINPHSILPRYGS 58

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           SD I+LDSS+S FYT+  PLS+ES + RL+G    G+SDG+  SA F KP+SFAVD+KGN
Sbjct: 59  SDFIILDSSKSVFYTVXSPLSQESEINRLSGSSA-GFSDGDSASAMFSKPRSFAVDLKGN 117

Query: 117 IYVADKSNHVIRKITNLGVTT-IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
           +YVAD+SN VIRKITN GVTT IAGG ++K G+ DGPAQNASFS DFEL FVP  CA+L+
Sbjct: 118 VYVADQSNGVIRKITNRGVTTTIAGGYAQKTGKVDGPAQNASFSKDFELVFVPEKCAVLV 177

Query: 176 SDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSV-LSCLVSLVIGFVARPYIIRH 234
           SD G+QL+RQI+LK EDC +S Q  S LG   +WVL+ + +SCLV  ++G ++RPY+I H
Sbjct: 178 SDRGSQLVRQIDLKVEDCRRSPQ--SVLGGAFLWVLLGLGVSCLVGFIVGIISRPYVIPH 235

Query: 235 TGRLDPPL-QHDMEALPNQSGETSTDLLLRHQKRN 268
           TG L P L Q D+EALPN SGETSTD LLRHQK++
Sbjct: 236 TGSLLPTLFQRDVEALPNPSGETSTDALLRHQKQS 270


>gi|255560013|ref|XP_002521025.1| conserved hypothetical protein [Ricinus communis]
 gi|223539862|gb|EEF41442.1| conserved hypothetical protein [Ricinus communis]
          Length = 408

 Score =  308 bits (789), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 154/225 (68%), Positives = 184/225 (81%), Gaps = 4/225 (1%)

Query: 14  LLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLS 73
           LL L  + V  GL+LEDGYTVTT+IDGH+LEINPH+V+ RP SSDLI+LDSS S  YT+S
Sbjct: 18  LLHLPSNYVLGGLILEDGYTVTTIIDGHKLEINPHAVLSRPQSSDLILLDSSHSTIYTIS 77

Query: 74  FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           FP+S+ESVVKRL+GDGV G SDGEPGSARF+KP+SFAVD KGNIYVAD+ N  IRKITN 
Sbjct: 78  FPISQESVVKRLSGDGVAGLSDGEPGSARFNKPRSFAVDNKGNIYVADRLNGTIRKITNS 137

Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
           GV+TIAGG SK  GR DGPAQNA+FS+DFE+ FV   CALLISDHGNQL+R++ LKP+DC
Sbjct: 138 GVSTIAGGYSKGFGREDGPAQNATFSSDFEVAFVAEECALLISDHGNQLVRRLPLKPDDC 197

Query: 194 SKSSQSGSALGAVSVWV--LVSVLSCLVSLVIGFVARPYIIRHTG 236
           + +S   SALGAVS WV  L  V+SCL+ + IGFV RP+I+ + G
Sbjct: 198 ATASH--SALGAVSFWVLGLGLVMSCLIGIAIGFVIRPHIVPYEG 240


>gi|297850862|ref|XP_002893312.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297339154|gb|EFH69571.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 398

 Score =  281 bits (720), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 138/229 (60%), Positives = 179/229 (78%), Gaps = 4/229 (1%)

Query: 7   YSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSR 66
           +SS+++ L  ++F+LVS  ++LEDGY VTTV+DGH+  +NP+++   PGSS+LIVLDSS 
Sbjct: 5   FSSVSLFLF-VVFNLVSGKIVLEDGYEVTTVVDGHKSGLNPYTIHALPGSSNLIVLDSSG 63

Query: 67  SAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
           S FYT SFPLS +SV+ R AGDG  GY DG+ G++RF KP+ FA+D KGN+YVAD+SN  
Sbjct: 64  STFYTTSFPLSFDSVIHRFAGDGTSGYVDGKAGNSRFKKPRGFAIDAKGNVYVADRSNKA 123

Query: 127 IRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
           IRKI++ G VTTIAGG SK+ G  DGPAQNA+FS+DFE+TFVP  C LL+SDHGN+++RQ
Sbjct: 124 IRKISSSGYVTTIAGGISKEFGHRDGPAQNATFSSDFEITFVPQRCCLLVSDHGNEMVRQ 183

Query: 186 INLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLVIGFVARPYIIRH 234
           INLK EDC +SS S   LGA S+W +   LSC++ + IGF  RPYIIRH
Sbjct: 184 INLKEEDCLESSHSN--LGAYSLWSIGIFLSCILGVAIGFAVRPYIIRH 230


>gi|388505270|gb|AFK40701.1| unknown [Medicago truncatula]
          Length = 384

 Score =  278 bits (711), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 133/206 (64%), Positives = 162/206 (78%), Gaps = 2/206 (0%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD 88
           E+GYT+TT++DGH+L INP S++ RP SSDLIVLDS+ S FYT+  P+S+ESV KR +G+
Sbjct: 31  EEGYTITTILDGHKLHINPFSILQRPISSDLIVLDSTNSTFYTVQLPISQESVFKRFSGN 90

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
           G  GY DG+ G ARFDKP+SFAVD +GN+YVAD+ N VIRKI+  GVTTIAGG S+K   
Sbjct: 91  GSPGYEDGDVGLARFDKPRSFAVDFRGNVYVADRVNKVIRKISTNGVTTIAGGSSEKSSI 150

Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSV 208
            DGP QNASFSNDFELTF+P +CALL+SDH +QL+ QINLK EDC+  S+  SALGAV  
Sbjct: 151 KDGPVQNASFSNDFELTFIPALCALLVSDHMHQLVHQINLKGEDCTLGSK--SALGAVMT 208

Query: 209 WVLVSVLSCLVSLVIGFVARPYIIRH 234
           W L   LSC++ LVIG V RPYII H
Sbjct: 209 WTLGLGLSCILGLVIGIVIRPYIIPH 234


>gi|42571623|ref|NP_973902.1| NHL domain-containing protein [Arabidopsis thaliana]
 gi|332192325|gb|AEE30446.1| NHL domain-containing protein [Arabidopsis thaliana]
          Length = 400

 Score =  277 bits (709), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 137/229 (59%), Positives = 178/229 (77%), Gaps = 4/229 (1%)

Query: 7   YSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSR 66
           +SS+++ L  ++F+LVS  ++LE+GY VTTV+DGH+  +NP+++   PGSS+LIVLDSS 
Sbjct: 5   FSSVSLFLF-VVFNLVSGKIVLEEGYEVTTVVDGHKSGLNPYTIHALPGSSNLIVLDSSG 63

Query: 67  SAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
           S FYT SFPLS +SV+ R AGDG  G+ DG+ G++RF KP+ FAVD KGN+YVADKSN  
Sbjct: 64  STFYTTSFPLSVDSVINRFAGDGSSGHVDGKAGNSRFSKPRGFAVDAKGNVYVADKSNKA 123

Query: 127 IRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
           IRKI++ G VTTIAGG SK  G  DGPAQNA+FS+DFE+TFVP  C LL+SDHGN++IRQ
Sbjct: 124 IRKISSSGSVTTIAGGISKAFGHRDGPAQNATFSSDFEITFVPQRCCLLVSDHGNEMIRQ 183

Query: 186 INLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLVIGFVARPYIIRH 234
           INLK EDC ++S S   LG  S+W +  VLSC++ + IGF  RPY+IRH
Sbjct: 184 INLKEEDCLENSHSN--LGTYSLWSIGIVLSCILGVAIGFAVRPYVIRH 230


>gi|225424715|ref|XP_002265272.1| PREDICTED: uncharacterized protein LOC100243227 [Vitis vinifera]
          Length = 438

 Score =  274 bits (701), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 139/238 (58%), Positives = 184/238 (77%), Gaps = 11/238 (4%)

Query: 1   MASHFVYSSLAITLLSLLFSLV----SSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
           MA++    SL++ ++  +F++     S+ L+LEDGYTV TV DG++LEINPHS++ R GS
Sbjct: 1   MATYL--QSLSLFIVFTIFTIAAIHGSADLVLEDGYTVRTVFDGNKLEINPHSILPRYGS 58

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           SD I+LDSS+S FYT+S PLS+ES +KRL+G    G+SDG+  SA F KP+SFAVD+KGN
Sbjct: 59  SDFIILDSSKSVFYTVSSPLSQESEIKRLSGSSA-GFSDGDSASATFSKPRSFAVDLKGN 117

Query: 117 IYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
           +YVAD+SN VIRKITN GV TTIAGG ++K G+ DGPAQNASFS DFEL FVP  CA+L+
Sbjct: 118 VYVADQSNGVIRKITNRGVTTTIAGGYAQKTGKVDGPAQNASFSKDFELVFVPEKCAVLV 177

Query: 176 SDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSV-LSCLVSLVIGFVARPYII 232
           SD G+QL+RQI+LK EDC +S Q  S LG   +WVL+ + +SCLV  ++G ++RPY+I
Sbjct: 178 SDRGSQLVRQIDLKVEDCRRSPQ--SVLGGAFLWVLLGLGVSCLVGFIVGIISRPYVI 233


>gi|296086532|emb|CBI32121.3| unnamed protein product [Vitis vinifera]
          Length = 369

 Score =  274 bits (700), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 139/238 (58%), Positives = 184/238 (77%), Gaps = 11/238 (4%)

Query: 1   MASHFVYSSLAITLLSLLFSLV----SSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
           MA++    SL++ ++  +F++     S+ L+LEDGYTV TV DG++LEINPHS++ R GS
Sbjct: 1   MATYL--QSLSLFIVFTIFTIAAIHGSADLVLEDGYTVRTVFDGNKLEINPHSILPRYGS 58

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           SD I+LDSS+S FYT+S PLS+ES +KRL+G    G+SDG+  SA F KP+SFAVD+KGN
Sbjct: 59  SDFIILDSSKSVFYTVSSPLSQESEIKRLSGSSA-GFSDGDSASATFSKPRSFAVDLKGN 117

Query: 117 IYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
           +YVAD+SN VIRKITN GV TTIAGG ++K G+ DGPAQNASFS DFEL FVP  CA+L+
Sbjct: 118 VYVADQSNGVIRKITNRGVTTTIAGGYAQKTGKVDGPAQNASFSKDFELVFVPEKCAVLV 177

Query: 176 SDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSV-LSCLVSLVIGFVARPYII 232
           SD G+QL+RQI+LK EDC +S Q  S LG   +WVL+ + +SCLV  ++G ++RPY+I
Sbjct: 178 SDRGSQLVRQIDLKVEDCRRSPQ--SVLGGAFLWVLLGLGVSCLVGFIVGIISRPYVI 233


>gi|356526828|ref|XP_003532018.1| PREDICTED: uncharacterized protein LOC100816542 [Glycine max]
          Length = 400

 Score =  253 bits (647), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 134/212 (63%), Positives = 161/212 (75%), Gaps = 3/212 (1%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
           ++ E+GYTVTTV DGH+  I P +V+ RP SSDLI+LDS  S FYT  FP++EESV  RL
Sbjct: 24  VITEEGYTVTTVFDGHKPHIFPFTVLQRPFSSDLILLDSVNSTFYTAQFPITEESVFTRL 83

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG-SK 144
           +GDG  GYSDG+ GSARF KP+SFA DM+GN+YVADKSN  IRKI+  GVTTIAGG  S+
Sbjct: 84  SGDGSVGYSDGDVGSARFAKPRSFAFDMRGNVYVADKSNRAIRKISAKGVTTIAGGEFSE 143

Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
           K    DGPA NASFSNDF+LTF+P +CALL+SDH ++L+RQINL  EDC+  S+ G  LG
Sbjct: 144 KSRTKDGPALNASFSNDFDLTFIPGLCALLVSDHMHRLVRQINLMEEDCTLGSKPG--LG 201

Query: 205 AVSVWVLVSVLSCLVSLVIGFVARPYIIRHTG 236
           AV  W L   LSCL+ LVIG V RPYII + G
Sbjct: 202 AVMTWTLGLGLSCLLGLVIGIVVRPYIIPNKG 233


>gi|9369410|gb|AAF87158.1|AC002423_23 T23E23.6 [Arabidopsis thaliana]
          Length = 370

 Score =  243 bits (621), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 123/212 (58%), Positives = 161/212 (75%), Gaps = 7/212 (3%)

Query: 7   YSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSR 66
           +SS+++ L  ++F+LVS  ++LE+GY VTTV+DGH+  +NP+++   PGSS+LIVLDSS 
Sbjct: 5   FSSVSLFLF-VVFNLVSGKIVLEEGYEVTTVVDGHKSGLNPYTIHALPGSSNLIVLDSSG 63

Query: 67  SAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
           S FYT SFPLS +SV+ R AGDG  G+ DG+ G++RF KP+ FAVD KGN+YVADKSN  
Sbjct: 64  STFYTTSFPLSVDSVINRFAGDGSSGHVDGKAGNSRFSKPRGFAVDAKGNVYVADKSNKA 123

Query: 127 IRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
           IRKI++ G VTTIAGG SK  G  DGPAQNA+FS+DFE+TFVP  C LL+SDHGN++IRQ
Sbjct: 124 IRKISSSGSVTTIAGGISKAFGHRDGPAQNATFSSDFEITFVPQRCCLLVSDHGNEMIRQ 183

Query: 186 INLKPEDCSKSSQSG-----SALGAVSVWVLV 212
           INLK EDC ++S S      + L  +  W L+
Sbjct: 184 INLKEEDCLENSHSSIQEEVNHLSFIMTWKLL 215


>gi|148909539|gb|ABR17864.1| unknown [Picea sitchensis]
          Length = 264

 Score =  224 bits (571), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 126/267 (47%), Positives = 176/267 (65%), Gaps = 21/267 (7%)

Query: 14  LLSLLFSLVS---SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFY 70
           LL +LFSL++     L+ E+GY V TV+ G+++++ PHS++  P + D+++LDS  S  +
Sbjct: 8   LLIILFSLITFTEPSLIPEEGYRVETVLYGNKMDVYPHSIL--PFNGDILLLDSVNSTLF 65

Query: 71  TLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
            +  PLS++S +K  AG  + V G++DG    A+F+ PKSF +D KGNIYVAD++NH IR
Sbjct: 66  RIGLPLSQDSTIKVFAGSRNTVPGFADGGFLDAQFNHPKSFTLDSKGNIYVADRANHAIR 125

Query: 129 KITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           KI+  GV+TIAGG + K G ADGP+Q A FSND++LTF+P ICALL+SD GN++IRQI L
Sbjct: 126 KISKSGVSTIAGGTAGKTGHADGPSQEAMFSNDYDLTFIPSICALLVSDRGNRMIRQIKL 185

Query: 189 KPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSL-----VIGFVARPYIIRHTGRL-DPPL 242
            P DC + S SG+       WV   V++  VS      ++ F+ +PYI  +TG L     
Sbjct: 186 PPGDCVQHSGSGTH------WV-TPVIASGVSFLLGLLLMAFL-QPYIAAYTGGLWSSSS 237

Query: 243 QHDMEALPNQSGETSTDLLLRHQKRNC 269
              ME +PN  GETS D LL H KRNC
Sbjct: 238 NASMERIPNNHGETSADELLCHVKRNC 264


>gi|115476416|ref|NP_001061804.1| Os08g0414600 [Oryza sativa Japonica Group]
 gi|37806208|dbj|BAC99711.1| unknown protein [Oryza sativa Japonica Group]
 gi|113623773|dbj|BAF23718.1| Os08g0414600 [Oryza sativa Japonica Group]
 gi|215715379|dbj|BAG95130.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215715380|dbj|BAG95131.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 275

 Score =  223 bits (569), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 126/253 (49%), Positives = 166/253 (65%), Gaps = 11/253 (4%)

Query: 25  GLLLEDGYTVTTVID----GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEES 80
           GL+L+DGYTVTT  D    G     +P++++ RP + DL++LDS+ SA YTL+ PLS  +
Sbjct: 26  GLVLDDGYTVTTAADLNHPGAPAAPHPYALLPRPRAGDLVLLDSAASALYTLALPLSGGA 85

Query: 81  VVKRLAGDGVQGYSDG--EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
             + LAG G         EP  A FD+P+S AVD   N+YVAD+ N  +RKI   G  TT
Sbjct: 86  AARTLAGGGGGPAGFADGEPRDAAFDRPRSLAVDHADNVYVADRMNGAVRKIAPSGFTTT 145

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSS 197
           IAGG SK  GR DGPAQNA+FS+DFEL +VP +CALL++D GN+LIRQINLK EDC++ +
Sbjct: 146 IAGGRSKGPGRKDGPAQNATFSSDFELVYVPKMCALLVTDRGNRLIRQINLKQEDCARET 205

Query: 198 QSGSALGAVSVWVLVSVLSCLVSLVIGFVARPYIIRHTGRLDPPL-QHDMEALPNQSGET 256
           Q G  LG  SV + ++VL  L+  VIGF  R +   H G L  PL Q D +A+ +   + 
Sbjct: 206 QPG--LGTTSVSI-IAVLCALLGSVIGFSIRHFYPAHAGSLHQPLFQEDADAVQDNPEDG 262

Query: 257 STDLLLRHQKRNC 269
            +D LLRHQKR+C
Sbjct: 263 RSDQLLRHQKRSC 275


>gi|226495023|ref|NP_001144050.1| uncharacterized protein LOC100276874 precursor [Zea mays]
 gi|195636116|gb|ACG37526.1| hypothetical protein [Zea mays]
 gi|414870559|tpg|DAA49116.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
          Length = 272

 Score =  216 bits (549), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 119/248 (47%), Positives = 161/248 (64%), Gaps = 8/248 (3%)

Query: 27  LLEDGYTVTTVIDGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           +LEDGYTVTT  D + L     +P++V+ R  + DL++LDS+ SA YT+S   S     +
Sbjct: 28  VLEDGYTVTTFADLNPLPASGPHPYAVLPRSRAGDLLLLDSAGSALYTISLSSSPGEPRR 87

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG 142
              G    G++DG PG A FD+P+S AVD   N+YVAD+ +  +RK+   G  TTIAGG 
Sbjct: 88  LAGGKRGSGFNDGGPGDAAFDRPRSVAVDAADNVYVADRLHGAVRKVAPSGYTTTIAGGF 147

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
           S   G  DGPAQNA+FS DF L +VP ICALL++D GN+++RQINLKPEDC+   QSG  
Sbjct: 148 SSGPGHRDGPAQNATFSPDFALVYVPKICALLVADRGNRMVRQINLKPEDCAHEKQSG-- 205

Query: 203 LGAVSVWVLVSVLSCLVSLVIGFVARPYIIRHTGRLDPPL-QHDMEALPNQSGETSTDLL 261
           LG   V   ++VL  L+  +IGF+AR +   + G L  PL Q D EA+P  + E ++D L
Sbjct: 206 LGNTLVS-FIAVLCTLLGSLIGFLARHFYPVNAGSLRQPLFQQDTEAIPEDTEEGNSDQL 264

Query: 262 LRHQKRNC 269
           LRH+KR+C
Sbjct: 265 LRHKKRSC 272


>gi|222640551|gb|EEE68683.1| hypothetical protein OsJ_27311 [Oryza sativa Japonica Group]
          Length = 420

 Score =  195 bits (496), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 110/217 (50%), Positives = 144/217 (66%), Gaps = 10/217 (4%)

Query: 25  GLLLEDGYTVTTVID----GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEES 80
           GL+L+DGYTVTT  D    G     +P++++ RP + DL++LDS+ SA YTL+ PLS  +
Sbjct: 26  GLVLDDGYTVTTAADLNHPGAPAAPHPYALLPRPRAGDLVLLDSAASALYTLALPLSGGA 85

Query: 81  VVKRLAGDGVQGYSDG--EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
             + LAG G         EP  A FD+P+S AVD   N+YVAD+ N  +RKI   G  TT
Sbjct: 86  AARTLAGGGGGPAGFADGEPRDAAFDRPRSLAVDHADNVYVADRMNGAVRKIAPSGFTTT 145

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSS 197
           IAGG SK  GR DGPAQNA+FS+DFEL +VP +CALL++D GN+LIRQINLK EDC++ +
Sbjct: 146 IAGGRSKGPGRKDGPAQNATFSSDFELVYVPKMCALLVTDRGNRLIRQINLKQEDCARET 205

Query: 198 QSGSALGAVSVWVLVSVLSCLVSLVIGFVARPYIIRH 234
           Q G  LG  SV + ++VL  L+  VIGF  R +   H
Sbjct: 206 QPG--LGTTSVSI-IAVLCALLGSVIGFSIRHFYPAH 239


>gi|414870561|tpg|DAA49118.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
          Length = 280

 Score =  184 bits (468), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 100/206 (48%), Positives = 134/206 (65%), Gaps = 7/206 (3%)

Query: 27  LLEDGYTVTTVIDGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           +LEDGYTVTT  D + L     +P++V+ R  + DL++LDS+ SA YT+S   S     +
Sbjct: 28  VLEDGYTVTTFADLNPLPASGPHPYAVLPRSRAGDLLLLDSAGSALYTISLSSSPGEPRR 87

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG 142
              G    G++DG PG A FD+P+S AVD   N+YVAD+ +  +RK+   G  TTIAGG 
Sbjct: 88  LAGGKRGSGFNDGGPGDAAFDRPRSVAVDAADNVYVADRLHGAVRKVAPSGYTTTIAGGF 147

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
           S   G  DGPAQNA+FS DF L +VP ICALL++D GN+++RQINLKPEDC+   QSG  
Sbjct: 148 SSGPGHRDGPAQNATFSPDFALVYVPKICALLVADRGNRMVRQINLKPEDCAHEKQSGLG 207

Query: 203 LGAVSVWVLVSVLSCLVSLVIGFVAR 228
              VS    ++VL  L+  +IGF+AR
Sbjct: 208 NTLVS---FIAVLCTLLGSLIGFLAR 230


>gi|224028415|gb|ACN33283.1| unknown [Zea mays]
 gi|414870560|tpg|DAA49117.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
          Length = 418

 Score =  184 bits (467), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 100/206 (48%), Positives = 134/206 (65%), Gaps = 7/206 (3%)

Query: 27  LLEDGYTVTTVIDGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           +LEDGYTVTT  D + L     +P++V+ R  + DL++LDS+ SA YT+S   S     +
Sbjct: 28  VLEDGYTVTTFADLNPLPASGPHPYAVLPRSRAGDLLLLDSAGSALYTISLSSSPGEPRR 87

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG 142
              G    G++DG PG A FD+P+S AVD   N+YVAD+ +  +RK+   G  TTIAGG 
Sbjct: 88  LAGGKRGSGFNDGGPGDAAFDRPRSVAVDAADNVYVADRLHGAVRKVAPSGYTTTIAGGF 147

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
           S   G  DGPAQNA+FS DF L +VP ICALL++D GN+++RQINLKPEDC+   QSG  
Sbjct: 148 SSGPGHRDGPAQNATFSPDFALVYVPKICALLVADRGNRMVRQINLKPEDCAHEKQSGLG 207

Query: 203 LGAVSVWVLVSVLSCLVSLVIGFVAR 228
              VS    ++VL  L+  +IGF+AR
Sbjct: 208 NTLVS---FIAVLCTLLGSLIGFLAR 230


>gi|242079119|ref|XP_002444328.1| hypothetical protein SORBIDRAFT_07g020240 [Sorghum bicolor]
 gi|241940678|gb|EES13823.1| hypothetical protein SORBIDRAFT_07g020240 [Sorghum bicolor]
          Length = 416

 Score =  181 bits (460), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 102/209 (48%), Positives = 141/209 (67%), Gaps = 12/209 (5%)

Query: 27  LLEDGYTVTTVIDGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           +LEDGYTVTT  + + L     +P++++ RP + DL++LDS+ SA YTLS   S     +
Sbjct: 29  VLEDGYTVTTFANFNPLPASGPHPYAILPRPRAGDLLLLDSAGSALYTLSLS-SSPGEPR 87

Query: 84  RLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
           RLAG   + G+ DG+   A FD+P+S AVD   N+YVAD+ +  +RK+   G  TT+AGG
Sbjct: 88  RLAGGKRRSGFDDGD---AAFDRPRSVAVDAADNVYVADQRHGAVRKVAPSGYTTTVAGG 144

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
            S   G  DG AQNA+FS DFEL +VP ICALL++D GN+++RQINLKPEDC+   QSG 
Sbjct: 145 LSSGPGHRDGLAQNATFSADFELVYVPKICALLVADRGNRMVRQINLKPEDCAHEKQSG- 203

Query: 202 ALGAVSVWVLVSVLSCLVSLVIGFVARPY 230
            LG  SV V +++L  L+ L+IGF+ R +
Sbjct: 204 -LGTTSVSV-IAILCALLGLIIGFLVRHF 230


>gi|414870562|tpg|DAA49119.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
          Length = 207

 Score =  175 bits (443), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 90/178 (50%), Positives = 119/178 (66%), Gaps = 4/178 (2%)

Query: 27  LLEDGYTVTTVIDGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           +LEDGYTVTT  D + L     +P++V+ R  + DL++LDS+ SA YT+S   S     +
Sbjct: 28  VLEDGYTVTTFADLNPLPASGPHPYAVLPRSRAGDLLLLDSAGSALYTISLSSSPGEPRR 87

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG 142
              G    G++DG PG A FD+P+S AVD   N+YVAD+ +  +RK+   G  TTIAGG 
Sbjct: 88  LAGGKRGSGFNDGGPGDAAFDRPRSVAVDAADNVYVADRLHGAVRKVAPSGYTTTIAGGF 147

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
           S   G  DGPAQNA+FS DF L +VP ICALL++D GN+++RQINLKPEDC+   QSG
Sbjct: 148 SSGPGHRDGPAQNATFSPDFALVYVPKICALLVADRGNRMVRQINLKPEDCAHEKQSG 205


>gi|326503362|dbj|BAJ99306.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 452

 Score =  170 bits (430), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 97/212 (45%), Positives = 135/212 (63%), Gaps = 13/212 (6%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           +GL+LEDGYTV+TV D +    +P++++ RP + DL++LDS+ S  YTL  P+S ++  +
Sbjct: 54  AGLVLEDGYTVSTVSDLNPSGTHPYALLPRPRAGDLVLLDSAGSTLYTLPLPVSADAGPR 113

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG 142
           RLAG            +     P+S AVD   N+YVAD++N  IRK+   G  TTIAG  
Sbjct: 114 RLAGGAG---------ALGDGHPRSIAVDGADNVYVADRANGSIRKVAPSGYTTTIAGAY 164

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
           S   G  D PAQNA+FS DFEL +VP ICALL++D GN+LIRQI LKPEDC+  +Q G  
Sbjct: 165 SAGTGHRDEPAQNATFSADFELIYVPQICALLVADRGNRLIRQIKLKPEDCAHENQKG-- 222

Query: 203 LGAVSVWVLVSVLSCLVSLVIGFVARPYIIRH 234
           LG  S  + +++L+ L   +IGF+ R +   H
Sbjct: 223 LGTTSTSI-IAILAALFGSIIGFLVRHFYPFH 253


>gi|115448467|ref|NP_001048013.1| Os02g0730400 [Oryza sativa Japonica Group]
 gi|46390465|dbj|BAD15926.1| putative NHL repeat-containing protein [Oryza sativa Japonica
           Group]
 gi|46390861|dbj|BAD16365.1| putative NHL repeat-containing protein [Oryza sativa Japonica
           Group]
 gi|113537544|dbj|BAF09927.1| Os02g0730400 [Oryza sativa Japonica Group]
 gi|125541002|gb|EAY87397.1| hypothetical protein OsI_08804 [Oryza sativa Indica Group]
 gi|125583566|gb|EAZ24497.1| hypothetical protein OsJ_08258 [Oryza sativa Japonica Group]
 gi|215715228|dbj|BAG94979.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 508

 Score =  166 bits (419), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 99/235 (42%), Positives = 140/235 (59%), Gaps = 17/235 (7%)

Query: 8   SSLAITLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSD 58
           S+ A +++  L+SL S         S +  E GYTV TV DG +L I P++V    G  +
Sbjct: 43  SNAASSVMKRLWSLKSTTKTGSGGKSMVKYEGGYTVETVFDGSKLGIEPYAVEVTQGG-E 101

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           L+V+DS  S  Y ++ PLS  S  K +AG  +G  G+ DG P  AR + PK F VD +GN
Sbjct: 102 LLVMDSMNSNIYRIALPLSRYSRPKLVAGSPEGFPGHVDGRPREARMNHPKGFTVDGRGN 161

Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLIS 176
           IYVAD  N  IRKI++ GVTTIAGG S + G  DGP+ +A FS DFE+ ++   C+LL+ 
Sbjct: 162 IYVADAMNMAIRKISDTGVTTIAGGKSSRGGHVDGPSDDAKFSTDFEVRYIGSSCSLLVI 221

Query: 177 DHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSL----VIGFVA 227
           D GNQ IR+I L  +DC    ++G  LG V+V +  +    +++L    V+G V+
Sbjct: 222 DRGNQAIREIQLNFDDCVYQYEAGFPLG-VAVLLAAAFFGYMLALLQRRVLGMVS 275


>gi|115467514|ref|NP_001057356.1| Os06g0269300 [Oryza sativa Japonica Group]
 gi|53793193|dbj|BAD54399.1| NHL repeat-containing protein-like [Oryza sativa Japonica Group]
 gi|113595396|dbj|BAF19270.1| Os06g0269300 [Oryza sativa Japonica Group]
 gi|215768552|dbj|BAH00781.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222635365|gb|EEE65497.1| hypothetical protein OsJ_20925 [Oryza sativa Japonica Group]
          Length = 517

 Score =  161 bits (407), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 98/237 (41%), Positives = 134/237 (56%), Gaps = 18/237 (7%)

Query: 8   SSLAITLLSLLFSLVSSG------------LLLEDGYTVTTVIDGHQLEINPHSVIDRPG 55
           SS A  ++  L+SL S+             +  E GY V TV DG +L I PH V   P 
Sbjct: 45  SSTASAVVKQLWSLKSTATRTGSAAAGRSMVKYEGGYAVETVFDGSKLGIEPHDVEVTP- 103

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDM 113
           S +L+VLDS  S  Y +  PLS  S  K +AG  +G+ G+ DG    A+ + PK F VD 
Sbjct: 104 SGELLVLDSMNSNVYRVQLPLSRYSRPKLVAGSPEGLSGHVDGRLREAKMNHPKGFTVDD 163

Query: 114 KGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
           +GNIYVAD  N  IRKI++ GVTTIAGG S + G  DGP+ +A FS DFE+ ++   C+L
Sbjct: 164 RGNIYVADAMNMAIRKISDTGVTTIAGGKSMRGGHLDGPSDDAKFSTDFEIRYISSSCSL 223

Query: 174 LISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWV---LVSVLSCLVSLVIGFVA 227
           L+ D GNQ IR+I L  +DC    ++G  LG   ++       +L+ L   V+G V+
Sbjct: 224 LVIDRGNQAIREIALHNDDCEYQYEAGFPLGIALLFAAGFFGYMLALLQRQVLGMVS 280


>gi|218197942|gb|EEC80369.1| hypothetical protein OsI_22479 [Oryza sativa Indica Group]
          Length = 517

 Score =  160 bits (406), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 98/237 (41%), Positives = 134/237 (56%), Gaps = 18/237 (7%)

Query: 8   SSLAITLLSLLFSLVSSG------------LLLEDGYTVTTVIDGHQLEINPHSVIDRPG 55
           SS A  ++  L+SL S+             +  E GY V TV DG +L I PH V   P 
Sbjct: 45  SSTASAVVKQLWSLKSTATRTGSAAAGRSMVKYEGGYAVETVFDGSKLGIEPHDVEVTP- 103

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDM 113
           S +L+VLDS  S  Y +  PLS  S  K +AG  +G+ G+ DG    A+ + PK F VD 
Sbjct: 104 SGELLVLDSMNSNVYRVQLPLSRYSRPKLVAGSPEGLSGHVDGRLREAKMNHPKGFTVDD 163

Query: 114 KGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
           +GNIYVAD  N  IRKI++ GVTTIAGG S + G  DGP+ +A FS DFE+ ++   C+L
Sbjct: 164 RGNIYVADAMNMAIRKISDTGVTTIAGGKSMRGGHLDGPSDDAKFSTDFEIRYISSSCSL 223

Query: 174 LISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWV---LVSVLSCLVSLVIGFVA 227
           L+ D GNQ IR+I L  +DC    ++G  LG   ++       +L+ L   V+G V+
Sbjct: 224 LVIDRGNQAIREIALHNDDCEYQYEAGFPLGIALLFAAGFFGYMLALLQRQVLGMVS 280


>gi|242095458|ref|XP_002438219.1| hypothetical protein SORBIDRAFT_10g009760 [Sorghum bicolor]
 gi|241916442|gb|EER89586.1| hypothetical protein SORBIDRAFT_10g009760 [Sorghum bicolor]
          Length = 507

 Score =  160 bits (405), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 88/178 (49%), Positives = 112/178 (62%), Gaps = 3/178 (1%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
           E GY V TV DG +L I PH+V   P + DL+VLDS  S  Y +  PLS  S  K LAG 
Sbjct: 76  EGGYAVDTVFDGSKLGIEPHAVEITP-AGDLLVLDSINSNIYRVQLPLSPYSRPKLLAGS 134

Query: 88  -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
            +G+ G+ DG    AR + P  F VD KGNIYVAD  N  IRKI++ GVTTIAGG S + 
Sbjct: 135 PEGLSGHVDGRLREARMNHPNGFTVDDKGNIYVADAMNMAIRKISDTGVTTIAGGKSIRG 194

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
           G  DGP+ +A FS DFE+ ++   C+LL+ D GNQ IR+I L  +DC+   ++G  LG
Sbjct: 195 GHIDGPSDDAKFSTDFEIQYISSSCSLLVIDRGNQAIREIPLNDDDCAYQYETGFPLG 252


>gi|326512916|dbj|BAK03365.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326529495|dbj|BAK04694.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326529987|dbj|BAK08273.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 503

 Score =  160 bits (405), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 95/234 (40%), Positives = 136/234 (58%), Gaps = 15/234 (6%)

Query: 8   SSLAITLLSLLFSLVSSG---------LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSD 58
           S+ A  +L  L+SL S+          +  E GYTV TV DG +L I P++V    G  +
Sbjct: 40  SNAASAVLKRLWSLKSTAKTASGAKSMVKYEGGYTVETVFDGSKLGIEPYAVEVTQGG-E 98

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           L+V+DS  S  Y ++ PLS  S  K +AG  +G+ G+ DG    A+ + PK F VD +GN
Sbjct: 99  LLVMDSMNSNIYRIALPLSRYSRPKLVAGSPEGIPGHIDGRLREAKMNHPKGFTVDGRGN 158

Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLIS 176
           IYVAD  N  +RKI++ GVTTIAGG S + G  DGP+ +A FS DFE+ ++   C+LL+ 
Sbjct: 159 IYVADAMNMAVRKISDTGVTTIAGGKSSRGGHVDGPSDDAKFSTDFEVRYIASSCSLLVI 218

Query: 177 DHGNQLIRQINLKPEDCSKSSQSGSALGA---VSVWVLVSVLSCLVSLVIGFVA 227
           D GNQ IR+I L  +DC    ++G  LG    ++      +L+ L   V+G V+
Sbjct: 219 DRGNQAIREIQLNFDDCVYQYEAGFPLGVALLLAAGFFGYMLALLQRRVLGMVS 272


>gi|413923863|gb|AFW63795.1| hypothetical protein ZEAMMB73_706161 [Zea mays]
          Length = 515

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 94/208 (45%), Positives = 125/208 (60%), Gaps = 12/208 (5%)

Query: 8   SSLAITLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSD 58
           S+ A  ++  L+SL S         S +  E GYTV TV DG +L I P+SV +   S +
Sbjct: 44  SNAASAVVKRLWSLKSTTKTASGGKSMVKYEGGYTVETVFDGSKLGIEPYSV-EVTQSGE 102

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           L+V+DS  S  Y ++ PLS  S  K +AG  +G  G+ DG+   AR + PK F VD KGN
Sbjct: 103 LLVMDSMNSNIYRMALPLSRYSRPKLVAGSPEGFPGHVDGKLREARMNHPKGFTVDDKGN 162

Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLIS 176
           IYVAD  N  IRKI++ GVTTIAGG S + G  DGP+  A FS DFE+ ++   C+LL+ 
Sbjct: 163 IYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYIGSSCSLLVI 222

Query: 177 DHGNQLIRQINLKPEDCSKSSQSGSALG 204
           D GNQ IR+I L  +DC    ++G  LG
Sbjct: 223 DRGNQAIREIQLHFDDCVYQYEAGFPLG 250


>gi|226530993|ref|NP_001143690.1| uncharacterized protein LOC100276420 precursor [Zea mays]
 gi|195624742|gb|ACG34201.1| hypothetical protein [Zea mays]
          Length = 515

 Score =  159 bits (401), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 94/209 (44%), Positives = 125/209 (59%), Gaps = 13/209 (6%)

Query: 8   SSLAITLLSLLFSLVS----------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSS 57
           S+ A  ++  L+SL S          S +  E GYTV TV DG +L I P+SV +   S 
Sbjct: 44  SNAASAVVKRLWSLKSTTKTAASGGKSMVKYEGGYTVETVFDGSKLGIEPYSV-EVTQSG 102

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           +L+V+DS  S  Y ++ PLS  S  K +AG  +G  G+ DG+   AR + PK F VD KG
Sbjct: 103 ELLVMDSMNSNIYRMALPLSRYSRPKLVAGSPEGFPGHVDGKLREARMNHPKGFTVDDKG 162

Query: 116 NIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
           NIYVAD  N  IRKI++ GVTTIAGG S + G  DGP+  A FS DFE+ ++   C+LL+
Sbjct: 163 NIYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYIGSSCSLLV 222

Query: 176 SDHGNQLIRQINLKPEDCSKSSQSGSALG 204
            D GNQ IR+I L  +DC    ++G  LG
Sbjct: 223 IDRGNQAIREIQLHFDDCVYQYEAGFPLG 251


>gi|413923861|gb|AFW63793.1| hypothetical protein ZEAMMB73_706161 [Zea mays]
          Length = 516

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 94/209 (44%), Positives = 125/209 (59%), Gaps = 13/209 (6%)

Query: 8   SSLAITLLSLLFSLVS----------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSS 57
           S+ A  ++  L+SL S          S +  E GYTV TV DG +L I P+SV +   S 
Sbjct: 44  SNAASAVVKRLWSLKSTTKTAASGGKSMVKYEGGYTVETVFDGSKLGIEPYSV-EVTQSG 102

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           +L+V+DS  S  Y ++ PLS  S  K +AG  +G  G+ DG+   AR + PK F VD KG
Sbjct: 103 ELLVMDSMNSNIYRMALPLSRYSRPKLVAGSPEGFPGHVDGKLREARMNHPKGFTVDDKG 162

Query: 116 NIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
           NIYVAD  N  IRKI++ GVTTIAGG S + G  DGP+  A FS DFE+ ++   C+LL+
Sbjct: 163 NIYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYIGSSCSLLV 222

Query: 176 SDHGNQLIRQINLKPEDCSKSSQSGSALG 204
            D GNQ IR+I L  +DC    ++G  LG
Sbjct: 223 IDRGNQAIREIQLHFDDCVYQYEAGFPLG 251


>gi|255552640|ref|XP_002517363.1| conserved hypothetical protein [Ricinus communis]
 gi|223543374|gb|EEF44905.1| conserved hypothetical protein [Ricinus communis]
          Length = 494

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 94/221 (42%), Positives = 131/221 (59%), Gaps = 14/221 (6%)

Query: 14  LLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDS 64
           L+  L+SL S         S +  E GYTV TV DG +L I PH+V   P S +L+VLDS
Sbjct: 42  LVKWLWSLKSNTKTAASSRSMMKFEGGYTVETVFDGSKLGIEPHTVDVSP-SGELLVLDS 100

Query: 65  SRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADK 122
             S  Y +S PLS  S  K +AG  +G  G+ DG    AR + PK   VD +GNIY+AD 
Sbjct: 101 ENSNIYKISTPLSRYSRPKLIAGSPEGYSGHVDGRLREARMNHPKGLTVDNRGNIYIADT 160

Query: 123 SNHVIRKITNLGVTTIAGG-GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
            N  IRKI++ GVTTIAGG  ++  G  DGP+++A FSNDF++ ++   C+LL+ D GNQ
Sbjct: 161 MNMAIRKISDGGVTTIAGGKWTRAGGHVDGPSEDAKFSNDFDIAYIGSSCSLLVIDRGNQ 220

Query: 182 LIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLV 222
            IR+I L  +DC+        LG ++V V  +    +++L+
Sbjct: 221 AIREIQLNDDDCNNQYDGTFHLG-IAVLVAAAFFGYMLALL 260


>gi|195621162|gb|ACG32411.1| hypothetical protein [Zea mays]
 gi|219884599|gb|ACL52674.1| unknown [Zea mays]
 gi|413944257|gb|AFW76906.1| hypothetical protein ZEAMMB73_849037 [Zea mays]
          Length = 508

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 87/178 (48%), Positives = 112/178 (62%), Gaps = 3/178 (1%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
           E GY V TV DG +L I PH+V   P + +L+VLDS  S  Y +  PLS  S  K LAG 
Sbjct: 73  EGGYAVDTVFDGSKLGIEPHAVEITP-AGNLLVLDSINSNIYRVQLPLSPYSRPKLLAGS 131

Query: 88  -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
             G+ G+ DG    AR + PK F VD +GNIYVAD  N  IRKI++ GVTTIAGG S + 
Sbjct: 132 PKGLSGHVDGRLREARMNHPKGFTVDDRGNIYVADGMNMAIRKISDTGVTTIAGGKSIRG 191

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
           G  DGP+ +A FS DFE+ ++   C+LL+ D GNQ IR+I L  +DC+   ++G  LG
Sbjct: 192 GHIDGPSDDAKFSTDFEIKYISSSCSLLVIDRGNQAIREIPLNDDDCAYQYEAGFPLG 249


>gi|293331401|ref|NP_001168100.1| uncharacterized protein LOC100381837 precursor [Zea mays]
 gi|223946007|gb|ACN27087.1| unknown [Zea mays]
 gi|413938717|gb|AFW73268.1| hypothetical protein ZEAMMB73_298159 [Zea mays]
          Length = 510

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 93/208 (44%), Positives = 126/208 (60%), Gaps = 12/208 (5%)

Query: 8   SSLAITLLSLLFSL-----VSSG----LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSD 58
           S+ A  ++  L+SL     ++SG    +  E GYTV TV DG +L I P+SV    G  +
Sbjct: 44  SNAASAVVKRLWSLKSTTKIASGGKSMVKYEGGYTVETVFDGSKLGIEPYSVEVTQGG-E 102

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           L+V+DS  S  Y ++ PLS  S  K +AG  +G  G+ DG+   AR + PK F VD +GN
Sbjct: 103 LLVMDSMNSNIYRMALPLSRYSRPKLVAGSPEGFPGHVDGKLREARMNHPKGFTVDDRGN 162

Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLIS 176
           IYVAD  N  IRKI++ GVTTIAGG S + G  DGP+  A FS DFE+ ++   C+LL+ 
Sbjct: 163 IYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYIGSSCSLLVI 222

Query: 177 DHGNQLIRQINLKPEDCSKSSQSGSALG 204
           D GNQ IR+I L  +DC    ++G  LG
Sbjct: 223 DRGNQAIREIQLHFDDCVYQYEAGFPLG 250


>gi|357137830|ref|XP_003570502.1| PREDICTED: uncharacterized protein LOC100839308 [Brachypodium
           distachyon]
          Length = 495

 Score =  157 bits (397), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 96/234 (41%), Positives = 135/234 (57%), Gaps = 15/234 (6%)

Query: 8   SSLAITLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSD 58
           S+ A  +L  L+SL S         S +  E GYTV TV DG +L I P++V    G  +
Sbjct: 39  SNAASAVLKRLWSLKSTTKTATGAKSMVKYEGGYTVETVFDGSKLGIEPYAVEVTQGG-E 97

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           L+V+DS  S  Y ++ PLS  S  K +AG  +G  G+ DG    A+ + PK F VD +GN
Sbjct: 98  LLVMDSMNSNIYRIALPLSRYSRPKLVAGSPEGYPGHVDGRLREAKMNHPKGFTVDGRGN 157

Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLIS 176
           IYVAD  N  +RKI++ GVTTIAGG S + G  DGP+ +A FS DFE+ ++   C+LL+ 
Sbjct: 158 IYVADAMNMAVRKISDTGVTTIAGGKSSRGGHVDGPSDDAKFSTDFEVRYIASSCSLLVI 217

Query: 177 DHGNQLIRQINLKPEDCSKSSQSGSALG---AVSVWVLVSVLSCLVSLVIGFVA 227
           D GNQ IR+I L  +DC    ++G  LG    ++      +L+ L   V+G V+
Sbjct: 218 DRGNQAIREIQLHFDDCVYQYEAGFPLGVALLLAAAFFGYMLALLQRRVLGMVS 271


>gi|242066098|ref|XP_002454338.1| hypothetical protein SORBIDRAFT_04g029020 [Sorghum bicolor]
 gi|241934169|gb|EES07314.1| hypothetical protein SORBIDRAFT_04g029020 [Sorghum bicolor]
          Length = 518

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 93/209 (44%), Positives = 124/209 (59%), Gaps = 13/209 (6%)

Query: 8   SSLAITLLSLLFSLVS----------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSS 57
           S+ A  ++  ++SL S          S +  E GYTV TV DG +L I P+SV    G  
Sbjct: 44  SNAASAVVKRMWSLKSTTKTAASGGKSMVKYEGGYTVETVFDGSKLGIEPYSVEVTQGG- 102

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           +L+V+DS  S  Y ++ PLS  S  K +AG  +G  G+ DG+   AR + PK F VD +G
Sbjct: 103 ELLVMDSMNSNIYRMALPLSRYSRPKLVAGSPEGFPGHVDGKLREARMNHPKGFTVDDRG 162

Query: 116 NIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
           NIYVAD  N  IRKI++ GVTTIAGG S + G  DGP+  A FS DFE+ +V   C+LL+
Sbjct: 163 NIYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYVGSSCSLLV 222

Query: 176 SDHGNQLIRQINLKPEDCSKSSQSGSALG 204
            D GNQ IR+I L  +DC    ++G  LG
Sbjct: 223 IDRGNQAIREIQLHFDDCVYQYEAGFPLG 251


>gi|413938718|gb|AFW73269.1| hypothetical protein ZEAMMB73_298159 [Zea mays]
          Length = 511

 Score =  157 bits (396), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 93/209 (44%), Positives = 124/209 (59%), Gaps = 13/209 (6%)

Query: 8   SSLAITLLSLLFSLVS----------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSS 57
           S+ A  ++  L+SL S          S +  E GYTV TV DG +L I P+SV    G  
Sbjct: 44  SNAASAVVKRLWSLKSTTKIAASGGKSMVKYEGGYTVETVFDGSKLGIEPYSVEVTQGG- 102

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           +L+V+DS  S  Y ++ PLS  S  K +AG  +G  G+ DG+   AR + PK F VD +G
Sbjct: 103 ELLVMDSMNSNIYRMALPLSRYSRPKLVAGSPEGFPGHVDGKLREARMNHPKGFTVDDRG 162

Query: 116 NIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
           NIYVAD  N  IRKI++ GVTTIAGG S + G  DGP+  A FS DFE+ ++   C+LL+
Sbjct: 163 NIYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYIGSSCSLLV 222

Query: 176 SDHGNQLIRQINLKPEDCSKSSQSGSALG 204
            D GNQ IR+I L  +DC    ++G  LG
Sbjct: 223 IDRGNQAIREIQLHFDDCVYQYEAGFPLG 251


>gi|357118152|ref|XP_003560822.1| PREDICTED: uncharacterized protein LOC100834977 [Brachypodium
           distachyon]
          Length = 544

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 88/204 (43%), Positives = 122/204 (59%), Gaps = 6/204 (2%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
           E  Y V TV DG +L I P++V   P + +L+VLDS  S  Y +  PLS  S  K ++G 
Sbjct: 79  EGWYEVETVFDGSKLGIEPYAVEVTP-AGELLVLDSMNSNIYRVQLPLSRYSRPKLVSGS 137

Query: 88  -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
            +G+ G+ DG    AR + PK F VD +GNIYVAD  N  IRKI++ GVTTIAGG S + 
Sbjct: 138 PEGLSGHVDGRLREARMNHPKGFTVDDRGNIYVADAMNMAIRKISDTGVTTIAGGKSVRG 197

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAV 206
           G  DGP+ +A FS DFE+ ++   C+LL+ D GNQ IR+I L+ +DC    ++G  LG  
Sbjct: 198 GHTDGPSDDAKFSTDFEIRYISSSCSLLVIDRGNQAIREIPLQLDDCEYQHEAGFPLGVA 257

Query: 207 SVWV---LVSVLSCLVSLVIGFVA 227
            ++       +L+ L   V G V+
Sbjct: 258 LLFAAGFFGYMLALLQRRVFGMVS 281


>gi|449446849|ref|XP_004141183.1| PREDICTED: uncharacterized protein LOC101216407 [Cucumis sativus]
 gi|449489545|ref|XP_004158343.1| PREDICTED: uncharacterized protein LOC101227198 [Cucumis sativus]
          Length = 417

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 84/216 (38%), Positives = 129/216 (59%), Gaps = 5/216 (2%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
           +  E GY V T++DG ++ I P+SV   P S +L++LD+  S  + +S P+S+    K  
Sbjct: 2   IKFERGYVVETLLDGSKMGIEPYSVGVSP-SGELLILDAENSNVHKISMPVSQFCRPKLF 60

Query: 86  AG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS 143
           AG  +G  G+ DG+   AR   P+   VD +GNIY+AD  N  IRKI++ GVTTIAGG  
Sbjct: 61  AGSSEGYSGHVDGKLRDARMSHPRGLTVDQRGNIYIADTKNKAIRKISDAGVTTIAGGKW 120

Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSAL 203
           +K G  DGP++ + FSNDF+L +V   C+LL+ D GNQ IR+I L+ EDC++    GS L
Sbjct: 121 RKSGHIDGPSEESKFSNDFDLVYVGSSCSLLVVDRGNQAIREIQLRAEDCTE--YDGSFL 178

Query: 204 GAVSVWVLVSVLSCLVSLVIGFVARPYIIRHTGRLD 239
             +++     +L  +++     V   +  ++  R+D
Sbjct: 179 LGIALLTAAMLLGYMLARFQFRVLATFSSKNDSRVD 214


>gi|225461726|ref|XP_002285515.1| PREDICTED: uncharacterized protein LOC100245328 [Vitis vinifera]
 gi|302142876|emb|CBI20171.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score =  154 bits (388), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 88/204 (43%), Positives = 125/204 (61%), Gaps = 7/204 (3%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
           L  EDGY V TV++G++L + PHS+ +   G  +L  +D+ ++    ++ PLS+ S  + 
Sbjct: 57  LQFEDGYLVETVVEGNELGVVPHSIRVSEDG--ELFAVDAVKNNIVRITPPLSQYSRARL 114

Query: 85  LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
           +AG   G  G+ DG+P  ARF+ PK   +D KGN+YVAD SN  IRKI + GVTTIAGG 
Sbjct: 115 VAGSFQGHTGHVDGKPSDARFNGPKGVTMDDKGNVYVADTSNLAIRKIGDSGVTTIAGGK 174

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
           S   G  DGP+++A FS+DF++ +V   C+LL+ D GN  +RQI+L  EDC   + S SA
Sbjct: 175 SNVAGYRDGPSEDAKFSSDFDVVYVRPTCSLLVVDRGNAALRQISLNQEDCDYQNSSISA 234

Query: 203 LGAVSV--WVLVSVLSCLVSLVIG 224
                V   V+V   SCL+    G
Sbjct: 235 TDIFMVIGAVMVGYASCLLQKGFG 258


>gi|224102017|ref|XP_002312513.1| predicted protein [Populus trichocarpa]
 gi|222852333|gb|EEE89880.1| predicted protein [Populus trichocarpa]
          Length = 494

 Score =  153 bits (386), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 95/238 (39%), Positives = 141/238 (59%), Gaps = 16/238 (6%)

Query: 6   VYSSLAITLLSLLFSL------VSSG---LLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
           ++S++   L+  L+SL      V SG   +  E GYTV TV DG +L I P+SV   P S
Sbjct: 38  LFSNVVSALMKWLWSLKATTKTVISGRPMMKFESGYTVETVFDGSKLGIEPYSVQVLP-S 96

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMK 114
            +L++LDS+ S  Y +S  LS  S  K +AG  +G  G+ DG+   A+ + PK   VD +
Sbjct: 97  GELLILDSANSNIYRISASLSLYSRPKLVAGSHEGYSGHVDGKLREAKMNHPKGLTVDDR 156

Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
           GNIYVAD  N  IRKI++ GVTTIAGG   +    DG +++A+FSNDF++ ++   C+LL
Sbjct: 157 GNIYVADTMNMAIRKISDAGVTTIAGGKWGRGSHVDGASEDANFSNDFDVVYIGSSCSLL 216

Query: 175 ISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLV---IGFVARP 229
           + D GN+ IR+I L  +DC+    SG  LG ++V V       +++L+   +G +  P
Sbjct: 217 VIDRGNRAIREIQLHFDDCAYQYGSGFPLG-IAVLVAAGFFGYMLALLQRRVGMIVSP 273


>gi|224117004|ref|XP_002317450.1| predicted protein [Populus trichocarpa]
 gi|222860515|gb|EEE98062.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score =  153 bits (386), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 92/235 (39%), Positives = 130/235 (55%), Gaps = 14/235 (5%)

Query: 3   SHFVYS-SLAITLLSLLFSLVSSG-------LLLEDGYTVTTVIDGHQLEINPHSVIDRP 54
           SHF Y+  LA+  L+  F   +         L  EDGY V TV+ G+ + + P+  I   
Sbjct: 4   SHFSYTLVLAVLSLACTFQFQAHAAPPDENVLQFEDGYLVETVVKGNAMGVVPYK-IRLS 62

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVD 112
              +L  +D   S    ++ PLS+ S  + +AG   G  G+ DG+P   RF+ P+   +D
Sbjct: 63  EDGELYAVDEVNSNVVKITPPLSQYSRARLVAGSFQGYTGHIDGKPNEVRFNHPRGLTMD 122

Query: 113 MKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA 172
            KGNIYVAD  NH IRKI + GVTTIAGG S   G  DGP+++A FSNDF++ +V   C+
Sbjct: 123 DKGNIYVADSLNHAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVHSTCS 182

Query: 173 LLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSV---WVLVSVLSCLVSLVIG 224
           LL+ D GN  +RQI+L  EDC   S S +    V +    VL+   +C++ L  G
Sbjct: 183 LLVVDRGNAALRQISLNQEDCDYQSSSFTMTEDVLMVVGAVLIGYATCMLQLGFG 237


>gi|356549639|ref|XP_003543199.1| PREDICTED: uncharacterized protein LOC100797841 [Glycine max]
          Length = 487

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 83/203 (40%), Positives = 122/203 (60%), Gaps = 4/203 (1%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
           L  E+GY V TV++G+++ + P+ +       +L  +D+  S    ++ PLS+ S  + +
Sbjct: 57  LQFENGYVVETVVEGNEIGVVPYRIRVSEEDGELFAVDAINSNIVRITPPLSQYSRGRLV 116

Query: 86  AG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS 143
           AG   G  G+ DG+P  ARF+ PK   VD KGN+YVAD  N  IRKI + GVTTIAGG S
Sbjct: 117 AGSFQGYTGHVDGKPSDARFNHPKGITVDDKGNVYVADTQNLAIRKIGDAGVTTIAGGKS 176

Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSAL 203
              G  DGP+++A FSNDF++ +V   C+LL+ D GN  +RQI+L  EDC   S S S+ 
Sbjct: 177 NVAGYRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLDQEDCDYQSNSISST 236

Query: 204 GAVSV--WVLVSVLSCLVSLVIG 224
             ++V   V+V   +C++    G
Sbjct: 237 DILTVVGAVIVGYATCMLQQGFG 259


>gi|293332279|ref|NP_001170567.1| uncharacterized protein LOC100384592 precursor [Zea mays]
 gi|238006082|gb|ACR34076.1| unknown [Zea mays]
          Length = 505

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 88/186 (47%), Positives = 115/186 (61%), Gaps = 7/186 (3%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVV 82
           S +  E GY V TV DG +L I PH+  I+R G  DL++LDS  S  Y +  PLS  S  
Sbjct: 71  SMVRYEGGYAVDTVFDGSKLGIEPHAAQINRAG--DLLLLDSINSNIYRVQLPLSPYSRP 128

Query: 83  KRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
           K LAG  +G+ G+ DG    AR + PK F VD +GNIYVAD  N  IRKI++ GVTTIAG
Sbjct: 129 KLLAGSPEGLSGHVDGRLREARMNHPKGFTVDDRGNIYVADAMNMAIRKISDTGVTTIAG 188

Query: 141 GGSKKEGRADGP--AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQ 198
           G S + G  D P  + +A FS DFE+ ++   C+LL+ D GNQ IR+I L  +DC+   +
Sbjct: 189 GKSIRGGYIDEPSVSDDAKFSTDFEVQYISSTCSLLVIDRGNQAIREIPLNDDDCAYQYE 248

Query: 199 SGSALG 204
           +G  LG
Sbjct: 249 AGFPLG 254


>gi|413953662|gb|AFW86311.1| hypothetical protein ZEAMMB73_666053 [Zea mays]
          Length = 506

 Score =  152 bits (384), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 88/186 (47%), Positives = 115/186 (61%), Gaps = 7/186 (3%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVV 82
           S +  E GY V TV DG +L I PH+  I+R G  DL++LDS  S  Y +  PLS  S  
Sbjct: 71  SMVRYEGGYAVDTVFDGSKLGIEPHAAQINRAG--DLLLLDSINSNIYRVQLPLSPYSRP 128

Query: 83  KRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
           K LAG  +G+ G+ DG    AR + PK F VD +GNIYVAD  N  IRKI++ GVTTIAG
Sbjct: 129 KLLAGSPEGLSGHVDGRLREARMNHPKGFTVDDRGNIYVADAMNMAIRKISDTGVTTIAG 188

Query: 141 GGSKKEGRADGP--AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQ 198
           G S + G  D P  + +A FS DFE+ ++   C+LL+ D GNQ IR+I L  +DC+   +
Sbjct: 189 GKSIRGGYIDEPSVSDDAKFSTDFEVQYISSTCSLLVIDRGNQAIREIPLNDDDCAYQYE 248

Query: 199 SGSALG 204
           +G  LG
Sbjct: 249 AGFPLG 254


>gi|9369390|gb|AAF87138.1|AC002423_3 T23E23.5 [Arabidopsis thaliana]
          Length = 493

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 94/242 (38%), Positives = 136/242 (56%), Gaps = 20/242 (8%)

Query: 1   MASHFVYSSLAITLLSLLFSLVSSG-----------LLLEDGYTVTTVIDGHQLEINPHS 49
           M+ HF++  + I L S   +   S            +  E+GY+V TV+DG +L I P+S
Sbjct: 1   MSRHFLFLGIIILLFSAFVASAPSSTSPATVPTKSMVKFENGYSVETVLDGSKLGIEPYS 60

Query: 50  VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPK 107
            I    + +L++LDS  S  Y +S  LS  S  + + G  +G  G+ DG    AR + PK
Sbjct: 61  -IQVLSNGELLILDSQNSNIYQISSSLSLYSRPRLVTGSPEGYPGHVDGRLRDARLNNPK 119

Query: 108 SFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS-KKEGRADGPAQNASFSNDFELTF 166
              VD +GNIYVAD  N+ IRKI+  GVTTIAGG   +  G  DGP+++A FSNDF++ +
Sbjct: 120 GLTVDDRGNIYVADTVNNAIRKISEAGVTTIAGGKMVRGGGHVDGPSEDAKFSNDFDVVY 179

Query: 167 VPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAV-----SVWVLVSVLSCLVSL 221
           +   C+LL+ D GNQ IR+I L  +DC+    SG  LG +     SVWV   +L   +S 
Sbjct: 180 LGSSCSLLVIDRGNQAIREIQLHFDDCADQYGSGFPLGNIYIPKFSVWVPSCILQSSLSQ 239

Query: 222 VI 223
           ++
Sbjct: 240 LL 241


>gi|296087695|emb|CBI34951.3| unnamed protein product [Vitis vinifera]
          Length = 811

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 96/234 (41%), Positives = 136/234 (58%), Gaps = 14/234 (5%)

Query: 6   VYSSLAITLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
           + S++   L+  L+SL S         S +  EDGYTV TV DG +L I P+S ++   S
Sbjct: 34  IVSNVVSALVKWLWSLKSTTNTAISSRSKMNFEDGYTVETVFDGSKLGIEPYS-LEVSTS 92

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMK 114
            +L+VLDS  S  Y +S PLS  S  K +AG  DG  G+ DG+   AR + PK   +D +
Sbjct: 93  GELLVLDSENSNIYKISSPLSRYSRPKLVAGSPDGYSGHVDGKLREARMNHPKGLTMDER 152

Query: 115 GNIYVADKSNHVIRKITNLGVTTIA-GGGSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
           GNIY+AD  N  IRKI++ GVTTIA G   +  G  DGP+++A FSNDF++ ++   C+L
Sbjct: 153 GNIYIADTMNMAIRKISDAGVTTIAGGKWGRGGGHVDGPSEDAKFSNDFDVVYIGSSCSL 212

Query: 174 LISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLVIGFVA 227
           L+ D GNQ IR+I L  EDC+        LG ++V V       +++L+   VA
Sbjct: 213 LVIDRGNQAIREIQLHYEDCAYQYNGSFHLG-IAVLVAAGFFGYMLALLQRRVA 265


>gi|224108107|ref|XP_002314724.1| predicted protein [Populus trichocarpa]
 gi|222863764|gb|EEF00895.1| predicted protein [Populus trichocarpa]
          Length = 487

 Score =  151 bits (381), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 91/238 (38%), Positives = 137/238 (57%), Gaps = 16/238 (6%)

Query: 6   VYSSLAITLLSLLFSLVSSG---------LLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
           V+S++    +  L+S+ S+          +  E GYTV TV DG +L I P+SV   P S
Sbjct: 41  VFSNVVPAFMKWLWSMKSTTKTVISGRPMMKFESGYTVETVFDGSKLGIEPYSVQVLP-S 99

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMK 114
            +L++LDS  S  Y +S  LS  S  K +AG  +G  G+ DG+   A+ + PK   VD +
Sbjct: 100 GELLILDSVNSNIYRMSSSLSLYSRPKLVAGSHEGYSGHVDGKLREAKMNHPKGLTVDDR 159

Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
           GNIY+AD  N  IRKI++ GVTTIAGG   +    DG +++A FSNDF++ ++   C+LL
Sbjct: 160 GNIYIADTMNMAIRKISDAGVTTIAGGKWGRGSHVDGASEDAKFSNDFDVLYIGSSCSLL 219

Query: 175 ISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLV---IGFVARP 229
           + D GNQ IR+I L  +DC+    SG  LG ++V +       +++L+   +G +  P
Sbjct: 220 VIDRGNQAIREIQLHFDDCAYQYGSGFPLG-IAVLLAAGFFGYMLALLQRRVGMIVSP 276


>gi|449492692|ref|XP_004159073.1| PREDICTED: uncharacterized protein LOC101226879 [Cucumis sativus]
          Length = 516

 Score =  151 bits (381), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 136/229 (59%), Gaps = 14/229 (6%)

Query: 6   VYSSLAITLLSLLFSLVSSG---------LLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
           V S++  +L+  L+S+ SS          +  E GY+V TV DG +L I+P+SV   P S
Sbjct: 34  VLSNVVSSLVKKLWSMKSSAKTAVSSRSMMKFESGYSVETVFDGSKLGIDPYSVEMSP-S 92

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMK 114
            +L++LD+  S  + +S PLS  S  K ++G  +G  G+ DG    AR + PK   +D +
Sbjct: 93  GELLILDAENSNIHKISMPLSRFSRPKLVSGSAEGYSGHVDGHSREARMNHPKGLTLDER 152

Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGG-GSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
           GNIY+AD  N  IRKI++ GVTTIAGG  ++  G  DGP+++A FSNDF++ +V   C+L
Sbjct: 153 GNIYIADTMNMAIRKISDTGVTTIAGGRWNQGSGHIDGPSEDAKFSNDFDVVYVGSSCSL 212

Query: 174 LISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLV 222
           L+ D GN+ IR+I L  +DC+        LG V + V   +   L++L+
Sbjct: 213 LVIDRGNKAIREIELNYDDCNTQYADSLNLGVV-LLVAAGLFGYLLALL 260


>gi|147860395|emb|CAN80443.1| hypothetical protein VITISV_043282 [Vitis vinifera]
          Length = 527

 Score =  150 bits (380), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 97/245 (39%), Positives = 136/245 (55%), Gaps = 24/245 (9%)

Query: 6   VYSSLAITLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
           + S++   L+  L+SL S         S +  EDGYTV TV DG +L I P+S ++   S
Sbjct: 17  IVSNVVSALVKWLWSLKSTTNTAISSXSKMNFEDGYTVETVFDGSKLGIEPYS-LEVSTS 75

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMK 114
            +L+VLDS  S  Y +S PLS  S  K +AG  DG  G+ DG+   AR + PK   +D +
Sbjct: 76  GELLVLDSENSNIYKISSPLSRYSRPKLVAGSPDGYSGHVDGKLREARMNHPKGLTMDER 135

Query: 115 GNIYVADKSNHVIRKITNLGVTTIA-GGGSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
           GNIY+AD  N  IRKI++ GVTTIA G   +  G  DGP+++A FSNDF++ ++   C+L
Sbjct: 136 GNIYIADTMNMAIRKISDAGVTTIAGGKWGRGGGHVDGPSEDAKFSNDFDVVYIGSSCSL 195

Query: 174 LISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLVIGFVARPYIIR 233
           L+ D GNQ IR+I L  EDC+        LG +           L + VI  +  P++ R
Sbjct: 196 LVIDRGNQAIREIQLHYEDCAYQYNGSFHLGKLR----------LANFVIA-ITXPWLWR 244

Query: 234 HTGRL 238
              R 
Sbjct: 245 FMRRF 249


>gi|225452518|ref|XP_002274991.1| PREDICTED: uncharacterized protein LOC100266244 [Vitis vinifera]
          Length = 677

 Score =  150 bits (380), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 96/234 (41%), Positives = 136/234 (58%), Gaps = 14/234 (5%)

Query: 6   VYSSLAITLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
           + S++   L+  L+SL S         S +  EDGYTV TV DG +L I P+S ++   S
Sbjct: 34  IVSNVVSALVKWLWSLKSTTNTAISSRSKMNFEDGYTVETVFDGSKLGIEPYS-LEVSTS 92

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMK 114
            +L+VLDS  S  Y +S PLS  S  K +AG  DG  G+ DG+   AR + PK   +D +
Sbjct: 93  GELLVLDSENSNIYKISSPLSRYSRPKLVAGSPDGYSGHVDGKLREARMNHPKGLTMDER 152

Query: 115 GNIYVADKSNHVIRKITNLGVTTIA-GGGSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
           GNIY+AD  N  IRKI++ GVTTIA G   +  G  DGP+++A FSNDF++ ++   C+L
Sbjct: 153 GNIYIADTMNMAIRKISDAGVTTIAGGKWGRGGGHVDGPSEDAKFSNDFDVVYIGSSCSL 212

Query: 174 LISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLVIGFVA 227
           L+ D GNQ IR+I L  EDC+        LG ++V V       +++L+   VA
Sbjct: 213 LVIDRGNQAIREIQLHYEDCAYQYNGSFHLG-IAVLVAAGFFGYMLALLQRRVA 265


>gi|356497860|ref|XP_003517774.1| PREDICTED: uncharacterized protein LOC100799483 [Glycine max]
          Length = 508

 Score =  150 bits (380), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 88/202 (43%), Positives = 125/202 (61%), Gaps = 5/202 (2%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           S +  E GYTV TV DG +L I P++V   P + +L++LDS+ S  Y +S  LS  S  K
Sbjct: 62  SMMKFESGYTVETVFDGSKLGIEPYAVEVLP-NGELLILDSANSNIYRISSSLSLNSRPK 120

Query: 84  RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
            +AG  +G  G+ DG    AR + PK   VD +GNIYVAD  N  IRKI++ GVTTIAGG
Sbjct: 121 LVAGSAEGYSGHVDGRFREARMNHPKGITVDNRGNIYVADIMNMAIRKISDSGVTTIAGG 180

Query: 142 -GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
             S+  G  DGP++ A FSNDF++ +V   C+LL+ D GNQ IR+I L  +DC+   ++G
Sbjct: 181 KWSRGGGHVDGPSEEAKFSNDFDVVYVGSSCSLLVIDRGNQAIREIQLHFDDCAYQYENG 240

Query: 201 SALGAVSVWVLVSVLSCLVSLV 222
             LG ++V +       +++L+
Sbjct: 241 FPLG-IAVLIGAGFFGYMLALL 261


>gi|449456567|ref|XP_004146020.1| PREDICTED: uncharacterized protein LOC101206392 [Cucumis sativus]
          Length = 477

 Score =  149 bits (376), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 85/218 (38%), Positives = 128/218 (58%), Gaps = 16/218 (7%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVV 82
           + +  E GY V T+++G+++ + P+ + +   G  +L  +DS  S    +S PLS  S  
Sbjct: 55  NAIQFESGYLVETIVEGNEIGMVPYKIRVSEDG--ELFAVDSVNSNVVKVSPPLSRYSRA 112

Query: 83  KRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
           + +AG   G +G+ DG+P  ARF++PK   +D KGN+YVAD  N  IRKI + GVTTIAG
Sbjct: 113 RLVAGSFQGYKGHVDGKPSDARFNQPKGITIDDKGNVYVADTLNLAIRKIVDAGVTTIAG 172

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
           G +   G +DGP + A FSNDF++ +V   C+LL+ D GN  +RQI+L  EDC       
Sbjct: 173 GKTNVPGYSDGPGEEAKFSNDFDVIYVRRTCSLLVVDRGNAALRQISLNKEDCD------ 226

Query: 201 SALGAVSVWVLVSVLSCLVSLVIGFVARPYIIRHTGRL 238
              G+VS      V   + +L+IG+    Y+++H  RL
Sbjct: 227 YQYGSVST---SDVAMFIGALLIGYFT--YMLQHGFRL 259


>gi|449518065|ref|XP_004166064.1| PREDICTED: uncharacterized protein LOC101228723 [Cucumis sativus]
          Length = 449

 Score =  149 bits (375), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 85/218 (38%), Positives = 128/218 (58%), Gaps = 16/218 (7%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVV 82
           + +  E GY V T+++G+++ + P+ + +   G  +L  +DS  S    +S PLS  S  
Sbjct: 27  NAIQFESGYLVETIVEGNEIGMVPYKIRVSEDG--ELFAVDSVNSNVVKVSPPLSRYSRA 84

Query: 83  KRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
           + +AG   G +G+ DG+P  ARF++PK   +D KGN+YVAD  N  IRKI + GVTTIAG
Sbjct: 85  RLVAGSFQGYKGHVDGKPSDARFNQPKGITIDDKGNVYVADTLNLAIRKIVDAGVTTIAG 144

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
           G +   G +DGP + A FSNDF++ +V   C+LL+ D GN  +RQI+L  EDC       
Sbjct: 145 GKTNVPGYSDGPGEEAKFSNDFDVIYVRRTCSLLVVDRGNAALRQISLNKEDCD------ 198

Query: 201 SALGAVSVWVLVSVLSCLVSLVIGFVARPYIIRHTGRL 238
              G+VS      V   + +L+IG+    Y+++H  RL
Sbjct: 199 YQYGSVST---SDVAMFIGALLIGYFT--YMLQHGFRL 231


>gi|449530424|ref|XP_004172195.1| PREDICTED: uncharacterized LOC101210502, partial [Cucumis sativus]
          Length = 437

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 88/235 (37%), Positives = 133/235 (56%), Gaps = 10/235 (4%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
           L  E+GY V TV++G+++ + PH + + + G  +L V+DS  S    ++ PLS+ +  + 
Sbjct: 16  LQFENGYLVGTVVEGNEIGVLPHKIHVSKDG--ELFVVDSVNSNIVKITPPLSKYTRARL 73

Query: 85  LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
           +AG      G+ DG+P  ARF+ P+   VD KGN+YVAD  N  IRKI + GVTTIAGG 
Sbjct: 74  VAGSFQSHTGHIDGKPNDARFNHPRGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGK 133

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
           S   G  DGP ++A FSNDF++ +V   C+LL+ D GN  IRQI+L  EDC     S S 
Sbjct: 134 SNVVGYRDGPGEDAKFSNDFDVMYVRSTCSLLVIDRGNAAIRQISLNQEDCEYQDSSIS- 192

Query: 203 LGAVSVWVLVSVLSCLVSLVIGFVARPYIIRHTGRLDPPLQHDMEALPNQSGETS 257
               +  VL+ + + L       + R +   +  + +PPL+ +    P +   +S
Sbjct: 193 ----NSDVLMIIGAVLAGYATYMIQRGFGTSNVSQTNPPLETEYREKPYKPESSS 243


>gi|449449204|ref|XP_004142355.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           LOC101209861 [Cucumis sativus]
          Length = 454

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 84/202 (41%), Positives = 124/202 (61%), Gaps = 5/202 (2%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           S +  E GY+V TV DG +L I+P+SV   P S +L++LD+  S  + +S PLS  S  K
Sbjct: 13  SMMKFESGYSVETVFDGSKLGIDPYSVEMSP-SGELLILDAENSNIHKISMPLSRFSRPK 71

Query: 84  RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
            ++G  +G  G+ DG    AR + PK   +D +GNIY+AD  N  IRKI++ GVTTIAGG
Sbjct: 72  LVSGSAEGYSGHVDGHSREARMNHPKGLTLDERGNIYIADTMNMAIRKISDTGVTTIAGG 131

Query: 142 -GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
             ++  G  DGP+++A FSNDF++ +V   C+LL+ D GN+ IR+I L  +DC+      
Sbjct: 132 RWNQGSGHIDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAIREIELNYDDCNTQYADS 191

Query: 201 SALGAVSVWVLVSVLSCLVSLV 222
             LG V + V   +   L++L+
Sbjct: 192 LNLGVV-LLVAAGLFGYLLALL 212


>gi|255560011|ref|XP_002521024.1| conserved hypothetical protein [Ricinus communis]
 gi|223539861|gb|EEF41441.1| conserved hypothetical protein [Ricinus communis]
          Length = 500

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 92/234 (39%), Positives = 138/234 (58%), Gaps = 15/234 (6%)

Query: 1   MASHFVYSSLAITLLSLLFSLVSSG---------LLLEDGYTVTTVIDGHQLEINPHSVI 51
           + S F  ++L++  +  L+SL ++          +  E GY V TV DG +L I P+SV 
Sbjct: 39  IVSGFFSNALSV-FMKWLWSLKATSKTAISGRPMMKFEGGYNVETVFDGSKLGIEPYSVE 97

Query: 52  DRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSF 109
             P + +L++LDS+ S  Y +S  LS  S  K +AG  DG  G+ DG+   AR + PK  
Sbjct: 98  VLP-NGELLILDSANSNIYRISSSLSLYSRPKLVAGSPDGYSGHVDGKHREARMNHPKGL 156

Query: 110 AVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEG-RADGPAQNASFSNDFELTFVP 168
            VD +GNIY+AD  N  IRKI++ GVTTIAGG   + G   DG +++A FSNDF++ ++ 
Sbjct: 157 TVDDRGNIYIADAMNMAIRKISDAGVTTIAGGKWGRGGSHVDGASEDAKFSNDFDVVYIG 216

Query: 169 HICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLV 222
             C+LL+ D GN+ IR+I L  +DC+   +SG  LG V+V V       +++L+
Sbjct: 217 SSCSLLVIDRGNRAIREIQLHFDDCAYQYESGFPLG-VAVLVAAGFFGYMLALL 269


>gi|449438911|ref|XP_004137231.1| PREDICTED: uncharacterized protein LOC101210502 [Cucumis sativus]
          Length = 480

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 88/235 (37%), Positives = 133/235 (56%), Gaps = 10/235 (4%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
           L  E+GY V TV++G+++ + PH + + + G  +L V+DS  S    ++ PLS+ +  + 
Sbjct: 59  LQFENGYLVGTVVEGNEIGVLPHKIHVSKDG--ELFVVDSVNSNIVKITPPLSKYTRARL 116

Query: 85  LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
           +AG      G+ DG+P  ARF+ P+   VD KGN+YVAD  N  IRKI + GVTTIAGG 
Sbjct: 117 VAGSFQSHTGHIDGKPNDARFNHPRGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGK 176

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
           S   G  DGP ++A FSNDF++ +V   C+LL+ D GN  IRQI+L  EDC     S S 
Sbjct: 177 SNVVGYRDGPGEDAKFSNDFDVMYVRSTCSLLVIDRGNAAIRQISLNQEDCEYQDSSIS- 235

Query: 203 LGAVSVWVLVSVLSCLVSLVIGFVARPYIIRHTGRLDPPLQHDMEALPNQSGETS 257
               +  VL+ + + L       + R +   +  + +PPL+ +    P +   +S
Sbjct: 236 ----NSDVLMIIGAVLAGYATYMIQRGFGTSNVSQTNPPLETEYREKPYKPESSS 286


>gi|356501873|ref|XP_003519748.1| PREDICTED: uncharacterized protein LOC100797570 [Glycine max]
          Length = 507

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 83/184 (45%), Positives = 117/184 (63%), Gaps = 4/184 (2%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           S +  E GYTV TV DG +L I P++V +   + +L++LDS+ S  Y +S  LS  +  K
Sbjct: 62  SMMKFESGYTVETVFDGSKLGIEPYAV-EVLANGELLILDSANSNIYRISSSLSLNTRPK 120

Query: 84  RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
            +AG  +G  G+ DG    AR + PK  AVD +GNIY+AD  N  IRKI++ GVTTIAGG
Sbjct: 121 LVAGSAEGYSGHVDGRLREARMNHPKGIAVDNRGNIYIADIMNMAIRKISDSGVTTIAGG 180

Query: 142 -GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
             S+  G  DGP++ A FSNDF++ ++   C+LL+ D GNQ IR+I L  +DC+   ++G
Sbjct: 181 KWSRGGGHIDGPSEEAKFSNDFDVVYIGSSCSLLVIDRGNQAIREIQLHFDDCAYQYENG 240

Query: 201 SALG 204
             LG
Sbjct: 241 LTLG 244


>gi|357140648|ref|XP_003571876.1| PREDICTED: uncharacterized protein LOC100824092 [Brachypodium
           distachyon]
          Length = 491

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 90/240 (37%), Positives = 135/240 (56%), Gaps = 22/240 (9%)

Query: 25  GLLLEDGYTVTTVIDGHQLEINPHSVIDRP-GSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
            L  E GY V T+++G +L + PH++   P    +L+ +DS+ S    ++ PLSE S  +
Sbjct: 65  ALQFESGYFVETLVEGDKLGVTPHTIRVSPLEGGELLAVDSAHSNIVRITPPLSEYSRAR 124

Query: 84  RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
            +AG   G  G+ DG+P  ARF +P   AVD  GN+Y+AD +N  IRKI + GVTTIAGG
Sbjct: 125 LVAGSFQGHAGHIDGKPIDARFKRPTGVAVDDTGNVYIADTANLAIRKIGDSGVTTIAGG 184

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
            S   G  DGP+++A FS DF++ +V  +C+LL+ D GN  +R+I L  EDC+       
Sbjct: 185 KSNIPGYRDGPSEDAKFSTDFDVVYVKKMCSLLVIDRGNAALRKIALPQEDCT------- 237

Query: 202 ALGAVSVWVLVSVLSCLVSLVIGFVARPY---IIRHTGRLDPPLQHDMEALPNQSGETST 258
                  +   ++LS  + LVIG V   Y   +++H     P     +EA  ++  E+ST
Sbjct: 238 -------YQDAALLSSDIILVIGAVVAGYLFSVVQHG--FGPSSSEKIEAPEDEKQESST 288


>gi|356573394|ref|XP_003554846.1| PREDICTED: uncharacterized protein LOC100793916 [Glycine max]
          Length = 528

 Score =  148 bits (373), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 84/199 (42%), Positives = 118/199 (59%), Gaps = 5/199 (2%)

Query: 28  LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
            E GY+V T+ DG QL I PHSV   P + + +VLDS  S  Y +S  +S  S  K LAG
Sbjct: 70  FESGYSVETIFDGSQLGIEPHSVKISP-NGEFLVLDSENSNIYKVSGSMSRYSRPKLLAG 128

Query: 88  --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
             +G  G+ DG P  AR + PK   VD +GNIY+AD  N  IRKI++ GVTTIAGG    
Sbjct: 129 SAEGNIGHIDGRPREARMNHPKGLTVDDRGNIYIADTLNMAIRKISDEGVTTIAGGKRGY 188

Query: 146 E-GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
             G  DGP+++A FSNDF++ +V   C+LL+ D GN  IR+I L  +DC+   +  ++  
Sbjct: 189 AGGHVDGPSEDAKFSNDFDVVYVGSSCSLLVVDRGNHAIREIQLHQDDCTSYDEDDNSFN 248

Query: 205 -AVSVWVLVSVLSCLVSLV 222
             + V V  +    +++L+
Sbjct: 249 LGIVVLVAAAFFGYMLALL 267


>gi|413936635|gb|AFW71186.1| hypothetical protein ZEAMMB73_395034 [Zea mays]
          Length = 501

 Score =  147 bits (371), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 87/214 (40%), Positives = 123/214 (57%), Gaps = 17/214 (7%)

Query: 22  VSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRP-GSSDLIVLDSSRSAFYTLSFPLSEES 80
           V   L  E GY V T+++G +L + PH+V   P    +L+ +DS+ S    ++ PLSE S
Sbjct: 67  VGVALQFESGYFVETLVEGDKLGVTPHTVRVSPVEGGELLAVDSAHSNIVRITPPLSEYS 126

Query: 81  VVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
             + +AG   G  G+ DG+P  ARF +P   AVD  GN+YVAD +N  IRKI   GVTTI
Sbjct: 127 RGRLVAGSFQGHSGHIDGKPSDARFKRPTGVAVDDMGNVYVADTANLAIRKIGESGVTTI 186

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQ 198
           AGG S   G  DGP+++A FS DF++ +V  +C+LL+ D GN  +R+I+L  EDC+    
Sbjct: 187 AGGKSNIPGYRDGPSEDAKFSTDFDVVYVKKMCSLLVIDRGNAALRKISLPQEDCT---- 242

Query: 199 SGSALGAVSVWVLVSVLSCLVSLVIGFVARPYII 232
                     +   ++LS  + LVIG V   YI 
Sbjct: 243 ----------YQDSALLSSDLILVIGAVVAGYIF 266


>gi|3176691|gb|AAC18814.1| Contains homology to serine/threonine protein kinase gb|X99618 from
           Mycobacterium tuberculosis. ESTs gb|F14403, gb|F14404,
           and gb|N96730 come from this gene [Arabidopsis thaliana]
          Length = 493

 Score =  147 bits (370), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 85/214 (39%), Positives = 127/214 (59%), Gaps = 10/214 (4%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           S +  E+GY+V TV DG +L I P+S+   P + +L++LDS  S  Y +S  LS  S  +
Sbjct: 35  SMVKFENGYSVETVFDGSKLGIEPYSIEVLP-NGELLILDSENSNIYKISSSLSLYSRPR 93

Query: 84  RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
            + G  +G  G+ DG    A+ + PK   VD +GNIYVAD  N+ IRKI+  GVTTIAGG
Sbjct: 94  LVTGSPEGYPGHVDGRLRDAKLNHPKGLTVDDRGNIYVADTVNNAIRKISEGGVTTIAGG 153

Query: 142 GS-KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
            + +  G  DGP+++A FSNDF++ +V   C+LL+ D GN+ IR+I L  +DC+    SG
Sbjct: 154 KTVRNGGHVDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAIREIQLHFDDCAYQYGSG 213

Query: 201 SALGAVS------VWVLVSVLSCLVSLVIGFVAR 228
             LG ++      +W+ V V +     ++  + R
Sbjct: 214 FPLGKLTKVPNFFLWIAVLVAAGFFGYMLALLQR 247


>gi|115445197|ref|NP_001046378.1| Os02g0234200 [Oryza sativa Japonica Group]
 gi|50251216|dbj|BAD27660.1| putative NHL repeat-containing protein [Oryza sativa Japonica
           Group]
 gi|113535909|dbj|BAF08292.1| Os02g0234200 [Oryza sativa Japonica Group]
 gi|125538730|gb|EAY85125.1| hypothetical protein OsI_06475 [Oryza sativa Indica Group]
 gi|222622487|gb|EEE56619.1| hypothetical protein OsJ_06002 [Oryza sativa Japonica Group]
          Length = 493

 Score =  147 bits (370), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 85/210 (40%), Positives = 121/210 (57%), Gaps = 17/210 (8%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSVIDRP-GSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
           L  E GY V T+++G +L + PH++   P    +L+ +DS+ S    ++ PLSE S  + 
Sbjct: 69  LQFESGYFVETLVEGDKLGVTPHTIRVSPVEGGELLAVDSAHSNIVRITPPLSEYSRGRL 128

Query: 85  LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
           +AG   G  G+ DG+P  ARF +P   AVD  GN+YVAD +N  IRKI   GVTTIAGG 
Sbjct: 129 VAGSFQGHSGHIDGKPSDARFKRPTGVAVDDTGNVYVADTANLAIRKIGESGVTTIAGGK 188

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
           S   G  DGP+++A FS DF++ +V  +C+LL+ D GN  +R+I L  EDC+        
Sbjct: 189 SNIPGYRDGPSEDAKFSTDFDVVYVKKMCSLLVIDRGNAALRKIALPQEDCT-------- 240

Query: 203 LGAVSVWVLVSVLSCLVSLVIGFVARPYII 232
                 +   ++LS  + LVIG V   YI 
Sbjct: 241 ------YQDATLLSSDIILVIGAVVAGYIF 264


>gi|226504928|ref|NP_001143593.1| uncharacterized protein LOC100276297 precursor [Zea mays]
 gi|195623016|gb|ACG33338.1| hypothetical protein [Zea mays]
          Length = 501

 Score =  147 bits (370), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 86/214 (40%), Positives = 123/214 (57%), Gaps = 17/214 (7%)

Query: 22  VSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRP-GSSDLIVLDSSRSAFYTLSFPLSEES 80
           V   L  E GY V T+++G +L + PH++   P    +L+ +DS+ S    ++ PLSE S
Sbjct: 67  VGVALQFESGYFVETLVEGDKLGVTPHTIRVSPVEGGELLAVDSAHSNIVRITPPLSEYS 126

Query: 81  VVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
             + +AG   G  G+ DG+P  ARF +P   AVD  GN+YVAD +N  IRKI   GVTTI
Sbjct: 127 RGRLVAGSFQGHSGHIDGKPSDARFKRPTGVAVDDMGNVYVADTANLAIRKIGESGVTTI 186

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQ 198
           AGG S   G  DGP+++A FS DF++ +V  +C+LL+ D GN  +R+I+L  EDC+    
Sbjct: 187 AGGKSNIPGYRDGPSEDAKFSTDFDVVYVKKMCSLLVIDRGNAALRKISLPQEDCT---- 242

Query: 199 SGSALGAVSVWVLVSVLSCLVSLVIGFVARPYII 232
                     +   ++LS  + LVIG V   YI 
Sbjct: 243 ----------YQDSALLSSDLILVIGAVVAGYIF 266


>gi|30698020|ref|NP_177185.3| NHL domain-containing protein [Arabidopsis thaliana]
 gi|332196920|gb|AEE35041.1| NHL domain-containing protein [Arabidopsis thaliana]
          Length = 509

 Score =  146 bits (369), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 94/255 (36%), Positives = 144/255 (56%), Gaps = 20/255 (7%)

Query: 13  TLLSLLFSL---------VSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLD 63
           +L+  L+SL           S +  E+GY+V TV DG +L I P+S+   P + +L++LD
Sbjct: 42  SLMKWLWSLKTTTKTTIATRSMVKFENGYSVETVFDGSKLGIEPYSIEVLP-NGELLILD 100

Query: 64  SSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD 121
           S  S  Y +S  LS  S  + + G  +G  G+ DG    A+ + PK   VD +GNIYVAD
Sbjct: 101 SENSNIYKISSSLSLYSRPRLVTGSPEGYPGHVDGRLRDAKLNHPKGLTVDDRGNIYVAD 160

Query: 122 KSNHVIRKITNLGVTTIAGGGS-KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGN 180
             N+ IRKI+  GVTTIAGG + +  G  DGP+++A FSNDF++ +V   C+LL+ D GN
Sbjct: 161 TVNNAIRKISEGGVTTIAGGKTVRNGGHVDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGN 220

Query: 181 QLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLV---IGFVARPYIIRHTGR 237
           + IR+I L  +DC+    SG  LG ++V V       +++L+   +G +   +  +    
Sbjct: 221 KAIREIQLHFDDCAYQYGSGFPLG-IAVLVAAGFFGYMLALLQRRVGSIVSSHNDQEMFE 279

Query: 238 LDP---PLQHDMEAL 249
            DP   P++H   +L
Sbjct: 280 ADPDQKPMKHSRPSL 294


>gi|242061092|ref|XP_002451835.1| hypothetical protein SORBIDRAFT_04g008420 [Sorghum bicolor]
 gi|241931666|gb|EES04811.1| hypothetical protein SORBIDRAFT_04g008420 [Sorghum bicolor]
          Length = 495

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 85/210 (40%), Positives = 122/210 (58%), Gaps = 17/210 (8%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSVIDRP-GSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
           L  E GY V T+++G +L + PH++   P    +L+ +DS+ S    ++ PLSE S  + 
Sbjct: 70  LQFESGYFVETLVEGDKLGVTPHTIRVSPVEGGELLAVDSAHSNIVRITPPLSEYSRGRL 129

Query: 85  LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
           +AG   G  G+ DG+P  ARF +P   AVD  GN+YVAD +N  IRKI   GVTTIAGG 
Sbjct: 130 VAGSFQGHSGHIDGKPSDARFKRPTGVAVDDMGNVYVADTANLAIRKIGESGVTTIAGGK 189

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
           S   G  DGP+++A FS DF++ +V  +C+LL+ D GN  +R+I+L  EDC+        
Sbjct: 190 SNIPGYRDGPSEDAKFSTDFDVVYVKKMCSLLVIDRGNAALRKISLPQEDCT-------- 241

Query: 203 LGAVSVWVLVSVLSCLVSLVIGFVARPYII 232
                 +   ++LS  + LVIG V   YI 
Sbjct: 242 ------YQDSALLSSDLILVIGAVVAGYIF 265


>gi|357452081|ref|XP_003596317.1| hypothetical protein MTR_2g075860 [Medicago truncatula]
 gi|355485365|gb|AES66568.1| hypothetical protein MTR_2g075860 [Medicago truncatula]
          Length = 493

 Score =  145 bits (367), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 83/204 (40%), Positives = 120/204 (58%), Gaps = 6/204 (2%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
           L  E+GY V TV++G+++ + P+ +       +L  +D   S    ++ PLS+ S  + +
Sbjct: 63  LQFENGYVVETVVEGNEIGVIPYRIRVSEEDGELFAVDEINSNIVRITPPLSQYSRGRLV 122

Query: 86  AGDGVQGYSD---GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
           AG   QGY+D   G+P  ARF+ PK   +D KGN+YVAD  N  IRKI + GVTTIAGG 
Sbjct: 123 AG-SFQGYTDHVDGKPSDARFNHPKGITMDDKGNVYVADTQNLAIRKIGDAGVTTIAGGK 181

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
           S   G  DGP+++A FSNDF++ +V   C+LL+ D GN  +R+I L  EDC   S S S+
Sbjct: 182 SNVAGYRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRKIILDQEDCDYQSSSISS 241

Query: 203 LGAVSV--WVLVSVLSCLVSLVIG 224
              + V   VLV   +C++    G
Sbjct: 242 TDILIVVGAVLVGYATCMLQQGFG 265


>gi|22329761|ref|NP_173800.2| NHL domain-containing protein [Arabidopsis thaliana]
 gi|18700091|gb|AAL77657.1| At1g23880/T23E23_8 [Arabidopsis thaliana]
 gi|20856013|gb|AAM26643.1| At1g23880/T23E23_8 [Arabidopsis thaliana]
 gi|332192324|gb|AEE30445.1| NHL domain-containing protein [Arabidopsis thaliana]
          Length = 545

 Score =  145 bits (367), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 84/198 (42%), Positives = 121/198 (61%), Gaps = 5/198 (2%)

Query: 28  LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
            E+GY+V TV+DG +L I P+S I    + +L++LDS  S  Y +S  LS  S  + + G
Sbjct: 128 FENGYSVETVLDGSKLGIEPYS-IQVLSNGELLILDSQNSNIYQISSSLSLYSRPRLVTG 186

Query: 88  --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS-K 144
             +G  G+ DG    AR + PK   VD +GNIYVAD  N+ IRKI+  GVTTIAGG   +
Sbjct: 187 SPEGYPGHVDGRLRDARLNNPKGLTVDDRGNIYVADTVNNAIRKISEAGVTTIAGGKMVR 246

Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
             G  DGP+++A FSNDF++ ++   C+LL+ D GNQ IR+I L  +DC+    SG  LG
Sbjct: 247 GGGHVDGPSEDAKFSNDFDVVYLGSSCSLLVIDRGNQAIREIQLHFDDCADQYGSGFPLG 306

Query: 205 AVSVWVLVSVLSCLVSLV 222
            ++V V       +++L+
Sbjct: 307 -IAVLVAAVFFGYMLALL 323


>gi|297845436|ref|XP_002890599.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336441|gb|EFH66858.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 485

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 99/258 (38%), Positives = 140/258 (54%), Gaps = 25/258 (9%)

Query: 8   SSLAITLLSLLFSLVS-------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLI 60
           +SL   L SL F   +       S +  E+GY+V TV+DG +L I P+S+   P + +L+
Sbjct: 41  TSLMKWLWSLSFKTTTITAVSTKSMVKFENGYSVETVLDGSKLGIEPYSLQVLP-NGELL 99

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
           +LDS  S  Y +S  LS  S  + + G  +G  G+ DG    AR + PK   VD +GNIY
Sbjct: 100 ILDSQNSNIYKISSSLSLYSRPRLITGSPEGYPGHVDGRLRDARLNNPKGLTVDDRGNIY 159

Query: 119 VADKSNHVIRKITNLGVTTIAGGGS-KKEGRADGPAQNASFSNDFELTFVPHICALLISD 177
           VAD  N+ IRKI+  GVTTIAGG      G  DGP+++A FSNDF++ ++   C+LL+ D
Sbjct: 160 VADTVNNAIRKISEAGVTTIAGGKMVHGGGHVDGPSEDAKFSNDFDVVYLGSSCSLLVID 219

Query: 178 HGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLVIGFVAR---PYIIRH 234
            GNQ IR+I L  +DC+    SG  LG     + V V +     ++  + R     I  H
Sbjct: 220 RGNQAIREIQLHFDDCADQYGSGFPLG-----IAVLVAAIFFGYMLALLQRRLSSIISYH 274

Query: 235 TGRLDPPLQHDMEALPNQ 252
           T       Q   EA+P+Q
Sbjct: 275 TD------QEVFEAVPDQ 286


>gi|30698018|ref|NP_850974.1| NHL domain-containing protein [Arabidopsis thaliana]
 gi|17065224|gb|AAL32766.1| Unknown protein [Arabidopsis thaliana]
 gi|21387163|gb|AAM47985.1| unknown protein [Arabidopsis thaliana]
 gi|332196919|gb|AEE35040.1| NHL domain-containing protein [Arabidopsis thaliana]
          Length = 447

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 89/231 (38%), Positives = 135/231 (58%), Gaps = 11/231 (4%)

Query: 28  LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
            E+GY+V TV DG +L I P+S+   P + +L++LDS  S  Y +S  LS  S  + + G
Sbjct: 4   FENGYSVETVFDGSKLGIEPYSIEVLP-NGELLILDSENSNIYKISSSLSLYSRPRLVTG 62

Query: 88  --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS-K 144
             +G  G+ DG    A+ + PK   VD +GNIYVAD  N+ IRKI+  GVTTIAGG + +
Sbjct: 63  SPEGYPGHVDGRLRDAKLNHPKGLTVDDRGNIYVADTVNNAIRKISEGGVTTIAGGKTVR 122

Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
             G  DGP+++A FSNDF++ +V   C+LL+ D GN+ IR+I L  +DC+    SG  LG
Sbjct: 123 NGGHVDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAIREIQLHFDDCAYQYGSGFPLG 182

Query: 205 AVSVWVLVSVLSCLVSLV---IGFVARPYIIRHTGRLDP---PLQHDMEAL 249
            ++V V       +++L+   +G +   +  +     DP   P++H   +L
Sbjct: 183 -IAVLVAAGFFGYMLALLQRRVGSIVSSHNDQEMFEADPDQKPMKHSRPSL 232


>gi|297841763|ref|XP_002888763.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297334604|gb|EFH65022.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 503

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 88/222 (39%), Positives = 131/222 (59%), Gaps = 14/222 (6%)

Query: 13  TLLSLLFSL---------VSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLD 63
           +L+  L+SL           S +  E+GY+V TV DG +L I P+S+   P + +L++LD
Sbjct: 42  SLMKWLWSLKTTTKTTIATRSMVKFENGYSVETVFDGSKLGIEPYSIEVLP-NGELLILD 100

Query: 64  SSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD 121
           S  S  Y +S  LS  S  + + G  +G  G+ DG    A+ + PK   VD +GNIYVAD
Sbjct: 101 SENSNIYKISSSLSLYSRPRLVTGSPEGYPGHVDGRLRDAKLNHPKGLTVDDRGNIYVAD 160

Query: 122 KSNHVIRKITNLGVTTIAGGGS-KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGN 180
             N+ IRKI+  GVTTIAGG + +  G  DGP+++A FSNDF++ +V   C+LL+ D GN
Sbjct: 161 TVNNAIRKISEGGVTTIAGGKTVRNGGHVDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGN 220

Query: 181 QLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLV 222
           + IR+I L  +DC+    SG  LG ++V V       +++L+
Sbjct: 221 KAIREIQLHFDDCAYQYGSGFPLG-IAVLVAAGFFGYMLALL 261


>gi|225424717|ref|XP_002265467.1| PREDICTED: uncharacterized protein LOC100263777 [Vitis vinifera]
 gi|296086531|emb|CBI32120.3| unnamed protein product [Vitis vinifera]
          Length = 491

 Score =  144 bits (363), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 87/202 (43%), Positives = 125/202 (61%), Gaps = 5/202 (2%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           S +  E GYTV TV DG +L I P+S I+  GS +L++LDS+ S  Y +S  LS+ +  K
Sbjct: 62  SMMKFESGYTVETVFDGSKLGIEPYS-IEVLGSGELLILDSANSNLYKISSSLSQYTRPK 120

Query: 84  RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
            + G  +G  G+ DG    AR + PK   VD +GNIYVAD  N  IRKI++ GVTTIAGG
Sbjct: 121 LVTGSAEGYSGHVDGRLREARMNHPKGLTVDDRGNIYVADTLNMAIRKISDAGVTTIAGG 180

Query: 142 GSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
              + G   DGP+++A FSNDF++ ++   C+LL+ D GNQ IR+I L  +DC+    SG
Sbjct: 181 KLGRGGGHVDGPSEDAKFSNDFDVVYIGSSCSLLVIDRGNQAIREIQLHFDDCAYQYGSG 240

Query: 201 SALGAVSVWVLVSVLSCLVSLV 222
             LG ++V +       +++L+
Sbjct: 241 FPLG-IAVLIAAGFFGYMLALL 261


>gi|11994597|dbj|BAB02652.1| unnamed protein product [Arabidopsis thaliana]
          Length = 511

 Score =  144 bits (362), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 85/208 (40%), Positives = 120/208 (57%), Gaps = 8/208 (3%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
           L  E+GY V TV++G+ + + P+  I      +L  +D   S    ++ PLS+ S  + +
Sbjct: 71  LQFENGYLVETVVEGNDIGVVPYK-IRVSDDGELYAVDELNSNIMKITPPLSQYSRGRLV 129

Query: 86  AG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS 143
           AG   G  G++DG+P  ARF+ P+   +D KGN+YVAD  N  IRKI + GVTTIAGG S
Sbjct: 130 AGSFQGKTGHADGKPSEARFNHPRGVTMDDKGNVYVADTLNLAIRKIGDSGVTTIAGGKS 189

Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSAL 203
              G  DGP+++A FSNDF++ +V   C+LL+ D GN  +RQI+L  EDC     S  +L
Sbjct: 190 NIAGYRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLSEEDCDYQDDSSISL 249

Query: 204 GAVSVWVL-----VSVLSCLVSLVIGFV 226
             V + VL     V      + LVIG V
Sbjct: 250 TGVDLRVLTEDHCVLTFGADILLVIGAV 277


>gi|147798773|emb|CAN69794.1| hypothetical protein VITISV_022544 [Vitis vinifera]
          Length = 491

 Score =  144 bits (362), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 87/202 (43%), Positives = 125/202 (61%), Gaps = 5/202 (2%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           S +  E GYTV TV DG +L I P+S I+  GS +L++LDS+ S  Y +S  LS+ +  K
Sbjct: 62  SMMKFESGYTVETVFDGSKLGIEPYS-IEVLGSGELLILDSANSNLYKISSSLSQYTRPK 120

Query: 84  RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
            + G  +G  G+ DG    AR + PK   VD +GNIYVAD  N  IRKI++ GVTTIAGG
Sbjct: 121 LVTGSAEGYSGHVDGRLREARMNHPKGLTVDDRGNIYVADTLNMAIRKISDAGVTTIAGG 180

Query: 142 GSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
              + G   DGP+++A FSNDF++ ++   C+LL+ D GNQ IR+I L  +DC+    SG
Sbjct: 181 KLGRGGGHVDGPSEDAKFSNDFDVVYIGSSCSLLVIDRGNQAIREIQLHFDDCAYQYGSG 240

Query: 201 SALGAVSVWVLVSVLSCLVSLV 222
             LG ++V +       +++L+
Sbjct: 241 FPLG-IAVLIAAGFFGYMLALL 261


>gi|224117390|ref|XP_002331700.1| predicted protein [Populus trichocarpa]
 gi|222874306|gb|EEF11437.1| predicted protein [Populus trichocarpa]
          Length = 495

 Score =  143 bits (360), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 78/177 (44%), Positives = 109/177 (61%), Gaps = 7/177 (3%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
           L  EDGY V TV++G+ + + P+ + +   G  +L  +D   S    ++ PLS+ S   R
Sbjct: 61  LQFEDGYLVETVVEGNAMGVVPYKIRVSEDG--ELYAVDEVNSNVVKITPPLSQYSRA-R 117

Query: 85  LAGDGVQGYS---DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
           LA    QGY+   DG+P  ARF+ P+   +D KGNIYVAD  N  IRKI + GVTTIAGG
Sbjct: 118 LAAGSFQGYTGHIDGKPNEARFNHPRGLTMDDKGNIYVADTLNLAIRKIGDAGVTTIAGG 177

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQ 198
            S   G  DGP+++A FSNDF++ ++   C+LL+ D GN  +RQI+L  EDC   S+
Sbjct: 178 KSNVAGFRDGPSEDAKFSNDFDVVYLHSTCSLLVVDRGNAALRQISLNQEDCDYQSK 234


>gi|255567088|ref|XP_002524526.1| conserved hypothetical protein [Ricinus communis]
 gi|223536200|gb|EEF37853.1| conserved hypothetical protein [Ricinus communis]
          Length = 484

 Score =  142 bits (359), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 80/204 (39%), Positives = 119/204 (58%), Gaps = 7/204 (3%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
           L  EDGY V TV++G+++ + P+ + +   G  +L  +D   S    ++ PLS+ S  + 
Sbjct: 60  LQFEDGYLVETVVEGNEIGVVPYKIRVSEDG--ELYAVDEVNSNIVKITPPLSQYSRARL 117

Query: 85  LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
           +AG   G +G+ DG+   ARF+ PK   +D KGN+YVAD  N  IRKI + GVTTIAGG 
Sbjct: 118 VAGSFQGYKGHVDGKSNEARFNHPKGITMDDKGNVYVADTLNLAIRKIGDAGVTTIAGGK 177

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
           S   G  DGP+++A FS DF++ +V   C+LL+ D GN  +RQI+L  EDC   S S + 
Sbjct: 178 SNTAGYRDGPSEDAKFSTDFDVVYVHSTCSLLVVDRGNVALRQISLNQEDCDYQSSSITV 237

Query: 203 LGAVSV--WVLVSVLSCLVSLVIG 224
              + V   V    ++C++    G
Sbjct: 238 TDLLMVVGAVFTGYVTCMLQQGFG 261


>gi|356565931|ref|XP_003551189.1| PREDICTED: uncharacterized protein LOC100781836 [Glycine max]
          Length = 493

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 82/184 (44%), Positives = 115/184 (62%), Gaps = 4/184 (2%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           S +  E GY V TV DG +L I P++V   P + +L++LDS+ S  Y +S  LS  S  K
Sbjct: 66  SMMKFESGYNVETVFDGSKLGIEPYAVEVLP-NGELLILDSANSNIYRISSSLSLYSRPK 124

Query: 84  RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
            +AG  +G  G+ DG+   AR + PK   VD +GNIYVAD +N  IRKI++ GVTTIAGG
Sbjct: 125 LVAGSAEGYSGHVDGKLREARMNHPKGITVDDRGNIYVADTTNMAIRKISDSGVTTIAGG 184

Query: 142 -GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
             ++  G  DGP++ A FS+D ++ +V   C+LL+ D GN+ IR+I L  +DC+    SG
Sbjct: 185 KWNRGGGHVDGPSEEAKFSDDLDVVYVGSSCSLLVIDRGNRAIREIQLHFDDCAYQYGSG 244

Query: 201 SALG 204
             LG
Sbjct: 245 FPLG 248


>gi|297834352|ref|XP_002885058.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297330898|gb|EFH61317.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 489

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 80/205 (39%), Positives = 121/205 (59%), Gaps = 8/205 (3%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
           L  E+GY V TV++G+++ + P+ + +   G  +L  +D   S    ++ PLS+ S  + 
Sbjct: 67  LQFENGYLVETVVEGNEIGVVPYKIRVSHDG--ELYAVDELNSNIMKITPPLSQYSRGRL 124

Query: 85  LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
           +AG   G  G++DG+P  ARF+ P+   +D KGN+YV D  N  IRKI + GVTTIAGG 
Sbjct: 125 VAGSFQGKTGHADGKPSEARFNHPRGVTMDDKGNVYVGDTLNLAIRKIGDSGVTTIAGGK 184

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
           S   G  DGP+++A FSNDF++ +V   C+LL+ D GN  +RQI+L  EDC     S  +
Sbjct: 185 SNIAGYRDGPSEDAKFSNDFDVVYVRSTCSLLVIDRGNAALRQISLSDEDCDYQDDSSIS 244

Query: 203 LGAVSV---WVLVSVLSCLVSLVIG 224
           L  + +    VL+   +CL+    G
Sbjct: 245 LTDILLVIGAVLIGYATCLLQQGFG 269


>gi|42572439|ref|NP_974315.1| NHL domain-containing protein [Arabidopsis thaliana]
 gi|332642057|gb|AEE75578.1| NHL domain-containing protein [Arabidopsis thaliana]
          Length = 493

 Score =  141 bits (356), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 80/204 (39%), Positives = 119/204 (58%), Gaps = 6/204 (2%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
           L  E+GY V TV++G+ + + P+  I      +L  +D   S    ++ PLS+ S  + +
Sbjct: 71  LQFENGYLVETVVEGNDIGVVPYK-IRVSDDGELYAVDELNSNIMKITPPLSQYSRGRLV 129

Query: 86  AG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS 143
           AG   G  G++DG+P  ARF+ P+   +D KGN+YVAD  N  IRKI + GVTTIAGG S
Sbjct: 130 AGSFQGKTGHADGKPSEARFNHPRGVTMDDKGNVYVADTLNLAIRKIGDSGVTTIAGGKS 189

Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSAL 203
              G  DGP+++A FSNDF++ +V   C+LL+ D GN  +RQI+L  EDC     S  +L
Sbjct: 190 NIAGYRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLSEEDCDYQDDSSISL 249

Query: 204 GAVSV---WVLVSVLSCLVSLVIG 224
             + +    VL+   +C++    G
Sbjct: 250 TDILLVIGAVLIGYATCMLQQGFG 273


>gi|22331093|ref|NP_188104.2| NHL domain-containing protein [Arabidopsis thaliana]
 gi|20260314|gb|AAM13055.1| unknown protein [Arabidopsis thaliana]
 gi|25083516|gb|AAN72090.1| unknown protein [Arabidopsis thaliana]
 gi|332642056|gb|AEE75577.1| NHL domain-containing protein [Arabidopsis thaliana]
          Length = 492

 Score =  141 bits (356), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 80/204 (39%), Positives = 119/204 (58%), Gaps = 6/204 (2%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
           L  E+GY V TV++G+ + + P+  I      +L  +D   S    ++ PLS+ S  + +
Sbjct: 71  LQFENGYLVETVVEGNDIGVVPYK-IRVSDDGELYAVDELNSNIMKITPPLSQYSRGRLV 129

Query: 86  AG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS 143
           AG   G  G++DG+P  ARF+ P+   +D KGN+YVAD  N  IRKI + GVTTIAGG S
Sbjct: 130 AGSFQGKTGHADGKPSEARFNHPRGVTMDDKGNVYVADTLNLAIRKIGDSGVTTIAGGKS 189

Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSAL 203
              G  DGP+++A FSNDF++ +V   C+LL+ D GN  +RQI+L  EDC     S  +L
Sbjct: 190 NIAGYRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLSEEDCDYQDDSSISL 249

Query: 204 GAVSV---WVLVSVLSCLVSLVIG 224
             + +    VL+   +C++    G
Sbjct: 250 TDILLVIGAVLIGYATCMLQQGFG 273


>gi|357485963|ref|XP_003613269.1| hypothetical protein MTR_5g034750 [Medicago truncatula]
 gi|355514604|gb|AES96227.1| hypothetical protein MTR_5g034750 [Medicago truncatula]
          Length = 589

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 85/202 (42%), Positives = 116/202 (57%), Gaps = 22/202 (10%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLS------ 77
           S +  E GY V TV DG +L I P++V +   + +L++LDS  S  Y +S  LS      
Sbjct: 76  SMMKFESGYNVETVFDGSKLGIEPYAV-EVLSNGELLILDSENSNIYKISSSLSLCKCAN 134

Query: 78  ------------EESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS 123
                         S  K +AG  +G  G+ DG+   AR + PK   VD +GNIYVAD  
Sbjct: 135 VITSPMMSLVAPTNSRPKLVAGSAEGYSGHVDGKLREARMNHPKGITVDDRGNIYVADIM 194

Query: 124 NHVIRKITNLGVTTIAGGG-SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQL 182
           N  IRKI++ GVTTIAGG  S+  G  DGP++ A FSNDF++ +V   C+LL+ D GNQ 
Sbjct: 195 NMAIRKISDSGVTTIAGGKLSRGGGHVDGPSEEAKFSNDFDVVYVGSSCSLLVIDRGNQA 254

Query: 183 IRQINLKPEDCSKSSQSGSALG 204
           IR+I L+ +DC+   +SG  LG
Sbjct: 255 IREIQLRFDDCAYQYESGFPLG 276


>gi|9755656|emb|CAC01808.1| putative protein [Arabidopsis thaliana]
          Length = 733

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 74/169 (43%), Positives = 105/169 (62%), Gaps = 5/169 (2%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
           E GY + TV DG +L I P+++   P   +LIVLDS  S  + +S PLS     K L+G 
Sbjct: 54  ESGYNMETVFDGSKLGIEPYAIEVSPNGGELIVLDSENSNIHKISMPLSRYGKPKLLSGS 113

Query: 88  -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
            +G  G+ DG+   AR ++P+  A+D +GNIYVAD  N  IRKI++ GV+TIA GG    
Sbjct: 114 QEGYTGHVDGKLKEARMNRPRGLAMDDRGNIYVADTINMAIRKISDDGVSTIAAGGRWSG 173

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSK 195
           G  +   ++  FS+DF+L +V   C+LL+ D GNQLI++I L   DCS+
Sbjct: 174 GSKE---ESMRFSDDFDLIYVSSSCSLLVIDRGNQLIKEIQLHDHDCSQ 219


>gi|30685100|ref|NP_196993.2| NHL domain-containing protein [Arabidopsis thaliana]
 gi|332004703|gb|AED92086.1| NHL domain-containing protein [Arabidopsis thaliana]
          Length = 754

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 74/169 (43%), Positives = 105/169 (62%), Gaps = 5/169 (2%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
           E GY + TV DG +L I P+++   P   +LIVLDS  S  + +S PLS     K L+G 
Sbjct: 75  ESGYNMETVFDGSKLGIEPYAIEVSPNGGELIVLDSENSNIHKISMPLSRYGKPKLLSGS 134

Query: 88  -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
            +G  G+ DG+   AR ++P+  A+D +GNIYVAD  N  IRKI++ GV+TIA GG    
Sbjct: 135 QEGYTGHVDGKLKEARMNRPRGLAMDDRGNIYVADTINMAIRKISDDGVSTIAAGGRWSG 194

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSK 195
           G  +   ++  FS+DF+L +V   C+LL+ D GNQLI++I L   DCS+
Sbjct: 195 GSKE---ESMRFSDDFDLIYVSSSCSLLVIDRGNQLIKEIQLHDHDCSQ 240


>gi|224140207|ref|XP_002323476.1| predicted protein [Populus trichocarpa]
 gi|222868106|gb|EEF05237.1| predicted protein [Populus trichocarpa]
          Length = 453

 Score =  137 bits (345), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 85/201 (42%), Positives = 119/201 (59%), Gaps = 5/201 (2%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           S +  E GYTV TV DG +L I P+SV   P S +L++LDS  S  Y +S   S+    K
Sbjct: 7   SMMKFEGGYTVETVFDGSKLGIEPYSVEVSP-SGELLLLDSENSNIYKISTQFSKYGRPK 65

Query: 84  RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAG 140
            + G  +G  G+ DG+   AR + PK   VD  GNIYVAD  N  IRKI T+ GVTTIAG
Sbjct: 66  LITGSFEGYPGHVDGKLRDARMNHPKGLTVDEGGNIYVADTMNMAIRKISTDGGVTTIAG 125

Query: 141 G-GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQS 199
           G  ++  G  DGP+++A FSNDF++ +    C+LLI D G+Q IR+I L  +DC+     
Sbjct: 126 GKWARGGGHVDGPSEDAKFSNDFDVVYSASSCSLLIIDRGSQAIREIQLHDDDCNYPHDD 185

Query: 200 GSALGAVSVWVLVSVLSCLVS 220
              L   ++ + ++ L+ LV+
Sbjct: 186 CFHLDLDNILINIAGLAVLVA 206


>gi|297807503|ref|XP_002871635.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317472|gb|EFH47894.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 704

 Score =  134 bits (337), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 70/168 (41%), Positives = 104/168 (61%), Gaps = 5/168 (2%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
           E GY + TV DG +L I P+++   P   +LIVLDS  S  + +S PLS  +  K ++G 
Sbjct: 75  ESGYNIETVFDGSKLGIEPYAIEVSPNGGELIVLDSENSNIHKISMPLSRYAKPKLVSGS 134

Query: 88  -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
            +G  G+ DG+   A+ ++P+  A+D  GNIYVAD +N  IRKI++ GV+TI  GG    
Sbjct: 135 QEGYTGHVDGKLKEAKMNRPRGLAIDDSGNIYVADTNNMAIRKISDDGVSTITAGGRWSG 194

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
           G  +   ++  FS+DF+L +V   C+LL+ D GNQ+I++I L   DCS
Sbjct: 195 GSKE---ESMRFSDDFDLIYVSSSCSLLVIDRGNQMIKEIQLHDHDCS 239


>gi|317015034|gb|ADU85604.1| NHL repeat-containing protein, partial [Wolffia arrhiza]
          Length = 239

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 76/174 (43%), Positives = 109/174 (62%), Gaps = 3/174 (1%)

Query: 28  LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
            E GY V T+ DG++L + PH++   P   +LI +DS  S    ++ PLS  S  + +AG
Sbjct: 60  FESGYLVETIADGNRLGLTPHAIRVSP-DGELIAVDSVNSNIVRITPPLSAFSRGRLVAG 118

Query: 88  --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
              G  G  DG+P  ARF++P+  AVD +GNIYVAD SN  IRKI + GV+TIAGG +  
Sbjct: 119 SFQGRSGLIDGKPSEARFNQPRGVAVDRRGNIYVADVSNLAIRKIGDSGVSTIAGGKAGA 178

Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQS 199
            G  DGP++ A FS DF++ +V  +C+LL+ D GN  +R+I L  +DC++   S
Sbjct: 179 AGFRDGPSEEARFSADFDVVYVKSLCSLLVVDRGNAALRKIFLHDDDCTQDFSS 232


>gi|302768697|ref|XP_002967768.1| hypothetical protein SELMODRAFT_67906 [Selaginella moellendorffii]
 gi|300164506|gb|EFJ31115.1| hypothetical protein SELMODRAFT_67906 [Selaginella moellendorffii]
          Length = 186

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 72/180 (40%), Positives = 106/180 (58%), Gaps = 6/180 (3%)

Query: 21  LVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEES 80
           L+ + +L+E G+TV T+ D  +   N  ++   P +  ++ LDSS +  + L  PLS+ S
Sbjct: 7   LLKTFVLVEPGFTVKTLFDLGKHSANAFALYPDPRNKFVLALDSSGNRIWKLRLPLSQNS 66

Query: 81  VVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
            ++  AG   G  GY DG    + F++P+S A+   G ++VAD  N  IRKI+  G VTT
Sbjct: 67  SLEAFAGSFVGESGYVDGPAAKSLFNRPQSLAICDNGAVFVADTRNLAIRKISKDGEVTT 126

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-KPEDCSKS 196
           IAGG S+K G ADGP   A FS++F L       +LLI+D GN+LIR+I +  P+ C  S
Sbjct: 127 IAGGSSRKPGFADGPGDTARFSSEFSLAC--SCGSLLIADRGNRLIREIQIDDPKSCDSS 184


>gi|168018878|ref|XP_001761972.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686689|gb|EDQ73076.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 386

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 72/167 (43%), Positives = 100/167 (59%), Gaps = 11/167 (6%)

Query: 31  GYTVTTVIDGHQLEINPHSV---IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
           GY VTTV++G+   ++ + +    DR   +  IVLDS++S    +  PLS++SVV+ +AG
Sbjct: 44  GYDVTTVLNGNLRGLSFYCIDEATDR-APAWAIVLDSTKSKVLRVQLPLSQDSVVEHIAG 102

Query: 88  --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG----VTTIAGG 141
             +G  GY DG  G A F+ PK   +D  GNIYVAD  N  IR IT  G    VTTIA G
Sbjct: 103 SLEGKAGYQDGRGGDALFNHPKMLTLDSDGNIYVADVRNTAIRMITTQGKHSFVTTIA-G 161

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           G  + G  DG  +  +FSNDF +T++   C LLI D GN+++R + L
Sbjct: 162 GMNRTGHNDGEGRVVTFSNDFGVTYLRKNCTLLIVDRGNRMVRAMKL 208


>gi|357516309|ref|XP_003628443.1| hypothetical protein MTR_8g058300 [Medicago truncatula]
 gi|355522465|gb|AET02919.1| hypothetical protein MTR_8g058300 [Medicago truncatula]
          Length = 160

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 59/122 (48%), Positives = 76/122 (62%), Gaps = 1/122 (0%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
           G  G+ DG+   ARF  PK  A+D KGN+YVAD  N  IRKI + GVTTIAGG S   G 
Sbjct: 24  GRTGHVDGKLSDARFHYPKGIALDDKGNVYVADTQNMAIRKIGDAGVTTIAGGKSNVAGY 83

Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSV 208
            DGP ++A  SNDF++ ++   C+LL+ D GN  +RQI L  EDC+  S S S  G+   
Sbjct: 84  RDGPGEDAKLSNDFDVVYIRPTCSLLVIDRGNAALRQIFLNQEDCNYQSSSISLTGS-EF 142

Query: 209 WV 210
           W+
Sbjct: 143 WI 144


>gi|125561542|gb|EAZ06990.1| hypothetical protein OsI_29235 [Oryza sativa Indica Group]
          Length = 498

 Score =  114 bits (286), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 58/100 (58%), Positives = 72/100 (72%), Gaps = 3/100 (3%)

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
            TTIAGG SK  GR DGPAQNA+FS DFEL +VP +CALLI+D GN+LIRQINLK EDC+
Sbjct: 222 TTTIAGGRSKGPGRKDGPAQNATFSPDFELVYVPKMCALLITDRGNRLIRQINLKREDCA 281

Query: 195 KSSQSGSALGAVSVWVLVSVLSCLVSLVIGFVARPYIIRH 234
           + +Q G  LG  SV  +++VL  L+  V+GF  R +   H
Sbjct: 282 RETQPG--LGTTSV-SIIAVLCALLGSVVGFSVRHFYPAH 318


>gi|357516379|ref|XP_003628478.1| hypothetical protein MTR_8g058630 [Medicago truncatula]
 gi|355522500|gb|AET02954.1| hypothetical protein MTR_8g058630 [Medicago truncatula]
          Length = 150

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 58/116 (50%), Positives = 73/116 (62%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
           G  G+ DG+   ARF  PK  A+D KGN+YVAD  N  IRKI + GVTTIAGG S   G 
Sbjct: 24  GRTGHVDGKLSDARFHYPKGIALDDKGNVYVADTQNMAIRKIGDAGVTTIAGGKSNVAGY 83

Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
            DGP ++A  SNDF++ ++   C+LL+ D GN  +RQI L  EDC+  S S S  G
Sbjct: 84  RDGPGEDAKLSNDFDVVYIRPTCSLLVIDRGNAALRQIFLNQEDCNYQSSSISLTG 139


>gi|302821461|ref|XP_002992393.1| hypothetical protein SELMODRAFT_47902 [Selaginella moellendorffii]
 gi|300139809|gb|EFJ06543.1| hypothetical protein SELMODRAFT_47902 [Selaginella moellendorffii]
          Length = 183

 Score =  103 bits (258), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 68/180 (37%), Positives = 102/180 (56%), Gaps = 9/180 (5%)

Query: 21  LVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEES 80
           L+ + +L+E G+TV T+ D  +   N  ++   P    ++ LDSS +  + L  PLS+ S
Sbjct: 7   LLKTFVLVEPGFTVKTLFDLGKHSANAFALYPDPRKKFVLALDSSGNRIWKLRLPLSQNS 66

Query: 81  VVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
            ++  AG   G  GY DG    + F++P+S ++   G ++VAD  N  IRKI+  G VTT
Sbjct: 67  SLEAFAGSFVGESGYVDGPAAKSLFNRPQSLSICDNGAVFVADTRNLAIRKISKDGEVTT 126

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-KPEDCSKS 196
           IAGG S+K G A      A FS++F L       +LLI+D GN+LIR+I +  P+ C  S
Sbjct: 127 IAGGSSRKPGFA---GDTARFSSEFSL--ACSCGSLLIADCGNRLIREIQIDDPKSCDSS 181


>gi|302791603|ref|XP_002977568.1| hypothetical protein SELMODRAFT_417419 [Selaginella moellendorffii]
 gi|300154938|gb|EFJ21572.1| hypothetical protein SELMODRAFT_417419 [Selaginella moellendorffii]
          Length = 288

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 73/201 (36%), Positives = 107/201 (53%), Gaps = 25/201 (12%)

Query: 31  GYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV 90
           G+TV T+ D  +  +N   +   P +  ++  DSS +  + L  PLS+ S ++  AG   
Sbjct: 38  GFTVKTLFDLGKHSVNAFGLYPDPRNKFMLAPDSSGNRIWKLRLPLSQNSSLEAFAG-WE 96

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRAD 150
            GY DG    + F++P+S A+   G I+VAD +   IRKI+   VTTIAGG S+K G AD
Sbjct: 97  SGYIDGPAAKSLFNRPQSLAMCGNGAIFVAD-TRMAIRKISK--VTTIAGGSSRKPGIAD 153

Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-KPEDC--SKSSQSGSA----- 202
           G A+   FS++F L       +L I+D GN LIR+I +  P+ C  S S+ SGS      
Sbjct: 154 GTAR---FSSEFSL--ACSCGSLPIADRGNWLIREIQIDDPKSCDSSDSTVSGSQKCVVL 208

Query: 203 --------LGAVSVWVLVSVL 215
                   LGA  +W ++  L
Sbjct: 209 LGVCLGIPLGAFIIWKILDQL 229


>gi|238009310|gb|ACR35690.1| unknown [Zea mays]
          Length = 340

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 51/118 (43%), Positives = 70/118 (59%), Gaps = 14/118 (11%)

Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
           GN+YVAD +N  IRKI   GVTTIAGG S   G  DGP+++A FS DF++ +V  +C+LL
Sbjct: 2   GNVYVADTANLAIRKIGESGVTTIAGGKSNIPGYRDGPSEDAKFSTDFDVVYVKKMCSLL 61

Query: 175 ISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLVIGFVARPYII 232
           + D GN  +R+I+L  EDC+              +   ++LS  + LVIG V   YI 
Sbjct: 62  VIDRGNAALRKISLPQEDCT--------------YQDSALLSSDLILVIGAVVAGYIF 105


>gi|413944258|gb|AFW76907.1| hypothetical protein ZEAMMB73_849037 [Zea mays]
          Length = 187

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 53/108 (49%), Positives = 65/108 (60%), Gaps = 3/108 (2%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
           E GY V TV DG +L I PH+V   P + +L+VLDS  S  Y +  PLS  S  K LAG 
Sbjct: 73  EGGYAVDTVFDGSKLGIEPHAVEITP-AGNLLVLDSINSNIYRVQLPLSPYSRPKLLAGS 131

Query: 88  -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
             G+ G+ DG    AR + PK F VD +GNIYVAD  N  IRKI++ G
Sbjct: 132 PKGLSGHVDGRLREARMNHPKGFTVDDRGNIYVADGMNMAIRKISDTG 179


>gi|302768701|ref|XP_002967770.1| hypothetical protein SELMODRAFT_17931 [Selaginella moellendorffii]
 gi|300164508|gb|EFJ31117.1| hypothetical protein SELMODRAFT_17931 [Selaginella moellendorffii]
          Length = 128

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 54/123 (43%), Positives = 75/123 (60%), Gaps = 7/123 (5%)

Query: 79  ESVVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN--LG 134
           +S ++ LAG   G  GY DG    + F++P+S A+   G ++VAD  N  IRKI+    G
Sbjct: 3   DSSLEALAGSLVGESGYVDGPAAKSLFNRPQSLAICDNGAVFVADTRNLAIRKISKDGEG 62

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-KPEDC 193
           +TTIAGG S+K G ADGP   A FS++F L       +LLI+D GN+LIR+I +  P+ C
Sbjct: 63  MTTIAGGSSRKPGFADGPGDTARFSSEFRLAC--SCGSLLIADRGNRLIREIQIDDPKSC 120

Query: 194 SKS 196
             S
Sbjct: 121 DSS 123


>gi|195609352|gb|ACG26506.1| hypothetical protein [Zea mays]
 gi|414870563|tpg|DAA49120.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
          Length = 159

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 52/117 (44%), Positives = 73/117 (62%), Gaps = 5/117 (4%)

Query: 27  LLEDGYTVTTVIDGHQLEIN---PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           +LEDGYTVTT  D + L  +   P++V+ R  + DL++LDS+ SA YT+S   S     +
Sbjct: 28  VLEDGYTVTTFADLNPLPASGPHPYAVLPRSRAGDLLLLDSAGSALYTISL-SSSPGEPR 86

Query: 84  RLAGDGV-QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA 139
           RLAG     G++DG PG A FD+P+S AVD   N+YVAD+ +  +RK+   G  T A
Sbjct: 87  RLAGGKRGSGFNDGGPGDAAFDRPRSVAVDAADNVYVADRLHGAVRKVAPSGERTRA 143


>gi|373488690|ref|ZP_09579354.1| NHL repeat containing protein [Holophaga foetida DSM 6591]
 gi|372005635|gb|EHP06271.1| NHL repeat containing protein [Holophaga foetida DSM 6591]
          Length = 438

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 54/138 (39%), Positives = 73/138 (52%), Gaps = 3/138 (2%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRL-AG-DGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           L V D++R     L  P S+ +V + L AG DG+ G  D    SA F+ P+  A+D  GN
Sbjct: 132 LYVSDATRHTISKL-VPNSDGTVTQSLLAGSDGISGSEDKSGSSASFNSPEGLALDASGN 190

Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLIS 176
           +YVAD  NH IRKI  LG  T   G ++  G  DG   +A F++   L F      L ++
Sbjct: 191 LYVADYGNHTIRKIDTLGAVTTFAGQAEASGTVDGDRLSARFNHPIGLAFNASYSVLYVA 250

Query: 177 DHGNQLIRQINLKPEDCS 194
           D GN  IR IN+K +  S
Sbjct: 251 DSGNHTIRAINIKSQTVS 268



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 48/107 (44%), Gaps = 3/107 (2%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA 139
           S +   AGD   G ++G    ARF++P    +   G + VAD  NH IRK++  G  +  
Sbjct: 321 STLAGYAGD--TGTANGTGNGARFNQPVGITLHTSGYLLVADAYNHAIRKVSTSGSVSTL 378

Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            G S   G  D     A FS    +  V       ++D+ N LIR I
Sbjct: 379 AGESGVSGNEDESGSEAHFSRPSNIC-VDSSGIAYVTDYKNGLIRTI 424



 Score = 40.0 bits (92), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 32/111 (28%), Positives = 50/111 (45%), Gaps = 8/111 (7%)

Query: 96  GEPG--SARFDKPKSFAVDMKGNI---YVADKSNHVIRKITNLGVTTIAGGGSKKEGRAD 150
           G PG  S+    P   AV    ++   YV D  +  IRKIT+ G  +   G +   G A+
Sbjct: 275 GSPGTSSSMLSSPNGVAVSTSDDVDTLYVTDYGSSTIRKITSSGGISTLAGYAGDTGTAN 334

Query: 151 GPAQNASFSNDFELTFVPHICA-LLISDHGNQLIRQINLKPEDCSKSSQSG 200
           G    A F+    +T   H    LL++D  N  IR+++      + + +SG
Sbjct: 335 GTGNGARFNQPVGITL--HTSGYLLVADAYNHAIRKVSTSGSVSTLAGESG 383


>gi|255531264|ref|YP_003091636.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
 gi|255344248|gb|ACU03574.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
          Length = 439

 Score = 83.6 bits (205), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 53/142 (37%), Positives = 78/142 (54%), Gaps = 7/142 (4%)

Query: 64  SSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS 123
           S+    +T     + E +V   AGDG  G  +G    A+F  P+  AVD +GN+ VAD+ 
Sbjct: 101 SAEGPVFTYKAAPASEYIVSTFAGDGTAGLFNGAAADAQFRNPEGLAVDAQGNVIVADRQ 160

Query: 124 NHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQL 182
           NH IRKI+  G VTTIAG G+   G ADG    A FS+ ++L   P +  ++++D  N  
Sbjct: 161 NHTIRKISPAGQVTTIAGDGT--AGYADGTGTAAKFSSPWKLAIDP-LGNIIVADRDNFK 217

Query: 183 IRQINLKPEDCSKSSQSGSALG 204
           IR+I     D + ++ +GS  G
Sbjct: 218 IRKI---APDGAVTTLAGSTAG 236



 Score = 70.9 bits (172), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 47/112 (41%), Positives = 62/112 (55%), Gaps = 4/112 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           +S    V  LAGDG  GY D     A+F  P    VD +GNI VAD+ NH IRKIT  G 
Sbjct: 274 VSPSGTVTTLAGDGTAGYLDATGVQAKFRNPSGLTVDQQGNIIVADRLNHRIRKITPAGT 333

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           V+TIAG G+   G  DG A  A F++ + +  V     +L+++  N  IR+I
Sbjct: 334 VSTIAGAGT--TGLLDGDALTAKFADPYGVA-VDAGGNILVAELTNARIRKI 382



 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 42/112 (37%), Positives = 61/112 (54%), Gaps = 5/112 (4%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           ++ +  V  LAG    G++DG   +A+F +P     D +GNIYVAD + H IRK++  G 
Sbjct: 221 IAPDGAVTTLAGS-TAGFADGTGSAAKFMQPLDVVADAQGNIYVADNTAHRIRKVSPSGT 279

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           VTT+AG G+   G  D     A F N   LT V     ++++D  N  IR+I
Sbjct: 280 VTTLAGDGT--AGYLDATGVQAKFRNPSGLT-VDQQGNIIVADRLNHRIRKI 328



 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 42/114 (36%), Positives = 63/114 (55%), Gaps = 6/114 (5%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           ++    V  +AG G  G  DG+  +A+F  P   AVD  GNI VA+ +N  IRKIT +G 
Sbjct: 328 ITPAGTVSTIAGAGTTGLLDGDALTAKFADPYGVAVDAGGNILVAELTNARIRKITPVGQ 387

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSN--DFELTFVPHICALLISDHGNQLIRQI 186
           V+T+AG      G ADG + NA F+   D ++    +I    +++H  ++IR I
Sbjct: 388 VSTLAG---SSAGFADGLSVNAKFNQPTDLDIDAKGNIYVAEVTNHRIRMIRLI 438


>gi|414870491|tpg|DAA49048.1| TPA: hypothetical protein ZEAMMB73_516870 [Zea mays]
          Length = 271

 Score = 83.6 bits (205), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 63/200 (31%), Positives = 100/200 (50%), Gaps = 42/200 (21%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
           E GYTV  ++D  +L I P+S+    GS +++++DS+    Y+++ PLS +S  K +AG 
Sbjct: 105 ECGYTVEKILDYSKLGIEPYSMEVTQGS-EILIMDSN---IYSMALPLSRDSRPKLVAGS 160

Query: 88  -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
            +G  G+ DG     R   PK   VD KG                             + 
Sbjct: 161 LEGFPGHVDGNLREGRIYHPKGVTVDDKG-----------------------------RG 191

Query: 147 GRADGPAQNASFSNDFELTF-VPHICALLISDHGNQLIRQINLKPEDCSKSSQS----GS 201
           G+ DGP+  A  S DFE+ + +   C+LL+ D GNQ IR+I L  +DC    ++    G 
Sbjct: 192 GQVDGPSDEAKSSTDFEVCYYIGSSCSLLVIDRGNQTIREIQLHFDDCVYQHEADFPLGV 251

Query: 202 ALGAVSVWV--LVSVLSCLV 219
           AL AV+ ++  ++++L C V
Sbjct: 252 ALLAVAAFLGYMLALLQCQV 271


>gi|431798812|ref|YP_007225716.1| gluconolactonase [Echinicola vietnamensis DSM 17526]
 gi|430789577|gb|AGA79706.1| gluconolactonase [Echinicola vietnamensis DSM 17526]
          Length = 428

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 61/172 (35%), Positives = 90/172 (52%), Gaps = 15/172 (8%)

Query: 37  VIDGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
           +IDG  ++    NP  V   P    LIV D + ++       ++ +  V  + G G  G+
Sbjct: 121 LIDGSGIQAAFRNPEGVTMHPDGY-LIVTDRANNSIR----KVTTDGAVSTVLGTGNSGF 175

Query: 94  SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGP 152
            +G   SA  D P    VDM+GNIYVAD+ NH+IRKI   G V+T+AG G  + G ADGP
Sbjct: 176 QNGPVASALLDYPWKSCVDMEGNIYVADRDNHMIRKIDPQGMVSTVAGTG--EAGFADGP 233

Query: 153 AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
           A+ A F    ++        L ++D+ N  IR+I +   D + S+ +GS  G
Sbjct: 234 AEEAQFDQPLDIAVTAE-GVLYVTDNRNHRIRKIEV---DGTVSTVAGSEQG 281



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 40/105 (38%), Positives = 56/105 (53%), Gaps = 5/105 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
           V  +AGDG QG  DG+  +A+F+ P   +V   G + VAD SNH IR I    V TIAG 
Sbjct: 326 VSTVAGDGSQGTRDGQVMTAQFNNPYGISVADNGQLVVADLSNHKIRLIQGENVITIAG- 384

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
                G  DG    + F N  ++T+  H   + ++D GN  +R+I
Sbjct: 385 --SVAGFLDGVGVTSQFYNPTDVTY--HDGVIYVADLGNHRVRKI 425



 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 43/98 (43%), Positives = 55/98 (56%), Gaps = 5/98 (5%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKEGR 148
           QG  DG    A F  P    VD  GNIYVAD+ NH+IRKI  N G V+T+AG GS  +G 
Sbjct: 280 QGNQDGALEEATFRYPSGLDVDDMGNIYVADRINHLIRKIDLNAGQVSTVAGDGS--QGT 337

Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            DG    A F+N + ++ V     L+++D  N  IR I
Sbjct: 338 RDGQVMTAQFNNPYGIS-VADNGQLVVADLSNHKIRLI 374



 Score = 42.7 bits (99), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 34/108 (31%), Positives = 55/108 (50%), Gaps = 4/108 (3%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
           VV  LAG    G  DG    A F  P+   +   G + V D++N+ IRK+T  G V+T+ 
Sbjct: 109 VVSTLAGSSDYGLIDGSGIQAAFRNPEGVTMHPDGYLIVTDRANNSIRKVTTDGAVSTVL 168

Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           G G+   G  +GP  +A     ++ + V     + ++D  N +IR+I+
Sbjct: 169 GTGNS--GFQNGPVASALLDYPWK-SCVDMEGNIYVADRDNHMIRKID 213


>gi|302821467|ref|XP_002992396.1| hypothetical protein SELMODRAFT_135182 [Selaginella moellendorffii]
 gi|300139812|gb|EFJ06546.1| hypothetical protein SELMODRAFT_135182 [Selaginella moellendorffii]
          Length = 161

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 54/143 (37%), Positives = 78/143 (54%), Gaps = 12/143 (8%)

Query: 74  FPLSEESVVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           FPL  +S ++  AG   G  GY DG    + F++P+S ++   G ++VAD  N  IRKI+
Sbjct: 14  FPL--DSSLEAFAGSFVGESGYVDGPAAKSLFNRPQSLSICDNGAVFVADTRNLAIRKIS 71

Query: 132 NL--GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL- 188
                VTTIAGG S+K G A      A FS++F L       +LLI+D GN+LIR+I + 
Sbjct: 72  KFLPTVTTIAGGSSRKPGFA---GDTARFSSEFSL--ACSCGSLLIADCGNRLIREIQID 126

Query: 189 KPEDCSKSSQSGSALGAVSVWVL 211
            P+ C  S  + S   +   W+ 
Sbjct: 127 DPKSCDSSDSAVSGDFSSFTWIF 149


>gi|330824371|ref|YP_004387674.1| NHL repeat containing protein [Alicycliphilus denitrificans K601]
 gi|329309743|gb|AEB84158.1| NHL repeat containing protein [Alicycliphilus denitrificans K601]
          Length = 362

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 41/106 (38%), Positives = 67/106 (63%), Gaps = 4/106 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           V  LAG+G  G++DG+  +ARF++P +  +D +G ++V D+ NH +R++T  G VTT+AG
Sbjct: 199 VDTLAGNGAPGFADGQRATARFNQPYNVRIDGQGRLWVPDQLNHAVRRVTPAGEVTTVAG 258

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            G  K G ADGPA    F N   +  +P+  A++++D  N  +R +
Sbjct: 259 AG--KAGYADGPAATVRFDNPTGVAPLPN-GAVVVADRNNNRLRLV 301



 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 44/114 (38%), Positives = 66/114 (57%), Gaps = 3/114 (2%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           L     V  +AG G  G++DG   +ARF++P + AV+  G +YVAD++NH IRKI   G 
Sbjct: 83  LRSGGTVATVAGTGKAGFADGPAHAARFNEPIAVAVERSGAVYVADRNNHRIRKIRPDGT 142

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           V T+AGG S   G  DG  ++A F+  + +        L ++D+ N  IR+I+L
Sbjct: 143 VITLAGGDS--AGFVDGDLKSARFNQPYGVALDAAQTTLYVADYLNHAIRRIDL 194



 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 50/137 (36%), Positives = 72/137 (52%), Gaps = 13/137 (9%)

Query: 71  TLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           TL FP      V+ LAGDG  G +DG PG  R ++P   +    G++Y AD+ NH +R +
Sbjct: 31  TLPFPE-----VRTLAGDGQPGAADGSPG--RVNRPHGLSYGNDGSLYFADRGNHQVRVL 83

Query: 131 TNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            + G V T+AG G  K G ADGPA  A F+    +  V    A+ ++D  N  IR+I  +
Sbjct: 84  RSGGTVATVAGTG--KAGFADGPAHAARFNEPIAVA-VERSGAVYVADRNNHRIRKI--R 138

Query: 190 PEDCSKSSQSGSALGAV 206
           P+    +   G + G V
Sbjct: 139 PDGTVITLAGGDSAGFV 155



 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 39/115 (33%), Positives = 62/115 (53%), Gaps = 6/115 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           V  +AG G  GY+DG   + RFD P   A    G + VAD++N+ +R +T  G V T+AG
Sbjct: 253 VTTVAGAGKAGYADGPAATVRFDNPTGVAPLPNGAVVVADRNNNRLRLVTPDGAVATLAG 312

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSK 195
            G  + G ADG A +A F+   ++ F   +  +L+S+     +R +   P+D  +
Sbjct: 313 AG--EAGFADGTAASARFNQPLDVEFDDSMSRVLVSEDKGHRLRVL---PKDTRR 362


>gi|223935789|ref|ZP_03627704.1| NHL repeat containing protein [bacterium Ellin514]
 gi|223895390|gb|EEF61836.1| NHL repeat containing protein [bacterium Ellin514]
          Length = 755

 Score = 77.4 bits (189), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 56/137 (40%), Positives = 77/137 (56%), Gaps = 8/137 (5%)

Query: 69  FYTLSF-PLSEES--VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNH 125
           F +L+F PLS      V  LAG+  QG +DG   SARF+ P   AVD  GN+YVAD +NH
Sbjct: 11  FLSLAFVPLSHSQPLTVSTLAGNAGQGSADGNNSSARFNLPGGVAVDKTGNLYVADTANH 70

Query: 126 VIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
            IRKI+   V+T A G +   G A+G    A F N  +   V     + ++D GN +IR+
Sbjct: 71  TIRKISGGVVSTFA-GLAGVSGSANGKGSAARF-NQPQGVAVDTNGIVYVADTGNHIIRK 128

Query: 186 INLKPEDCSKSSQSGSA 202
           I L   D + S+ +G+A
Sbjct: 129 IAL---DGTVSTLAGAA 142



 Score = 77.4 bits (189), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 48/132 (36%), Positives = 70/132 (53%), Gaps = 7/132 (5%)

Query: 61  VLDSSRSAFYTLSF-----PLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
            +D+S + F + +F      ++    V  LAG  G  G ++G  G A+F  P+  AVD  
Sbjct: 377 AVDASGNVFVSDTFNHTIRKITAAGTVSTLAGTAGSSGTNNGVGGGAQFYAPQGIAVDTG 436

Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
           GN YVAD +N+VIRK+T+ G  T   G +  EG+ DG   NA FS   +   +     + 
Sbjct: 437 GNAYVADTANNVIRKVTSGGTVTTLAGTAGVEGQGDGTGSNAQFSGP-QAVALDGAANVY 495

Query: 175 ISDHGNQLIRQI 186
           +SD GN  IR+I
Sbjct: 496 VSDTGNHTIRKI 507



 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 40/117 (34%), Positives = 56/117 (47%), Gaps = 13/117 (11%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           +S    V  LAG    G +DG  GSA+F  PK  AVD  GN++V+D  NH IRKIT  G 
Sbjct: 343 ISSGGAVTTLAGSASIGNADGTGGSAKFYWPKGTAVDASGNVFVSDTFNHTIRKITAAGT 402

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICAL------LISDHGNQLIRQI 186
            +   G +   G  +G    A F       + P   A+       ++D  N +IR++
Sbjct: 403 VSTLAGTAGSSGTNNGVGGGAQF-------YAPQGIAVDTGGNAYVADTANNVIRKV 452



 Score = 64.3 bits (155), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 43/138 (31%), Positives = 67/138 (48%), Gaps = 8/138 (5%)

Query: 50  VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKS 108
            +D+ G  +L V D++      +S       VV   AG  GV G ++G+  +ARF++P+ 
Sbjct: 55  AVDKTG--NLYVADTANHTIRKIS-----GGVVSTFAGLAGVSGSANGKGSAARFNQPQG 107

Query: 109 FAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP 168
            AVD  G +YVAD  NH+IRKI   G  +   G +   G  +    NA F     +    
Sbjct: 108 VAVDTNGIVYVADTGNHIIRKIALDGTVSTLAGAAGNPGTLNATGTNAQFYEPEAVAVNG 167

Query: 169 HICALLISDHGNQLIRQI 186
           +   + ++D  N  IRQ+
Sbjct: 168 NGSLIYVADTWNHEIRQV 185



 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 41/111 (36%), Positives = 53/111 (47%), Gaps = 13/111 (11%)

Query: 82  VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
           V  LAG  G  G ++G   +A+F +P   AV   G +YVAD  NH IR +T+ GV T   
Sbjct: 240 VTTLAGSPGNYGSTNGTGSAAQFYQPMGVAVAANGTVYVADNLNHTIRAVTSGGVVTTLA 299

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICA-----LLISDHGNQLIRQI 186
           G +   G  DG   NA F       + P   A     + + D GN  IRQI
Sbjct: 300 GLAGNYGSKDGTGSNARF-------YAPQGVAVSGSTVFVVDTGNGTIRQI 343



 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 48/103 (46%), Gaps = 11/103 (10%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASF- 158
           +ARF  P   AVD  GN+YVAD  NH IRKIT  G  +   G     G ADG  ++A F 
Sbjct: 537 TARFYSPSGLAVDSSGNVYVADTGNHTIRKITADGSVSTLAGLPGVWGNADGTNRDARFF 596

Query: 159 ----------SNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
                      N F +    H   +LI+   N ++  I  +P+
Sbjct: 597 QPEGISIDSQGNLFVMDSGNHTMRMLIASGTNWIVTTIAGQPD 639



 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 42/120 (35%), Positives = 58/120 (48%), Gaps = 19/120 (15%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---- 130
           ++ +  V  LAG  GV G +DG    ARF +P+  ++D +GN++V D  NH +R +    
Sbjct: 567 ITADGSVSTLAGLPGVWGNADGTNRDARFFQPEGISIDSQGNLFVMDSGNHTMRMLIASG 626

Query: 131 TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICAL------LISDHGNQLIR 184
           TN  VTTIA G     G ADG    A F       + P    L       ++D GN  IR
Sbjct: 627 TNWIVTTIA-GQPDLGGAADGTGNGAQF-------YYPGGLGLNNSGFFAVADSGNNTIR 678



 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 40/117 (34%), Positives = 56/117 (47%), Gaps = 7/117 (5%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++    V  LAG  GV+G  DG   +A+F  P++ A+D   N+YV+D  NH IRKI+  G
Sbjct: 452 VTSGGTVTTLAGTAGVEGQGDGTGSNAQFSGPQAVALDGAANVYVSDTGNHTIRKISPGG 511

Query: 135 VTTIAGGGSKKEGRADGPAQN-----ASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             T   G     G  D    N     A F +   L  V     + ++D GN  IR+I
Sbjct: 512 AVTTFAGFPGHPGNLDSNMDNNGTNTARFYSPSGLA-VDSSGNVYVADTGNHTIRKI 567



 Score = 41.6 bits (96), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 29/74 (39%), Positives = 44/74 (59%), Gaps = 15/74 (20%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGS-------ARFDKPKSFAVDMKGN-IYVADKSNHVI 127
           ++ +  V  LAG      + G PG+       A+F +P++ AV+  G+ IYVAD  NH I
Sbjct: 129 IALDGTVSTLAG------AAGNPGTLNATGTNAQFYEPEAVAVNGNGSLIYVADTWNHEI 182

Query: 128 RKITNLG-VTTIAG 140
           R++T+ G VTT+AG
Sbjct: 183 RQVTSAGVVTTLAG 196



 Score = 40.4 bits (93), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 40/95 (42%), Gaps = 5/95 (5%)

Query: 96  GEPG----SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           G PG     + F +P+  AV   GNIYVAD  N  IR I   G  T   G     G  +G
Sbjct: 196 GTPGVIGTGSLFYQPQGIAVGSDGNIYVADTGNGTIRVIPPGGSVTTLAGSPGNYGSTNG 255

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
               A F     +    +   + ++D+ N  IR +
Sbjct: 256 TGSAAQFYQPMGVAVAAN-GTVYVADNLNHTIRAV 289


>gi|124002516|ref|ZP_01687369.1| hypothetical protein M23134_05219 [Microscilla marina ATCC 23134]
 gi|123992345|gb|EAY31713.1| hypothetical protein M23134_05219 [Microscilla marina ATCC 23134]
          Length = 2385

 Score = 77.4 bits (189), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 51/140 (36%), Positives = 71/140 (50%), Gaps = 8/140 (5%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-----GVQGYSDGEPGSARFDKPKSFA 110
           + +L V D +  A   ++ P     VV+ +AG+      V G   G    A+F++P   A
Sbjct: 663 AGNLYVADKNNHAIRMITNPSGGSPVVRTIAGNSSYPTAVSGNVTGALAVAKFNEPSGVA 722

Query: 111 VDMKGNIYVADKSNHVIRKITNLGVTTIAG---GGSKKEGRADGPAQNASFSNDFELTFV 167
           VD  GNIYVADK+NH I+KI N  VTT+AG     +   GR DG A  A F     +   
Sbjct: 723 VDAAGNIYVADKNNHRIKKIANGMVTTLAGPMNDAASIPGRTDGAADAARFFFPTSVALD 782

Query: 168 PHICALLISDHGNQLIRQIN 187
                L ++D  N +IRQ+N
Sbjct: 783 ITGAQLYVADKLNNIIRQVN 802



 Score = 62.0 bits (149), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 43/113 (38%), Positives = 56/113 (49%), Gaps = 9/113 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-----VT 136
           V  +AG GV G+ D     A+F+ P   A+DM GN+YVADK+NH IR ITN       V 
Sbjct: 631 VSIVAGSGVAGFKDDNGSLAQFNYPSGLALDMAGNLYVADKNNHAIRMITNPSGGSPVVR 690

Query: 137 TIAGGGSKK---EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           TIAG  S      G   G    A F N+     V     + ++D  N  I++I
Sbjct: 691 TIAGNSSYPTAVSGNVTGALAVAKF-NEPSGVAVDAAGNIYVADKNNHRIKKI 742



 Score = 56.6 bits (135), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 43/76 (56%), Gaps = 3/76 (3%)

Query: 86  AGD---GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
           AGD   G+ G+ DG   SA+F  P    V+  G++Y+AD  N VIRKI+   V TIAG  
Sbjct: 812 AGDVANGIAGHQDGNAASAKFRSPAGITVNAVGDVYIADTHNQVIRKISQGQVITIAGEV 871

Query: 143 SKKEGRADGPAQNASF 158
              +  +D  A++A F
Sbjct: 872 EVADDDSDVLARDAKF 887



 Score = 53.5 bits (127), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 37/102 (36%), Positives = 57/102 (55%), Gaps = 5/102 (4%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
           + G +DG   +ARF  P S A+D+ G  +YVADK N++IR++      T+   G    G 
Sbjct: 760 IPGRTDGAADAARFFFPTSVALDITGAQLYVADKLNNIIRQVNTADGHTLTYAGDVANGI 819

Query: 149 A---DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           A   DG A +A F +   +T V  +  + I+D  NQ+IR+I+
Sbjct: 820 AGHQDGNAASAKFRSPAGIT-VNAVGDVYIADTHNQVIRKIS 860


>gi|296136309|ref|YP_003643551.1| NHL repeat containing protein [Thiomonas intermedia K12]
 gi|295796431|gb|ADG31221.1| NHL repeat containing protein [Thiomonas intermedia K12]
          Length = 366

 Score = 77.4 bits (189), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 46/117 (39%), Positives = 66/117 (56%), Gaps = 2/117 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           +S + VV  LAG  G  G+ DG    ARF+ P    +D +G IYV++  N+VIRKIT  G
Sbjct: 196 ISAKGVVSTLAGSPGDTGWRDGRGAQARFNTPVGLTLDAQGQIYVSEYFNNVIRKITPDG 255

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
             T   G   K G ADG  + A F +   L+F P   +L+++D GN  +R+I+ + E
Sbjct: 256 TVTTFAGKPGKGGFADGKVEEAQFLHPQTLSFAPD-GSLIVADTGNNRVRRISPQGE 311



 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 39/112 (34%), Positives = 57/112 (50%), Gaps = 2/112 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           +S E  V  LAG   V+G++DG    ARF+ P    V+ KG +YVAD  N  +R+I+  G
Sbjct: 141 ISREGRVSTLAGKLEVEGFADGTGQQARFNHPVGLGVNAKGVVYVADAYNSTVRRISAKG 200

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           V +   G     G  DG    A F+    LT       + +S++ N +IR+I
Sbjct: 201 VVSTLAGSPGDTGWRDGRGAQARFNTPVGLTLDAQ-GQIYVSEYFNNVIRKI 251



 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 34/95 (35%), Positives = 51/95 (53%), Gaps = 1/95 (1%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGP 152
           Y+ G+   ARF  P+  A+D +GNI+VAD  N V+RK+   G  +I  G  ++    +GP
Sbjct: 49  YTGGDFAQARFHDPRGMALDAQGNIFVADYVNSVVRKLGTDGQVSIVAGQVEQRDARNGP 108

Query: 153 AQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           A  A F +  E   V     L +SD G+  +R I+
Sbjct: 109 ALQARFYSP-ECVAVATDGTLFVSDSGSNTVRCIS 142



 Score = 45.8 bits (107), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 58/120 (48%), Gaps = 11/120 (9%)

Query: 80  SVVKRLAGDG----VQGY------SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           SVV++L  DG    V G        +G    ARF  P+  AV   G ++V+D  ++ +R 
Sbjct: 81  SVVRKLGTDGQVSIVAGQVEQRDARNGPALQARFYSPECVAVATDGTLFVSDSGSNTVRC 140

Query: 130 ITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           I+  G  +   G  + EG ADG  Q A F++   L  V     + ++D  N  +R+I+ K
Sbjct: 141 ISREGRVSTLAGKLEVEGFADGTGQQARFNHPVGLG-VNAKGVVYVADAYNSTVRRISAK 199



 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 44/73 (60%), Gaps = 2/73 (2%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++ +  V   AG  G  G++DG+   A+F  P++ +    G++ VAD  N+ +R+I+  G
Sbjct: 251 ITPDGTVTTFAGKPGKGGFADGKVEEAQFLHPQTLSFAPDGSLIVADTGNNRVRRISPQG 310

Query: 135 -VTTIAGGGSKKE 146
            V+T+AG G+ ++
Sbjct: 311 EVSTLAGTGASEK 323


>gi|445496654|ref|ZP_21463509.1| NHL repeat containing protein [Janthinobacterium sp. HH01]
 gi|444786649|gb|ELX08197.1| NHL repeat containing protein [Janthinobacterium sp. HH01]
          Length = 741

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 53/132 (40%), Positives = 70/132 (53%), Gaps = 8/132 (6%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           + +L V D+  +A   ++     E VV  LAGDG+ G  DG   +A+F+ P   AVD  G
Sbjct: 168 AGNLYVADTGNNAIRKVT----PEGVVSTLAGDGLPGDKDGRGAAAQFNGPVGIAVDAAG 223

Query: 116 NIYVADKSNHVIRKIT-NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
            +YVAD  N  IR+I  N  VTTIAGG   + G+ADG A  A F     L        L 
Sbjct: 224 VVYVADTYNDRIRRIAPNGDVTTIAGG--SRAGKADGAAAQALFDTPTGLALSA-AGDLY 280

Query: 175 ISDHGNQLIRQI 186
           I+D GN  IR++
Sbjct: 281 IADTGNHAIRKL 292



 Score = 63.5 bits (153), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 47/132 (35%), Positives = 69/132 (52%), Gaps = 7/132 (5%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
           +     LAG GV+GY++G   +A F+ P   A+D  GN+YVAD  N+ IRK+T  G V+T
Sbjct: 134 DGATTTLAG-GVEGYAEGAGKAAAFNTPSGLAIDAAGNLYVADTGNNAIRKVTPEGVVST 192

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI--NLKPEDCSK 195
           +AG G    G  DG    A F+    +  V     + ++D  N  IR+I  N      + 
Sbjct: 193 LAGDG--LPGDKDGRGAAAQFNGPVGIA-VDAAGVVYVADTYNDRIRRIAPNGDVTTIAG 249

Query: 196 SSQSGSALGAVS 207
            S++G A GA +
Sbjct: 250 GSRAGKADGAAA 261



 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 43/111 (38%), Positives = 59/111 (53%), Gaps = 6/111 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS-NHVIRKITNLGVTTIAG 140
           V  +AGDG+ G ++G   S RF  P    +D  GN+YVAD   N+ IRKI   G TT   
Sbjct: 82  VTSIAGDGLPGANNGHGRSTRFADPFGVVIDGAGNLYVADGGDNNSIRKIALDGATTTLA 141

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
           GG   EG A+G  + A+F+    L  +     L ++D GN  IR++   PE
Sbjct: 142 GGV--EGYAEGAGKAAAFNTPSGLA-IDAAGNLYVADTGNNAIRKVT--PE 187


>gi|302821973|ref|XP_002992647.1| hypothetical protein SELMODRAFT_7079 [Selaginella moellendorffii]
 gi|300139611|gb|EFJ06349.1| hypothetical protein SELMODRAFT_7079 [Selaginella moellendorffii]
          Length = 124

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 51/122 (41%), Positives = 69/122 (56%), Gaps = 9/122 (7%)

Query: 79  ESVVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
           +S ++  AG   G  GY DG    + F++P+S A+   G ++VAD  N  IRKI+  G V
Sbjct: 3   DSSLEAFAGSFVGESGYVDGPAAKSLFNRPQSLAICDNGAVFVADTRNLAIRKISKDGEV 62

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-KPEDCS 194
           TTIAGG S+K G A      A FS +F L       +LLI+D GN+LIR+I +  P  C 
Sbjct: 63  TTIAGGSSRKPGFA---GDTARFSGEFSLAC--SCGSLLIADRGNRLIREIQIDDPNSCD 117

Query: 195 KS 196
            S
Sbjct: 118 SS 119


>gi|325104251|ref|YP_004273905.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
 gi|324973099|gb|ADY52083.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
          Length = 532

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 71/214 (33%), Positives = 102/214 (47%), Gaps = 31/214 (14%)

Query: 15  LSLLFSLVSSGLLLEDGYTVTTV--------IDGHQLEI---NPHSV-IDRPGSSDLIVL 62
           L+L+  L +  +     YTVTTV        +DG+ L+    NP  + +D  G  D+I+ 
Sbjct: 6   LTLILMLGTYMVTKSQNYTVTTVAGSTTFGFVDGNGLDARFRNPDGILVDMNG--DIIIT 63

Query: 63  DSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADK 122
           D +  +       ++   VV  LAG GV GY++G+PG  +F+ P    VD  GNI V +K
Sbjct: 64  DRTNHSIR----KMTTAGVVSTLAGTGVSGYANGKPG--QFNTPWQSTVDAAGNIIVVEK 117

Query: 123 SNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
               IRKI   G V+TIAG GS   G +DG    A F N  +   V     + I+D  N+
Sbjct: 118 DGARIRKIALDGTVSTIAGTGS--AGYSDGAVSVARFDNALD-AVVDSDGNIFIADRNNR 174

Query: 182 LIRQI------NLKPEDCSKSSQSGSALGAVSVW 209
            +R+I      N      S  +  G+  G V VW
Sbjct: 175 RVRKITPGAGGNWTTATVSTVAGDGTTSGTV-VW 207



 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 49/147 (33%), Positives = 74/147 (50%), Gaps = 10/147 (6%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
           S+   V  +AG    G+ DG    ARF  P    VDM G+I + D++NH IRK+T  G V
Sbjct: 20  SQNYTVTTVAGSTTFGFVDGNGLDARFRNPDGILVDMNGDIIITDRTNHSIRKMTTAGVV 79

Query: 136 TTIAGGGSKKEGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKP--ED 192
           +T+AG G    G A+G P Q   F+  ++ T V     +++ +     IR+I L      
Sbjct: 80  STLAGTGV--SGYANGKPGQ---FNTPWQST-VDAAGNIIVVEKDGARIRKIALDGTVST 133

Query: 193 CSKSSQSGSALGAVSVWVLVSVLSCLV 219
            + +  +G + GAVSV    + L  +V
Sbjct: 134 IAGTGSAGYSDGAVSVARFDNALDAVV 160



 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 28/51 (54%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + V  +AG G  G  DG    A F++P     D  GNIYVAD  NH IRK+
Sbjct: 299 ATVTTIAGTGSTGRIDGLGNVATFNQPYDVVTDENGNIYVADNVNHSIRKL 349



 Score = 43.9 bits (102), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 40/121 (33%), Positives = 57/121 (47%), Gaps = 14/121 (11%)

Query: 93  YSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI--------TNLGVTTIAGG 141
           ++DGE G   +AR         D  GNI  AD S + IRKI        T   VTTIAG 
Sbjct: 248 FTDGESGKPLTARLGDVFGLNFDNDGNIIFADASFNRIRKITPGENGDWTTATVTTIAGT 307

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
           GS   GR DG    A+F+  +++        + ++D+ N  IR++   P     SS + +
Sbjct: 308 GS--TGRIDGLGNVATFNQPYDV-VTDENGNIYVADNVNHSIRKLVPSPLPVQLSSFTAT 364

Query: 202 A 202
           A
Sbjct: 365 A 365


>gi|373955434|ref|ZP_09615394.1| NHL repeat containing protein [Mucilaginibacter paludis DSM 18603]
 gi|373892034|gb|EHQ27931.1| NHL repeat containing protein [Mucilaginibacter paludis DSM 18603]
          Length = 459

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 46/112 (41%), Positives = 62/112 (55%), Gaps = 4/112 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           ++   V    AG G  GY DG   +A+F  PK  A D +GNIYVAD  N++IRKI+  G 
Sbjct: 189 ITAAGVTTTYAGTGTLGYLDGPAATAQFYAPKGVAADAQGNIYVADMGNNMIRKISAAGV 248

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           VTT+AG GS   G ADG   +A F +   L  V     + ++D G   IR++
Sbjct: 249 VTTLAGKGS--AGYADGTGADAVFKSPAGLA-VDASGNIYVADQGTNTIRKV 297



 Score = 70.5 bits (171), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 45/113 (39%), Positives = 62/113 (54%), Gaps = 5/113 (4%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           +S   VV  LAG G  GY+DG    A F  P   AVD  GNIYVAD+  + IRK+T+ G 
Sbjct: 243 ISAAGVVTTLAGKGSAGYADGTGADAVFKSPAGLAVDASGNIYVADQGTNTIRKVTSAGV 302

Query: 135 VTTIAGGGSKKEGRADGPAQ-NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           VTT+A  G+   G+ D     +A FS+   +T V     + ++D  N  IR++
Sbjct: 303 VTTLA--GAAASGQVDATTNTDARFSSPSGVT-VDASGNVYVADLANHAIRKV 352



 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 45/125 (36%), Positives = 63/125 (50%), Gaps = 15/125 (12%)

Query: 69  FYTLSFPLSEESVVKRLAGDGVQG-YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
           F T S   +  + V   AG+   G + +G   +A+F  P+    D +GN+YVAD  N+VI
Sbjct: 127 FKTNSATFAITATVSTFAGNANGGGFVNGTGVNAQFHNPQGICTDAQGNMYVADSYNNVI 186

Query: 128 RKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQ 181
           RKIT  GVTT    G+   G  DGPA  A F       + P   A      + ++D GN 
Sbjct: 187 RKITAAGVTTTY-AGTGTLGYLDGPAATAQF-------YAPKGVAADAQGNIYVADMGNN 238

Query: 182 LIRQI 186
           +IR+I
Sbjct: 239 MIRKI 243



 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 29/63 (46%), Positives = 36/63 (57%), Gaps = 1/63 (1%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++   VV  LAG    G  D    + ARF  P    VD  GN+YVAD +NH IRK+T+ G
Sbjct: 297 VTSAGVVTTLAGAAASGQVDATTNTDARFSSPSGVTVDASGNVYVADLANHAIRKVTSAG 356

Query: 135 VTT 137
           VTT
Sbjct: 357 VTT 359



 Score = 45.1 bits (105), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 35/54 (64%), Gaps = 1/54 (1%)

Query: 78  EESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
             +++  LAG  G  G+++G   +A F+ P++  +D +GNIYV D  N++IRKI
Sbjct: 400 TTNIIYSLAGVAGTSGFANGTNINALFNGPQALTLDSQGNIYVVDYYNNMIRKI 453


>gi|255531263|ref|YP_003091635.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
 gi|255344247|gb|ACU03573.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
          Length = 646

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 45/107 (42%), Positives = 63/107 (58%), Gaps = 4/107 (3%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
           +V   AG G  G  +G   SARFD P+  A+D  GN+Y+ADK+N+VIRKIT+ G V+T A
Sbjct: 327 LVSNYAGTGTAGLVNGAKASARFDSPEGLAIDASGNMYIADKNNNVIRKITSTGTVSTFA 386

Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G G    G ADG    A F+  +++  V     + ++D  N  IR+I
Sbjct: 387 GTGV--AGFADGAGSIAKFNGPWKVA-VDATGNVYVADRDNFKIRKI 430



 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 42/112 (37%), Positives = 61/112 (54%), Gaps = 5/112 (4%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           ++    V   AG GV G++DG    A+F+ P   AVD  GN+YVAD+ N  IRKIT  G 
Sbjct: 376 ITSTGTVSTFAGTGVAGFADGAGSIAKFNGPWKVAVDATGNVYVADRDNFKIRKITPAGI 435

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           V+T+AG      G ADG    A F    ++   P    ++++D+ +  IR+I
Sbjct: 436 VSTLAG---STAGYADGTGSAAKFMQPLDVAIDPS-GNVIVADNTSHRIRKI 483



 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 43/112 (38%), Positives = 61/112 (54%), Gaps = 5/112 (4%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           ++   +V  LAG    GY+DG   +A+F +P   A+D  GN+ VAD ++H IRKIT  G 
Sbjct: 430 ITPAGIVSTLAGS-TAGYADGTGSAAKFMQPLDVAIDPSGNVIVADNTSHRIRKITAAGV 488

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           VTTIAG G+   G  +G    A F N   +  V     + ++D  N  IR+I
Sbjct: 489 VTTIAGNGT--AGYTNGTGTAAQFKNPSGVD-VDASGNIYVADRLNHRIRKI 537



 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 29/60 (48%), Positives = 38/60 (63%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           ++   VV  +AG+G  GY++G   +A+F  P    VD  GNIYVAD+ NH IRKIT  GV
Sbjct: 483 ITAAGVVTTIAGNGTAGYTNGTGTAAQFKNPSGVDVDASGNIYVADRLNHRIRKITTSGV 542



 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 43/113 (38%), Positives = 61/113 (53%), Gaps = 5/113 (4%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           ++   VV  LAG G  G +DG  GSA+F  P    VD+ GN+YVAD  +  IRKI++  V
Sbjct: 537 ITTSGVVSSLAGTGTSGTTDGAAGSAKFSDPYGVTVDVSGNVYVADLISSRIRKISSGQV 596

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           +T+AG      G  +G +  A F+   +L  V     + I+DH N  IR + L
Sbjct: 597 STLAG---TIPGYQNGTSTIAKFNQPTDL--VIQGSNIYIADHSNNSIRLVKL 644


>gi|431798813|ref|YP_007225717.1| NHL repeat protein [Echinicola vietnamensis DSM 17526]
 gi|430789578|gb|AGA79707.1| NHL repeat protein [Echinicola vietnamensis DSM 17526]
          Length = 655

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 42/105 (40%), Positives = 69/105 (65%), Gaps = 4/105 (3%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
           + +AG G+ G+ +G   +A+FD P+  A+D +GNI+VAD+ N+VIRKI++ G V+T AG 
Sbjct: 336 ETIAGTGIAGFQNGSSSTAKFDNPEGIAIDQQGNIFVADRDNNVIRKISSSGDVSTFAGT 395

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G    G  DG A  A F++ +++  V +   ++++D GN  IR+I
Sbjct: 396 GIA--GFTDGVAGVAKFNSPWKVA-VDNQGNVIVADRGNHSIRKI 437



 Score = 74.7 bits (182), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 51/144 (35%), Positives = 76/144 (52%), Gaps = 12/144 (8%)

Query: 46  NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
           NP  + ID+ G  ++ V D   +        +S    V   AG G+ G++DG  G A+F+
Sbjct: 358 NPEGIAIDQQG--NIFVADRDNNVIR----KISSSGDVSTFAGTGIAGFTDGVAGVAKFN 411

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKEGRADGPAQNASFSNDFE 163
            P   AVD +GN+ VAD+ NH IRKIT N  V+T+AG      G  DG    A F    +
Sbjct: 412 SPWKVAVDNQGNVIVADRGNHSIRKITPNGTVSTLAG---TTNGYQDGSGNQAKFDQPTD 468

Query: 164 LTFVPHICALLISDHGNQLIRQIN 187
           +  +P+   ++I+D+ N  IR I+
Sbjct: 469 VAVLPN-GNIVIADNRNHCIRMID 491



 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 41/107 (38%), Positives = 58/107 (54%), Gaps = 4/107 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIAG 140
           V  +AG G  GY DG    A+F  P     D  GN++VAD+ NH IRKI +   V+T+AG
Sbjct: 496 VSTIAGTGNGGYVDGAGSQAQFYYPSGIDTDPNGNLFVADRKNHAIRKIDSYHNVSTVAG 555

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           G    EG  +G    A F + + +  V     +L++D  N +IR+IN
Sbjct: 556 G--NGEGIQNGGIAVAKFDDPYGVA-VGQNGKVLVADLDNNVIREIN 599



 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 47/153 (30%), Positives = 77/153 (50%), Gaps = 9/153 (5%)

Query: 40  GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG 99
           G Q +    S ID   + +L V D    A   +    S  +V     G+G +G  +G   
Sbjct: 512 GSQAQFYYPSGIDTDPNGNLFVADRKNHAIRKID---SYHNVSTVAGGNG-EGIQNGGIA 567

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
            A+FD P   AV   G + VAD  N+VIR+I    V+TI   GS  EG  DGP+  +  +
Sbjct: 568 VAKFDDPYGVAVGQNGKVLVADLDNNVIREINGDYVSTII--GSNGEGYIDGPSTASKMN 625

Query: 160 NDFELTFVPHICALLISDHGNQLIRQINLKPED 192
           +  ++  + +   ++ +D+GN L+R++ +K E+
Sbjct: 626 SPTDV--LVNGDEIIFADYGNHLVRKV-VKDEE 655


>gi|402814801|ref|ZP_10864394.1| copper amine oxidase domain protein [Paenibacillus alvei DSM 29]
 gi|402507172|gb|EJW17694.1| copper amine oxidase domain protein [Paenibacillus alvei DSM 29]
          Length = 571

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 44/98 (44%), Positives = 56/98 (57%), Gaps = 4/98 (4%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRA 149
           QG+ DGE    +FD+P   A D KGN+YVAD  NHVIRKI   G VTT+AG G    G  
Sbjct: 129 QGWEDGEQAQVQFDQPTGLAADKKGNLYVADAGNHVIRKIDGAGKVTTVAGDGIP--GWK 186

Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           DG A  A F     +  V    A+ ++D  N +IR+I+
Sbjct: 187 DGAAGTARFHTPRAIA-VADDGAIYVTDSLNHVIRRID 223



 Score = 73.6 bits (179), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 52/134 (38%), Positives = 68/134 (50%), Gaps = 11/134 (8%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           V  +AGDG+ G+ DG  G+ARF  P++ AV   G IYV D  NHVIR+I  +G VTT+  
Sbjct: 174 VTTVAGDGIPGWKDGAAGTARFHTPRAIAVADDGAIYVTDSLNHVIRRIDAMGMVTTLTA 233

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFV----PHICALL------ISDHGNQLIRQINLKP 190
             S+    + G    A    D  LT      P   AL+      ++D GNQ IR I+LK 
Sbjct: 234 RSSRIVEYSPGSVTGAGDFKDGNLTEAMFNEPSGIALMNDGKLAVADTGNQRIRLIDLKQ 293

Query: 191 EDCSKSSQSGSALG 204
              S  + S S  G
Sbjct: 294 GSVSTIAGSSSTAG 307



 Score = 45.4 bits (106), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 54/195 (27%), Positives = 84/195 (43%), Gaps = 27/195 (13%)

Query: 14  LLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLS 73
           L   +F+  S   L+ DG          ++ +     ID    S   +  SS +A YTL 
Sbjct: 257 LTEAMFNEPSGIALMNDGKLAVADTGNQRIRL-----IDLKQGSVSTIAGSSSTAGYTLP 311

Query: 74  FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
                  V + L   G  GY DG   ++ F+ P   A+  +  I VAD+ NHV+R I   
Sbjct: 312 ------GVQRVLYAPG--GYRDGAASNSIFNSPTGIAITNENGIVVADRWNHVVRYIYKG 363

Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC--ALLISDHGNQLIRQIN---- 187
            V T++ G + K G  DG     S++   E T V  +   ++ +++  N  IR I     
Sbjct: 364 EVVTLS-GQAGKSGDLDGI---TSYAKLHEPTAVAVLANGSIAVAEGFNNAIRLIRRYEL 419

Query: 188 ----LKPEDCSKSSQ 198
               + P D S++SQ
Sbjct: 420 PECVMGPLDQSRASQ 434



 Score = 38.9 bits (89), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 36/86 (41%), Gaps = 1/86 (1%)

Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
           R   P   A     +I V D  NHV+R++   G  T   G   K+G  DG      F   
Sbjct: 85  RLQFPTGIAAGENQSIIVTDTRNHVLRRLDAEGRITATIGMINKQGWEDGEQAQVQFDQP 144

Query: 162 FELTFVPHICALLISDHGNQLIRQIN 187
             L        L ++D GN +IR+I+
Sbjct: 145 TGLA-ADKKGNLYVADAGNHVIRKID 169


>gi|223934698|ref|ZP_03626618.1| NHL repeat containing protein [bacterium Ellin514]
 gi|223896653|gb|EEF63094.1| NHL repeat containing protein [bacterium Ellin514]
          Length = 847

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 51/127 (40%), Positives = 68/127 (53%), Gaps = 4/127 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           ++   VV  LAG   +G  +G   SARF  P++ AVD + NIYVAD  N VIRKIT  GV
Sbjct: 358 MTAAGVVTTLAGSPSEGSINGVTSSARFYSPQNVAVDGQNNIYVADTQNSVIRKITPFGV 417

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSK 195
            ++  G +   G ADG   NA FS    +  V     + ++D GN  IR+I       S 
Sbjct: 418 VSVLAGTTGVFGSADGSGANALFSGPQGIA-VDGGGNIYVADTGNSTIRKIT---PSGST 473

Query: 196 SSQSGSA 202
           S+ +GSA
Sbjct: 474 STLAGSA 480



 Score = 67.8 bits (164), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 41/112 (36%), Positives = 60/112 (53%), Gaps = 2/112 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++   + + LAG  GVQG +DG    A F++P   A+D  GNIYV+D  +  IRK+T  G
Sbjct: 85  VTASGLCRTLAGTAGVQGSADGMGAQASFNQPSGIALDSDGNIYVSDYGSSTIRKVTQSG 144

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             T   G +   G  +    NA F +   L  V +   L ++D+GN LIR+I
Sbjct: 145 QVTTLAGMTGVTGSVNNTGTNALFFHPMGLA-VDNATNLYVADYGNHLIRKI 195



 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 48/133 (36%), Positives = 65/133 (48%), Gaps = 6/133 (4%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
           G +++ V D+  S    ++ P     VV  LAG  GV G +DG   +A F  P+  AVD 
Sbjct: 395 GQNNIYVADTQNSVIRKIT-PFG---VVSVLAGTTGVFGSADGSGANALFSGPQGIAVDG 450

Query: 114 KGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
            GNIYVAD  N  IRKIT  G T+   G +   G ADG    A F    +   V     +
Sbjct: 451 GGNIYVADTGNSTIRKITPSGSTSTLAGSAGNPGNADGAGITAQFYQP-QGVAVDSANNV 509

Query: 174 LISDHGNQLIRQI 186
            ++D GN  +R +
Sbjct: 510 YVADTGNHTVRMV 522



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 40/126 (31%), Positives = 64/126 (50%), Gaps = 13/126 (10%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
           ++  +AG   +G +DG  GSA+   P+  AVD  GN+YVAD  N++IR +T  G+     
Sbjct: 36  LIGTVAGYAGKGSADGVGGSAQLFGPQGVAVDGAGNVYVADTGNNIIRVVTASGLCRTLA 95

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICAL------LISDHGNQLIRQINLKPEDCS 194
           G +  +G ADG    ASF+        P   AL       +SD+G+  IR++    +  +
Sbjct: 96  GTAGVQGSADGMGAQASFNQ-------PSGIALDSDGNIYVSDYGSSTIRKVTQSGQVTT 148

Query: 195 KSSQSG 200
            +  +G
Sbjct: 149 LAGMTG 154



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 41/126 (32%), Positives = 61/126 (48%), Gaps = 2/126 (1%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++   +   LAG  G  G  DG    ARF+ P   AVD  GN+YV D +N  IRK+T+ G
Sbjct: 522 VTPGGISSTLAGLAGTFGTFDGTNAGARFNGPTGIAVDGAGNLYVTDYNNDTIRKVTSAG 581

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
             T   G +   G  DG   +A F     ++ V  +  L + D GN  +R++ L     +
Sbjct: 582 AVTTLAGWTGMWGSIDGAGNSALFFGPSGIS-VDALGNLYVIDSGNSTLRKLTLSGGTWT 640

Query: 195 KSSQSG 200
            S+ +G
Sbjct: 641 VSTVAG 646



 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 44/119 (36%), Positives = 61/119 (51%), Gaps = 17/119 (14%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           +S   VV  LAG  G  G +DG   SARF  P+  AVD  G +++AD +N  IR +T  G
Sbjct: 303 ISPAGVVTTLAGLSGTAGSADGTNSSARFLGPQGVAVDSTGTVFIADTANSTIRVMTAAG 362

Query: 135 -VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQI 186
            VTT+A  GS  EG  +G   +A F       + P   A      + ++D  N +IR+I
Sbjct: 363 VVTTLA--GSPSEGSINGVTSSARF-------YSPQNVAVDGQNNIYVADTQNSVIRKI 412



 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 39/132 (29%), Positives = 65/132 (49%), Gaps = 4/132 (3%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
           G  ++ V D+  S    ++ P    S +   AG+   G +DG   +A+F +P+  AVD  
Sbjct: 450 GGGNIYVADTGNSTIRKIT-PSGSTSTLAGSAGN--PGNADGAGITAQFYQPQGVAVDSA 506

Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
            N+YVAD  NH +R +T  G+++   G +   G  DG    A F+    +  V     L 
Sbjct: 507 NNVYVADTGNHTVRMVTPGGISSTLAGLAGTFGTFDGTNAGARFNGPTGIA-VDGAGNLY 565

Query: 175 ISDHGNQLIRQI 186
           ++D+ N  IR++
Sbjct: 566 VTDYNNDTIRKV 577



 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 45/148 (30%), Positives = 68/148 (45%), Gaps = 8/148 (5%)

Query: 39  DGHQLEIN-PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDG 96
           DG   + N P +V +D+ G  ++ V D+  +A   +   +   SV       G  G +D 
Sbjct: 215 DGLGGQFNEPEAVAVDQAG--NVYVADTGNAAIRMI---MPGGSVTTLAGAAGFVGSADA 269

Query: 97  EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNA 156
              +A F +P    ++  GN+YVAD  N+ IR+I+  GV T   G S   G ADG   +A
Sbjct: 270 SGTNALFHQPAGIGINSAGNLYVADYFNNTIRQISPAGVVTTLAGLSGTAGSADGTNSSA 329

Query: 157 SFSNDFELTFVPHICALLISDHGNQLIR 184
            F    +   V     + I+D  N  IR
Sbjct: 330 RFLGP-QGVAVDSTGTVFIADTANSTIR 356



 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 41/112 (36%), Positives = 54/112 (48%), Gaps = 4/112 (3%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           +++   V  LAG  GV G  +    +A F  P   AVD   N+YVAD  NH+IRKIT   
Sbjct: 140 VTQSGQVTTLAGMTGVTGSVNNTGTNALFFHPMGLAVDNATNLYVADYGNHLIRKITPSN 199

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           V +   G +   G ADG      F N+ E   V     + ++D GN  IR I
Sbjct: 200 VVSTLAGVTGVPGSADG--LGGQF-NEPEAVAVDQAGNVYVADTGNAAIRMI 248



 Score = 40.8 bits (94), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 30/72 (41%)

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
           +L V+DS  S    L+      +V       GV G  DG    A F  P    V   G +
Sbjct: 618 NLYVIDSGNSTLRKLTLSGGTWTVSTVAGMPGVNGGIDGSGAGAEFYYPAGVTVSAAGYV 677

Query: 118 YVADKSNHVIRK 129
           YVAD  N+ IR 
Sbjct: 678 YVADAGNNTIRS 689


>gi|182412380|ref|YP_001817446.1| immunoglobulin I-set domain-containing protein [Opitutus terrae
           PB90-1]
 gi|177839594|gb|ACB73846.1| Immunoglobulin I-set domain protein [Opitutus terrae PB90-1]
          Length = 1130

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 44/103 (42%), Positives = 57/103 (55%), Gaps = 3/103 (2%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
           G  G +DG    ARF  P+  A+D  GN+YVAD +NH IRKI   G  T   G + + G 
Sbjct: 220 GSAGSADGPGDQARFRDPEGLAIDAAGNVYVADINNHTIRKINPAGEVTTLAGAAGESGF 279

Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
           ADGPA NA F     L   P   A+ ++D  N+ IR+I+  PE
Sbjct: 280 ADGPAANARFFCPTSLAIDPA-GAIWVNDAINRAIRKIS--PE 319



 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 62/169 (36%), Positives = 88/169 (52%), Gaps = 14/169 (8%)

Query: 38  IDG---HQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-GY 93
           IDG   + L  NP +++  P S DL V DS+  A   ++       VV  LAG  ++ GY
Sbjct: 115 IDGVGSNALFHNPSALVVGP-SGDLYVADSNGHAIRKVT----PAGVVTTLAGGPLRYGY 169

Query: 94  SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG-GGSKKEGRADG 151
            DG    A+F  P+  AV+  G I+V+D+S H IR++  LG V+T AG GGS   G ADG
Sbjct: 170 MDGPGTEAQFSYPRGIAVNATGVIFVSDRSAHTIRRVDQLGNVSTWAGHGGSA--GSADG 227

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
           P   A F  D E   +     + ++D  N  IR+IN   E  + +  +G
Sbjct: 228 PGDQARF-RDPEGLAIDAAGNVYVADINNHTIRKINPAGEVTTLAGAAG 275



 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 46/133 (34%), Positives = 64/133 (48%), Gaps = 7/133 (5%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMK 114
           S ++ V+D   S    +S     + +V   AG  G  G  DG   +ARF  P    +D  
Sbjct: 499 SGNIFVVDRGASTIRKIS-----QGIVTTFAGMPGETGQDDGAGAAARFRDPMGIVIDGA 553

Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
            N+YVAD +N  IRK+T  GV T   G +  +G  DGP   ASF N + L   P+  AL 
Sbjct: 554 DNLYVADTNNWKIRKVTPAGVVTTFAGHTSTQGANDGPIGIASFFNPYGLAIGPN-GALY 612

Query: 175 ISDHGNQLIRQIN 187
           + D     +R I+
Sbjct: 613 VVDLAGDTLRMIS 625



 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 43/128 (33%), Positives = 60/128 (46%), Gaps = 2/128 (1%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++   VV  LAG   V G  DG   +A F  P +  V   G++YVAD + H IRK+T  G
Sbjct: 96  ITPAGVVTTLAGVANVHGCIDGVGSNALFHNPSALVVGPSGDLYVADSNGHAIRKVTPAG 155

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
           V T   GG  + G  DGP   A FS    +  V     + +SD     IR+++      +
Sbjct: 156 VVTTLAGGPLRYGYMDGPGTEAQFSYPRGIA-VNATGVIFVSDRSAHTIRRVDQLGNVST 214

Query: 195 KSSQSGSA 202
            +   GSA
Sbjct: 215 WAGHGGSA 222



 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 37/102 (36%), Positives = 48/102 (47%), Gaps = 1/102 (0%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
           LAG+  +G  DG    ARF+     A+   G +YVAD +N  IR IT  GV T   G + 
Sbjct: 51  LAGEWNKGSRDGAAAVARFNGANGVAIAPNGLVYVADLANSTIRAITPAGVVTTLAGVAN 110

Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             G  DG   NA F N   L   P    L ++D     IR++
Sbjct: 111 VHGCIDGVGSNALFHNPSALVVGPS-GDLYVADSNGHAIRKV 151



 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 39/106 (36%), Positives = 53/106 (50%), Gaps = 3/106 (2%)

Query: 82  VKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
           V  +AG DG  G +DG   SARF      AVD  GNI+V D+    IRKI+   VTT A 
Sbjct: 466 VTTIAGADGEDGDTDGPAASARFRAVDGLAVDSSGNIFVVDRGASTIRKISQGIVTTFA- 524

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G   + G+ DG    A F +   +  +     L ++D  N  IR++
Sbjct: 525 GMPGETGQDDGAGAAARFRDPMGI-VIDGADNLYVADTNNWKIRKV 569



 Score = 46.6 bits (109), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 28/66 (42%), Positives = 38/66 (57%), Gaps = 3/66 (4%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           P  E + +   AG+   G++DG   +ARF  P S A+D  G I+V D  N  IRKI+  G
Sbjct: 263 PAGEVTTLAGAAGE--SGFADGPAANARFFCPTSLAIDPAGAIWVNDAINRAIRKISPEG 320

Query: 135 -VTTIA 139
            VTT+A
Sbjct: 321 TVTTVA 326



 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 32/99 (32%), Positives = 46/99 (46%), Gaps = 3/99 (3%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
           G  G+ DG   +A+F   +    D +GN+ V D     IRK+T  G  T   G   ++G 
Sbjct: 421 GYPGFRDGSGYAAQFRDLRGITPDKEGNLLVGD--GRTIRKVTLAGAVTTIAGADGEDGD 478

Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            DGPA +A F     L  V     + + D G   IR+I+
Sbjct: 479 TDGPAASARFRAVDGLA-VDSSGNIFVVDRGASTIRKIS 516



 Score = 40.8 bits (94), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 37/111 (33%), Positives = 49/111 (44%), Gaps = 8/111 (7%)

Query: 50  VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKS 108
           VID  G+ +L V D++      ++       VV   AG    QG +DG  G A F  P  
Sbjct: 549 VID--GADNLYVADTNNWKIRKVT----PAGVVTTFAGHTSTQGANDGPIGIASFFNPYG 602

Query: 109 FAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR-ADGPAQNASF 158
            A+   G +YV D +   +R I+  G  T  GG S   G  ADG    A F
Sbjct: 603 LAIGPNGALYVVDLAGDTLRMISPDGFVTTLGGSSAHRGETADGIGTAARF 653



 Score = 40.0 bits (92), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 55/111 (49%), Gaps = 16/111 (14%)

Query: 33  TVTTVID--GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV 90
           TVTTV D  G  + I+P+ V+    + D I    S S    ++ P   +S   R      
Sbjct: 321 TVTTVADTAGEGITIDPNGVL-YIAADDRIKRLESGSVLSVVAGP--TDSYTSR------ 371

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
               +G   +ARF +P   A+ + GN+YV D   + IR++T  G V+T+AG
Sbjct: 372 ----NGVGANARFVQPIGSALAVDGNLYVTDSGGYAIRRVTRSGEVSTLAG 418


>gi|348676369|gb|EGZ16187.1| hypothetical protein PHYSODRAFT_506757 [Phytophthora sojae]
          Length = 1981

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 51/126 (40%), Positives = 71/126 (56%), Gaps = 11/126 (8%)

Query: 70  YTLSFPLS--EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
           Y+ +F  S   +  V  +AG+G  G+ DG   +ARF+ P+  AVD  G +YVAD +NH I
Sbjct: 190 YSAAFGASGRAQGRVTTVAGEGTAGFLDGPAATARFNHPRGVAVDSNGVVYVADTANHRI 249

Query: 128 RKI--TNLGVTTIAGGGSKKEGRADGPAQNA---SFSNDFEL--TFVPHICALLISDHGN 180
           RKI  T   V+T+AG G   EG ADG A +A   S+ +D  +  T       + ++D GN
Sbjct: 250 RKINPTTKMVSTLAGDGI--EGFADGAALSAARFSYPSDVAVLETNGGATVTVFVADTGN 307

Query: 181 QLIRQI 186
             IRQI
Sbjct: 308 HRIRQI 313



 Score = 55.5 bits (132), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 27/62 (43%), Positives = 33/62 (53%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           G +DG+P  ARFD P   AVD  G ++VAD  NH+IR+I   G T    G       AD 
Sbjct: 344 GLADGDPLGARFDSPMGVAVDADGVVFVADTGNHLIRRIEPNGTTHTLAGSVVPSEDADT 403

Query: 152 PA 153
           P 
Sbjct: 404 PG 405



 Score = 52.4 bits (124), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 41/126 (32%), Positives = 61/126 (48%), Gaps = 21/126 (16%)

Query: 81  VVKRLAGDGVQGYSDGEP-GSARFDKPKSFAV-DMKG----NIYVADKSNHVIRKITNLG 134
           +V  LAGDG++G++DG    +ARF  P   AV +  G     ++VAD  NH IR+I +  
Sbjct: 258 MVSTLAGDGIEGFADGAALSAARFSYPSDVAVLETNGGATVTVFVADTGNHRIRQIKSGV 317

Query: 135 VTTIAG--------------GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGN 180
           V+ ++G                S   G ADG    A F +   +  V     + ++D GN
Sbjct: 318 VSCLSGLCGAGVETVRLSQQPESPHAGLADGDPLGARFDSPMGVA-VDADGVVFVADTGN 376

Query: 181 QLIRQI 186
            LIR+I
Sbjct: 377 HLIRRI 382



 Score = 41.2 bits (95), Expect = 0.46,   Method: Composition-based stats.
 Identities = 29/107 (27%), Positives = 48/107 (44%), Gaps = 5/107 (4%)

Query: 66  RSAFYTLSFPLSEESV-----VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           R   Y   F  + E+V     V  LAG  V G+ DGE   A F+ P   A    G +YVA
Sbjct: 450 RRVTYDGGFASTIETVTSTNRVVTLAGSNVPGHIDGEGNEATFNAPAGVAFAADGRVYVA 509

Query: 121 DKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFV 167
             ++  +R+I+   + + +   S +  +   P+  AS+    +  F+
Sbjct: 510 SSTDCTVRQISPASLVSRSVTCSTRATQVLRPSGCASYEQPVDELFL 556



 Score = 38.9 bits (89), Expect = 2.4,   Method: Composition-based stats.
 Identities = 27/85 (31%), Positives = 40/85 (47%), Gaps = 18/85 (21%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-------------- 134
           G +G+ DG    A+FD P++ AV  +  + VAD   H +R++T  G              
Sbjct: 413 GERGFRDGNLTYAQFDSPRAIAVGAERTVVVAD--GHRVRRVTYDGGFASTIETVTSTNR 470

Query: 135 VTTIAGGGSKKEGRADGPAQNASFS 159
           V T+A  GS   G  DG    A+F+
Sbjct: 471 VVTLA--GSNVPGHIDGEGNEATFN 493


>gi|110639161|ref|YP_679370.1| hypothetical protein CHU_2779 [Cytophaga hutchinsonii ATCC 33406]
 gi|110281842|gb|ABG60028.1| conserved hypothetical protein [Cytophaga hutchinsonii ATCC 33406]
          Length = 1051

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 42/111 (37%), Positives = 59/111 (53%), Gaps = 2/111 (1%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           ++   +V  +AG+G  G  D    +ARF+ P +  +D   NIY+AD  NH IRKIT  GV
Sbjct: 81  VNPSGIVSTIAGNGTSGLIDATGSAARFNLPAAVVLDAAQNIYIADNGNHCIRKITPAGV 140

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             I   GS   G  DG    A F+N + +  +     L ++D GN LIR+I
Sbjct: 141 -VITFAGSGTAGSNDGTGTAAQFNNPYGMA-IDASGNLYVADSGNNLIRKI 189



 Score = 70.9 bits (172), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 50/143 (34%), Positives = 76/143 (53%), Gaps = 11/143 (7%)

Query: 46  NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
           NP+ + ID  G  +L V DS  +    L   ++   VV  +AG+   GY DG   +ARF 
Sbjct: 164 NPYGMAIDASG--NLYVADSGNN----LIRKITSSGVVTTIAGNTTPGYVDGTGTAARFY 217

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFE 163
            P +  +D+ GN ++ D  NH IRK+T+ G V+T+AG GS   G  +G    A F+  + 
Sbjct: 218 LPVNITLDVSGNFFITDNRNHRIRKMTSAGVVSTVAGSGS--AGYMNGTGVTAQFNRPYG 275

Query: 164 LTFVPHICALLISDHGNQLIRQI 186
           +  V     L ++D  N +IR+I
Sbjct: 276 IV-VDAFSNLYVTDTNNGVIRKI 297



 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 39/84 (46%), Positives = 52/84 (61%), Gaps = 3/84 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           ++   VV  +AG G  GY +G   +A+F++P    VD   N+YV D +N VIRKIT+ G 
Sbjct: 243 MTSAGVVSTVAGSGSAGYMNGTGVTAQFNRPYGIVVDAFSNLYVTDTNNGVIRKITSSGV 302

Query: 135 VTTIAGGGSKKEGRADGPAQNASF 158
           V+T AG G+   G ADGPA NA F
Sbjct: 303 VSTYAGTGT--PGFADGPAANAQF 324



 Score = 67.4 bits (163), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 43/112 (38%), Positives = 60/112 (53%), Gaps = 4/112 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           ++   VV   AG G  G +DG   +A+F+ P   A+D  GN+YVAD  N++IRKIT+ G 
Sbjct: 135 ITPAGVVITFAGSGTAGSNDGTGTAAQFNNPYGMAIDASGNLYVADSGNNLIRKITSSGV 194

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           VTTIA  G+   G  DG    A F     +T         I+D+ N  IR++
Sbjct: 195 VTTIA--GNTTPGYVDGTGTAARFYLPVNITLDVS-GNFFITDNRNHRIRKM 243



 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/91 (32%), Positives = 45/91 (49%), Gaps = 13/91 (14%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           ++   VV   AG G  G++DG   +A+F  P    ++  G++Y AD   H +RK+T  G 
Sbjct: 297 ITSSGVVSTYAGTGTPGFADGPAANAQFQWPTGLTINASGDLYEADNETHRVRKVTPAGI 356

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
           V+T AG G            NA F+N   L+
Sbjct: 357 VSTFAGNG------------NAGFANTAALS 375



 Score = 38.9 bits (89), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 42/88 (47%), Gaps = 4/88 (4%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
           +A F      A D  GN+ VA   +H IRK+   G V+TIAG G+     A G A   + 
Sbjct: 51  TAEFYGLSGLAFDNSGNLIVAGYQDHSIRKVNPSGIVSTIAGNGTSGLIDATGSAARFNL 110

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
                L    +I    I+D+GN  IR+I
Sbjct: 111 PAAVVLDAAQNI---YIADNGNHCIRKI 135


>gi|194364186|ref|YP_002026796.1| SMP-30/gluconolaconase/LRE domain-containing protein
           [Stenotrophomonas maltophilia R551-3]
 gi|194346990|gb|ACF50113.1| SMP-30/Gluconolaconase/LRE domain protein [Stenotrophomonas
           maltophilia R551-3]
          Length = 693

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 59/152 (38%), Positives = 76/152 (50%), Gaps = 11/152 (7%)

Query: 38  IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
           IDG  L+   N  S I      +L V D+   A   +      +  V  LAG G QGY+D
Sbjct: 114 IDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIG----TDGQVTTLAG-GEQGYAD 168

Query: 96  GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQ 154
           G    ARFD P   AVD +G +YVAD  N  IR I T+  V T+AGG  ++ G ADG   
Sbjct: 169 GPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVIGTDGNVRTLAGG--ERPGLADGLGV 226

Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            A F     L F  H  ALL++D  N  +R++
Sbjct: 227 AARFDTPVALAFDAH-GALLVADLFNNAVRRV 257



 Score = 63.9 bits (154), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 43/106 (40%), Positives = 58/106 (54%), Gaps = 6/106 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           ++ +AG G +G  DG    A F+ P   A D +GN+YVAD  NH IR+I   G VTT+AG
Sbjct: 103 IETVAGQG-EGRIDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIGTDGQVTTLAG 161

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G   ++G ADGPA  A F     +  V     + ++D  N  IR I
Sbjct: 162 G---EQGYADGPAAQARFDAPMGIA-VDAQGQVYVADTFNDRIRVI 203



 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 41/114 (35%), Positives = 58/114 (50%), Gaps = 6/114 (5%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD--KSNHVIRKITN 132
           PL+  + ++ LAGDG  G  DG    ARF  P +      G+IY  D   +N + R+  +
Sbjct: 41  PLAWTAQIEMLAGDGHPGDRDGASAQARFADPYALLRSADGSIYFTDAGDNNRIRRRQPD 100

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             + T+AG G   EGR DGPA  ASF+    +        L ++D GN  IR+I
Sbjct: 101 GRIETVAGQG---EGRIDGPALQASFNTPSGIAADAQ-GNLYVADTGNHAIRRI 150


>gi|409099442|ref|ZP_11219466.1| NHL repeat containing protein [Pedobacter agri PB92]
          Length = 754

 Score = 73.9 bits (180), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 48/112 (42%), Positives = 60/112 (53%), Gaps = 4/112 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           +S    V  +AG G  G  DG   SA F+ P   AVD  GNIYVAD  N+ IRKIT  G 
Sbjct: 193 ISSTGFVTTVAGSGATGSLDGNAASATFNNPAGVAVDRAGNIYVADAGNNKIRKITPAGV 252

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           V+T+AG G+   G  DG   NA       +T  P +  L I+D+ N  IR+I
Sbjct: 253 VSTLAGNGT--AGSTDGTGTNARLYYPTGITIDP-LDNLYIADYRNNRIRKI 301



 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 58/162 (35%), Positives = 82/162 (50%), Gaps = 11/162 (6%)

Query: 46  NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
           NP  V +DR G  ++ V D+  +        ++   VV  LAG+G  G +DG   +AR  
Sbjct: 222 NPAGVAVDRAG--NIYVADAGNNKIR----KITPAGVVSTLAGNGTAGSTDGTGTNARLY 275

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFE 163
            P    +D   N+Y+AD  N+ IRKI+  G VTT+AG GS   G  +G   N+SF+    
Sbjct: 276 YPTGITIDPLDNLYIADYRNNRIRKISPSGVVTTLAGNGS--AGNLNGVGTNSSFNGPIM 333

Query: 164 LTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
           L F      L I+D  N LIR+I+L      K+  +G A  A
Sbjct: 334 LAFDVQ-GNLFITDFQNNLIRKISLTGYTIDKNLPAGLAFDA 374



 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 33/88 (37%), Positives = 52/88 (59%), Gaps = 4/88 (4%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
           S  F  P     D+ G++Y+AD+ N+ I+KI++ G VTT+AG G+   G  DG A +A+F
Sbjct: 163 SGNFSSPGGLNFDLAGDLYLADQKNNKIKKISSTGFVTTVAGSGA--TGSLDGNAASATF 220

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
           +N   +  V     + ++D GN  IR+I
Sbjct: 221 NNPAGVA-VDRAGNIYVADAGNNKIRKI 247



 Score = 37.7 bits (86), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 26/79 (32%), Positives = 33/79 (41%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
           V   AG G     +G   S+ F+ P     D + NIYV D  +  IRKIT  G+ +  G 
Sbjct: 63  VSTFAGSGAASSYNGTGLSSGFNMPSGIGSDAQSNIYVCDYGSGAIRKITPSGLVSTIGY 122

Query: 142 GSKKEGRADGPAQNASFSN 160
                G     A N   SN
Sbjct: 123 VGTPTGLTADAAGNTYISN 141


>gi|426402696|ref|YP_007021667.1| hypothetical protein Bdt_0693 [Bdellovibrio bacteriovorus str.
           Tiberius]
 gi|425859364|gb|AFY00400.1| hypothetical protein Bdt_0693 [Bdellovibrio bacteriovorus str.
           Tiberius]
          Length = 692

 Score = 73.9 bits (180), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 46/132 (34%), Positives = 66/132 (50%), Gaps = 7/132 (5%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
           + D+ ++DS+ S    +S       VV   AG  GV  ++DG   SARFD P    +D  
Sbjct: 69  AGDIFIVDSNASVIRKIS-----NGVVSTFAGKFGVFDHADGTGDSARFDYPTGITIDGS 123

Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
           GN++V + +NH IRKIT   V T   G     G ADG    A F+N  ++T         
Sbjct: 124 GNLFVTEGNNHTIRKITPAAVVTTVAGSPGNAGTADGTGSAARFNNPEDITLAAD-GNFY 182

Query: 175 ISDHGNQLIRQI 186
           I+D  N +IR++
Sbjct: 183 ITDKNNNMIRKM 194



 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 52/139 (37%), Positives = 66/139 (47%), Gaps = 18/139 (12%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
           G+  L V DS     YT+   ++   VV  LAG  G+QG  DG    A F +     V  
Sbjct: 506 GAGSLYVSDSGN---YTVR-KVTLAGVVTTLAGQVGIQGSDDGTGTGATFSRVAGITVTP 561

Query: 114 KGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA- 172
            GNI+VAD  N+VIRKIT  GV T   G + + G  DG   NA FS        PH  A 
Sbjct: 562 SGNIFVADTDNNVIRKITVAGVVTTFAGAAGQGGNDDGMGSNARFSQ-------PHFVAT 614

Query: 173 -----LLISDHGNQLIRQI 186
                L +++ G   IR+I
Sbjct: 615 DSSGNLYVAEWGEATIRKI 633



 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 41/114 (35%), Positives = 58/114 (50%), Gaps = 2/114 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           L+   VV  LAGD    G +DG   SARF      A D  G++YV+D  N+ +RK+T  G
Sbjct: 468 LTPGGVVTSLAGDYDNSGSADGTGTSARFGGVAGIASDGAGSLYVSDSGNYTVRKVTLAG 527

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           V T   G    +G  DG    A+FS    +T  P    + ++D  N +IR+I +
Sbjct: 528 VVTTLAGQVGIQGSDDGTGTGATFSRVAGITVTPS-GNIFVADTDNNVIRKITV 580



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 39/111 (35%), Positives = 54/111 (48%), Gaps = 1/111 (0%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           ++   VV   AGDG  G +DG   +A F+ P     D  GN++V   S   IRKIT  GV
Sbjct: 194 MTPAGVVTTFAGDGTYGCTDGTGAAAHFNYPTGIVGDSAGNLFVVCSSCSTIRKITPAGV 253

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            T   G +   G  DG    A FS    +T +     L ++D+GN  IR++
Sbjct: 254 VTTFAGLANATGALDGTGTAARFSWPIGIT-IDSSDNLYVADYGNSAIRKV 303



 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 47/111 (42%), Positives = 57/111 (51%), Gaps = 3/111 (2%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S DL V DS  ++   ++       V   LAGD   G +DG   +A F  P+  A D  G
Sbjct: 342 SGDLFVTDSDNASIRKITPARVVTLVAGSLAGD-SDGSADGTGTAASFFSPEGVAADPAG 400

Query: 116 NIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
           N+YVAD  N  IRKIT  G VTTIA G   + G ADG    A FS   +LT
Sbjct: 401 NLYVADTMNRTIRKITPSGNVTTIA-GSPGQIGSADGTGAAARFSYPTKLT 450



 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 47/140 (33%), Positives = 62/140 (44%), Gaps = 20/140 (14%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
           S +L V D   SA       ++  +VV   AG  G  G  DG   +ARF  P    +D  
Sbjct: 287 SDNLYVADYGNSAIR----KVTSSAVVSNFAGSYGDYGAVDGTGTAARFAGPAGIGIDAS 342

Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGG--GSKKEGRADGPAQNASFSNDFELTFVPHICA 172
           G+++V D  N  IRKIT   V T+  G      +G ADG    ASF       F P   A
Sbjct: 343 GDLFVTDSDNASIRKITPARVVTLVAGSLAGDSDGSADGTGTAASF-------FSPEGVA 395

Query: 173 ------LLISDHGNQLIRQI 186
                 L ++D  N+ IR+I
Sbjct: 396 ADPAGNLYVADTMNRTIRKI 415



 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 39/108 (36%), Positives = 53/108 (49%), Gaps = 4/108 (3%)

Query: 82  VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
           V  +AG  G  G +DG   +ARF  P    V   GNIY+AD+  + IRK+T  GV T   
Sbjct: 421 VTTIAGSPGQIGSADGTGAAARFSYPTKLTVAEDGNIYIADE--YRIRKLTPGGVVTSLA 478

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           G     G ADG   +A F     +       +L +SD GN  +R++ L
Sbjct: 479 GDYDNSGSADGTGTSARFGGVAGIA-SDGAGSLYVSDSGNYTVRKVTL 525



 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 34/86 (39%), Positives = 50/86 (58%), Gaps = 4/86 (4%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++  +VV  +AG  G  G +DG   +ARF+ P+   +   GN Y+ DK+N++IRK+T  G
Sbjct: 139 ITPAAVVTTVAGSPGNAGTADGTGSAARFNNPEDITLAADGNFYITDKNNNMIRKMTPAG 198

Query: 135 -VTTIAGGGSKKEGRADGPAQNASFS 159
            VTT AG G+   G  DG    A F+
Sbjct: 199 VVTTFAGDGT--YGCTDGTGAAAHFN 222



 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 49/112 (43%), Gaps = 2/112 (1%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++   VV   AG     G  DG   +ARF  P    +D   N+YVAD  N  IRK+T+  
Sbjct: 248 ITPAGVVTTFAGLANATGALDGTGTAARFSWPIGITIDSSDNLYVADYGNSAIRKVTSSA 307

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           V +   G     G  DG    A F+    +  +     L ++D  N  IR+I
Sbjct: 308 VVSNFAGSYGDYGAVDGTGTAARFAGPAGIG-IDASGDLFVTDSDNASIRKI 358



 Score = 45.4 bits (106), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 31/67 (46%), Positives = 38/67 (56%), Gaps = 2/67 (2%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NL 133
           ++   VV   AG   QG +D   GS ARF +P   A D  GN+YVA+     IRKIT + 
Sbjct: 578 ITVAGVVTTFAGAAGQGGNDDGMGSNARFSQPHFVATDSSGNLYVAEWGEATIRKITPSA 637

Query: 134 GVTTIAG 140
            VTTIAG
Sbjct: 638 VVTTIAG 644



 Score = 44.3 bits (103), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 31/89 (34%), Positives = 47/89 (52%), Gaps = 8/89 (8%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEG---RADGPAQNAS 157
           ARF +P+   V+  G+I++ D +  VIRKI+N  V+T AG    K G    ADG   +A 
Sbjct: 56  ARFREPRDAVVNAAGDIFIVDSNASVIRKISNGVVSTFAG----KFGVFDHADGTGDSAR 111

Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQI 186
           F     +T +     L +++  N  IR+I
Sbjct: 112 FDYPTGIT-IDGSGNLFVTEGNNHTIRKI 139


>gi|410694120|ref|YP_003624742.1| putative Soluble quinoprotein glucose dehydrogenase [Thiomonas sp.
           3As]
 gi|294340545|emb|CAZ88930.1| putative Soluble quinoprotein glucose dehydrogenase [Thiomonas sp.
           3As]
          Length = 366

 Score = 73.9 bits (180), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 46/117 (39%), Positives = 65/117 (55%), Gaps = 2/117 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           +S + VV  LAG  G  G+ DG    ARF+ P    +D +G IYV++  N+VIRKIT  G
Sbjct: 196 ISAKGVVSTLAGSPGDTGWRDGRGAQARFNTPVGLTLDAQGQIYVSEYFNNVIRKITPDG 255

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
             T   G   K G ADG    A F +   L+F P   +L+++D GN  +R+I+ + E
Sbjct: 256 TVTTFAGKPGKGGFADGNVGEALFLHPQTLSFAPD-GSLIVADTGNNRVRRISPQGE 311



 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 34/95 (35%), Positives = 52/95 (54%), Gaps = 1/95 (1%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGP 152
           Y+ G+   ARF  P+  A+D +GNI+VAD  N V+RK+   G  +I  G  ++    +GP
Sbjct: 49  YNGGDFAQARFHDPRGMALDAQGNIFVADYVNSVVRKLGTDGQVSIVAGQVEQRDARNGP 108

Query: 153 AQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           A  A F +  E   V     L +SD G+  +R+I+
Sbjct: 109 ALQARFYSP-ECVAVATDGTLFVSDSGSNTVRRIS 142



 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 37/121 (30%), Positives = 59/121 (48%), Gaps = 11/121 (9%)

Query: 79  ESVVKRLAGDG----VQGY------SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
            SVV++L  DG    V G        +G    ARF  P+  AV   G ++V+D  ++ +R
Sbjct: 80  NSVVRKLGTDGQVSIVAGQVEQRDARNGPALQARFYSPECVAVATDGTLFVSDSGSNTVR 139

Query: 129 KITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           +I+  G  +   G  + EG ADG  Q A F++   L  V     + ++D  N  +R+I+ 
Sbjct: 140 RISREGRVSTLAGKLEVEGFADGTGQQARFNHPVGLD-VNAKGVVYVADAYNSTVRRISA 198

Query: 189 K 189
           K
Sbjct: 199 K 199



 Score = 40.0 bits (92), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 43/73 (58%), Gaps = 2/73 (2%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++ +  V   AG  G  G++DG  G A F  P++ +    G++ VAD  N+ +R+I+  G
Sbjct: 251 ITPDGTVTTFAGKPGKGGFADGNVGEALFLHPQTLSFAPDGSLIVADTGNNRVRRISPQG 310

Query: 135 -VTTIAGGGSKKE 146
            V+T+AG G+ ++
Sbjct: 311 EVSTLAGTGAGEK 323


>gi|42522312|ref|NP_967692.1| hypothetical protein Bd0727 [Bdellovibrio bacteriovorus HD100]
 gi|39574843|emb|CAE78685.1| hypothetical protein with NHL repeat [Bdellovibrio bacteriovorus
           HD100]
          Length = 709

 Score = 73.9 bits (180), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 46/132 (34%), Positives = 66/132 (50%), Gaps = 7/132 (5%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
           + D+ ++DS+ S    +S       VV   AG  GV  ++DG   SARFD P    +D  
Sbjct: 86  AGDIFIVDSNASVIRKIS-----NGVVSTFAGKFGVFDHADGTGDSARFDYPTGITIDGS 140

Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
           GN++V + +NH IRKIT   V T   G     G ADG    A F+N  ++T         
Sbjct: 141 GNLFVTEGNNHTIRKITPAAVVTTVAGSPGNAGTADGTGSAARFNNPEDITLAAD-GNFY 199

Query: 175 ISDHGNQLIRQI 186
           I+D  N +IR++
Sbjct: 200 ITDKNNNMIRKM 211



 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 52/139 (37%), Positives = 66/139 (47%), Gaps = 18/139 (12%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
           G+  L V DS     YT+   ++   VV  LAG  G+QG  DG    A F +     V  
Sbjct: 523 GAGSLYVSDSGN---YTVR-KVTLAGVVTTLAGQVGIQGSDDGTGTGATFSRVAGITVTP 578

Query: 114 KGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA- 172
            GNI+VAD  N+VIRKIT  GV T   G + + G  DG   NA FS        PH  A 
Sbjct: 579 SGNIFVADTDNNVIRKITVAGVVTTFAGAAGQGGNDDGMGSNARFSQ-------PHFVAT 631

Query: 173 -----LLISDHGNQLIRQI 186
                L +++ G   IR+I
Sbjct: 632 DSSGNLYVAEWGEATIRKI 650



 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 41/114 (35%), Positives = 58/114 (50%), Gaps = 2/114 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           L+   VV  LAGD    G +DG   SARF      A D  G++YV+D  N+ +RK+T  G
Sbjct: 485 LTPGGVVTSLAGDYDNSGSADGTGTSARFGGVAGIASDGAGSLYVSDSGNYTVRKVTLAG 544

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           V T   G    +G  DG    A+FS    +T  P    + ++D  N +IR+I +
Sbjct: 545 VVTTLAGQVGIQGSDDGTGTGATFSRVAGITVTPS-GNIFVADTDNNVIRKITV 597



 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 53/111 (47%), Gaps = 1/111 (0%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           ++   VV   AGDG  G +DG   +A F+ P     D  GN++V   S   IRKIT  GV
Sbjct: 211 MTPAGVVTTFAGDGTYGCTDGTGAAAHFNYPTGIVGDSAGNLFVVCSSCSTIRKITPAGV 270

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            T   G +   G  DG    A FS    +T +     L ++D+ N  IR++
Sbjct: 271 VTTFAGQAYTTGAVDGTGTAARFSWPVGIT-IDSSDNLYVADYSNSAIRKV 320



 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 46/111 (41%), Positives = 57/111 (51%), Gaps = 3/111 (2%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S +L V DS  ++   ++       V   LAGD   G +DG   +A F  P+  A D  G
Sbjct: 359 SGNLFVTDSDNASIRKVTPARVVTLVAGSLAGD-SDGSADGTGTAASFHSPEGVAADPAG 417

Query: 116 NIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
           N+YVAD  N  IRKIT  G VTTIA G   + G ADG    A FS   +LT
Sbjct: 418 NLYVADTMNRTIRKITPSGNVTTIA-GSPGQIGSADGTGAAARFSYPTKLT 467



 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 46/134 (34%), Positives = 63/134 (47%), Gaps = 8/134 (5%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
           S +L V D S SA       ++  +VV   AG  G  G  DG   +ARF  P    +D  
Sbjct: 304 SDNLYVADYSNSAIR----KVTSSAVVSNFAGSYGDYGAVDGTGTAARFAGPAGVGIDAS 359

Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGG--GSKKEGRADGPAQNASFSNDFELTFVPHICA 172
           GN++V D  N  IRK+T   V T+  G      +G ADG    ASF +   +   P    
Sbjct: 360 GNLFVTDSDNASIRKVTPARVVTLVAGSLAGDSDGSADGTGTAASFHSPEGVAADPA-GN 418

Query: 173 LLISDHGNQLIRQI 186
           L ++D  N+ IR+I
Sbjct: 419 LYVADTMNRTIRKI 432



 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 40/108 (37%), Positives = 53/108 (49%), Gaps = 4/108 (3%)

Query: 82  VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
           V  +AG  G  G +DG   +ARF  P    V   GNIYVAD+  + IRK+T  GV T   
Sbjct: 438 VTTIAGSPGQIGSADGTGAAARFSYPTKLTVAEDGNIYVADE--YRIRKLTPGGVVTSLA 495

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           G     G ADG   +A F     +       +L +SD GN  +R++ L
Sbjct: 496 GDYDNSGSADGTGTSARFGGVAGIA-SDGAGSLYVSDSGNYTVRKVTL 542



 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 34/86 (39%), Positives = 50/86 (58%), Gaps = 4/86 (4%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++  +VV  +AG  G  G +DG   +ARF+ P+   +   GN Y+ DK+N++IRK+T  G
Sbjct: 156 ITPAAVVTTVAGSPGNAGTADGTGSAARFNNPEDITLAADGNFYITDKNNNMIRKMTPAG 215

Query: 135 -VTTIAGGGSKKEGRADGPAQNASFS 159
            VTT AG G+   G  DG    A F+
Sbjct: 216 VVTTFAGDGT--YGCTDGTGAAAHFN 239



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 50/112 (44%), Gaps = 2/112 (1%)

Query: 76  LSEESVVKRLAGDG-VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++   VV   AG     G  DG   +ARF  P    +D   N+YVAD SN  IRK+T+  
Sbjct: 265 ITPAGVVTTFAGQAYTTGAVDGTGTAARFSWPVGITIDSSDNLYVADYSNSAIRKVTSSA 324

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           V +   G     G  DG    A F+    +  +     L ++D  N  IR++
Sbjct: 325 VVSNFAGSYGDYGAVDGTGTAARFAGPAGVG-IDASGNLFVTDSDNASIRKV 375



 Score = 46.2 bits (108), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 31/67 (46%), Positives = 38/67 (56%), Gaps = 2/67 (2%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++   VV   AG   QG +D   GS ARF +P   A D  GN+YVA+     IRKIT+  
Sbjct: 595 ITVAGVVTTFAGAAGQGGNDDGMGSNARFSQPHFVATDSSGNLYVAEWGEATIRKITSGA 654

Query: 135 -VTTIAG 140
            VTTIAG
Sbjct: 655 VVTTIAG 661



 Score = 44.3 bits (103), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 31/89 (34%), Positives = 47/89 (52%), Gaps = 8/89 (8%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEG---RADGPAQNAS 157
           ARF +P+   V+  G+I++ D +  VIRKI+N  V+T AG    K G    ADG   +A 
Sbjct: 73  ARFREPRDAVVNAAGDIFIVDSNASVIRKISNGVVSTFAG----KFGVFDHADGTGDSAR 128

Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQI 186
           F     +T +     L +++  N  IR+I
Sbjct: 129 FDYPTGIT-IDGSGNLFVTEGNNHTIRKI 156


>gi|255532869|ref|YP_003093241.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
 gi|255345853|gb|ACU05179.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
          Length = 2296

 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 45/107 (42%), Positives = 60/107 (56%), Gaps = 4/107 (3%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
           VV   AG GV+G+ DG   +A+F  P   A D  GN+YVAD+ NH IRKIT  G VTT+A
Sbjct: 642 VVSTFAGSGVKGFKDGTGTAAQFAFPTGLAADAAGNVYVADRDNHRIRKITAAGVVTTLA 701

Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G G+     ADG   NA F+    +  V     + ++D  N  IR++
Sbjct: 702 GNGTAD--FADGTQANARFNMPVGVA-VDANGNVFVADRNNFRIRKV 745



 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 44/106 (41%), Positives = 61/106 (57%), Gaps = 3/106 (2%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           ++   VV  LAG+G   ++DG   +ARF+ P   AVD  GN++VAD++N  IRK+T  G 
Sbjct: 691 ITAAGVVTTLAGNGTADFADGTQANARFNMPVGVAVDANGNVFVADRNNFRIRKVTPAGV 750

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGN 180
           V+T AG G+   G A+G + +A F   F L        L ISD GN
Sbjct: 751 VSTYAGKGT--SGFAEGLSGDAQFKLVFGLAMSSISGNLYISDQGN 794



 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 31/61 (50%), Positives = 40/61 (65%), Gaps = 2/61 (3%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
           VVKR  G  + GY++G   SA F+ P   A+D  GN Y+AD+ NH IRKI+  G VTT+A
Sbjct: 481 VVKRFVGKDL-GYANGTGTSAMFNSPAGIAMDAAGNTYLADRGNHRIRKISPAGVVTTLA 539

Query: 140 G 140
           G
Sbjct: 540 G 540



 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 39/107 (36%), Positives = 52/107 (48%), Gaps = 6/107 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           V   AG G  GY++G   +A F      A+D  GN+YV D+  + IRKIT  G V+T AG
Sbjct: 589 VSTFAGTGTYGYANGTTTTATFTYLLGIAIDNSGNLYVTDQDVNSIRKITPGGVVSTFAG 648

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICA-LLISDHGNQLIRQI 186
            G K  G  DG    A F+  F           + ++D  N  IR+I
Sbjct: 649 SGVK--GFKDGTGTAAQFA--FPTGLAADAAGNVYVADRDNHRIRKI 691



 Score = 45.8 bits (107), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 35/77 (45%), Positives = 47/77 (61%), Gaps = 3/77 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           +S   VV  LAG G  G++DG   +A+F+ P   AVD  GN+YV+D  N  +RKIT  G 
Sbjct: 529 ISPAGVVTTLAGSGTAGFADGAGAAAQFNDPSGVAVDAAGNVYVSDTRNFRVRKITPAGQ 588

Query: 135 VTTIAGGGSKKEGRADG 151
           V+T AG G+   G A+G
Sbjct: 589 VSTFAGTGT--YGYANG 603



 Score = 41.2 bits (95), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 34/68 (50%), Gaps = 4/68 (5%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           S+   V   AG GV GY +G+     +A+F        D   N+YV D  N+ IRKI   
Sbjct: 33  SQAQTVSPYAGTGVDGYYEGDNNDAITAQFSYIGGITTDANNNLYVTDSRNYCIRKIDQN 92

Query: 134 G-VTTIAG 140
           G V+ IAG
Sbjct: 93  GKVSVIAG 100


>gi|373953102|ref|ZP_09613062.1| Ig family protein [Mucilaginibacter paludis DSM 18603]
 gi|373889702|gb|EHQ25599.1| Ig family protein [Mucilaginibacter paludis DSM 18603]
          Length = 3731

 Score = 73.9 bits (180), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 47/112 (41%), Positives = 69/112 (61%), Gaps = 4/112 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           +S   VV   AG GV G ++G   +A F+ P   A D++GN+YV+D ++++IRKIT  G 
Sbjct: 109 ISPVGVVSTFAGSGVAGSANGTGTAASFNNPFGIATDVQGNLYVSDVNSNLIRKITPGGV 168

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           VTT+AG GS   G  +G    ASF+  + LT       + ++D+GNQLIR+I
Sbjct: 169 VTTLAGSGSA--GSVNGTGTAASFNTPYSLT-TDMQGNVYVADYGNQLIRKI 217



 Score = 70.9 bits (172), Expect = 5e-10,   Method: Composition-based stats.
 Identities = 45/112 (40%), Positives = 63/112 (56%), Gaps = 4/112 (3%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
            ++   +V   AG G  G  +G   +A F+ P     D +GNIYV+D +N+ IRKIT  G 
Sbjct: 1086 ITPSGLVTTFAGSGAAGSENGNGAAASFNNPFGLTTDAQGNIYVSDANNNTIRKITPSGV 1145

Query: 135  VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            VTT AG GS   G ADG    ASF++ + L        + ++D GNQ+IR+I
Sbjct: 1146 VTTFAGSGS--SGAADGIGMAASFNSPYGLATDAQ-GNIYVADFGNQVIRKI 1194



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 48/112 (42%), Positives = 60/112 (53%), Gaps = 4/112 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           ++   VV   AG GV G  +G   +A F  P    +D +GN+YVAD  NH IRKIT  G 
Sbjct: 272 ITPAGVVTTFAGSGVPGALNGTGTAATFYNPTGVTMDAQGNVYVADSQNHSIRKITPAGV 331

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           VTT+AG GS   G A+G   NASF           +  L I+D  N LIR+I
Sbjct: 332 VTTLAGTGSM--GSANGAGTNASFYYP-NAVVADALGNLYIADTNNHLIRKI 380



 Score = 65.5 bits (158), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 41/111 (36%), Positives = 54/111 (48%), Gaps = 1/111 (0%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           ++   VV  LAG G  G +DG   +A F  P     D +GN+YVAD  N  IRKIT  GV
Sbjct: 657 ITPAGVVTTLAGSGDSGSADGTGTAASFHTPYDLTTDAQGNVYVADNFNQTIRKITREGV 716

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
                G S   G  +G A  A F N   +        + ++D+GN  IR+I
Sbjct: 717 VNTFAGTSGSSGFVNGTAAAAKFKNPIGIATDTQ-GNVYVADNGNLAIRKI 766



 Score = 63.5 bits (153), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 40/106 (37%), Positives = 53/106 (50%), Gaps = 1/106 (0%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
           VV  LAG G  G  +G   +A F+ P S   DM+GN+YVAD  N +IRKIT  GV T   
Sbjct: 168 VVTTLAGSGSAGSVNGTGTAASFNTPYSLTTDMQGNVYVADYGNQLIRKITPAGVVTTLA 227

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G     G  +G    A F+    +        + ++D  NQ IR+I
Sbjct: 228 GTVGSSGFVNGTGTAAKFNYPRSVA-TDAAGNVYVADQVNQAIRKI 272



 Score = 62.0 bits (149), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 45/112 (40%), Positives = 60/112 (53%), Gaps = 4/112 (3%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
            ++   +V   AG G  G  +G   +A F  P     D +GN+YVAD  N+ IRKIT  G 
Sbjct: 1249 ITPAGLVTTFAGSGGIGALNGTGTAASFHNPTGITTDAQGNVYVADLYNNAIRKITPGGV 1308

Query: 135  VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            VTT+AG GS   G ADG   +ASF N   +     +  + + D  NQLIR+I
Sbjct: 1309 VTTLAGTGSI--GSADGVGTSASFYNPNAVA-TDAVGNIYVVDTYNQLIRKI 1357



 Score = 61.6 bits (148), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 40/113 (35%), Positives = 61/113 (53%), Gaps = 12/113 (10%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
            ++ ++VV  LAG+G  G ++G   +A F+ P     D  GN+YVAD  N++IRK+T+ G 
Sbjct: 1690 ITPDAVVTTLAGNGAAGSANGTGNAASFNHPWGIVADAAGNLYVADTYNNLIRKVTSAGS 1749

Query: 135  VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            VTT AG G+      DG    ASF+         +  A+ I   GN  + ++N
Sbjct: 1750 VTTFAGSGAASS--VDGTGTAASFN---------YPSAISIDASGNLYVAELN 1791



 Score = 61.6 bits (148), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 43/111 (38%), Positives = 61/111 (54%), Gaps = 1/111 (0%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
            ++   VV   AG G  G +DG   +A F+ P   A D +GNIYVAD  N VIRKIT  GV
Sbjct: 1140 ITPSGVVTTFAGSGSSGAADGIGMAASFNSPYGLATDAQGNIYVADFGNQVIRKITPDGV 1199

Query: 136  TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             T   G +   G  +G A  A F++ +++  V     + ++D  NQ+IR+I
Sbjct: 1200 VTTFAGTTGVAGNVNGAAAAAKFNSPYDVA-VDVTGNVYVADELNQVIRKI 1249



 Score = 60.8 bits (146), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 44/113 (38%), Positives = 61/113 (53%), Gaps = 5/113 (4%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++   VV  LAG  G  G+ +G   +A+F+ P+S A D  GN+YVAD+ N  IRKIT  G
Sbjct: 217 ITPAGVVTTLAGTVGSSGFVNGTGTAAKFNYPRSVATDAAGNVYVADQVNQAIRKITPAG 276

Query: 135 -VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            VTT AG G    G  +G    A+F N   +T       + ++D  N  IR+I
Sbjct: 277 VVTTFAGSGVP--GALNGTGTAATFYNPTGVTMDAQ-GNVYVADSQNHSIRKI 326



 Score = 60.8 bits (146), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 45/113 (39%), Positives = 60/113 (53%), Gaps = 4/113 (3%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
            ++   VV  LAG    G +DG   +A F+ P   AVD  GN+YVAD  N ++RKIT  G 
Sbjct: 2130 ITAAGVVTTLAGKLTAGSADGVGAAASFNLPAGVAVDASGNVYVADLLNSMVRKITPDGT 2189

Query: 135  VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            VTT+A  GS   G ADG    A F     L  V     ++++D  N  IR+I+
Sbjct: 2190 VTTLA--GSTSAGSADGTGAAAGFHYPTNLQ-VDDQGNIIVADQLNNKIRKIS 2239



 Score = 59.3 bits (142), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 43/107 (40%), Positives = 60/107 (56%), Gaps = 5/107 (4%)

Query: 82  VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
           V   AG+ G+ G ++    +A F  P   AVD  GN+YVAD  N++IRKI+ +G V+T A
Sbjct: 60  VSTFAGNAGIAGNTNATGTAATFHSPFGVAVDASGNVYVADAGNNLIRKISPVGVVSTFA 119

Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G G    G A+G    ASF+N F +        L +SD  + LIR+I
Sbjct: 120 GSGVA--GSANGTGTAASFNNPFGIATDVQ-GNLYVSDVNSNLIRKI 163



 Score = 59.3 bits (142), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 43/107 (40%), Positives = 58/107 (54%), Gaps = 5/107 (4%)

Query: 82   VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
            V  +AG  G+ G ++G   +A F  P     D  GN+Y+AD +N +IRKIT  G VTT A
Sbjct: 1037 VSTVAGSVGIAGKANGIGTAATFSGPSGVTTDASGNLYIADFNNRLIRKITPSGLVTTFA 1096

Query: 140  GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            G G+   G  +G    ASF+N F LT       + +SD  N  IR+I
Sbjct: 1097 GSGAA--GSENGNGAAASFNNPFGLTTDAQ-GNIYVSDANNNTIRKI 1140



 Score = 58.2 bits (139), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 36/89 (40%), Positives = 47/89 (52%), Gaps = 2/89 (2%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++ E VV   AG  G  G+ +G   +A+F  P   A D +GN+YVAD  N  IRKIT  G
Sbjct: 711 ITREGVVNTFAGTSGSSGFVNGTAAAAKFKNPIGIATDTQGNVYVADNGNLAIRKITPAG 770

Query: 135 -VTTIAGGGSKKEGRADGPAQNASFSNDF 162
            VTT+AG G K        AQ   +  D+
Sbjct: 771 VVTTLAGSGFKDPFSVATDAQGNVYVMDY 799



 Score = 57.0 bits (136), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 34/84 (40%), Positives = 50/84 (59%), Gaps = 3/84 (3%)

Query: 82   VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
            V   AG G     DG   +A F+ P + ++D  GN+YVA+ + +VIRKI+  G VTTIAG
Sbjct: 1750 VTTFAGSGAASSVDGTGTAASFNYPSAISIDASGNLYVAELNGNVIRKISPAGVVTTIAG 1809

Query: 141  GGSKKEGRADGPAQNASFSNDFEL 164
             G+   G A+G  + A+F N + +
Sbjct: 1810 SGA--SGIANGIGKAATFGNLYSI 1831



 Score = 54.7 bits (130), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 41/121 (33%), Positives = 66/121 (54%), Gaps = 7/121 (5%)

Query: 69   FYTLSFPLSEESVVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
            FY     ++ + +V   AG   G  G +DG   +A F  P+  A+D  GN++V +  N++
Sbjct: 1628 FYNQIRKITPDGLVTTFAGSLTGAPGATDGTGAAATFRSPRGMAIDALGNLFVVE-DNYL 1686

Query: 127  IRKIT-NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
            IRKIT +  VTT+AG G+   G A+G    ASF++ + +        L ++D  N LIR+
Sbjct: 1687 IRKITPDAVVTTLAGNGAA--GSANGTGNAASFNHPWGIV-ADAAGNLYVADTYNNLIRK 1743

Query: 186  I 186
            +
Sbjct: 1744 V 1744



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 30/60 (50%), Positives = 36/60 (60%), Gaps = 1/60 (1%)

Query: 100  SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
            +A F+ P   AVD  GNIYVAD+ N++IRKIT  GV T    G    G ADG    ASF+
Sbjct: 2100 AATFNHPTGLAVDAAGNIYVADQGNNMIRKITAAGVVTTL-AGKLTAGSADGVGAAASFN 2158



 Score = 52.0 bits (123), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 38/112 (33%), Positives = 57/112 (50%), Gaps = 3/112 (2%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
            ++  +VV    G G  G ++G+  +A F  P   A+D  GN++VAD+  + IRKIT  G 
Sbjct: 1581 ITPATVVSTFVGTGSPGSTNGKGTAASFYVPYGMAIDAAGNLFVADQFYNQIRKITPDGL 1640

Query: 135  VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            VTT AG  +   G  DG    A+F +   +         ++ D  N LIR+I
Sbjct: 1641 VTTFAGSLTGAPGATDGTGAAATFRSPRGMAIDALGNLFVVED--NYLIRKI 1690



 Score = 51.6 bits (122), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 37/86 (43%), Positives = 48/86 (55%), Gaps = 4/86 (4%)

Query: 102  RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSN 160
            +F+ P   AVD+ GN+YVAD+ N VIRKIT  G VTT AG G    G  +G    ASF N
Sbjct: 1221 KFNSPYDVAVDVTGNVYVADELNQVIRKITPAGLVTTFAGSGGI--GALNGTGTAASFHN 1278

Query: 161  DFELTFVPHICALLISDHGNQLIRQI 186
               +T       + ++D  N  IR+I
Sbjct: 1279 PTGITTDAQ-GNVYVADLYNNAIRKI 1303



 Score = 50.8 bits (120), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 43/123 (34%), Positives = 61/123 (49%), Gaps = 6/123 (4%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
           VV   AG G     +G   +A F        D + N+YV D   ++IRKIT  G VTT+A
Sbjct: 610 VVSTFAGSGTAASVNGTGVAASFLHAYRLTTDAQSNVYVID--GNMIRKITPAGVVTTLA 667

Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQS 199
           G G    G ADG    ASF   ++LT       + ++D+ NQ IR+I  +    + +  S
Sbjct: 668 GSGD--SGSADGTGTAASFHTPYDLTTDAQ-GNVYVADNFNQTIRKITREGVVNTFAGTS 724

Query: 200 GSA 202
           GS+
Sbjct: 725 GSS 727



 Score = 50.1 bits (118), Expect = 0.001,   Method: Composition-based stats.
 Identities = 36/86 (41%), Positives = 47/86 (54%), Gaps = 4/86 (4%)

Query: 82  VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
           V  LAG  G  G ++G   +A F  P   A D  GNIYV+D +N++IRKI   G V+T A
Sbjct: 556 VSTLAGKAGSAGNANGTGTAATFSSPTGVATDPSGNIYVSDYNNNLIRKINLAGVVSTFA 615

Query: 140 GGGSKKEGRADGPAQNASFSNDFELT 165
           G G+       G A  ASF + + LT
Sbjct: 616 GSGTAASVNGTGVA--ASFLHAYRLT 639



 Score = 49.7 bits (117), Expect = 0.001,   Method: Composition-based stats.
 Identities = 41/108 (37%), Positives = 59/108 (54%), Gaps = 4/108 (3%)

Query: 80   SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
            +VV   AG G  G  +    SA F+ P   AVD +GN YV D+ N+++RKIT  G V+T+
Sbjct: 2026 AVVTTFAGSGAAGSVNSTGTSATFNGPLDVAVDAEGNTYVLDQLNNLVRKITPAGVVSTL 2085

Query: 139  AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            AG GS         A  A+F++   L  V     + ++D GN +IR+I
Sbjct: 2086 AGSGSSGSANGAATA--ATFNHPTGLA-VDAAGNIYVADQGNNMIRKI 2130



 Score = 49.3 bits (116), Expect = 0.002,   Method: Composition-based stats.
 Identities = 30/66 (45%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
            ++ +  V  LAG    G +DG   +A F  P +  VD +GNI VAD+ N+ IRKI+  G 
Sbjct: 2184 ITPDGTVTTLAGSTSAGSADGTGAAAGFHYPTNLQVDDQGNIIVADQLNNKIRKISPAGV 2243

Query: 135  VTTIAG 140
            VTTIAG
Sbjct: 2244 VTTIAG 2249



 Score = 48.1 bits (113), Expect = 0.004,   Method: Composition-based stats.
 Identities = 41/118 (34%), Positives = 57/118 (48%), Gaps = 29/118 (24%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           ++   VV  LAG G             F  P S A D +GN+YV D S  ++RKI   G 
Sbjct: 766 ITPAGVVTTLAGSG-------------FKDPFSVATDAQGNVYVMDYSTPILRKILPTGT 812

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQI 186
           VT +AG GS   G A+G    ++F       +VP+  A      + ++D GN LIR+I
Sbjct: 813 VTILAGDGSA--GSANGAGTVSNF-------YVPNALATDALGNIYVADAGNNLIRKI 861



 Score = 48.1 bits (113), Expect = 0.004,   Method: Composition-based stats.
 Identities = 27/51 (52%), Positives = 30/51 (58%)

Query: 81   VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            VV  LAG G  G +DG   SA F  P + A D  GNIYV D  N +IRKIT
Sbjct: 1308 VVTTLAGTGSIGSADGVGTSASFYNPNAVATDAVGNIYVVDTYNQLIRKIT 1358



 Score = 47.8 bits (112), Expect = 0.005,   Method: Composition-based stats.
 Identities = 33/89 (37%), Positives = 45/89 (50%), Gaps = 5/89 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTIA 139
           V  LAGDG  G ++G    + F  P + A D  GNIYVAD  N++IRKIT  N  +T + 
Sbjct: 813 VTILAGDGSAGSANGAGTVSNFYVPNALATDALGNIYVADAGNNLIRKITTGNYSITPML 872

Query: 140 GGG---SKKEGRADGPAQNASFSNDFELT 165
             G    +  G   G    AS +  + +T
Sbjct: 873 PAGLNFDQSTGTISGTPTVASPATTYTIT 901



 Score = 47.8 bits (112), Expect = 0.005,   Method: Composition-based stats.
 Identities = 44/139 (31%), Positives = 63/139 (45%), Gaps = 17/139 (12%)

Query: 73   SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
            + P    S+V  + G+G  G  +G   +A  +       D+ GN++VAD  NH+IRKIT 
Sbjct: 1524 AVPAKTYSLVSTIVGNGSSGAVNGTGTAASLNLCDGLVFDLLGNMFVADFGNHMIRKITP 1583

Query: 133  LG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGN-----QLIRQI 186
               V+T  G GS   G  +G    ASF       +VP+   + I   GN     Q   QI
Sbjct: 1584 ATVVSTFVGTGSP--GSTNGKGTAASF-------YVPY--GMAIDAAGNLFVADQFYNQI 1632

Query: 187  NLKPEDCSKSSQSGSALGA 205
                 D   ++ +GS  GA
Sbjct: 1633 RKITPDGLVTTFAGSLTGA 1651



 Score = 44.3 bits (103), Expect = 0.056,   Method: Composition-based stats.
 Identities = 23/61 (37%), Positives = 34/61 (55%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
            +S   VV  +AG G  G ++G   +A F    S A D  G++YVAD+  ++IRKI   G 
Sbjct: 1798 ISPAGVVTTIAGSGASGIANGIGKAATFGNLYSIATDASGDVYVADQYKYIIRKIVGTGY 1857

Query: 136  T 136
            +
Sbjct: 1858 S 1858


>gi|329908045|ref|ZP_08274805.1| hypothetical protein IMCC9480_3485 [Oxalobacteraceae bacterium
           IMCC9480]
 gi|327546778|gb|EGF31712.1| hypothetical protein IMCC9480_3485 [Oxalobacteraceae bacterium
           IMCC9480]
          Length = 685

 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 86/180 (47%), Gaps = 25/180 (13%)

Query: 25  GLLLEDGYTVTTVIDGHQLE---INPHSVIDRPGSSDLIVLDSSRSAFYTLSFP------ 75
           G+ +  G T    + G +LE   ++ +    R GS   +  D+S + +Y +  P      
Sbjct: 351 GVHVITGGTTLKTLAGKELERGMVDGNGAKARFGSLQGVTSDASGN-WYVVDTPHHMIRK 409

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           ++   VV +LAG+G  G  DG    A F+ P+    D  GN++VAD  N  IRKIT  G 
Sbjct: 410 ITPAGVVSQLAGNGKPGGVDGTGSGASFNYPRGIVADALGNLFVADTFNSRIRKITPAGV 469

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQINL 188
           VTTIAG GS   G  DGP   A F       F P   A      L ++D GN  +R+I +
Sbjct: 470 VTTIAGAGS-SSGSTDGPGNIARF-------FEPEAIAIDASRNLYVADTGNHTVRKITV 521



 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 47/142 (33%), Positives = 71/142 (50%), Gaps = 9/142 (6%)

Query: 47  PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFD 104
           P S+ IDR G+  L V D++      ++     + VV  +AG  GV G +DG   +ARF 
Sbjct: 120 PQSIAIDRAGT--LYVADTNNQTIRKIT----PQGVVTTIAGRVGVDGSTDGRGNAARFS 173

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFEL 164
            P+  AVD+ G + V+D  NH +R I+  GV     G +   G  DG    A FSN   L
Sbjct: 174 YPQGIAVDVAGTVLVSDTYNHTVRTISPGGVVGTLAGSAGNFGVLDGVRSAARFSNPQGL 233

Query: 165 TFVPHICALLISDHGNQLIRQI 186
                   + ++D GN ++R++
Sbjct: 234 -VTDAARNIYVADAGNGVLRKV 254



 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 43/113 (38%), Positives = 58/113 (51%), Gaps = 2/113 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           +S +  V  LAG     G +DG   +ARF+ P+S A+D  G +YVAD +N  IRKIT  G
Sbjct: 89  VSADGSVSTLAGAMSTPGAADGTGTAARFNYPQSIAIDRAGTLYVADTNNQTIRKITPQG 148

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           V T   G    +G  DG    A FS    +  V     +L+SD  N  +R I+
Sbjct: 149 VVTTIAGRVGVDGSTDGRGNAARFSYPQGIA-VDVAGTVLVSDTYNHTVRTIS 200



 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 39/105 (37%), Positives = 53/105 (50%), Gaps = 6/105 (5%)

Query: 60  IVLDSSRSAFYTLSF-----PLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDM 113
           IV D+  + F   +F      ++   VV  +AG G   G +DG    ARF +P++ A+D 
Sbjct: 443 IVADALGNLFVADTFNSRIRKITPAGVVTTIAGAGSSSGSTDGPGNIARFFEPEAIAIDA 502

Query: 114 KGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASF 158
             N+YVAD  NH +RKIT  GV +   G   K G  DG    A F
Sbjct: 503 SRNLYVADTGNHTVRKITVAGVVSTLAGSPGKYGSDDGTGAAARF 547



 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 45/135 (33%), Positives = 65/135 (48%), Gaps = 7/135 (5%)

Query: 58  DLIVLDSSRSAFYTLSF-----PLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAV 111
           + I +D+SR+ +   +       ++   VV  LAG  G  G  DG   +ARF KP + AV
Sbjct: 496 EAIAIDASRNLYVADTGNHTVRKITVAGVVSTLAGSPGKYGSDDGTGAAARFLKPPAIAV 555

Query: 112 DMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC 171
           D  GN+ ++  +   IRKIT  GV T   G   + G  DG    A F N   L    +  
Sbjct: 556 DPSGNVVLSQPAYGTIRKITPGGVVTTLAGRVLETGALDGAGDAARFFNPQGLA-ADNAG 614

Query: 172 ALLISDHGNQLIRQI 186
            + ++D GN  IR+I
Sbjct: 615 NVYVADTGNNTIRRI 629



 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 44/86 (51%), Gaps = 1/86 (1%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSN 160
           ARF+ P   AVD  GN YVAD  NHVIRK++  G  +   G     G ADG    A F N
Sbjct: 60  ARFNFPIGIAVDGVGNRYVADWQNHVIRKVSADGSVSTLAGAMSTPGAADGTGTAARF-N 118

Query: 161 DFELTFVPHICALLISDHGNQLIRQI 186
             +   +     L ++D  NQ IR+I
Sbjct: 119 YPQSIAIDRAGTLYVADTNNQTIRKI 144



 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 34/84 (40%), Positives = 40/84 (47%), Gaps = 1/84 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           +S   VV  LAG  G  G  DG   +ARF  P+    D   NIYVAD  N V+RK+T  G
Sbjct: 199 ISPGGVVGTLAGSAGNFGVLDGVRSAARFSNPQGLVTDAARNIYVADAGNGVLRKVTPAG 258

Query: 135 VTTIAGGGSKKEGRADGPAQNASF 158
           + T   G     G  DG    A F
Sbjct: 259 IVTTLAGSLANYGLKDGTGAAAGF 282



 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 33/78 (42%), Positives = 47/78 (60%), Gaps = 3/78 (3%)

Query: 65  SRSAFYTLSFPLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKS 123
           S+ A+ T+   ++   VV  LAG  ++ G  DG   +ARF  P+  A D  GN+YVAD  
Sbjct: 564 SQPAYGTIR-KITPGGVVTTLAGRVLETGALDGAGDAARFFNPQGLAADNAGNVYVADTG 622

Query: 124 NHVIRKITNLG-VTTIAG 140
           N+ IR+IT  G V+T+AG
Sbjct: 623 NNTIRRITPSGQVSTVAG 640


>gi|408823744|ref|ZP_11208634.1| NHL repeat-containing protein [Pseudomonas geniculata N1]
          Length = 681

 Score = 73.6 bits (179), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 58/152 (38%), Positives = 75/152 (49%), Gaps = 11/152 (7%)

Query: 38  IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
           +DG  L+   N  S I      +L V D+   A   +      +  V  LAG G QGY+D
Sbjct: 102 VDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIG----TDGQVTTLAG-GEQGYAD 156

Query: 96  GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQ 154
           G    ARFD P   AVD +G +YVAD  N  IR I T+  V T+AGG  ++ G ADG   
Sbjct: 157 GPAAQARFDAPMGIAVDAQGQVYVADTYNDRIRVIGTDGNVRTLAGG--ERPGMADGVGA 214

Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            A F     L F     ALL++D  N  +R+I
Sbjct: 215 AARFDTPVALAFDAQ-GALLVADLFNNAVRRI 245



 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/106 (41%), Positives = 58/106 (54%), Gaps = 6/106 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           V+ +AG G +G  DG    A F+ P   A D +GN+YVAD  NH IR+I   G VTT+AG
Sbjct: 91  VETVAGQG-EGRVDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIGTDGQVTTLAG 149

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G   ++G ADGPA  A F     +  V     + ++D  N  IR I
Sbjct: 150 G---EQGYADGPAAQARFDAPMGIA-VDAQGQVYVADTYNDRIRVI 191



 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 41/114 (35%), Positives = 59/114 (51%), Gaps = 6/114 (5%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADK--SNHVIRKITN 132
           PL+  + ++ LAGDG  G  DG    ARF  P +      G++Y  D   +N + R++ +
Sbjct: 29  PLAWTAQIEPLAGDGHPGNRDGAAAQARFADPYALLRSADGSVYFTDAGDNNRIRRRLPD 88

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             V T+AG G   EGR DGPA  ASF+    +        L ++D GN  IR+I
Sbjct: 89  GRVETVAGQG---EGRVDGPALQASFNTPSGIAADAQ-GNLYVADTGNHAIRRI 138


>gi|225872818|ref|YP_002754275.1| hypothetical protein ACP_1177 [Acidobacterium capsulatum ATCC
           51196]
 gi|225794459|gb|ACO34549.1| hypothetical protein ACP_1177 [Acidobacterium capsulatum ATCC
           51196]
          Length = 855

 Score = 73.6 bits (179), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 49/140 (35%), Positives = 73/140 (52%), Gaps = 7/140 (5%)

Query: 49  SVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKP 106
           S +   GS +L + D++ +        +S   VV  +AG+G +GYS DG    SA  D P
Sbjct: 82  SAVAYDGSGNLYIADTNNNVIR----EVSTTGVVTTIAGNGEEGYSGDGAAATSAMLDTP 137

Query: 107 KSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTF 166
              AVD  GNIY+AD  N+ IR+++N  + T+AG G+      DG A  ++  +D     
Sbjct: 138 TGIAVDSNGNIYIADSHNNRIREVSNGIINTVAGNGTAGY-SGDGAAATSAMLDDPTAVA 196

Query: 167 VPHICALLISDHGNQLIRQI 186
           V     + I+D GNQ IR +
Sbjct: 197 VDASGNIYIADTGNQRIRNV 216



 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 41/107 (38%), Positives = 59/107 (55%), Gaps = 6/107 (5%)

Query: 85  LAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
           +AG G  GYS   G   SA F  P + A D  GN+Y+AD +N+VIR+++  G VTTIAG 
Sbjct: 58  VAGTGSAGYSGNGGAATSAAFASPSAVAYDGSGNLYIADTNNNVIREVSTTGVVTTIAGN 117

Query: 142 GSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           G  +EG   DG A  ++  +      V     + I+D  N  IR+++
Sbjct: 118 G--EEGYSGDGAAATSAMLDTPTGIAVDSNGNIYIADSHNNRIREVS 162



 Score = 53.5 bits (127), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 28/66 (42%), Positives = 38/66 (57%), Gaps = 2/66 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
           ++  +AG+G  GYS DG    SA  D P + AVD  GNIY+AD  N  IR +    + T+
Sbjct: 165 IINTVAGNGTAGYSGDGAAATSAMLDDPTAVAVDASGNIYIADTGNQRIRNVAAGTIHTV 224

Query: 139 AGGGSK 144
           AG G +
Sbjct: 225 AGNGEE 230



 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 37/111 (33%), Positives = 60/111 (54%), Gaps = 5/111 (4%)

Query: 81  VVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
            +  +AG+G +GYS DG   ++   D P   AVD  GNIY+AD  N+ IR+++   + T+
Sbjct: 220 TIHTVAGNGEEGYSGDGAAAASAELDTPTGIAVDSSGNIYIADSHNNRIREVSGGVINTV 279

Query: 139 AGGGSKK-EGRADGPAQNASFSNDFELTFVPHICA--LLISDHGNQLIRQI 186
           AG G+    G   G   +A+ +   + T V    A  + I+D  N+ +R+I
Sbjct: 280 AGSGAVTFPGSYSGDGGSATAATLAKPTGVALDAAGHVYIADTNNERLREI 330



 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 9/124 (7%)

Query: 81  VVKRLAGDGV---QGYSDGEPGSAR---FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           V+  +AG G     G   G+ GSA      KP   A+D  G++Y+AD +N  +R+I N  
Sbjct: 275 VINTVAGSGAVTFPGSYSGDGGSATAATLAKPTGVALDAAGHVYIADTNNERLREIANGV 334

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
           + T+AG G +     DG A  ++  ND     V    ++ ++D  N+ +R   L P   +
Sbjct: 335 IATVAGNGQQGY-SGDGAAATSAALNDPRNASVNASGSVAVADTLNERVR--GLTPPTLT 391

Query: 195 KSSQ 198
            +SQ
Sbjct: 392 YASQ 395


>gi|223939820|ref|ZP_03631690.1| NHL repeat containing protein [bacterium Ellin514]
 gi|223891508|gb|EEF57999.1| NHL repeat containing protein [bacterium Ellin514]
          Length = 805

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 54/144 (37%), Positives = 78/144 (54%), Gaps = 16/144 (11%)

Query: 51  IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSF 109
           ID+ G  +L+V D+  S    ++ P+    VV  LAG     G +DG    ARF+ P+  
Sbjct: 358 IDKAG--NLLVADTQNSEIRKIA-PVGTNYVVTTLAGFAQNVGGADGTNAVARFNSPRGI 414

Query: 110 AVDMKGNIYVADKSNHVIRKI----TNLGVTTIAGGGSKKEGRA---DGPAQNASFSNDF 162
           AVD  GN++VAD++N+ IRK+    TN  VTTIAG    + G A   DG   NA F+   
Sbjct: 415 AVDAAGNVFVADQNNNTIRKLTPVGTNWAVTTIAG----QAGMAYYGDGNGTNAYFNYPA 470

Query: 163 ELTFVPHICALLISDHGNQLIRQI 186
            +  V     L ++D GN +IR++
Sbjct: 471 GIA-VDASGNLFVTDAGNHVIRKL 493



 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 55/145 (37%), Positives = 73/145 (50%), Gaps = 26/145 (17%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-VQGYSDGEPGSARFDKPKSFAVDMK 114
           S  L V DSS ++   ++ P+    VV  +AG   V G +DG  G A F++P   AVD  
Sbjct: 187 SGHLFVGDSSNNSIRKMT-PVGTNWVVTTIAGSSPVSGSNDGTNGFAHFNQPCGLAVDAA 245

Query: 115 GNIYVADKSNHVIRKITNLG----VTTIAGGGSKKEGRAD---GPAQNASFSNDFELTFV 167
           G+I+VAD  N  IRKIT+ G    VTTIAG    K G AD   G   NA F+        
Sbjct: 246 GSIFVADYFNSTIRKITSAGTNWLVTTIAG----KVGVADSAEGTGTNAVFN-------Y 294

Query: 168 PHICA------LLISDHGNQLIRQI 186
           PH  A      + ++D  N  IR++
Sbjct: 295 PHAVAVDTNGNVFVADSENYTIRKL 319



 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 45/112 (40%), Positives = 56/112 (50%), Gaps = 18/112 (16%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI----TNLGVTTIAG 140
           +AG    G +DG    ARF+ P++ A D  GNIYVAD  NHVIRK+    TN  VTTIA 
Sbjct: 42  IAGVASIGSNDGTNRVARFNNPEATASDSAGNIYVADSGNHVIRKMAAVGTNWVVTTIA- 100

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQI 186
           G     G ADG    A F       + P   A      + ++D  N  IR+I
Sbjct: 101 GLPGNPGSADGTNSAAQF-------YYPAAVAVDNSGNVFVADSANYTIRKI 145



 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 49/136 (36%), Positives = 69/136 (50%), Gaps = 8/136 (5%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMK 114
           S ++ V DS+      ++ PL    VV  +AG  G  G SDG    A F  P+  A D  
Sbjct: 129 SGNVFVADSANYTIRKIA-PLGTNWVVTTIAGVPGYHGSSDGTNTDALFFYPEGLAFDSS 187

Query: 115 GNIYVADKSNHVIRKITNLG----VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHI 170
           G+++V D SN+ IRK+T +G    VTTIA G S   G  DG    A F+    L  V   
Sbjct: 188 GHLFVGDSSNNSIRKMTPVGTNWVVTTIA-GSSPVSGSNDGTNGFAHFNQPCGLA-VDAA 245

Query: 171 CALLISDHGNQLIRQI 186
            ++ ++D+ N  IR+I
Sbjct: 246 GSIFVADYFNSTIRKI 261



 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 45/117 (38%), Positives = 59/117 (50%), Gaps = 19/117 (16%)

Query: 81  VVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG----V 135
           VV  +AG  G  G +DG   +A+F  P + AVD  GN++VAD +N+ IRKI  LG    V
Sbjct: 95  VVTTIAGLPGNPGSADGTNSAAQFYYPAAVAVDNSGNVFVADSANYTIRKIAPLGTNWVV 154

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQI 186
           TTIA G     G +DG   +A F       F P   A      L + D  N  IR++
Sbjct: 155 TTIA-GVPGYHGSSDGTNTDALF-------FYPEGLAFDSSGHLFVGDSSNNSIRKM 203



 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 36/98 (36%), Positives = 49/98 (50%), Gaps = 6/98 (6%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG----VTTIAGGGSKKEG 147
           G+ DG    ARF  P   A+D    +YV D+ N+ IRK+T LG    VTT+A G     G
Sbjct: 630 GFMDGTNSDARFAYPTGIAIDTNDTLYVTDQGNNTIRKVTPLGTNWMVTTLA-GIHASTG 688

Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
            ADG    A F+  F +  +     L ++D  N  IR+
Sbjct: 689 SADGAGSAALFNGPFGIA-IDKTGNLFVADLQNSSIRK 725



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 37/111 (33%), Positives = 55/111 (49%), Gaps = 7/111 (6%)

Query: 81  VVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA 139
           +V  +AG  GV   ++G   +A F+ P + AVD  GN++VAD  N+ IRK+T  G   + 
Sbjct: 269 LVTTIAGKVGVADSAEGTGTNAVFNYPHAVAVDTNGNVFVADSENYTIRKLTPSGTNYVV 328

Query: 140 G---GGSKKEGRADGPAQNASFSNDFELTF-VPHICALLISDHGNQLIRQI 186
               G ++     DG    A F   F L   +     LL++D  N  IR+I
Sbjct: 329 STPIGQTRASNSTDGTNNAARFW--FLLGISIDKAGNLLVADTQNSEIRKI 377



 Score = 45.4 bits (106), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 49/162 (30%), Positives = 75/162 (46%), Gaps = 11/162 (6%)

Query: 47  PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-VQGYSDGEPGSARFD 104
           PH+V +D  G  ++ V DS       L+ P     VV    G       +DG   +ARF 
Sbjct: 295 PHAVAVDTNG--NVFVADSENYTIRKLT-PSGTNYVVSTPIGQTRASNSTDGTNNAARFW 351

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKI----TNLGVTTIAGGGSKKEGRADGPAQNASFSN 160
                ++D  GN+ VAD  N  IRKI    TN  VTT+A G ++  G ADG    A F++
Sbjct: 352 FLLGISIDKAGNLLVADTQNSEIRKIAPVGTNYVVTTLA-GFAQNVGGADGTNAVARFNS 410

Query: 161 DFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
              +  V     + ++D  N  IR++     + + ++ +G A
Sbjct: 411 PRGIA-VDAAGNVFVADQNNNTIRKLTPVGTNWAVTTIAGQA 451



 Score = 42.7 bits (99), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 38/114 (33%), Positives = 56/114 (49%), Gaps = 8/114 (7%)

Query: 94  SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI----TNLGVTTIAGGGSKKEGRA 149
           +DG   +ARF       VD  GN++VAD +N +IRKI    TN   TTIAG  +  +  A
Sbjct: 516 TDGTNANARFFIVDGITVDAAGNLFVADNNNCLIRKIAPVGTNWITTTIAGKLNSYD-FA 574

Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSAL 203
           DG   N  F+    +  V     + + D GN ++R+  L P   +  S + +A 
Sbjct: 575 DGVGTNILFNQPTGIA-VGKGGVVYVVDMGNNMVRK--LTPNGTNYVSSTVAAF 625


>gi|254522386|ref|ZP_05134441.1| NHL repeat containing protein [Stenotrophomonas sp. SKA14]
 gi|219719977|gb|EED38502.1| NHL repeat containing protein [Stenotrophomonas sp. SKA14]
          Length = 643

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 58/153 (37%), Positives = 76/153 (49%), Gaps = 11/153 (7%)

Query: 37  VIDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS 94
           +IDG  L+   N  S I      +L V D+   A   +      +  V  LAG GVQG+ 
Sbjct: 63  LIDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIGI----DGQVTTLAG-GVQGHG 117

Query: 95  DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPA 153
           DG    ARFD P   AVD +G +YVAD  N  IR I T+  V T+AGG  ++ G ADG  
Sbjct: 118 DGPVAQARFDAPMGIAVDAQGQVYVADTFNDRIRVIGTDGSVRTLAGG--ERPGLADGAG 175

Query: 154 QNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             A F     L F     ALL++D  N  +R++
Sbjct: 176 DAARFDTPVALAFDAQ-GALLVADLFNNAVRRV 207



 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 42/106 (39%), Positives = 55/106 (51%), Gaps = 6/106 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           V+ +AG G +G  DG    A F+ P   A D +GN+YVAD  NH IR+I   G VTT+AG
Sbjct: 53  VETVAGQG-EGLIDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIGIDGQVTTLAG 111

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G    +G  DGP   A F     +  V     + ++D  N  IR I
Sbjct: 112 G---VQGHGDGPVAQARFDAPMGIA-VDAQGQVYVADTFNDRIRVI 153



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 53/106 (50%), Gaps = 6/106 (5%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD--KSNHVIRKITNLGVTTIAGGG 142
           +AGDG  G  DG    ARF  P +      G++Y  D   +N + R++ +  V T+AG G
Sbjct: 1   MAGDGHPGDRDGASAQARFADPYALLRGADGSVYFTDAGDNNRIRRRLPDGRVETVAGQG 60

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
              EG  DGPA  ASF+    +        L ++D GN  IR+I +
Sbjct: 61  ---EGLIDGPALQASFNTPSGIAADAQ-GNLYVADTGNHAIRRIGI 102


>gi|171913873|ref|ZP_02929343.1| NHL repeat containing protein [Verrucomicrobium spinosum DSM 4136]
          Length = 370

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 47/130 (36%), Positives = 71/130 (54%), Gaps = 7/130 (5%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI--- 130
           ++ +  ++ +AG+G +GY+ DG P +A  FD P     D  GN Y+AD SNH IRK+   
Sbjct: 84  VTPDGKIETMAGNGKKGYTGDGGPATAASFDLPHEIRFDKAGNYYIADMSNHAIRKVDAK 143

Query: 131 TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKP 190
           T + +TT+AG G       DGP++ A       + F P    L I D GN +IR+I++K 
Sbjct: 144 TGI-ITTLAGTGRGGYTGDDGPSEKAELKQPHSIQFGPD-GDLYICDIGNHVIRKIDMKT 201

Query: 191 EDCSKSSQSG 200
              S  + +G
Sbjct: 202 GKISTFAGTG 211



 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 40/117 (34%), Positives = 58/117 (49%), Gaps = 6/117 (5%)

Query: 77  SEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           + E  +   AG GV+G+S DG P + A+ D P        G I+  + +   IR++T  G
Sbjct: 29  AAEWTISTFAGTGVKGFSGDGGPATQAQMDNPFGVVRGPDGAIWYTEYTGQRIRRVTPDG 88

Query: 135 -VTTIAGGGSKKEGRAD-GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            + T+AG G KK    D GPA  ASF    E+ F        I+D  N  IR+++ K
Sbjct: 89  KIETMAGNG-KKGYTGDGGPATAASFDLPHEIRF-DKAGNYYIADMSNHAIRKVDAK 143



 Score = 37.4 bits (85), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 39/74 (52%), Gaps = 4/74 (5%)

Query: 78  EESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           +  ++  +AG G  G++ +G P   A    PK  A+D +GN ++AD  +H +R I     
Sbjct: 257 KAGIIHHIAGTGASGFTGNGGPAKLATLKGPKGIALDAEGNAWLADTESHTVRVIMAKTG 316

Query: 136 T--TIAGGGSKKEG 147
           T   +AG G K +G
Sbjct: 317 TLELLAGTGEKGDG 330


>gi|374606289|ref|ZP_09679172.1| copper amine oxidase domain-containing protein [Paenibacillus
           dendritiformis C454]
 gi|374388103|gb|EHQ59542.1| copper amine oxidase domain-containing protein [Paenibacillus
           dendritiformis C454]
          Length = 564

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 50/138 (36%), Positives = 73/138 (52%), Gaps = 13/138 (9%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           L +   V  +AG G+ G+ DG+   ARF++P+  AV   G++YVAD  NHVIR+I  N  
Sbjct: 165 LDKSGRVTTVAGSGLAGWKDGKGAEARFNEPRDVAVAEDGSLYVADALNHVIRRIDANGN 224

Query: 135 VTTI----------AGGGSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
           VTT+          A G     G  ADG    + F+    L F P    L++SD GNQ +
Sbjct: 225 VTTLNARSKRIVEYAPGAVTAAGDYADGKLMVSKFNEPSSLAFTPS-GDLVVSDTGNQRL 283

Query: 184 RQINLKPEDCSKSSQSGS 201
           R ++LK +  +  + +GS
Sbjct: 284 RLVDLKQKRVTTLAGAGS 301



 Score = 67.0 bits (162), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 58/96 (60%), Gaps = 4/96 (4%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADG 151
           ++DG+   ARF++P   A D +GN+Y+AD +NHVIRK+   G VTT+AG G    G  DG
Sbjct: 128 WADGKGTEARFNEPMGMAEDRQGNLYIADAANHVIRKLDKSGRVTTVAGSGLA--GWKDG 185

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
               A F+   ++  V    +L ++D  N +IR+I+
Sbjct: 186 KGAEARFNEPRDVA-VAEDGSLYVADALNHVIRRID 220



 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 47/142 (33%), Positives = 73/142 (51%), Gaps = 14/142 (9%)

Query: 68  AFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
           A Y+  FP +      RL   G  GY DG+   A F+ P   A+  +G I VAD+ NH I
Sbjct: 303 ASYSYKFPDT------RLYAAG--GYRDGKASEALFNGPAGIALTGEGGIVVADRWNHAI 354

Query: 128 RKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGN---QLIR 184
           R + +  V T++GGG  + G  +G A+ A+F    ++  + +   + ++D  N   +LIR
Sbjct: 355 RYLYDGKVYTLSGGG--RTGHQNGWAEQATFREPMDVAVLSN-GTIAVADGFNNSIRLIR 411

Query: 185 QINLKPEDCSKSSQSGSALGAV 206
           +  L  E  S ++ S SA  AV
Sbjct: 412 RYTLPEEVRSPAATSPSAQDAV 433



 Score = 45.8 bits (107), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 37/121 (30%), Positives = 55/121 (45%), Gaps = 11/121 (9%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           PL  E++    AG GV G +DG    A+F  P        G + +AD  NH+IR+    G
Sbjct: 49  PLWYETI--DWAGSGVSGSADGRAEQAQFRYPAGLLAGKNGALLIADTYNHLIRRADAAG 106

Query: 135 -VTTIAG----GGSKKE---GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            V+T+AG        +E     ADG    A F+    +        L I+D  N +IR++
Sbjct: 107 QVSTLAGQVRFAAETREPNGSWADGKGTEARFNEPMGMA-EDRQGNLYIADAANHVIRKL 165

Query: 187 N 187
           +
Sbjct: 166 D 166


>gi|409204160|ref|ZP_11232357.1| putative outer membrane adhesin-like protein [Pseudoalteromonas
           flavipulchra JG1]
          Length = 2380

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 46/131 (35%), Positives = 68/131 (51%), Gaps = 5/131 (3%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           ++DL + D +  A       ++  SVV   AG G  G ++G    A F+KP   A+D   
Sbjct: 735 NNDLFIADRNNHAIR----KVTSASVVTTFAGTGSAGSANGTGTQASFNKPYDIALDSAD 790

Query: 116 NIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
           N+YV++++ H IRKIT+ GV T   G +   G  DG A  A FS  F +  V     + +
Sbjct: 791 NLYVSEQTGHTIRKITSAGVVTTFAGSAGVSGSTDGLASVARFSQPFGIA-VDSNGIVYV 849

Query: 176 SDHGNQLIRQI 186
           +D GN  IR+I
Sbjct: 850 ADTGNHRIRKI 860



 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 46/113 (40%), Positives = 60/113 (53%), Gaps = 12/113 (10%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           ++ E VV  LAG G  G  +G   +A F+ PK+  VD  GN+YVAD SN+ IRKIT  G 
Sbjct: 588 ITPEGVVTTLAGSGTAGSDEGTGSAASFNFPKAVTVDASGNVYVADSSNNKIRKITPAGV 647

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           VTT AG GS   G  DG    A+F+        P    + I  +GN  + + N
Sbjct: 648 VTTFAGSGSP--GSTDGTGTAATFA-------AP--TGITIDSNGNLFVVETN 689



 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 50/116 (43%), Positives = 63/116 (54%), Gaps = 5/116 (4%)

Query: 72  LSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           LS   S E+ V  LAG    GY+DG   +A F  P   A D   N+YVAD SNHVIRKIT
Sbjct: 531 LSLNTSTEATVSTLAGQS-SGYADGTGTAASFKSPYDVATDSNNNVYVADYSNHVIRKIT 589

Query: 132 NLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             G VTT+AG G+   G  +G    ASF+    +T V     + ++D  N  IR+I
Sbjct: 590 PEGVVTTLAGSGTA--GSDEGTGSAASFNFPKAVT-VDASGNVYVADSSNNKIRKI 642



 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 39/131 (29%), Positives = 60/131 (45%), Gaps = 5/131 (3%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S ++ V DSS +        ++   VV   AG G  G +DG   +A F  P    +D  G
Sbjct: 626 SGNVYVADSSNNKIR----KITPAGVVTTFAGSGSPGSTDGTGTAATFAAPTGITIDSNG 681

Query: 116 NIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
           N++V + + H++RKIT  GV T   G     G  D    +A F+  +      +   L I
Sbjct: 682 NLFVVETNPHIVRKITPAGVVTTFAGSKGSSGFTDATGTSAKFNFPYNGGSSSN-NDLFI 740

Query: 176 SDHGNQLIRQI 186
           +D  N  IR++
Sbjct: 741 ADRNNHAIRKV 751



 Score = 58.5 bits (140), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/134 (31%), Positives = 70/134 (52%), Gaps = 10/134 (7%)

Query: 60  IVLDSSRSAFYTLSFP-----LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
           I +DS+ + F   + P     ++   VV   AG  G  G++D    SA+F+ P +     
Sbjct: 675 ITIDSNGNLFVVETNPHIVRKITPAGVVTTFAGSKGSSGFTDATGTSAKFNFPYNGGSSS 734

Query: 114 KGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA 172
             ++++AD++NH IRK+T+   VTT AG GS   G A+G    ASF+  +++  +     
Sbjct: 735 NNDLFIADRNNHAIRKVTSASVVTTFAGTGSA--GSANGTGTQASFNKPYDIA-LDSADN 791

Query: 173 LLISDHGNQLIRQI 186
           L +S+     IR+I
Sbjct: 792 LYVSEQTGHTIRKI 805


>gi|329908044|ref|ZP_08274804.1| hypothetical protein IMCC9480_3484 [Oxalobacteraceae bacterium
           IMCC9480]
 gi|327546777|gb|EGF31711.1| hypothetical protein IMCC9480_3484 [Oxalobacteraceae bacterium
           IMCC9480]
          Length = 642

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 44/112 (39%), Positives = 53/112 (47%), Gaps = 1/112 (0%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           ++   VV +LAG G  G  DG    A F  PK    D  GN++VAD  N  IRKIT  GV
Sbjct: 369 ITPAGVVSQLAGSGKWGSVDGTGAGASFSAPKGIVADPLGNVFVADTYNSTIRKITPAGV 428

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            T   G     G  DGP   A FS   E   +     L + D GN  IR+I+
Sbjct: 429 VTTIAGAPSSTGSTDGPGNLARFSGP-EAIAIDAQRNLYVGDTGNHTIRKIS 479



 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 53/144 (36%), Positives = 71/144 (49%), Gaps = 11/144 (7%)

Query: 46  NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARF 103
           +P S+ IDR G   L V DS       +S     + VV  LAG  G  G +DG   +ARF
Sbjct: 78  SPESIAIDRAGM--LYVADSVNHTIRKVS----PQGVVTTLAGRAGEPGSADGRGSAARF 131

Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFE 163
             PK  AVD+ GN+ V+D +NH IRKI+  G  T   G ++     DG    A F+  F 
Sbjct: 132 FDPKGVAVDVAGNVVVSDNANHTIRKISPDGNVTTLAGSARNASALDGIGSAARFA--FP 189

Query: 164 LTFVPHICA-LLISDHGNQLIRQI 186
              V      + + D GN L+R+I
Sbjct: 190 QALVTDAARNVYVVDRGNGLLRKI 213



 Score = 64.7 bits (156), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 62/114 (54%), Gaps = 4/114 (3%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           G  DG   +ARF  P+S A+D  G +YVAD  NH IRK++  GV T   G + + G ADG
Sbjct: 65  GAVDGRGAAARFYSPESIAIDRAGMLYVADSVNHTIRKVSPQGVVTTLAGRAGEPGSADG 124

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
               A F  D +   V     +++SD+ N  IR+I+    D + ++ +GSA  A
Sbjct: 125 RGSAARFF-DPKGVAVDVAGNVVVSDNANHTIRKIS---PDGNVTTLAGSARNA 174



 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/114 (36%), Positives = 58/114 (50%), Gaps = 3/114 (2%)

Query: 74  FPLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           + ++  S  KRLAG   +    +G    ARF+ P +   D  GN YVAD  N++IRKIT 
Sbjct: 312 YVVTNGSTFKRLAGLAPELALVNGNGAQARFNLPWAVTSDATGNWYVADAGNYMIRKITP 371

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            GV +    GS K G  DG    ASFS    +   P +  + ++D  N  IR+I
Sbjct: 372 AGVVSQL-AGSGKWGSVDGTGAGASFSAPKGIVADP-LGNVFVADTYNSTIRKI 423



 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 40/113 (35%), Positives = 53/113 (46%), Gaps = 4/113 (3%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           +S   VV  LAG  G  G  DG   +AR   P+S +VD  GN+YV   S   +RKIT  G
Sbjct: 478 ISTSGVVSTLAGSPGRYGSEDGTGAAARLASPRSMSVDQAGNVYVI--SYRAVRKITPAG 535

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           V T   G +   G  D   ++A F     LT       + +SD     IR+I+
Sbjct: 536 VVTTWAGQALAYGNVDAVGEDARFGYLLALT-ADAAGNVYVSDTAATTIRKID 587



 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 29/77 (37%), Positives = 41/77 (53%), Gaps = 1/77 (1%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++   VV  +AG     G +DG    ARF  P++ A+D + N+YV D  NH IRKI+  G
Sbjct: 423 ITPAGVVTTIAGAPSSTGSTDGPGNLARFSGPEAIAIDAQRNLYVGDTGNHTIRKISTSG 482

Query: 135 VTTIAGGGSKKEGRADG 151
           V +   G   + G  DG
Sbjct: 483 VVSTLAGSPGRYGSEDG 499



 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 32/88 (36%), Positives = 47/88 (53%), Gaps = 2/88 (2%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
           ARF  P S  VD  GN Y+AD++N+ IRK+   G V+T+AG  +   G  DG    A F 
Sbjct: 18  ARFAYPGSMVVDSAGNRYIADQNNNQIRKVRPDGSVSTLAGASASTPGAVDGRGAAARFY 77

Query: 160 NDFELTFVPHICALLISDHGNQLIRQIN 187
           +  E   +     L ++D  N  IR+++
Sbjct: 78  SP-ESIAIDRAGMLYVADSVNHTIRKVS 104



 Score = 42.4 bits (98), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 38/68 (55%), Gaps = 2/68 (2%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           +S +  V  LAG      + DG   +ARF  P++   D   N+YV D+ N ++RKIT  G
Sbjct: 158 ISPDGNVTTLAGSARNASALDGIGSAARFAFPQALVTDAARNVYVVDRGNGLLRKITPAG 217

Query: 135 -VTTIAGG 141
            VTT+A G
Sbjct: 218 IVTTLASG 225


>gi|392540748|ref|ZP_10287885.1| Ig family protein [Pseudoalteromonas piscicida JCM 20779]
          Length = 2384

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 49/116 (42%), Positives = 61/116 (52%), Gaps = 5/116 (4%)

Query: 72  LSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           LS   S E+ V  LAG    GY+DG   +A F  P     D  GN+YVAD  NHVIRKIT
Sbjct: 535 LSLNTSTEATVSTLAGQS-SGYADGTGTAASFKSPYDLVTDSNGNVYVADYGNHVIRKIT 593

Query: 132 NLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             G VTT+AG GS   G  DG    ASF+    +T       + ++D  N  IR++
Sbjct: 594 PEGVVTTLAGSGSA--GSDDGTGSAASFNFPKAVTL-DSSGNVYVADSSNNNIRKV 646



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 65/131 (49%), Gaps = 5/131 (3%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           ++DL + D +  A       ++  SVV   AG G  G ++G    A F+KP   A D   
Sbjct: 739 NNDLFIADRNNHAIR----KVTSASVVTTFAGTGSAGSTNGTGTQASFNKPYDVAADSAD 794

Query: 116 NIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
           N+YV +++ H IRKIT+ GV T   G +   G  DG    A FS  + +  V     + +
Sbjct: 795 NLYVTEQAAHTIRKITSTGVVTTYAGSAGASGNTDGLVSVARFSQPYGIA-VDSNDVVYV 853

Query: 176 SDHGNQLIRQI 186
           +D GN  IR+I
Sbjct: 854 ADTGNHRIRKI 864



 Score = 68.2 bits (165), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 44/113 (38%), Positives = 60/113 (53%), Gaps = 12/113 (10%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           ++ E VV  LAG G  G  DG   +A F+ PK+  +D  GN+YVAD SN+ IRK+T  G 
Sbjct: 592 ITPEGVVTTLAGSGSAGSDDGTGSAASFNFPKAVTLDSSGNVYVADSSNNNIRKVTPAGV 651

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           VTT AG G+   G  DG    A+F+        P    + I  +GN  + + N
Sbjct: 652 VTTFAGSGTY--GSDDGTGTAATFA-------AP--TGITIDSNGNLYVVETN 693



 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 50/106 (47%), Gaps = 1/106 (0%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
           VV   AG G  G  DG   +A F  P    +D  GN+YV + + H++RKIT  GV T   
Sbjct: 651 VVTTFAGSGTYGSDDGTGTAATFAAPTGITIDSNGNLYVVETNPHIVRKITPAGVVTTFA 710

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G     G  D    +A+F+  +      +   L I+D  N  IR++
Sbjct: 711 GSKNSSGFTDATGTSATFNFPYNGGSNSN-NDLFIADRNNHAIRKV 755



 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 38/134 (28%), Positives = 67/134 (50%), Gaps = 10/134 (7%)

Query: 60  IVLDSSRSAFYTLSFP-----LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
           I +DS+ + +   + P     ++   VV   AG     G++D    SA F+ P +   + 
Sbjct: 679 ITIDSNGNLYVVETNPHIVRKITPAGVVTTFAGSKNSSGFTDATGTSATFNFPYNGGSNS 738

Query: 114 KGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA 172
             ++++AD++NH IRK+T+   VTT AG GS   G  +G    ASF+  +++        
Sbjct: 739 NNDLFIADRNNHAIRKVTSASVVTTFAGTGSA--GSTNGTGTQASFNKPYDVA-ADSADN 795

Query: 173 LLISDHGNQLIRQI 186
           L +++     IR+I
Sbjct: 796 LYVTEQAAHTIRKI 809


>gi|428179226|gb|EKX48098.1| hypothetical protein GUITHDRAFT_69001 [Guillardia theta CCMP2712]
          Length = 334

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 60/175 (34%), Positives = 80/175 (45%), Gaps = 18/175 (10%)

Query: 14  LLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLS 73
            ++LL      GL  EDG        G Q   N    I      DL + D          
Sbjct: 25  WITLLAGCEEDGL--EDGV-------GDQARFNSPVGIVESKEGDLYLCDQENHCIR--- 72

Query: 74  FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
             ++ +  V  LAG+G +GY DG+   AR + P    +D +GNI  AD  N  IRK++  
Sbjct: 73  -KVTRKGEVTTLAGNGEEGYHDGKGKDARLNIPTGLCMDAQGNIIFADSGNQRIRKVSPD 131

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           G VTTIAG    K+G  DGPA  A F N      V    ++ +SD GN  IR+I+
Sbjct: 132 GTVTTIAG---SKKGFKDGPAGKALF-NYPAYVAVDSKGSIFVSDFGNHCIRKID 182



 Score = 71.2 bits (173), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 48/117 (41%), Positives = 67/117 (57%), Gaps = 5/117 (4%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           +S +  V  +AG   +G+ DG  G A F+ P   AVD KG+I+V+D  NH IRKI   G 
Sbjct: 128 VSPDGTVTTIAGS-KKGFKDGPAGKALFNYPAYVAVDSKGSIFVSDFGNHCIRKIDGEGM 186

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
           VTT+AG G  K G ADG    A F N  +   +     + I+D+GNQ IR+++ + E
Sbjct: 187 VTTVAGNG--KMGWADGKGAKARF-NSPQGMCIDKEDTVYIADYGNQRIRKMSKEGE 240



 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 40/118 (33%), Positives = 61/118 (51%), Gaps = 14/118 (11%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           + +E  +  LAG    G  DG    ARF+ P       +G++Y+ D+ NH IRK+T  G 
Sbjct: 20  IDKEGWITLLAGCEEDGLEDGVGDQARFNSPVGIVESKEGDLYLCDQENHCIRKVTRKGE 79

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA-----LLISDHGNQLIRQIN 187
           VTT+AG G  +EG  DG  ++A       L     +C      ++ +D GNQ IR+++
Sbjct: 80  VTTLAGNG--EEGYHDGKGKDA------RLNIPTGLCMDAQGNIIFADSGNQRIRKVS 129



 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 37/110 (33%), Positives = 60/110 (54%), Gaps = 4/110 (3%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
           E +V  +AG+G  G++DG+   ARF+ P+   +D +  +Y+AD  N  IRK++  G V T
Sbjct: 184 EGMVTTVAGNGKMGWADGKGAKARFNSPQGMCIDKEDTVYIADYGNQRIRKMSKEGEVVT 243

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           +AG G  + G A G  Q A F     ++ V     + + D  N  +R+I+
Sbjct: 244 VAGSG--EPGFAHGHGQLARFRGPRSVS-VSQDGIVYVGDRENFRVRKIS 290



 Score = 45.8 bits (107), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 28/68 (41%), Positives = 39/68 (57%), Gaps = 1/68 (1%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           +S+E  V  +AG G  G++ G    ARF  P+S +V   G +YV D+ N  +RKI+  G 
Sbjct: 235 MSKEGEVVTVAGSGEPGFAHGHGQLARFRGPRSVSVSQDGIVYVGDRENFRVRKISEDGY 294

Query: 135 VTTIAGGG 142
           V T AG G
Sbjct: 295 VWTFAGMG 302


>gi|223937730|ref|ZP_03629631.1| NHL repeat containing protein [bacterium Ellin514]
 gi|223893523|gb|EEF59983.1| NHL repeat containing protein [bacterium Ellin514]
          Length = 1474

 Score = 71.6 bits (174), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 65/181 (35%), Positives = 89/181 (49%), Gaps = 34/181 (18%)

Query: 43   LEINPHSVIDRPGSSDLIVL----DSSRSAFYTLSFPL--SEESVVKRLAGDGVQGYSDG 96
            L  NP  +    G+S L+V     + + SA  T ++ L   E +VV+  AG GV G+++G
Sbjct: 1002 LYTNPFPI---SGNSTLVVKAWLSNYAPSALATATYSLLDYEPTVVQTYAGTGVPGFTNG 1058

Query: 97   EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQN 155
                A+F+ P+   VD  G ++V+D  N+VIRKI TN  VTT AG G    G  DG   N
Sbjct: 1059 PNLKAQFNAPQGICVDGLGTLFVSDTGNNVIRKISTNGVVTTFAGSG--VAGTHDGVGTN 1116

Query: 156  ASFSNDFELTFVPHICA------LLISDHGNQLIRQINLKPEDC-------SKSSQSGSA 202
            ASF         P   A      L ++D GN LIR++   P+         S  SQ G A
Sbjct: 1117 ASF-------LAPTGIALDSSNNLYVADSGNSLIRKV--TPDGLVTTLTTFSDPSQQGPA 1167

Query: 203  L 203
            L
Sbjct: 1168 L 1168



 Score = 43.1 bits (100), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 39/104 (37%), Positives = 54/104 (51%), Gaps = 7/104 (6%)

Query: 86   AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSK 144
            AGD + GY DG    ARF +PK+ A    G+I V+D +   IRKI + G VTT+AG G  
Sbjct: 1252 AGD-LSGYMDGPRLLARFQRPKALATASDGSIIVSDWTR--IRKIHSDGRVTTLAGAG-- 1306

Query: 145  KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
              G  +G    A+F N      V     +  +D  N  IR+I++
Sbjct: 1307 DLGLRNGSGLFAAF-NQLGAVTVDSAGNIYAADAANHSIRKISV 1349



 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 23/50 (46%), Positives = 29/50 (58%)

Query: 82   VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            V  LAG G  G  +G    A F++  +  VD  GNIY AD +NH IRKI+
Sbjct: 1299 VTTLAGAGDLGLRNGSGLFAAFNQLGAVTVDSAGNIYAADAANHSIRKIS 1348


>gi|344205849|ref|YP_004790990.1| NHL repeat containing protein [Stenotrophomonas maltophilia JV3]
 gi|343777211|gb|AEM49764.1| NHL repeat containing protein [Stenotrophomonas maltophilia JV3]
          Length = 693

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 58/152 (38%), Positives = 76/152 (50%), Gaps = 11/152 (7%)

Query: 38  IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
           +DG  L+   N  S I      +L V D+   A   +S     +  V  LAG G QG++D
Sbjct: 114 VDGPALQASFNTPSGIAADVQGNLYVADTGNHAIRRIS----TDGQVTTLAG-GEQGHAD 168

Query: 96  GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQ 154
           G    ARFD P   AVD +G +YVAD  N  IR I T+  V T+AGG   + G ADG   
Sbjct: 169 GPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVIGTDGSVRTLAGG--DRPGFADGVGT 226

Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            A F     L F  H  ALL++D  N  +R++
Sbjct: 227 AARFDTPVALAFDAH-GALLVADLFNNAVRRV 257



 Score = 66.6 bits (161), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 44/106 (41%), Positives = 60/106 (56%), Gaps = 6/106 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           V+ +AG G +G  DG    A F+ P   A D++GN+YVAD  NH IR+I+  G VTT+AG
Sbjct: 103 VETVAGQG-EGRVDGPALQASFNTPSGIAADVQGNLYVADTGNHAIRRISTDGQVTTLAG 161

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G   ++G ADGPA  A F     +  V     + ++D  N  IR I
Sbjct: 162 G---EQGHADGPAAQARFDAPMGIA-VDAQGQVYVADTFNDRIRVI 203



 Score = 60.8 bits (146), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 42/115 (36%), Positives = 60/115 (52%), Gaps = 6/115 (5%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADK--SNHVIRKITN 132
           PL+  + ++ LAGDG  G  DG    ARF  P +      G+IY  D   +N + R++ +
Sbjct: 41  PLAWTAQIEPLAGDGHPGDRDGGSAQARFADPYALLRGADGSIYFTDAGDNNRIRRRLPD 100

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
             V T+AG G   EGR DGPA  ASF+    +        L ++D GN  IR+I+
Sbjct: 101 GRVETVAGQG---EGRVDGPALQASFNTPSGIAADVQ-GNLYVADTGNHAIRRIS 151


>gi|255589882|ref|XP_002535117.1| nhl repeat-containing protein, putative [Ricinus communis]
 gi|223523999|gb|EEF27264.1| nhl repeat-containing protein, putative [Ricinus communis]
          Length = 435

 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 50/133 (37%), Positives = 67/133 (50%), Gaps = 8/133 (6%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           + +L V D+  +A   +      +  V  LAGDG+ G  DG+   A+F+ P   AVD  G
Sbjct: 130 AGNLYVADTGNNAIRKIG----PDGTVSTLAGDGLAGDKDGKGAGAQFNGPIGVAVDAAG 185

Query: 116 NIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
            +YV D  N  IR+I   G VTT+AGG  K+ G ADG    A F     +       AL 
Sbjct: 186 VVYVTDTYNDRIRRIAPDGTVTTVAGG--KRAGMADGAGAQALFDTPTGIALGAS-GALY 242

Query: 175 ISDHGNQLIRQIN 187
           I+D GN  IR+I 
Sbjct: 243 IADTGNSAIRKIG 255



 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 48/130 (36%), Positives = 68/130 (52%), Gaps = 7/130 (5%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS-NHVIRKITNL 133
           P    + +  +AGDG+ G S+G     RF  P   A+D +GN+YVAD   N+ IRKI   
Sbjct: 37  PAGWHARITTMAGDGLPGSSNGSGKRTRFADPFGVALDRQGNLYVADGGDNNSIRKIDLD 96

Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
           GVTT   GG+  EG A+G    A+F+    L  +     L ++D GN  IR+I     D 
Sbjct: 97  GVTTTLAGGT--EGYAEGAPTAAAFNTPSGLA-IDAAGNLYVADTGNNAIRKIG---PDG 150

Query: 194 SKSSQSGSAL 203
           + S+ +G  L
Sbjct: 151 TVSTLAGDGL 160



 Score = 63.5 bits (153), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 42/109 (38%), Positives = 59/109 (54%), Gaps = 5/109 (4%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
           + V   LAG G +GY++G P +A F+ P   A+D  GN+YVAD  N+ IRKI   G V+T
Sbjct: 96  DGVTTTLAG-GTEGYAEGAPTAAAFNTPSGLAIDAAGNLYVADTGNNAIRKIGPDGTVST 154

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           +AG G    G  DG    A F+    +  V     + ++D  N  IR+I
Sbjct: 155 LAGDG--LAGDKDGKGAGAQFNGPIGVA-VDAAGVVYVTDTYNDRIRRI 200



 Score = 42.7 bits (99), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 24/65 (36%), Positives = 34/65 (52%), Gaps = 1/65 (1%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           ++ +  V  +AG    G +DG    A FD P   A+   G +Y+AD  N  IRKI   G 
Sbjct: 200 IAPDGTVTTVAGGKRAGMADGAGAQALFDTPTGIALGASGALYIADTGNSAIRKIGKDGT 259

Query: 135 VTTIA 139
           V+T+A
Sbjct: 260 VSTVA 264


>gi|299473428|emb|CBN77825.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 2120

 Score = 70.9 bits (172), Expect = 5e-10,   Method: Composition-based stats.
 Identities = 38/75 (50%), Positives = 47/75 (62%), Gaps = 3/75 (4%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSK 144
            G G  GY+DG   +A F+ P+  AVD   N+YVAD  NH IR+I+  G VTT+AG G  
Sbjct: 205 GGSGEAGYADGSGTAALFNDPQDVAVDAHANVYVADTGNHRIRRISPEGVVTTVAGDG-- 262

Query: 145 KEGRADGPAQNASFS 159
           +EG  DG A  ASFS
Sbjct: 263 EEGSDDGDAMEASFS 277



 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 39/135 (28%), Positives = 55/135 (40%), Gaps = 25/135 (18%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN----IYVADKSNHVIRKIT 131
           +S E VV  +AGDG +G  DG+   A F  P   A+    +    +YVAD +NH +RKI+
Sbjct: 249 ISPEGVVTTVAGDGEEGSDDGDAMEASFSFPGGIALYYDSSEGLVLYVADTNNHRLRKIS 308

Query: 132 --------------------NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC 171
                                   T +A   + + G ADG    A F     L       
Sbjct: 309 GDIANGAGTVTCHAGRCGNGTESATRMAAEATPEAGFADGDGSYARFDGPSGLA-AAEDG 367

Query: 172 ALLISDHGNQLIRQI 186
            L ++D  N LIR +
Sbjct: 368 TLFVADTNNHLIRMV 382



 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 27/71 (38%), Positives = 38/71 (53%), Gaps = 3/71 (4%)

Query: 84  RLAGDGV--QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAG 140
           R+A +     G++DG+   ARFD P   A    G ++VAD +NH+IR +  N  V T+ G
Sbjct: 334 RMAAEATPEAGFADGDGSYARFDGPSGLAAAEDGTLFVADTNNHLIRMVLANSTVFTLTG 393

Query: 141 GGSKKEGRADG 151
           G    E  A G
Sbjct: 394 GLEGAEVEAGG 404



 Score = 44.3 bits (103), Expect = 0.052,   Method: Composition-based stats.
 Identities = 31/84 (36%), Positives = 43/84 (51%), Gaps = 17/84 (20%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------LG------V 135
           GV G++DG   +ARF+ P   ++   G ++VAD   H +R+I+        LG      V
Sbjct: 415 GVAGHTDGNLTAARFNYPADVSLGTNGTLFVADL--HSLRRISMPENPTIVLGVGFDGRV 472

Query: 136 TTIAGGGSKKEGRADGPAQNASFS 159
           TT AGG   + G ADG    A FS
Sbjct: 473 TTAAGGA--EPGEADGTGPEARFS 494


>gi|380692925|ref|ZP_09857784.1| hypothetical protein BfaeM_02978 [Bacteroides faecis MAJ27]
          Length = 454

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 41/131 (31%), Positives = 69/131 (52%), Gaps = 5/131 (3%)

Query: 59  LIVLDSSRSAFYTLSFPLSEES--VVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKG 115
           L ++   +   Y +++  +  +  + +  AGD G  GY+ G+   ARF++P +  +D +G
Sbjct: 323 LYIIARKKHCIYKVAYNAATHTFGIPELFAGDYGESGYASGKGTGARFNQPSTPCLDPEG 382

Query: 116 NIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
           N+ + DK NH IRKIT  G  T+  G  +  G  DG    A F     +TF  +  AL++
Sbjct: 383 NLLIPDKMNHCIRKITPEGEVTLYAGQPQTSGHTDGLPDKAKFYEPEAVTFSGN--ALIV 440

Query: 176 SDHGNQLIRQI 186
           +D GN  +R +
Sbjct: 441 ADRGNHCVRNV 451


>gi|347754243|ref|YP_004861807.1| gluconolactonase [Candidatus Chloracidobacterium thermophilum B]
 gi|347586761|gb|AEP11291.1| Gluconolactonase [Candidatus Chloracidobacterium thermophilum B]
          Length = 1222

 Score = 70.5 bits (171), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 45/113 (39%), Positives = 65/113 (57%), Gaps = 7/113 (6%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           L+ +  V  LAGDGV+G  DG    ARF  P++ AVD  G +YVAD +   +R+I+  G 
Sbjct: 333 LTLDGAVTTLAGDGVEGTRDGVGKQARFKAPRAVAVDASGTVYVADDAR--VRRISPGGM 390

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           V TIAGG   + G  DGPA+ A F     L     +  L ++D GN+ +R+++
Sbjct: 391 VVTIAGG---EPGCVDGPAEAARFDTLSGLAL-DRVGNLYLADAGNRRLRKLS 439



 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 43/110 (39%), Positives = 58/110 (52%), Gaps = 4/110 (3%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
           E VV   AGD   G+ DG   SAR + P   A+D +G +Y+AD  NH IRK+   G + T
Sbjct: 616 ELVVTVTAGDSTPGFLDGVGTSARLNHPVGLALDAQGVLYIADHFNHAIRKLLPDGRLVT 675

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           +AGGG +  G  DG    A F+    L  V     L ++DH N  IR++ 
Sbjct: 676 LAGGGQR--GFQDGYGPEAQFNGPLGLA-VGRDGELYVADHLNMRIRKVT 722



 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 36/76 (47%), Positives = 44/76 (57%), Gaps = 5/76 (6%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG- 142
           LAG G +G+ DG    A+F+ P   AV   G +YVAD  N  IRK+T  G V+T+AG G 
Sbjct: 676 LAGGGQRGFQDGYGPEAQFNGPLGLAVGRDGELYVADHLNMRIRKVTPDGYVSTLAGTGI 735

Query: 143 SKKEGRADGPAQNASF 158
           SK E   DG    ASF
Sbjct: 736 SKIE---DGSVATASF 748



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 51/98 (52%), Gaps = 5/98 (5%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRAD 150
           GY+DG    A+F +P   A D +GN+YVAD   H IR+I+  G V T+AG G  + G  D
Sbjct: 242 GYADGPLTVAKFLRPNGIACDPQGNLYVADFGGHRIRQISVDGLVRTLAGSG--QPGNRD 299

Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
                A F+    + +      L ++D  N  +R++ L
Sbjct: 300 DLGLLAEFNGPRGIAYAAGY--LYVADLNNASVRRLTL 335



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 54/113 (47%), Gaps = 9/113 (7%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           ++ +  V  LAG G+    DG   +A F+ PK  AVDM G +YV D     +R IT  G 
Sbjct: 721 VTPDGYVSTLAGTGISKIEDGSVATASFEGPKGVAVDMHGVVYVTDGVT--VRTITPDGE 778

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTF-VPHICALLISDHGNQLIRQI 186
           V T+AG   +  G  DG    A F   + +   V  +C   ++D  N  +R I
Sbjct: 779 VRTLAG---QVRGFRDGIGTRAMFGWAYAIAVDVSGLC--FVTDAANHAVRCI 826



 Score = 44.7 bits (104), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 42/151 (27%), Positives = 66/151 (43%), Gaps = 14/151 (9%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           ++ +  V+ LAG  V+G+ DG    A F    + AVD+ G  +V D +NH +R I   G 
Sbjct: 773 ITPDGEVRTLAGQ-VRGFRDGIGTRAMFGWAYAIAVDVSGLCFVTDAANHAVRCIFPDGT 831

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
           V T+ GGG          A+  +F N   +    H   L ++D  N  I ++       +
Sbjct: 832 VKTVFGGGE---------ARQLNFPNGLAVDVFGH---LYVADTNNHRILRLTPNGNGYT 879

Query: 195 KSSQSGSALGAVSVWVLVSVLSCLVSLVIGF 225
            S   G   G  +   L + L     +V+GF
Sbjct: 880 ASLVCGVRRGRQTGSALEAELDSPRGIVVGF 910



 Score = 40.4 bits (93), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 27/71 (38%), Positives = 39/71 (54%), Gaps = 2/71 (2%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           +S   +V  +AG G  G  DG   +ARFD     A+D  GN+Y+AD  N  +RK++  G 
Sbjct: 385 ISPGGMVVTIAG-GEPGCVDGPAEAARFDTLSGLALDRVGNLYLADAGNRRLRKLSRDGQ 443

Query: 135 VTTIAGGGSKK 145
           V+T+  G   K
Sbjct: 444 VSTLPVGPDDK 454


>gi|386716891|ref|YP_006183217.1| NHL repeat protein [Stenotrophomonas maltophilia D457]
 gi|384076453|emb|CCH11034.1| NHL repeat protein [Stenotrophomonas maltophilia D457]
          Length = 652

 Score = 70.5 bits (171), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 57/152 (37%), Positives = 75/152 (49%), Gaps = 11/152 (7%)

Query: 38  IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
           +DG  L+   N  S I      +L V D+   A   +S     +  V  LAG G QG++D
Sbjct: 73  VDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIS----TDGQVTTLAG-GEQGHAD 127

Query: 96  GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQ 154
           G    ARFD P   AVD +G +YVAD  N  IR I T+  V T+AGG   + G ADG   
Sbjct: 128 GPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVIGTDGSVRTLAGG--DRPGLADGVGA 185

Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            A F     L F     ALL++D  N  +R++
Sbjct: 186 AARFDTPVALAFDAQ-GALLVADLFNNAVRRV 216



 Score = 66.2 bits (160), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/106 (41%), Positives = 59/106 (55%), Gaps = 6/106 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           V+ +AG G +G  DG    A F+ P   A D +GN+YVAD  NH IR+I+  G VTT+AG
Sbjct: 62  VETVAGQG-EGRVDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRISTDGQVTTLAG 120

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G   ++G ADGPA  A F     +  V     + ++D  N  IR I
Sbjct: 121 G---EQGHADGPAAQARFDAPMGIA-VDAQGQVYVADTFNDRIRVI 162



 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 39/114 (34%), Positives = 59/114 (51%), Gaps = 6/114 (5%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD--KSNHVIRKITNL 133
           ++  + ++ LAGDG  G  DG    ARF  P +      G++Y  D   +N + R++ + 
Sbjct: 1   MAWTAQIEPLAGDGHPGDRDGASAQARFADPYALLRSADGSVYFTDAGDNNRIRRRLPDG 60

Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            V T+AG G   EGR DGPA  ASF+    +        L ++D GN  IR+I+
Sbjct: 61  RVETVAGQG---EGRVDGPALQASFNTPSGIAADAQ-GNLYVADTGNHAIRRIS 110


>gi|456737758|gb|EMF62435.1| Hypothetical protein EPM1_0109 [Stenotrophomonas maltophilia EPM1]
          Length = 693

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 56/152 (36%), Positives = 74/152 (48%), Gaps = 11/152 (7%)

Query: 38  IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
           +DG  L+   N  S I      +L V D+   A   +      +  V  LAG G QG++D
Sbjct: 114 VDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIG----TDGQVTTLAG-GEQGHAD 168

Query: 96  GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQ 154
           G    ARFD P   AVD +G +YVAD  N  IR I T+  V T+AGG   + G ADG   
Sbjct: 169 GPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVIGTDGNVRTLAGG--DRPGLADGVGA 226

Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            A F     L F     ALL++D  N  +R++
Sbjct: 227 TARFDTPVALAFDAQ-GALLVADLFNNAVRRV 257



 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 44/106 (41%), Positives = 58/106 (54%), Gaps = 6/106 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           V+ +AG G +G  DG    A F+ P   A D +GN+YVAD  NH IR+I   G VTT+AG
Sbjct: 103 VETVAGQG-EGRVDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIGTDGQVTTLAG 161

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G   ++G ADGPA  A F     +  V     + ++D  N  IR I
Sbjct: 162 G---EQGHADGPAAQARFDAPMGIA-VDAQGQVYVADTFNDRIRVI 203



 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 41/114 (35%), Positives = 59/114 (51%), Gaps = 6/114 (5%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADK--SNHVIRKITN 132
           PL+  + ++ LAGDG  G  DG    ARF  P +      G++Y  D   +N + R++ +
Sbjct: 41  PLAWTAQIEPLAGDGHPGDRDGASAQARFADPYALLRSADGSVYFTDAGDNNRIRRRLPD 100

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             V T+AG G   EGR DGPA  ASF+    +        L ++D GN  IR+I
Sbjct: 101 GRVETVAGQG---EGRVDGPALQASFNTPSGIAADAQ-GNLYVADTGNHAIRRI 150


>gi|424666862|ref|ZP_18103887.1| hypothetical protein A1OC_00420 [Stenotrophomonas maltophilia
           Ab55555]
 gi|401069531|gb|EJP78052.1| hypothetical protein A1OC_00420 [Stenotrophomonas maltophilia
           Ab55555]
          Length = 693

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 56/152 (36%), Positives = 74/152 (48%), Gaps = 11/152 (7%)

Query: 38  IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
           +DG  L+   N  S I      +L V D+   A   +      +  V  LAG G QG++D
Sbjct: 114 VDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIGI----DGQVTTLAG-GEQGHAD 168

Query: 96  GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQ 154
           G    ARFD P   AVD +G +YVAD  N  IR I T+  V T+AGG   + G ADG   
Sbjct: 169 GPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVIGTDGNVRTLAGG--DRPGLADGVGA 226

Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            A F     L F     ALL++D  N  +R++
Sbjct: 227 AARFDTPVALAFDAQ-GALLVADLFNNAVRRV 257



 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 44/106 (41%), Positives = 58/106 (54%), Gaps = 6/106 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           V+ +AG G +G  DG    A F+ P   A D +GN+YVAD  NH IR+I   G VTT+AG
Sbjct: 103 VETVAGQG-EGRVDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIGIDGQVTTLAG 161

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G   ++G ADGPA  A F     +  V     + ++D  N  IR I
Sbjct: 162 G---EQGHADGPAAQARFDAPMGIA-VDAQGQVYVADTFNDRIRVI 203



 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 60/116 (51%), Gaps = 6/116 (5%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADK--SNHVIRKITN 132
           PL+  + ++ LAGDG  G  DG    ARF  P +      G++Y  D   +N + R++ +
Sbjct: 41  PLAWTAQIEPLAGDGHPGDRDGASAQARFADPYALLRSADGSVYFTDAGDNNRIRRRLPD 100

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
             V T+AG G   EGR DGPA  ASF+    +        L ++D GN  IR+I +
Sbjct: 101 GRVETVAGQG---EGRVDGPALQASFNTPSGIAADAQ-GNLYVADTGNHAIRRIGI 152


>gi|421861496|ref|ZP_16293498.1| hypothetical protein PPOP_3335 [Paenibacillus popilliae ATCC 14706]
 gi|410828922|dbj|GAC43935.1| hypothetical protein PPOP_3335 [Paenibacillus popilliae ATCC 14706]
          Length = 575

 Score = 69.3 bits (168), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 49/142 (34%), Positives = 71/142 (50%), Gaps = 21/142 (14%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           L +   V  +AG G+ G+ DG    ARF++P+  AV   G++YVAD  NHV+R+I  N  
Sbjct: 176 LDKSGRVTTVAGSGLAGWRDGTGSEARFNEPRDVAVAEDGSLYVADALNHVLRRIDANGN 235

Query: 135 VTT---------------IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHG 179
           VTT               +A GG      ADG    + F+    L F      L++SD G
Sbjct: 236 VTTLNARSKRIVEYAPGAVAAGGDY----ADGKLGESKFNEPSSLAFTSS-GDLVVSDTG 290

Query: 180 NQLIRQINLKPEDCSKSSQSGS 201
           NQ +R +NLK +  +  + +GS
Sbjct: 291 NQRLRLVNLKQKYVTTLAGAGS 312



 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 37/96 (38%), Positives = 57/96 (59%), Gaps = 4/96 (4%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADG 151
           ++DG+   ARF++P   A D +GN+Y+AD  NHVIRK+   G VTT+AG G    G  DG
Sbjct: 139 WTDGKGTEARFNQPMGMAEDRQGNLYIADAGNHVIRKLDKSGRVTTVAGSG--LAGWRDG 196

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
               A F+   ++  V    +L ++D  N ++R+I+
Sbjct: 197 TGSEARFNEPRDVA-VAEDGSLYVADALNHVLRRID 231



 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 34/108 (31%), Positives = 49/108 (45%), Gaps = 6/108 (5%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGS 143
            AG GV G  DG    A F  P        G + +AD  NH+IR+    G V+T+AG  +
Sbjct: 71  WAGSGVSGSVDGRAERAEFRYPSGLLAGKNGELLIADTYNHLIRRADAAGQVSTLAGQVA 130

Query: 144 KKEGR----ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           K   +     DG    A F+    +        L I+D GN +IR+++
Sbjct: 131 KMRQQYGSWTDGKGTEARFNQPMGMA-EDRQGNLYIADAGNHVIRKLD 177



 Score = 46.2 bits (108), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 38/118 (32%), Positives = 57/118 (48%), Gaps = 6/118 (5%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           GY D E   A F+ P   A+  +G I VAD+ NH IR + N  V T+   G    G  +G
Sbjct: 330 GYRDSEASEALFNGPAGIAITAEGGIIVADRWNHAIRYLFNGKVYTLG--GGGGTGHQNG 387

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGN---QLIRQINLKPEDCSKSSQSGSALGAV 206
            A+ A+F     +  + +   + ++D  N   +LIR+  L  E  S ++ S  A  AV
Sbjct: 388 WAEQATFREPVNVAVLSN-GTIAVADGFNNSIRLIRRYTLPEEARSSTAASPYARDAV 444


>gi|301122083|ref|XP_002908768.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262099530|gb|EEY57582.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 1981

 Score = 69.3 bits (168), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 44/108 (40%), Positives = 61/108 (56%), Gaps = 8/108 (7%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGG 142
           +AGDG  G+ DG   +++F+ P+  AVD  G +YVAD +NH IR I  T   V+T+AG G
Sbjct: 208 VAGDGSAGFQDGLAATSQFNHPRGVAVDSNGVVYVADTANHRIRIIDPTTKRVSTLAGDG 267

Query: 143 SKKEGRADGPAQNA---SFSNDFELTFV-PHICALLISDHGNQLIRQI 186
              EG  DGPA  A   S+ +D  +         + ++D GN  IRQI
Sbjct: 268 V--EGFVDGPALTAARFSYPSDVAVRESNSGTITVFVADTGNHRIRQI 313



 Score = 57.4 bits (137), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 28/62 (45%), Positives = 34/62 (54%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           G +DG+P  ARFD P   AVD  G ++VAD  NH+IR+I   G T    GG      AD 
Sbjct: 344 GMADGDPLGARFDSPMGVAVDADGVVFVADTGNHLIRRIDLDGTTHTVAGGVVPSEDADT 403

Query: 152 PA 153
           P 
Sbjct: 404 PG 405



 Score = 55.8 bits (133), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 45/126 (35%), Positives = 62/126 (49%), Gaps = 20/126 (15%)

Query: 82  VKRLAGDGVQGYSDGEP-GSARFDKPKSFAVDMKGN----IYVADKSNHVIRKITNLGVT 136
           V  LAGDGV+G+ DG    +ARF  P   AV    +    ++VAD  NH IR+I N  VT
Sbjct: 260 VSTLAGDGVEGFVDGPALTAARFSYPSDVAVRESNSGTITVFVADTGNHRIRQIRNGVVT 319

Query: 137 TIAG-GGSKKE-------------GRADGPAQNASFSNDFELTFVPHICALLISDHGNQL 182
            ++G  G+  E             G ADG    A F +   +  V     + ++D GN L
Sbjct: 320 CLSGLCGAGVETVRLSQQPEKPHAGMADGDPLGARFDSPMGVA-VDADGVVFVADTGNHL 378

Query: 183 IRQINL 188
           IR+I+L
Sbjct: 379 IRRIDL 384



 Score = 39.7 bits (91), Expect = 1.4,   Method: Composition-based stats.
 Identities = 26/90 (28%), Positives = 43/90 (47%), Gaps = 14/90 (15%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-------NLGVTTIAG- 140
           G +G+ DG    ARF+ P++ A+  +  + V+++  H IR++T        +G  T A  
Sbjct: 413 GERGFRDGNLTFARFESPRAIAIGAQRTLIVSER--HRIRRVTYDGGSASTIGTVTAANR 470

Query: 141 ----GGSKKEGRADGPAQNASFSNDFELTF 166
                GS   G  DG    A+F+    +TF
Sbjct: 471 VVTLAGSNVPGHIDGEGNEATFNAPAGVTF 500


>gi|451980854|ref|ZP_21929239.1| conserved hypothetical protein, contains NHL repeats [Nitrospina
           gracilis 3/211]
 gi|451761976|emb|CCQ90480.1| conserved hypothetical protein, contains NHL repeats [Nitrospina
           gracilis 3/211]
          Length = 729

 Score = 69.3 bits (168), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 44/115 (38%), Positives = 63/115 (54%), Gaps = 4/115 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TN 132
           + +  ++  + G GV GYS DG P + A    P     D +GN+YVAD+ NHV+RK+ TN
Sbjct: 68  IDQNGMLTNVVGTGVSGYSGDGGPATEATLKVPAGLTFDKEGNLYVADRENHVVRKVDTN 127

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
             +TT AG G        GPA  A  +   ++T V H   L ISD  N +IR+++
Sbjct: 128 GIITTFAGTGKAGYSGDKGPATQAKLNLPSDMT-VDHKGNLFISDRSNNVIRKVD 181



 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 42/114 (36%), Positives = 64/114 (56%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ E  +  +AG+G   Y+ DG P   A F  P+S  +D +GN+Y+ D +N+VIRKI   
Sbjct: 528 VTPEGKIITIAGNGSSDYTGDGGPAKDASFRDPQSLTMDKEGNLYIGDTANNVIRKIDKN 587

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G VTT AG G+ +     GPA  A F +  ++ F P    L + + GN  +R+I
Sbjct: 588 GIVTTYAGNGNHEHSGDGGPALKAGFMSIGDIVFSPS-GELHVVEPGNHTVRKI 640



 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 39/115 (33%), Positives = 60/115 (52%), Gaps = 4/115 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +    ++   AG G  GYS   G    A+ + P    VD KGN++++D+SN+VIRK+   
Sbjct: 124 VDTNGIITTFAGTGKAGYSGDKGPATQAKLNLPSDMTVDHKGNLFISDRSNNVIRKVDPS 183

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           G +TT AG G++     + PA   +    F L    H   L I+D GN  IR+++
Sbjct: 184 GTITTYAGTGNEGYNGDNMPALRTNLDKPFGLAVDKH-GNLYIADRGNNRIRKVD 237



 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 36/80 (45%), Positives = 47/80 (58%), Gaps = 2/80 (2%)

Query: 109 FAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFV 167
            A D KGN+YV+D+ NH IRK+T  G + TIAG GS       GPA++ASF +   LT  
Sbjct: 507 LAFDKKGNLYVSDRINHQIRKVTPEGKIITIAGNGSSDYTGDGGPAKDASFRDPQSLTMD 566

Query: 168 PHICALLISDHGNQLIRQIN 187
                L I D  N +IR+I+
Sbjct: 567 KE-GNLYIGDTANNVIRKID 585



 Score = 45.4 bits (106), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 39/122 (31%), Positives = 61/122 (50%), Gaps = 7/122 (5%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDM 113
           + DL+V+D S    Y +     + S V+ +AG+GV+ ++  +G    A  + P    VD 
Sbjct: 333 NGDLLVVDRSH---YRIRKMHQKGSRVETIAGNGVKNFAGDNGPATGANLEFPHGIVVDK 389

Query: 114 KGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNAS-FSNDFELTFVPHIC 171
           + N+  ADK ++ IR+I   G+ TT+ G G +     +GPA  AS F    EL     I 
Sbjct: 390 QDNVIFADKGHYRIRQIDPEGIITTVVGNGIRGNIGDNGPALEASVFPTILELNHKDEIF 449

Query: 172 AL 173
            L
Sbjct: 450 FL 451



 Score = 43.9 bits (102), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 35/89 (39%), Positives = 50/89 (56%), Gaps = 5/89 (5%)

Query: 76  LSEESVVKRLAGD-GVQG-YSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-T 131
           ++ +  V+ +AG  GVQG + DG   + A   +P   A D KGN+Y+ D  N+ IRK+ T
Sbjct: 640 ITRDGKVELVAGRPGVQGLFGDGGKATEAMLKQPACIAFDSKGNMYITDMGNNRIRKVDT 699

Query: 132 NLGVTTIAGGGSKKEGRADGPAQNASFSN 160
           N  +TT+AG GS   G  +G   N  F N
Sbjct: 700 NGIITTLAGRGSFGWGY-EGEEVNIYFQN 727



 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 32/88 (36%), Positives = 51/88 (57%), Gaps = 8/88 (9%)

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSN---D 161
           P   AVD +GN+YVAD++N+ IRKI  LG + T+ G G +     +G A+ A  +N    
Sbjct: 269 PTDVAVDDEGNVYVADRNNNRIRKINTLGMIRTVMGTGQQD---YNGDAEVARETNLHLP 325

Query: 162 FELTFVPHICALLISDHGNQLIRQINLK 189
           F +T + +   LL+ D  +  IR+++ K
Sbjct: 326 FAIT-LDNNGDLLVVDRSHYRIRKMHQK 352



 Score = 40.0 bits (92), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 38/122 (31%), Positives = 59/122 (48%), Gaps = 21/122 (17%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKITNLG--V 135
           +++ + G G Q Y +G+   AR      P +  +D  G++ V D+S++ IRK+   G  V
Sbjct: 298 MIRTVMGTGQQDY-NGDAEVARETNLHLPFAITLDNNGDLLVVDRSHYRIRKMHQKGSRV 356

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQINLK 189
            TIAG G K     +GPA  A+          PH         ++ +D G+  IRQI+  
Sbjct: 357 ETIAGNGVKNFAGDNGPATGANLE-------FPHGIVVDKQDNVIFADKGHYRIRQID-- 407

Query: 190 PE 191
           PE
Sbjct: 408 PE 409


>gi|290974765|ref|XP_002670115.1| predicted protein [Naegleria gruberi]
 gi|284083670|gb|EFC37371.1| predicted protein [Naegleria gruberi]
          Length = 784

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 52/148 (35%), Positives = 77/148 (52%), Gaps = 7/148 (4%)

Query: 43  LEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGS 100
           LE  P  V   P S+DL++ D        L+     +++V  + GD   GYS  +G    
Sbjct: 95  LEFQPGGVALYPNSNDLLISDPVGGVIVRLN-SKGIQTIVAGMKGD--LGYSGDNGLATR 151

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKEGRADGPAQNASFS 159
           AR + P   AV   G +Y+ADKSNHVIRKI+ N  ++TIAG G +      G A+ A  +
Sbjct: 152 ARLNTPTGVAVASNGEVYIADKSNHVIRKISLNGNISTIAGNGEEGFSGDGGNAKTAQLN 211

Query: 160 NDFELTFVPHICALLISDHGNQLIRQIN 187
           +   L+ +     L I+D  N +IR+I+
Sbjct: 212 SPIGLS-ISSTGELYIADSKNHVIRKID 238



 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 35/109 (32%), Positives = 58/109 (53%), Gaps = 4/109 (3%)

Query: 82  VKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
           +  +AG+G +G+S   G   +A+ + P   ++   G +Y+AD  NHVIRKI   G ++T 
Sbjct: 187 ISTIAGNGEEGFSGDGGNAKTAQLNSPIGLSISSTGELYIADSKNHVIRKIDANGIISTF 246

Query: 139 AGGGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           AG G+      DG  A+ A  +  + + F      + I+D  N LIR++
Sbjct: 247 AGNGTINGYGGDGSQAKQALLNTPYGVFFYESTGEVYIADTLNSLIRKV 295



 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 44/166 (26%), Positives = 70/166 (42%), Gaps = 46/166 (27%)

Query: 65  SRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYV--- 119
           + SA++ +   +S   ++ + AG G  GYS DG     A+F++P   A    G+I++   
Sbjct: 405 AESAYHKIR-KISPNGIISQYAGTGDHGYSGDGSDAKLAKFNEPTRIACSKNGDIFIADL 463

Query: 120 --------------------------------------ADKSNHVIRKITNLGVTTIAGG 141
                                                 AD+S+HVIRKI+  GV +I  G
Sbjct: 464 FNGAIRRIFASNGTVDTVVSGLGSPQAVILTESGELLVADRSSHVIRKISTNGVMSIIAG 523

Query: 142 GSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             +  G   DG A    FS   ++   P+   L I+D+ N LIR++
Sbjct: 524 VLEDGGFNGDGLATKTKFSGPQDIALAPN-GDLYIADYDNYLIRKL 568



 Score = 44.7 bits (104), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 39/122 (31%), Positives = 58/122 (47%), Gaps = 7/122 (5%)

Query: 33  TVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG 92
           TV TV+ G     +P +VI    S +L+V D S      +S       +   L   G  G
Sbjct: 477 TVDTVVSGLG---SPQAVILTE-SGELLVADRSSHVIRKISTNGVMSIIAGVLEDGGFNG 532

Query: 93  YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADG 151
             DG     +F  P+  A+   G++Y+AD  N++IRK++  G +TT+AG G    G   G
Sbjct: 533 --DGLATKTKFSGPQDIALAPNGDLYIADYDNYLIRKLSKNGNITTVAGNGDSTVGEDFG 590

Query: 152 PA 153
            A
Sbjct: 591 IA 592



 Score = 40.4 bits (93), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 34/122 (27%), Positives = 59/122 (48%), Gaps = 9/122 (7%)

Query: 76  LSEESVVKRLAGDG-VQGYS-DG-EPGSARFDKPKS-FAVDMKGNIYVADKSNHVIRKIT 131
           +    ++   AG+G + GY  DG +   A  + P   F  +  G +Y+AD  N +IRK++
Sbjct: 237 IDANGIISTFAGNGTINGYGGDGSQAKQALLNTPYGVFFYESTGEVYIADTLNSLIRKVS 296

Query: 132 NLGVTTIAGGGSKKEG--RADG--PAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
             G+ +   G     G  R D   PA +A  S    +  +  +  + I+D+GN  IR+++
Sbjct: 297 KSGIISTVAGVPNSSGYSREDENVPATSALLSAPTSVA-LSSLGEMFIADNGNLYIRKVD 355

Query: 188 LK 189
            K
Sbjct: 356 TK 357


>gi|323454387|gb|EGB10257.1| hypothetical protein AURANDRAFT_62887 [Aureococcus anophagefferens]
          Length = 434

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 50/138 (36%), Positives = 71/138 (51%), Gaps = 17/138 (12%)

Query: 60  IVLDSSRSAFYTLSFPLSEESVVKRL----------AGDGVQGYSDGEPGSARFDKPKSF 109
           + LD++R+  Y      S    V+RL          AGDG +GY+DG+ G ARFD+P + 
Sbjct: 247 VCLDAARNCLYVAD---SGNHCVRRLDLASGAVTTVAGDGTRGYADGDAGGARFDEPTAV 303

Query: 110 AVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPH 169
           A+D  G +YVAD+ N  +R +    V+T+AGGG+       G A  ASFS    L   P 
Sbjct: 304 ALDADGALYVADQENRRVRCLRGRTVSTLAGGGAGDSADGVGAA--ASFSAPNSLHLCPE 361

Query: 170 ICALLISDHGNQLIRQIN 187
           I  L + D   Q +R + 
Sbjct: 362 IRRLYVGD--KQCVRVVT 377



 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 45/134 (33%), Positives = 65/134 (48%), Gaps = 4/134 (2%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV-DMK 114
           SS L+V DS R     +    +   V++ LAG G+ G  DG    A F  P   A     
Sbjct: 141 SSRLVVTDSRRCRVVEVCG--ASGRVLRVLAGCGMSGGRDGRADRATFAGPWGVAAHPAT 198

Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
           G++YV++     IR I    VTT+AG G +  GRADG    ASF+    +        L 
Sbjct: 199 GDVYVSEYFGCKIRVIRRGDVTTLAGSGPRG-GRADGVGPAASFAGPRGVCLDAARNCLY 257

Query: 175 ISDHGNQLIRQINL 188
           ++D GN  +R+++L
Sbjct: 258 VADSGNHCVRRLDL 271


>gi|455647406|gb|EMF26379.1| hypothetical protein H114_24527 [Streptomyces gancidicus BKS 13-15]
          Length = 627

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 45/123 (36%), Positives = 70/123 (56%), Gaps = 7/123 (5%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V+  P S   +V D++R     L+     E+V++R  G G +G  DG PG ARF +P
Sbjct: 186 PGKVLRLPSSGTFLVTDTTRHQVVELAE--DGETVLRRF-GSGGRGLEDGPPGRARFQEP 242

Query: 107 KSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK-KEGRA-DGPAQNASFSNDF 162
           +  A+   G + VAD  NH +R++   +  VTT+AG GS  + G A DGPA+  + S+ +
Sbjct: 243 QGLALLPDGAVVVADTVNHALRRLDPESGEVTTLAGTGSPWRPGEATDGPARAVNLSSPW 302

Query: 163 ELT 165
           ++T
Sbjct: 303 DVT 305



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/107 (33%), Positives = 53/107 (49%), Gaps = 3/107 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           V R AG G +   DG    A F +P   AV  +G ++VAD  +  +R++   G V T  G
Sbjct: 331 VAREAGTGGEQLVDGSAKEALFAQPSGLAVTAEG-LWVADSESSALRRLGPDGTVRTAVG 389

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            G    G  DGPA  A F +   +T +P   +L +SD  N  +R+ +
Sbjct: 390 AGLFAFGYQDGPADGALFQHPLGVTALPD-GSLAVSDTYNHALRRYD 435


>gi|325102830|ref|YP_004272484.1| cell surface receptor IPT/TIG domain protein [Pedobacter saltans
           DSM 12145]
 gi|324971678|gb|ADY50662.1| cell surface receptor IPT/TIG domain protein [Pedobacter saltans
           DSM 12145]
          Length = 491

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 57/173 (32%), Positives = 82/173 (47%), Gaps = 13/173 (7%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P+ V   P + DLIV D +  A   +S    +  +V ++AG G  G  DG+   A F+ P
Sbjct: 170 PNGVSFDPTTGDLIVADRTAQAIKRIS----KAGIVTKIAGTGTAGRVDGDISIASFNNP 225

Query: 107 KSFAVDMKGNIYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFE 163
              AVD  GNIYVAD  NH IRKI   TN   T   G G+   G  DG   N   +    
Sbjct: 226 YKTAVDKYGNIYVADNGNHRIRKIDLSTNTVSTIAGGAGAATSGYTDGIGANGLLNTPTG 285

Query: 164 LTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVS--VWVLVSV 214
           L  V     + ++D  N  +R+I     D   S+ +G+ +  ++  +W  V+V
Sbjct: 286 LA-VDDDLNVYVADAANHAVRKIT---PDGRVSTLAGNGIAGIADGIWPNVTV 334



 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 30/43 (69%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
            GY +G PG +RF  P   AVD +GN+++AD +N+VIRKI  +
Sbjct: 443 NGYVNGLPGISRFYNPTDVAVDEEGNVFIADMNNYVIRKIVKI 485



 Score = 40.0 bits (92), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 30/90 (33%), Positives = 44/90 (48%), Gaps = 5/90 (5%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           ++ +  V  LAG+G+ G +DG   +   ++P +  +   G +Y AD+    IRKI    G
Sbjct: 307 ITPDGRVSTLAGNGIAGIADGIWPNVTVNRPTAVCMGKDGFLYSADRYGQRIRKIDVRTG 366

Query: 135 VT-TIAGGGSKKEGRA---DGPAQNASFSN 160
            T TIAG G    G     DG    A F+N
Sbjct: 367 KTVTIAGSGGNAAGTGGHVDGEVLKARFNN 396



 Score = 40.0 bits (92), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 52/112 (46%), Gaps = 7/112 (6%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD-----KSNHVIRKITNLGVTTIAGGGS 143
           G  G+ DGE   ARF+      +D  G IYV++       +H IR I N  V+TI G  +
Sbjct: 380 GTGGHVDGEVLKARFNNIWGMDIDKDGTIYVSELEGTAGKSHTIRMIKNGQVSTIGGPDA 439

Query: 144 -KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
               G  +G    + F N  ++  V     + I+D  N +IR+I    +D S
Sbjct: 440 FDNNGYVNGLPGISRFYNPTDVA-VDEEGNVFIADMNNYVIRKIVKIKKDTS 490


>gi|428179990|gb|EKX48859.1| hypothetical protein GUITHDRAFT_105483 [Guillardia theta CCMP2712]
          Length = 388

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 54/164 (32%), Positives = 74/164 (45%), Gaps = 17/164 (10%)

Query: 45  INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR---------LAGDGVQGYSD 95
           + P+  I     + + VLD        L F +S  + VK          LAG G  GY D
Sbjct: 17  VGPNIYISDEKGNKIYVLDRFTGTVMIL-FTMSSNTTVKGNDVVSQVTLLAGTGEAGYKD 75

Query: 96  GEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKITNLG------VTTIAGGGSKKEGR 148
           G   SA+F  P   AV  ++  + V+D+ NH IR IT         V T+AG      G 
Sbjct: 76  GIGSSAKFKSPSGLAVQSLRTGLCVSDRGNHAIRIITRESPDKPAVVDTVAGSDDSGAGF 135

Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPED 192
           ADG   NA F++   L   P    +L++D GN  IR++ L  ED
Sbjct: 136 ADGEGPNALFNHPEGLAMSPDGSFILVADSGNHRIRKVLLSKED 179



 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 45/124 (36%), Positives = 68/124 (54%), Gaps = 16/124 (12%)

Query: 80  SVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI------ 130
           +VV  +AG  D   G++DGE  +A F+ P+  A+   G+ I VAD  NH IRK+      
Sbjct: 120 AVVDTVAGSDDSGAGFADGEGPNALFNHPEGLAMSPDGSFILVADSGNHRIRKVLLSKED 179

Query: 131 TNLGVTTIAGGGSKK-----EGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIR 184
            NL V+TIAGG   +     +G  D     N+SF+   ++ F+P+   +L+ D GN  IR
Sbjct: 180 KNL-VSTIAGGNQNRNVTYSQGFNDADNGVNSSFNRPTDVAFLPNGEEVLVVDAGNHAIR 238

Query: 185 QINL 188
           ++ L
Sbjct: 239 KLTL 242



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 45/167 (26%), Positives = 74/167 (44%), Gaps = 13/167 (7%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-------VQGYSDGEPG 99
           P  +   P  S ++V DS       +     ++++V  +AG          QG++D + G
Sbjct: 148 PEGLAMSPDGSFILVADSGNHRIRKVLLSKEDKNLVSTIAGGNQNRNVTYSQGFNDADNG 207

Query: 100 -SARFDKPKSFAVDMKGN-IYVADKSNHVIRKIT--NLGVTTIAGGGSKKEGRADGPAQN 155
            ++ F++P   A    G  + V D  NH IRK+T     VTTIAG G+   G  +G   N
Sbjct: 208 VNSSFNRPTDVAFLPNGEEVLVVDAGNHAIRKLTLSTGAVTTIAGNGNS--GSDNGFGTN 265

Query: 156 ASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
           A+F     +          I++   Q IRQ++L     +  + +G A
Sbjct: 266 ATFEFPMSIVISAGGKYAFIAEMEGQRIRQLHLSTGQVTTLAGTGDA 312


>gi|116749588|ref|YP_846275.1| NHL repeat-containing protein [Syntrophobacter fumaroxidans MPOB]
 gi|116698652|gb|ABK17840.1| NHL repeat containing protein [Syntrophobacter fumaroxidans MPOB]
          Length = 1750

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 59/185 (31%), Positives = 85/185 (45%), Gaps = 17/185 (9%)

Query: 12  ITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPG------SSDLIVLDSS 65
           I L  LLF   SS   + D        DG  LE N    +D P       S ++ + D++
Sbjct: 2   IFLAILLFPGPSSAERVVDTVAGLQSDDGPALEAN----LDSPSGVAVDTSGNVYLSDTN 57

Query: 66  RSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKS 123
                 L     +   ++ LAG    GYS   G    A  ++P+  AVD  GN+Y AD +
Sbjct: 58  NHRIRKLDVAAGQ---IQTLAGGQSPGYSGDGGTAAKAGLNRPRGIAVDAAGNVYFADSN 114

Query: 124 NHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQL 182
           NH IRKI   G +TT+AG GS       GPA +A  +  F +   P    + ++D GN  
Sbjct: 115 NHCIRKIDTSGIITTVAGTGSAGSNGDGGPAASARLAYPFGIAVDPS-GNIYVADLGNHK 173

Query: 183 IRQIN 187
           +R+I+
Sbjct: 174 VRRID 178



 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 39/106 (36%), Positives = 56/106 (52%), Gaps = 4/106 (3%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
           +AG GV GYS  DG    AR + P + AVD   +IY+AD +NH IRK+   G +TT+AG 
Sbjct: 465 VAGKGVAGYSGDDGPAAEARLNNPSAIAVDGSESIYIADTNNHRIRKVDGGGTITTVAGN 524

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           G+      DG +  A+  N      V     + I+D  N  +R ++
Sbjct: 525 GTPGY-SGDGASATAASLNFPNGVAVDADGNVFIADTSNHRVRMVD 569



 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 42/116 (36%), Positives = 60/116 (51%), Gaps = 4/116 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +    ++  +AG G  G + DG P  SAR   P   AVD  GNIYVAD  NH +R+I   
Sbjct: 121 IDTSGIITTVAGTGSAGSNGDGGPAASARLAYPFGIAVDPSGNIYVADLGNHKVRRIDAA 180

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           G ++T+AG G        GPA  A  ++   +  V     L ISD G  +IR++++
Sbjct: 181 GNISTVAGTGLLSRLGDGGPATEAGLTSPTGVA-VDGSGNLFISDSGRHVIRRVDV 235



 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 37/108 (34%), Positives = 58/108 (53%), Gaps = 3/108 (2%)

Query: 81  VVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
            +  +AG+G  GYS DG   +A   + P   AVD  GN+++AD SNH +R + +  +TT+
Sbjct: 517 TITTVAGNGTPGYSGDGASATAASLNFPNGVAVDADGNVFIADTSNHRVRMVDSGVITTV 576

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           AG G+       G A +AS      + +V    AL I+D  N  +R++
Sbjct: 577 AGNGTPGYSGDGGAAVSASLKAPHGV-WVDATGALYIADAHNYRVRKV 623



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 40/134 (29%), Positives = 63/134 (47%), Gaps = 9/134 (6%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG-YSDGEPGS-ARFDKPKSFAVDM 113
           ++DL + D+     Y        + V+  +AG G  G   DG P + AR   P + AVD 
Sbjct: 273 NNDLYIADT-----YNQRIRKVTDGVINTVAGTGYGGSLEDGIPATGARLKSPVALAVDS 327

Query: 114 KGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA 172
           + NIY+AD  +H IR++   G + T+AG G        G A  A   +   L   P   +
Sbjct: 328 QNNIYIADTYSHRIRRVDAAGNIVTVAGKGVPGNAGDGGQAVAAILKSPHGLALGPD-NS 386

Query: 173 LLISDHGNQLIRQI 186
           L I+D  +  +R++
Sbjct: 387 LYIADRTDHRVRKV 400



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 55/114 (48%), Gaps = 5/114 (4%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           + +  V+  +AG+G  GYS G+ G   SA    P    VD  G +Y+AD  N+ +RK+  
Sbjct: 567 MVDSGVITTVAGNGTPGYS-GDGGAAVSASLKAPHGVWVDATGALYIADAHNYRVRKVAG 625

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             + T+AG G+       G A  A F +   L  V     L ++D  N  +R++
Sbjct: 626 GNIVTVAGTGTPGYSGDGGLAAAADFRSVHGL-VVDGSGNLFVADMENSRVRKV 678



 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 31/94 (32%), Positives = 47/94 (50%), Gaps = 3/94 (3%)

Query: 95  DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRAD-GP 152
           DG P + A    P   AVD  GN++++D   HVIR++   G      G  ++    D GP
Sbjct: 197 DGGPATEAGLTSPTGVAVDGSGNLFISDSGRHVIRRVDVGGTIDRVAGDYEQRYHGDGGP 256

Query: 153 AQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           A +A   N + +   P+   L I+D  NQ IR++
Sbjct: 257 ALSAGLMNAYGVALGPN-NDLYIADTYNQRIRKV 289



 Score = 42.4 bits (98), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 35/115 (30%), Positives = 58/115 (50%), Gaps = 4/115 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSA--RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++   V+  LAG G +G S     +A    D P + AV   G++Y +D  ++ +RKI   
Sbjct: 400 VTAAGVISTLAGTGEEGLSADGAAAAFANLDGPCAVAVGPSGSVYFSDSGSNRVRKIGLD 459

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           G ++T+AG G       DGPA  A  +N   +  V    ++ I+D  N  IR+++
Sbjct: 460 GNLSTVAGKGVAGYSGDDGPAAEARLNNPSAIA-VDGSESIYIADTNNHRIRKVD 513



 Score = 41.6 bits (96), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 42/138 (30%), Positives = 64/138 (46%), Gaps = 13/138 (9%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVD 112
           GS +L + DS R     +         + R+AGD  Q Y  DG P  SA        A+ 
Sbjct: 216 GSGNLFISDSGRHVIRRVDV----GGTIDRVAGDYEQRYHGDGGPALSAGLMNAYGVALG 271

Query: 113 MKGNIYVADKSNHVIRKITNLGVTTIAG---GGSKKEGRADGPAQNASFSNDFELTFVPH 169
              ++Y+AD  N  IRK+T+  + T+AG   GGS ++G    PA  A   +   L  V  
Sbjct: 272 PNNDLYIADTYNQRIRKVTDGVINTVAGTGYGGSLEDGI---PATGARLKSPVALA-VDS 327

Query: 170 ICALLISDHGNQLIRQIN 187
              + I+D  +  IR+++
Sbjct: 328 QNNIYIADTYSHRIRRVD 345



 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 54/107 (50%), Gaps = 4/107 (3%)

Query: 85  LAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGG 141
           +AG GV G +   G+  +A    P   A+    ++Y+AD+++H +RK+T  GV +T+AG 
Sbjct: 353 VAGKGVPGNAGDGGQAVAAILKSPHGLALGPDNSLYIADRTDHRVRKVTAAGVISTLAGT 412

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           G +        A  A+      +   P   ++  SD G+  +R+I L
Sbjct: 413 GEEGLSADGAAAAFANLDGPCAVAVGPS-GSVYFSDSGSNRVRKIGL 458


>gi|338536093|ref|YP_004669427.1| putative lipoprotein [Myxococcus fulvus HW-1]
 gi|337262189|gb|AEI68349.1| putative lipoprotein [Myxococcus fulvus HW-1]
          Length = 855

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 41/112 (36%), Positives = 62/112 (55%), Gaps = 7/112 (6%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA 139
           + VKR+  +G     + E  +   + P   AVD  GN+YVAD  ++VIR+I   G   + 
Sbjct: 598 NAVKRVTREG-----EVETVATGLNGPMGIAVDAAGNVYVADTDHYVIRRIDVEGKVEVF 652

Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
            GG+   G  DGPA+ A+F+    +T  P   ALL++D  N +IR+I+L  E
Sbjct: 653 AGGTP--GLQDGPAKQAAFNQPTGMTVTPDGTALLVADMNNGVIRRIDLVAE 702



 Score = 57.0 bits (136), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 59/166 (35%), Positives = 79/166 (47%), Gaps = 17/166 (10%)

Query: 27  LLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLA 86
           L+ +G+ VTT + G  L   P  V      S L V++S  S    +   L+  SVV   A
Sbjct: 699 LVAEGHPVTT-LQGDWL-YRPSGVAVSADGSTLYVVESGMSRVVRIRDGLT--SVV---A 751

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI------TNLGVTTIAG 140
           G    GY DG P SA+F      AV   G++ VAD  N+ +R++      T   VTT+AG
Sbjct: 752 GT-TPGYRDGAPTSAQFLPYLGIAVLKDGSLAVADPGNYRVRRVVLEADGTARKVTTLAG 810

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            G  + G ADGP   A       LT  P    L ++D GN L+R I
Sbjct: 811 NG--RYGHADGPGDKAELVLPAGLTVGPD-GRLYVADAGNGLVRAI 853



 Score = 39.3 bits (90), Expect = 1.9,   Method: Composition-based stats.
 Identities = 20/50 (40%), Positives = 26/50 (52%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V  LAG+G  G++DG    A    P    V   G +YVAD  N ++R IT
Sbjct: 805 VTTLAGNGRYGHADGPGDKAELVLPAGLTVGPDGRLYVADAGNGLVRAIT 854


>gi|333383538|ref|ZP_08475196.1| hypothetical protein HMPREF9455_03362 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332827477|gb|EGK00223.1| hypothetical protein HMPREF9455_03362 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 434

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 40/131 (30%), Positives = 66/131 (50%), Gaps = 3/131 (2%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
           L +  + R   YT +   +E  +       G  G+SDG    A F+ P+   +D++GN+Y
Sbjct: 307 LYIAYTQRHCIYTYNLATNEHLLFA--GAPGQPGWSDGRVSDAEFNLPRQMTLDLEGNLY 364

Query: 119 VADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDH 178
           +AD+ NH IR I   G+ T   G   +EG  DG  + A F     +  V     + I+D+
Sbjct: 365 IADEGNHCIRMIDKDGIVTTPIGQPGQEGYMDGSPEIALFKKPRGVA-VDKNGDVYIADY 423

Query: 179 GNQLIRQINLK 189
           GN+ +R++ L+
Sbjct: 424 GNRCLRKLTLQ 434


>gi|312198348|ref|YP_004018409.1| serine/threonine protein kinase [Frankia sp. EuI1c]
 gi|311229684|gb|ADP82539.1| serine/threonine protein kinase [Frankia sp. EuI1c]
          Length = 782

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 55/102 (53%), Gaps = 3/102 (2%)

Query: 88  DGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKK 145
           DG+    DG P + A    P   AVD +GN+Y+AD  N  +R++   G +TT+AG G + 
Sbjct: 505 DGLTYAGDGVPATKAMLRSPNDLAVDGRGNLYIADGENERVRRVDTHGIITTVAGSGVEA 564

Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            G  DGPA +A F     LT  P   +L + D GN  IR+I 
Sbjct: 565 GGDGDGPATDAFFKYTEGLTIGPD-GSLYVGDQGNSRIRRIT 605



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 44/86 (51%), Gaps = 8/86 (9%)

Query: 60  IVLDSSRSAFYTLSF------PLSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAV 111
           I LD  R  FY   +       +    ++  +AG+G+ G++   G+  SA+ D P   AV
Sbjct: 400 ITLDLKRGYFYIADYYNERVRRVDHNGIITTVAGNGIAGFAGDGGQATSAQLDSPTDVAV 459

Query: 112 DMKGNIYVADKSNHVIRKITNLGVTT 137
              G+IY+AD +N+ +R +   G+ T
Sbjct: 460 AADGSIYIADSANNRVRHVDTHGIIT 485



 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 41/81 (50%), Gaps = 3/81 (3%)

Query: 81  VVKRLAGDGVQ--GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
           ++  +AG GV+  G  DG    A F   +   +   G++YV D+ N  IR+IT  G + T
Sbjct: 553 IITTVAGSGVEAGGDGDGPATDAFFKYTEGLTIGPDGSLYVGDQGNSRIRRITPQGRIIT 612

Query: 138 IAGGGSKKEGRADGPAQNASF 158
           IAG G++      GPA  A  
Sbjct: 613 IAGTGTRGYSGDGGPATKAQI 633



 Score = 40.0 bits (92), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 31/95 (32%), Positives = 48/95 (50%), Gaps = 6/95 (6%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKP-KSFAVDMKGNIYVADKSNHVIRKITN 132
           ++ +  +  +AG G +GYS DG P + A+ D    + AVD  GN+Y +D+S   +R++  
Sbjct: 604 ITPQGRIITIAGTGTRGYSGDGGPATKAQIDSTGTAIAVDSAGNVYFSDQSR--VRRVDT 661

Query: 133 LGV-TTIAGGGSKKEGRADGPAQNASFSNDFELTF 166
            GV TT+ G G        GPA  A     + L  
Sbjct: 662 HGVITTVVGTGESGLAGDGGPAVKAKIGLAYGLVL 696


>gi|325916326|ref|ZP_08178604.1| gluconolactonase [Xanthomonas vesicatoria ATCC 35937]
 gi|325537497|gb|EGD09215.1| gluconolactonase [Xanthomonas vesicatoria ATCC 35937]
          Length = 699

 Score = 68.2 bits (165), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 44/112 (39%), Positives = 57/112 (50%), Gaps = 4/112 (3%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
           + + VV  LAGDG  G+ DG    ARF+ P   AVD +G +YVAD  N  IR I   G V
Sbjct: 153 TPQGVVTTLAGDGTAGFGDGAASQARFNGPMGVAVDAQGRVYVADTYNDRIRVIERDGQV 212

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            T+A  GS   G ADG  + A      +L    H   + ++D  N  IR+I 
Sbjct: 213 RTLA--GSALPGMADGVGEQARLDTPTDLKVDAH-GVVWVADMRNDAIRRIT 261



 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 48/123 (39%), Positives = 66/123 (53%), Gaps = 8/123 (6%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           V+ LAG G +G++DG   +A F+ P   A+D  GN+Y+AD  NH IRK T  G VTT+AG
Sbjct: 105 VQTLAG-GREGFADGIGAAAAFNTPSGIALDTAGNLYIADTGNHAIRKRTPQGVVTTLAG 163

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
            G+   G  DG A  A F+    +  V     + ++D  N  IR I     D    + +G
Sbjct: 164 DGTA--GFGDGAASQARFNGPMGVA-VDAQGRVYVADTYNDRIRVIE---RDGQVRTLAG 217

Query: 201 SAL 203
           SAL
Sbjct: 218 SAL 220



 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 40/116 (34%), Positives = 62/116 (53%), Gaps = 6/116 (5%)

Query: 72  LSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS-NHVIR-K 129
           ++ PL   + ++ +AG+GV+G  D     A+F +P   A+D  G +Y+AD   N+ IR  
Sbjct: 40  VATPLGWAAQLQWVAGNGVRGAQDARAADAQFAEPYGLAIDTHGALYIADAGDNNRIRVM 99

Query: 130 ITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
           + N  V T+AGG   +EG ADG    A+F+    +        L I+D GN  IR+
Sbjct: 100 LPNGSVQTLAGG---REGFADGIGAAAAFNTPSGIAL-DTAGNLYIADTGNHAIRK 151



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/67 (40%), Positives = 36/67 (53%), Gaps = 1/67 (1%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           +  +  V+ LAG  + G +DG    AR D P    VD  G ++VAD  N  IR+IT  G 
Sbjct: 206 IERDGQVRTLAGSALPGMADGVGEQARLDTPTDLKVDAHGVVWVADMRNDAIRRITPDGS 265

Query: 135 VTTIAGG 141
           V T+ GG
Sbjct: 266 VATLVGG 272


>gi|190572588|ref|YP_001970433.1| NHL repeat-containing protein [Stenotrophomonas maltophilia K279a]
 gi|190010510|emb|CAQ44119.1| putative NHL repeat protein [Stenotrophomonas maltophilia K279a]
          Length = 1267

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 56/152 (36%), Positives = 73/152 (48%), Gaps = 11/152 (7%)

Query: 38  IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
           +DG  L+   N  S I      +L V D+   A   +      +  V  LAG G QG++D
Sbjct: 688 VDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIG----TDGQVTTLAG-GEQGHAD 742

Query: 96  GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQ 154
           G    ARFD P   AVD +G +YVAD  N  IR I   G V T+AGG   + G ADG   
Sbjct: 743 GPAVQARFDAPMGIAVDAQGQVYVADTFNDRIRVIGTDGMVRTLAGG--DRPGLADGVGA 800

Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            A F     L F     ALL++D  N  +R++
Sbjct: 801 AARFDTPVALAFDAQ-GALLVADLFNNAVRRV 831



 Score = 66.6 bits (161), Expect = 9e-09,   Method: Composition-based stats.
 Identities = 38/78 (48%), Positives = 49/78 (62%), Gaps = 5/78 (6%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAG 140
           V+ +AG G +G  DG    A F+ P   A D +GN+YVAD  NH IR+I T+  VTT+AG
Sbjct: 677 VETVAGQG-EGRVDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIGTDGQVTTLAG 735

Query: 141 GGSKKEGRADGPAQNASF 158
           G   ++G ADGPA  A F
Sbjct: 736 G---EQGHADGPAVQARF 750



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 41/114 (35%), Positives = 59/114 (51%), Gaps = 6/114 (5%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD--KSNHVIRKITN 132
           PL+  + ++ LAGDG  G  DG    ARF  P +      G++Y  D   +N + R++ +
Sbjct: 615 PLAWTAQIEPLAGDGHPGDRDGASAQARFADPYALLRGADGSVYFTDAGDNNRIRRRLPD 674

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             V T+AG G   EGR DGPA  ASF+    +        L ++D GN  IR+I
Sbjct: 675 GRVETVAGQG---EGRVDGPALQASFNTPSGIAADAQ-GNLYVADTGNHAIRRI 724


>gi|182414082|ref|YP_001819148.1| NHL repeat-containing protein [Opitutus terrae PB90-1]
 gi|177841296|gb|ACB75548.1| NHL repeat containing protein [Opitutus terrae PB90-1]
          Length = 963

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 45/118 (38%), Positives = 61/118 (51%), Gaps = 8/118 (6%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--- 131
           ++   VV  LAGD  + G +DG  G ARF  P   AVD  GNI+VAD  N+ IR++T   
Sbjct: 253 ITPTGVVLTLAGDPDLVGSADGTGGDARFSSPAGLAVDRDGNIFVADSLNNTIRRVTPLN 312

Query: 132 ---NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
               LGV T   G +   G ADG    A F+  + +  V     + ++D GN  IR+I
Sbjct: 313 GPAPLGVVTTVAGQAGVTGSADGVGSQARFNLPYGIA-VDAAGNIFVADLGNTTIRKI 369



 Score = 63.9 bits (154), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 39/111 (35%), Positives = 54/111 (48%), Gaps = 2/111 (1%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           ++    V  LAG+   G +DG    ARF+ P   AVD+ GN YVAD  N  IRKIT  GV
Sbjct: 369 IAPSGAVTTLAGEASVGTADGPGPMARFNYPNGVAVDLAGNTYVADTFNATIRKITPAGV 428

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            +   G + + G ADG    A F  +F L         + +   +  +R+I
Sbjct: 429 VSTLAGAAGQIGSADGTGSAARF--EFPLGIAVDRAGNVYTTANSATVRKI 477



 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 46/109 (42%), Positives = 56/109 (51%), Gaps = 15/109 (13%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGS 143
           LAG    G  DG+  +A F  P S AVD  GN++VAD +NH IRKIT  G V+T AG G 
Sbjct: 44  LAGRPFFGGLDGQGRAAGFTTPSSAAVDQAGNLFVADTTNHTIRKITPSGTVSTFAGMGG 103

Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQI 186
           +  G  DG    A F         PH  A      L ++D GN  IR+I
Sbjct: 104 QP-GSVDGTGNAARF-------LSPHGVALDEAGNLYVADSGNNTIRKI 144



 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 45/112 (40%), Positives = 55/112 (49%), Gaps = 3/112 (2%)

Query: 76  LSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++   VV  LAG   Q G +DG   +ARF+ P   AVD  GN+Y    S   +RKIT  G
Sbjct: 423 ITPAGVVSTLAGAAGQIGSADGTGSAARFEFPLGIAVDRAGNVYTTANS-ATVRKITPAG 481

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           V T   G S   G ADGP   A F+    L  V     L ++D  N  IRQI
Sbjct: 482 VVTTIAGVSGNFGSADGPGLAARFAFPNGLA-VATDGTLYVADEENSTIRQI 532



 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 63/131 (48%), Gaps = 2/131 (1%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++   VV  +AG  G  G +DG   +ARF  P   AV   G +YVAD+ N  IR+IT  G
Sbjct: 477 ITPAGVVTTIAGVSGNFGSADGPGLAARFAFPNGLAVATDGTLYVADEENSTIRQITPDG 536

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
           + +   G   + G  DG    A F     LT +     L +SD G+  +R+I    E  +
Sbjct: 537 MVSTLAGSPAQRGGIDGTGTAARFVQPAGLT-IDAAGNLYVSDRGDFTVRKITPAGEVTT 595

Query: 195 KSSQSGSALGA 205
            + Q G A GA
Sbjct: 596 VAGQHGIAGGA 606



 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 54/143 (37%), Positives = 73/143 (51%), Gaps = 15/143 (10%)

Query: 51  IDRPGSSDLIVLDSSRSAFYTLSFPLSEES---VVKRLAGD-GVQGYSDGEPGSARFDKP 106
           +DR G  ++ V DS  +    ++ PL+  +   VV  +AG  GV G +DG    ARF+ P
Sbjct: 289 VDRDG--NIFVADSLNNTIRRVT-PLNGPAPLGVVTTVAGQAGVTGSADGVGSQARFNLP 345

Query: 107 KSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS--NDFE 163
              AVD  GNI+VAD  N  IRKI   G VTT+A  G    G ADGP   A F+  N   
Sbjct: 346 YGIAVDAAGNIFVADLGNTTIRKIAPSGAVTTLA--GEASVGTADGPGPMARFNYPNGVA 403

Query: 164 LTFVPHICALLISDHGNQLIRQI 186
           +    +     ++D  N  IR+I
Sbjct: 404 VDLAGNT---YVADTFNATIRKI 423



 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 56/112 (50%), Gaps = 3/112 (2%)

Query: 76  LSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++ + +V  LAG   Q G  DG   +ARF +P    +D  GN+YV+D+ +  +RKIT  G
Sbjct: 532 ITPDGMVSTLAGSPAQRGGIDGTGTAARFVQPAGLTIDAAGNLYVSDRGDFTVRKITPAG 591

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             T   G     G ADG    A F+    +  +     L ++D  N+ IRQI
Sbjct: 592 EVTTVAGQHGIAGGADGTGSAAQFAYAGGIA-IDRRGTLYVADSNNR-IRQI 641



 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 43/112 (38%), Positives = 57/112 (50%), Gaps = 2/112 (1%)

Query: 76  LSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++    V   AG G Q G  DG   +ARF  P   A+D  GN+YVAD  N+ IRKIT  G
Sbjct: 89  ITPSGTVSTFAGMGGQPGSVDGTGNAARFLSPHGVALDEAGNLYVADSGNNTIRKITPTG 148

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           V +   G +   G ADG    A F++   +T  P    L ++D  N +IR I
Sbjct: 149 VVSTLAGQAGAAGSADGDGSAARFNHPTGVTAYPD-GTLFVADTQNHVIRTI 199



 Score = 46.6 bits (109), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 54/106 (50%), Gaps = 3/106 (2%)

Query: 82  VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
           V   AG  G++G ++G   +A F  P++ AV  +GN+YV ++ +  IR IT  GV     
Sbjct: 205 VSTFAGKTGIRGNTNGTVDTALFALPRNIAV-FRGNLYVTEQESAAIRWITPTGVVLTLA 263

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G     G ADG   +A FS+   L  V     + ++D  N  IR++
Sbjct: 264 GDPDLVGSADGTGGDARFSSPAGLA-VDRDGNIFVADSLNNTIRRV 308



 Score = 37.7 bits (86), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 20/58 (34%), Positives = 27/58 (46%)

Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
           RF+ P        G ++VAD  NHVIR IT  G  +   G +   G  +G    A F+
Sbjct: 171 RFNHPTGVTAYPDGTLFVADTQNHVIRTITPAGRVSTFAGKTGIRGNTNGTVDTALFA 228


>gi|444912221|ref|ZP_21232386.1| putative hemolysin [Cystobacter fuscus DSM 2262]
 gi|444717129|gb|ELW57964.1| putative hemolysin [Cystobacter fuscus DSM 2262]
          Length = 844

 Score = 67.8 bits (164), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 42/107 (39%), Positives = 58/107 (54%), Gaps = 5/107 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
           VKR++ DG    S       R + P   A D +GN+YV+D  N+ IR+IT  G TT+  G
Sbjct: 591 VKRVSPDG----SVSTLSHVRLNGPLGIAADAQGNVYVSDSDNYCIRRITPDGTTTVFAG 646

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
              + G  DG A  A F+    L FV     LL++D GN +IR+I+L
Sbjct: 647 AEMQPGGMDGSALQARFNQPAGL-FVTPAQELLVADLGNGVIRRIDL 692



 Score = 40.8 bits (94), Expect = 0.66,   Method: Composition-based stats.
 Identities = 23/56 (41%), Positives = 27/56 (48%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S    V  LAG G  G  DG    A F  P   AV   G +YVAD  N ++R IT
Sbjct: 788 ISPAGEVTTLAGSGRFGARDGRGADADFVVPGGLAVGTDGTLYVADSGNALLRAIT 843


>gi|29348037|ref|NP_811540.1| cell surface protein [Bacteroides thetaiotaomicron VPI-5482]
 gi|29339939|gb|AAO77734.1| putative cell surface protein, have conserved domain [Bacteroides
           thetaiotaomicron VPI-5482]
          Length = 434

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 44/132 (33%), Positives = 68/132 (51%), Gaps = 5/132 (3%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
           L +  +++   YT +    E    K  AG  G  G++DG    A F+ P+  ++DM+GNI
Sbjct: 307 LYIAYAAKHVIYTYNIETGEH---KLFAGAFGQSGWNDGIATDAEFNSPRQMSLDMEGNI 363

Query: 118 YVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISD 177
           Y+AD  NH IR I   G+ T   G   + G ADG + + +  ND     V     + I+D
Sbjct: 364 YIADSGNHCIRMIDKNGIVTTPIGQPGEAGYADG-SPDMALLNDPRGVAVNSEGDVYIAD 422

Query: 178 HGNQLIRQINLK 189
            GN+ IR++ L+
Sbjct: 423 LGNRCIRKLTLQ 434


>gi|430809992|ref|ZP_19437107.1| NHL repeat containing protein [Cupriavidus sp. HMR-1]
 gi|429497580|gb|EKZ96110.1| NHL repeat containing protein [Cupriavidus sp. HMR-1]
          Length = 536

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 43/106 (40%), Positives = 59/106 (55%), Gaps = 8/106 (7%)

Query: 60  IVLDSSRSAFYTLSFP-----LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
           IV+D + + F   ++      ++   VV  LAG G  G++DG   +A F++P   AVD  
Sbjct: 350 IVVDVAGNVFVADTYNHLVRMVTPTGVVTTLAGTGTAGFADGPAQAAAFNRPAGIAVDAA 409

Query: 115 GNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
           GNIYV+D  N  IRKI+    VTT+A  GS   G ADG    ASF+
Sbjct: 410 GNIYVSDLGNARIRKISPANVVTTLA--GSTTRGSADGTGAAASFT 453



 Score = 63.5 bits (153), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 53/88 (60%), Gaps = 4/88 (4%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
           +A F  P    VD+ GN++VAD  NH++R +T  G VTT+AG G+   G ADGPAQ A+F
Sbjct: 341 AASFYYPSDIVVDVAGNVFVADTYNHLVRMVTPTGVVTTLAGTGT--AGFADGPAQAAAF 398

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
           +    +  V     + +SD GN  IR+I
Sbjct: 399 NRPAGIA-VDAAGNIYVSDLGNARIRKI 425



 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 31/60 (51%), Positives = 38/60 (63%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           +S    V  LAG GV G +DG   +A F+ P   AVD  GN+YVAD  NH+IRKIT+ GV
Sbjct: 263 ISPYGAVTTLAGSGVAGSTDGTGTAASFNLPSGVAVDGAGNVYVADTYNHLIRKITSAGV 322



 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 44/82 (53%), Gaps = 4/82 (4%)

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFEL 164
           P   AVD  GN+YVAD  NH IRKI+  G VTT+AG G    G  DG    ASF N    
Sbjct: 239 PIDVAVDGTGNVYVADYGNHRIRKISPYGAVTTLAGSG--VAGSTDGTGTAASF-NLPSG 295

Query: 165 TFVPHICALLISDHGNQLIRQI 186
             V     + ++D  N LIR+I
Sbjct: 296 VAVDGAGNVYVADTYNHLIRKI 317



 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 23/62 (37%), Positives = 33/62 (53%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           +S  +VV  LAG   +G +DG   +A F      A D  GN+Y  D  ++ +RK+T  GV
Sbjct: 425 ISPANVVTTLAGSTTRGSADGTGAAASFTSLLRIATDAAGNVYAVDAGSNAVRKVTPAGV 484

Query: 136 TT 137
            T
Sbjct: 485 VT 486


>gi|325104746|ref|YP_004274400.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
 gi|324973594|gb|ADY52578.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
          Length = 1769

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 36/84 (42%), Positives = 50/84 (59%), Gaps = 3/84 (3%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
            ++    V   AG+G  GY+DG   SA F+ P++  +D   NIYV++ SNH IRKIT  G 
Sbjct: 1079 ITPNGTVSTFAGNGTAGYADGPGTSAMFNSPQAMVMDASDNIYVSESSNHRIRKITPAGE 1138

Query: 135  VTTIAGGGSKKEGRADGPAQNASF 158
            V+T+AG G   +G  DG  + A F
Sbjct: 1139 VSTVAGKGV--QGNRDGTKEEAQF 1160



 Score = 66.2 bits (160), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 43/112 (38%), Positives = 62/112 (55%), Gaps = 5/112 (4%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG 134
            ++    V   AG+G  GY+DG    ARF++P     D  GN+YV++ SN+ IRKIT N  
Sbjct: 1026 ITPNGTVSTFAGNGTAGYADGVGVLARFNRPTGIITDALGNVYVSESSNY-IRKITPNGT 1084

Query: 135  VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            V+T AG G+   G ADGP  +A F N  +   +     + +S+  N  IR+I
Sbjct: 1085 VSTFAGNGTA--GYADGPGTSAMF-NSPQAMVMDASDNIYVSESSNHRIRKI 1133



 Score = 64.7 bits (156), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 38/85 (44%), Positives = 51/85 (60%), Gaps = 3/85 (3%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG 134
            ++ + +V  +AG G  G S+G   +A F+KP   A+D  GNIY+AD  NH IRKIT N  
Sbjct: 972  ITPQGLVSTIAGSGTNGNSNGVGTAASFNKPTGIAIDPWGNIYIADDLNHSIRKITPNGT 1031

Query: 135  VTTIAGGGSKKEGRADGPAQNASFS 159
            V+T AG G+   G ADG    A F+
Sbjct: 1032 VSTFAGNGTA--GYADGVGVLARFN 1054



 Score = 61.6 bits (148), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 41/116 (35%), Positives = 64/116 (55%), Gaps = 12/116 (10%)

Query: 80   SVVKRLAGDG--------VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            S+++++  DG          GY+DG   +A+F+ P+  A+D  GNIY+A++    IRKIT
Sbjct: 914  SLIRKITADGQVSTFAGSTFGYADGIGTAAKFNGPQGMAIDASGNIYIAERGASRIRKIT 973

Query: 132  NLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
              G V+TIAG G+   G ++G    ASF+    +   P    + I+D  N  IR+I
Sbjct: 974  PQGLVSTIAGSGTN--GNSNGVGTAASFNKPTGIAIDPW-GNIYIADDLNHSIRKI 1026



 Score = 58.9 bits (141), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 40/102 (39%), Positives = 55/102 (53%), Gaps = 12/102 (11%)

Query: 52   DRPGSSDL------IVLDSSRSAFYTLSF-----PLSEESVVKRLAGDGVQGYSDGEPGS 100
            D PG+S +      +V+D+S + + + S       ++    V  +AG GVQG  DG    
Sbjct: 1098 DGPGTSAMFNSPQAMVMDASDNIYVSESSNHRIRKITPAGEVSTVAGKGVQGNRDGTKEE 1157

Query: 101  ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
            A+F  P   A+DM GNIYVA+ SNH IRKIT  G+T     G
Sbjct: 1158 AQFWGPVGIALDMSGNIYVAEWSNHRIRKITQ-GITLTGTPG 1198


>gi|322434821|ref|YP_004217033.1| hypothetical protein AciX9_1190 [Granulicella tundricola MP5ACTX9]
 gi|321162548|gb|ADW68253.1| NHL repeat containing protein [Granulicella tundricola MP5ACTX9]
          Length = 1631

 Score = 67.4 bits (163), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 42/110 (38%), Positives = 59/110 (53%), Gaps = 4/110 (3%)

Query: 81  VVKRLAGDGVQ--GYSDGEPGSAR-FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTT 137
           V+   AG+GV   GY DG P +A   + P   AV  KG IY+AD+ N +IRK+ N  ++T
Sbjct: 313 VITTFAGNGVTTGGYGDGGPAAAAMLNAPWGIAVSSKGEIYIADQGNSLIRKVVNGTIST 372

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           +AG     +  ++ PA +    N      V     L ISD GN LIR++N
Sbjct: 373 VAGTHDAADTGSNPPAIHTQL-NSPAGVVVDVAGNLYISDSGNNLIRKVN 421



 Score = 45.4 bits (106), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 31/91 (34%), Positives = 47/91 (51%), Gaps = 4/91 (4%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTIAGGGSKKEGRAD-GPAQNA 156
           SA    P S A+D  GN++++D  N+VIRK+   +  ++TIAG   K     D GPA +A
Sbjct: 219 SATLSSPSSVALDGAGNVFISDTGNNVIRKVNVFDGTISTIAGQMGKNGYVGDGGPASSA 278

Query: 157 SFSNDFELTFVPHICALLISDHGNQLIRQIN 187
             +    L F      L   D  N ++R+I+
Sbjct: 279 KLNGPNGLVFDAQ-GNLYFCDTNNNVVRRID 308



 Score = 42.0 bits (97), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 37/110 (33%), Positives = 58/110 (52%), Gaps = 4/110 (3%)

Query: 82  VKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVT-TI 138
           +  +AG  V  Y+ +G    + F  P    +D  GN+Y+ D SN  IRK+  + G+T TI
Sbjct: 143 ITTVAGSSVWIYNGEGVATQSPFFLPFGVTLDAAGNLYITDTSNTRIRKVDAVTGMTSTI 202

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           AG G+       GPA +A+ S+   +        + ISD GN +IR++N+
Sbjct: 203 AGNGTIGGTGDGGPATSATLSSPSSVAL-DGAGNVFISDTGNNVIRKVNV 251


>gi|290972390|ref|XP_002668936.1| predicted protein [Naegleria gruberi]
 gi|284082474|gb|EFC36192.1| predicted protein [Naegleria gruberi]
          Length = 727

 Score = 67.0 bits (162), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 58/176 (32%), Positives = 82/176 (46%), Gaps = 23/176 (13%)

Query: 27  LLEDGYTVTTVIDGHQ-------LEINPHSVIDRPGSSDLIVLDSSRSAFYTLSF----- 74
           +LE+G  VT   +G Q       L +N   V      S   V  SS++  Y   +     
Sbjct: 35  ILENGNIVTIAGNGQQGYSGDNGLAVNAQLV------SPSYVFVSSKNEVYIADYQNNRI 88

Query: 75  -PLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI- 130
             + E   +  +AG+G QGY+  +G   +A+   P S  V +K  +Y+AD  NH IRKI 
Sbjct: 89  RKILENGNIITIAGNGQQGYNGDNGLAVNAQLASPSSVCVSLKNEVYIADSGNHSIRKIL 148

Query: 131 TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            N  + TIAG G +     +GPA NA    D    FV     + I+D GN  IR+I
Sbjct: 149 ENGNIVTIAGNGQQGYNGDNGPAVNAPLF-DPSSVFVTPNDEVYIADTGNHRIRKI 203



 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 44/114 (38%), Positives = 61/114 (53%), Gaps = 8/114 (7%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSA----RFDKPKSFAVDMKGNIYVADKSNHVIRKI-TN 132
           E   +  +AG+G QGY +G+ G A     FD P S  V     +Y+AD  NH IRKI  N
Sbjct: 149 ENGNIVTIAGNGQQGY-NGDNGPAVNAPLFD-PSSVFVTPNDEVYIADTGNHRIRKILRN 206

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             + TIAG G +     +GPA NA  S+  ++ FV     + I+D+ N  IR+I
Sbjct: 207 GNIITIAGNGQQGYSGDNGPAVNAKLSSPVDV-FVSSKNEVYIADYQNTRIRKI 259



 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 39/105 (37%), Positives = 54/105 (51%), Gaps = 4/105 (3%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGG 141
           +AG+G QGYS  +G   +A+   P    V  K  +Y+AD  N  IRKI  N  + TIAG 
Sbjct: 212 IAGNGQQGYSGDNGPAVNAKLSSPVDVFVSSKNEVYIADYQNTRIRKILENGNIITIAGN 271

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G    G  +G A NA   +   + FV     + I+D+ N  IR+I
Sbjct: 272 GKLGYGGDNGLAVNAKLYSPVGV-FVSQDNEVYIADYQNNRIRKI 315



 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 44/89 (49%), Gaps = 3/89 (3%)

Query: 100 SARFDKPKS-FAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNAS 157
           +A+   P+S F       +Y+AD  NH +RKI  N  + TIAG G +     +G A NA 
Sbjct: 4   NAQLFNPRSIFVSSSNDEVYIADTGNHTVRKILENGNIVTIAGNGQQGYSGDNGLAVNAQ 63

Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQI 186
             +     FV     + I+D+ N  IR+I
Sbjct: 64  LVSP-SYVFVSSKNEVYIADYQNNRIRKI 91



 Score = 41.2 bits (95), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 44/169 (26%), Positives = 74/169 (43%), Gaps = 20/169 (11%)

Query: 27  LLEDGYTVTTVIDGHQ-LEINPHSVIDRPGSSDLIVLDSSRSAFYTLSF------PLSEE 79
           +L +G  +T   +G Q    +    ++   SS + V  SS++  Y   +       + E 
Sbjct: 203 ILRNGNIITIAGNGQQGYSGDNGPAVNAKLSSPVDVFVSSKNEVYIADYQNTRIRKILEN 262

Query: 80  SVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVT 136
             +  +AG+G  GY   +G   +A+   P    V     +Y+AD  N+ IRKI  N  + 
Sbjct: 263 GNIITIAGNGKLGYGGDNGLAVNAKLYSPVGVFVSQDNEVYIADYQNNRIRKILENGNII 322

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
           TIAG G   E   DG +        F+ +  PHI  +  S +G   +++
Sbjct: 323 TIAGNG---ELGYDGDSV-------FDFSKYPHIGPIYSSTYGKAKLKK 361


>gi|86605024|ref|YP_473787.1| NHL repeat-containing protein [Synechococcus sp. JA-3-3Ab]
 gi|86553566|gb|ABC98524.1| NHL repeat domain protein [Synechococcus sp. JA-3-3Ab]
          Length = 637

 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 38/84 (45%), Positives = 50/84 (59%), Gaps = 3/84 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           +S +  V   AG  V GY DGE   A+F +P   A+ + G++YVAD++N  IR+IT  G 
Sbjct: 408 ISRQRQVTTFAGTCVAGYRDGERDEAQFREPFGLALGLDGSLYVADRANRRIRRITPTGK 467

Query: 135 VTTIAGGGSKKEGRADGPAQNASF 158
           VTT AG G  + G ADGPA  A  
Sbjct: 468 VTTAAGTG--QPGSADGPADQAQL 489



 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 48/134 (35%), Positives = 65/134 (48%), Gaps = 6/134 (4%)

Query: 74  FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-N 132
           F LS E  ++  AG G  G+ DG    A+FD P+    D KG ++VAD  NH +R I+  
Sbjct: 352 FRLSPEGELEVWAGSGKAGHRDGAADQAQFDSPQGLLWDPKGGLWVADSGNHCLRHISRQ 411

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL--KP 190
             VTT A  G+   G  DG    A F   F L       +L ++D  N+ IR+I    K 
Sbjct: 412 RQVTTFA--GTCVAGYRDGERDEAQFREPFGLALG-LDGSLYVADRANRRIRRITPTGKV 468

Query: 191 EDCSKSSQSGSALG 204
              + + Q GSA G
Sbjct: 469 TTAAGTGQPGSADG 482



 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 62/121 (51%), Gaps = 12/121 (9%)

Query: 76  LSEESVVKRLAGDGV--------QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
           +++   ++RL+ DG          GY DG    ARF      A D  G +++AD+ NH +
Sbjct: 505 IADRHRLRRLSADGQLTTLSRAEAGYRDGPLAEARFQTLAGLAFDSAGILWLADRDNHRL 564

Query: 128 RKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           R++   G V+T+A  G  + G  DGPA  A F    +L  +P   ++++ D G   +R++
Sbjct: 565 RRLQPNGQVSTLA--GQDEPGWQDGPASVARFEQPGDLLVLPD-GSVVVVDAGLPGLRRL 621

Query: 187 N 187
           +
Sbjct: 622 S 622



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/107 (30%), Positives = 55/107 (51%), Gaps = 7/107 (6%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           V   AG G  G +DG    A+  +P + AVD +GN+++AD+  H +R+++  G +TT++ 
Sbjct: 468 VTTAAGTGQPGSADGPADQAQLLQPTALAVDREGNLWIADR--HRLRRLSADGQLTTLS- 524

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
               + G  DGP   A F     L F      L ++D  N  +R++ 
Sbjct: 525 --RAEAGYRDGPLAEARFQTLAGLAFD-SAGILWLADRDNHRLRRLQ 568



 Score = 44.3 bits (103), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 2/79 (2%)

Query: 109 FAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP 168
            AVD +G  Y+AD + H I +++  G   +   GS K G  DG A  A F +   L + P
Sbjct: 333 IAVDPEGFFYLADPAQHRIFRLSPEGELEVW-AGSGKAGHRDGAADQAQFDSPQGLLWDP 391

Query: 169 HICALLISDHGNQLIRQIN 187
               L ++D GN  +R I+
Sbjct: 392 K-GGLWVADSGNHCLRHIS 409


>gi|380693545|ref|ZP_09858404.1| hypothetical protein BfaeM_06134 [Bacteroides faecis MAJ27]
          Length = 454

 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 57/98 (58%), Gaps = 1/98 (1%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           GY +G   + R +KP+  A D  GN++V +++NH+IRKIT  G  T+  G  ++ G  DG
Sbjct: 358 GYVNGTGATVRLNKPEQMAFDEDGNMFVPERNNHIIRKITPAGSATLYAGRPEQSGFGDG 417

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
             + A F+    +T  P   ++ ++D  N +IR++ ++
Sbjct: 418 LPEEAKFNQPECVTVYPD-NSIYVADRDNHVIRRVTVE 454


>gi|302768699|ref|XP_002967769.1| hypothetical protein SELMODRAFT_409048 [Selaginella moellendorffii]
 gi|300164507|gb|EFJ31116.1| hypothetical protein SELMODRAFT_409048 [Selaginella moellendorffii]
          Length = 199

 Score = 67.0 bits (162), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 50/140 (35%), Positives = 69/140 (49%), Gaps = 13/140 (9%)

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
           VTTIAGG S+K G ADGP   A FS++F L       +LLI+D GN+LIR+I +  +D  
Sbjct: 68  VTTIAGGSSRKPGFADGPGDTARFSSEFSLAC--SCGSLLIADRGNRLIREIQI--DDPK 123

Query: 195 KSSQSGSALGAVSVWVLVSVLSCLVSLVIGFVARPYIIRHTGRLDPPLQHDMEALPNQSG 254
               S SA+     W L+ V+  L+ + +      +II     LD   +   + L N   
Sbjct: 124 SRDSSDSAVSGPQKWALIPVV--LLGICLEMPLGAFIIWKI--LDQRCRQAGDQLGNTER 179

Query: 255 -----ETSTDLLLRHQKRNC 269
                    DL +R QKR C
Sbjct: 180 LLAGVPEDDDLQVRQQKRTC 199


>gi|427400231|ref|ZP_18891469.1| hypothetical protein HMPREF9710_01065 [Massilia timonae CCUG 45783]
 gi|425720971|gb|EKU83886.1| hypothetical protein HMPREF9710_01065 [Massilia timonae CCUG 45783]
          Length = 367

 Score = 66.6 bits (161), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 37/77 (48%), Positives = 44/77 (57%), Gaps = 1/77 (1%)

Query: 85  LAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS 143
           LAGD  ++G  DG   +ARF  P+  A+D   N+YVAD  N VIRKIT  GV T   G +
Sbjct: 55  LAGDPTIEGTMDGAGTAARFKWPQGIAIDADDNLYVADTGNFVIRKITPAGVVTTVAGAA 114

Query: 144 KKEGRADGPAQNASFSN 160
              G  DG A NA F N
Sbjct: 115 GTSGFVDGDAGNARFGN 131



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 44/78 (56%), Gaps = 7/78 (8%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++   VV  +AG  G  G+ DG+ G+ARF  P + AV+ +G IYVAD  N  IR IT+ G
Sbjct: 101 ITPAGVVTTVAGAAGTSGFVDGDAGNARFGNPVAVAVNRRGTIYVAD--NLRIRSITSAG 158

Query: 135 ----VTTIAGGGSKKEGR 148
                 TI  G +  +GR
Sbjct: 159 RVSTTATIPIGNNVSDGR 176



 Score = 44.7 bits (104), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 35/54 (64%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
           +G ++G   +ARF++  +  VD +GN+Y AD  N+++R+IT  GV T   G ++
Sbjct: 273 RGSNNGVGTAARFEQVVALTVDPRGNVYAADAINNLVRRITPEGVVTTVAGSTR 326


>gi|298386893|ref|ZP_06996448.1| IPT/TIG domain-containing protein [Bacteroides sp. 1_1_14]
 gi|298260567|gb|EFI03436.1| IPT/TIG domain-containing protein [Bacteroides sp. 1_1_14]
          Length = 454

 Score = 66.6 bits (161), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 57/98 (58%), Gaps = 1/98 (1%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           GY +G   + R +KP+  A D  GN++V +++NH+IRKIT  G  T+  G  ++ G  DG
Sbjct: 358 GYVNGTGATVRLNKPEQMAFDEDGNMFVPERNNHIIRKITPAGSATLYAGRPEQSGFGDG 417

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
             + A F+    +T  P   ++ ++D  N +IR++ ++
Sbjct: 418 LPEEAKFNQPECVTVYPD-NSIYVADRDNHVIRRVTVE 454


>gi|124006033|ref|ZP_01690870.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
 gi|123988440|gb|EAY28086.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
          Length = 789

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 63/184 (34%), Positives = 94/184 (51%), Gaps = 22/184 (11%)

Query: 12  ITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFY 70
           ITLL ++ SL +SG   ++ Y + TV+  +    N   V +D    +++ V D+ R+  Y
Sbjct: 9   ITLLCIV-SLEASG---QNQYKINTVLSNYD---NIWGVAVD--AQNNVYVTDAERNRIY 59

Query: 71  TLSFPLSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIR 128
            L       +++   AG G +GYS DG P +A   D P    VD  GN+Y AD SN V+R
Sbjct: 60  RLDAITGVRTII---AGTGTEGYSGDGGPATAAMLDYPTGITVDAAGNVYFADGSNDVVR 116

Query: 129 KI--TNLGVTTIAGGGSKKEGRADGPAQNAS--FSNDFELTFVPHICALLISDHGNQLIR 184
           KI  T   ++TIAG G++      G A  A   F +D  L    +I    I DH N  IR
Sbjct: 117 KIDATTHVISTIAGNGNRGFAGDGGQATAAQLHFPSDVALDTAGNI---YIVDHRNDRIR 173

Query: 185 QINL 188
           ++++
Sbjct: 174 KVDI 177



 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 53/106 (50%), Gaps = 7/106 (6%)

Query: 85  LAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVTTIAG 140
            AGDG    +DG P   A    PK  A+D  GN+Y+AD  +  IRK+   T + +TTIAG
Sbjct: 225 FAGDG-NALNDGGPAHLASLRNPKGLAIDAAGNVYIADVLDDRIRKVDARTGI-ITTIAG 282

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            G+       G A +A  +  F +  V  +  +  +D  N  +R++
Sbjct: 283 TGAGGYSGDGGVATSARINYPFRVA-VDALGNVYFTDWDNDALRKL 327


>gi|428175994|gb|EKX44881.1| hypothetical protein GUITHDRAFT_60229, partial [Guillardia theta
           CCMP2712]
          Length = 181

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 43/109 (39%), Positives = 58/109 (53%), Gaps = 5/109 (4%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLG--VTTIAG- 140
           LAG    GY DG+    RF+ P   A    G+ + VAD +NH IR+I  +   VTTIAG 
Sbjct: 1   LAGPSSAGYQDGQAKIVRFNSPLDIAFSKDGSWLAVADTNNHAIRRIDAMTGVVTTIAGC 60

Query: 141 -GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
              S   G  DGPA  ASF++   +   P    + ++D  N +IRQI+L
Sbjct: 61  PRSSSCLGSQDGPASAASFNSPTSIALDPQDKFIFVADTSNNMIRQIDL 109



 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 42/116 (36%), Positives = 62/116 (53%), Gaps = 9/116 (7%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGR 148
           G  DG   +A F+ P S A+D +   I+VAD SN++IR+I  T + VTTIA  G+ + G 
Sbjct: 68  GSQDGPASAASFNSPTSIALDPQDKFIFVADTSNNMIRQIDLTAMTVTTIA--GATRSGY 125

Query: 149 ADGPAQNASFSNDFELTFVPHICA----LLISDHGNQLIRQINLKPEDCSKSSQSG 200
           ADG    + F N   +T  P        ++++D  N  IR+I+L     S  + SG
Sbjct: 126 ADGLLAASLFYNPTGITAHPDFTNENKLVMVADTQNHAIRRIDLFQGRVSTVAGSG 181


>gi|403382146|ref|ZP_10924203.1| copper amine oxidase domain-containing protein [Paenibacillus sp.
           JC66]
          Length = 531

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 61/191 (31%), Positives = 81/191 (42%), Gaps = 25/191 (13%)

Query: 21  LVSSGLLLEDGYTVTTVIDGHQLEINPHS-----VIDRPGSSDLIVLDSSRSAFYTLSFP 75
           L  S  L EDG  V  + DG + E++  S      +D  G  +L V D    A   +   
Sbjct: 98  LAGSSFLQEDGQVVDALGDG-KGELSSFSEPAGLALDHNG--NLFVADKGNHAVRKVD-- 152

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
              E  V   AG GV G+ DG    + F  P+   V   G +YVAD  NHVIRKI   G 
Sbjct: 153 --AEGNVTTYAGQGVLGHKDGTAEESLFYAPEDVVVASDGTVYVADTLNHVIRKIDPEGK 210

Query: 135 -----------VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
                      +    G         DGP Q A F+    L  + H+  L ISD GN++I
Sbjct: 211 VSTLNALPQRYIEVFPGEAVLAGDYKDGPLQEAKFNEPTGLA-IDHLGNLYISDTGNRVI 269

Query: 184 RQINLKPEDCS 194
           R ++L  +  S
Sbjct: 270 RYMDLANDRVS 280



 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 36/125 (28%), Positives = 62/125 (49%), Gaps = 7/125 (5%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA 139
           S V+ +AG G  G  DG+   A F  P   A    G +YV+D  +H++R++ + GV+ +A
Sbjct: 40  SSVQTIAGTGSFGARDGDKAEASFRHPSGLAAAPDGTLYVSDTKSHLLRRLDHSGVSLLA 99

Query: 140 GG------GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
           G       G   +   DG  + +SFS    L  + H   L ++D GN  +R+++ +    
Sbjct: 100 GSSFLQEDGQVVDALGDGKGELSSFSEPAGLA-LDHNGNLFVADKGNHAVRKVDAEGNVT 158

Query: 194 SKSSQ 198
           + + Q
Sbjct: 159 TYAGQ 163



 Score = 42.7 bits (99), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 33/61 (54%), Gaps = 1/61 (1%)

Query: 92  GYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRAD 150
           G+SDG     A F  P+  A+  +G + +AD  NH IR +    V T+AGG   + G+ D
Sbjct: 302 GFSDGHATEEALFMAPRGIAITEEGGLVIADSLNHAIRYLFEGRVITLAGGHEAEHGQQD 361

Query: 151 G 151
           G
Sbjct: 362 G 362


>gi|340788705|ref|YP_004754170.1| hypothetical protein CFU_3523 [Collimonas fungivorans Ter331]
 gi|340553972|gb|AEK63347.1| hypothetical protein CFU_3523 [Collimonas fungivorans Ter331]
          Length = 394

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/86 (39%), Positives = 50/86 (58%), Gaps = 1/86 (1%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSN 160
           ARF++P   AVD  GN+Y+AD  NH IR IT+ GV T   G + + G+ DG A +A F+ 
Sbjct: 108 ARFNQPGGIAVDAGGNLYLADTQNHTIRMITSAGVVTTIAGSAGQAGQNDGTAGDARFNQ 167

Query: 161 DFELTFVPHICALLISDHGNQLIRQI 186
            + +        L ++D GN  +R+I
Sbjct: 168 PWGVAR-DGAGNLYVTDTGNATVRKI 192



 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 37/93 (39%), Positives = 52/93 (55%), Gaps = 6/93 (6%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
           + ++ V DS      T+S       VV  LAG  G  G +DG    ARF++P+  A D  
Sbjct: 297 AGNIRVSDSGNQLIRTVSL----TGVVSTLAGAAGTAGSTDGSGNKARFNQPEGIAADAA 352

Query: 115 GNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKE 146
            NIYVAD SN++IRK+T +  V+T+ GGG+   
Sbjct: 353 NNIYVADTSNNLIRKVTPDAQVSTLFGGGNSNS 385



 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 28/63 (44%), Positives = 36/63 (57%), Gaps = 1/63 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++   VV  +AG  G  G +DG  G ARF++P   A D  GN+YV D  N  +RKIT  G
Sbjct: 137 ITSAGVVTTIAGSAGQAGQNDGTAGDARFNQPWGVARDGAGNLYVTDTGNATVRKITAAG 196

Query: 135 VTT 137
           V T
Sbjct: 197 VVT 199



 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 43/86 (50%), Gaps = 1/86 (1%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSN 160
           ARF++P   A+D  GNI V+D  N +IR ++  GV +   G +   G  DG    A F N
Sbjct: 284 ARFNQPYGIALDSAGNIRVSDSGNQLIRTVSLTGVVSTLAGAAGTAGSTDGSGNKARF-N 342

Query: 161 DFELTFVPHICALLISDHGNQLIRQI 186
             E         + ++D  N LIR++
Sbjct: 343 QPEGIAADAANNIYVADTSNNLIRKV 368



 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/43 (51%), Positives = 29/43 (67%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           G SDG   + RF +P + AVD  GN++VAD +N+ IRKIT  G
Sbjct: 44  GSSDGSGSNVRFSQPGAIAVDAGGNLFVADTANNTIRKITAAG 86



 Score = 41.6 bits (96), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 32/99 (32%), Positives = 49/99 (49%), Gaps = 12/99 (12%)

Query: 101 ARFDKPKSFAVDMKGN-----------IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA 149
           A+F+ P+   +D  GN           IY+AD +N+ IRK+   G  +   G +   G A
Sbjct: 218 AQFNLPRGITLDNAGNSTGTAVNLAVNIYIADSNNNTIRKLDQNGNVSTLAGTAGSSGSA 277

Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           DG  Q A F+  + +        + +SD GNQLIR ++L
Sbjct: 278 DGSGQRARFNQPYGIAL-DSAGNIRVSDSGNQLIRTVSL 315


>gi|344924046|ref|ZP_08777507.1| NHL repeat containing protein [Candidatus Odyssella
           thessalonicensis L13]
          Length = 368

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 47/141 (33%), Positives = 69/141 (48%), Gaps = 10/141 (7%)

Query: 51  IDRPGSSDLIVLDSSRSAFYTLSFP-----LSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
            DRP S   + +DSS + +    F      ++   V   LAG    GY +G  G+A+F  
Sbjct: 162 FDRPYS---VAVDSSGTVYVADFFNSRIRRITSAGVTSTLAGSSTGGYLEGTGGAAQFGT 218

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
           P   AVD  G +YV D     +RKIT+ GVT++   GS   G A+G   +A FS+ + + 
Sbjct: 219 PIDVAVDSSGTVYVTDTYTQRVRKITSGGVTSLL-AGSNTIGYAEGTGASARFSSPYGIA 277

Query: 166 FVPHICALLISDHGNQLIRQI 186
            V       ++D  N  IR+I
Sbjct: 278 -VDSSGTAYVADSDNHRIRKI 297



 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 43/112 (38%), Positives = 60/112 (53%), Gaps = 5/112 (4%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           ++   V   LAG   QGY++G    ARFD+P S AVD  G +YVAD  N  IR+IT+ GV
Sbjct: 136 ITSAGVTSLLAGS-AQGYAEGTGSGARFDRPYSVAVDSSGTVYVADFFNSRIRRITSAGV 194

Query: 136 T-TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           T T+A  GS   G  +G    A F    ++  V     + ++D   Q +R+I
Sbjct: 195 TSTLA--GSSTGGYLEGTGGAAQFGTPIDVA-VDSSGTVYVTDTYTQRVRKI 243



 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 41/106 (38%), Positives = 59/106 (55%), Gaps = 3/106 (2%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
           +V  LAG    GY++G   SARF+ P   AV   G IYVAD +NH IR I++ G T++  
Sbjct: 34  IVSTLAGS-TGGYAEGTGASARFNYPYGIAVHSSGTIYVADSANHRIRSISSAGTTSVF- 91

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            GS   G  +G   +A F+N + +  V     L +S++ N  IR+I
Sbjct: 92  AGSGTAGTTEGTGASAQFNNPYGVA-VDSSGTLYVSEYTNHRIRKI 136



 Score = 60.8 bits (146), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/84 (39%), Positives = 46/84 (54%), Gaps = 5/84 (5%)

Query: 60  IVLDSSRSAFYTLSF-----PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
           + +DSS + + T ++      ++   V   LAG    GY++G   SARF  P   AVD  
Sbjct: 222 VAVDSSGTVYVTDTYTQRVRKITSGGVTSLLAGSNTIGYAEGTGASARFSSPYGIAVDSS 281

Query: 115 GNIYVADKSNHVIRKITNLGVTTI 138
           G  YVAD  NH IRKIT+ G T++
Sbjct: 282 GTAYVADSDNHRIRKITSGGTTSL 305



 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 32/87 (36%), Positives = 49/87 (56%), Gaps = 3/87 (3%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
           SA+F+ P   AVD  G +YV++ +NH IRKIT+ GVT++  G    +G A+G    A F 
Sbjct: 106 SAQFNNPYGVAVDSSGTLYVSEYTNHRIRKITSAGVTSLLAG--SAQGYAEGTGSGARFD 163

Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
             + +  V     + ++D  N  IR+I
Sbjct: 164 RPYSVA-VDSSGTVYVADFFNSRIRRI 189



 Score = 39.3 bits (90), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 22/32 (68%)

Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +F+ P    VD  G++Y+AD SNH+IRKI  L
Sbjct: 323 QFNYPAGIDVDSSGSLYIADSSNHLIRKIDAL 354


>gi|298387997|ref|ZP_06997545.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
 gi|298259263|gb|EFI02139.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
          Length = 439

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 45/134 (33%), Positives = 73/134 (54%), Gaps = 14/134 (10%)

Query: 65  SRSAFYTLS-------FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
           S+SA Y          +P  +  + + + G G  G+ DG+   A+F++P+  A+D +GN+
Sbjct: 311 SKSAVYRFKLTRDADGWPALDGDIEEYIPGAGA-GFRDGDVQEAQFNEPRGIAIDKEGNL 369

Query: 118 YVADKSNHVIRKI-TNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
           Y+AD +NH IRK+ T L  VTTIAG G   +G  DG    A F   + + ++     L I
Sbjct: 370 YIADVNNHRIRKVDTKLNIVTTIAGSG---KGYKDGDPLEAQFDQPWGV-YLDKNEFLYI 425

Query: 176 SDHGNQLIRQINLK 189
           +D  N  IR++ ++
Sbjct: 426 ADQNNHCIRKLAIE 439


>gi|405345633|ref|ZP_11022426.1| hypothetical protein A176_0011 [Chondromyces apiculatus DSM 436]
 gi|397093682|gb|EJJ24379.1| hypothetical protein A176_0011 [Myxococcus sp. (contaminant ex DSM
           436)]
          Length = 870

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 52/148 (35%), Positives = 79/148 (53%), Gaps = 9/148 (6%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-VQGYSDGEPGSARFD 104
           +P +V+  P + +L V DS       L     E   V+  AG G V G++DG P  ARF 
Sbjct: 611 SPTAVVAGP-TGELYVADSGNHVIRRLDRG-EEGWQVRTWAGQGFVAGFADGGPARARFS 668

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTIAGGGSKKEGRADG-PAQNASFSND 161
           +P + AVD  GN+YVAD+ NH IR +      V T+AG G+   G AD    ++ASF+  
Sbjct: 669 RPMALAVDAAGNVYVADQDNHRIRMVRAGTREVVTLAGTGTL--GTADAVRGRDASFAAP 726

Query: 162 FELTFVPHICALLISDHGNQLIRQINLK 189
             L  +  +  L + D  +Q +R+++L+
Sbjct: 727 SALA-LGGVGTLYVLDTVSQRLRRVSLQ 753



 Score = 51.6 bits (122), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 38/121 (31%), Positives = 54/121 (44%), Gaps = 7/121 (5%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG----VTTIAGGGSK 144
           G  G+ DG   +A F  P   AV   G + VAD  N+ IR I   G    V+TIAG G  
Sbjct: 538 GASGWQDGPADTAMFQSPTGLAVTHAGEVVVADTRNNRIRLIQQEGAGRTVSTIAGSG-- 595

Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
           + G  DG    A   +   +   P    L ++D GN +IR+++   E     + +G    
Sbjct: 596 ELGHRDGAGSQALLRSPTAVVAGP-TGELYVADSGNHVIRRLDRGEEGWQVRTWAGQGFV 654

Query: 205 A 205
           A
Sbjct: 655 A 655



 Score = 49.3 bits (116), Expect = 0.002,   Method: Composition-based stats.
 Identities = 44/155 (28%), Positives = 65/155 (41%), Gaps = 6/155 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI----TNLGVTT 137
           V  +AG G  G+ DG    A    P +      G +YVAD  NHVIR++        V T
Sbjct: 588 VSTIAGSGELGHRDGAGSQALLRSPTAVVAGPTGELYVADSGNHVIRRLDRGEEGWQVRT 647

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSS 197
            AG G    G ADG    A FS    L  V     + ++D  N  IR +     +    +
Sbjct: 648 WAGQGFVA-GFADGGPARARFSRPMALA-VDAAGNVYVADQDNHRIRMVRAGTREVVTLA 705

Query: 198 QSGSALGAVSVWVLVSVLSCLVSLVIGFVARPYII 232
            +G+   A +V    +  +   +L +G V   Y++
Sbjct: 706 GTGTLGTADAVRGRDASFAAPSALALGGVGTLYVL 740



 Score = 40.8 bits (94), Expect = 0.67,   Method: Composition-based stats.
 Identities = 44/139 (31%), Positives = 58/139 (41%), Gaps = 10/139 (7%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ---GYSDGEPGSARFDKPKSFAV 111
           G   L VLD+       +S  L     V  LAG G     G+ DG    ARF       +
Sbjct: 733 GVGTLYVLDTVSQRLRRVS--LQGSRAVVTLAGTGAGTPFGFQDGPGSDARFRAQLGMVM 790

Query: 112 DMKGNIYVADKSNHVIRKIT---NLGVTTI-AGGGSKKEGRADGPAQNASFSNDFELTFV 167
             +G + +AD +N  +RKI    N   T +    GS + G A G A  A  S    L F 
Sbjct: 791 GPQGELLLADTANLRLRKIIPGENAAATRVFTFAGSGRVGTALGRADAADLSAPVGLAFD 850

Query: 168 PHICALLISDHGNQLIRQI 186
                L +SD  NQ+IR +
Sbjct: 851 AG-GLLYVSDAFNQVIRVV 868


>gi|380692433|ref|ZP_09857292.1| NHL repeat-containing protein [Bacteroides faecis MAJ27]
          Length = 439

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 45/134 (33%), Positives = 73/134 (54%), Gaps = 14/134 (10%)

Query: 65  SRSAFYTLS-------FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
           S+SA Y          +P  +  + + + G G  G+ DG+   A+F++P+  A+D +GN+
Sbjct: 311 SKSAVYRFKLTRDADGWPALDGDIEEYIPGAGA-GFRDGDVQEAQFNEPRGIAIDKEGNL 369

Query: 118 YVADKSNHVIRKI-TNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
           Y+AD +NH IRK+ T L  VTTIAG G   +G  DG    A F   + + ++     L I
Sbjct: 370 YIADVNNHRIRKVDTKLNIVTTIAGSG---KGYKDGDPLEAQFDQPWGV-YLDKNEFLYI 425

Query: 176 SDHGNQLIRQINLK 189
           +D  N  IR++ ++
Sbjct: 426 ADQNNHCIRKLAIE 439


>gi|115376708|ref|ZP_01463935.1| hypothetical protein STIAU_5350 [Stigmatella aurantiaca DW4/3-1]
 gi|310822897|ref|YP_003955255.1| hypothetical protein STAUR_5665 [Stigmatella aurantiaca DW4/3-1]
 gi|115366257|gb|EAU65265.1| hypothetical protein STIAU_5350 [Stigmatella aurantiaca DW4/3-1]
 gi|309395969|gb|ADO73428.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1]
          Length = 856

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 35/86 (40%), Positives = 48/86 (55%), Gaps = 1/86 (1%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDF 162
            + P   A D  GNIYVAD  NH IR+I   G  T+  G   + G+ DG A+ A F+   
Sbjct: 616 LNGPMGIAADASGNIYVADSDNHCIRRIAPDGTATVFAGAVMEPGQGDGTAKAARFNQPA 675

Query: 163 ELTFVPHICALLISDHGNQLIRQINL 188
            L F P    LL++D GN +IR+++L
Sbjct: 676 GLAFGPG-GELLVADLGNGVIRRVDL 700



 Score = 47.4 bits (111), Expect = 0.007,   Method: Composition-based stats.
 Identities = 40/112 (35%), Positives = 55/112 (49%), Gaps = 10/112 (8%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI------TNLG 134
           VV  +AG    GY+DG P S++F      AV   G++ VAD  N+ IR+I          
Sbjct: 747 VVSIIAGSS-PGYADGTPTSSQFLPYLGLAVLKDGSLAVADPGNYRIRRILFSAQGKATE 805

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           VTT+AG G  + G  DG  Q+A       L  V     L ++D GN L+R +
Sbjct: 806 VTTLAGSG--RFGSRDGEGQSADLVLPAGLA-VGEDGTLYVADAGNALLRAV 854



 Score = 38.1 bits (87), Expect = 4.3,   Method: Composition-based stats.
 Identities = 21/64 (32%), Positives = 31/64 (48%), Gaps = 10/64 (15%)

Query: 77  SEESVVKRLAGDGVQ----------GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
           S+   ++R+A DG            G  DG   +ARF++P   A    G + VAD  N V
Sbjct: 635 SDNHCIRRIAPDGTATVFAGAVMEPGQGDGTAKAARFNQPAGLAFGPGGELLVADLGNGV 694

Query: 127 IRKI 130
           IR++
Sbjct: 695 IRRV 698


>gi|223939666|ref|ZP_03631540.1| NHL repeat containing protein [bacterium Ellin514]
 gi|223891717|gb|EEF58204.1| NHL repeat containing protein [bacterium Ellin514]
          Length = 452

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 47/133 (35%), Positives = 65/133 (48%), Gaps = 6/133 (4%)

Query: 75  PLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--- 130
           P     VV  +AG  G+ G+ DG    A F  P   A+D  GN+YVAD  N+ IRK+   
Sbjct: 203 PDGTNWVVTTIAGAAGIHGFKDGTNNDALFYSPIGLALDSAGNMYVADTGNNAIRKLTLE 262

Query: 131 -TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            TN  VTTIAG  +++ G  DG    A F+   E   V     + ++D  N  IR++ + 
Sbjct: 263 GTNWVVTTIAGSTNQQNGSLDGTNNQALFTWP-ESPAVDSAGNVYVADSYNYTIRKVTVV 321

Query: 190 PEDCSKSSQSGSA 202
             D   S+  G A
Sbjct: 322 GTDYVVSTVGGRA 334



 Score = 53.9 bits (128), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 45/122 (36%), Positives = 61/122 (50%), Gaps = 11/122 (9%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---- 130
           + ++ VV  +AG +   G  DG    ARF  P   AVD  GN++VA+   + IRK+    
Sbjct: 88  IGKDWVVTTIAGTNQAYGTKDGTNAEARFTGPTGLAVDASGNVFVAEGYANTIRKLSPIG 147

Query: 131 TNLGVTTIAG-GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           TN  VTTIAG  GS   G ADG   +A F   +          L ++D G   IRQ  L+
Sbjct: 148 TNWIVTTIAGLAGS--SGSADGTNSDARFYLPYGNMACDTNGNLFVTD-GYDTIRQ--LR 202

Query: 190 PE 191
           P+
Sbjct: 203 PD 204



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/69 (43%), Positives = 39/69 (56%), Gaps = 5/69 (7%)

Query: 95  DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG----VTTIAGGGSKKEGRAD 150
           DG   +A F  P+S A D  GN+YVAD +  VIRK+  +G    VTTIA G ++  G  D
Sbjct: 50  DGTNTAAHFYSPQSVACDASGNVYVADYNGRVIRKVEVIGKDWVVTTIA-GTNQAYGTKD 108

Query: 151 GPAQNASFS 159
           G    A F+
Sbjct: 109 GTNAEARFT 117


>gi|153868931|ref|ZP_01998653.1| receptor protein kinase [Beggiatoa sp. PS]
 gi|152074497|gb|EDN71345.1| receptor protein kinase [Beggiatoa sp. PS]
          Length = 3115

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 42/112 (37%), Positives = 60/112 (53%), Gaps = 4/112 (3%)

Query: 81   VVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
             +  +AG+G +GYS DG+P +A + + P    VD  GN+Y+ADK+NH IRK+   G+ T 
Sbjct: 2335 TITTVAGNGNKGYSGDGDPATAAQINTPTGLEVDSTGNLYIADKNNHRIRKVDTEGIITT 2394

Query: 139  AGGGSKKEGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
              G  K     DG  A  A  S   ++  +     L I+D GN  IR+I  K
Sbjct: 2395 FTGTGKPGTATDGIIASVAQISQPTDVA-LDQYGNLYIADKGNDTIRKIGEK 2445



 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 59/116 (50%), Gaps = 4/116 (3%)

Query: 77   SEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
            S  S++  +AG+G  GY   +G    AR   P   AVD + N+Y+AD  NH IRK+   G
Sbjct: 2275 SANSIITTVAGNGRSGYQGDNGPATGARLSNPTGLAVDSQNNLYIADTDNHRIRKVDLTG 2334

Query: 135  -VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
             +TT+AG G+K       PA  A  +    L  V     L I+D  N  IR+++ +
Sbjct: 2335 TITTVAGNGNKGYSGDGDPATAAQINTPTGLE-VDSTGNLYIADKNNHRIRKVDTE 2389



 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/105 (36%), Positives = 57/105 (54%), Gaps = 4/105 (3%)

Query: 85   LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
            +AG G +G +  DG    A+   P+  A+D +GN+Y+AD  NH IRK+ + G +TT+AG 
Sbjct: 2163 VAGTGAKGSTGDDGPAIEAKLKNPQGTAIDHEGNLYIADTLNHRIRKVDSNGIITTVAGI 2222

Query: 142  GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            G       +G A  A   N   + F  +   L I+D GN  IR++
Sbjct: 2223 GKAGNTGDNGLATAAKLRNPTAIVF-DNNGHLYIADSGNHRIRKV 2266



 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 41/132 (31%), Positives = 63/132 (47%), Gaps = 19/132 (14%)

Query: 57   SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
            SD+++  +  SA +TL     EE+      G             A  + PKS + D  GN
Sbjct: 2098 SDIVIQPNDNSAIWTLV----EEAAAIEQTG-------------ANLNSPKSMSFDASGN 2140

Query: 117  IYVADKSNH-VIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
             Y+AD  NH ++++ T   +T +AG G+K     DGPA  A   N  + T + H   L I
Sbjct: 2141 AYIADSLNHRILKRDTQGNLTVVAGTGAKGSTGDDGPAIEAKLKNP-QGTAIDHEGNLYI 2199

Query: 176  SDHGNQLIRQIN 187
            +D  N  IR+++
Sbjct: 2200 ADTLNHRIRKVD 2211


>gi|427400920|ref|ZP_18892158.1| hypothetical protein HMPREF9710_01754 [Massilia timonae CCUG 45783]
 gi|425720099|gb|EKU83025.1| hypothetical protein HMPREF9710_01754 [Massilia timonae CCUG 45783]
          Length = 717

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 53/151 (35%), Positives = 74/151 (49%), Gaps = 17/151 (11%)

Query: 49  SVIDRPGSSDLI-----VLDSSRSAFYTLSFP-------LSEESVVKRLAGDGVQGYSDG 96
            VID PGS+        V   +R A Y            +  +  V  LAG G +G++DG
Sbjct: 63  GVIDGPGSASRFSDPYGVAIGARGAVYVADGGEANRIRLIQPDGAVSTLAG-GKEGFADG 121

Query: 97  EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQN 155
              +A F  P + A+D +GN+YVAD  NH IRK+   G VTT+AG GS   G  DG  + 
Sbjct: 122 IGAAAAFHTPSALALDHEGNLYVADTGNHAIRKVAPDGTVTTVAGSGSP--GYLDGIGRA 179

Query: 156 ASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           A F+    +  V     + ++D  N  IR+I
Sbjct: 180 AQFNGPVGIA-VDDAGIVYVADTYNDRIRRI 209



 Score = 60.1 bits (144), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 49/137 (35%), Positives = 67/137 (48%), Gaps = 10/137 (7%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
             +L V D+   A   ++     +  V  +AG G  GY DG   +A+F+ P   AVD  G
Sbjct: 139 EGNLYVADTGNHAIRKVA----PDGTVTTVAGSGSPGYLDGIGRAAQFNGPVGIAVDDAG 194

Query: 116 NIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
            +YVAD  N  IR+I   G VTT+AG G  K G  DG   +A F     L        L 
Sbjct: 195 IVYVADTYNDRIRRIAPDGMVTTLAGNG--KPGLLDGALLDAGFDTPSALA-AGRDGTLY 251

Query: 175 ISDHGNQLIRQINLKPE 191
           ++D GN  +R+I  KP+
Sbjct: 252 VADTGNHAVRRI--KPD 266



 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 47/108 (43%), Positives = 58/108 (53%), Gaps = 8/108 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSA-RFDKPKSFAVDMKGNIYVADKSN-HVIRKITNLG-VTTI 138
           V  LAGDG  G  DG PGSA RF  P   A+  +G +YVAD    + IR I   G V+T+
Sbjct: 53  VSLLAGDGRDGVIDG-PGSASRFSDPYGVAIGARGAVYVADGGEANRIRLIQPDGAVSTL 111

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           AGG   KEG ADG    A+F     L    H   L ++D GN  IR++
Sbjct: 112 AGG---KEGFADGIGAAAAFHTPSALAL-DHEGNLYVADTGNHAIRKV 155


>gi|320105416|ref|YP_004181006.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
 gi|319923937|gb|ADV81012.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
          Length = 822

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 42/111 (37%), Positives = 60/111 (54%), Gaps = 5/111 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
           +  +AG GV G+S G+ G+A       P   AVD  GN+YVAD  NH IRKI+   +TT+
Sbjct: 149 IATIAGTGVAGFS-GDGGAAVLATLSHPTGLAVDTGGNLYVADTDNHRIRKISGTTITTV 207

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           AG G +      GPA  A++ +  +   V     L I+D  NQ IR ++ +
Sbjct: 208 AGSGEQGFAGDGGPA-TAAWLDSPDGVAVDATGNLYIADTHNQRIRVVSAE 257



 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 40/110 (36%), Positives = 59/110 (53%), Gaps = 4/110 (3%)

Query: 80  SVVKRLAGDGVQGYS-DGEPGSARF-DKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VT 136
           + +  +AG G QG++ DG P +A + D P   AVD  GN+Y+AD  N  IR ++  G ++
Sbjct: 202 TTITTVAGSGEQGFAGDGGPATAAWLDSPDGVAVDATGNLYIADTHNQRIRVVSAEGTIS 261

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           TIAG GS+      G A  AS +    L+ V  +  +  +D  N  IR I
Sbjct: 262 TIAGNGSRAYAGDGGSAVAASLARPRGLS-VDALGNIYFADSDNNRIRLI 310



 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 38/114 (33%), Positives = 57/114 (50%), Gaps = 3/114 (2%)

Query: 76  LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +S   ++  +AG G QG++   G   SA  D P   AVD  GN+Y++D  N  IR+++  
Sbjct: 88  VSAAGIITTVAGTGEQGFAGDGGSATSAWLDSPVGVAVDRVGNLYISDSHNQRIRRVSGG 147

Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            + TIAG G        G A  A+ S+   L  V     L ++D  N  IR+I+
Sbjct: 148 TIATIAGTGVAGFSGDGGAAVLATLSHPTGLA-VDTGGNLYVADTDNHRIRKIS 200



 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 49/94 (52%)

Query: 94  SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPA 153
           S+ E  +A    P   A D  GN+Y AD +NHV+RK++  G+ T   G  ++    DG +
Sbjct: 52  SNVEATTAPLASPAGIAYDASGNMYFADLNNHVVRKVSAAGIITTVAGTGEQGFAGDGGS 111

Query: 154 QNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
             +++ +      V  +  L ISD  NQ IR+++
Sbjct: 112 ATSAWLDSPVGVAVDRVGNLYISDSHNQRIRRVS 145



 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 42/118 (35%), Positives = 64/118 (54%), Gaps = 6/118 (5%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           +S E  +  +AG+G + Y+ G+ GSA      +P+  +VD  GNIY AD  N+ IR I  
Sbjct: 254 VSAEGTISTIAGNGSRAYA-GDGGSAVAASLARPRGLSVDALGNIYFADSDNNRIRLIAT 312

Query: 133 LG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            G +TT+AG GS+ +G   G A +A+       T V  +    +SD  N +IR++ L 
Sbjct: 313 TGIITTVAGNGSQGDGGDGGSALDATLDTP-RATAVRALGIFDLSDTHNDIIREVGLN 369


>gi|383124185|ref|ZP_09944852.1| hypothetical protein BSIG_3780 [Bacteroides sp. 1_1_6]
 gi|251839311|gb|EES67395.1| hypothetical protein BSIG_3780 [Bacteroides sp. 1_1_6]
          Length = 454

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 57/98 (58%), Gaps = 1/98 (1%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           GY +G   + R +KP+  A D  GN++V +++NH+IRKIT  G  ++  G  ++ G  DG
Sbjct: 358 GYINGTGATVRLNKPEQMAFDEDGNMFVPERNNHIIRKITPAGSASLYAGQPEQSGFGDG 417

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
             + A F+    +T  P   ++ ++D  N +IR++ ++
Sbjct: 418 LPEEAKFNQPECVTVYPD-NSVYVADRDNHVIRRVTVE 454


>gi|302848538|ref|XP_002955801.1| hypothetical protein VOLCADRAFT_119185 [Volvox carteri f.
           nagariensis]
 gi|300258994|gb|EFJ43226.1| hypothetical protein VOLCADRAFT_119185 [Volvox carteri f.
           nagariensis]
          Length = 1149

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 42/126 (33%), Positives = 65/126 (51%), Gaps = 12/126 (9%)

Query: 73  SFPLSEESVVKRLAGDG---------VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS 123
           S  +S++  ++R+A DG           G++DG  GSARF   +  AVD  GNI+VAD S
Sbjct: 241 SLVVSDQHCLRRVASDGFVTTIAGGTTPGHADGPSGSARFYNLRGVAVDGDGNIFVADSS 300

Query: 124 NHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
           NH +R++T     V+T+  G     G  DG   +A F N            L+++D GN 
Sbjct: 301 NHCVRQLTAADSIVSTLV-GSPGNAGFRDGTGTDARFRNPCGAAVNLQDGTLVVADAGNN 359

Query: 182 LIRQIN 187
            +R+++
Sbjct: 360 RLRKVD 365



 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 42/113 (37%), Positives = 63/113 (55%), Gaps = 9/113 (7%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK-GNIYVADKSNHVIRKITNLG- 134
           S E+VV  LAG G  G+ DG    A+F  P   A+D + G++ V+D+  H +R++ + G 
Sbjct: 203 SGEAVV--LAGGGGYGHKDGPGRKAKFSCPMYLALDHRDGSLVVSDQ--HCLRRVASDGF 258

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           VTTIAGG +   G ADGP+ +A F N      V     + ++D  N  +RQ+ 
Sbjct: 259 VTTIAGGTTP--GHADGPSGSARFYN-LRGVAVDGDGNIFVADSSNHCVRQLT 308



 Score = 44.3 bits (103), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 33/92 (35%), Positives = 49/92 (53%), Gaps = 6/92 (6%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
           G  ++ V DSS      L+   + +S+V  L G  G  G+ DG    ARF  P   AV++
Sbjct: 290 GDGNIFVADSSNHCVRQLT---AADSIVSTLVGSPGNAGFRDGTGTDARFRNPCGAAVNL 346

Query: 114 K-GNIYVADKSNHVIRKI-TNLGVTTIAGGGS 143
           + G + VAD  N+ +RK+  +   TTIAG G+
Sbjct: 347 QDGTLVVADAGNNRLRKVDRDRLTTTIAGNGT 378


>gi|223934991|ref|ZP_03626910.1| NHL repeat containing protein [bacterium Ellin514]
 gi|223896444|gb|EEF62886.1| NHL repeat containing protein [bacterium Ellin514]
          Length = 1064

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 39/108 (36%), Positives = 56/108 (51%), Gaps = 13/108 (12%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
           LAG   +G ++G   S +F  P++ A+D+  N++VAD  NHVIRKI+  G+ T   G   
Sbjct: 52  LAGYAGRGSANGISSSVQFKAPQAIAIDISNNVFVADTENHVIRKISCTGIITTLAGSLG 111

Query: 145 KEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQI 186
             G  DG   NA F       F P   A      +L++D GN  +R+I
Sbjct: 112 THGSRDGSGTNALF-------FRPAGIAVDASGNVLVADTGNNTVRKI 152



 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 45/115 (39%), Positives = 59/115 (51%), Gaps = 7/115 (6%)

Query: 75  PLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--- 130
           P     +V  +AG  G  G  DG   SARF +P+S +VD  G +YVAD  NH IRKI   
Sbjct: 593 PSGTNWIVNTVAGLAGFWGSIDGTNTSARFFQPRSLSVDASGALYVADSGNHAIRKITPS 652

Query: 131 -TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIR 184
            TN  VTT+A G +   G  DG   NA FS+   ++       + ++D  N  IR
Sbjct: 653 GTNWVVTTVA-GLAGAAGSVDGTGINAEFSHPAGISLT-SAGIVYVADSDNNTIR 705



 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 46/132 (34%), Positives = 63/132 (47%), Gaps = 6/132 (4%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
           S +L V+D+           ++   VV   AG  G  G ++G   +A F  P+   +D+ 
Sbjct: 246 SGNLYVVDTGNGTIR----KITSSGVVTTFAGSAGNYGATNGIGANALFYAPQGITIDLF 301

Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
           G +YVAD  NH IRKIT+ G  T   G +   G AD    +ASF N   +T       L 
Sbjct: 302 GCVYVADTGNHTIRKITSDGTVTTLAGLAGNYGSADSVNSSASFWNPQGIT-SDATGNLY 360

Query: 175 ISDHGNQLIRQI 186
           I+D GN  IR I
Sbjct: 361 IADTGNNTIRTI 372



 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 40/106 (37%), Positives = 52/106 (49%), Gaps = 2/106 (1%)

Query: 82  VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
           V  +AG  GV G  D    +A F  P+   VD  GN+YV D  N  IRKIT+ GV T   
Sbjct: 213 VNTMAGSAGVYGNLDNSGANALFSGPQGLTVDSSGNLYVVDTGNGTIRKITSSGVVTTFA 272

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G +   G  +G   NA F     +T     C + ++D GN  IR+I
Sbjct: 273 GSAGNYGATNGIGANALFYAPQGITIDLFGC-VYVADTGNHTIRKI 317



 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 55/159 (34%), Positives = 78/159 (49%), Gaps = 11/159 (6%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDM 113
           G+ ++ V D+    F  L   ++    V  LAG+    G SDG   +ARF  P   AVD 
Sbjct: 519 GTGNVFVADT----FNNLIRKITPGGAVTTLAGNFENFGSSDGTNSNARFYWPSGVAVDN 574

Query: 114 KGNIYVADKSNHVIRKI----TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPH 169
            GN++VAD  NH IR++    TN  V T+A G +   G  DG   +A F     L+ V  
Sbjct: 575 AGNVFVADYMNHTIRELIPSGTNWIVNTVA-GLAGFWGSIDGTNTSARFFQPRSLS-VDA 632

Query: 170 ICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSV 208
             AL ++D GN  IR+I     +   ++ +G A  A SV
Sbjct: 633 SGALYVADSGNHAIRKITPSGTNWVVTTVAGLAGAAGSV 671



 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 54/112 (48%), Gaps = 2/112 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           +S   ++  LAG  G  G  DG   +A F +P   AVD  GN+ VAD  N+ +RKIT  G
Sbjct: 97  ISCTGIITTLAGSLGTHGSRDGSGTNALFFRPAGIAVDASGNVLVADTGNNTVRKITATG 156

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             T   G +   G  D    NA F     +  + +   + ++D GN  IR+I
Sbjct: 157 DVTTFAGSAGNYGSTDNLGTNALFYRPTGIA-IDNFNNIFVADTGNNTIRKI 207



 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 58/112 (51%), Gaps = 2/112 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++ +      AG  GV G ++G    A+F  P+  AVD  GN++VAD  N++IRKIT  G
Sbjct: 481 ITPDGAATTFAGSAGVSGTANGTNTDAQFYAPQGLAVDGTGNVFVADTFNNLIRKITPGG 540

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             T   G  +  G +DG   NA F     +  V +   + ++D+ N  IR++
Sbjct: 541 AVTTLAGNFENFGSSDGTNSNARFYWPSGVA-VDNAGNVFVADYMNHTIREL 591



 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 44/118 (37%), Positives = 56/118 (47%), Gaps = 14/118 (11%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGS-------ARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           +S   +V  LAG      S G PGS       A F  P+   VD  GNIYVAD  NH+IR
Sbjct: 426 ISPSGLVCTLAG------SIGHPGSVNNIGTNALFSGPQGITVDGVGNIYVADTLNHIIR 479

Query: 129 KITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           +IT  G  T   G +   G A+G   +A F     L  V     + ++D  N LIR+I
Sbjct: 480 RITPDGAATTFAGSAGVSGTANGTNTDAQFYAPQGLA-VDGTGNVFVADTFNNLIRKI 536



 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 47/157 (29%), Positives = 73/157 (46%), Gaps = 9/157 (5%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP   I    + +L + D+  +   T++        V   AG    G +DG    ARF  
Sbjct: 347 NPQG-ITSDATGNLYIADTGNNTIRTIT----PGGSVTTFAGLPSIGSADGLSSDARFRF 401

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
           P++ AVD   N+YVAD +N  IRKI+  G+     G     G  +    NA FS    +T
Sbjct: 402 PQAVAVDAATNVYVADTANQTIRKISPSGLVCTLAGSIGHPGSVNNIGTNALFSGPQGIT 461

Query: 166 FVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
            V  +  + ++D  N +IR+I     D + ++ +GSA
Sbjct: 462 -VDGVGNIYVADTLNHIIRRIT---PDGAATTFAGSA 494



 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 41/113 (36%), Positives = 56/113 (49%), Gaps = 5/113 (4%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++ +  V  LAG  G  G +D    SA F  P+    D  GN+Y+AD  N+ IR IT  G
Sbjct: 317 ITSDGTVTTLAGLAGNYGSADSVNSSASFWNPQGITSDATGNLYIADTGNNTIRTITPGG 376

Query: 135 -VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            VTT A  G    G ADG + +A F    +   V     + ++D  NQ IR+I
Sbjct: 377 SVTTFA--GLPSIGSADGLSSDARFRFP-QAVAVDAATNVYVADTANQTIRKI 426



 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/106 (37%), Positives = 49/106 (46%), Gaps = 2/106 (1%)

Query: 82  VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
           V   AG  G  G +D    +A F +P   A+D   NI+VAD  N+ IRKIT  G      
Sbjct: 158 VTTFAGSAGNYGSTDNLGTNALFYRPTGIAIDNFNNIFVADTGNNTIRKITPSGNVNTMA 217

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G +   G  D    NA FS    LT V     L + D GN  IR+I
Sbjct: 218 GSAGVYGNLDNSGANALFSGPQGLT-VDSSGNLYVVDTGNGTIRKI 262


>gi|114568006|ref|YP_755160.1| leucine-rich repeat-containing protein [Syntrophomonas wolfei
           subsp. wolfei str. Goettingen]
 gi|114338941|gb|ABI69789.1| Leucine-rich repeat (LRR) protein-like protein [Syntrophomonas
           wolfei subsp. wolfei str. Goettingen]
          Length = 1351

 Score = 64.7 bits (156), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 39/110 (35%), Positives = 62/110 (56%), Gaps = 4/110 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
           ++  +AG+G  GYS DG P  SA+   P   A D  GN+Y+ DK N+ IRK+ N G ++T
Sbjct: 506 IISTVAGNGTAGYSGDGGPATSAQLKDPVGVACDKNGNLYIVDKDNNRIRKVDNTGKIST 565

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           +AG G+        PA +A     + +TF  +   + I+D  N+ IR+++
Sbjct: 566 VAGNGTGGYAGDGSPATSAQIWVPYGVTF-DNNGNMYIADMNNKRIRKVD 614



 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/82 (41%), Positives = 46/82 (56%), Gaps = 3/82 (3%)

Query: 82  VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
           +  +AG+G  GY+ DG P  SA+   P     D  GN+Y+AD +N  IRK+   G +TT+
Sbjct: 563 ISTVAGNGTGGYAGDGSPATSAQIWVPYGVTFDNNGNMYIADMNNKRIRKVDPSGIITTV 622

Query: 139 AGGGSKKEGRADGPAQNASFSN 160
           AG GS K     GPA+ A   N
Sbjct: 623 AGNGSWKYSGDGGPAEAAGLCN 644



 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 58/109 (53%), Gaps = 7/109 (6%)

Query: 85  LAGDGVQGYS--DGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
           +AG   QG++  +G+ G   SAR + P   A D  GN+Y+AD+ NH IRK+   G ++T+
Sbjct: 451 VAGIVTQGWATYEGDGGLATSARLNYPFGVACDGSGNLYIADRGNHRIRKVDTSGIISTV 510

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           AG G+       GPA +A   +   +    +   L I D  N  IR+++
Sbjct: 511 AGNGTAGYSGDGGPATSAQLKDPVGVACDKN-GNLYIVDKDNNRIRKVD 558



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 43/81 (53%), Gaps = 3/81 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
           ++  +AG+G  GYS DG P  SA+ + P+    D   N+Y+ D  N  IRK+   G ++T
Sbjct: 335 IISTVAGNGTAGYSGDGGPATSAQLNTPRGVVCDGADNLYIVDGDNQRIRKVDTSGKIST 394

Query: 138 IAGGGSKKEGRADGPAQNASF 158
           +AG G+       GPA  A  
Sbjct: 395 VAGNGTHWFEGDGGPATAAGL 415



 Score = 40.4 bits (93), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 60/114 (52%), Gaps = 11/114 (9%)

Query: 82  VKRLAGDGVQGYS-DGEPGSAR--FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
           +  +AG+G   +  DG P +A   +D P   A D  GN+Y+A+ +++ IRK+ + G ++ 
Sbjct: 392 ISTVAGNGTHWFEGDGGPATAAGLYD-PIGVACDSSGNLYIAESNSNCIRKVDSSGKISV 450

Query: 138 IAG----GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           +AG    G +  EG   G A +A  +  F +        L I+D GN  IR+++
Sbjct: 451 VAGIVTQGWATYEGDG-GLATSARLNYPFGVA-CDGSGNLYIADRGNHRIRKVD 502


>gi|182416423|ref|YP_001821489.1| NHL repeat-containing protein [Opitutus terrae PB90-1]
 gi|177843637|gb|ACB77889.1| NHL repeat containing protein [Opitutus terrae PB90-1]
          Length = 1026

 Score = 64.7 bits (156), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 47/112 (41%), Positives = 63/112 (56%), Gaps = 1/112 (0%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           +S +  V  LAG G  G +DG+  SARF      AVD  GN+YVAD  NH IRKIT  GV
Sbjct: 187 VSPDGNVTILAGAGKAGSADGDSASARFSGSDGLAVDGTGNVYVADLFNHTIRKITPDGV 246

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            T   G +++ G ADG    A F    EL+ +     +L++D GN  IR+++
Sbjct: 247 VTTLAGVARESGFADGAGAAARFYYPRELS-IDAYGNILVADEGNCAIRKVS 297



 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 50/137 (36%), Positives = 68/137 (49%), Gaps = 12/137 (8%)

Query: 58  DLIVLDSSRSAFYTLSF------PLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFA 110
           D + +D+ R   Y + F       ++ E VV  LAG  G  G   G  G ARFD P + A
Sbjct: 109 DGLAIDA-RGNLYAVDFTDHTVRKITPEGVVTTLAGSAGDHGTQVGHGGEARFDSPMAVA 167

Query: 111 VDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPH 169
           VD   N+YV    +  IRK++  G VT +AG G  K G ADG + +A FS    L  V  
Sbjct: 168 VDRWDNLYVGQMGDGAIRKVSPDGNVTILAGAG--KAGSADGDSASARFSGSDGLA-VDG 224

Query: 170 ICALLISDHGNQLIRQI 186
              + ++D  N  IR+I
Sbjct: 225 TGNVYVADLFNHTIRKI 241



 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 45/119 (37%), Positives = 61/119 (51%), Gaps = 10/119 (8%)

Query: 78  EESVVKRLAG---DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           E  V  R AG   D   G +DG   SARF  P+S AVD  G +YVAD ++ VIRKIT  G
Sbjct: 22  ESYVFTRYAGNPNDTPYGSTDGFGDSARFAAPRSIAVDASGTLYVADAASSVIRKITAEG 81

Query: 135 VTTIAGGGSKKEGRADGPAQNASFS--NDFELTFVPHICALLISDHGNQLIRQINLKPE 191
           + T   G + + G ADG    A F   +   +    ++ A+  +DH    +R+I   PE
Sbjct: 82  MVTTFVGTAGQRGSADGIGAAARFQGIDGLAIDARGNLYAVDFTDH---TVRKIT--PE 135



 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 41/115 (35%), Positives = 57/115 (49%), Gaps = 10/115 (8%)

Query: 77  SEESVVKRLAGDGV------QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           S  S V+++  DGV      Q Y DGEP   R   P   A    G++Y+AD  N  IR+I
Sbjct: 563 SWSSAVRKITPDGVVTTVVRQPY-DGEP--ERLYLPFGIAAGHDGSLYIADTGNSTIRQI 619

Query: 131 TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
              G     GGG ++EG+ DG    A F N + +  V     L ++D GN L+R+
Sbjct: 620 RPDGSMVTIGGGMRQEGKQDGRGGEARFLNPYGVA-VDAAGHLYVADSGNNLVRK 673



 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 55/117 (47%), Gaps = 12/117 (10%)

Query: 81  VVKRLAGDGVQGYSDGEPG-----------SARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           V+ ++A DG   +  G PG            ARF+ P   AV   G IYVAD  NH IR+
Sbjct: 456 VIAKIAPDGAVTFFAGSPGLSGSTDGNARTEARFNGPTGIAVGPSGTIYVADFDNHTIRQ 515

Query: 130 ITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           I+  G+ +   G + + G ADG    A F     +T V     + ++D  +  +R+I
Sbjct: 516 ISPAGMVSTLAGAAGQPGTADGTGSAARFYAPAAVT-VDRAGMIYVADSWSSAVRKI 571



 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 44/135 (32%), Positives = 66/135 (48%), Gaps = 8/135 (5%)

Query: 60  IVLDSSRSAFYTLSF-----PLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDM 113
           I +D +R+ F   S+      ++   VV  +AG  GV G ++G   +ARF  P + A+D 
Sbjct: 384 IAIDQNRNVFVADSYNNVIRKITPGGVVTTVAGLGGVFGSAEGSGAAARFGVPAAVAIDA 443

Query: 114 KGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQN-ASFSNDFELTFVPHICA 172
             N+YVA++  HVI KI   G  T   G     G  DG A+  A F+    +   P    
Sbjct: 444 AANLYVANRQTHVIAKIAPDGAVTFFAGSPGLSGSTDGNARTEARFNGPTGIAVGPS-GT 502

Query: 173 LLISDHGNQLIRQIN 187
           + ++D  N  IRQI+
Sbjct: 503 IYVADFDNHTIRQIS 517



 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 38/121 (31%), Positives = 56/121 (46%), Gaps = 5/121 (4%)

Query: 38  IDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGE 97
           +D  +  +     + R G  D+ V DS  S    ++   +  +   R  G    GY++G 
Sbjct: 318 VDAARFSLPRGVAVSRTG--DIYVADSGNSTVRRIAVGGAVTTFAGRPGG---PGYANGS 372

Query: 98  PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNAS 157
             +A+F  P   A+D   N++VAD  N+VIRKIT  GV T   G     G A+G    A 
Sbjct: 373 SETAQFYFPTGIAIDQNRNVFVADSYNNVIRKITPGGVVTTVAGLGGVFGSAEGSGAAAR 432

Query: 158 F 158
           F
Sbjct: 433 F 433



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 41/142 (28%), Positives = 67/142 (47%), Gaps = 7/142 (4%)

Query: 47  PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           P S+ +D  G+  L V D++ S    ++   +E  V   +   G +G +DG   +ARF  
Sbjct: 53  PRSIAVDASGT--LYVADAASSVIRKIT---AEGMVTTFVGTAGQRGSADGIGAAARFQG 107

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
               A+D +GN+Y  D ++H +RKIT  GV T   G +   G   G    A F +   + 
Sbjct: 108 IDGLAIDARGNLYAVDFTDHTVRKITPEGVVTTLAGSAGDHGTQVGHGGEARFDSPMAVA 167

Query: 166 FVPHICALLISDHGNQLIRQIN 187
            V     L +   G+  IR+++
Sbjct: 168 -VDRWDNLYVGQMGDGAIRKVS 188



 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 40/118 (33%), Positives = 53/118 (44%), Gaps = 21/118 (17%)

Query: 76  LSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           +S   +V  LAG   Q G +DG   +ARF  P +  VD  G IYVAD  +  +RKIT  G
Sbjct: 516 ISPAGMVSTLAGAAGQPGTADGTGSAARFYAPAAVTVDRAGMIYVADSWSSAVRKITPDG 575

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP------HICALLISDHGNQLIRQI 186
           V T                    +  + E  ++P      H  +L I+D GN  IRQI
Sbjct: 576 VVTT--------------VVRQPYDGEPERLYLPFGIAAGHDGSLYIADTGNSTIRQI 619



 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 56/118 (47%), Gaps = 15/118 (12%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           +S   VV  +AG  G+ G  DG   +ARF  P+  AV   G+IYVAD  N  +R+I   G
Sbjct: 296 VSPSGVVSTVAGKTGLSGSDDGVD-AARFSLPRGVAVSRTGDIYVADSGNSTVRRIAVGG 354

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICAL------LISDHGNQLIRQI 186
             T   G     G A+G ++ A F       + P   A+       ++D  N +IR+I
Sbjct: 355 AVTTFAGRPGGPGYANGSSETAQF-------YFPTGIAIDQNRNVFVADSYNNVIRKI 405



 Score = 45.1 bits (105), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 43/85 (50%), Gaps = 2/85 (2%)

Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
           RF  P+  ++D  GNI VAD+ N  IRK++  GV +   G +   G  DG    A FS  
Sbjct: 268 RFYYPRELSIDAYGNILVADEGNCAIRKVSPSGVVSTVAGKTGLSGSDDG-VDAARFSLP 326

Query: 162 FELTFVPHICALLISDHGNQLIRQI 186
             +  V     + ++D GN  +R+I
Sbjct: 327 RGVA-VSRTGDIYVADSGNSTVRRI 350


>gi|158319004|ref|YP_001511512.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
 gi|158114409|gb|ABW16606.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
          Length = 892

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 52/146 (35%), Positives = 77/146 (52%), Gaps = 11/146 (7%)

Query: 45  INPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSA 101
           ++P+SV ID  G+  +  L S R    T +  +S+      LAG G  GYS  +G   +A
Sbjct: 576 LSPYSVAIDPQGTLFITSLSSDRIQKVTRTGEVSD------LAGTGADGYSGDNGPATAA 629

Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSN 160
           + + P S   D  GNIY+ D  N+ IRKIT  G +TTIAG G+       GPA  A  ++
Sbjct: 630 KLNGPGSAVPDKNGNIYIPDAQNYRIRKITPDGIITTIAGTGTAGFSGDGGPATAAQINS 689

Query: 161 DFELTFVPHICALLISDHGNQLIRQI 186
             ++   P   ++ I+D+ N  IR+I
Sbjct: 690 AEKVAIGPD-GSIYIADYDNHRIRKI 714



 Score = 60.8 bits (146), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 40/120 (33%), Positives = 65/120 (54%), Gaps = 15/120 (12%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ + ++  +AG G+QGYS DG P +A + D P    +   G +Y+A+  ++ I+KIT  
Sbjct: 714 ITPDGIINTIAGTGLQGYSGDGGPATAAKLDGPNDVELGDDGTLYIANLGSNTIQKITKD 773

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC-----ALLISDHGNQLIRQIN 187
           G VTT+AG G K      GPA  A  S       VP +       + I+D+GN  +R+++
Sbjct: 774 GIVTTVAGNGQKGFSGDGGPATAAQLS-------VPSVSLGNGGEIYIADYGNNRVRKVD 826



 Score = 54.3 bits (129), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 36/94 (38%), Positives = 52/94 (55%), Gaps = 5/94 (5%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ + ++  +AG G  G+S DG P +A + +  +  A+   G+IY+AD  NH IRKIT  
Sbjct: 658 ITPDGIITTIAGTGTAGFSGDGGPATAAQINSAEKVAIGPDGSIYIADYDNHRIRKITPD 717

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFS--NDFEL 164
           G + TIAG G +      GPA  A     ND EL
Sbjct: 718 GIINTIAGTGLQGYSGDGGPATAAKLDGPNDVEL 751



 Score = 48.5 bits (114), Expect = 0.003,   Method: Composition-based stats.
 Identities = 40/113 (35%), Positives = 63/113 (55%), Gaps = 3/113 (2%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NL 133
           ++++ +V  +AG+G +G+S DG P +A      S ++   G IY+AD  N+ +RK+  N 
Sbjct: 770 ITKDGIVTTVAGNGQKGFSGDGGPATAAQLSVPSVSLGNGGEIYIADYGNNRVRKVDPNG 829

Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            +TTIAG G++  G   G A  A F N+          AL I+D GN  +R+I
Sbjct: 830 TITTIAGTGAEGSGGDGGQATAAQF-NEPSSVAEDADGALYIADSGNNRLRRI 881


>gi|108761192|ref|YP_633033.1| lipoprotein [Myxococcus xanthus DK 1622]
 gi|108465072|gb|ABF90257.1| putative lipoprotein [Myxococcus xanthus DK 1622]
          Length = 868

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 35/89 (39%), Positives = 50/89 (56%), Gaps = 2/89 (2%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDF 162
            + P   A D  GN+YVAD  ++VIR+I   G   +  GG+   G  DGPA+ A+F+   
Sbjct: 629 LNGPMGIAADAAGNVYVADTDHYVIRRIDPEGKVEVFAGGTP--GLMDGPAKQAAFNQPT 686

Query: 163 ELTFVPHICALLISDHGNQLIRQINLKPE 191
            L   P   ALL++D  N +IR+I+L  E
Sbjct: 687 GLAVTPDGTALLVADMNNGVIRRIDLVAE 715



 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 49/166 (29%), Positives = 76/166 (45%), Gaps = 17/166 (10%)

Query: 27  LLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLA 86
           L+ +G+ VTT + G  L   P  V      + L V++S  S    +      + +   +A
Sbjct: 712 LVAEGHPVTT-LQGDWL-YRPSGVAVSADGNTLFVVESGMSRVVRI-----RDGLTSVVA 764

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI------TNLGVTTIAG 140
           G    G+ DG P S++F      AV   G++ V+D  N+ +R++          VTT+AG
Sbjct: 765 GT-TPGFRDGAPESSQFLPYLGIAVLKDGSLAVSDPGNYRVRRVVLNADGNARKVTTLAG 823

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            G  + G +DGP   A       LT  P    L ++D GN L+R I
Sbjct: 824 SG--RYGHSDGPGDAADLVLPAGLTVGPD-GRLYVADAGNSLVRAI 866



 Score = 40.0 bits (92), Expect = 1.1,   Method: Composition-based stats.
 Identities = 21/50 (42%), Positives = 26/50 (52%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V  LAG G  G+SDG   +A    P    V   G +YVAD  N ++R IT
Sbjct: 818 VTTLAGSGRYGHSDGPGDAADLVLPAGLTVGPDGRLYVADAGNSLVRAIT 867


>gi|290995422|ref|XP_002680294.1| predicted protein [Naegleria gruberi]
 gi|284093914|gb|EFC47550.1| predicted protein [Naegleria gruberi]
          Length = 1170

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 55/168 (32%), Positives = 85/168 (50%), Gaps = 30/168 (17%)

Query: 48  HSVIDRPGSSDLIVLDS-----SRSAFYT-LSFPLSEESV---------VKRLAGD---- 88
           +S++DR   + +I LD+     + ++ YT +S   SE S          +K+L       
Sbjct: 281 NSLVDRVAGTGIIELDTDGKLGNLTSLYTPISVTQSESSSDIYIGTTIGIKKLTDSNKLI 340

Query: 89  ----GVQGYS--DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAG 140
               G Q YS  DG P +A R   P+  A+   G ++++DK +H IRKI + GV +T+AG
Sbjct: 341 SSFAGSQAYSSGDGYPATAARLSSPEGIAISTSGEVFISDKGSHTIRKIDSKGVISTVAG 400

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
            GS   G  DGPA  A  +    L F P+   LLI++  N  IR+++ 
Sbjct: 401 TGS--AGYVDGPALKAKLNGPGFLAFTPN-GDLLITESSNNRIRKLSF 445


>gi|312200909|ref|YP_004020970.1| serine/threonine protein kinase [Frankia sp. EuI1c]
 gi|311232245|gb|ADP85100.1| serine/threonine protein kinase [Frankia sp. EuI1c]
          Length = 847

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 44/114 (38%), Positives = 64/114 (56%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +S++  V  +AG+G  G+S DG P  SA  + P +  VD  GNIYV D +N+ IRKIT  
Sbjct: 554 ISKDGTVTPIAGNGTAGFSGDGGPATSAELNGPGTAVVDKNGNIYVPDTANNRIRKITPD 613

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G +TT+ G G+       GPA  A   N  E   V    +L ++D+ N+ IR++
Sbjct: 614 GKITTVVGNGTAGFSGDGGPATQAEI-NSVEGIAVGPDGSLYLADYSNERIRKV 666



 Score = 47.8 bits (112), Expect = 0.005,   Method: Composition-based stats.
 Identities = 31/88 (35%), Positives = 50/88 (56%), Gaps = 3/88 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ +  +  + G+G  G+S DG P + A  +  +  AV   G++Y+AD SN  IRK+T  
Sbjct: 610 ITPDGKITTVVGNGTAGFSGDGGPATQAEINSVEGIAVGPDGSLYLADYSNERIRKVTPD 669

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSN 160
           G ++TIAG G+K       PA +A  S+
Sbjct: 670 GIISTIAGTGTKGYTSTPTPALSAQISD 697



 Score = 45.1 bits (105), Expect = 0.034,   Method: Composition-based stats.
 Identities = 33/115 (28%), Positives = 61/115 (53%), Gaps = 5/115 (4%)

Query: 76  LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           + ++ ++   AG+G  G +   G+  +A    P   A+   G +Y+++  +  +RK+T  
Sbjct: 722 IGKDGILSPFAGNGKTGRTGDGGDAANATLSIP-DLALGPDGTVYISNYGSDTVRKVTPD 780

Query: 134 GV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           GV TTIAG G++      GPA  A   +   +  V    A+ I+D+GN+ IR+++
Sbjct: 781 GVITTIAGTGAEGNTGDGGPATAAQLKSPSSVV-VDASGAVYIADNGNKEIRRVD 834



 Score = 40.4 bits (93), Expect = 0.75,   Method: Composition-based stats.
 Identities = 21/65 (32%), Positives = 36/65 (55%), Gaps = 2/65 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ + V+  +AG G +G + DG P +A +   P S  VD  G +Y+AD  N  IR++   
Sbjct: 777 VTPDGVITTIAGTGAEGNTGDGGPATAAQLKSPSSVVVDASGAVYIADNGNKEIRRVDPN 836

Query: 134 GVTTI 138
           G  ++
Sbjct: 837 GTISL 841



 Score = 38.9 bits (89), Expect = 2.2,   Method: Composition-based stats.
 Identities = 31/114 (27%), Positives = 56/114 (49%), Gaps = 5/114 (4%)

Query: 76  LSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ + ++  +AG G +GY S   P  SA+   P S  +   G IY+ +  +  ++KI   
Sbjct: 666 VTPDGIISTIAGTGTKGYTSTPTPALSAQISDPNSVVIADDGTIYIGNLGSDSVQKIGKD 725

Query: 134 GV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G+ +  AG G        G A NA+ S   +L   P    + IS++G+  +R++
Sbjct: 726 GILSPFAGNGKTGRTGDGGDAANATLSIP-DLALGPD-GTVYISNYGSDTVRKV 777


>gi|124008176|ref|ZP_01692873.1| hypothetical protein M23134_05970 [Microscilla marina ATCC 23134]
 gi|123986275|gb|EAY26097.1| hypothetical protein M23134_05970 [Microscilla marina ATCC 23134]
          Length = 2548

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 46/119 (38%), Positives = 59/119 (49%), Gaps = 20/119 (16%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI- 138
           + V  +AGDG  G + G   SARF +P   AVD  GN+YVADK+NH I+K+ +L   T+ 
Sbjct: 715 ATVTTVAGDGTSGDAIGASTSARFREPSDVAVDFSGNLYVADKNNHKIKKV-DLNTNTVS 773

Query: 139 -----AGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQI 186
                A G     G  DG A  A F       F P   A      L ++D  N LIRQ+
Sbjct: 774 ILSGPAVGTVFPAGATDGTASIARF-------FFPTSIALDRSGNLFVADRHNNLIRQV 825



 Score = 64.3 bits (155), Expect = 6e-08,   Method: Composition-based stats.
 Identities = 45/109 (41%), Positives = 57/109 (52%), Gaps = 17/109 (15%)

Query: 86  AGDGVQG---YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
           AGD  Q    Y DG   SARF+ P    VDM G++YVAD  N VIRKI+   VTTIAG  
Sbjct: 836 AGDISQTNALYVDGAAASARFNHPTGITVDMVGDVYVADTRNQVIRKISEGQVTTIAGIA 895

Query: 143 SKKEGRADGPAQNASFS------NDFELTFVPHICALLISDHGNQLIRQ 185
           + + G  +G AQ A F+       D E         + + D  NQL+R+
Sbjct: 896 NDR-GLTNGTAQAAKFNYPGSVYADLEQN-------IYVGDKVNQLVRR 936



 Score = 60.1 bits (144), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 50/136 (36%), Positives = 72/136 (52%), Gaps = 15/136 (11%)

Query: 79  ESVVKRLAGD-GVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITN---- 132
           +S V  +AG  G  G  +   G+ ARF+ P   A+D  G +YVADK NHVIR I N    
Sbjct: 655 DSTVSIVAGSVGASGLVNDPVGTNARFNHPSGLAIDDAGILYVADKDNHVIRAIANPDGA 714

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFS--NDFELTFVPHICALLISDHGNQLIRQINLKP 190
             VTT+AG G+   G A G + +A F   +D  + F  +   L ++D  N  I++++L  
Sbjct: 715 ATVTTVAGDGT--SGDAIGASTSARFREPSDVAVDFSGN---LYVADKNNHKIKKVDLNT 769

Query: 191 EDCSKSSQSGSALGAV 206
              + S  SG A+G V
Sbjct: 770 N--TVSILSGPAVGTV 783



 Score = 54.7 bits (130), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 38/99 (38%), Positives = 60/99 (60%), Gaps = 4/99 (4%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVT-TIAGGGSKKEG-R 148
           G +DG    ARF  P S A+D  GN++VAD+ N++IR++ T+ G T T AG  S+     
Sbjct: 787 GATDGTASIARFFFPTSIALDRSGNLFVADRHNNLIRQVATSSGATSTYAGDISQTNALY 846

Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            DG A +A F++   +T V  +  + ++D  NQ+IR+I+
Sbjct: 847 VDGAAASARFNHPTGIT-VDMVGDVYVADTRNQVIRKIS 884


>gi|223937279|ref|ZP_03629185.1| NHL repeat containing protein [bacterium Ellin514]
 gi|223894064|gb|EEF60519.1| NHL repeat containing protein [bacterium Ellin514]
          Length = 429

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 43/112 (38%), Positives = 58/112 (51%), Gaps = 2/112 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++   VV  LAG  G     DG   +A F +P+S AVD  GN+YVAD  NH IRKIT  G
Sbjct: 133 ITPAGVVSTLAGSAGNINSFDGTGINANFYQPRSLAVDNGGNVYVADTWNHTIRKITPAG 192

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + +   G +   G ADG    A F+    +  V +   L ++D  N  IR+I
Sbjct: 193 LVSTLAGLAGNPGSADGTNSKARFNRPSGIA-VDNATNLFVTDFHNHTIRKI 243



 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 45/118 (38%), Positives = 58/118 (49%), Gaps = 14/118 (11%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++   +V  LAG  G  G +DG    ARF++P   AVD   N++V D  NH IRKIT  G
Sbjct: 188 ITPAGLVSTLAGLAGNPGSADGTNSKARFNRPSGIAVDNATNLFVTDFHNHTIRKITPGG 247

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPH-ICA-----LLISDHGNQLIRQI 186
             T   G     G ADG       +N+    F P  I A     L ++D GNQ IR+I
Sbjct: 248 TVTTIAGLPGVWGNADG-------TNNVARFFQPQGIVADNAGNLFVADSGNQTIRKI 298



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 42/116 (36%), Positives = 52/116 (44%), Gaps = 13/116 (11%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT 136
           ++   +   AG    G  DG   SARF  P S A D  GNIYVAD  N  IRKIT  G  
Sbjct: 25  AQSLTINTFAGHDAPGSKDGLSSSARFRHPNSIAADSAGNIYVADTENSTIRKITPNGSV 84

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQI 186
           +   G +   G ADG   NA F       + P   A      + ++D  N  IR+I
Sbjct: 85  STFAGFAGTFGSADGVGTNALF-------YAPQGIAVDSAGFIYVADTANATIRKI 133



 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 42/126 (33%), Positives = 60/126 (47%), Gaps = 19/126 (15%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---- 130
           ++    V  +AG  GV G +DG    ARF +P+    D  GN++VAD  N  IRKI    
Sbjct: 243 ITPGGTVTTIAGLPGVWGNADGTNNVARFFQPQGIVADNAGNLFVADSGNQTIRKISPSG 302

Query: 131 TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIR 184
           TN  V+T+AG          G A NA+ +N+    + P   A      + ++D GN  IR
Sbjct: 303 TNWIVSTVAG--------LSGIAGNANGTNNTARFYFPADVAQDIAGYIYVADLGNNAIR 354

Query: 185 QINLKP 190
              + P
Sbjct: 355 TERIVP 360



 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 57/194 (29%), Positives = 79/194 (40%), Gaps = 32/194 (16%)

Query: 15  LSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSD------------LIVL 62
           LS     V  GLL         V     L IN  +  D PGS D             I  
Sbjct: 5   LSFKNLFVGGGLLAS-----MAVAHAQSLTINTFAGHDAPGSKDGLSSSARFRHPNSIAA 59

Query: 63  DSSRSAFYTLSFPLSEESVVKRLAGDG----------VQGYSDGEPGSARFDKPKSFAVD 112
           DS+ + +   +    E S ++++  +G            G +DG   +A F  P+  AVD
Sbjct: 60  DSAGNIYVADT----ENSTIRKITPNGSVSTFAGFAGTFGSADGVGTNALFYAPQGIAVD 115

Query: 113 MKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA 172
             G IYVAD +N  IRKIT  GV +   G +      DG   NA+F     L  V +   
Sbjct: 116 SAGFIYVADTANATIRKITPAGVVSTLAGSAGNINSFDGTGINANFYQPRSLA-VDNGGN 174

Query: 173 LLISDHGNQLIRQI 186
           + ++D  N  IR+I
Sbjct: 175 VYVADTWNHTIRKI 188


>gi|196230675|ref|ZP_03129536.1| Peptidase S53 propeptide [Chthoniobacter flavus Ellin428]
 gi|196225016|gb|EDY19525.1| Peptidase S53 propeptide [Chthoniobacter flavus Ellin428]
          Length = 1120

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 53/156 (33%), Positives = 74/156 (47%), Gaps = 20/156 (12%)

Query: 34  VTTVIDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV- 90
           VT  + G Q    +N  ++   P  S ++++DS          PL    V+  LAG  + 
Sbjct: 631 VTMAMSGDQFRCVMNAGALTSAP--SSVLIVDS----------PL----VISTLAGQVLT 674

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRAD 150
           QG +DG   +A+F  P   AVD  GNIYVAD +N  IRKIT  G  T   G     G  +
Sbjct: 675 QGTADGTGTAAQFGYPSGVAVDSSGNIYVADFNNDTIRKITPGGTVTTPYGQPGVIGATN 734

Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G   NA+F N      +     + ++D GN  IR+I
Sbjct: 735 GTGTNATF-NTPNGVAIDSANNIYVADSGNSAIRKI 769



 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 37/87 (42%), Positives = 51/87 (58%), Gaps = 7/87 (8%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
           ++++ V DS  SA   ++       VV  LAG  G+ G +DG    A F+ P+  AVD  
Sbjct: 753 ANNIYVADSGNSAIRKIT----PGRVVSTLAGQPGLSGSADGTT-KALFNNPQGVAVDSA 807

Query: 115 GNIYVADKSNHVIRKITNLG-VTTIAG 140
           GN+YVAD +N  IRKIT  G V+T+AG
Sbjct: 808 GNVYVADTTNETIRKITPAGVVSTLAG 834



 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 36/95 (37%), Positives = 47/95 (49%), Gaps = 10/95 (10%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASF---- 158
           F+ P S AVD  GN+YVAD  N V+RKIT  GV T   G +   GR DG    A F    
Sbjct: 851 FNGPSSVAVDSAGNVYVADLYNFVVRKITPGGVVTTPYGQAGMPGRLDGIGTAALFNAPI 910

Query: 159 ------SNDFELTFVPHICALLISDHGNQLIRQIN 187
                 +N+  +T       L  +  GN L+R++N
Sbjct: 911 GVAVDANNNLYITDSQIPPDLTSTSTGNNLVRRVN 945



 Score = 53.5 bits (127), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 49/98 (50%), Gaps = 2/98 (2%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
           GV G ++G   +A F+ P   A+D   NIYVAD  N  IRKIT   V +   G     G 
Sbjct: 728 GVIGATNGTGTNATFNTPNGVAIDSANNIYVADSGNSAIRKITPGRVVSTLAGQPGLSGS 787

Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           ADG  +  +  N+ +   V     + ++D  N+ IR+I
Sbjct: 788 ADGTTK--ALFNNPQGVAVDSAGNVYVADTTNETIRKI 823


>gi|302768705|ref|XP_002967772.1| hypothetical protein SELMODRAFT_409051 [Selaginella moellendorffii]
 gi|302826179|ref|XP_002994615.1| hypothetical protein SELMODRAFT_432523 [Selaginella moellendorffii]
 gi|300137312|gb|EFJ04320.1| hypothetical protein SELMODRAFT_432523 [Selaginella moellendorffii]
 gi|300164510|gb|EFJ31119.1| hypothetical protein SELMODRAFT_409051 [Selaginella moellendorffii]
          Length = 172

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 49/140 (35%), Positives = 68/140 (48%), Gaps = 13/140 (9%)

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
           VTTIAGG S+K G ADGP   A FS++F L       +LLI+D GN+LI +I +  +D  
Sbjct: 41  VTTIAGGSSRKPGFADGPGDTARFSSEFSLAC--SFGSLLIADRGNRLIHEIQI--DDPK 96

Query: 195 KSSQSGSALGAVSVWVLVSVLSCLVSLVIGFVARPYIIRHTGRLDPPLQHDMEALPNQSG 254
               S SA+     W L+ V+  L+ + +      +II     LD   +   + L N   
Sbjct: 97  SRDSSDSAVSGPQKWALIPVV--LLGICLEMPLGAFIIWKI--LDQRCRQAGDQLGNTER 152

Query: 255 -----ETSTDLLLRHQKRNC 269
                    DL +R QKR C
Sbjct: 153 LLAGVPEDDDLQVRQQKRTC 172


>gi|380692993|ref|ZP_09857852.1| cell surface protein [Bacteroides faecis MAJ27]
          Length = 457

 Score = 63.9 bits (154), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 61/114 (53%), Gaps = 2/114 (1%)

Query: 77  SEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           + E   +  AGD G  G++DG+   A+F+ P+   +D + N+Y+AD  NH IRK+T  GV
Sbjct: 345 TREGNFRIYAGDKGRAGHADGQETEAQFNSPRQLVLDEEENLYIADSGNHCIRKVTPEGV 404

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            +   G     G  DG  + A F+  + L  +     + I D  N+ +R+++++
Sbjct: 405 VSTVIGNPNSSGYKDGTPEIALFTEPWGLA-IDSEGTIYIGDKDNRCVRKLSIE 457


>gi|153868997|ref|ZP_01998705.1| protein kinase-like protein [Beggiatoa sp. PS]
 gi|152074439|gb|EDN71293.1| protein kinase-like protein [Beggiatoa sp. PS]
          Length = 1432

 Score = 63.9 bits (154), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 41/120 (34%), Positives = 59/120 (49%), Gaps = 4/120 (3%)

Query: 74  FPLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           F L  E  +   AG G +G+   +G    A    P+  A+D +GN+Y+AD +NH IRKI 
Sbjct: 436 FKLDPEGNLTVFAGTGTKGFEGDNGPAIEANLSNPEGLAIDAQGNLYIADTNNHRIRKID 495

Query: 132 NLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKP 190
           + G +TT+ G G       +  A  A       + F  H     I+D GN  IR+IN +P
Sbjct: 496 SDGIITTVVGTGEAGYAGDNEFAIAAQLKKPTAIVF-DHNGHFYIADSGNNSIRKINYQP 554



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 66/135 (48%), Gaps = 9/135 (6%)

Query: 61  VLDSSRSAFYTLSF-----PLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDM 113
           + DS  ++   +++     PL+  S++  +AGDG  GYS  +G    AR   P S  VD 
Sbjct: 539 IADSGNNSIRKINYQPGTSPLNANSLITTIAGDGRSGYSGDNGPAIQARLGNPSSLVVDN 598

Query: 114 KGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA 172
           + N+Y+AD  NH IRKI   G +TT AG G K        A  A  +    L  V     
Sbjct: 599 ENNLYIADTDNHRIRKIDIRGNITTFAGSGYKGYSGDGNLAITARLNMPTGLA-VDGTGN 657

Query: 173 LLISDHGNQLIRQIN 187
           + I+D  N  IR+I+
Sbjct: 658 IFIADQNNHRIRKID 672



 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 41/128 (32%), Positives = 62/128 (48%), Gaps = 9/128 (7%)

Query: 82  VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA 139
           +   AG G +GYS DG    +AR + P   AVD  GNI++AD++NH IRKI   G+    
Sbjct: 621 ITTFAGSGYKGYSGDGNLAITARLNMPTGLAVDGTGNIFIADQNNHRIRKIDGEGIIRTF 680

Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPED------- 192
            G   +    DG   + +  N      +     L +++ GN  IR+I  K +D       
Sbjct: 681 TGTGVRGTATDGILASVAEINQPTDIALDQYGNLYLAEKGNHFIRKIGEKDQDGDEGAPH 740

Query: 193 CSKSSQSG 200
           CS +++ G
Sbjct: 741 CSTNNEMG 748



 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 33/89 (37%), Positives = 45/89 (50%), Gaps = 2/89 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
           +A    PK    D  GN+Y+AD  NH I K+   G +T  AG G+K     +GPA  A+ 
Sbjct: 408 TAELKVPKDLIFDATGNLYIADILNHRIFKLDPEGNLTVFAGTGTKGFEGDNGPAIEANL 467

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
           SN   L  +     L I+D  N  IR+I+
Sbjct: 468 SNPEGLA-IDAQGNLYIADTNNHRIRKID 495


>gi|344924044|ref|ZP_08777505.1| NHL repeat containing protein [Candidatus Odyssella
           thessalonicensis L13]
          Length = 383

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 41/106 (38%), Positives = 56/106 (52%), Gaps = 7/106 (6%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
           LAG G  GY +G   +A+F  P S AVD  G +YV D +N+ IRKIT+ G T++  G   
Sbjct: 201 LAGSGATGYVEGTGSAAQFSTPTSVAVDSSGTVYVIDANNYRIRKITSGGTTSLFAG--S 258

Query: 145 KEGRADGPAQNASFSNDFEL----TFVPHICALLISDHGNQLIRQI 186
            +G A+G    A F N F L      V +   + ++D  N  IR I
Sbjct: 259 TQGYAEGTGSAARF-NFFNLIPSGITVDNAGTVYVADTFNYRIRTI 303



 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/97 (36%), Positives = 53/97 (54%), Gaps = 3/97 (3%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRAD 150
           +GY +G   +ARF+ P +  +D  G +YVAD  N+ IRKIT+ G T++   GS   G A+
Sbjct: 45  RGYQEGTGSAARFENPYAITIDSSGTLYVADNGNYRIRKITSSGTTSLL-AGSGTTGYAE 103

Query: 151 GPAQNASFSNDFELTFVPHICA-LLISDHGNQLIRQI 186
           G   +A F N F+          + +SD  N  +R+I
Sbjct: 104 GTGASAQF-NTFQWGIAADNSGNVYVSDTTNNRVRKI 139



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 37/110 (33%), Positives = 60/110 (54%), Gaps = 10/110 (9%)

Query: 46  NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
           NP+++ ID  G+  L V D+     Y +   ++       LAG G  GY++G   SA+F+
Sbjct: 59  NPYAITIDSSGT--LYVADNGN---YRIR-KITSSGTTSLLAGSGTTGYAEGTGASAQFN 112

Query: 105 KPK-SFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK--KEGRADG 151
             +   A D  GN+YV+D +N+ +RKIT+ G T++  G +   +EG   G
Sbjct: 113 TFQWGIAADNSGNVYVSDTTNNRVRKITSGGTTSLLAGSTSGYQEGTGAG 162



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 48/102 (47%), Gaps = 3/102 (2%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
           LAG    GY +G    AR   P+  AV+  G +YVA  ++  IR IT+ G T++   GS 
Sbjct: 148 LAGS-TSGYQEGTGAGARLSSPRGLAVNSAGTVYVATATSERIRAITSGGTTSLL-AGSG 205

Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             G  +G    A FS    +  V     + + D  N  IR+I
Sbjct: 206 ATGYVEGTGSAAQFSTPTSVA-VDSSGTVYVIDANNYRIRKI 246



 Score = 44.7 bits (104), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 26/56 (46%), Positives = 33/56 (58%), Gaps = 5/56 (8%)

Query: 90  VQGYSDGEPGSARFD----KPKSFAVDMKGNIYVADKSNHVIRKITNLGVT-TIAG 140
            QGY++G   +ARF+     P    VD  G +YVAD  N+ IR IT  GVT T+AG
Sbjct: 259 TQGYAEGTGSAARFNFFNLIPSGITVDNAGTVYVADTFNYRIRTITPGGVTSTLAG 314


>gi|182414669|ref|YP_001819735.1| SMP-30/gluconolaconase/LRE domain-containing protein [Opitutus
           terrae PB90-1]
 gi|177841883|gb|ACB76135.1| SMP-30/Gluconolaconase/LRE domain protein [Opitutus terrae PB90-1]
          Length = 1292

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 41/106 (38%), Positives = 52/106 (49%), Gaps = 4/106 (3%)

Query: 95  DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQ 154
           DG    ARF+ PK  AVD  G +YVAD SNH+IRK+T  GV T   G     G +DG   
Sbjct: 209 DGAGSDARFNGPKGIAVDANGTVYVADTSNHIIRKVTPSGVVTTLAGSPGISGNSDGAGD 268

Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
            A F+   ++  V     + + D    L R+I   PE    S  SG
Sbjct: 269 AARFNAPTDIA-VDDAGTIYVVDQSGSL-RKIT--PEGVVTSLASG 310



 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 42/111 (37%), Positives = 55/111 (49%), Gaps = 2/111 (1%)

Query: 76  LSEESVVKRLAGDG-VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           +S E VV  +AG     G  DG   +A F + K  AVD  GN++VA+ + HVIR+IT  G
Sbjct: 558 ISPEGVVTTVAGTANATGSIDGVGAAATFGEIKGLAVDRAGNVFVAENTTHVIRRITPDG 617

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
                GG +   G ADG    A F+  + L        L I D GN  IR+
Sbjct: 618 TVVTIGGLAGSIGTADGVGSAARFNEPWGLAL-DRFGHLYIVDSGNNTIRK 667



 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 41/78 (52%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
           G+ G  DGE  +ARF+ P   AVD  GN+YVAD  NH IRKI   G  T   G S  EG 
Sbjct: 464 GLDGSLDGEKTAARFNAPHGLAVDQHGNLYVADTFNHSIRKIDAAGQVTTPYGVSGVEGT 523

Query: 149 ADGPAQNASFSNDFELTF 166
            DG    A F +   L F
Sbjct: 524 VDGIGNAARFGSPTALAF 541



 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 47/101 (46%), Gaps = 13/101 (12%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           GY DGE   ARF +P   A D  GN+YV D   ++IRKIT     +   G     G  DG
Sbjct: 43  GYVDGEGTQARFHEPAGLACDADGNLYVVDPGTNLIRKITPAASVSTFAGTPTGWGLVDG 102

Query: 152 PAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQI 186
           PA +A F        +P   A      + I+D GN  IR I
Sbjct: 103 PAASARFG-------LPQGVAVGADGTVYIADTGNAAIRII 136



 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 36/95 (37%), Positives = 46/95 (48%), Gaps = 3/95 (3%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           G  DG   SARF  P+  AV   G +Y+AD  N  IR IT  G   I  GG  + G  DG
Sbjct: 98  GLVDGPAASARFGLPQGVAVGADGTVYIADTGNAAIRIITPDGSVRILAGG--RSGSQDG 155

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
               A+F N  E   V     + ++D GN  +R+I
Sbjct: 156 YGTGATF-NLPEAVAVNAAGVVYVADSGNNTVRRI 189



 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 51/98 (52%), Gaps = 3/98 (3%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
           GV+G  DG   +ARF  P + A D  G+++VAD   H +R+I+  GV T   G +   G 
Sbjct: 519 GVEGTVDGIGNAARFGSPTALAFDRDGSLFVAD--GHRVRRISPEGVVTTVAGTANATGS 576

Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            DG    A+F  + +   V     + ++++   +IR+I
Sbjct: 577 IDGVGAAATF-GEIKGLAVDRAGNVFVAENTTHVIRRI 613



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 35/76 (46%), Positives = 44/76 (57%), Gaps = 3/76 (3%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++   VV  LAG  G+ G SDG   +ARF+ P   AVD  G IYV D+S   +RKIT  G
Sbjct: 244 VTPSGVVTTLAGSPGISGNSDGAGDAARFNAPTDIAVDDAGTIYVVDQSGS-LRKITPEG 302

Query: 135 -VTTIAGGGSKKEGRA 149
            VT++A G S   G A
Sbjct: 303 VVTSLASGFSYPRGVA 318



 Score = 45.4 bits (106), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 34/106 (32%), Positives = 47/106 (44%), Gaps = 2/106 (1%)

Query: 95  DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQ 154
           DG+  +A F  P   AVD  GN++VAD     IRKI   G  +   G    +G  DG   
Sbjct: 416 DGQGSNAGFGNPTGIAVDAAGNLFVADF-KATIRKIDATGYVSTVAGAHGLDGSLDGEKT 474

Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
            A F+    L    H   L ++D  N  IR+I+   +  +    SG
Sbjct: 475 AARFNAPHGLAVDQH-GNLYVADTFNHSIRKIDAAGQVTTPYGVSG 519



 Score = 41.6 bits (96), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 46/91 (50%), Gaps = 15/91 (16%)

Query: 81  VVKRLAGDGVQ----------GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V++R+  DG            G +DG   +ARF++P   A+D  G++Y+ D  N+ IRK 
Sbjct: 609 VIRRITPDGTVVTIGGLAGSIGTADGVGSAARFNEPWGLALDRFGHLYIVDSGNNTIRKG 668

Query: 131 TNLGVTTIAGGGSKKE--GRADGPAQNASFS 159
             +G T  A     K+   +A G  + ASF+
Sbjct: 669 VIVGATAPA---ITKDPVAKAAGAGRQASFT 696



 Score = 40.4 bits (93), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 48/96 (50%), Gaps = 4/96 (4%)

Query: 94  SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKK-EGRADG 151
           +DG   +A+F  P+   V   G ++VAD + +VIRK+T  L VTT AG   +      DG
Sbjct: 358 ADGVGLNAQFTSPRGLTVAADGTVFVADSNAYVIRKVTPGLLVTTWAGSLVRPIYQTVDG 417

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
              NA F N   +  V     L ++D     IR+I+
Sbjct: 418 QGSNAGFGNPTGIA-VDAAGNLFVADF-KATIRKID 451


>gi|365864816|ref|ZP_09404493.1| NHL repeat-containing protein [Streptomyces sp. W007]
 gi|364005745|gb|EHM26808.1| NHL repeat-containing protein [Streptomyces sp. W007]
          Length = 292

 Score = 63.5 bits (153), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 51/136 (37%), Positives = 72/136 (52%), Gaps = 6/136 (4%)

Query: 76  LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ +  +  +AG G  GYS   G   SAR + P    VD   N+Y++D +NH +RK+T  
Sbjct: 20  VTADGTISTVAGTGTAGYSGDSGAATSARLNLPLGVVVDSADNLYISDYNNHRVRKVTPD 79

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPED 192
           G +TTIAG GS   G   GP+  A  SN F L  V  +  L I+DH N  IR+I    E 
Sbjct: 80  GEITTIAGTGSAGFGGDGGPSPVAQLSNPFGLA-VDCVDTLYIADHLNNRIRKI--ASER 136

Query: 193 CSKSSQSGSALGAVSV 208
            +   QSG+ +   +V
Sbjct: 137 MAGLPQSGTVVSWANV 152


>gi|405375498|ref|ZP_11029528.1| putative hemolysin [Chondromyces apiculatus DSM 436]
 gi|397086225|gb|EJJ17355.1| putative hemolysin [Myxococcus sp. (contaminant ex DSM 436)]
          Length = 855

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 42/113 (37%), Positives = 60/113 (53%), Gaps = 9/113 (7%)

Query: 80  SVVKRLAGDG-VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
           + VKR+A  G V   + G  G      P   A D  GN+YV+D  ++VIR+I   G   +
Sbjct: 598 NAVKRVAAGGEVTTLASGLNG------PMGIATDAAGNVYVSDTDHYVIRRIDPEGKVEL 651

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
             G +   G  DGPA+ A+F+    LT  P   ALL++D  N +IR+I+L  E
Sbjct: 652 FAGSTP--GLQDGPAKQAAFNQPAGLTVTPDGSALLVADLNNGVIRRIDLVAE 702



 Score = 51.6 bits (122), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 52/166 (31%), Positives = 77/166 (46%), Gaps = 17/166 (10%)

Query: 27  LLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLA 86
           L+ +G+ VTT + G  L   P  V        L V++S  S    +      + V   +A
Sbjct: 699 LVAEGHPVTT-LQGDWL-YRPSGVTVSEDGGTLYVVESGMSRVVRI-----RDGVTSVVA 751

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG------VTTIAG 140
           G    G+ DG P S++F      AV   G++ V+D  N+ +R+I   G      VTT+AG
Sbjct: 752 GT-TPGFRDGAPESSQFLPYLGIAVLKDGSLAVSDPGNYRVRRIHLDGNGQARKVTTLAG 810

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            G+   G ADGP ++A       L   P    L ++D GN L+R I
Sbjct: 811 SGTY--GHADGPGEDAQLVLPAGLVVGPD-GRLYVADSGNALVRTI 853



 Score = 39.7 bits (91), Expect = 1.4,   Method: Composition-based stats.
 Identities = 20/50 (40%), Positives = 26/50 (52%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V  LAG G  G++DG    A+   P    V   G +YVAD  N ++R IT
Sbjct: 805 VTTLAGSGTYGHADGPGEDAQLVLPAGLVVGPDGRLYVADSGNALVRTIT 854


>gi|291223251|ref|XP_002731624.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 434

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 54/180 (30%), Positives = 83/180 (46%), Gaps = 17/180 (9%)

Query: 33  TVTTVIDGHQLEINPHSVIDRPG-------SSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
           T+   + G+Q  +   ++   P        +  L V DS+ S    ++    E   V   
Sbjct: 107 TLAGSLQGYQDGVGEKAMFHHPAGMSMHRKTRTLFVADSANSRIRAINVVTGE---VTTF 163

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVD--MKGNIYVADKSNHVIRKIT--NLGVTTIAGG 141
           AG G +   DG    A F  P++ AVD   K   +VAD  NH IR+++  +  VTTIAGG
Sbjct: 164 AGSGKEELKDGLKTIASFFNPQAVAVDHVYKDRFFVADTDNHAIREVSLPDGEVTTIAGG 223

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
              ++G  DG    A+F +   +T  P    L I+DH N  IR I ++ +  +  + SG 
Sbjct: 224 ---EKGFKDGKGTGATFYHPAGVTIDPIRNILFIADHYNHAIRMIGVESKIVTTLAGSGK 280



 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 57/112 (50%), Gaps = 5/112 (4%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG-V 135
           E  +V  LAG G  G+ +G    A F+ P+  A D +  + YV +  N+ +R + + G V
Sbjct: 268 ESKIVTTLAGSGKPGFVNGMGNQAMFNYPEGMAYDTENKVLYVVEFDNNCVRIVDDEGEV 327

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            +  GG   +EG++DG  + A F +   LTF      + I+D  N  IR I+
Sbjct: 328 RSFVGG---REGKSDGLGEEAKFFHPTGLTFDEKEKIIYITDQYNHQIRGIS 376



 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 37/127 (29%), Positives = 52/127 (40%), Gaps = 17/127 (13%)

Query: 60  IVLDSSRSAFYTLSFP------LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDM 113
           +  D+     Y + F       + +E  V+   G G +G SDG    A+F  P     D 
Sbjct: 299 MAYDTENKVLYVVEFDNNCVRIVDDEGEVRSFVG-GREGKSDGLGEEAKFFHPTGLTFDE 357

Query: 114 KGNI-YVADKSNHVIRKITNLGVTTIAGGG--------SKKEGRADGPAQNASFSNDFEL 164
           K  I Y+ D+ NH IR I+ +G  T   G         S KEG       + +F   F +
Sbjct: 358 KEKIIYITDQYNHQIRGISGIGAKTSPPGKITIEKIMTSVKEGYTGTAMMSLTFILFFAM 417

Query: 165 TFVPHIC 171
            FV  IC
Sbjct: 418 VFVC-IC 423


>gi|298386946|ref|ZP_06996500.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
 gi|298260096|gb|EFI02966.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
          Length = 457

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 60/114 (52%), Gaps = 2/114 (1%)

Query: 77  SEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           + E   +  AGD G  G++DG+   A+F  P+   +D + N+Y+AD  NH IRK+T  GV
Sbjct: 345 TREGNFRIYAGDKGRAGHADGQETEAQFSSPRQLVLDEEENLYIADSGNHCIRKVTPEGV 404

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            +   G     G  DG  + A F+  + L  +     + I D  N+ +R+++++
Sbjct: 405 VSTVIGNPNTSGYKDGTPEIALFTEPWGLA-IDSEGTIYIGDKDNRCVRKLSIE 457



 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 33/57 (57%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ E VV  + G+    GY DG P  A F +P   A+D +G IY+ DK N  +RK++
Sbjct: 399 VTPEGVVSTVIGNPNTSGYKDGTPEIALFTEPWGLAIDSEGTIYIGDKDNRCVRKLS 455


>gi|320106842|ref|YP_004182432.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
 gi|319925363|gb|ADV82438.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
          Length = 382

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 63/116 (54%), Gaps = 5/116 (4%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDM-KGNIYVADKSNHVIRKITNLG 134
           L+  + V  LAG+G  GY+DG+  SARF+KP     +   G  Y+AD  N+ IR++ + G
Sbjct: 261 LNGANQVTTLAGNGTAGYADGQGNSARFNKPTQVVYNSGDGAYYIADTFNNCIRRMDSAG 320

Query: 135 -VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            VTT AG G  + G  DG + +A F     +        L ++D GN  IR++++ 
Sbjct: 321 NVTTYAGIGG-QSGLVDGASTSAKFDKPTGIAIANGY--LYVADSGNNAIRRVDMN 373



 Score = 37.7 bits (86), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 43/165 (26%), Positives = 63/165 (38%), Gaps = 44/165 (26%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF-----A 110
           + +L + D+  +     +  L   SV   LAG G  GY+DG P SA+F+ P        A
Sbjct: 54  NGNLYIADTGNNVIKMFNPSLGTLSV---LAGSGTAGYNDGSPASAQFNTPTGLSAVYTA 110

Query: 111 VDMKGN------------IYVADKSNHVIRKIT-----------------NLGVTTIAGG 141
            D++              I + D  N+ IR++                  N  V T+ GG
Sbjct: 111 QDIQSGPGFPPTRLRLIEIRINDTGNNAIRRMCIQAPGDQVTCQSVNGLPNGQVVTMGGG 170

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
                G ADG    + F++            LLI D GN  IR +
Sbjct: 171 ---SLGDADGNGSASRFNSPTGFA----TDGLLIGDTGNNSIRTM 208


>gi|302821389|ref|XP_002992357.1| hypothetical protein SELMODRAFT_430570 [Selaginella moellendorffii]
 gi|300139773|gb|EFJ06507.1| hypothetical protein SELMODRAFT_430570 [Selaginella moellendorffii]
          Length = 220

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 40/99 (40%), Positives = 56/99 (56%), Gaps = 8/99 (8%)

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-KPEDC 193
           VTTI GG S+K G ADGP   A FS+  E +      +LLI+D GN+LIR+I +  P+ C
Sbjct: 23  VTTIVGGSSRKPGFADGPGDTARFSS--ESSLACSCGSLLIADRGNRLIREIQIDDPKSC 80

Query: 194 SKSSQSGSALGAVSVWVLVSVLSCLVSLVIGFVARPYII 232
             S    SA+     W L+ V+  L+ + +G    P+II
Sbjct: 81  DSSD---SAVSGSQKWALIPVV--LLGVCLGMPLGPFII 114


>gi|297568314|ref|YP_003689658.1| hypothetical protein [Desulfurivibrio alkaliphilus AHT2]
 gi|296924229|gb|ADH85039.1| YD repeat protein [Desulfurivibrio alkaliphilus AHT2]
          Length = 2439

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 39/103 (37%), Positives = 57/103 (55%), Gaps = 3/103 (2%)

Query: 79   ESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
            + ++  +AG G +G+S DG P   AR   P + A+   G+IY+AD  NH IR++   G +
Sbjct: 1244 DGIITTIAGTGYRGFSGDGGPADEARLFGPIAVAIGPDGSIYIADTYNHRIRRVGTDGII 1303

Query: 136  TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDH 178
            TT+AG GS       GPA  ASFS    +TF P     + ++H
Sbjct: 1304 TTVAGTGSLGYSGDGGPATEASFSTPSGITFGPDGSLYIAANH 1346



 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/114 (33%), Positives = 60/114 (52%), Gaps = 9/114 (7%)

Query: 81   VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
            +++ +AG G  G+S G   +AR  +P   ++D  G IY+AD  NH IR +   G +TT A
Sbjct: 1068 IIETIAGTGSWGHSGGLAVNARLYRPSCVSIDPDGTIYIADTGNHRIRSVGTDGIITTHA 1127

Query: 140  GGGSKKEGRA-------DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            G G +  G         DGPA NA  ++  ++   P   ++ I+D  N  IR++
Sbjct: 1128 GSGFRAGGLGDSGYSGDDGPAVNARLNSPTDIALGPD-GSIYIADSINHRIRRV 1180



 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 46/87 (52%), Gaps = 2/87 (2%)

Query: 101  ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
            AR D+P + A+   G+IY+AD  NH IR++   G +TTIAG G +      GPA  A   
Sbjct: 1212 ARLDRPHAVAIGPDGSIYIADTYNHRIRRVGTDGIITTIAGTGYRGFSGDGGPADEARLF 1271

Query: 160  NDFELTFVPHICALLISDHGNQLIRQI 186
                +   P   ++ I+D  N  IR++
Sbjct: 1272 GPIAVAIGPD-GSIYIADTYNHRIRRV 1297



 Score = 45.1 bits (105), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 27/62 (43%), Positives = 36/62 (58%), Gaps = 3/62 (4%)

Query: 84   RLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
            R  G G  GYS  DG   +AR + P   A+   G+IY+AD  NH IR++   G +TT+AG
Sbjct: 1132 RAGGLGDSGYSGDDGPAVNARLNSPTDIALGPDGSIYIADSINHRIRRVGTDGIITTVAG 1191

Query: 141  GG 142
             G
Sbjct: 1192 TG 1193



 Score = 44.7 bits (104), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 55/111 (49%), Gaps = 7/111 (6%)

Query: 79   ESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV- 135
            + ++  +AG G  GYS DG P + A F  P        G++Y+A  +NH IR++   G+ 
Sbjct: 1300 DGIITTVAGTGSLGYSGDGGPATEASFSTPSGITFGPDGSLYIA--ANHRIRRVGTDGII 1357

Query: 136  TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            TTIAG G +      GPA  A       ++  P   ++ ++D  N  IR I
Sbjct: 1358 TTIAGTGYRGFSGDGGPADEARLG-PRGVSLGPD-GSIYVADSNNHRIRHI 1406


>gi|291452573|ref|ZP_06591963.1| NHL repeat containing protein [Streptomyces albus J1074]
 gi|291355522|gb|EFE82424.1| NHL repeat containing protein [Streptomyces albus J1074]
          Length = 613

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 38/116 (32%), Positives = 65/116 (56%), Gaps = 7/116 (6%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S + +V D++R     L+     ESVV+R+ G+G +G +DG P  ARF +P+  A+  +G
Sbjct: 185 SGNFLVSDTTRHQLVELAE--DGESVVRRIGGEGERGLADGGPAEARFSEPQGLALTPEG 242

Query: 116 N-IYVADKSNHVIRKITNLG--VTTIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
           + + VAD  NH +R++      VTT AG G +  +     GPA+    S+ +++ +
Sbjct: 243 DAVIVADTVNHALRRVELASGVVTTPAGTGRQWWQGSPTSGPAREVDLSSPWDVAW 298



 Score = 45.4 bits (106), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 47/103 (45%), Gaps = 2/103 (1%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSK 144
           AG   +G  DG    A F +P   A D    ++VAD     +R +   G V T+ G G  
Sbjct: 327 AGTTNEGLVDGPGAEAWFAQPSGLAADGDARLWVADSETSALRWVDPEGTVHTVVGTGLF 386

Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
             G  DG A  A F +   +T +P   ++ +SD  N  +R+ +
Sbjct: 387 DFGHRDGEAAQALFQHPLAVTVLPD-GSVAVSDTYNHALRRYD 428



 Score = 40.8 bits (94), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 49/105 (46%), Gaps = 7/105 (6%)

Query: 87  GDGVQGYSDGEP--GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG-GGS 143
           GDG   Y   EP  G+ RF  P    +   GN  V+D + H + ++   G + +   GG 
Sbjct: 160 GDGP--YVAPEPVAGALRF--PGRAVLLPSGNFLVSDTTRHQLVELAEDGESVVRRIGGE 215

Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
            + G ADG    A FS    L   P   A++++D  N  +R++ L
Sbjct: 216 GERGLADGGPAEARFSEPQGLALTPEGDAVIVADTVNHALRRVEL 260


>gi|260821256|ref|XP_002605949.1| hypothetical protein BRAFLDRAFT_132236 [Branchiostoma floridae]
 gi|229291286|gb|EEN61959.1| hypothetical protein BRAFLDRAFT_132236 [Branchiostoma floridae]
          Length = 446

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 59/100 (59%), Gaps = 4/100 (4%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKIT--NLGVTTIAGGGSKKEGRA 149
           + DG+   ARF  P+   +D K N ++VAD  NHVIR+++  +  V T+AG   K++G  
Sbjct: 177 WKDGKGLEARFYHPQGLYLDAKQNRMFVADTDNHVIREMSMPDAVVKTVAGT-PKEKGLV 235

Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           +G  + A F +  ++ + P+   L +SDH N  IR INL+
Sbjct: 236 NGQGRAAKFYHPTQMAYDPYTDILYVSDHFNHAIRTINLQ 275



 Score = 50.4 bits (119), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 45/149 (30%), Positives = 69/149 (46%), Gaps = 16/149 (10%)

Query: 42  QLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPG 99
           Q+  +P++ I       L V D    A  T++    E  +   LAG   G  G+ DG+  
Sbjct: 249 QMAYDPYTDI-------LYVSDHFNHAIRTINLQGYEVDI---LAGSPKGEPGFKDGKGT 298

Query: 100 SARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNAS 157
           +AR + P+  A D    + YV +  N+ IR +T  G V T+AGG   + G  DG    A 
Sbjct: 299 AARLNYPEGIAFDTTHRVLYVVEFGNNCIRMVTPAGIVKTLAGG--PEPGYKDGSGSAAR 356

Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQI 186
           F +   LT  P    + ++D  N L+R +
Sbjct: 357 FFHPTGLTLDPTNKVIYVTDQYNHLVRSV 385



 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 40/82 (48%), Gaps = 7/82 (8%)

Query: 60  IVLDSSRSAFYTLSFP------LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDM 113
           I  D++    Y + F       ++   +VK LAG    GY DG   +ARF  P    +D 
Sbjct: 308 IAFDTTHRVLYVVEFGNNCIRMVTPAGIVKTLAGGPEPGYKDGSGSAARFFHPTGLTLDP 367

Query: 114 KGN-IYVADKSNHVIRKITNLG 134
               IYV D+ NH++R +T +G
Sbjct: 368 TNKVIYVTDQYNHLVRSVTAVG 389



 Score = 37.0 bits (84), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 47/102 (46%), Gaps = 13/102 (12%)

Query: 101 ARFDKPKSFAVDMKGN-IYVAD-------KSNHVIRKIT--NLGVTTIAGGGSKKEGRAD 150
           AR + P     D   N IYVAD        +N  IR+I+  +  +TT+AG     +G  D
Sbjct: 64  ARLNYPWGLVFDKSDNSIYVADCGCPNSPHTNERIRRISLSDSWITTLAG---SSQGNQD 120

Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPED 192
           G  ++A F +   +        L + D  N +IRQ++ +  D
Sbjct: 121 GLGKDAHFHHTAGMVVDEDNAMLYVCDSSNNVIRQVSTRTGD 162


>gi|383454552|ref|YP_005368541.1| putative lipoprotein [Corallococcus coralloides DSM 2259]
 gi|380728713|gb|AFE04715.1| putative lipoprotein [Corallococcus coralloides DSM 2259]
          Length = 861

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 40/110 (36%), Positives = 61/110 (55%), Gaps = 9/110 (8%)

Query: 80  SVVKRLAGDG-VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
           + VKR++ DG ++    G  G      P   A D  GN+YV+D  N VIR+I+  G   +
Sbjct: 605 NTVKRVSPDGKIRTVLTGLYG------PMGIAADALGNVYVSDTENAVIRRISPEGKAEV 658

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
             G +   G  DGPA +A+F+    L+F P   ALL++D  N +IR+I++
Sbjct: 659 FAGTT--WGYQDGPALSAAFNQPAGLSFTPDGTALLVADLNNSVIRRIDM 706


>gi|374311754|ref|YP_005058184.1| NHL repeat containing protein [Granulicella mallensis MP5ACTX8]
 gi|358753764|gb|AEU37154.1| NHL repeat containing protein [Granulicella mallensis MP5ACTX8]
          Length = 1456

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 42/152 (27%), Positives = 76/152 (50%), Gaps = 10/152 (6%)

Query: 40  GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GE 97
           G + EI    ++DR G  D+ ++D   +    ++   +   ++  +AG+G +G+S   G 
Sbjct: 82  GAKFEIPFGILVDRAG--DIFIVDPGNNNVRKIA---ASTGIITTVAGNGTRGFSGDGGA 136

Query: 98  PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQN 155
             SA  ++P++ A+D  GN+Y+ D  N  +RK+  T   +TT+AG GS       GPA N
Sbjct: 137 ATSAELNQPQAVALDAAGNMYITDTINSRVRKVEATTGIITTVAGNGSVTSSGDGGPATN 196

Query: 156 ASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           A       +        + ++D G   +R+I+
Sbjct: 197 AGLVQPAGIAL-DSANNIFVTDSGEPGVREIS 227



 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 37/111 (33%), Positives = 61/111 (54%), Gaps = 5/111 (4%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VT 136
           ++  +AG+G  GY+ DG P  SA   +P+  A+D  G++Y AD  +HV+R+I      +T
Sbjct: 503 IITTIAGNGTSGYTGDGGPATSAGMYEPQGVALDNAGDVYFADFYHHVVRRIDAASGIIT 562

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           T+AG G++      G A +A   N  E   +     L I+D GN  +R+++
Sbjct: 563 TVAGNGTEGFSGDGGSATSAEL-NYPEGISLDSAANLYIADSGNYRVRKVD 612



 Score = 53.5 bits (127), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 54/98 (55%), Gaps = 6/98 (6%)

Query: 77  SEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---T 131
           +   ++  +AG+G +G+S   G   SA  + P+  ++D   N+Y+AD  N+ +RK+   T
Sbjct: 556 AASGIITTVAGNGTEGFSGDGGSATSAELNYPEGISLDSAANLYIADSGNYRVRKVDAAT 615

Query: 132 NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPH 169
            + +TT+AG G+K +    G A NA F    ++ F  H
Sbjct: 616 GI-ITTVAGDGAKTDSGDGGLAVNAGFRAISDVKFDSH 652



 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 49/154 (31%), Positives = 81/154 (52%), Gaps = 17/154 (11%)

Query: 41  HQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEP 98
           HQL     S I   GS ++ ++D+ R A + L+      SVV   AG+G +G+S   G  
Sbjct: 249 HQL-----SGIAIDGSGNVFIVDTDRYAVFKLTTGTGTVSVV---AGNGTRGFSGDGGAA 300

Query: 99  GSARFDKPKSFAVDMKGNIYVAD---KSNHVIRKITNLG--VTTIAGGGSKKEGRADGPA 153
            SA    P++ A+D  GN+Y++D   +SN+ IRK+T     ++T+AG G+  E   DG A
Sbjct: 301 TSAELAYPQAVALDTAGNLYISDGDYRSNN-IRKVTAATGIISTVAGDGT-AEFSGDGGA 358

Query: 154 QNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
              +  N+     +     L I+D  N ++R+++
Sbjct: 359 AVDARLNEPGGVALDTAGNLYIADVNNSVVREVS 392



 Score = 45.1 bits (105), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 37/56 (66%), Gaps = 4/56 (7%)

Query: 81  VVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++  +AGDG   +S G+ G+A   R ++P   A+D  GN+Y+AD +N V+R+++ L
Sbjct: 340 IISTVAGDGTAEFS-GDGGAAVDARLNEPGGVALDTAGNLYIADVNNSVVREVSPL 394


>gi|116619658|ref|YP_821814.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116222820|gb|ABJ81529.1| NHL repeat containing protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 585

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 42/137 (30%), Positives = 68/137 (49%), Gaps = 8/137 (5%)

Query: 54  PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG--SARFDKPKSFAV 111
           P + +L + DS+      +S      S +  +AG+G  G++  +    SA  + P   A+
Sbjct: 60  PKTGNLYIADSANHRIRMIS-----GSTISTVAGNGTAGFAGDKAAATSANLNTPSGVAL 114

Query: 112 DMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEG-RADGPAQNASFSNDFELTFVPHI 170
           D  GN Y+AD  N VIRK+T   +TT+AG  ++  G + DG   N +  N+     V   
Sbjct: 115 DSSGNFYIADSLNSVIRKVTGGTITTVAGDYTQFPGDQGDGGQANVAVLNNPTSVMVDPA 174

Query: 171 CALLISDHGNQLIRQIN 187
               I+D GN  IR+++
Sbjct: 175 GNYYIADSGNNRIRKVD 191



 Score = 43.9 bits (102), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 45/136 (33%), Positives = 68/136 (50%), Gaps = 12/136 (8%)

Query: 59  LIVLDSS--RSAFYTLSFPLSEESVVKRL---AGDGVQGYS-DGEPGS-ARFDKPKSFAV 111
           L + D+S  R A Y    P +  +V   L   AG+   G++ DG   + ++ +KP   AV
Sbjct: 222 LYIADTSNNRIAKYA---PYTANNVAADLTNFAGNLTAGFAGDGNTATLSQLNKPVGIAV 278

Query: 112 DMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHI 170
           D  GN+Y+AD +N  IRK+   G +TTIAG G        GPA +A  S    +    + 
Sbjct: 279 DSAGNVYIADSNNGRIRKVGTDGIITTIAGKGGSGYSGDGGPATSAVLSFPRGIAVAAN- 337

Query: 171 CALLISDHGNQLIRQI 186
             + I+D  N +IR +
Sbjct: 338 GTVYIADTNNHVIRAL 353



 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 42/138 (30%), Positives = 55/138 (39%), Gaps = 27/138 (19%)

Query: 60  IVLDSSRSAFYTLSFPLSEESVVKRL--------AGDGVQGYSD----GEPGSARFDKPK 107
           + LDSS + FY      S  SV++++        AGD  Q   D    G+   A  + P 
Sbjct: 112 VALDSSGN-FYIAD---SLNSVIRKVTGGTITTVAGDYTQFPGDQGDGGQANVAVLNNPT 167

Query: 108 SFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFV 167
           S  VD  GN Y+AD  N+ IRK+   G      G     GR   P   A F N       
Sbjct: 168 SVMVDPAGNYYIADSGNNRIRKVDTTGTINAYLGTLATGGRLRNPYALALFGN------- 220

Query: 168 PHICALLISDHGNQLIRQ 185
                L I+D  N  I +
Sbjct: 221 ----VLYIADTSNNRIAK 234


>gi|421740691|ref|ZP_16178929.1| NHL repeat protein [Streptomyces sp. SM8]
 gi|406690906|gb|EKC94689.1| NHL repeat protein [Streptomyces sp. SM8]
          Length = 444

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 38/116 (32%), Positives = 65/116 (56%), Gaps = 7/116 (6%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S + +V D++R     L+     ESVV+R+ G+G +G +DG P  ARF +P+  A+  +G
Sbjct: 16  SGNFLVSDTTRHQLVELAE--DGESVVRRIGGEGERGLADGGPAEARFSEPQGLALTPEG 73

Query: 116 N-IYVADKSNHVIRKITNLG--VTTIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
           + + VAD  NH +R++      VTT AG G +  +     GPA+    S+ +++ +
Sbjct: 74  DAVIVADTVNHALRRVELASGVVTTPAGTGRQWWQGSPTSGPAREVDLSSPWDVAW 129



 Score = 43.5 bits (101), Expect = 0.091,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 46/103 (44%), Gaps = 2/103 (1%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSK 144
           AG   +G  DG    A F +P   A D    ++VAD     +R +   G V T  G G  
Sbjct: 158 AGTTNEGLVDGPGAEAWFAQPSGLAADGDARLWVADSETSALRWVDPEGTVHTAVGTGLF 217

Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
             G  DG A  A F +   +T +P   ++ +SD  N  +R+ +
Sbjct: 218 DFGHRDGEAAQALFQHPLAVTVLPD-GSVAVSDTYNHALRRYD 259



 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 43/91 (47%), Gaps = 3/91 (3%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG-GGSKKEGRADGPAQNAS 157
           G+ RF  P    +   GN  V+D + H + ++   G + +   GG  + G ADG    A 
Sbjct: 3   GALRF--PGRAVLLPSGNFLVSDTTRHQLVELAEDGESVVRRIGGEGERGLADGGPAEAR 60

Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQINL 188
           FS    L   P   A++++D  N  +R++ L
Sbjct: 61  FSEPQGLALTPEGDAVIVADTVNHALRRVEL 91


>gi|291612976|ref|YP_003523133.1| NHL repeat-containing protein [Sideroxydans lithotrophicus ES-1]
 gi|291583088|gb|ADE10746.1| NHL repeat-containing protein [Sideroxydans lithotrophicus ES-1]
          Length = 385

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 41/105 (39%), Positives = 57/105 (54%), Gaps = 5/105 (4%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGS 143
           LA  GV G SDG   +ARF++P     D   N+YVAD +NH IRKI    G  T   G +
Sbjct: 285 LASSGV-GSSDGIGANARFNQPMGITTDGS-NLYVADTNNHTIRKIVIATGNVTTPAGAA 342

Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
              G ADG    A F++ F + ++    AL ++D+ N  IR++ L
Sbjct: 343 LVPGTADGAGPTARFNHPFGIIYIN--GALYVADYTNGSIRKVQL 385



 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 39/112 (34%), Positives = 51/112 (45%), Gaps = 7/112 (6%)

Query: 81  VVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTT 137
           +V   AG  G    +DG   +ARF  P   A D   N+YV D   + IRKI      VTT
Sbjct: 43  LVSTFAGTSGSHAITDGIGATARFQIPAYIASDGT-NLYVTDMYANNIRKIVIATGQVTT 101

Query: 138 IAGGGSKKEGRAD---GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            AG  +   G  D   GP  +A F+N + +        L ++D  N  IRQI
Sbjct: 102 FAGSITGASGVTDAPSGPGTSALFNNPYGIVIDATNTKLYVADGYNHTIRQI 153


>gi|325186506|emb|CCA21046.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 1881

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 44/124 (35%), Positives = 63/124 (50%), Gaps = 11/124 (8%)

Query: 71  TLSFPLSE-----ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNH 125
           T SFP S      +  V+  AG G +G++DGE  ++ FD P   A+D  G  YV D  NH
Sbjct: 214 TWSFPKSNSKTKPQGAVETFAG-GSRGHNDGEAVASLFDNPNDLAIDSTGVTYVVDTGNH 272

Query: 126 VIRKITNLG-VTTIAGGGSKKEGRADGPAQNA--SFSNDFELTFVPHICALLISDHGNQL 182
            IR+I     VTT A  G++  G  DGP   A  +F +   +    +   + ++D GN  
Sbjct: 273 CIRRIDQRSRVTTFA--GNRTRGFRDGPLDIAQYNFPSGIAVVNEQNRVTVYVADTGNHR 330

Query: 183 IRQI 186
           IR+I
Sbjct: 331 IRRI 334



 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/117 (33%), Positives = 58/117 (49%), Gaps = 6/117 (5%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN---IYVADKSNHVIRKITN 132
           + + S V   AG+  +G+ DG    A+++ P   AV  + N   +YVAD  NH IR+I +
Sbjct: 277 IDQRSRVTTFAGNRTRGFRDGPLDIAQYNFPSGIAVVNEQNRVTVYVADTGNHRIRRIRD 336

Query: 133 LGVTTIAGGGSK--KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
             V  IAG        G +DG A  + F     L  +     L+++D GN LIR I+
Sbjct: 337 GQVACIAGRCDSIPHPGFSDGNASESRFDTPLGLA-IDVDGNLIVADSGNNLIRLID 392



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/50 (48%), Positives = 34/50 (68%), Gaps = 1/50 (2%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT-TIAG 140
           G+SDG    +RFD P   A+D+ GN+ VAD  N++IR I  +G+T T+AG
Sbjct: 353 GFSDGNASESRFDTPLGLAIDVDGNLIVADSGNNLIRLIDAVGLTRTLAG 402


>gi|383636240|ref|ZP_09950646.1| redoxin domain-containing protein [Streptomyces chartreusis NRRL
           12338]
          Length = 605

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 44/139 (31%), Positives = 74/139 (53%), Gaps = 14/139 (10%)

Query: 39  DGHQLEINPHSVIDR-PG------SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ 91
           DG  +   P   + R PG      S   +V D++R     L+     ESVV+R+ G G +
Sbjct: 165 DGPYVAPEPEPTVLRFPGKALLLPSGTFLVSDTTRHQLVELAE--DGESVVRRI-GSGTR 221

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAGGGSK--KEG 147
           G++DG  G+A F++P+  A+  +G++ VAD  NH +R+  +T   VTT+AG G +  +  
Sbjct: 222 GFADGPTGTAAFNEPQGLALLDEGSVVVADTVNHALRRLDLTTGEVTTLAGTGRQWWQGS 281

Query: 148 RADGPAQNASFSNDFELTF 166
              GPA+    S+ +++  
Sbjct: 282 PTSGPAREVDLSSPWDVAL 300


>gi|222054715|ref|YP_002537077.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
 gi|221564004|gb|ACM19976.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
          Length = 2393

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 41/121 (33%), Positives = 68/121 (56%), Gaps = 7/121 (5%)

Query: 73   SFPLSEESVVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRK 129
            S  L+E+ ++  +AG+GV G++ G+ G A     + P+  AVD+ GNIY+AD +N+ IR+
Sbjct: 1554 SITLAEQGIINTVAGNGVAGFT-GDGGIAVGSSLNYPEDAAVDLDGNIYIADSANNRIRR 1612

Query: 130  I--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            +   +  + TIAG G  +     GPA  A  +  F +        + ISD+ N  +R+I+
Sbjct: 1613 VDKASGSIATIAGNGLAEFSGDGGPAYMAGLNRPFAVALDTS-GNIYISDNLNYRVRKID 1671

Query: 188  L 188
            L
Sbjct: 1672 L 1672


>gi|375146530|ref|YP_005008971.1| NHL repeat containing protein [Niastella koreensis GR20-10]
 gi|361060576|gb|AEV99567.1| NHL repeat containing protein [Niastella koreensis GR20-10]
          Length = 434

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 47/139 (33%), Positives = 66/139 (47%), Gaps = 15/139 (10%)

Query: 53  RPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPK---SF 109
           + G+  ++V     +   T  F      VV  LAG G  G++DG+  +A F+        
Sbjct: 94  KTGTGPVVVTIGKNTGASTTDFKYLFSYVVTTLAGSGNAGFADGKGANASFNFAGVRCQL 153

Query: 110 AVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP 168
           +VD  GN+YV D  N  IRKI   G VTTIAG G  + G  +GPA  A F+N        
Sbjct: 154 SVDNIGNVYVPDGGNQRIRKIAPDGTVTTIAGTG--ENGYNEGPASGAKFNNP------- 204

Query: 169 HICALLISDHGNQLIRQIN 187
             CA  +  +GN  + + N
Sbjct: 205 --CATAMDANGNMYVAERN 221



 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 23/50 (46%), Positives = 32/50 (64%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V  +AG G  GY++G   S RF+ P   AVD  GN+Y+ D +N+ +RKIT
Sbjct: 383 VTTIAGAGDSGYAEGTGSSVRFNGPTGIAVDKNGNMYILDMANNRVRKIT 432



 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 36/56 (64%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ +  V  +AG G  GY++G    A+F+ P + A+D  GN+YVA+++   IRKIT
Sbjct: 174 IAPDGTVTTIAGTGENGYNEGPASGAKFNNPCATAMDANGNMYVAERNGRRIRKIT 229



 Score = 40.4 bits (93), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 51/104 (49%), Gaps = 9/104 (8%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVA----DKSNHVIRKITN--LGVTTIAGGGSKK 145
           G+ DG    ARF+ P   A+D K N+YV      ++++ +R IT     VTTIAG G   
Sbjct: 334 GWVDGIGTVARFNNPWGIAMDNKSNLYVTGLGEGRNSNCVRMITPDVWNVTTIAGAG--D 391

Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            G A+G   +  F+    +  V     + I D  N  +R+I ++
Sbjct: 392 SGYAEGTGSSVRFNGPTGIA-VDKNGNMYILDMANNRVRKITVE 434


>gi|393778566|ref|ZP_10366837.1| Ig family protein [Ralstonia sp. PBA]
 gi|392714468|gb|EIZ02071.1| Ig family protein [Ralstonia sp. PBA]
          Length = 342

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 35/111 (31%), Positives = 57/111 (51%), Gaps = 1/111 (0%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           ++   +V   AG G  G +DG   +A F+ P   A+D  GN+YVAD  N+++R+IT  GV
Sbjct: 227 ITPGGLVTTFAGSGTTGSADGPGNTASFNSPGGVALDGDGNVYVADTGNNLLRRITRAGV 286

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            +   G +   G  +G    A F   + +  V     + ++D  N LIR++
Sbjct: 287 VSTLAGQAGVTGAQNGIGSAARFKQPYGVV-VDADGTVYVADTFNNLIRKV 336



 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 56/163 (34%), Positives = 79/163 (48%), Gaps = 24/163 (14%)

Query: 47  PHSVI-DRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-GYSDGEPGSARFD 104
           P+ +I D+ G  +L V D        ++ P  E +    +AG G+  G  DG   +A FD
Sbjct: 148 PYGIIFDKNG--NLFVSDVQTGILRKIT-PAGETT---NIAGGGISSGPVDGNGAAAYFD 201

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFE 163
                A+D  GNIY  + S + IRKIT  G VTT AG G+   G ADGP   ASF++   
Sbjct: 202 GLGGIALDAAGNIYATELSGNRIRKITPGGLVTTFAGSGTT--GSADGPGNTASFNS--- 256

Query: 164 LTFVPHICAL------LISDHGNQLIRQINLKPEDCSKSSQSG 200
               P   AL       ++D GN L+R+I       + + Q+G
Sbjct: 257 ----PGGVALDGDGNVYVADTGNNLLRRITRAGVVSTLAGQAG 295



 Score = 45.1 bits (105), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 38/82 (46%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT 136
           + ++ V   AG G  G + G   +A F  P     D  GN++V+D    ++RKIT  G T
Sbjct: 119 AAQAEVSLFAGSGAAGSATGIGSAASFSLPYGIIFDKNGNLFVSDVQTGILRKITPAGET 178

Query: 137 TIAGGGSKKEGRADGPAQNASF 158
           T   GG    G  DG    A F
Sbjct: 179 TNIAGGGISSGPVDGNGAAAYF 200


>gi|29347802|ref|NP_811305.1| NHL repeat-containing protein [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|383120108|ref|ZP_09940841.1| hypothetical protein BSIG_4606 [Bacteroides sp. 1_1_6]
 gi|29339703|gb|AAO77499.1| NHL repeat-containing protein [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|251838367|gb|EES66454.1| hypothetical protein BSIG_4606 [Bacteroides sp. 1_1_6]
          Length = 440

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 44/134 (32%), Positives = 72/134 (53%), Gaps = 13/134 (9%)

Query: 65  SRSAFYTLS-------FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
           S+SA Y          +P  +  + + + G G  G+ DG+   A+F +P+  A+D +GN+
Sbjct: 311 SKSAVYRFKLTRDADGWPALDGDIDEYIPGAGA-GFRDGDVQEAQFKEPRGIAIDKEGNL 369

Query: 118 YVADKSNHVIRKI-TNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
           Y+AD  N+ IRK+ T L  VTTIAG G+   G  DG    A F+  + + ++     L I
Sbjct: 370 YIADVGNNRIRKVDTKLNVVTTIAGSGA--AGYKDGDPLEAQFNQPWGV-YLDKNEFLYI 426

Query: 176 SDHGNQLIRQINLK 189
           +D  N  IR++ ++
Sbjct: 427 ADQNNHCIRKLAIE 440


>gi|302821969|ref|XP_002992645.1| hypothetical protein SELMODRAFT_6806 [Selaginella moellendorffii]
 gi|300139609|gb|EFJ06347.1| hypothetical protein SELMODRAFT_6806 [Selaginella moellendorffii]
          Length = 79

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 32/65 (49%), Positives = 43/65 (66%), Gaps = 1/65 (1%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEG 147
           G  GY DG    +RF++P+S A+   G ++VAD +N  IRKI+  G VTTIAGG S++ G
Sbjct: 15  GESGYVDGPAAKSRFNRPQSLAICDNGAVFVADTTNLAIRKISKDGEVTTIAGGSSRRPG 74

Query: 148 RADGP 152
            AD P
Sbjct: 75  IADSP 79


>gi|336180276|ref|YP_004585651.1| NHL repeat containing protein [Frankia symbiont of Datisca
           glomerata]
 gi|334861256|gb|AEH11730.1| NHL repeat containing protein [Frankia symbiont of Datisca
           glomerata]
          Length = 398

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 37/88 (42%), Positives = 51/88 (57%), Gaps = 2/88 (2%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
           A  + P + AVD  GN+Y++D  NH IRKI  LG +TTIAG G+       GPA  A  +
Sbjct: 133 AELNSPATTAVDSDGNVYISDSHNHRIRKIDPLGIITTIAGTGTAGFSGDGGPATAAQLN 192

Query: 160 NDFELTFVPHICALLISDHGNQLIRQIN 187
             + L  V    ++ ISD+ NQ IR+I+
Sbjct: 193 EPYGLA-VATDDSIYISDYENQRIRKID 219



 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/110 (38%), Positives = 59/110 (53%), Gaps = 4/110 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
           ++  +AG G  G+S DG P +A + ++P   AV    +IY++D  N  IRKI  LG +TT
Sbjct: 167 IITTIAGTGTAGFSGDGGPATAAQLNEPYGLAVATDDSIYISDYENQRIRKIDPLGIITT 226

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           IAG G+       GPA  A   N   L  V     L IS+ GN  IR+++
Sbjct: 227 IAGTGTAGFSGDGGPALQAQIKNPNNLA-VAADGTLYISELGNARIRKVS 275



 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 38/114 (33%), Positives = 58/114 (50%), Gaps = 15/114 (13%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TT 137
           ++  +AG G  G+S DG P   A+   P + AV   G +Y+++  N  IRK++  GV TT
Sbjct: 223 IITTIAGTGTAGFSGDGGPALQAQIKNPNNLAVAADGTLYISELGNARIRKVSPNGVITT 282

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC-----ALLISDHGNQLIRQI 186
           +AG G+   G   GPA  A          VP +       L I+D+ N+ IR++
Sbjct: 283 VAGNGTSGYGGDGGPATAAQLR-------VPSVALSPDGVLYIADYSNERIRRV 329


>gi|375148602|ref|YP_005011043.1| NHL repeat containing protein [Niastella koreensis GR20-10]
 gi|361062648|gb|AEW01640.1| NHL repeat containing protein [Niastella koreensis GR20-10]
          Length = 716

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 37/116 (31%), Positives = 62/116 (53%), Gaps = 5/116 (4%)

Query: 76  LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++   ++   AG+G +G+S   G+   A+  +P + AVD  GN+Y++D SN VIRK+   
Sbjct: 132 VTTSGIMSIFAGNGTEGFSGDGGQAAQAKLYRPTAVAVDKDGNLYISDASNKVIRKVNKQ 191

Query: 134 GVTTIAGGGSKKEGRA--DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           GV +   G   + G A   GPA  A  +    +  V +   + I+D  N ++R+IN
Sbjct: 192 GVISTIAGVPGRAGYAGDGGPATKALLTQPAGIA-VDYSGNIYIADPSNSVVRKIN 246



 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 46/119 (38%), Positives = 63/119 (52%), Gaps = 5/119 (4%)

Query: 76  LSEESVVKRLAG-DGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           ++++ V+  +AG  G  GY+ DG P + A   +P   AVD  GNIY+AD SN V+RKI  
Sbjct: 188 VNKQGVISTIAGVPGRAGYAGDGGPATKALLTQPAGIAVDYSGNIYIADPSNSVVRKINP 247

Query: 133 LGV-TTIAGGGSKKEGRADGPAQNASFS-NDFELTFVPHICALLISDHGNQLIRQINLK 189
            GV TT AG G+       GPA  A F     +   V     +  SD+ N  IR+I+ K
Sbjct: 248 AGVITTFAGNGTAGYSGDGGPAIKAQFQMGSPQGLAVDPAGNVYASDYQNHAIRKISSK 306



 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 47/136 (34%), Positives = 67/136 (49%), Gaps = 10/136 (7%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFD--KPKSFAV 111
           S ++ + D S S    ++       V+   AG+G  GYS DG P   A+F    P+  AV
Sbjct: 229 SGNIYIADPSNSVVRKIN----PAGVITTFAGNGTAGYSGDGGPAIKAQFQMGSPQGLAV 284

Query: 112 DMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHI 170
           D  GN+Y +D  NH IRKI++ G +TTIAG G+       GPA  A       +    + 
Sbjct: 285 DPAGNVYASDYQNHAIRKISSKGIITTIAGTGAPDYAGDGGPAILAKIWYPIGIA-TDNA 343

Query: 171 CALLISDHGNQLIRQI 186
             + I+D  N  IR+I
Sbjct: 344 GNVFITDSYNNTIREI 359



 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 33/101 (32%), Positives = 57/101 (56%), Gaps = 4/101 (3%)

Query: 89  GVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKK 145
           G  GY    G   +ARF++P++ A D   N+Y+AD  NHVIRK+ + G ++T+AG G+  
Sbjct: 33  GSPGYGGDAGPANNARFNQPRAVATDNDNNLYIADMRNHVIRKVNSNGIISTVAGNGTAG 92

Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
                GPA  A  +    +T + +   + I+D  + +I+++
Sbjct: 93  TAGDGGPATAAQLAQPTGMT-IDNDGNIYIADFNSSVIKKV 132



 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 48/89 (53%), Gaps = 2/89 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI-AGGGSKKEGRADGPAQNASF 158
           +A+  +P    +D  GNIY+AD ++ VI+K+T  G+ +I AG G++      G A  A  
Sbjct: 102 AAQLAQPTGMTIDNDGNIYIADFNSSVIKKVTTSGIMSIFAGNGTEGFSGDGGQAAQAKL 161

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
                +  V     L ISD  N++IR++N
Sbjct: 162 YRPTAVA-VDKDGNLYISDASNKVIRKVN 189


>gi|383784367|ref|YP_005468936.1| hypothetical protein LFE_1113 [Leptospirillum ferrooxidans C2-3]
 gi|383083279|dbj|BAM06806.1| hypothetical protein LFE_1113 [Leptospirillum ferrooxidans C2-3]
          Length = 382

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 49/152 (32%), Positives = 67/152 (44%), Gaps = 6/152 (3%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP  +   P    L V DS  +    +         V  LAG      +DG    A F +
Sbjct: 125 NPEGLAISPDGKTLYVADSRNNRIRKIDL---ATKTVSTLAGHAFASGNDGIGDHAGFAQ 181

Query: 106 PKSFAVDMKGN-IYVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRADGPAQNASFSNDF 162
           P+  A+   G  +Y+ D  N++IRK  +T L VTT+AG G+   G ADG    A+F    
Sbjct: 182 PRGLAISPDGKTLYIGDSGNNMIRKMDLTTLQVTTLAGQGALVPGHADGVGTQATFLEPR 241

Query: 163 ELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
            L   P    L I+D  N LIR++ L     S
Sbjct: 242 GLAISPDGQILYIADTRNHLIRKLVLATNSVS 273



 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 41/108 (37%), Positives = 53/108 (49%), Gaps = 5/108 (4%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT----NLGVTTIAGGGSKKE 146
           +G +DG   SARF+ P+       G+IYVAD  N +IRKIT       V TIAG      
Sbjct: 53  RGDNDGNGASARFEFPQGIVAAPDGSIYVADTGNDMIRKITMSSGTGTVETIAGVNHHAR 112

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
            R DG    A F+N   L   P    L ++D  N  IR+I+L  +  S
Sbjct: 113 FR-DGAGAAARFNNPEGLAISPDGKTLYVADSRNNRIRKIDLATKTVS 159


>gi|111219919|ref|YP_710713.1| serine/threonine protein kinase [Frankia alni ACN14a]
 gi|111147451|emb|CAJ59101.1| Putative serine/threonine protein kinase [Frankia alni ACN14a]
          Length = 779

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 43/128 (33%), Positives = 63/128 (49%), Gaps = 15/128 (11%)

Query: 42  QLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS 100
           +L+    +V DR  S DL + DS  +    +        +V  +AG G +G+S DG P +
Sbjct: 622 RLQATTAAVADR--SGDLYIADSEANRIRRID----SRGIVTTVAGTGAKGFSGDGGPAT 675

Query: 101 -ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG-------GSKKEGRADGP 152
            A  + P   AVD  GN+Y+AD +NH IR++T  G+ T   G       GS K    +GP
Sbjct: 676 RAMLNNPTGVAVDRAGNLYIADSANHRIRRVTPDGIITAFAGSGAAFVWGSPKSDETNGP 735

Query: 153 AQNASFSN 160
           A +    N
Sbjct: 736 ATSVVLWN 743



 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 44/132 (33%), Positives = 64/132 (48%), Gaps = 8/132 (6%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
           +++KR  G       DG P SA R     +   D  G++Y+AD   + IR+I + G VTT
Sbjct: 599 TLIKRGTGLSPSPPGDGGPASAGRLQATTAAVADRSGDLYIADSEANRIRRIDSRGIVTT 658

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSS 197
           +AG G+K      GPA  A  +N   +  V     L I+D  N  IR++     D   ++
Sbjct: 659 VAGTGAKGFSGDGGPATRAMLNNPTGVA-VDRAGNLYIADSANHRIRRVT---PDGIITA 714

Query: 198 QSGSALGAVSVW 209
            +GS  GA  VW
Sbjct: 715 FAGS--GAAFVW 724



 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 29/59 (49%), Positives = 35/59 (59%), Gaps = 2/59 (3%)

Query: 82  VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
           V  +AG G  G S DG P   A F +  S AVD  GNIY++D+SN V R  T+  VTTI
Sbjct: 486 VSTVAGSGETGTSGDGGPALQAAFQRLGSVAVDAAGNIYISDESNRVRRVDTHGTVTTI 544



 Score = 37.0 bits (84), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 31/90 (34%), Positives = 45/90 (50%), Gaps = 2/90 (2%)

Query: 96  GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQ 154
           G  G A  ++ K   VD  GN+Y+AD+++  IR+I   G V+T+AG G        GPA 
Sbjct: 446 GPDGVAAINEFKDLTVDAAGNVYLADQADPRIRRIDRRGYVSTVAGSGETGTSGDGGPAL 505

Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIR 184
            A+F        V     + ISD  N++ R
Sbjct: 506 QAAFQR-LGSVAVDAAGNIYISDESNRVRR 534


>gi|290995104|ref|XP_002680171.1| predicted protein [Naegleria gruberi]
 gi|284093791|gb|EFC47427.1| predicted protein [Naegleria gruberi]
          Length = 928

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 39/105 (37%), Positives = 56/105 (53%), Gaps = 4/105 (3%)

Query: 85  LAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
           +AG+G  G+   +G   +A+   P S  V     +Y+AD SNH IRKI   G + TIAG 
Sbjct: 46  IAGNGTAGFRGDNGPATNAQLYNPYSVFVSSNNEVYIADFSNHRIRKILENGKIVTIAGN 105

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G+      +GPA NA  +N + + FV     + I D+ N  IR+I
Sbjct: 106 GTGGFSGDNGPATNAQLNNPYSV-FVSSNNEVYIVDYNNHRIRKI 149



 Score = 60.5 bits (145), Expect = 7e-07,   Method: Composition-based stats.
 Identities = 39/112 (34%), Positives = 59/112 (52%), Gaps = 4/112 (3%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           E   +  +AG+G  G+S  +G   +A+ + P S  V     +Y+ D +NH IRKI  N  
Sbjct: 95  ENGKIVTIAGNGTGGFSGDNGPATNAQLNNPYSVFVSSNNEVYIVDYNNHRIRKILKNGN 154

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + TIAG G+      +GPA NA  +N   + FV     + I+D+ N  IR+I
Sbjct: 155 IVTIAGNGTGGFSGDNGPATNAQLNNPMGV-FVSSNNEVYIADYYNHRIRKI 205



 Score = 53.1 bits (126), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 40/137 (29%), Positives = 60/137 (43%), Gaps = 19/137 (13%)

Query: 69  FYTLSFPLSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
           F  L   L++   +  +AG+G  G+   +G   +A+   P S  V     +Y+ D +NH 
Sbjct: 340 FIDLKRSLTKNGNIVTIAGNGTAGFRGDNGPATNAQLYNPYSVFVSSNNEVYIVDYNNHR 399

Query: 127 IRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFEL----------------TFVPH 169
           IRKI  N  + TIAG G+      +GPA NA  +N   +                 FV  
Sbjct: 400 IRKILKNGNIVTIAGNGTGGFSGDNGPATNAQLNNPMVVIHLGPATNAQLNNPYSVFVSS 459

Query: 170 ICALLISDHGNQLIRQI 186
              + I D+ N  IR+I
Sbjct: 460 NNEVYIVDYNNHRIRKI 476



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 33/92 (35%), Positives = 48/92 (52%), Gaps = 2/92 (2%)

Query: 96  GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQ 154
           G   +A+ + P S  V     +Y+ D +NH IRKI   G + TIAG G+      +GPA 
Sbjct: 442 GPATNAQLNNPYSVFVSSNNEVYIVDYNNHRIRKILENGNIVTIAGNGTGGFSGDNGPAT 501

Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           NA  +N   + FV     + I+D+ N  IR+I
Sbjct: 502 NAQLNNPMGV-FVSSNNEVYIADYYNHRIRKI 532



 Score = 51.6 bits (122), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 35/101 (34%), Positives = 51/101 (50%), Gaps = 17/101 (16%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
           +AG+G  G+S  +G   +A+ + P    V     +Y+AD  NH IRKI   G + TIAG 
Sbjct: 158 IAGNGTGGFSGDNGPATNAQLNNPMGVFVSSNNEVYIADYYNHRIRKILENGNIVTIAGN 217

Query: 142 GSKKEGRADGPAQNASFSND--FELTFVPHICALLISDHGN 180
           G+            A FS D  F++   PHI   L++ +GN
Sbjct: 218 GT------------AGFSGDSPFDIRTYPHIGNKLLTGNGN 246



 Score = 50.1 bits (118), Expect = 0.001,   Method: Composition-based stats.
 Identities = 32/88 (36%), Positives = 44/88 (50%), Gaps = 2/88 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASF 158
           SA+ + P    V     +Y+AD  N  IRKI  N  + TIAG G+      +GPA NA  
Sbjct: 7   SAQLNSPFGVFVSSNNEVYIADYGNQRIRKILKNGNIVTIAGNGTAGFRGDNGPATNAQL 66

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
            N + + FV     + I+D  N  IR+I
Sbjct: 67  YNPYSV-FVSSNNEVYIADFSNHRIRKI 93



 Score = 47.4 bits (111), Expect = 0.007,   Method: Composition-based stats.
 Identities = 33/98 (33%), Positives = 47/98 (47%), Gaps = 17/98 (17%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           E   +  +AG+G  G+S  +G   +A+ + P    V     +Y+AD  NH IRKI   G 
Sbjct: 478 ENGNIVTIAGNGTGGFSGDNGPATNAQLNNPMGVFVSSNNEVYIADYYNHRIRKILENGN 537

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSND--FELTFVPHI 170
           + TIAG G+            A FS D  F++   PHI
Sbjct: 538 IVTIAGNGT------------AGFSGDSPFDIRTYPHI 563


>gi|373955343|ref|ZP_09615303.1| Ig family protein [Mucilaginibacter paludis DSM 18603]
 gi|373891943|gb|EHQ27840.1| Ig family protein [Mucilaginibacter paludis DSM 18603]
          Length = 929

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 39/87 (44%), Positives = 53/87 (60%), Gaps = 4/87 (4%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSND 161
           F++P    VD+ GN+YVAD + + I+KIT  G VT +AG GS   G  +G    ASF   
Sbjct: 199 FNQPNGVTVDLNGNVYVADAATNSIKKITPTGSVTVVAGNGSM--GSNNGIGSAASFYYP 256

Query: 162 FELTFVPHICALLISDHGNQLIRQINL 188
           F +T V     L +SD+GN LIR+I+L
Sbjct: 257 FSVT-VDGAGNLYVSDNGNNLIRKIDL 282



 Score = 53.9 bits (128), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 29/73 (39%), Positives = 44/73 (60%), Gaps = 1/73 (1%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           V  LAG G+  ++DG   +A F  P    +D  GN+Y+AD  N+ +RK+T LG VTT+AG
Sbjct: 286 VTTLAGSGMAAFADGTGTAASFYGPCGGTLDAMGNLYIADGVNNRVRKVTPLGVVTTVAG 345

Query: 141 GGSKKEGRADGPA 153
            G++     +G +
Sbjct: 346 NGTRATINGNGTS 358



 Score = 52.0 bits (123), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 35/78 (44%), Positives = 45/78 (57%), Gaps = 3/78 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           V  +AG+G  G ++G   +A F  P S  VD  GN+YV+D  N++IRKI   G VTT+AG
Sbjct: 232 VTVVAGNGSMGSNNGIGSAASFYYPFSVTVDGAGNLYVSDNGNNLIRKIDLAGAVTTLAG 291

Query: 141 GGSKKEGRADGPAQNASF 158
            G      ADG    ASF
Sbjct: 292 SG--MAAFADGTGTAASF 307



 Score = 50.8 bits (120), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 37/108 (34%), Positives = 55/108 (50%), Gaps = 6/108 (5%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
           VV  +AG+G +   +G   SA  + P    +D+ G +YVA+   + IRK+   G  TI  
Sbjct: 339 VVTTVAGNGTRATINGNGTSASLNTPTGATIDIAGIVYVAELDGNCIRKVDPSGNVTIL- 397

Query: 141 GGSKKEGRADGPAQNASFS--NDFELTFVPHICALLISDHGNQLIRQI 186
            GS   G A+G    ASF   ND +      I    ++D+GN +IR+I
Sbjct: 398 AGSNVAGSANGIGTAASFRRPNDVQADQSGFI---YVTDYGNNVIRKI 442



 Score = 41.6 bits (96), Expect = 0.39,   Method: Composition-based stats.
 Identities = 23/53 (43%), Positives = 28/53 (52%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           V  LAG  V G ++G   +A F +P     D  G IYV D  N+VIRKI   G
Sbjct: 394 VTILAGSNVAGSANGIGTAASFRRPNDVQADQSGFIYVTDYGNNVIRKILTTG 446


>gi|114566902|ref|YP_754056.1| hypothetical protein Swol_1378 [Syntrophomonas wolfei subsp. wolfei
           str. Goettingen]
 gi|114337837|gb|ABI68685.1| hypothetical protein Swol_1378 [Syntrophomonas wolfei subsp. wolfei
           str. Goettingen]
          Length = 1030

 Score = 60.8 bits (146), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 41/116 (35%), Positives = 62/116 (53%), Gaps = 10/116 (8%)

Query: 81  VVKRLAGDGV------QGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           ++  +AG+GV       GYS   G   SA+ + P   A D  GN+Y+AD +NH IRK+  
Sbjct: 594 IISTVAGNGVLTGSYKSGYSGDGGSATSAQLNYPYGVAFDASGNMYIADSNNHCIRKVDT 653

Query: 133 LG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           LG ++T AG G+       GPA +A  +N   L+F  +   + I+D  N  IR ++
Sbjct: 654 LGIISTAAGNGTYGYSGDGGPATSAQLNNPNGLSF-DNRGNMYIADTYNHRIRMVD 708



 Score = 55.8 bits (133), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 39/121 (32%), Positives = 63/121 (52%), Gaps = 5/121 (4%)

Query: 71  TLSFPLSEES-VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
           T    L++ + ++  +AG+G  GYS   G   SA  + P   A D  GN+Y+AD SN  +
Sbjct: 471 TFRLTLTQNTGIISTVAGNGTAGYSGDGGLAASALLNYPHGLAFDGNGNLYIADASNRRV 530

Query: 128 RKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           RKI + G +TT+AG G+       G A  A  +  + + F  +   + I+D  N  IR++
Sbjct: 531 RKIDSAGIITTVAGNGTSGYSGDGGSAIAAKITCPYGVAFDSN-GNMYIADIFNHRIRKV 589

Query: 187 N 187
           +
Sbjct: 590 D 590



 Score = 55.5 bits (132), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 39/118 (33%), Positives = 62/118 (52%), Gaps = 12/118 (10%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TT 137
           ++   AG+G  GYS DG P  SA+ + P   + D +GN+Y+AD  NH IR +   GV +T
Sbjct: 656 IISTAAGNGTYGYSGDGGPATSAQLNNPNGLSFDNRGNMYIADTYNHRIRMVDPNGVIST 715

Query: 138 IAGGGSKKEGRAD--------GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           +AG G+  +   +        G A +A  +N   +TF      + I+D  N  IR+++
Sbjct: 716 VAGNGNSGDRYGNDGGYSGDGGLATSAQLNNPNGITF-DSSGNMYIADSNNNCIRKVD 772



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 38/130 (29%), Positives = 63/130 (48%), Gaps = 12/130 (9%)

Query: 81  VVKRLAGDGVQGYSDGEPG----------SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V+  +AG+G  G   G  G          SA+ + P     D  GN+Y+AD +N+ IRK+
Sbjct: 712 VISTVAGNGNSGDRYGNDGGYSGDGGLATSAQLNNPNGITFDSSGNMYIADSNNNCIRKV 771

Query: 131 TNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            + G ++T AG G+       GPA +A   N   +  + +   L I+D+ +  IR++ L 
Sbjct: 772 DHSGMISTFAGNGTSGHFGDGGPATSAQLRNPVGVA-LDNSGNLFIADYFDHSIRKVVLA 830

Query: 190 PEDCSKSSQS 199
            +  S  S +
Sbjct: 831 AQQNSTISPT 840



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 44/143 (30%), Positives = 71/143 (49%), Gaps = 16/143 (11%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAV 111
           G+ +L + D+S      +        ++  +AG+G  GYS G+ GSA   +   P   A 
Sbjct: 516 GNGNLYIADASNRRVRKID----SAGIITTVAGNGTSGYS-GDGGSAIAAKITCPYGVAF 570

Query: 112 DMKGNIYVADKSNHVIRKITNLG-VTTIAGG----GSKKEGRA--DGPAQNASFSNDFEL 164
           D  GN+Y+AD  NH IRK+   G ++T+AG     GS K G +   G A +A  +  + +
Sbjct: 571 DSNGNMYIADIFNHRIRKVDPAGIISTVAGNGVLTGSYKSGYSGDGGSATSAQLNYPYGV 630

Query: 165 TFVPHICALLISDHGNQLIRQIN 187
            F      + I+D  N  IR+++
Sbjct: 631 AFDAS-GNMYIADSNNHCIRKVD 652


>gi|428180908|gb|EKX49774.1| hypothetical protein GUITHDRAFT_67798, partial [Guillardia theta
           CCMP2712]
          Length = 247

 Score = 60.8 bits (146), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 60/106 (56%), Gaps = 4/106 (3%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGG 142
           +AG G +G  DG   S R + P+    D  G++Y+AD SNH IR++   ++ + T+AG G
Sbjct: 74  VAGCGRRGCQDGSHESGRLNGPRGLTFDPHGDLYIADSSNHRIRRLRSQDMFLETVAGSG 133

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
            + E R DG   NASFS  + L +      L++S+  +  IR+++ 
Sbjct: 134 -EGESR-DGTLLNASFSFPYGLAWDALTNTLIVSEQESHKIRRVDF 177



 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 28/60 (46%), Positives = 35/60 (58%), Gaps = 1/60 (1%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           V  +AG G  G+ DGE   A F++P   A D +GNI+VAD  N  IR IT  G V T+ G
Sbjct: 182 VTTIAGTGECGHRDGEASQATFNEPCFIACDQQGNIFVADSRNDSIRCITRHGFVHTLVG 241



 Score = 40.4 bits (93), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 43/147 (29%), Positives = 65/147 (44%), Gaps = 17/147 (11%)

Query: 43  LEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSAR 102
           L  +PH         DL + DSS      L    S++  ++ +AG G     DG   +A 
Sbjct: 98  LTFDPHG--------DLYIADSSNHRIRRLR---SQDMFLETVAGSGEGESRDGTLLNAS 146

Query: 103 FDKPKSFAVD-MKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASFS 159
           F  P   A D +   + V+++ +H IR++      VTTIAG G  + G  DG A  A+F 
Sbjct: 147 FSFPYGLAWDALTNTLIVSEQESHKIRRVDFAANSVTTIAGTG--ECGHRDGEASQATF- 203

Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
           N+           + ++D  N  IR I
Sbjct: 204 NEPCFIACDQQGNIFVADSRNDSIRCI 230


>gi|336178839|ref|YP_004584214.1| serine/threonine protein kinase [Frankia symbiont of Datisca
           glomerata]
 gi|334859819|gb|AEH10293.1| serine/threonine protein kinase [Frankia symbiont of Datisca
           glomerata]
          Length = 776

 Score = 60.8 bits (146), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 38/115 (33%), Positives = 62/115 (53%), Gaps = 4/115 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +    V+  +AG G  GYS DG P +A +  +P   A D  GN+Y+AD +N+ IR++++ 
Sbjct: 603 IGTNDVITTVAGSGTSGYSGDGGPATAAQLARPGGLAADTAGNLYIADNANNRIRRVSSD 662

Query: 134 GV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           GV  T+AG G+       GPA  A  +N   +  V     + I+D  N  +R+++
Sbjct: 663 GVIITVAGSGTSGYSGDGGPATAAQLANPGSVA-VTDDGRVYIADTDNNRVRRVD 716



 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/99 (39%), Positives = 53/99 (53%), Gaps = 4/99 (4%)

Query: 92  GYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGR 148
           GYS DG P +A +  KP S  VD  G +Y+AD  NH IR+I TN  +TT+AG G+     
Sbjct: 563 GYSGDGGPATAAQLAKPTSVLVDADGTLYIADTGNHRIRRIGTNDVITTVAGSGTSGYSG 622

Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
             GPA  A  +    L        L I+D+ N  IR+++
Sbjct: 623 DGGPATAAQLARPGGLA-ADTAGNLYIADNANNRIRRVS 660



 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 39/114 (34%), Positives = 56/114 (49%), Gaps = 3/114 (2%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +S + V+  +AG G  GYS DG P +A +   P S AV   G +Y+AD  N+ +R++   
Sbjct: 659 VSSDGVIITVAGSGTSGYSGDGGPATAAQLANPGSVAVTDDGRVYIADTDNNRVRRVDAD 718

Query: 134 GV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           GV TT+AG          GPA  A       L        L I+D+ N  IR++
Sbjct: 719 GVITTVAGSDEAGYSGDGGPATAARLCEPNGLGLDTTERLLYITDNCNDRIRRV 772



 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 54/106 (50%), Gaps = 4/106 (3%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
           +AG GV G+S  +G   +A   KP    VD  G +Y  D  N+ IR+I   G +TT+AG 
Sbjct: 444 VAGTGVAGFSGDNGPATAADLSKPDDPLVDNTGAVYFTDTGNNRIRRIGADGIITTVAGT 503

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           G+      +GPA  A F+   ++        L I+D  N  IR+IN
Sbjct: 504 GTYGFSGDNGPAAQAHFATPTDVAR-DRAGNLYIADTDNNRIRRIN 548



 Score = 46.6 bits (109), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 53/114 (46%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +  + ++  +AG G  G+S  +G    A F  P   A D  GN+Y+AD  N+ IR+I  +
Sbjct: 491 IGADGIITTVAGTGTYGFSGDNGPAAQAHFATPTDVARDRAGNLYIADTDNNRIRRINVV 550

Query: 134 GVTTIAGGGSKKEGRAD-GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G  T   G        D GPA  A  +    +  V     L I+D GN  IR+I
Sbjct: 551 GTVTTVAGTGTPGYSGDGGPATAAQLAKPTSV-LVDADGTLYIADTGNHRIRRI 603


>gi|423215200|ref|ZP_17201728.1| hypothetical protein HMPREF1074_03260 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692463|gb|EIY85701.1| hypothetical protein HMPREF1074_03260 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 451

 Score = 60.8 bits (146), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 57/98 (58%), Gaps = 1/98 (1%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           GY +G   + R ++P+  A D  GN++V +KS H+IRKIT  GV ++  G   + G  DG
Sbjct: 355 GYVNGSGATVRLNQPRQPAFDEDGNMFVPEKSAHIIRKITPAGVASLYAGIPGQSGFGDG 414

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
             + A F++   +T  P   ++ ++D  N +IR++ ++
Sbjct: 415 LPELAKFNSPECVTVYPD-NSVYVADRENHVIRRVTVE 451



 Score = 44.3 bits (103), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 19/43 (44%), Positives = 28/43 (65%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G  G+ DG P  A+F+ P+   V    ++YVAD+ NHVIR++T
Sbjct: 407 GQSGFGDGLPELAKFNSPECVTVYPDNSVYVADRENHVIRRVT 449


>gi|86610239|ref|YP_479001.1| NHL repeat-containing protein [Synechococcus sp. JA-2-3B'a(2-13)]
 gi|86558781|gb|ABD03738.1| NHL repeat protein [Synechococcus sp. JA-2-3B'a(2-13)]
          Length = 649

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 46/78 (58%), Gaps = 3/78 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           V   AG  + GY DG    A+F +P   A+   G++YVAD++N  IR+IT  G VTT+AG
Sbjct: 426 VSTFAGTCIAGYRDGGLDRAQFREPSGLALGSDGSLYVADRANRRIRRITPAGEVTTVAG 485

Query: 141 GGSKKEGRADGPAQNASF 158
            G  + G  DGPA+ A  
Sbjct: 486 TG--QPGSVDGPAEQAQL 501



 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 35/97 (36%), Positives = 51/97 (52%), Gaps = 4/97 (4%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRAD 150
           GY DG    ARF      A D  GN++VAD+ NH IR++   G V+T+A  G  + G  D
Sbjct: 541 GYRDGPLAEARFQTLGGLAFDAAGNLWVADRDNHRIRRLQPGGQVSTLA--GQNQPGWQD 598

Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           GP   A F     L  +P   ++++ D G   +R+I+
Sbjct: 599 GPTSVARFDQPAGLAVLPD-GSVVVVDAGLPGLRRIS 634



 Score = 57.0 bits (136), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 45/134 (33%), Positives = 61/134 (45%), Gaps = 6/134 (4%)

Query: 74  FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-N 132
           F LS E  ++  AG G  G  DG    A+FD P+    D KG ++VAD  NH +R I   
Sbjct: 364 FRLSPEGELEVWAGTGQAGRRDGAANQAQFDSPQGLLWDPKGGLWVADSGNHCLRYINLQ 423

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE- 191
             V+T A  G+   G  DG    A F     L       +L ++D  N+ IR+I    E 
Sbjct: 424 RQVSTFA--GTCIAGYRDGGLDRAQFREPSGLALG-SDGSLYVADRANRRIRRITPAGEV 480

Query: 192 -DCSKSSQSGSALG 204
              + + Q GS  G
Sbjct: 481 TTVAGTGQPGSVDG 494



 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 36/107 (33%), Positives = 55/107 (51%), Gaps = 7/107 (6%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           V  +AG G  G  DG    A+  +P + AVD +GN+++AD   H +R+++  G VTT++G
Sbjct: 480 VTTVAGTGQPGSVDGPAEQAQLLQPTALAVDKEGNLWIAD--GHRLRRLSAGGRVTTLSG 537

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
               + G  DGP   A F     L F      L ++D  N  IR++ 
Sbjct: 538 ---AEPGYRDGPLAEARFQTLGGLAFD-AAGNLWVADRDNHRIRRLQ 580



 Score = 50.4 bits (119), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 44/81 (54%), Gaps = 2/81 (2%)

Query: 109 FAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP 168
            AVD +G  Y+AD++ H I +++  G   +   G+ + GR DG A  A F +   L + P
Sbjct: 345 IAVDPEGAFYLADRAQHRIFRLSPEGELEVW-AGTGQAGRRDGAANQAQFDSPQGLLWDP 403

Query: 169 HICALLISDHGNQLIRQINLK 189
               L ++D GN  +R INL+
Sbjct: 404 K-GGLWVADSGNHCLRYINLQ 423



 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 32/64 (50%), Gaps = 2/64 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL--GVTTIA 139
           V  LAG    G+ DG    ARFD+P   AV   G++ V D     +R+I+ +  G  T  
Sbjct: 585 VSTLAGQNQPGWQDGPTSVARFDQPAGLAVLPDGSVVVVDAGLPGLRRISRVASGPVTPT 644

Query: 140 GGGS 143
            GGS
Sbjct: 645 PGGS 648


>gi|359151030|ref|ZP_09183771.1| Redoxin domain-containing protein [Streptomyces sp. S4]
          Length = 568

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 37/116 (31%), Positives = 64/116 (55%), Gaps = 7/116 (6%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S + +V D++R     L+     ESVV+R+ G+G +G +DG P  ARF +P+   +  +G
Sbjct: 140 SGNFLVSDTTRHQLVELAE--DGESVVRRIGGEGERGLADGGPAEARFSEPQGLVLTPEG 197

Query: 116 N-IYVADKSNHVIRKITNLG--VTTIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
           + + VAD  NH +R++      VTT AG G +  +     GPA+    S+ +++ +
Sbjct: 198 DAVIVADTVNHALRRVELASGVVTTPAGTGRQWWQGSPTSGPAREVDLSSPWDVAW 253



 Score = 43.9 bits (102), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 46/103 (44%), Gaps = 2/103 (1%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSK 144
           AG   +G  DG    A F +P   A D    ++VAD     +R +   G V T  G G  
Sbjct: 282 AGTTNEGLVDGPGAEAWFAQPSGLAADGDARLWVADSETSALRWVDPEGTVHTAVGTGLF 341

Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
             G  DG A  A F +   +T +P   ++ +SD  N  +R+ +
Sbjct: 342 DFGHRDGEAAQALFQHPLAVTVLPD-GSVAVSDTYNHALRRYD 383



 Score = 40.4 bits (93), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 49/105 (46%), Gaps = 7/105 (6%)

Query: 87  GDGVQGYSDGEP--GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG-GGS 143
           GDG   Y   EP  G+ RF  P    +   GN  V+D + H + ++   G + +   GG 
Sbjct: 115 GDGP--YVAPEPVAGALRF--PGRAVLLPSGNFLVSDTTRHQLVELAEDGESVVRRIGGE 170

Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
            + G ADG    A FS    L   P   A++++D  N  +R++ L
Sbjct: 171 GERGLADGGPAEARFSEPQGLVLTPEGDAVIVADTVNHALRRVEL 215


>gi|288921012|ref|ZP_06415304.1| serine/threonine protein kinase [Frankia sp. EUN1f]
 gi|288347587|gb|EFC81872.1| serine/threonine protein kinase [Frankia sp. EUN1f]
          Length = 882

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 45/117 (38%), Positives = 64/117 (54%), Gaps = 8/117 (6%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ +  +  +AG GV GYS DG P +A +    +  AV   G+IY+AD  NH IRKI+  
Sbjct: 648 IAADGTISTIAGTGVAGYSGDGGPATAAQLKSAEKLAVAPDGSIYIADYENHRIRKISTD 707

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFS--NDFELTFVPHICALLISDHGNQLIRQIN 187
           G +TTIAG G        GPA  A+ +  ND E   V     L I++ G+  I++IN
Sbjct: 708 GIITTIAGTGVDGYSGEGGPATAATLNGPNDVE---VASDGTLYIANLGSDTIQKIN 761



 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/89 (38%), Positives = 48/89 (53%), Gaps = 2/89 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
           +A+ D P S   D KGNIY+ D  N+ IRKI   G ++TIAG G        GPA  A  
Sbjct: 618 AAKLDGPGSAVTDGKGNIYIPDARNYRIRKIAADGTISTIAGTGVAGYSGDGGPATAAQL 677

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
            +  +L   P   ++ I+D+ N  IR+I+
Sbjct: 678 KSAEKLAVAPD-GSIYIADYENHRIRKIS 705



 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 54/160 (33%), Positives = 79/160 (49%), Gaps = 13/160 (8%)

Query: 36  TVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSF------PLSEESVVKRLAGDG 89
           T +DG+  E  P +     G +D+ V  +S    Y  +        ++ +  +  +AG G
Sbjct: 716 TGVDGYSGEGGPATAATLNGPNDVEV--ASDGTLYIANLGSDTIQKINTDGTIHTVAGTG 773

Query: 90  VQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKE 146
            QGYS DG P + A+   P S ++   G +Y+AD  N+ IRK+  N  +TTIAG GS+  
Sbjct: 774 EQGYSGDGGPATKAQLSIP-SVSLGRNGTLYIADYGNNRIRKVDANGTITTIAGTGSEGS 832

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
               GPA  A F+ D          AL I+D GN  IR+I
Sbjct: 833 SGDGGPATEAKFT-DPSSVVEDSTGALYIADSGNNRIRRI 871


>gi|357008076|ref|ZP_09073075.1| copper amine oxidase domain-containing protein [Paenibacillus elgii
           B69]
          Length = 533

 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 37/94 (39%), Positives = 53/94 (56%), Gaps = 4/94 (4%)

Query: 95  DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPA 153
           DG+  ++ F++P+  A D  GN+YVAD  NH IRKI   G V+T+AG G    GR DG  
Sbjct: 119 DGKSDASLFNEPQGLAADANGNVYVADSGNHAIRKIDTAGQVSTVAGNGLL--GRKDGEG 176

Query: 154 QNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           ++A F    ++  V     L ++D  N  IR I+
Sbjct: 177 KDALFYRPTDVA-VAADGTLYVADSLNHAIRSIS 209



 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 61/212 (28%), Positives = 95/212 (44%), Gaps = 35/212 (16%)

Query: 50  VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG---------VQG-YSDGEPG 99
           V+D  G  +LIV DS       +     ++  V  LAG G         VQG ++DG   
Sbjct: 255 VLDAKG--NLIVSDSGNQRIRYIDL---QQGKVTTLAGGGQAATNKELHVQGGFADGSAS 309

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
            ARF  P   A+  +G + +AD  NH IR + +  V+TIAG   +  G ADG   +A+  
Sbjct: 310 DARFSFPMGLALTEEGGLVIADSQNHAIRYLLDGQVSTIAGAADRITGNADGIEGSAALH 369

Query: 160 NDFELTFVPHICALLISDHGNQLIRQINLK--PEDCSKSSQSGSALGAVSVWVLVSVLSC 217
              ++  +    ++L +D  N  +R+++L   P+   K           +V V+V     
Sbjct: 370 RPMDVAVLAD-GSILAADTYNNKLREVSLYRLPDGLPKDG---------TVKVVVDRQQ- 418

Query: 218 LVSLVIGFVARPYIIRHTGRLDPPLQHDMEAL 249
                I F A+P I+   GR   P++   EAL
Sbjct: 419 -----IAFEAQPEIV--NGRTMVPIRAITEAL 443



 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 45/133 (33%), Positives = 60/133 (45%), Gaps = 13/133 (9%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG------- 134
           V  +AG+G+ G  DGE   A F +P   AV   G +YVAD  NH IR I+  G       
Sbjct: 160 VSTVAGNGLLGRKDGEGKDALFYRPTDVAVAADGTLYVADSLNHAIRSISPSGEVKTLNA 219

Query: 135 -----VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
                V    G  S     ADG  ++A F+    L        L++SD GNQ IR I+L+
Sbjct: 220 LSPRVVELFPGQVSPAGDFADGDLKSAKFNEPTALVLDAK-GNLIVSDSGNQRIRYIDLQ 278

Query: 190 PEDCSKSSQSGSA 202
               +  +  G A
Sbjct: 279 QGKVTTLAGGGQA 291



 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 38/120 (31%), Positives = 60/120 (50%), Gaps = 11/120 (9%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           PL+E +    LAG+G  G +DG   SA F  P    V   G + V+D  N +IRK++   
Sbjct: 40  PLTEMAT---LAGNGGLGSADGAGVSASFRMPGGLTVLKDGTVLVSDSRNQLIRKLSQGT 96

Query: 135 VTTIAGGGSKKEGRA-------DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           V+T AG   K++ +        DG + +AS  N+ +         + ++D GN  IR+I+
Sbjct: 97  VSTFAGAAYKQDSKGFPVGALLDGKS-DASLFNEPQGLAADANGNVYVADSGNHAIRKID 155


>gi|29349488|ref|NP_812991.1| hypothetical protein BT_4080 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|298384984|ref|ZP_06994543.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
 gi|383120427|ref|ZP_09941155.1| hypothetical protein BSIG_2566 [Bacteroides sp. 1_1_6]
 gi|29341397|gb|AAO79185.1| putative cell surface protein [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|251840523|gb|EES68605.1| hypothetical protein BSIG_2566 [Bacteroides sp. 1_1_6]
 gi|298262128|gb|EFI04993.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
          Length = 456

 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 35/100 (35%), Positives = 52/100 (52%), Gaps = 2/100 (2%)

Query: 88  DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKE 146
           +G  GY DG  G A F+KP    VD +GNI + D  NH IRKIT + G  +   G  +  
Sbjct: 355 EGKSGYLDGPIGQAMFNKPGQMCVDSEGNIILTDTENHCIRKITMSTGYVSTLAGKPQNS 414

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G  +G A++A F     +  + +   + I D  N+ IR++
Sbjct: 415 GYVNGSAEDAQFKKPLGIC-IDNDDVMYIGDSENRAIRRL 453


>gi|261404592|ref|YP_003240833.1| copper amine oxidase domain-containing protein [Paenibacillus sp.
           Y412MC10]
 gi|261281055|gb|ACX63026.1| copper amine oxidase domain protein [Paenibacillus sp. Y412MC10]
          Length = 1280

 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 42/104 (40%), Positives = 57/104 (54%), Gaps = 4/104 (3%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
           A  ++P   AVD KG +Y+AD  NH IRKI   G VTTIAG G  + G  DG AQ+A F 
Sbjct: 138 AFLNRPSGIAVDGKGQVYIADSGNHAIRKIDQAGRVTTIAGNG--RIGLKDGKAQDALFH 195

Query: 160 NDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSAL 203
              ++  V     L ++D  N +IR+I+   E  +  S S  A+
Sbjct: 196 EPQDVA-VTEDGILYVADTLNHVIRRISPDGEVTTIGSPSTRAV 238



 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 29/93 (31%), Positives = 47/93 (50%), Gaps = 3/93 (3%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           GY DG   +ARF+ P+  A  ++  + +AD  NH +R++ +  V+T+AGG     G ADG
Sbjct: 318 GYLDGAADAARFNSPRGLAWSVEDGLLIADSHNHAVRQLKDERVSTMAGG---TRGYADG 374

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIR 184
                 F+   ++        L I+D  N  +R
Sbjct: 375 IESEVRFNAPADIAVASDSGELFIADQRNGAVR 407



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 48/155 (30%), Positives = 69/155 (44%), Gaps = 15/155 (9%)

Query: 45  INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
           +N  S I   G   + + DS   A   +     +   V  +AG+G  G  DG+   A F 
Sbjct: 140 LNRPSGIAVDGKGQVYIADSGNHAIRKID----QAGRVTTIAGNGRIGLKDGKAQDALFH 195

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK--EGRAD-----GPAQNAS 157
           +P+  AV   G +YVAD  NHVIR+I+  G  T  G  S +  + RA      G  +N S
Sbjct: 196 EPQDVAVTEDGILYVADTLNHVIRRISPDGEVTTIGSPSTRAVQVRAGVIALAGDYKNGS 255

Query: 158 FS----NDFELTFVPHICALLISDHGNQLIRQINL 188
            +    N+     +     L +SD GN  IR I+ 
Sbjct: 256 LAEAQFNEPAGLALDGKGNLYVSDSGNHAIRYIDF 290


>gi|118161380|gb|ABK64062.1| conserved hypothetical protein [Janthinobacterium lividum]
          Length = 779

 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 47/120 (39%), Positives = 63/120 (52%), Gaps = 12/120 (10%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV-------I 127
           ++ +  V  LAG     G SDG+  SARF      AVD  G IYVAD  +++       I
Sbjct: 269 ITPDGTVTTLAGISRTYGTSDGDAKSARFGGISGIAVDTAGAIYVADAGSNINGIVGSRI 328

Query: 128 RKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           RK+T  G V+T+AGG     G ADG    A+FS+   LT +     L ++D GN  IR+I
Sbjct: 329 RKLTPAGIVSTLAGG--SAYGSADGAGAVATFSSLNGLT-IDKQGNLFVADEGNHTIRKI 385



 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/104 (36%), Positives = 51/104 (49%), Gaps = 13/104 (12%)

Query: 94  SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPA 153
           +DG    ARF   ++ A D  GN+Y+ D  NH +RKIT  GV T   G +   G ADG  
Sbjct: 634 ADGSLAQARFGYLRAIAFDASGNLYLTDSLNHNVRKITPAGVVTTLAGTTGVAGDADGRG 693

Query: 154 QNASFSNDFELTFVPHICAL------LISDHGNQLIRQINLKPE 191
             ASF+        PH  AL       ++D  N L+R+I+   E
Sbjct: 694 SAASFNG-------PHGIALDKAGNVYVADTENNLVRRISTSGE 730



 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 41/118 (34%), Positives = 58/118 (49%), Gaps = 8/118 (6%)

Query: 76  LSEESVVKRLAGDGVQ---GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV----IR 128
           ++   VV  LAG   Q     +DG   +ARF+ P   AVD  GN+YV+D +  +    +R
Sbjct: 385 ITPAGVVSTLAGTASQLRPAVADGIGAAARFNLPYGLAVDGAGNVYVSDSNPGLQLNGVR 444

Query: 129 KITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           KIT  G  T   G +   G  DG A  A F+    +        L ++D GNQLIR++
Sbjct: 445 KITPAGEVTTITGSNAGIGVTDGLASEARFAGPQAIALHSD-GTLFVADTGNQLIRRV 501



 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 35/91 (38%), Positives = 49/91 (53%), Gaps = 2/91 (2%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           L+   +V  LAG    G +DG    A F       +D +GN++VAD+ NH IRKIT  G 
Sbjct: 331 LTPAGIVSTLAGGSAYGSADGAGAVATFSSLNGLTIDKQGNLFVADEGNHTIRKITPAGV 390

Query: 135 VTTIAGGGSK-KEGRADGPAQNASFSNDFEL 164
           V+T+AG  S+ +   ADG    A F+  + L
Sbjct: 391 VSTLAGTASQLRPAVADGIGAAARFNLPYGL 421



 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 47/155 (30%), Positives = 71/155 (45%), Gaps = 12/155 (7%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           +S   ++  LAG    G +DG+  +A FD   +  VD  G++YV   ++  IRKIT  G 
Sbjct: 217 ISPAGLLTTLAGSSSCGAADGKGTAASFDSMLAITVDASGSVYVGGGAS--IRKITPDGT 274

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQL-------IRQINL 188
            T   G S+  G +DG A++A F        V    A+ ++D G+ +       IR+  L
Sbjct: 275 VTTLAGISRTYGTSDGDAKSARFGG-ISGIAVDTAGAIYVADAGSNINGIVGSRIRK--L 331

Query: 189 KPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSLVI 223
            P     +   GSA G+      V+  S L  L I
Sbjct: 332 TPAGIVSTLAGGSAYGSADGAGAVATFSSLNGLTI 366



 Score = 43.9 bits (102), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 33/50 (66%), Gaps = 1/50 (2%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT-TIAG 140
           G +DG    ARF  P++ A+   G ++VAD  N +IR+++ +GVT T+AG
Sbjct: 463 GVTDGLASEARFAGPQAIALHSDGTLFVADTGNQLIRRVSAVGVTSTLAG 512



 Score = 41.2 bits (95), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 53/117 (45%), Gaps = 8/117 (6%)

Query: 76  LSEESVVKRLAGDGVQGY----SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S   V   LAG+   G      DG    A F +P    V   G  Y+AD   + IR++ 
Sbjct: 501 VSAVGVTSTLAGNPGHGSFLSSGDGTGAQATFAQPAGIVVGSDGVAYIADAFRNTIRRVG 560

Query: 132 NLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           N G VTT+AG    +    DG    A FS+   +  +     L ++D+  Q +R+++
Sbjct: 561 NDGVVTTLAGSYENRSQPTDGQGAKAGFSSTNGIA-IDGNATLYVADY--QAVRKVD 614


>gi|290974673|ref|XP_002670069.1| predicted protein [Naegleria gruberi]
 gi|284083624|gb|EFC37325.1| predicted protein [Naegleria gruberi]
          Length = 821

 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 47/152 (30%), Positives = 77/152 (50%), Gaps = 21/152 (13%)

Query: 51  IDRPGSSDLIVLDSSRSAFYTLSFPLSEE-----------SVVKRLAGDGVQGYSDGEPG 99
           ++ PG   L+    + +  Y + FPLS +            +    AG+G  G + GE G
Sbjct: 101 VENPG---LLAFSPNETTLY-VCFPLSNKVQRMDFLPNGTEIFSDFAGNGNSG-NTGEGG 155

Query: 100 ---SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKEGRADGPAQN 155
              +A+ ++P   AVD  G++Y++D  N +IRK++ + G+ T   G S+      G A+N
Sbjct: 156 LAINAQLNQPSGVAVDSNGDVYISDSGNGLIRKVSISSGIITKFAGTSQGFAGDGGLAKN 215

Query: 156 ASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           A  SN   L F P+   L I+D  N ++R+IN
Sbjct: 216 AKLSNPRGLNFGPN-GDLYIADSDNNVVRRIN 246



 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 38/146 (26%), Positives = 68/146 (46%), Gaps = 8/146 (5%)

Query: 44  EINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEPGSA 101
           ++N  S +    + D+ + DS       +S       ++ + AG   QG++   G   +A
Sbjct: 161 QLNQPSGVAVDSNGDVYISDSGNGLIRKVSI---SSGIITKFAGTS-QGFAGDGGLAKNA 216

Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSN 160
           +   P+       G++Y+AD  N+V+R+I + G +TTIAG  +       G A+ A   N
Sbjct: 217 KLSNPRGLNFGPNGDLYIADSDNNVVRRINSSGIITTIAGDTTSGYSGDGGDAKLAKMKN 276

Query: 161 DFELTFVPHICALLISDHGNQLIRQI 186
              +  V     + ISD  N +IR++
Sbjct: 277 PINV-IVSKTNEIFISDADNNVIRKV 301



 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 51/98 (52%), Gaps = 9/98 (9%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP  +   P + DL + DS  +    ++       ++  +AGD   GYS G+ G A+  K
Sbjct: 220 NPRGLNFGP-NGDLYIADSDNNVVRRIN----SSGIITTIAGDTTSGYS-GDGGDAKLAK 273

Query: 106 ---PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
              P +  V     I+++D  N+VIRK++N  ++TIAG
Sbjct: 274 MKNPINVIVSKTNEIFISDADNNVIRKVSNGNISTIAG 311



 Score = 45.4 bits (106), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 51/91 (56%), Gaps = 3/91 (3%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG-GSKKEGRADGPAQNASF 158
           A+ + P S  V     I +AD SN+VIRK+   G + TIAG  GS+     +G A ++  
Sbjct: 496 AKLNAPNSVKVSSSDEIVIADTSNNVIRKVFKNGTIITIAGTIGSQGYSGDNGLAISSVL 555

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQINLK 189
           S+  ++   P+   + I+D GN +IR+I+ K
Sbjct: 556 SSPSDVAIAPN-GEVFIADWGNHVIRKIDTK 585



 Score = 40.4 bits (93), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 32/94 (34%), Positives = 47/94 (50%), Gaps = 8/94 (8%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYS--DGEPGSARF 103
           P+SV  +  SSD IV+  + +      F   +   +  +AG  G QGYS  +G   S+  
Sbjct: 501 PNSV--KVSSSDEIVIADTSNNVIRKVF---KNGTIITIAGTIGSQGYSGDNGLAISSVL 555

Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTT 137
             P   A+   G +++AD  NHVIRKI   G+ T
Sbjct: 556 SSPSDVAIAPNGEVFIADWGNHVIRKIDTKGIIT 589


>gi|29349195|ref|NP_812698.1| hypothetical protein BT_3787 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29341103|gb|AAO78892.1| conserved hypothetical protein with NHL repeat [Bacteroides
           thetaiotaomicron VPI-5482]
          Length = 452

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 35/98 (35%), Positives = 52/98 (53%), Gaps = 1/98 (1%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           GY +G   +ARFD P+  A D  GN++V +   H IRKIT  G  ++  G   + G  DG
Sbjct: 356 GYVNGSGVTARFDNPRQPAFDQDGNMFVPEYGRHTIRKITPTGEVSLYAGLPGQAGFTDG 415

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
             + A F N  E   V    +L ++D  N LIR++ ++
Sbjct: 416 LPEKARF-NKPECVTVYLDNSLYVADRDNHLIRRVTVE 452



 Score = 50.4 bits (119), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 24/57 (42%), Positives = 36/57 (63%), Gaps = 2/57 (3%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           P  E S+   L G    G++DG P  ARF+KP+   V +  ++YVAD+ NH+IR++T
Sbjct: 396 PTGEVSLYAGLPGQA--GFTDGLPEKARFNKPECVTVYLDNSLYVADRDNHLIRRVT 450


>gi|325104254|ref|YP_004273908.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
 gi|324973102|gb|ADY52086.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
          Length = 439

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 57/171 (33%), Positives = 84/171 (49%), Gaps = 24/171 (14%)

Query: 29  EDGYTVTTV--------IDGHQLEI---NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPL 76
           E  Y V+T+        IDG+ L+    NP  + +D  G  D+I+ D +  +       +
Sbjct: 117 ESNYIVSTIAGSTTFGFIDGNGLDARFRNPDGILVDMNG--DIIITDRTNHSIR----KM 170

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
           +   VV  LAG GV GY++G+PG  +F+ P    +D  GNI V +K    IRKI   G V
Sbjct: 171 TTAGVVSTLAGTGVSGYANGKPG--QFNTPWQSTMDAAGNIIVIEKDGGRIRKIAPDGAV 228

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + IAG GS   G  DG    A F++  +   V     + ++D  N  IR+I
Sbjct: 229 SLIAGTGS--LGFTDGNVSVARFNHALD-GVVDSEGNIFVADRNNYRIRKI 276



 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 53/103 (51%), Gaps = 7/103 (6%)

Query: 42  QLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSA 101
           Q      S +D  G+  +I  D  R         ++ +  V  +AG G  G++DG    A
Sbjct: 194 QFNTPWQSTMDAAGNIIVIEKDGGRIR------KIAPDGAVSLIAGTGSLGFTDGNVSVA 247

Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGS 143
           RF+      VD +GNI+VAD++N+ IRKIT  G V+T  G G+
Sbjct: 248 RFNHALDGVVDSEGNIFVADRNNYRIRKITPAGMVSTFMGDGT 290



 Score = 50.8 bits (120), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 24/52 (46%), Positives = 33/52 (63%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ V  +AG+G  G  DG   +A F++P    +D KG+IYVAD  NH IRKI
Sbjct: 385 KATVSTIAGNGTAGRGDGLGHAATFNQPYDVVMDAKGDIYVADNVNHSIRKI 436



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/125 (31%), Positives = 62/125 (49%), Gaps = 15/125 (12%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-- 130
           +++  V+  + G   +GY DG PG   + +         D +GNI +AD SN+ IRKI  
Sbjct: 319 ITKSGVMTTIVGS-TKGYDDGTPGNPLTVKLGDIFGLNFDNEGNILLADASNNRIRKITP 377

Query: 131 ------TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIR 184
                 T   V+TIAG G+   GR DG    A+F+  +++        + ++D+ N  IR
Sbjct: 378 GVGNDWTKATVSTIAGNGT--AGRGDGLGHAATFNQPYDVVMDAK-GDIYVADNVNHSIR 434

Query: 185 QINLK 189
           +I  K
Sbjct: 435 KIIYK 439


>gi|301064161|ref|ZP_07204608.1| RHS repeat-associated core domain protein [delta proteobacterium
           NaphS2]
 gi|300441781|gb|EFK06099.1| RHS repeat-associated core domain protein [delta proteobacterium
           NaphS2]
          Length = 2050

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 40/110 (36%), Positives = 58/110 (52%), Gaps = 3/110 (2%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTT 137
           ++  +AG G  G+S DG P   A    P   A+D  GN+Y+ D  N+ +RK+ TN  +TT
Sbjct: 804 IITTVAGSGDYGFSGDGGPAIEASLSFPMGIAIDSAGNLYILDSDNNRVRKVDTNGIITT 863

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           +AG G+       GPA  AS S+      +     L ISD GN  IR+++
Sbjct: 864 VAGNGNWSYNGDGGPAVEASLSSAASGIAIDSAGNLYISDTGNYCIRKVD 913



 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 39/110 (35%), Positives = 56/110 (50%), Gaps = 4/110 (3%)

Query: 81   VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTT 137
            ++  +AG+GV G+S DG P   A        A+D  GN+Y+ D SNH +RK+ TN  +TT
Sbjct: 917  IITTVAGNGVAGFSGDGGPAVEASLGWAMGIAIDSAGNLYILDGSNHRVRKVDTNGIITT 976

Query: 138  IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            +AG          GPA  AS      +  +     L ISD  N  IR+++
Sbjct: 977  VAGSDDYGFSGDGGPAIEASLGYAVGIA-IDSAENLYISDSSNHCIRRVD 1025



 Score = 46.2 bits (108), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 38/136 (27%), Positives = 64/136 (47%), Gaps = 9/136 (6%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG--SARFDKPKSFAVD 112
           + +L +LDS  +    +        ++  +AG+G   Y+ DG P   ++        A+D
Sbjct: 839 AGNLYILDSDNNRVRKVD----TNGIITTVAGNGNWSYNGDGGPAVEASLSSAASGIAID 894

Query: 113 MKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC 171
             GN+Y++D  N+ IRK+ TN  +TT+AG G        GPA  AS      +  +    
Sbjct: 895 SAGNLYISDTGNYCIRKVDTNGIITTVAGNGVAGFSGDGGPAVEASLGWAMGIA-IDSAG 953

Query: 172 ALLISDHGNQLIRQIN 187
            L I D  N  +R+++
Sbjct: 954 NLYILDGSNHRVRKVD 969



 Score = 45.4 bits (106), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 37/110 (33%), Positives = 54/110 (49%), Gaps = 4/110 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTT 137
           V+  +AG+G   Y+ DG P   A        A D  GN+++AD  N+ IRK+ TN  +TT
Sbjct: 748 VLTTVAGNGNWSYNGDGGPAIEASLRSSYGIATDSAGNLHIADWGNNRIRKVDTNGIITT 807

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           +AG G        GPA  AS S    +  +     L I D  N  +R+++
Sbjct: 808 VAGSGDYGFSGDGGPAIEASLSFPMGIA-IDSAGNLYILDSDNNRVRKVD 856



 Score = 38.9 bits (89), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 36/126 (28%), Positives = 57/126 (45%), Gaps = 7/126 (5%)

Query: 81   VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
            ++  +AG    G+S DG P   A        A+D   N+Y++D SNH IR++   G + T
Sbjct: 973  IITTVAGSDDYGFSGDGGPAIEASLGYAVGIAIDSAENLYISDSSNHCIRRVDTGGIIAT 1032

Query: 138  IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK---PEDCS 194
            +AG G         PA+ +       +  V     L ISD  N+ I ++ +    P   S
Sbjct: 1033 VAGNGIYGSIEDGVPAKKSPLGYPVGVA-VDTAGNLYISDEANERILKVGVPHNLPIPSS 1091

Query: 195  KSSQSG 200
             S ++G
Sbjct: 1092 ASMETG 1097


>gi|380693812|ref|ZP_09858671.1| hypothetical protein BfaeM_07498 [Bacteroides faecis MAJ27]
          Length = 456

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 35/100 (35%), Positives = 52/100 (52%), Gaps = 2/100 (2%)

Query: 88  DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKE 146
           +G  GY DG  G A F+KP    VD +GNI + D  NH IRKIT + G  +   G  +  
Sbjct: 355 EGKSGYLDGPIGQAMFNKPGQMCVDSEGNIILTDTENHCIRKITMSTGYVSTLAGKPQNS 414

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G  +G A++A F     +  + +   + I D  N+ IR++
Sbjct: 415 GYVNGSAEDAQFKKPLGIC-IDNDDVMYIGDSENRAIRRL 453


>gi|290975761|ref|XP_002670610.1| predicted protein [Naegleria gruberi]
 gi|284084171|gb|EFC37866.1| predicted protein [Naegleria gruberi]
          Length = 308

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 42/112 (37%), Positives = 59/112 (52%), Gaps = 4/112 (3%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           E   +  +AG+G  G+S  +G   +A+F+ P S  V  K  +Y+ D SNH IRKI  N  
Sbjct: 39  ENGNIVTIAGNGNYGFSGDNGPATNAQFNYPCSVFVSSKNEVYITDYSNHRIRKILENGN 98

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + TIAG G+      +GPA NA   N   + FV     + I+D  N  IR+I
Sbjct: 99  IITIAGNGTVGFSGDNGPATNAQLYNPSSV-FVSSNNEVYIADFCNHRIRKI 149



 Score = 59.3 bits (142), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 41/112 (36%), Positives = 58/112 (51%), Gaps = 4/112 (3%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           E   +  +AG+G  G+S  +G   +A+F+ P S  V  K  +Y+ D SNH IRKI  N  
Sbjct: 151 ENGNIVTIAGNGNYGFSGDNGPATNAQFNYPCSVFVSSKNEVYITDYSNHRIRKILENGN 210

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + TIAG G+      +GPA NA   N   + FV     +  +D  N  IR+I
Sbjct: 211 IITIAGNGTVGFSGDNGPATNAQLYNPSSV-FVSSNNEVYFTDQHNNRIRKI 261



 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 40/112 (35%), Positives = 58/112 (51%), Gaps = 4/112 (3%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           E   +  +AG+G  G+S  +G   +A+   P S  V     +Y+AD  NH IRKI  N  
Sbjct: 95  ENGNIITIAGNGTVGFSGDNGPATNAQLYNPSSVFVSSNNEVYIADFCNHRIRKILENGN 154

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + TIAG G+      +GPA NA F+    + FV     + I+D+ N  IR+I
Sbjct: 155 IVTIAGNGNYGFSGDNGPATNAQFNYPCSV-FVSSKNEVYITDYSNHRIRKI 205



 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 45/87 (51%), Gaps = 2/87 (2%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFS 159
           A+   P S  V     +Y+AD  NH IRKI  N  + TIAG G+      +GPA NA F+
Sbjct: 8   AQLFNPSSVFVSSNNEVYIADFCNHRIRKILENGNIVTIAGNGNYGFSGDNGPATNAQFN 67

Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
               + FV     + I+D+ N  IR+I
Sbjct: 68  YPCSV-FVSSKNEVYITDYSNHRIRKI 93



 Score = 45.1 bits (105), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 50/103 (48%), Gaps = 4/103 (3%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           E   +  +AG+G  G+S  +G   +A+   P S  V     +Y  D+ N+ IRKI  N  
Sbjct: 207 ENGNIITIAGNGTVGFSGDNGPATNAQLYNPSSVFVSSNNEVYFTDQHNNRIRKILENGN 266

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISD 177
           + TIAG G+      +GPA NA   N   + FV     + I+D
Sbjct: 267 IITIAGNGNYGFSGDNGPATNAQLYNPNSV-FVSSNNEVYITD 308


>gi|351729452|ref|ZP_08947143.1| NHL repeat-containing protein [Acidovorax radicis N35]
          Length = 659

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 29/69 (42%), Positives = 42/69 (60%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA 149
           V  ++DG+  +ARF +PK+ AVD  GN+YVAD  NH++R+I+  G  T   G     G  
Sbjct: 44  VSNHADGDGPAARFYQPKAVAVDTAGNMYVADTENHLVRRISPTGAVTTLAGKPGVCGNQ 103

Query: 150 DGPAQNASF 158
           DG +  A+F
Sbjct: 104 DGTSDTATF 112



 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/111 (36%), Positives = 53/111 (47%), Gaps = 9/111 (8%)

Query: 82  VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNH-------VIRKITNL 133
           V  LAG  G  G++DG+  +A+F    + A D  GN+YV D   H        IRKIT  
Sbjct: 482 VTLLAGSLGTVGHADGQGAAAQFSALGNMATDAAGNVYVVDGLFHEVNKIGPTIRKITPS 541

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
           G V+T+AG  +   G ADG    A F+ D    +     AL +   GN  I
Sbjct: 542 GMVSTLAGNPAAAPGYADGTGAAAVFTVDLGFRYPQQTAALAVDAQGNVYI 592



 Score = 43.9 bits (102), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 29/106 (27%), Positives = 46/106 (43%), Gaps = 10/106 (9%)

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG--GSKKEGRADGPAQNASFSNDFE 163
           P   A D  G +Y AD ++HV+R+    G  T+  G  G   +G  DG A  A F     
Sbjct: 179 PVHLATDSTGALYAADPNDHVVRRYATDGQATVVSGTVGPNNQGTTDGTASQAKFGAFKA 238

Query: 164 LTFVPHICALLISDHGNQLIRQIN--------LKPEDCSKSSQSGS 201
           + F       ++ + G   IR+I         ++ + C++S   GS
Sbjct: 239 MAFDAADRLFVLDNTGLPAIRRIGSDGSATTLVRAQGCTESRAPGS 284



 Score = 42.0 bits (97), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 37/67 (55%), Gaps = 11/67 (16%)

Query: 76  LSEESVVKRLAGD--GVQGYSDGEPGSARF---------DKPKSFAVDMKGNIYVADKSN 124
           ++   +V  LAG+     GY+DG   +A F          +  + AVD +GN+Y+ D ++
Sbjct: 538 ITPSGMVSTLAGNPAAAPGYADGTGAAAVFTVDLGFRYPQQTAALAVDAQGNVYITDGAH 597

Query: 125 HVIRKIT 131
           HV+R+IT
Sbjct: 598 HVVRRIT 604



 Score = 40.8 bits (94), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 30/61 (49%), Gaps = 2/61 (3%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           P    ++V  LA     G  DG+ GSARFD P + A    G +YV D  N  +R +   G
Sbjct: 321 PQGTSTLVAGLAS--APGADDGQGGSARFDAPGALAWSPSGTLYVRDDGNRSLRTVMPDG 378

Query: 135 V 135
           V
Sbjct: 379 V 379



 Score = 38.5 bits (88), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 34/68 (50%), Gaps = 9/68 (13%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD--------KSNHV 126
           +S    V  LAG  GV G  DG   +A F +P S AVD  G +YVA+         + + 
Sbjct: 84  ISPTGAVTTLAGKPGVCGNQDGTSDTATFCRPSSIAVDKAGTVYVAETRPSSTTLSAANP 143

Query: 127 IRKITNLG 134
           IRKI+  G
Sbjct: 144 IRKISTTG 151


>gi|222056335|ref|YP_002538697.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
 gi|221565624|gb|ACM21596.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
          Length = 372

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 43/118 (36%), Positives = 59/118 (50%), Gaps = 10/118 (8%)

Query: 76  LSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITN 132
           L+    V  LAG   Q G +DG   +ARF+ P     D   N+YVAD  N++IRK  IT 
Sbjct: 44  LTLTGTVSVLAGQAPQMGTADGTGSAARFNAPSGITTDGT-NLYVADTGNNLIRKVVITT 102

Query: 133 LGVTTIAG----GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             VTT+AG    G ++  G  DG    A F+  F +T       L ++D  N  IR++
Sbjct: 103 GAVTTLAGTVGTGTAQTSGSTDGTGSAAKFNAPFAIT--TDGTNLYVADTNNNTIRKV 158



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 44/152 (28%), Positives = 67/152 (44%), Gaps = 10/152 (6%)

Query: 40  GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV---QGYSDG 96
           G     N  S I   G+ +L V D+  +    +       + +    G G     G +DG
Sbjct: 67  GSAARFNAPSGITTDGT-NLYVADTGNNLIRKVVITTGAVTTLAGTVGTGTAQTSGSTDG 125

Query: 97  EPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKIT-NLGVTTIAGGGSKKEGRADGPAQ 154
              +A+F+ P  FA+   G N+YVAD +N+ IRK+    G  T   G     G ADG   
Sbjct: 126 TGSAAKFNAP--FAITTDGTNLYVADTNNNTIRKVVIATGTVTTLAGSVGIPGSADGIGP 183

Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
              F++   +T       L +SD GN+ IR++
Sbjct: 184 AGLFNSPGGIT--TDGTNLYVSDTGNRTIRKV 213



 Score = 40.8 bits (94), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 50/164 (30%), Positives = 71/164 (43%), Gaps = 14/164 (8%)

Query: 33  TVTTV-----IDGHQLEINPHSVIDRPG--SSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
           TVTT+     I G    I P  + + PG  ++D   L  S +   T+   +     V  L
Sbjct: 164 TVTTLAGSVGIPGSADGIGPAGLFNSPGGITTDGTNLYVSDTGNRTIRKVVIATGAVTTL 223

Query: 86  AGD-GVQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKIT-NLGVTTIAGGG 142
           AG  G  G +DG   SA F     F +   G +++VAD  N  IRKI    G+ T   G 
Sbjct: 224 AGSAGTPGSTDGVGPSALFGTV--FGITTDGTSLFVADTDNSTIRKIVIATGMVTTLAGS 281

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           +   G ADG    A F+  F +T       L ++D     IR++
Sbjct: 282 AGVSGIADGTGSTAKFNAPFGIT--TDGTNLYVTDSRQGSIRKV 323


>gi|168701257|ref|ZP_02733534.1| NHL repeat containing protein [Gemmata obscuriglobus UQM 2246]
          Length = 358

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 45/111 (40%), Positives = 58/111 (52%), Gaps = 7/111 (6%)

Query: 82  VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTT 137
           ++R AG G +GY+ DG     A FD PK   +D  GN++V D  N VIRKI      VTT
Sbjct: 248 IERFAGTGKKGYTGDGTKALDATFDGPKEIDIDKDGNVFVVDTENEVIRKIDAKSGVVTT 307

Query: 138 IAGGG-SKKEGRAD-GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           IAG G +K  G  D GPA  A+      +   P   AL I D  +  IR++
Sbjct: 308 IAGKGRTKTPGLGDNGPATGATLGRPHGVAVGPD-GALYIGDTNSHRIRKV 357



 Score = 40.8 bits (94), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 58/113 (51%), Gaps = 5/113 (4%)

Query: 81  VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG-VT 136
           V+    G G +G++ DG P + A  D+P   A D  GN+Y +D  NH++RK+    G +T
Sbjct: 18  VITSPVGSGKKGFAGDGGPAAKAELDQPFDVAFDKAGNLYFSDTFNHLVRKVDAKTGTIT 77

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           T+AG G K  G   G A  AS +  + +        L I D  N  +R+++ K
Sbjct: 78  TVAGNGRKGFGGDGGKATEASLNEPYGIELDAD-GNLYIVDRLNFCVRKVDGK 129


>gi|196229731|ref|ZP_03128595.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
 gi|196226057|gb|EDY20563.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
          Length = 360

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 44/132 (33%), Positives = 64/132 (48%), Gaps = 7/132 (5%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI--- 130
           ++ +  +   AG G +GYS DG P   A F+ P     D  GN Y+AD +NH IR++   
Sbjct: 74  VTPDGKIHTAAGIGQKGYSGDGGPALEATFNLPHEIRFDRAGNYYIADMANHAIRRVDAK 133

Query: 131 TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKP 190
           T + +TT AG G        GPA  A       + F P   +L I D GN  IR++++  
Sbjct: 134 TKI-ITTFAGTGKPGYSGDGGPAAQAQLKQPHSIQFGPD-GSLYICDVGNNCIRKVDMTT 191

Query: 191 EDCSKSSQSGSA 202
              S  + +G A
Sbjct: 192 GTISTFAGTGKA 203



 Score = 44.3 bits (103), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 37/143 (25%), Positives = 64/143 (44%), Gaps = 10/143 (6%)

Query: 47  PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARF 103
           PH +  DR G  +  + D +  A   +    ++  ++   AG G  GYS DG P + A+ 
Sbjct: 106 PHEIRFDRAG--NYYIADMANHAIRRVD---AKTKIITTFAGTGKPGYSGDGGPAAQAQL 160

Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
            +P S      G++Y+ D  N+ IRK+  T   ++T AG G   +     P +       
Sbjct: 161 KQPHSIQFGPDGSLYICDVGNNCIRKVDMTTGTISTFAGTGKAGDTPDGSPIEGTPLKGP 220

Query: 162 FELTFVPHICALLISDHGNQLIR 184
             + F       L++  GNQ+++
Sbjct: 221 RSMDFDKEGNLWLVTREGNQVLK 243



 Score = 41.2 bits (95), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 47/92 (51%), Gaps = 9/92 (9%)

Query: 65  SRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADK 122
           +R     L F  ++  ++   AG G +G++ +G P   A    PK  AVD +GN+++AD 
Sbjct: 235 TREGNQVLKFD-AKTGIISIAAGTGKKGFTGNGGPALEATLSGPKGIAVDAQGNVWLADT 293

Query: 123 SNHVIRKIT--NLGVTTIAGGGSKKEGRADGP 152
            +H IR+I      +  +AG G     R DGP
Sbjct: 294 ESHSIREINAKTGAIELVAGDGQ----RGDGP 321


>gi|111219920|ref|YP_710714.1| serine/threonine protein kinase [Frankia alni ACN14a]
 gi|111147452|emb|CAJ59102.1| Putative serine/threonine protein kinase [Frankia alni ACN14a]
          Length = 725

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 33/86 (38%), Positives = 49/86 (56%), Gaps = 2/86 (2%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSND 161
            +KP + AVD  GN+Y+ DKSN  +R++   G VTT+AG G +      GPA  A  ++ 
Sbjct: 395 LNKPDAMAVDTAGNLYIVDKSNQRVRRVDRDGVVTTVAGNGIRGFTGDGGPAIRAELADP 454

Query: 162 FELTFVPHICALLISDHGNQLIRQIN 187
             +  V     + ISD GNQ +R++N
Sbjct: 455 AGIA-VDAAGDIYISDQGNQRVRRVN 479



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/126 (30%), Positives = 61/126 (48%), Gaps = 15/126 (11%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +  + VV  +AG+G++G++ DG P   A    P   AVD  G+IY++D+ N  +R++   
Sbjct: 422 VDRDGVVTTVAGNGIRGFTGDGGPAIRAELADPAGIAVDAAGDIYISDQGNQRVRRVNPA 481

Query: 134 GV-TTIAGGGSKKEGRADGPAQNASFSNDFELT-----------FVPHICALLISDHGNQ 181
           GV TT AG G       +GP +   FS D  L            +V     + I D  N 
Sbjct: 482 GVITTFAGTGVFGFSGENGP-KIGGFSGDGVLARQAMLDEPSALWVDRAGNVYICDGSND 540

Query: 182 LIRQIN 187
            IR+++
Sbjct: 541 RIRKVD 546



 Score = 45.4 bits (106), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 45/89 (50%), Gaps = 1/89 (1%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
           +AR   P+S AVD+ G +YV D+ N+ +R+I   G +TT AG G+       GPA  A+ 
Sbjct: 571 AARLQWPESLAVDVAGTMYVTDQGNNRVRRIDTHGIITTAAGTGTMGFSGDGGPATRAAI 630

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
                   V     + ++D     IR+I+
Sbjct: 631 HTVGAGVTVDDAGNIYLADPQVNRIRRID 659



 Score = 45.4 bits (106), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 41/127 (32%), Positives = 58/127 (45%), Gaps = 17/127 (13%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPG--------------SARFDKPKSFAVDMKGNIYVAD 121
           ++   V+   AG GV G+S GE G               A  D+P +  VD  GN+Y+ D
Sbjct: 478 VNPAGVITTFAGTGVFGFS-GENGPKIGGFSGDGVLARQAMLDEPSALWVDRAGNVYICD 536

Query: 122 KSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGN 180
            SN  IRK+   GV +T+ G G    G   GPA  A       L  V     + ++D GN
Sbjct: 537 GSNDRIRKVDTAGVISTVVGSGGDGFGGDGGPATAARLQWPESLA-VDVAGTMYVTDQGN 595

Query: 181 QLIRQIN 187
             +R+I+
Sbjct: 596 NRVRRID 602



 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 39/119 (32%), Positives = 53/119 (44%), Gaps = 6/119 (5%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           +    ++   AG G  G+S DG P +  A         VD  GNIY+AD   + IR+I  
Sbjct: 601 IDTHGIITTAAGTGTMGFSGDGGPATRAAIHTVGAGVTVDDAGNIYLADPQVNRIRRIDT 660

Query: 133 LG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKP 190
            G +TTIAG G          A +A+ S    L F  +   LL+S+     I  I L P
Sbjct: 661 HGIITTIAGTGVAGNAGNGQRALDAAISYPSSLLF--YRGYLLVSEVNGSRILAIALGP 717


>gi|153867703|ref|ZP_01997987.1| NHL repeat protein [Beggiatoa sp. SS]
 gi|152144978|gb|EDN72014.1| NHL repeat protein [Beggiatoa sp. SS]
          Length = 263

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 64/112 (57%), Gaps = 4/112 (3%)

Query: 86  AGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG 142
           AGDG  G++ DG P +A + + P    +D  GN+Y+AD  NH IRKI + G ++T+AG G
Sbjct: 11  AGDGNLGFAGDGGPATAAQLNSPLGMGIDSAGNLYIADTENHRIRKIDSSGNISTVAGDG 70

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
           ++     DG A  ++  N+     +     L I+D GN ++R++++   + S
Sbjct: 71  NRGY-SGDGAAAVSAKLNNPMWVSLDSAGNLYIADTGNNVVRKLDIASGNIS 121



 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 40/110 (36%), Positives = 55/110 (50%), Gaps = 8/110 (7%)

Query: 82  VKRLAGD---GVQGYSDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VT 136
           +  +AGD   GV G  DG P +A +   P    +D  GN+Y+AD SNH IRK+   G +T
Sbjct: 120 ISTVAGDSSVGVAG--DGGPATAAQLSYPTGIDIDTAGNLYIADSSNHRIRKVDTTGNIT 177

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           TIAG G+      DG    A+  N      V     + I+D  N  IR++
Sbjct: 178 TIAGDGTPGF-AGDGKIATAAQLNAPTQVMVDSTGQVYIADTSNHRIRKV 226



 Score = 43.9 bits (102), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 25/61 (40%), Positives = 39/61 (63%), Gaps = 3/61 (4%)

Query: 82  VKRLAGDGVQGYS-DGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTI 138
           +  +AGDG  G++ DG+   +A+ + P    VD  G +Y+AD SNH IRK+ T+  +TT+
Sbjct: 176 ITTIAGDGTPGFAGDGKIATAAQLNAPTQVMVDSTGQVYIADTSNHRIRKVSTDNIITTM 235

Query: 139 A 139
           A
Sbjct: 236 A 236


>gi|290995070|ref|XP_002680154.1| predicted protein [Naegleria gruberi]
 gi|284093774|gb|EFC47410.1| predicted protein [Naegleria gruberi]
          Length = 636

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 61/114 (53%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++   ++  +AG G  GYS DG    SA+ + P   A+   G IY+AD +N+ IRKIT  
Sbjct: 401 ITTSGIISTIAGTGTSGYSGDGSSATSAQLNSPYGIAISSSGEIYIADTNNNRIRKITTS 460

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G ++TIAG G++        A +A   N + +  +     + ++D+ N  IR+I
Sbjct: 461 GIISTIAGTGTQGYSGDGSSATSAQLYNPYGVA-ISSRGEIYVADYNNNRIRKI 513



 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 39/114 (34%), Positives = 60/114 (52%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++   ++  +AG G QGYS DG    SA+   P   A+  +G IYVAD +N+ IRKIT  
Sbjct: 457 ITTSGIISTIAGTGTQGYSGDGSSATSAQLYNPYGVAISSRGEIYVADYNNNRIRKITTS 516

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G ++TIAG G+         A +A   N + +  +     + I+D  N  IR++
Sbjct: 517 GIISTIAGTGTSGYSGDGSSAISAQLYNPYGVA-ISSSDEIYITDTNNNRIRKL 569



 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 70/144 (48%), Gaps = 16/144 (11%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++   ++  +AG+G  GYS DG    SA+ + P   A+     IY+AD  N+ IRKIT  
Sbjct: 239 ITTSGIISTIAGNGTAGYSGDGSSATSAQLNSPSGIAISSSDEIYIADMFNNRIRKITTS 298

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQI 186
           G ++TIAG G+       G + + S +   +L F P+  A      + I+D  N  IR+I
Sbjct: 299 GIISTIAGTGTS------GYSGDGSSATSIQLYF-PYGVAVSLSDEIYIADMFNNRIRKI 351

Query: 187 NLKPEDCSKSSQSGSALGAVSVWV 210
                  + +   G  L A + ++
Sbjct: 352 TTSGIISTIAGGIGDGLSATTAYI 375



 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/117 (29%), Positives = 56/117 (47%), Gaps = 12/117 (10%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           ++   ++  +AG+G  GYS     SA+   P   A+      Y+AD +NH IRKIT  G 
Sbjct: 129 ITTSGIISTIAGNGTAGYSGDVATSAKLYYPSGIAISSSDETYIADTNNHRIRKITTSGI 188

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC-----ALLISDHGNQLIRQI 186
           ++TIAG G+       G + + S +   +L +   +       + I D  N  IR+I
Sbjct: 189 ISTIAGNGTA------GYSGDGSSAKSAQLYYPSGVAISSSDEIYIVDRSNNRIRKI 239



 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 58/114 (50%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++   ++  +AG+G  GYS DG    SA+   P   A+     IY+ D+SN+ IRKIT  
Sbjct: 183 ITTSGIISTIAGNGTAGYSGDGSSAKSAQLYYPSGVAISSSDEIYIVDRSNNRIRKITTS 242

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G ++TIAG G+      DG +  ++  N      +     + I+D  N  IR+I
Sbjct: 243 GIISTIAGNGTAGY-SGDGSSATSAQLNSPSGIAISSSDEIYIADMFNNRIRKI 295



 Score = 50.1 bits (118), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 39/123 (31%), Positives = 59/123 (47%), Gaps = 24/123 (19%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++   ++  +AG+G  GYS DG    SA+   P   A+     IY+ D+SN+ IRKIT  
Sbjct: 73  ITTSGIISTIAGNGTAGYSGDGSSAKSAQLYYPSGVAISSSDEIYIVDRSNNRIRKITTS 132

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELT---FVPHICAL------LISDHGNQLI 183
           G ++TIAG G+            A +S D   +   + P   A+       I+D  N  I
Sbjct: 133 GIISTIAGNGT------------AGYSGDVATSAKLYYPSGIAISSSDETYIADTNNHRI 180

Query: 184 RQI 186
           R+I
Sbjct: 181 RKI 183



 Score = 46.6 bits (109), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 32/112 (28%), Positives = 54/112 (48%), Gaps = 6/112 (5%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           ++   ++  +AG    G  DG   +  +    +F     G IY+AD +NH IRKIT  G 
Sbjct: 351 ITTSGIISTIAG----GIGDGLSATTAYINAITFEFSSSGEIYIADTNNHRIRKITTSGI 406

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           ++TIAG G+         A +A  ++ + +  +     + I+D  N  IR+I
Sbjct: 407 ISTIAGTGTSGYSGDGSSATSAQLNSPYGIA-ISSSGEIYIADTNNNRIRKI 457



 Score = 40.0 bits (92), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 29/93 (31%), Positives = 44/93 (47%), Gaps = 12/93 (12%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
           SA+   P   A+      Y+AD +NH IRKIT  G ++TIAG G+       G + + S 
Sbjct: 43  SAKLYYPSGIAISSSDETYIADTNNHRIRKITTSGIISTIAGNGTA------GYSGDGSS 96

Query: 159 SNDFELTFVPHIC-----ALLISDHGNQLIRQI 186
           +   +L +   +       + I D  N  IR+I
Sbjct: 97  AKSAQLYYPSGVAISSSDEIYIVDRSNNRIRKI 129


>gi|116624994|ref|YP_827150.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116228156|gb|ABJ86865.1| NHL repeat containing protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 912

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 43/108 (39%), Positives = 62/108 (57%), Gaps = 5/108 (4%)

Query: 82  VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
           +  LAG GV GYS DG P  SA+ + P++ AVD  GN+YVAD +N+ +RKI   G +TT+
Sbjct: 511 ISTLAGTGVSGYSGDGGPATSAQLNGPQAVAVDGSGNVYVADTANNRVRKIGPTGLITTV 570

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           AG G        GPA +A   N   L     +  + I+D G+  +R++
Sbjct: 571 AGNGIGGFSGDGGPATSAQVGNPNGLAL-DSVGNVFITD-GSARVRKL 616



 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 37/105 (35%), Positives = 50/105 (47%), Gaps = 3/105 (2%)

Query: 87  GDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
           GDG QG++   G P       P S  VD  GN+Y AD  N+ IRK++   V T AG G  
Sbjct: 294 GDGTQGFAGDGGAPNKVEMSLPTSVQVDSSGNLYFADSLNNRIRKLSGGNVNTYAGNGIV 353

Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
                 G A NA  +    +  V     L +SD  N L+R+++ K
Sbjct: 354 ARSGDGGAATNAQLNTPLGVA-VDAAGNLYVSDTLNNLVRRVDTK 397



 Score = 53.9 bits (128), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 59/109 (54%), Gaps = 4/109 (3%)

Query: 81  VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTT 137
           V+  +AG G  G++   G   SA+ + P S A+D  GN+Y+A+ SN+ IRK+ TN  ++T
Sbjct: 454 VMSTVAGSGTSGFAGDGGAATSAQLNAPFSVALDAAGNLYIAEFSNNRIRKVATNGNIST 513

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           +AG G        GPA +A   N  +   V     + ++D  N  +R+I
Sbjct: 514 LAGTGVSGYSGDGGPATSAQL-NGPQAVAVDGSGNVYVADTANNRVRKI 561



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/85 (30%), Positives = 45/85 (52%), Gaps = 1/85 (1%)

Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
           + + P+  AVD  GN+Y+AD  NH +RK++   ++T+AG G+       G A +A  +  
Sbjct: 422 QLNNPQGLAVDSAGNLYIADTQNHRVRKVSGGVMSTVAGSGTSGFAGDGGAATSAQLNAP 481

Query: 162 FELTFVPHICALLISDHGNQLIRQI 186
           F +        L I++  N  IR++
Sbjct: 482 FSVAL-DAAGNLYIAEFSNNRIRKV 505



 Score = 46.6 bits (109), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 42/133 (31%), Positives = 67/133 (50%), Gaps = 10/133 (7%)

Query: 76  LSEESVVKRLAGDG---VQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ + ++   AG+G   V G+    G    A    P + AVD   N+Y+A  +++ +R++
Sbjct: 113 VNPQGIISTFAGNGNVSVPGFWGDSGAATDASIHLPVAIAVDSSNNVYIAASADNTVRRV 172

Query: 131 TNLGVTTI-AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           T  G+  I AG G K      G A  A  +   ++TF P    LLI+D GN +IRQ+   
Sbjct: 173 TTDGIINIFAGAGYKGYYGDAGAAGLAGLTGPQDITFGPK-GVLLIADTGNAVIRQVG-- 229

Query: 190 PEDCSKSSQSGSA 202
             D   S+ SG+A
Sbjct: 230 -TDGVISTVSGNA 241



 Score = 43.9 bits (102), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 58/117 (49%), Gaps = 5/117 (4%)

Query: 32  YTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLS--FPLSEESVVKRLAGDG 89
           YT++TV  G        ++    G    I L      F + +  F +     +  +AG+ 
Sbjct: 11  YTISTVAGGAPPPTPVAALSTSIGQPRKIALSGGNMYFSSGNSVFKIDGSGTLTLVAGNS 70

Query: 90  VQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGS 143
             G+S DG P  +A+ + P+  A+D  GN+Y+AD  N+ +RK+   G ++T AG G+
Sbjct: 71  RAGFSGDGGPAVNAQLNSPQGVALDSAGNLYIADSQNNRVRKVNPQGIISTFAGNGN 127



 Score = 39.7 bits (91), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 3/62 (4%)

Query: 82  VKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTI 138
           V   AG+G+   S   G   +A+ + P   AVD  GN+YV+D  N+++R++   GV TT 
Sbjct: 344 VNTYAGNGIVARSGDGGAATNAQLNTPLGVAVDAAGNLYVSDTLNNLVRRVDTKGVITTF 403

Query: 139 AG 140
           AG
Sbjct: 404 AG 405



 Score = 37.0 bits (84), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 26/95 (27%), Positives = 44/95 (46%), Gaps = 1/95 (1%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
           SA+   P   A+D  GN+++ D S  V +   +  ++TIAGGG++      G A  A  +
Sbjct: 587 SAQVGNPNGLALDSVGNVFITDGSARVRKLFISGIISTIAGGGNRGYSGDGGNAFAAQLN 646

Query: 160 NDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
               L  +    AL ++D  N  +R + +     S
Sbjct: 647 GPSGLA-INSTGALFVADALNNAVRMLQISASGIS 680


>gi|290972086|ref|XP_002668792.1| predicted protein [Naegleria gruberi]
 gi|284082314|gb|EFC36048.1| predicted protein [Naegleria gruberi]
          Length = 679

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 41/112 (36%), Positives = 57/112 (50%), Gaps = 4/112 (3%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           E   +  +AG+G  G+S  +G   +A+F+ P S  V  K  +Y+ D SNH IRKI  N  
Sbjct: 39  ENGNIVTIAGNGNYGFSGDNGPATNAQFNYPCSVFVSSKNEVYITDYSNHSIRKILENGN 98

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + TIAG G+       GPA NA   N   + FV     +  +D  N  IR+I
Sbjct: 99  IITIAGNGTVGFSGDSGPATNAQLYNPSSV-FVSSKNEVYFTDQHNNRIRKI 149



 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 42/112 (37%), Positives = 57/112 (50%), Gaps = 5/112 (4%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           E   +  +AG+G  G+S  +G   +A+   P S  V     +Y+ D SNH IRKI  N  
Sbjct: 151 ENGNIITIAGNGTYGFSGDNGPATNAQLYNPYSVFVSSNNEVYITDYSNHRIRKILENGN 210

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + TIAG G+      +GPA NA   N     FV +   + ISD  NQ IR+I
Sbjct: 211 IVTIAGNGNYGFSGDNGPATNAQL-NRPNSVFVSN-NEVYISDQSNQRIRKI 260



 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 39/112 (34%), Positives = 57/112 (50%), Gaps = 4/112 (3%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           E   +  +AG+G  G+S   G   +A+   P S  V  K  +Y  D+ N+ IRKI  N  
Sbjct: 95  ENGNIITIAGNGTVGFSGDSGPATNAQLYNPSSVFVSSKNEVYFTDQHNNRIRKILENGN 154

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + TIAG G+      +GPA NA   N + + FV     + I+D+ N  IR+I
Sbjct: 155 IITIAGNGTYGFSGDNGPATNAQLYNPYSV-FVSSNNEVYITDYSNHRIRKI 205



 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 41/112 (36%), Positives = 60/112 (53%), Gaps = 6/112 (5%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           E   +  +AG+G  G+S  +G   +A+ ++P S  V     +Y++D+SN  IRKI  N  
Sbjct: 207 ENGNIVTIAGNGNYGFSGDNGPATNAQLNRPNSVFVS-NNEVYISDQSNQRIRKILENGN 265

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + TIAG G+      +GPA NA   N     FV +   + ISD  NQ IR+I
Sbjct: 266 IITIAGNGNYGFSGDNGPATNAQL-NRPNSVFVSN-NEVYISDQSNQRIRKI 315



 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 45/87 (51%), Gaps = 2/87 (2%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFS 159
           A+   P S  V     +Y+AD  NH IRKI  N  + TIAG G+      +GPA NA F+
Sbjct: 8   AQLFNPSSVFVSSNNEVYIADFCNHRIRKILENGNIVTIAGNGNYGFSGDNGPATNAQFN 67

Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
               + FV     + I+D+ N  IR+I
Sbjct: 68  YPCSV-FVSSKNEVYITDYSNHSIRKI 93



 Score = 45.1 bits (105), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 52/104 (50%), Gaps = 18/104 (17%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           E   +  +AG+G  G+S  +G   +A+ ++P S  V     +Y++D+SN  IRKI  N  
Sbjct: 262 ENGNIITIAGNGNYGFSGDNGPATNAQLNRPNSVFVS-NNEVYISDQSNQRIRKILENGN 320

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSND--FELTFVPHICALLIS 176
           + TIAG G+            A FS D  F +   PHI   L++
Sbjct: 321 IVTIAGNGT------------AGFSGDSPFNIIMYPHIGNKLLT 352


>gi|290982061|ref|XP_002673749.1| predicted protein [Naegleria gruberi]
 gi|284087335|gb|EFC41005.1| predicted protein [Naegleria gruberi]
          Length = 185

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 43/126 (34%), Positives = 62/126 (49%), Gaps = 6/126 (4%)

Query: 74  FPLSEESVVKRLAGDGV-QGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIR--- 128
           F    + V + +AG+G+ Q   DG  G+ A  + P+  A+DM GN+Y+A++  H IR   
Sbjct: 16  FTSHAQYVTQTVAGNGISQSLGDGFLGTQASLNSPRDVALDMYGNMYIAEEKAHKIRFIN 75

Query: 129 KITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           K T + ++T+ G G        G A NA     F L        L ISD GN +IR IN+
Sbjct: 76  KTTGI-ISTLVGNGIAGSVGDGGLAINAQLHLPFSLAIDSTGQNLFISDSGNHVIRHINM 134

Query: 189 KPEDCS 194
                S
Sbjct: 135 TTRIIS 140


>gi|116620673|ref|YP_822829.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116223835|gb|ABJ82544.1| NHL repeat containing protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 912

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 49/188 (26%), Positives = 87/188 (46%), Gaps = 10/188 (5%)

Query: 21  LVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTL---SFPLS 77
           L+  G +    Y ++T+  G        +     G    + +D++ + +++     F + 
Sbjct: 10  LLFCGSVWAQQYVISTIAGGAAPPTPVAAAKASVGDPTRVAVDAAGNVYFSSLHSIFKVD 69

Query: 78  EESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
               + R AG+G  G S   G+  SA+   P   AVD  GNI+VAD+  +V+R+I   G 
Sbjct: 70  SSGTMTRFAGNGRPGNSGDGGQASSAQLLFPMGLAVDAAGNIFVADRDANVVRRIATTGI 129

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
           + T+AG G+       GPA  A  +  F +  V     + I+D  N ++R++     D +
Sbjct: 130 IQTVAGNGTPGYQGDGGPATTAQLNAPFAVA-VDGQGNVFIADTNNVVVRRVT---PDGT 185

Query: 195 KSSQSGSA 202
            S+ +GS 
Sbjct: 186 ISTYAGSG 193



 Score = 58.9 bits (141), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 37/115 (32%), Positives = 65/115 (56%), Gaps = 4/115 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ + V+  +AG G  G+S   G   +A+ ++P+  AVD +GN+YVAD  N+ IRKI  L
Sbjct: 524 VTTDGVIHTIAGFGTPGFSGDGGAATAAKLNRPRGVAVDAQGNVYVADTGNNRIRKIDPL 583

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           G ++T+AG GS +    DG A     + D     V     + +++ G+  +R+++
Sbjct: 584 GNISTVAGDGSTEFIPGDGIATQQGLT-DPRGVAVDRAGNIYVAETGHNRVRRVS 637



 Score = 58.2 bits (139), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 39/114 (34%), Positives = 66/114 (57%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++   +++ +AG+G  GY  DG P + A+ + P + AVD +GN+++AD +N V+R++T  
Sbjct: 124 IATTGIIQTVAGNGTPGYQGDGGPATTAQLNAPFAVAVDGQGNVFIADTNNVVVRRVTPD 183

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G ++T AG G++      G A+NA F    E   V     L I+D  N  IR++
Sbjct: 184 GTISTYAGSGARGFAGDGGAARNAWFDGP-EGVAVDANGVLYIADTFNGRIRRV 236



 Score = 50.8 bits (120), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 39/116 (33%), Positives = 62/116 (53%), Gaps = 7/116 (6%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSAR---FDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           ++ +  +   AG G +G++ G+ G+AR   FD P+  AVD  G +Y+AD  N  IR++  
Sbjct: 180 VTPDGTISTYAGSGARGFA-GDGGAARNAWFDGPEGVAVDANGVLYIADTFNGRIRRVAA 238

Query: 133 LG-VTTIAGGGSKKE-GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            G +TT AG GS    G  +GP  +A+ S   ++  V       I+D GN  +R +
Sbjct: 239 DGTITTAAGVGSTGVFGGDNGPPASAALSLPTDVA-VDRSGNPYIADFGNSRVRMV 293



 Score = 49.3 bits (116), Expect = 0.002,   Method: Composition-based stats.
 Identities = 39/105 (37%), Positives = 56/105 (53%), Gaps = 4/105 (3%)

Query: 86  AGDGVQGY-SDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT-TIAGGG 142
           AG+G   Y  DG P + A  ++P+  A D  GN Y+AD  ++V+RK+T  GV  TIAG G
Sbjct: 478 AGNGNASYFGDGLPATQASVNQPEGVAGDAAGNTYIADTFDNVVRKVTTDGVIHTIAGFG 537

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           +      DG A  A+  N      V     + ++D GN  IR+I+
Sbjct: 538 TPGFS-GDGGAATAAKLNRPRGVAVDAQGNVYVADTGNNRIRKID 581



 Score = 47.0 bits (110), Expect = 0.007,   Method: Composition-based stats.
 Identities = 25/61 (40%), Positives = 37/61 (60%), Gaps = 2/61 (3%)

Query: 85  LAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
           LAG+G+  +  DG   + A+FD P   A+D  GN+ + D  N  +RKI+   +TTIAG G
Sbjct: 366 LAGNGLASFGGDGAAATTAQFDTPTGVAIDAGGNVLIVDSQNQRLRKISRGVITTIAGTG 425

Query: 143 S 143
           +
Sbjct: 426 T 426



 Score = 46.6 bits (109), Expect = 0.011,   Method: Composition-based stats.
 Identities = 28/73 (38%), Positives = 34/73 (46%), Gaps = 9/73 (12%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG-------- 140
           GV G  +G P SA    P   AVD  GN Y+AD  N  +R + N  +TT+AG        
Sbjct: 252 GVFGGDNGPPASAALSLPTDVAVDRSGNPYIADFGNSRVRMVANGVITTVAGRNNGAPIV 311

Query: 141 -GGSKKEGRADGP 152
            G      R DGP
Sbjct: 312 EGEEAVNARLDGP 324



 Score = 43.5 bits (101), Expect = 0.10,   Method: Composition-based stats.
 Identities = 39/129 (30%), Positives = 60/129 (46%), Gaps = 8/129 (6%)

Query: 82  VKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
           +  +AGDG   +   DG         P+  AVD  GNIYVA+  ++ +R+++  G +TTI
Sbjct: 586 ISTVAGDGSTEFIPGDGIATQQGLTDPRGVAVDRAGNIYVAETGHNRVRRVSTGGTITTI 645

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQ 198
           AG G        G    A  +  + +  V     + ++D GN  IR   L P   S + Q
Sbjct: 646 AGNGQCCYTGDGGLGTAAQLNQPWGIA-VDSAGNIYVADSGNNAIRL--LAP--VSANIQ 700

Query: 199 SGSALGAVS 207
            G+ + A S
Sbjct: 701 VGAVVNAAS 709



 Score = 41.6 bits (96), Expect = 0.38,   Method: Composition-based stats.
 Identities = 43/147 (29%), Positives = 63/147 (42%), Gaps = 18/147 (12%)

Query: 17  LLFSLVSSGL--LLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSF 74
           LL +L  +GL     DG   TT     Q +      ID  G  +++++DS       +S 
Sbjct: 362 LLTTLAGNGLASFGGDGAAATTA----QFDTPTGVAIDAGG--NVLIVDSQNQRLRKIS- 414

Query: 75  PLSEESVVKRLAGDGVQGYSDGE---PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
                 V+  +AG G  G+ +GE   P +A+ + P+  A D  GN +VAD  N  +R+  
Sbjct: 415 ----RGVITTIAGTGTAGF-NGEVVLPATAQLNTPRGVAADAYGNYFVADTGNRRVREGQ 469

Query: 132 NLGVTTIAGGGSKKEGRADG-PAQNAS 157
             G      G        DG PA  AS
Sbjct: 470 PGGNFFTKAGNGNASYFGDGLPATQAS 496



 Score = 38.9 bits (89), Expect = 2.2,   Method: Composition-based stats.
 Identities = 26/79 (32%), Positives = 38/79 (48%), Gaps = 2/79 (2%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +S    +  +AG+G   Y+ DG  G+A + ++P   AVD  GNIYVAD  N+ IR +  +
Sbjct: 636 VSTGGTITTIAGNGQCCYTGDGGLGTAAQLNQPWGIAVDSAGNIYVADSGNNAIRLLAPV 695

Query: 134 GVTTIAGGGSKKEGRADGP 152
                 G          GP
Sbjct: 696 SANIQVGAVVNAASNLPGP 714


>gi|294817139|ref|ZP_06775781.1| Receptor protein kinase [Streptomyces clavuligerus ATCC 27064]
 gi|294321954|gb|EFG04089.1| Receptor protein kinase [Streptomyces clavuligerus ATCC 27064]
          Length = 510

 Score = 59.3 bits (142), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 39/114 (34%), Positives = 64/114 (56%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ +  +  + G GV G+S DG P +A R ++P   AVD  G +YV + +N+ IRK+   
Sbjct: 126 ITADGTISTVVGTGVAGFSGDGGPATAARLNRPFGVAVDSTGVLYVTEYNNNRIRKVAAD 185

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G ++T+AG G+      DGPA +A  +  + +  V    A+ I+D  N  IR+I
Sbjct: 186 GTISTVAGTGATGNRGDDGPAASAQLNRPYGVA-VDGAGAVYIADTDNHRIRKI 238



 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 38/108 (35%), Positives = 62/108 (57%), Gaps = 4/108 (3%)

Query: 82  VKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
           +  +AG GV G++  +G   +A+ ++P   AVD  G +Y +D +NH +RKIT  G V+T+
Sbjct: 20  ISTVAGTGVAGFTGDNGPAVAAQLNRPYGIAVDSAGTLYFSDYNNHRVRKITTDGKVSTV 79

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           AG GS      +GPA +A  +   E+  V    A+ ++D  N  +R+I
Sbjct: 80  AGIGSAGYRGDNGPAVSAQLNCPREVA-VDGAGAVYVTDAENHRVRKI 126



 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/114 (34%), Positives = 60/114 (52%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ +  +  +AG G  G    DG   SA+ ++P   AVD  G +Y+AD  NH IRKIT  
Sbjct: 182 VAADGTISTVAGTGATGNRGDDGPAASAQLNRPYGVAVDGAGAVYIADTDNHRIRKITAD 241

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G ++T+AG G+       GPA  A  +    +  +     L ++D+GN  IR+I
Sbjct: 242 GTISTVAGTGTGGFDGDGGPATAARLNRPMGV-VIDSTGTLYVADYGNHRIRKI 294



 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 58/114 (50%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ +  V  +AG G  GY   +G   SA+ + P+  AVD  G +YV D  NH +RKIT  
Sbjct: 70  ITTDGKVSTVAGIGSAGYRGDNGPAVSAQLNCPREVAVDGAGAVYVTDAENHRVRKITAD 129

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G ++T+ G G        GPA  A  +  F +  V     L ++++ N  IR++
Sbjct: 130 GTISTVVGTGVAGFSGDGGPATAARLNRPFGVA-VDSTGVLYVTEYNNNRIRKV 182



 Score = 45.4 bits (106), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 33/88 (37%), Positives = 50/88 (56%), Gaps = 2/88 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
           +AR ++P    +D  G +YVAD  NH IRKIT  G + T+AG G+   G   GP  +A  
Sbjct: 264 AARLNRPMGVVIDSTGTLYVADYGNHRIRKITTDGKIGTVAGTGTGGFGGDGGPPASAQV 323

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
           +N   +  V  +  L I+D+GN  +R++
Sbjct: 324 NNSLGVA-VDCVDTLYIADYGNHRVRKV 350


>gi|326445969|ref|ZP_08220703.1| hypothetical protein SclaA2_33112 [Streptomyces clavuligerus ATCC
           27064]
          Length = 508

 Score = 59.3 bits (142), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 39/114 (34%), Positives = 64/114 (56%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ +  +  + G GV G+S DG P +A R ++P   AVD  G +YV + +N+ IRK+   
Sbjct: 124 ITADGTISTVVGTGVAGFSGDGGPATAARLNRPFGVAVDSTGVLYVTEYNNNRIRKVAAD 183

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G ++T+AG G+      DGPA +A  +  + +  V    A+ I+D  N  IR+I
Sbjct: 184 GTISTVAGTGATGNRGDDGPAASAQLNRPYGVA-VDGAGAVYIADTDNHRIRKI 236



 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 38/108 (35%), Positives = 62/108 (57%), Gaps = 4/108 (3%)

Query: 82  VKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
           +  +AG GV G++  +G   +A+ ++P   AVD  G +Y +D +NH +RKIT  G V+T+
Sbjct: 18  ISTVAGTGVAGFTGDNGPAVAAQLNRPYGIAVDSAGTLYFSDYNNHRVRKITTDGKVSTV 77

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           AG GS      +GPA +A  +   E+  V    A+ ++D  N  +R+I
Sbjct: 78  AGIGSAGYRGDNGPAVSAQLNCPREVA-VDGAGAVYVTDAENHRVRKI 124



 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/114 (34%), Positives = 60/114 (52%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ +  +  +AG G  G    DG   SA+ ++P   AVD  G +Y+AD  NH IRKIT  
Sbjct: 180 VAADGTISTVAGTGATGNRGDDGPAASAQLNRPYGVAVDGAGAVYIADTDNHRIRKITAD 239

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G ++T+AG G+       GPA  A  +    +  +     L ++D+GN  IR+I
Sbjct: 240 GTISTVAGTGTGGFDGDGGPATAARLNRPMGV-VIDSTGTLYVADYGNHRIRKI 292



 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 58/114 (50%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ +  V  +AG G  GY   +G   SA+ + P+  AVD  G +YV D  NH +RKIT  
Sbjct: 68  ITTDGKVSTVAGIGSAGYRGDNGPAVSAQLNCPREVAVDGAGAVYVTDAENHRVRKITAD 127

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G ++T+ G G        GPA  A  +  F +  V     L ++++ N  IR++
Sbjct: 128 GTISTVVGTGVAGFSGDGGPATAARLNRPFGVA-VDSTGVLYVTEYNNNRIRKV 180



 Score = 45.4 bits (106), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 33/88 (37%), Positives = 50/88 (56%), Gaps = 2/88 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
           +AR ++P    +D  G +YVAD  NH IRKIT  G + T+AG G+   G   GP  +A  
Sbjct: 262 AARLNRPMGVVIDSTGTLYVADYGNHRIRKITTDGKIGTVAGTGTGGFGGDGGPPASAQV 321

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
           +N   +  V  +  L I+D+GN  +R++
Sbjct: 322 NNSLGVA-VDCVDTLYIADYGNHRVRKV 348


>gi|411003633|ref|ZP_11379962.1| hypothetical protein SgloC_12561 [Streptomyces globisporus C-1027]
          Length = 605

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 39/112 (34%), Positives = 59/112 (52%), Gaps = 7/112 (6%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
            +V D++R     L    ++   V+R  G G +G SDG PG ARF +P+  AV   G + 
Sbjct: 188 FLVSDTTRHRLVELD---ADGETVRRHFGTGERGLSDGGPGEARFSEPQGLAVLPDGRVA 244

Query: 119 VADKSNHVIRKIT-NLGVT-TIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
           VAD  NH IR +    GVT T+AG G +  +     GPA+    S+ +++ +
Sbjct: 245 VADTVNHAIRALDLTTGVTSTLAGTGRQWWQGTPTSGPARGVDLSSPWDVAW 296


>gi|254390149|ref|ZP_05005369.1| NHL repeat-containing protein [Streptomyces clavuligerus ATCC
           27064]
 gi|197703856|gb|EDY49668.1| NHL repeat-containing protein [Streptomyces clavuligerus ATCC
           27064]
          Length = 527

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 39/114 (34%), Positives = 64/114 (56%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ +  +  + G GV G+S DG P +A R ++P   AVD  G +YV + +N+ IRK+   
Sbjct: 143 ITADGTISTVVGTGVAGFSGDGGPATAARLNRPFGVAVDSTGVLYVTEYNNNRIRKVAAD 202

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G ++T+AG G+      DGPA +A  +  + +  V    A+ I+D  N  IR+I
Sbjct: 203 GTISTVAGTGATGNRGDDGPAASAQLNRPYGVA-VDGAGAVYIADTDNHRIRKI 255



 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 38/108 (35%), Positives = 62/108 (57%), Gaps = 4/108 (3%)

Query: 82  VKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
           +  +AG GV G++  +G   +A+ ++P   AVD  G +Y +D +NH +RKIT  G V+T+
Sbjct: 37  ISTVAGTGVAGFTGDNGPAVAAQLNRPYGIAVDSAGTLYFSDYNNHRVRKITTDGKVSTV 96

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           AG GS      +GPA +A  +   E+  V    A+ ++D  N  +R+I
Sbjct: 97  AGIGSAGYRGDNGPAVSAQLNCPREVA-VDGAGAVYVTDAENHRVRKI 143



 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/114 (34%), Positives = 60/114 (52%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ +  +  +AG G  G    DG   SA+ ++P   AVD  G +Y+AD  NH IRKIT  
Sbjct: 199 VAADGTISTVAGTGATGNRGDDGPAASAQLNRPYGVAVDGAGAVYIADTDNHRIRKITAD 258

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G ++T+AG G+       GPA  A  +    +  +     L ++D+GN  IR+I
Sbjct: 259 GTISTVAGTGTGGFDGDGGPATAARLNRPMGV-VIDSTGTLYVADYGNHRIRKI 311



 Score = 53.9 bits (128), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 58/114 (50%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ +  V  +AG G  GY   +G   SA+ + P+  AVD  G +YV D  NH +RKIT  
Sbjct: 87  ITTDGKVSTVAGIGSAGYRGDNGPAVSAQLNCPREVAVDGAGAVYVTDAENHRVRKITAD 146

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G ++T+ G G        GPA  A  +  F +  V     L ++++ N  IR++
Sbjct: 147 GTISTVVGTGVAGFSGDGGPATAARLNRPFGVA-VDSTGVLYVTEYNNNRIRKV 199



 Score = 45.4 bits (106), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 33/88 (37%), Positives = 50/88 (56%), Gaps = 2/88 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
           +AR ++P    +D  G +YVAD  NH IRKIT  G + T+AG G+   G   GP  +A  
Sbjct: 281 AARLNRPMGVVIDSTGTLYVADYGNHRIRKITTDGKIGTVAGTGTGGFGGDGGPPASAQV 340

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
           +N   +  V  +  L I+D+GN  +R++
Sbjct: 341 NNSLGVA-VDCVDTLYIADYGNHRVRKV 367


>gi|116619867|ref|YP_822023.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116223029|gb|ABJ81738.1| NHL repeat containing protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 592

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 54/112 (48%), Gaps = 2/112 (1%)

Query: 77  SEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           +  +++   AG+G QG+S   G+   A F+ P + AVD  G +YV D+SN  IR+I   G
Sbjct: 184 AASNIISTFAGNGSQGFSGDGGQAAQASFNNPTALAVDAAGTVYVTDQSNQRIRRIDTGG 243

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           V T   G        DG +  A+  N      +     L I D  NQ +R++
Sbjct: 244 VITTVAGNGNAAFSGDGGSATAASLNYPGGIVLDSSGTLYIVDSVNQRVRKV 295



 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/125 (28%), Positives = 63/125 (50%), Gaps = 4/125 (3%)

Query: 65  SRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADK 122
           SR     L+  ++    +  +AG+G  G++   G+   A+ ++      D  GN+Y+A++
Sbjct: 3   SRFVLPLLAAAIASGQTITTVAGNGTAGFAGDTGQATQAQINRAVGLVTDANGNLYLAEE 62

Query: 123 SNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
            N+ +RK+   GV TT+AG G+       GPA  A  +    L  +     + +SD GN+
Sbjct: 63  LNNRVRKVDTGGVITTLAGIGTAGFAGDGGPAAQAQLNGPLGLC-IDGSGNIYVSDQGNK 121

Query: 182 LIRQI 186
            +R+I
Sbjct: 122 RVRKI 126



 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 32/88 (36%), Positives = 47/88 (53%), Gaps = 5/88 (5%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPAQNASFS 159
           F  P   AVD  GN+ + D+  H IRK+   +N+ ++T AG GS+      G A  ASF+
Sbjct: 155 FAIPIRCAVDKSGNLLIVDQGAHKIRKVEAASNI-ISTFAGNGSQGFSGDGGQAAQASFN 213

Query: 160 NDFELTFVPHICALLISDHGNQLIRQIN 187
           N   L  V     + ++D  NQ IR+I+
Sbjct: 214 NPTALA-VDAAGTVYVTDQSNQRIRRID 240



 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 30/108 (27%), Positives = 51/108 (47%), Gaps = 3/108 (2%)

Query: 81  VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
           V+  +AG+G   +S   G   +A  + P    +D  G +Y+ D  N  +RK++   ++T+
Sbjct: 244 VITTVAGNGNAAFSGDGGSATAASLNYPGGIVLDSSGTLYIVDSVNQRVRKVSGTTISTV 303

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           AG G+       G A  A  +N F +T       L + D  N  +R+I
Sbjct: 304 AGTGTAGFSGDGGAALQAQLNNPFPIT-ADGSGNLYVGDVSNNRVRKI 350



 Score = 45.4 bits (106), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 47/130 (36%), Positives = 65/130 (50%), Gaps = 16/130 (12%)

Query: 81  VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
           V+  LAG G  G++ DG P + A+ + P    +D  GNIYV+D+ N  +RKI   G +TT
Sbjct: 75  VITTLAGIGTAGFAGDGGPAAQAQLNGPLGLCIDGSGNIYVSDQGNKRVRKIAPSGTITT 134

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQINLKPE 191
           +AG GS   G   G A  A F+       +P  CA      LLI D G   IR++     
Sbjct: 135 VAGSGSAASGGDGGAATAAGFA-------IPIRCAVDKSGNLLIVDQGAHKIRKVEAASN 187

Query: 192 DCSKSSQSGS 201
             S  + +GS
Sbjct: 188 IISTFAGNGS 197



 Score = 41.6 bits (96), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 45/88 (51%), Gaps = 11/88 (12%)

Query: 51  IDRPGSSDLIVLDSSRSAFYTLSFPLSEESV----VKRLAGDGVQGYSDGEPGSA---RF 103
           ++ PG    IVLDSS + +   S       V    +  +AG G  G+S G+ G+A   + 
Sbjct: 268 LNYPGG---IVLDSSGTLYIVDSVNQRVRKVSGTTISTVAGTGTAGFS-GDGGAALQAQL 323

Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           + P     D  GN+YV D SN+ +RKIT
Sbjct: 324 NNPFPITADGSGNLYVGDVSNNRVRKIT 351


>gi|423214677|ref|ZP_17201205.1| hypothetical protein HMPREF1074_02737 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692583|gb|EIY85820.1| hypothetical protein HMPREF1074_02737 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 419

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 47/152 (30%), Positives = 73/152 (48%), Gaps = 20/152 (13%)

Query: 56  SSDLIVLDSSRSAFYTLS-----FPLSEESVVKRLAGDGVQG--YSDGEPG---SARFDK 105
           S + IV D +++     S       +S +   K +AGDGV+G  Y DGEPG   SA+   
Sbjct: 270 SPNYIVFDKNKNMIVGTSNGYVLIQISPDGTQKTIAGDGVKGQEYYDGEPGNPLSAKVGA 329

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT--------NLGVTTIAGGGSKKEGRADGPAQNAS 157
               A    G +Y++D + + IRK+T           + TIAG G  K G +DG    A+
Sbjct: 330 TFGVAAGSDGCLYLSDNTYNCIRKLTPDANGDYSKGTLETIAGSG--KAGFSDGKGLKAT 387

Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           F+  +E+        + ++   N LIR+I +K
Sbjct: 388 FNQPYEIIITEDCKTMYVAGAVNYLIRRITVK 419


>gi|262405589|ref|ZP_06082139.1| NHL repeat containing protein [Bacteroides sp. 2_1_22]
 gi|294644531|ref|ZP_06722285.1| putative lipoprotein [Bacteroides ovatus SD CC 2a]
 gi|294805793|ref|ZP_06764670.1| putative lipoprotein [Bacteroides xylanisolvens SD CC 1b]
 gi|345510784|ref|ZP_08790345.1| hypothetical protein BSAG_00303 [Bacteroides sp. D1]
 gi|262356464|gb|EEZ05554.1| NHL repeat containing protein [Bacteroides sp. 2_1_22]
 gi|292640084|gb|EFF58348.1| putative lipoprotein [Bacteroides ovatus SD CC 2a]
 gi|294447014|gb|EFG15604.1| putative lipoprotein [Bacteroides xylanisolvens SD CC 1b]
 gi|345454355|gb|EEO48593.2| hypothetical protein BSAG_00303 [Bacteroides sp. D1]
          Length = 419

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 47/152 (30%), Positives = 73/152 (48%), Gaps = 20/152 (13%)

Query: 56  SSDLIVLDSSRSAFYTLS-----FPLSEESVVKRLAGDGVQG--YSDGEPG---SARFDK 105
           S + IV D +++     S       +S +   K +AGDGV+G  Y DGEPG   SA+   
Sbjct: 270 SPNYIVFDKNKNMIVGTSNGYVLIQISPDGTQKTIAGDGVKGQEYYDGEPGNPLSAKVGA 329

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT--------NLGVTTIAGGGSKKEGRADGPAQNAS 157
               A    G +Y++D + + IRK+T           + TIAG G  K G +DG    A+
Sbjct: 330 TFGVAAGSDGCLYLSDNTYNCIRKLTPDANGDYSKGTLETIAGSG--KAGFSDGKGLKAT 387

Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           F+  +E+        + ++   N LIR+I +K
Sbjct: 388 FNQPYEIIITEDCKTMYVAGAVNYLIRRITVK 419


>gi|452992643|emb|CCQ95895.1| hypothetical protein CULT_320031 [Clostridium ultunense Esp]
          Length = 329

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 46/138 (33%), Positives = 68/138 (49%), Gaps = 21/138 (15%)

Query: 53  RPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSAR---------- 102
           R    ++ +LDS   A   ++    ++  V+ +AG G  GY+ G+ G A+          
Sbjct: 182 RGPEGNIYILDSKNYAVRKINI---DKETVELIAGTGKPGYT-GDGGDAKDATFGGNKES 237

Query: 103 -FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSN 160
            FD P S ++D  GNIYV D  NHV+R ITN G V TIAG  +  +GR     +N +   
Sbjct: 238 YFDGPWSLSLDEVGNIYVGDTQNHVVRMITNKGKVYTIAGNINVIKGR-----KNRTNEK 292

Query: 161 DFELTFVPHICALLISDH 178
           D     +P IC++   D 
Sbjct: 293 DLFKINLPLICSMDYYDR 310


>gi|336403610|ref|ZP_08584322.1| hypothetical protein HMPREF0127_01635 [Bacteroides sp. 1_1_30]
 gi|335945536|gb|EGN07347.1| hypothetical protein HMPREF0127_01635 [Bacteroides sp. 1_1_30]
          Length = 419

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 47/152 (30%), Positives = 73/152 (48%), Gaps = 20/152 (13%)

Query: 56  SSDLIVLDSSRSAFYTLS-----FPLSEESVVKRLAGDGVQG--YSDGEPG---SARFDK 105
           S + IV D +++     S       +S +   K +AGDGV+G  Y DGEPG   SA+   
Sbjct: 270 SPNYIVFDKNKNMIVGTSNGYVLIQISPDGTQKTIAGDGVKGQEYYDGEPGNPLSAKVGA 329

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT--------NLGVTTIAGGGSKKEGRADGPAQNAS 157
               A    G +Y++D + + IRK+T           + TIAG G  K G +DG    A+
Sbjct: 330 TFGVAAGSDGCLYLSDNTYNCIRKLTPDANGDYSKGTLETIAGSG--KAGFSDGKGLKAT 387

Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           F+  +E+        + ++   N LIR+I +K
Sbjct: 388 FNQPYEIIITEDCKTMYVAGAVNYLIRRITVK 419


>gi|428185691|gb|EKX54543.1| hypothetical protein GUITHDRAFT_63359, partial [Guillardia theta
           CCMP2712]
          Length = 305

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 49/143 (34%), Positives = 70/143 (48%), Gaps = 9/143 (6%)

Query: 45  INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
           ++P  +   P  S L+V DS      ++S+    E VV   AG G +G+ DG    A+FD
Sbjct: 57  LHPKGICPLP--SGLLVCDSGHHRIRSVSY--DGERVVA-FAGSGKRGHRDGPVQVAQFD 111

Query: 105 KPKSFAV-DMKGNIYVADKSNHVIRKITNLGVTTIAGGGS--KKEGRADGPAQNASFSND 161
            P S  V     +I VAD  N+ +R+I N  VTT+AGG    +  G  DG ++ A F   
Sbjct: 112 TPCSICVCPSDKSIIVADSGNNAVRRIANGMVTTLAGGSGPDRAGGFVDGESEGAKFRRP 171

Query: 162 FELTFVPHICALLISDHGNQLIR 184
             + F      LL+ D GN  +R
Sbjct: 172 TFVMFDKEE-TLLVIDSGNHCLR 193



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/136 (26%), Positives = 58/136 (42%), Gaps = 18/136 (13%)

Query: 56  SSDLIVLDSSRSAFYTLSF-----PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFA 110
           +S ++V D+  +A   ++      PL E    K  A             SA +  PK   
Sbjct: 16  TSVVVVADTGNNALRLVALGRQGGPLVEGICTKSFAA------------SASWLHPKGIC 63

Query: 111 VDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHI 170
             +   + V D  +H IR ++  G   +A  GS K G  DGP Q A F     +   P  
Sbjct: 64  -PLPSGLLVCDSGHHRIRSVSYDGERVVAFAGSGKRGHRDGPVQVAQFDTPCSICVCPSD 122

Query: 171 CALLISDHGNQLIRQI 186
            +++++D GN  +R+I
Sbjct: 123 KSIIVADSGNNAVRRI 138



 Score = 44.3 bits (103), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 33/128 (25%), Positives = 61/128 (47%), Gaps = 4/128 (3%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
           L+V+DS       +S    E   V+ LAG    G +DG   +   + P++  +   G+I 
Sbjct: 182 LLVIDSGNHCLRVMSPDWKE---VRTLAGGPKAGGTDGAVDTCELNHPEASCLLEDGSIL 238

Query: 119 VADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDH 178
           +AD+ N+ IR++     +  +  G+   G  DG  + ++F+    +  +     ++ISD 
Sbjct: 239 IADRENNKIRRLDGDLRSLSSWAGNGCWGATDGLIEESTFNKPCGVCCLEDG-TIVISDS 297

Query: 179 GNQLIRQI 186
           GN  IR +
Sbjct: 298 GNNCIRLV 305


>gi|329935738|ref|ZP_08285543.1| hypothetical protein SGM_1035 [Streptomyces griseoaurantiacus M045]
 gi|329304829|gb|EGG48702.1| hypothetical protein SGM_1035 [Streptomyces griseoaurantiacus M045]
          Length = 617

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 37/113 (32%), Positives = 64/113 (56%), Gaps = 7/113 (6%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S +L+V D++R     L+     ES V+R+ G G +G++DG   +A F +P+  A+   G
Sbjct: 198 SGNLLVSDTTRHQLVELA--ADGESEVRRI-GSGARGFADGTAEAAAFSEPQGLALLEDG 254

Query: 116 NIYVADKSNHVIRK--ITNLGVTTIAGGGSK--KEGRADGPAQNASFSNDFEL 164
           ++ VAD  NH +R+  +   GVTT+AG G +  +     GPA+    S+ +++
Sbjct: 255 SVVVADTVNHALRRLDVATGGVTTLAGTGRQWWQGSPTSGPAREVDLSSPWDV 307



 Score = 38.9 bits (89), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 28/102 (27%), Positives = 52/102 (50%), Gaps = 3/102 (2%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
           GDG     + EP + RF  P    +   GN+ V+D + H + ++   G + +   GS   
Sbjct: 173 GDGPYVAPEPEPTTLRF--PGKALLLPSGNLLVSDTTRHQLVELAADGESEVRRIGSGAR 230

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           G ADG A+ A+FS    L  +    +++++D  N  +R++++
Sbjct: 231 GFADGTAEAAAFSEPQGLALL-EDGSVVVADTVNHALRRLDV 271


>gi|399034109|ref|ZP_10732477.1| gluconolactonase [Flavobacterium sp. CF136]
 gi|398067543|gb|EJL59038.1| gluconolactonase [Flavobacterium sp. CF136]
          Length = 418

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 48/168 (28%), Positives = 76/168 (45%), Gaps = 19/168 (11%)

Query: 38  IDGHQLEINPHSVIDRP----------GSSDLIVLDSSRSAFYTLS-----FPLSEESVV 82
           ++G+  +  P  V+ RP          G++  + LDS  + + +         L+   ++
Sbjct: 212 VNGYIYKFTPSGVLTRPRNNEGALIRVGAASGMCLDSKNNLYISEDATSKIMKLTPTGIL 271

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
              AG    G+ +G    A+F KP     D   N+YV D+ N+ IRKI   G V+T+AG 
Sbjct: 272 TIFAGANGTGFQNGNIKEAKFSKPSGIVFDKMDNLYVVDRYNNRIRKIAVDGTVSTVAGS 331

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           G +  G  DG    A+F+   E+T +     L I +  N  IR I  K
Sbjct: 332 GIR--GNKDGITSEATFNFPQEIT-IDKYNNLYIVEPDNNRIRMITSK 376



 Score = 42.4 bits (98), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 19/59 (32%), Positives = 33/59 (55%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++ +  V  +AG G++G  DG    A F+ P+   +D   N+Y+ +  N+ IR IT+ G
Sbjct: 319 IAVDGTVSTVAGSGIRGNKDGITSEATFNFPQEITIDKYNNLYIVEPDNNRIRMITSKG 377


>gi|290986743|ref|XP_002676083.1| predicted protein [Naegleria gruberi]
 gi|284089683|gb|EFC43339.1| predicted protein [Naegleria gruberi]
          Length = 733

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 39/108 (36%), Positives = 56/108 (51%), Gaps = 4/108 (3%)

Query: 82  VKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTI 138
           +  +AG G  G+S  +G   +A+   P S  V     +Y+AD+SNH IRKI  N  + TI
Sbjct: 43  IATIAGKGTCGFSGDNGPATNAQIYYPSSVFVSSNNEVYIADQSNHRIRKILENGNIVTI 102

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           AG G       +GPA NA     + + FV     + I D+GN  +R+I
Sbjct: 103 AGNGIGGFSGDNGPATNAQIYYPYSV-FVSSNNVVYIVDYGNNRVRKI 149



 Score = 53.5 bits (127), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 57/112 (50%), Gaps = 4/112 (3%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG 134
           E   +  +AG+G+ G+S  +G   +A+   P S  V     +Y+ D  N+ +RKI  N  
Sbjct: 95  ENGNIVTIAGNGIGGFSGDNGPATNAQIYYPYSVFVSSNNVVYIVDYGNNRVRKILGNGN 154

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + TIAG G+      +GPA NA  +N   + FV     + I+D  N  IR+I
Sbjct: 155 IVTIAGNGTSGFSGDNGPATNAQLNNPVGV-FVSSNNEVYIADQSNHRIRKI 205



 Score = 50.8 bits (120), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 40/105 (38%), Positives = 59/105 (56%), Gaps = 5/105 (4%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGG 141
           +AG+G  G+S  +G   +A+ + P    V     +Y+AD+SNH IRKI  N  + TIAG 
Sbjct: 158 IAGNGTSGFSGDNGPATNAQLNNPVGVFVSSNNEVYIADQSNHRIRKILENGNIVTIAGN 217

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G+   G  +GPA NA     + + FV +   + I D+GN  IR+I
Sbjct: 218 GTGGFGGDNGPATNAQLYIPYSV-FVSN-NEVYIVDYGNNRIRKI 260



 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 33/91 (36%), Positives = 48/91 (52%), Gaps = 18/91 (19%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGG 141
           +AG+G  G+S  +G   +A+ ++P S  V     +Y+AD +NH IRKI  N  + TIAG 
Sbjct: 269 IAGNGTSGFSGDNGPATNAQLNRPSSVFVS-NNEVYIADLNNHKIRKILENGNIITIAGN 327

Query: 142 GSKKEGRADGPAQNASFSND--FELTFVPHI 170
           G+K             FS D  F++   PHI
Sbjct: 328 GTK------------GFSGDSPFDIRMYPHI 346



 Score = 45.8 bits (107), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 36/108 (33%), Positives = 52/108 (48%), Gaps = 5/108 (4%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFS 159
           A+   P    V     +Y+AD SNH IRKI  N  + TIAG G+      +GPA NA   
Sbjct: 8   AQLYNPVGIFVSSNNEVYIADYSNHRIRKILKNGNIATIAGKGTCGFSGDNGPATNAQIY 67

Query: 160 NDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVS 207
               + FV     + I+D  N  IR+I    E+ +  + +G+ +G  S
Sbjct: 68  YPSSV-FVSSNNEVYIADQSNHRIRKI---LENGNIVTIAGNGIGGFS 111


>gi|320103301|ref|YP_004178892.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
 gi|319750583|gb|ADV62343.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
          Length = 676

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 57/111 (51%), Gaps = 4/111 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKI--TNLGVTTI 138
           +K + G+G  G  DG    A F++P+   +D ++  +YVAD  NH IR I  T   VTT+
Sbjct: 236 LKAVVGNGKIGMRDGAYERASFNRPQGIRLDALRNRLYVADTENHAIRAIDLTTRSVTTV 295

Query: 139 AGGGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           AG G     G   GPA+    ++ ++L  +P     L++  G   I +I+ 
Sbjct: 296 AGTGEMVYPGLPGGPARRFGLNSPWDLVQIPETNQFLVAMAGTHQIYKIDF 346



 Score = 40.4 bits (93), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 42/82 (51%), Gaps = 7/82 (8%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
           L V D+  +    + F  +    ++ L GD  +G +D  P   RFD+P   +V + G +Y
Sbjct: 441 LYVADTYNNKIKVIDFS-ARTKTIRTLVGDRRRGDTDDPP---RFDQPGGLSV-LGGQLY 495

Query: 119 VADKSNHVIR--KITNLGVTTI 138
           VAD +NH IR   +  L V T+
Sbjct: 496 VADTNNHKIRVVDLATLAVKTL 517


>gi|441163236|ref|ZP_20968263.1| hypothetical protein SRIM_29923 [Streptomyces rimosus subsp.
           rimosus ATCC 10970]
 gi|440616375|gb|ELQ79517.1| hypothetical protein SRIM_29923 [Streptomyces rimosus subsp.
           rimosus ATCC 10970]
          Length = 238

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/117 (32%), Positives = 64/117 (54%), Gaps = 4/117 (3%)

Query: 76  LSEESVVKRLAGDGVQGY-SDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ + ++  +AG+G+ GY SDG P +A +   P   A D  GN+Y+AD  N+ +RK+   
Sbjct: 71  VTPQGIITTVAGNGIAGYVSDGGPATATQLSSPCGVAADGAGNLYIADLGNNRVRKVDTK 130

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           G +TT+AG G+       GPA  A+  N      V     + I+D+ N  +R+++ K
Sbjct: 131 GIITTVAGNGTGGYVSDGGPAT-ATQLNGPHSVAVDRDGNVYIADYHNHRVRKVDSK 186



 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 40/62 (64%), Gaps = 2/62 (3%)

Query: 76  LSEESVVKRLAGDGVQGY-SDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +  + ++  +AG+G  GY SDG P +A + + P S AVD  GN+Y+AD  NH +RK+ + 
Sbjct: 127 VDTKGIITTVAGNGTGGYVSDGGPATATQLNGPHSVAVDRDGNVYIADYHNHRVRKVDSK 186

Query: 134 GV 135
           G+
Sbjct: 187 GL 188


>gi|357485931|ref|XP_003613253.1| hypothetical protein MTR_5g034550 [Medicago truncatula]
 gi|355514588|gb|AES96211.1| hypothetical protein MTR_5g034550 [Medicago truncatula]
          Length = 154

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 40/124 (32%), Positives = 63/124 (50%), Gaps = 7/124 (5%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
           + +G  G+ D +   AR + PK   VD +GNIYVAD  N  IRKI+ LG          +
Sbjct: 11  SAEGYSGHVDEKLREARMNHPKGITVDDRGNIYVADIINMAIRKIS-LGNNMTYLSFLYE 69

Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
           E      +    +   F++ +V    +LL+ D G Q IR+I L+ +DC+   +S   LG 
Sbjct: 70  E------SLILFYLLLFDVIYVGSSYSLLVIDRGKQAIREIQLRFDDCAYQYESRFPLGK 123

Query: 206 VSVW 209
           ++ +
Sbjct: 124 LNKF 127


>gi|424869138|ref|ZP_18292858.1| uncharacterized protein [Leptospirillum sp. Group II 'C75']
 gi|124514239|gb|EAY55753.1| conserved protein of unknown function [Leptospirillum rubarum]
 gi|387220840|gb|EIJ75456.1| uncharacterized protein [Leptospirillum sp. Group II 'C75']
          Length = 389

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 50/153 (32%), Positives = 64/153 (41%), Gaps = 8/153 (5%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP  +   P    L V DS  +    +         V  +AG       DG    A F+ 
Sbjct: 132 NPEGMAISPDGKTLYVADSRNNMIRKIDL---ATKTVSTIAGHSFPSSGDGVGKEAGFET 188

Query: 106 PKSFAVDMKGN-IYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
           P+  A+   G  +YVAD  N+ IRKI   TN  VTT+AG G    G ADG    A+F   
Sbjct: 189 PRGLAISPDGKTLYVADSGNNAIRKIDLATNT-VTTLAGAGKLMSGSADGVGVQATFHEP 247

Query: 162 FELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
             L        L I+D  N LIR++ L     S
Sbjct: 248 RSLAISGDGQVLYIADTRNNLIRKMVLATNSVS 280



 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 39/111 (35%), Positives = 55/111 (49%), Gaps = 5/111 (4%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRA 149
           Y DG   +ARF+ P+  A+   G  +YVAD  N++IRKI      V+TIAG      G  
Sbjct: 120 YRDGVGANARFNNPEGMAISPDGKTLYVADSRNNMIRKIDLATKTVSTIAGHSFPSSG-- 177

Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
           DG  + A F     L   P    L ++D GN  IR+I+L     +  + +G
Sbjct: 178 DGVGKEAGFETPRGLAISPDGKTLYVADSGNNAIRKIDLATNTVTTLAGAG 228



 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/121 (31%), Positives = 54/121 (44%), Gaps = 18/121 (14%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---------------- 134
           +G  DG    ARF+ P+       G IY+AD  N +IRKI++ G                
Sbjct: 47  RGAIDGPGREARFEFPQGILAAPDGTIYIADTGNDMIRKISSNGLSGATPTPAELHHLSV 106

Query: 135 -VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
            V  IAG   +   R DG   NA F+N   +   P    L ++D  N +IR+I+L  +  
Sbjct: 107 DVEDIAGSDHRARYR-DGVGANARFNNPEGMAISPDGKTLYVADSRNNMIRKIDLATKTV 165

Query: 194 S 194
           S
Sbjct: 166 S 166


>gi|410478931|ref|YP_006766568.1| hypothetical protein LFML04_1390 [Leptospirillum ferriphilum ML-04]
 gi|406774183|gb|AFS53608.1| hypothetical protein LFML04_1390 [Leptospirillum ferriphilum ML-04]
          Length = 427

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 50/153 (32%), Positives = 64/153 (41%), Gaps = 8/153 (5%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP  +   P    L V DS  +    +         V  +AG       DG    A F+ 
Sbjct: 170 NPEGMAISPDGKTLYVADSRNNMIRKIDL---ATKTVSTIAGHSFPSSGDGVGKEAGFET 226

Query: 106 PKSFAVDMKGN-IYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
           P+  A+   G  +YVAD  N+ IRKI   TN  VTT+AG G    G ADG    A+F   
Sbjct: 227 PRGLAISPDGKTLYVADSGNNAIRKIDLATNT-VTTLAGAGKLMSGSADGVGVQATFHEP 285

Query: 162 FELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
             L        L I+D  N LIR++ L     S
Sbjct: 286 RSLAISGDGQVLYIADTRNNLIRKMVLATNSVS 318



 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 39/111 (35%), Positives = 55/111 (49%), Gaps = 5/111 (4%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRA 149
           Y DG   +ARF+ P+  A+   G  +YVAD  N++IRKI      V+TIAG      G  
Sbjct: 158 YRDGVGANARFNNPEGMAISPDGKTLYVADSRNNMIRKIDLATKTVSTIAGHSFPSSG-- 215

Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
           DG  + A F     L   P    L ++D GN  IR+I+L     +  + +G
Sbjct: 216 DGVGKEAGFETPRGLAISPDGKTLYVADSGNNAIRKIDLATNTVTTLAGAG 266



 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/121 (31%), Positives = 54/121 (44%), Gaps = 18/121 (14%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---------------- 134
           +G  DG    ARF+ P+       G IY+AD  N +IRKI++ G                
Sbjct: 85  RGAIDGPGREARFEFPQGILAAPDGTIYIADTGNDMIRKISSNGLSGATPTPAELHHLSV 144

Query: 135 -VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
            V  IAG   +   R DG   NA F+N   +   P    L ++D  N +IR+I+L  +  
Sbjct: 145 DVEDIAGSDHRARYR-DGVGANARFNNPEGMAISPDGKTLYVADSRNNMIRKIDLATKTV 203

Query: 194 S 194
           S
Sbjct: 204 S 204


>gi|318059218|ref|ZP_07977941.1| hypothetical protein SSA3_14816 [Streptomyces sp. SA3_actG]
 gi|318079124|ref|ZP_07986456.1| hypothetical protein SSA3_21010 [Streptomyces sp. SA3_actF]
          Length = 610

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 67/124 (54%), Gaps = 8/124 (6%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P   +  PG +  +V D++R A   L    ++   V+R  G G +G  DG+   ARF++P
Sbjct: 183 PGKALRLPGGT-FLVSDTTRHALVELE---ADGETVRRRIGSGERGLRDGDAAGARFNEP 238

Query: 107 KSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK-KEGR-ADGPAQNASFSNDF 162
           +  A+   G + VAD  NH +R++   +  V+T+AG G +  +G    GPA+  S S+ +
Sbjct: 239 QGLALLPDGAVVVADTVNHALRRVDLASGEVSTLAGTGRQWWQGEPVAGPAREVSLSSPW 298

Query: 163 ELTF 166
           ++ +
Sbjct: 299 DVAW 302


>gi|197118445|ref|YP_002138872.1| lipoprotein [Geobacter bemidjiensis Bem]
 gi|197087805|gb|ACH39076.1| repeat-containing lipoprotein, putative [Geobacter bemidjiensis
           Bem]
          Length = 652

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 43/108 (39%), Positives = 59/108 (54%), Gaps = 8/108 (7%)

Query: 82  VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTI 138
           V  LAG  GV G ++G  G+ARF  P   A D   N+YVAD  NH +RKI   +  VTT+
Sbjct: 162 VTTLAGTAGVVGSTNGTGGAARFSSPSGVATDGA-NLYVADSLNHRVRKIVLASAAVTTL 220

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           AG G +    ADG    ASF++   +       +L ++D GN  +R+I
Sbjct: 221 AGSGLQD--FADGTGIAASFNSPRGI--ATDGVSLYLADQGNSAVRRI 264



 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 41/124 (33%), Positives = 56/124 (45%), Gaps = 8/124 (6%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKEG 147
           G  G +DG   +A F  P     D + N+YVAD +NH +R+I+   G  T   G   + G
Sbjct: 375 GTVGSADGVGVAAAFTSPYDLTTDGR-NVYVADTNNHTVRQISIATGAVTTLAGTPDRAG 433

Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS----KSSQSGSAL 203
             DG    ASF   F          L +SD GN  +R+I +     S     +  +GSA 
Sbjct: 434 SGDGTGAAASFR--FPSGLTTDGTNLFVSDTGNNTVRKIVIATGAVSTLAGTAGATGSAD 491

Query: 204 GAVS 207
           GA S
Sbjct: 492 GAGS 495



 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 35/102 (34%), Positives = 45/102 (44%), Gaps = 2/102 (1%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKEG 147
           G  G +DG    ARF  P   A D   ++YV D  N VIRK+    G T+   G +   G
Sbjct: 58  GSAGSTDGAASQARFSIPGDVASDGT-SLYVVDTGNSVIRKVVLATGATSTVAGLAGITG 116

Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
             DG    A F+    +        L +SD GN  IR IN+ 
Sbjct: 117 SGDGTGAVARFNYPSGIALSSDNSTLYVSDTGNNTIRSINVA 158



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 58/112 (51%), Gaps = 11/112 (9%)

Query: 82  VKRLAGDG--VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTT 137
           V  LAGDG    G +D   G+ARF  P    ++  G +YVAD  N ++RK  +     +T
Sbjct: 314 VSTLAGDGGGSPGSTDAVGGAARFSAPAGLVLN-AGVLYVADTGNDLLRKVAVATGASST 372

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           +AG      G ADG    A+F++ ++LT       + ++D  N  +RQI++ 
Sbjct: 373 LAG----TVGSADGVGVAAAFTSPYDLTTDGR--NVYVADTNNHTVRQISIA 418



 Score = 45.1 bits (105), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 40/123 (32%), Positives = 53/123 (43%), Gaps = 15/123 (12%)

Query: 82  VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
           V  LAG  G  G +DG   +ARF  P     D   N+YVAD  N+ IRKI       IA 
Sbjct: 477 VSTLAGTAGATGSADGAGSAARFHSPNGLTTDGT-NLYVADSGNNAIRKI------VIAT 529

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
           G           A +A  S+ + +T       L I+D GN  I +  L     S  + +G
Sbjct: 530 GAVSTA-----VASSAGLSSPYGVT--TDGTTLFITDSGNHRICKFTLAGSAFSSLTVTG 582

Query: 201 SAL 203
           +A 
Sbjct: 583 AAF 585


>gi|444306537|ref|ZP_21142300.1| NHL repeat containing protein [Arthrobacter sp. SJCon]
 gi|443481173|gb|ELT44105.1| NHL repeat containing protein [Arthrobacter sp. SJCon]
          Length = 673

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 52/166 (31%), Positives = 78/166 (46%), Gaps = 10/166 (6%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDL-IVLDSSRSAFYTLSF-PLSEESVVKRLA 86
           ED    T  + GH LE++  S  D   S  L  V+ +        SF P+S E  V  +A
Sbjct: 324 EDAAGFTGRLSGHPLEVSLSSPWDLVWSRKLNAVVIAMAGTHQIFSFDPVSGE--VDIIA 381

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-----NLGVTTIAGG 141
           G+G++G  DG    A F +P   A D  GNI+VAD     +RK+      ++ V +  G 
Sbjct: 382 GNGLEGLLDGPAHEAWFAQPSGLAEDGDGNIWVADSETSALRKLVIDDAGSITVESALGK 441

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           G    G  DGPA  A   +   +T +P   ++ I+D  N  +R+ +
Sbjct: 442 GLFDFGFRDGPAVEARLQHPLGVTVLPD-GSVAIADTYNGAVRRYD 486


>gi|294630411|ref|ZP_06708971.1| NHL repeat protein [Streptomyces sp. e14]
 gi|292833744|gb|EFF92093.1| NHL repeat protein [Streptomyces sp. e14]
          Length = 609

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 37/115 (32%), Positives = 66/115 (57%), Gaps = 7/115 (6%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S + +V D++R     L+     ESVV+R+ G G +G++DG    A F++P+  A+   G
Sbjct: 193 SGNFLVSDTTRHQLVELA--ADGESVVRRI-GSGARGFADGAAEDAAFNEPQGLALLDDG 249

Query: 116 NIYVADKSNHVIRKI--TNLGVTTIAGGGSK-KEGRA-DGPAQNASFSNDFELTF 166
            + VAD  NH +R+       +TT+AG G +  +G A +GPA+  + S+ +++ +
Sbjct: 250 AVVVADTVNHALRRFDPATGRITTLAGTGRQWMQGEATEGPAREVNLSSPWDVAW 304



 Score = 38.5 bits (88), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 28/101 (27%), Positives = 50/101 (49%), Gaps = 3/101 (2%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
           GDG     + EP + RF  P    +   GN  V+D + H + ++   G + +   GS   
Sbjct: 168 GDGPYVAPEPEPTTLRF--PGKALLLPSGNFLVSDTTRHQLVELAADGESVVRRIGSGAR 225

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           G ADG A++A+F+    L  +    A++++D  N  +R+ +
Sbjct: 226 GFADGAAEDAAFNEPQGLALLDD-GAVVVADTVNHALRRFD 265


>gi|290990564|ref|XP_002677906.1| NHL domain-containing protein kinase [Naegleria gruberi]
 gi|284091516|gb|EFC45162.1| NHL domain-containing protein kinase [Naegleria gruberi]
          Length = 2733

 Score = 58.5 bits (140), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 41/109 (37%), Positives = 58/109 (53%), Gaps = 6/109 (5%)

Query: 82  VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TT 137
           +  +AG G  GYS G+ G   SA+F +P+  AV   G IY+AD  NHV+RKI+   V +T
Sbjct: 548 IATVAGSGASGYS-GDGGLLTSAKFQQPQGVAVSSNGEIYIADTENHVVRKISTSNVLST 606

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           IAG GS       G A  A   +   +  +     + I+D+ N  IR+I
Sbjct: 607 IAGTGSYGYNGDGGLAITAKLFSPIGVA-ISSSGEVFIADNNNHRIRKI 654



 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 57/113 (50%), Gaps = 6/113 (5%)

Query: 81  VVKRLAGDGVQ----GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
           V++R+    +     G  DG P +  + + +SF +   G IY+AD  NH IRK++ LG +
Sbjct: 762 VIRRVISGNISTIAGGVGDGGPATGGYIQAQSFDISSTGEIYIADTENHRIRKVSTLGKI 821

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           +TIAG G+       G A  A  S+   +  V     + I+D  N  IR++ L
Sbjct: 822 STIAGTGAMGYSGDGGLAITAKLSSPVGVA-VSSTGEVFIADRDNHRIRKVTL 873



 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 40/116 (34%), Positives = 61/116 (52%), Gaps = 8/116 (6%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           +S  +V+  +AG G  GY +G+ G   +A+   P   A+   G +++AD +NH IRKI  
Sbjct: 598 ISTSNVLSTIAGTGSYGY-NGDGGLAITAKLFSPIGVAISSSGEVFIADNNNHRIRKIAK 656

Query: 133 LG-VTTIAGGG-SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            G ++TI G G +  +G  D  A NA   N   +T  P    L I+D  N  IR++
Sbjct: 657 DGYISTIVGTGLNGYDGDGD-LATNAKLGNPQGVTLSP-TGELFIADQNNHRIRKV 710



 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/109 (34%), Positives = 56/109 (51%), Gaps = 7/109 (6%)

Query: 82  VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
           +  +AG G  GYS G+ G   +A+   P   AV   G +++AD+ NH IRK+T  G +TT
Sbjct: 821 ISTIAGTGAMGYS-GDGGLAITAKLSSPVGVAVSSTGEVFIADRDNHRIRKVTLSGIITT 879

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           IAG G+         A+ A  ++   +T       + I+D  N  IRQI
Sbjct: 880 IAGNGTSGFNSDGIEAKFAQLNSPSSVTISGG--EIYIADTNNHRIRQI 926



 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 41/132 (31%), Positives = 65/132 (49%), Gaps = 7/132 (5%)

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGE-PGSARFDKPKSFAVDMKG 115
           D+ + D+  +    +SF      ++  +AG G+ G + DG    SA+ + P   AV   G
Sbjct: 471 DVYIADTYNNRIRKVSF---STGIISTVAGTGIAGQNGDGNLATSAQLNFPSGVAVSSNG 527

Query: 116 NIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
           +IY+AD  NH IRKI TN  + T+AG G+       G   +A F    +   V     + 
Sbjct: 528 DIYIADTENHRIRKIETNGYIATVAGSGASGYSGDGGLLTSAKFQQP-QGVAVSSNGEIY 586

Query: 175 ISDHGNQLIRQI 186
           I+D  N ++R+I
Sbjct: 587 IADTENHVVRKI 598



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 31/114 (27%), Positives = 63/114 (55%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGY-SDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++++  +  + G G+ GY  DG+  + A+   P+   +   G +++AD++NH IRK+ + 
Sbjct: 654 IAKDGYISTIVGTGLNGYDGDGDLATNAKLGNPQGVTLSPTGELFIADQNNHRIRKVASN 713

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G ++T+AG G+         A NA  ++   + F   I  + I+D  N++IR++
Sbjct: 714 GYISTVAGNGNFGFSGDGDLATNAELNSPSGIAF-SSIGTMYIADRLNRVIRRV 766



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 28/65 (43%), Positives = 41/65 (63%), Gaps = 4/65 (6%)

Query: 81  VVKRLAGDGVQGY-SDG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
           ++  +AG+G  G+ SDG E   A+ + P S  +   G IY+AD +NH IR+I+N G + T
Sbjct: 876 IITTIAGNGTSGFNSDGIEAKFAQLNSPSSVTIS-GGEIYIADTNNHRIRQISNTGIIKT 934

Query: 138 IAGGG 142
           IAG G
Sbjct: 935 IAGNG 939



 Score = 46.6 bits (109), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 33/120 (27%), Positives = 57/120 (47%), Gaps = 17/120 (14%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           ++   ++  +AG G  GY DG+     +A+   P    V+  G++Y+AD  N+ IRKI+ 
Sbjct: 230 IATNGIITTIAGTGTHGY-DGDGALAINAQLYSPTGVVVNSNGDVYIADSGNNRIRKISK 288

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQI 186
             ++T+AG G        G + +A  +        P   A      ++I+D  N  IR+I
Sbjct: 289 GYISTVAGNGIAGYSGDGGLSTSAQLA-------TPQSVAINLNGEIIIADSNNNRIRKI 341



 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 39/122 (31%), Positives = 56/122 (45%), Gaps = 8/122 (6%)

Query: 72  LSFP----LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNH 125
           L+FP    ++    +  +AG G   Y   +     A+   P   A+D  G  Y++D  NH
Sbjct: 166 LNFPSSIAITTNGNISTIAGTGTGSYGGDNAIATGAQLYAPACIAIDSNGEFYISDSMNH 225

Query: 126 VIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIR 184
            IRKI TN  +TTIAG G+         A NA   +   +  V     + I+D GN  IR
Sbjct: 226 RIRKIATNGIITTIAGTGTHGYDGDGALAINAQLYSPTGVV-VNSNGDVYIADSGNNRIR 284

Query: 185 QI 186
           +I
Sbjct: 285 KI 286



 Score = 45.8 bits (107), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 38/143 (26%), Positives = 64/143 (44%), Gaps = 14/143 (9%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARF 103
           NP  V   P + +L + D +          ++    +  +AG+G  G+S DG+  + A  
Sbjct: 685 NPQGVTLSP-TGELFIADQNNHRIR----KVASNGYISTVAGNGNFGFSGDGDLATNAEL 739

Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFE 163
           + P   A    G +Y+AD+ N VIR++ +  ++TIAGG         G  Q  SF     
Sbjct: 740 NSPSGIAFSSIGTMYIADRLNRVIRRVISGNISTIAGGVGDGGPATGGYIQAQSFD---- 795

Query: 164 LTFVPHICALLISDHGNQLIRQI 186
              +     + I+D  N  IR++
Sbjct: 796 ---ISSTGEIYIADTENHRIRKV 815



 Score = 39.3 bits (90), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 21/46 (45%), Positives = 31/46 (67%), Gaps = 1/46 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSK 144
           +A+   P   A+ + G IY+AD +N+ IRKI TN  +TTIAG G++
Sbjct: 107 AAQLFYPYGVALSLGGEIYIADSNNNRIRKIATNGIITTIAGTGTR 152



 Score = 37.7 bits (86), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 33/116 (28%), Positives = 59/116 (50%), Gaps = 17/116 (14%)

Query: 80  SVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-V 135
           ++V R+ G   +GY+  D    SA  +  +S AV+  G++Y+AD  N+ IRK++ + G +
Sbjct: 435 NIVSRVVGS--EGYNGDDIAATSALLNYARSIAVNSNGDVYIADTYNNRIRKVSFSTGII 492

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA-----LLISDHGNQLIRQI 186
           +T+AG G        G   + + +   +L F   +       + I+D  N  IR+I
Sbjct: 493 STVAGTG------IAGQNGDGNLATSAQLNFPSGVAVSSNGDIYIADTENHRIRKI 542


>gi|302520584|ref|ZP_07272926.1| NHL repeat containing protein [Streptomyces sp. SPB78]
 gi|302429479|gb|EFL01295.1| NHL repeat containing protein [Streptomyces sp. SPB78]
          Length = 634

 Score = 58.5 bits (140), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 66/124 (53%), Gaps = 8/124 (6%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P   +  PG +  +V D++R A   L    ++   V+R  G G +G  DG+   ARF++P
Sbjct: 183 PGKALRLPGGT-FLVSDTTRHALVELE---ADGETVRRRIGSGERGLRDGDAAGARFNEP 238

Query: 107 KSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSK-KEGR-ADGPAQNASFSNDF 162
           +  A+   G + VAD  NH +R++      V+T+AG G +  +G    GPA+  S S+ +
Sbjct: 239 QGLALLPDGAVVVADTVNHALRRVDPASGEVSTLAGTGRQWWQGEPVAGPAREVSLSSPW 298

Query: 163 ELTF 166
           ++ +
Sbjct: 299 DVAW 302


>gi|428183007|gb|EKX51866.1| hypothetical protein GUITHDRAFT_84912 [Guillardia theta CCMP2712]
          Length = 351

 Score = 58.5 bits (140), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 52/105 (49%), Gaps = 1/105 (0%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAGGGS 143
           LAG G  G++DG+  S+ F++P   +    G  +YVAD  N+ +RK+      T+   G 
Sbjct: 64  LAGTGTPGFADGDWNSSSFNQPCGVSTSPDGQWLYVADSGNNRVRKVNTTSGATLTLVGK 123

Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
              G ADG A   S S    +   P    L I+D  +Q IRQ+ L
Sbjct: 124 GDLGSADGEAATVSLSTPLSVAASPDGRYLYIADSLSQRIRQVRL 168



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 49/89 (55%), Gaps = 5/89 (5%)

Query: 102 RFDKPKSFAVDMKG-NIYVADKSNHVIRKIT--NLGVTTIAGGGSKKEGRADGPAQNASF 158
           RF+ P   A+      +YV D+ NH IRK++  +   TT+AG G+   G ADG   ++SF
Sbjct: 25  RFNCPYGLALSPDDLWLYVGDRDNHRIRKVSMRDGSTTTLAGTGTP--GFADGDWNSSSF 82

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
           +    ++  P    L ++D GN  +R++N
Sbjct: 83  NQPCGVSTSPDGQWLYVADSGNNRVRKVN 111



 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 29/106 (27%), Positives = 49/106 (46%), Gaps = 1/106 (0%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAGGGS 143
           L G G  G +DGE  +     P S A    G  +Y+AD  +  IR++      T    GS
Sbjct: 120 LVGKGDLGSADGEAATVSLSTPLSVAASPDGRYLYIADSLSQRIRQVRLADGWTRTLAGS 179

Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
             +  +DGP   +SF+    +   P    + ++D  N+ +R+I ++
Sbjct: 180 GNQSFSDGPPSESSFNTPAAIAVSPDGLTVYVADLNNERVRKIAVR 225


>gi|329927682|ref|ZP_08281814.1| copper amine oxidase N-terminal domain protein [Paenibacillus sp.
           HGF5]
 gi|328938318|gb|EGG34710.1| copper amine oxidase N-terminal domain protein [Paenibacillus sp.
           HGF5]
          Length = 1251

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 56/104 (53%), Gaps = 4/104 (3%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
           A  ++P   AVD KG +Y+AD  NH IRKI   G V TIAG G  + G  DG AQ+A F 
Sbjct: 109 AFLNRPSGIAVDGKGQVYIADSGNHAIRKIDQAGRVNTIAGNG--RIGLKDGKAQDALFH 166

Query: 160 NDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSAL 203
              ++  V     L ++D  N +IR+I+   E  +  S S  A+
Sbjct: 167 EPQDVA-VTEDGILYVADTLNHVIRRISPDGEVTTIGSPSTRAV 209



 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 40/142 (28%), Positives = 60/142 (42%), Gaps = 17/142 (11%)

Query: 55  GSSDLIVLDSSRSAFYTLSF------------PLSEESVVKRLAGDGVQGYSDGEPGSAR 102
           G  +L V DS   A   + F            P S+      L  D   GY DG   +AR
Sbjct: 242 GKGNLYVSDSGNQAIRYIDFGKGTVSTAAGSVPSSDGYAKDALYAD--PGYLDGAADAAR 299

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDF 162
           F+ P+  A  ++  + +AD  NH +R++ +  VTT+ GG     G ADG      F+   
Sbjct: 300 FNSPRGLAWSVEDGLLIADSHNHAVRQLKDERVTTMVGG---TRGYADGIESEVRFNAPA 356

Query: 163 ELTFVPHICALLISDHGNQLIR 184
           ++        L I+D  N  +R
Sbjct: 357 DIGVASDSGELFIADQRNGAVR 378



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 49/155 (31%), Positives = 70/155 (45%), Gaps = 15/155 (9%)

Query: 45  INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
           +N  S I   G   + + DS   A   +     +   V  +AG+G  G  DG+   A F 
Sbjct: 111 LNRPSGIAVDGKGQVYIADSGNHAIRKID----QAGRVNTIAGNGRIGLKDGKAQDALFH 166

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK--EGRAD-----GPAQNAS 157
           +P+  AV   G +YVAD  NHVIR+I+  G  T  G  S +  + RA      G  +N S
Sbjct: 167 EPQDVAVTEDGILYVADTLNHVIRRISPDGEVTTIGSPSTRAVQVRAGVIALAGDYKNGS 226

Query: 158 FS----NDFELTFVPHICALLISDHGNQLIRQINL 188
            +    N+     +     L +SD GNQ IR I+ 
Sbjct: 227 LAEAQFNEPAGLALDGKGNLYVSDSGNQAIRYIDF 261


>gi|410098158|ref|ZP_11293137.1| hypothetical protein HMPREF1076_02315 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409222431|gb|EKN15373.1| hypothetical protein HMPREF1076_02315 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 446

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 36/104 (34%), Positives = 54/104 (51%), Gaps = 4/104 (3%)

Query: 89  GVQ---GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
           GVQ   G+ DGE   A F++PK    D  G +YVAD  NHVIRKIT  GV +   G +  
Sbjct: 344 GVQNETGWQDGEREEALFNEPKQICFDQDGVMYVADAGNHVIRKITRDGVVSTVIGIAGV 403

Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           +G  DG  ++A F     +  +     + + D  N  +R++ ++
Sbjct: 404 KGYVDGSPEDALFQYPTGVA-IDKEGTIYVGDARNNCVRKLAIE 446



 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 23/56 (41%), Positives = 33/56 (58%), Gaps = 1/56 (1%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ + VV  + G  GV+GY DG P  A F  P   A+D +G IYV D  N+ +RK+
Sbjct: 388 ITRDGVVSTVIGIAGVKGYVDGSPEDALFQYPTGVAIDKEGTIYVGDARNNCVRKL 443



 Score = 37.7 bits (86), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 27/93 (29%), Positives = 44/93 (47%), Gaps = 12/93 (12%)

Query: 73  SFPLSEESVVKRLAGDGVQGYS------DGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
           +F  + ++ V  +AG  V G        DG    A FD+P    VD + NI+V+++  H 
Sbjct: 127 TFSYTTQTTVSTIAGKPVGGDDGHGEMIDGTLAEASFDQPWFLCVDAEKNIFVSERLGHA 186

Query: 127 IRKITNLG--VTTIAGGGSKKEGRADGPAQNAS 157
           +R+I      V+T+  G    EG    P  +A+
Sbjct: 187 VRQINEEKNVVSTLVKG----EGNCQNPNASAT 215


>gi|428186137|gb|EKX54988.1| hypothetical protein GUITHDRAFT_59145, partial [Guillardia theta
           CCMP2712]
          Length = 231

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 42/142 (29%), Positives = 64/142 (45%), Gaps = 19/142 (13%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VT 136
            + V   AG G+QG+ DG  G ARF +P    +  +  +YVAD +NH IR I  +   V 
Sbjct: 17  NNFVSYFAGSGIQGFRDGAAGDARFSEPLGI-IHFRNKLYVADSNNHAIRSIDPISGEVA 75

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQINLKP 190
           T+A  G+++ G  DG A  +       L   P          L ++D GN +I+ ++   
Sbjct: 76  TVA--GNQQRGNLDGAASQSQLDTPTSLLNRPEGVVFDSHGHLYVADTGNHVIKMLDFSS 133

Query: 191 --------EDCSKSSQSGSALG 204
                    + S +S  GS  G
Sbjct: 134 GKVLRTIGTEVSSTSNRGSRFG 155


>gi|302543897|ref|ZP_07296239.1| putative NHL repeat protein [Streptomyces hygroscopicus ATCC 53653]
 gi|302461515|gb|EFL24608.1| putative NHL repeat protein [Streptomyces himastatinicus ATCC
           53653]
          Length = 608

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 39/117 (33%), Positives = 64/117 (54%), Gaps = 8/117 (6%)

Query: 54  PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDM 113
           PG +  +V D++R     L+     ESV++R+ G G +G +DG    ARF +P+  A   
Sbjct: 184 PGGT-FLVSDTTRHQLVELA--ADGESVLRRI-GTGERGLADGRADRARFSEPQGLAALP 239

Query: 114 KGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
            G + VAD  NH +R+       VTT+AG G +  +   A+GPA++   S+ ++L +
Sbjct: 240 DGTVAVADTVNHALRRFDPATGEVTTLAGTGRQWWQGSPAEGPARDIDLSSPWDLAW 296


>gi|332027687|gb|EGI67755.1| NHL repeat-containing protein 2 [Acromyrmex echinatior]
          Length = 686

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 55/197 (27%), Positives = 89/197 (45%), Gaps = 20/197 (10%)

Query: 5   FVYSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDS 64
            +Y+ +AI     L  + + GL L+  + +        L     +V+     + L++ DS
Sbjct: 180 LLYTKIAIAYFKSLKQMSNHGLPLKPAHHLLPSSKDGLLFPGKLTVLQLEQETKLVISDS 239

Query: 65  SRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSN 124
             +     +    E   V+ + G   QG+ DG+  +ARF+ P+   V +   IYVAD +N
Sbjct: 240 GNNRIVITN----EHGRVEHVIGGCNQGFKDGDFKNARFNSPQGVCV-LNNIIYVADNNN 294

Query: 125 HVIRKI--TNLGVTTIAGGGSKKEGRADG----------PAQNASFSNDFELTFVPHICA 172
           H IRKI      V+TIAG GS+   R  G          P   A + ++++ T VP    
Sbjct: 295 HAIRKIDLAKRIVSTIAGTGSQGCDRKGGKHGTDQALSSPWDVAIYHHEYKGTMVP---V 351

Query: 173 LLISDHGNQLIRQINLK 189
           LLI+  G   I  + L+
Sbjct: 352 LLIAIAGTHQIWALFLE 368


>gi|206602867|gb|EDZ39348.1| Conserved protein of unknown function [Leptospirillum sp. Group II
           '5-way CG']
          Length = 374

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 49/153 (32%), Positives = 64/153 (41%), Gaps = 8/153 (5%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP  +   P    L + DS  +    +         V  +AG       DG    A F+ 
Sbjct: 117 NPEGMAISPDGKTLYIADSRNNMIRKIDL---ATKTVSTIAGHSFPSSGDGVGKEAGFET 173

Query: 106 PKSFAVDMKGN-IYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
           P+  A+   G  +YVAD  N+ IRKI   TN  VTT+AG G    G ADG    A+F   
Sbjct: 174 PRGLAISPDGKTLYVADSGNNAIRKIDLATNT-VTTLAGAGKLMSGSADGVGVQATFHEP 232

Query: 162 FELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
             L        L I+D  N LIR++ L     S
Sbjct: 233 RSLAISGDGQVLYIADTRNNLIRKMVLATNSVS 265



 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 55/111 (49%), Gaps = 5/111 (4%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRA 149
           Y DG   +ARF+ P+  A+   G  +Y+AD  N++IRKI      V+TIAG      G  
Sbjct: 105 YRDGVGANARFNNPEGMAISPDGKTLYIADSRNNMIRKIDLATKTVSTIAGHSFPSSG-- 162

Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
           DG  + A F     L   P    L ++D GN  IR+I+L     +  + +G
Sbjct: 163 DGVGKEAGFETPRGLAISPDGKTLYVADSGNNAIRKIDLATNTVTTLAGAG 213



 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 53/106 (50%), Gaps = 3/106 (2%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGR 148
           +G  DG   +ARF+ P+       G IY+AD  N +IRK+  +   V  +AG   +   R
Sbjct: 47  RGAIDGAGTNARFEFPQGILAAPDGTIYIADTGNDMIRKMNPSTKSVENVAGSDHRARYR 106

Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
            DG   NA F+N   +   P    L I+D  N +IR+I+L  +  S
Sbjct: 107 -DGVGANARFNNPEGMAISPDGKTLYIADSRNNMIRKIDLATKTVS 151


>gi|224536973|ref|ZP_03677512.1| hypothetical protein BACCELL_01849 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521412|gb|EEF90517.1| hypothetical protein BACCELL_01849 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 446

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/97 (31%), Positives = 51/97 (52%), Gaps = 1/97 (1%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRAD 150
           +GY+DG    A+F++P+    D   N YV D++N++IRK+   G  +   G  ++ G AD
Sbjct: 347 KGYTDGPGSKAQFNEPQQGCFDNDDNFYVCDQNNNLIRKVEPSGQVSTFAGRREEWGWAD 406

Query: 151 GP-AQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G   + A F   F + +  +     I+D  N+ IR I
Sbjct: 407 GDLRKEARFDRPFGIAYNRNTSEFYIADKNNKRIRII 443


>gi|410096564|ref|ZP_11291551.1| hypothetical protein HMPREF1076_00729 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409226528|gb|EKN19437.1| hypothetical protein HMPREF1076_00729 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 457

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 35/117 (29%), Positives = 59/117 (50%), Gaps = 1/117 (0%)

Query: 73  SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           S P   ++ ++ + G G     +G   +A+  + +  A D +GN+Y+AD +NHVI K+  
Sbjct: 342 STPWITQNEIEHIVGTGKGAAKEGNGKAAQLGEIEGMAADQEGNVYLADYTNHVIWKVDE 401

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
               TI  G   + G  DG  Q A F+  +++   P    L ++D  N LIR I ++
Sbjct: 402 EFNATIFAGVPGESGYKDGKPQEALFNKPYDVAATPD-GILYVADTYNYLIRCIAIQ 457


>gi|196228192|ref|ZP_03127059.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
 gi|196227595|gb|EDY22098.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
          Length = 357

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 46/141 (32%), Positives = 76/141 (53%), Gaps = 8/141 (5%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI-TN 132
           + ++ ++  +AG G +G + DG P +A  F+   +F V   G++ +AD  N+++R+I   
Sbjct: 71  VGKDGILHVIAGTGAKGATGDGGPATAATFNGIHNFVVLRNGDLLLADSFNNLLRRIDAK 130

Query: 133 LGV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
            GV TT+AGG  K      GPA++A FS   E+   P    L  +D GN+ +R I+L   
Sbjct: 131 TGVITTVAGGVKKGFAGDGGPAKDALFSTLIEIALNPAGTKLYCADIGNRRVRCIDLATN 190

Query: 192 DCSKSSQSGSAL----GAVSV 208
             +  + +G A     GAV+V
Sbjct: 191 SITTVAGNGKAGVPADGAVAV 211



 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 39/110 (35%), Positives = 53/110 (48%), Gaps = 7/110 (6%)

Query: 82  VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVT 136
           +K + G G +G S D  PG  A  + PK  A+D  G++ +AD  NHVIR+    T L +T
Sbjct: 248 IKTVVGTGAKGLSGDAGPGLQATMNGPKYIALDRDGSVLIADAENHVIRRYSPKTGL-IT 306

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            +AG G    G  DG            +T  P   AL I+D  N  I +I
Sbjct: 307 RVAGTGKPGNGGLDGDPLQCQLHRPHGVTVGPD-GALYITDSYNDRILKI 355


>gi|153808835|ref|ZP_01961503.1| hypothetical protein BACCAC_03135 [Bacteroides caccae ATCC 43185]
 gi|149128661|gb|EDM19879.1| NHL repeat protein [Bacteroides caccae ATCC 43185]
          Length = 426

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 50/98 (51%), Gaps = 1/98 (1%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           GY DG+  +ARF+ P     D+ G +Y+AD SNH IR I   G  +   G   + G  DG
Sbjct: 330 GYEDGKRLNARFNFPSQICFDLDGIMYIADSSNHCIRSIDREGAVSTVIGVPGRAGYVDG 389

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
              +A F   + +  V     + I+D  N+ IR++ ++
Sbjct: 390 TPDDALFDEPWGVA-VDEEGTIYIADTKNKCIRKLAIQ 426



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 22/42 (52%), Positives = 26/42 (61%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G  GY DG P  A FD+P   AVD +G IY+AD  N  IRK+
Sbjct: 382 GRAGYVDGTPDDALFDEPWGVAVDEEGTIYIADTKNKCIRKL 423


>gi|440789641|gb|ELR10946.1| NHL repeat containing 2 isoform 3, putative [Acanthamoeba
           castellanii str. Neff]
          Length = 934

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 51/161 (31%), Positives = 74/161 (45%), Gaps = 9/161 (5%)

Query: 47  PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           PH V +DR G   + V DS       LS   +  + +   + D   G  DG+  +ARF  
Sbjct: 298 PHGVAVDREGDR-MFVADSGHHRILVLSLDGAFRTAIG--SDDATTGLVDGDYATARFHS 354

Query: 106 PKSFAVDMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSKK-EGRADGPAQNASFSNDF 162
           P   +   +  +YVAD  NH IR  ++    V T+AG G +  E +  G A   S SN +
Sbjct: 355 PLGLSYAGEDKLYVADSENHCIRCVRLAEERVVTVAGTGQRGYERKGGGRALEWSLSNPW 414

Query: 163 ELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSAL 203
           ++    H   L I+  G   I + N + E+ S  S SGS L
Sbjct: 415 DV--ASHGHDLYIAMAGTHQIWKYNEQSEEISLVSGSGSEL 453


>gi|386841352|ref|YP_006246410.1| hypothetical protein SHJG_5269 [Streptomyces hygroscopicus subsp.
           jinggangensis 5008]
 gi|374101653|gb|AEY90537.1| hypothetical protein SHJG_5269 [Streptomyces hygroscopicus subsp.
           jinggangensis 5008]
 gi|451794647|gb|AGF64696.1| hypothetical protein SHJGH_5033 [Streptomyces hygroscopicus subsp.
           jinggangensis TL01]
          Length = 611

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 64/113 (56%), Gaps = 7/113 (6%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S   +V D++R     L+     E+VV+R+ G G +G++DG P  A F +P+  A+   G
Sbjct: 189 SGTFLVSDTTRHQLVELAE--DAETVVRRI-GSGTRGFADGGPREASFSEPQGLALLDDG 245

Query: 116 NIYVADKSNHVIRKI--TNLGVTTIAGGGSK-KEGRA-DGPAQNASFSNDFEL 164
           ++ VAD  NH +R++      VTT+AG G +  +G A  GPA+    S+ +++
Sbjct: 246 SVVVADTVNHALRRLDPATGEVTTLAGTGKQWWQGSATSGPAREVDLSSPWDV 298


>gi|393789630|ref|ZP_10377750.1| hypothetical protein HMPREF1068_04030 [Bacteroides nordii
           CL02T12C05]
 gi|392650346|gb|EIY44015.1| hypothetical protein HMPREF1068_04030 [Bacteroides nordii
           CL02T12C05]
          Length = 435

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 53/98 (54%), Gaps = 1/98 (1%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           GY+DG    A FD P+   +  + ++++AD +NHVIRKIT  GV +   G +   G  DG
Sbjct: 339 GYADGPCEYALFDTPRQIILTAENDLFLADSNNHVIRKITQDGVVSTVIGQAGMTGNQDG 398

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
             + A F+  F +  V     + I D  NQ IR++ ++
Sbjct: 399 TPEVALFNEPFGVA-VDTDGTIYIGDSKNQSIRRLAIE 435



 Score = 44.7 bits (104), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 34/122 (27%), Positives = 51/122 (41%), Gaps = 22/122 (18%)

Query: 31  GYTVTTVIDGHQLEINPHS----VIDRPGSSD----LIVLDSSRSAFYTLSFPL----SE 78
            YT    I  + L+   H     +++ PG +D      + D+ R    T    L    S 
Sbjct: 311 AYTNAHCIYTYNLKTGAHKLFAGMVNTPGYADGPCEYALFDTPRQIILTAENDLFLADSN 370

Query: 79  ESVVKRLAGDGV----------QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
             V++++  DGV           G  DG P  A F++P   AVD  G IY+ D  N  IR
Sbjct: 371 NHVIRKITQDGVVSTVIGQAGMTGNQDGTPEVALFNEPFGVAVDTDGTIYIGDSKNQSIR 430

Query: 129 KI 130
           ++
Sbjct: 431 RL 432


>gi|423220897|ref|ZP_17207391.1| hypothetical protein HMPREF1061_04164 [Bacteroides caccae
           CL03T12C61]
 gi|392622375|gb|EIY16503.1| hypothetical protein HMPREF1061_04164 [Bacteroides caccae
           CL03T12C61]
          Length = 426

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 50/98 (51%), Gaps = 1/98 (1%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           GY DG+  +ARF+ P     D+ G +Y+AD SNH IR I   G  +   G   + G  DG
Sbjct: 330 GYEDGKRLNARFNFPSQICFDLDGIMYIADSSNHCIRSIDREGAVSTVIGVPGRAGYVDG 389

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
              +A F   + +  V     + I+D  N+ IR++ ++
Sbjct: 390 TPDDALFDEPWGVA-VDEEGTIYIADTKNKCIRKLAIQ 426



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 22/42 (52%), Positives = 26/42 (61%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G  GY DG P  A FD+P   AVD +G IY+AD  N  IRK+
Sbjct: 382 GRAGYVDGTPDDALFDEPWGVAVDEEGTIYIADTKNKCIRKL 423


>gi|288923319|ref|ZP_06417453.1| serine/threonine protein kinase [Frankia sp. EUN1f]
 gi|288345335|gb|EFC79730.1| serine/threonine protein kinase [Frankia sp. EUN1f]
          Length = 752

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 38/103 (36%), Positives = 52/103 (50%), Gaps = 3/103 (2%)

Query: 101 ARFDKPKSFAVDM-KGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
           A FD  +   +D  +G IYVAD  N+ IR+I   G +TT+AG G+       GPA  AS 
Sbjct: 415 AEFDSARDLTIDTTRGVIYVADTENNRIRRIDRTGRITTVAGNGTAGFDGDGGPATAASL 474

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
                +   P    L I+D+GN  IR+I       + + QSGS
Sbjct: 475 QEPSGVVVAPD-GTLFIADNGNHRIRRIGTNGVITTIAGQSGS 516



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 49/153 (32%), Positives = 73/153 (47%), Gaps = 28/153 (18%)

Query: 60  IVLDSSRSAFYTLSFPLSEESVVKRL---------AGDGVQGY-SDGEPGSA-RFDKPKS 108
           + +D++R   Y      +E + ++R+         AG+G  G+  DG P +A    +P  
Sbjct: 423 LTIDTTRGVIYVAD---TENNRIRRIDRTGRITTVAGNGTAGFDGDGGPATAASLQEPSG 479

Query: 109 FAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAG----GGSKKEGRADG--------PAQN 155
             V   G +++AD  NH IR+I TN  +TTIAG    GG   E   DG        PA N
Sbjct: 480 VVVAPDGTLFIADNGNHRIRRIGTNGVITTIAGQSGSGGFAGEVSEDGLIFAGDGLPAVN 539

Query: 156 ASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           A  +N      +    +LL++D  NQ IR+I L
Sbjct: 540 AKLNNP-NSLLLDDDGSLLVADGTNQRIRRIGL 571



 Score = 40.4 bits (93), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 44/81 (54%), Gaps = 4/81 (4%)

Query: 82  VKRLAGDGVQGYS-DGEPGS-ARFDKPKS-FAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
           +  +AG G  G+S DG P + A+     +  AVD  G IY++D +N  IR+IT  G +TT
Sbjct: 642 ISTVAGTGDAGFSGDGGPATRAKISTVGADVAVDSTGTIYLSDPANSRIRRITPDGIITT 701

Query: 138 IAGGGSKKEGRADGPAQNASF 158
           IAG G  +     GPA  A  
Sbjct: 702 IAGTGVAQYSGNGGPAVEAEL 722



 Score = 38.1 bits (87), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 40/89 (44%), Gaps = 1/89 (1%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
           +A    P + A    G+IY+ D+ N  +R+I   G ++T+AG G        GPA  A  
Sbjct: 606 AATLSYPCALARGPDGSIYLVDQDNFRLRRIDPAGRISTVAGTGDAGFSGDGGPATRAKI 665

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
           S       V     + +SD  N  IR+I 
Sbjct: 666 STVGADVAVDSTGTIYLSDPANSRIRRIT 694


>gi|383111216|ref|ZP_09932033.1| hypothetical protein BSGG_2316 [Bacteroides sp. D2]
 gi|313694781|gb|EFS31616.1| hypothetical protein BSGG_2316 [Bacteroides sp. D2]
          Length = 419

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 46/152 (30%), Positives = 73/152 (48%), Gaps = 20/152 (13%)

Query: 56  SSDLIVLDSSRSAFYTLS-----FPLSEESVVKRLAGDGVQG--YSDGEPG---SARFDK 105
           S + IV D +++     S       +S +   K +AGDGV+G  Y DG+PG   SA+   
Sbjct: 270 SPNYIVFDKNKNMIVGTSNGYVLIQISPDGTQKTIAGDGVKGQEYYDGDPGNPLSAKVGA 329

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT--------NLGVTTIAGGGSKKEGRADGPAQNAS 157
               A    G +Y++D + + IRK+T           + TIAG G  K G +DG    A+
Sbjct: 330 TFGVAAGSDGCLYLSDNTYNCIRKLTPDASGDYSKGTLETIAGSG--KSGFSDGKGLKAT 387

Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           F+  +E+        + ++   N LIR+I +K
Sbjct: 388 FNQPYEIIITKDCKTMYVAGAVNYLIRRITVK 419


>gi|442323754|ref|YP_007363775.1| putative lipoprotein [Myxococcus stipitatus DSM 14675]
 gi|441491396|gb|AGC48091.1| putative lipoprotein [Myxococcus stipitatus DSM 14675]
          Length = 906

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 41/109 (37%), Positives = 54/109 (49%), Gaps = 7/109 (6%)

Query: 82  VKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVTT 137
           V  LAG   Q G+ DG    AR  +P   AVD  GNI+VAD  NH +R+I       VTT
Sbjct: 479 VSTLAGVAFQDGFVDGPVARARLRRPVGVAVDGLGNIFVADTGNHAVRRIAPDAARTVTT 538

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           IAG G+   G  +GP    +  +   +   P    L ++D GN  I +I
Sbjct: 539 IAGLGTP--GVGEGPGATTALRSPQSIAVAPD-GTLYVADTGNHRIVRI 584



 Score = 55.1 bits (131), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 43/131 (32%), Positives = 58/131 (44%), Gaps = 7/131 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTI 138
           V  +AG G  G  +G   +     P+S AV   G +YVAD  NH I +I   G   V+T 
Sbjct: 536 VTTIAGLGTPGVGEGPGATTALRSPQSIAVAPDGTLYVADTGNHRIVRIARDGRWTVSTF 595

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS--KS 196
           AG    ++GRADG    A F     L F      L ++D  N  + +I  +    +   S
Sbjct: 596 AGSREGRQGRADGTGPAARFQTPTSLVFAG--TDLYVTDTFNHRLARITPQARVSTVIGS 653

Query: 197 SQSGSALGAVS 207
             SGS  G  S
Sbjct: 654 RGSGSTNGPAS 664



 Score = 50.1 bits (118), Expect = 9e-04,   Method: Composition-based stats.
 Identities = 39/110 (35%), Positives = 56/110 (50%), Gaps = 8/110 (7%)

Query: 82  VKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTI 138
           V   AG  +G QG +DG   +ARF  P S  V    ++YV D  NH + +IT    V+T+
Sbjct: 592 VSTFAGSREGRQGRADGTGPAARFQTPTSL-VFAGTDLYVTDTFNHRLARITPQARVSTV 650

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
            G  S+  G  +GPA  A       + F     AL + D GN+LIR++ +
Sbjct: 651 IG--SRGSGSTNGPASQARLHRPTAVAFGDG--ALWVVDTGNRLIRRVAM 696



 Score = 40.4 bits (93), Expect = 0.76,   Method: Composition-based stats.
 Identities = 27/87 (31%), Positives = 43/87 (49%), Gaps = 3/87 (3%)

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
           P S AV + G + + D  N  IR +    V T AG G+   G  DG A+ A+FS    + 
Sbjct: 725 PMSGAVHVDGRLLLTDTGNERIRMLVGGRVRTYAGSGA--HGARDGTAEQATFSLPTGIA 782

Query: 166 FVPHICALLISDHGNQLIRQINLKPED 192
            +P+   +L+ D G   +R++ +   D
Sbjct: 783 ALPN-GNMLVVDQGASTLRELRMPAAD 808


>gi|443625938|ref|ZP_21110373.1| putative NHL repeat protein [Streptomyces viridochromogenes Tue57]
 gi|443340614|gb|ELS54821.1| putative NHL repeat protein [Streptomyces viridochromogenes Tue57]
          Length = 632

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 44/139 (31%), Positives = 73/139 (52%), Gaps = 14/139 (10%)

Query: 39  DGHQLEINPHSVIDR-PG------SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ 91
           DG  +   P   + R PG      S D +V D++R     L+     E+VV+R+ G G +
Sbjct: 169 DGPYVAPEPEPTVLRFPGKVLLLPSGDFLVSDTTRHQLVELAG--DGETVVRRI-GQGSR 225

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK-KEGR 148
           G +DG    ARF +P+  A+   G + VAD  NH++R+        TT+AG G +  +G+
Sbjct: 226 GSTDGYADRARFQEPQGLALLPDGTVAVADTVNHLVRRFDPETGRTTTLAGTGIQWMQGQ 285

Query: 149 A-DGPAQNASFSNDFELTF 166
           A  GP ++ S S+ +++ +
Sbjct: 286 ATSGPGRDVSLSSPWDVAW 304



 Score = 37.7 bits (86), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 30/117 (25%), Positives = 51/117 (43%), Gaps = 5/117 (4%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
           GDG     + EP   RF  P    +   G+  V+D + H + ++   G T +   G    
Sbjct: 168 GDGPYVAPEPEPTVLRF--PGKVLLLPSGDFLVSDTTRHQLVELAGDGETVVRRIGQGSR 225

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSAL 203
           G  DG A  A F     L  +P    + ++D  N L+R+ +  PE    ++ +G+ +
Sbjct: 226 GSTDGYADRARFQEPQGLALLPD-GTVAVADTVNHLVRRFD--PETGRTTTLAGTGI 279


>gi|365121900|ref|ZP_09338810.1| hypothetical protein HMPREF1033_02156 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363643613|gb|EHL82920.1| hypothetical protein HMPREF1033_02156 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 443

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 36/104 (34%), Positives = 52/104 (50%), Gaps = 13/104 (12%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           GY DG    A F++P+   +D   N+Y+AD  NHVIRKIT  G  +   G +   G  DG
Sbjct: 347 GYLDGIGSYAMFNQPRQLILDEDDNLYIADTENHVIRKITPQGQVSTVIGQAGVAGYQDG 406

Query: 152 PAQNASFSNDFELTFVPH-IC-----ALLISDHGNQLIRQINLK 189
             + A F         PH +C      + I D+ NQ IR++ ++
Sbjct: 407 DPEVALFDR-------PHGVCINKEGIIYIGDYENQCIRRLAIE 443


>gi|290989086|ref|XP_002677176.1| predicted protein [Naegleria gruberi]
 gi|284090782|gb|EFC44432.1| predicted protein [Naegleria gruberi]
          Length = 2103

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 38/115 (33%), Positives = 62/115 (53%), Gaps = 6/115 (5%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSAR---FDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           +S   ++  +AG+G  G+  G+ G A     ++P   A+   G +Y+AD  NH IRK++N
Sbjct: 525 VSNNGIITTIAGNGTAGFC-GDGGLAVNTCLNRPNGIAISSSGELYIADYGNHRIRKVSN 583

Query: 133 LG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            G +TTIAG G+         A NAS  +  +++   +   + I+D GN  IR+I
Sbjct: 584 NGIITTIAGNGNTIYNGDGIDAANASLYSPVDVSIGAN-NEIYIADAGNYRIRKI 637



 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 46/153 (30%), Positives = 72/153 (47%), Gaps = 9/153 (5%)

Query: 38  IDGHQLEIN-PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-D 95
           ID  +  +N PH V   P S ++   D +          +     +  +AG+G   Y+ D
Sbjct: 105 IDATKATVNSPHGVAFHPTSGEVYFADYANHRIR----KIYSNGTITTIAGNGEPNYNGD 160

Query: 96  GEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPA 153
             P  SA+   P   AV   G +++AD  N+ IRKI+N G +TTIAG G+       GPA
Sbjct: 161 NIPATSAQLGYPIGIAVSSGGEVFIADTLNNRIRKISNNGMITTIAGNGTGGYSGDGGPA 220

Query: 154 QNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            NA  +  F +  +     ++ +D  N  IR++
Sbjct: 221 VNAMLNTPFGVA-IDSDGNIVFADLVNNRIRKV 252



 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 44/132 (33%), Positives = 65/132 (49%), Gaps = 8/132 (6%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S +LIV +++ S  + +   L+   +V  +AG    G  DG   S  F     F +   G
Sbjct: 790 SENLIVCETTSS--WKIRKILATTGMVSTIAG----GIGDGLNASNAFLVSTLFDISNSG 843

Query: 116 NIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
            IY+AD  NH IRKI TN  + TIAG G        G A +A  +N  ++  V     L+
Sbjct: 844 EIYIADTGNHRIRKIFTNGTIITIAGNGIGGYAGDGGQATSAQLNNPKDVA-VSSNGELI 902

Query: 175 ISDHGNQLIRQI 186
           I+D+ N  IR++
Sbjct: 903 IADYSNNRIRKV 914



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 44/125 (35%), Positives = 62/125 (49%), Gaps = 35/125 (28%)

Query: 81  VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLG-VT 136
           ++  +AGDG  GY+ DG   + A  + P   A+D   N  Y+AD +N+ IRK+++ G +T
Sbjct: 313 IISSIAGDGQSGYNYDGIDATIATLNNPVGIAIDSSNNEFYIADSNNNRIRKVSSSGKIT 372

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL-----ISDHGNQL--------I 183
           TIAGG S             SF N+     VP I A +     +S + NQL         
Sbjct: 373 TIAGGTS-------------SFGNN-----VPAIPAFISPIGSMSLNNNQLSFADSSGQF 414

Query: 184 RQINL 188
           RQINL
Sbjct: 415 RQINL 419



 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 38/115 (33%), Positives = 56/115 (48%), Gaps = 5/115 (4%)

Query: 76  LSEESVVKRLAGDGVQGYS-DG-EPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKI-T 131
           +S   ++  +AG+G  GY+ DG +   A  + P   A     G +Y AD +NH IRKI +
Sbjct: 83  ISNNGIISTIAGNGQAGYNGDGIDATKATVNSPHGVAFHPTSGEVYFADYANHRIRKIYS 142

Query: 132 NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           N  +TTIAG G       + PA +A       +  V     + I+D  N  IR+I
Sbjct: 143 NGTITTIAGNGEPNYNGDNIPATSAQLGYPIGIA-VSSGGEVFIADTLNNRIRKI 196



 Score = 45.1 bits (105), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 38/140 (27%), Positives = 61/140 (43%), Gaps = 10/140 (7%)

Query: 76  LSEESVVKRLAGDGVQGYS-DG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TN 132
           +S   ++  +AG+G   Y+ DG +  +A    P   ++     IY+AD  N+ IRKI TN
Sbjct: 581 VSNNGIITTIAGNGNTIYNGDGIDAANASLYSPVDVSIGANNEIYIADAGNYRIRKIFTN 640

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFS-------NDFELTFVPHICALLISDHGNQLIRQ 185
             + TIAG G+      +G   NA+         N   + F   +  ++   + N  I  
Sbjct: 641 GTIVTIAGTGTNGFSGDNGLGSNATIGYPSSVLFNSGNVYFTDIVYCVIRKIYSNGTITT 700

Query: 186 INLKPEDCSKSSQSGSALGA 205
           I+ K   C+     G A  A
Sbjct: 701 ISGKAGTCTYGGDGGKASNA 720



 Score = 41.6 bits (96), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 44/158 (27%), Positives = 74/158 (46%), Gaps = 10/158 (6%)

Query: 31  GYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV 90
           GYT  T+I  +    NP+S+  +  ++D+ V+ +S S  Y             +LAG   
Sbjct: 432 GYT--TIIQTYS---NPNSI--KFFNNDMYVMYTSNSVSYIDRIVNLNYGPAVKLAGAIF 484

Query: 91  QGYSDGEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGR 148
               +    SA  + P    ++   G + ++D +NH IRK++N G +TTIAG G+     
Sbjct: 485 YNGDNIPATSAIMNYPSHCVINSTNGELIISDMNNHRIRKVSNNGIITTIAGNGTAGFCG 544

Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             G A N   +    +  +     L I+D+GN  IR++
Sbjct: 545 DGGLAVNTCLNRPNGIA-ISSSGELYIADYGNHRIRKV 581



 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 44/84 (52%), Gaps = 4/84 (4%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGS 143
           AG    G   G+  +A+   P   A+   G+IY++D  NH IR I+++   ++ IAG G 
Sbjct: 705 AGTCTYGGDGGKASNAQLSYPAGIAISSTGDIYISDNYNHRIRVISSVTGIISNIAGTG- 763

Query: 144 KKEGRADGPAQN-ASFSNDFELTF 166
           + E   DG  ++  +F+    LTF
Sbjct: 764 RSEYNGDGLHESITNFAYPVGLTF 787



 Score = 37.0 bits (84), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 33/107 (30%), Positives = 51/107 (47%), Gaps = 4/107 (3%)

Query: 82  VKRLAGDGVQGYSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
           +  +AG G     DG P  +A  +  +  A    G I +AD  N+ IRKI+N G ++TIA
Sbjct: 36  ITTVAGGG-NSVGDGLPAVNALLNSVRDVAY-YSGVIVIADTLNNRIRKISNNGIISTIA 93

Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G G          A  A+ ++   + F P    +  +D+ N  IR+I
Sbjct: 94  GNGQAGYNGDGIDATKATVNSPHGVAFHPTSGEVYFADYANHRIRKI 140


>gi|37522270|ref|NP_925647.1| hypothetical protein gll2701 [Gloeobacter violaceus PCC 7421]
 gi|35213270|dbj|BAC90642.1| gll2701 [Gloeobacter violaceus PCC 7421]
          Length = 351

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 42/114 (36%), Positives = 55/114 (48%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +  +  ++ +AG G   Y  DG P   ARF+ P   AVD  GN+ VAD  NH IR I   
Sbjct: 124 VGTDGRIETIAGTGEADYGGDGGPAKQARFNDPAGVAVDALGNVLVADTYNHRIRTIGPD 183

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G + T+AG G        GPA  A     + +   P    +LI+D GN  IR I
Sbjct: 184 GTIRTVAGTGQAGYSGDGGPATAARLDFPWGVAVAPD-GRILIADTGNNRIRSI 236



 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 53/177 (29%), Positives = 88/177 (49%), Gaps = 16/177 (9%)

Query: 17  LLFSLVSSGLLLEDGYTVTTVIDGHQLEIN-----PHSVIDRPGSSDLIVLDSSRSAFYT 71
           + F+L+ + L     + + +V+ G+ +++      P  V+  P + +L V++ SR+    
Sbjct: 13  VWFALLGTAL---GSFALRSVVAGNHIKVAEEPLYPVFVLPAP-AGELFVVEHSRNQILK 68

Query: 72  LSFPLSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           L  P    +V+   AG+G   YS DG P +         A D  GN+Y+AD ++H +R++
Sbjct: 69  L-VPGRLPTVI---AGNGTSDYSGDGGPATRAGLFMMGIARDRAGNLYIADHNHHRVRRV 124

Query: 131 TNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
              G + TIAG G    G   GPA+ A F ND     V  +  +L++D  N  IR I
Sbjct: 125 GTDGRIETIAGTGEADYGGDGGPAKQARF-NDPAGVAVDALGNVLVADTYNHRIRTI 180



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 40/114 (35%), Positives = 55/114 (48%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +  +  ++ +AG G  GYS DG P +A R D P   AV   G I +AD  N+ IR I   
Sbjct: 180 IGPDGTIRTVAGTGQAGYSGDGGPATAARLDFPWGVAVAPDGRILIADTGNNRIRSIGPD 239

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G + T+AG G    G   GPA  A      +L    H   L ++D  N  +R+I
Sbjct: 240 GTIRTVAGTGQAGFGGDGGPAVKARLERP-QLAVADHRGNLFVADTNNNRVRRI 292



 Score = 40.0 bits (92), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 48/88 (54%), Gaps = 11/88 (12%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
           AR ++P+    D +GN++VAD +N+ +R+I   G ++T+AGG        + PA  A+  
Sbjct: 263 ARLERPQLAVADHRGNLFVADTNNNRVRRIAPDGTISTVAGG--------EPPA--AARL 312

Query: 160 NDFELTFVPHICALLISDHGNQLIRQIN 187
           ND     V     L I+D GN  + +I+
Sbjct: 313 NDPFAVGVDERGHLYIADTGNFRVLKID 340


>gi|406836139|ref|ZP_11095733.1| NHL repeat containing protein [Schlesneria paludicola DSM 18645]
          Length = 365

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 67/135 (49%), Gaps = 13/135 (9%)

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKG 115
           D+I++    +A Y +     +E  V+ LAG G +GY+ DG  G  A+ + PK  A+D +G
Sbjct: 236 DVILVLREGNAVYRID---RKEKSVRHLAGTGKKGYAGDGGDGKLAQVNGPKGIAIDHQG 292

Query: 116 NIYVADKSNHVIRKITNL--GVTTIAGGGSKKEGRADGPAQNASFS--NDFELTFVPHIC 171
           NI + D  NHVIR I  L   + T+ G G+      DGP  N      N     FV    
Sbjct: 293 NILLCDTENHVIRIIERLTGKIDTLVGDGTI----GDGPDGNPRHCRLNRPHGVFVALDG 348

Query: 172 ALLISDHGNQLIRQI 186
            + I D GN  IR++
Sbjct: 349 TVYIGDSGNHKIRKL 363



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 34/100 (34%), Positives = 50/100 (50%), Gaps = 6/100 (6%)

Query: 94  SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRA 149
            DG P   A+F +P     D  GN Y++D  + VIR+I   T + +TT+AG         
Sbjct: 99  GDGGPALRAKFHEPHEIRFDRNGNYYISDMKSDVIRRIDAKTQI-ITTVAGTAKPGFTGD 157

Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            GPA  A F+N   ++       LLI D  N  +RQ++L+
Sbjct: 158 GGPATKAEFNNPIAVSLDGD-ARLLICDIKNHRVRQVDLE 196



 Score = 42.0 bits (97), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 27/51 (52%), Gaps = 1/51 (1%)

Query: 82  VKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +  L GDG  G   DG P   R ++P    V + G +Y+ D  NH IRK+T
Sbjct: 314 IDTLVGDGTIGDGPDGNPRHCRLNRPHGVFVALDGTVYIGDSGNHKIRKLT 364



 Score = 41.2 bits (95), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 37/115 (32%), Positives = 53/115 (46%), Gaps = 6/115 (5%)

Query: 82  VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTI 138
           +  +AG GV  +S DG P   A    P    +   G +Y  D +NHVIR++    G  T 
Sbjct: 28  IATIAGTGVDEHSGDGGPALKAGLSNPFGLEIAPDGMLYFCDFTNHVIRRMDLKTGFLTT 87

Query: 139 AGGGSKKEGRA--DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
             G  +  G A   GPA  A F    E+ F  +     ISD  + +IR+I+ K +
Sbjct: 88  VAGTPRNPGFAGDGGPALRAKFHEPHEIRFDRN-GNYYISDMKSDVIRRIDAKTQ 141


>gi|326432343|gb|EGD77913.1| NHL repeat protein [Salpingoeca sp. ATCC 50818]
          Length = 1505

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 39/114 (34%), Positives = 57/114 (50%), Gaps = 9/114 (7%)

Query: 81   VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV--- 135
            V+  +AG GV G+S   G P  AR D P+  AV   G++ +AD  NH +R+  N+GV   
Sbjct: 1391 VITTVAGTGVAGFSGDGGLPTDARLDTPRGVAVHSSGSLAIADSGNHRVREF-NIGVGAA 1449

Query: 136  ---TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
               TT AG G +        A + + +    +TF P    +L  D  N+ +RQI
Sbjct: 1450 GIITTTAGNGQRGYNGDGMVATDTALNFPTGITFSPLTDNVLFVDRRNRRVRQI 1503



 Score = 37.7 bits (86), Expect = 5.1,   Method: Composition-based stats.
 Identities = 36/117 (30%), Positives = 51/117 (43%), Gaps = 27/117 (23%)

Query: 84   RLAGDGVQGYSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRK-------ITNLGV 135
            R+AGD      DG+    AR + P   AV     +Y+AD  NH +R        IT +  
Sbjct: 1343 RVAGD----EGDGQLAIGARLNMPAGIAV-YDHMLYIADSGNHRVRAVDLYTQVITTVAG 1397

Query: 136  TTIAG----GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
            T +AG    GG   + R D P   A  S+           +L I+D GN  +R+ N+
Sbjct: 1398 TGVAGFSGDGGLPTDARLDTPRGVAVHSSG----------SLAIADSGNHRVREFNI 1444


>gi|116622401|ref|YP_824557.1| Ig domain-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116225563|gb|ABJ84272.1| Ig domain protein, group 1 domain protein [Candidatus Solibacter
           usitatus Ellin6076]
          Length = 2770

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 63/111 (56%), Gaps = 5/111 (4%)

Query: 81  VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGV-T 136
           ++  +AG G +GYS DG+P + A+ + P   A+D  GN+++AD  NH +R++  N GV T
Sbjct: 141 IITTVAGTGERGYSGDGQPATLAKINSPYHIALDGHGNLFIADDGNHRVRRVDGNSGVIT 200

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           T+AG G+      D  A +A   N   +  +     L I+D+GN  +R ++
Sbjct: 201 TVAGTGNAGYNGDDQQATHADLQNPRGV-LIDASGNLYIADYGNHRVRVVD 250



 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/114 (33%), Positives = 56/114 (49%), Gaps = 10/114 (8%)

Query: 82  VKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTI 138
           +  +AG G+QG+S   G   +AR   P   A+   G++Y+AD  NH IRK+   GV TT+
Sbjct: 25  ISTVAGTGIQGFSGDSGPATAARLFDPTDVAIHPNGDLYIADTYNHRIRKVDKNGVITTV 84

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHIC-----ALLISDHGNQLIRQIN 187
           AG G        G A +   +   EL     I       L I+D G+  IR+++
Sbjct: 85  AGTGQATN--VGGDANDNILAVSAELNHPSGIAFDTAGNLYIADTGHDRIRRVD 136



 Score = 46.2 bits (108), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 33/112 (29%), Positives = 52/112 (46%), Gaps = 4/112 (3%)

Query: 81  VVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT-T 137
           V+  +AG G  GY+  D +   A    P+   +D  GN+Y+AD  NH +R +   GV  T
Sbjct: 198 VITTVAGTGNAGYNGDDQQATHADLQNPRGVLIDASGNLYIADYGNHRVRVVDATGVIHT 257

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            AG G        G A  A+      L        + ++D  +Q +RQ+N++
Sbjct: 258 FAGTGVYGFSGDGGAAMAANLKGPIGLG-TDAAGNIYVADGQDQRVRQVNIQ 308



 Score = 45.8 bits (107), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 27/90 (30%), Positives = 46/90 (51%), Gaps = 3/90 (3%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNAS 157
           SA  + P   A D  GN+Y+AD  +  IR++  +   +TT+AG G +       PA  A 
Sbjct: 105 SAELNHPSGIAFDTAGNLYIADTGHDRIRRVDGVTGIITTVAGTGERGYSGDGQPATLAK 164

Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQIN 187
            ++ + +    H   L I+D GN  +R+++
Sbjct: 165 INSPYHIALDGH-GNLFIADDGNHRVRRVD 193



 Score = 45.8 bits (107), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 53/113 (46%), Gaps = 12/113 (10%)

Query: 81  VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVT 136
           V+   AG GV G+S   G   +A    P     D  GNIYVAD  +  +R+  I    ++
Sbjct: 254 VIHTFAGTGVYGFSGDGGAAMAANLKGPIGLGTDAAGNIYVADGQDQRVRQVNIQTRVIS 313

Query: 137 TIAGGGSKKEGRADGPAQNASFSN---DFELTFVPHICALLISDHGNQLIRQI 186
           T+ G G        GPA +A+ S+    F+ T       + I+ +G+ +IR +
Sbjct: 314 TVVGNGQSGYSGDGGPATDATISSASATFDAT-----GRIYIAQYGSNVIRSV 361


>gi|290992224|ref|XP_002678734.1| predicted protein [Naegleria gruberi]
 gi|284092348|gb|EFC45990.1| predicted protein [Naegleria gruberi]
          Length = 1366

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 57/165 (34%), Positives = 80/165 (48%), Gaps = 18/165 (10%)

Query: 28  LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
           L   Y +TTV  G +   +P   I +  + DLIV D+   +   +S+      VV R+AG
Sbjct: 399 LPSSYYITTVARGLK---DPQK-ISKLSNGDLIVSDTGDHSIKKISY---STGVVTRIAG 451

Query: 88  DGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAG---G 141
            GV G+S DG   + A+F+KP   AV +   IY+AD  NH IR I  N  ++T+ G   G
Sbjct: 452 TGVAGFSGDGGLATLAQFNKPYGIAVTINDEIYIADSLNHRIRFIDVNGNISTVVGTSIG 511

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            S   G A     NA    D  L+       L I+D  N  IR++
Sbjct: 512 FSGDGGLATAAKLNAPM--DVSLSASGD---LYIADRDNYRIRKV 551



 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 50/188 (26%), Positives = 85/188 (45%), Gaps = 8/188 (4%)

Query: 2   ASHFVYSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIV 61
           +++ V  SLAI L+ +   ++  G  L       +   G   E      + R  + D++V
Sbjct: 9   SANIVGFSLAIVLIMMAMVVMVKGTSLNPYSVKISTFAGGNGEFKDPQKLARLSNGDILV 68

Query: 62  LDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYV 119
            D+   A       ++   V+  +AG GV G++  +G   +A+ +KP   AV     IY 
Sbjct: 69  TDALGHAIK----KINGTGVITTIAGTGVAGFAGDNGPAINAQVNKPYGIAVSSNDEIYF 124

Query: 120 ADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDH 178
           AD  NH IRKI   G +TT+ G G+       G A + + +   +++       L I+D 
Sbjct: 125 ADSMNHRIRKIDITGNITTVVGTGANTFSGDGGLATDCTMNTPMDVSLSAS-GELYIADM 183

Query: 179 GNQLIRQI 186
            N  IR++
Sbjct: 184 YNYRIRKV 191


>gi|299144811|ref|ZP_07037879.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
 gi|298515302|gb|EFI39183.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
          Length = 459

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 43/124 (34%), Positives = 58/124 (46%), Gaps = 19/124 (15%)

Query: 85  LAGD-GVQGYSDGEPGSARFDKP--------KSFAVDMKGNIY---VADKSNHVIRKITN 132
            AG+ G  GY D    SARFD P        + +  + + +IY   + D +NH IRKIT 
Sbjct: 336 FAGEFGSYGYKDASQNSARFDNPMQGCFVLNEEYVAEQRADIYDFYLTDAANHCIRKITP 395

Query: 133 LG-VTTIAGGGSKK-----EGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
            G VTT AG GS        G  DG P + A F+    L +        + D+GN  +R 
Sbjct: 396 DGIVTTFAGRGSYSTDQIVSGYIDGDPRETARFNYPLGLCYEESTGTFYVGDNGNHRVRT 455

Query: 186 INLK 189
           I L+
Sbjct: 456 IALQ 459


>gi|326435583|gb|EGD81153.1| hypothetical protein PTSG_13151 [Salpingoeca sp. ATCC 50818]
          Length = 1551

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 39/114 (34%), Positives = 57/114 (50%), Gaps = 9/114 (7%)

Query: 81   VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV--- 135
            V+  +AG GV G+S   G P  AR D P+  AV   G++ +AD  NH +R+  N+GV   
Sbjct: 1437 VITTVAGTGVAGFSGDGGLPTDARLDTPRGVAVHSSGSLAIADSGNHRVREF-NIGVGAA 1495

Query: 136  ---TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
               TT AG G +        A + + +    +TF P    +L  D  N+ +RQI
Sbjct: 1496 GIITTTAGNGQRGYNGDGMVATDTALNFPTGITFSPLTDNVLFVDRRNRRVRQI 1549



 Score = 38.9 bits (89), Expect = 2.1,   Method: Composition-based stats.
 Identities = 43/146 (29%), Positives = 63/146 (43%), Gaps = 30/146 (20%)

Query: 58   DLIVLDSSRSAFYTLSFPLSEESVVK---RLAGDGVQGYSDGEPG-SARFDKPKSFAVDM 113
            +L+  D    A Y ++   +  SVV    R+AGD      DG+    AR + P   AV  
Sbjct: 1360 NLLFSDEENHAVYMVNPHTTIISVVAGTPRVAGD----EGDGQLAIGARLNMPAGIAV-Y 1414

Query: 114  KGNIYVADKSNHVIRK-------ITNLGVTTIAG----GGSKKEGRADGPAQNASFSNDF 162
               +Y+AD  NH +R        IT +  T +AG    GG   + R D P   A  S+  
Sbjct: 1415 DHMLYIADSGNHRVRAVDLYTQVITTVAGTGVAGFSGDGGLPTDARLDTPRGVAVHSSG- 1473

Query: 163  ELTFVPHICALLISDHGNQLIRQINL 188
                     +L I+D GN  +R+ N+
Sbjct: 1474 ---------SLAIADSGNHRVREFNI 1490


>gi|293372451|ref|ZP_06618835.1| IPT/TIG domain protein [Bacteroides ovatus SD CMC 3f]
 gi|292632634|gb|EFF51228.1| IPT/TIG domain protein [Bacteroides ovatus SD CMC 3f]
          Length = 446

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 43/124 (34%), Positives = 58/124 (46%), Gaps = 19/124 (15%)

Query: 85  LAGD-GVQGYSDGEPGSARFDKP--------KSFAVDMKGNIY---VADKSNHVIRKITN 132
            AG+ G  GY D    SARFD P        + +  + + +IY   + D +NH IRKIT 
Sbjct: 323 FAGEFGSYGYKDASQNSARFDNPMQGCFVLNEEYVAEQRADIYDFYLTDAANHCIRKITP 382

Query: 133 LG-VTTIAGGGSKK-----EGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
            G VTT AG GS        G  DG P + A F+    L +        + D+GN  +R 
Sbjct: 383 DGIVTTFAGRGSYSTDQIVSGYIDGDPRETARFNYPLGLCYEESTGTFYVGDNGNHRVRT 442

Query: 186 INLK 189
           I L+
Sbjct: 443 IALQ 446


>gi|325965074|ref|YP_004242980.1| thiol-disulfide isomerase-like thioredoxin [Arthrobacter
           phenanthrenivorans Sphe3]
 gi|323471161|gb|ADX74846.1| thiol-disulfide isomerase-like thioredoxin [Arthrobacter
           phenanthrenivorans Sphe3]
          Length = 674

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 50/166 (30%), Positives = 76/166 (45%), Gaps = 10/166 (6%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDL-IVLDSSRSAFYTLSF-PLSEESVVKRLA 86
           ED    +  + GH L++   S  D   S  L  V+ +        SF PL+    V  +A
Sbjct: 325 EDAAGFSGQLVGHPLDVALSSPWDVVWSRKLNAVVIAMAGTHQIFSFDPLT--GSVAIVA 382

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-----NLGVTTIAGG 141
           G+G++G  DG P  A F +P   A D  GNI+VAD     +RK+       + V +  G 
Sbjct: 383 GNGLEGLLDGAPHEAWFAQPSGLAEDADGNIWVADSETSALRKLVIDDDGGVAVESAIGK 442

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           G    G  DGPA  A   +   +T +P   ++ I+D  N  +R+ +
Sbjct: 443 GLFDFGFRDGPAAEARLQHPLGVTVLPD-GSVAIADTYNGAVRRYD 487


>gi|290975813|ref|XP_002670636.1| predicted protein [Naegleria gruberi]
 gi|284084197|gb|EFC37892.1| predicted protein [Naegleria gruberi]
          Length = 2442

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 45/140 (32%), Positives = 66/140 (47%), Gaps = 11/140 (7%)

Query: 82   VKRLAGDGVQG-YSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTI 138
            +  +AG G+ G Y DG   + A+   P    V   G+IY+AD  NH IRK+ +N  +TTI
Sbjct: 919  ITTVAGSGLAGFYGDGGLATNAKLSYPMGIYVSKNGDIYIADYGNHRIRKVFSNGTITTI 978

Query: 139  AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQ 198
            AG GS         A   + ++   +   P+   + I+D GN LIR+I L   DC    +
Sbjct: 979  AGTGSINYNGDIQEATLTTLNSPRGVYMSPNEDEIYIADSGNNLIRKIKL---DCPSDYE 1035

Query: 199  SGSALGAVSVWVLVSVLSCL 218
                LG       V ++ C 
Sbjct: 1036 LDKRLGEC-----VQIVKCF 1050



 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 38/114 (33%), Positives = 60/114 (52%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TN 132
           ++ + V+  +AG    G+   +G   +A  + P+S  V+ KG IYV+D  NH +RKI  N
Sbjct: 857 ITSDGVISLVAGSYYGGFGGDNGPAANAVLNSPESLFVNSKGEIYVSDTVNHRVRKIDLN 916

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             +TT+AG G        G A NA  S    + +V     + I+D+GN  IR++
Sbjct: 917 GTITTVAGSGLAGFYGDGGLATNAKLSYPMGI-YVSKNGDIYIADYGNHRIRKV 969



 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 64/148 (43%), Gaps = 22/148 (14%)

Query: 48  HSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-----VQGYSDGEPGSAR 102
           + VI  P +S++ +L S R               ++++  +G     V G  DG      
Sbjct: 785 NQVIIHPQTSEMFILSSDR---------------IRKVNTNGTISTVVGGIGDGAQAIYA 829

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTT-IAGGGSKKEGRADGPAQNASFSND 161
              P    V   G IY+ D  NH IR IT+ GV + +AG      G  +GPA NA   N 
Sbjct: 830 LINPTDIHVTNNGTIYLTDSLNHCIRMITSDGVISLVAGSYYGGFGGDNGPAANAVL-NS 888

Query: 162 FELTFVPHICALLISDHGNQLIRQINLK 189
            E  FV     + +SD  N  +R+I+L 
Sbjct: 889 PESLFVNSKGEIYVSDTVNHRVRKIDLN 916



 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 61/111 (54%), Gaps = 9/111 (8%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKS-FAVDMKG-NIYVADKSNHVIRKITNLGVTTI 138
           +V  +AG GV GYS G+ G A     +  + + M G ++Y+AD +N+VIRK++N  VTT+
Sbjct: 703 IVSTIAGTGVLGYS-GDGGLATNANFRWIYGLWMVGSDLYIADSTNNVIRKLSNNIVTTV 761

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI--SDHGNQLIRQIN 187
           AG G+        PA +A+ +   ++   P    + I  SD     IR++N
Sbjct: 762 AGTGNANFNGDGIPALSANINTANQVIIHPQTSEMFILSSDR----IRKVN 808



 Score = 43.5 bits (101), Expect = 0.089,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 57/114 (50%), Gaps = 5/114 (4%)

Query: 76  LSEESVVKRLAG-DGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           L  + +V+R+AG  G+ G+S   G   +A        +V   G+IYVA+  N  IRKI+N
Sbjct: 529 LGTDGIVRRIAGITGICGFSGDGGLATNALMGVSWGLSVSPSGDIYVAESINSRIRKISN 588

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             +TT+AG         D  A +AS  N     F     A  ++D GN+ +R++
Sbjct: 589 GIITTVAGTSVAGYNGDDIAATSASL-NGPPAVFATS-TAYYLADAGNRRVRKV 640



 Score = 43.5 bits (101), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 52/110 (47%), Gaps = 6/110 (5%)

Query: 81  VVKRLAGDGVQGYSDGEPGSAR----FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT 136
           ++  +AG+G  G S+    +         P++  V    ++Y AD SN   R +TN  V+
Sbjct: 646 IITTIAGNGASGSSEENINATSAIIGTSSPRALWVSNDDSVYYADVSNGKTRVVTNGIVS 705

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           TIAG G        G A NA+F   + L  V     L I+D  N +IR++
Sbjct: 706 TIAGTGVLGYSGDGGLATNANFRWIYGLWMVGS--DLYIADSTNNVIRKL 753


>gi|291000698|ref|XP_002682916.1| dual specificity protein phosphatase [Naegleria gruberi]
 gi|284096544|gb|EFC50172.1| dual specificity protein phosphatase [Naegleria gruberi]
          Length = 1502

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 52/105 (49%), Gaps = 4/105 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
           +   AG+G +G S G      FD P    V +  N++V+D  NH + +I N G  T A G
Sbjct: 421 ITNFAGNGNRGCSTGRVTDVMFDTPTGICV-IGDNVFVSDTGNHRVVRINN-GFVTNAAG 478

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             +  G  DG  QN+  ++  +L    H   L I+D GN  IRQ+
Sbjct: 479 NPRTAGFMDGKVQNSILNSPTDL--FSHENTLYINDFGNSRIRQV 521


>gi|260904571|ref|ZP_05912893.1| NHL repeat-containing protein [Brevibacterium linens BL2]
          Length = 647

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 53/179 (29%), Positives = 89/179 (49%), Gaps = 31/179 (17%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V + P S +L+V DS   +   + +  S +++++R+ G G +G +DG+  SA F +P
Sbjct: 186 PGKVTELP-SGNLLVADSGHHSL--VEYDASGQNIIRRI-GTGERGANDGDFTSASFSEP 241

Query: 107 KSFAV---DMKG----NIYVADKSNHVIRKITNLG---VTTIAGGGSK------------ 144
               V   D+      ++ VAD  NH +R I NL    VTT+AG GS+            
Sbjct: 242 GGITVLPDDVAAKAGYHLVVADTVNHTLRGI-NLDTETVTTVAGTGSQHMVGAIDNVVGT 300

Query: 145 --KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
             + GR DGPA +   S+ +++ ++P    ++++  GN  I   +  PE  S    SG+
Sbjct: 301 HGELGRYDGPALDVKLSSPWDVLYIPATAEVVVAMAGNHTIWSFD--PETGSIRLLSGT 357



 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/161 (23%), Positives = 76/161 (47%), Gaps = 9/161 (5%)

Query: 39  DGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
           DG  L++   +P  V+  P +++++V  +     ++      E   ++ L+G   +G +D
Sbjct: 308 DGPALDVKLSSPWDVLYIPATAEVVVAMAGNHTIWSFD---PETGSIRLLSGTMNEGLAD 364

Query: 96  GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPA 153
           GE  +A F +     +   G++++AD     IR++  +   V+T+ G G    G  DGPA
Sbjct: 365 GEAEAAWFAQTSGLDLSSDGDVFIADSETSAIRRLDPSTGAVSTLIGEGLFDFGFRDGPA 424

Query: 154 QNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
             A   +   +  +P   ++ I+D  N  IR+ +    + S
Sbjct: 425 AEARLQHPLGVRSLPD-GSIAIADTYNGAIRRYDFTTNEVS 464


>gi|223993873|ref|XP_002286620.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220977935|gb|EED96261.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1937

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 47/150 (31%), Positives = 75/150 (50%), Gaps = 24/150 (16%)

Query: 73  SFPLSEESVVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ L+ +  V  +AG   G +G+ DG    ARF  P+  AVD  G +YVAD  NH IR I
Sbjct: 218 NYTLNPQGFVFTMAGSTSGEEGFVDGVESDARFRHPEGVAVDHDGYVYVADTGNHAIRMI 277

Query: 131 TNLG-VTTIAGGG--SKKEGRADGPAQNASFSN---DFELTFVPH--------------- 169
           +  G V+T+AG G    K+G A+  AQ +S ++     +  + P+               
Sbjct: 278 SPSGRVSTLAGNGVPGYKDGLANSGAQLSSPTDIAVWRDWAWWPYENPVDPDSFIYKNGN 337

Query: 170 -ICALLISDHGNQLIRQINLKPEDCSKSSQ 198
              ALL++D  N  +R+I    E+ S++ +
Sbjct: 338 GTLALLVADTDNHRVRKITGDIEEDSETGE 367



 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 25/50 (50%), Positives = 35/50 (70%), Gaps = 1/50 (2%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           GY+DGE   ARFD P+   V   G ++VAD +NH++R+I+ LG V T+AG
Sbjct: 392 GYADGEKDEARFDSPQGITVASDGRVFVADTNNHLVREISRLGTVVTVAG 441



 Score = 38.1 bits (87), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 45/155 (29%), Positives = 59/155 (38%), Gaps = 45/155 (29%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGS-ARFDKPKSFAV-------------DMKGNIY--- 118
           +S    V  LAG+GV GY DG   S A+   P   AV             D    IY   
Sbjct: 277 ISPSGRVSTLAGNGVPGYKDGLANSGAQLSSPTDIAVWRDWAWWPYENPVDPDSFIYKNG 336

Query: 119 -------VADKSNHVIRKIT---------------NLGVTTIAG-----GGSKKEGRADG 151
                  VAD  NH +RKIT               N+ V   AG       + + G ADG
Sbjct: 337 NGTLALLVADTDNHRVRKITGDIEEDSETGEKMWNNVRVECFAGRCGSVSSNPEPGYADG 396

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
               A F +   +T V     + ++D  N L+R+I
Sbjct: 397 EKDEARFDSPQGIT-VASDGRVFVADTNNHLVREI 430


>gi|333025645|ref|ZP_08453709.1| hypothetical protein STTU_3149 [Streptomyces sp. Tu6071]
 gi|332745497|gb|EGJ75938.1| hypothetical protein STTU_3149 [Streptomyces sp. Tu6071]
          Length = 610

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 67/124 (54%), Gaps = 8/124 (6%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P   +  PG +  +V D++R A   L    ++   V+R  G G +G  DG+   ARF++P
Sbjct: 183 PAKALRLPGGT-FLVSDTTRHALVELE---ADGETVRRRIGSGERGLLDGDATGARFNEP 238

Query: 107 KSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK-KEGR-ADGPAQNASFSNDF 162
           +  A+   G + VAD  NH +R++   +  V+T+AG G +  +G    GPA+  S S+ +
Sbjct: 239 QGLALLPDGAVVVADTVNHALRRVDLASGEVSTLAGTGRQWWQGEPVAGPAREVSLSSPW 298

Query: 163 ELTF 166
           ++ +
Sbjct: 299 DVAW 302


>gi|421097867|ref|ZP_15558546.1| hypothetical protein LEP1GSC125_1130 [Leptospira borgpetersenii
           str. 200901122]
 gi|410799150|gb|EKS01231.1| hypothetical protein LEP1GSC125_1130 [Leptospira borgpetersenii
           str. 200901122]
          Length = 357

 Score = 57.0 bits (136), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 39/108 (36%), Positives = 60/108 (55%), Gaps = 5/108 (4%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKE 146
           GV G+ +G+  ++ F  P    +D + N+YV + SNH IRKI  N G V+T++GG     
Sbjct: 162 GVDGFQNGDRLNSLFKSPFFMDLDRERNLYVGELSNHAIRKINLNSGTVSTLSGG---VL 218

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
           G  DG   +A F +   +T+     +LL++D  N  IR+I+LK    S
Sbjct: 219 GYLDGDLASAQFKSPLGITYDQKTDSLLVADIQNHNIRKIDLKASTVS 266



 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 48/95 (50%), Gaps = 2/95 (2%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP   + R    + I LD      +     ++   V+   AG G++   DG   +A F+ 
Sbjct: 21  NPSIFMGRAWVENTI-LDCILKECHLCKLKVTNNPVISLFAGTGIKESIDGTTQTASFNT 79

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
           P    +D  GNI+V+D++ ++IRKI   G VTT++
Sbjct: 80  PFGLELDTFGNIFVSDQTANLIRKIDRFGNVTTLS 114


>gi|290975887|ref|XP_002670673.1| predicted protein [Naegleria gruberi]
 gi|284084234|gb|EFC37929.1| predicted protein [Naegleria gruberi]
          Length = 1074

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 46/143 (32%), Positives = 75/143 (52%), Gaps = 9/143 (6%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSARFD 104
           P+ +     S ++I++D+S +        +S    +  +AG G  GYS  +G+  +A+F+
Sbjct: 152 PYGLALNSSSGEVIIVDTSNNVIR----KVSSIGNITTIAGTGAAGYSGDNGQATNAKFN 207

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFE 163
            P++ A    G ++VAD  NH IRKI+N G VTT+AG G+         A+NA  +    
Sbjct: 208 APRA-AFYSNGELFVADSRNHRIRKISNSGIVTTVAGTGTAGFNGDSILAKNAQLNYPSG 266

Query: 164 LTFVPHICALLISDHGNQLIRQI 186
           ++ V     + ISD  N  IR+I
Sbjct: 267 IS-VNSNGEIFISDSVNNRIRKI 288



 Score = 53.5 bits (127), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 40/117 (34%), Positives = 62/117 (52%), Gaps = 16/117 (13%)

Query: 82  VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
           +  +AG GV G+S G+ G   SA  + P   AV+  G + ++D +NH IRK++  G +TT
Sbjct: 828 ISTVAGSGVAGFS-GDGGLATSATLNNPSYVAVNSNGELLISDTNNHRIRKVSLNGIITT 886

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTF-----VPHICALLISDHGNQLIRQINLK 189
           IAG G+      +G   NAS    ++L +     V     L I+D  N  IR++N+ 
Sbjct: 887 IAGNGTAGY---NGEGNNASL---YQLNYPSGLVVSSTGDLFIADSYNHRIRKLNVN 937



 Score = 50.8 bits (120), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 45/134 (33%), Positives = 65/134 (48%), Gaps = 12/134 (8%)

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK-GN 116
           D+ ++DS  S    LS        +  +AG    G  DG      +   +SFA+  K G 
Sbjct: 758 DVFIIDSFNSRIRKLS-----SGKISTVAG----GLGDGSSAVNSYLNSQSFAISSKSGE 808

Query: 117 IYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
           I++AD +NH IRKI TN  ++T+AG G        G A +A+ +N      V     LLI
Sbjct: 809 IFIADSNNHRIRKIATNGDISTVAGSGVAGFSGDGGLATSATLNNP-SYVAVNSNGELLI 867

Query: 176 SDHGNQLIRQINLK 189
           SD  N  IR+++L 
Sbjct: 868 SDTNNHRIRKVSLN 881



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 64/120 (53%), Gaps = 10/120 (8%)

Query: 76  LSEESVVKRLAGDGVQGYS---DGEPG---SARFDKPKSFAVDMKGN-IYVADKSNHVIR 128
           +S    +  +AG+G   Y     G+ G   +A+   P + AV++  N IY+AD  NH IR
Sbjct: 598 VSTSGYISTIAGNGTVAYKATFSGDNGLAINAQLFVPFAVAVNLTNNDIYIADSGNHRIR 657

Query: 129 KITNLG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           K+++    +TT+AG G+      +G A NA  +  F ++ + +   + ISD  NQ +R++
Sbjct: 658 KVSSSSGIITTVAGTGTSGFSGDNGLATNAKLNFPFSIS-IGNSGEIYISDQYNQRVRKV 716



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/120 (31%), Positives = 57/120 (47%), Gaps = 3/120 (2%)

Query: 69  FYTLSFPLSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
            Y  S+    +S +  +AG G  G    DG   + R   P S  +   G+ Y +D SNH 
Sbjct: 424 MYLGSYLSKIKSSLTAVAGTGAIGADSGDGLAITERLHNPNSIFISSNGDSYFSDSSNHK 483

Query: 127 IRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           IRK++N  +TTIAG G+      DG +  ++  N+ +   V     +  SD  N  IR+I
Sbjct: 484 IRKLSNGYITTIAGTGTSGY-SGDGSSATSAKLNNPKGVVVSSSGEIYFSDSENHRIRKI 542



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 49/100 (49%), Gaps = 9/100 (9%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARF 103
           NP+S+     + D    DSS      LS        +  +AG G  GYS DG    SA+ 
Sbjct: 462 NPNSIFISS-NGDSYFSDSSNHKIRKLS-----NGYITTIAGTGTSGYSGDGSSATSAKL 515

Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG 142
           + PK   V   G IY +D  NH IRKI+  G ++T+AG G
Sbjct: 516 NNPKGVVVSSSGEIYFSDSENHRIRKISTGGIISTVAGSG 555



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 39/129 (30%), Positives = 63/129 (48%), Gaps = 19/129 (14%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKIT- 131
           +S   ++  +AG+G  GY +GE  +A   + + P    V   G++++AD  NH IRK+  
Sbjct: 878 VSLNGIITTIAGNGTAGY-NGEGNNASLYQLNYPSGLVVSSTGDLFIADSYNHRIRKLNV 936

Query: 132 NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTF-----VPHICALLISDHGNQLIRQI 186
           N  ++T AG G        G + + S     +L F     V  +  L ISD  N  IR++
Sbjct: 937 NGTISTSAGNG------IAGLSGDGSLPTSAQLNFPTGLAVSSVGELYISDDSNNRIRKV 990

Query: 187 NLKPEDCSK 195
           +L    CS+
Sbjct: 991 SLT---CSQ 996



 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 55/109 (50%), Gaps = 4/109 (3%)

Query: 82  VKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTI 138
           V   AG G   +S DG   + +  + P    V+  G  ++AD SNHVIRK+ TN  ++TI
Sbjct: 69  VSTFAGVGTSSFSGDGALATQSEINFPYGVIVNSLGETFIADTSNHVIRKVSTNGKISTI 128

Query: 139 AGGGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           AG  S      DG  A NA  ++ + L        ++I D  N +IR++
Sbjct: 129 AGTASSYGYSGDGGLATNALLNSPYGLALNSSSGEVIIVDTSNNVIRKV 177



 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 53/100 (53%), Gaps = 6/100 (6%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSARFD 104
           P +V     ++D+ + DS       +S   S   ++  +AG G  G+S  +G   +A+ +
Sbjct: 634 PFAVAVNLTNNDIYIADSGNHRIRKVS---SSSGIITTVAGTGTSGFSGDNGLATNAKLN 690

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGS 143
            P S ++   G IY++D+ N  +RK+   G ++TIAG G+
Sbjct: 691 FPFSISIGNSGEIYISDQYNQRVRKVAANGYISTIAGSGA 730



 Score = 41.2 bits (95), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 28/113 (24%), Positives = 54/113 (47%), Gaps = 8/113 (7%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSAR-FDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++    +  +AG G  G++ DG   ++  F+ P   + +  G++++ D  N  IRK+++ 
Sbjct: 716 VAANGYISTIAGSGAIGFNGDGLAATSTCFNYPSGVSSNSNGDVFIIDSFNSRIRKLSSG 775

Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            ++T+AG      G  DG +   S+ N            + I+D  N  IR+I
Sbjct: 776 KISTVAG------GLGDGSSAVNSYLNSQSFAISSKSGEIFIADSNNHRIRKI 822



 Score = 37.0 bits (84), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 42/92 (45%), Gaps = 1/92 (1%)

Query: 95  DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQ 154
           DG  G+A    P   A+ +   I +AD  N  IRK+++  V+T AG G+      DG   
Sbjct: 29  DGMTGNALLRYPYGIALGLNSEILIADTFNQRIRKVSSSDVSTFAGVGTSSF-SGDGALA 87

Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             S  N      V  +    I+D  N +IR++
Sbjct: 88  TQSEINFPYGVIVNSLGETFIADTSNHVIRKV 119


>gi|290973961|ref|XP_002669715.1| predicted protein [Naegleria gruberi]
 gi|284083266|gb|EFC36971.1| predicted protein [Naegleria gruberi]
          Length = 710

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 41/105 (39%), Positives = 57/105 (54%), Gaps = 5/105 (4%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
           +AG+G +G+S  +G   +A+ + P    V     +Y+AD SNHVIRKI+  G + TIAG 
Sbjct: 144 IAGNGTKGFSGDNGPATNAQLNGPAGVFVS-NNEVYIADYSNHVIRKISQNGTIVTIAGN 202

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G       +G A NA   N    TFV     + ISD  N +IR+I
Sbjct: 203 GKPGFSGDNGLATNAQLYNP-SGTFVSSNNEVYISDCFNHVIRKI 246



 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 59/112 (52%), Gaps = 5/112 (4%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           +   +  +AG+G  G+   +G   +A+ + P++  V     +Y+AD  N  IRK+  N  
Sbjct: 81  QNGTIVTIAGNGKPGFGGDNGLATNAQLNYPRNVYVSSNNEVYIADFCNQRIRKVLQNGN 140

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + TIAG G+K     +GPA NA   N     FV +   + I+D+ N +IR+I
Sbjct: 141 IITIAGNGTKGFSGDNGPATNAQL-NGPAGVFVSN-NEVYIADYSNHVIRKI 190



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 56/114 (49%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +S+   +  +AG+G  G+S  +G   +A+   P    V     +Y++D  NHVIRKI   
Sbjct: 190 ISQNGTIVTIAGNGKPGFSGDNGLATNAQLYNPSGTFVSSNNEVYISDCFNHVIRKILQN 249

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G + TIAG G       +G A NA   +   + FV     + ISD  N  IR++
Sbjct: 250 GTIVTIAGNGKGGFSGDNGLATNAQLYSPLGV-FVSSNNEVYISDCFNHRIRKV 302



 Score = 38.1 bits (87), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 28/98 (28%), Positives = 46/98 (46%), Gaps = 17/98 (17%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           +   +  +AG+G  G+S  +G   +A+   P    V     +Y++D  NH IRK+ + G 
Sbjct: 248 QNGTIVTIAGNGKGGFSGDNGLATNAQLYSPLGVFVSSNNEVYISDCFNHRIRKVLHNGN 307

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSND--FELTFVPHI 170
           + TIAG G+              FS D  F+++  PH 
Sbjct: 308 IVTIAGNGT------------PGFSGDSPFDISLYPHF 333



 Score = 37.4 bits (85), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 27/49 (55%), Gaps = 1/49 (2%)

Query: 97  EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSK 144
           E  SA+ + P S  V     +++AD SN  IRK+  N  + TIAG G K
Sbjct: 2   EATSAQLNNPFSVFVSSNNEVFIADYSNQSIRKVLQNGNIVTIAGNGDK 50


>gi|323343289|ref|ZP_08083516.1| hypothetical protein HMPREF0663_10051 [Prevotella oralis ATCC
           33269]
 gi|323095108|gb|EFZ37682.1| hypothetical protein HMPREF0663_10051 [Prevotella oralis ATCC
           33269]
          Length = 436

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 47/182 (25%), Positives = 76/182 (41%), Gaps = 23/182 (12%)

Query: 30  DGYTVTTVIDGHQLEINPHSVIDRP------GSSDLIVLDSSRSAFYTLSFP-------- 75
           DGY  T    G  ++INP +    P      G S  +  +        +SF         
Sbjct: 256 DGYIYTRYFYGDIIKINPRTYEVTPVYKTQEGDSYGLTFNPLHPNILYMSFRENAGTMAN 315

Query: 76  -------LSEESVVKRLAGDGVQG-YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
                     E+  +RL+   + G + DG+   A+F  P     D  GNIYVAD++NH I
Sbjct: 316 SICSIDVTDPENTYRRLSSSNISGGHRDGKLEKAQFHNPTQIYCDADGNIYVADRNNHCI 375

Query: 128 RKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           R+I+   +     G  + +G  DG    A F N+     +    A+ ++D GN  +R++ 
Sbjct: 376 RRISPDDMVETVLGMPETKGWKDGAKSEALF-NEPTGIGIGKDGAVYVADWGNGRVRKLT 434

Query: 188 LK 189
           + 
Sbjct: 435 IN 436


>gi|322435234|ref|YP_004217446.1| hypothetical protein AciX9_1613 [Granulicella tundricola MP5ACTX9]
 gi|321162961|gb|ADW68666.1| NHL repeat containing protein [Granulicella tundricola MP5ACTX9]
          Length = 781

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 55/112 (49%), Gaps = 7/112 (6%)

Query: 81  VVKRLAGDGVQGYSDGE--PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGV 135
           ++  +AG+G   ++       SA    P+   +D  GN+ +AD +NH IR+I   T L +
Sbjct: 225 IITTIAGNGTPAFTSDNIAATSATLYLPRGITLDPSGNLLIADSANHRIRRIDAVTGL-I 283

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           TT+AG G++       PA  AS      +   P     L +D  NQ IRQI+
Sbjct: 284 TTLAGDGTQTYAGDSTPATTASLDTPRAIALSPATLPTL-TDTANQRIRQID 334



 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 37/115 (32%), Positives = 58/115 (50%), Gaps = 5/115 (4%)

Query: 77  SEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TN 132
           +   ++  +AG G  G S DG   + A+ D P + A+D   N+Y+AD  NH+IR++  T 
Sbjct: 107 AATQIITTVAGTGTPGRSPDGTLATKAQLDTPTAIALDSSQNLYIADTRNHIIRRVDATT 166

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
             +TT+AG G++      GPA  A       L        L ++D  N  IR+I+
Sbjct: 167 HLITTLAGTGTQGFSGDAGPALAAQIDTPTGLALDAS-NNLYLADTHNHRIRRID 220



 Score = 37.7 bits (86), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 26/98 (26%), Positives = 46/98 (46%), Gaps = 4/98 (4%)

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG--PAQNASFSNDFE 163
           P     D +GN+Y+A+ + H+I +++  G  TI  G   +    DG  P Q A   +   
Sbjct: 25  PSGLVYDAQGNLYLAETAAHLILRVSPSGALTIVAGTGTQGYAGDGTLPTQ-ALLDSPTA 83

Query: 164 LTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
           L   P    L ++D  N  IR+I+   +  +  + +G+
Sbjct: 84  LAITP-TGDLYLADTHNHAIRRIDAATQIITTVAGTGT 120


>gi|451982091|ref|ZP_21930422.1| hypothetical protein NITGR_780006 [Nitrospina gracilis 3/211]
 gi|451760645|emb|CCQ91702.1| hypothetical protein NITGR_780006 [Nitrospina gracilis 3/211]
          Length = 724

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 35/120 (29%), Positives = 63/120 (52%), Gaps = 14/120 (11%)

Query: 76  LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           + +  ++  +AG+G  G+S  +G    A  + P    +D+KGN+Y+AD++NH +R++   
Sbjct: 81  IGKNGIITTVAGNGNAGFSGDEGPALEAALNFPAGLCLDLKGNLYIADRNNHRVRRVDTK 140

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA-----LLISDHGNQLIRQIN 187
           G +TT+AG G    G  +GPA+      +  L F   +       + ISD  N  + ++N
Sbjct: 141 GIITTVAGTGEPDFGMEEGPAE------EIPLHFPSDVACDSQGQVYISDRSNNRVLKMN 194



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 62/132 (46%), Gaps = 6/132 (4%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           ++ + ++  +AG+G   +S G+ G A       P     D +GN+Y+AD++N+ +RK+  
Sbjct: 249 VTPDGIITTVAGEGTHFFS-GDFGPATRCSLAYPTDVVTDDRGNLYIADRNNNRVRKVDT 307

Query: 133 LGVTTIAGGGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
            G+ T   G  K E   D   A   S    F L F P    LLI D  +  +R ++L   
Sbjct: 308 NGIITTVMGTGKNEYNGDNEIASETSLHLPFALAFTPD-QHLLIVDRNHHRVRSMHLVQH 366

Query: 192 DCSKSSQSGSAL 203
                + +G AL
Sbjct: 367 TVQTVAGNGQAL 378



 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 25/68 (36%), Positives = 41/68 (60%), Gaps = 3/68 (4%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           + +  ++  +AG GVQG+S DG P   A+   P   A D +GN+Y  D+ N+ +R++   
Sbjct: 636 VDKNGIITTVAGTGVQGFSGDGGPAVQAQLKSPYRMAFDSEGNLYFTDRDNNRVRRVDTH 695

Query: 134 G-VTTIAG 140
           G +TT+AG
Sbjct: 696 GTITTLAG 703



 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 54/113 (47%), Gaps = 8/113 (7%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           +  +  ++ LA D      DGE        P   AV+ KG ++V+D  +  IRKI N G 
Sbjct: 532 VDAQGSIQTLAKDAWLAIEDGEV------HPNGLAVNDKGEVFVSDSGSSKIRKIDNEGN 585

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           VTT AG GS ++    GPA  A   +   L F P    L IS+     IR+++
Sbjct: 586 VTTYAGDGSFEDKGDGGPALLAGIRSPGGLVFSPS-GELYISEENTHRIRKVD 637



 Score = 45.8 bits (107), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 47/86 (54%), Gaps = 12/86 (13%)

Query: 110 AVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP 168
           AVD +GNI+++ +S + IRKI   G +TT+AG G+      +GPA  A+      L F  
Sbjct: 61  AVDKEGNIFISHRSKNRIRKIGKNGIITTVAGNGNAGFSGDEGPALEAA------LNFPA 114

Query: 169 HIC-----ALLISDHGNQLIRQINLK 189
            +C      L I+D  N  +R+++ K
Sbjct: 115 GLCLDLKGNLYIADRNNHRVRRVDTK 140



 Score = 44.7 bits (104), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 57/115 (49%), Gaps = 4/115 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ +  +  +AG G+ GY    G    A    P    VD  GN+Y+AD+ N+ +RK+T  
Sbjct: 193 MNPQGQIVTVAGLGMAGYGGDFGPAIDALLKYPFGIHVDESGNLYIADRGNNRVRKVTPD 252

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           G +TT+AG G+       GPA   S +   ++        L I+D  N  +R+++
Sbjct: 253 GIITTVAGEGTHFFSGDFGPATRCSLAYPTDVV-TDDRGNLYIADRNNNRVRKVD 306



 Score = 42.4 bits (98), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 32/115 (27%), Positives = 52/115 (45%), Gaps = 4/115 (3%)

Query: 76  LSEESVVKRLAGDG-VQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +  E  V   AGDG  +   DG P   A    P        G +Y+++++ H IRK+   
Sbjct: 580 IDNEGNVTTYAGDGSFEDKGDGGPALLAGIRSPGGLVFSPSGELYISEENTHRIRKVDKN 639

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           G +TT+AG G +      GPA  A   + + + F      L  +D  N  +R+++
Sbjct: 640 GIITTVAGTGVQGFSGDGGPAVQAQLKSPYRMAFDSE-GNLYFTDRDNNRVRRVD 693



 Score = 42.0 bits (97), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 43/82 (52%), Gaps = 2/82 (2%)

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFEL 164
           P   A D +G +Y++D+SN+ + K+   G + T+AG G    G   GPA +A     F +
Sbjct: 169 PSDVACDSQGQVYISDRSNNRVLKMNPQGQIVTVAGLGMAGYGGDFGPAIDALLKYPFGI 228

Query: 165 TFVPHICALLISDHGNQLIRQI 186
             V     L I+D GN  +R++
Sbjct: 229 -HVDESGNLYIADRGNNRVRKV 249


>gi|307202136|gb|EFN81636.1| NHL repeat-containing protein 2 [Harpegnathos saltator]
          Length = 688

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 46/143 (32%), Positives = 70/143 (48%), Gaps = 20/143 (13%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
           LI+ DS  S     +    +   ++ + G   QG+ DG+  SA+F+ P+   + +   IY
Sbjct: 236 LIISDSGNSRILITN----QHGEIEHIIGGTNQGFEDGDFKSAKFNSPQGVCM-LDDVIY 290

Query: 119 VADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADG----------PAQNASFSNDFELTF 166
           VAD +NH IRKI  +   V+TIAG GS+   R  G          P   A + ++++ T 
Sbjct: 291 VADNNNHAIRKINLSEKSVSTIAGTGSQSHDRCGGRSGTNQDLSSPWDVAIYYHEYKNTT 350

Query: 167 VPHICALLISDHGNQLIRQINLK 189
           VP    LLI+  G   I  + LK
Sbjct: 351 VP---VLLIAIAGTHQIWALFLK 370



 Score = 38.5 bits (88), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 47/104 (45%), Gaps = 7/104 (6%)

Query: 92  GYSDGEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRAD 150
           G  DG+  SA+   P   A + ++  IYVAD  NH I+ +       +A G  +      
Sbjct: 459 GDMDGKEYSAKLQHPLGIAWNHLEKQIYVADTYNHKIKSVD------VATGYCRTLFGDR 512

Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
            P    SF+   +L   P    L ++D  N  ++ INLK ++ S
Sbjct: 513 KPNSMFSFNEPSDLAVSPDGNILYVADTNNHALKVINLKSKEIS 556


>gi|290982352|ref|XP_002673894.1| predicted protein [Naegleria gruberi]
 gi|284087481|gb|EFC41150.1| predicted protein [Naegleria gruberi]
          Length = 2807

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 40/109 (36%), Positives = 56/109 (51%), Gaps = 4/109 (3%)

Query: 82  VKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
           +  +AG G  GYS   G   +A+   P++ AV   G IY+AD  NH IRKI   G ++TI
Sbjct: 572 ISTIAGTGSVGYSGDGGLATNAQLYYPQTVAVSSSGEIYIADAYNHRIRKINTSGYISTI 631

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           AG GS       G A +A     F +  +  +  + I+D  N  IR+IN
Sbjct: 632 AGTGSVGYSGDGGLATSAQLYYPFSVA-ISSVGEIYIADTYNHRIRKIN 679



 Score = 53.5 bits (127), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 32/97 (32%), Positives = 51/97 (52%), Gaps = 2/97 (2%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRAD 150
           G+ DG+  +  F    SF   + G I +AD +NH IRKIT LG ++TI+G G+      +
Sbjct: 802 GFGDGDMATTSFISAYSFEFTLNGEIIIADSNNHRIRKITTLGYISTISGTGTAGYNGDE 861

Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
             A N+  +N   +    +   + I+D  N  IR++N
Sbjct: 862 ILATNSQLNNPNGIALSSN-SEIYIADTNNHRIRKVN 897



 Score = 50.8 bits (120), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 36/95 (37%), Positives = 48/95 (50%), Gaps = 14/95 (14%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
           SA+ + P+S A+   G IY+AD +NH IRKI   G ++TIAG GS       G A NA  
Sbjct: 536 SAQLNYPRSVAISSSGEIYIADFNNHRIRKINISGYISTIAGTGSVGYSGDGGLATNAQL 595

Query: 159 SNDFELTFVPHICA------LLISDHGNQLIRQIN 187
                  + P   A      + I+D  N  IR+IN
Sbjct: 596 -------YYPQTVAVSSSGEIYIADAYNHRIRKIN 623



 Score = 47.8 bits (112), Expect = 0.004,   Method: Composition-based stats.
 Identities = 31/89 (34%), Positives = 49/89 (55%), Gaps = 2/89 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
           SA+ + P S AV   G IY+ D +N+ IRKI   G ++TIAG G+         A +A  
Sbjct: 704 SAQLNYPFSVAVSSVGEIYIVDTNNYRIRKINTSGYISTIAGTGTGGYNGDSILATSAQL 763

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
           +  + LT +     ++++D+ N  IR+IN
Sbjct: 764 NYPYGLT-ISSTSEIIVADYYNHRIRKIN 791



 Score = 47.0 bits (110), Expect = 0.009,   Method: Composition-based stats.
 Identities = 36/129 (27%), Positives = 63/129 (48%), Gaps = 7/129 (5%)

Query: 82  VKRLAGDGVQGYSDGE--PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
           +  ++G G  GY+  E    +++ + P   A+     IY+AD +NH IRK+   G ++TI
Sbjct: 846 ISTISGTGTAGYNGDEILATNSQLNNPNGIALSSNSEIYIADTNNHRIRKVNASGYISTI 905

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQ 198
           AG G+      DG    ++  N      +     +LI+D+ N  IR+I     +   S+ 
Sbjct: 906 AGTGTGGYN-GDGVLATSAQLNYPNGIAIQENGEILIADNNNHRIRKIR---TNGYISTI 961

Query: 199 SGSALGAVS 207
           +GS +G  +
Sbjct: 962 AGSGIGGFT 970



 Score = 46.6 bits (109), Expect = 0.010,   Method: Composition-based stats.
 Identities = 35/108 (32%), Positives = 50/108 (46%), Gaps = 10/108 (9%)

Query: 82  VKRLAGDGVQGYSDGE--PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
           +  +AG G  GY+       SA+ + P    +     I VAD  NH IRKI   G ++TI
Sbjct: 740 ISTIAGTGTGGYNGDSILATSAQLNYPYGLTISSTSEIIVADYYNHRIRKINTSGYISTI 799

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           AG      G  DG     SF + +   F  +   ++I+D  N  IR+I
Sbjct: 800 AG------GFGDGDMATTSFISAYSFEFTLN-GEIIIADSNNHRIRKI 840



 Score = 46.6 bits (109), Expect = 0.010,   Method: Composition-based stats.
 Identities = 41/109 (37%), Positives = 55/109 (50%), Gaps = 4/109 (3%)

Query: 82  VKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
           +  +AG G  GYS   G   SA+   P S A+   G IY+AD  NH IRKI   G ++TI
Sbjct: 628 ISTIAGTGSVGYSGDGGLATSAQLYYPFSVAISSVGEIYIADTYNHRIRKINTSGYISTI 687

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           +G GS       G A +A  +  F +  V  +  + I D  N  IR+IN
Sbjct: 688 SGTGSGGYSGDGGLATSAQLNYPFSVA-VSSVGEIYIVDTNNYRIRKIN 735



 Score = 45.4 bits (106), Expect = 0.027,   Method: Composition-based stats.
 Identities = 38/117 (32%), Positives = 53/117 (45%), Gaps = 21/117 (17%)

Query: 82  VKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
           +  +AG G  GYS   G   SA+   P   ++   G IY+AD  NH IRK+T  G ++TI
Sbjct: 251 ISTVAGSGTAGYSGDGGLATSAQLYYPLGVSISSAGEIYIADYYNHRIRKVTTSGYISTI 310

Query: 139 AG--------GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           AG        GG     +   P   ++ SN            + I+D  N  IR+IN
Sbjct: 311 AGTTYGFSGDGGLATSAQLYYPNGVSTGSNG----------EIFIADTYNYRIRKIN 357



 Score = 43.1 bits (100), Expect = 0.13,   Method: Composition-based stats.
 Identities = 38/113 (33%), Positives = 60/113 (53%), Gaps = 5/113 (4%)

Query: 80  SVVKRLAGDG-VQGYS-DGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
           SV  ++AG G + GY  DG    SAR + P   A    G++Y+AD +N+ IRK+T  G +
Sbjct: 457 SVASKVAGTGTLNGYGGDGNLATSARLNYPYGTAFSSNGDMYIADTNNNRIRKVTTSGYI 516

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           +T+AG G+       G A +A  +    +  +     + I+D  N  IR+IN+
Sbjct: 517 STVAGSGTGGYSGDGGLATSAQLNYPRSVA-ISSSGEIYIADFNNHRIRKINI 568



 Score = 42.7 bits (99), Expect = 0.15,   Method: Composition-based stats.
 Identities = 27/88 (30%), Positives = 46/88 (52%), Gaps = 2/88 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
           SA+ + P S ++   G IY+AD  N  IRK+T  G ++T+AG G+       G A +A  
Sbjct: 215 SAQVNYPFSVSISSTGEIYIADYYNQRIRKVTTYGFISTVAGSGTAGYSGDGGLATSAQL 274

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
                ++ +     + I+D+ N  IR++
Sbjct: 275 YYPLGVS-ISSAGEIYIADYYNHRIRKV 301



 Score = 40.0 bits (92), Expect = 1.1,   Method: Composition-based stats.
 Identities = 31/88 (35%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASF 158
           SA+   PK  A+     IY+AD  NH IRK+ TN  ++T+AG G+   G   G A +A  
Sbjct: 159 SAQLYYPKGVAISPSNEIYIADTYNHRIRKVNTNGYISTVAGSGTGGYGGDGGLATSAQV 218

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
           +  F ++ +     + I+D+ NQ IR++
Sbjct: 219 NYPFSVS-ISSTGEIYIADYYNQRIRKV 245



 Score = 39.7 bits (91), Expect = 1.2,   Method: Composition-based stats.
 Identities = 50/198 (25%), Positives = 88/198 (44%), Gaps = 28/198 (14%)

Query: 10  LAITLLSLLFSLVSSGLLLEDG----YTVTTVIDG------------HQLEINPHSVIDR 53
           + + LL++L  +V S   ++      Y +TT+  G             QL I     I  
Sbjct: 1   MKVVLLAILLCIVMSAKFIDTQSLLLYNITTIAGGGSVVGDGLLATNAQLAIPAGVAISS 60

Query: 54  PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFA 110
            G  ++I+ DS+ +    +S        +  +AG G   +  G+ G   +A+ + P + +
Sbjct: 61  IG--EIIIADSNNNRIRKIS----TNGYISTIAGTGSASFG-GDGGLAINAQLNSPSAVS 113

Query: 111 VDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPH 169
           +   G++Y++D  N  IRKI TN  ++TIAG G+       G A +A       +   P 
Sbjct: 114 ISSNGDMYISDTGNQRIRKISTNGYISTIAGSGTGGYSGDGGLATSAQLYYPKGVAISPS 173

Query: 170 ICALLISDHGNQLIRQIN 187
              + I+D  N  IR++N
Sbjct: 174 -NEIYIADTYNHRIRKVN 190


>gi|290978635|ref|XP_002672041.1| predicted protein [Naegleria gruberi]
 gi|284085614|gb|EFC39297.1| predicted protein [Naegleria gruberi]
          Length = 706

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 39/119 (32%), Positives = 61/119 (51%), Gaps = 5/119 (4%)

Query: 75  PLSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKIT 131
           P   E  +  +AG+G++G+   +G    A+   P   AV+   N +Y+AD  NH IRK++
Sbjct: 5   PKGMEQTIWTIAGNGIKGFYGDNGLAIEAKLSSPYGVAVNPYSNDVYIADYRNHCIRKVS 64

Query: 132 NLG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
            L   +TTIAG G        G A +A  +    ++  P    L I+D+ N  IR+I+L
Sbjct: 65  ALDNKITTIAGTGVAGYSGDGGLALHAQLNCPLSVSIHPKNEELYIADYSNHRIRKISL 123



 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 37/143 (25%), Positives = 70/143 (48%), Gaps = 14/143 (9%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG---SARF 103
           P  V   P + D+ ++D   +    +S   +   ++   AG+G  G+  GE G   +A+ 
Sbjct: 154 PQCVKIHPITFDIYIIDFVNNKIRRIS---NSSGIISTFAGNGTAGFC-GEGGFATNAQL 209

Query: 104 DKPKSFAVD-MKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASFSN 160
           + P     +   G++Y+AD +NH +RK+      +TT+AG G  K G +DG   +A  + 
Sbjct: 210 NGPSGLDFNPTTGDVYIADSNNHRVRKVNCKSGIITTLAGTG--KAGYSDGI--DAQLNY 265

Query: 161 DFELTFVPHICALLISDHGNQLI 183
            ++++F      + ++D  N  I
Sbjct: 266 PYDVSFCTRGQIIYVTDRSNNRI 288



 Score = 45.4 bits (106), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 44/149 (29%), Positives = 68/149 (45%), Gaps = 10/149 (6%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK- 105
           P SV   P + +L + D S      +S      S V   AG+   G S G+ G A F + 
Sbjct: 96  PLSVSIHPKNEELYIADYSNHRIRKISLTYGTISTV---AGNNTGGTS-GDGGLALFAQL 151

Query: 106 --PKSFAVD-MKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASFSN 160
             P+   +  +  +IY+ D  N+ IR+I+N    ++T AG G+       G A NA  + 
Sbjct: 152 YFPQCVKIHPITFDIYIIDFVNNKIRRISNSSGIISTFAGNGTAGFCGEGGFATNAQLNG 211

Query: 161 DFELTFVPHICALLISDHGNQLIRQINLK 189
              L F P    + I+D  N  +R++N K
Sbjct: 212 PSGLDFNPTTGDVYIADSNNHRVRKVNCK 240


>gi|239989050|ref|ZP_04709714.1| hypothetical protein SrosN1_17240 [Streptomyces roseosporus NRRL
           11379]
 gi|291446053|ref|ZP_06585443.1| alkyl hydroperoxide reductase/Thiol specific antioxidant/Mal
           allergen [Streptomyces roseosporus NRRL 15998]
 gi|291349000|gb|EFE75904.1| alkyl hydroperoxide reductase/Thiol specific antioxidant/Mal
           allergen [Streptomyces roseosporus NRRL 15998]
          Length = 605

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 58/112 (51%), Gaps = 7/112 (6%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
            +V D++R     L    ++   V+R  G G +G +DG P  ARF +P+  AV   G + 
Sbjct: 188 FLVSDTTRHRLVELD---ADGETVRRHFGTGERGLNDGGPDEARFSEPQGLAVLPDGRVA 244

Query: 119 VADKSNHVIRKIT-NLGVT-TIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
           VAD  NH IR +    GVT T+AG G +  +     GPA+    S+ ++L +
Sbjct: 245 VADTVNHAIRALDLTTGVTSTLAGTGRQWWQGTPTSGPAREVDLSSPWDLAW 296


>gi|375146761|ref|YP_005009202.1| NHL repeat containing protein [Niastella koreensis GR20-10]
 gi|361060807|gb|AEV99798.1| NHL repeat containing protein [Niastella koreensis GR20-10]
          Length = 439

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 58/196 (29%), Positives = 83/196 (42%), Gaps = 19/196 (9%)

Query: 21  LVSSGLLLEDGYTVTTVIDGHQLEI---NPHSVI----DRPGSSDLIVLDSSRSAFYTLS 73
           L S    + D   +T  ++G  L++   N   ++     + GS   +V   + SA  T  
Sbjct: 55  LFSGSNYINDTNQITVTLNGKPLKVVGVNGSQIMAVVPKKAGSGHFVVKIGADSAVSTGI 114

Query: 74  FPLSEESVVKRLAGDGVQGYSDGEPGSARFD-------KPKSFAVDMKGNIYVADKSNHV 126
           F       V   AG G  G+++G    A FD       +     VD   N+YVAD  NH 
Sbjct: 115 FNYVYTRTVSTFAGSGKAGFANGMGTDAMFDFGGQSWYRSMGIIVDDNLNLYVADPGNHC 174

Query: 127 IRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
           IRKI +N  VTT  G  S   G ADG    A FS  +++ F      +   D  N  IR+
Sbjct: 175 IRKIDSNANVTTFCGSPS-GAGYADGKGTAAKFSLPYDVAFDAQ-GNIWCVDPANWDIRK 232

Query: 186 INLKPEDCSKSSQSGS 201
           I   P+  + +   GS
Sbjct: 233 I--APDGTATTWAWGS 246



 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/109 (35%), Positives = 51/109 (46%), Gaps = 14/109 (12%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA---------DKSNHVIRKI--TNLGVTT 137
           GV GY +G    A+FD P   A+D  GN+YVA           S+  IR I   +  V+T
Sbjct: 331 GVPGYVNGAAAVAKFDHPWGLAIDTAGNLYVAGNGTYDGNTGNSDQAIRYIEAASFNVST 390

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            AG GS   G AD   + A+FS    +  V     L + D  N  IR+I
Sbjct: 391 FAGSGS--AGYADAVGEAAAFSAPTGIA-VDKNGTLYVLDKNNNRIRKI 436



 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 23/49 (46%), Positives = 28/49 (57%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V   AG G  GY+D    +A F  P   AVD  G +YV DK+N+ IRKI
Sbjct: 388 VSTFAGSGSAGYADAVGEAAAFSAPTGIAVDKNGTLYVLDKNNNRIRKI 436



 Score = 43.5 bits (101), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 30/53 (56%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
           GY+DG+  +A+F  P   A D +GNI+  D +N  IRKI   G  T    GS+
Sbjct: 195 GYADGKGTAAKFSLPYDVAFDAQGNIWCVDPANWDIRKIAPDGTATTWAWGSQ 247


>gi|88601128|gb|ABD46562.1| NHL repeat-containing protein [Vermamoeba vermiformis]
          Length = 309

 Score = 56.6 bits (135), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 33/85 (38%), Positives = 45/85 (52%), Gaps = 1/85 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++ + VV  LAG  G  G  DG    ARF+ P    VD  GN+YVAD  N+ +RK+T  G
Sbjct: 29  ITPDGVVVTLAGHPGHWGSRDGVGSKARFNGPSGLDVDTDGNVYVADYYNNTMRKVTPEG 88

Query: 135 VTTIAGGGSKKEGRADGPAQNASFS 159
           + T   G   + G  DG  + A F+
Sbjct: 89  IVTTIAGHVGQWGSTDGSGEKARFN 113



 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 64/114 (56%), Gaps = 11/114 (9%)

Query: 82  VKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
           V  +AG  +Q G +DG+  +ARF++P   +VD  GN++V+D  NH IRKI+ LG VTTIA
Sbjct: 202 VTTIAGRVMQPGSADGKLTAARFNQPSGISVDKIGNLFVSDYYNHTIRKISPLGEVTTIA 261

Query: 140 GGGSKK---EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKP 190
           G    +   EG  D    N  F N  +     +IC     D  N ++R++ ++P
Sbjct: 262 GMFGHQGAVEGFGDHIRLNHPFRNTIDSAGNLYIC-----DEYNSIVRKL-IRP 309



 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 36/95 (37%), Positives = 47/95 (49%), Gaps = 1/95 (1%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           G  DG    ARF  P   AV   G I+V D+ NH IR I+ +G  T   G   + G ADG
Sbjct: 158 GSEDGNGQQARFFGPSGIAVAPDGTIFVCDRYNHTIRSISIVGEVTTIAGRVMQPGSADG 217

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
               A F+    ++ V  I  L +SD+ N  IR+I
Sbjct: 218 KLTAARFNQPSGIS-VDKIGNLFVSDYYNHTIRKI 251



 Score = 40.0 bits (92), Expect = 1.00,   Method: Compositional matrix adjust.
 Identities = 25/81 (30%), Positives = 39/81 (48%), Gaps = 1/81 (1%)

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
           P+  A+D   NIY+ + +N+ IRKIT  GV     G     G  DG    A F+    L 
Sbjct: 5   PRGIAIDKHDNIYICEWNNNTIRKITPDGVVVTLAGHPGHWGSRDGVGSKARFNGPSGLD 64

Query: 166 FVPHICALLISDHGNQLIRQI 186
            V     + ++D+ N  +R++
Sbjct: 65  -VDTDGNVYVADYYNNTMRKV 84


>gi|393782357|ref|ZP_10370541.1| hypothetical protein HMPREF1071_01409 [Bacteroides salyersiae
           CL02T12C01]
 gi|392673185|gb|EIY66648.1| hypothetical protein HMPREF1071_01409 [Bacteroides salyersiae
           CL02T12C01]
          Length = 446

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 51/98 (52%), Gaps = 1/98 (1%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           GY+DG    A F++P+   V     +Y+AD +NHVIRKI+  G+ +   G + + G  DG
Sbjct: 350 GYADGPCEYAMFNEPRQMIVSADNELYLADTNNHVIRKISQDGIVSTVIGLAGQSGFMDG 409

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
             + A F   F +        + I D  NQ +R++ ++
Sbjct: 410 TPEEALFDKPFGVALDTD-GTIYIGDSENQCVRRLAIE 446


>gi|251771416|gb|EES51995.1| conserved protein of unknown function [Leptospirillum
           ferrodiazotrophum]
          Length = 373

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/117 (34%), Positives = 58/117 (49%), Gaps = 3/117 (2%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRK--ITNLGVTT 137
            V  LAG    G +DG  G+A F  P++ A+   G  +Y++D  N++IRK  +    VTT
Sbjct: 148 AVTTLAGRAFPGSNDGTGGAAGFYGPRALALTPDGKTLYISDSGNNMIRKLDVATGTVTT 207

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
           +AG G+   G  DG    ASF +   +        L ++D  N LIR+I L     S
Sbjct: 208 VAGKGALAPGMDDGIGGAASFRDPRGIALSSDGSVLYVADTRNNLIRKIVLATNAVS 264



 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 56/112 (50%), Gaps = 5/112 (4%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRA 149
           + DG   +ARF+ P+  A+   G  +Y+AD  N+ IRK  +++  VTT+AG      G  
Sbjct: 104 FRDGNGTAARFNNPEGLAISSDGKTLYIADSRNNRIRKMDLSSGAVTTLAG--RAFPGSN 161

Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
           DG    A F     L   P    L ISD GN +IR++++     +  +  G+
Sbjct: 162 DGTGGAAGFYGPRALALTPDGKTLYISDSGNNMIRKLDVATGTVTTVAGKGA 213



 Score = 47.8 bits (112), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 37/102 (36%), Positives = 48/102 (47%), Gaps = 5/102 (4%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG----VTTIAGGGSKKE 146
           +G  DG    ARF+  +       G +YVAD  N +IRK+T  G    VTTIAG      
Sbjct: 44  RGAVDGTGSQARFEYVQGIVAAPDGTLYVADTGNDLIRKVTVSGGTATVTTIAGVNHHAR 103

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
            R DG    A F+N   L        L I+D  N  IR+++L
Sbjct: 104 FR-DGNGTAARFNNPEGLAISSDGKTLYIADSRNNRIRKMDL 144


>gi|307110397|gb|EFN58633.1| hypothetical protein CHLNCDRAFT_140870 [Chlorella variabilis]
          Length = 415

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 60/202 (29%), Positives = 90/202 (44%), Gaps = 25/202 (12%)

Query: 54  PGSSDL--IVLDSSRSAFYTLSFPLSEESVVK---------RLAG-DGVQGYSDGEPGSA 101
           PGS     I +D  R   Y  +   S  +V +          +AG DG  GY DG  G A
Sbjct: 42  PGSGRFHNIAVDDPRGPHYVYAVDTSRHTVGRIALDTGTWETVAGLDGTPGYRDGPEGPA 101

Query: 102 R----FDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQN 155
                F++P +      G++ VAD  N  IR+I      V+T+A G   + G ADGPA  
Sbjct: 102 GQPALFNRPSAICQMPHGHLAVADTGNACIRQIDAATKQVSTLA-GRCGEPGAADGPAAE 160

Query: 156 ASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC-SKSSQSGSALGAVSV-WVLVS 213
           A F +  +     + C++ + D     +R + +   +C   S+ SG  +   SV W+L  
Sbjct: 161 AQFGSSIKSIACAN-CSVFVGDVSTGRLRLVRVDDAECLGASNPSGRFITRESVKWLLAG 219

Query: 214 VLSCLVSLVIGFVARPYIIRHT 235
           V+  L  L +G   R Y+ RH 
Sbjct: 220 VV-MLSILALG--GRRYLQRHA 238


>gi|182437571|ref|YP_001825290.1| hypothetical protein SGR_3778 [Streptomyces griseus subsp. griseus
           NBRC 13350]
 gi|178466087|dbj|BAG20607.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus
           NBRC 13350]
          Length = 605

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 58/112 (51%), Gaps = 7/112 (6%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
           L+V D++R     L    ++   V+R  G G +G  DG P  ARF +P+  AV   G + 
Sbjct: 188 LLVSDTTRHRLVELD---ADGETVRRHFGTGERGLRDGGPDEARFSEPQGLAVLPDGRVA 244

Query: 119 VADKSNHVIRKIT-NLGVT-TIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
           VAD  NH IR +    GVT T+AG G +  +     GPA+    S+ +++ +
Sbjct: 245 VADTVNHAIRALDLTTGVTSTLAGTGRQWWQGTPTSGPAREVDLSSPWDVAW 296


>gi|456874551|gb|EMF89837.1| hypothetical protein LEP1GSC005_3835 [Leptospira santarosai str.
           ST188]
          Length = 356

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 67/119 (56%), Gaps = 7/119 (5%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKE 146
           GV G+ +G+  ++ F  P    +D + N+YV +  NH IRKI  N G V+T++GG     
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKINLNSGSVSTLSGG---TL 217

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
           G  DG   +A F +   +T+     +LL++D  +  IR+I+LK  + + S+  G+ +GA
Sbjct: 218 GYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLK--NATVSTFVGNGIGA 274



 Score = 50.4 bits (119), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 55/203 (27%), Positives = 78/203 (38%), Gaps = 48/203 (23%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP   + R  + + I LD      Y     ++   VV   AG G++   DG   +A F  
Sbjct: 20  NPSIFMSRAWAENTI-LDCILKECYLCKLKITNNPVVSLFAGTGIKESIDGTTQTASFKT 78

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI-------------------------- 138
           P    VD  GNI+V+D+  ++IRKI   G VTT+                          
Sbjct: 79  PFGLEVDTFGNIFVSDQMANLIRKIDRSGNVTTLSTSLLLEDPSGIKFDPITGDKYVSCK 138

Query: 139 -----------------AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
                            AG  S  +G  +G   N+SF   F +  +     L + + GN 
Sbjct: 139 DGAQIFKIDPLDQFSLYAGNSSGVDGFQNGDRLNSSFKGPFFMD-LDRERNLYVGELGNH 197

Query: 182 LIRQINLKPEDCSKSSQSGSALG 204
            IR+INL     S S+ SG  LG
Sbjct: 198 AIRKINLNSG--SVSTLSGGTLG 218



 Score = 40.0 bits (92), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 41/81 (50%), Gaps = 6/81 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
           V  L+G G  GY DG+  SA+F  P     D K + + VAD  +H IRKI   N  V+T 
Sbjct: 209 VSTLSG-GTLGYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLKNATVSTF 267

Query: 139 AGGGSKKEGRADGPAQNASFS 159
            G G   +   DG   NAS +
Sbjct: 268 VGNGIGAD--IDGNGLNASLN 286


>gi|290977087|ref|XP_002671270.1| predicted protein [Naegleria gruberi]
 gi|284084837|gb|EFC38526.1| predicted protein [Naegleria gruberi]
          Length = 1095

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 53/161 (32%), Positives = 75/161 (46%), Gaps = 13/161 (8%)

Query: 31  GYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV 90
           G+T  T ++   L  +P  +     +SDL + D        +S        +  +AG G 
Sbjct: 319 GFTDNTTVNLATLA-SPELMNFAKNTSDLYIADKGNHRIRKIS-----NGFITTIAGQGS 372

Query: 91  QGYSDGEPGSARFD---KPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKK 145
             Y  GE   +R     KPK  A+D  GNIY+AD SNH +RKI+ L   +TTIAG GS  
Sbjct: 373 PSYC-GENVDSRLSALSKPKGAALDSLGNIYIADTSNHRVRKISYLDGTITTIAGTGSFG 431

Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
                  A +A  +    + F   I  + I+D GN  IR+I
Sbjct: 432 YNGDGILATSAQVNKPTGIAF-DSIGNIYIADSGNNRIRKI 471



 Score = 51.2 bits (121), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 43/111 (38%), Positives = 57/111 (51%), Gaps = 4/111 (3%)

Query: 79  ESVVKRLAGDGVQGYS-DG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGV 135
           +  +  +AG G  GY+ DG    SA+ +KP   A D  GNIY+AD  N+ IRKI TN  +
Sbjct: 418 DGTITTIAGTGSFGYNGDGILATSAQVNKPTGIAFDSIGNIYIADSGNNRIRKILTNGTI 477

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            TIAG G         PA NA  ++   +T V     + I+D  N  IR I
Sbjct: 478 LTIAGVGLGGYNGDGIPATNAKLNSPVSVT-VDSNDLVYITDTYNHRIRLI 527



 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 48/144 (33%), Positives = 67/144 (46%), Gaps = 9/144 (6%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DG-EPGSARF 103
           NP S    P  S L + DS+      + FP      +  +AG G  GY+ DG E   A+ 
Sbjct: 56  NPASSALGPDGS-LFIADSNNHRIRQV-FP---NGTITTIAGTGFSGYNGDGIEAIRAQL 110

Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDF 162
             P S  V+  G +  +D  N+ IRKI TN  + TI G G       +G A+NA+ +  +
Sbjct: 111 KNPVSVVVNSMGEVLFSDNGNNRIRKILTNGTIITIVGTGVDSFSGDNGLARNAAINYPY 170

Query: 163 ELTFVPHICALLISDHGNQLIRQI 186
            LT       LL +D  N  IRQ+
Sbjct: 171 GLTLNSKE-ELLFTDVNNNRIRQV 193



 Score = 42.4 bits (98), Expect = 0.20,   Method: Composition-based stats.
 Identities = 35/106 (33%), Positives = 51/106 (48%), Gaps = 4/106 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAG 140
           +  +AG GV     G    A    P S A+   G++++AD +NH IR++  N  +TTIAG
Sbjct: 35  ITTIAGGGV--CDGGLAIKASLANPASSALGPDGSLFIADSNNHRIRQVFPNGTITTIAG 92

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            G          A  A   N   +  V  +  +L SD+GN  IR+I
Sbjct: 93  TGFSGYNGDGIEAIRAQLKNPVSVV-VNSMGEVLFSDNGNNRIRKI 137



 Score = 41.2 bits (95), Expect = 0.51,   Method: Composition-based stats.
 Identities = 31/94 (32%), Positives = 47/94 (50%), Gaps = 11/94 (11%)

Query: 98  PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNA 156
           P + + + P+S A D   N+Y+AD  N+ IRK+ TN  + T+AG G+         A NA
Sbjct: 550 PNNTKLNYPQSIAFDSSNNMYIADTYNNRIRKMFTNGTIITVAGTGTLGYNGDGIEATNA 609

Query: 157 SFSNDFELTFVPHIC----ALLISDHGNQLIRQI 186
                 +L +   I      LLI+D  N  IR++
Sbjct: 610 ------QLKYPQGIAIDGDELLIADSYNNRIRRV 637



 Score = 38.9 bits (89), Expect = 2.6,   Method: Composition-based stats.
 Identities = 24/60 (40%), Positives = 35/60 (58%), Gaps = 3/60 (5%)

Query: 85  LAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
           +AG G+ GY+ DG P + A+ + P S  VD    +Y+ D  NH IR I   G ++T+ GG
Sbjct: 480 IAGVGLGGYNGDGIPATNAKLNSPVSVTVDSNDLVYITDTYNHRIRLILPSGNISTVIGG 539



 Score = 37.7 bits (86), Expect = 5.8,   Method: Composition-based stats.
 Identities = 28/88 (31%), Positives = 44/88 (50%), Gaps = 4/88 (4%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQN-ASFS 159
           A+ + P   AV   G +Y +D SN + +  TN  +TT AG  +   G  D    N A+ +
Sbjct: 276 AKINSPYGVAVSSLGEVYFSD-SNRIRKVSTNGTITTFAG--TSLYGFTDNTTVNLATLA 332

Query: 160 NDFELTFVPHICALLISDHGNQLIRQIN 187
           +   + F  +   L I+D GN  IR+I+
Sbjct: 333 SPELMNFAKNTSDLYIADKGNHRIRKIS 360


>gi|197120448|ref|YP_002132399.1| fibronectin type III domain-containing protein [Anaeromyxobacter
           sp. K]
 gi|196170297|gb|ACG71270.1| Fibronectin type III domain protein [Anaeromyxobacter sp. K]
          Length = 460

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 42/112 (37%), Positives = 57/112 (50%), Gaps = 8/112 (7%)

Query: 80  SVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGV 135
           + V  LAG   Q G +DG   SA F  P+  A D    +YVAD  N+ IR +   T + V
Sbjct: 244 AAVTTLAGSTTQAGAADGPGASATFRWPRGVATDGT-TLYVADTMNNAIRALDLATGV-V 301

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           +T+AG  +   G ADG    A FS  + L    H   LL++D GN  IR ++
Sbjct: 302 STLAGDPASYAGYADGRGTAARFSAPYGLAM--HGGELLVADSGNSAIRAVD 351



 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 57/112 (50%), Gaps = 8/112 (7%)

Query: 82  VKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTT 137
           V  LAGD     G+ DG   +A F+ P+    D    +YVAD +N+++R+I      VTT
Sbjct: 190 VTTLAGDPRTYAGHQDGIGTAASFNSPRGITSDGT-YLYVADTANNLVRRIDLATAAVTT 248

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           +AG  + + G ADGP  +A+F   +          L ++D  N  IR ++L 
Sbjct: 249 LAGS-TTQAGAADGPGASATFR--WPRGVATDGTTLYVADTMNNAIRALDLA 297


>gi|422003150|ref|ZP_16350382.1| hypothetical protein LSS_06614 [Leptospira santarosai serovar
           Shermani str. LT 821]
 gi|417258114|gb|EKT87507.1| hypothetical protein LSS_06614 [Leptospira santarosai serovar
           Shermani str. LT 821]
          Length = 356

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 67/119 (56%), Gaps = 7/119 (5%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKE 146
           GV G+ +G+  ++ F  P    +D + N+YV +  NH IRKI  N G V+T++GG     
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKINLNSGSVSTLSGG---TL 217

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
           G  DG   +A F +   +T+     +LL++D  +  IR+I+LK  + + S+  G+ +GA
Sbjct: 218 GYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLK--NATVSTFVGNGIGA 274



 Score = 50.4 bits (119), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 55/203 (27%), Positives = 78/203 (38%), Gaps = 48/203 (23%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP   + R  + + I LD      Y     ++   VV   AG G++   DG   +A F  
Sbjct: 20  NPSIFMSRAWAENTI-LDCILKECYLCKLKITNNPVVSLFAGTGIKESIDGTTQTASFKT 78

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI-------------------------- 138
           P    VD  GNI+V+D+  ++IRKI   G VTT+                          
Sbjct: 79  PFGLEVDTFGNIFVSDQMANLIRKIDRSGNVTTLSTSLLLEDPSGIKFDPITGDKYVSCK 138

Query: 139 -----------------AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
                            AG  S  +G  +G   N+SF   F +  +     L + + GN 
Sbjct: 139 DSAQIFKIDPLDQFSLYAGNSSGVDGFQNGDRLNSSFKGPFFMD-LDRERNLYVGELGNH 197

Query: 182 LIRQINLKPEDCSKSSQSGSALG 204
            IR+INL     S S+ SG  LG
Sbjct: 198 AIRKINLNSG--SVSTLSGGTLG 218



 Score = 40.0 bits (92), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 41/81 (50%), Gaps = 6/81 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
           V  L+G G  GY DG+  SA+F  P     D K + + VAD  +H IRKI   N  V+T 
Sbjct: 209 VSTLSG-GTLGYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLKNATVSTF 267

Query: 139 AGGGSKKEGRADGPAQNASFS 159
            G G   +   DG   NAS +
Sbjct: 268 VGNGIGAD--IDGNGLNASLN 286


>gi|326778226|ref|ZP_08237491.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Streptomyces griseus XylebKG-1]
 gi|326658559|gb|EGE43405.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Streptomyces griseus XylebKG-1]
          Length = 615

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 58/112 (51%), Gaps = 7/112 (6%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
           L+V D++R     L    ++   V+R  G G +G  DG P  ARF +P+  AV   G + 
Sbjct: 198 LLVSDTTRHRLVELD---ADGETVRRHFGTGERGLRDGGPDEARFSEPQGLAVLPDGRVA 254

Query: 119 VADKSNHVIRKIT-NLGVT-TIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
           VAD  NH IR +    GVT T+AG G +  +     GPA+    S+ +++ +
Sbjct: 255 VADTVNHAIRALDLTTGVTSTLAGTGRQWWQGTPTSGPAREVDLSSPWDVAW 306


>gi|29348305|ref|NP_811808.1| hypothetical protein BT_2896 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|383121762|ref|ZP_09942466.1| hypothetical protein BSIG_1754 [Bacteroides sp. 1_1_6]
 gi|29340208|gb|AAO78002.1| NHL repeat protein [Bacteroides thetaiotaomicron VPI-5482]
 gi|251841348|gb|EES69429.1| hypothetical protein BSIG_1754 [Bacteroides sp. 1_1_6]
          Length = 452

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 50/103 (48%), Gaps = 1/103 (0%)

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS 143
           R AG+GV G  DG+  S+R D+P   AVD +GN+ V       +RKIT  G  +   G  
Sbjct: 348 RFAGNGVWGIIDGQGVSSRMDQPNQIAVDAEGNLLVTTVYGRTVRKITPEGYVSTYAGIG 407

Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            + G  DG A  A F+  + +  +     + + D  N  IR I
Sbjct: 408 YQTGYVDGLAAEAKFNKPYGIA-IDAQGNVYVGDCENWRIRVI 449



 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 52/106 (49%), Gaps = 13/106 (12%)

Query: 31  GYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSF-----PLSEESVVKRL 85
           G  V  +IDG  +     S +D+P   + I +D+  +   T  +      ++ E  V   
Sbjct: 351 GNGVWGIIDGQGVS----SRMDQP---NQIAVDAEGNLLVTTVYGRTVRKITPEGYVSTY 403

Query: 86  AGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           AG G Q GY DG    A+F+KP   A+D +GN+YV D  N  IR I
Sbjct: 404 AGIGYQTGYVDGLAAEAKFNKPYGIAIDAQGNVYVGDCENWRIRVI 449


>gi|410448598|ref|ZP_11302672.1| hypothetical protein LEP1GSC068_3151 [Leptospira sp. Fiocruz
           LV3954]
 gi|410017668|gb|EKO79726.1| hypothetical protein LEP1GSC068_3151 [Leptospira sp. Fiocruz
           LV3954]
          Length = 356

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 67/119 (56%), Gaps = 7/119 (5%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKE 146
           GV G+ +G+  ++ F  P    +D + N+YV +  NH IRKI  N G V+T++GG     
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKINLNSGSVSTLSGG---TL 217

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
           G  DG   +A F +   +T+     +LL++D  +  IR+I+LK  + + S+  G+ +GA
Sbjct: 218 GYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLK--NATVSTFVGNGIGA 274



 Score = 50.4 bits (119), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 55/203 (27%), Positives = 78/203 (38%), Gaps = 48/203 (23%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP   + R  + + I LD      Y     ++   VV   AG G++   DG   +A F  
Sbjct: 20  NPSIFMSRAWAENTI-LDCILKECYLCKLKITNNPVVSLFAGTGIKESIDGTTQTASFKT 78

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI-------------------------- 138
           P    VD  GNI+V+D+  ++IRKI   G VTT+                          
Sbjct: 79  PFGLEVDTFGNIFVSDQMANLIRKIDRSGNVTTLSTSLLLEDPSGIKFDPITGDKYVSCK 138

Query: 139 -----------------AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
                            AG  S  +G  +G   N+SF   F +  +     L + + GN 
Sbjct: 139 DSAQIFKIDPLDQFSLYAGNSSGVDGFQNGDRLNSSFKGPFFMD-LDRERNLYVGELGNH 197

Query: 182 LIRQINLKPEDCSKSSQSGSALG 204
            IR+INL     S S+ SG  LG
Sbjct: 198 AIRKINLNSG--SVSTLSGGTLG 218



 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 40/81 (49%), Gaps = 6/81 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
           V  L+G G  GY DG+  SA+F  P     D K + + VAD  +H IRKI   N  V+T 
Sbjct: 209 VSTLSG-GTLGYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLKNATVSTF 267

Query: 139 AGGGSKKEGRADGPAQNASFS 159
            G G       DG   NAS +
Sbjct: 268 VGNGIG--ANIDGNGLNASLN 286


>gi|359685014|ref|ZP_09255015.1| hypothetical protein Lsan2_10264 [Leptospira santarosai str.
           2000030832]
          Length = 356

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 67/119 (56%), Gaps = 7/119 (5%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKE 146
           GV G+ +G+  ++ F  P    +D + N+YV +  NH IRKI  N G V+T++GG     
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKINLNSGSVSTLSGG---TL 217

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
           G  DG   +A F +   +T+     +LL++D  +  IR+I+LK  + + S+  G+ +GA
Sbjct: 218 GYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLK--NATVSTFVGNGIGA 274



 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 56/203 (27%), Positives = 78/203 (38%), Gaps = 48/203 (23%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP   + R  + + I LD      Y     ++   VV   AG G+Q   DG   +A F  
Sbjct: 20  NPSIFMSRAWAENTI-LDCILKECYLCKLKVTNNPVVSLFAGTGIQESIDGTTQTASFKT 78

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI-------------------------- 138
           P    VD  GNI+V+D+  ++IRKI   G VTT+                          
Sbjct: 79  PFGLEVDTFGNIFVSDQMANLIRKIDRSGNVTTLSTSLLLEDPSGIKFDPITGDKYVSCK 138

Query: 139 -----------------AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
                            AG  S  +G  +G   N+SF   F +  +     L + + GN 
Sbjct: 139 DSAQIFKIDPLDQFSLYAGNSSGVDGFQNGDRLNSSFKGPFFMD-LDRERNLYVGELGNH 197

Query: 182 LIRQINLKPEDCSKSSQSGSALG 204
            IR+INL     S S+ SG  LG
Sbjct: 198 AIRKINLNSG--SVSTLSGGTLG 218



 Score = 40.0 bits (92), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 41/81 (50%), Gaps = 6/81 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
           V  L+G G  GY DG+  SA+F  P     D K + + VAD  +H IRKI   N  V+T 
Sbjct: 209 VSTLSG-GTLGYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLKNATVSTF 267

Query: 139 AGGGSKKEGRADGPAQNASFS 159
            G G   +   DG   NAS +
Sbjct: 268 VGNGIGAD--IDGNGLNASLN 286


>gi|398331208|ref|ZP_10515913.1| hypothetical protein LalesM3_04029 [Leptospira alexanderi serovar
           Manhao 3 str. L 60]
          Length = 357

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/118 (33%), Positives = 67/118 (56%), Gaps = 7/118 (5%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKEG 147
           ++G+ +G+  ++ F  P    +D + N+YV +  NH IRKI  N G V+T++GG S   G
Sbjct: 163 IEGFQNGDRLNSLFKGPFFMDLDRERNLYVGELGNHAIRKINLNSGTVSTLSGGVS---G 219

Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
             DG   +A F +   +T+     +LL++D  N  IR+I+LK    + S+  G+ +GA
Sbjct: 220 YLDGDLASAQFKSPLGITYDQKTDSLLVADLQNHRIRKIDLKTS--TVSTFLGNGIGA 275



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 46/94 (48%), Gaps = 2/94 (2%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP   + R    + I LD      Y     ++   V+   AG G+    DG   +A F+ 
Sbjct: 22  NPSIFMSRAWVENTI-LDCILKECYLCKLKVTNNPVISLFAGTGINVSIDGTTQTASFNT 80

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
           P    +D  GNI+V+D+S ++IRKI   G +TT+
Sbjct: 81  PFGLELDTFGNIFVSDQSANLIRKIDRFGNITTL 114



 Score = 45.4 bits (106), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 38/108 (35%), Positives = 57/108 (52%), Gaps = 5/108 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAG 140
           V  L+G GV GY DG+  SA+F  P     D K + + VAD  NH IRKI +L  +T++ 
Sbjct: 210 VSTLSG-GVSGYLDGDLASAQFKSPLGITYDQKTDSLLVADLQNHRIRKI-DLKTSTVST 267

Query: 141 G-GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
             G+      DG   NASF+    ++F  +   + +SD  +  IR ++
Sbjct: 268 FLGNGIGASIDGKGLNASFNGPAFISF-DNSGYMFVSDTNSNKIRIVD 314


>gi|418754100|ref|ZP_13310334.1| hypothetical protein LEP1GSC179_2637 [Leptospira santarosai str.
           MOR084]
 gi|409965522|gb|EKO33385.1| hypothetical protein LEP1GSC179_2637 [Leptospira santarosai str.
           MOR084]
          Length = 356

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 67/119 (56%), Gaps = 7/119 (5%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKE 146
           GV G+ +G+  ++ F  P    +D + N+YV +  NH IRKI  N G V+T++GG     
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKINLNSGSVSTLSGG---TL 217

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
           G  DG   +A F +   +T+     +LL++D  +  IR+I+LK  + + S+  G+ +GA
Sbjct: 218 GYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLK--NATVSTFVGNGIGA 274



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 54/203 (26%), Positives = 78/203 (38%), Gaps = 48/203 (23%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP   + R  + + I+ D      Y     ++   VV   AG G++   DG   +A F  
Sbjct: 20  NPSIFMSRAWAENTII-DCILKECYLCKLKITNNPVVSLFAGTGIKESIDGTTQTASFKT 78

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI-------------------------- 138
           P    VD  GNI+V+D+  ++IRKI   G VTT+                          
Sbjct: 79  PFGLEVDTFGNIFVSDQMANLIRKIDRSGNVTTLSTSLLLEDPSGIKFDPITGDKYVSCK 138

Query: 139 -----------------AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
                            AG  S  +G  +G   N+SF   F +  +     L + + GN 
Sbjct: 139 DSAQIFKIDPLDQFSLYAGNSSGVDGFQNGDRLNSSFKGPFFMD-LDRERNLYVGELGNH 197

Query: 182 LIRQINLKPEDCSKSSQSGSALG 204
            IR+INL     S S+ SG  LG
Sbjct: 198 AIRKINLNSG--SVSTLSGGTLG 218



 Score = 40.0 bits (92), Expect = 1.00,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 41/81 (50%), Gaps = 6/81 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
           V  L+G G  GY DG+  SA+F  P     D K + + VAD  +H IRKI   N  V+T 
Sbjct: 209 VSTLSG-GTLGYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLKNATVSTF 267

Query: 139 AGGGSKKEGRADGPAQNASFS 159
            G G   +   DG   NAS +
Sbjct: 268 VGNGIGAD--IDGNGLNASLN 286


>gi|332668794|ref|YP_004451801.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Cellulomonas fimi ATCC 484]
 gi|332337831|gb|AEE44414.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Cellulomonas fimi ATCC 484]
          Length = 639

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/163 (26%), Positives = 79/163 (48%), Gaps = 26/163 (15%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P   +  PG + L+V D+       L+     E++V+R+ G G +G+ DG P  ARF +P
Sbjct: 181 PAKAVALPGGT-LLVADAGHHGLAELA--ADGETLVRRI-GSGERGFVDGGPDEARFSEP 236

Query: 107 KSFAVDMKG-------NIYVADKSNHVIR--KITNLGVTTIAGGGSK-KEGRADG----- 151
              A+   G       ++ VAD  NH +R  ++++  VTT+AG G +   G AD      
Sbjct: 237 NGLALVPDGLRATLGYDVLVADTVNHALRGVRLSDGSVTTVAGTGEQLMVGAADNVRGGG 296

Query: 152 -------PAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
                  PA++   S+ +++ + P + A +++  GN  +   +
Sbjct: 297 TGAGYDGPARDVRLSSPWDVAWSPVLGAFVVAMAGNHTLWTFD 339



 Score = 38.1 bits (87), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 29/106 (27%), Positives = 48/106 (45%), Gaps = 3/106 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIA 139
           V+ +AG   +G  DG    A F +P   +V   G ++VAD     +R +   +  V T  
Sbjct: 343 VRHVAGTMNEGLLDGPGEGAWFAQPSGLSVGPDGRVWVADSETSALRWLDPADGTVGTAV 402

Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
           G G    G  DG A  A   +   +  +P   ++L++D  N  +R+
Sbjct: 403 GEGLFDFGHRDGRADEARMQHPLGVAALPD-GSVLVADTYNGAVRR 447


>gi|32472743|ref|NP_865737.1| hypothetical protein RB3815 [Rhodopirellula baltica SH 1]
 gi|32443980|emb|CAD73422.1| conserved hypothetical protein [Rhodopirellula baltica SH 1]
          Length = 351

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 54/102 (52%), Gaps = 5/102 (4%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-----VTTIAGGGSKKE 146
           GY DG   SARF+K  +  +D +  +Y++D +NH +R++         V T AG G +  
Sbjct: 106 GYVDGPARSARFNKLHNLVIDAEDVLYLSDHANHSVRRLIQTTDGEWMVDTYAGQGKEGP 165

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
              +   ++A+F     +T       LLI+D GNQ++R I+L
Sbjct: 166 ATDNVDRRDATFHEPISVTLDAEGNRLLIADIGNQVVRSIDL 207



 Score = 44.7 bits (104), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 26/95 (27%), Positives = 46/95 (48%), Gaps = 4/95 (4%)

Query: 34  VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
           + T + G   ++     +D  G+  L+VL+ + +    +         +  LAG G +G 
Sbjct: 211 LVTTLAGRTSKLKDPRAVDLDGNRRLLVLERNGNRLRRVK----SNGDITTLAGSGKKGK 266

Query: 94  SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           +DG+   A F+ PK   V   G +Y+AD  NH++R
Sbjct: 267 ADGDAKQASFNGPKHMDVAPDGRVYIADDVNHLVR 301



 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 48/78 (61%), Gaps = 6/78 (7%)

Query: 110 AVDMKGN--IYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTF 166
           AVD+ GN  + V +++ + +R++ + G +TT+AG G  K+G+ADG A+ ASF+    +  
Sbjct: 227 AVDLDGNRRLLVLERNGNRLRRVKSNGDITTLAGSG--KKGKADGDAKQASFNGPKHMDV 284

Query: 167 VPHICALLISDHGNQLIR 184
            P    + I+D  N L+R
Sbjct: 285 APD-GRVYIADDVNHLVR 301


>gi|418747486|ref|ZP_13303786.1| hypothetical protein LEP1GSC163_2583 [Leptospira santarosai str.
           CBC379]
 gi|410791609|gb|EKR89564.1| hypothetical protein LEP1GSC163_2583 [Leptospira santarosai str.
           CBC379]
          Length = 356

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 67/119 (56%), Gaps = 7/119 (5%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKE 146
           GV G+ +G+  ++ F  P    +D + N+YV +  NH IRKI  N G V+T++GG     
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKINLNSGSVSTLSGG---TL 217

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
           G  DG   +A F +   +T+     +LL++D  +  IR+I+LK  + + S+  G+ +GA
Sbjct: 218 GYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLK--NATVSTFVGNGIGA 274



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 54/203 (26%), Positives = 78/203 (38%), Gaps = 48/203 (23%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP   + R  + + I+ D      Y     ++   VV   AG G++   DG   +A F  
Sbjct: 20  NPSIFMSRAWAENTII-DCILKECYLCKLKITNNPVVSLFAGTGIKESIDGTTQTASFKT 78

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI-------------------------- 138
           P    VD  GNI+V+D+  ++IRKI   G VTT+                          
Sbjct: 79  PFGLEVDTFGNIFVSDQMANLIRKIDRSGNVTTLSTNLLLEDPSGIKFDPITGDKYVSCK 138

Query: 139 -----------------AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
                            AG  S  +G  +G   N+SF   F +  +     L + + GN 
Sbjct: 139 DSAQIFKIDPLDQFSLYAGNSSGVDGFQNGDRLNSSFKGPFFMD-LDRERNLYVGELGNH 197

Query: 182 LIRQINLKPEDCSKSSQSGSALG 204
            IR+INL     S S+ SG  LG
Sbjct: 198 AIRKINLNSG--SVSTLSGGTLG 218



 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 41/81 (50%), Gaps = 6/81 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
           V  L+G G  GY DG+  SA+F  P     D K + + VAD  +H IRKI   N  V+T 
Sbjct: 209 VSTLSG-GTLGYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLKNATVSTF 267

Query: 139 AGGGSKKEGRADGPAQNASFS 159
            G G   +   DG   NAS +
Sbjct: 268 VGNGIGAD--IDGNGLNASLN 286


>gi|290985345|ref|XP_002675386.1| predicted protein [Naegleria gruberi]
 gi|284088982|gb|EFC42642.1| predicted protein [Naegleria gruberi]
          Length = 788

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/109 (35%), Positives = 56/109 (51%), Gaps = 4/109 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
           ++  +AG G +G+S DG P + A+ + P    V     +Y+ D  N  IRKI   G + T
Sbjct: 241 IINTIAGTGEEGFSGDGGPATNAQLNVPCGVFVSATNEVYIVDSGNCRIRKILRNGIINT 300

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           IAG G +      GPA NA  SN + + FV     + I+D  N  IR+I
Sbjct: 301 IAGTGEEGYSGDGGPAINAQISNVYNI-FVSQNDEVYIADTNNHRIRKI 348



 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 53/106 (50%), Gaps = 6/106 (5%)

Query: 85  LAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           +AG G  G+S G+ G   +A+   P S  V     +Y+AD  N+ +RKI   G + TIAG
Sbjct: 189 IAGTGEDGFS-GDGGLAINAKLGYPCSVFVSKDDEVYIADSQNNRVRKILRNGIINTIAG 247

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            G +      GPA NA  +    + FV     + I D GN  IR+I
Sbjct: 248 TGEEGFSGDGGPATNAQLNVPCGV-FVSATNEVYIVDSGNCRIRKI 292



 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 34/95 (35%), Positives = 47/95 (49%), Gaps = 17/95 (17%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
           ++  +AG G +GYS DG P  +A+     +  V     +Y+AD +NH IRKI   G + T
Sbjct: 297 IINTIAGTGEEGYSGDGGPAINAQISNVYNIFVSQNDEVYIADTNNHRIRKILKDGTIET 356

Query: 138 IAGGGSKKEGRADGPAQNASFSND--FELTFVPHI 170
           IAG G K             FS D  F+ +  PHI
Sbjct: 357 IAGNGEK------------GFSGDSPFDFSSHPHI 379



 Score = 42.7 bits (99), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 32/88 (36%), Positives = 42/88 (47%), Gaps = 2/88 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASF 158
           SA  D P S AV     +Y+AD   + IRKI  N  + TIAG G +      GPA NA  
Sbjct: 94  SAMLDFPTSIAVSSDNEVYIADLGIYRIRKILRNGNIVTIAGTGEEGFSGDGGPAINAQI 153

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
           S  + +    +   L   D+ N  IR+I
Sbjct: 154 SAVWNIALSQN-NELYFVDYSNCRIRKI 180



 Score = 38.5 bits (88), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 49/105 (46%), Gaps = 4/105 (3%)

Query: 85  LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
           +AG G +G+S DG P  +A+     + A+     +Y  D SN  IRKI   G + TIAG 
Sbjct: 133 IAGTGEEGFSGDGGPAINAQISAVWNIALSQNNELYFVDYSNCRIRKILRNGTIVTIAGT 192

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G        G A NA       + FV     + I+D  N  +R+I
Sbjct: 193 GEDGFSGDGGLAINAKLGYPCSV-FVSKDDEVYIADSQNNRVRKI 236


>gi|227537797|ref|ZP_03967846.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33300]
 gi|227242411|gb|EEI92426.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33300]
          Length = 441

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 50/104 (48%), Gaps = 1/104 (0%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
           KR+   G  G+ DG    A F+ PK    D  GN+++AD  NH IR ++  G+ +   G 
Sbjct: 336 KRINAPGGSGFRDGPVADAVFNFPKDIKFDRAGNMFIADYGNHCIRMLSADGIVSTVAGQ 395

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             K G  DG    + F+  + +  V     + I+D  N  IR++
Sbjct: 396 PTKAGYKDGGPVESQFNQPWGVA-VNDQGDIYIADWNNARIRKL 438


>gi|290994196|ref|XP_002679718.1| predicted protein [Naegleria gruberi]
 gi|284093336|gb|EFC46974.1| predicted protein [Naegleria gruberi]
          Length = 762

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 57/105 (54%), Gaps = 4/105 (3%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
           +AG+G  G+S  DG   +A+ + P S  V  K  +++AD+ NH IRKI   G + TIAG 
Sbjct: 74  IAGNGKGGFSGDDGLATNAQLNCPSSVFVSNKNEVFIADQYNHRIRKILESGRIVTIAGN 133

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G +     +GPA +A  +    + FV ++  +   D  N  +R+I
Sbjct: 134 GEEGFSGDNGPATSARLNCPMSI-FVSNMNEVYFVDSNNNRVRKI 177



 Score = 45.1 bits (105), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 35/96 (36%), Positives = 46/96 (47%), Gaps = 13/96 (13%)

Query: 78  EESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           E   V  +AG G QG+S DG P + A+   P    V +   IY AD +N+ IRK+  N  
Sbjct: 283 ENGNVVTVAGTGQQGFSGDGGPATRAQLACPTGLFVTLNNEIYFADSANNRIRKVLENGN 342

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHI 170
           + TIAG G K     D P         F+ +  PHI
Sbjct: 343 IVTIAGNG-KHGYSGDAP---------FDFSLHPHI 368



 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 32/87 (36%), Positives = 41/87 (47%), Gaps = 2/87 (2%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
           A+   P    V     IY+AD+ NH +RKI   G + TIAG G       DG A NA   
Sbjct: 36  AKLCYPFGVFVSPTNEIYIADQYNHRVRKILESGRIVTIAGNGKGGFSGDDGLATNAQL- 94

Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
           N     FV +   + I+D  N  IR+I
Sbjct: 95  NCPSSVFVSNKNEVFIADQYNHRIRKI 121



 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 34/60 (56%), Gaps = 3/60 (5%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
           +AG+G +G+S  +G   SAR + P S  V     +Y  D +N+ +RKI   G + TIAG 
Sbjct: 130 IAGNGEEGFSGDNGPATSARLNCPMSIFVSNMNEVYFVDSNNNRVRKILENGIIVTIAGN 189


>gi|290983166|ref|XP_002674300.1| predicted protein [Naegleria gruberi]
 gi|284087889|gb|EFC41556.1| predicted protein [Naegleria gruberi]
          Length = 1435

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 54/174 (31%), Positives = 83/174 (47%), Gaps = 27/174 (15%)

Query: 31  GYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL----- 85
           G ++T + +G+ L+    + ++RP       + SS+   Y   F     S++++L     
Sbjct: 140 GTSITGLYNGNNLK-GTLTALNRPIGLFYTEIFSSQGELY---FADESNSLIRKLVIANG 195

Query: 86  -----AGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT-T 137
                AG+ V GYS   GE  SA    P S  V   G IY+AD  N VIRK++NLG+  T
Sbjct: 196 TVSNVAGNVVAGYSGDGGEATSASLRYPSSVYVSNSGEIYIADTFNCVIRKVSNLGIIGT 255

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA-----LLISDHGNQLIRQI 186
           +AG G +      G  Q++  + + +L F   I       L I D GN  IR++
Sbjct: 256 VAGVGGQC-----GYNQDSINATESKLNFPTGITVSDSGDLYIVDKGNHRIRKV 304



 Score = 37.0 bits (84), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 43/167 (25%), Positives = 75/167 (44%), Gaps = 23/167 (13%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG---SARF 103
           P+ +  +P + DL + D+       +    S    + R  G G+ GY +GE     + + 
Sbjct: 450 PYEIAIQPSTQDLFISDTHNHRILRVD---STSGRIYRFMGSGIPGY-NGEGSDLLNLQL 505

Query: 104 DKPK--SFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
           + P+  SF+ D+   + V+D  NH IRK+ N   +TI G    + G  + P+      +D
Sbjct: 506 NNPQGLSFSGDI---LIVSDTFNHRIRKVFNSTSSTILG---TQFGGKEPPS--TLVLSD 557

Query: 162 FELTFVPHIC-----ALLISDHGNQLIRQINLKPEDCSKSSQSGSAL 203
             L F P         L ++D  N LIR  ++  +    +SQ+ S +
Sbjct: 558 IMLKF-PRFALLKGTILAVADTKNYLIRDFDISTQSVYGNSQTLSTI 603



 Score = 37.0 bits (84), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 27/91 (29%), Positives = 44/91 (48%), Gaps = 3/91 (3%)

Query: 97  EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG-SKKEGRADG-PAQ 154
           EP  A+       A + K  ++ +D  NH I KI N  +  +AG G +  +   DG  A 
Sbjct: 384 EPTIAKLKYTYGVATNSKNEVFFSDAYNHRIVKIENGVLKQVAGKGVAGFDNNIDGLLAT 443

Query: 155 NASFSNDFELTFVPHICALLISD-HGNQLIR 184
           + S +  +E+   P    L ISD H ++++R
Sbjct: 444 DVSLNYPYEIAIQPSTQDLFISDTHNHRILR 474


>gi|290975594|ref|XP_002670527.1| predicted protein [Naegleria gruberi]
 gi|284084087|gb|EFC37783.1| predicted protein [Naegleria gruberi]
          Length = 443

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 57/112 (50%), Gaps = 4/112 (3%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           E   +  +AG+G  G+S  +G   SA+   P    V     +Y++D  N+ IRK+  N  
Sbjct: 39  ENGNIVTIAGNGTAGFSGDNGPATSAQLYNPFGIFVSSNNEVYISDSGNNRIRKVLENGN 98

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + TIAG G+      +GPA +A   N F + FV     + ISD  N +IR+I
Sbjct: 99  IITIAGNGTVGFSGDNGPATSAQLYNPFGI-FVSANNEVYISDSNNNIIRKI 149



 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 46/87 (52%), Gaps = 2/87 (2%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
           A+ + P+   V     +Y+AD+ N  IRK+   G + TIAG G+      +GPA +A   
Sbjct: 8   AQLNYPRCVFVSSNNEVYIADEGNQRIRKVVENGNIVTIAGNGTAGFSGDNGPATSAQLY 67

Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
           N F + FV     + ISD GN  IR++
Sbjct: 68  NPFGI-FVSSNNEVYISDSGNNRIRKV 93



 Score = 44.7 bits (104), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 48/98 (48%), Gaps = 17/98 (17%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           E   +  +AG+G  G+S  +G   SA+   P    V     +Y++D +N++IRKI  N  
Sbjct: 95  ENGNIITIAGNGTVGFSGDNGPATSAQLYNPFGIFVSANNEVYISDSNNNIIRKILENGN 154

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSND--FELTFVPHI 170
           + TIAG G++             FS D  F++   PH+
Sbjct: 155 IVTIAGNGTR------------GFSGDSPFDIRMYPHV 180


>gi|300772471|ref|ZP_07082341.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33861]
 gi|300760774|gb|EFK57600.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33861]
          Length = 441

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 50/104 (48%), Gaps = 1/104 (0%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
           KR+   G  G+ DG    A F+ PK    D  GN+++AD  NH IR ++  G+ +   G 
Sbjct: 336 KRINAPGGSGFRDGPVADAVFNFPKDIKFDRAGNMFIADYGNHCIRMLSADGIVSTVAGQ 395

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             K G  DG    + F+  + +  V     + I+D  N  IR++
Sbjct: 396 PTKAGYKDGGPVESQFNQPWGVA-VNDQGDIYIADWNNARIRKL 438


>gi|290971645|ref|XP_002668597.1| predicted protein [Naegleria gruberi]
 gi|284082068|gb|EFC35853.1| predicted protein [Naegleria gruberi]
          Length = 223

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 64/119 (53%), Gaps = 14/119 (11%)

Query: 76  LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +S+  +++ +AG+G +G+S  DG    A+ + P S  V+  G IY+ D +N  IRK+   
Sbjct: 86  VSKSGIIETIAGNGNEGFSGDDGLATQAQLNCPMSTFVNSNGEIYITDSNNFRIRKVQRN 145

Query: 134 GV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA-----LLISDHGNQLIRQI 186
           G+  TIAG G+      DG   + S + + +L +  H+       + ISD GN  IR++
Sbjct: 146 GIIKTIAGNGN------DGFEGDGSSATNAQLNYPMHVFVSSNDEMYISDSGNNAIRKV 198



 Score = 38.9 bits (89), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 54/106 (50%), Gaps = 7/106 (6%)

Query: 85  LAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           +AG+G  G S G+ G   +A  + P +  V     +Y+ D +NH IRK++  G + TIAG
Sbjct: 40  IAGNGKLGCS-GDGGLAINAELNCPMNVFV-FNEEVYITDSANHRIRKVSKSGIIETIAG 97

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            G++     DG A  A  +     TFV     + I+D  N  IR++
Sbjct: 98  NGNEGFSGDDGLATQAQLNCPMS-TFVNSNGEIYITDSNNFRIRKV 142



 Score = 37.0 bits (84), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 40/75 (53%), Gaps = 3/75 (4%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +    ++K +AG+G  G+  DG   + A+ + P    V     +Y++D  N+ IRK++  
Sbjct: 142 VQRNGIIKTIAGNGNDGFEGDGSSATNAQLNYPMHVFVSSNDEMYISDSGNNAIRKVSRN 201

Query: 134 G-VTTIAGGGSKKEG 147
           G + T+AG G   +G
Sbjct: 202 GIIETVAGEGPYSQG 216


>gi|386722458|ref|YP_006188784.1| copper amine oxidase domain-containing protein [Paenibacillus
           mucilaginosus K02]
 gi|384089583|gb|AFH61019.1| copper amine oxidase domain-containing protein [Paenibacillus
           mucilaginosus K02]
          Length = 537

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 48/159 (30%), Positives = 72/159 (45%), Gaps = 18/159 (11%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           G+S+G    AR + P   AV  +G + +AD  NH +R +    +TT+AG G +K G  DG
Sbjct: 302 GFSNGAALQARLNYPMGIAVTEEGGLLIADSQNHAVRYLFGGQLTTLAGAGEQKTGLLDG 361

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK-PEDCSKSSQSGSALGAVSVWV 210
               A  +   ++  +     L+     N+L R    + PED      +G AL  V    
Sbjct: 362 MEGKAGLNRPADVAVLGDGSVLVADSFNNRLRRLTGYRLPEDLP----AGDALKVV---- 413

Query: 211 LVSVLSCLVSLVIGFVARPYIIRHTGRLDPPLQHDMEAL 249
                  L + V+ F A+P I    GRL  P++   EAL
Sbjct: 414 -------LDAGVMKFDAQPEITE--GRLMVPVRAVTEAL 443



 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 55/189 (29%), Positives = 79/189 (41%), Gaps = 21/189 (11%)

Query: 29  EDGYTVTTVIDGH---QLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
           + GY V  ++DG     L   P  +   P  S L V D+   A   +      +  V  +
Sbjct: 109 DKGYPVGGLLDGAANASLFQEPLGLSAGPDGS-LYVADAGNHAIRRID----AKGNVTTV 163

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
           AG G  G  DG+  +A F +P   A    G +YVAD   H IR+I+  G  T     SK+
Sbjct: 164 AGSGRLGVKDGKGAAAEFYRPGDVAAAPDGTLYVADTLGHTIRRISPQGEVTTLTAPSKR 223

Query: 146 -----EGR-------ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
                 G+       ADGP   A F+    +        L +SD GNQ +R I+L     
Sbjct: 224 VVEATPGQVAAAGDFADGPLAQAKFNEPTGIALDAK-GNLYVSDSGNQRVRYIDLAKGTV 282

Query: 194 SKSSQSGSA 202
           +  +  G+A
Sbjct: 283 TTVAGGGTA 291



 Score = 45.8 bits (107), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 52/111 (46%), Gaps = 8/111 (7%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
           AG G  G ++G    A F  P   AV   G   V+D  N VIRK+T   V  +AG   +K
Sbjct: 48  AGTGRLGSANGTGLEASFRVPAGLAVLPDGTAAVSDSRNGVIRKLTGGRVDVLAGVFYRK 107

Query: 146 E-------GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           +       G  DG A  + F     L+  P   +L ++D GN  IR+I+ K
Sbjct: 108 DDKGYPVGGLLDGAANASLFQEPLGLSAGPD-GSLYVADAGNHAIRRIDAK 157


>gi|116625892|ref|YP_828048.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116229054|gb|ABJ87763.1| NHL repeat containing protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 2384

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 43/131 (32%), Positives = 65/131 (49%), Gaps = 9/131 (6%)

Query: 78  EESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TN 132
           +  ++  +AG GVQG++   G  GSA+   P   A D  GN+Y+AD  N  IR+I   T 
Sbjct: 138 QTGMISTVAGTGVQGFAGDGGLAGSAQLSHPAGIAFDSAGNLYIADMGNFRIRRIDGQTG 197

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP-HICALLISDHGNQLIRQINLKPE 191
           + +TTIAG GS         A  A  S    +   P     LL+S+ G   IR+++L   
Sbjct: 198 V-ITTIAGDGSNTTSPDGVLAAGAGLSQPIWV--APDRSGGLLVSEMGAMRIRRVDLAGG 254

Query: 192 DCSKSSQSGSA 202
             +  + +G+A
Sbjct: 255 GLTTVAGNGNA 265



 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 39/138 (28%), Positives = 71/138 (51%), Gaps = 12/138 (8%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVD 112
           S+++ + DS  +    + +P     ++ ++AG+G    S G+ GSA     ++P   A D
Sbjct: 62  SNNVFIADSGNNRVVVVQYP---SGILYQIAGNGTA-TSSGDGGSALQSSVNRPMGLAAD 117

Query: 113 MKGNIYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPH 169
             GN+Y+++   + IR+I   T + ++T+AG G +      G A +A  S+   + F   
Sbjct: 118 FNGNLYISEFQGNRIRRIDMQTGM-ISTVAGTGVQGFAGDGGLAGSAQLSHPAGIAF-DS 175

Query: 170 ICALLISDHGNQLIRQIN 187
              L I+D GN  IR+I+
Sbjct: 176 AGNLYIADMGNFRIRRID 193


>gi|196233467|ref|ZP_03132310.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
 gi|196222463|gb|EDY16990.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
          Length = 509

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 53/105 (50%), Gaps = 10/105 (9%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
            AG+  +G +DG   +A+F  P   A+D  GN+ VAD++N  +R+I+  G T    G + 
Sbjct: 205 FAGE-TEGAADGAGVTAQFKNPSGLAIDEHGNVDVADRTNQTLRRISPSGETETIAGTAG 263

Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           + G ADG  + A F       + P    L I +HGN  +   + K
Sbjct: 264 EHGTADGAGKAARF-------WYPQ--GLAIDEHGNVYVADTSAK 299



 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 38/98 (38%), Positives = 53/98 (54%), Gaps = 7/98 (7%)

Query: 46  NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
           NP  + ID  G+ D  V D +      +S P  E   +   AG+   G +DG   +ARF 
Sbjct: 224 NPSGLAIDEHGNVD--VADRTNQTLRRIS-PSGETETIAGTAGE--HGTADGAGKAARFW 278

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGG 141
            P+  A+D  GN+YVAD S   +RK+T N  V+T+AGG
Sbjct: 279 YPQGLAIDEHGNVYVADTSAKSVRKVTPNGQVSTLAGG 316



 Score = 45.4 bits (106), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 42/132 (31%), Positives = 60/132 (45%), Gaps = 12/132 (9%)

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           D+ V DSS      ++     +  V  LAG  G  G  +G    ARF  P   A+   G 
Sbjct: 330 DVFVTDSSNHVIQKIA----HDGKVSVLAGKPGESGAKNGHGEEARFHFPNGIAIGADGA 385

Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA--LL 174
           +YV + S+  IRKI   G  T   G  ++EGR DG    A F   + LT +       L 
Sbjct: 386 LYVVEHSS--IRKIDGKGNVTTFAGSLEEEGRNDGTGAAARF---WGLTAIAADARGNLY 440

Query: 175 ISDHGNQLIRQI 186
           ++DH  + +R+I
Sbjct: 441 VADHELKNLRRI 452



 Score = 37.7 bits (86), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 22/68 (32%), Positives = 35/68 (51%), Gaps = 10/68 (14%)

Query: 78  EESVVKRLAGDG----------VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
           E S ++++ G G           +G +DG   +ARF    + A D +GN+YVAD     +
Sbjct: 390 EHSSIRKIDGKGNVTTFAGSLEEEGRNDGTGAAARFWGLTAIAADARGNLYVADHELKNL 449

Query: 128 RKITNLGV 135
           R+I+  GV
Sbjct: 450 RRISRSGV 457


>gi|421614357|ref|ZP_16055418.1| NHL repeat containing protein [Rhodopirellula baltica SH28]
 gi|408494877|gb|EKJ99474.1| NHL repeat containing protein [Rhodopirellula baltica SH28]
          Length = 351

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 53/102 (51%), Gaps = 5/102 (4%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-----VTTIAGGGSKKE 146
           GY DG   SARF+K  +  +D +  +Y++D +NH +R++         V T AG G +  
Sbjct: 106 GYVDGPARSARFNKLHNLVIDAEDVLYLSDHANHCVRRLIQTSDGEWMVDTYAGQGEEGP 165

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
              +   ++A+F     +T       LLI+D GNQ++R I+ 
Sbjct: 166 ATDNVDRRDATFHEPISVTLDAEGNRLLIADIGNQVVRSIDF 207



 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 27/95 (28%), Positives = 47/95 (49%), Gaps = 4/95 (4%)

Query: 34  VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
           + T + G + ++     +D  G+  L+VL+ + +    L         +  LAG G +G 
Sbjct: 211 LVTTLAGRKSKLKDPRAVDLDGNGRLLVLERNGNRLRRLE----SNGDITTLAGSGKKGT 266

Query: 94  SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           +DG+   A F+ PK   V   G +Y+AD  NH++R
Sbjct: 267 ADGDAKQASFNGPKHMDVAPDGRVYIADDVNHLVR 301



 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 26/85 (30%), Positives = 49/85 (57%), Gaps = 4/85 (4%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFS 159
           ++   P++  +D  G + V +++ + +R++ +N  +TT+AG G  K+G ADG A+ ASF+
Sbjct: 220 SKLKDPRAVDLDGNGRLLVLERNGNRLRRLESNGDITTLAGSG--KKGTADGDAKQASFN 277

Query: 160 NDFELTFVPHICALLISDHGNQLIR 184
               +   P    + I+D  N L+R
Sbjct: 278 GPKHMDVAPD-GRVYIADDVNHLVR 301


>gi|158315874|ref|YP_001508382.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
 gi|158111279|gb|ABW13476.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
          Length = 772

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 47/127 (37%), Positives = 65/127 (51%), Gaps = 15/127 (11%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +     +  +AG G  G+S DG P + A+ D+P S AV   G +YVAD  NH +R+I   
Sbjct: 436 VDRAGTITTVAGTGADGFSGDGGPATEAQLDEPTSVAVAPDGTLYVADTRNHRVRRIGRD 495

Query: 134 G-VTTIAG-------GGSKKEGRA---DG-PAQNASFSNDFELTFVPHICALLISDHGNQ 181
           G +TTIAG       G   ++G A   DG PA NA   N      +    +LLI+D  N 
Sbjct: 496 GIITTIAGQDEFGFAGEVSEDGLAYSGDGLPAVNAKL-NYPNTVLMETDGSLLIADGENN 554

Query: 182 LIRQINL 188
            +R+I L
Sbjct: 555 RVRRIGL 561



 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 39/125 (31%), Positives = 61/125 (48%), Gaps = 5/125 (4%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKS-FAVDMKGNIYVADKSNHVIRKITN 132
           ++ +  +  LAG G  GYS DG P   A+ +   +   VD  GN+Y++D  ++ +R+I  
Sbjct: 615 IAGDGTISTLAGTGKTGYSGDGGPADQAQINAVGADLVVDAAGNVYLSDPGSNRVRRIAP 674

Query: 133 LG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
            G +TTIAG G  K     GPA  A       L     +  L I+D  +  +R + L P 
Sbjct: 675 DGTITTIAGTGVSKYSGNGGPATAAELVYPGGLAL-DQLGNLYIADGIDSRVRAVRLPPG 733

Query: 192 DCSKS 196
            C+ +
Sbjct: 734 SCAAA 738



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 45/88 (51%), Gaps = 1/88 (1%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
           SARF  P + A    G++YVAD+ NH +R+I   G ++T+AG G        GPA  A  
Sbjct: 585 SARFSYPSALARGPDGSLYVADQDNHRVRRIAGDGTISTLAGTGKTGYSGDGGPADQAQI 644

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
           +       V     + +SD G+  +R+I
Sbjct: 645 NAVGADLVVDAAGNVYLSDPGSNRVRRI 672



 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 34/101 (33%), Positives = 56/101 (55%), Gaps = 3/101 (2%)

Query: 88  DGVQGYSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKK 145
           DG+    DG P  +A+ + P +  ++  G++ +AD  N+ +R+I   G +TTIAG G++ 
Sbjct: 516 DGLAYSGDGLPAVNAKLNYPNTVLMETDGSLLIADGENNRVRRIGLDGIITTIAGTGAEG 575

Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            G   GPA +A FS    L   P   +L ++D  N  +R+I
Sbjct: 576 FGGDGGPATSARFSYPSALARGPD-GSLYVADQDNHRVRRI 615


>gi|302552781|ref|ZP_07305123.1| NHL repeat containing protein [Streptomyces viridochromogenes DSM
           40736]
 gi|302470399|gb|EFL33492.1| NHL repeat containing protein [Streptomyces viridochromogenes DSM
           40736]
          Length = 605

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 41/139 (29%), Positives = 71/139 (51%), Gaps = 14/139 (10%)

Query: 39  DGHQLEINPHSVIDR-PG------SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ 91
           DG  +   P   + R PG      S + +V D++R     L+     ESVV+R+ G G +
Sbjct: 165 DGPYVAPEPEPTVLRFPGKALLLPSGNFLVSDTTRHQLVELAG--DGESVVRRI-GSGAR 221

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAGGGSK--KEG 147
           G++DG    A F++P+   +   G++ VAD  NH +R+  +    VTT+AG G +  +  
Sbjct: 222 GFADGSADEAAFNEPQGLTLLDDGSVVVADTVNHALRRLDLATGEVTTLAGTGRQWWQGS 281

Query: 148 RADGPAQNASFSNDFELTF 166
              GPA+    S+ +++ +
Sbjct: 282 PTSGPAREVDLSSPWDVAW 300



 Score = 38.9 bits (89), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 30/114 (26%), Positives = 55/114 (48%), Gaps = 3/114 (2%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
           GDG     + EP   RF  P    +   GN  V+D + H + ++   G + +   GS   
Sbjct: 164 GDGPYVAPEPEPTVLRF--PGKALLLPSGNFLVSDTTRHQLVELAGDGESVVRRIGSGAR 221

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
           G ADG A  A+F+    LT +    +++++D  N  +R+++L   + +  + +G
Sbjct: 222 GFADGSADEAAFNEPQGLTLLDD-GSVVVADTVNHALRRLDLATGEVTTLAGTG 274


>gi|290995436|ref|XP_002680301.1| predicted protein [Naegleria gruberi]
 gi|284093921|gb|EFC47557.1| predicted protein [Naegleria gruberi]
          Length = 699

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 59/112 (52%), Gaps = 4/112 (3%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           E   +  +AG+G+ G+S  +G   +A+   P S  V     +Y+AD+ NH IRKI  N  
Sbjct: 95  ENGNIITIAGNGIHGFSGDNGLATNAQLYTPCSVFVSSNNEVYIADQGNHRIRKILENGN 154

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + TIAG G       +G A NA  ++ + + FV     + I+D+ N  IR+I
Sbjct: 155 IVTIAGNGIHGFSGDNGLATNAQLNSSYSV-FVSSNNEVYIADYFNNRIRKI 205



 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 57/112 (50%), Gaps = 4/112 (3%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           E   +  +AG+G +G S  +G   +A+ ++P S  V     +Y+AD+ N+ IRKI  N  
Sbjct: 39  ENGTIVTIAGNGTKGSSGDNGLATNAQLNRPYSVFVSSNNEVYIADQGNNRIRKILENGN 98

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + TIAG G       +G A NA       + FV     + I+D GN  IR+I
Sbjct: 99  IITIAGNGIHGFSGDNGLATNAQLYTPCSV-FVSSNNEVYIADQGNHRIRKI 149



 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 32/88 (36%), Positives = 48/88 (54%), Gaps = 2/88 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
           SA+ ++P +  V     +Y+AD  N+ IRKI   G + TIAG G+K     +G A NA  
Sbjct: 7   SAQLNRPNNVFVSSNNEVYIADCFNNRIRKILENGTIVTIAGNGTKGSSGDNGLATNAQL 66

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
           +  + + FV     + I+D GN  IR+I
Sbjct: 67  NRPYSV-FVSSNNEVYIADQGNNRIRKI 93



 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 36/109 (33%), Positives = 56/109 (51%), Gaps = 17/109 (15%)

Query: 77  SEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNL 133
           +E   +  +AG+G+ G++  +G   +AR + P S  V     +Y+AD  N+ IRKI  N 
Sbjct: 226 NENGNIITIAGNGIHGFNGDNGLATNARLNHPFSVFVSSNNEVYIADYYNNRIRKILENG 285

Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSND--FELTFVPHICALLISDHGN 180
            + TIAG G+            A FS D  F++   PHI   L++ +GN
Sbjct: 286 NIITIAGNGT------------AGFSGDSPFDIRTYPHIGNKLLTGNGN 322



 Score = 43.5 bits (101), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 58/132 (43%), Gaps = 24/132 (18%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI----- 130
           E   +  +AG+G+ G+S  +G   +A+ +   S  V     +Y+AD  N+ IRKI     
Sbjct: 151 ENGNIVTIAGNGIHGFSGDNGLATNAQLNSSYSVFVSSNNEVYIADYFNNRIRKILENGN 210

Query: 131 ----------------TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
                            N  + TIAG G       +G A NA  ++ F + FV     + 
Sbjct: 211 IITIAGNGTHGFNGDNENGNIITIAGNGIHGFNGDNGLATNARLNHPFSV-FVSSNNEVY 269

Query: 175 ISDHGNQLIRQI 186
           I+D+ N  IR+I
Sbjct: 270 IADYYNNRIRKI 281


>gi|302036371|ref|YP_003796693.1| hypothetical protein NIDE1006 [Candidatus Nitrospira defluvii]
 gi|300604435|emb|CBK40767.1| conserved protein of unknown function, contains NHL repeats
           [Candidatus Nitrospira defluvii]
          Length = 404

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/97 (35%), Positives = 51/97 (52%), Gaps = 5/97 (5%)

Query: 95  DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADG 151
           DG P   AR + P + AVD  GN+Y+AD  NH +RK+      +T +AG G  +     G
Sbjct: 137 DGGPARRARLNFPSAVAVDRAGNLYIADTMNHRVRKVDGATGIITNVAGTGQARYSGDGG 196

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           PA +A+ +    L       AL I+D  N  +R+++L
Sbjct: 197 PAVSAAINEPTGLAVSDE--ALYIADQSNNRVRRVDL 231



 Score = 46.2 bits (108), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 39/72 (54%), Gaps = 7/72 (9%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG---GGSKKEGRADGPAQNAS 157
            ++PK   VD KGN+Y+AD  NHV+R++      +TT+AG    G+      + PA  A 
Sbjct: 34  LNEPKGLCVDRKGNLYIADSENHVVRRVDRATGIITTVAGVCPSGTTGPAVPEPPA--AE 91

Query: 158 FSNDFELTFVPH 169
            +ND E  F  H
Sbjct: 92  ETNDDEDPFADH 103


>gi|290978674|ref|XP_002672060.1| predicted protein [Naegleria gruberi]
 gi|284085634|gb|EFC39316.1| predicted protein [Naegleria gruberi]
          Length = 1219

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 46/204 (22%), Positives = 86/204 (42%), Gaps = 18/204 (8%)

Query: 21  LVSSGLLLEDGYTVTTVIDGHQLEIN------PHSVIDRPGSSDLIVLDSSRSAFYTLSF 74
           ++++G +L    T T   +G  +E        PH V     + ++ + D+  +    +  
Sbjct: 388 ILTNGTILTIAGTGTKGYNGDGIEATSAQLNTPHDVALNLATGEIYIADTENNRIRKIL- 446

Query: 75  PLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVD-MKGNIYVADKSNHVIRKIT 131
                  +  +AG G  GY+ DG     A  ++P    +D   G ++ +D  N  IR+++
Sbjct: 447 ---TNGTITTIAGTGDYGYNGDGIMAVDAWLNEPSGVEIDSTSGEVFFSDTENFRIRRVS 503

Query: 132 NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKP- 190
           N G+ T   G  K +   DG A + + +   E+ +VP    +  +D GN  IR+  +   
Sbjct: 504 NSGIITTIAGTGKSKFNGDGMATDTNLATPTEIQYVPSTTEIYFADSGNNRIRKFTINGM 563

Query: 191 ----EDCSKSSQSGSALGAVSVWV 210
                  S S  +G  + A   W+
Sbjct: 564 MTTIAGTSTSGYNGDNMPATRAWL 587



 Score = 50.8 bits (120), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 38/116 (32%), Positives = 58/116 (50%), Gaps = 16/116 (13%)

Query: 82  VKRLAGDGVQGYS--DGEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKITNLG-VTT 137
           +  +AG G+ GY+  D    SA+ + P    +D   G I+++D  NH IRKI   G +TT
Sbjct: 114 ITTIAGTGIAGYNGDDKSADSAQLNTPSGIVIDPNNGEIFISDSKNHRIRKILQNGKITT 173

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC-----ALLISDHGNQLIRQINL 188
           IAG G   E   +G    A F+      ++P+        L I+D  N  IR+++L
Sbjct: 174 IAGTG---EAGYNGDGIEAKFAK----LYLPNGIDLYEKELFIADQNNHRIRKVSL 222



 Score = 48.1 bits (113), Expect = 0.004,   Method: Composition-based stats.
 Identities = 46/168 (27%), Positives = 69/168 (41%), Gaps = 8/168 (4%)

Query: 27  LLEDGYTVTTVIDGHQLEIN----PHSVIDRPGSSDLIVLDS---SRSAFYTLSFPLSEE 79
           +LE G  VT    G     N      S ++ P S  L  L     S S  Y +    +E 
Sbjct: 278 ILESGIIVTIAGTGEAGFDNILNATQSKVNHPNSVTLNELGEVFISDSQNYRVRKITTES 337

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTI 138
            ++  + G+G + Y      +     P+    +  G  + AD SNH IRKI TN  + TI
Sbjct: 338 GIITTIVGNGFEKYCTDLASNTPLFYPRGIISNENGEFFYADSSNHCIRKILTNGTILTI 397

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           AG G+K        A +A  +   ++        + I+D  N  IR+I
Sbjct: 398 AGTGTKGYNGDGIEATSAQLNTPHDVALNLATGEIYIADTENNRIRKI 445



 Score = 42.7 bits (99), Expect = 0.15,   Method: Composition-based stats.
 Identities = 44/155 (28%), Positives = 69/155 (44%), Gaps = 12/155 (7%)

Query: 39  DGHQLEIN---PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS- 94
           DG   + N   P  +   P ++++   DS  +     +       ++  +AG    GY+ 
Sbjct: 522 DGMATDTNLATPTEIQYVPSTTEIYFADSGNNRIRKFTI----NGMMTTIAGTSTSGYNG 577

Query: 95  DGEPGS-ARFDKPKSFAVDMKGN-IYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADG 151
           D  P + A  + P     D K N +Y+AD SNH IRKI TN  +TTIAG G       + 
Sbjct: 578 DNMPATRAWLNFPVGVTYDPKTNQVYIADLSNHRIRKILTNGTITTIAGTGKGGYNGNNL 637

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            A +   +    +   P+   ++ SD  N +IR I
Sbjct: 638 TALSTQVNGPHGMILQPN-GDVIFSDSTNNVIRII 671



 Score = 40.0 bits (92), Expect = 0.95,   Method: Composition-based stats.
 Identities = 33/112 (29%), Positives = 52/112 (46%), Gaps = 5/112 (4%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           ++ E +++ LAG   QG  DG   +     P   A  +   + +AD +NH IRK+ TN  
Sbjct: 1   MTSEQIIRNLAG---QGICDGHLSTNAPVNPSDCAF-LDEEMLIADATNHRIRKVFTNGT 56

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + TIAG G          A +A  +N   +    +   + I+D  N  IR+I
Sbjct: 57  IVTIAGNGFAGYNGDGLDATSAQLNNPVGIYVDTNSREVYIADSNNHRIRKI 108


>gi|116620348|ref|YP_822504.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116223510|gb|ABJ82219.1| NHL repeat containing protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 915

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 36/114 (31%), Positives = 58/114 (50%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ + V+  + G+G  G S  +G   SAR   P+   VD  GN+Y+ D  N+ +R++T  
Sbjct: 566 VTADGVISTVVGNGTVGASGDEGAAASARLASPRGLTVDDNGNLYIGDTGNNRVRQVTAD 625

Query: 134 GVT-TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G+  TIAG G        GPA +A+      L F+    AL  +D  N  +R++
Sbjct: 626 GIMHTIAGTGPAGFAGDGGPAADAALDGPAGL-FLDGSGALYFADSNNNRVRRL 678



 Score = 52.4 bits (124), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 47/139 (33%), Positives = 73/139 (52%), Gaps = 10/139 (7%)

Query: 51  IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGS--ARFDKPKS 108
           +DR G+  L ++D+S          L+ +  V+ +AG+G  GY+     +  A+   P +
Sbjct: 491 LDRGGT--LYIVDTSNHRV----LRLAADGTVQTVAGNGSGGYAGDGGSARFAQLRVPGA 544

Query: 109 FAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFV 167
            A D KGN+Y+AD  NH IRK+T  GV +T+ G G+      +G A +A  ++   LT V
Sbjct: 545 CAFDGKGNLYIADTGNHSIRKVTADGVISTVVGNGTVGASGDEGAAASARLASPRGLT-V 603

Query: 168 PHICALLISDHGNQLIRQI 186
                L I D GN  +RQ+
Sbjct: 604 DDNGNLYIGDTGNNRVRQV 622



 Score = 50.4 bits (119), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 39/107 (36%), Positives = 57/107 (53%), Gaps = 5/107 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
           +  +AG G  G  DG P +A +F      A D  GN+YVAD +NH +RK++  G VTTIA
Sbjct: 29  IDTVAGSGRIG--DGGPATAAQFSDMSGIAADRLGNLYVADTNNHRVRKVSLGGTVTTIA 86

Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G G        G A NA  +  + L  +     + ++D GN+ +R+I
Sbjct: 87  GTGIAGFSGDGGLAVNARLNLPYGLA-LDDFGNIYVADLGNERVRRI 132



 Score = 45.8 bits (107), Expect = 0.021,   Method: Composition-based stats.
 Identities = 29/67 (43%), Positives = 41/67 (61%), Gaps = 5/67 (7%)

Query: 82  VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
           V  +AG G+ G+S G+ G   +AR + P   A+D  GNIYVAD  N  +R+I + G + T
Sbjct: 82  VTTIAGTGIAGFS-GDGGLAVNARLNLPYGLALDDFGNIYVADLGNERVRRIGSDGAIVT 140

Query: 138 IAGGGSK 144
           IAG G +
Sbjct: 141 IAGNGRR 147



 Score = 45.1 bits (105), Expect = 0.035,   Method: Composition-based stats.
 Identities = 34/114 (29%), Positives = 58/114 (50%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +  +  +  +AG+G +  S DG  P       P++ A+D KGN+Y+A+   H +R+++  
Sbjct: 132 IGSDGAIVTIAGNGRRASSPDGAGPLDTSLLSPRNIAIDGKGNLYIAEFEGHRVRRLSAD 191

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G + T+AG G    G     +  A  +    L F     AL I+D GN ++R+I
Sbjct: 192 GRLVTVAGTGIAGLGGDGFASVKAQVNYPAGLAF-DRAGALYIADSGNNVVRKI 244



 Score = 42.0 bits (97), Expect = 0.25,   Method: Composition-based stats.
 Identities = 46/141 (32%), Positives = 65/141 (46%), Gaps = 20/141 (14%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVD 112
           G+ +++V D    AF      ++   V++ +AG G  G S DG    SA    PK   +D
Sbjct: 437 GAGNVMVAD----AFNHRVVAINALHVLRPVAGTGTGGVSADGTAALSAALRGPKGVCLD 492

Query: 113 MKGNIYVADKSNH-VIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC 171
             G +Y+ D SNH V+R   +  V T+AG GS       G A+ A          VP  C
Sbjct: 493 RGGTLYIVDTSNHRVLRLAADGTVQTVAGNGSGGYAGDGGSARFAQLR-------VPGAC 545

Query: 172 A------LLISDHGNQLIRQI 186
           A      L I+D GN  IR++
Sbjct: 546 AFDGKGNLYIADTGNHSIRKV 566



 Score = 41.2 bits (95), Expect = 0.48,   Method: Composition-based stats.
 Identities = 35/110 (31%), Positives = 53/110 (48%), Gaps = 5/110 (4%)

Query: 82  VKRLAGDG-VQGYSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTI 138
           +  +AGDG      DG    SA   +P + A+D KGN+Y+AD     +R++   GV TT+
Sbjct: 347 IDTVAGDGYAHSVGDGAAATSAILHQPSAIALDSKGNLYIADTGTQRVREVLPNGVMTTL 406

Query: 139 AGGGSKKEGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           AG G       D  PA  A  ++   L  V     ++++D  N  +  IN
Sbjct: 407 AGTGVVGRDTGDAVPAGIAPLNSPMGLA-VDGAGNVMVADAFNHRVVAIN 455



 Score = 40.4 bits (93), Expect = 0.71,   Method: Composition-based stats.
 Identities = 29/76 (38%), Positives = 34/76 (44%), Gaps = 3/76 (3%)

Query: 81  VVKRLAGDGVQGYSDGEP---GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTT 137
           V+  LAG GV G   G+    G A  + P   AVD  GN+ VAD  NH +  I  L V  
Sbjct: 402 VMTTLAGTGVVGRDTGDAVPAGIAPLNSPMGLAVDGAGNVMVADAFNHRVVAINALHVLR 461

Query: 138 IAGGGSKKEGRADGPA 153
              G       ADG A
Sbjct: 462 PVAGTGTGGVSADGTA 477


>gi|428177953|gb|EKX46830.1| hypothetical protein GUITHDRAFT_70199, partial [Guillardia theta
           CCMP2712]
          Length = 305

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 52/165 (31%), Positives = 76/165 (46%), Gaps = 12/165 (7%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPL----SEESVVKRLAGDGVQGYSDGEPGSAR 102
           P  V   P  S L+V DS       ++       S +  V  +AG+G  G +DG   +A+
Sbjct: 41  PSDVALSPDESFLVVADSRNHVIRRINKGAQGWSSNDVTVTTIAGNGQPGLADGSATTAQ 100

Query: 103 FDKPKSFAVDMKGN-IYVADKSNHVIRKIT--NLGVTTIAGGGSKKEGRADGPAQNASFS 159
           F KP    +   G  + VAD +N+ IRK+T     V TIAG     +G  DG    + F 
Sbjct: 101 FQKPLGVDISPDGTWVAVADSANNRIRKVTLATGNVETIAG---FTQGYQDGVGTLSKFW 157

Query: 160 NDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
               ++F P    L ++D  N  IR I+L     S +++SG  LG
Sbjct: 158 LPASVSFHPQGTGLAVADTNNNRIRWISL--PSYSVTTKSGRCLG 200



 Score = 37.7 bits (86), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 53/106 (50%), Gaps = 3/106 (2%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKIT--NLGVTTIAGGGSKKEG 147
           QGY DG    ++F  P S +   +G  + VAD +N+ IR I+  +  VTT +G      G
Sbjct: 144 QGYQDGVGTLSKFWLPASVSFHPQGTGLAVADTNNNRIRWISLPSYSVTTKSGRCLGIAG 203

Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
            A+G  + A++++   +   P    + I+D  N +IR+  +   D 
Sbjct: 204 YANGLGKTAAYNSPQYVRVSPDGVWIYIADGQNNVIRRARVDNGDV 249


>gi|332878080|ref|ZP_08445810.1| NHL repeat protein [Capnocytophaga sp. oral taxon 329 str. F0087]
 gi|357047673|ref|ZP_09109271.1| NHL repeat protein [Paraprevotella clara YIT 11840]
 gi|332684042|gb|EGJ56909.1| NHL repeat protein [Capnocytophaga sp. oral taxon 329 str. F0087]
 gi|355529361|gb|EHG98795.1| NHL repeat protein [Paraprevotella clara YIT 11840]
          Length = 461

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 36/114 (31%), Positives = 57/114 (50%), Gaps = 2/114 (1%)

Query: 77  SEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           + E   K  AG  G  G++DG    A+F  P+   +D + N+Y+AD  N+ IRKIT  GV
Sbjct: 349 TREGNFKIYAGMQGRPGHADGLASDAQFHSPRQLVLDEEENLYIADSENNCIRKITPEGV 408

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            +   G   K G  DG    A F+  + L  +     + + D  N  +RQ++++
Sbjct: 409 VSTVIGIPGKSGYKDGTPDVALFTQPWGLA-IDSEGIIYVGDKDNLCVRQLSIE 461


>gi|29347940|ref|NP_811443.1| hypothetical protein BT_2530 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|383121619|ref|ZP_09942326.1| hypothetical protein BSIG_4948 [Bacteroides sp. 1_1_6]
 gi|29339842|gb|AAO77637.1| conserved hypothetical protein, with a conserved domain
           [Bacteroides thetaiotaomicron VPI-5482]
 gi|251837946|gb|EES66035.1| hypothetical protein BSIG_4948 [Bacteroides sp. 1_1_6]
          Length = 441

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 1/104 (0%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
           KRL   G   + DG    A F+ PK    D  GN++VAD  NH IR I+   + T   G 
Sbjct: 336 KRLNAPGGSAFRDGPIEDALFNYPKDIKFDNDGNMFVADYGNHCIRMISADNIVTTVAGQ 395

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
               G  DG    + F N + +  V     + I+D GN  IR++
Sbjct: 396 PGVAGYKDGGPVESLFKNPWGVA-VNEQGDIYIADWGNARIRKL 438


>gi|365861024|ref|ZP_09400809.1| alkyl hydroperoxide reductase [Streptomyces sp. W007]
 gi|364009547|gb|EHM30502.1| alkyl hydroperoxide reductase [Streptomyces sp. W007]
          Length = 560

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 58/112 (51%), Gaps = 7/112 (6%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
            +V D++R     L    ++   V+R  G G +G SDG P  ARF +P+  AV   G + 
Sbjct: 143 FLVSDTTRHRLVELD---ADGETVRRHFGTGERGLSDGGPDEARFSEPQGLAVLPDGRVA 199

Query: 119 VADKSNHVIRKIT-NLGVT-TIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
           VAD  NH +R +    GVT T+AG G +  +     GPA+    S+ +++ +
Sbjct: 200 VADTVNHALRALDLTTGVTSTLAGTGRQWWQGTPTSGPAREVDLSSPWDVAW 251


>gi|421111981|ref|ZP_15572448.1| hypothetical protein LEP1GSC071_4063 [Leptospira santarosai str.
           JET]
 gi|410802632|gb|EKS08783.1| hypothetical protein LEP1GSC071_4063 [Leptospira santarosai str.
           JET]
          Length = 356

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 67/119 (56%), Gaps = 7/119 (5%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKE 146
           GV G+ +G+  ++ F  P    +D + N+YV +  NH IRKI  N G V+T++GG     
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKINLNSGSVSTLSGG---IL 217

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
           G  DG   +A F +   +T+     +LL++D  +  IR+I+LK  + + S+  G+ +GA
Sbjct: 218 GYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLK--NATVSTFVGNGIGA 274



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 55/203 (27%), Positives = 78/203 (38%), Gaps = 48/203 (23%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP   + R  + + I LD      Y     ++   VV   AG G++   DG   +A F  
Sbjct: 20  NPSIFMSRAWAENTI-LDCILKECYLCKLKITNNPVVSLFAGTGIKESIDGTTQTASFKT 78

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI-------------------------- 138
           P    VD  GNI+V+D+  ++IRKI   G VTT+                          
Sbjct: 79  PFGLEVDTFGNIFVSDQMANLIRKIDRSGNVTTLSTSLLLEDPSGIKFDPITGDKYVSCK 138

Query: 139 -----------------AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
                            AG  S  +G  +G   N+SF   F +  +     L + + GN 
Sbjct: 139 DSAQIFKIDPLDQFSLYAGNSSGVDGFQNGDRLNSSFKGPFFMD-LDRERNLYVGELGNH 197

Query: 182 LIRQINLKPEDCSKSSQSGSALG 204
            IR+INL     S S+ SG  LG
Sbjct: 198 AIRKINLNSG--SVSTLSGGILG 218



 Score = 41.2 bits (95), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 42/81 (51%), Gaps = 6/81 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
           V  L+G G+ GY DG+  SA+F  P     D K + + VAD  +H IRKI   N  V+T 
Sbjct: 209 VSTLSG-GILGYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLKNATVSTF 267

Query: 139 AGGGSKKEGRADGPAQNASFS 159
            G G   +   DG   NAS +
Sbjct: 268 VGNGIGAD--IDGNGLNASLN 286


>gi|290978943|ref|XP_002672194.1| predicted protein [Naegleria gruberi]
 gi|284085769|gb|EFC39450.1| predicted protein [Naegleria gruberi]
          Length = 1483

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 41/131 (31%), Positives = 60/131 (45%), Gaps = 6/131 (4%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S+  IV+  S++    L F L EE  +  LAG    G+ D  P       P    +    
Sbjct: 733 STKYIVISDSKNN--RLRFYLKEEKTIYTLAG----GFGDNGPSENGIGTPFGVTISPNN 786

Query: 116 NIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
            IYV+D  N++IR+I N  +TT+AG G+      D  A  AS ++   +   P    ++ 
Sbjct: 787 EIYVSDVLNNLIRRIKNGKITTVAGSGAYGYNGDDISALKASLASPSNIVVNPLTNDIIF 846

Query: 176 SDHGNQLIRQI 186
           SD  N  IR I
Sbjct: 847 SDTNNHRIRII 857


>gi|390957686|ref|YP_006421443.1| gluconolactonase [Terriglobus roseus DSM 18391]
 gi|390412604|gb|AFL88108.1| gluconolactonase [Terriglobus roseus DSM 18391]
          Length = 738

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/128 (35%), Positives = 68/128 (53%), Gaps = 6/128 (4%)

Query: 76  LSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +S   V+  +AG G QGY    G   +A  + P + AV   G++Y AD  NH IR+I + 
Sbjct: 96  VSSGGVLAVVAGTGHQGYVGDGGAATAAELNAPTAVAVAPDGSVYFADSGNHCIRRIASG 155

Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
            +TT+AG G+   G   GPA  A F +   L F     +L ++D GN+ +R+I   P   
Sbjct: 156 VITTVAGNGAPGFGGDGGPAMVARFRSPGGLAFAAD-GSLYVADTGNRRVRKI---PPGG 211

Query: 194 SKSSQSGS 201
           S S+ +G+
Sbjct: 212 SVSTIAGT 219



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 51/112 (45%), Gaps = 1/112 (0%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           L   S V  LAG G  G +  G         P+  A D +GN+YVAD  NH I ++++ G
Sbjct: 41  LDPGSRVGTLAGSGRDGRTVAGTAQDVALGWPRGLAYDHEGNLYVADSRNHQIDRVSSGG 100

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           V  +  G   +    DG A  A+  N      V    ++  +D GN  IR+I
Sbjct: 101 VLAVVAGTGHQGYVGDGGAATAAELNAPTAVAVAPDGSVYFADSGNHCIRRI 152



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 45/87 (51%), Gaps = 2/87 (2%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
           ARF  P   A    G++YVAD  N  +RKI   G V+TIAG G++ +    G A  ASF 
Sbjct: 178 ARFRSPGGLAFAADGSLYVADTGNRRVRKIPPGGSVSTIAGTGTEDDAGDGGVATAASFR 237

Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
           +   L  +P    LLI+D     +R +
Sbjct: 238 SPGALQVLPD-GRLLIADREAYRVRAL 263


>gi|302821385|ref|XP_002992355.1| hypothetical protein SELMODRAFT_6804 [Selaginella moellendorffii]
 gi|300139771|gb|EFJ06505.1| hypothetical protein SELMODRAFT_6804 [Selaginella moellendorffii]
          Length = 78

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/77 (44%), Positives = 48/77 (62%), Gaps = 4/77 (5%)

Query: 79  ESVVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
           +S ++  AG   G  GY DG    +RF++P+S A+   G ++V D +N  IRKI+  G V
Sbjct: 3   DSSLEAFAGSHVGESGYVDGPAAKSRFNRPQSLAICDNGAVFV-DTTNLAIRKISKNGEV 61

Query: 136 TTIAGGGSKKEGRADGP 152
           TTIAGG S++ G AD P
Sbjct: 62  TTIAGGSSRRPGIADSP 78


>gi|290971688|ref|XP_002668616.1| predicted protein [Naegleria gruberi]
 gi|284082092|gb|EFC35872.1| predicted protein [Naegleria gruberi]
          Length = 348

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 41/105 (39%), Positives = 57/105 (54%), Gaps = 5/105 (4%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
           +AG+G +G+S  +G   +A+ + P    V     +Y+AD SNHVIRKI+  G + TIAG 
Sbjct: 143 IAGNGTKGFSGDNGPATNAQLNGPAGVFVS-NNEVYIADYSNHVIRKISQNGTIVTIAGN 201

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G       +G A NA   N    TFV     + ISD  N +IR+I
Sbjct: 202 GKPGFSGDNGLATNAQLYNP-SGTFVSSNNEVYISDCFNHVIRKI 245



 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 39/114 (34%), Positives = 58/114 (50%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +S+   +  +AG+G  G+S  +G   +A+   P    V     +Y++D  NHVIRKI+  
Sbjct: 189 ISQNGTIVTIAGNGKPGFSGDNGLATNAQLYNPSGTFVSSNNEVYISDCFNHVIRKISQN 248

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G + TIAG G       +GPA NA   +   + FV     + ISD  N  IR+I
Sbjct: 249 GTIVTIAGNGKGGFSGDNGPATNAQLYSPLGV-FVSSDNEVYISDCFNHRIRKI 301



 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 59/112 (52%), Gaps = 5/112 (4%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           +   +  +AG+G  G+   +G   +A+ + P++  V     +Y+AD  N  IRK+  N  
Sbjct: 80  QNGTIVTIAGNGKPGFGGDNGLATNAQLNYPRNVYVSSNNEVYIADFCNQRIRKVLQNGN 139

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + TIAG G+K     +GPA NA   N     FV +   + I+D+ N +IR+I
Sbjct: 140 IITIAGNGTKGFSGDNGPATNAQL-NGPAGVFVSN-NEVYIADYSNHVIRKI 189


>gi|345008031|ref|YP_004810385.1| redoxin domain-containing protein [Streptomyces violaceusniger Tu
           4113]
 gi|344034380|gb|AEM80105.1| Redoxin domain protein [Streptomyces violaceusniger Tu 4113]
          Length = 615

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 38/117 (32%), Positives = 63/117 (53%), Gaps = 8/117 (6%)

Query: 54  PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDM 113
           PG +  +V D++R     L+     ESVV+R+ G G +G +DG    ARF +P+  A+  
Sbjct: 195 PGGT-FLVSDTTRHQLVELA--ADGESVVRRI-GAGERGLTDGTGERARFSEPQGLALLP 250

Query: 114 KGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
            G + VAD  NH +R+       VTT+AG G +  +    +GPA+    S+ +++ +
Sbjct: 251 DGTVAVADTVNHALRRFDPATGEVTTLAGTGRQWWQGSPTEGPAREVDLSSPWDVAW 307


>gi|262195555|ref|YP_003266764.1| NHL repeat containing protein [Haliangium ochraceum DSM 14365]
 gi|262078902|gb|ACY14871.1| NHL repeat containing protein [Haliangium ochraceum DSM 14365]
          Length = 404

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 42/113 (37%), Positives = 61/113 (53%), Gaps = 6/113 (5%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRAD 150
           G++DG    ARFD+P   AV +   IYVAD +NH IRKIT  G V+T+AG G      +D
Sbjct: 241 GFADGNGAEARFDRPLGVAV-VDDEIYVADSANHRIRKITLDGEVSTLAGTGVAAS--SD 297

Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQL-IRQINLKPEDCSKSSQSGSA 202
           G    A+F   + +T       L +++ G+   +R+I+L  E     + SG A
Sbjct: 298 GSLAEATFDTPYAMTRDSD-GNLYVTELGDSFRVRKIDLVAETVETIAGSGMA 349



 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 33/64 (51%), Gaps = 3/64 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV-IRKITNLG--VTTI 138
           V  LAG GV   SDG    A FD P +   D  GN+YV +  +   +RKI  +   V TI
Sbjct: 284 VSTLAGTGVAASSDGSLAEATFDTPYAMTRDSDGNLYVTELGDSFRVRKIDLVAETVETI 343

Query: 139 AGGG 142
           AG G
Sbjct: 344 AGSG 347



 Score = 38.1 bits (87), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 35/83 (42%), Gaps = 15/83 (18%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGS 143
           LAG    G +DG    A    P   AV   GN+Y+A+  N  IRKIT  G V+T+     
Sbjct: 86  LAGGEEAGTTDGTRAVATLRNPAGLAVGPNGNVYIAEFDNDRIRKITPEGTVSTL----- 140

Query: 144 KKEGRADGPAQNASFSNDFELTF 166
                     Q A F   F L F
Sbjct: 141 ---------TQAAGFDRPFGLAF 154


>gi|357412450|ref|YP_004924186.1| alkyl hydroperoxide reductase [Streptomyces flavogriseus ATCC
           33331]
 gi|320009819|gb|ADW04669.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Streptomyces flavogriseus ATCC 33331]
          Length = 603

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 57/112 (50%), Gaps = 7/112 (6%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
            +V DS+R     L    ++   V+   G G +G++DG    ARF +P+  AV   G I 
Sbjct: 188 FLVSDSTRHRLVELE---ADGETVRGHFGTGERGFADGGREEARFSEPQGLAVLPDGRIA 244

Query: 119 VADKSNHVIRKIT-NLGVTTIAGGGSKK--EGRA-DGPAQNASFSNDFELTF 166
           VAD  NH IR +    GVTT   G  ++  +G A DGPA     S+ +++ +
Sbjct: 245 VADTVNHAIRALDLTTGVTTTLAGTGRQWWQGSATDGPATEVDLSSPWDIAW 296


>gi|290976072|ref|XP_002670765.1| predicted protein [Naegleria gruberi]
 gi|284084327|gb|EFC38021.1| predicted protein [Naegleria gruberi]
          Length = 854

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 60/112 (53%), Gaps = 5/112 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTT 137
           +  +AG G  GY +G+ G A       P+  AV   G +Y++D +N++IRK+  N  + T
Sbjct: 73  ISTIAGTGSAGY-NGDGGLAIETMLSSPQGVAVSESGEVYLSDSTNNIIRKVYLNGTIQT 131

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           IAG  ++     +GPA N  F +   L+  P    L I+D  N ++R+++L 
Sbjct: 132 IAGSRTQGYSGDNGPAVNCQFFSPQGLSLSPSNSDLYIADTFNNVVRRLDLN 183



 Score = 42.0 bits (97), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 47/96 (48%), Gaps = 3/96 (3%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
           SA+   PK   V   G +++AD  N+ IRK+ + G + TIAG  S       G + +A  
Sbjct: 319 SAQLSSPKGLVVSSSGEMFLADSGNNRIRKVFSNGIIVTIAGTSSVGFSGDGGLSTDAQL 378

Query: 159 SNDFELTF--VPHICALLISDHGNQLIRQINLKPED 192
           +N   L    +  +  + ISD GN  IR++    E+
Sbjct: 379 NNPVNLAIRQLRSLSEIYISDAGNHAIRKLTAFCEE 414



 Score = 37.4 bits (85), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 26/107 (24%), Positives = 46/107 (42%), Gaps = 5/107 (4%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSA--RFD 104
           P  +   P +SDL + D+  +    L         ++   GDG +GY+    G+A   F+
Sbjct: 155 PQGLSLSPSNSDLYIADTFNNVVRRLDL---NTGYIRPFVGDGGRGYTSPGDGNAFGWFN 211

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
             +   +   G +   D  N++IRK+   G+  ++G G       DG
Sbjct: 212 VIRGLCLSPNGELIFTDTENNMIRKVYANGLYDVSGRGQTNSYLGDG 258


>gi|298387816|ref|ZP_06997366.1| hypothetical protein HMPREF9007_04616 [Bacteroides sp. 1_1_14]
 gi|298259421|gb|EFI02295.1| hypothetical protein HMPREF9007_04616 [Bacteroides sp. 1_1_14]
          Length = 440

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 48/104 (46%), Gaps = 1/104 (0%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
           KRL   G   + DG    A F+ PK    D  GN++VAD  NH IR ++   + T   G 
Sbjct: 335 KRLNAPGGSAFRDGPIEDALFNYPKDIKFDNDGNMFVADYGNHCIRMVSADNIVTTVAGQ 394

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
               G  DG    + F N + +  V     + I+D GN  IR++
Sbjct: 395 PGVAGYKDGGPVESLFKNPWGVA-VNEQGDIYIADWGNARIRKL 437


>gi|289664052|ref|ZP_06485633.1| putative NHL repeat protein [Xanthomonas campestris pv. vasculorum
           NCPPB 702]
          Length = 263

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 37/84 (44%), Positives = 44/84 (52%), Gaps = 3/84 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           L+ +  V  LAGDGV G  +G     RF+ P   AVD +G +YVAD  N  I  I   G 
Sbjct: 48  LTPQGKVTTLAGDGVAGDRNGAAAQVRFNGPVGVAVDAQGRVYVADTYNDRIGVIETDGQ 107

Query: 135 VTTIAGGGSKKEGRADGPAQNASF 158
           V T+AGGG    G ADG    A F
Sbjct: 108 VRTLAGGG--LPGMADGIGTQAWF 129



 Score = 53.5 bits (127), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 29/59 (49%), Positives = 39/59 (66%), Gaps = 2/59 (3%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG 142
           LAG G +G++DG   +A F+ P    +D  GN+Y+AD  NH IRK+T  G VTT+AG G
Sbjct: 4   LAG-GREGFADGIGVAAAFNTPSGIMLDTAGNLYIADTGNHAIRKLTPQGKVTTLAGDG 61


>gi|440714668|ref|ZP_20895245.1| NHL repeat containing protein [Rhodopirellula baltica SWK14]
 gi|436440433|gb|ELP33756.1| NHL repeat containing protein [Rhodopirellula baltica SWK14]
          Length = 283

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 53/102 (51%), Gaps = 5/102 (4%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-----VTTIAGGGSKKE 146
           GY DG   SARF+K  +  +D +  +Y++D +NH +R++         V T AG G    
Sbjct: 38  GYVDGPARSARFNKLHNLVIDAEDVLYLSDHANHSVRRLIQTTDGEWMVDTYAGQGKAGP 97

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
              +   ++A+F     +T       LLI+D GNQ++R I+L
Sbjct: 98  ATNNVDRRDATFHEPISVTLDAEGNRLLIADIGNQVVRSIDL 139



 Score = 44.7 bits (104), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 26/95 (27%), Positives = 46/95 (48%), Gaps = 4/95 (4%)

Query: 34  VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
           + T + G   ++     +D  G+  L+VL+ + +    +         +  LAG G +G 
Sbjct: 143 LVTTLAGRTSKLKDPRAVDLDGNRRLLVLERNGNRLRRVE----SNGDITTLAGSGKKGT 198

Query: 94  SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           +DG+   A F+ PK   V   G +Y+AD  NH++R
Sbjct: 199 ADGDAKQASFNGPKHMDVAPDGRVYIADDVNHLVR 233



 Score = 41.2 bits (95), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 52/88 (59%), Gaps = 8/88 (9%)

Query: 100 SARFDKPKSFAVDMKGN--IYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNA 156
           +++   P+  AVD+ GN  + V +++ + +R++ +N  +TT+AG G  K+G ADG A+ A
Sbjct: 151 TSKLKDPR--AVDLDGNRRLLVLERNGNRLRRVESNGDITTLAGSG--KKGTADGDAKQA 206

Query: 157 SFSNDFELTFVPHICALLISDHGNQLIR 184
           SF+    +   P    + I+D  N L+R
Sbjct: 207 SFNGPKHMDVAPD-GRVYIADDVNHLVR 233


>gi|417302109|ref|ZP_12089222.1| NHL repeat containing protein [Rhodopirellula baltica WH47]
 gi|327541551|gb|EGF28082.1| NHL repeat containing protein [Rhodopirellula baltica WH47]
          Length = 351

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 53/102 (51%), Gaps = 5/102 (4%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-----VTTIAGGGSKKE 146
           GY DG   SARF+K  +  +D +  +Y++D +NH +R++         V T AG G +  
Sbjct: 106 GYVDGPARSARFNKLHNLVIDAEDVLYLSDHANHSVRRLIQTSDGEWMVDTYAGQGKEGP 165

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
              +   ++A+F     +T       LLI+D GNQ++R I+ 
Sbjct: 166 ATNNVDRRDATFHEPISVTLDAEGNRLLIADIGNQVVRSIDF 207



 Score = 45.4 bits (106), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 26/95 (27%), Positives = 47/95 (49%), Gaps = 4/95 (4%)

Query: 34  VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
           + T + G + ++     +D  G+  L+VL+ + +    +         +  LAG G +G 
Sbjct: 211 LVTTLAGRKSKLKDPRAVDLDGNGRLLVLERNGNRLRRVE----SNGDITTLAGSGKKGT 266

Query: 94  SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           +DG+   A F+ PK   V   G +Y+AD  NH++R
Sbjct: 267 ADGDAKQASFNGPKHMDVAPDGRVYIADDVNHLVR 301



 Score = 43.5 bits (101), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 26/85 (30%), Positives = 49/85 (57%), Gaps = 4/85 (4%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFS 159
           ++   P++  +D  G + V +++ + +R++ +N  +TT+AG G  K+G ADG A+ ASF+
Sbjct: 220 SKLKDPRAVDLDGNGRLLVLERNGNRLRRVESNGDITTLAGSG--KKGTADGDAKQASFN 277

Query: 160 NDFELTFVPHICALLISDHGNQLIR 184
               +   P    + I+D  N L+R
Sbjct: 278 GPKHMDVAPD-GRVYIADDVNHLVR 301


>gi|372267905|ref|ZP_09503953.1| hypothetical protein AlS89_08394 [Alteromonas sp. S89]
          Length = 2227

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 36/109 (33%), Positives = 55/109 (50%), Gaps = 4/109 (3%)

Query: 81   VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
            ++  +AG G +G+S DG P + A    P    +D +GN+Y+ D  N  IR++   G +TT
Sbjct: 1056 IISTVAGSGARGFSGDGGPATDASLSAPSGITLDPEGNLYIVDSGNRRIRRVGVDGRITT 1115

Query: 138  IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            IAG G  K     G A  A F +   + +      + ISD G   +R+I
Sbjct: 1116 IAGNGGYKNTGDGGSALEAGFDDPLGIAYAAD-GGIYISDSGEGGVRRI 1163



 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 54/109 (49%), Gaps = 8/109 (7%)

Query: 81   VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
            V+   AGDG      G    A  + P        G+IY+AD  NH IR++   G ++T+A
Sbjct: 1008 VIDTFAGDG------GPATDASLNAPYDVICGPHGSIYIADSRNHRIRRVDVNGIISTVA 1061

Query: 140  GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
            G G++      GPA +AS S    +T  P    L I D GN+ IR++ +
Sbjct: 1062 GSGARGFSGDGGPATDASLSAPSGITLDPE-GNLYIVDSGNRRIRRVGV 1109



 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 38/121 (31%), Positives = 59/121 (48%), Gaps = 9/121 (7%)

Query: 71   TLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            T++  L+    V+  +GDG    +      A  D P        G+IYVAD  N  +R+I
Sbjct: 941  TIAGTLASNVSVRGFSGDGAIATA------ATMDHPYGLDFCDDGSIYVADTGNDRVRRI 994

Query: 131  TNLGV-TTIAGGGSKKEGRAD-GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
               GV TTIAG         D GPA +AS +  +++   PH  ++ I+D  N  IR++++
Sbjct: 995  DRRGVITTIAGSEVIDTFAGDGGPATDASLNAPYDVICGPH-GSIYIADSRNHRIRRVDV 1053

Query: 189  K 189
             
Sbjct: 1054 N 1054



 Score = 37.0 bits (84), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 57/117 (48%), Gaps = 11/117 (9%)

Query: 82   VKRLAGDGVQGYSD-GEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVT 136
            +  +AG+G  GY + G+ GSA    FD P   A    G IY++D     +R+I T+  + 
Sbjct: 1113 ITTIAGNG--GYKNTGDGGSALEAGFDDPLGIAYAADGGIYISDSGEGGVRRIGTDGTIV 1170

Query: 137  TIAGGGSKKEGRA--DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
            T+AG  S   G     GPA  A  +   ++   P   +L + D  N  IR+++  PE
Sbjct: 1171 TVAGNKSPYWGFGGDGGPAIQADMTGVTDVAIGPE-GSLYLVDAYNFRIRKVS-NPE 1225


>gi|290985545|ref|XP_002675486.1| predicted protein [Naegleria gruberi]
 gi|284089082|gb|EFC42742.1| predicted protein [Naegleria gruberi]
          Length = 819

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 41/112 (36%), Positives = 56/112 (50%), Gaps = 4/112 (3%)

Query: 78  EESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           ++  +  +AG G +G+S DG+P  SA+   P S  V     ++ AD  N+ IRKI   G 
Sbjct: 448 KDGTIVTIAGTGEEGFSGDGDPATSAQLSHPCSVFVSSTNEVFFADSGNYRIRKILRNGN 507

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + TIAG G K       PA NA  S   +  FV     +  SD GN  IR+I
Sbjct: 508 IVTIAGTGEKGYSGDGRPAINAQISY-VQNIFVSQNDEIYFSDFGNHRIRKI 558



 Score = 54.3 bits (129), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 40/105 (38%), Positives = 55/105 (52%), Gaps = 4/105 (3%)

Query: 85  LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
           +AG G +G+S DG P  SA+ D P    V     +Y+ D +NH IRKI   G + TIAG 
Sbjct: 567 IAGTGEKGFSGDGGPATSAQLDSPCGVFVSNNDEVYIVDYNNHRIRKILRNGIINTIAGT 626

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G +      GPA NA  ++   + FV     + I + GN  IR+I
Sbjct: 627 GEEGFSGDGGPAINAQVNHPCGV-FVSSTNEVYIMNSGNYRIRKI 670



 Score = 50.8 bits (120), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 39/105 (37%), Positives = 54/105 (51%), Gaps = 4/105 (3%)

Query: 85  LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
           +AG G +GYS DG P  +A+    ++  V     IY +D  NH IRKI   G + TIAG 
Sbjct: 511 IAGTGEKGYSGDGRPAINAQISYVQNIFVSQNDEIYFSDFGNHRIRKILRNGTIVTIAGT 570

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G K      GPA +A   +   + FV +   + I D+ N  IR+I
Sbjct: 571 GEKGFSGDGGPATSAQLDSPCGV-FVSNNDEVYIVDYNNHRIRKI 614



 Score = 48.5 bits (114), Expect = 0.003,   Method: Composition-based stats.
 Identities = 38/109 (34%), Positives = 55/109 (50%), Gaps = 4/109 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTT 137
           ++  +AG G +G+S DG P  +A+ + P    V     +Y+ +  N+ IRKI  N  +TT
Sbjct: 619 IINTIAGTGEEGFSGDGGPAINAQVNHPCGVFVSSTNEVYIMNSGNYRIRKILRNANITT 678

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           IAG G K      G A NA  S   +  FV     + I+D  N  IR+I
Sbjct: 679 IAGTGVKGYSGDGGLAINAQISY-VDNIFVSRNDEVYIADTENHRIRKI 726



 Score = 46.2 bits (108), Expect = 0.015,   Method: Composition-based stats.
 Identities = 35/96 (36%), Positives = 51/96 (53%), Gaps = 11/96 (11%)

Query: 82  VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
           +  +AG GV+GYS G+ G   +A+    ++  V     +Y+AD +NH IRKI   G + T
Sbjct: 293 ITTIAGTGVKGYS-GDGGLAINAQISYVENIFVSQNDEVYIADTNNHRIRKILKDGTIET 351

Query: 138 IAGGGSKKEGRADGP---AQNASFSNDFELTFVPHI 170
           IAG G K  G  D P   + +    ND+  T +P I
Sbjct: 352 IAGNGEKGFG-GDSPFDFSSHPHIGNDY--TIIPKI 384



 Score = 45.4 bits (106), Expect = 0.025,   Method: Composition-based stats.
 Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 4/105 (3%)

Query: 85  LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
           +AG G +GYS DG P  +A+     +  V     +Y +D  NH IRKI   G + TIAG 
Sbjct: 128 IAGTGEEGYSGDGGPAINAQISAVNNIFVSQNDEVYFSDFRNHRIRKILRNGTIVTIAGT 187

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G +      GPA NA  +    + FV +   + I D+ +  IR++
Sbjct: 188 GEQGFSGDGGPAINAKLNTPCGV-FVSNNDEVYIVDYKSHRIRKM 231



 Score = 45.1 bits (105), Expect = 0.031,   Method: Composition-based stats.
 Identities = 38/105 (36%), Positives = 55/105 (52%), Gaps = 4/105 (3%)

Query: 85  LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
           +AG G QG+S DG P  +A+ + P    V     +Y+ D  +H IRK+   G + TIAG 
Sbjct: 184 IAGTGEQGFSGDGGPAINAKLNTPCGVFVSNNDEVYIVDYKSHRIRKMLQDGTIITIAGT 243

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G +  G   GPA +A  S+   + FV     + I+D  N  IR+I
Sbjct: 244 GEQGFGGDGGPATSAQLSHPCGV-FVSSTNEVYITDSYNYRIRKI 287



 Score = 43.9 bits (102), Expect = 0.068,   Method: Composition-based stats.
 Identities = 32/88 (36%), Positives = 43/88 (48%), Gaps = 2/88 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
           SA+F+ P    V     +Y++D  N+ IRKI   G + TIAG G +      GPA NA  
Sbjct: 89  SAQFNHPCDVFVSSTNEVYISDFGNYRIRKILRNGNIVTIAGTGEEGYSGDGGPAINAQI 148

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
           S    + FV     +  SD  N  IR+I
Sbjct: 149 SAVNNI-FVSQNDEVYFSDFRNHRIRKI 175



 Score = 43.5 bits (101), Expect = 0.10,   Method: Composition-based stats.
 Identities = 34/96 (35%), Positives = 49/96 (51%), Gaps = 11/96 (11%)

Query: 82  VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
           +  +AG GV+GYS G+ G   +A+     +  V     +Y+AD  NH IRKI   G + T
Sbjct: 676 ITTIAGTGVKGYS-GDGGLAINAQISYVDNIFVSRNDEVYIADTENHRIRKILRNGTIKT 734

Query: 138 IAGGGSKKEGRADGP---AQNASFSNDFELTFVPHI 170
           IAG G +  G  D P   + +    ND+  T +P I
Sbjct: 735 IAGNGEEGFG-GDSPFDFSSHPHIGNDY--TIIPKI 767



 Score = 42.7 bits (99), Expect = 0.17,   Method: Composition-based stats.
 Identities = 33/88 (37%), Positives = 41/88 (46%), Gaps = 2/88 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
           SA+   P    V     +Y+ D  N+ IRKI   G +TTIAG G K      G A NA  
Sbjct: 257 SAQLSHPCGVFVSSTNEVYITDSYNYRIRKILRNGNITTIAGTGVKGYSGDGGLAINAQI 316

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
           S   E  FV     + I+D  N  IR+I
Sbjct: 317 SY-VENIFVSQNDEVYIADTNNHRIRKI 343



 Score = 40.0 bits (92), Expect = 1.0,   Method: Composition-based stats.
 Identities = 29/87 (33%), Positives = 41/87 (47%), Gaps = 2/87 (2%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
           A  + P    V     +Y+AD  NH +RKI   G + TIAG G +       PA +A  S
Sbjct: 417 AMLNCPGGVFVAPNDEVYMADCQNHRVRKILKDGTIVTIAGTGEEGFSGDGDPATSAQLS 476

Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
           +   + FV     +  +D GN  IR+I
Sbjct: 477 HPCSV-FVSSTNEVFFADSGNYRIRKI 502


>gi|408828763|ref|ZP_11213653.1| hypothetical protein SsomD4_16351 [Streptomyces somaliensis DSM
           40738]
          Length = 614

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 37/115 (32%), Positives = 62/115 (53%), Gaps = 7/115 (6%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S +L+V D++R     ++     E+VV+R  GDG +G  DG   SARF +P+       G
Sbjct: 197 SGNLLVSDTTRHRLVEVA--PDGETVVRRY-GDGRRGLVDGPAESARFSEPQGMCALPDG 253

Query: 116 NIYVADKSNHVIRKI--TNLGVTTIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
            I VAD  NH +R +      V+T+AG G++  +     GPA+  + S+ +++ +
Sbjct: 254 RIVVADTVNHALRALDPETGAVSTLAGTGAQWMQGSPTSGPAREVALSSPWDVAW 308



 Score = 42.4 bits (98), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 38/135 (28%), Positives = 63/135 (46%), Gaps = 5/135 (3%)

Query: 68  AFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP-KSFAVDMKGNIYVADKSNHV 126
           A  TL   L  E   K     G   Y   EP +     P K+ A++  GN+ V+D + H 
Sbjct: 151 AIGTLVAELEAEHGAKGTLRRGDGPYVPPEPTATHLRFPGKALALE-SGNLLVSDTTRHR 209

Query: 127 IRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + ++   G T +   G  + G  DGPA++A FS    +  +P    ++++D  N  +R  
Sbjct: 210 LVEVAPDGETVVRRYGDGRRGLVDGPAESARFSEPQGMCALPD-GRIVVADTVNHALRA- 267

Query: 187 NLKPEDCSKSSQSGS 201
            L PE  + S+ +G+
Sbjct: 268 -LDPETGAVSTLAGT 281


>gi|159471860|ref|XP_001694074.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158277241|gb|EDP03010.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 1591

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 36/125 (28%), Positives = 60/125 (48%), Gaps = 10/125 (8%)

Query: 73   SFPLSEESVVKRLAGDG---------VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS 123
            S  +S++  ++R+A DG         + G+ DG   +ARF   +  AVD +GN Y +D S
Sbjct: 1221 SLVVSDQHCLRRVASDGFVTTIAGSSMPGHLDGPAATARFYNLRGVAVDGEGNCYCSDSS 1280

Query: 124  NHVIRKITNL-GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQL 182
            NH +R +    G+ +   G   + G  DG    A F N   +       +L ++D  N  
Sbjct: 1281 NHCVRLLHAADGMVSTFAGSPGQAGFRDGAGTEARFRNPCGIAINLQDGSLAVADAENNR 1340

Query: 183  IRQIN 187
            +R+I+
Sbjct: 1341 LRRID 1345



 Score = 40.0 bits (92), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 36/84 (42%), Positives = 48/84 (57%), Gaps = 8/84 (9%)

Query: 79   ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK-GNIYVADKSNHVIRKITNLG-VT 136
            E+VV  LAG G  G+ DG    A+F  P   AVD +  ++ V+D+  H +R++ + G VT
Sbjct: 1185 EAVV--LAGGGGYGHRDGPGRKAKFACPMYLAVDARDSSLVVSDQ--HCLRRVASDGFVT 1240

Query: 137  TIAGGGSKKEGRADGPAQNASFSN 160
            TIA  GS   G  DGPA  A F N
Sbjct: 1241 TIA--GSSMPGHLDGPAATARFYN 1262


>gi|344923600|ref|ZP_08777061.1| NHL repeat containing protein [Candidatus Odyssella
           thessalonicensis L13]
          Length = 1226

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 64/135 (47%), Gaps = 8/135 (5%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
           GS  + V DS+ +        ++    V  LAG GV  Y++G   +ARF  P+   VD+ 
Sbjct: 35  GSGTVYVADSNNNVIK----KITPAGTVSVLAGSGVSDYAEGTGTAARFSYPEGVGVDVA 90

Query: 115 G-NIYVADKSNHVIRKI-TNLGVTT-IAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC 171
           G  +YVAD  N VIRKI T+  VT+ +AG         DG    A F+    +  V    
Sbjct: 91  GTTVYVADSQNGVIRKIDTSTRVTSLVAGSPGALNAHVDGTYTTARFAYPTSVA-VDSSS 149

Query: 172 ALLISDHGNQLIRQI 186
            L I D  N  IR+I
Sbjct: 150 NLYIGDTLNHCIRKI 164



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/50 (54%), Positives = 32/50 (64%), Gaps = 3/50 (6%)

Query: 95  DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTIAGGG 142
           DG   +ARF  P S AVD   N+Y+ D  NH IRKI   N+ VTT+AGGG
Sbjct: 129 DGTYTTARFAYPTSVAVDSSSNLYIGDTLNHCIRKIAPGNV-VTTLAGGG 177



 Score = 44.7 bits (104), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 50/107 (46%), Gaps = 3/107 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
           S  F+     AVD  G +YVAD +N+VI+KIT  G V+ +AG G        G A   S+
Sbjct: 22  SGGFNFASGVAVDGSGTVYVADSNNNVIKKITPAGTVSVLAGSGVSDYAEGTGTAARFSY 81

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
                +        + ++D  N +IR+I+      S  + S  AL A
Sbjct: 82  PEGVGVDVAG--TTVYVADSQNGVIRKIDTSTRVTSLVAGSPGALNA 126


>gi|290995663|ref|XP_002680402.1| predicted protein [Naegleria gruberi]
 gi|284094023|gb|EFC47658.1| predicted protein [Naegleria gruberi]
          Length = 974

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 64/198 (32%), Positives = 99/198 (50%), Gaps = 25/198 (12%)

Query: 30  DGYTVTTVIDGHQLEINP------HSVIDRPGSSDLIVLDS------SRSAFYTLSFPLS 77
           +G   T V  G + + +P      +++I  PG+   I+ DS      S  +F+ +   L+
Sbjct: 210 EGRIATVVGSGQKGQESPDGVLAVNAIIGAPGN---IIFDSIGNMYLSDRSFHKVRKVLT 266

Query: 78  EESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
             ++V  +AG+G+  Y+ DG    SA   +P   A+   G +Y+A+   H IRK+ TN  
Sbjct: 267 NGTIVT-IAGNGMSAYNGDGILAVSASLFRPSGLALSSTGELYIAESYGHRIRKVLTNGT 325

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
           + TIAG G        G A NA   +  E  F+     LLISD GN+ IR+I+L   D +
Sbjct: 326 IITIAGTGVAGYEGDGGLAVNA-LLDAPESIFLSSDNYLLISDFGNKRIRKISLL--DGT 382

Query: 195 KSSQSGSALG---AVSVW 209
            S+ +G  LG   A SV+
Sbjct: 383 ISTIAGDGLGDGKAASVY 400



 Score = 38.5 bits (88), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 36/134 (26%), Positives = 61/134 (45%), Gaps = 30/134 (22%)

Query: 77  SEESVVKRLAGDGVQ-GYSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           S  +++   AG   Q    DG+P  +    +P+   +   G +Y+AD S   I+K+++ G
Sbjct: 86  STNNIMSTFAGTNNQVNPEDGQPAVNTPLAEPRGLEISSNGEVYIADYSTCKIKKVSSQG 145

Query: 135 V-TTIAGGGSKKEGRADGPAQN------------ASFSNDFELTFVPHIC------ALLI 175
           + TTIAG          GP QN             +F+N+ +L+F   I        L I
Sbjct: 146 IMTTIAG---------TGPGQNNDRRLVSVYTGDGAFANETQLSFPQAIAFHPITGELHI 196

Query: 176 SDHGNQLIRQINLK 189
           ++  +  IR+I L+
Sbjct: 197 AETQSYKIRKITLE 210



 Score = 37.4 bits (85), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 36/66 (54%), Gaps = 2/66 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
           ++  LAG  V G+S DG P + A F       +D   +IYV+D  NH IRKI+ +  T  
Sbjct: 663 IIVTLAGSNVAGFSGDGGPAADATFGTLGGIYLDSNDDIYVSDPDNHRIRKISLMCPTDY 722

Query: 139 AGGGSK 144
           A  GS+
Sbjct: 723 ALNGSE 728


>gi|290982388|ref|XP_002673912.1| predicted protein [Naegleria gruberi]
 gi|284087499|gb|EFC41168.1| predicted protein [Naegleria gruberi]
          Length = 2313

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 34/106 (32%), Positives = 55/106 (51%), Gaps = 7/106 (6%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGG 142
           LAG  ++G++DG   +A  + P S ++ +   +Y +D+ NH IR ++ +   VTTI+G G
Sbjct: 500 LAGGSLKGFADGVGSNANLNYPDSISIGLNNMLYFSDRDNHAIRSVSTINALVTTISGSG 559

Query: 143 SKKEGRADGPAQNASFS--NDFELTFVPHICALLISDHGNQLIRQI 186
                  +GPA  A  +     E+     I   +  D GNQ IR+I
Sbjct: 560 IAGYTGDEGPAIYAKLNLPGSIEVALNGDI---IFMDKGNQRIRKI 602



 Score = 45.8 bits (107), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 24/67 (35%), Positives = 38/67 (56%), Gaps = 2/67 (2%)

Query: 85  LAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
           +AG G  G+S   G   SA  + P +   D  GN+ ++D SN+ IRK+TN  ++T+AG  
Sbjct: 281 VAGTGTSGFSGDGGVATSALLNGPSALTFDSSGNMLISDSSNNRIRKVTNGIISTLAGTS 340

Query: 143 SKKEGRA 149
           ++  G  
Sbjct: 341 NRNFGNG 347



 Score = 45.8 bits (107), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 39/108 (36%), Positives = 54/108 (50%), Gaps = 5/108 (4%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
           VV  +AG G  GYS DG     A+ + P+   +   G I V+D  NH IRKI N  ++TI
Sbjct: 82  VVSTIAGTGNAGYSGDGANALFAQLNSPQGIGLLSGGAIIVSDTLNHRIRKIENGIISTI 141

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           AG GS     A G A +A  +    L    +   +  +D  N +IR+I
Sbjct: 142 AGTGSPGY-TASGTATSALINTPLGLAVYNN--EVYFADSLNHVIRKI 186



 Score = 45.4 bits (106), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 56/112 (50%), Gaps = 17/112 (15%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
            ++V  ++G G+ GY+ G+ G A + K   P S  V + G+I   DK N  IRKIT  G 
Sbjct: 549 NALVTTISGSGIAGYT-GDEGPAIYAKLNLPGSIEVALNGDIIFMDKGNQRIRKITKYGN 607

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + T+ G GS            + F+N   LT +     + I+D GN LI+ +
Sbjct: 608 IFTLYGNGST-----------SVFNNANGLT-LGGSGEIYIADSGNNLIKML 647



 Score = 41.6 bits (96), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 36/78 (46%), Gaps = 2/78 (2%)

Query: 78  EESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT 136
           E  ++  +AG G  GY+  G   SA  + P   AV     +Y AD  NHVIRKI++ G  
Sbjct: 134 ENGIISTIAGTGSPGYTASGTATSALINTPLGLAV-YNNEVYFADSLNHVIRKISSSGSI 192

Query: 137 TIAGGGSKKEGRADGPAQ 154
           +   G     G    P Q
Sbjct: 193 SNEQGVGATSGSQTLPLQ 210



 Score = 40.4 bits (93), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 50/109 (45%), Gaps = 7/109 (6%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDK--PKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTT 137
           ++R+   GV     G   S   D    KSF +   G IY+AD  NH I K  I    +TT
Sbjct: 440 LRRIESSGVLKLVVGSCNSGNQDYFLSKSFDISSDGIIYIADYYNHRIAKFVIGGTSLTT 499

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           +AGG  K  G ADG   NA+ +    ++ +     L  SD  N  IR +
Sbjct: 500 LAGGSLK--GFADGVGSNANLNYPDSIS-IGLNNMLYFSDRDNHAIRSV 545


>gi|290992324|ref|XP_002678784.1| predicted protein [Naegleria gruberi]
 gi|284092398|gb|EFC46040.1| predicted protein [Naegleria gruberi]
          Length = 2331

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 58/107 (54%), Gaps = 2/107 (1%)

Query: 82  VKRLAGDGVQGYSDG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
           +  +AG+G  GYSD  +P  A F+   S  ++  G++Y++D +N+VIRK+++ G  T   
Sbjct: 509 ISSVAGNGYLGYSDNTQPLRASFNLAVSVLINNVGDMYISDYANNVIRKMSSSGAITTIA 568

Query: 141 GGSKKEGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G +K  G AD   A+ A  +  F L +      L  +D  N  IR++
Sbjct: 569 GQAKISGFADSLNAKTALLNGPFGLYYQESTQTLFFADSLNGRIRKL 615



 Score = 40.8 bits (94), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 59/109 (54%), Gaps = 5/109 (4%)

Query: 81  VVKRLAGDGVQG-YSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTT 137
           V+  +AG+G +  Y DG  G SA  + P    +   G+IY+AD  N  IRK+ ++  +++
Sbjct: 452 VLSTIAGNGGERFYGDGGLGTSASLNYPYGIHLHTDGSIYIADTYNSKIRKLDSSKKISS 511

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           +AG G         P + ASF+    +  + ++  + ISD+ N +IR++
Sbjct: 512 VAGNGYLGYSDNTQPLR-ASFNLAVSV-LINNVGDMYISDYANNVIRKM 558


>gi|262405960|ref|ZP_06082510.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|262356835|gb|EEZ05925.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
          Length = 459

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 42/124 (33%), Positives = 58/124 (46%), Gaps = 19/124 (15%)

Query: 85  LAGD-GVQGYSDGEPGSARFDKP--------KSFAVDMKGNIY---VADKSNHVIRKITN 132
            AG+ G  GY D    SARFD P        + +  + + ++Y   + D +NH IRKIT 
Sbjct: 336 FAGEFGSYGYKDASQNSARFDNPMQGCFVLNEEYVAEQRLDVYDFYLTDAANHCIRKITP 395

Query: 133 LG-VTTIAGGGSKK-----EGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
            G VTT AG GS        G  DG P + A F+    L +        + D+GN  +R 
Sbjct: 396 DGIVTTFAGRGSYSTDQIVSGYIDGDPRETARFNYPLGLCYEESTGTFYVGDNGNHRVRT 455

Query: 186 INLK 189
           I L+
Sbjct: 456 IALQ 459


>gi|380692527|ref|ZP_09857386.1| hypothetical protein BfaeM_00899 [Bacteroides faecis MAJ27]
          Length = 441

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 48/104 (46%), Gaps = 1/104 (0%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
           KRL   G   + DG    A F+ PK    D  GN++VAD  NH IR ++   + T   G 
Sbjct: 336 KRLNAPGGSAFRDGPIEDALFNYPKDIKFDNDGNMFVADYGNHCIRMVSADNIVTTVAGQ 395

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
               G  DG    + F N + +  V     + I+D GN  IR++
Sbjct: 396 PGVAGYKDGGPLESLFKNPWGVA-VNEQGDIYIADWGNARIRKL 438


>gi|290985668|ref|XP_002675547.1| predicted protein [Naegleria gruberi]
 gi|284089144|gb|EFC42803.1| predicted protein [Naegleria gruberi]
          Length = 1037

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 41/111 (36%), Positives = 62/111 (55%), Gaps = 4/111 (3%)

Query: 82  VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
           V+ + G+G+ G+S DGE   +A+ ++P    VD  G +   D SN+ IR I+N G ++TI
Sbjct: 28  VEVIGGNGISGFSGDGELAINAKLNRPTKIVVDSAGRVVFCDNSNNRIRMISNNGTISTI 87

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           AG G+       G A NA  +    LT +  I  +LI+D  N  IR+I+L 
Sbjct: 88  AGSGTGFVLGDGGLATNAILNMPTGLT-INSIGEILIADTSNFKIRKIDLN 137



 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 42/142 (29%), Positives = 71/142 (50%), Gaps = 8/142 (5%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG-SARFDK 105
           P+ V+    S D+ + D+S      +S   S   ++  +AG G  G+ D      A+ + 
Sbjct: 445 PYGVV-TSSSGDVFIADTSNCRIRKVS---SSTGIITTVAG-GTCGFGDNVLAVDAQLNT 499

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFEL 164
           P   +V+ KG +++AD +NH IRK+++ G ++TIAG G        G A NA+     ++
Sbjct: 500 PYGISVNSKGELFIADTNNHRIRKVSSSGFISTIAGNGVGGFSGDGGLATNANLFKPSKV 559

Query: 165 TFVPHICALLISDHGNQLIRQI 186
             V  I  + I+D     IR+I
Sbjct: 560 V-VNSIGEIFIADSSTNRIRKI 580



 Score = 45.1 bits (105), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 35/115 (30%), Positives = 57/115 (49%), Gaps = 5/115 (4%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           +S    +  +AG G  G+  G+ G   +A  + P    ++  G I +AD SN  IRKI  
Sbjct: 78  ISNNGTISTIAGSGT-GFVLGDGGLATNAILNMPTGLTINSIGEILIADTSNFKIRKIDL 136

Query: 133 LG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            G +TTIAG G++      G A +A  +   +++  P    + I+D  N  +R+I
Sbjct: 137 NGIITTIAGSGNQGFEGDGGLATSAPLNFPADVSVHPTTNEVFIADSNNHCVRKI 191



 Score = 43.5 bits (101), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 45/88 (51%), Gaps = 2/88 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASF 158
           +A   KP    V+  G I++AD S + IRKI TN  + TIAG G+      +  A N+  
Sbjct: 550 NANLFKPSKVVVNSIGEIFIADSSTNRIRKILTNGTIITIAGNGNSGFNGDEADATNSQL 609

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
            + + +  +     + ISD GN  IR++
Sbjct: 610 GSPYGIA-LSSTGEIYISDQGNNRIRKL 636



 Score = 42.4 bits (98), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 32/115 (27%), Positives = 54/115 (46%), Gaps = 4/115 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           +S   ++  + G+ +Q            + PK  A+   G +Y AD   + IRKIT  G 
Sbjct: 305 VSLNGIITTVVGEYLQSQDGKLATDVVLNTPKGVAISPSGEVYFADSDKYAIRKITLSGT 364

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           + TIAG G    G A     ++  SN +++ F+  +    I+D  N  IR+I++ 
Sbjct: 365 IITIAGSG--LYGFAGDNGYSSQLSNSYDI-FINSLGEAFIADTYNHRIRKIDVN 416



 Score = 41.6 bits (96), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 39/150 (26%), Positives = 73/150 (48%), Gaps = 14/150 (9%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEPGSARFD 104
           P  V   P ++++ + DS+      +   L+  ++       G QG S   G   ++R +
Sbjct: 166 PADVSVHPTTNEVFIADSNNHCVRKI---LTNGTIFTVAGICGAQGQSSDGGLAINSRLN 222

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGS---KKEGRADGPAQNASFS 159
            P   A+   G+I++A++ N  +RK+++    ++T AG G      +G+A   A NAS  
Sbjct: 223 YPIGVAISSTGDIFIAERDNSKVRKVSSSTGIISTFAGNGVFGFMGDGKA---AVNASIY 279

Query: 160 NDFELTFVPHICALLISDHGNQLIRQINLK 189
           N + + F        ISD+G+  IR+++L 
Sbjct: 280 NPYNVAF-NSAGEAFISDYGSGKIRKVSLN 308


>gi|111219921|ref|YP_710715.1| ascorbate-dependent monooxygenase [Frankia alni ACN14a]
 gi|111147453|emb|CAJ59103.1| Putative ascorbate-dependent monooxygenase [Frankia alni ACN14a]
          Length = 865

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 28/68 (41%), Positives = 46/68 (67%), Gaps = 3/68 (4%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++   V+  +AG GV G+S DG P  +A+ D+P   A+D +G++Y+AD+ NH IR++   
Sbjct: 558 INAAGVITTIAGTGVAGFSGDGGPAVAAQLDQPAGIALDNRGDLYIADRLNHRIRRVDPR 617

Query: 134 G-VTTIAG 140
           G +TT+AG
Sbjct: 618 GIITTVAG 625



 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 42/135 (31%), Positives = 62/135 (45%), Gaps = 8/135 (5%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG-YSDGEPGS-ARFDKPKSFAVDM 113
           + DL + D   +    +        ++   AG   +G + DG P + A  D P   A D 
Sbjct: 665 AGDLFIADQGENRIRKVD----ARGIISTFAGSSGRGSFGDGGPATDALLDVPFGVAADA 720

Query: 114 KGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA 172
            G++Y+AD  N  IRK+   GV TT+AG   +      GPA  AS   D     V  +  
Sbjct: 721 AGDVYIADTDNSRIRKVDTHGVITTVAGNRLRGFAGDGGPAVKASL-QDPRGIAVDAVGN 779

Query: 173 LLISDHGNQLIRQIN 187
           L I+D GN  IR+++
Sbjct: 780 LYITDRGNSRIRKVD 794



 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 39/134 (29%), Positives = 61/134 (45%), Gaps = 7/134 (5%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDM 113
           + D+ + D+  S    +        V+  +AG+ ++G++ DG P   A    P+  AVD 
Sbjct: 721 AGDVYIADTDNSRIRKVD----THGVITTVAGNRLRGFAGDGGPAVKASLQDPRGIAVDA 776

Query: 114 KGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA 172
            GN+Y+ D+ N  IRK+   G +TT+AG G        G A NA          V H   
Sbjct: 777 VGNLYITDRGNSRIRKVDTHGIITTLAGSGRPGSAGDGGLAGNAELGRPDGAVGVDHEGN 836

Query: 173 LLISDHGNQLIRQI 186
           +  SD  +  IR +
Sbjct: 837 VFFSDRASGRIRVV 850


>gi|307182571|gb|EFN69764.1| NHL repeat-containing protein 2 [Camponotus floridanus]
          Length = 687

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 52/197 (26%), Positives = 85/197 (43%), Gaps = 20/197 (10%)

Query: 5   FVYSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDS 64
            +Y+ + I     L  + +  L L   Y + +      L     +++     + LI+ DS
Sbjct: 182 LLYARVMIAYFKSLNQISNHSLPLRPAYHLLSSFKNGLLFPGKLAILSSEHGTKLIISDS 241

Query: 65  SRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSN 124
             +     +    +   V+   G   QG+ DG   +ARF+ P+   V +   IYVAD +N
Sbjct: 242 GNNRIVIAT----KHGEVEHFIGGCNQGFKDGSFKNARFNSPQGVCV-LNNTIYVADNNN 296

Query: 125 HVIRKI--TNLGVTTIAGGGSKKEGRADG----------PAQNASFSNDFELTFVPHICA 172
           H IRKI  T   V+TIAG GS+      G          P   A + ++++ + VP    
Sbjct: 297 HAIRKINLTENNVSTIAGTGSQGYDHKGGKQGINQVLSSPWDVAIYHHEYKDSIVP---V 353

Query: 173 LLISDHGNQLIRQINLK 189
           LLI+  G   I  + L+
Sbjct: 354 LLIAIAGTHQIWALFLE 370


>gi|358461043|ref|ZP_09171215.1| serine/threonine protein kinase [Frankia sp. CN3]
 gi|357074242|gb|EHI83734.1| serine/threonine protein kinase [Frankia sp. CN3]
          Length = 866

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 39/114 (34%), Positives = 60/114 (52%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++++ VV  +AG+   G+S DG P +A +   P   A D  GN+Y+ D  N+ +RKI   
Sbjct: 574 IAKDGVVTPIAGNAQDGFSGDGGPATAAQLYGPGRVAWDKAGNLYIPDTQNYRVRKIDPS 633

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G +TT+ G G+       GPA  A   N  E   V     L ++D+ NQ IR++
Sbjct: 634 GKITTVVGIGTAGYSGDGGPATQAQI-NGVEGIAVTADGTLYLADYDNQRIRKV 686



 Score = 50.8 bits (120), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 35/115 (30%), Positives = 60/115 (52%), Gaps = 7/115 (6%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           ++ + ++  +AG G +GYS G P +A   + D P S ++   G IY A+  +  ++KI  
Sbjct: 686 VTPDGIITTIAGTGEKGYS-GTPTTATQAKLDGPNSISLADDGTIYFANLGSDTVQKIDK 744

Query: 133 LG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            G +TT AG G        GPA +A+ S      F+ H   + I  +G++ IR++
Sbjct: 745 AGMLTTFAGNGKTGRTGDGGPATSATLS--IPDVFLGHDGTVYICAYGSETIRKV 797



 Score = 50.1 bits (118), Expect = 0.001,   Method: Composition-based stats.
 Identities = 29/67 (43%), Positives = 41/67 (61%), Gaps = 3/67 (4%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-N 132
           ++ + ++  +AG G +GY+ DG P  +A+   P S  VD  G IYVAD  N VIR+I  N
Sbjct: 797 VTSDGIITTIAGTGAEGYTGDGGPANAAQLSDPTSVVVDAGGAIYVADNGNKVIRRIDPN 856

Query: 133 LGVTTIA 139
             +TTIA
Sbjct: 857 GTITTIA 863



 Score = 48.1 bits (113), Expect = 0.004,   Method: Composition-based stats.
 Identities = 37/115 (32%), Positives = 62/115 (53%), Gaps = 5/115 (4%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           + +  ++   AG+G  G + DG P  SA    P  F +   G +Y+    +  IRK+T+ 
Sbjct: 742 IDKAGMLTTFAGNGKTGRTGDGGPATSATLSIPDVF-LGHDGTVYICAYGSETIRKVTSD 800

Query: 134 GV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           G+ TTIAG G++      GPA  A  S+   +  V    A+ ++D+GN++IR+I+
Sbjct: 801 GIITTIAGTGAEGYTGDGGPANAAQLSDPTSVV-VDAGGAIYVADNGNKVIRRID 854



 Score = 44.7 bits (104), Expect = 0.039,   Method: Composition-based stats.
 Identities = 34/97 (35%), Positives = 50/97 (51%), Gaps = 13/97 (13%)

Query: 82  VKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
           +  + G G  GYS DG P + A+ +  +  AV   G +Y+AD  N  IRK+T  G +TTI
Sbjct: 636 ITTVVGIGTAGYSGDGGPATQAQINGVEGIAVTADGTLYLADYDNQRIRKVTPDGIITTI 695

Query: 139 AGGGSK---------KEGRADGPAQNASFSNDFELTF 166
           AG G K          + + DGP  + S ++D  + F
Sbjct: 696 AGTGEKGYSGTPTTATQAKLDGP-NSISLADDGTIYF 731


>gi|294647916|ref|ZP_06725468.1| IPT/TIG domain protein [Bacteroides ovatus SD CC 2a]
 gi|294806312|ref|ZP_06765159.1| IPT/TIG domain protein [Bacteroides xylanisolvens SD CC 1b]
 gi|345510467|ref|ZP_08790034.1| hypothetical protein BSAG_00676 [Bacteroides sp. D1]
 gi|292636824|gb|EFF55290.1| IPT/TIG domain protein [Bacteroides ovatus SD CC 2a]
 gi|294446568|gb|EFG15188.1| IPT/TIG domain protein [Bacteroides xylanisolvens SD CC 1b]
 gi|295085612|emb|CBK67135.1| IPT/TIG domain./NHL repeat. [Bacteroides xylanisolvens XB1A]
 gi|345454427|gb|EEO48966.2| hypothetical protein BSAG_00676 [Bacteroides sp. D1]
          Length = 476

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 42/124 (33%), Positives = 58/124 (46%), Gaps = 19/124 (15%)

Query: 85  LAGD-GVQGYSDGEPGSARFDKP--------KSFAVDMKGNIY---VADKSNHVIRKITN 132
            AG+ G  GY D    SARFD P        + +  + + ++Y   + D +NH IRKIT 
Sbjct: 353 FAGEFGSYGYKDASQNSARFDNPMQGCFVLNEEYVAEQRLDVYDFYLTDAANHCIRKITP 412

Query: 133 LG-VTTIAGGGSKK-----EGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
            G VTT AG GS        G  DG P + A F+    L +        + D+GN  +R 
Sbjct: 413 DGIVTTFAGRGSYSTDQIVSGYIDGDPRETARFNYPLGLCYEESTGTFYVGDNGNHRVRT 472

Query: 186 INLK 189
           I L+
Sbjct: 473 IALQ 476


>gi|290999745|ref|XP_002682440.1| predicted protein [Naegleria gruberi]
 gi|284096067|gb|EFC49696.1| predicted protein [Naegleria gruberi]
          Length = 731

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 38/118 (32%), Positives = 64/118 (54%), Gaps = 5/118 (4%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +S   ++  +AG GV GYS DG P  + + + P    +D   NI +AD++NH IR I+N 
Sbjct: 228 VSNSGIISTVAGTGVSGYSGDGIPANTTKLNTPNGITIDSNDNIIIADRNNHRIRLISNS 287

Query: 134 G--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
              ++T+AG G+      +  A +A  S   ++T + +   L+I+D  N +IR + L 
Sbjct: 288 SGIISTLAGNGTTGSRDEEVLATSAKLSRPADVT-IGYDGELIITDTDNFVIRIVKLN 344



 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 43/149 (28%), Positives = 77/149 (51%), Gaps = 12/149 (8%)

Query: 76  LSEESVVKRLAGDGVQGYSDGE---PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-- 130
           +S   ++  +AG G  GY +G+      A+   P   +VD KGN+Y+ADK NH IRKI  
Sbjct: 508 VSNNGIISTIAGTGSAGY-NGDVIMATEAKLYLPHGVSVDNKGNVYIADKQNHRIRKILA 566

Query: 131 -TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            T + ++TIAG G       +  A  +  ++ +++T       + I+D  N  IR+I   
Sbjct: 567 STGM-ISTIAGTGQAGFNDDNMSALESRVNSPYDVTVDESGQVIYIADTNNHKIRRI--- 622

Query: 190 PEDCSKSSQSGSALGAVSVWVLVSVLSCL 218
            ++ + ++ +G+ +G  +   ++S  S L
Sbjct: 623 -QNGNLTTIAGNGIGGYNQDGILSTQSQL 650



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 62/114 (54%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TN 132
           +S++  VK +AG+G+ GY+ DG     A+ + P   A D  GNIY++D  NH +R + TN
Sbjct: 396 ISKDGSVKTIAGNGIGGYNGDGMLAIDAQLNYPHGVASDSIGNIYISDSYNHRVRIVFTN 455

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             ++TIAG G+    +    A ++  +  F +    +   L ISD  N  +R++
Sbjct: 456 GTISTIAGNGNSGFNKDGIQATSSQLNYPFGIALNGN-DELFISDRSNHRVRKV 508



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 36/115 (31%), Positives = 57/115 (49%), Gaps = 9/115 (7%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGS--ARFDKPKSFAVDMKGN-IYVADKSNHVIRKITN 132
           L+   ++  +AG G  G++D    +  +R + P    VD  G  IY+AD +NH IR+I N
Sbjct: 565 LASTGMISTIAGTGQAGFNDDNMSALESRVNSPYDVTVDESGQVIYIADTNNHKIRRIQN 624

Query: 133 LGVTTIAG---GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIR 184
             +TTIAG   GG  ++G     +Q       ++++  P    + I D  N  IR
Sbjct: 625 GNLTTIAGNGIGGYNQDGILSTQSQ---LYYPYDVSIDPVSGKIFIGDASNFRIR 676



 Score = 43.5 bits (101), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 64/115 (55%), Gaps = 16/115 (13%)

Query: 81  VVKRLAGDGVQGYS-DG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
            +  +AG+G  G++ DG +  S++ + P   A++    ++++D+SNH +RK++N G ++T
Sbjct: 457 TISTIAGNGNSGFNKDGIQATSSQLNYPFGIALNGNDELFISDRSNHRVRKVSNNGIIST 516

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQI 186
           IAG GS       G   +   + + +L ++PH  +      + I+D  N  IR+I
Sbjct: 517 IAGTGSA------GYNGDVIMATEAKL-YLPHGVSVDNKGNVYIADKQNHRIRKI 564


>gi|289667480|ref|ZP_06488555.1| putative NHL repeat protein [Xanthomonas campestris pv. musacearum
           NCPPB 4381]
          Length = 310

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 37/84 (44%), Positives = 44/84 (52%), Gaps = 3/84 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           L+ +  V  LAGDGV G  +G     RF+ P   AVD +G +YVAD  N  I  I   G 
Sbjct: 95  LTPQGKVTTLAGDGVAGDRNGAAAQVRFNGPVGVAVDAQGRVYVADTYNDRIGVIETDGQ 154

Query: 135 VTTIAGGGSKKEGRADGPAQNASF 158
           V T+AGGG    G ADG    A F
Sbjct: 155 VRTLAGGG--LPGMADGIGTQAWF 176



 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 31/68 (45%), Positives = 42/68 (61%), Gaps = 2/68 (2%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           L  +  V  LAG G +G++DG   +A F+ P    +D  GN+Y+AD  NH IRK+T  G 
Sbjct: 42  LGADGHVHTLAG-GREGFADGIGVAAAFNTPSGIMLDTAGNLYIADTGNHAIRKLTPQGK 100

Query: 135 VTTIAGGG 142
           VTT+AG G
Sbjct: 101 VTTLAGDG 108



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 51/101 (50%), Gaps = 10/101 (9%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG----VTTIAGGGSKK 145
           ++G +DG    A+FD P   A D  G +Y+AD  +    +I  LG    V T+AGG   +
Sbjct: 1   MRGVADGRAADAQFDDPYGLATDAHGTLYIADGGDD--NRIHGLGADGHVHTLAGG---R 55

Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           EG ADG    A+F N      +     L I+D GN  IR++
Sbjct: 56  EGFADGIGVAAAF-NTPSGIMLDTAGNLYIADTGNHAIRKL 95


>gi|441163235|ref|ZP_20968262.1| leucine-rich repeat-containing protein [Streptomyces rimosus subsp.
           rimosus ATCC 10970]
 gi|440616374|gb|ELQ79516.1| leucine-rich repeat-containing protein [Streptomyces rimosus subsp.
           rimosus ATCC 10970]
          Length = 553

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 36/114 (31%), Positives = 56/114 (49%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++   ++  +AG+G  GY SDG P    R   P   A+D  G++Y+ D  NH IRK+T+ 
Sbjct: 24  VTPNGLITTVAGNGTAGYVSDGGPALGTRLHYPWGLALDEAGSLYIGDGHNHRIRKVTSD 83

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G +TT+AG G+       GPA        + +        L I+D  N  +R +
Sbjct: 84  GIITTVAGNGTAGYVDDGGPAAGTRLYYPYGIAL-DRGGNLYIADCNNHRVRGV 136



 Score = 45.4 bits (106), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 35/58 (60%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGY-SDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ + ++  +AG+G  GY  DG P +  R   P   A+D  GN+Y+AD +NH +R +T
Sbjct: 80  VTSDGIITTVAGNGTAGYVDDGGPAAGTRLYYPYGIALDRGGNLYIADCNNHRVRGVT 137



 Score = 45.4 bits (106), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 41/79 (51%), Gaps = 2/79 (2%)

Query: 109 FAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFV 167
             VD +G++Y+AD+ NH +RK+T  G +TT+AG G+       GPA        + L   
Sbjct: 3   LVVDAEGSLYIADRYNHRVRKVTPNGLITTVAGNGTAGYVSDGGPALGTRLHYPWGLAL- 61

Query: 168 PHICALLISDHGNQLIRQI 186
               +L I D  N  IR++
Sbjct: 62  DEAGSLYIGDGHNHRIRKV 80


>gi|290979906|ref|XP_002672674.1| predicted protein [Naegleria gruberi]
 gi|284086252|gb|EFC39930.1| predicted protein [Naegleria gruberi]
          Length = 747

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 40/132 (30%), Positives = 70/132 (53%), Gaps = 8/132 (6%)

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
           DL + DS   A   +     E  ++  +AG+G+ G+ DG    A+F+     +V   G++
Sbjct: 512 DLYICDSINHAIRKID---HETGIISTIAGNGIAGFKDGNASIAQFNSNLGLSVLPNGDL 568

Query: 118 YVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGP-AQNASFSNDFELTFVPHICALL 174
            +AD +N+ IRK   ++  V TIAGG +   G  DG  A +A  ++  ++ +      + 
Sbjct: 569 LIADYNNNRIRKYLASSKQVLTIAGGLAGYSG--DGQLATSAQLNHPTDVAYNASTGDVF 626

Query: 175 ISDHGNQLIRQI 186
           I+D GN++IR+I
Sbjct: 627 IADFGNKVIRKI 638



 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 47/123 (38%), Positives = 68/123 (55%), Gaps = 11/123 (8%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGE-PGSARFDKPKSFAVDMK-GNIYVADKSNHVIRKITN 132
           L+    V  +AG G+ GYS DG+   SA+ + P   A +   G++++AD  N VIRKI+N
Sbjct: 582 LASSKQVLTIAG-GLAGYSGDGQLATSAQLNHPTDVAYNASTGDVFIADFGNKVIRKISN 640

Query: 133 LG--VTTIAGGGSKKEGRADGPAQNASFSN--DFELTFVPHICALLISDHGNQLIRQINL 188
               ++TIAG G  +    +G A  AS SN   + L+  P    L ISD  N LIR+IN+
Sbjct: 641 STGIISTIAGSGLSEY---NGDAMPASISNISPYGLSIHPITGELFISDSVNYLIRKINV 697

Query: 189 KPE 191
             +
Sbjct: 698 NNQ 700



 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 47/156 (30%), Positives = 77/156 (49%), Gaps = 24/156 (15%)

Query: 40  GHQLEIN-PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESV------VKRLAGDGVQG 92
           G++  +N P+SV     S D+ +     + F  L     ++ V      V  LAGD    
Sbjct: 439 GNETALNYPYSVTQSEISGDVFI----GTTFKILKVSKKDKRVSTVAGSVSTLAGDNFNS 494

Query: 93  YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRA 149
            +      ++ ++P   A  + G++Y+ D  NH IRKI   T + ++TIAG G    G  
Sbjct: 495 IA------SQLNEPAGLAFAINGDLYICDSINHAIRKIDHETGI-ISTIAGNG--IAGFK 545

Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
           DG A  A F+++  L+ +P+   LLI+D+ N  IR+
Sbjct: 546 DGNASIAQFNSNLGLSVLPN-GDLLIADYNNNRIRK 580


>gi|255530479|ref|YP_003090851.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
 gi|255343463|gb|ACU02789.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
          Length = 453

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 48/104 (46%), Gaps = 1/104 (0%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
           KRL   G   + DG    A F+ PK    D  GN++VAD  NH IR ++   + T   G 
Sbjct: 348 KRLNAPGGSAFRDGPLADALFNDPKEIKFDNSGNMFVADYGNHCIRMVSADNIVTTVAGQ 407

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             K G  DG    + F+  + +  V     + I+D  N  IR++
Sbjct: 408 PGKSGYKDGGPVESLFNQPWGVA-VNEQGDIYIADWSNARIRKL 450


>gi|449137215|ref|ZP_21772546.1| NHL repeat containing protein [Rhodopirellula europaea 6C]
 gi|448884292|gb|EMB14794.1| NHL repeat containing protein [Rhodopirellula europaea 6C]
          Length = 351

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 53/102 (51%), Gaps = 5/102 (4%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-----NLGVTTIAGGGSKKE 146
           GY DG   SARF+K  +  +D    +Y++D +NH +R++         V T AG G +  
Sbjct: 106 GYVDGPARSARFNKLHNLMIDSNDVLYLSDHANHSVRRLMESEDGKWVVDTYAGNGKRGP 165

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
              +   ++A+F     +T       LLI+D GNQ+IR ++L
Sbjct: 166 AVDNVDRRDATFHEPISVTLDVKDNRLLIADIGNQVIRSLDL 207



 Score = 45.8 bits (107), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 26/95 (27%), Positives = 47/95 (49%), Gaps = 4/95 (4%)

Query: 34  VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
           + ++I G + ++     +D  G   L+VL+ + +    +         +  LAG G +G 
Sbjct: 211 MVSIIAGQKSKLKDPRAVDLDGERRLLVLERNGNRLRRVE----SNGDITTLAGSGKKGT 266

Query: 94  SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           +DG+   A F+ PK   V   G +Y+AD  NH++R
Sbjct: 267 ADGDAKQASFNGPKHMDVAPDGRVYIADDVNHLVR 301



 Score = 40.8 bits (94), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 49/86 (56%), Gaps = 4/86 (4%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASF 158
            ++   P++  +D +  + V +++ + +R++ +N  +TT+AG G  K+G ADG A+ ASF
Sbjct: 219 KSKLKDPRAVDLDGERRLLVLERNGNRLRRVESNGDITTLAGSG--KKGTADGDAKQASF 276

Query: 159 SNDFELTFVPHICALLISDHGNQLIR 184
           +    +   P    + I+D  N L+R
Sbjct: 277 NGPKHMDVAPD-GRVYIADDVNHLVR 301


>gi|111219555|ref|YP_710349.1| serine/threonine-protein kinase [Frankia alni ACN14a]
 gi|111147087|emb|CAJ58734.1| putative serine/threonine-protein kinase (partial match) [Frankia
           alni ACN14a]
          Length = 899

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 34/89 (38%), Positives = 51/89 (57%), Gaps = 2/89 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASF 158
           +A+ D P S A D  GNIY+ D  N+ IRK+T  GV +T+AG G+       GPA  A  
Sbjct: 632 AAQLDGPGSTARDKAGNIYIGDAKNNRIRKVTPAGVISTVAGTGTAGYSGDGGPATAAQL 691

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
           ++  ++T  P   ++  SD+ N  IR+I+
Sbjct: 692 NSAEKVTTGPD-GSVYFSDYDNHRIRKID 719



 Score = 51.2 bits (121), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 48/141 (34%), Positives = 77/141 (54%), Gaps = 10/141 (7%)

Query: 55  GSSDLIVLDSSRSAFYTLSF----PLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKS 108
           G +D+ + D     F  L+      ++ + ++  +AG G  GY+ DG P  SA+ +KP S
Sbjct: 749 GPNDITMTDDGTLYFVDLTSETIQKVTPDGIISTIAGTGEAGYTGDGGPARSAKLNKP-S 807

Query: 109 FAVDMKG-NIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTF 166
            A+   G   Y+AD +N+ +RKI   G +TTIAG G++  G   GPA  A F N   +  
Sbjct: 808 LAIGPDGETFYIADYNNNRVRKIDPNGIITTIAGTGTEGSGGDGGPATAAQFKNPSSVV- 866

Query: 167 VPHICALLISDHGNQLIRQIN 187
           V    A+ ++D+GN  +R+I+
Sbjct: 867 VDGSGAVYVADNGNDRVRRID 887



 Score = 43.9 bits (102), Expect = 0.075,   Method: Composition-based stats.
 Identities = 33/99 (33%), Positives = 49/99 (49%), Gaps = 13/99 (13%)

Query: 81  VVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
           V+  +AG G  GYS DG P +A + +  +       G++Y +D  NH IRKI   G +TT
Sbjct: 667 VISTVAGTGTAGYSGDGGPATAAQLNSAEKVTTGPDGSVYFSDYDNHRIRKIDPAGIITT 726

Query: 138 IAGGG---------SKKEGRADGPAQNASFSNDFELTFV 167
             G G         +  + R DGP  + + ++D  L FV
Sbjct: 727 YVGTGVAGYTGAGGAATQARIDGP-NDITMTDDGTLYFV 764



 Score = 42.7 bits (99), Expect = 0.18,   Method: Composition-based stats.
 Identities = 28/88 (31%), Positives = 43/88 (48%), Gaps = 2/88 (2%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFS 159
           AR D P    +   G +Y  D ++  I+K+T  G+ +TIAG G        GPA++A   
Sbjct: 745 ARIDGPNDITMTDDGTLYFVDLTSETIQKVTPDGIISTIAGTGEAGYTGDGGPARSAKL- 803

Query: 160 NDFELTFVPHICALLISDHGNQLIRQIN 187
           N   L   P      I+D+ N  +R+I+
Sbjct: 804 NKPSLAIGPDGETFYIADYNNNRVRKID 831


>gi|290996598|ref|XP_002680869.1| predicted protein [Naegleria gruberi]
 gi|284094491|gb|EFC48125.1| predicted protein [Naegleria gruberi]
          Length = 1407

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 53/201 (26%), Positives = 92/201 (45%), Gaps = 14/201 (6%)

Query: 8   SSLAITLLSLLFSLVSSGLLLEDGY-TVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSR 66
           S L I  +S+ +++ +       GY ++++ ID     I+ +S      S DL + D+S 
Sbjct: 21  SGLVIGAISVKYNITTIA-----GYNSLSSGIDAKMANIDVNSCFKDLSSGDLFISDNSN 75

Query: 67  SAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSN 124
                +    +    V  +AG GV GY+  + +  +A+ + P    V     IY+AD  N
Sbjct: 76  HRIVRV---FAANGTVLTIAGTGVSGYNGDNIQATTAQLNTPTGVFV-FNSEIYIADSQN 131

Query: 125 HVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
             +RKI TN  + TIAG G+      DG     ++ N     FV     + IS++ N  I
Sbjct: 132 SRVRKIQTNGNIVTIAGNGNAGYN-GDGMLATNAYLNSPVDVFVSSNGNVYISEYQNHYI 190

Query: 184 RQINLKPEDCSKSSQSGSALG 204
           R +N+     +  + +G+ +G
Sbjct: 191 RMVNVSTGVITTVAGNGTQIG 211



 Score = 41.2 bits (95), Expect = 0.49,   Method: Composition-based stats.
 Identities = 39/110 (35%), Positives = 55/110 (50%), Gaps = 10/110 (9%)

Query: 85  LAGDGVQGYSDG--EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGG 141
           +AG GV GY+    +  +A+ + P S   + K  +Y++D  N  IRKI TN  + TIAG 
Sbjct: 441 IAGTGVAGYNSDYMDASTAQLNYPSS-VFEFKNEVYISDSVNRRIRKIFTNGTIVTIAGT 499

Query: 142 GSK--KEGRADGPAQNASFSNDFELT--FVPHICALLISDHGNQLIRQIN 187
           GS+    G       NA  +  +  T  FV     + I D  N LIR+IN
Sbjct: 500 GSQPPSSGYLGDDGVNALSARLYFPTGIFVTSANEVFIVD--NFLIRKIN 547



 Score = 39.3 bits (90), Expect = 1.6,   Method: Composition-based stats.
 Identities = 28/77 (36%), Positives = 39/77 (50%), Gaps = 13/77 (16%)

Query: 81  VVKRLAGDGVQ----------GYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           V+  +AG+G Q          GY+ DG P + AR   P+   V     IY+AD  N  IR
Sbjct: 199 VITTVAGNGTQIGTSGTGLGFGYNGDGIPATYARLTNPQGIFVTSNNEIYIADAGNFRIR 258

Query: 129 KI-TNLGVTTIAGGGSK 144
           K+ TN  + T+AG G +
Sbjct: 259 KVLTNGTIITVAGTGEE 275



 Score = 39.3 bits (90), Expect = 2.0,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 39/62 (62%), Gaps = 3/62 (4%)

Query: 85  LAGDGVQGYS-DGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGG 141
           +AG G +GY+ DG    +A+ D P   +VD  G I++A+  ++ +RK+ TN  + TIAG 
Sbjct: 269 VAGTGEEGYNGDGMLATAAKLDYPYGVSVDSNGEIWIAELGSNRLRKVLTNGTIVTIAGT 328

Query: 142 GS 143
           G+
Sbjct: 329 GT 330


>gi|290972323|ref|XP_002668903.1| predicted protein [Naegleria gruberi]
 gi|284082440|gb|EFC36159.1| predicted protein [Naegleria gruberi]
          Length = 296

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 33/94 (35%), Positives = 53/94 (56%), Gaps = 4/94 (4%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG--GGSKKEGRADGPAQN 155
           +A  + P   AV   G++Y+ADKSNHVIRK++ L   +TTIAG  G ++    ++  A N
Sbjct: 108 NAELNFPSGVAVHSNGDVYIADKSNHVIRKVSALNGKITTIAGIAGETELNKYSNSLATN 167

Query: 156 ASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            + ++   L        ++ISD  N +IR++ L 
Sbjct: 168 TTLNSPQYLAVNSSTAEVIISDTNNNVIRKVYLN 201


>gi|385680260|ref|ZP_10054188.1| NHL repeat-containing protein [Amycolatopsis sp. ATCC 39116]
          Length = 597

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 37/111 (33%), Positives = 53/111 (47%), Gaps = 8/111 (7%)

Query: 97  EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNA 156
           +P   RF  P    V  +G I VAD  NH I +  + G T I   GS   GRADG    A
Sbjct: 163 QPADLRF--PSKAVVTAEGRILVADTGNHSIAEFASDGETLIRRFGSGARGRADGAFDVA 220

Query: 157 SFSNDFELTFVPHICA------LLISDHGNQLIRQINLKPEDCSKSSQSGS 201
           SF+    L  +P   A      LL++D  N L+R ++L   + +  + +G+
Sbjct: 221 SFAEPSGLALLPAEVADQAGYHLLVADTANHLLRGVDLVTGEVTTVAGTGN 271



 Score = 43.9 bits (102), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 32/121 (26%), Positives = 63/121 (52%), Gaps = 11/121 (9%)

Query: 73  SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV------DMKG-NIYVADKSNH 125
            F    E++++R  G G +G +DG    A F +P   A+      D  G ++ VAD +NH
Sbjct: 193 EFASDGETLIRRF-GSGARGRADGAFDVASFAEPSGLALLPAEVADQAGYHLLVADTANH 251

Query: 126 VIRKITNLG--VTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQL 182
           ++R +  +   VTT+AG G++ ++G  DG A +   ++ +++ +       +++  GN  
Sbjct: 252 LLRGVDLVTGEVTTVAGTGNQWRDGETDGDALSIDLTSPWDVIWWEPAGGAVVAMAGNHT 311

Query: 183 I 183
           +
Sbjct: 312 L 312


>gi|418677567|ref|ZP_13238841.1| putative lipoprotein [Leptospira kirschneri serovar Grippotyphosa
           str. RM52]
 gi|418687786|ref|ZP_13248945.1| putative lipoprotein [Leptospira kirschneri serovar Grippotyphosa
           str. Moskva]
 gi|418742122|ref|ZP_13298495.1| putative lipoprotein [Leptospira kirschneri serovar Valbuzzi str.
           200702274]
 gi|400320757|gb|EJO68617.1| putative lipoprotein [Leptospira kirschneri serovar Grippotyphosa
           str. RM52]
 gi|410738110|gb|EKQ82849.1| putative lipoprotein [Leptospira kirschneri serovar Grippotyphosa
           str. Moskva]
 gi|410750480|gb|EKR07460.1| putative lipoprotein [Leptospira kirschneri serovar Valbuzzi str.
           200702274]
          Length = 358

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 41/119 (34%), Positives = 65/119 (54%), Gaps = 9/119 (7%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKKE 146
           + G  +G+  ++ FD P    +D++ N+YV + SNH IRKI NL    V+T +GG S   
Sbjct: 164 LSGLQNGDRLNSLFDGPFFMDIDLERNLYVGELSNHTIRKI-NLDSGMVSTFSGGIS--- 219

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
           G  DG   +A F +   + +     +LL +D  +  IR+INLK    + S+  G+ +GA
Sbjct: 220 GYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKINLKTS--TVSTLLGNGIGA 276



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 37/70 (52%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++++ VV   AG GV    DG   +A F  P    VD  GNI+V+
Sbjct: 36  ILDCILKECYLCSLKITDKPVVSLFAGTGVAESIDGIISTASFKTPFGLEVDTSGNIFVS 95

Query: 121 DKSNHVIRKI 130
           D+  ++IRKI
Sbjct: 96  DQMTNLIRKI 105


>gi|326435919|gb|EGD81489.1| hypothetical protein PTSG_02206 [Salpingoeca sp. ATCC 50818]
          Length = 6977

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 63/144 (43%), Gaps = 9/144 (6%)

Query: 46   NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
            +P  V+  P + D++V D   +    ++     E  V    G G +G+ DG     R   
Sbjct: 912  SPQDVLSLP-NGDIVVCDRDNARVRVIT----PEGKVSTALGVGKRGHQDGVGAGVRLAG 966

Query: 106  PKSFAVDMKGNIYVADKSNHVIRKITNL--GVTTIAGGGSKKEGRADGPAQNASFSNDFE 163
            P+       G+  V D  NH IR++++    V TIAG    + G  DG A  A F     
Sbjct: 967  PRGMTQLASGSFVVTDAENHCIREVSSALAKVETIAG--CTRAGVRDGDAAAAEFRYPTH 1024

Query: 164  LTFVPHICALLISDHGNQLIRQIN 187
               +P    +LI+D GN  IR I+
Sbjct: 1025 ALEIPRQKLILITDTGNHTIRAIS 1048


>gi|326431736|gb|EGD77306.1| NHL repeat-containing protein [Salpingoeca sp. ATCC 50818]
          Length = 1384

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 37/114 (32%), Positives = 56/114 (49%), Gaps = 9/114 (7%)

Query: 81   VVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---- 134
            V+  LAG GV G+    G P +AR + P+  A    G I +AD  N  IR + N+G    
Sbjct: 1270 VITALAGTGVAGFRGDGGAPMAARLNTPRGIAAMPAGEIAIADTGNSRIRTL-NVGGQGA 1328

Query: 135  --VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
              + T+AG G++      G A  A+ +    +TF P    ++  D  N+ +RQI
Sbjct: 1329 GVIETVAGTGARGFSGDGGVATEANLNFPTGITFSPSTNNIVFVDRRNRRVRQI 1382


>gi|225420274|ref|ZP_03762577.1| hypothetical protein CLOSTASPAR_06617 [Clostridium asparagiforme
           DSM 15981]
 gi|225041091|gb|EEG51337.1| hypothetical protein CLOSTASPAR_06617 [Clostridium asparagiforme
           DSM 15981]
          Length = 336

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 44/144 (30%), Positives = 55/144 (38%), Gaps = 43/144 (29%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR-------------- 128
           + LAG G +GY D   G A F  P   A D  GN+Y+AD  N VIR              
Sbjct: 150 RTLAGRGREGYEDNRGGRALFSGPTGLAADDAGNLYIADTGNDVIRRLRPDGMVDTYLRG 209

Query: 129 -------------------------KITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFE 163
                                    K+ N  +   AGG    EG+ADG    A FS+   
Sbjct: 210 LSGPVGLCWHEGALYVSDTGNHRVLKVENGEIVWTAGG---AEGQADGGFGQARFSSPTY 266

Query: 164 LTFVPHICALLISDHGNQLIRQIN 187
           L  V     L +SD GN  +R+I 
Sbjct: 267 LA-VSEQGTLYVSDTGNAAVRKIE 289



 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 22/50 (44%), Positives = 29/50 (58%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
           G +G +DG  G ARF  P   AV  +G +YV+D  N  +RKI N  V T+
Sbjct: 247 GAEGQADGGFGQARFSSPTYLAVSEQGTLYVSDTGNAAVRKIENGTVATV 296



 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 53/109 (48%), Gaps = 6/109 (5%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
           K L G+ + GY DG     R ++P       +G   V+D+ NH++R I   G  T+AG G
Sbjct: 98  KGLYGEPLGGYGDGTRSEMRMEEPWDIVPYGEGYA-VSDRENHMVRYIDAQGSRTLAGRG 156

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
             +EG  D     A FS    L        L I+D GN +IR+  L+P+
Sbjct: 157 --REGYEDNRGGRALFSGPTGLA-ADDAGNLYIADTGNDVIRR--LRPD 200


>gi|386849777|ref|YP_006267790.1| Teneurin-4 [Actinoplanes sp. SE50/110]
 gi|359837281|gb|AEV85722.1| Teneurin-4 [Actinoplanes sp. SE50/110]
          Length = 631

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 29/66 (43%), Positives = 42/66 (63%), Gaps = 2/66 (3%)

Query: 80  SVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTT 137
           + V  +AG G  G S DG   S A+ + P+   VD  G+IYVAD  NH++RKI+N  ++T
Sbjct: 172 AAVSVIAGTGTAGTSPDGTAASLAKLNAPRDVKVDASGDIYVADTGNHLVRKISNGTIST 231

Query: 138 IAGGGS 143
           +AG G+
Sbjct: 232 VAGTGA 237



 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 42/81 (51%)

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
           P   AVD  G +Y+AD  N+V+R++    +TT+AG G      ADG    +   +D E  
Sbjct: 36  PGGVAVDPAGTMYIADSGNNVVRQVAAGIITTVAGTGVAGGTPADGGLATSGNLSDPEDV 95

Query: 166 FVPHICALLISDHGNQLIRQI 186
            V     L I+D G+  IR++
Sbjct: 96  TVDSGGVLYIADTGHHRIRRV 116



 Score = 42.4 bits (98), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 31/113 (27%), Positives = 51/113 (45%), Gaps = 6/113 (5%)

Query: 78  EESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
              ++  +AG GV G +  + G   S     P+   VD  G +Y+AD  +H IR++    
Sbjct: 61  AAGIITTVAGTGVAGGTPADGGLATSGNLSDPEDVTVDSGGVLYIADTGHHRIRRVAGGV 120

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           +TT+AG G        G A     S+ + +     +  + ++D GN  I  IN
Sbjct: 121 ITTVAGTGVAGS-LMTGVAATTLLSSPYGIDTANGM--VYVADTGNNRILMIN 170



 Score = 37.0 bits (84), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 50/109 (45%), Gaps = 5/109 (4%)

Query: 81  VVKRLAGDGVQG-YSDGEPGSARFDKPKSFAVDM-KGNIYVADKSNHVIRKITNLGVTTI 138
           V+  +AG GV G    G   +     P  + +D   G +YVAD  N+ I  I    V+ I
Sbjct: 120 VITTVAGTGVAGSLMTGVAATTLLSSP--YGIDTANGMVYVADTGNNRILMINGAAVSVI 177

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           AG G+      DG A + +  N      V     + ++D GN L+R+I+
Sbjct: 178 AGTGTAGT-SPDGTAASLAKLNAPRDVKVDASGDIYVADTGNHLVRKIS 225


>gi|312198347|ref|YP_004018408.1| serine/threonine protein kinase [Frankia sp. EuI1c]
 gi|311229683|gb|ADP82538.1| serine/threonine protein kinase [Frankia sp. EuI1c]
          Length = 814

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 37/118 (31%), Positives = 65/118 (55%), Gaps = 6/118 (5%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +S +  +  +AG+G  G+S DG P +A   +KP + A+   G +Y+ D  N  +R+++  
Sbjct: 530 ISPDGTITTVAGNGQPGFSGDGGPATAAMLNKPVAVAIGPGGTLYIVDTFNMRVRQVSPD 589

Query: 134 G-VTTIAGGGSKKEGRAD--GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           G + TIAG G +    AD  GPA NA+      +  +     L I+D+GN ++R++ +
Sbjct: 590 GIIQTIAGSGERPWNPADDGGPATNAALWYPSGIA-IDSAGDLFIADNGNDIVRRVGV 646



 Score = 51.2 bits (121), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 39/112 (34%), Positives = 59/112 (52%), Gaps = 4/112 (3%)

Query: 79  ESVVKRLAGD-GVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT 136
           + ++  +AG  G   + DG+P + A   KP + A+D +G IY+AD  NH IR+I   GV 
Sbjct: 647 DGIITTVAGRFGYGSWGDGKPATQAMISKPFNVALDRQGRIYIADSYNHKIRRIGLDGVI 706

Query: 137 -TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            TIAG G        G A  A+  +   +T V     + I+D GN  +R+I+
Sbjct: 707 ETIAGTGVAGYSGDGGKATAATLRDPRGVT-VDAAGNVYITDSGNNRVRRID 757



 Score = 50.4 bits (119), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 32/86 (37%), Positives = 46/86 (53%), Gaps = 12/86 (13%)

Query: 79  ESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
           + V++ +AG GV GYS   G+  +A    P+   VD  GN+Y+ D  N+ +R+I   G +
Sbjct: 703 DGVIETIAGTGVAGYSGDGGKATAATLRDPRGVTVDAAGNVYITDSGNNRVRRIDTAGII 762

Query: 136 TTIAGG---GSKKEG------RADGP 152
           TT+AG    GSK         R DGP
Sbjct: 763 TTVAGTAPPGSKSTDTSATALRPDGP 788



 Score = 50.1 bits (118), Expect = 0.001,   Method: Composition-based stats.
 Identities = 41/118 (34%), Positives = 59/118 (50%), Gaps = 10/118 (8%)

Query: 73  SFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           SFP      V  +AG G  G++ DG P + A+ D P   A+D  GN+Y AD  N+ +R+I
Sbjct: 477 SFP------VVAIAGTGQAGFAGDGGPAAQAQLDHPYGPAMDGFGNLYFADFDNNRVRRI 530

Query: 131 TNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           +  G +TT+AG G        GPA  A  +    +   P    L I D  N  +RQ++
Sbjct: 531 SPDGTITTVAGNGQPGFSGDGGPATAAMLNKPVAVAIGPG-GTLYIVDTFNMRVRQVS 587



 Score = 45.4 bits (106), Expect = 0.023,   Method: Composition-based stats.
 Identities = 33/118 (27%), Positives = 58/118 (49%), Gaps = 6/118 (5%)

Query: 76  LSEESVVKRLAGDGVQGYS---DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S + +++ +AG G + ++   DG P + A    P   A+D  G++++AD  N ++R++ 
Sbjct: 586 VSPDGIIQTIAGSGERPWNPADDGGPATNAALWYPSGIAIDSAGDLFIADNGNDIVRRVG 645

Query: 132 NLGVTTIAGGGSKKEGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
             G+ T   G        DG PA  A  S  F +  +     + I+D  N  IR+I L
Sbjct: 646 VDGIITTVAGRFGYGSWGDGKPATQAMISKPFNVA-LDRQGRIYIADSYNHKIRRIGL 702


>gi|337745906|ref|YP_004640068.1| copper amine oxidase domain-containing protein [Paenibacillus
           mucilaginosus KNP414]
 gi|336297095|gb|AEI40198.1| copper amine oxidase domain protein [Paenibacillus mucilaginosus
           KNP414]
          Length = 537

 Score = 53.9 bits (128), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 47/159 (29%), Positives = 72/159 (45%), Gaps = 18/159 (11%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           G+S+G    AR + P   AV  +G + +AD  NH +R +    ++T+AG G +K G  DG
Sbjct: 302 GFSNGAALQARLNYPMGIAVTEEGGLLIADSQNHAVRYLFGGQLSTLAGAGEQKMGLLDG 361

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK-PEDCSKSSQSGSALGAVSVWV 210
               A  +   ++  +     L+     N+L R    + PED      +G AL  V    
Sbjct: 362 MEGKAGLNRPADVAVLGDGSVLVADSFNNRLRRLTGYRLPEDLP----AGDALKVV---- 413

Query: 211 LVSVLSCLVSLVIGFVARPYIIRHTGRLDPPLQHDMEAL 249
                  L + V+ F A+P I    GRL  P++   EAL
Sbjct: 414 -------LDAGVMKFDAQPEITE--GRLMVPVRAVTEAL 443



 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 55/189 (29%), Positives = 79/189 (41%), Gaps = 21/189 (11%)

Query: 29  EDGYTVTTVIDGH---QLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
           + GY V  ++DG     L   P  +   P  S L V D+   A   +      +  V  +
Sbjct: 109 DKGYPVGGLLDGAANASLFQEPLGLSAGPDGS-LYVADAGNHAIRRID----AKGNVTTV 163

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
           AG G  G  DG+  +A F +P   A    G +YVAD   H IR+I+  G  T     S++
Sbjct: 164 AGSGRLGAKDGKGAAAEFYRPGDVAAAPDGTLYVADTLGHTIRRISPQGEVTTLTAPSRR 223

Query: 146 -----EGR-------ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
                 G+       ADGP   A F+    +        L +SD GNQ IR I+L     
Sbjct: 224 VVEATPGQVAAAGDFADGPLAQAKFNEPTGIALDAK-GNLYVSDSGNQRIRYIDLAKGTV 282

Query: 194 SKSSQSGSA 202
           +  +  G+A
Sbjct: 283 TTVAGGGTA 291



 Score = 45.8 bits (107), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 52/111 (46%), Gaps = 8/111 (7%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
           AG G  G ++G    A F  P   AV   G   V+D  N VIRK+T   V  +AG   +K
Sbjct: 48  AGTGRLGSANGTGLEASFRVPAGLAVLPDGTAAVSDSRNGVIRKLTGGRVDVLAGVFYRK 107

Query: 146 E-------GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           +       G  DG A  + F     L+  P   +L ++D GN  IR+I+ K
Sbjct: 108 DDKGYPVGGLLDGAANASLFQEPLGLSAGPD-GSLYVADAGNHAIRRIDAK 157


>gi|379719861|ref|YP_005311992.1| copper amine oxidase domain-containing protein [Paenibacillus
           mucilaginosus 3016]
 gi|378568533|gb|AFC28843.1| copper amine oxidase domain-containing protein [Paenibacillus
           mucilaginosus 3016]
          Length = 535

 Score = 53.9 bits (128), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 55/189 (29%), Positives = 79/189 (41%), Gaps = 21/189 (11%)

Query: 29  EDGYTVTTVIDGH---QLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
           + GY V  ++DG     L   P  +   P  S L V D+   A   +      +  V  +
Sbjct: 109 DKGYPVGGLLDGAANASLFQEPLGLSAGPDGS-LYVADAGNHAIRRID----AKGNVTTV 163

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
           AG G  G  DG+  +A F +P   A    G +YVAD   H IR+I+  G  T     S++
Sbjct: 164 AGSGRLGAKDGKGAAAEFYRPGDVAAAPDGTLYVADTLGHTIRRISPQGEVTTLTAPSRR 223

Query: 146 -----EGR-------ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
                 G+       ADGP   A F+    +        L +SD GNQ IR I+L     
Sbjct: 224 VVEATPGQVAAAGDFADGPLAQAKFNEPTGIALDAK-GNLYVSDSGNQRIRYIDLAKGTV 282

Query: 194 SKSSQSGSA 202
           +  +  G+A
Sbjct: 283 TTVAGGGTA 291



 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 44/159 (27%), Positives = 68/159 (42%), Gaps = 20/159 (12%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           G+S+G    AR + P   AV  +G + +AD  NH +R +    ++T+AG G +K G  DG
Sbjct: 302 GFSNGAALQARLNYPMGIAVTEEGGLLIADSQNHDVRYLFGGQLSTLAGAGEQKMGLLDG 361

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK-PEDCSKSSQSGSALGAVSVWV 210
               A  +   ++  +     L+     N+L R    + PED                  
Sbjct: 362 MEGKAGLNRPADVAVLGDGSVLVADSFNNRLRRLTGYRLPEDLP---------------- 405

Query: 211 LVSVLSCLVSLVIGFVARPYIIRHTGRLDPPLQHDMEAL 249
              V   L + V+ F A+P I    GRL  P++   EAL
Sbjct: 406 -AGVKVVLDAGVMKFDAQPEITE--GRLMVPVRAITEAL 441



 Score = 45.8 bits (107), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 52/111 (46%), Gaps = 8/111 (7%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
           AG G  G ++G    A F  P   AV   G   V+D  N VIRK+T   V  +AG   +K
Sbjct: 48  AGTGRLGSANGTGLEASFRVPAGLAVLPDGTAAVSDSRNGVIRKLTGGRVDVLAGVFYRK 107

Query: 146 E-------GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           +       G  DG A  + F     L+  P   +L ++D GN  IR+I+ K
Sbjct: 108 DDKGYPVGGLLDGAANASLFQEPLGLSAGPD-GSLYVADAGNHAIRRIDAK 157


>gi|455790313|gb|EMF42184.1| putative lipoprotein [Leptospira interrogans serovar Lora str. TE
           1992]
          Length = 294

 Score = 53.9 bits (128), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 28/70 (40%), Positives = 37/70 (52%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNIYV+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95

Query: 121 DKSNHVIRKI 130
           D+ N++IRKI
Sbjct: 96  DQINNLIRKI 105



 Score = 53.5 bits (127), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 40/118 (33%), Positives = 67/118 (56%), Gaps = 7/118 (5%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKEG 147
           + G  +G+  ++ FD P    +D + N+YV + SNH IRKI  N G V+T++GG S   G
Sbjct: 164 LSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGTVSTLSGGIS---G 220

Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
             DG   +A F +   + +   + +LL +D  +  IR+I+LK  + + S+  G+ +GA
Sbjct: 221 YLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLK--NSTVSTLLGNGIGA 276



 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 43/80 (53%), Gaps = 6/80 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
           V  L+G G+ GY DG+  SARF  P   A + K N +  AD  +H IRKI   N  V+T+
Sbjct: 211 VSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLKNSTVSTL 269

Query: 139 AGGGSKKEGRADGPAQNASF 158
            G G   +   DG   NASF
Sbjct: 270 LGNGIGAD--VDGNGTNASF 287


>gi|399157177|ref|ZP_10757244.1| NHL repeat containing protein, partial [SAR324 cluster bacterium
           SCGC AAA001-C10]
          Length = 352

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 40/108 (37%), Positives = 52/108 (48%), Gaps = 6/108 (5%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTI 138
           + V  LAG    G +D    SA+F+ PK    D   N+YVAD SNH IRKI  + GV T 
Sbjct: 238 TAVTTLAGQSDNGSTDATGTSAQFNYPKGITTDGT-NLYVADYSNHRIRKIVISTGVVTT 296

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             G S  +G  D    +ASF   +          L ++D  N  IR+I
Sbjct: 297 LAGSS--QGSTDATGTSASFY--YPSGITTDGTNLYVADRYNHRIRKI 340


>gi|158315873|ref|YP_001508381.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
 gi|158111278|gb|ABW13475.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
          Length = 850

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 42/109 (38%), Positives = 56/109 (51%), Gaps = 4/109 (3%)

Query: 82  VKRLAGDGVQGYS-DGEP-GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
           V  +AG G   +S DG P GSA  + P     D  GNIYVAD  N+ +R+IT  G +TTI
Sbjct: 517 VVTVAGTGEAAFSGDGGPAGSAALNGPFGMVADWAGNIYVADFDNNRVRRITADGTITTI 576

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           AG G        GPA  A       +  +     +LI+D  NQ IR+++
Sbjct: 577 AGTGEAGFSGDGGPATQARLRQPAAVA-LDSAGNILIADTFNQRIRRVD 624



 Score = 51.2 bits (121), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 36/98 (36%), Positives = 56/98 (57%), Gaps = 10/98 (10%)

Query: 47  PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARF 103
           P SV +DRPG   L + DS  +    +      + V++ +AG G+ GYS DG P + A  
Sbjct: 711 PISVAMDRPGR--LYIADSGNNRIRRIGL----DGVIETVAGTGLPGYSGDGGPATRATL 764

Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
             P+  AVD +G I++ D++N  IR++   G +TT+AG
Sbjct: 765 RSPRGVAVDARGAIFITDRTNRRIRRVDPSGIITTVAG 802



 Score = 50.1 bits (118), Expect = 0.001,   Method: Composition-based stats.
 Identities = 36/114 (31%), Positives = 56/114 (49%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ +  +  +AG G  G+S DG P + AR  +P + A+D  GNI +AD  N  IR++   
Sbjct: 567 ITADGTITTIAGTGEAGFSGDGGPATQARLRQPAAVALDSAGNILIADTFNQRIRRVDPS 626

Query: 134 GVTTIAGGGSKKEGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G  T   G   +    DG PA  A+      +   P    + I+D GN  IR++
Sbjct: 627 GTITTVAGKDDRGFSEDGVPATEATLWYPGGVVADP-TGNIYIADSGNNRIRRV 679



 Score = 48.1 bits (113), Expect = 0.004,   Method: Composition-based stats.
 Identities = 36/104 (34%), Positives = 53/104 (50%), Gaps = 3/104 (2%)

Query: 86  AGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT-TIAGGGS 143
            GDG   + DG P + A    P S A+D  G +Y+AD  N+ IR+I   GV  T+AG G 
Sbjct: 690 GGDGEGAFGDGGPAADALLAFPISVAMDRPGRLYIADSGNNRIRRIGLDGVIETVAGTGL 749

Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
                  GPA  A+  +   +  V    A+ I+D  N+ IR+++
Sbjct: 750 PGYSGDGGPATRATLRSPRGVA-VDARGAIFITDRTNRRIRRVD 792



 Score = 47.4 bits (111), Expect = 0.007,   Method: Composition-based stats.
 Identities = 37/110 (33%), Positives = 55/110 (50%), Gaps = 4/110 (3%)

Query: 82  VKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
           +  +AG   +G+S DG P + A    P     D  GNIY+AD  N+ IR++   G + T+
Sbjct: 629 ITTVAGKDDRGFSEDGVPATEATLWYPGGVVADPTGNIYIADSGNNRIRRVGTDGIIQTV 688

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           AGG  +      GPA +A  +    +  +     L I+D GN  IR+I L
Sbjct: 689 AGGDGEGAFGDGGPAADALLAFPISVA-MDRPGRLYIADSGNNRIRRIGL 737


>gi|418714767|ref|ZP_13275259.1| putative lipoprotein [Leptospira interrogans str. UI 08452]
 gi|410788941|gb|EKR82646.1| putative lipoprotein [Leptospira interrogans str. UI 08452]
          Length = 358

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 28/70 (40%), Positives = 37/70 (52%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNIYV+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95

Query: 121 DKSNHVIRKI 130
           D+ N++IRKI
Sbjct: 96  DQINNLIRKI 105



 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 40/118 (33%), Positives = 67/118 (56%), Gaps = 7/118 (5%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKEG 147
           + G  +G+  ++ FD P    +D + N+YV + SNH IRKI  N G V+T++GG S   G
Sbjct: 164 LSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGTVSTLSGGIS---G 220

Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
             DG   +A F +   + +   + +LL +D  +  IR+I+LK  + + S+  G+ +GA
Sbjct: 221 YLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLK--NSTVSTLLGNGIGA 276



 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 43/80 (53%), Gaps = 6/80 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
           V  L+G G+ GY DG+  SARF  P   A + K N +  AD  +H IRKI   N  V+T+
Sbjct: 211 VSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLKNSTVSTL 269

Query: 139 AGGGSKKEGRADGPAQNASF 158
            G G   +   DG   NASF
Sbjct: 270 LGNGIGAD--VDGNGTNASF 287


>gi|345851882|ref|ZP_08804843.1| hypothetical protein SZN_18986 [Streptomyces zinciresistens K42]
 gi|345636654|gb|EGX58200.1| hypothetical protein SZN_18986 [Streptomyces zinciresistens K42]
          Length = 468

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 39/114 (34%), Positives = 63/114 (55%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ +  +  +AG G  G+  DG P  +A+ + P+  AVD  G +Y+AD +NH IRK+T  
Sbjct: 28  ITTDGRITTVAGTGAAGFRGDGGPAVAAQLNGPREVAVDGAGAVYIADSNNHRIRKVTPD 87

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G ++T+AG G+       GPA  A  +    +T V     L ++D+ N  IR+I
Sbjct: 88  GKISTVAGTGAGGFRGDGGPATAAQLNLPLGVT-VDGAGVLHVADYYNHRIRRI 140



 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 32/85 (37%), Positives = 46/85 (54%), Gaps = 2/85 (2%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSND 161
            ++P   AVD  G +YVAD S H +R+IT  G +TT+AG G+       GPA  A  +  
Sbjct: 1   MNRPYGIAVDSTGTVYVADFSGHRVRRITTDGRITTVAGTGAAGFRGDGGPAVAAQLNGP 60

Query: 162 FELTFVPHICALLISDHGNQLIRQI 186
            E+  V    A+ I+D  N  IR++
Sbjct: 61  REVA-VDGAGAVYIADSNNHRIRKV 84



 Score = 40.8 bits (94), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 32/94 (34%), Positives = 46/94 (48%), Gaps = 14/94 (14%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASF 158
           +A+ + P    VD  G ++VAD  NH IR+IT  GV +T+AG G+       GPA  A  
Sbjct: 110 AAQLNLPLGVTVDGAGVLHVADYYNHRIRRITADGVISTVAGTGAAGFRGDGGPAATAQL 169

Query: 159 SNDFELTFVPHICA------LLISDHGNQLIRQI 186
           +        PH  A      L I+D  N  +R++
Sbjct: 170 NG-------PHGVALNAAGDLCIADLQNHRVRKV 196


>gi|418707100|ref|ZP_13267936.1| putative lipoprotein [Leptospira interrogans serovar Hebdomadis
           str. R499]
 gi|410763306|gb|EKR34037.1| putative lipoprotein [Leptospira interrogans serovar Hebdomadis
           str. R499]
          Length = 358

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 28/70 (40%), Positives = 37/70 (52%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNIYV+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95

Query: 121 DKSNHVIRKI 130
           D+ N++IRKI
Sbjct: 96  DQINNLIRKI 105



 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 40/118 (33%), Positives = 67/118 (56%), Gaps = 7/118 (5%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKEG 147
           + G  +G+  ++ FD P    +D + N+YV + SNH IRKI  N G V+T++GG S   G
Sbjct: 164 LSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGTVSTLSGGIS---G 220

Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
             DG   +A F +   + +   + +LL +D  +  IR+I+LK  + + S+  G+ +GA
Sbjct: 221 YLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLK--NSTVSTLLGNGIGA 276



 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 43/80 (53%), Gaps = 6/80 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
           V  L+G G+ GY DG+  SARF  P   A + K N +  AD  +H IRKI   N  V+T+
Sbjct: 211 VSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLKNSTVSTL 269

Query: 139 AGGGSKKEGRADGPAQNASF 158
            G G   +   DG   NASF
Sbjct: 270 LGNGIGAD--VDGNGTNASF 287


>gi|417772389|ref|ZP_12420278.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
           Pomona]
 gi|418684105|ref|ZP_13245295.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
           Kennewicki LC82-25]
 gi|400324210|gb|EJO76509.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
           Kennewicki LC82-25]
 gi|409945760|gb|EKN95775.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
           Pomona]
 gi|455670390|gb|EMF35375.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
           Fox 32256]
          Length = 358

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 28/70 (40%), Positives = 37/70 (52%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNIYV+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95

Query: 121 DKSNHVIRKI 130
           D+ N++IRKI
Sbjct: 96  DQINNLIRKI 105



 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 36/100 (36%), Positives = 57/100 (57%), Gaps = 5/100 (5%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKEGRA 149
           G  +G+  ++ FD P    +D + N+YV + SNH IRKI  N G V+T++GG S   G  
Sbjct: 166 GLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGTVSTLSGGIS---GYL 222

Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           DG   +A F +   + +   + +LL +D  +  IR+I+LK
Sbjct: 223 DGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLK 262



 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 43/80 (53%), Gaps = 6/80 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
           V  L+G G+ GY DG+  SARF  P   A + K N +  AD  +H IRKI   N  V+T+
Sbjct: 211 VSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLKNSTVSTL 269

Query: 139 AGGGSKKEGRADGPAQNASF 158
            G G   +   DG   NASF
Sbjct: 270 LGNGIGTD--VDGNGTNASF 287


>gi|24215844|ref|NP_713325.1| hypothetical protein LA_3145 [Leptospira interrogans serovar Lai
           str. 56601]
 gi|45656856|ref|YP_000942.1| hypothetical protein LIC10968 [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
 gi|386074980|ref|YP_005989298.1| hypothetical protein LIF_A2535 [Leptospira interrogans serovar Lai
           str. IPAV]
 gi|417765006|ref|ZP_12412972.1| putative lipoprotein [Leptospira interrogans serovar Bulgarica str.
           Mallika]
 gi|418666993|ref|ZP_13228411.1| putative lipoprotein [Leptospira interrogans serovar Pyrogenes str.
           2006006960]
 gi|418699338|ref|ZP_13260303.1| putative lipoprotein [Leptospira interrogans serovar Bataviae str.
           L1111]
 gi|418723579|ref|ZP_13282417.1| putative lipoprotein [Leptospira interrogans str. UI 12621]
 gi|418727930|ref|ZP_13286513.1| putative lipoprotein [Leptospira interrogans str. UI 12758]
 gi|421083673|ref|ZP_15544545.1| putative lipoprotein [Leptospira santarosai str. HAI1594]
 gi|421102269|ref|ZP_15562874.1| putative lipoprotein [Leptospira interrogans serovar
           Icterohaemorrhagiae str. Verdun LP]
 gi|421122244|ref|ZP_15582528.1| putative lipoprotein [Leptospira interrogans str. Brem 329]
 gi|24197042|gb|AAN50343.1| hypothetical protein LA_3145 [Leptospira interrogans serovar Lai
           str. 56601]
 gi|45600092|gb|AAS69579.1| conserved hypothetical protein [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
 gi|353458770|gb|AER03315.1| hypothetical protein LIF_A2535 [Leptospira interrogans serovar Lai
           str. IPAV]
 gi|400352606|gb|EJP04785.1| putative lipoprotein [Leptospira interrogans serovar Bulgarica str.
           Mallika]
 gi|409962948|gb|EKO26678.1| putative lipoprotein [Leptospira interrogans str. UI 12621]
 gi|410344796|gb|EKO95953.1| putative lipoprotein [Leptospira interrogans str. Brem 329]
 gi|410367934|gb|EKP23317.1| putative lipoprotein [Leptospira interrogans serovar
           Icterohaemorrhagiae str. Verdun LP]
 gi|410433778|gb|EKP78117.1| putative lipoprotein [Leptospira santarosai str. HAI1594]
 gi|410757315|gb|EKR18927.1| putative lipoprotein [Leptospira interrogans serovar Pyrogenes str.
           2006006960]
 gi|410761708|gb|EKR27881.1| putative lipoprotein [Leptospira interrogans serovar Bataviae str.
           L1111]
 gi|410777322|gb|EKR57287.1| putative lipoprotein [Leptospira interrogans str. UI 12758]
 gi|456822102|gb|EMF70597.1| putative lipoprotein [Leptospira interrogans serovar Canicola str.
           LT1962]
          Length = 358

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 28/70 (40%), Positives = 37/70 (52%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNIYV+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95

Query: 121 DKSNHVIRKI 130
           D+ N++IRKI
Sbjct: 96  DQINNLIRKI 105



 Score = 53.5 bits (127), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 40/118 (33%), Positives = 67/118 (56%), Gaps = 7/118 (5%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKEG 147
           + G  +G+  ++ FD P    +D + N+YV + SNH IRKI  N G V+T++GG S   G
Sbjct: 164 LSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGTVSTLSGGIS---G 220

Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
             DG   +A F +   + +   + +LL +D  +  IR+I+LK  + + S+  G+ +GA
Sbjct: 221 YLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLK--NSTVSTLLGNGIGA 276



 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 43/80 (53%), Gaps = 6/80 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
           V  L+G G+ GY DG+  SARF  P   A + K N +  AD  +H IRKI   N  V+T+
Sbjct: 211 VSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLKNSTVSTL 269

Query: 139 AGGGSKKEGRADGPAQNASF 158
            G G   +   DG   NASF
Sbjct: 270 LGNGIGAD--VDGNGTNASF 287


>gi|290975268|ref|XP_002670365.1| predicted protein [Naegleria gruberi]
 gi|284083923|gb|EFC37621.1| predicted protein [Naegleria gruberi]
          Length = 1034

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 39/105 (37%), Positives = 58/105 (55%), Gaps = 13/105 (12%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVD 112
            ++D++ +D+ R     LS     + +V  LAG    G++ DG P  SA+ + P   ++ 
Sbjct: 15  ANNDILFIDNDR--LRKLS-----QGIVSTLAGTKEAGFNGDGIPAVSAQLNSPLGVSMS 67

Query: 113 MKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNA 156
             G IY+ADK NH IRKI   G +TTIAG G   E +   PA++A
Sbjct: 68  KGGEIYIADKLNHRIRKIDIFGKITTIAGTGQAGESQ---PAESA 109



 Score = 43.9 bits (102), Expect = 0.070,   Method: Composition-based stats.
 Identities = 35/109 (32%), Positives = 55/109 (50%), Gaps = 5/109 (4%)

Query: 80  SVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTT 137
           S++  LAG G  GY  DG   + A+ + P++  +   G + ++D  NH IR I +  + T
Sbjct: 143 SLIVTLAGTGEPGYGGDGSLATQAQLNGPEAVILSGTGAVIISDTYNHAIRTILDNNLIT 202

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           IAG      G +DG  +   +     LT  P+  ALLI+D  N  I+ +
Sbjct: 203 IAGTLGTN-GDSDGSLETKLYY-PMGLTITPYQ-ALLIADSWNDKIKML 248


>gi|418707795|ref|ZP_13268612.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
           str. UI 08368]
 gi|410771897|gb|EKR47094.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
           str. UI 08368]
 gi|456968671|gb|EMG09839.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
           str. LT2186]
          Length = 358

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 28/70 (40%), Positives = 37/70 (52%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNIYV+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95

Query: 121 DKSNHVIRKI 130
           D+ N++IRKI
Sbjct: 96  DQINNLIRKI 105



 Score = 53.5 bits (127), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 38/115 (33%), Positives = 62/115 (53%), Gaps = 5/115 (4%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKEGRAD 150
           G  +G+  ++ FD P    +D + N+YV + SNH IRKI  N G  +I  GG    G  D
Sbjct: 166 GLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGTVSILSGG--ISGYLD 223

Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
           G   +A F +   + +   + +LL +D  +  IR+I+LK  + + S+  G+ +GA
Sbjct: 224 GDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLK--NSTVSTLLGNGIGA 276



 Score = 46.2 bits (108), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 31/73 (42%), Positives = 39/73 (53%), Gaps = 5/73 (6%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSKK 145
           G+ GY DG+  SARF  P   A + K N +  AD  +H IRKI   N  V+T+ G G   
Sbjct: 217 GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLKNSTVSTLLGNGIGA 276

Query: 146 EGRADGPAQNASF 158
           +   DG   NASF
Sbjct: 277 D--VDGNGTNASF 287


>gi|417763451|ref|ZP_12411429.1| putative lipoprotein [Leptospira interrogans str. 2002000624]
 gi|417776933|ref|ZP_12424763.1| putative lipoprotein [Leptospira interrogans str. 2002000621]
 gi|418671130|ref|ZP_13232484.1| putative lipoprotein [Leptospira interrogans str. 2002000623]
 gi|409940657|gb|EKN86296.1| putative lipoprotein [Leptospira interrogans str. 2002000624]
 gi|410573292|gb|EKQ36344.1| putative lipoprotein [Leptospira interrogans str. 2002000621]
 gi|410581799|gb|EKQ49606.1| putative lipoprotein [Leptospira interrogans str. 2002000623]
          Length = 358

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 28/70 (40%), Positives = 37/70 (52%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNIYV+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95

Query: 121 DKSNHVIRKI 130
           D+ N++IRKI
Sbjct: 96  DQINNLIRKI 105



 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 67/118 (56%), Gaps = 7/118 (5%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKEG 147
           + G  +G+  ++ FD P    +D + N+YV + +NH IRKI  N G V+T++GG S   G
Sbjct: 164 LSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELNNHTIRKINLNSGTVSTLSGGIS---G 220

Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
             DG   +A F +   + +   + +LL +D  +  IR+I+LK  + + S+  G+ +GA
Sbjct: 221 YLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLK--NSTVSTLLGNGIGA 276



 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 43/80 (53%), Gaps = 6/80 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
           V  L+G G+ GY DG+  SARF  P   A + K N +  AD  +H IRKI   N  V+T+
Sbjct: 211 VSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLKNSTVSTL 269

Query: 139 AGGGSKKEGRADGPAQNASF 158
            G G   +   DG   NASF
Sbjct: 270 LGNGIGAD--VDGNGTNASF 287


>gi|417783289|ref|ZP_12431009.1| putative lipoprotein [Leptospira interrogans str. C10069]
 gi|409953418|gb|EKO07917.1| putative lipoprotein [Leptospira interrogans str. C10069]
          Length = 358

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 28/70 (40%), Positives = 37/70 (52%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNIYV+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95

Query: 121 DKSNHVIRKI 130
           D+ N++IRKI
Sbjct: 96  DQINNLIRKI 105



 Score = 53.5 bits (127), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 40/118 (33%), Positives = 67/118 (56%), Gaps = 7/118 (5%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKEG 147
           + G  +G+  ++ FD P    +D + N+YV + SNH IRKI  N G V+T++GG S   G
Sbjct: 164 LSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGMVSTLSGGIS---G 220

Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
             DG   +A F +   + +   + +LL +D  +  IR+I+LK  + + S+  G+ +GA
Sbjct: 221 YLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLK--NSTVSTLLGNGIGA 276



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 34/81 (41%), Positives = 44/81 (54%), Gaps = 6/81 (7%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTT 137
           +V  L+G G+ GY DG+  SARF  P   A + K N +  AD  +H IRKI   N  V+T
Sbjct: 210 MVSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLKNSTVST 268

Query: 138 IAGGGSKKEGRADGPAQNASF 158
           + G G   +   DG   NASF
Sbjct: 269 LLGNGIGAD--VDGNGTNASF 287


>gi|430744164|ref|YP_007203293.1| NHL repeat protein [Singulisphaera acidiphila DSM 18658]
 gi|430015884|gb|AGA27598.1| NHL repeat protein [Singulisphaera acidiphila DSM 18658]
          Length = 358

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 35/117 (29%), Positives = 59/117 (50%), Gaps = 6/117 (5%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TN 132
            +  ++  +AG G +G+S G+ G    A+ D+P    +D +GN+Y AD+ N  +R++  +
Sbjct: 72  GKSGIITTVAGSGTKGFS-GDGGVALKAKLDEPYGIVLDSRGNLYFADRLNRRVRRVDAD 130

Query: 133 LG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
            G +TTIAG GSK      GP   A       +        L I+D  +  +R ++L
Sbjct: 131 SGMITTIAGDGSKTYSGDGGPGARAGLVEPNGVALDSQEARLFIADVADHRVRVVDL 187



 Score = 50.4 bits (119), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 39/115 (33%), Positives = 58/115 (50%), Gaps = 7/115 (6%)

Query: 78  EESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI---TN 132
           +  ++   AG G +GYS DG P +A  F+ PK FA+D  G++ + D  N  IR+I   T 
Sbjct: 245 QTGLITTRAGTGAKGYSGDGGPATAATFNGPKEFAIDRAGDLLIVDTENQAIRRIDARTG 304

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           L + T+AG G +      G A +A       +   P   A+ I D GN  IR++ 
Sbjct: 305 L-IRTLAGNGQRGGEGDGGAATSALLDRPHGVAVGPD-GAVYIGDTGNHRIRKVG 357



 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 25/89 (28%), Positives = 43/89 (48%), Gaps = 3/89 (3%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASF 158
           A  + P   A D +GN+Y++D  NH IR++      +TT+AG G+K      G A  A  
Sbjct: 41  ATLNMPFDVAFDSRGNLYLSDTMNHCIRRVDGKSGIITTVAGSGTKGFSGDGGVALKAKL 100

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
              + +        L  +D  N+ +R+++
Sbjct: 101 DEPYGIVLDSR-GNLYFADRLNRRVRRVD 128


>gi|75909875|ref|YP_324171.1| NHL repeat-containing protein [Anabaena variabilis ATCC 29413]
 gi|75703600|gb|ABA23276.1| NHL repeat protein [Anabaena variabilis ATCC 29413]
          Length = 503

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 52/175 (29%), Positives = 76/175 (43%), Gaps = 14/175 (8%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V+  PG   L V DS         F       +  L G+G  G +DG    A+F  P
Sbjct: 175 PGKVLATPGG--LFVADSGHHRIVVSDF----NGEILHLIGNGKSGLTDGNFQEAQFSAP 228

Query: 107 KSFAVDMKGNI-YVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRA--DGPAQNASFSND 161
           +  A DM+  I YVAD  NHV+R+  I    V TIAG G +        G     + ++ 
Sbjct: 229 QGMAFDMENQILYVADTDNHVVRRADIQQQTVETIAGTGEQSRNIQPHGGAGLETALNSP 288

Query: 162 FELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLS 216
           ++L  V +  +L I+  G   I Q++L P    K+     A G    ++  SV +
Sbjct: 289 WDLVKVGN--SLYIAMAGTHQIWQMDL-PSGFVKTYAGTGAEGCFDGYLTESVFA 340



 Score = 38.5 bits (88), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 21/62 (33%), Positives = 35/62 (56%), Gaps = 3/62 (4%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG 140
           + + GDG  G  +G+  + RF +P   +  M   +Y++D +NHVIR++    L VTT+  
Sbjct: 431 QTVLGDGSAGLQNGQGKNTRFFEPSGLSA-MDSYLYISDTNNHVIRRVDLRTLEVTTMQF 489

Query: 141 GG 142
            G
Sbjct: 490 NG 491


>gi|298245110|ref|ZP_06968916.1| NHL repeat containing protein [Ktedonobacter racemifer DSM 44963]
 gi|297552591|gb|EFH86456.1| NHL repeat containing protein [Ktedonobacter racemifer DSM 44963]
          Length = 496

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 44/154 (28%), Positives = 76/154 (49%), Gaps = 22/154 (14%)

Query: 27  LLEDGYTVTTVIDGHQLEIN----PHSVIDRPG-------SSDLIVLDSSRSAFYTLSFP 75
           +LE+ +    ++D  QL  +    P S +  PG       +  LI+ DS+       +  
Sbjct: 148 MLEE-FEARGLLDHRQLRFSAATAPSSFLAFPGKLAVDAQADRLIISDSAHHRLVETNL- 205

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNL 133
              +  V+ + G GVQG +DG    A+F+ P+  A+ +   +YVAD  NH+IR++     
Sbjct: 206 ---QGRVRTIIGSGVQGQADGSFAEAQFNHPQGVAL-VNDLLYVADTDNHLIRRVDLRTK 261

Query: 134 GVTTIAGGGSKK---EGRADGPAQNASFSNDFEL 164
            V T+AG G +      R  GPA++ + S+ ++L
Sbjct: 262 QVETLAGTGEQNGMVRTRLQGPARSIALSSPWDL 295



 Score = 38.5 bits (88), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 20/48 (41%), Positives = 29/48 (60%), Gaps = 3/48 (6%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIR 128
           V+ LAG G  G  +G    A+F++P   A+   GN +Y+AD +NH IR
Sbjct: 435 VRTLAGTGEAGIHNGPAEQAQFNEPGDLAI--TGNTLYIADTNNHAIR 480


>gi|298388124|ref|ZP_06997668.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
 gi|298259116|gb|EFI01996.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
          Length = 435

 Score = 53.5 bits (127), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 34/94 (36%), Positives = 52/94 (55%), Gaps = 2/94 (2%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGP 152
           +++G    ARF++P     D  GN+YV+D+ N+ IRKIT  G  TI   G++ EG  +G 
Sbjct: 341 FANGIGEEARFNQPCQGVFDEDGNLYVSDRDNNCIRKITPDGNVTIY-AGNRSEGLVNGL 399

Query: 153 AQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
              +SF     LT       + ++DH N +IR+I
Sbjct: 400 PLKSSFRRPEGLTRSKD-GVIYVADHDNHVIRKI 432



 Score = 38.9 bits (89), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 20/46 (43%), Positives = 26/46 (56%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            AG+  +G  +G P  + F +P+       G IYVAD  NHVIRKI
Sbjct: 387 YAGNRSEGLVNGLPLKSSFRRPEGLTRSKDGVIYVADHDNHVIRKI 432


>gi|290996889|ref|XP_002681014.1| predicted protein [Naegleria gruberi]
 gi|284094637|gb|EFC48270.1| predicted protein [Naegleria gruberi]
          Length = 1023

 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 42/117 (35%), Positives = 60/117 (51%), Gaps = 6/117 (5%)

Query: 72  LSFPLSEESVVKRLAGDGV-QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +S  +++ES V  +AG GV  GY   +   A    P S  +   G+IY+AD SNH +R++
Sbjct: 11  VSTTVAQESFVSTIAGGGVCDGYLATQ---ASLAYPGSPTIGPDGSIYIADSSNHRVRQV 67

Query: 131 T-NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             N  +TTIAG G         PA  A   N   +  V  I  + ISD+GN  IR++
Sbjct: 68  YPNGTITTIAGTGISGYNGDVIPATRAQLKNPVSVA-VNSIGEVFISDNGNNRIRKV 123



 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 35/111 (31%), Positives = 56/111 (50%), Gaps = 4/111 (3%)

Query: 79  ESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGV 135
              +  +AG G+ GY+ D  P + A+   P S AV+  G ++++D  N+ IRK+ TN  +
Sbjct: 70  NGTITTIAGTGISGYNGDVIPATRAQLKNPVSVAVNSIGEVFISDNGNNRIRKVLTNGTI 129

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            T AG G        G A NA  +  + +     I  L+ISD  +  IR++
Sbjct: 130 ITFAGSGQTTFSGDYGLATNAGINYPYGIALT-SIEELIISDVNHNRIRKV 179



 Score = 40.8 bits (94), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 21/42 (50%), Positives = 28/42 (66%), Gaps = 2/42 (4%)

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSK 144
           KPK   +D  GNIY+AD +N+ +RKI+ L   + TIAG GS 
Sbjct: 356 KPKGAVIDSLGNIYIADTNNNRVRKISYLDGTINTIAGTGSN 397



 Score = 40.8 bits (94), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 36/109 (33%), Positives = 51/109 (46%), Gaps = 13/109 (11%)

Query: 85  LAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGG 142
           +AG G  G++ D     A+   P S  +D   N+Y+AD  NH IRKI  N  +TTI G G
Sbjct: 427 VAGVGSSGFNGDILATDAKLSNPVSVTIDSNDNVYIADTYNHRIRKILQNGNLTTIVGLG 486

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHIC-----ALLISDHGNQLIRQI 186
           S       G   +   SN  +L +   I       + I+D  N  IR++
Sbjct: 487 SS------GFNGDYLLSNGTKLNYPQSIAFDSNGNMYIADMNNNRIRKM 529



 Score = 40.4 bits (93), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 19/42 (45%), Positives = 29/42 (69%), Gaps = 1/42 (2%)

Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRK-ITNLGVTTIAGGG 142
           + + P+S A D  GN+Y+AD +N+ IRK +TN  + T+AG G
Sbjct: 501 KLNYPQSIAFDSNGNMYIADMNNNRIRKMLTNGTIITVAGTG 542



 Score = 38.9 bits (89), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 28/48 (58%), Gaps = 7/48 (14%)

Query: 96  GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS 143
           G PG    DK         G++Y ADK +H IRKI+N  +T++AG GS
Sbjct: 299 GNPGQMFIDKNS-------GDVYFADKGSHRIRKISNGFITSVAGSGS 339


>gi|424667903|ref|ZP_18104928.1| hypothetical protein A1OC_01487 [Stenotrophomonas maltophilia
           Ab55555]
 gi|401068165|gb|EJP76689.1| hypothetical protein A1OC_01487 [Stenotrophomonas maltophilia
           Ab55555]
          Length = 470

 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 46/147 (31%), Positives = 66/147 (44%), Gaps = 11/147 (7%)

Query: 73  SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           S+ L + S+  R AG G     DG    A F +P + AV ++  +YVAD     IR +  
Sbjct: 301 SYHLGQRSLQWR-AGSGAIDERDGSGHLAAFAQPTALAV-VQQVLYVADALGSSIRALQL 358

Query: 133 LG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-- 188
            G  V T+ G G  + G  DGP   AS      +   P    L I+D GN  +R + L  
Sbjct: 359 RGDLVQTLVGQGPWRFGDEDGPRAQASLQFPQAIALSPDAPLLWIADTGNGRLRTLRLGG 418

Query: 189 -----KPEDCSKSSQSGSALGAVSVWV 210
                +P        +G A+GA +VW+
Sbjct: 419 GELTTQPLPRRLHGPAGLAVGAGAVWI 445



 Score = 39.7 bits (91), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 47/103 (45%), Gaps = 12/103 (11%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIAG 140
           + R  G G   + DG    A F +P++  ++ +  +YVAD  NH +R+I  L G+     
Sbjct: 197 ILRQFGLGTADFMDGNLAEAAFHRPQALVLE-RDALYVADTGNHAVRRINLLTGIVDTLC 255

Query: 141 GGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQL 182
           G  +     +GP AQ    S D       H   L I+D  NQL
Sbjct: 256 GNGRPGAPVEGPVAQARQVSLD-------HPVGLAIAD--NQL 289


>gi|283779991|ref|YP_003370746.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
 gi|283438444|gb|ADB16886.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
          Length = 364

 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 37/83 (44%), Positives = 49/83 (59%), Gaps = 9/83 (10%)

Query: 77  SEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI---T 131
           ++  ++  +AG G +G++ DG P   A F  P S A+D   NIYV D  NH IRKI   T
Sbjct: 135 AKTGIISTIAGSGKEGFAGDGGPAKEAVFSIPHSIALDADDNIYVCDLGNHRIRKIDAKT 194

Query: 132 NLGVTTIAGGGSK---KEGRADG 151
            L ++TIAG G K   KEG+ DG
Sbjct: 195 GL-ISTIAGNGGKTLPKEGKIDG 216



 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 45/118 (38%), Positives = 61/118 (51%), Gaps = 23/118 (19%)

Query: 85  LAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVTTIA 139
           +AG G  GY+ DG P + A  ++P     D  GN+   +  N VIRK+   T + ++TIA
Sbjct: 86  VAGSGKMGYAGDGGPATEALLNEPYEVRFDSHGNMIFVEMKNFVIRKVDAKTGI-ISTIA 144

Query: 140 GGGSKKEGRA--DGPAQNASFSNDFELTFVPHICAL------LISDHGNQLIRQINLK 189
           G G  KEG A   GPA+ A FS       +PH  AL       + D GN  IR+I+ K
Sbjct: 145 GSG--KEGFAGDGGPAKEAVFS-------IPHSIALDADDNIYVCDLGNHRIRKIDAK 193



 Score = 45.4 bits (106), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 49/96 (51%), Gaps = 9/96 (9%)

Query: 38  IDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-D 95
           IDG    + P +V ID+  +   I L    S  + L+   SE   +  +AG G +G+S D
Sbjct: 214 IDGTMSLLGPRAVCIDK--NVMWIALREGHS-VWKLNLATSE---LTHIAGTGKKGFSGD 267

Query: 96  GEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G P   A FD PK  AV   G + V D  NHVIRKI
Sbjct: 268 GGPAKDATFDGPKGVAVCPDGGVVVVDTENHVIRKI 303



 Score = 37.0 bits (84), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 34/134 (25%), Positives = 58/134 (43%), Gaps = 6/134 (4%)

Query: 71  TLSFPLSEESVVKRLAGDG--VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           TL+  L E   +  +AG G    G + G         P    +   G +Y+ +  NH +R
Sbjct: 16  TLALTL-EAGTIDTVAGTGKPADGPASGLGVETNVGDPFGVEIGPDGALYITEVRNHRVR 74

Query: 129 KI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           ++      +TT+AG G        GPA  A  +  +E+ F  H   ++  +  N +IR++
Sbjct: 75  RLDLKTGAMTTVAGSGKMGYAGDGGPATEALLNEPYEVRFDSH-GNMIFVEMKNFVIRKV 133

Query: 187 NLKPEDCSKSSQSG 200
           + K    S  + SG
Sbjct: 134 DAKTGIISTIAGSG 147


>gi|190573634|ref|YP_001971479.1| hypothetical protein Smlt1644 [Stenotrophomonas maltophilia K279a]
 gi|190011556|emb|CAQ45175.1| conserved hypothetical protein [Stenotrophomonas maltophilia K279a]
          Length = 470

 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 46/147 (31%), Positives = 66/147 (44%), Gaps = 11/147 (7%)

Query: 73  SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           S+ L + S+  R AG G     DG    A F +P + AV ++  +YVAD     IR +  
Sbjct: 301 SYHLGQRSLQWR-AGSGAIDERDGSGHLAAFAQPTALAV-VQQVLYVADALGSSIRALQL 358

Query: 133 LG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-- 188
            G  V T+ G G  + G  DGP   AS      +   P    L I+D GN  +R + L  
Sbjct: 359 RGDLVQTLVGQGPWRFGDEDGPRAQASLQFPQAIALSPDAPLLWIADTGNGRLRTLRLGG 418

Query: 189 -----KPEDCSKSSQSGSALGAVSVWV 210
                +P        +G A+GA +VW+
Sbjct: 419 GELTTQPLPRRLHGPAGLAVGAGAVWI 445



 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 47/103 (45%), Gaps = 12/103 (11%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIAG 140
           + R  G G   + DG    A F +P++  ++ +  +YVAD  NH +R+I  L G+     
Sbjct: 197 ILRQFGLGTADFMDGNLAEAAFHRPQALVLE-RDALYVADTGNHAVRRINLLTGIVDTLC 255

Query: 141 GGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQL 182
           G  +     +GP AQ    S D       H   L I+D  NQL
Sbjct: 256 GNGRPGAPVEGPVAQARQVSLD-------HPVGLAIAD--NQL 289


>gi|284038675|ref|YP_003388605.1| NHL repeat containing protein [Spirosoma linguale DSM 74]
 gi|283817968|gb|ADB39806.1| NHL repeat containing protein [Spirosoma linguale DSM 74]
          Length = 359

 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 38/97 (39%), Positives = 46/97 (47%), Gaps = 5/97 (5%)

Query: 74  FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           F ++ +  +   AG G  GY DG    A+F  P   A D  GN+YVAD  N  IRKI+  
Sbjct: 73  FKVTPQGNLSLYAGSGGTGYQDGSLDKAKFLWPYGLAFDRAGNLYVADSGNQAIRKISPD 132

Query: 134 G-VTTIAGG----GSKKEGRADGPAQNASFSNDFELT 165
           G VTT AG      S      DG  + A F N   LT
Sbjct: 133 GQVTTFAGQPYDVTSITNVSVDGIGKEARFYNPLVLT 169



 Score = 45.8 bits (107), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 31/96 (32%), Positives = 49/96 (51%), Gaps = 4/96 (4%)

Query: 95  DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPA 153
           +G   SAR+    S   D   N+Y+ +    ++RKI   G V+ +   GS+  G  DGP 
Sbjct: 267 NGPINSARYGLITSLRFDSNDNLYIGETGAGIVRKIATDGQVSDVT--GSRLGGYKDGPL 324

Query: 154 QNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           Q A F +  +L F P   +L ++D+ N  IR+I  +
Sbjct: 325 QAAEFGSVEDLAFSPS-GSLYVADNRNGAIRKITFE 359



 Score = 38.1 bits (87), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 31/60 (51%), Gaps = 9/60 (15%)

Query: 81  VVKRLAGDG---------VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +V+++A DG         + GY DG   +A F   +  A    G++YVAD  N  IRKIT
Sbjct: 298 IVRKIATDGQVSDVTGSRLGGYKDGPLQAAEFGSVEDLAFSPSGSLYVADNRNGAIRKIT 357



 Score = 37.4 bits (85), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 26/82 (31%), Positives = 36/82 (43%), Gaps = 3/82 (3%)

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFEL 164
           +P S A D   N+Y AD    + +      ++  AG G    G  DG    A F   + L
Sbjct: 51  RPVSVAFDKANNLYFADGIARIFKVTPQGNLSLYAGSGG--TGYQDGSLDKAKFLWPYGL 108

Query: 165 TFVPHICALLISDHGNQLIRQI 186
            F      L ++D GNQ IR+I
Sbjct: 109 AF-DRAGNLYVADSGNQAIRKI 129


>gi|290973047|ref|XP_002669261.1| predicted protein [Naegleria gruberi]
 gi|284082806|gb|EFC36517.1| predicted protein [Naegleria gruberi]
          Length = 618

 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 48/87 (55%), Gaps = 3/87 (3%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           E   +  +AG+G  G+S  +G   +A+   P S  V +   +Y+AD+ NH IRKI  N  
Sbjct: 148 ENGNIITIAGNGTDGFSGDNGPATNAQLYYPSSVFVSLTNEVYIADQHNHRIRKILENGN 207

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSND 161
           + TIAG G+      +GPA NA  +++
Sbjct: 208 IITIAGNGTYGFSGDNGPATNAQLNHN 234



 Score = 45.4 bits (106), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 32/87 (36%), Positives = 42/87 (48%), Gaps = 2/87 (2%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFS 159
           A+   P    V     +Y+AD  NH +RKI  N  + TIAG G+      +GPA NA F 
Sbjct: 8   AQLFNPFGVFVSSNNEVYIADFCNHAVRKILENGNIVTIAGNGTAGFSGDNGPATNAQF- 66

Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
           N     FV     + I+D  N  IR+I
Sbjct: 67  NYPSSVFVSSNNEVCIADLHNHRIRKI 93



 Score = 44.3 bits (103), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 31/95 (32%), Positives = 46/95 (48%), Gaps = 9/95 (9%)

Query: 61  VLDSSRSAFYTLSF------PLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVD 112
           V  SS +  Y   F       + E   +  +AG+G  G+S  +G   +A+F+ P S  V 
Sbjct: 16  VFVSSNNEVYIADFCNHAVRKILENGNIVTIAGNGTAGFSGDNGPATNAQFNYPSSVFVS 75

Query: 113 MKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKE 146
               + +AD  NH IRKI   G + TIAG G+  +
Sbjct: 76  SNNEVCIADLHNHRIRKILENGSIITIAGNGTHNQ 110


>gi|429737840|ref|ZP_19271682.1| IPT/TIG domain protein [Prevotella saccharolytica F0055]
 gi|429161715|gb|EKY04093.1| IPT/TIG domain protein [Prevotella saccharolytica F0055]
          Length = 438

 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 54/110 (49%), Gaps = 2/110 (1%)

Query: 80  SVVKRLAGDGVQG-YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
           +  KRL+   + G + DG    A+F  P     D  GNIYVAD+ NH IR+IT   +   
Sbjct: 329 TTYKRLSSSNISGGHRDGPLSEAQFRNPSQIFCDNDGNIYVADRGNHCIRRITPENMVET 388

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
             G    +G  DG  ++A F N+     +    ++ ++D  N  +R++ +
Sbjct: 389 VLGMPGTKGWKDGKKEDALF-NEPTGIGIAQDGSVYVADFKNGRVRKLTI 437



 Score = 41.2 bits (95), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 19/57 (33%), Positives = 35/57 (61%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ E++V+ + G  G +G+ DG+   A F++P    +   G++YVAD  N  +RK+T
Sbjct: 380 ITPENMVETVLGMPGTKGWKDGKKEDALFNEPTGIGIAQDGSVYVADFKNGRVRKLT 436


>gi|456733989|gb|EMF58811.1| Hypothetical protein EPM1_4079 [Stenotrophomonas maltophilia EPM1]
          Length = 470

 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 46/147 (31%), Positives = 66/147 (44%), Gaps = 11/147 (7%)

Query: 73  SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           S+ L + S+  R AG G     DG    A F +P + AV ++  +YVAD     IR +  
Sbjct: 301 SYHLGQRSLQWR-AGSGAIDERDGSGHLAAFAQPTALAV-VQQVLYVADALGSSIRALQL 358

Query: 133 LG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-- 188
            G  V T+ G G  + G  DGP   AS      +   P    L I+D GN  +R + L  
Sbjct: 359 RGDLVQTLVGQGPWRFGDEDGPRAQASLQFPQAIALSPDAPLLWIADTGNGRLRTLRLGG 418

Query: 189 -----KPEDCSKSSQSGSALGAVSVWV 210
                +P        +G A+GA +VW+
Sbjct: 419 GELTTQPLPRRLHGPAGLAVGAGAVWI 445



 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 47/103 (45%), Gaps = 12/103 (11%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIAG 140
           + R  G G   + DG    A F +P++  ++ +  +YVAD  NH +R+I  L G+     
Sbjct: 197 ILRQFGLGTADFMDGNLAEAAFHRPQALVLE-RDALYVADTGNHAVRRINLLTGIVDTLC 255

Query: 141 GGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQL 182
           G  +     +GP AQ    S D       H   L I+D  NQL
Sbjct: 256 GNGRPGAPVEGPVAQARQVSLD-------HPVGLAIAD--NQL 289


>gi|29830929|ref|NP_825563.1| hypothetical protein SAV_4386 [Streptomyces avermitilis MA-4680]
 gi|29608042|dbj|BAC72098.1| hypothetical protein SAV_4386 [Streptomyces avermitilis MA-4680]
          Length = 341

 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 63/114 (55%), Gaps = 7/114 (6%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S + +V D++R     L+     ESVV+R+ G G++G++DG   +A F +P+   +  + 
Sbjct: 189 SGNFLVSDTTRHQLVELA--ADGESVVRRI-GSGIRGFADGPADAASFSEPQGMTLLDED 245

Query: 116 NIYVADKSNHVIRK--ITNLGVTTIAGGGSK--KEGRADGPAQNASFSNDFELT 165
            + VAD  NH +R+  +    VTT+AG G +  +   A GPA+    S+ +++ 
Sbjct: 246 VVVVADTVNHALRRLDLATGEVTTLAGTGRQWWQGSPASGPAREVDLSSPWDVA 299



 Score = 41.6 bits (96), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 32/114 (28%), Positives = 55/114 (48%), Gaps = 3/114 (2%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
           GDG     + EP   RF  P    +   GN  V+D + H + ++   G + +   GS   
Sbjct: 164 GDGPYVAPEPEPTVLRF--PGKALLLPSGNFLVSDTTRHQLVELAADGESVVRRIGSGIR 221

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
           G ADGPA  ASFS    +T +     ++++D  N  +R+++L   + +  + +G
Sbjct: 222 GFADGPADAASFSEPQGMTLLDEDV-VVVADTVNHALRRLDLATGEVTTLAGTG 274


>gi|292491957|ref|YP_003527396.1| hypothetical protein Nhal_1896 [Nitrosococcus halophilus Nc4]
 gi|291580552|gb|ADE15009.1| YD repeat protein [Nitrosococcus halophilus Nc4]
          Length = 2539

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/113 (30%), Positives = 62/113 (54%), Gaps = 3/113 (2%)

Query: 76   LSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
            +  + VV   AG+G+ G+S  G+  +    +P + A+D  GN+Y+A+  N+VIR++   G
Sbjct: 1294 IGTDGVVSIAAGNGIAGFSGTGDLATQAMMRPFAIALDSDGNLYIAESQNNVIRRVGPDG 1353

Query: 135  -VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             +T +AG G++      G A  A  ++  +L   P   +L I+D+ N  IR +
Sbjct: 1354 LITLVAGNGTRGFSGDGGLATQARLNHPNDLAIGPD-GSLYIADYDNHRIRLV 1405



 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 48/141 (34%), Positives = 73/141 (51%), Gaps = 17/141 (12%)

Query: 77   SEESVVKRL---------AGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSN 124
            S+ +V++R+         AG+G +G+S G+ G    AR + P   A+   G++Y+AD  N
Sbjct: 1341 SQNNVIRRVGPDGLITLVAGNGTRGFS-GDGGLATQARLNHPNDLAIGPDGSLYIADYDN 1399

Query: 125  HVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
            H IR ++  G +TTIAG GS+  G   GPA  A   N   +T V    +L I+D  N  I
Sbjct: 1400 HRIRLVSPNGIITTIAGNGSRGYGGDGGPATQAMLRNPQGIT-VASDSSLYIADRRNHRI 1458

Query: 184  RQINLKPEDCSKSSQSGSALG 204
            R+++  PE    +      LG
Sbjct: 1459 RKVS--PEGIITTVAGNGILG 1477



 Score = 41.2 bits (95), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 36/58 (62%), Gaps = 4/58 (6%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            +S E ++  +AG+G+ GY DG+ G    A+ + P + A+    N+Y+AD  NH IR++
Sbjct: 1461 VSPEGIITTVAGNGILGY-DGDGGISTGAKLNLPIAVALSPNENLYIADYYNHRIRRV 1517


>gi|290991466|ref|XP_002678356.1| predicted protein [Naegleria gruberi]
 gi|284091968|gb|EFC45612.1| predicted protein [Naegleria gruberi]
          Length = 1017

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 33/94 (35%), Positives = 53/94 (56%), Gaps = 4/94 (4%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG--GGSKKEGRADGPAQN 155
           +A  + P   AV   G++Y+ADKSNHVIRK++ L   +TTIAG  G ++    ++  A N
Sbjct: 108 NAELNFPSGVAVHSNGDVYIADKSNHVIRKVSALNGKITTIAGIAGETELNKYSNSLATN 167

Query: 156 ASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            + ++   L        ++ISD  N +IR++ L 
Sbjct: 168 TTLNSPQYLAVNSSTAEVIISDTNNNVIRKVYLN 201



 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 44/129 (34%), Positives = 65/129 (50%), Gaps = 17/129 (13%)

Query: 68  AFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSN 124
           A Y  SF  S  + +  +AG G  G   G+ G   SAR  KP S  ++ + N+Y+ D  N
Sbjct: 447 AMYLDSFLGSLATKITVVAGTGKLGGYAGDGGLATSARIQKPTSVVLNDQ-NLYIVDTLN 505

Query: 125 HVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC-----ALLISD 177
           H IRK+  T   +TTIAG G+       G + +   +   +L +  H+       + ISD
Sbjct: 506 HRIRKVSLTFGNITTIAGIGTA------GFSGDGGLATKAKLNYPTHMAISASGEIFISD 559

Query: 178 HGNQLIRQI 186
           +GNQ IR+I
Sbjct: 560 NGNQRIRKI 568



 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 37/119 (31%), Positives = 59/119 (49%), Gaps = 8/119 (6%)

Query: 76  LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--T 131
           ++    +  +AG+G+ G+S  +G    A F+     AV   G++YVAD  NH IRKI  +
Sbjct: 568 IATNGKISTIAGNGIVGFSGDNGLATKATFNSRNGIAVASNGDVYVADTRNHRIRKISVS 627

Query: 132 NLGVTTIAGGGS----KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           N  ++T AG GS       G   G A +A  +  + +        + I+D GN  IR++
Sbjct: 628 NGFISTFAGNGSVAYQATFGGDGGLAVSAKLNLPYSVAINNATNEVYITDSGNHRIRKV 686



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 39/117 (33%), Positives = 57/117 (48%), Gaps = 9/117 (7%)

Query: 76  LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TN 132
           +S    +  +AG G  G S   G   SA+ + P S A+ +   I + D  NH IRK+ +N
Sbjct: 255 ISTNGKITTIAGTGTAGLSGDGGLATSAKLNYPNSVALGLNNEILIVDTLNHRIRKLFSN 314

Query: 133 LGVTTIAGGGSKKEGRAD-GPAQNA--SFSNDFELTFVPHICALLISDHGNQLIRQI 186
             + +IAG G+ +    D G A NA  +  ND  +T         ISD GN  IR++
Sbjct: 315 GTIISIAGNGTTQGFSGDGGNALNALLNLPNDVVMTLNGE---YFISDFGNHRIRKV 368



 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 53/102 (51%), Gaps = 8/102 (7%)

Query: 45  INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ--GYSDGEPGSAR 102
           +N +S    P S + I+ DS+ +    L   +S   ++  +AG G    G  +     A+
Sbjct: 820 LNSNSFTTTP-SGEFIIADSNNN----LIRKISTSGIISTIAGTGAATFGGDNANATIAK 874

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGS 143
            + P + AV   G I++AD +NH IRKI  N  +TTIAG G+
Sbjct: 875 LNNPLNVAVSSSGEIFIADTNNHRIRKIFLNGTITTIAGNGT 916



 Score = 45.8 bits (107), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 38/123 (30%), Positives = 59/123 (47%), Gaps = 4/123 (3%)

Query: 82  VKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTI 138
           +  +AG G  G+S   G    A+ + P   A+   G I+++D  N  IRKI TN  ++TI
Sbjct: 518 ITTIAGIGTAGFSGDGGLATKAKLNYPTHMAISASGEIFISDNGNQRIRKIATNGKISTI 577

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQ 198
           AG G       +G A  A+F N      V     + ++D  N  IR+I++     S  + 
Sbjct: 578 AGNGIVGFSGDNGLATKATF-NSRNGIAVASNGDVYVADTRNHRIRKISVSNGFISTFAG 636

Query: 199 SGS 201
           +GS
Sbjct: 637 NGS 639



 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 27/70 (38%), Positives = 37/70 (52%), Gaps = 7/70 (10%)

Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLIS 176
           +Y AD  N  +RKI+N  +TTIAG      G  DG A  +++ N    T  P     +I+
Sbjct: 784 VYFADSLNSRVRKISNGKITTIAG------GIGDGLAATSAYLNSNSFTTTPS-GEFIIA 836

Query: 177 DHGNQLIRQI 186
           D  N LIR+I
Sbjct: 837 DSNNNLIRKI 846



 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 38/125 (30%), Positives = 56/125 (44%), Gaps = 26/125 (20%)

Query: 79   ESVVKRLAGDGVQGYS-DG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
               +  +AG+G  GYS DG +    + + P + AV   G I++ D  NH IRKI   G +
Sbjct: 905  NGTITTIAGNGTAGYSGDGLDSTKCQLNYPSAVAVSSGGEIFIVDTHNHRIRKIAIDGII 964

Query: 136  TTIAGGGSKKEGRADGPAQNASFSNDFEL-----------TFVPHICALLISDHGNQLIR 184
            +TIAG G             A F+ D +L             +       IS+ GN+ IR
Sbjct: 965  STIAGNGI------------AGFNGDGKLPINTQLNYPTGIVIASSGEAYISEEGNRRIR 1012

Query: 185  QINLK 189
            +I L+
Sbjct: 1013 KIYLQ 1017



 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 31/99 (31%), Positives = 48/99 (48%), Gaps = 2/99 (2%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRAD 150
           G  DG   ++ +    SF     G   +AD +N++IRKI+  G ++TIAG G+   G  +
Sbjct: 808 GIGDGLAATSAYLNSNSFTTTPSGEFIIADSNNNLIRKISTSGIISTIAGTGAATFGGDN 867

Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
             A  A  +N   +  V     + I+D  N  IR+I L 
Sbjct: 868 ANATIAKLNNPLNVA-VSSSGEIFIADTNNHRIRKIFLN 905


>gi|456988165|gb|EMG23306.1| hypothetical protein LEP1GSC150_2908 [Leptospira interrogans
           serovar Copenhageni str. LT2050]
          Length = 231

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/70 (40%), Positives = 37/70 (52%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNIYV+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95

Query: 121 DKSNHVIRKI 130
           D+ N++IRKI
Sbjct: 96  DQINNLIRKI 105


>gi|428162807|gb|EKX31917.1| hypothetical protein GUITHDRAFT_57527, partial [Guillardia theta
           CCMP2712]
          Length = 203

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 37/113 (32%), Positives = 58/113 (51%), Gaps = 5/113 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIR--KITNLGVTTI 138
           V  LAG G  G++DGE  +A F+ P + A     +I  +AD  NH +R   ++N+ ++T+
Sbjct: 54  VGTLAGSGYVGFADGEARAAMFNHPIALATSPDNSIVLIADGFNHRVRSFNVSNMSISTL 113

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
           AG G    G  DG   +A F+    L+F      + ++D  N  IR I L  +
Sbjct: 114 AGDGGA--GFQDGIGTDAKFNFPSALSFFGDGTKVAVTDMYNNKIRIITLDSK 164



 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 38/115 (33%), Positives = 56/115 (48%), Gaps = 7/115 (6%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGN---IYVADKSNHVIRKIT--NLGVTTIAGGGSKKEG 147
           Y DG   +A F+ P   A+   G+   + V++  NHV+R I   +L V T+AG G    G
Sbjct: 7   YIDGNLSNACFNGPSGVAILQLGSDTLVLVSEVGNHVVRYINVISLQVGTLAGSG--YVG 64

Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
            ADG A+ A F++   L   P    +LI+D  N  +R  N+     S  +  G A
Sbjct: 65  FADGEARAAMFNHPIALATSPDNSIVLIADGFNHRVRSFNVSNMSISTLAGDGGA 119


>gi|255070387|ref|XP_002507275.1| predicted protein [Micromonas sp. RCC299]
 gi|226522550|gb|ACO68533.1| predicted protein [Micromonas sp. RCC299]
          Length = 131

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 57/108 (52%), Gaps = 4/108 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           V  ++G G+ G+ DG+   + F  P   AV   G++YVAD  N+ IR++++ G VT IAG
Sbjct: 6   VTSISGSGIPGFRDGQGEKSHFSSPAGVAVSSDGSVYVADSGNNRIRRVSSKGHVTWIAG 65

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
            G    G +D     A F+    +  +P    + ++D  N  IR ++L
Sbjct: 66  SG--LAGFSDDKLLRAEFNRPQGIVTIP-TGLIFVADTLNHRIRLVSL 110



 Score = 38.1 bits (87), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 32/56 (57%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S +  V  +AG G+ G+SD +   A F++P+       G I+VAD  NH IR ++
Sbjct: 54  VSSKGHVTWIAGSGLAGFSDDKLLRAEFNRPQGIVTIPTGLIFVADTLNHRIRLVS 109


>gi|421117009|ref|ZP_15577380.1| putative lipoprotein [Leptospira interrogans serovar Canicola str.
           Fiocruz LV133]
 gi|410011441|gb|EKO69561.1| putative lipoprotein [Leptospira interrogans serovar Canicola str.
           Fiocruz LV133]
          Length = 358

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/70 (40%), Positives = 37/70 (52%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNIYV+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTISTASFKTPFGLEVDTSGNIYVS 95

Query: 121 DKSNHVIRKI 130
           D+ N++IRKI
Sbjct: 96  DQINNLIRKI 105



 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 36/100 (36%), Positives = 57/100 (57%), Gaps = 5/100 (5%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKEGRA 149
           G  +G+  ++ FD P    +D + N+YV + SNH IRKI  N G V+T++GG S   G  
Sbjct: 166 GLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGTVSTLSGGIS---GYL 222

Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           DG   +A F +   + +   + +LL +D  +  IR+I+LK
Sbjct: 223 DGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLK 262



 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 43/80 (53%), Gaps = 6/80 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
           V  L+G G+ GY DG+  SARF  P   A + K N +  AD  +H IRKI   N  V+T+
Sbjct: 211 VSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLKNSTVSTL 269

Query: 139 AGGGSKKEGRADGPAQNASF 158
            G G   +   DG   NASF
Sbjct: 270 LGNGIGTD--VDGNGTNASF 287


>gi|418688943|ref|ZP_13250072.1| putative lipoprotein [Leptospira interrogans str. FPW2026]
 gi|421128059|ref|ZP_15588277.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
           str. 2006006986]
 gi|421134520|ref|ZP_15594655.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
           str. Andaman]
 gi|400361935|gb|EJP17894.1| putative lipoprotein [Leptospira interrogans str. FPW2026]
 gi|410021314|gb|EKO88104.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
           str. Andaman]
 gi|410434526|gb|EKP83664.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
           str. 2006006986]
          Length = 358

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 40/116 (34%), Positives = 66/116 (56%), Gaps = 7/116 (6%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKEGRA 149
           G  +G+  ++ FD P    +D + N+YV + SNH IRKI  N G V+T++GG S   G  
Sbjct: 166 GLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGTVSTLSGGIS---GYL 222

Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
           DG   +A F +   + +   + +LL +D  +  IR+I+LK  + + S+  G+ +GA
Sbjct: 223 DGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLK--NSTVSTLLGNGIGA 276



 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/70 (40%), Positives = 37/70 (52%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNIYV+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTISTASFKTPFGLEVDTSGNIYVS 95

Query: 121 DKSNHVIRKI 130
           D+ N++IRKI
Sbjct: 96  DQINNLIRKI 105



 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 43/80 (53%), Gaps = 6/80 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
           V  L+G G+ GY DG+  SARF  P   A + K N +  AD  +H IRKI   N  V+T+
Sbjct: 211 VSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDLKNSTVSTL 269

Query: 139 AGGGSKKEGRADGPAQNASF 158
            G G   +   DG   NASF
Sbjct: 270 LGNGIGAD--VDGNGTNASF 287


>gi|392945711|ref|ZP_10311353.1| serine/threonine protein kinase [Frankia sp. QA3]
 gi|392289005|gb|EIV95029.1| serine/threonine protein kinase [Frankia sp. QA3]
          Length = 859

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 34/89 (38%), Positives = 50/89 (56%), Gaps = 2/89 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASF 158
           +A+ + P S   D  GNIY+ D  N+ IRKI+  GV +TIAG G+       GPA  A  
Sbjct: 592 AAQLNGPGSTTRDKAGNIYIGDAQNNRIRKISPAGVISTIAGTGTAGYSGDGGPATAAQL 651

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
           +N  ++T  P   ++  SD+ N  IR+I+
Sbjct: 652 NNAEQVTTGPD-GSVYFSDYENHRIRKID 679



 Score = 49.7 bits (117), Expect = 0.001,   Method: Composition-based stats.
 Identities = 34/110 (30%), Positives = 54/110 (49%), Gaps = 4/110 (3%)

Query: 81  VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TT 137
           ++    G GV GY+   G    AR D P    +   G +Y AD  +  I+K+T  G+ ++
Sbjct: 683 IITTYVGTGVAGYTGAGGPATQARIDGPHEITMTDDGTLYFADLRSETIQKVTPDGIISS 742

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           +AG G        GPA++A   N   L+  P    L ++D+ N  IR+I+
Sbjct: 743 VAGTGEAGYAGDGGPARSAKL-NGPSLSIGPDGRTLYLADYHNNRIRKID 791



 Score = 49.3 bits (116), Expect = 0.002,   Method: Composition-based stats.
 Identities = 33/93 (35%), Positives = 45/93 (48%), Gaps = 3/93 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +S   V+  +AG G  GYS DG P +A + +  +       G++Y +D  NH IRKI   
Sbjct: 622 ISPAGVISTIAGTGTAGYSGDGGPATAAQLNNAEQVTTGPDGSVYFSDYENHRIRKIDPA 681

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
           G +TT  G G      A GPA  A      E+T
Sbjct: 682 GIITTYVGTGVAGYTGAGGPATQARIDGPHEIT 714



 Score = 47.8 bits (112), Expect = 0.005,   Method: Composition-based stats.
 Identities = 42/116 (36%), Positives = 65/116 (56%), Gaps = 6/116 (5%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKG-NIYVADKSNHVIRKITN 132
           ++ + ++  +AG G  GY+ DG P  SA+ + P S ++   G  +Y+AD  N+ IRKI  
Sbjct: 734 VTPDGIISSVAGTGEAGYAGDGGPARSAKLNGP-SLSIGPDGRTLYLADYHNNRIRKIDP 792

Query: 133 LGV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            G+ TT+AG G+   G   GPA  A   N   +  V    AL I+D+GN  +R+I+
Sbjct: 793 SGIITTVAGIGTAGSGGDGGPATAAQLKNPTSVV-VDGAGALYIADNGNARVRRID 847


>gi|116672456|ref|YP_833389.1| NHL repeat-containing protein [Arthrobacter sp. FB24]
 gi|116612565|gb|ABK05289.1| NHL repeat containing protein [Arthrobacter sp. FB24]
          Length = 672

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 50/166 (30%), Positives = 77/166 (46%), Gaps = 10/166 (6%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTL-SF-PLSEESVVKRLA 86
           ED    T  +  H L++   S  D   S+ L  +  + +  + + SF P+S    V  LA
Sbjct: 323 EDAAGFTGRLSDHPLDVALSSPWDVVWSAKLNAVVVAMAGVHQIFSFDPIS--GAVSILA 380

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTI---AGG 141
           G+G++G  DG    A F +    A D  GNI+VAD     +RK  I + G  T+    G 
Sbjct: 381 GNGLEGLLDGAAHEAWFAQSSGLAEDADGNIWVADSETSALRKLVIDDAGTVTVESAVGK 440

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           G    G  DGPA  A   +   +T +P   ++ I+D  N  +R+ +
Sbjct: 441 GLFDFGFRDGPAAEARLQHPLGVTVLPD-GSVAIADTYNGAVRRYD 485


>gi|408824784|ref|ZP_11209674.1| NHL repeat containing protein [Pseudomonas geniculata N1]
          Length = 470

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 46/147 (31%), Positives = 67/147 (45%), Gaps = 11/147 (7%)

Query: 73  SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           S+ L + S+  R AG G     DG    A F +P + AV ++  +YVAD     IR +  
Sbjct: 301 SYHLGQRSLQWR-AGSGSIDERDGSGHLAAFAQPTALAV-VQQVLYVADALGSSIRALQL 358

Query: 133 LG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-- 188
            G  V T+ G G  + G  DGP  +AS      +   P    L I+D GN  +R + L  
Sbjct: 359 RGDLVQTLVGQGPWRFGSEDGPRADASLQFPQAIALSPDAPLLWIADTGNGRLRTLRLGG 418

Query: 189 -----KPEDCSKSSQSGSALGAVSVWV 210
                +P        +G A+GA +VW+
Sbjct: 419 GELTTQPLPRRLHGPAGLAVGAGAVWI 445



 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 47/103 (45%), Gaps = 12/103 (11%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIAG 140
           + R  G G   + DG    A F +P++  ++ +  +YVAD  NH +R+I  L G+     
Sbjct: 197 ILRQFGLGTADFMDGNLAEAAFHRPQALVLE-RDALYVADTGNHAVRRINLLTGIVDTLC 255

Query: 141 GGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQL 182
           G  +     +GP AQ    S D       H   L I+D  NQL
Sbjct: 256 GNGRPGAPVEGPVAQARQVSLD-------HPVGLAIAD--NQL 289


>gi|290977429|ref|XP_002671440.1| predicted protein [Naegleria gruberi]
 gi|284085009|gb|EFC38696.1| predicted protein [Naegleria gruberi]
          Length = 1494

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 38/122 (31%), Positives = 63/122 (51%), Gaps = 6/122 (4%)

Query: 81  VVKRLAGDGVQGYSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTI 138
           +V  +AG   +   DG     A+ + P   A+   G +Y++D SN VIRKI TN  ++T+
Sbjct: 473 LVSTIAGTISKYTGDGSSAIQAQLNGPTGVAISPNGEVYISDSSNDVIRKIDTNGAISTV 532

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQ 198
            G  S   G  D  A+ A  +N   + F+P+   L+ISD  N  IR+ ++   + +  + 
Sbjct: 533 VGSSS---GFLDASARRAQLANPMGIAFLPN-GDLIISDAKNNRIRKFSISSGNVTTIAG 588

Query: 199 SG 200
           +G
Sbjct: 589 TG 590



 Score = 43.1 bits (100), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 35/130 (26%), Positives = 55/130 (42%), Gaps = 1/130 (0%)

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG-SARFDKPKSFAVDMKGN 116
           D+ ++DSS++    + +  +  +        G    + G P  SA F  P+  AV     
Sbjct: 127 DVFIVDSSQNRVKKVFYNSTINAFNMITVAGGGSSLASGIPALSATFTAPRCIAVSPDDE 186

Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLIS 176
           +YV D  N++I+KI N  +  + G G K        A N   S+   + F P    L  S
Sbjct: 187 VYVCDAFNYLIKKIVNDTIYNVIGTGIKGNSVDGTLANNTKISSPSSIAFHPITKELFYS 246

Query: 177 DHGNQLIRQI 186
           D    +IR I
Sbjct: 247 DLLANVIRYI 256


>gi|389875076|ref|YP_006374432.1| NHL repeat-containing protein [Tistrella mobilis KA081020-065]
 gi|388532256|gb|AFK57450.1| NHL repeat-containing protein [Tistrella mobilis KA081020-065]
          Length = 487

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/109 (32%), Positives = 60/109 (55%), Gaps = 6/109 (5%)

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGG 141
           R  G G  G  DG   SARF +P+   +   G I+VAD  NH +R+I   +  VTTIAG 
Sbjct: 210 RRVGSGEPGLVDGPAESARFQRPQGL-IAADGAIFVADTWNHAVRRIDVASGEVTTIAGT 268

Query: 142 GSKKE-GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           G +    +A  PA +++ ++ ++L +   +  L +++ G   + +I+L+
Sbjct: 269 GRRGPILKAPAPAIDSALASPWDLEYREGV--LYVANAGTHQLARIDLE 315



 Score = 44.7 bits (104), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 32/112 (28%), Positives = 55/112 (49%), Gaps = 4/112 (3%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--V 135
           E  V++ LAG G +   DG    A   +P + A+  +  ++  D     +R++   G  V
Sbjct: 315 ETGVLEPLAGTGAEALVDGPADQAALAQPSALAL-KEDRLWFIDAETSSLRRLDLAGGTV 373

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           +T+ G G  + G  DGPA+ A F +   L F  +   +L++D  N  IR ++
Sbjct: 374 STLVGDGLFEFGMVDGPARTARFQHPLGLCFDMN-GNILVADAYNDAIRVVD 424


>gi|423216521|ref|ZP_17203044.1| hypothetical protein HMPREF1074_04576 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392690731|gb|EIY83987.1| hypothetical protein HMPREF1074_04576 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 589

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 69/144 (47%), Gaps = 18/144 (12%)

Query: 60  IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGE---PGSARFDKPKSFAVD-MKG 115
           +++ + R   Y ++        V ++A DG  G +DGE   P +A+  +P+   VD  + 
Sbjct: 453 LIVGAQRDGLYAVA----PNGTVTKIACDG-NGNTDGEANKPMTAKLIQPEGIVVDKTRN 507

Query: 116 NIYVADKSNHVIRKI-------TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP 168
           +IY  D  N+ IR+I       T+  V+TIAG G+K  G  DG    A  +    +T   
Sbjct: 508 DIYFTDGYNNKIRRIRPGKNGYTDATVSTIAGSGTK--GNQDGDGATAQLAMPHGITMTA 565

Query: 169 HICALLISDHGNQLIRQINLKPED 192
               +  SD  N +IR+I +  +D
Sbjct: 566 DGNTIYFSDLDNYIIRKITITEKD 589


>gi|320107313|ref|YP_004182903.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
 gi|319925834|gb|ADV82909.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
          Length = 1821

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/95 (36%), Positives = 52/95 (54%), Gaps = 3/95 (3%)

Query: 95  DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGP 152
           DG P SA RF+ P   A+D  G++Y+AD++NH IR IT  G ++T+AG G  +     G 
Sbjct: 341 DGGPASAGRFNNPLGLAIDSGGSLYIADQANHRIRMITAGGSLSTVAGTGVPRYNGDGGL 400

Query: 153 AQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           A  A       +  V     L I+D  N L+R+++
Sbjct: 401 AVAAQLQQPAAVA-VDVARNLYIADSNNHLVRKVS 434



 Score = 47.8 bits (112), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 45/140 (32%), Positives = 67/140 (47%), Gaps = 14/140 (10%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARF---DKPKSFA 110
           G+ DL + D+  +A   L+       ++  +AG  GV G S G+ GSA     + P   A
Sbjct: 244 GAGDLYIADTRNNAIRKLTL---ATGILSTIAGRLGVPG-SSGDGGSATLATLNAPGGLA 299

Query: 111 VDMKGNIYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRAD-GPAQNASFSNDFELTF 166
           +D  G +Y+AD  N  IR+I   T + +T  AG  +  +   D GPA    F+N   L  
Sbjct: 300 MDAAGFLYIADTGNDTIRRIDPSTGM-ITLFAGIPTVADFAGDGGPASAGRFNNPLGLA- 357

Query: 167 VPHICALLISDHGNQLIRQI 186
           +    +L I+D  N  IR I
Sbjct: 358 IDSGGSLYIADQANHRIRMI 377



 Score = 45.4 bits (106), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 62/113 (54%), Gaps = 10/113 (8%)

Query: 82  VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VT 136
           +  +AG GV  Y +G+ G   +A+  +P + AVD+  N+Y+AD +NH++RK+++    ++
Sbjct: 383 LSTVAGTGVPRY-NGDGGLAVAAQLQQPAAVAVDVARNLYIADSNNHLVRKVSSTTGLIS 441

Query: 137 TIAG--GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           +I G  G +   G  +GPA  A  +  + L        L  +D  N  IR+++
Sbjct: 442 SIVGIPGAAAYSGD-NGPANVAKINGPYALAL-DSAGDLYFADLLNNRIRKVS 492



 Score = 37.4 bits (85), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 13/79 (16%)

Query: 89  GVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN----LGVTTIAGGG 142
           G   YS  +G    A+ + P + A+D  G++Y AD  N+ IRK++N    L  T I    
Sbjct: 448 GAAAYSGDNGPANVAKINGPYALALDSAGDLYFADLLNNRIRKVSNSYATLQYTPI---- 503

Query: 143 SKKEGRADGPAQNASFSND 161
             + GR   P Q+ +F ND
Sbjct: 504 --RVGRTSAP-QSQTFEND 519


>gi|456865078|gb|EMF83443.1| putative lipoprotein [Leptospira weilii serovar Topaz str. LT2116]
          Length = 357

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 39/119 (32%), Positives = 66/119 (55%), Gaps = 9/119 (7%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKKE 146
           + G+ +G+  ++ F  P    +D + N+YV +  NH IRKI NL    V+T++GG S   
Sbjct: 163 IDGFQNGDRLNSLFKGPFFMDLDRERNLYVGELGNHAIRKI-NLNSETVSTLSGGVS--- 218

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
           G  DG   +A F +   +T+     +LL++D  +  IR+I+LK    + S+  G+ +GA
Sbjct: 219 GYLDGDLVSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLKTS--TVSTFLGNGIGA 275



 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 47/94 (50%), Gaps = 2/94 (2%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP   + R    + I LD      +     ++   V+   AG G+    DG   +A F+ 
Sbjct: 22  NPSIFMSRAWLENTI-LDCILKECHLCKLKVTNNPVISLFAGTGINQSVDGTTQTASFNT 80

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
           P    +D  GNI+V+D++ ++IRKI + G VTT+
Sbjct: 81  PFGLELDTFGNIFVSDQAANLIRKIDHFGNVTTL 114



 Score = 45.1 bits (105), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 45/133 (33%), Positives = 70/133 (52%), Gaps = 8/133 (6%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAG 140
           V  L+G GV GY DG+  SA+F  P     D K + + VAD  +H IRKI +L  +T++ 
Sbjct: 210 VSTLSG-GVSGYLDGDLVSAQFKSPLGITYDQKTDSLLVADLQDHRIRKI-DLKTSTVST 267

Query: 141 G-GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE-DCSKSSQ 198
             G+      DG   NASF N      + +   + +SD  ++ IR ++  P+ + S  + 
Sbjct: 268 FLGNGIGASIDGKGLNASF-NGPAFISLDNSGYMFVSDTNSKKIRIVD--PDLNVSTITH 324

Query: 199 SGSALGAVSVWVL 211
           + +A+GAV V  L
Sbjct: 325 TFAAIGAVKVDCL 337


>gi|359728310|ref|ZP_09267006.1| hypothetical protein Lwei2_15904 [Leptospira weilii str.
           2006001855]
 gi|417778551|ref|ZP_12426356.1| putative lipoprotein [Leptospira weilii str. 2006001853]
 gi|410781344|gb|EKR65918.1| putative lipoprotein [Leptospira weilii str. 2006001853]
          Length = 357

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 39/119 (32%), Positives = 66/119 (55%), Gaps = 9/119 (7%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKKE 146
           + G+ +G+  ++ F  P    +D + N+YV +  NH IRKI NL    V+T++GG S   
Sbjct: 163 IDGFQNGDRLNSLFKGPFFMDLDRERNLYVGELGNHAIRKI-NLNSETVSTLSGGVS--- 218

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
           G  DG   +A F +   +T+     +LL++D  +  IR+I+LK    + S+  G+ +GA
Sbjct: 219 GYLDGDLVSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDLKTS--TVSTFLGNGIGA 275



 Score = 46.6 bits (109), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 48/94 (51%), Gaps = 2/94 (2%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP   + R    + I LD      +     ++ + V+   AG G+    DG   +A F+ 
Sbjct: 22  NPSIFMSRAWLENAI-LDCILKECHLCKLKVTNKPVISLFAGTGINQSVDGTTQTASFNT 80

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
           P    +D  GNI+V+D++ ++IRKI + G VTT+
Sbjct: 81  PFGLELDTFGNIFVSDQAANLIRKIDHFGNVTTL 114



 Score = 44.7 bits (104), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 44/133 (33%), Positives = 70/133 (52%), Gaps = 8/133 (6%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAG 140
           V  L+G GV GY DG+  SA+F  P     D K + + VAD  +H IRKI +L  +T++ 
Sbjct: 210 VSTLSG-GVSGYLDGDLVSAQFKSPLGITYDQKTDSLLVADLQDHRIRKI-DLKTSTVST 267

Query: 141 G-GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE-DCSKSSQ 198
             G+      DG   NASF N      + +   + +SD  ++ IR ++  P+ + S  + 
Sbjct: 268 FLGNGIGASIDGKGLNASF-NGPAFISLDNSGYMFVSDTNSKKIRIVD--PDLNVSTITH 324

Query: 199 SGSALGAVSVWVL 211
           + +A+GAV +  L
Sbjct: 325 TFAAIGAVKIDCL 337


>gi|425734502|ref|ZP_18852820.1| NHL repeat-containing protein [Brevibacterium casei S18]
 gi|425481116|gb|EKU48277.1| NHL repeat-containing protein [Brevibacterium casei S18]
          Length = 666

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 45/155 (29%), Positives = 74/155 (47%), Gaps = 26/155 (16%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV---D 112
           S +L+V DS   +   + +    ++V++R+ G G +G +DG+  SA+F +P    V   D
Sbjct: 208 SGNLLVADSGHHSL--VEYTPDGQTVLRRI-GTGERGLTDGDFASAQFSEPGGITVLPED 264

Query: 113 MKG----NIYVADKSNHVIRKITNLG--VTTIAGGGSKKE--------------GRADGP 152
           +       + VAD  NHV+R I   G  V TIAG G +                GR  GP
Sbjct: 265 IAARAGYQLVVADTVNHVLRGIDLDGETVRTIAGTGEQHMVGAIDNVRGTHGALGRYSGP 324

Query: 153 AQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           A +   S+ +++ FVP    ++++  GN  I   +
Sbjct: 325 ALDVKLSSPWDVLFVPATGEVVVAMAGNHTIWSFD 359



 Score = 41.2 bits (95), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 36/151 (23%), Positives = 68/151 (45%), Gaps = 6/151 (3%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           +P  V+  P + +++V  +     ++   P++ +  ++ L+G   +G  +G+  SA F +
Sbjct: 332 SPWDVLFVPATGEVVVAMAGNHTIWSFD-PVTGD--IRLLSGTMNEGLVNGDAESAWFAQ 388

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASFSNDFE 163
                +   G + VAD     IR++       TT+ G G    G  DGPA  A   +   
Sbjct: 389 SSGLDLHPDGGVIVADSETSAIRRLDPASGEATTLVGTGLFDFGFRDGPAAEARLQHPLG 448

Query: 164 LTFVPHICALLISDHGNQLIRQINLKPEDCS 194
           +  +P   +L I+D  N  IR+ +    + S
Sbjct: 449 VRTLPD-GSLAIADTYNGAIRRYDFTTNEVS 478



 Score = 40.4 bits (93), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 29/101 (28%), Positives = 47/101 (46%), Gaps = 6/101 (5%)

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
           P    V   GN+ VAD  +H + + T  G T +   G+ + G  DG   +A FS    +T
Sbjct: 200 PGKVTVLPSGNLLVADSGHHSLVEYTPDGQTVLRRIGTGERGLTDGDFASAQFSEPGGIT 259

Query: 166 FVPHICA------LLISDHGNQLIRQINLKPEDCSKSSQSG 200
            +P   A      L+++D  N ++R I+L  E     + +G
Sbjct: 260 VLPEDIAARAGYQLVVADTVNHVLRGIDLDGETVRTIAGTG 300


>gi|156399714|ref|XP_001638646.1| predicted protein [Nematostella vectensis]
 gi|156225768|gb|EDO46583.1| predicted protein [Nematostella vectensis]
          Length = 286

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 36/110 (32%), Positives = 57/110 (51%), Gaps = 6/110 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKIT--NLGVTTI 138
           V   AG+G +G  DGE   A+F  P+S ++D +G+ ++V D  NH IR I+  +  V T+
Sbjct: 131 VTTFAGNGSEGLVDGERTRAQFKGPQSLSLDDEGDRLFVGDTDNHAIRVISLKDGSVQTL 190

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
            GG     G  DG    + F +   + +      L +SDH N +IR + +
Sbjct: 191 VGG---SLGFKDGVGLKSKFYHPTGIAYDRENDILYVSDHYNHVIRAVKV 237



 Score = 42.4 bits (98), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 48/106 (45%), Gaps = 7/106 (6%)

Query: 102 RFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASF 158
           R +     A+D +  + Y+AD  N  IRK+      VTT AG GS  EG  DG    A F
Sbjct: 95  RINPNTGMAIDTRKRVMYLADVGNSRIRKVDMATAEVTTFAGNGS--EGLVDGERTRAQF 152

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
                L+       L + D  N  IR I+LK  D S  +  G +LG
Sbjct: 153 KGPQSLSLDDEGDRLFVGDTDNHAIRVISLK--DGSVQTLVGGSLG 196


>gi|359777541|ref|ZP_09280822.1| hypothetical protein ARGLB_073_01730 [Arthrobacter globiformis NBRC
           12137]
 gi|359305319|dbj|GAB14651.1| hypothetical protein ARGLB_073_01730 [Arthrobacter globiformis NBRC
           12137]
          Length = 666

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 51/166 (30%), Positives = 77/166 (46%), Gaps = 10/166 (6%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDL-IVLDSSRSAFYTLSF-PLSEESVVKRLA 86
           ED    T  +  H L++   S  D   SS L  V+ +        S+ PL+ +  V  LA
Sbjct: 317 EDAAGFTGRLGDHPLDVALSSPWDVVWSSKLNAVVIAMAGVHQIFSYEPLTGD--VSILA 374

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTI---AGG 141
           G+G++G  DG    A F +P   A D  GNI+VAD     +RK  I++ G  T+    G 
Sbjct: 375 GNGLEGLLDGPAHEAWFAQPSGLAEDADGNIWVADSETSALRKLVISDDGAVTVESAVGK 434

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           G    G  DG A  A   +   +T +P   ++ I+D  N  +R+ +
Sbjct: 435 GLFDFGFRDGEASEARLQHPLGVTVLPD-GSVAIADTYNGAVRRYD 479



 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 22/68 (32%), Positives = 38/68 (55%), Gaps = 9/68 (13%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSF-------AVDMKGNIYVADKSNHVIRKIT--NLGVTT 137
           G G +G++DG  G ARF++P+         A     ++ +AD  NH +R ++  +  VTT
Sbjct: 239 GSGEKGHADGAAGIARFNEPQGLVLLPEDVAAKTGYDVVIADSVNHRLRGLSLADGSVTT 298

Query: 138 IAGGGSKK 145
           +AG G ++
Sbjct: 299 LAGSGVQR 306


>gi|351729451|ref|ZP_08947142.1| NHL repeat-containing protein [Acidovorax radicis N35]
          Length = 663

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/66 (42%), Positives = 35/66 (53%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGP 152
           Y D    +ARF+ P++ AVD  GN YVAD  NH++RKI   G  T   G     G ADG 
Sbjct: 48  YLDAAGTAARFNAPQAVAVDTAGNTYVADTGNHLVRKIDASGNVTTLAGQEGVCGNADGT 107

Query: 153 AQNASF 158
              A+ 
Sbjct: 108 GTAATL 113



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 35/95 (36%), Positives = 47/95 (49%), Gaps = 9/95 (9%)

Query: 74  FPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNH------- 125
           F ++    V   AG+   +G++DG  GSA F    S A+D  GN+YV D   H       
Sbjct: 480 FKINPTGAVSLFAGNPNARGHADGPAGSALFAGLGSMALDPAGNLYVVDGVYHFITGVGP 539

Query: 126 VIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
            IRKIT  G V+T+AG  +   G  DGP   A  +
Sbjct: 540 TIRKITPAGMVSTLAGRANVPPGLVDGPVSQAQLT 574



 Score = 43.1 bits (100), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 50/99 (50%), Gaps = 7/99 (7%)

Query: 109 FAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG-SKKEGRADGP-----AQNASFSND 161
            A+   G++ +AD  NH IR+++  G +TT AGGG ++ +  + GP     A  A+  N 
Sbjct: 1   MALTPTGDVLIADPGNHTIRRLSPTGQLTTFAGGGPTRSDASSPGPRYLDAAGTAARFNA 60

Query: 162 FELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
            +   V       ++D GN L+R+I+      + + Q G
Sbjct: 61  PQAVAVDTAGNTYVADTGNHLVRKIDASGNVTTLAGQEG 99



 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 35/74 (47%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           G++DG   +ARFD P + A+   G +YV D++N  IR I   G+    G       +  G
Sbjct: 341 GWADGSGSTARFDGPGALAMGKSGTLYVRDRNNAAIRTIQPDGMVGTLGRVKNPCSQITG 400

Query: 152 PAQNASFSNDFELT 165
           P    +F     LT
Sbjct: 401 PINEVAFCGPEALT 414



 Score = 38.1 bits (87), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 35/68 (51%), Gaps = 12/68 (17%)

Query: 82  VKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYV-----------ADKSNHVIRK 129
           V  LAG +GV G +DG   +A    P S AVD  GN+YV           A+ + + IRK
Sbjct: 91  VTTLAGQEGVCGNADGTGTAATLCSPTSIAVDKDGNVYVSEWAPLTQTLPAEPTGNPIRK 150

Query: 130 ITNLGVTT 137
           IT  GV +
Sbjct: 151 ITPAGVVS 158


>gi|254524393|ref|ZP_05136448.1| NHL repeat protein [Stenotrophomonas sp. SKA14]
 gi|219721984|gb|EED40509.1| NHL repeat protein [Stenotrophomonas sp. SKA14]
          Length = 470

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 45/147 (30%), Positives = 65/147 (44%), Gaps = 11/147 (7%)

Query: 73  SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           S+ L + S+  R AG G     DG    A F +P + AV ++  +YVAD     IR +  
Sbjct: 301 SYHLGQRSLQWR-AGSGAIDERDGSGHLAAFAQPTALAV-VQQVLYVADALGSSIRALQL 358

Query: 133 LG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-- 188
            G  V T+ G G  + G  DGP   A       +   P    L I+D GN  +R + L  
Sbjct: 359 RGDLVQTLVGQGPWRFGNEDGPRAQARLQFPQAIALSPDAPLLWIADSGNGRLRTLRLGG 418

Query: 189 -----KPEDCSKSSQSGSALGAVSVWV 210
                +P        +G A+GA +VW+
Sbjct: 419 GELTTQPLPRRLQGPAGLAVGAGAVWI 445



 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 47/103 (45%), Gaps = 12/103 (11%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIAG 140
           + R  G G   + DG    A F +P++  ++ +  +YVAD  NH +R+I  L G+     
Sbjct: 197 ILRQFGLGTADFMDGNLAEAAFHRPQALVLE-RDALYVADTGNHAVRRINLLTGIVDTLC 255

Query: 141 GGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQL 182
           G  +     +GP AQ    S D       H   L I+D  NQL
Sbjct: 256 GNGRPGAPVEGPVAQARQVSLD-------HPIGLAIAD--NQL 289


>gi|421094122|ref|ZP_15554843.1| putative lipoprotein [Leptospira borgpetersenii str. 200801926]
 gi|410363263|gb|EKP14295.1| putative lipoprotein [Leptospira borgpetersenii str. 200801926]
 gi|456886734|gb|EMF97860.1| putative lipoprotein [Leptospira borgpetersenii str. 200701203]
          Length = 358

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 59/109 (54%), Gaps = 7/109 (6%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKK 145
           G+ G+ +G+  ++ F  P    +D + N+YV +  NH IRKI NL    V+T++GG S  
Sbjct: 163 GIGGFQNGDRLNSLFKGPLFMDLDRERNLYVGELGNHTIRKI-NLNSETVSTLSGGVS-- 219

Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
            G  DG   +A F +   + +     +LL++D  N  IR+I+LK    S
Sbjct: 220 -GYLDGDLTSAQFKSPSGIIYDQKTDSLLVADLQNHRIRKIDLKTSTVS 267



 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 44/81 (54%), Gaps = 1/81 (1%)

Query: 60  IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYV 119
           ++LD      +T    ++   VV   AG G++   DG   +A F  P S  +D  GNI+V
Sbjct: 35  MILDCILKECHTCKLKVTNNPVVSPFAGTGIRESIDGTTQTASFKTPFSLELDTFGNIFV 94

Query: 120 ADKSNHVIRKITNLG-VTTIA 139
           +D  +++IRKI   G VTT++
Sbjct: 95  SDSDSNLIRKIDRFGNVTTLS 115



 Score = 46.6 bits (109), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 35/80 (43%), Positives = 42/80 (52%), Gaps = 6/80 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
           V  L+G GV GY DG+  SA+F  P     D K + + VAD  NH IRKI      V+T+
Sbjct: 211 VSTLSG-GVSGYLDGDLTSAQFKSPSGIIYDQKTDSLLVADLQNHRIRKIDLKTSTVSTL 269

Query: 139 AGGGSKKEGRADGPAQNASF 158
            G G   E   DG   NASF
Sbjct: 270 LGNG--IEASIDGKGLNASF 287



 Score = 38.5 bits (88), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 22/62 (35%), Positives = 37/62 (59%), Gaps = 1/62 (1%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVT 136
           + S V  L G+G++   DG+  +A FD P   ++D  G ++V+D S++ IR +  +L V 
Sbjct: 262 KTSTVSTLLGNGIEASIDGKGLNASFDGPAFISLDNSGYMFVSDTSSNKIRIVDPDLNVF 321

Query: 137 TI 138
           TI
Sbjct: 322 TI 323


>gi|421091013|ref|ZP_15551797.1| putative lipoprotein [Leptospira kirschneri str. 200802841]
 gi|410000210|gb|EKO50873.1| putative lipoprotein [Leptospira kirschneri str. 200802841]
          Length = 358

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 65/119 (54%), Gaps = 9/119 (7%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKKE 146
           + G  +G+  ++ FD P    +D++ N+YV + SNH IRKI NL    V+T +GG S   
Sbjct: 164 LSGLQNGDRLNSLFDGPFFMDIDLERNLYVGELSNHTIRKI-NLDSGMVSTFSGGIS--- 219

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
           G  DG   +A F +   + +     +LL +D  +  IR+I+LK    + S+  G+ +GA
Sbjct: 220 GYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKIDLKTS--TVSTLLGNGIGA 276



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 37/70 (52%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++++ VV   AG GV    DG   +A F  P    VD  GNI+V+
Sbjct: 36  ILDCILKECYLCSLKITDKPVVSLFAGTGVAESIDGTISTASFKTPFGLEVDTSGNIFVS 95

Query: 121 DKSNHVIRKI 130
           D+  ++IRKI
Sbjct: 96  DQMTNLIRKI 105


>gi|429202583|ref|ZP_19193964.1| antioxidant, AhpC/TSA family [Streptomyces ipomoeae 91-03]
 gi|428661888|gb|EKX61363.1| antioxidant, AhpC/TSA family [Streptomyces ipomoeae 91-03]
          Length = 619

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 50/175 (28%), Positives = 83/175 (47%), Gaps = 20/175 (11%)

Query: 39  DGHQLEINPHSVIDR-PGSS------DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ 91
           DG  +   P   + R PG +      D +V D++R     L+     E+VV+R  G G +
Sbjct: 170 DGPYVPPEPDPTVLRFPGKALLLPDGDFLVSDTTRHRLVRLA--PDGETVVRRY-GTGER 226

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG-VTTIAGGGSK--KEG 147
           G+ DG    ARF +P+  A+   G++ VAD  NH +R++  + G V T+AG G +  +  
Sbjct: 227 GFVDGPADHARFSEPQGLALLDSGDVVVADTVNHALRRLDPDTGHVATLAGTGRQWWQGS 286

Query: 148 RADGPAQNASFSNDFELT-FVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
              GPA     S+ +++  F   +   +   H  QL       P DC+ +  +G+
Sbjct: 287 PTSGPAWQIDLSSPWDVAVFGGKVWIAMAGVH--QL---WTYDPADCTVAVAAGT 336


>gi|348175195|ref|ZP_08882089.1| NHL repeat-containing protein [Saccharopolyspora spinosa NRRL
           18395]
          Length = 630

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 43/147 (29%), Positives = 77/147 (52%), Gaps = 16/147 (10%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  VI  PG  +L+V DS+  +   + F    E+VV+R+ G G +G +DG   +A F +P
Sbjct: 194 PGKVIALPGG-NLLVSDSANHSL--VEFAADGETVVRRI-GTGQRGRADGV--AAEFAEP 247

Query: 107 KSFAV-------DMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSK-KEGRADGPAQNA 156
              A+        +  ++ VAD  NH++R  ++ +  VTT+AG G + ++G   GPA   
Sbjct: 248 AGLALLPQRVAEQVGYDVVVADTVNHLLRGVRLVDGQVTTLAGTGEQWRDGSDSGPALET 307

Query: 157 SFSNDFELTFVPHICALLISDHGNQLI 183
             ++ ++L +      ++I+  GN  +
Sbjct: 308 PLTSPWDLAWWEPAGGVVIAMAGNHTL 334



 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 25/80 (31%), Positives = 41/80 (51%), Gaps = 8/80 (10%)

Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA-- 172
           GN+ V+D +NH + +    G T +   G+ + GRADG A  A F+    L  +P   A  
Sbjct: 203 GNLLVSDSANHSLVEFAADGETVVRRIGTGQRGRADGVA--AEFAEPAGLALLPQRVAEQ 260

Query: 173 ----LLISDHGNQLIRQINL 188
               ++++D  N L+R + L
Sbjct: 261 VGYDVVVADTVNHLLRGVRL 280



 Score = 37.7 bits (86), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 30/108 (27%), Positives = 46/108 (42%), Gaps = 6/108 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT----NLGVTT 137
           V+R AG  V+G  DG    A F +    A D    +++AD     +R +        V T
Sbjct: 344 VRRFAGTTVEGLRDGAADEAFFAQTSGLA-DGGDRLWLADSETSALRWVEAADRGFAVRT 402

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
             G G    G ADG A  A F +   +  +P   ++ + D  N  +R+
Sbjct: 403 AVGTGLFDFGHADGTADKALFQHPLGVAVLPD-GSVAVCDTYNGAVRR 449


>gi|322788140|gb|EFZ13922.1| hypothetical protein SINV_02735 [Solenopsis invicta]
          Length = 717

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 44/153 (28%), Positives = 72/153 (47%), Gaps = 20/153 (13%)

Query: 49  SVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKS 108
           +V+     + L++ DS  +     +    E   V+ + G   QG+ DG+  +ARF+ P+ 
Sbjct: 255 TVLQSEQGTKLVISDSGNNRIVITN----EHGRVEHVIGGCSQGFKDGDFKNARFNSPQG 310

Query: 109 FAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADG----------PAQNA 156
               +   IY+AD +NH IRKI  +   V+TIAG G +   R  G          P   A
Sbjct: 311 VCA-LNNTIYIADNNNHAIRKINLSEKTVSTIAGTGLQGCDRNGGGHGTDQALSSPWDVA 369

Query: 157 SFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            + ++++ T VP    LLI+  G   I  + L+
Sbjct: 370 IYHHEYKGTSVP---VLLIAIAGTHQIWALFLE 399


>gi|116625316|ref|YP_827472.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116228478|gb|ABJ87187.1| NHL repeat containing protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 981

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 29/68 (42%), Positives = 42/68 (61%), Gaps = 3/68 (4%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ + ++  +AG+G +G+S DG P +A  F  P   AVD  GN+YV D SN  +RKI   
Sbjct: 226 VTPDGIITTIAGNGTRGFSGDGGPATAATFRGPIGLAVDAFGNVYVTDNSNGRVRKIDAA 285

Query: 134 G-VTTIAG 140
           G +TT AG
Sbjct: 286 GTITTYAG 293



 Score = 50.8 bits (120), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 41/70 (58%), Gaps = 3/70 (4%)

Query: 76  LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +S   ++   AG+G  G++   G   SA+ + P   A+D  GN+Y+ D SN+ +RK+T  
Sbjct: 116 ISAAGIISTYAGNGTAGFAGDSGAATSAQLNGPTDVAIDGNGNLYICDSSNNRVRKVTPG 175

Query: 134 G-VTTIAGGG 142
           G +TT AG G
Sbjct: 176 GTITTFAGNG 185



 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 34/101 (33%), Positives = 51/101 (50%), Gaps = 10/101 (9%)

Query: 88  DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKE 146
           DGVQ  +   P       P   AVD +GN+Y+   S   +RK+T  G+ TTIAG G++  
Sbjct: 192 DGVQAATTAVP------SPAGIAVDAQGNVYI--SSAVRVRKVTPDGIITTIAGNGTRGF 243

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
               GPA  A+F     L  V     + ++D+ N  +R+I+
Sbjct: 244 SGDGGPATAATFRGPIGLA-VDAFGNVYVTDNSNGRVRKID 283



 Score = 41.2 bits (95), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 36/116 (31%), Positives = 54/116 (46%), Gaps = 8/116 (6%)

Query: 78  EESVVKRLAGDGVQGYS-DGEPG-----SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
               +  +AG+G  GY+ DG P      +         AVD  GN Y AD SN+VIRKI+
Sbjct: 58  SSGTLSTIAGNGNFGYAGDGGPAINATLNLGAGGLAGLAVDGLGNAYFADSSNNVIRKIS 117

Query: 132 NLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             G ++T AG G+       G A +A  +   ++  +     L I D  N  +R++
Sbjct: 118 AAGIISTYAGNGTAGFAGDSGAATSAQLNGPTDVA-IDGNGNLYICDSSNNRVRKV 172



 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 24/81 (29%), Positives = 41/81 (50%), Gaps = 3/81 (3%)

Query: 109 FAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP 168
            A+D +G +Y+ D +   +RKI++  ++TIAG G+       GPA NA+ +         
Sbjct: 36  LAIDGQGTVYLTDSATARVRKISSGTLSTIAGNGNFGYAGDGGPAINATLNLGAGGLAGL 95

Query: 169 HICAL---LISDHGNQLIRQI 186
            +  L     +D  N +IR+I
Sbjct: 96  AVDGLGNAYFADSSNNVIRKI 116


>gi|153806007|ref|ZP_01958675.1| hypothetical protein BACCAC_00252 [Bacteroides caccae ATCC 43185]
 gi|149130684|gb|EDM21890.1| IPT/TIG domain protein [Bacteroides caccae ATCC 43185]
          Length = 440

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/50 (54%), Positives = 31/50 (62%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           VV+ +AG    GY D   GSA F  P+  AVD  G IYV DK N+VIRKI
Sbjct: 388 VVRLVAGGTQAGYVDANAGSAAFSGPQDLAVDKNGVIYVYDKKNNVIRKI 437



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 40/120 (33%), Positives = 53/120 (44%), Gaps = 15/120 (12%)

Query: 55  GSSDLIVLDSSRSAFYTLS-FPLSEESVVKRLAG--DGVQGYSDGEPGSARF-------- 103
           G  +L +  ++R    T+  F  S  +VV  LAG  +G  GY DG    A F        
Sbjct: 98  GDGELTITIANREPVRTIEKFTYSFSAVVTTLAGSANGEPGYQDGVGSEALFFFDAAKAE 157

Query: 104 ----DKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
                K  S  VD  GN+YV D  N+ +RKIT  G  T   G +  +G  DG    A F+
Sbjct: 158 PAEDWKKGSVCVDDDGNVYVGDCVNYCVRKITPDGTVTTLAGLAGNKGCIDGTGVQARFN 217



 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 53/114 (46%), Gaps = 13/114 (11%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA--------DKSNHVIR--KITN 132
           + L G G  GY +G    A F  P   A+D  G+IYVA        +  +  IR   +TN
Sbjct: 327 ENLIGKGTSGYLNGSFEDALFTFPSDLAIDSNGDIYVAGNGAWDGGENLDQSIRLLDMTN 386

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             V  +AGG   + G  D  A +A+FS   +L  V     + + D  N +IR+I
Sbjct: 387 RVVRLVAGG--TQAGYVDANAGSAAFSGPQDLA-VDKNGVIYVYDKKNNVIRKI 437


>gi|398339828|ref|ZP_10524531.1| hypothetical protein LkirsB1_10395 [Leptospira kirschneri serovar
           Bim str. 1051]
          Length = 358

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 65/119 (54%), Gaps = 9/119 (7%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKKE 146
           + G  +G+  ++ FD P    +D++ N+YV + SNH IRKI NL    V+T +GG S   
Sbjct: 164 LSGLQNGDRLNSLFDGPFFMDIDLERNLYVGELSNHTIRKI-NLDSGMVSTFSGGIS--- 219

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
           G  DG   +A F +   + +     +LL +D  +  IR+I+LK    + S+  G+ +GA
Sbjct: 220 GYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKIDLKTS--TVSTLLGNGIGA 276



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 37/70 (52%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++++ VV   AG GV    DG   +A F  P    VD  GNI+V+
Sbjct: 36  ILDCILKECYLCSLKITDKPVVSLFAGTGVAESIDGTISTASFKTPFGLEVDTSGNIFVS 95

Query: 121 DKSNHVIRKI 130
           D+  ++IRKI
Sbjct: 96  DQMTNLIRKI 105


>gi|423298428|ref|ZP_17276486.1| hypothetical protein HMPREF1070_05151 [Bacteroides ovatus
           CL03T12C18]
 gi|392663340|gb|EIY56891.1| hypothetical protein HMPREF1070_05151 [Bacteroides ovatus
           CL03T12C18]
          Length = 507

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/67 (38%), Positives = 36/67 (53%), Gaps = 2/67 (2%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           GY DG    ARF + +    D  GNIY+ D+ NH IR + N GV+T+ GG     G  +G
Sbjct: 413 GYWDGIFAEARFLRGEQMCSDDAGNIYIVDQDNHAIRMVNNTGVSTVIGGNGS--GAVNG 470

Query: 152 PAQNASF 158
             ++A  
Sbjct: 471 VGKDAKL 477


>gi|158520859|ref|YP_001528729.1| YD repeat-containing protein [Desulfococcus oleovorans Hxd3]
 gi|158509685|gb|ABW66652.1| YD repeat protein [Desulfococcus oleovorans Hxd3]
          Length = 2831

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/116 (33%), Positives = 54/116 (46%), Gaps = 7/116 (6%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
            +S + +++  AG GV     G+ G    AR   P   AVD  GN+++AD  N  IRK+  
Sbjct: 1629 ISPDGIIEAFAGMGVDAGYSGDGGLAVDARLQSPTGLAVDKTGNLFIADSGNFSIRKVDP 1688

Query: 133  LGVTTIAGGGSKKEGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
             GV T   GG+      DG PA +A   +  E+T        L    G   IR+IN
Sbjct: 1689 KGVITTIAGGNGPGYSGDGWPAVDAQLQSISEITLDSSGNLYLT---GYDHIRKIN 1741



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/110 (34%), Positives = 52/110 (47%), Gaps = 9/110 (8%)

Query: 82   VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
            +K  AGDG      G    AR   P+   +D  GNIY+AD  NH +R+I+  G +   AG
Sbjct: 1587 MKVFAGDG------GPAIEARLYHPQGMEIDASGNIYIADTDNHCVRRISPDGIIEAFAG 1640

Query: 141  GGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
             G       DG  A +A   +   L  V     L I+D GN  IR+++ K
Sbjct: 1641 MGVDAGYSGDGGLAVDARLQSPTGLA-VDKTGNLFIADSGNFSIRKVDPK 1689


>gi|423219560|ref|ZP_17206056.1| hypothetical protein HMPREF1061_02829 [Bacteroides caccae
           CL03T12C61]
 gi|392624765|gb|EIY18843.1| hypothetical protein HMPREF1061_02829 [Bacteroides caccae
           CL03T12C61]
          Length = 436

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/50 (54%), Positives = 31/50 (62%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           VV+ +AG    GY D   GSA F  P+  AVD  G IYV DK N+VIRKI
Sbjct: 384 VVRLVAGGTQAGYVDANAGSAAFSGPQDLAVDKNGVIYVYDKKNNVIRKI 433



 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 40/120 (33%), Positives = 53/120 (44%), Gaps = 15/120 (12%)

Query: 55  GSSDLIVLDSSRSAFYTL-SFPLSEESVVKRLAG--DGVQGYSDGEPGSARF-------- 103
           G  +L +  ++R    T+  F  S  +VV  LAG  +G  GY DG    A F        
Sbjct: 94  GDGELTITIANREPVRTIDKFTYSFSAVVTTLAGSANGEPGYQDGVGSEALFFFDAAKAE 153

Query: 104 ----DKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
                K  S  VD  GN+YV D  N+ +RKIT  G  T   G +  +G  DG    A F+
Sbjct: 154 PAEDWKKGSVCVDDDGNVYVGDCVNYCVRKITPDGTVTTLAGLAGNKGCIDGTGVQARFN 213



 Score = 42.4 bits (98), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 53/114 (46%), Gaps = 13/114 (11%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA--------DKSNHVIR--KITN 132
           + L G G  GY +G    A F  P   A+D  G+IYVA        +  +  IR   +TN
Sbjct: 323 ENLIGKGTSGYLNGSFEDALFTFPSDLAIDSNGDIYVAGNGAWDGGENLDQSIRLLDMTN 382

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             V  +AGG   + G  D  A +A+FS   +L  V     + + D  N +IR+I
Sbjct: 383 RVVRLVAGG--TQAGYVDANAGSAAFSGPQDLA-VDKNGVIYVYDKKNNVIRKI 433


>gi|430741385|ref|YP_007200514.1| NHL repeat protein [Singulisphaera acidiphila DSM 18658]
 gi|430013105|gb|AGA24819.1| NHL repeat protein [Singulisphaera acidiphila DSM 18658]
          Length = 361

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 44/118 (37%), Positives = 62/118 (52%), Gaps = 4/118 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TN 132
           + ++  V+ +AG G  G S DG P + A F+   S AV   G+I V+D  N+ +RKI   
Sbjct: 73  IDKDGRVQTVAGTGKNGDSGDGGPATKAEFNGMHSLAVMKNGDILVSDTWNNRVRKIDAR 132

Query: 133 LGV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            GV TTIAG G K      GPA  A F   + +       AL ++D  N+ IR ++LK
Sbjct: 133 TGVITTIAGTGKKGFSGDGGPATAAEFGGIYCIALDEPGQALYLADLDNRRIRAVDLK 190



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 44/147 (29%), Positives = 71/147 (48%), Gaps = 12/147 (8%)

Query: 45  INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SAR 102
           I+P +V    G  ++ +LD   +A   +         ++ L G G +G S DG PG  A+
Sbjct: 218 IDPRAVA-TDGKGNVYILDRGGNALRVVD----RSGKIRTLVGTGEKGASGDGGPGRQAK 272

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSK-KEGRADGPAQNASFS 159
            + PK   V+ +G++ +AD  NH IR  + +   + ++AG G K   G    PA+ A F+
Sbjct: 273 LNGPKHLCVNPQGDVIIADTENHRIRLYRPSEGTIESLAGTGRKGTAGVGKAPAE-AEFN 331

Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
               +T  P    L ISD  N  I ++
Sbjct: 332 QPHGVTLAPD-GVLFISDSSNHRILKL 357



 Score = 42.7 bits (99), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 43/157 (27%), Positives = 73/157 (46%), Gaps = 12/157 (7%)

Query: 50  VIDRPGSS-DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY-SDG-EPGSARFDKP 106
            +D PG +  L  LD+ R     L     +  +V  LAG+G +G   DG +  ++    P
Sbjct: 166 ALDEPGQALYLADLDNRRIRAVDL-----KTGLVSTLAGNGHKGIPQDGADARTSPMIDP 220

Query: 107 KSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
           ++ A D KGN+Y+ D+  + +R +   G + T+ G G K      GP + A  +    L 
Sbjct: 221 RAVATDGKGNVYILDRGGNALRVVDRSGKIRTLVGTGEKGASGDGGPGRQAKLNGPKHLC 280

Query: 166 FVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
             P    ++I+D  N  IR    +P + +  S +G+ 
Sbjct: 281 VNPQ-GDVIIADTENHRIRL--YRPSEGTIESLAGTG 314


>gi|421129432|ref|ZP_15589632.1| putative lipoprotein [Leptospira kirschneri str. 2008720114]
 gi|410358807|gb|EKP05916.1| putative lipoprotein [Leptospira kirschneri str. 2008720114]
          Length = 358

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 65/119 (54%), Gaps = 9/119 (7%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKKE 146
           + G  +G+  ++ FD P    +D++ N+YV + SNH IRKI NL    V+T +GG S   
Sbjct: 164 LSGLQNGDRLNSLFDGPFFMDIDLERNLYVGELSNHTIRKI-NLDSGMVSTFSGGIS--- 219

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
           G  DG   +A F +   + +     +LL +D  +  IR+I+LK    + S+  G+ +GA
Sbjct: 220 GYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKIDLKTS--TVSTLLGNGIGA 276



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 37/70 (52%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++++ VV   AG GV    DG   +A F  P    VD  GNI+V+
Sbjct: 36  ILDCILKECYLCSLKITDKPVVSLFAGTGVAESIDGTISTASFKTPFGLEVDTSGNIFVS 95

Query: 121 DKSNHVIRKI 130
           D+  ++IRKI
Sbjct: 96  DQMTNLIRKI 105


>gi|290975027|ref|XP_002670245.1| predicted protein [Naegleria gruberi]
 gi|284083802|gb|EFC37501.1| predicted protein [Naegleria gruberi]
          Length = 1363

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 53/165 (32%), Positives = 78/165 (47%), Gaps = 18/165 (10%)

Query: 28  LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
           L   Y +TTV    +   +P   I +  + D+IV D+   +   +S+      V+ R+AG
Sbjct: 393 LPSKYYITTVATDFK---DPQK-IAKLSNGDIIVSDTGDHSIKKISY---STGVISRIAG 445

Query: 88  DGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAG---G 141
            GV G+S DG   S A+ +KP   A+     IY+AD  NH IR +  N  ++T+AG   G
Sbjct: 446 TGVAGFSGDGGLASQAQLNKPYGIAITANDEIYIADNLNHRIRFVDVNGNISTVAGTSIG 505

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            S   G A     NA    D  L+       L I+D  N  IR++
Sbjct: 506 FSGDSGLATAAKLNAPM--DVSLSASGD---LYIADRDNYRIRRV 545



 Score = 43.5 bits (101), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 22/88 (25%), Positives = 48/88 (54%), Gaps = 2/88 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
           +A+ + P   ++   G++Y+AD+ N+ IR++++  ++T AG G        G A +A+ S
Sbjct: 515 AAKLNAPMDVSLSASGDLYIADRDNYRIRRVSSGVISTFAGNGQSGYSGDGGQATSAALS 574

Query: 160 NDFELTFVPHICALLISDHGNQLIRQIN 187
             + +  +     + ISD  N  +R+++
Sbjct: 575 QAYGVKVING--EVYISDSNNYKVRKVD 600



 Score = 43.1 bits (100), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 31/109 (28%), Positives = 57/109 (52%), Gaps = 5/109 (4%)

Query: 81  VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
           V+   AG+G  GYS   G+  SA   +     V + G +Y++D +N+ +RK+   G +TT
Sbjct: 549 VISTFAGNGQSGYSGDGGQATSAALSQAYGVKV-INGEVYISDSNNYKVRKVDGSGIITT 607

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           IAG G+         A +++ ++  ++  +P    ++I+D  N  IR +
Sbjct: 608 IAGSGAAPFNGDGLMATSSNMNHPTDVALLPS-GEMIIADTDNYRIRMV 655



 Score = 39.3 bits (90), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 31/121 (25%), Positives = 57/121 (47%), Gaps = 6/121 (4%)

Query: 12  ITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYT 71
           + ++++L  +V+   L      ++T   G+    +P   + R  + D++V D+   A   
Sbjct: 13  VVMMAMLVYMVNGAPLNTYSVKISTFAGGNGEFKDPQK-LARLSNGDILVSDALGHAIKK 71

Query: 72  LSFPLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           ++       V+  +AG+G  G+   +G   +A  +KP   AV     IY AD  N+ IRK
Sbjct: 72  INV---TSGVITTIAGNGTAGFGGDNGPAVNALVNKPFGIAVSQTDEIYFADSGNNRIRK 128

Query: 130 I 130
           I
Sbjct: 129 I 129


>gi|290984233|ref|XP_002674832.1| predicted protein [Naegleria gruberi]
 gi|284088424|gb|EFC42088.1| predicted protein [Naegleria gruberi]
          Length = 2158

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 40/121 (33%), Positives = 57/121 (47%), Gaps = 5/121 (4%)

Query: 69  FYTLSFPLSEESVVKRLAGDGVQGYS-DG-EPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
           FY       E   V  +AG G   Y+ DG +  +A+ ++P    V     IY+ DK NH 
Sbjct: 169 FYNHKVKKVESGTVTTIAGTGTANYNGDGIQASTAQLNEPHGVFV-RNDKIYIVDKKNHR 227

Query: 127 IRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
           IR I   G ++T+AG G+   G  D  A  +S   D    +V     + ISDH N  IR+
Sbjct: 228 IRMIQPDGKISTVAGNGTLGFG-PDNVAAKSSMLQDPVFVYVTQDEVIYISDHENNRIRR 286

Query: 186 I 186
           +
Sbjct: 287 V 287



 Score = 43.5 bits (101), Expect = 0.098,   Method: Composition-based stats.
 Identities = 34/100 (34%), Positives = 47/100 (47%), Gaps = 4/100 (4%)

Query: 89  GVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
           GV G  D  G  G  + D P +  V   G  Y+AD  NH ++K+ +  VTTIAG G+   
Sbjct: 134 GVPGCGDAIGATGVTQLDHPPAIFVPYTGLYYIADFYNHKVKKVESGTVTTIAGTGTANY 193

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
              DG   + +  N+    FV +   + I D  N  IR I
Sbjct: 194 -NGDGIQASTAQLNEPHGVFVRND-KIYIVDKKNHRIRMI 231


>gi|374988208|ref|YP_004963703.1| hypothetical protein SBI_05452 [Streptomyces bingchenggensis BCW-1]
 gi|297158860|gb|ADI08572.1| hypothetical protein SBI_05452 [Streptomyces bingchenggensis BCW-1]
          Length = 609

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 61/117 (52%), Gaps = 8/117 (6%)

Query: 54  PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDM 113
           PG +  +V D++R     L+     ESV++R+ G G +G  DG    ARF +P+  A+  
Sbjct: 183 PGGT-FLVSDTTRHQLVELA--ADGESVLRRI-GSGERGLVDGPADRARFSEPQGLALLP 238

Query: 114 KGNIYVADKSNHVIRK--ITNLGVTTIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
            G + VAD  NH +R   +    V T+AG G +  +   A GPA+    S+ +++ +
Sbjct: 239 DGTVIVADTVNHALRSLDLATGEVRTVAGTGRQWWQGSPAAGPAREVDLSSPWDVAW 295



 Score = 38.5 bits (88), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 32/121 (26%), Positives = 50/121 (41%), Gaps = 1/121 (0%)

Query: 68  AFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
           A  TL   L  E   K     G   Y   EP +     P    +   G   V+D + H +
Sbjct: 138 ALETLVAELEAEHGAKGTLRRGDGPYVPPEPVATELRFPGKALLLPGGTFLVSDTTRHQL 197

Query: 128 RKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            ++   G + +   GS + G  DGPA  A FS    L  +P    ++++D  N  +R ++
Sbjct: 198 VELAADGESVLRRIGSGERGLVDGPADRARFSEPQGLALLPD-GTVIVADTVNHALRSLD 256

Query: 188 L 188
           L
Sbjct: 257 L 257


>gi|347756512|ref|YP_004864075.1| Thiol-disulfide isomerase / thioredoxin [Candidatus
           Chloracidobacterium thermophilum B]
 gi|347589029|gb|AEP13558.1| Thiol-disulfide isomerase / thioredoxin [Candidatus
           Chloracidobacterium thermophilum B]
          Length = 676

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 53/106 (50%), Gaps = 9/106 (8%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLG---VTTIAGGG 142
           G G  G  +G    A F  P+  A+D  G+ +YVAD  NH++R++ NL    V T+AG G
Sbjct: 255 GSGAPGTRNGPYNFAEFQYPQGMALD--GDFLYVADTGNHLLRRV-NLKTRMVETVAGTG 311

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
             +  R  GP  +   S+ ++L    H   L I+  G   I Q NL
Sbjct: 312 KNERSRQTGPGTSVGLSSPWDLAI--HQRTLFIAMAGAHQIWQYNL 355



 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 39/125 (31%), Positives = 56/125 (44%), Gaps = 5/125 (4%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGV 135
           +  VV   AG G +G  DG   +A F +P   + D K  +YVAD     +R I      V
Sbjct: 356 DTGVVGPYAGTGAEGRQDGTLETAVFAQPSGLSTDGK-RLYVADSEISAVRAIDLATGQV 414

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSK 195
           TT+AGG     G  +G  +NA F +   +        L ++D  N  +R I+L+    S 
Sbjct: 415 TTLAGGDLFDFGDTNGKGENARFQHPLGVAAAER--KLYVADTYNHKLRTIDLRTRFVSN 472

Query: 196 SSQSG 200
              SG
Sbjct: 473 LIGSG 477


>gi|290995458|ref|XP_002680312.1| predicted protein [Naegleria gruberi]
 gi|284093932|gb|EFC47568.1| predicted protein [Naegleria gruberi]
          Length = 744

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 34/88 (38%), Positives = 46/88 (52%), Gaps = 2/88 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASF 158
           +A F  P S  V   G +Y+AD  NH IRK+ TN  + TIAG G       +GPA NA  
Sbjct: 26  NANFKGPSSVFVSSNGEVYIADIGNHKIRKVLTNGKMVTIAGNGQNGFSGDNGPATNAML 85

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
           +    + FV     + I+D+ N  IR+I
Sbjct: 86  NYPIGV-FVSPYNEVYIADYFNNRIRKI 112



 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 48/155 (30%), Positives = 77/155 (49%), Gaps = 16/155 (10%)

Query: 41  HQLEINPHSVIDRPG------SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS 94
           ++L IN  ++++RP       + ++ + D +R+    +    S  ++V  +AG+G +GY 
Sbjct: 133 NRLAIN--AMLNRPSGVFVTLNDEVYIADQNRNKIRKIE---SNGNIVT-IAGNGKEGYG 186

Query: 95  DGE-PG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADG 151
           D   P  +A  ++P S  V     +Y+AD  NH IRKI  N  + TIAG G       + 
Sbjct: 187 DDNVPAVNAMLNRPSSVFVTSNNEVYIADTGNHRIRKILRNGNIVTIAGNGLVGFHEDNI 246

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            A N S  N  +  FV     + I+D GN  IR+I
Sbjct: 247 LATNTSL-NFPKSVFVSSNGEVYIADTGNHRIRKI 280



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/105 (35%), Positives = 53/105 (50%), Gaps = 5/105 (4%)

Query: 85  LAGDGVQGYSDGE--PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGG 141
           +AG+G+ G+ +      +   + PKS  V   G +Y+AD  NH IRKI  N  + TIAG 
Sbjct: 233 IAGNGLVGFHEDNILATNTSLNFPKSVFVSSNGEVYIADTGNHRIRKILANGNIATIAGN 292

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G +  G  + PA +    N     FV     + I+D  NQ IR+I
Sbjct: 293 GKEGYGDDNVPAIDTML-NRPSSVFVSS-NEVYIADQSNQRIRKI 335



 Score = 41.2 bits (95), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 33/92 (35%), Positives = 48/92 (52%), Gaps = 14/92 (15%)

Query: 82  VKRLAGDGVQGYSDGE-PG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTI 138
           +  +AG+G +GY D   P      ++P S  V     +Y+AD+SN  IRKI  N  + TI
Sbjct: 286 IATIAGNGKEGYGDDNVPAIDTMLNRPSSVFVS-SNEVYIADQSNQRIRKILQNGNIVTI 344

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHI 170
           AG G  +EG +       SF   F+++  PHI
Sbjct: 345 AGNG--REGNSGD-----SF---FDISRYPHI 366



 Score = 40.8 bits (94), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 50/105 (47%), Gaps = 4/105 (3%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGG 141
           +AG+G  G+S  +G   +A  + P    V     +Y+AD  N+ IRKI TN  + TIAG 
Sbjct: 65  IAGNGQNGFSGDNGPATNAMLNYPIGVFVSPYNEVYIADYFNNRIRKILTNGNIITIAGN 124

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
               +G  +  A NA   N     FV     + I+D     IR+I
Sbjct: 125 NVNGKGVYNRLAINAML-NRPSGVFVTLNDEVYIADQNRNKIRKI 168


>gi|116754580|ref|YP_843698.1| NHL repeat-containing protein [Methanosaeta thermophila PT]
 gi|116666031|gb|ABK15058.1| NHL repeat containing protein [Methanosaeta thermophila PT]
          Length = 504

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 54/102 (52%), Gaps = 7/102 (6%)

Query: 95  DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRADGP 152
           DG+ G AR   P       +G+IYVAD  NH I+K  +++  + T AG GS   G ADGP
Sbjct: 400 DGDFGRARLQHPLGLFYK-EGSIYVADTYNHRIKKADLSSGSIHTTAGTGSP--GFADGP 456

Query: 153 AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
              A+F+    LTF+    +L I+D  N  +R  + +  D S
Sbjct: 457 GAQAAFNEPSGLTFLGD--SLFIADTNNHAVRIYDQRSGDVS 496



 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 31/105 (29%), Positives = 54/105 (51%), Gaps = 5/105 (4%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSK 144
           G G +GYSDG+   AR  +P+  A+ +   +Y+AD  NH++R   +    +  +AG G  
Sbjct: 230 GSGERGYSDGDFSEARLFRPQGIAI-VGDVVYIADTGNHMVRAADLRRRTLVRMAGTGKS 288

Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           +     G     S S+ ++L FV     L I+  G+  I +++L+
Sbjct: 289 RHPGLGGRGAEVSLSSPWDLVFVQD--HLYIAMAGSHQIWRMDLE 331


>gi|290971764|ref|XP_002668649.1| predicted protein [Naegleria gruberi]
 gi|284082135|gb|EFC35905.1| predicted protein [Naegleria gruberi]
          Length = 556

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 50/166 (30%), Positives = 76/166 (45%), Gaps = 29/166 (17%)

Query: 48  HSVIDRPGSSDLIVLDS-----SRSAFYT-LSFPLSEESV---------VKRLAGD---- 88
           +S++DR   + +I LD+     + ++ YT +S   SE S          +K+L       
Sbjct: 338 NSLVDRVAGTGIIELDTDGKLGNLTSLYTPISVTQSESSSDIYIGTTIGIKKLTDSNKLI 397

Query: 89  ----GVQGYS--DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAG 140
               G Q YS  DG P +A R   P+  A+   G ++++DK +H IRKI + GV + +AG
Sbjct: 398 SSFAGSQAYSSGDGYPATAARLSSPEGIAISTSGEVFISDKGSHTIRKIDSKGVISNVAG 457

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            GS   G  DGPA       D       H   L       Q+IR++
Sbjct: 458 TGS--AGYVDGPALKPKQPIDTAALLRKHARNLYPDSSDRQIIRKL 501



 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 37/87 (42%), Gaps = 2/87 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
           SAR   P   AV  K  I++ D +N  IRKI N     + G G         PA  AS  
Sbjct: 94  SARLSYPVCIAVSPKDEIFICDNNN--IRKIVNDTFYNVVGNGQLGNSPDGTPAVTASIK 151

Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
             + + F P    +  SD  N +IR +
Sbjct: 152 TAYGIAFHPVTGEMYFSDINNNVIRYV 178


>gi|398335921|ref|ZP_10520626.1| hypothetical protein LkmesMB_09964 [Leptospira kmetyi serovar
           Malaysia str. Bejo-Iso9]
          Length = 947

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 40/118 (33%), Positives = 60/118 (50%), Gaps = 7/118 (5%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSKKE 146
           G  G +DG   +ARF+ PK  A D    +YVAD  NH IR  KI++  VTTIAG G+   
Sbjct: 722 GTSGNADGVGTAARFNAPKGIATDGT-YLYVADTGNHKIRKIKISDNTVTTIAGSGT--A 778

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
           G  DG    A F+    L +  +   L ++D  +  I+ ++L     + ++ +G   G
Sbjct: 779 GTLDGLGTAAKFNQPSHLVYDSN--KLYVTDTNSNNIKLVDLTTSPVTVTTIAGDPAG 834


>gi|375146499|ref|YP_005008940.1| cell surface receptor IPT/TIG domain-containing protein [Niastella
           koreensis GR20-10]
 gi|361060545|gb|AEV99536.1| cell surface receptor IPT/TIG domain protein [Niastella koreensis
           GR20-10]
          Length = 500

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 29/76 (38%), Positives = 42/76 (55%), Gaps = 4/76 (5%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
           G  ++ V D S +  Y +  P  + S++   AG    GY DG+P  ARF  P+   +D  
Sbjct: 426 GVGNIYVTDDSTNGIYRVD-PNGKLSLI---AGGVRSGYIDGKPQDARFSGPRGIVIDAS 481

Query: 115 GNIYVADKSNHVIRKI 130
           GN++VAD  N+ IRKI
Sbjct: 482 GNLFVADIGNNCIRKI 497



 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 38/110 (34%), Positives = 53/110 (48%), Gaps = 14/110 (12%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
           VV  +AG G  GY DG   +A+F  P+   +D  GNIYVA+  N  IRKI   G VTT  
Sbjct: 253 VVSTIAGSGTYGYLDGPGLTAQFHFPQGLTLDNAGNIYVAEYQNQSIRKIDPAGNVTTF- 311

Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
                     +GP    ++ ND     V     + ++D  N  I +I+L+
Sbjct: 312 --------FINGPQGYVAYPND----VVVDASNVYVTDQSNNRICKISLQ 349



 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 44/134 (32%), Positives = 67/134 (50%), Gaps = 5/134 (3%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEE-SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
           S  + V+  ++ A    SF    + + V   AG    G  D    +A+F  P   A+D  
Sbjct: 173 SGKVTVVSCNKMAVSDTSFIYQTKVNSVITFAGQQNPGAVDASGINAQFWSPDDIALDAA 232

Query: 115 GNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
           GN+ V++  N  IR+IT LG V+TIAG G+   G  DGP   A F     LT   +   +
Sbjct: 233 GNLIVSEIGNSKIRRITPLGVVSTIAGSGT--YGYLDGPGLTAQFHFPQGLTL-DNAGNI 289

Query: 174 LISDHGNQLIRQIN 187
            ++++ NQ IR+I+
Sbjct: 290 YVAEYQNQSIRKID 303



 Score = 45.8 bits (107), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 53/101 (52%), Gaps = 6/101 (5%)

Query: 31  GYTVTTVIDGHQLEIN-PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG 89
           G   T  I+G Q  +  P+ V+    +S++ V D S +    +S    +  ++  L+G+G
Sbjct: 306 GNVTTFFINGPQGYVAYPNDVVVD--ASNVYVTDQSNNRICKISL---QTGLLSVLSGNG 360

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
             G  DG+P  A+F +P   A+D   N+ +ADK N  +RK+
Sbjct: 361 NWGMVDGDPQQAQFYQPAKMALDNNNNLIIADKINGRVRKV 401


>gi|290982010|ref|XP_002673724.1| NHL repeat-containing protein [Naegleria gruberi]
 gi|284087309|gb|EFC40980.1| NHL repeat-containing protein [Naegleria gruberi]
          Length = 456

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 43/126 (34%), Positives = 62/126 (49%), Gaps = 7/126 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKITNL--GVT 136
           +K + G+G+ GY  G+ G+A       P S AVD  GNIY+AD  NH IRK T L   ++
Sbjct: 115 IKTIVGNGMPGYG-GDSGAATSTSLLFPTSIAVDSSGNIYIADTQNHRIRKFTALTGKLS 173

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKS 196
           TIAG   +     +  A  A  S+   +  +  I  + I+D  N  IR+I +     S  
Sbjct: 174 TIAGTEEEGYNGDNIIATTAKLSSPTGVA-LDSIGNVYIADSQNHRIRKITISNGKISTI 232

Query: 197 SQSGSA 202
           +  G A
Sbjct: 233 AGDGDA 238



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 37/108 (34%), Positives = 53/108 (49%), Gaps = 6/108 (5%)

Query: 85  LAGDGVQGYSDGE--PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTIAG 140
           +AG   +GY+       +A+   P   A+D  GN+Y+AD  NH IRKIT  N  ++TIAG
Sbjct: 175 IAGTEEEGYNGDNIIATTAKLSSPTGVALDSIGNVYIADSQNHRIRKITISNGKISTIAG 234

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICA--LLISDHGNQLIRQI 186
            G       DG +  AS  N+     V    +  +  +D  N  IR+I
Sbjct: 235 DGDAGSPLYDGLSAIASSLNNPTSVAVDQSSSNTVYFADTNNSKIRKI 282



 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 36/96 (37%), Positives = 49/96 (51%), Gaps = 6/96 (6%)

Query: 96  GEPGSARF---DKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRAD 150
           G P +  F   D P   A+   G++YVADKSN  I+KI  T   ++TIAGG        +
Sbjct: 361 GFPSTTEFACLDSPFGIAIGPNGDVYVADKSNARIKKINPTTKTISTIAGGSGAGFDGDN 420

Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           GPA  +S  ND     V    A+ I+D  N  IR++
Sbjct: 421 GPAILSSL-NDPCGIAVQSNGAVYIADTKNNRIRKV 455


>gi|434403812|ref|YP_007146697.1| thiol-disulfide isomerase-like thioredoxin [Cylindrospermum
           stagnale PCC 7417]
 gi|428258067|gb|AFZ24017.1| thiol-disulfide isomerase-like thioredoxin [Cylindrospermum
           stagnale PCC 7417]
          Length = 506

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 45/148 (30%), Positives = 67/148 (45%), Gaps = 13/148 (8%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V+  P    L + DS        SF       V  + G G  G++DG    A+F  P
Sbjct: 177 PGKVLATPAG--LFIADSGHHRLVMSSF----NGEVFHIIGTGKSGFTDGNFSEAQFSAP 230

Query: 107 KSFAVDMKGNI-YVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRA--DGPAQNASFSND 161
           +  A D +  I YVAD  NH +RK  I    V TIAG G +        G A   + ++ 
Sbjct: 231 QGMAYDAENQILYVADTENHALRKVDIKRQVVGTIAGTGEQSHNTRPHSGAALETALNSP 290

Query: 162 FELTFVPHICALLISDHGNQLIRQINLK 189
           ++L  + +  +L I+  GN  I Q++L+
Sbjct: 291 WDLEKIGN--SLFIAMAGNHQIWQLDLE 316



 Score = 40.8 bits (94), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 37/129 (28%), Positives = 55/129 (42%), Gaps = 7/129 (5%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLG 134
           E  +VK  AG G +G  DG    + F +P     D K  +++AD     IR +    N  
Sbjct: 316 ENHIVKTYAGTGGEGCVDGNFTESAFAQPSGITTDGK-ELFIADSEISSIRGVELGENGK 374

Query: 135 VTTIAGGGSKKE-GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
           V T+ GGG   + G  DG   +    +   + F  +   L ++D  N  I+ +N    DC
Sbjct: 375 VRTVCGGGFLFDFGDIDGQYFDVRLQHCLGVDFFQN--HLWVTDTYNHKIKLVNPITGDC 432

Query: 194 SKSSQSGSA 202
                 GSA
Sbjct: 433 QTVLGDGSA 441



 Score = 37.4 bits (85), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 20/62 (32%), Positives = 35/62 (56%), Gaps = 3/62 (4%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG 140
           + + GDG  G  +G+  + RF +P   +  +   +Y++D +NHVIR++    L VTT+  
Sbjct: 433 QTVLGDGSAGLQNGQGKNTRFFEPSGLSA-IDSYLYISDTNNHVIRRVDLETLEVTTLQF 491

Query: 141 GG 142
            G
Sbjct: 492 SG 493


>gi|356571070|ref|XP_003553704.1| PREDICTED: NHL repeat-containing protein 2-like [Glycine max]
          Length = 1098

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 35/113 (30%), Positives = 56/113 (49%), Gaps = 4/113 (3%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVT 136
           + + ++   G +G  DG    A F++P+  A + K NI YVAD  NH +R+I   N  V 
Sbjct: 642 NFIVQIGSSGEEGLQDGSFDDATFNRPQGLAYNAKKNILYVADTENHALREIDFVNEKVR 701

Query: 137 TIAGGGSKKEGRADGPAQNASFSND-FELTFVPHICALLISDHGNQLIRQINL 188
           T+AG G+K      G   ++   N  +++ F P    + I+  G   I + NL
Sbjct: 702 TLAGNGTKGSDYVGGGKGDSQLLNSPWDVCFHPFDEKIYIAMAGQHQIWEHNL 754



 Score = 41.6 bits (96), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 31/76 (40%), Positives = 41/76 (53%), Gaps = 5/76 (6%)

Query: 115 GNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA 172
           G IY+AD  NH I+K+  T+  V+TIAG G  K G  DG A  A  S    +    +   
Sbjct: 857 GEIYIADSYNHKIKKLDPTSKRVSTIAGTG--KAGFKDGTAVKAQLSEPSGIV-EGNKGR 913

Query: 173 LLISDHGNQLIRQINL 188
           L I+D  N LIR ++L
Sbjct: 914 LFIADTNNSLIRYLDL 929


>gi|283781091|ref|YP_003371846.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
 gi|283439544|gb|ADB17986.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
          Length = 782

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 31/67 (46%), Positives = 39/67 (58%), Gaps = 3/67 (4%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK 144
           G GV G +DG    A FD P+  A+D +  +YVAD  NH++RKI  T   VTTIAG G +
Sbjct: 332 GSGVIGKADGSFAEASFDHPQGCALDGE-TLYVADTENHLLRKIDLTKKTVTTIAGTGKQ 390

Query: 145 KEGRADG 151
            E    G
Sbjct: 391 AESAWPG 397



 Score = 39.3 bits (90), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 4/49 (8%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           VK +AG G  G SD     A FD+P   A+ +   +YVAD +NH+IRK+
Sbjct: 578 VKTIAGSGSPGTSDD---PATFDEPAGLAL-VGETLYVADTNNHLIRKV 622


>gi|290989149|ref|XP_002677205.1| predicted protein [Naegleria gruberi]
 gi|284090811|gb|EFC44461.1| predicted protein [Naegleria gruberi]
          Length = 691

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 30/68 (44%), Positives = 43/68 (63%), Gaps = 5/68 (7%)

Query: 81  VVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VT 136
           ++  +AG G +GY DG+ G    AR   P   AV+ +G++Y +D  N+VIRKI+  G +T
Sbjct: 226 LIHTVAGFGEKGY-DGDGGFAFYARIRCPTGIAVNQRGDVYFSDSGNNVIRKISTCGIIT 284

Query: 137 TIAGGGSK 144
           TIAG G K
Sbjct: 285 TIAGTGEK 292



 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 37/115 (32%), Positives = 63/115 (54%), Gaps = 8/115 (6%)

Query: 81  VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL--GVT 136
           ++  +AG G +G++ DG   + ++   P   A    G++Y+AD+ NH IRK++ +   ++
Sbjct: 111 IITTIAGCGERGFNRDGCVATVSKLSYPTRIARCENGDLYIADRHNHRIRKVSAVDGSIS 170

Query: 137 TIAGGGSKKEG-RADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           TIAG G  +EG   DG PA +A       +        L ISD+ N  IR+++L+
Sbjct: 171 TIAGTG--REGYNGDGIPAIHADLYFPTGIAIDEREQELFISDYHNNRIRRVSLR 223



 Score = 46.6 bits (109), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 31/113 (27%), Positives = 54/113 (47%), Gaps = 3/113 (2%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +S   ++  +AG G +G++  +  +     + P   A+  +G I   D  N+ I KI N 
Sbjct: 277 ISTCGIITTIAGTGEKGFNGDQMDALDVMLNGPSGIAISQEGEILFTDIYNNRICKI-NR 335

Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             T +   G+ +EG  DGP + A F+N  ++        + + D GN  IR+I
Sbjct: 336 DRTLVTIAGTTEEGYQDGPVRMAKFNNPSDIAIDNETGDVYVVDGGNNYIRKI 388


>gi|441185716|ref|ZP_20970504.1| hypothetical protein SRIM_41417, partial [Streptomyces rimosus
           subsp. rimosus ATCC 10970]
 gi|440613960|gb|ELQ77297.1| hypothetical protein SRIM_41417, partial [Streptomyces rimosus
           subsp. rimosus ATCC 10970]
          Length = 108

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 47/78 (60%), Gaps = 3/78 (3%)

Query: 79  ESVVKRLAGDGVQGY-SDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGV 135
           +  +  +AG+GV G+ SDG  G+  R   P+   VD  GN+++AD+ NH IRK+T N  +
Sbjct: 18  DGSIATVAGNGVAGFISDGGHGALTRVYYPQGVTVDKNGNLFIADRYNHRIRKVTPNGTI 77

Query: 136 TTIAGGGSKKEGRADGPA 153
           TT+AG G+       GPA
Sbjct: 78  TTVAGDGNAGYISDGGPA 95


>gi|428169673|gb|EKX38605.1| hypothetical protein GUITHDRAFT_56349, partial [Guillardia theta
           CCMP2712]
          Length = 210

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 46/161 (28%), Positives = 73/161 (45%), Gaps = 10/161 (6%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP  +     S  +++ D   S    L  PL +   V  +AG    GY DG    A+FD 
Sbjct: 26  NPVGISVSADSQYVVIADYLNSLIRLLFVPLLQ---VTTVAGTSGSGYEDGTCAMAKFDG 82

Query: 106 PKS--FAVDMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
           P    + +D K  + +AD  N+ IR   +++  V+TIAG G+   G+ DG    A+F+  
Sbjct: 83  PIDVVWGID-KSLVLIADFWNNRIRSLNLSSCSVSTIAGSGAF--GQLDGSTAAATFAGP 139

Query: 162 FELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
             L        LL+++     +R I+L     +  + SG A
Sbjct: 140 CSLALTSQGSKLLVAEFSGHSVRMIDLSKNTVTTLAGSGIA 180



 Score = 43.9 bits (102), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 46/87 (52%), Gaps = 5/87 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
           V  +AG G  G  DG   +A F  P S A+  +G+ + VA+ S H +R I  +   VTT+
Sbjct: 115 VSTIAGSGAFGQLDGSTAAATFAGPCSLALTSQGSKLLVAEFSGHSVRMIDLSKNTVTTL 174

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELT 165
           AG G    G  DG    A F+N ++LT
Sbjct: 175 AGSG--IAGLQDGVGLIAQFTNPWDLT 199


>gi|418737091|ref|ZP_13293489.1| NHL repeat protein [Leptospira borgpetersenii serovar Castellonis
           str. 200801910]
 gi|410747250|gb|EKR00156.1| NHL repeat protein [Leptospira borgpetersenii serovar Castellonis
           str. 200801910]
          Length = 358

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 58/109 (53%), Gaps = 7/109 (6%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKK 145
           G+ G+ +G+  ++ F  P    +D + N+YV +  NH IRKI NL    V+T++GG S  
Sbjct: 163 GIGGFQNGDRLNSLFKGPLFMDLDRERNLYVGELGNHTIRKI-NLNSETVSTLSGGVS-- 219

Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
            G  DG   +A F     + +     +LL++D  N  IR+I+LK    S
Sbjct: 220 -GYLDGDLTSAQFKFPSGIIYDQKTDSLLVADLQNHRIRKIDLKTSTVS 267



 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 44/81 (54%), Gaps = 1/81 (1%)

Query: 60  IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYV 119
           ++LD      +T    ++   VV   AG G++   DG   +A F  P S  +D  GNI+V
Sbjct: 35  MILDCILKECHTCKLKVTNNPVVSPFAGTGIRESIDGTTQTASFKTPFSLELDTFGNIFV 94

Query: 120 ADKSNHVIRKITNLG-VTTIA 139
           +D  +++IRKI   G VTT++
Sbjct: 95  SDSDSNLIRKIDRFGNVTTLS 115



 Score = 45.8 bits (107), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 35/80 (43%), Positives = 42/80 (52%), Gaps = 6/80 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
           V  L+G GV GY DG+  SA+F  P     D K + + VAD  NH IRKI      V+T+
Sbjct: 211 VSTLSG-GVSGYLDGDLTSAQFKFPSGIIYDQKTDSLLVADLQNHRIRKIDLKTSTVSTL 269

Query: 139 AGGGSKKEGRADGPAQNASF 158
            G G   E   DG   NASF
Sbjct: 270 LGNG--IEASIDGKGLNASF 287



 Score = 38.5 bits (88), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 22/62 (35%), Positives = 37/62 (59%), Gaps = 1/62 (1%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVT 136
           + S V  L G+G++   DG+  +A FD P   ++D  G ++V+D S++ IR +  +L V 
Sbjct: 262 KTSTVSTLLGNGIEASIDGKGLNASFDGPAFISLDNSGYMFVSDTSSNKIRIVDPDLNVF 321

Query: 137 TI 138
           TI
Sbjct: 322 TI 323


>gi|387219837|gb|AFJ69627.1| nhl repeat containing 2 [Nannochloropsis gaditana CCMP526]
          Length = 295

 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 24/50 (48%), Positives = 30/50 (60%), Gaps = 1/50 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK-GNIYVADKSNHVIRKI 130
           V RLAG G +GY DG    A FD+P+   VD K G  +V D +NH IR +
Sbjct: 229 VSRLAGSGKRGYRDGAGAEAEFDEPRGLCVDGKGGQAFVCDSNNHCIRTV 278


>gi|297582981|ref|YP_003698761.1| NHL repeat containing protein [Bacillus selenitireducens MLS10]
 gi|297141438|gb|ADH98195.1| NHL repeat containing protein [Bacillus selenitireducens MLS10]
          Length = 414

 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 34/97 (35%), Positives = 48/97 (49%), Gaps = 4/97 (4%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           G  DG    A F  P+  AV   G I VAD  NH +R IT   V  +AG G    G  +G
Sbjct: 320 GSDDGPVSEAGFHFPRGVAVLSSGAILVADTYNHRLRLITEDEVLPVAGHGVH--GMVNG 377

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           P ++A F   + +        +L++DH N +IR++ L
Sbjct: 378 PVEDALFDGPYHVAVFGE--RILVTDHWNHMIREVEL 412



 Score = 40.0 bits (92), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 51/195 (26%), Positives = 70/195 (35%), Gaps = 63/195 (32%)

Query: 54  PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG---------SARFD 104
           P     ++ DS       +S     E+   R   DG   Y  GEP           A FD
Sbjct: 111 PFDDGFLITDSGNHTLRYVSMSGQSETFAGRY--DGYDEY--GEPEGRFDHGTGEEAGFD 166

Query: 105 KPKSFAVDMK-GNIYVADKSNHVIRKITNLG-VTTIA----------------------- 139
            P     D   G +YVAD  N  IR++T  G V+T+A                       
Sbjct: 167 TPLGLVYDEDSGLLYVADAGNGAIRRVTEDGEVSTVAEDLDYPTDLILLDGSLIVSEARA 226

Query: 140 -----------------GGGSKKE-----GR-ADGPAQNASFSNDFELTFVPHICALLIS 176
                            GGG ++      GR ADG  + A F+  F L  +     ++++
Sbjct: 227 HVLTLVDPESGETEHLAGGGYEEADGELVGRFADGSGEGAGFNEPFGLAVLEE--TIVVA 284

Query: 177 DHGNQLIRQINLKPE 191
           D GNQ IRQ+ L  E
Sbjct: 285 DSGNQRIRQVTLDGE 299



 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 28/108 (25%), Positives = 43/108 (39%), Gaps = 9/108 (8%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG-------SK 144
           GY +G P   +F+ P    V       + D  NH +R ++  G +    G         +
Sbjct: 92  GYQNGNPDDTKFNHPAGI-VPFDDGFLITDSGNHTLRYVSMSGQSETFAGRYDGYDEYGE 150

Query: 145 KEGRAD-GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
            EGR D G  + A F     L +      L ++D GN  IR++    E
Sbjct: 151 PEGRFDHGTGEEAGFDTPLGLVYDEDSGLLYVADAGNGAIRRVTEDGE 198


>gi|290976760|ref|XP_002671107.1| predicted protein [Naegleria gruberi]
 gi|284084673|gb|EFC38363.1| predicted protein [Naegleria gruberi]
          Length = 1349

 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 38/154 (24%), Positives = 67/154 (43%), Gaps = 2/154 (1%)

Query: 34  VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQG 92
           V T   G+  +  P S     G   L    +S    +   F  +  + ++R+  D  V  
Sbjct: 454 VGTGTQGNSGDYGPASQCQLNGPQGLATPPTSMKGQFLFDFADTNNNRIRRIQNDLTVIP 513

Query: 93  YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGP 152
                P +   + P        G++Y+A + NH I K +N  ++T+AG G        GP
Sbjct: 514 LLSNNPNNDGINGPNGIMYTPNGDLYIASQGNHKIMKYSNSQLSTVAGNGYAGYSGDGGP 573

Query: 153 AQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           A+NA  +   ++T V     + I+D GN+ +R++
Sbjct: 574 ARNAKLNYPADVT-VSSTGEVYIADWGNRAVRKV 606



 Score = 42.4 bits (98), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 25/68 (36%), Positives = 36/68 (52%), Gaps = 4/68 (5%)

Query: 79  ESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-- 134
            S +  +AG+G  GYS DG P  +A+ + P    V   G +Y+AD  N  +RK+   G  
Sbjct: 553 NSQLSTVAGNGYAGYSGDGGPARNAKLNYPADVTVSSTGEVYIADWGNRAVRKVFRNGTI 612

Query: 135 VTTIAGGG 142
           +T I  GG
Sbjct: 613 ITLIVSGG 620



 Score = 38.5 bits (88), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 44/165 (26%), Positives = 72/165 (43%), Gaps = 36/165 (21%)

Query: 4   HFVYSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVID--RPGSSDLIV 61
           +F+ S L  T  + L+ +  SG   +                NPH+ +    P    + +
Sbjct: 325 NFIDSDLPSTSTTPLYGMYDSGFQFD----------------NPHNALSVYMPSPQQMRL 368

Query: 62  LDSSRSAFYTLSFPLSEESVVK---------RLAGDGVQ-GYS-DGEPGS-ARFDKPKSF 109
             S++     L FP+   +VVK         R+AG   Q GY+ D  P + A   +P + 
Sbjct: 369 TSSNQ-----LYFPIFNSAVVKVDLNSGLVTRIAGSITQFGYNQDSIPATIALMTRPNTV 423

Query: 110 AVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPA 153
                G++ ++D  N+ IRKI + G+ TTI G G++      GPA
Sbjct: 424 LQLDSGDLIISDTGNNRIRKIDSNGIITTIVGTGTQGNSGDYGPA 468


>gi|229822379|ref|YP_002883905.1| alkyl hydroperoxide reductase [Beutenbergia cavernae DSM 12333]
 gi|229568292|gb|ACQ82143.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Beutenbergia cavernae DSM 12333]
          Length = 641

 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 39/132 (29%), Positives = 59/132 (44%), Gaps = 3/132 (2%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S DL     + +  +TL     E+  + RLAG   +G  DG    A F +P   AV   G
Sbjct: 324 SPDLAAFVVAMAGNHTLWAFDDEQGSLTRLAGTEHEGLRDGPAAEAWFAQPSGLAVAQDG 383

Query: 116 NIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
            I+VAD     +R +   G  V T  G G  + G  DG A  A   +   +  +P   ++
Sbjct: 384 RIWVADSETSALRWLDPAGGDVHTAVGQGLFEFGHRDGAADQALLQHPLGVAALPD-ASV 442

Query: 174 LISDHGNQLIRQ 185
           L++D  N  +R+
Sbjct: 443 LVADTYNGALRR 454



 Score = 40.4 bits (93), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 38/162 (23%), Positives = 72/162 (44%), Gaps = 36/162 (22%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFA---VD 112
           + +L+V D+   +   L      E++V+R+ G G +G  DG P  ARF +P       V+
Sbjct: 189 NGNLLVADAGHHSLAELG--PDGETLVRRV-GSGERGLVDGGPNVARFSEPNGLCLVPVE 245

Query: 113 MKG----NIYVADKSNHVIR--KITNLGVTTIAGGGSKKEGRADGPAQNAS--------- 157
           ++     ++ VAD  NH +R  ++ +  V+T+AG G +      G   NAS         
Sbjct: 246 LRARLGYDVLVADTVNHALRGVRLADGHVSTVAGTGQQY---VVGAPDNASDPHGGTHPV 302

Query: 158 ------------FSNDFELTFVPHICALLISDHGNQLIRQIN 187
                        S+ +++ + P + A +++  GN  +   +
Sbjct: 303 GFGDQFVPTSVKLSSPWDVAWSPDLAAFVVAMAGNHTLWAFD 344



 Score = 38.1 bits (87), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 52/109 (47%), Gaps = 8/109 (7%)

Query: 98  PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNAS 157
           PG+ RF   K+ A++  GN+ VAD  +H + ++   G T +   GS + G  DG    A 
Sbjct: 175 PGTLRFPA-KAIALE-NGNLLVADAGHHSLAELGPDGETLVRRVGSGERGLVDGGPNVAR 232

Query: 158 FSNDFELTFVP-HICA-----LLISDHGNQLIRQINLKPEDCSKSSQSG 200
           FS    L  VP  + A     +L++D  N  +R + L     S  + +G
Sbjct: 233 FSEPNGLCLVPVELRARLGYDVLVADTVNHALRGVRLADGHVSTVAGTG 281


>gi|290985160|ref|XP_002675294.1| predicted protein [Naegleria gruberi]
 gi|284088889|gb|EFC42550.1| predicted protein [Naegleria gruberi]
          Length = 1229

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 54/105 (51%), Gaps = 5/105 (4%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDM-KGNIYVADKSNHVIRKITNLGVTTIA 139
           +V   +G  + GY DG   +ARF  P    +D   G++YV+D  N VIRKI+N  V+T+A
Sbjct: 608 IVSTFSGGTISGYQDGPIQTARFYFPNGLDIDTTNGDLYVSDTFNCVIRKISNGVVSTVA 667

Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIR 184
            G  +     +G    AS    + L++      ++ + + N+LI 
Sbjct: 668 -GQYQNTNYQNGNLSTASIGYPYYLSYYN---GMIFTVYDNELIE 708



 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 35/134 (26%), Positives = 65/134 (48%), Gaps = 9/134 (6%)

Query: 76  LSEESVVKRLAGDGVQ---GYSDGEPGSARFDKPKSFAV-DMKGNIYVADKSNHVIR--K 129
           L    V+  +AGDG     GY DG    A F    S +  D    ++++D +N  +R   
Sbjct: 230 LLSNGVMSVIAGDGNNYNNGYRDGNATQALFYYIPSLSFKDSTQELFISDLNNQRVRVLN 289

Query: 130 ITNLGVTTIAGGGSKKEGRADGPAQNAS--FSNDFELTFVPHICALLISDHGNQLIRQIN 187
           + +  V+T+ G       R   P QN    +   +++ ++P +  LLI+D  N++IR+++
Sbjct: 290 MNSNQVSTLLGNDINSYERTYLP-QNGKDVYVRPYQIQYLPQLDQLLIADLDNRVIRKVD 348

Query: 188 LKPEDCSKSSQSGS 201
           L+ +    +  +GS
Sbjct: 349 LQNQQAWTTIIAGS 362



 Score = 37.0 bits (84), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 37/136 (27%), Positives = 66/136 (48%), Gaps = 13/136 (9%)

Query: 114 KGNIYVADKSNHVIRKITNLGVTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVPHICA 172
            G+IY++   N++++ ++N  ++ IAG G+    G  DG A  A F     L+F      
Sbjct: 216 NGDIYLS-AYNNIVKLLSNGVMSVIAGDGNNYNNGYRDGNATQALFYYIPSLSFKDSTQE 274

Query: 173 LLISDHGNQLIRQINLKPEDCSKSSQSGSALGA-VSVWVLVSVLSCLVSLVIGFVARPYI 231
           L ISD  NQ +R +N+       S+Q  + LG  ++ +    +      + +    RPY 
Sbjct: 275 LFISDLNNQRVRVLNM------NSNQVSTLLGNDINSYERTYLPQNGKDVYV----RPYQ 324

Query: 232 IRHTGRLDPPLQHDME 247
           I++  +LD  L  D++
Sbjct: 325 IQYLPQLDQLLIADLD 340


>gi|74317309|ref|YP_315049.1| hypothetical protein Tbd_1291 [Thiobacillus denitrificans ATCC
           25259]
 gi|74056804|gb|AAZ97244.1| conserved hypothetical protein containing twin-arginine
           translocation pathway signal [Thiobacillus denitrificans
           ATCC 25259]
          Length = 510

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 48/109 (44%), Gaps = 8/109 (7%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-------VTT 137
           + G     + DG    ARF+       D  GN+YVAD+ NHVIRKIT          V+T
Sbjct: 332 MVGQKQGAFVDGPANEARFNGCSQIDYDNAGNLYVADRFNHVIRKITPSADPMVGKMVST 391

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            A G   + GR  GPA  A F   + L+       + I D  N  +  I
Sbjct: 392 YA-GVPMQSGRISGPASKAKFYEPWGLSVDRKNNLIFIGDTANSRVAVI 439



 Score = 43.5 bits (101), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 39/141 (27%), Positives = 63/141 (44%), Gaps = 14/141 (9%)

Query: 59  LIVLDSSRSAFYTLSFPLS-------------EESVVKRLAGDGVQGYSDGEPGSARFDK 105
           L  +  +++A  TLS P S             EE      AG+   G  +G   SA F  
Sbjct: 41  LTAVKGAQAALPTLSTPFSVATNEGFPLPKSPEEIWSLDYAGNLFPGMRNGPALSAFFYP 100

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
                ++  G+I VAD  N +IR I+NLG  TI  G  ++    DG A + +F++  ++ 
Sbjct: 101 ATGMYIEDNGSIKVADARNSLIRNISNLGDVTIFSGSMQQYPFKDGSAVDTNFNSPNDID 160

Query: 166 FVPHICALLISDHGNQLIRQI 186
            +      ++ D  N  IR++
Sbjct: 161 KLSD-GTYVVGDRENNAIRRV 180


>gi|296124331|ref|YP_003632109.1| NHL repeat containing protein [Planctomyces limnophilus DSM 3776]
 gi|296016671|gb|ADG69910.1| NHL repeat containing protein [Planctomyces limnophilus DSM 3776]
          Length = 581

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 30/71 (42%), Positives = 42/71 (59%), Gaps = 5/71 (7%)

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAG 140
           ++ G G  G  DG   SA+FD P+  A+D  GN +YVAD  NH++R +  T   V+T+AG
Sbjct: 283 KVIGSGKIGAKDGPAESAQFDHPQGMALD--GNTLYVADTENHLLRTVNLTTWEVSTLAG 340

Query: 141 GGSKKEGRADG 151
            G +  GR  G
Sbjct: 341 TGEQARGRDRG 351


>gi|290995346|ref|XP_002680256.1| predicted protein [Naegleria gruberi]
 gi|284093876|gb|EFC47512.1| predicted protein [Naegleria gruberi]
          Length = 1226

 Score = 50.8 bits (120), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 41/131 (31%), Positives = 62/131 (47%), Gaps = 6/131 (4%)

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
           DL   DS+  A       +  + V+  L G+G  G+ DG+  SA F+ P S A+   G+ 
Sbjct: 398 DLYFCDSNNHAIR----KIDTQGVITTLTGNGTAGFQDGDAASALFNSPASIALLPNGDF 453

Query: 118 YVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLI 175
            VAD +N  IRK   +   V TIAG  +      +G A  A  +    + +      +LI
Sbjct: 454 LVADFTNSRIRKYVASTKQVVTIAGNSTLGFYGDNGLASKALLNKPTGVFYNSTSGDILI 513

Query: 176 SDHGNQLIRQI 186
           +D+ N  IR+I
Sbjct: 514 ADYFNFRIRKI 524



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 31/102 (30%), Positives = 46/102 (45%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
           +AG G    S+    +A+F +P+   V     +Y+ D  N++IRKI N  V  I G G +
Sbjct: 40  VAGGGSSLSSNISALNAKFSQPRCVVVSPNNEVYICDTGNYMIRKIVNGVVYNIVGNGQQ 99

Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
                   A NA  S    + F P    L  SD+ N  I+ +
Sbjct: 100 GSSTDGTLAVNAKVSLVLSVAFHPVSGELFFSDYSNNAIKYV 141



 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 44/82 (53%), Gaps = 4/82 (4%)

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFSNDFE 163
           +P   A    G++Y  D +NH IRKI   GV TT+ G G+   G  DG A +A F++   
Sbjct: 387 QPAGMAFASNGDLYFCDSNNHAIRKIDTQGVITTLTGNGT--AGFQDGDAASALFNSPAS 444

Query: 164 LTFVPHICALLISDHGNQLIRQ 185
           +  +P+    L++D  N  IR+
Sbjct: 445 IALLPN-GDFLVADFTNSRIRK 465



 Score = 45.1 bits (105), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 69/146 (47%), Gaps = 10/146 (6%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY--SDGEPGSARFD 104
           P S+   P + D +V D + S      +  S + VV  +AG+   G+   +G    A  +
Sbjct: 442 PASIALLP-NGDFLVADFTNSRI--RKYVASTKQVVT-IAGNSTLGFYGDNGLASKALLN 497

Query: 105 KPKS-FAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASFSND 161
           KP   F     G+I +AD  N  IRKI+N    +TTIAG GS      DG A  ++    
Sbjct: 498 KPTGVFYNSTSGDILIADYFNFRIRKISNNTGIITTIAGSGSTSY-NGDGLAATSTSMAP 556

Query: 162 FELTFVPHICALLISDHGNQLIRQIN 187
           + L   P    L  +D+ N L+R+I+
Sbjct: 557 YGLAVNPISGELFFTDNANFLVRRID 582


>gi|357400623|ref|YP_004912548.1| hypothetical protein SCAT_3039 [Streptomyces cattleya NRRL 8057 =
           DSM 46488]
 gi|386356676|ref|YP_006054922.1| hypothetical protein SCATT_30300 [Streptomyces cattleya NRRL 8057 =
           DSM 46488]
 gi|337767032|emb|CCB75743.1| conserved protein of unknown function [Streptomyces cattleya NRRL
           8057 = DSM 46488]
 gi|365807185|gb|AEW95401.1| hypothetical protein SCATT_30300 [Streptomyces cattleya NRRL 8057 =
           DSM 46488]
          Length = 613

 Score = 50.8 bits (120), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 36/133 (27%), Positives = 59/133 (44%)

Query: 68  AFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
           A  TL   L  E   K     G   Y   EP       P    +   GN+ V+D + H +
Sbjct: 146 AIATLVEELEAEHAAKGTLRRGDGPYVPPEPVPTDLRFPGKALLLPTGNLLVSDSTRHAL 205

Query: 128 RKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            ++   G T +   GS + G ADGPA  A FS    L  +P   +++++D  N  +R+++
Sbjct: 206 VELAPDGETVVRRIGSGERGLADGPADRARFSEPQGLALLPDGESVIVADTVNHALRRVD 265

Query: 188 LKPEDCSKSSQSG 200
           L   + +  + +G
Sbjct: 266 LASGEVTTVAGTG 278


>gi|302551068|ref|ZP_07303410.1| NHL repeat-containing protein [Streptomyces viridochromogenes DSM
           40736]
 gi|302468686|gb|EFL31779.1| NHL repeat-containing protein [Streptomyces viridochromogenes DSM
           40736]
          Length = 498

 Score = 50.8 bits (120), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 59/109 (54%), Gaps = 5/109 (4%)

Query: 82  VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTT 137
           +  +AG G+ G++ D +P  +A+ + P + AVD  GN+++AD SNH +R++      +TT
Sbjct: 244 ITTVAGTGIAGFNGDNQPAVNAQLNAPHTVAVDSGGNVFIADTSNHRVRRVDAATRQITT 303

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           +AG G+       GPA     ++   +  V     L I+D  N  +R++
Sbjct: 304 VAGIGTAGFNGDGGPAIGVQLNSPVGVA-VDSGGGLFIADASNYRVRKV 351



 Score = 45.8 bits (107), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 36/136 (26%), Positives = 66/136 (48%), Gaps = 15/136 (11%)

Query: 76  LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--T 131
           ++ E  +  +AG+G  G+S   G    A+ + P+  AVD  GN++ +D SNH +R++   
Sbjct: 10  VTREPAIYIVAGNGTAGFSSDYGPAVCAQLNYPRKVAVDSSGNLFFSDYSNHRVRRVDAA 69

Query: 132 NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA-----LLISDHGNQLIRQI 186
              +TT+AG G+      +G  Q A +S    L +   +       L I+D  N  +R++
Sbjct: 70  TQQITTVAGNGTAGF---NGDNQPAVYS---RLHYPRGVAVDAGGNLFIADCNNHRVRRV 123

Query: 187 NLKPEDCSKSSQSGSA 202
           +      +  + +G A
Sbjct: 124 DAATRQITTVAGTGIA 139



 Score = 42.0 bits (97), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 31/125 (24%), Positives = 60/125 (48%), Gaps = 5/125 (4%)

Query: 82  VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTT 137
           +  +AG G+ G++ D +P  +A+ + P   AVD  GN+++ D +N  +R++      +TT
Sbjct: 130 ITTVAGTGIAGFNGDNQPAVNAQLNSPIGVAVDAGGNLFITDFNNQRVRRVDAATRQITT 189

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSS 197
           +AG G+      +  A NA       +        L I D+ N  +R+++      +  +
Sbjct: 190 VAGTGTAGFNGDNQSAVNAQLYGPVGVA-ADAGGNLFIGDYANHRVRRVDAATRQITTVA 248

Query: 198 QSGSA 202
            +G A
Sbjct: 249 GTGIA 253



 Score = 37.0 bits (84), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 39/66 (59%), Gaps = 5/66 (7%)

Query: 82  VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT--- 136
           +  +AG G  G++ DG P    + + P   AVD  G +++AD SN+ +RK++++  T   
Sbjct: 301 ITTVAGIGTAGFNGDGGPAIGVQLNSPVGVAVDSGGGLFIADASNYRVRKVSDVASTARF 360

Query: 137 TIAGGG 142
           ++A GG
Sbjct: 361 SVAPGG 366


>gi|384568064|ref|ZP_10015168.1| thiol-disulfide isomerase-like thioredoxin [Saccharomonospora
           glauca K62]
 gi|384523918|gb|EIF01114.1| thiol-disulfide isomerase-like thioredoxin [Saccharomonospora
           glauca K62]
          Length = 634

 Score = 50.8 bits (120), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 42/138 (30%), Positives = 61/138 (44%), Gaps = 15/138 (10%)

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
           DL+ L   +      +F  SE +   R A D   G+ DG     RF  P    V  +G +
Sbjct: 156 DLVALHEEKGTLRRGAFDPSEAA---RAAADNRGGWLDG----LRF--PSKAVVTAEGRV 206

Query: 118 YVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA----- 172
            VAD +NH I +  +   T +   G  + G  DG    A F+    LT +P   A     
Sbjct: 207 LVADTANHSIAEFASDVETLLRRFGDGRRGSVDGAFDVARFAEPSGLTLLPTHVADRVGY 266

Query: 173 -LLISDHGNQLIRQINLK 189
            LLI+D  N L+R ++L+
Sbjct: 267 HLLIADTANHLLRGVDLR 284


>gi|336399828|ref|ZP_08580628.1| NHL repeat containing protein [Prevotella multisaccharivorax DSM
           17128]
 gi|336069564|gb|EGN58198.1| NHL repeat containing protein [Prevotella multisaccharivorax DSM
           17128]
          Length = 494

 Score = 50.8 bits (120), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 53/106 (50%), Gaps = 10/106 (9%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-----TNLGVTTIAG--GGS 143
           +G+ DG   +A+F+ P+       G +Y+AD  NH IR I      N  VTT  G  G S
Sbjct: 392 KGWEDGSLKNAKFNYPRQICFTKDGRLYIADSQNHCIRSIDTTQGANARVTTPIGVPGSS 451

Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
              G ADG  + A F+    +        + I+D  NQ+IR++++K
Sbjct: 452 ---GYADGGVELAKFNIPMGVAVSSDGEKVYIADMKNQVIRELSIK 494


>gi|324997987|ref|ZP_08119099.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Pseudonocardia sp. P1]
          Length = 614

 Score = 50.8 bits (120), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 47/89 (52%), Gaps = 3/89 (3%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTIAGGGSKKEGRAD 150
           Y+DG  GSARF +P+   V     + VAD  NH +R+++  +  V+T+AG GS+   R D
Sbjct: 231 YTDGPAGSARFSEPQGLLVLDPSTVLVADTVNHAVRRVSLDDGTVSTVAGTGSQLRERVD 290

Query: 151 GPAQNASFSNDFELT-FVPHICALLISDH 178
                A  S+ ++L  +  H+   +   H
Sbjct: 291 PGGTAAELSSPWDLAPWDGHVVVAMAGSH 319



 Score = 40.8 bits (94), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 42/175 (24%), Positives = 72/175 (41%), Gaps = 19/175 (10%)

Query: 28  LEDGYTVTTVIDGHQLE--INPHSVIDR--------PGSSDLIVLDSSRSAFYTLSFPLS 77
           L+DG   T    G QL   ++P              P    ++V  +     +T+  P+S
Sbjct: 270 LDDGTVSTVAGTGSQLRERVDPGGTAAELSSPWDLAPWDGHVVVAMAGSHQLWTVD-PVS 328

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-----TN 132
            ++ V  LAG   +G  DG    A   +P   A    G ++VAD     +R++       
Sbjct: 329 GQARV--LAGTTNEGLRDGSFAEAFLAQPSGLATGPDGTLWVADSEISALRRVDVDPGAG 386

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
             V+T  G G  + G  DGPA  A   +   +  +P   ++ ++D  N  +R+ +
Sbjct: 387 PAVSTAVGQGLFEFGHRDGPAAEALLQHPLGVAVLPD-GSVAVADTYNGAVRRFD 440


>gi|108707787|gb|ABF95582.1| HAD-superfamily hydrolase, subfamily IA, variant 3 containing
           protein, expressed [Oryza sativa Japonica Group]
          Length = 772

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 54/105 (51%), Gaps = 4/105 (3%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG--VTTIAGGGS 143
           G   +G  DG   +A F++P+  A + K NI YVAD  NH +R+I  +   V T+AG G+
Sbjct: 610 GSSEEGLLDGTFDTASFNRPQGLAYNSKKNILYVADTENHALREINFVSETVKTLAGNGT 669

Query: 144 K-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           K  + R  G   N   ++ +++ + P    L I+  G   I + N
Sbjct: 670 KGSDYRGGGQGTNQVLNSPWDVCYDPSKETLYIAMAGQHQIWKHN 714



 Score = 41.6 bits (96), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 29/96 (30%), Positives = 42/96 (43%)

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
           P   A+D+  N      SNH    +TNL    I   GS +EG  DG    ASF+    L 
Sbjct: 574 PGKLAIDVLNNRLFISDSNHNRIVVTNLEGEFICQIGSSEEGLLDGTFDTASFNRPQGLA 633

Query: 166 FVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
           +      L ++D  N  +R+IN   E     + +G+
Sbjct: 634 YNSKKNILYVADTENHALREINFVSETVKTLAGNGT 669


>gi|290997788|ref|XP_002681463.1| predicted protein [Naegleria gruberi]
 gi|284095087|gb|EFC48719.1| predicted protein [Naegleria gruberi]
          Length = 2297

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 39/113 (34%), Positives = 54/113 (47%), Gaps = 3/113 (2%)

Query: 78  EESVVKRLAGDGVQGYS-DG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           E   +    GDG +G S DG  P  A      +  V   G++Y+ADK+ H IRK++N  +
Sbjct: 42  ERFTITSYVGDGNEGNSGDGVSPSHASIGYVDAIEVGSNGDLYIADKTYHRIRKVSNGII 101

Query: 136 TTIAGGGSKKEGRAD-GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            TIAG GS      D G A +A  +    L F      L + D  N  IR+I+
Sbjct: 102 KTIAGTGSSGGYFGDNGVATSAKLNKPQALAFYSGDGTLYLGDSLNYRIRKIS 154



 Score = 47.8 bits (112), Expect = 0.004,   Method: Composition-based stats.
 Identities = 34/145 (23%), Positives = 67/145 (46%), Gaps = 8/145 (5%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY--SDGEPGSARF 103
           +P+ V   P + +L + D+  +    +      ++ +  + G GV GY    G+   A+ 
Sbjct: 239 SPNDVAISPTTGELFIADTGNNVIRKVGL----DNKIITVVGTGVSGYLGDGGQAKQAQL 294

Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT-TIAGGGSKKEGRADGP-AQNASFSND 161
             P S A    G + ++D  N+VIRK+ + G+  TIAG         DG  + +A   + 
Sbjct: 295 SSPTSIAFTSAGEMLISDSDNYVIRKVYSNGIIRTIAGSARNSGSVGDGTDSLSAQIDSV 354

Query: 162 FELTFVPHICALLISDHGNQLIRQI 186
           + +++      + I+D  N  +R++
Sbjct: 355 YSISYSNISNEVFIADTSNFRVRKL 379



 Score = 46.2 bits (108), Expect = 0.014,   Method: Composition-based stats.
 Identities = 32/115 (27%), Positives = 56/115 (48%), Gaps = 2/115 (1%)

Query: 76  LSEESVVKRLAGDGVQG-YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           +S   ++  + G G +G  ++G   S+   +    AV   G +Y +D + H I KI+   
Sbjct: 153 ISTNQIISTVIGKGTKGALTEGTATSSTIGQVLGLAVSSSGVLYFSDYTYHCIGKISGTV 212

Query: 135 VTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           V+  AG  S    G  +G A +A  ++  ++   P    L I+D GN +IR++ L
Sbjct: 213 VSVAAGTCSLLGYGGDNGVATSAKLNSPNDVAISPTTGELFIADTGNNVIRKVGL 267


>gi|290985802|ref|XP_002675614.1| predicted protein [Naegleria gruberi]
 gi|284089211|gb|EFC42870.1| predicted protein [Naegleria gruberi]
          Length = 848

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 61/113 (53%), Gaps = 9/113 (7%)

Query: 32  YTVTTVIDG-HQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV 90
           +T+ T+I   +Q++  P  +I  P S DL +  +  +  +  S   S E  V  +AG+G 
Sbjct: 521 FTIQTIIQAPNQVQFLPQQLIYGP-SCDLYIACTLENQIFKYS---SSEGFV-LIAGNGK 575

Query: 91  QGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAG 140
            GY+   G    AR + PK+ AV+  G +Y+AD  N V+RKI +N  + T+ G
Sbjct: 576 VGYTGDYGPAIYARLNNPKAIAVNKFGEVYIADSFNKVVRKIFSNETIVTLIG 628


>gi|158319005|ref|YP_001511513.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
 gi|158114410|gb|ABW16607.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
          Length = 807

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 48/96 (50%), Gaps = 4/96 (4%)

Query: 95  DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA--DG 151
           DG P  +A    P   A+   G +YVAD S H +R+IT  G  T+  G  ++ G    DG
Sbjct: 642 DGGPAVNATLTSPSGLALADDGTLYVADSSAHTVRRITKDGKITLVAGKPEQSGYEGDDG 701

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           PA +A  ++  +L   P    L I+D  N +IR + 
Sbjct: 702 PAVDALLASPQDLALGPA-GELYIADTYNNVIRMVT 736



 Score = 37.7 bits (86), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 36/118 (30%), Positives = 56/118 (47%), Gaps = 6/118 (5%)

Query: 76  LSEESVVKRLAGDGVQ-GYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           ++++  +  +AG   Q GY   DG    A    P+  A+   G +Y+AD  N+VIR +T 
Sbjct: 678 ITKDGKITLVAGKPEQSGYEGDDGPAVDALLASPQDLALGPAGELYIADTYNNVIRMVTP 737

Query: 133 LG-VTTIAGGGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
            G +TT AG        ADG  A     S+   L  V    A+ +S+    ++R+I L
Sbjct: 738 DGKITTFAGSDESTADDADGALATETRISSPTGLA-VDTSGAVYVSEGSYGVVRRIAL 794


>gi|290971766|ref|XP_002668650.1| predicted protein [Naegleria gruberi]
 gi|284082136|gb|EFC35906.1| predicted protein [Naegleria gruberi]
          Length = 728

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 40/113 (35%), Positives = 56/113 (49%), Gaps = 6/113 (5%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKI-TNL 133
           E   +  +AG G  G+S G+ G A  +K   P+   V     +Y+AD+ NH IRKI  N 
Sbjct: 207 ENGNIVTIAGKGTAGFS-GDNGLATKEKLNFPRCVFVSSNNEVYIADQINHRIRKILENG 265

Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            + TIAG G       +G A NA   N     FV     + I+++GN  IR+I
Sbjct: 266 NIVTIAGNGPYGFCGDNGLATNAQL-NSPAGVFVSSNNEIYIAEYGNHRIRKI 317



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/106 (36%), Positives = 53/106 (50%), Gaps = 6/106 (5%)

Query: 85  LAGDGVQGYSDGEPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAG 140
           +AG G  G+S G+ G A  +K   P+   V     +Y+AD+ NH IRKI  N  + TIAG
Sbjct: 102 IAGKGTAGFS-GDNGLATKEKLNFPRCVFVSSNNEVYIADQINHRIRKILENGNIVTIAG 160

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            G       +G A NA   N     FV     + I+D+ N  IR+I
Sbjct: 161 NGPYGFCGDNGLATNAQL-NSPAGVFVSSNNEIYIADYDNHRIRKI 205



 Score = 45.1 bits (105), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 51/112 (45%), Gaps = 4/112 (3%)

Query: 78  EESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           E   +  +AG+G  G+   +G   +A+ + P    V     IY+AD  NH IRKI  N  
Sbjct: 151 ENGNIVTIAGNGPYGFCGDNGLATNAQLNSPAGVFVSSNNEIYIADYDNHRIRKILENGN 210

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + TIAG G+      +G A      N     FV     + I+D  N  IR+I
Sbjct: 211 IVTIAGKGTAGFSGDNGLATKEKL-NFPRCVFVSSNNEVYIADQINHRIRKI 261



 Score = 43.9 bits (102), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 44/88 (50%), Gaps = 2/88 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASF 158
           SA+   P+S  V     +Y+AD  NH IRKI  N  + TIAG G+      +G A  A  
Sbjct: 7   SAQLYSPESVFVSSNNEVYIADYGNHRIRKILENGNIVTIAGNGTAGFSGDNGIATKAQL 66

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
           +    + FV     + I+D+ N  IR+I
Sbjct: 67  NGPVGV-FVSSNNEVYIADYDNHRIRKI 93



 Score = 43.5 bits (101), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 50/112 (44%), Gaps = 4/112 (3%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           E   +  +AG+G  G+S  +G    A+ + P    V     +Y+AD  NH IRKI  N  
Sbjct: 39  ENGNIVTIAGNGTAGFSGDNGIATKAQLNGPVGVFVSSNNEVYIADYDNHRIRKILENGN 98

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           +  IAG G+      +G A      N     FV     + I+D  N  IR+I
Sbjct: 99  IVIIAGKGTAGFSGDNGLATKEKL-NFPRCVFVSSNNEVYIADQINHRIRKI 149



 Score = 41.6 bits (96), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 45/98 (45%), Gaps = 17/98 (17%)

Query: 78  EESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           E   +  +AG+G  G+   +G   +A+ + P    V     IY+A+  NH IRKI  N  
Sbjct: 263 ENGNIVTIAGNGPYGFCGDNGLATNAQLNSPAGVFVSSNNEIYIAEYGNHRIRKILENGN 322

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSND--FELTFVPHI 170
           + T+AG G+            A FS D  F++   PH 
Sbjct: 323 IVTVAGNGT------------AGFSGDSPFDIRMYPHF 348


>gi|428182271|gb|EKX51132.1| hypothetical protein GUITHDRAFT_103053 [Guillardia theta CCMP2712]
          Length = 1472

 Score = 50.4 bits (119), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 34/97 (35%), Positives = 50/97 (51%), Gaps = 4/97 (4%)

Query: 82   VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGVTTIA 139
            V+ LAG G  G+ DG   SA F  P+   +D    +Y+AD  +H IR   I+   V+T+A
Sbjct: 1328 VQTLAGSGANGFVDGVGTSAEFHSPRDIVLD-GDKLYIADSGSHRIRTLSISTRDVSTLA 1386

Query: 140  GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLIS 176
            G  + K+G  DG    A F + F L     + +L +S
Sbjct: 1387 GY-AWKQGTQDGKGTFARFLSPFSLALSSAVNSLALS 1422


>gi|418693423|ref|ZP_13254475.1| putative lipoprotein [Leptospira kirschneri str. H1]
 gi|409958780|gb|EKO17669.1| putative lipoprotein [Leptospira kirschneri str. H1]
          Length = 358

 Score = 50.4 bits (119), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 63/119 (52%), Gaps = 9/119 (7%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKKE 146
           + G  +G+   + FD P    +D + N+YV + SNH IRKI NL    V+T +GG S   
Sbjct: 164 LSGLQNGDRLDSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLDSGTVSTFSGGIS--- 219

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
           G  DG   +A F +   + +     +LL +D  +  IR+I+LK    + S+  G+ +GA
Sbjct: 220 GYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKIDLKTS--TVSTLLGNGIGA 276



 Score = 47.8 bits (112), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 36/70 (51%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNI+V+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAESIDGTISTASFKTPFGLEVDTSGNIFVS 95

Query: 121 DKSNHVIRKI 130
           D+  ++IRKI
Sbjct: 96  DQITNLIRKI 105


>gi|424835068|ref|ZP_18259738.1| NHL repeat-containing protein [Clostridium sporogenes PA 3679]
 gi|365978195|gb|EHN14287.1| NHL repeat-containing protein [Clostridium sporogenes PA 3679]
          Length = 467

 Score = 50.4 bits (119), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 46/172 (26%), Positives = 75/172 (43%), Gaps = 22/172 (12%)

Query: 23  SSGLLLEDGYTVTTVIDGHQLEINPHSVI--DRPGSSDLIVLDSSRSAFYTLSFPLSEES 80
              ++LE+       I   ++ IN + +I     G + +++LD +     T+        
Sbjct: 162 EENIILENKCEKEEYIYPSKISINENGLIAFSEKGKNRIVILDQNLEKIKTI-------- 213

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTI 138
                 G    G  DG+   A+F  P+     +K  IYVAD  NH IR+  +    V TI
Sbjct: 214 ------GSSEYGLKDGDFREAQFKAPEGLRF-IKNKIYVADTENHFIRECDLEKEMVKTI 266

Query: 139 AGGGSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           AG G K+    A G A N S ++ ++L  +     + I+  GN  I + N+K
Sbjct: 267 AGNGQKEYSPMAKGDALNVSLNSPWDLEILED--CIYIAMAGNHQIWKYNMK 316


>gi|290975266|ref|XP_002670364.1| predicted protein [Naegleria gruberi]
 gi|284083922|gb|EFC37620.1| predicted protein [Naegleria gruberi]
          Length = 1584

 Score = 50.4 bits (119), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 38/112 (33%), Positives = 51/112 (45%), Gaps = 8/112 (7%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV- 135
           +E S     +GDG      G    A+ + P   ++   G IY+AD  NH IRKI + GV 
Sbjct: 158 NEHSDCDGFSGDG------GLASRAKLNSPNGLSISQNGEIYIADTLNHRIRKINSYGVI 211

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           +TIAG G    G  DG     S  N      +     + I+D  N  IR+IN
Sbjct: 212 STIAGTGRASFG-GDGGFAVLSQLNSPNGVHISQNGEIYIADTLNHRIRKIN 262



 Score = 49.7 bits (117), Expect = 0.001,   Method: Composition-based stats.
 Identities = 36/101 (35%), Positives = 47/101 (46%), Gaps = 3/101 (2%)

Query: 88  DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKE 146
           DG  G   G    A+ + P   ++   G IY+AD  NH IRKI + GV +TIAG G    
Sbjct: 559 DGFSG-DGGLASRAKLNSPNGLSISQNGEIYIADTLNHRIRKINSYGVISTIAGTGRASF 617

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           G  DG     S  N      +     + I+D  N  IR+IN
Sbjct: 618 G-GDGGFAVLSQLNSPNGVHISQNGEIYIADTLNHRIRKIN 657



 Score = 48.5 bits (114), Expect = 0.003,   Method: Composition-based stats.
 Identities = 36/110 (32%), Positives = 55/110 (50%), Gaps = 6/110 (5%)

Query: 81  VVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-T 136
           V+  +AG G   +  G+ G A   + + P    +   G IY+AD  NH IRKI + GV +
Sbjct: 210 VISTIAGTGRASFG-GDGGFAVLSQLNSPNGVHISQNGEIYIADTLNHRIRKINSYGVIS 268

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           T+AG G    G   G A N+  ++ + +  V     + I+D  N  IR+I
Sbjct: 269 TVAGTGRASFGGDGGLAINSQLNSPYGV-HVSQNGEIYIADTLNHRIRKI 317



 Score = 48.5 bits (114), Expect = 0.003,   Method: Composition-based stats.
 Identities = 36/110 (32%), Positives = 55/110 (50%), Gaps = 6/110 (5%)

Query: 81  VVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-T 136
           V+  +AG G   +  G+ G A   + + P    +   G IY+AD  NH IRKI + GV +
Sbjct: 605 VISTIAGTGRASFG-GDGGFAVLSQLNSPNGVHISQNGEIYIADTLNHRIRKINSYGVIS 663

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           T+AG G    G   G A N+  ++ + +  V     + I+D  N  IR+I
Sbjct: 664 TVAGTGRASFGGDGGLAINSQLNSPYGV-HVSQNGEIYIADTLNHRIRKI 712



 Score = 43.1 bits (100), Expect = 0.11,   Method: Composition-based stats.
 Identities = 41/111 (36%), Positives = 56/111 (50%), Gaps = 6/111 (5%)

Query: 81  VVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVT 136
           V+  +AG G   +  G+ G   +++ + P    V   G IY+AD  NH IRKI  N  +T
Sbjct: 266 VISTVAGTGRASFG-GDGGLAINSQLNSPYGVHVSQNGEIYIADTLNHRIRKIFVNGTIT 324

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFV-PHICALLISDHGNQLIRQI 186
           TIAG  S      DG    AS  N  +  FV P+   +LI+D  N  IR+I
Sbjct: 325 TIAGSSSDGSFGGDGGLSIASRLNSPKGVFVSPNNNEILIADTSNNRIRKI 375



 Score = 43.1 bits (100), Expect = 0.11,   Method: Composition-based stats.
 Identities = 41/111 (36%), Positives = 56/111 (50%), Gaps = 6/111 (5%)

Query: 81  VVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVT 136
           V+  +AG G   +  G+ G   +++ + P    V   G IY+AD  NH IRKI  N  +T
Sbjct: 661 VISTVAGTGRASFG-GDGGLAINSQLNSPYGVHVSQNGEIYIADTLNHRIRKIFVNGTIT 719

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFV-PHICALLISDHGNQLIRQI 186
           TIAG  S      DG    AS  N  +  FV P+   +LI+D  N  IR+I
Sbjct: 720 TIAGSSSDGSFGGDGGLSIASRLNSPKGVFVSPNNNEILIADTSNNRIRKI 770


>gi|421106984|ref|ZP_15567544.1| putative lipoprotein [Leptospira kirschneri str. H2]
 gi|410007898|gb|EKO61577.1| putative lipoprotein [Leptospira kirschneri str. H2]
          Length = 358

 Score = 50.4 bits (119), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 63/119 (52%), Gaps = 9/119 (7%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKKE 146
           + G  +G+   + FD P    +D + N+YV + SNH IRKI NL    V+T +GG S   
Sbjct: 164 LSGLQNGDRLDSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLDSGTVSTFSGGIS--- 219

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGA 205
           G  DG   +A F +   + +     +LL +D  +  IR+I+LK    + S+  G+ +GA
Sbjct: 220 GYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKIDLKTS--TVSTLLGNGIGA 276



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 36/70 (51%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNI+V+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAESIDGTISTASFKTPFGLEVDTSGNIFVS 95

Query: 121 DKSNHVIRKI 130
           D+  ++IRKI
Sbjct: 96  DQITNLIRKI 105


>gi|320107007|ref|YP_004182597.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
 gi|319925528|gb|ADV82603.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
          Length = 710

 Score = 50.4 bits (119), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 41/129 (31%), Positives = 65/129 (50%), Gaps = 17/129 (13%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           LS       +AGDG QG++ DG P  SA  D P   AV  +G +++AD  NH +R ++  
Sbjct: 73  LSTTGTWTTVAGDGRQGFAGDGGPAISAELDTPMGIAVSAEGTLWIADAHNHRVRLVSPN 132

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPED 192
           G ++TIAG      G      Q  + + D +        AL I+D  +  IR+++   E+
Sbjct: 133 GIISTIAG----SSGSGTVLRQPVALAFDVD-------GALYIADAADHRIRRVD---EN 178

Query: 193 CSKSSQSGS 201
            S S+ +G+
Sbjct: 179 GSISTVAGT 187



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/81 (41%), Positives = 43/81 (53%), Gaps = 4/81 (4%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           + E   +  +AG G QGYS  E  +  AR D P S A D  G +  AD+ NH IR++   
Sbjct: 175 VDENGSISTVAGTGDQGYSGDEGPAILARIDSPSSLAFDAAGRLLFADRRNHRIRRVDVD 234

Query: 134 G-VTTIAGGGSKKEGRADGPA 153
           G V T+AG GS   G  DG A
Sbjct: 235 GTVHTVAGDGS-ASGEGDGMA 254



 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/102 (35%), Positives = 49/102 (48%), Gaps = 4/102 (3%)

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFSNDFE 163
           KP   A D  GN+YVAD   HVIR+++  G  TT+AG G +      GPA +A       
Sbjct: 48  KPAGMAYDSAGNLYVADARGHVIRRLSTTGTWTTVAGDGRQGFAGDGGPAISAELDTPMG 107

Query: 164 LTFVPHICALLISDHGNQLIRQI--NLKPEDCSKSSQSGSAL 203
           +  V     L I+D  N  +R +  N      + SS SG+ L
Sbjct: 108 IA-VSAEGTLWIADAHNHRVRLVSPNGIISTIAGSSGSGTVL 148



 Score = 45.8 bits (107), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 27/85 (31%), Positives = 47/85 (55%), Gaps = 2/85 (2%)

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFE 163
           +P + A D+ G +Y+AD ++H IR++   G ++T+AG G +     +GPA  A   +   
Sbjct: 150 QPVALAFDVDGALYIADAADHRIRRVDENGSISTVAGTGDQGYSGDEGPAILARIDSPSS 209

Query: 164 LTFVPHICALLISDHGNQLIRQINL 188
           L F      LL +D  N  IR++++
Sbjct: 210 LAF-DAAGRLLFADRRNHRIRRVDV 233


>gi|325110177|ref|YP_004271245.1| hypothetical protein Plabr_3626 [Planctomyces brasiliensis DSM
           5305]
 gi|324970445|gb|ADY61223.1| NHL repeat containing protein [Planctomyces brasiliensis DSM 5305]
          Length = 370

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 38/128 (29%), Positives = 61/128 (47%), Gaps = 9/128 (7%)

Query: 69  FYTLSFP--LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSN 124
            + L  P  +  E  V  + GDG  GY+  +G    A+ ++P    +   G +YV + +N
Sbjct: 17  LFVLQLPATVQAEGKVITILGDGKAGYTGDNGSADEAQCEQPFGLVIGPDGALYVCETAN 76

Query: 125 HVIRKI---TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
           HVIR+I   TN  VTT+AG G K      G A  A     +E+ F      +   +  N 
Sbjct: 77  HVIRRIDLKTNK-VTTVAGTGEKGYSGDGGSALEAKLDEPYEIRFDKQ-GNMFFVEMLNN 134

Query: 182 LIRQINLK 189
           ++R+++ K
Sbjct: 135 IVRRVDAK 142



 Score = 45.8 bits (107), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 39/114 (34%), Positives = 52/114 (45%), Gaps = 11/114 (9%)

Query: 78  EESVVKRLAG-DGVQGY-SDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           E  ++  +AG  G   Y  DG P   AR   PK  +VD +GNI++AD  +H IR+I    
Sbjct: 255 ERGLLHHVAGLGGKNAYEGDGRPARLARLAGPKGISVDKQGNIFLADTESHTIRRIDGQT 314

Query: 135 --VTTIAGGGSKKEGRADGPAQN--ASFSNDFELTFVPHICALLISDHGNQLIR 184
             +TT+ G G K     DGP  N      N     FV     + I D  N  +R
Sbjct: 315 GIITTLVGNGKK----GDGPDGNPLECKLNRPHGVFVAADGKVYIGDSSNHRVR 364



 Score = 44.3 bits (103), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 35/113 (30%), Positives = 57/113 (50%), Gaps = 7/113 (6%)

Query: 82  VKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGV-T 136
           V  +AG G +GYS G+ GSA   + D+P     D +GN++  +  N+++R++    GV +
Sbjct: 89  VTTVAGTGEKGYS-GDGGSALEAKLDEPYEIRFDKQGNMFFVEMLNNIVRRVDAKTGVIS 147

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           T+AG G K        A  A+F+    + F      L I D GN  +R +  +
Sbjct: 148 TVAGTGEKGFSGDGKAATEATFNRPHSICF-DSAGHLYICDIGNHRVRVVEAR 199



 Score = 37.7 bits (86), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 19/56 (33%), Positives = 34/56 (60%), Gaps = 2/56 (3%)

Query: 77  SEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++  V+  +AG G +G+S DG+  + A F++P S   D  G++Y+ D  NH +R +
Sbjct: 141 AKTGVISTVAGTGEKGFSGDGKAATEATFNRPHSICFDSAGHLYICDIGNHRVRVV 196


>gi|290973007|ref|XP_002669241.1| predicted protein [Naegleria gruberi]
 gi|284082786|gb|EFC36497.1| predicted protein [Naegleria gruberi]
          Length = 747

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 43/126 (34%), Positives = 63/126 (50%), Gaps = 7/126 (5%)

Query: 82  VKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT-TI 138
           +  +AG GV G S DG+  + A+ + P+  A+   G IY AD SN  IR+I   G+  TI
Sbjct: 244 INTIAGTGVLGLSGDGDLATKAQLNTPRGIAISSTGEIYFADTSNQRIRRIALNGIIDTI 303

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQ 198
           AG G  + G  DG     +  N      +     + I+D  NQ IR+I L   + + S+ 
Sbjct: 304 AGTGDPRFG-GDGDLATKAQLNSPRGIAISSTGEIYIADTYNQRIRRIAL---NTNISTF 359

Query: 199 SGSALG 204
           +GS  G
Sbjct: 360 AGSGFG 365



 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 56/115 (48%), Gaps = 4/115 (3%)

Query: 78  EESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG 134
           E  ++  +AG G   +  DG+  + A+ + P+  A+   G IY+AD  NH IR+I  N  
Sbjct: 128 ENGIITTIAGTGDSRFGGDGDLATKAQLNSPRGIAISSTGEIYIADTYNHRIRRIALNGT 187

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           + T+AG G  + G     A  A  +    +  +     + I+D  N+ IR+I L 
Sbjct: 188 INTVAGTGDSRFGGDGDLATKAQLNYPMGIA-ISSTGEIYIADTFNERIRRIALN 241



 Score = 42.7 bits (99), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 39/104 (37%), Positives = 50/104 (48%), Gaps = 4/104 (3%)

Query: 86  AGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGG 142
           +G G  GY    G    A  + P S A    G IY+AD  NH IRKI+ N  +TTIAG G
Sbjct: 362 SGFGYSGYVGDGGLSTDALLNTPLSVACSSNGEIYIADTYNHRIRKISLNNTITTIAGTG 421

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
                   G A NA  S+  ++  V     +  SD+ N  IR+I
Sbjct: 422 DSGFSGDGGLAINARLSSPADI-VVNSNGVIYFSDYDNNRIRKI 464


>gi|405960880|gb|EKC26754.1| NHL repeat-containing protein 2 [Crassostrea gigas]
          Length = 699

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 5/79 (6%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TN 132
           LS++ VV+   G   +G+ +G     RFD P+   V MK N IY+AD  NH IRKI    
Sbjct: 240 LSKDGVVQHCIGGKERGFKNGTFTQCRFDSPQ--GVVMKENVIYLADTENHAIRKIDLQK 297

Query: 133 LGVTTIAGGGSKKEGRADG 151
             VTTIAG G +   +  G
Sbjct: 298 QEVTTIAGNGKQGNDKFGG 316


>gi|430749689|ref|YP_007212597.1| copper amine oxidase family protein [Thermobacillus composti KWC4]
 gi|430733654|gb|AGA57599.1| copper amine oxidase family protein [Thermobacillus composti KWC4]
          Length = 187

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 38/104 (36%), Positives = 45/104 (43%), Gaps = 17/104 (16%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
           +AG G+ G  DG      F  P   AV   G IYVAD  N+VIRKI N  VTT+      
Sbjct: 4   IAGTGILGLKDGSASQENFYYPMDVAVSPSGVIYVADTLNYVIRKIENGRVTTLN----- 58

Query: 145 KEGRADGPAQNASFSNDFELTFVP----HICALLISDHGNQLIR 184
                   AQ+   +  F     P       A LI+D GN  IR
Sbjct: 59  --------AQSGRVAEVFPGVVEPAGDYRDGAFLIADGGNNRIR 94


>gi|168183170|ref|ZP_02617834.1| NHL repeat protein [Clostridium botulinum Bf]
 gi|237796201|ref|YP_002863753.1| NHL repeat-containing protein [Clostridium botulinum Ba4 str. 657]
 gi|182673679|gb|EDT85640.1| NHL repeat protein [Clostridium botulinum Bf]
 gi|229261965|gb|ACQ52998.1| NHL repeat protein [Clostridium botulinum Ba4 str. 657]
          Length = 467

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 45/171 (26%), Positives = 76/171 (44%), Gaps = 22/171 (12%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSVI--DRPGSSDLIVLDSSRSAFYTLSFPLSEESV 81
             ++LE+       I   ++ IN + +I     G + +++LD +                
Sbjct: 163 KNIILENKCEKEEYIYPSKISINENGLIAFSEKGKNRIVILDQNLEKI------------ 210

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIA 139
             ++ G G  G  DG+   A+F  P+     ++  IYVAD  NH IR+  +    V TIA
Sbjct: 211 --KIIGSGKYGLKDGDFKEAQFKAPEGLRF-IENKIYVADTENHSIRECDLEKEIVKTIA 267

Query: 140 GGGSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           G G K+    A G A N S ++ ++L  +     + I+  GN  I + N+K
Sbjct: 268 GNGQKEYSPMAKGDALNVSLNSPWDLEILKD--CIYIAMAGNHQIWKYNMK 316


>gi|303248818|ref|ZP_07335068.1| YD repeat protein [Desulfovibrio fructosovorans JJ]
 gi|302489759|gb|EFL49690.1| YD repeat protein [Desulfovibrio fructosovorans JJ]
          Length = 2115

 Score = 50.1 bits (118), Expect = 9e-04,   Method: Composition-based stats.
 Identities = 30/76 (39%), Positives = 45/76 (59%), Gaps = 3/76 (3%)

Query: 71  TLSFPLSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIR 128
           T  F L  +  ++ +AG+G   Y+ DG P +A     P   AVD +GN+Y+AD+ N+ IR
Sbjct: 846 TKVFRLDTDGTIRVVAGNGSTHYNGDGIPATAASLYHPNGLAVDAQGNLYIADQYNNRIR 905

Query: 129 KITNLG-VTTIAGGGS 143
           K+   G +TT AG G+
Sbjct: 906 KVDQNGIITTFAGTGA 921


>gi|290983036|ref|XP_002674235.1| predicted protein [Naegleria gruberi]
 gi|284087824|gb|EFC41491.1| predicted protein [Naegleria gruberi]
          Length = 737

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 42/113 (37%), Positives = 61/113 (53%), Gaps = 7/113 (6%)

Query: 82  VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
           +  +AG G  G+S G+ G   +A+   P++  V  +G I +AD SNHVIRKIT  G + T
Sbjct: 90  IHTIAGTGTSGHS-GDGGLAVNAKLFHPRTVTVTSEGEILIADTSNHVIRKITKDGYIFT 148

Query: 138 IAGGGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           IAG  S      DG  A++A  S+   +   P    + I ++G   IR+IN K
Sbjct: 149 IAGKPSTSGYGGDGGLAKDALLSSPQCVRISPSG-DIFIVEYGGARIRKINSK 200



 Score = 40.4 bits (93), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 50/106 (47%), Gaps = 6/106 (5%)

Query: 85  LAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           +AG    GYS G+ G   +A         +     IY+AD  N VIRKI + G + TIAG
Sbjct: 37  IAGSLTSGYS-GDGGLAINALLATSMGVTISPNREIYIADTGNSVIRKINSTGYIHTIAG 95

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            G+       G A NA   +   +T       +LI+D  N +IR+I
Sbjct: 96  TGTSGHSGDGGLAVNAKLFHPRTVTVTSEG-EILIADTSNHVIRKI 140


>gi|320103824|ref|YP_004179415.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
 gi|319751106|gb|ADV62866.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
          Length = 417

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/113 (34%), Positives = 54/113 (47%), Gaps = 4/113 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VT 136
            V+ +AG G +G+S DG P  +A  D+P   A    G+IY  D+ N  +RKI      +T
Sbjct: 127 TVEVIAGTGRKGFSGDGGPARAAELDEPYGVAPAPDGSIYFVDRLNRRVRKIDGATGIIT 186

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           T+AG G  +     GPA  A       L   P    LLI+D     IR ++L 
Sbjct: 187 TVAGTGRPETSGDGGPANRAGLVEPNGLALDPKAEDLLIADVQACRIRVVDLN 239



 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 41/119 (34%), Positives = 57/119 (47%), Gaps = 9/119 (7%)

Query: 78  EESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNL 133
            + ++  +AG+G +GYS DG P   A F+ PK   V  +G + + D  N  IR       
Sbjct: 296 RDGIITTVAGNGRKGYSGDGGPALEATFNGPKELDVSPEGLVAIVDTENQAIRLYNPKTR 355

Query: 134 GVTTIAGGGSK-KEGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKP 190
            VTT+AG G + +EG  DG PA  A       + F P    + ISD  N  IR +   P
Sbjct: 356 VVTTLAGNGQRGREG--DGVPATAARLDRPHGIAFGPG-GVIAISDTNNHRIRLVKSVP 411



 Score = 40.4 bits (93), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 38/149 (25%), Positives = 62/149 (41%), Gaps = 8/149 (5%)

Query: 45  INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY-SDGEPGS-AR 102
           + P+ +   P + DL++ D        +        V+   AG G +    DG P   A 
Sbjct: 209 VEPNGLALDPKAEDLLIADVQACRIRVVDL---NTGVISTFAGTGRKARDGDGGPAQQAS 265

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASFSN 160
               ++  V   G +Y+ ++  + +R +      +TT+AG G K      GPA  A+F+ 
Sbjct: 266 ILGARAVRVAPDGTVYILEREGNSLRAVDPRDGIITTVAGNGRKGYSGDGGPALEATFNG 325

Query: 161 DFELTFVPHICALLISDHGNQLIRQINLK 189
             EL   P     ++ D  NQ IR  N K
Sbjct: 326 PKELDVSPEGLVAIV-DTENQAIRLYNPK 353



 Score = 37.7 bits (86), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 43/90 (47%), Gaps = 3/90 (3%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNAS 157
           SAR ++P     D  GN+  +D +NH +++    N  V  IAG G K      GPA+ A 
Sbjct: 91  SARLNQPFDCRYDAVGNLVFSDTANHQLKRWNQANDTVEVIAGTGRKGFSGDGGPARAAE 150

Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQIN 187
               + +   P   ++   D  N+ +R+I+
Sbjct: 151 LDEPYGVAPAPD-GSIYFVDRLNRRVRKID 179



 Score = 37.0 bits (84), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 27/97 (27%), Positives = 48/97 (49%), Gaps = 13/97 (13%)

Query: 41  HQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY-SDGEPG 99
            +L+++P  ++         ++D+   A    +    +  VV  LAG+G +G   DG P 
Sbjct: 327 KELDVSPEGLV--------AIVDTENQAIRLYN---PKTRVVTTLAGNGQRGREGDGVPA 375

Query: 100 SA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           +A R D+P   A    G I ++D +NH IR + ++ V
Sbjct: 376 TAARLDRPHGIAFGPGGVIAISDTNNHRIRLVKSVPV 412


>gi|348508723|ref|XP_003441903.1| PREDICTED: NHL repeat-containing protein 2-like [Oreochromis
           niloticus]
          Length = 719

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 55/177 (31%), Positives = 80/177 (45%), Gaps = 34/177 (19%)

Query: 16  SLLFSLVSSG-----LLLEDG----YTVTTVIDGHQLEIN------PHSVIDRPG----- 55
           +LLFSLV  G     +L  D     Y    ++  H + I       P S++  PG     
Sbjct: 168 NLLFSLVGEGHRDRLMLFTDCALRYYGGRGLLKAHTVGIKLYKDALPPSILSFPGKVAID 227

Query: 56  --SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDM 113
             +  L++ D+       +S        +  + G    G  DG+   A F+ P+  A+  
Sbjct: 228 NSTKRLVIADTGHHRILVVS----STGDLLHVIGGPESGRKDGDVSEASFNSPQGVAI-- 281

Query: 114 KGN-IYVADKSNHVIRKITNL--GVTTIAGGGSK---KEGRADGPAQNASFSNDFEL 164
           KG+ +YVAD  NH+IRKI  L   V+T+AG G++   KEG A GP Q  S   D  L
Sbjct: 282 KGDTVYVADTENHLIRKIDLLEGKVSTLAGVGTQGTDKEGGAMGPQQPISSPWDVML 338


>gi|72094692|ref|XP_795849.1| PREDICTED: NHL repeat-containing protein 2-like [Strongylocentrotus
           purpuratus]
          Length = 706

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 41/140 (29%), Positives = 60/140 (42%), Gaps = 12/140 (8%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-------GYSDGEPGSARFDKPKSFAV 111
           + V DS  S+   +SF   ++  VK + G  +        G SDG+   A+   P   A 
Sbjct: 407 MFVADSESSSIRRVSF---KDGAVKNVVGGEMDPMNLFAYGDSDGKGLEAKLQHPLGVAW 463

Query: 112 DMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPH 169
           D    ++VAD  NH I+ I        T AG G   +G  D     A F+    L   P 
Sbjct: 464 DHSKLLFVADSYNHKIKMIDPEERYCATYAGTGEPGKGGDDEHILKAQFNEPGGLAISPC 523

Query: 170 ICALLISDHGNQLIRQINLK 189
            C + ++D  N  IR I++K
Sbjct: 524 GCKIYVADTNNHTIRCIDIK 543



 Score = 45.4 bits (106), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 36/102 (35%), Positives = 49/102 (48%), Gaps = 11/102 (10%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V   P  + L V D+  +    ++     + VV+   G    G++DG    ARF  P
Sbjct: 212 PGKVTTNPEGTLLAVSDTGHNRVIIVAL----DGVVQHCIGGPETGFNDGLYREARFHSP 267

Query: 107 K--SFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK 144
           +   +A D+   IYVAD  NH IRKI      VTTIAG G +
Sbjct: 268 QGLCWAQDV---IYVADTENHAIRKIDLKEQCVTTIAGTGEQ 306


>gi|348689157|gb|EGZ28971.1| hypothetical protein PHYSODRAFT_474561 [Phytophthora sojae]
          Length = 98

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/69 (43%), Positives = 39/69 (56%), Gaps = 2/69 (2%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA 149
           V GY+DG   +  F  P+S AV   G++YVAD +N VI K T+ GV T+   GS   G  
Sbjct: 11  VSGYADGYGNAVLFKNPRSLAVGSDGHLYVADDNNRVI-KFTSGGVMTVL-AGSTSSGLV 68

Query: 150 DGPAQNASF 158
           DG   +A F
Sbjct: 69  DGTGTSARF 77


>gi|381205317|ref|ZP_09912388.1| NHL repeat containing protein [SAR324 cluster bacterium JCVI-SC
           AAA005]
          Length = 458

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 35/122 (28%), Positives = 62/122 (50%), Gaps = 6/122 (4%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
           +AG G  GY+DG   SA+F+ P +F  + K +++VAD  N+ IR+I +L    +      
Sbjct: 271 IAGGGSIGYTDGSGISAQFNAPNAFT-NTKDSLFVADTCNNTIRRI-DLDTNIVTSLLPP 328

Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
           + G   G + +   ++  +LT   +   L I+D GN  I++I+L   DC +   +     
Sbjct: 329 EPGCGSGSSDSDKLNSPTDLTTNGY--QLYIADKGNSAIKRIDLN--DCKEQEWTNCTFD 384

Query: 205 AV 206
            +
Sbjct: 385 KI 386


>gi|290995833|ref|XP_002680487.1| serine/threonine kinase [Naegleria gruberi]
 gi|284094108|gb|EFC47743.1| serine/threonine kinase [Naegleria gruberi]
          Length = 1078

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/120 (31%), Positives = 59/120 (49%), Gaps = 8/120 (6%)

Query: 39  DGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEP 98
           D +  +IN H +  R  SSD ++   + +   TL+ P+S  +  +    +G     DG  
Sbjct: 133 DVYICDINNHRI--RKVSSDGVISTIAGTGRTTLN-PISPNTYRQEGGYNG-----DGLA 184

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASF 158
            +   + PK   VD KG++Y AD  N +IRK++N  +TTIAG          G A N +F
Sbjct: 185 TATDLNYPKYAQVDSKGDVYFADTYNGLIRKVSNGYITTIAGSKQNSTVNTFGMALNTTF 244


>gi|290973162|ref|XP_002669318.1| predicted protein [Naegleria gruberi]
 gi|284082864|gb|EFC36574.1| predicted protein [Naegleria gruberi]
          Length = 832

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/117 (33%), Positives = 61/117 (52%), Gaps = 6/117 (5%)

Query: 76  LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKITN 132
           + +  ++  + G G +GYS  +G    A  D P S A   + G++   D +N+ IRKI+N
Sbjct: 255 IDKNGIISTIIG-GSKGYSGDEGNAADAMIDGPYSLAFHPVSGDLTFVDINNYRIRKISN 313

Query: 133 LG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
            G ++TIAG G K      G A NA       ++F P+   L I++  N  IR+I+L
Sbjct: 314 KGIISTIAGNGEKGSIGDGGSALNAQIYYSVSISFSPN-GELYIANEWNNRIRKISL 369



 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/116 (33%), Positives = 56/116 (48%), Gaps = 2/116 (1%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +S    V  +AG G  G+S DG P   A+   P S A +  G ++++D  N+ IRKI   
Sbjct: 199 ISTNGNVYTVAGTGEFGFSGDGGPAFDAQLFFPSSIAFNNGGELFISDLGNNRIRKIDKN 258

Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           G+ +   GGSK     +G A +A     + L F P    L   D  N  IR+I+ K
Sbjct: 259 GIISTIIGGSKGYSGDEGNAADAMIDGPYSLAFHPVSGDLTFVDINNYRIRKISNK 314



 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 50/146 (34%), Positives = 66/146 (45%), Gaps = 21/146 (14%)

Query: 55  GSSDLIVLDSSRSAFYTLSFP----LSEESVVKRLAGDGVQGYSDGEPGS-ARFDKPKSF 109
           G SDL+ L SS    +    P    + +   +  LAG+G +   DG P    RF    S 
Sbjct: 119 GPSDLL-LTSSGELLFMDGGPRLKKIDKNGFLTCLAGNGSEILPDGFPAKDTRFTSIASI 177

Query: 110 AVD-MKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFV 167
           A   + G IY+++  NH IR+I TN  V T+AG G        GPA +A         F 
Sbjct: 178 AQHPVTGEIYLSETGNHTIRRISTNGNVYTVAGTGEFGFSGDGGPAFDAQL-------FF 230

Query: 168 PHICA------LLISDHGNQLIRQIN 187
           P   A      L ISD GN  IR+I+
Sbjct: 231 PSSIAFNNGGELFISDLGNNRIRKID 256



 Score = 41.6 bits (96), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 45/173 (26%), Positives = 84/173 (48%), Gaps = 29/173 (16%)

Query: 23  SSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVV 82
           S G   ++G     +IDG      P+S+   P S DL  +D +    Y +   +S + ++
Sbjct: 268 SKGYSGDEGNAADAMIDG------PYSLAFHPVSGDLTFVDINN---YRIR-KISNKGII 317

Query: 83  KRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA 139
             +AG+G +G S G+ GSA   +     S +    G +Y+A++ N+ IRKI+  G+ +  
Sbjct: 318 STIAGNGEKG-SIGDGGSALNAQIYYSVSISFSPNGELYIANEWNNRIRKISLSGIISTY 376

Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICAL------LISDHGNQLIRQI 186
            GG+  +G           ++ + + F+P   ++      LI+D  + LIR++
Sbjct: 377 SGGTFGDGYD---------ASSWGVLFLPQGVSITPNGDVLIADSKHALIRKL 420



 Score = 38.9 bits (89), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 47/98 (47%), Gaps = 3/98 (3%)

Query: 89  GVQGYSDG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEG 147
           G     DG  P  A + +P S  +   G++++AD  ++ IRK++N  V+TIAG    +  
Sbjct: 46  GTNSVGDGLPPDGATYVRPVSVKIAQNGDVFIADHDHYTIRKVSNGKVSTIAGKNLVQGD 105

Query: 148 RADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIR 184
             DG  A NA  S   +L        LL  D G +L +
Sbjct: 106 SGDGTLAANALLSGPSDLLLTSS-GELLFMDGGPRLKK 142


>gi|406947628|gb|EKD78528.1| NHL repeat containing protein [uncultured bacterium]
          Length = 810

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 52/113 (46%), Gaps = 15/113 (13%)

Query: 91  QGYSDGEP-----GSAR-----FDKPKSFAVDMKGNI-YVADKSNHVIRKIT----NLGV 135
            GY +GE      G +R     F +P   AVD  G   YVAD  N VIR++T    N G 
Sbjct: 488 NGYQEGEACPNQFGQSRKNCAYFSRPGGIAVDSNGKYAYVADTGNEVIRRVTLYGKNKGK 547

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           T +  G  ++ G  D   ++A F+    +T       L ++D  N  IRQ+ +
Sbjct: 548 TKLIAGSPQESGYQDATRKDAQFNVPIAVTIDNADNYLYVADRDNHTIRQVRI 600



 Score = 42.4 bits (98), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 29/93 (31%), Positives = 44/93 (47%), Gaps = 5/93 (5%)

Query: 60  IVLDSSRSAFYTLSF-----PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
           IV D + S +   +F      ++   VV  + G G  G + G   S +F  P +  +D  
Sbjct: 51  IVSDGAGSFYLADTFNGVIRKINSNGVVSTVVGAGGYGDTTGSGSSTKFALPAAVGLDSS 110

Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEG 147
           GN+Y+AD  N  I+K     V+TIA    + EG
Sbjct: 111 GNVYIADTGNGKIKKFNGSTVSTIASDFDRPEG 143



 Score = 42.0 bits (97), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 50/96 (52%), Gaps = 6/96 (6%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIR--KITNLGVTTIAGGGSKKEGR 148
           GY D     A+F+ P +  +D   N +YVAD+ NH IR  +I++  VTT+ G  SKK G 
Sbjct: 559 GYQDATRKDAQFNVPIAVTIDNADNYLYVADRDNHTIRQVRISDGQVTTLVGNPSKK-GY 617

Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIR 184
            DG    A  +   E+ +  +   +  S+ G  ++R
Sbjct: 618 LDGKFAEAYLNLPVEVFY--NRGNVYFSEAGTHMVR 651



 Score = 40.8 bits (94), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 34/53 (64%), Gaps = 3/53 (5%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKI 130
           +  VK +AGDG +GY++G+  + +FD P   A   KG N+ +AD  N +IRK+
Sbjct: 657 DEAVKLVAGDGNRGYNNGDRDNTQFDNPVGLA--RKGDNLLIADSLNDLIRKV 707



 Score = 38.5 bits (88), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 31/94 (32%), Positives = 43/94 (45%), Gaps = 16/94 (17%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
           A FD P+    D  G+ Y+AD  N VIRKI + G V+T+ G G    G   G   +  F+
Sbjct: 43  AYFDFPEDIVSDGAGSFYLADTFNGVIRKINSNGVVSTVVGAGGY--GDTTGSGSSTKFA 100

Query: 160 NDFELTFVPHICAL------LISDHGNQLIRQIN 187
                  +P    L       I+D GN  I++ N
Sbjct: 101 -------LPAAVGLDSSGNVYIADTGNGKIKKFN 127



 Score = 37.4 bits (85), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 34/120 (28%), Positives = 58/120 (48%), Gaps = 9/120 (7%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKKEG 147
           +GY DG+   A  + P     + +GN+Y ++   H++R + + G   V  +AG G++  G
Sbjct: 615 KGYLDGKFAEAYLNLPVEVFYN-RGNVYFSEAGTHMVR-VADRGDEAVKLVAGDGNR--G 670

Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVS 207
             +G   N  F N   L        LLI+D  N LIR+++L   +    ++   A+  VS
Sbjct: 671 YNNGDRDNTQFDNPVGLARKGD--NLLIADSLNDLIRKVDLGDGENIPYTEDAPAVSGVS 728


>gi|290970060|ref|XP_002668023.1| predicted protein [Naegleria gruberi]
 gi|284081057|gb|EFC35279.1| predicted protein [Naegleria gruberi]
          Length = 318

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/112 (34%), Positives = 56/112 (50%), Gaps = 4/112 (3%)

Query: 78  EESVVKRLAGDGVQGYS-DG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
               +  +AG G+ GY+ DG +  +A+   P    +D K  +Y +D SNH IRKI +N  
Sbjct: 81  NNGTIVTIAGTGIAGYNGDGIDATTAQLFSPYGIGIDSKDLVYFSDSSNHRIRKILSNGT 140

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + TIAG GS       G A  A     F LT  P+   ++ +D  N  IR+I
Sbjct: 141 IVTIAGIGSNGFSGDGGLATKAKIYLPFGLTISPN-DDVIFADVNNNRIRKI 191



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/66 (45%), Positives = 42/66 (63%), Gaps = 5/66 (7%)

Query: 79  ESVVKRLAGDGVQGYSDG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VT 136
           +S++K +AG   QG  DG    SA  + P S A++  G +Y+ADKS+H IRK+ N G + 
Sbjct: 30  KSIIKTIAG---QGVCDGISATSASLNGPSSTAINSLGEVYIADKSHHRIRKVYNNGTIV 86

Query: 137 TIAGGG 142
           TIAG G
Sbjct: 87  TIAGTG 92


>gi|290986699|ref|XP_002676061.1| predicted protein [Naegleria gruberi]
 gi|284089661|gb|EFC43317.1| predicted protein [Naegleria gruberi]
          Length = 819

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/64 (46%), Positives = 41/64 (64%), Gaps = 4/64 (6%)

Query: 85  LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG 140
           +AG+G QGY+ +G  G  +   +P S  +D  GNIY ADK N +IRKI  +N  V T+AG
Sbjct: 207 IAGNGKQGYNGEGLSGLDSALKEPYSVDMDSDGNIYFADKGNSIIRKILASNGAVITVAG 266

Query: 141 GGSK 144
            G+K
Sbjct: 267 NGTK 270



 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 34/112 (30%), Positives = 57/112 (50%), Gaps = 3/112 (2%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK-ITNLG 134
           LS  +V+  + G      +DG     +   P     D+ G++ +AD+ NH IRK ++N  
Sbjct: 477 LSNGTVIT-VVGSTTGSTADGPALQTKLTIPNGIGFDLDGSLLIADRYNHRIRKLLSNGT 535

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + TIAG G   +G  D  A N+S +    +  +P+   +LI+D  N  IR++
Sbjct: 536 LITIAGTGVSGKGGDDILATNSSLNQPSGVCTMPN-GDILIADRANHQIRRL 586



 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 36/116 (31%), Positives = 56/116 (48%), Gaps = 7/116 (6%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
            +K +AG G  GY+ DG P  S+  + P++  +     IY+ D SN  IRKI N G + T
Sbjct: 591 TIKTIAGTGSSGYNGDGLPATSSTLNYPQTVTM-FHNEIYMVDTSNSRIRKILNNGTMIT 649

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI--NLKPE 191
           I G G + +         +   + + +TF      + ISD+ N  I+ +  N  PE
Sbjct: 650 IVGNGIRGDSLDGLDPLESKLKSPYHVTFDSR-GRMYISDYSNHKIKVVDFNSSPE 704



 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 50/106 (47%), Gaps = 6/106 (5%)

Query: 84  RLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
           R AG G QGY+ DG+       + P S  +   G +Y +D SN+++RK+   G      G
Sbjct: 96  RFAGTGFQGYNGDGKLALETDLNYPGSLMM-YDGELYFSDLSNYLVRKVDRNGYVRTVAG 154

Query: 142 GSKKEGRADGPAQNASFSNDFEL-TFVPHICALLISDHGNQLIRQI 186
             K  G AD    N   +N +    +V  +  +LI+D     IR++
Sbjct: 155 SIKSYGYADNVLANE--TNIYPTGIYVTKLGEVLIADSSIHRIRKV 198


>gi|289774196|ref|ZP_06533574.1| NHL repeat containing protein [Streptomyces lividans TK24]
 gi|289704395|gb|EFD71824.1| NHL repeat containing protein [Streptomyces lividans TK24]
          Length = 605

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/115 (31%), Positives = 62/115 (53%), Gaps = 7/115 (6%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S   +V D++R     L+     ES V+R+ G G +G++DG    A F +P+  A+   G
Sbjct: 189 SGTFLVSDTTRHQLVELAE--DGESPVRRI-GSGRRGFTDGPADRAEFSEPQGLALLGDG 245

Query: 116 NIYVADKSNHVIRKITNLG--VTTIAGGGSK-KEGR-ADGPAQNASFSNDFELTF 166
           ++ VAD  NH +R+   +   VTT+AG G + + G  + G A+  S S+ +++  
Sbjct: 246 SVVVADTVNHALRRYEPVTGEVTTLAGTGRQHRPGEPSRGAARAVSLSSPWDVAL 300


>gi|21218701|ref|NP_624480.1| hypothetical protein SCO0142 [Streptomyces coelicolor A3(2)]
 gi|6066645|emb|CAB58305.1| conserved hypothetical protein SCJ33.06c [Streptomyces coelicolor
           A3(2)]
          Length = 601

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/115 (31%), Positives = 62/115 (53%), Gaps = 7/115 (6%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S   +V D++R     L+     ES V+R+ G G +G++DG    A F +P+  A+   G
Sbjct: 185 SGTFLVSDTTRHQLVELAE--DGESPVRRI-GSGRRGFTDGPADRAEFSEPQGLALLGDG 241

Query: 116 NIYVADKSNHVIRKITNLG--VTTIAGGGSK-KEGR-ADGPAQNASFSNDFELTF 166
           ++ VAD  NH +R+   +   VTT+AG G + + G  + G A+  S S+ +++  
Sbjct: 242 SVVVADTVNHALRRYEPVTGEVTTLAGTGRQHRPGEPSRGAARAVSLSSPWDVAL 296


>gi|418721636|ref|ZP_13280812.1| hypothetical protein LEP1GSC101_0633 [Leptospira borgpetersenii
           str. UI 09149]
 gi|410741989|gb|EKQ90740.1| hypothetical protein LEP1GSC101_0633 [Leptospira borgpetersenii
           str. UI 09149]
          Length = 246

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 44/81 (54%), Gaps = 1/81 (1%)

Query: 60  IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYV 119
           ++LD      +T    ++   VV   AG G++   DG   +A F  P S  +D  GNI+V
Sbjct: 35  MILDCILKECHTCKLKVTNNPVVSPFAGTGIRESIDGTTQTASFKTPFSLELDTFGNIFV 94

Query: 120 ADKSNHVIRKITNLG-VTTIA 139
           +D  +++IRKI   G VTT++
Sbjct: 95  SDSDSNLIRKIDRFGNVTTLS 115



 Score = 40.8 bits (94), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 27/73 (36%), Positives = 41/73 (56%), Gaps = 7/73 (9%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKK 145
           G+ G+ +G+  ++ F  P    +D + N+YV +  NH IRKI NL    V+T++GG S  
Sbjct: 163 GIGGFQNGDRLNSLFKGPLFMDLDRERNLYVGELGNHTIRKI-NLNSETVSTLSGGVS-- 219

Query: 146 EGRADGPAQNASF 158
            G  DG   +A F
Sbjct: 220 -GYLDGDLTSAQF 231


>gi|348681648|gb|EGZ21464.1| hypothetical protein PHYSODRAFT_329405 [Phytophthora sojae]
          Length = 145

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/108 (35%), Positives = 52/108 (48%), Gaps = 8/108 (7%)

Query: 60  IVLDSSRSAFY-----TLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
           I +DSS S +       L   ++ +  VK +AG G +G+ +G   SARF  P +      
Sbjct: 3   IAIDSSDSIYVLDMHRILKVTVTLDGDVKVVAGSGAKGFVNGFGESARFSTPWALTFGSD 62

Query: 115 GNIYVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRADGPAQNASFSN 160
           G +YV D  N  IRK  IT   V T A G  +  G  +G   NA+F N
Sbjct: 63  GYLYVPDLDNDCIRKVDITTTEVMTYA-GICQTSGAVNGLTTNATFDN 109


>gi|392403177|ref|YP_006439789.1| NHL repeat containing protein [Turneriella parva DSM 21527]
 gi|390611131|gb|AFM12283.1| NHL repeat containing protein [Turneriella parva DSM 21527]
          Length = 392

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 42/114 (36%), Positives = 58/114 (50%), Gaps = 11/114 (9%)

Query: 81  VVKRLAGDG----VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLG 134
           VV  LAG G    + G  DG   +ARF   ++   D   N+YVAD SN+ IRK  I +  
Sbjct: 107 VVSTLAGPGPGTSLSGDVDGIGEAARFSSMRAITTDGT-NLYVADNSNNKIRKIVIASRA 165

Query: 135 VTTIAG--GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           VT +AG   G    G  DG A +A F+N   +  V     L ++D  N+ +R+I
Sbjct: 166 VTVLAGPAAGDTTSGDTDGTANDARFNNPQGI--VTDGTNLFVADSLNRKVRKI 217



 Score = 45.1 bits (105), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 37/105 (35%), Positives = 48/105 (45%), Gaps = 7/105 (6%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG--G 141
           AGD   G +DG    ARF+ P+    D   N++VAD  N  +RKI      V+T+AG   
Sbjct: 174 AGDTTSGDTDGTANDARFNNPQGIVTDGT-NLFVADSLNRKVRKIVIASGIVSTLAGPAQ 232

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G    G  DG A  A F     +T       L + D  N  IR+I
Sbjct: 233 GVTGSGDTDGSANTARFGLPGAMT--TDGVNLYLCDSSNHKIRKI 275



 Score = 45.1 bits (105), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 37/101 (36%), Positives = 46/101 (45%), Gaps = 25/101 (24%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRADGPAQNAS 157
           SARF+ P     D   ++YVAD SN  IRK  I    VTT+A           GPAQ A+
Sbjct: 304 SARFNNPLGITTDGT-SLYVADTSNQKIRKIVIATGAVTTVA-----------GPAQGAT 351

Query: 158 FSNDFELT------FVPHICA-----LLISDHGNQLIRQIN 187
            S D E        F PH        L + ++ N  IR+IN
Sbjct: 352 TSGDTEGVGTSARFFNPHGITTDGYRLFVGEYWNHKIRRIN 392



 Score = 43.5 bits (101), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 39/127 (30%), Positives = 62/127 (48%), Gaps = 7/127 (5%)

Query: 64  SSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS 123
           + RS+   L+  L+ +++     G    G +DG   +ARF+       D + NIY  D  
Sbjct: 36  AGRSSNCALTLSLTVDTLAGPSQGTAASGDTDGTGSAARFNAVTGVTTDGQ-NIYSTDYF 94

Query: 124 NHVIRK--ITNLGVTTIA--GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHG 179
            H IRK  I++  V+T+A  G G+   G  DG  + A FS+   +T       L ++D+ 
Sbjct: 95  GHKIRKTVISSGVVSTLAGPGPGTSLSGDVDGIGEAARFSSMRAIT--TDGTNLYVADNS 152

Query: 180 NQLIRQI 186
           N  IR+I
Sbjct: 153 NNKIRKI 159


>gi|290983716|ref|XP_002674574.1| predicted protein [Naegleria gruberi]
 gi|284088165|gb|EFC41830.1| predicted protein [Naegleria gruberi]
          Length = 2271

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/107 (33%), Positives = 57/107 (53%), Gaps = 6/107 (5%)

Query: 85  LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
            AG G  GYS DG  G SA  + P + A D  GN+ ++D  N+ IRK+ N  ++T+AG  
Sbjct: 291 FAGSGTSGYSGDGGLGTSALLNGPSALAFDSSGNLLISDSFNNRIRKVANGTISTLAGNS 350

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPH---ICALLISDHGNQLIRQI 186
           ++  G        ASFS+   + +  +      +LISD  N ++R++
Sbjct: 351 NRNFGNG-VLGTLASFSSPNSVYYTGNDDSAGGILISDTNNHVLRRL 396



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 54/108 (50%), Gaps = 6/108 (5%)

Query: 81  VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
           ++  +AG GV  YS   G   +A+ + P+   +   G I  +D  NH IRKI N  ++T+
Sbjct: 84  IMSTIAGTGVANYSGDGGAAVNAQLNSPQGIGILSTGAIVFSDTLNHCIRKIENGIISTL 143

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           AG GS   G   G A +A  +    L    +   +  ++ GN LIR+I
Sbjct: 144 AGNGSP--GLTVGSAISAQLNTPTALIVASN--DIYFAESGNHLIRRI 187



 Score = 45.4 bits (106), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 40/123 (32%), Positives = 54/123 (43%), Gaps = 19/123 (15%)

Query: 77  SEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           +  ++V  LAGD   G+  DG P  S+R D P    + + G I   D+ N  IRKIT  G
Sbjct: 557 TSSALVTTLAGDRFNGFLGDGGPAKSSRLDSPGPIQLTLGGEIIFMDRGNQRIRKITKYG 616

Query: 135 -VTTIAGGGSKKEGR-ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPED 192
            + TI G GS    + A+G     S               + I+D GN LI+   L    
Sbjct: 617 NIFTIYGNGSTSVFKDANGLTLGGS-------------GEIYIADSGNNLIKM--LSAAK 661

Query: 193 CSK 195
           CS 
Sbjct: 662 CSN 664



 Score = 44.7 bits (104), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 36/107 (33%), Positives = 53/107 (49%), Gaps = 2/107 (1%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTT 137
           E  ++  LAG+G  G + G   SA+ + P +  V    +IY A+  NH+IR+I++ G+ T
Sbjct: 136 ENGIISTLAGNGSPGLTVGSAISAQLNTPTALIV-ASNDIYFAESGNHLIRRISSSGMLT 194

Query: 138 IA-GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
           I  G  S   G +D     ASF  +  +  V H   L  SD  N  I
Sbjct: 195 IYIGESSGTSGSSDTGTAGASFKMNTPMGIVFHAGNLYFSDSKNYRI 241


>gi|348690525|gb|EGZ30339.1| hypothetical protein PHYSODRAFT_295167 [Phytophthora sojae]
          Length = 383

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/71 (43%), Positives = 39/71 (54%), Gaps = 3/71 (4%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEG 147
           V GY+DG   +  F KP+S AV    ++YVAD  N+ IRK+  T   VTT A G     G
Sbjct: 11  VSGYADGYGDAVFFKKPRSLAVGSDADLYVADVGNYYIRKVRLTTKEVTTYA-GTCTSSG 69

Query: 148 RADGPAQNASF 158
             DG A +A F
Sbjct: 70  ATDGVATSALF 80



 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 32/88 (36%), Positives = 46/88 (52%), Gaps = 6/88 (6%)

Query: 60  IVLDSSRSAFYTLS---FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           IV DSS + + T +      +    +  LAG    G  DG   SARF  P + A+   G+
Sbjct: 261 IVTDSSDNIYVTDNNRVMKFTSGGAMTVLAGSTSSGLVDGTGTSARFYHPDALAIGSDGD 320

Query: 117 IYVADKSNHVIRK---ITNLGVTTIAGG 141
           +YVADK+N+ IR+   IT    T++A G
Sbjct: 321 LYVADKANYCIRRLNLITKEVTTSVASG 348


>gi|386717879|ref|YP_006184205.1| hypothetical protein SMD_1470 [Stenotrophomonas maltophilia D457]
 gi|384077441|emb|CCH12027.1| Alr1013 protein [Stenotrophomonas maltophilia D457]
          Length = 470

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 45/147 (30%), Positives = 65/147 (44%), Gaps = 11/147 (7%)

Query: 73  SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           S+ L + S+  R AG G     DG    A F +P + AV ++  +YVAD     IR +  
Sbjct: 301 SYHLGQRSLQWR-AGSGSIDERDGSGHLAAFAQPTALAV-VQQVLYVADALGSSIRALQL 358

Query: 133 LG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-- 188
            G  V T+ G G  + G  DGP   AS      +        L I+D GN  +R + L  
Sbjct: 359 RGDLVQTLVGQGPWRFGSEDGPRAQASLQYPQAIALSADAPLLWIADTGNGRLRTLRLGG 418

Query: 189 -----KPEDCSKSSQSGSALGAVSVWV 210
                +P        +G A+GA +VW+
Sbjct: 419 GELTTQPLPRRLHGPAGLAVGAGAVWI 445



 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 48/103 (46%), Gaps = 12/103 (11%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIAG 140
           + R  G G   + DG    A F +P++  ++ + ++YVAD  NH +R+I  L G+     
Sbjct: 197 ILRQFGLGTADFMDGNLAEAAFHRPQALVLE-RDSLYVADTGNHAVRRINLLTGIVDTLC 255

Query: 141 GGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQL 182
           G  +     +GP AQ    S D       H   L I+D  NQL
Sbjct: 256 GNGRPGAPMEGPVAQARQVSLD-------HPVGLAIAD--NQL 289


>gi|443288042|ref|ZP_21027136.1| Alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Micromonospora lupini str. Lupac 08]
 gi|385881808|emb|CCH22229.1| Alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Micromonospora lupini str. Lupac 08]
          Length = 612

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 49/92 (53%), Gaps = 6/92 (6%)

Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP-HICA- 172
           GN+ V+D + H + ++   G T +   GS + GRADGPA  A+FS    L  +P H+   
Sbjct: 185 GNLLVSDSARHSLVEVAPDGETPVRRIGSGERGRADGPAAAATFSEPQGLCLLPTHVAEV 244

Query: 173 ----LLISDHGNQLIRQINLKPEDCSKSSQSG 200
               L+++D  N L+R + L+  +    + SG
Sbjct: 245 AGYDLVVADTVNHLLRGVRLESGEVVTVAGSG 276



 Score = 44.7 bits (104), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 42/174 (24%), Positives = 72/174 (41%), Gaps = 12/174 (6%)

Query: 25  GLLLEDGYTVTTVIDGHQLE--INPHS----VIDRPGSSDLIVLDS----SRSAFYTLSF 74
           G+ LE G  VT    G Q    ++ H+     +D     DL   D     + +  + L +
Sbjct: 261 GVRLESGEVVTVAGSGRQWRASVDDHAHDALAVDLSSPWDLAWYDDKLIIAMAGIHQLWW 320

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKITNL 133
               +      AG  V+   DG    A   +P   +V   G  +++AD     +R + N 
Sbjct: 321 FDPIKRTAGMYAGTTVEALRDGPLAEAWMAQPSGLSVSADGVRLWIADSETSAVRYVENG 380

Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            + T  G G    G  DGPA++A   +   +  +P   ++LI+D  N  +R+ +
Sbjct: 381 VLGTAVGQGLFDFGHVDGPAESALLQHPLGVCALPD-GSVLIADTYNGAVRRFD 433



 Score = 42.0 bits (97), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 36/138 (26%), Positives = 70/138 (50%), Gaps = 21/138 (15%)

Query: 47  PHSVIDRPGSS------DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGS 100
           P + +  PG +      +L+V DS+R +   ++     E+ V+R+ G G +G +DG   +
Sbjct: 169 PETALRFPGKAVLLPDGNLLVSDSARHSLVEVAP--DGETPVRRI-GSGERGRADGPAAA 225

Query: 101 ARFDKPKSFAV------DMKG-NIYVADKSNHVIR--KITNLGVTTIAGGGSKKEGRADG 151
           A F +P+   +      ++ G ++ VAD  NH++R  ++ +  V T+AG G +     D 
Sbjct: 226 ATFSEPQGLCLLPTHVAEVAGYDLVVADTVNHLLRGVRLESGEVVTVAGSGRQWRASVDD 285

Query: 152 PAQNA---SFSNDFELTF 166
            A +A     S+ ++L +
Sbjct: 286 HAHDALAVDLSSPWDLAW 303


>gi|390342982|ref|XP_001201679.2| PREDICTED: NHL repeat-containing protein 2-like [Strongylocentrotus
           purpuratus]
          Length = 338

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 41/140 (29%), Positives = 60/140 (42%), Gaps = 12/140 (8%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-------GYSDGEPGSARFDKPKSFAV 111
           + V DS  S+   ++F   ++  VK + G  +        G SDG+   AR   P   A 
Sbjct: 39  MFVADSESSSIRRVAF---KDGAVKNVVGGEMDPMNLFAYGDSDGKGLEARLQHPLGVAW 95

Query: 112 DMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPH 169
           D    ++VAD  NH I+ I        T AG G   +G  D     A F+    L   P 
Sbjct: 96  DHSKLLFVADSYNHKIKMIDPEERYCATYAGTGEPGKGGDDEHILKAQFNEPGGLAISPC 155

Query: 170 ICALLISDHGNQLIRQINLK 189
            C + ++D  N  IR I++K
Sbjct: 156 GCKIYVADTNNHTIRCIDIK 175


>gi|297848174|ref|XP_002891968.1| hypothetical protein ARALYDRAFT_892829 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297337810|gb|EFH68227.1| hypothetical protein ARALYDRAFT_892829 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 1058

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 56/115 (48%), Gaps = 4/115 (3%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLG 134
           E + + ++   G +G+ DG    A F++P+  A + K N +YVAD  NH +R+I   N  
Sbjct: 600 EGNFIVQIGSTGEEGFQDGSFEDAAFNRPQGLAYNAKKNLLYVADTENHALREIDFVNER 659

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSND-FELTFVPHICALLISDHGNQLIRQINL 188
           V T+AG G+K      G        N  +++ F P    + ++  G   I + N+
Sbjct: 660 VRTLAGNGTKGSDYQGGRKGTKQLLNSPWDVCFEPVNEKVYVAMAGQHQIWEYNV 714



 Score = 41.6 bits (96), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 43/103 (41%), Gaps = 2/103 (1%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           G +DG         P        G IY+ D  NH I+K+  +    +   G+ K G  DG
Sbjct: 794 GDNDGVGAEVLLQHPLGVLCAKDGQIYLTDSYNHKIKKLDPITKRVVTVAGTGKAGFKDG 853

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-KPEDC 193
             + A  S    L    +   L ++D  N LIR I+L K ED 
Sbjct: 854 KVKGAQLSEPAGLAITEN-GRLFVADTNNSLIRYIDLNKGEDA 895



 Score = 38.5 bits (88), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 19/56 (33%), Positives = 32/56 (57%), Gaps = 2/56 (3%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           P+++  V   +AG G  G+ DG+   A+  +P   A+   G ++VAD +N +IR I
Sbjct: 834 PITKRVVT--VAGTGKAGFKDGKVKGAQLSEPAGLAITENGRLFVADTNNSLIRYI 887


>gi|441163237|ref|ZP_20968264.1| NHL repeat-containing protein [Streptomyces rimosus subsp. rimosus
           ATCC 10970]
 gi|440616376|gb|ELQ79518.1| NHL repeat-containing protein [Streptomyces rimosus subsp. rimosus
           ATCC 10970]
          Length = 551

 Score = 49.3 bits (116), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/79 (35%), Positives = 45/79 (56%), Gaps = 2/79 (2%)

Query: 109 FAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFV 167
            AVD  GN+Y+AD +N  +RK+T  G +TT+AG G+       GPA ++  +    +T  
Sbjct: 1   MAVDAAGNLYIADSANQRVRKVTPQGTITTVAGTGTAGYTSDGGPATSSQLNTPAYVTL- 59

Query: 168 PHICALLISDHGNQLIRQI 186
                L I++ G+Q IR++
Sbjct: 60  DDAGNLYIAESGSQRIRKV 78



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 46/81 (56%), Gaps = 3/81 (3%)

Query: 76  LSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ +  +  +AG G  GY SDG P  S++ + P    +D  GN+Y+A+  +  IRK+T  
Sbjct: 22  VTPQGTITTVAGTGTAGYTSDGGPATSSQLNTPAYVTLDDAGNLYIAESGSQRIRKVTTD 81

Query: 134 G-VTTIAGGGSKKEGRADGPA 153
           G +TT+AG G+       GPA
Sbjct: 82  GIITTVAGNGTAGYVDDGGPA 102



 Score = 46.2 bits (108), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 36/58 (62%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGY-SDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ + ++  +AG+G  GY  DG P +A R   P+  A+D  GN+Y+AD  N+ +R +T
Sbjct: 78  VTTDGIITTVAGNGTAGYVDDGGPATATRLYGPRGVALDRAGNLYIADGDNNRVRGVT 135


>gi|344206806|ref|YP_004791947.1| NHL repeat containing protein [Stenotrophomonas maltophilia JV3]
 gi|343778168|gb|AEM50721.1| NHL repeat containing protein [Stenotrophomonas maltophilia JV3]
          Length = 470

 Score = 49.3 bits (116), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 45/147 (30%), Positives = 65/147 (44%), Gaps = 11/147 (7%)

Query: 73  SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           S+ L + S+  R AG G     DG    A F +P + AV ++  +YVAD     IR +  
Sbjct: 301 SYHLGQRSLQWR-AGSGSIDERDGSGHLAAFAQPTALAV-VQQVLYVADALGSSIRALQL 358

Query: 133 LG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-- 188
            G  V T+ G G  + G  DGP   AS      +        L I+D GN  +R + L  
Sbjct: 359 RGDLVQTLVGQGPWRFGSEDGPRAQASLQYPQAIALSADAPLLWIADTGNGRLRTLRLGG 418

Query: 189 -----KPEDCSKSSQSGSALGAVSVWV 210
                +P        +G A+GA +VW+
Sbjct: 419 GELTTQPLPRRLHGPAGLAVGAGAVWI 445



 Score = 40.0 bits (92), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 48/103 (46%), Gaps = 12/103 (11%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIAG 140
           + R  G G   + DG    A F +P++  ++ + ++YVAD  NH +R+I  L G+     
Sbjct: 197 ILRQFGLGTADFMDGNLAEAAFHRPQALVLE-RDSLYVADTGNHAVRRINLLTGIVDTLC 255

Query: 141 GGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQL 182
           G  +     +GP AQ    S D       H   L I+D  NQL
Sbjct: 256 GNGRPGAPVEGPVAQARQVSLD-------HPVGLAIAD--NQL 289


>gi|392404857|ref|YP_006441469.1| NHL repeat containing protein [Turneriella parva DSM 21527]
 gi|390612811|gb|AFM13963.1| NHL repeat containing protein [Turneriella parva DSM 21527]
          Length = 697

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 51/100 (51%), Gaps = 7/100 (7%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAG--GGSKKE 146
           +G +DG   +ARF+ P+    D   N+YVAD +N  IRK  I+   VTTIAG   GS   
Sbjct: 478 RGDTDGTGTAARFNVPEGITTDGT-NLYVADSTNSKIRKIVISTRVVTTIAGPAQGSAAT 536

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G  D     A F+    +T+      L I+D  N  IR++
Sbjct: 537 GDTDATGNAARFNKPTGITY--DGTNLFIADGNNNKIRKL 574



 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 35/97 (36%), Positives = 44/97 (45%), Gaps = 14/97 (14%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPG-----------SARFDKPKSFAVDMKGNIYVADKS 123
           P++  + V   AG G   Y    PG           SARF + K    D   N+YV D  
Sbjct: 333 PITLSANVSTFAGPGPGCYFPYNPGCPEGDLDATGTSARFRQIKGVTTDGT-NLYVVDTG 391

Query: 124 NHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASF 158
           N+ IRKI  +   VT +AGG S + G ADG    A F
Sbjct: 392 NNNIRKIVISTGAVTKLAGGTSTEFGDADGTGSTARF 428



 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 43/141 (30%), Positives = 65/141 (46%), Gaps = 17/141 (12%)

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG----DGVQGYSDGEPGSARFDKPKSFAVD 112
           ++L V DS+ S    +   +    VV  +AG        G +D    +ARF+KP     D
Sbjct: 501 TNLYVADSTNSKIRKI---VISTRVVTTIAGPAQGSAATGDTDATGNAARFNKPTGITYD 557

Query: 113 MKGNIYVADKSNHVIRK--ITNLGVTTIAG--GGSKKEGRADGPAQNASFSNDFELTFVP 168
              N+++AD +N+ IRK  I+   VTTIAG   G+   G  D     A F +   +T   
Sbjct: 558 GT-NLFIADGNNNKIRKLVISTGVVTTIAGPSQGTITSGDTDAVGNAARFYSPVGIT--T 614

Query: 169 HICALLISD---HGNQLIRQI 186
               L ++D   + N  IR+I
Sbjct: 615 DRTNLFVADGTGNRNNKIRKI 635


>gi|336313754|ref|ZP_08568676.1| thiol-disulfide isomerase-like thioredoxin [Rheinheimera sp. A13L]
 gi|335881693|gb|EGM79570.1| thiol-disulfide isomerase-like thioredoxin [Rheinheimera sp. A13L]
          Length = 498

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 41/127 (32%), Positives = 62/127 (48%), Gaps = 10/127 (7%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTT 137
            +VKRL G G+    DG   S+ F  P+   +  K  +YVAD  NH IR+I  +   V T
Sbjct: 222 KLVKRL-GSGIAELKDGHSDSSAFSSPQGLVLTDKA-LYVADTGNHAIRRIDLSTFQVNT 279

Query: 138 IAGGGSKKEGR---ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
           IAG G   +GR      PAQ  S  + ++L    +   L I+  G+  I  ++LK  + +
Sbjct: 280 IAGNGELAQGRLISGSTPAQ-VSLRSPWDLALDNN--TLYIAMAGSHQIWTLDLKSSELN 336

Query: 195 KSSQSGS 201
             + +G 
Sbjct: 337 LFAGTGQ 343



 Score = 37.4 bits (85), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 34/128 (26%), Positives = 62/128 (48%), Gaps = 9/128 (7%)

Query: 69  FYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVI 127
            +TL    SE ++    AG G +   DG+   A F++P   A  ++GN ++VAD     +
Sbjct: 325 IWTLDLKSSELNL---FAGTGQEALLDGKRRDAAFNQPSGLA--LRGNKLWVADAEASAV 379

Query: 128 RKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
           R+I  ++  V T+ G G  + G  DG  + A   ++ ++  +     L ++D  N  IR 
Sbjct: 380 RQIDLSSGKVDTLVGQGLFEFGLKDGGFKRALLQHNKDVVALDK-NTLAVADTYNHKIRL 438

Query: 186 INLKPEDC 193
           ++L  +  
Sbjct: 439 LDLDKQQV 446


>gi|359493626|ref|XP_003634638.1| PREDICTED: LOW QUALITY PROTEIN: protein argonaute 1B-like [Vitis
           vinifera]
          Length = 867

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 23/45 (51%), Positives = 30/45 (66%)

Query: 123 SNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFV 167
           SN  IRKI + GVTTI GG S   G  +GP ++A FS DF++ +V
Sbjct: 10  SNLAIRKIGDSGVTTIVGGKSNVAGYGNGPCEDAKFSRDFDVVYV 54


>gi|290972869|ref|XP_002669173.1| predicted protein [Naegleria gruberi]
 gi|284082716|gb|EFC36429.1| predicted protein [Naegleria gruberi]
          Length = 1010

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 42/132 (31%), Positives = 58/132 (43%), Gaps = 23/132 (17%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDM 113
           S DL + DSSR       F +     +   AG G  GYS DG     + F+ P   A+  
Sbjct: 75  SDDLFISDSSRHVI----FRMFSNGTISLFAGIGFAGYSKDGYSALDSLFNSPNGIAMSP 130

Query: 114 KGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
            GN+Y+AD  N  IR ++N  V++I G  SK  G A                 V     +
Sbjct: 131 NGNLYIADSQNDKIRIVSNGIVSSIDGTFSKPLGVA-----------------VSANNLV 173

Query: 174 LISDHGNQLIRQ 185
            I+D GN LI++
Sbjct: 174 YIADTGNNLIKK 185


>gi|119961614|ref|YP_949427.1| NHL repeat-containing protein [Arthrobacter aurescens TC1]
 gi|403528903|ref|YP_006663790.1| NHL repeat-containing protein [Arthrobacter sp. Rue61a]
 gi|119948473|gb|ABM07384.1| putative NHL repeat protein [Arthrobacter aurescens TC1]
 gi|403231330|gb|AFR30752.1| NHL repeat-containing protein [Arthrobacter sp. Rue61a]
          Length = 657

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/117 (30%), Positives = 58/117 (49%), Gaps = 7/117 (5%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---- 130
           PL+ E  V  LAG G++G  DG+   A F +    A+D + NI+VAD     +R++    
Sbjct: 359 PLANE--VSILAGSGLEGLLDGKAEEAWFAQSSGLAIDGEDNIWVADSETSSLRRLVISD 416

Query: 131 TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           + + V T  G G    G  DG A  A   +   +T +P   ++ I+D  N  +R+ +
Sbjct: 417 SGVSVETAIGKGLFDFGFRDGEASEARLQHPLGVTVLPD-NSVAIADTYNGAVRRYD 472



 Score = 39.3 bits (90), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 33/108 (30%), Positives = 54/108 (50%), Gaps = 13/108 (12%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPK-------S 108
           + + +V+DS       L   L     V+R+ G G +GY DG+   A+F++P+        
Sbjct: 188 NGNYLVVDSGHHRLVELRPDLE---TVERVIGSGTKGYLDGQSEIAQFNEPQGVTLLPSE 244

Query: 109 FAVDMKGNIYVADKSNHVIRKIT--NLGVTTIAGGGSKKEGRADGPAQ 154
            A  +  +  VAD  NH +R +T  +  V T+AG G ++   A GPA+
Sbjct: 245 LAWKLGYDAVVADTVNHRLRGVTLSSGYVQTLAGNGVQRLLDA-GPAR 291


>gi|428166914|gb|EKX35881.1| hypothetical protein GUITHDRAFT_79273, partial [Guillardia theta
           CCMP2712]
          Length = 318

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 45/174 (25%), Positives = 78/174 (44%), Gaps = 14/174 (8%)

Query: 40  GHQLEIN-PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEP 98
           G +  IN P+ V   P  + +IV D+  +    +     E      LAG  + G ++G  
Sbjct: 107 GTKARINTPYGVKVSPCGNYVIVSDTGNNMIRKVDI---ESGYTNTLAGQSLAGTANGVG 163

Query: 99  GSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI---------TNLGVTTIAGGGSKKEGR 148
             A+F+ P    VD    + YV+D+ N+ IRKI         T+   + +   GS   G 
Sbjct: 164 TLAQFNMPVDVTVDWNETVAYVSDQGNNCIRKIDLLTAALDWTSATPSLVVVAGSGVAGL 223

Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
            D    +A F N   +    +  +LL++D  +  IR+I+L   + +  + +G+A
Sbjct: 224 TDAVGLSAQFYNPTGVAVDWYGASLLVADSMDSTIRRIDLMTSEVTTLAGNGNA 277



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 55/119 (46%), Gaps = 5/119 (4%)

Query: 71  TLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRK 129
           T +FP++  + +    GDG  G   G    AR + P    V   GN + V+D  N++IRK
Sbjct: 84  TATFPITAVAWI----GDGTAGNVQGYGTKARINTPYGVKVSPCGNYVIVSDTGNNMIRK 139

Query: 130 ITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           +      T    G    G A+G    A F+   ++T   +     +SD GN  IR+I+L
Sbjct: 140 VDIESGYTNTLAGQSLAGTANGVGTLAQFNMPVDVTVDWNETVAYVSDQGNNCIRKIDL 198



 Score = 43.5 bits (101), Expect = 0.089,   Method: Compositional matrix adjust.
 Identities = 38/143 (26%), Positives = 65/143 (45%), Gaps = 15/143 (10%)

Query: 60  IVLDSSRSAFYTLSFP---LSE----ESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAV 111
           IV   SR+  Y+ ++    + E      +V  +AG  G   Y D    +ARF       V
Sbjct: 3   IVPSKSRNVLYSTTWDNYCVKEIDLTTKIVTTIAGLCGTNQYVDSTGTNARF--SNMIGV 60

Query: 112 DMKGN---IYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTF 166
           D+  N   +YVAD  N+ +R+I      +T +A  G    G   G    A  +  + +  
Sbjct: 61  DLTKNEQYLYVADVGNNKLRRIDTATFPITAVAWIGDGTAGNVQGYGTKARINTPYGVKV 120

Query: 167 VPHICALLISDHGNQLIRQINLK 189
            P    +++SD GN +IR+++++
Sbjct: 121 SPCGNYVIVSDTGNNMIRKVDIE 143



 Score = 42.0 bits (97), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 40/112 (35%), Positives = 55/112 (49%), Gaps = 18/112 (16%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKITNLG--VTTIAGG 141
           +AG GV G +D    SA+F  P   AVD  G ++ VAD  +  IR+I  +   VTT+AG 
Sbjct: 215 VAGSGVAGLTDAVGLSAQFYNPTGVAVDWYGASLLVADSMDSTIRRIDLMTSEVTTLAGN 274

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICAL-------LISDHGNQLIRQI 186
           G+       G   N  ++ND E T VP   AL        +SD     IR++
Sbjct: 275 GNA------GFIDNL-YANDAEFT-VPFGVALSRDGKYVFVSDQNRNNIRKM 318


>gi|428174200|gb|EKX43097.1| hypothetical protein GUITHDRAFT_153351 [Guillardia theta CCMP2712]
          Length = 392

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 41/144 (28%), Positives = 63/144 (43%), Gaps = 7/144 (4%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P ++   P  + LIV D       T +    + S   +  G+G +GY DG   +A F+ P
Sbjct: 150 PSALALAPNQATLIVSDLELHQLLTFNMADGKTS---KFVGNGQRGYQDGA--TASFNGP 204

Query: 107 KSFAVDMKGN-IYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFEL 164
           +       G  I VAD  NHV+RKI     +T+   G +   G  DG   +A F + + +
Sbjct: 205 RGLTFSPDGTYILVADTGNHVVRKIDVGSAMTSTLAGAAGMAGYQDGMGVDARFQSPYGI 264

Query: 165 TFVPHICALLISDHGNQLIRQINL 188
           T         +S+     IR I+L
Sbjct: 265 TIDSVGQFAYVSETDGNRIRMIDL 288



 Score = 44.7 bits (104), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 31/94 (32%), Positives = 49/94 (52%), Gaps = 7/94 (7%)

Query: 103 FDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
           F  P   +    GN I V D  N+V+R + +  V+TIAG G  ++G ADG A +A F++ 
Sbjct: 46  FSGPSGISTWANGNSILVVDTGNNVVRLLASAQVSTIAGNG--RQGVADGQATSAEFNSP 103

Query: 162 FELT----FVPHICALLISDHGNQLIRQINLKPE 191
            ++     F   +   L+SD  N+ IR + +  E
Sbjct: 104 SDVATKTDFNTGMVTALVSDTNNKKIRFLYVGQE 137



 Score = 39.3 bits (90), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 48/167 (28%), Positives = 72/167 (43%), Gaps = 13/167 (7%)

Query: 39  DGHQLEIN-PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGE 97
           DG     N P  +   P  + ++V D+       +    +  S +   AG  + GY DG 
Sbjct: 195 DGATASFNGPRGLTFSPDGTYILVADTGNHVVRKIDVGSAMTSTLAGAAG--MAGYQDGM 252

Query: 98  PGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPA 153
              ARF  P    +D  G   YV++   + IR I   TN+ V++I+G      G  DG  
Sbjct: 253 GVDARFQSPYGITIDSVGQFAYVSETDGNRIRMIDLATNV-VSSISGSSLGLAGSQDGEG 311

Query: 154 QNASFSNDFELTFVPHICALLISDHGNQLIRQI----NLKPEDCSKS 196
             A FS    +T + +   LL+SD  N  IR +    N+K + C  S
Sbjct: 312 YTARFSGPKGVTLM-NDQVLLVSDSNNNRIRAVATKDNIKIDLCVSS 357


>gi|423301816|ref|ZP_17279839.1| hypothetical protein HMPREF1057_02980 [Bacteroides finegoldii
           CL09T03C10]
 gi|408470907|gb|EKJ89439.1| hypothetical protein HMPREF1057_02980 [Bacteroides finegoldii
           CL09T03C10]
          Length = 469

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 49/96 (51%), Gaps = 2/96 (2%)

Query: 95  DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKEGRADGPA 153
           DG+ G +  + P    +D + NIYVAD+ N  IRKI+ + G  T   G ++ EG  DG  
Sbjct: 375 DGDLGGSTLNHPCQICMDSQNNIYVADRYNCCIRKISLDAGTITTFAGKAQTEGYQDGLI 434

Query: 154 QNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
             A F +   +  V     + ++D  N  IR+I L+
Sbjct: 435 SEALFDDPMGIA-VDKNGVIYVADSDNYAIRRIALE 469



 Score = 44.3 bits (103), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 24/41 (58%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            +GY DG    A FD P   AVD  G IYVAD  N+ IR+I
Sbjct: 426 TEGYQDGLISEALFDDPMGIAVDKNGVIYVADSDNYAIRRI 466


>gi|290987128|ref|XP_002676275.1| predicted protein [Naegleria gruberi]
 gi|284089876|gb|EFC43531.1| predicted protein [Naegleria gruberi]
          Length = 1182

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/136 (27%), Positives = 65/136 (47%), Gaps = 9/136 (6%)

Query: 54  PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAV 111
           P   +   +DSS     T+ + +S   +V   AG G  GY +D  P  SA  + PK  A 
Sbjct: 441 PNVGEYYFVDSS-----TVKYLISSSGIVITKAGSGTTGYNADNIPATSAHLNLPKGIAR 495

Query: 112 DMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHI 170
              G++Y+ D  NH IRK+  N  + T+AG G+         A +A+ +N + +    + 
Sbjct: 496 STNGDLYITDSGNHRIRKVFFNQTIITVAGTGTSTYNGDGLLATSANINNPYGIVLTSN- 554

Query: 171 CALLISDHGNQLIRQI 186
             ++ S++    +R+I
Sbjct: 555 NEVIFSEYEGHRVRKI 570



 Score = 37.7 bits (86), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 54/110 (49%), Gaps = 6/110 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVTT 137
           +K +AG G+ G       + ARF+ P + A      +Y++D ++HVIRK+   +NL ++T
Sbjct: 100 IKTIAGVGIPGSVTTTIATLARFNYPSALAFQNSDVMYLSDTNSHVIRKVILSSNL-IST 158

Query: 138 IAGG-GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            AG  G       DG A NA  +    +   P    L I++  N  IR +
Sbjct: 159 YAGQVGVPGSIGDDGVATNALLNTPGGIVISPVNNNLYIAETSNNKIRVV 208


>gi|86738767|ref|YP_479167.1| serine/threonine protein kinase [Frankia sp. CcI3]
 gi|86565629|gb|ABD09438.1| serine/threonine protein kinase [Frankia sp. CcI3]
          Length = 863

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 37/115 (32%), Positives = 59/115 (51%), Gaps = 4/115 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +S + ++    G GV GY+ DG P +A + + P    +   G +Y AD ++  I+K+T  
Sbjct: 682 ISPQGIITTYVGTGVAGYTGDGGPATAAKINGPNDLQMTDDGTLYFADLASDTIQKVTPD 741

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           G +TT+AG G        GPA  A   N   LT  P    L ++D+ N  IR+++
Sbjct: 742 GIITTVAGTGEGGFSGDGGPATRARL-NVPSLTVGPDGRTLYLADYRNHRIRRVD 795



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 36/89 (40%), Positives = 52/89 (58%), Gaps = 2/89 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
           +A+ D P S A D  GNIY+ D  N+ IRKI+  G +TTIAG G    G   GPA  A  
Sbjct: 596 AAQLDGPGSTARDKAGNIYIGDAKNNRIRKISPAGIITTIAGTGDAGYGGDGGPATAAKI 655

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
           ++  ++T  P   ++ +SD+ N  IR+I+
Sbjct: 656 NSAEKVTTGPD-GSVYLSDYENHRIRKIS 683



 Score = 40.4 bits (93), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 37/89 (41%), Positives = 49/89 (55%), Gaps = 4/89 (4%)

Query: 101 ARFDKPKSFAVDMKG-NIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASF 158
           AR + P S  V   G  +Y+AD  NH IR++   GV TTIAG G +  G   GPA  A F
Sbjct: 765 ARLNVP-SLTVGPDGRTLYLADYRNHRIRRVDPNGVITTIAGTGGEGSGGDGGPATAAQF 823

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
            N   +  V    AL I+D+GN  +R+I+
Sbjct: 824 KNPSSVA-VDGSGALYIADNGNDRVRRID 851



 Score = 37.4 bits (85), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 18/41 (43%), Positives = 26/41 (63%), Gaps = 1/41 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIA 139
           +A+F  P S AVD  G +Y+AD  N  +R+I  N  +TT+A
Sbjct: 820 AAQFKNPSSVAVDGSGALYIADNGNDRVRRIDPNGTITTVA 860


>gi|428214156|ref|YP_007087300.1| thiol-disulfide isomerase-like thioredoxin [Oscillatoria acuminata
           PCC 6304]
 gi|428002537|gb|AFY83380.1| thiol-disulfide isomerase-like thioredoxin [Oscillatoria acuminata
           PCC 6304]
          Length = 506

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/109 (32%), Positives = 51/109 (46%), Gaps = 9/109 (8%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V+    SS L + DS        +        V+ + G+G+ G +DG    A+F  P
Sbjct: 178 PGKVLADAASSRLFIADSGHHRIIVTTL----NGTVQSIIGNGIPGLTDGSFSEAQFFGP 233

Query: 107 KSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGP 152
           +  + D     +YVAD  NH IR+I  T   V T+AG G  ++ R  GP
Sbjct: 234 QGMSWDANSQRLYVADTENHAIRQIDFTTEQVQTLAGTG--EQNRTQGP 280



 Score = 38.9 bits (89), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 18/48 (37%), Positives = 30/48 (62%), Gaps = 1/48 (2%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           K + GDG  G  DG+  ++RF +P   ++   G +Y+AD +NH IR++
Sbjct: 437 KTVLGDGTPGLLDGKNTTSRFFEPGGLSI-ASGILYIADTNNHRIRRV 483


>gi|12323029|gb|AAG51506.1|AC058785_9 hypothetical protein [Arabidopsis thaliana]
          Length = 1041

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 42/140 (30%), Positives = 65/140 (46%), Gaps = 16/140 (11%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLG 134
           E + + ++   G +G+ DG    A F++P+  A + K N +YVAD  NH +R+I   N  
Sbjct: 597 EGNFIVQIGSSGEEGFQDGSFEDAAFNRPQGLAYNAKKNLLYVADTENHALREIDFVNER 656

Query: 135 VTTIAG---------GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
           V T+AG         GG K   +A    QN      +E + +  I  +     GN   R 
Sbjct: 657 VQTLAGNGTKGSDYQGGRKGTKQASDLIQNIGQHQIWEYSVLDGITRVF---SGNGYERN 713

Query: 186 IN-LKPEDCSKSSQSGSALG 204
           +N   P+  S +  SG +LG
Sbjct: 714 LNGSTPQTTSFAQPSGISLG 733



 Score = 40.8 bits (94), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 38/79 (48%), Gaps = 2/79 (2%)

Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
           G IY+ D  NH I+K+  +    +   G+ K G  DG  + A  S    L    +   L 
Sbjct: 800 GQIYLTDSYNHKIKKLDPVTKRVVTLAGTGKAGFKDGKVKGAQLSEPAGLAITEN-GRLF 858

Query: 175 ISDHGNQLIRQINL-KPED 192
           ++D  N LIR I+L K ED
Sbjct: 859 VADTNNSLIRYIDLNKGED 877



 Score = 40.4 bits (93), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 31/55 (56%), Gaps = 3/55 (5%)

Query: 79  ESVVKR---LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + V KR   LAG G  G+ DG+   A+  +P   A+   G ++VAD +N +IR I
Sbjct: 816 DPVTKRVVTLAGTGKAGFKDGKVKGAQLSEPAGLAITENGRLFVADTNNSLIRYI 870


>gi|428172619|gb|EKX41527.1| hypothetical protein GUITHDRAFT_153825 [Guillardia theta CCMP2712]
          Length = 285

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 65/144 (45%), Gaps = 14/144 (9%)

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMK 114
            ++ + D+   A   ++        V  +AG   G  G+ DGE  SARF+ P   AV  +
Sbjct: 133 ENIFIADTYNHAIRHITRDEDRNWTVTLIAGSRSGQSGFEDGEGESARFNCPTGLAVVNE 192

Query: 115 GNIYVADKSNHVIRKITNLG----VTTIAGGGSK-KEGR-------ADGPAQNASFSNDF 162
             I ++D SN  IR +  +     VTT+ G  SK  EGR        DGP   A  +   
Sbjct: 193 QEILISDFSNSAIRLLRRVRGDWLVTTVVGKTSKDSEGRMLPESGYVDGPVDQARLNRPH 252

Query: 163 ELTFVPHICALLISDHGNQLIRQI 186
            + +     ++LI+D  N  IR++
Sbjct: 253 GVAWDEASRSVLIADCMNHRIRRV 276



 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 46/133 (34%), Positives = 63/133 (47%), Gaps = 18/133 (13%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAV-DMKG--NIYVADKSNHVIRK--ITN-LG--VT 136
           +AG G +G+ DG  G  +F  P      DM G  ++ VAD  N+ IRK  ITN +G  V 
Sbjct: 39  IAGHGAKGHVDGPVGECKFCYPHDVLFQDMDGAEHLLVADFDNNCIRKVFITNGVGEHVV 98

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPED---- 192
           T AG    K G  DG  + A F++   ++       + I+D  N  IR I  + ED    
Sbjct: 99  TWAGSNLSKPGLRDGSIKEALFNHPGGISADQSGENIFIADTYNHAIRHIT-RDEDRNWT 157

Query: 193 -----CSKSSQSG 200
                 S+S QSG
Sbjct: 158 VTLIAGSRSGQSG 170


>gi|198421727|ref|XP_002130106.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 454

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/146 (24%), Positives = 62/146 (42%), Gaps = 3/146 (2%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP  +     +  L + D+   A   ++   +   V     G  ++G+ DG    A +  
Sbjct: 176 NPQQLEYDDTNDRLFIADTDNHAIRIINVDSATPKVTTLTGGPRMEGFKDGSFAEAMWRH 235

Query: 106 PKSFAVDMKGNI-YVADKSNHVIR--KITNLGVTTIAGGGSKKEGRADGPAQNASFSNDF 162
           P   A D++ ++ YV+D  NH +R   +    VTTIAGGG   +          + +   
Sbjct: 236 PTGMAYDVRKDVLYVSDHYNHAVRMLDLKKKTVTTIAGGGEPGDVNQRIFGHGLAMNYPE 295

Query: 163 ELTFVPHICALLISDHGNQLIRQINL 188
            +   P    L +++ GN  IR I+L
Sbjct: 296 GIAIDPEHEVLYVAEFGNNCIRMISL 321



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 36/147 (24%), Positives = 62/147 (42%), Gaps = 6/147 (4%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVD-MKGNI 117
           + + DS  +A   +    +E +      GD  + + DG   +A  + P+    D     +
Sbjct: 132 MYIADSGNNAIRLMDLGTNEVTTFAGTNGD--KAFKDGSLLTAGLNNPQQLEYDDTNDRL 189

Query: 118 YVADKSNHVIRKITNLGVT---TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
           ++AD  NH IR I     T   T   GG + EG  DG    A + +   + +      L 
Sbjct: 190 FIADTDNHAIRIINVDSATPKVTTLTGGPRMEGFKDGSFAEAMWRHPTGMAYDVRKDVLY 249

Query: 175 ISDHGNQLIRQINLKPEDCSKSSQSGS 201
           +SDH N  +R ++LK +  +  +  G 
Sbjct: 250 VSDHYNHAVRMLDLKKKTVTTIAGGGE 276


>gi|317419409|emb|CBN81446.1| NHL repeat-containing protein 2 [Dicentrarchus labrax]
          Length = 719

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 55/176 (31%), Positives = 78/176 (44%), Gaps = 32/176 (18%)

Query: 16  SLLFSLVSSG-----LLLEDG----YTVTTVIDGHQLEIN------PHSVIDRPGSSDLI 60
           +LLFS+V  G     +L  D     Y    ++  H ++I       P S +  PG    +
Sbjct: 168 NLLFSIVGEGHRDRLMLFTDCALRYYGEQGLLKTHAVKIKLYRDSLPTSFLSFPGK---V 224

Query: 61  VLDSSRSAFYTLS------FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
            +DSS                LS    +  + G    G  DG+   A F+ P+  A+  K
Sbjct: 225 AIDSSEKRLAIADTGHHRILLLSSTGQLLHVIGGPESGRRDGDLSEASFNSPQGVAI--K 282

Query: 115 GN-IYVADKSNHVIRKITNLG--VTTIAG---GGSKKEGRADGPAQNASFSNDFEL 164
           G+ +YVAD  NH+IRKI  L   V+T+AG    G+ K+G A GP Q  S   D  L
Sbjct: 283 GDTVYVADTENHLIRKIDLLEGRVSTLAGVGIQGTDKDGGAMGPQQPISSPWDVTL 338


>gi|348681654|gb|EGZ21470.1| hypothetical protein PHYSODRAFT_329412 [Phytophthora sojae]
          Length = 251

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 52/162 (32%), Positives = 79/162 (48%), Gaps = 19/162 (11%)

Query: 12  ITLLSLLFSLVSSGLLLE--DGYTVTTVIDGHQLEINPHSVIDRPGSSDL------IVLD 63
           IT+ SL  SLV  G+ ++  +   V+ + +   ++I P   +    +S +      I +D
Sbjct: 61  ITIASLADSLVLQGITVDSSENVYVSRLTEFDVVKIAPDGSVSIFANSSMFTLAMGIAID 120

Query: 64  SSRSAFYT-----LSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
           SS + + T     L F L+ E  V  +AG  VQG+ +     ARF  P +  +   GN+Y
Sbjct: 121 SSDNIYITDQHRILKFTLAGEMGV--VAGR-VQGFLNALGELARFSTPWALTIGSDGNLY 177

Query: 119 VADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASF 158
           V D  N+ IRK+  T   VTT A G     G  DG A +A+F
Sbjct: 178 VVDSDNNCIRKVDLTTREVTTYA-GICLTSGTTDGLATDATF 218


>gi|168698189|ref|ZP_02730466.1| NHL repeat containing protein [Gemmata obscuriglobus UQM 2246]
          Length = 358

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 52/103 (50%), Gaps = 2/103 (1%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASF 158
           A F+   S AV   G +Y+AD  N+ +R        V   AG G+K      GPA++A F
Sbjct: 99  ATFNGMHSLAVGPGGTVYLADTWNNRVRVFDPEAGTVKAFAGTGTKGFSGDGGPAKDAKF 158

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
              F + F P    L ++D  N+ IR+I++K +  + ++ +G 
Sbjct: 159 GGVFCVAFDPAQKNLYVTDLDNRRIRKIDMKTQTVTTAAGNGE 201



 Score = 46.6 bits (109), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 41/148 (27%), Positives = 65/148 (43%), Gaps = 9/148 (6%)

Query: 42  QLEINPHS-VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGS 100
           Q  ++P +  +D+ G  +L VL+    A   +      +  ++ +AG G  G   G+   
Sbjct: 214 QPLVDPRAHAVDKDG--NLWVLERGGHALRVVD----PKGTIRTVAGTGKAGRGTGKALE 267

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNH-VIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
           A  + PK   VD  G + +AD  NH V+R     G +T +AG G K      G    A F
Sbjct: 268 AAMNGPKHLCVDKDGTVLIADTENHRVVRFDPKAGTLTAVAGTGKKGNSLGGGDPLKAEF 327

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
           +    +T  P    + ISD  N  + +I
Sbjct: 328 NQPHGVTVHPKTGDIYISDANNGRVLKI 355



 Score = 40.8 bits (94), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 43/159 (27%), Positives = 71/159 (44%), Gaps = 25/159 (15%)

Query: 54  PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY-SDGEPGSARFD---KPKSF 109
           P   +L V D        +     +   V   AG+G +G   DGE  +AR      P++ 
Sbjct: 168 PAQKNLYVTDLDNRRIRKIDM---KTQTVTTAAGNGEKGVPKDGE--NARTQPLVDPRAH 222

Query: 110 AVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP 168
           AVD  GN++V ++  H +R +   G + T+AG G  K GR  G A  A+ +         
Sbjct: 223 AVDKDGNLWVLERGGHALRVVDPKGTIRTVAGTG--KAGRGTGKALEAAMNGP------K 274

Query: 169 HIC-----ALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
           H+C      +LI+D  N   R +   P+  + ++ +G+ 
Sbjct: 275 HLCVDKDGTVLIADTENH--RVVRFDPKAGTLTAVAGTG 311



 Score = 37.7 bits (86), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 31/72 (43%), Positives = 36/72 (50%), Gaps = 5/72 (6%)

Query: 78  EESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVD-MKGNIYVADKSNHVIRKI--TN 132
           E   VK  AG G +G+S DG P   A+F      A D  + N+YV D  N  IRKI    
Sbjct: 131 EAGTVKAFAGTGTKGFSGDGGPAKDAKFGGVFCVAFDPAQKNLYVTDLDNRRIRKIDMKT 190

Query: 133 LGVTTIAGGGSK 144
             VTT AG G K
Sbjct: 191 QTVTTAAGNGEK 202


>gi|170066897|ref|XP_001868267.1| NHL repeat containing 2 [Culex quinquefasciatus]
 gi|167863075|gb|EDS26458.1| NHL repeat containing 2 [Culex quinquefasciatus]
          Length = 734

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/77 (38%), Positives = 42/77 (54%), Gaps = 3/77 (3%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLG 134
           SE +V+ R+ G    G+ DG+   ARF+ P+  A      I+VAD  NH IR+I   +  
Sbjct: 260 SEGTVLHRIGGKK-SGFVDGDFRKARFNAPQGLAFQNDDVIFVADNENHAIRRIDLKSKQ 318

Query: 135 VTTIAGGGSKKEGRADG 151
           VTT+AG G +   R  G
Sbjct: 319 VTTVAGSGQQGCDRIGG 335


>gi|430743901|ref|YP_007203030.1| thiol-disulfide isomerase-like thioredoxin [Singulisphaera
           acidiphila DSM 18658]
 gi|430015621|gb|AGA27335.1| thiol-disulfide isomerase-like thioredoxin [Singulisphaera
           acidiphila DSM 18658]
          Length = 696

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 41/149 (27%), Positives = 67/149 (44%), Gaps = 9/149 (6%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V+    S  L + D++ +          +  VV    G G  G  DG+   A F++P
Sbjct: 229 PGKVLADEKSKQLFIADTAHNRIVLTDLDGRKSVVV----GSGGIGMVDGDYAKAEFNRP 284

Query: 107 KSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRAD-GPAQNASFSNDFE 163
           +   + +   +YVAD  NH IR I      V+T+AG G +   R+  G  +  S S+ ++
Sbjct: 285 QGLCL-VDDTLYVADTENHAIRAIHLKTKQVSTVAGTGQQGHRRSGAGAGKATSLSSPWD 343

Query: 164 LTFVPHICALLISDHGNQLIRQINLKPED 192
           L  +P    L I+  G   I + ++ P D
Sbjct: 344 LVLIPGTKTLAIAMAGTHQIWRYDI-PSD 371


>gi|153864226|ref|ZP_01997193.1| NHL repeat protein [Beggiatoa sp. SS]
 gi|152146280|gb|EDN72808.1| NHL repeat protein [Beggiatoa sp. SS]
          Length = 313

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 37/107 (34%), Positives = 53/107 (49%), Gaps = 5/107 (4%)

Query: 84  RLAGD-GVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIA 139
           R+AG  GV G+S   G    A+ + PK   V   G +Y+ DK NH IRK+   G +TTI 
Sbjct: 45  RIAGTLGVAGFSGDGGRAVEAKLNNPKRIVVADDGVVYITDKGNHRIRKVDTSGIITTIV 104

Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G G    G   G A  A  ++  ++  +    +L I D  N  IR++
Sbjct: 105 GNGVASFGGDGGEAPGAQLNSPSDIA-IDDSGSLYIIDTNNHRIRKV 150



 Score = 42.4 bits (98), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 31/96 (32%), Positives = 50/96 (52%), Gaps = 8/96 (8%)

Query: 40  GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGE 97
           G QL       ID  GS  L ++D++      +       +++  LAG GV G+S  DG+
Sbjct: 120 GAQLNSPSDIAIDDSGS--LYIIDTNNHRIRKVE---KATNIITTLAG-GVSGFSGDDGQ 173

Query: 98  PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
             SA+F+ P   A+  +G++ +AD  NH +R IT +
Sbjct: 174 ALSAQFNTPDGIAISDEGHLLIADSENHRVRMITGV 209



 Score = 41.2 bits (95), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 27/94 (28%), Positives = 42/94 (44%), Gaps = 22/94 (23%)

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT-----------NLGVTTIAG-GGSKKEGRADGPA 153
           P   A+D  GN+Y+++  NHVIRK+             LGV   +G GG   E + + P 
Sbjct: 12  PNGVAIDKAGNLYISEAGNHVIRKVDANTQSIFRIAGTLGVAGFSGDGGRAVEAKLNNPK 71

Query: 154 QNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           +            V     + I+D GN  IR+++
Sbjct: 72  R----------IVVADDGVVYITDKGNHRIRKVD 95


>gi|395226792|ref|ZP_10405227.1| Cadherin domain-containing protein, partial [Thiovulum sp. ES]
 gi|394444937|gb|EJF05961.1| Cadherin domain-containing protein, partial [Thiovulum sp. ES]
          Length = 1326

 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 27/68 (39%), Positives = 41/68 (60%), Gaps = 2/68 (2%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG 134
           ++++  + ++ G G  G++DG+   A+   P  F  D   NI+VAD +N  +RKIT N  
Sbjct: 205 INKDGEISKIVG-GSYGFADGDSSIAQLKSPSGFDFDKNWNIFVADTNNGRVRKITPNGD 263

Query: 135 VTTIAGGG 142
           VTTIAG G
Sbjct: 264 VTTIAGDG 271


>gi|428171420|gb|EKX40337.1| hypothetical protein GUITHDRAFT_75685, partial [Guillardia theta
           CCMP2712]
          Length = 284

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/77 (45%), Positives = 42/77 (54%), Gaps = 5/77 (6%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGG 141
           LAGDG  G  +G   +A F  P+  A   K + IYV+D  N +IRKI      VTT AG 
Sbjct: 181 LAGDGNVGSKNGRGTAATFYTPQGIAYSSKQSAIYVSDYQNSLIRKIEVNTREVTTAAG- 239

Query: 142 GSKKEGRADGPAQNASF 158
            S   G ADGPA +A F
Sbjct: 240 -SLSVGCADGPATSAQF 255


>gi|288923318|ref|ZP_06417452.1| serine/threonine protein kinase [Frankia sp. EUN1f]
 gi|288345334|gb|EFC79729.1| serine/threonine protein kinase [Frankia sp. EUN1f]
          Length = 849

 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 37/115 (32%), Positives = 60/115 (52%), Gaps = 4/115 (3%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +  + ++  LAG DG   + DG P S A    P + A+D  G +Y+AD SN+ IR+I   
Sbjct: 678 VGSDGIITTLAGQDGEGSFGDGGPASKALLAFPLAVALDRFGRLYIADTSNNRIRRIGLD 737

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           G + T+AG G        GPA  A+  +   +T V     + I+D  N+ +R+++
Sbjct: 738 GRIETVAGNGRPGLSGDGGPATKATLRSPRGVT-VDAAGTIYITDRTNRRVRRVD 791



 Score = 47.0 bits (110), Expect = 0.008,   Method: Composition-based stats.
 Identities = 42/141 (29%), Positives = 69/141 (48%), Gaps = 10/141 (7%)

Query: 51  IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGSAR-FDKPKS 108
           +DR G  DL++ D+    F      +    ++  +AG+G   +S DG P +A     P  
Sbjct: 603 LDRDG--DLLIADT----FNQRIRRVDAAGIITTVAGNGEHAFSGDGGPATAAALWYPGG 656

Query: 109 FAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFV 167
            AVD  G +++AD +N+ IR++ + G +TT+AG   +      GPA  A  +    +  +
Sbjct: 657 VAVDGNGTVFIADTANNRIRRVGSDGIITTLAGQDGEGSFGDGGPASKALLAFPLAVA-L 715

Query: 168 PHICALLISDHGNQLIRQINL 188
                L I+D  N  IR+I L
Sbjct: 716 DRFGRLYIADTSNNRIRRIGL 736



 Score = 41.6 bits (96), Expect = 0.36,   Method: Composition-based stats.
 Identities = 32/90 (35%), Positives = 48/90 (53%), Gaps = 2/90 (2%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNAS 157
            +A  + P     D  GN+YVAD  N+ +R+I+  G + T+AG G K  G   GPA +A 
Sbjct: 535 ATASVNAPFGLVTDRAGNVYVADSGNNRVRRISPDGTIVTVAGTGVKGFGGDGGPAVDAQ 594

Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQIN 187
                 +T +     LLI+D  NQ IR+++
Sbjct: 595 LYQPSAVT-LDRDGDLLIADTFNQRIRRVD 623


>gi|290971380|ref|XP_002668485.1| predicted protein [Naegleria gruberi]
 gi|284081915|gb|EFC35741.1| predicted protein [Naegleria gruberi]
          Length = 277

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/120 (31%), Positives = 59/120 (49%), Gaps = 8/120 (6%)

Query: 39  DGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEP 98
           D +  +IN H +  R  SSD ++   + +   TL+ P+S  +  +    +G     DG  
Sbjct: 133 DVYICDINNHRI--RKVSSDGVISTIAGTGRTTLN-PISPNTYRQEGGYNG-----DGLA 184

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASF 158
            +   + PK   VD KG++Y AD  N +IRK++N  +TTIAG          G A N +F
Sbjct: 185 TATDLNYPKYAQVDSKGDVYFADTYNGLIRKVSNGYITTIAGSKQNSTVNTFGMALNTTF 244


>gi|345482264|ref|XP_001607897.2| PREDICTED: NHL repeat-containing protein 2-like [Nasonia
           vitripennis]
          Length = 768

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 42/73 (57%), Gaps = 1/73 (1%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           + ++  V+ + G    G+ DG+  +ARF+ P+   V +K +++VAD  NH IRKI     
Sbjct: 329 MGKDGKVEHVVGGYSPGFKDGDFETARFNAPQGVCV-LKDDVFVADNENHAIRKINLKTK 387

Query: 136 TTIAGGGSKKEGR 148
           T I+  G+  +GR
Sbjct: 388 TVISIAGTGSQGR 400


>gi|170759724|ref|YP_001788082.1| NHL repeat-containing protein [Clostridium botulinum A3 str. Loch
           Maree]
 gi|169406713|gb|ACA55124.1| NHL repeat protein [Clostridium botulinum A3 str. Loch Maree]
          Length = 464

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 44/171 (25%), Positives = 75/171 (43%), Gaps = 22/171 (12%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSVI--DRPGSSDLIVLDSSRSAFYTLSFPLSEESV 81
             ++LE+ Y     +   ++ IN + +I     G + +++LD +     T+         
Sbjct: 160 ENIILENKYEKEEYMYPSKISINENGLIAFSEKGKNRIVILDQNLEKIKTI--------- 210

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIA 139
                G    G  DG+    +F  P+     ++  IYVAD  NH IR+  +    V TIA
Sbjct: 211 -----GSSEYGLKDGDFKETQFKAPEGLRF-IENKIYVADTENHSIRECDLEKEIVKTIA 264

Query: 140 GGGSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           G G K+    A G A N S ++ ++L  +     + I+  GN  I + N+K
Sbjct: 265 GNGQKEYSPMAKGDALNVSLNSPWDLEILKD--CIYIAMAGNHQIWKYNMK 313


>gi|30696124|ref|NP_564718.2| haloacid dehalogenase-like hydrolase domain-containing protein
           [Arabidopsis thaliana]
 gi|17529226|gb|AAL38840.1| unknown protein [Arabidopsis thaliana]
 gi|332195279|gb|AEE33400.1| haloacid dehalogenase-like hydrolase domain-containing protein
           [Arabidopsis thaliana]
          Length = 1055

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 41/70 (58%), Gaps = 3/70 (4%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLG 134
           E + + ++   G +G+ DG    A F++P+  A + K N +YVAD  NH +R+I   N  
Sbjct: 597 EGNFIVQIGSSGEEGFQDGSFEDAAFNRPQGLAYNAKKNLLYVADTENHALREIDFVNER 656

Query: 135 VTTIAGGGSK 144
           V T+AG G+K
Sbjct: 657 VQTLAGNGTK 666



 Score = 40.8 bits (94), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 38/79 (48%), Gaps = 2/79 (2%)

Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
           G IY+ D  NH I+K+  +    +   G+ K G  DG  + A  S    L    +   L 
Sbjct: 814 GQIYLTDSYNHKIKKLDPVTKRVVTLAGTGKAGFKDGKVKGAQLSEPAGLAITEN-GRLF 872

Query: 175 ISDHGNQLIRQINL-KPED 192
           ++D  N LIR I+L K ED
Sbjct: 873 VADTNNSLIRYIDLNKGED 891



 Score = 40.4 bits (93), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 31/55 (56%), Gaps = 3/55 (5%)

Query: 79  ESVVKR---LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + V KR   LAG G  G+ DG+   A+  +P   A+   G ++VAD +N +IR I
Sbjct: 830 DPVTKRVVTLAGTGKAGFKDGKVKGAQLSEPAGLAITENGRLFVADTNNSLIRYI 884


>gi|290975356|ref|XP_002670409.1| predicted protein [Naegleria gruberi]
 gi|284083967|gb|EFC37665.1| predicted protein [Naegleria gruberi]
          Length = 2200

 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 37/111 (33%), Positives = 50/111 (45%), Gaps = 8/111 (7%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV- 135
           +E S     +GDG      G    A+ + P   ++   G IY+AD  NH IRKI + GV 
Sbjct: 158 NEHSDCDGFSGDG------GLASRAKLNSPNGLSISQNGEIYIADTLNHRIRKINSYGVI 211

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           +TIAG G    G  DG     S  N      +     + I+D  N  IR+I
Sbjct: 212 STIAGTGRASFG-GDGGFAVLSPLNSPNGVHISQNGEIYIADTLNHRIRKI 261



 Score = 46.6 bits (109), Expect = 0.012,   Method: Composition-based stats.
 Identities = 36/110 (32%), Positives = 53/110 (48%), Gaps = 6/110 (5%)

Query: 81  VVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVT 136
           V+  +AG G   +  G+ G A     + P    +   G IY+AD  NH IRKI  N  +T
Sbjct: 210 VISTIAGTGRASFG-GDGGFAVLSPLNSPNGVHISQNGEIYIADTLNHRIRKIFVNGLIT 268

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           T+AG G    G   G A N+  ++ + +  V     + I+D  N  IR+I
Sbjct: 269 TVAGTGRASFGGDGGLAINSQLNSPYGV-HVSQSGEIYIADTLNHRIRKI 317



 Score = 42.4 bits (98), Expect = 0.21,   Method: Composition-based stats.
 Identities = 40/111 (36%), Positives = 56/111 (50%), Gaps = 6/111 (5%)

Query: 81  VVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVT 136
           ++  +AG G   +  G+ G   +++ + P    V   G IY+AD  NH IRKI  N  +T
Sbjct: 266 LITTVAGTGRASFG-GDGGLAINSQLNSPYGVHVSQSGEIYIADTLNHRIRKIFVNGTIT 324

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFV-PHICALLISDHGNQLIRQI 186
           TIAG  S      DG    AS  N  +  FV P+   +LI+D  N  IR+I
Sbjct: 325 TIAGSSSDGSFGGDGGLSIASRLNSPKGVFVSPNNNEILIADTSNNRIRKI 375


>gi|290978083|ref|XP_002671766.1| predicted protein [Naegleria gruberi]
 gi|284085337|gb|EFC39022.1| predicted protein [Naegleria gruberi]
          Length = 258

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/108 (35%), Positives = 57/108 (52%), Gaps = 5/108 (4%)

Query: 82  VKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
           +  +AG+G  G+S   G   +A+ + P    V +K  +Y+AD  NH IRK++N G + TI
Sbjct: 38  IMTVAGNGELGFSSDGGLAINAKLNYPNGVFV-VKEEVYIADYHNHRIRKVSNNGIIETI 96

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           AG G       +GPA NA   N     FV     + ISD+ N  +R++
Sbjct: 97  AGNGEGGFEGDNGPAINAKL-NYPTSVFVSENGEVYISDYLNNRVRKV 143



 Score = 42.0 bits (97), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 32/115 (27%), Positives = 57/115 (49%), Gaps = 7/115 (6%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-T 131
           +S   +++ +AG+G  G+ +G+ G   +A+ + P S  V   G +Y++D  N+ +RK+  
Sbjct: 87  VSNNGIIETIAGNGEGGF-EGDNGPAINAKLNYPTSVFVSENGEVYISDYLNNRVRKVLQ 145

Query: 132 NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           N  + TIAG G        G A NA  +    +        + I+D  N  IR++
Sbjct: 146 NGNIVTIAGNGKLGCSGDGGLAINAELNCPMNVFVFNE--EVYITDSANHRIRKV 198



 Score = 40.0 bits (92), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 35/57 (61%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +S+  +++ +AG+G +G+S  DG    A+ + P S  V+  G IY+ D +N  IRK+
Sbjct: 198 VSKSGIIETIAGNGNEGFSGDDGLATQAQLNCPMSTFVNSNGEIYITDSNNFRIRKV 254



 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 57/111 (51%), Gaps = 8/111 (7%)

Query: 85  LAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           +AG+G  G S G+ G   +A  + P +  V     +Y+ D +NH IRK++  G + TIAG
Sbjct: 152 IAGNGKLGCS-GDGGLAINAELNCPMNVFV-FNEEVYITDSANHRIRKVSKSGIIETIAG 209

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
            G++     DG A  A  +     TFV     + I+D  N  IR++ LK E
Sbjct: 210 NGNEGFSGDDGLATQAQLNCPMS-TFVNSNGEIYITDSNNFRIRKV-LKVE 258


>gi|406901359|gb|EKD44040.1| NHL repeat containing protein, partial [uncultured bacterium]
          Length = 667

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 38/135 (28%), Positives = 68/135 (50%), Gaps = 7/135 (5%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGN-IY 118
           V D+       ++   S++   K +AG   + G+ DG   +A F+ P +  +D   N +Y
Sbjct: 531 VADTGNEVIRRVTLTGSKQGQTKLIAGSPTESGFKDGTKTAAEFNVPIALTIDSADNYLY 590

Query: 119 VADKSNHVIRK--ITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLIS 176
           VAD+ NH IRK  I++  VTT+ G  S   G  DG  ++A  +   E+ +  +   +  S
Sbjct: 591 VADRDNHAIRKVRISDGKVTTVTGNPS-TPGYLDGRLEDAYLNYPVEVYY--NRGNIYFS 647

Query: 177 DHGNQLIRQINLKPE 191
           + G Q +R +++  +
Sbjct: 648 ESGTQRVRVVDMADD 662



 Score = 44.3 bits (103), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 42/98 (42%), Gaps = 5/98 (5%)

Query: 96  GEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKIT----NLGVTTIAGGGSKKEGRAD 150
           G    A F +P    VD  G   YVAD  N VIR++T      G T +  G   + G  D
Sbjct: 507 GRKNCAYFSRPGGIVVDGSGKYAYVADTGNEVIRRVTLTGSKQGQTKLIAGSPTESGFKD 566

Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           G    A F+    LT       L ++D  N  IR++ +
Sbjct: 567 GTKTAAEFNVPIALTIDSADNYLYVADRDNHAIRKVRI 604



 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 27/93 (29%), Positives = 46/93 (49%), Gaps = 5/93 (5%)

Query: 60  IVLDSSRSAFYTLSF-----PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
           I+ D + + + T +F      +    +V  + G G  G  +G   +++F  P + AVD  
Sbjct: 51  IIADGAGNFYVTDTFNGVIRKIDANGIVSTVVGQGGYGDVNGSATTSKFAHPSAVAVDDS 110

Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEG 147
           GN+Y+AD  N  I+K +   VTT+     + EG
Sbjct: 111 GNVYIADAGNGKIKKFSGGRVTTLKSDLDRPEG 143



 Score = 42.0 bits (97), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 31/84 (36%), Positives = 39/84 (46%), Gaps = 12/84 (14%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
           A FD P+    D  GN YV D  N VIRKI   G V+T+ G G    G  +G A  + F+
Sbjct: 43  AYFDFPEDIIADGAGNFYVTDTFNGVIRKIDANGIVSTVVGQGG--YGDVNGSATTSKFA 100

Query: 160 NDFELTFVPHICALLISDHGNQLI 183
                    H  A+ + D GN  I
Sbjct: 101 ---------HPSAVAVDDSGNVYI 115


>gi|290976488|ref|XP_002670972.1| predicted protein [Naegleria gruberi]
 gi|284084536|gb|EFC38228.1| predicted protein [Naegleria gruberi]
          Length = 704

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 60/113 (53%), Gaps = 5/113 (4%)

Query: 81  VVKRLAGD-GVQGYSDGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VT 136
           ++  +AG  G  G++D     SA+ + P++ A+   G+IY+AD  N+ IRKIT     ++
Sbjct: 211 IITTIAGTAGSSGFTDNVLSTSAKLNGPQAVAIMSNGDIYIADTQNNRIRKITAATGIIS 270

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           TI G G+      DG A  ++  N     F+     L I+D  N  +R+I+L+
Sbjct: 271 TICGTGNGGIA-GDGSAATSAMINSPRDLFLGLQNDLYIADSWNHRLRRIDLR 322


>gi|340370782|ref|XP_003383925.1| PREDICTED: NHL repeat-containing protein 2-like [Amphimedon
           queenslandica]
          Length = 730

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 32/90 (35%), Positives = 47/90 (52%), Gaps = 6/90 (6%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTI 138
           +VK++ G G  G+ DG    A F+ P+   +  +  +YVAD  NH+IRKI  ++  V T+
Sbjct: 259 LVKQVYGSGSPGFKDGRGKEAEFNCPQGLVI-CEECVYVADTENHLIRKISLSDDFVLTV 317

Query: 139 AGGGSK---KEGRADGPAQNASFSNDFELT 165
           AG G +   KEG   G  Q  S   D  + 
Sbjct: 318 AGTGYQGNDKEGGKVGKEQEISSPWDLAIN 347



 Score = 38.1 bits (87), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 32/107 (29%), Positives = 47/107 (43%), Gaps = 5/107 (4%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKIT--NLGVTTIAGGGSKKEGR 148
           G  DG+    +   P   A D    + Y+AD  NH I+ +       +T+AG GS   G 
Sbjct: 459 GDVDGKGRDVKLQHPMGVAWDDTNQLLYIADSFNHKIKVVNPKTKVCSTLAGTGSP--GL 516

Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSK 195
            DG  + A FS    L        L ++D  N  IR ++LK +  S+
Sbjct: 517 VDGSFEVAQFSEPAGLCMSQEGDTLYVADTNNHAIRILDLKEKKVSQ 563


>gi|194365180|ref|YP_002027790.1| NHL repeat containing protein [Stenotrophomonas maltophilia R551-3]
 gi|194347984|gb|ACF51107.1| NHL repeat containing protein [Stenotrophomonas maltophilia R551-3]
          Length = 470

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 54/118 (45%), Gaps = 4/118 (3%)

Query: 73  SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           S+ L + S+  R AG G     DG    A F +P + AV ++  +YVAD     IR +  
Sbjct: 301 SYHLGQRSLQWR-AGSGSIDERDGSGHLAAFAQPTALAV-VQQVLYVADALGSSIRALQL 358

Query: 133 LG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
            G  V T+ G G  + G  DGP   AS      +   P    L I+D GN  +R + L
Sbjct: 359 RGDLVQTLVGQGPWRFGNEDGPRTQASLQFPQAMALSPDAPLLWIADTGNGRLRTLRL 416



 Score = 40.4 bits (93), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 48/103 (46%), Gaps = 12/103 (11%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIAG 140
           + R  G G   + DG    A F +P++  ++ + ++YVAD  NH +R+I  L G+     
Sbjct: 197 ILRQFGLGTADFMDGNLAEAAFHRPQALVLE-RDSLYVADTGNHAVRRINLLTGIVDTLC 255

Query: 141 GGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQL 182
           G  +     +GP AQ    S D       H   L I+D  NQL
Sbjct: 256 GNGRPGAPVEGPVAQARQVSLD-------HPVGLAIAD--NQL 289


>gi|116620267|ref|YP_822423.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116223429|gb|ABJ82138.1| NHL repeat containing protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 380

 Score = 48.1 bits (113), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 56/106 (52%), Gaps = 6/106 (5%)

Query: 85  LAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG 140
           +AG G  GYS DG P   A+   PK  A    G++Y+AD  +H IR++   +  +TT+AG
Sbjct: 276 VAGTGESGYSGDGGPAKLAKLSGPKGIAWAPDGSLYLADTESHTIRRVDLKSGVITTVAG 335

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            G + +G  DG A+    S    + FV    A+ I+D  +  +R +
Sbjct: 336 TGKRGDG-PDGDARMCQLSRPHGI-FVSAAGAVFIADSESHRVRAL 379



 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 40/116 (34%), Positives = 61/116 (52%), Gaps = 5/116 (4%)

Query: 78  EESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNL 133
           + +V+   AG G +GYS DG P   A  ++P     D  GN+++A+  NHV+R++     
Sbjct: 98  KTNVISTAAGSGEKGYSGDGGPALGAALNEPYEVRFDRVGNMFIAEMQNHVVRRVDAKTR 157

Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            ++T+AG G+   G   GPA  A       + F      LLI D GN  IR+++LK
Sbjct: 158 VISTVAGTGTAGFGGDGGPATAALLRQPHSIAFDGE-GRLLICDIGNHRIRRVDLK 212


>gi|407780651|ref|ZP_11127872.1| NHL repeat-containing protein [Oceanibaculum indicum P24]
 gi|407208878|gb|EKE78785.1| NHL repeat-containing protein [Oceanibaculum indicum P24]
          Length = 498

 Score = 48.1 bits (113), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/60 (45%), Positives = 33/60 (55%), Gaps = 3/60 (5%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL--GVTTIAGGGSK 144
           G G  G++ G    ARF  P+  A D  G IYVAD  NH IR+I      VTT+AG G +
Sbjct: 217 GSGEAGFASGPADRARFRDPQGLAADT-GAIYVADTGNHAIRRIDRQTGQVTTLAGNGRR 275


>gi|222053203|ref|YP_002535565.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
 gi|221562492|gb|ACM18464.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
          Length = 155

 Score = 48.1 bits (113), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 43/108 (39%), Positives = 55/108 (50%), Gaps = 10/108 (9%)

Query: 82  VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTI 138
           V  LAG  GV G++DG   +A F +      D   N+YVAD SN+ IRK  I +  VTT+
Sbjct: 54  VTTLAGTAGVYGWADGIGTAALFGRISDITTDGA-NLYVAD-SNNTIRKIVIASGAVTTL 111

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           AGG     G  DG   +ASF     +T      +L I+D  N  IR I
Sbjct: 112 AGG---STGSTDGVGVSASFGYPSGIT--TDGASLFITDTFNDTIRAI 154


>gi|17228508|ref|NP_485056.1| hypothetical protein alr1013 [Nostoc sp. PCC 7120]
 gi|17130359|dbj|BAB72970.1| alr1013 [Nostoc sp. PCC 7120]
          Length = 503

 Score = 48.1 bits (113), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 35/101 (34%), Positives = 46/101 (45%), Gaps = 9/101 (8%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V+  P  S L V DS         F       +  L G+G  G +DG    A+F  P
Sbjct: 175 PGKVLATP--SGLFVADSGHHRIVVSDF----NGEILHLIGNGKSGLTDGNFQEAQFSAP 228

Query: 107 KSFAVDMKGNI-YVADKSNHVIRK--ITNLGVTTIAGGGSK 144
           +  A DM+  I Y+AD  NH +R+  I    V TIAG G +
Sbjct: 229 QGMAFDMENQILYLADTENHALRRVDINQQTVETIAGTGEQ 269



 Score = 38.5 bits (88), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 20/62 (32%), Positives = 36/62 (58%), Gaps = 3/62 (4%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG 140
           + + GDG+ G  +G+  + RF +P   +  +   +Y++D +NHVIR++    L VTT+  
Sbjct: 431 QTVLGDGLAGLQNGQGKNTRFFEPSGLSA-IDSYLYISDTNNHVIRRVDLRTLEVTTMQF 489

Query: 141 GG 142
            G
Sbjct: 490 NG 491


>gi|428225769|ref|YP_007109866.1| NHL repeat containing protein [Geitlerinema sp. PCC 7407]
 gi|427985670|gb|AFY66814.1| NHL repeat containing protein [Geitlerinema sp. PCC 7407]
          Length = 498

 Score = 48.1 bits (113), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/71 (38%), Positives = 38/71 (53%), Gaps = 3/71 (4%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGV 135
           E  V+   G G  GY DG   +A+F  P+    D  G+ +YVAD  NH+IR +   +  V
Sbjct: 197 EGKVRATVGSGTPGYQDGPWETAQFRAPQGLTYDPAGDRLYVADTGNHLIRCVHGRSRTV 256

Query: 136 TTIAGGGSKKE 146
           TT+AG G +  
Sbjct: 257 TTLAGTGQQNR 267



 Score = 41.6 bits (96), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 39/130 (30%), Positives = 53/130 (40%), Gaps = 15/130 (11%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT---NLGV 135
           E  V  L G G +   DG    A F +P   A+D    +YVAD  +  +R I+      V
Sbjct: 311 EDAVGTLLGTGAEACIDGTASEATFAQPSGLAIDGD-TLYVADSESSSVRAISLADPPSV 369

Query: 136 TTIAGGG-----SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKP 190
            T+ G G       +EGR D     A   +   L   P    L I+D  N  I+++NL  
Sbjct: 370 QTLCGSGGLFDFGDREGRGD----RARLQHCLGLAHGP--GNLWIADTYNHKIKRLNLAE 423

Query: 191 EDCSKSSQSG 200
             C     SG
Sbjct: 424 GHCVNIVGSG 433



 Score = 40.8 bits (94), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 39/78 (50%), Gaps = 1/78 (1%)

Query: 110 AVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPH 169
           A+  + +++VAD  +H +  I  L     A  GS   G  DGP + A F     LT+ P 
Sbjct: 174 AIAQENHLFVADTGHHRL-VIATLEGKVRATVGSGTPGYQDGPWETAQFRAPQGLTYDPA 232

Query: 170 ICALLISDHGNQLIRQIN 187
              L ++D GN LIR ++
Sbjct: 233 GDRLYVADTGNHLIRCVH 250


>gi|296122042|ref|YP_003629820.1| NHL repeat containing protein [Planctomyces limnophilus DSM 3776]
 gi|296014382|gb|ADG67621.1| NHL repeat containing protein [Planctomyces limnophilus DSM 3776]
          Length = 425

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 37/143 (25%), Positives = 63/143 (44%), Gaps = 10/143 (6%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARFD 104
           PH +   P + D+++ D        +     +  ++  LAG G  G+S DG P   A+ +
Sbjct: 170 PHELRFTP-AGDIVIADMRTHTIRKID---GKTGMISTLAGTGTAGFSGDGGPAEKAQLN 225

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
            P S  +D  G++ + D  NH +RK+   T L ++T  G G +K  +   P      +  
Sbjct: 226 MPHSIQIDPAGDLLICDTGNHRVRKVDMKTGL-ISTAYGTGERKPAKDGDPQVGTPLNGP 284

Query: 162 FELTFVPHICALLISDHGNQLIR 184
             + F P    +L    GN + R
Sbjct: 285 RSIDFTPEGDMILALREGNAVYR 307



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 37/118 (31%), Positives = 60/118 (50%), Gaps = 8/118 (6%)

Query: 78  EESVVKRLAGD-GVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI---T 131
           E+     +AG  G +G++ DG P + A  ++P        G+I +AD   H IRKI   T
Sbjct: 139 EKQTATTIAGTPGKKGFAGDGGPATKALMNEPHELRFTPAGDIVIADMRTHTIRKIDGKT 198

Query: 132 NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            + ++T+AG G+       GPA+ A  +    +   P    LLI D GN  +R++++K
Sbjct: 199 GM-ISTLAGTGTAGFSGDGGPAEKAQLNMPHSIQIDP-AGDLLICDTGNHRVRKVDMK 254



 Score = 44.7 bits (104), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 46/156 (29%), Positives = 64/156 (41%), Gaps = 22/156 (14%)

Query: 45  INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK--------RLAGDGVQGYSDG 96
           +N    ID     D+I+     +A Y   FP  E  ++          L GDG+      
Sbjct: 281 LNGPRSIDFTPEGDMILALREGNAVY--RFPKGEAKLIHIAGVGGKPSLVGDGIDARK-- 336

Query: 97  EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPA 153
               A    PK  AVD  G+IY+AD   H IR I   T L + T+ G G   +G  DG A
Sbjct: 337 ----AILGAPKGAAVDANGDIYLADTETHTIRVIRAKTGL-IETVIGDGKAGDG-PDGEA 390

Query: 154 QNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           +     N     F+     LL+ D  N  +R + L+
Sbjct: 391 KTCRL-NRPHGVFITKEGLLLVGDSSNNKVRVLPLR 425



 Score = 43.9 bits (102), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 52/170 (30%), Positives = 78/170 (45%), Gaps = 18/170 (10%)

Query: 48  HSVIDRPGS-----SDLIVLDS-----SRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DG 96
            SVI RPG      S+L+ L +     S  AF  L+   + E  VK + G G  G++ D 
Sbjct: 44  RSVIMRPGQTSFGPSNLMRLAALCVLLSLFAFAELNVAHAGE--VKTIFGSGKDGFNGDQ 101

Query: 97  EPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRAD-GP 152
           +P       +P    +   G +Y  + + H+IR+  +     TTIAG   KK    D GP
Sbjct: 102 QPFLETHSGQPFGLVIGPDGALYFCEYTGHIIRRLDLEKQTATTIAGTPGKKGFAGDGGP 161

Query: 153 AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
           A  A  +   EL F P    ++I+D     IR+I+ K    S  + +G+A
Sbjct: 162 ATKALMNEPHELRFTP-AGDIVIADMRTHTIRKIDGKTGMISTLAGTGTA 210


>gi|148380724|ref|YP_001255265.1| NHL repeat protein [Clostridium botulinum A str. ATCC 3502]
 gi|153933286|ref|YP_001385008.1| NHL repeat-containing protein [Clostridium botulinum A str. ATCC
           19397]
 gi|153935024|ref|YP_001388478.1| NHL repeat-containing protein [Clostridium botulinum A str. Hall]
 gi|148290208|emb|CAL84327.1| conserved hypothetical protein [Clostridium botulinum A str. ATCC
           3502]
 gi|152929330|gb|ABS34830.1| NHL repeat protein [Clostridium botulinum A str. ATCC 19397]
 gi|152930938|gb|ABS36437.1| NHL repeat protein [Clostridium botulinum A str. Hall]
          Length = 467

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 35/109 (32%), Positives = 55/109 (50%), Gaps = 6/109 (5%)

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAGG 141
           ++ G G  G  DG+   A+F  P+     ++  IYVAD  NH IR+  +    V TIAG 
Sbjct: 211 KIIGSGEYGLKDGDFKEAQFKAPEGLRF-IENKIYVADTENHSIRECDLEKEIVKTIAGN 269

Query: 142 GSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           G K+    A G A N S ++ ++L  +     + I+  GN  I + N++
Sbjct: 270 GQKEYSPMAKGDALNVSLNSPWDLEILKD--CIYIAMAGNHQIWKYNMR 316


>gi|387819032|ref|YP_005679379.1| hypothetical protein H04402_02846 [Clostridium botulinum H04402
           065]
 gi|322807076|emb|CBZ04650.1| gll0387 protein [Clostridium botulinum H04402 065]
          Length = 467

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 35/109 (32%), Positives = 55/109 (50%), Gaps = 6/109 (5%)

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAGG 141
           ++ G G  G  DG+   A+F  P+     ++  IYVAD  NH IR+  +    V TIAG 
Sbjct: 211 KIIGSGEYGLKDGDFKEAQFKAPEGLRF-IENKIYVADTENHSIRECDLEKEIVKTIAGN 269

Query: 142 GSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           G K+    A G A N S ++ ++L  +     + I+  GN  I + N++
Sbjct: 270 GQKEYSPMAKGDALNVSLNSPWDLEILKD--CIYIAMAGNHQIWKYNMR 316


>gi|453054414|gb|EMF01867.1| redoxin domain-containing protein [Streptomyces mobaraensis NBRC
           13819 = DSM 40847]
          Length = 608

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 39/124 (31%), Positives = 64/124 (51%), Gaps = 11/124 (8%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P   +  PG    +V D++R     L+     E+VV+R+ G G +G+ D     ARF +P
Sbjct: 183 PGKALALPGGG-YLVSDTTRHQLVELA--ADGETVVRRI-GTGERGFGDS---PARFSEP 235

Query: 107 KSFAVDMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSK-KEGRA-DGPAQNASFSNDF 162
           +  A+   G + VAD  NH +R        V T+AG G +  +G A +GPA+  + S+ +
Sbjct: 236 QGLALLPDGRVAVADTVNHALRVYDPETHEVGTLAGTGRQWWQGAATEGPAREVALSSPW 295

Query: 163 ELTF 166
           +L +
Sbjct: 296 DLAW 299


>gi|392945712|ref|ZP_10311354.1| protein kinase family protein,NHL repeat protein [Frankia sp. QA3]
 gi|392289006|gb|EIV95030.1| protein kinase family protein,NHL repeat protein [Frankia sp. QA3]
          Length = 770

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 31/96 (32%), Positives = 48/96 (50%), Gaps = 2/96 (2%)

Query: 94  SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEG-RADG 151
            DG P  +A   +P   A+   G++YVAD  N  +R+I   G  T+  G     G R DG
Sbjct: 604 GDGGPAINATLYQPSGLALGDDGSLYVADTGNETVRRIDPSGRITLVAGRPGTYGHRGDG 663

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            A  A+  +D     +    +L I+D GN +IR+++
Sbjct: 664 KAATAALLDDPRGLALGPDGSLYIADSGNDVIRRVD 699



 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 58/114 (50%), Gaps = 11/114 (9%)

Query: 81  VVKRLAGDGVQGYSDG------EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           V+  +AG G   YSDG      +   A  D P + AVD  G +YVA+   + +R+I    
Sbjct: 533 VITTVAGAGPD-YSDGFTADVGDATKAHLDDPSAVAVDDDGVLYVAE--GYRVRRIAQGR 589

Query: 135 VTTIAGGGSKKEGRAD-GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           +TT+AG  ++     D GPA NA+      L       +L ++D GN+ +R+I+
Sbjct: 590 ITTVAGKSTESGSAGDGGPAINATLYQPSGLALG-DDGSLYVADTGNETVRRID 642



 Score = 44.7 bits (104), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 29/90 (32%), Positives = 46/90 (51%), Gaps = 3/90 (3%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGP-AQNAS 157
           +A  D P+  A+   G++Y+AD  N VIR++   G +TT+AG GS      DG  A    
Sbjct: 668 AALLDDPRGLALGPDGSLYIADSGNDVIRRVDRHGIITTVAGTGSYSADDRDGALATQTR 727

Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQIN 187
            SN   +   P   A+ I+  G+  +R++ 
Sbjct: 728 LSNPAGVLVDP-TGAIYIACSGDGTVRRVG 756


>gi|397781307|ref|YP_006545780.1| Tripartite motif-containing protein 71 [Methanoculleus bourgensis
            MS2]
 gi|396939809|emb|CCJ37064.1| Tripartite motif-containing protein 71 AltName: Full=Lin-41 homolog
            [Methanoculleus bourgensis MS2]
          Length = 1996

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/56 (46%), Positives = 34/56 (60%), Gaps = 3/56 (5%)

Query: 99   GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA--DGP 152
            G+ +FDKP   AVD  G+IYVAD +NH I+K  + G T +   GSK  G    +GP
Sbjct: 1394 GNGQFDKPWGIAVDAAGDIYVADYNNHRIQKFDSAG-TFVTAWGSKGSGSGQFNGP 1448



 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 26/56 (46%), Positives = 34/56 (60%), Gaps = 3/56 (5%)

Query: 99   GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA--DGP 152
            G+ +FDKP   AVD  G+IYVAD +NH I+K  + G T I   GS+  G    +GP
Sbjct: 1676 GNGQFDKPWGIAVDTAGDIYVADYNNHRIQKFDSAG-TFITAWGSEGSGSGQFNGP 1730



 Score = 41.6 bits (96), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 28/106 (26%), Positives = 45/106 (42%), Gaps = 11/106 (10%)

Query: 100  SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASF 158
            + +FD+P   A D+ GN+YV+D  NH I+K  + G   T  G      G+ D P   A  
Sbjct: 1348 NGQFDRPTGIATDISGNVYVSDYYNHRIQKFDSTGAFLTTWGAYGTGNGQFDKPWGIA-- 1405

Query: 159  SNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
                    V     + ++D+ N  I++ +      +     GS  G
Sbjct: 1406 --------VDAAGDIYVADYNNHRIQKFDSAGTFVTAWGSKGSGSG 1443



 Score = 39.3 bits (90), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 11/107 (10%)

Query: 99   GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNAS 157
            G+ +FD+    A D+ GN+YV+D  NH I+K  + G   T  G      G+ D P   A 
Sbjct: 1629 GNGQFDRLTGIATDISGNVYVSDYYNHRIQKFDSTGAFLTTWGAYGTGNGQFDKPWGIA- 1687

Query: 158  FSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
                     V     + ++D+ N  I++ +      +     GS  G
Sbjct: 1688 ---------VDTAGDIYVADYNNHRIQKFDSAGTFITAWGSEGSGSG 1725



 Score = 37.4 bits (85), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 18/32 (56%)

Query: 103  FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
            F  P   A D  GN+YVAD SNH IR   + G
Sbjct: 1304 FSYPTGIATDATGNVYVADYSNHCIRVFNSTG 1335


>gi|302786076|ref|XP_002974809.1| hypothetical protein SELMODRAFT_101985 [Selaginella moellendorffii]
 gi|300157704|gb|EFJ24329.1| hypothetical protein SELMODRAFT_101985 [Selaginella moellendorffii]
          Length = 1049

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 47/103 (45%), Gaps = 5/103 (4%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSKKEGRA 149
           G  DG    A F  P        G +YVAD  NH I+   + +  V T+AG G    G  
Sbjct: 791 GDKDGPASQALFQHPLGILSSANGAVYVADTYNHKIKLMDLASKTVRTVAGTGVA--GYE 848

Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPED 192
           DG    A FS    L   P+  +L ++D  N +IR +  +P+D
Sbjct: 849 DGKGVKAQFSEPAGLALGPN-GSLFVADTNNNVIRLLKPRPDD 890



 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 32/48 (66%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
            V+ +AG GV GY DG+   A+F +P   A+   G+++VAD +N+VIR
Sbjct: 835 TVRTVAGTGVAGYEDGKGVKAQFSEPAGLALGPNGSLFVADTNNNVIR 882



 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 52/105 (49%), Gaps = 5/105 (4%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTIAGGGS 143
           G G +G +DG    A F++P+  A D + NI YVAD  NH +R++      V T+AG G 
Sbjct: 607 GSGAEGLTDGSFSLASFNRPQGVAYDTEKNILYVADTENHALREVDFVKETVRTLAGTGE 666

Query: 144 KK-EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           +  + R     Q+   ++ ++L   P    L I+  G   I + N
Sbjct: 667 RGFDYRGGKAGQSQPLNSPWDLCLHPD-GMLYIAMAGEHQIWEFN 710



 Score = 39.3 bits (90), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 34/135 (25%), Positives = 58/135 (42%), Gaps = 7/135 (5%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG--YSDGEPGSAR-FDKPKSFAVDMKG 115
           L V D+   A   + F    +  V+ LAG G +G  Y  G+ G ++  + P    +   G
Sbjct: 638 LYVADTENHALREVDF---VKETVRTLAGTGERGFDYRGGKAGQSQPLNSPWDLCLHPDG 694

Query: 116 NIYVADKSNHVIRKITNL-GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
            +Y+A    H I +     GV  +  G   +         N+S++    L+  P +  L 
Sbjct: 695 MLYIAMAGEHQIWEFNRSNGVAKVFSGDGYERNLNGSKGSNSSYAQPSGLSLSPDMDHLY 754

Query: 175 ISDHGNQLIRQINLK 189
           I+D  +  +R +NLK
Sbjct: 755 IADSESSSVRSVNLK 769


>gi|312200910|ref|YP_004020971.1| serine/threonine protein kinase [Frankia sp. EuI1c]
 gi|311232246|gb|ADP85101.1| serine/threonine protein kinase [Frankia sp. EuI1c]
          Length = 804

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 37/99 (37%), Positives = 45/99 (45%), Gaps = 4/99 (4%)

Query: 94  SDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA--D 150
            DG P + A    P   AV   G+IYVAD     IR+IT  G      G S K+G     
Sbjct: 637 GDGGPATKASLPSPSGLAVTADGSIYVADDYLDTIREITPDGTIRTVAGTSGKDGETGDG 696

Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           GPA  A  S    L   P   +L I+D  N  IR+I+ K
Sbjct: 697 GPAARALLSGPTGLALGPD-GSLYITDTHNAKIRRIDPK 734


>gi|290991612|ref|XP_002678429.1| predicted protein [Naegleria gruberi]
 gi|284092041|gb|EFC45685.1| predicted protein [Naegleria gruberi]
          Length = 992

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 43/142 (30%), Positives = 69/142 (48%), Gaps = 8/142 (5%)

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGE--PGSARFDKPKSFAVDMKG 115
           DL++ DSS    + +   L+  +V+ R+AG G + Y+       +A   +P    +   G
Sbjct: 547 DLLIADSSN---HVIRKVLTNGTVI-RVAGTGTRAYNGDNILAVNAHLSEPSGIHILSNG 602

Query: 116 NIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
            I  +DK N+ +RKI TN  + TIAG G+      + PA +  F     L   P   ++ 
Sbjct: 603 EILFSDKYNYRVRKILTNGTIITIAGIGTYGYNGENLPALSTKFFGVTGLALSPVDGSIY 662

Query: 175 ISDHGNQLIRQINLKPEDCSKS 196
           ++D  N  IR+I   P D S+S
Sbjct: 663 LADTSNHRIRKIT-DPCDVSRS 683



 Score = 43.5 bits (101), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 36/107 (33%), Positives = 53/107 (49%), Gaps = 10/107 (9%)

Query: 86  AGDGVQGYSDGE--PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGG 142
           AG G+ GY  G+    + +   P    V   G++ +AD SNHVIRK+ TN  V  +AG G
Sbjct: 515 AGTGIYGYDPGDVLAVNTKLFFPNGLDVYPNGDLLIADSSNHVIRKVLTNGTVIRVAGTG 574

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICA---LLISDHGNQLIRQI 186
           ++     +  A NA  S    +    HI +   +L SD  N  +R+I
Sbjct: 575 TRAYNGDNILAVNAHLSEPSGI----HILSNGEILFSDKYNYRVRKI 617


>gi|289707042|ref|ZP_06503373.1| antioxidant, AhpC/TSA family [Micrococcus luteus SK58]
 gi|289556228|gb|EFD49588.1| antioxidant, AhpC/TSA family [Micrococcus luteus SK58]
          Length = 703

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 51/114 (44%), Gaps = 6/114 (5%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-----LGVTTIAG 140
           AG  ++G  DG+ G A F +P   +V   G ++VAD     +R +         V T  G
Sbjct: 399 AGTALEGLLDGDAGRAWFAQPSGLSVGADGALWVADSETSAVRWVRTGEDGRREVGTAVG 458

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
            G    G  DG A  A   +   +T +P   ++LI+D  N  IR+     ED +
Sbjct: 459 AGLFDFGHVDGEADRARLQHPLGVTALPD-GSVLIADTYNGAIRRYAPAGEDAA 511


>gi|290993799|ref|XP_002679520.1| predicted protein [Naegleria gruberi]
 gi|284093137|gb|EFC46776.1| predicted protein [Naegleria gruberi]
          Length = 1414

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 40/139 (28%), Positives = 65/139 (46%), Gaps = 21/139 (15%)

Query: 64  SSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEP------------GSARFDKPKSFAV 111
           +SR   Y +   L    V++++A DG+     G P              A+ + P+  AV
Sbjct: 62  NSRGDMYVVDAQLQ---VIRKIAKDGITTTIAGFPQDAGYNGDNILANKAKLNYPRGVAV 118

Query: 112 DMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGP-AQNASFSNDFELTFVPH 169
           +    I+++D  N+ IRKI+N G ++T+AG G  + G  D   A N  F N   L +   
Sbjct: 119 NENDEIFISDSGNYRIRKISNSGIISTVAGTG--ENGFMDHVLAINGKFGNPSHLLYTNS 176

Query: 170 ICALLISDHGNQLIRQINL 188
              L I+D  N  IR+++ 
Sbjct: 177 --TLYINDQSNNKIRKLDF 193



 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 43/70 (61%), Gaps = 2/70 (2%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           +S+  ++  +AG G  GY D    + ++ + PK   V   G +++AD +N+ +RKI+  G
Sbjct: 850 VSKSGIISTIAGLGAAGYIDNVLATESQLNAPKGVVVAPSGEVFIADSNNNKVRKISTSG 909

Query: 135 -VTTIAGGGS 143
            ++TIAGGGS
Sbjct: 910 IISTIAGGGS 919



 Score = 44.3 bits (103), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 36/107 (33%), Positives = 47/107 (43%), Gaps = 3/107 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
           +  L G G Q Y DG P        P + A + +G++YV D    VIRKI   G+TT   
Sbjct: 32  ISLLVGGGNQ-YGDGLPADKINLYNPTAIAFNSRGDMYVVDAQLQVIRKIAKDGITTTIA 90

Query: 141 GGSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G  +  G   D    N +  N      V     + ISD GN  IR+I
Sbjct: 91  GFPQDAGYNGDNILANKAKLNYPRGVAVNENDEIFISDSGNYRIRKI 137



 Score = 38.9 bits (89), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 45/88 (51%), Gaps = 3/88 (3%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
           +A    P   +++ +G +Y+AD SNH IRK++  G ++TIAG G+   G  D      S 
Sbjct: 820 TAHLWNPIGISLNSEGEVYIADLSNHRIRKVSKSGIISTIAGLGA--AGYIDNVLATESQ 877

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQI 186
            N  +   V     + I+D  N  +R+I
Sbjct: 878 LNAPKGVVVAPSGEVFIADSNNNKVRKI 905


>gi|227540165|ref|ZP_03970214.1| conserved protein of hypothetical function [Sphingobacterium
           spiritivorum ATCC 33300]
 gi|227240026|gb|EEI90041.1| conserved protein of hypothetical function [Sphingobacterium
           spiritivorum ATCC 33300]
          Length = 658

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 47/137 (34%), Positives = 62/137 (45%), Gaps = 15/137 (10%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS---NHVIRKIT-----NLGVT 136
           L G G  GY DG    A+ +      VD KG++Y  D +    + IRKIT     N  V+
Sbjct: 381 LIGSGGAGYVDGSLADAKTNGAWGVVVDNKGDLYFTDNNATGGNRIRKITFDASGNGTVS 440

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC--- 193
           TIAG  S   G ADG    ASF+    L    +   L +S+     IR+I+L        
Sbjct: 441 TIAGQAS--TGNADGIGAAASFNRPSGLAIDKNNNILYVSEWIGNRIRRIDLSTNRVELL 498

Query: 194 --SKSSQSGSALGAVSV 208
             S ++QSG   G  SV
Sbjct: 499 AGSATAQSGDVEGIGSV 515



 Score = 41.6 bits (96), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 41/143 (28%), Positives = 67/143 (46%), Gaps = 15/143 (10%)

Query: 50  VIDRPGSSDLIVLDSSRSA---FYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           V+D  G  DL   D++ +       ++F  S    V  +AG    G +DG   +A F++P
Sbjct: 406 VVDNKG--DLYFTDNNATGGNRIRKITFDASGNGTVSTIAGQASTGNADGIGAAASFNRP 463

Query: 107 KSFAVDMKGNI-YVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPAQNASFS--- 159
              A+D   NI YV++   + IR+I   TN  V  +AG  + + G  +G    A F+   
Sbjct: 464 SGLAIDKNNNILYVSEWIGNRIRRIDLSTNR-VELLAGSATAQSGDVEGIGSVARFNQLG 522

Query: 160 NDFELTFVPHICALLISDHGNQL 182
           N  EL+  P    L +S + ++ 
Sbjct: 523 NALELS--PDGKKLFVSGNSSRF 543


>gi|326436774|gb|EGD82344.1| NHL repeat protein [Salpingoeca sp. ATCC 50818]
          Length = 1484

 Score = 47.8 bits (112), Expect = 0.004,   Method: Composition-based stats.
 Identities = 38/107 (35%), Positives = 51/107 (47%), Gaps = 18/107 (16%)

Query: 92   GYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTIAGGGSKKEG 147
            GYS  DG    AR ++P+  A+  KG +Y+AD  NH IR +      +TT+AG GS    
Sbjct: 1326 GYSGDDGLSIGARLNRPRGIALS-KGVLYIADSGNHRIRAVDMRTQVITTVAGTGSAGFS 1384

Query: 148  RADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQINL 188
               G   NA+         VPH  A      L I+D GN  +R  N+
Sbjct: 1385 GDGGLPTNAALR-------VPHGVAVHSSGTLAIADSGNHRVRVFNI 1424



 Score = 43.5 bits (101), Expect = 0.090,   Method: Composition-based stats.
 Identities = 34/114 (29%), Positives = 51/114 (44%), Gaps = 10/114 (8%)

Query: 81   VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---- 134
            V+  +AG G  G+S   G P +A    P   AV   G + +AD  NH +R + N+G    
Sbjct: 1371 VITTVAGTGSAGFSGDGGLPTNAALRVPHGVAVHSSGTLAIADSGNHRVR-VFNIGSSTA 1429

Query: 135  --VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
              + TI G G       D  A  +S +    +TF      ++ +D  N  IRQ+
Sbjct: 1430 GVIETITGDGEHGYN-GDNAATASSLNFPTSVTFSARTGNVVFADTRNNRIRQV 1482



 Score = 42.7 bits (99), Expect = 0.17,   Method: Composition-based stats.
 Identities = 29/107 (27%), Positives = 50/107 (46%), Gaps = 5/107 (4%)

Query: 85   LAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
            +AG G  G S   G   SA+ ++P+    D++GN+Y+ ++ NH +  +       I G G
Sbjct: 1204 VAGTGEMGASGVGGAATSAQLNRPRCAIADVEGNVYITEEGNHRVSIVDVASKKLIVGAG 1263

Query: 143  SKKEGR--ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            +   G     G   NA+  +   L +     +LL SD  N ++  +N
Sbjct: 1264 TGNPGHRGMGGTPGNAAIHSPVGLAWADD-GSLLFSDEENHVVYMVN 1309



 Score = 37.7 bits (86), Expect = 5.1,   Method: Composition-based stats.
 Identities = 30/122 (24%), Positives = 56/122 (45%), Gaps = 7/122 (5%)

Query: 86   AGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG 142
            AG G  G+    G PG+A    P   A    G++  +D+ NHV+  +     + ++  G 
Sbjct: 1262 AGTGNPGHRGMGGTPGNAAIHSPVGLAWADDGSLLFSDEENHVVYMVNRRSLIISVVAGT 1321

Query: 143  SKKEGRA--DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
             +  G +  DG +  A  +    +     +  L I+D GN  IR ++++ +  +  + +G
Sbjct: 1322 PRFAGYSGDDGLSIGARLNRPRGIALSKGV--LYIADSGNHRIRAVDMRTQVITTVAGTG 1379

Query: 201  SA 202
            SA
Sbjct: 1380 SA 1381


>gi|242017281|ref|XP_002429120.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212513984|gb|EEB16382.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 791

 Score = 47.8 bits (112), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 35/68 (51%), Gaps = 2/68 (2%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGVT 136
           E VVK   G   QG+ DG    A FD P   A   +  +Y+AD +NH IR   +  + VT
Sbjct: 237 EGVVKYTIGGKEQGFKDGNFQEALFDTPHGLAFFNESILYIADTNNHAIRLVDLNKMIVT 296

Query: 137 TIAGGGSK 144
           T+ G G +
Sbjct: 297 TVVGNGER 304



 Score = 42.0 bits (97), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 61/124 (49%), Gaps = 11/124 (8%)

Query: 76  LSEESVVKRLAGDGVQGYSDGE-PGSARFDKPK--SFAVDMKGNIYVADKSNHVIRKI-- 130
           + +E V   +AG+G +   + + P  A F +P   +++ D+K  +++AD  +  IR++  
Sbjct: 487 MYKEKVCLAIAGNGKEENKNNDLPNKAGFAQPCGITYSEDLKV-LFIADTESSSIRRLSL 545

Query: 131 TNLGVTTIAGGGSKKE-----GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
           T   VT +AGG    E     G  DG   N    +   LT+     +L + D  N  I+Q
Sbjct: 546 TKGSVTHLAGGSRFPEDLFSFGDVDGKGYNVRLQHPLGLTWNKKRKSLFVCDSYNHKIKQ 605

Query: 186 INLK 189
           IN++
Sbjct: 606 INVE 609


>gi|296128041|ref|YP_003635291.1| alkyl hydroperoxide reductase [Cellulomonas flavigena DSM 20109]
 gi|296019856|gb|ADG73092.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Cellulomonas flavigena DSM 20109]
          Length = 639

 Score = 47.8 bits (112), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 51/106 (48%), Gaps = 3/106 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIA 139
           V  LAG   +G  DG P  A F +P   +VD  G +++AD     +R +   +  V ++ 
Sbjct: 353 VTHLAGTMNEGLVDGTPQEAWFAQPSGLSVDAAGRLWLADAETSALRWVDPADASVHSVV 412

Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
           G G    G  DG A  A F +   +  +P   ++L++D  N  +R+
Sbjct: 413 GTGLFDFGHRDGAADQALFQHPLGVAALPD-GSVLVADTYNGALRR 457



 Score = 45.1 bits (105), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 30/107 (28%), Positives = 56/107 (52%), Gaps = 13/107 (12%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P   ++ PG + L+    + +  +TL+   ++   + R  G G +G  DG P  ARF +P
Sbjct: 181 PAKAVELPGGTFLV----ADAGHHTLTEVAADGETLVRRIGSGERGLVDGGPDDARFSEP 236

Query: 107 KSFAV---DMKG----NIYVADKSNHVIR--KITNLGVTTIAGGGSK 144
               +   +++G    ++ VAD  NH +R  ++++  VTT+AG G +
Sbjct: 237 NGLGLVPDELRGRVGYDVLVADTVNHALRGVRLSDGHVTTLAGTGEQ 283



 Score = 37.0 bits (84), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 31/109 (28%), Positives = 49/109 (44%), Gaps = 10/109 (9%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY-VADKSNHVIRKITNLGVTTIAGGGSKK 145
           GDG       +P + RF    + AV++ G  + VAD  +H + ++   G T +   GS +
Sbjct: 164 GDGPYVAPTPQPTTLRF---PAKAVELPGGTFLVADAGHHTLTEVAADGETLVRRIGSGE 220

Query: 146 EGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQINL 188
            G  DG   +A FS    L  VP          +L++D  N  +R + L
Sbjct: 221 RGLVDGGPDDARFSEPNGLGLVPDELRGRVGYDVLVADTVNHALRGVRL 269


>gi|162448821|ref|YP_001611188.1| NHL repeat-containing protein [Sorangium cellulosum So ce56]
 gi|161159403|emb|CAN90708.1| hypothetical protein with NHL repeat [Sorangium cellulosum So ce56]
          Length = 1889

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 27/62 (43%), Positives = 36/62 (58%), Gaps = 4/62 (6%)

Query: 92   GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRA 149
            G  DG   +ARF +P     D +GN+Y+AD  NH +R+  I    VTT+AG G+   G A
Sbjct: 949  GNVDGVGTTARFAEPSGIVADGRGNLYIADAFNHSLRRFEIATSMVTTVAGNGTA--GSA 1006

Query: 150  DG 151
            DG
Sbjct: 1007 DG 1008



 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 37/103 (35%), Positives = 54/103 (52%), Gaps = 9/103 (8%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSKK 145
           G +  ++G    ARF +P   A D     +YVAD++   +R+I  T+  VTTIA  GS  
Sbjct: 795 GARALTNGSYAQARFIQPGDMAYDASTRMLYVADRT--AVRRIDLTSGAVTTIA--GSTV 850

Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
            G  DG    A F+    +T   H  AL I+D G++ IR++ L
Sbjct: 851 SGTTDGVGTAARFNTISGITL--HDGALYIADRGSRTIRKLTL 891



 Score = 41.2 bits (95), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 24/65 (36%), Positives = 38/65 (58%), Gaps = 3/65 (4%)

Query: 80   SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD--KSNHVIRKITNLGVT- 136
            S+V  +AG+G  G +DG   +AR   P++ A+D  G++++ +  K   V R    LG T 
Sbjct: 992  SMVTTVAGNGTAGSADGVGTAARLTSPQALAIDGSGSVFIGEGNKPGRVRRFDPALGATV 1051

Query: 137  TIAGG 141
            T+AGG
Sbjct: 1052 TVAGG 1056


>gi|256393018|ref|YP_003114582.1| NHL repeat containing protein [Catenulispora acidiphila DSM 44928]
 gi|256359244|gb|ACU72741.1| NHL repeat containing protein [Catenulispora acidiphila DSM 44928]
          Length = 1046

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 49/89 (55%), Gaps = 2/89 (2%)

Query: 100  SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASF 158
            SA+ + P+  AVD  G+++VAD  ++ IR++T  G ++T+    +   G + G A  AS 
Sbjct: 940  SAQLNDPQGVAVDGAGDLFVADTFDNAIREVTPDGTISTVVNSSAAPGGESSGAAPTASH 999

Query: 159  SND-FELTFVPHICALLISDHGNQLIRQI 186
             N  + +T  P    L I+D  N +I Q+
Sbjct: 1000 LNTPYAVTVDPSTDLLYIADTRNSVIAQV 1028



 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 23/57 (40%), Positives = 38/57 (66%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           L+   ++ RL G G  GYS DG P   A+ ++P   A+D +G++Y+AD +N+VIR++
Sbjct: 849 LTPRGLLLRLIGTGRAGYSGDGRPSPLAQLNQPIGLALDAQGDLYIADSANNVIRRV 905


>gi|111219554|ref|YP_710348.1| protein serine/threonine kinase [Frankia alni ACN14a]
 gi|111147086|emb|CAJ58733.1| putative Protein serine/threonine kinase [Frankia alni ACN14a]
          Length = 765

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 57/114 (50%), Gaps = 4/114 (3%)

Query: 78  EESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           E+  +  +AG   +  S G+ G   +A   +P   A+   G++YVAD+    +R+I   G
Sbjct: 583 EKGRITTVAGKATEYGSAGDGGPAVNATLYQPSGLALGDDGSLYVADRGEDTVRRIDRSG 642

Query: 135 -VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            +T +AG       R DG A  A+  +D     +    +L I+D GN +IR+++
Sbjct: 643 RITLVAGRPGTYGNRGDGRAATAALLDDPTGLALGPDGSLYIADAGNDVIRRVD 696



 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 56/114 (49%), Gaps = 11/114 (9%)

Query: 81  VVKRLAGDGVQGYSDG------EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           V+  +AG G   Y+DG      +   A  D P + AVD  G +YVA+   + +R++    
Sbjct: 530 VITTVAGAGPD-YADGFTADVGDATKAHLDDPSAVAVDDDGVLYVAE--GYRVRRVEKGR 586

Query: 135 VTTIAGGGSKKEGRAD-GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           +TT+AG  ++     D GPA NA+      L       +L ++D G   +R+I+
Sbjct: 587 ITTVAGKATEYGSAGDGGPAVNATLYQPSGLALG-DDGSLYVADRGEDTVRRID 639



 Score = 40.4 bits (93), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 30/53 (56%), Gaps = 1/53 (1%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADG 151
           +A  D P   A+   G++Y+AD  N VIR++   G +TT+AG  S   G  DG
Sbjct: 665 AALLDDPTGLALGPDGSLYIADAGNDVIRRVDGHGIITTVAGTASYSAGDHDG 717


>gi|288801999|ref|ZP_06407440.1| conserved hypothetical protein [Prevotella melaninogenica D18]
 gi|288335434|gb|EFC73868.1| conserved hypothetical protein [Prevotella melaninogenica D18]
          Length = 486

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 50/103 (48%), Gaps = 4/103 (3%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVT---TIAGGGSKKE 146
           +G+ DG   +A+F  P+       G +Y+AD  NH IR I T LG     T   G  +  
Sbjct: 384 KGWEDGALKNAKFCYPRQMTFTKDGKLYIADSGNHCIRMIDTTLGKNARVTTPIGVPQSA 443

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           G  DG  + A F+    +        + ++D  NQ+IR++++K
Sbjct: 444 GFQDGGVELAKFNWPTGVAVSADGSTVYVADSKNQVIRELSIK 486


>gi|281415417|ref|ZP_06247159.1| thiol-disulfide isomerase-like thioredoxin [Micrococcus luteus NCTC
           2665]
          Length = 699

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 51/114 (44%), Gaps = 6/114 (5%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-----LGVTTIAG 140
           AG  ++G  DG+ G A F +P   +V   G ++VAD     +R +         V T  G
Sbjct: 395 AGTALEGLLDGDAGRAWFAQPSGLSVGADGALWVADSETSAVRWVRTGEDGRREVGTAVG 454

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
            G    G  DG A  A   +   +T +P   ++LI+D  N  IR+     ED +
Sbjct: 455 AGLFDFGHVDGEADRARLQHALGVTALPD-GSVLIADTYNGAIRRYAPAGEDAA 507


>gi|239918387|ref|YP_002957945.1| thiol-disulfide isomerase-like thioredoxin [Micrococcus luteus NCTC
           2665]
 gi|239839594|gb|ACS31391.1| thiol-disulfide isomerase-like thioredoxin [Micrococcus luteus NCTC
           2665]
          Length = 699

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 51/114 (44%), Gaps = 6/114 (5%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-----LGVTTIAG 140
           AG  ++G  DG+ G A F +P   +V   G ++VAD     +R +         V T  G
Sbjct: 395 AGTALEGLLDGDAGRAWFAQPSGLSVGADGALWVADSETSAVRWVRTGEDGRREVGTAVG 454

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
            G    G  DG A  A   +   +T +P   ++LI+D  N  IR+     ED +
Sbjct: 455 AGLFDFGHVDGEADRARLQHALGVTALPD-GSVLIADTYNGAIRRYAPAGEDAA 507


>gi|410896258|ref|XP_003961616.1| PREDICTED: NHL repeat-containing protein 2-like [Takifugu rubripes]
          Length = 716

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 45/79 (56%), Gaps = 8/79 (10%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSK---K 145
           G  DG+   A F+ P+  A+  KG+ +YVAD  NH+IRKI  +   V+T+AG G +   K
Sbjct: 262 GRKDGDLSEASFNSPQGVAI--KGDTVYVADTENHLIRKIDLSAGKVSTLAGTGEQGTDK 319

Query: 146 EGRADGPAQNASFSNDFEL 164
           +G A GP Q  S   D  L
Sbjct: 320 DGGAPGPEQPISSPWDVTL 338


>gi|290982235|ref|XP_002673836.1| protein kinase [Naegleria gruberi]
 gi|284087422|gb|EFC41092.1| protein kinase [Naegleria gruberi]
          Length = 1543

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 50/160 (31%), Positives = 68/160 (42%), Gaps = 26/160 (16%)

Query: 45  INPH---SVIDRP-----GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-D 95
           IN H   S  +RP     G   + V DS      ++S        VK  AG G+ G++ D
Sbjct: 588 INDHALNSTFNRPTGLSFGDGHVYVADSGNHMIRSISITTKS---VKTFAGTGIPGFNGD 644

Query: 96  GEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRK-ITNLGVTTIAGGGSKKEGRADGPA 153
           G+     + + P      +   I VAD  NH IR  I N  V TIAG      G  +GPA
Sbjct: 645 GKLLLETQLNSPMDVVFTVTMGILVADTFNHRIRLLIPNFIVKTIAGSSVDGSGGDEGPA 704

Query: 154 QNASFSNDFELTFVPH-----ICALLISDHGNQLIRQINL 188
            N+         F PH     + A+ I+D  N  IR I++
Sbjct: 705 INSEL-------FYPHSISFGVTAIYIADTFNNKIRAISI 737



 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 38/139 (27%), Positives = 60/139 (43%), Gaps = 7/139 (5%)

Query: 48  HSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPK 107
           HS +D  G   +++ D+ R+    + +P    +V+  LAG    G  D  P         
Sbjct: 773 HSYVDEGGIERVLISDN-RNDRLRVYYP--GNNVIHSLAG----GIGDFGPSENAIGNCL 825

Query: 108 SFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFV 167
             +V    +IY+ D  N++IR I N  ++TIAG G       +  A  AS +    + + 
Sbjct: 826 GISVAPNNDIYLTDVKNNLIRLIRNGIISTIAGNGGYGFNLDNIYATQASLAYPSGIVYS 885

Query: 168 PHICALLISDHGNQLIRQI 186
                L+ SD  N  IR I
Sbjct: 886 ESTGNLIFSDTNNHRIRVI 904


>gi|223939922|ref|ZP_03631790.1| NHL repeat containing protein [bacterium Ellin514]
 gi|223891413|gb|EEF57906.1| NHL repeat containing protein [bacterium Ellin514]
          Length = 359

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 40/136 (29%), Positives = 66/136 (48%), Gaps = 12/136 (8%)

Query: 60  IVLDSSRSAFY------TLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAV 111
           + LD + + F+      T+    ++  +V  +AG+G  G+S DG P + A+ ++P S   
Sbjct: 113 VRLDQAENLFWVERLSHTVRKCDAKTGIVTTIAGNGTAGFSGDGGPATKAQMNEPHSIGF 172

Query: 112 DMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPA-QNASFSNDFELTFVP 168
           D  GN+Y+ D  NH IRK+   +  ++T  G G +K    DG A  NA  +    L F P
Sbjct: 173 DKAGNLYICDVRNHRIRKVDMKSGMISTFCGTGERKP-TPDGTAIGNAPLNGPRALDFGP 231

Query: 169 HICALLISDHGNQLIR 184
                +    GN + +
Sbjct: 232 DGNLWVALREGNAVYK 247



 Score = 45.1 bits (105), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 41/151 (27%), Positives = 70/151 (46%), Gaps = 11/151 (7%)

Query: 44  EINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGSA- 101
           E+N  + I R     L + D+       ++     +  +  +AG G  G+S DG P +A 
Sbjct: 50  ELNGPTGIVRGSDGSLYICDTENHRIRKVT----ADGNISTVAGTGEHGWSGDGGPATAA 105

Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPAQNASF 158
           + ++P    +D   N++  ++ +H +RK    T + VTTIAG G+       GPA  A  
Sbjct: 106 KLNEPYEVRLDQAENLFWVERLSHTVRKCDAKTGI-VTTIAGNGTAGFSGDGGPATKAQM 164

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQINLK 189
           +    + F      L I D  N  IR++++K
Sbjct: 165 NEPHSIGF-DKAGNLYICDVRNHRIRKVDMK 194



 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 43/86 (50%), Gaps = 3/86 (3%)

Query: 82  VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
           +   AG+G +G+S DG     A  + P        G++Y+ D  NH IRK+T  G ++T+
Sbjct: 28  ISNFAGNGKKGFSGDGAAAKGAELNGPTGIVRGSDGSLYICDTENHRIRKVTADGNISTV 87

Query: 139 AGGGSKKEGRADGPAQNASFSNDFEL 164
           AG G        GPA  A  +  +E+
Sbjct: 88  AGTGEHGWSGDGGPATAAKLNEPYEV 113



 Score = 41.2 bits (95), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 43/88 (48%), Gaps = 5/88 (5%)

Query: 45  INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SAR 102
           +N    +D     +L V     +A Y +     E+  +  +AG G  G++ +G P  +A 
Sbjct: 221 LNGPRALDFGPDGNLWVALREGNAVYKIDM---EKGRIFHVAGTGKNGFTGNGGPAKAAT 277

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           F  PK  +V   GN++VAD  NH IR I
Sbjct: 278 FKGPKGLSVASNGNVFVADTENHAIRMI 305


>gi|168180827|ref|ZP_02615491.1| NHL repeat protein [Clostridium botulinum NCTC 2916]
 gi|421838034|ref|ZP_16272034.1| NHL repeat-containing protein [Clostridium botulinum CFSAN001627]
 gi|182668214|gb|EDT80193.1| NHL repeat protein [Clostridium botulinum NCTC 2916]
 gi|409739668|gb|EKN40278.1| NHL repeat-containing protein [Clostridium botulinum CFSAN001627]
          Length = 467

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 35/109 (32%), Positives = 55/109 (50%), Gaps = 6/109 (5%)

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAGG 141
           ++ G G  G  DG+   A+F  P+     ++  IYVAD  NH IR+  +    V TIAG 
Sbjct: 211 KIIGSGEYGLKDGDFKEAQFKAPEGLRF-IENKIYVADTENHSIRECDLEKEIVKTIAGN 269

Query: 142 GSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           G K+    A G A N S ++ ++L  +     + I+  GN  I + N++
Sbjct: 270 GQKEYSPMAKGDALNVSLNSPWDLEILKG--CIYIAMAGNHQIWKYNMR 316


>gi|436838118|ref|YP_007323334.1| NHL repeat-containing protein [Fibrella aestuarina BUZ 2]
 gi|384069531|emb|CCH02741.1| NHL repeat-containing protein [Fibrella aestuarina BUZ 2]
          Length = 375

 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 42/127 (33%), Positives = 54/127 (42%), Gaps = 18/127 (14%)

Query: 74  FPLSEESVVKRLAG----DGVQ--GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
           F ++    V RL+G    DG    GY DG    A+F  P S  +D  G +Y+ D  NH I
Sbjct: 141 FRINSAGQVTRLSGGPDPDGGYSFGYRDGPLARAKFSNPWSIIMDKAGMLYIGDGLNHCI 200

Query: 128 RKI---TNLGVTTIAGGGSK-----KEGRADGPAQNASFSNDFELTFVPHICALLISDHG 179
           RK+   TN  V+T AG   +     K     GPA       D  L        L I D G
Sbjct: 201 RKVDLSTNT-VSTFAGKPLENTAVCKYADGKGPAAQVCTVEDLTLD---QDGNLFIPDWG 256

Query: 180 NQLIRQI 186
           N  +R +
Sbjct: 257 NNSVRMV 263


>gi|290977893|ref|XP_002671671.1| predicted protein [Naegleria gruberi]
 gi|284085242|gb|EFC38927.1| predicted protein [Naegleria gruberi]
          Length = 3330

 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 37/110 (33%), Positives = 56/110 (50%), Gaps = 4/110 (3%)

Query: 81   VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
            ++  LAG+G+  +S DG   + A  + P    V   G IY++D  NH IR I   G ++T
Sbjct: 2358 LISTLAGNGIANFSGDGLAATEASLNSPSGVFVSEFGKIYISDSGNHRIRAILPNGIIST 2417

Query: 138  IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            + G G       +G A NAS +N + +    +   L+ISD  N  IR I+
Sbjct: 2418 VGGNGIPGFSGDNGLATNASLNNPYGIV-ETYSGDLIISDSDNNRIRLID 2466



 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 37/144 (25%), Positives = 64/144 (44%), Gaps = 10/144 (6%)

Query: 46   NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
            NP+ +++   S DLI+ DS  +    +        ++  LAG G +G+ DG    A F+ 
Sbjct: 2440 NPYGIVE-TYSGDLIISDSDNNRIRLID----RYGIITTLAGTGEKGFQDGFFLDALFNN 2494

Query: 106  PKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFE 163
            P          +Y++D     IR+  + +  V TI G G K     + P      +N   
Sbjct: 2495 PSQLFY-FHSRLYISDTYGQRIREANLESKTVKTIIGTGEKGYNGDNFPT-TTQLNNPLS 2552

Query: 164  LTFVPHICALLISDHGNQLIRQIN 187
            + F+ +   + I+D  N+ IR +N
Sbjct: 2553 M-FITYNNIIYIADTDNKKIRMLN 2575



 Score = 37.4 bits (85), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 39/66 (59%), Gaps = 5/66 (7%)

Query: 74   FPLSEES-VVKRLAGD-GVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
            +  SEES  V+R+AG     GY+ DG+P   +   +P+S  V   G +Y+ D +NH+IR+
Sbjct: 2025 YSASEESGTVQRIAGSLSSSGYNGDGKPAIDSLLSQPQSVFVK-DGEVYICDTNNHLIRR 2083

Query: 130  ITNLGV 135
            I   G+
Sbjct: 2084 IDRNGI 2089


>gi|433458786|ref|ZP_20416675.1| NHL repeat containing protein [Arthrobacter crystallopoietes
           BAB-32]
 gi|432192788|gb|ELK49609.1| NHL repeat containing protein [Arthrobacter crystallopoietes
           BAB-32]
          Length = 680

 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 57/121 (47%), Gaps = 11/121 (9%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--- 131
           PL+ E  V  LAG G++G  DGE  +A F +    A D  G+I++AD     +R +T   
Sbjct: 378 PLTGELAV--LAGTGLEGLQDGEANAAWFAQSSGLAQDANGSIWIADSETSALRVLTFSD 435

Query: 132 -----NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
                 + V T  G G    G  DG A  A   +   +T +P   ++ ++D  N  +R+ 
Sbjct: 436 VNGSERVQVQTAIGTGLFDFGFRDGGAAEARLQHPLGVTVLPD-GSVAVADTYNGAVRRY 494

Query: 187 N 187
           +
Sbjct: 495 D 495



 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 42/75 (56%), Gaps = 9/75 (12%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV---DMKG----NIYVADKSNHVIRKIT- 131
           S V    G G +G+ DG+  +A+F++P+  A+   D++     ++ +AD  NH +R +T 
Sbjct: 219 STVLNTYGSGSKGWQDGDAEAAQFNEPQGLALLPEDVRAKAGYDVVIADSVNHRLRGLTL 278

Query: 132 -NLGVTTIAGGGSKK 145
               V TIAG G ++
Sbjct: 279 ATGAVVTIAGNGVQR 293


>gi|375098226|ref|ZP_09744491.1| Peroxiredoxin [Saccharomonospora marina XMU15]
 gi|374658959|gb|EHR53792.1| Peroxiredoxin [Saccharomonospora marina XMU15]
          Length = 673

 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 32/100 (32%), Positives = 50/100 (50%), Gaps = 9/100 (9%)

Query: 97  EPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQN 155
           +PGS  RF  P    V  +G + VAD + H + +  +   T +   GS   G ADG    
Sbjct: 228 QPGSGLRF--PAKAVVTAEGRVLVADTAQHSVAEFASDAETLVRRFGSGVRGHADGAFDV 285

Query: 156 ASFSNDFELTFVPHICA------LLISDHGNQLIRQINLK 189
           A+F+    LT +P   A      L+++D GN L+R ++L+
Sbjct: 286 AAFAEPSGLTLLPTEIAERVGYHLVVADTGNHLLRGVDLR 325



 Score = 42.0 bits (97), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 31/121 (25%), Positives = 61/121 (50%), Gaps = 11/121 (9%)

Query: 73  SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF-------AVDMKGNIYVADKSNH 125
            F    E++V+R  G GV+G++DG    A F +P          A  +  ++ VAD  NH
Sbjct: 259 EFASDAETLVRRF-GSGVRGHADGAFDVAAFAEPSGLTLLPTEIAERVGYHLVVADTGNH 317

Query: 126 VIRKI--TNLGVTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQL 182
           ++R +      VTT+AG G + ++G   G A + + ++ +++ +      ++++  GN  
Sbjct: 318 LLRGVDLRTGAVTTVAGTGEQWRDGADAGTALDVALTSPWDVRWWEAAGGVVVAMAGNHT 377

Query: 183 I 183
           +
Sbjct: 378 L 378


>gi|345000781|ref|YP_004803635.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Streptomyces sp. SirexAA-E]
 gi|344316407|gb|AEN11095.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Streptomyces sp. SirexAA-E]
          Length = 608

 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 3/72 (4%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
            +V D++R     L    ++   V+   G G +G +DG PG ARF +P+  A+   G I 
Sbjct: 193 FLVSDTTRHRLVELD---ADGETVRGHFGTGERGLTDGGPGEARFSEPQGLALLPDGRIA 249

Query: 119 VADKSNHVIRKI 130
           VAD  NH IR +
Sbjct: 250 VADTVNHAIRAL 261


>gi|194877446|ref|XP_001973882.1| GG16549 [Drosophila erecta]
 gi|190657069|gb|EDV54282.1| GG16549 [Drosophila erecta]
          Length = 717

 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 39/122 (31%), Positives = 55/122 (45%), Gaps = 19/122 (15%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTI 138
           VV+   G    G+ DG   +ARF+ P+  A   +  + VAD  NH IR+I  TN  V T+
Sbjct: 257 VVQHKIGGYQPGFVDGNSTAARFNNPQGIAFLNENTLIVADTKNHAIRQISLTNGMVETL 316

Query: 139 AGGGSKKEGRADG----------PAQNASF-SNDFELTF------VPHICALLISDHGNQ 181
           AG G +   R  G          P   A F + D +++F      +P    +LIS  G  
Sbjct: 317 AGTGHQGNERIGGRLGPLQPLSSPWDVAIFRTRDMDMSFHLDERNIPEKTIVLISMAGTH 376

Query: 182 LI 183
            I
Sbjct: 377 QI 378


>gi|290986946|ref|XP_002676184.1| NHL repeat-containing protein [Naegleria gruberi]
 gi|284089785|gb|EFC43440.1| NHL repeat-containing protein [Naegleria gruberi]
          Length = 1290

 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 37/136 (27%), Positives = 65/136 (47%), Gaps = 9/136 (6%)

Query: 54  PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAV 111
           P   +   +DSS     T+ + +S   +V   AG G  GY +D  P  SA  + PK  A 
Sbjct: 438 PNVGEYYFVDSS-----TVKYLISSSGIVITKAGSGTTGYNADNIPATSAHLNFPKGIAR 492

Query: 112 DMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHI 170
              G++++ D  NH IRK+  N  + T+AG G+         A +A+ +N + +    + 
Sbjct: 493 STNGDLFITDSGNHRIRKVFLNQTIITVAGTGTSAYNGDGLLATSANINNPYGIVLTSNN 552

Query: 171 CALLISDHGNQLIRQI 186
             ++ S++    +R+I
Sbjct: 553 -EVIFSEYEGHRVRKI 567


>gi|195456364|ref|XP_002075106.1| GK23381 [Drosophila willistoni]
 gi|194171191|gb|EDW86092.1| GK23381 [Drosophila willistoni]
          Length = 721

 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 36/115 (31%), Positives = 56/115 (48%), Gaps = 5/115 (4%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VT 136
           + VV+   G    G+ DG+  ++RF+ P+  A   +  + VAD  NH +R+I+  G  V 
Sbjct: 259 DGVVEHKIGGFESGFIDGDLTTSRFNNPQGIAFLDEDTLIVADTKNHALRQISLQGGIVE 318

Query: 137 TIAGGG---SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           T+AG G   S++ G   GP Q+ S   D  +   P +      D  N L + I L
Sbjct: 319 TLAGTGHQCSERTGGRTGPVQSMSSPWDVAIFRTPDMDMSFHLDERNVLEKTIIL 373


>gi|406944252|gb|EKD76066.1| Serine/threonine protein kinase, partial [uncultured bacterium]
          Length = 212

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 43/76 (56%), Gaps = 2/76 (2%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV--DMKGNIYVADKSNHVIRKITNL 133
           +  + ++  +AG G  GY DG    A F +P+  A+  D    +++AD +N+VIRKI + 
Sbjct: 100 IGTDGILTNVAGAGDYGYRDGSSDYALFAQPQDIAIYGDTASELFIADTNNNVIRKIKDG 159

Query: 134 GVTTIAGGGSKKEGRA 149
            V+T+  G S  +G A
Sbjct: 160 EVSTLLSGLSSPQGVA 175


>gi|290994118|ref|XP_002679679.1| predicted protein [Naegleria gruberi]
 gi|284093297|gb|EFC46935.1| predicted protein [Naegleria gruberi]
          Length = 1272

 Score = 47.4 bits (111), Expect = 0.007,   Method: Composition-based stats.
 Identities = 43/114 (37%), Positives = 58/114 (50%), Gaps = 9/114 (7%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMK-GNIYVADKSNHVIRKI---TN 132
           +++  +AG    G  DG+  SA    F +     +  K GNIYV DKSN  IRKI   TN
Sbjct: 193 NLITTIAGIRAAGTFDGDGKSALQTSFFRMTDVTISKKNGNIYVTDKSNQRIRKISSSTN 252

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           L V+TIAG GS     +   A NAS  N   L    +   ++ S+ GN  IR++
Sbjct: 253 L-VSTIAGNGSIGFVDSMSNAINASLFNPMYLVLTDND-EIIFSEFGNNRIRKL 304


>gi|116007528|ref|NP_001036460.1| CG12547, isoform A [Drosophila melanogaster]
 gi|442622271|ref|NP_001260704.1| CG12547, isoform B [Drosophila melanogaster]
 gi|17945781|gb|AAL48938.1| RE33981p [Drosophila melanogaster]
 gi|30923523|gb|EAA46001.1| CG12547, isoform A [Drosophila melanogaster]
 gi|220948326|gb|ACL86706.1| CG12547-PA [synthetic construct]
 gi|440214080|gb|AGB93239.1| CG12547, isoform B [Drosophila melanogaster]
          Length = 717

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 40/127 (31%), Positives = 57/127 (44%), Gaps = 19/127 (14%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNL 133
           L+   VV+   G    G+ DG+   ARF+ P+  A   +  + VAD  NH IR+I  TN 
Sbjct: 252 LTAGGVVQHRIGGHQPGFVDGDLTVARFNNPQGIAFLNENVLIVADTKNHAIRQISLTNA 311

Query: 134 GVTTIAGGGSKKEGRADG----------PAQNASF-SNDFELTF------VPHICALLIS 176
            V T+AG G +   R  G          P   A F + D +++F      +P    +LIS
Sbjct: 312 MVETLAGTGIQGNDRIGGRLGPLQPLSSPWDVAIFRTRDMDMSFHLDERNIPEKTIILIS 371

Query: 177 DHGNQLI 183
             G   I
Sbjct: 372 MAGTHQI 378


>gi|284097436|ref|ZP_06385541.1| NHL repeat containing protein [Candidatus Poribacteria sp. WGA-A3]
 gi|283831050|gb|EFC35055.1| NHL repeat containing protein [Candidatus Poribacteria sp. WGA-A3]
          Length = 387

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 52/108 (48%), Gaps = 9/108 (8%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTIAG 140
           +R AGDG      G    A  + P + AV   G +Y+AD  NH IR++      ++TIAG
Sbjct: 113 QRFAGDG------GPAVEAALNFPSAVAVAEDGTVYIADTWNHRIRRVDPGTGAISTIAG 166

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
            G  K    +GPA  A+ +    L        L I+D  N  +R+++L
Sbjct: 167 TGQAKCYGDNGPAVKAALNEPVALAL-DGPDRLYIADQSNNRVRKLDL 213



 Score = 45.8 bits (107), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 47/155 (30%), Positives = 67/155 (43%), Gaps = 14/155 (9%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVD 112
           G   L + D S +    L        V+  + G G  GY+ DG PG       P   AVD
Sbjct: 194 GPDRLYIADQSNNRVRKLDL---TSGVMTTVVGTGESGYNGDGAPGPETALAGPSGLAVD 250

Query: 113 MKGNIYVADKSNHVIRKITNL--GVTTIAGGGSKKEGRADGPAQNAS---FSNDFELTFV 167
            +GN+Y+AD  +  IRK       V T+AGG    +     P +N S    S  + +   
Sbjct: 251 QEGNLYIADTFSGRIRKWDRHTGTVETVAGGTGAFQFT---PGENESSPNLSRPYAIALH 307

Query: 168 PHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
           P    L I+D  N LIR  +L+  + S  + +G A
Sbjct: 308 PD-GRLFITDSDNHLIRVWDLQKREMSLLAGNGKA 341



 Score = 45.4 bits (106), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 25/58 (43%), Positives = 36/58 (62%), Gaps = 11/58 (18%)

Query: 96  GEPG---------SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGV-TTIAGGG 142
           GEPG         +A  ++PK+ A+D  GN+Y+AD  NH++RK+    G+ TTIAG G
Sbjct: 1   GEPGWGGDAGPATAAMLNEPKTVALDAAGNLYIADSENHLVRKVDARTGIMTTIAGTG 58


>gi|116621305|ref|YP_823461.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116224467|gb|ABJ83176.1| NHL repeat containing protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 1042

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 49/163 (30%), Positives = 63/163 (38%), Gaps = 48/163 (29%)

Query: 71  TLSFPLSEESVVKRLAGD-------GVQGYSDGEPG-SARFDKPKSFAVDMKGNIYVADK 122
           TLS  L   +   +LAG        GV    DG P  SA  + P++   D  G IY+AD 
Sbjct: 8   TLSAALCLTTAHAQLAGSYVSRTIAGVFPLGDGGPATSAILETPQAVVADSSGTIYIADA 67

Query: 123 SNHVIRKI-------------------------------------TNLG-VTTIAGGGSK 144
            N VIRK+                                     T+ G VTT+AG G+ 
Sbjct: 68  GNGVIRKVSRGIISTVAGYTGYISDLKLDSSGNLYLAGGNNVFKLTSAGKVTTVAGNGTS 127

Query: 145 KEGRAD-GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
                D GPA NA FS  + +        L I D GN  IR++
Sbjct: 128 GTYTGDGGPAINAGFSGAYAIAL-DSTGNLYICDSGNNAIRKV 169



 Score = 44.3 bits (103), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 23/65 (35%), Positives = 35/65 (53%), Gaps = 3/65 (4%)

Query: 74  FPLSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           F L+    V  +AG+G  G   G+ G   +A F    + A+D  GN+Y+ D  N+ IRK+
Sbjct: 110 FKLTSAGKVTTVAGNGTSGTYTGDGGPAINAGFSGAYAIALDSTGNLYICDSGNNAIRKV 169

Query: 131 TNLGV 135
           T  G+
Sbjct: 170 TPDGI 174



 Score = 43.5 bits (101), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 30/114 (26%), Positives = 56/114 (49%), Gaps = 6/114 (5%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++   +++  AGDG +G++ DG   + A F       +D   N+YV D +N  +R I ++
Sbjct: 280 IAPNGIIQAFAGDGTEGFAGDGSSATLAHFSGVTGLGMDPNNNLYVVDGNNERVRMINSV 339

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G + T+AG   +     DG A  A+  +  + T      ++  +D  N  +R+I
Sbjct: 340 GSIATVAG---RAHFSGDGGAATAATLHRPQGTVTGTDGSIYFTDTANHRVRKI 390



 Score = 40.0 bits (92), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 33/107 (30%), Positives = 50/107 (46%), Gaps = 9/107 (8%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKR--LAGDGVQGYSDGEPGSARFDKPKSFAVDM 113
           +++L V+D +      ++   S  +V  R   +GDG      G   +A   +P+      
Sbjct: 320 NNNLYVVDGNNERVRMINSVGSIATVAGRAHFSGDG------GAATAATLHRPQGTVTGT 373

Query: 114 KGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
            G+IY  D +NH +RKI   G +TTIAG G        GPA  A+ S
Sbjct: 374 DGSIYFTDTANHRVRKIGTDGKITTIAGTGDLGFSGDGGPATQATMS 420



 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 25/71 (35%), Positives = 40/71 (56%), Gaps = 3/71 (4%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +  +  +  +AG G  G+S DG P + A    P + A+D   N+YV D+    +RKIT  
Sbjct: 390 IGTDGKITTIAGTGDLGFSGDGGPATQATMSFPDALAIDSTNNLYVIDQKQLRVRKITPT 449

Query: 134 G-VTTIAGGGS 143
           G ++T+AG G+
Sbjct: 450 GTISTVAGNGT 460


>gi|423300918|ref|ZP_17278942.1| hypothetical protein HMPREF1057_02083 [Bacteroides finegoldii
           CL09T03C10]
 gi|408472253|gb|EKJ90781.1| hypothetical protein HMPREF1057_02083 [Bacteroides finegoldii
           CL09T03C10]
          Length = 459

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 38/98 (38%), Positives = 52/98 (53%), Gaps = 2/98 (2%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
           G  G+ +G+   AR  +P    +D +GNIYVA +  H I KIT  GV T    G+   G 
Sbjct: 361 GKTGFVEGQGNDARLQEPCQMDLDEEGNIYVAVRKKHRIAKITPDGVVTSY-TGTGTSGT 419

Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            DGP   A F++   + F P   AL +SD+ N  IR+I
Sbjct: 420 TDGPLDKAQFNHPEGVQFGPD-GALYVSDYWNHKIRKI 456


>gi|390960114|ref|YP_006423871.1| gluconolactonase [Terriglobus roseus DSM 18391]
 gi|390415032|gb|AFL90536.1| gluconolactonase [Terriglobus roseus DSM 18391]
          Length = 1836

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 53/98 (54%), Gaps = 5/98 (5%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRA-D 150
           Y+ G+  ++    P   A+D  GN+YVAD  N+VIR+I   GV TT+AG  +     A D
Sbjct: 228 YAGGDVATS-LSSPAGLAIDGAGNLYVADSGNNVIRRIDTNGVMTTVAGQVANSSPYAGD 286

Query: 151 G-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           G  A  A+ +  + +   P    L I+D GN  +R+++
Sbjct: 287 GLLATAATLNAPWSVALDPD-GNLFIADTGNNAVRRVD 323



 Score = 46.2 bits (108), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 43/147 (29%), Positives = 67/147 (45%), Gaps = 18/147 (12%)

Query: 46  NPHSVIDRPGSSDLIVL----DSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSA 101
           +P  V+    S+D  VL    D S S +  +S P           GDG      G   +A
Sbjct: 345 DPSGVLYVADSADNTVLRVNADGSFSTYVGISTPRGGS-----FDGDG------GAATAA 393

Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASFS 159
             + P + A D  GN+Y+AD +N+ +R++  L   +TT+AG G          +  ASFS
Sbjct: 394 HLNGPAALAFDPAGNLYIADSANNRVRRVDALSKLITTVAGNGGTVMDDTVMNSDIASFS 453

Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
             + L F+ +   L + D  +  +RQI
Sbjct: 454 GPYAL-FLDNAANLYVGDLFHNRVRQI 479


>gi|290997412|ref|XP_002681275.1| predicted protein [Naegleria gruberi]
 gi|284094899|gb|EFC48531.1| predicted protein [Naegleria gruberi]
          Length = 887

 Score = 47.4 bits (111), Expect = 0.007,   Method: Composition-based stats.
 Identities = 31/96 (32%), Positives = 51/96 (53%), Gaps = 4/96 (4%)

Query: 95  DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI---AGGGSKKEGRADG 151
           +G+P  A      S  ++  G++++ D +NHVIR+   LG +TI   AG G+ K    DG
Sbjct: 645 NGKPLDATLGYINSLLINDNGDLFMVDTTNHVIRQFNLLGTSTIATTAGVGTSKGYSGDG 704

Query: 152 PAQNASFSNDFELTFVPHI-CALLISDHGNQLIRQI 186
            +  +S  +    + + H   AL+  D GN LIR++
Sbjct: 705 KSAKSSLLSLNLNSTLLHYNGALMFFDGGNNLIRKV 740


>gi|187778661|ref|ZP_02995134.1| hypothetical protein CLOSPO_02256 [Clostridium sporogenes ATCC
           15579]
 gi|187772286|gb|EDU36088.1| NHL repeat protein [Clostridium sporogenes ATCC 15579]
          Length = 467

 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 42/153 (27%), Positives = 69/153 (45%), Gaps = 22/153 (14%)

Query: 42  QLEINPHSVI--DRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG 99
           ++ IN + +I     G + +++LD +     T+              G    G  DG+  
Sbjct: 181 KISINENGLIAFSEKGRNRIVILDQNLEKIKTI--------------GSSEYGLKDGDFK 226

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAGGGSKKEG-RADGPAQNA 156
            A+F  P+     ++  IYVAD  NH IR+  +    V TIAG G K+    A G A N 
Sbjct: 227 EAQFKAPEGLRF-IENKIYVADTENHSIRECDLEKEMVKTIAGNGQKEYSPMAKGDALNV 285

Query: 157 SFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           S ++ ++L  +     + I+  GN  I + N+K
Sbjct: 286 SLNSPWDLEILED--CIYIAMAGNHQIWKYNMK 316


>gi|410938688|ref|ZP_11370531.1| putative lipoprotein [Leptospira noguchii str. 2006001870]
 gi|410786234|gb|EKR75182.1| putative lipoprotein [Leptospira noguchii str. 2006001870]
          Length = 358

 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 36/70 (51%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + +V   AG G     DG   +A F  P    +D  GNI+V+
Sbjct: 36  ILDCILKECYLCSLKITNKPIVSLFAGTGAAVSVDGTTSTASFKTPFGLEIDTSGNIFVS 95

Query: 121 DKSNHVIRKI 130
           D++ ++IRKI
Sbjct: 96  DQTANLIRKI 105



 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 35/102 (34%), Positives = 55/102 (53%), Gaps = 5/102 (4%)

Query: 95  DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLG-VTTIAGGGSKKEGRADGP 152
           +G+  ++ F+ P    +D + N+YV +  NH IRKI  N G V+T++GG S   G  DG 
Sbjct: 169 NGDRLNSLFNGPFFMDIDPERNLYVGELGNHTIRKIDLNSGNVSTLSGGIS---GYLDGD 225

Query: 153 AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
             +A F +   + +     +LL +D  +  IR+INLK    S
Sbjct: 226 LASARFKSPLGIAYNHKTDSLLAADIQDHRIRKINLKTSTVS 267


>gi|290992574|ref|XP_002678909.1| predicted protein [Naegleria gruberi]
 gi|284092523|gb|EFC46165.1| predicted protein [Naegleria gruberi]
          Length = 1626

 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 44/87 (50%), Gaps = 2/87 (2%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFS 159
           A  + P+   VD    +Y AD  NH IRKI   G V TIAG  +      + P   A  +
Sbjct: 122 ASANSPRGITVDSNDEVYFADSKNHRIRKILKDGTVITIAGTKTGGYNGDNIPQTTAQLN 181

Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
           N +++  +    A+ I+D+GN  IR+I
Sbjct: 182 NPYDVK-IGSDGAIYIADYGNNRIRKI 207



 Score = 45.8 bits (107), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 33/111 (29%), Positives = 53/111 (47%), Gaps = 3/111 (2%)

Query: 78  EESVVKRLAGDGVQGYS-DGEP-GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           ++  V  +AG    GY+ D  P  +A+ + P    +   G IY+AD  N+ IRKI N  +
Sbjct: 153 KDGTVITIAGTKTGGYNGDNIPQTTAQLNNPYDVKIGSDGAIYIADYGNNRIRKIANNTI 212

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           TT+AG G+      +    +A  S    +  +     + ISD  N  IR++
Sbjct: 213 TTVAGNGTIGSDSDNAEGTSAKLSGPSGIA-ISSTGDVYISDTNNNKIRKL 262



 Score = 45.1 bits (105), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 35/111 (31%), Positives = 53/111 (47%), Gaps = 4/111 (3%)

Query: 79  ESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
            + +  +AG+G  G  SD   G SA+   P   A+   G++Y++D +N+ IRK++  G +
Sbjct: 209 NNTITTVAGNGTIGSDSDNAEGTSAKLSGPSGIAISSTGDVYISDTNNNKIRKLSTNGII 268

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           TT AG G          A+ A  S    L        L I+D  N  IRQ+
Sbjct: 269 TTFAGTGVSGFFGDKADAKRARLSGPRGLAITAS-GVLYIADTNNNRIRQV 318



 Score = 42.7 bits (99), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 23/62 (37%), Positives = 39/62 (62%), Gaps = 3/62 (4%)

Query: 86  AGDGVQGYS-DG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGG 142
           AG G++ Y+ DG +   A  D P   A++  G +++AD++N+ +RK+ TN  + TIAG G
Sbjct: 497 AGTGLKPYNGDGIQATCANLDTPYGIALNSDGELFIADQNNYRVRKVFTNGTIVTIAGNG 556

Query: 143 SK 144
            +
Sbjct: 557 ER 558



 Score = 37.7 bits (86), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 32/115 (27%), Positives = 57/115 (49%), Gaps = 4/115 (3%)

Query: 76  LSEESVVKRLAGDGVQGY-SD-GEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKITN 132
           +S+++++   +G+  + Y SD G    +R   P+  ++D   G+IY AD +N  + KIT+
Sbjct: 318 VSQQNIISTFSGNDDKVYCSDAGLALGSRTPSPRGISLDPTTGDIYFADSTNQRLSKITS 377

Query: 133 LG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            G +  ++G G       +  A  A  +N   +T  P    L  +D  N  IR+I
Sbjct: 378 DGEIELVSGTGEANFFGENVVATTAKLNNPSGVTVDPKTGDLYFADTKNNRIRKI 432


>gi|357391051|ref|YP_004905892.1| hypothetical protein KSE_41500 [Kitasatospora setae KM-6054]
 gi|311897528|dbj|BAJ29936.1| hypothetical protein KSE_41500 [Kitasatospora setae KM-6054]
          Length = 608

 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 39/119 (32%), Positives = 61/119 (51%), Gaps = 18/119 (15%)

Query: 60  IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDG-EPGSARFDKPKSFAVDMKG--- 115
           +V DS   A   L      E+V++R+ GDGV+G  DG EP   RF +P+  A+   G   
Sbjct: 189 LVADSGHHALVELD--ADGETVLRRI-GDGVRGLVDGPEP---RFSEPQGLALVPAGLAP 242

Query: 116 ----NIYVADKSNHVIR--KITNLGVTTIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
               ++ VAD  NH +R  ++ +  VTT+AG G +  +     GPA     S+ +++ F
Sbjct: 243 ELGYDVVVADTVNHALRGVRLADGSVTTLAGTGKQWWQGSPTAGPALGVDLSSPWDVAF 301



 Score = 45.8 bits (107), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 34/109 (31%), Positives = 51/109 (46%), Gaps = 4/109 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKIT--NLGVTTI 138
           V+  AG   +G  DG PG A F +P   AV   G  ++VAD     +R ++     V T 
Sbjct: 326 VRVAAGTTNEGLVDGPPGEAWFAQPSGLAVSADGERLWVADSETSALRWVSRGTHEVRTA 385

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            G G    G  DG A  A   +   +T +P   ++ +SD  NQ +R+ +
Sbjct: 386 VGTGLFDFGHRDGAAGQALLQHPLGVTVLPD-GSVAVSDTYNQALRRYD 433


>gi|269926393|ref|YP_003323016.1| alkyl hydroperoxide reductase [Thermobaculum terrenum ATCC BAA-798]
 gi|269790053|gb|ACZ42194.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Thermobaculum terrenum ATCC BAA-798]
          Length = 493

 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI-TNLG-VTTI 138
           ++R+ G+G +G  DG    A+F+ P+  A+  +GN +YVAD  NH +R +  N G V TI
Sbjct: 210 IRRIWGNGEEGLVDGSASEAKFNHPQGMAI--RGNELYVADTENHALRLLHLNEGKVETI 267

Query: 139 AGGGSK 144
           AG G +
Sbjct: 268 AGTGEQ 273


>gi|307727531|ref|YP_003910744.1| NHL repeat containing protein [Burkholderia sp. CCGE1003]
 gi|307588056|gb|ADN61453.1| NHL repeat containing protein [Burkholderia sp. CCGE1003]
          Length = 396

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 43/124 (34%), Positives = 65/124 (52%), Gaps = 10/124 (8%)

Query: 78  EESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           +   V  +AG G  GYS DG P + A    P   A+D  GN+++A+   HV+RK+  + G
Sbjct: 80  DNGTVTIIAGTGQAGYSGDGGPATQAMLHYPTGVALDRAGNVFIAEPFAHVVRKVDRHTG 139

Query: 135 V-TTIAGGGSKK-EGRAD-GPAQNAS-FSNDFELTFVPHICALLISDHGNQLIRQINLKP 190
           V TT+AG G+     R D GPA  A+ ++ND     V     L I+D     IR+++ + 
Sbjct: 140 VITTVAGPGTPDVPNRGDGGPATQATMYANDVA---VNREGDLYIADSAAGRIRRVDHQT 196

Query: 191 EDCS 194
            + S
Sbjct: 197 GNIS 200



 Score = 38.1 bits (87), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 27/65 (41%), Positives = 33/65 (50%), Gaps = 6/65 (9%)

Query: 98  PGSARFDKP----KSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADG 151
           PGS   D P    ++ AVD  GNIY A+ +  V+R+I   N  VT IAG G        G
Sbjct: 41  PGSGGGDAPAIQAEAIAVDRWGNIYTAEPTYSVVRRIDHDNGTVTIIAGTGQAGYSGDGG 100

Query: 152 PAQNA 156
           PA  A
Sbjct: 101 PATQA 105


>gi|344923601|ref|ZP_08777062.1| hypothetical protein COdytL_03000 [Candidatus Odyssella
           thessalonicensis L13]
          Length = 1165

 Score = 47.0 bits (110), Expect = 0.009,   Method: Composition-based stats.
 Identities = 28/79 (35%), Positives = 41/79 (51%), Gaps = 1/79 (1%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAG 140
           V  +AG G  G+++G  GSA+F+ P+   V   G + VAD +N  IR +    G  T   
Sbjct: 100 VATVAGTGTPGHTNGPGGSAQFNMPQGITVMNDGRLLVADTNNDRIRLVDPGNGFQTSDY 159

Query: 141 GGSKKEGRADGPAQNASFS 159
            G+   G  DG A  A+F+
Sbjct: 160 AGTGSSGYTDGAAAGATFT 178



 Score = 44.7 bits (104), Expect = 0.041,   Method: Composition-based stats.
 Identities = 42/151 (27%), Positives = 66/151 (43%), Gaps = 20/151 (13%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS-------NHVIR 128
           L+  ++V   AG G  G  DG   +A+   P   A D  G IY+           N  +R
Sbjct: 32  LTPWNIVTTFAGSGTAGLVDGTGAAAQLANPWGLAYDGGGYIYITQGQGFMNGFINSAVR 91

Query: 129 K--ITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           +  +    V T+AG G+   G  +GP  +A F+    +T V +   LL++D  N  IR +
Sbjct: 92  RLDLNTRYVATVAGTGTP--GHTNGPGGSAQFNMPQGIT-VMNDGRLLVADTNNDRIRLV 148

Query: 187 N----LKPEDCSKSSQS----GSALGAVSVW 209
           +     +  D + +  S    G+A GA   W
Sbjct: 149 DPGNGFQTSDYAGTGSSGYTDGAAAGATFTW 179



 Score = 41.6 bits (96), Expect = 0.36,   Method: Composition-based stats.
 Identities = 34/102 (33%), Positives = 42/102 (41%), Gaps = 5/102 (4%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGS 143
           AG G  GY+DG    A F  P   A    G IYVAD +   IR I  +   VTT AG  S
Sbjct: 160 AGTGSSGYTDGAAAGATFTWPMGMATGPDGTIYVADMAR--IRSINPSTNQVTTFAGTVS 217

Query: 144 KKEGRA-DGPAQNASFSNDFELTFVPHICALLISDHGNQLIR 184
                A DGP     F  +  +        + + D G+   R
Sbjct: 218 TTMSDATDGPQNGPRFQYNMSVALNAAGDKVYVGDRGHCYFR 259



 Score = 40.8 bits (94), Expect = 0.65,   Method: Composition-based stats.
 Identities = 18/39 (46%), Positives = 23/39 (58%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G  +G   +A FD P     D  G++Y+AD  NHVIRKI
Sbjct: 332 GLVNGSGATALFDGPSGMVQDADGSLYIADSYNHVIRKI 370


>gi|189462806|ref|ZP_03011591.1| hypothetical protein BACCOP_03504 [Bacteroides coprocola DSM 17136]
 gi|189430422|gb|EDU99406.1| NHL repeat protein [Bacteroides coprocola DSM 17136]
          Length = 500

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 35/64 (54%), Gaps = 9/64 (14%)

Query: 76  LSEESVVKRLAGDG-------VQGYSDGE-PGSARFDKPKSFAVDMKGNI-YVADKSNHV 126
           L+ E VV   AG G        +GY DG   G ARF+ P+  A D + N+ YV D  NH+
Sbjct: 412 LTPEGVVTTYAGRGSANLNNEARGYVDGTLRGEARFNNPRGLAYDEQNNVFYVGDMDNHI 471

Query: 127 IRKI 130
           IRKI
Sbjct: 472 IRKI 475



 Score = 42.4 bits (98), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 38/131 (29%), Positives = 56/131 (42%), Gaps = 20/131 (15%)

Query: 86  AGDGVQGYSDGEPGSARFDKP------------KSFAVDMKGNIYVADKSNHVIRKITNL 133
            G G   + DG   +AR + P            K+   D+  + Y  D  NH +R +T  
Sbjct: 357 GGVGQSAWVDGVGANARLNSPYQGVFVKNPEYVKANKSDVY-DFYFTDSGNHCVRVLTPE 415

Query: 134 GV-TTIAGGGS-----KKEGRADGPAQ-NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           GV TT AG GS     +  G  DG  +  A F+N   L +        + D  N +IR+I
Sbjct: 416 GVVTTYAGRGSANLNNEARGYVDGTLRGEARFNNPRGLAYDEQNNVFYVGDMDNHIIRKI 475

Query: 187 NLKPEDCSKSS 197
            ++ ED +  S
Sbjct: 476 AMEDEDSAHES 486


>gi|449684055|ref|XP_002166330.2| PREDICTED: NHL repeat-containing protein 2-like [Hydra
           magnipapillata]
          Length = 193

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 40/135 (29%), Positives = 62/135 (45%), Gaps = 17/135 (12%)

Query: 28  LEDGYTVTTVIDGHQLEINPHS-----VIDRPGSSDLIVLDSSRSAFYTLSFP------L 76
           L+ GY  T    G   EI  ++     +I  P     +V D   +  Y   F       +
Sbjct: 16  LKTGYVTTLSSQGFGAEIGLNNQPVNLIIHYPEG---LVYDPDYNLLYVCEFENHNIRVV 72

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGV 135
           S + +VK LAG+  +G  DG   +ARF  P   A D K   +YV+D+ NH++R +T +G 
Sbjct: 73  SMQGLVKTLAGE-FKGKKDGVGRTARFYYPSGLAFDRKNRMLYVSDQYNHLVRTVTAIG- 130

Query: 136 TTIAGGGSKKEGRAD 150
           + +     K+E   D
Sbjct: 131 SNVEYSNIKQEAVLD 145



 Score = 43.5 bits (101), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 45/169 (26%), Positives = 76/169 (44%), Gaps = 18/169 (10%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG-VTTIAGGGSK 144
           G G +   + +P +     P+    D   N+ YV +  NH IR ++  G V T+AG   +
Sbjct: 28  GFGAEIGLNNQPVNLIIHYPEGLVYDPDYNLLYVCEFENHNIRVVSMQGLVKTLAG---E 84

Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI----------NLKPEDCS 194
            +G+ DG  + A F     L F      L +SD  N L+R +          N+K E   
Sbjct: 85  FKGKKDGVGRTARFYYPSGLAFDRKNRMLYVSDQYNHLVRTVTAIGSNVEYSNIKQEAVL 144

Query: 195 KSSQSGSALGAVS--VWVLVSVLSCLVSLVIG-FVARPYIIRHTGRLDP 240
            S+     +  V+    + ++++ C  SL++G F+ARP + +    L P
Sbjct: 145 DSTDILKKVKTVADHSSLTLNIMLCSASLLVGIFIARPCLSKLIKLLRP 193


>gi|226950178|ref|YP_002805269.1| NHL repeat-containing protein [Clostridium botulinum A2 str. Kyoto]
 gi|226843692|gb|ACO86358.1| NHL repeat protein [Clostridium botulinum A2 str. Kyoto]
          Length = 467

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 44/171 (25%), Positives = 74/171 (43%), Gaps = 22/171 (12%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSVI--DRPGSSDLIVLDSSRSAFYTLSFPLSEESV 81
             ++LE+       +   ++ IN   +I     G + +++LD +     T+         
Sbjct: 163 KNIILENKCEKEEYMYPSKISINEDGLIAFSEKGKNRIVILDQNLEKIKTI--------- 213

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIA 139
                G    G  DG+   A+F  P+     ++  IYVAD  NH IR+  +    V TIA
Sbjct: 214 -----GSSEYGLKDGDFKEAQFKAPEGLRF-IENKIYVADTENHFIRECDLEKEIVKTIA 267

Query: 140 GGGSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           G G K+    A G A N S ++ ++L  +     + I+  GN  I + N+K
Sbjct: 268 GNGQKEYSPMAKGDALNVSLNSPWDLEILKD--CIYIAMAGNHQIWKYNMK 316


>gi|225450563|ref|XP_002277564.1| PREDICTED: NHL repeat-containing protein 2-like [Vitis vinifera]
          Length = 1078

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 54/108 (50%), Gaps = 4/108 (3%)

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTIAG 140
           ++   G +G  DG    A F++P+  A + K N+ YVAD  NH +R+I   N  V T+AG
Sbjct: 627 QIGSTGEEGLRDGSFDDATFNRPQGLAYNAKKNLLYVADTENHALREIDFVNETVQTLAG 686

Query: 141 GGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            G+K  + +  G       ++ +++ F P    + I+  G   I + N
Sbjct: 687 NGTKGSDYQGGGKGATQLLNSPWDVCFEPINEIVYIAMAGQHQIWEHN 734



 Score = 42.4 bits (98), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 5/104 (4%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRA 149
           G  DG         P   +    G IYVAD  NH I+K+      V+T+AG G  K G  
Sbjct: 815 GDHDGVGSEVLLQHPLGVSCGKDGQIYVADSYNHKIKKLDPATGRVSTLAGTG--KAGFK 872

Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
           DG A  A  S    +  V +   L I+D  N +IR ++LK ++ 
Sbjct: 873 DGRALAAQLSEPSGIVEVEN-GVLFIADTNNSVIRYLDLKKKEA 915


>gi|300771890|ref|ZP_07081761.1| NHL repeat containing protein [Sphingobacterium spiritivorum ATCC
           33861]
 gi|300761276|gb|EFK58101.1| NHL repeat containing protein [Sphingobacterium spiritivorum ATCC
           33861]
          Length = 658

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 45/133 (33%), Positives = 60/133 (45%), Gaps = 15/133 (11%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS---NHVIRKIT-----NLGVT 136
           L G G  GY DG    A+ +      VD KG++Y  D +    + IRKIT     N  V+
Sbjct: 381 LIGSGGAGYVDGSLADAKTNGAWGVVVDNKGDLYFTDNNATGGNRIRKITFDASGNGTVS 440

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC--- 193
           TIAG  S   G ADG    ASF+    L    +   L +S+     IR+I+L        
Sbjct: 441 TIAGQAS--TGNADGIGAAASFNRPSGLAIDKNNNILYVSEWIGNRIRRIDLATNRVELL 498

Query: 194 --SKSSQSGSALG 204
             S ++QSG   G
Sbjct: 499 AGSATAQSGDVEG 511



 Score = 41.2 bits (95), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 41/143 (28%), Positives = 67/143 (46%), Gaps = 15/143 (10%)

Query: 50  VIDRPGSSDLIVLDSSRSA---FYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           V+D  G  DL   D++ +       ++F  S    V  +AG    G +DG   +A F++P
Sbjct: 406 VVDNKG--DLYFTDNNATGGNRIRKITFDASGNGTVSTIAGQASTGNADGIGAAASFNRP 463

Query: 107 KSFAVDMKGNI-YVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPAQNASFS--- 159
              A+D   NI YV++   + IR+I   TN  V  +AG  + + G  +G    A F+   
Sbjct: 464 SGLAIDKNNNILYVSEWIGNRIRRIDLATNR-VELLAGSATAQSGDVEGIGGVARFNQLG 522

Query: 160 NDFELTFVPHICALLISDHGNQL 182
           N  EL+  P    L +S + ++ 
Sbjct: 523 NALELS--PDGKKLFVSGNSSRF 543


>gi|170755613|ref|YP_001782372.1| NHL repeat-containing protein [Clostridium botulinum B1 str. Okra]
 gi|429244666|ref|ZP_19208103.1| NHL repeat-containing protein [Clostridium botulinum CFSAN001628]
 gi|169120825|gb|ACA44661.1| NHL repeat protein [Clostridium botulinum B1 str. Okra]
 gi|428758305|gb|EKX80740.1| NHL repeat-containing protein [Clostridium botulinum CFSAN001628]
          Length = 467

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 44/171 (25%), Positives = 74/171 (43%), Gaps = 22/171 (12%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSVI--DRPGSSDLIVLDSSRSAFYTLSFPLSEESV 81
             ++LE+       +   ++ IN   +I     G + +++LD +     T+         
Sbjct: 163 ENIILENKCEKEEYMYPSKISINEDGLIAFSEKGKNRIVILDQNLEKIKTI--------- 213

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIA 139
                G    G  DG+   A+F  P+     ++  IYVAD  NH IR+  +    V TIA
Sbjct: 214 -----GSSKYGLKDGDFKEAQFKAPEGLRF-IENKIYVADTENHFIRECDLEKEIVKTIA 267

Query: 140 GGGSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           G G K+    A G A N S ++ ++L  +     + I+  GN  I + N+K
Sbjct: 268 GNGQKEYSPMAKGDALNVSLNSPWDLEILKD--CIYIAMAGNHQIWKYNMK 316


>gi|153938821|ref|YP_001392044.1| NHL repeat-containing protein [Clostridium botulinum F str.
           Langeland]
 gi|384463040|ref|YP_005675635.1| NHL repeat protein [Clostridium botulinum F str. 230613]
 gi|152934717|gb|ABS40215.1| NHL repeat protein [Clostridium botulinum F str. Langeland]
 gi|295320057|gb|ADG00435.1| NHL repeat protein [Clostridium botulinum F str. 230613]
          Length = 467

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 44/171 (25%), Positives = 74/171 (43%), Gaps = 22/171 (12%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSVI--DRPGSSDLIVLDSSRSAFYTLSFPLSEESV 81
             ++LE+       +   ++ IN   +I     G + +++LD +     T+         
Sbjct: 163 ENIILENKCEKEEYMYPSKISINEDGLIAFSEKGKNRIVILDQNLEKIKTI--------- 213

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIA 139
                G    G  DG+   A+F  P+     ++  IYVAD  NH IR+  +    V TIA
Sbjct: 214 -----GSSEYGLKDGDFKEAQFKAPEGLRF-IENKIYVADTENHFIRECDLEKEIVKTIA 267

Query: 140 GGGSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           G G K+    A G A N S ++ ++L  +     + I+  GN  I + N+K
Sbjct: 268 GNGQKEYSPMAKGDALNVSLNSPWDLEILKD--CIYIAMAGNHQIWKYNMK 316


>gi|238062013|ref|ZP_04606722.1| LOW QUALITY PROTEIN: alkyl hydroperoxide reductase/thiol specific
           antioxidant/Mal allergen [Micromonospora sp. ATCC 39149]
 gi|237883824|gb|EEP72652.1| LOW QUALITY PROTEIN: alkyl hydroperoxide reductase/thiol specific
           antioxidant/Mal allergen [Micromonospora sp. ATCC 39149]
          Length = 582

 Score = 46.6 bits (109), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 38/139 (27%), Positives = 71/139 (51%), Gaps = 21/139 (15%)

Query: 46  NPHSVIDRPGSS------DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG 99
            P + +  PG +      +L+V DS+R +   L+     E++V+R+ G GV+G +DG  G
Sbjct: 138 EPETALRFPGKAIALDGGNLLVSDSARHSLVELAP--DGETLVRRI-GSGVRGRADGPAG 194

Query: 100 SARFDKPKSFA------VDMKG-NIYVADKSNHVIR--KITNLGVTTIAGGGSKKEGRAD 150
           +A F +P+          ++ G ++ VAD  NH++R  ++    V T+AG G +     D
Sbjct: 195 AAAFAEPQGLCRLPEHVAEVAGYDVVVADTVNHLLRGVRLATGEVVTVAGTGRQWRSTVD 254

Query: 151 GPAQNA---SFSNDFELTF 166
             A ++     S+ ++L +
Sbjct: 255 DHAHDSLSVDLSSPWDLAW 273



 Score = 45.8 bits (107), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 51/98 (52%), Gaps = 7/98 (7%)

Query: 97  EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNA 156
           EP +A     K+ A+D  GN+ V+D + H + ++   G T +   GS   GRADGPA  A
Sbjct: 138 EPETALRFPGKAIALD-GGNLLVSDSARHSLVELAPDGETLVRRIGSGVRGRADGPAGAA 196

Query: 157 SFSNDFELTFVPHICA------LLISDHGNQLIRQINL 188
           +F+    L  +P   A      ++++D  N L+R + L
Sbjct: 197 AFAEPQGLCRLPEHVAEVAGYDVVVADTVNHLLRGVRL 234



 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 30/103 (29%), Positives = 46/103 (44%), Gaps = 2/103 (1%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKITNLGVTTIAGGGSK 144
           AG  V+   DG    A   +P   AV   G  ++VAD     +R + N  + T  G G  
Sbjct: 302 AGTTVEALRDGPLAEAWLAQPSGLAVSADGARLWVADSETSAVRYVENGVLGTAVGQGLF 361

Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
             G  DGPA  A   +   +  +P   ++LI+D  N  +R+ +
Sbjct: 362 DFGHVDGPADRALLQHPLGVCALPD-GSVLIADTYNGAVRRFD 403


>gi|290996648|ref|XP_002680894.1| predicted protein [Naegleria gruberi]
 gi|284094516|gb|EFC48150.1| predicted protein [Naegleria gruberi]
          Length = 966

 Score = 46.6 bits (109), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 41/72 (56%), Gaps = 5/72 (6%)

Query: 78  EESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           E  +    AG+G  G+S G+ G   +A+ + P+  AV   G +Y+AD  N  +RK++  G
Sbjct: 429 EIYIAVTFAGNGTSGFS-GDGGLAINAQLNSPRCVAVSGSGEVYIADSMNSRVRKVSTNG 487

Query: 135 -VTTIAGGGSKK 145
            +TTIAG G K 
Sbjct: 488 IITTIAGNGDKN 499



 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 35/125 (28%), Positives = 57/125 (45%), Gaps = 6/125 (4%)

Query: 68  AFYTLSFPLSEESVVKRLAG-DGVQGYSDGE--PGSARFDKPKSFAVDMKGNIYVADKSN 124
           +F  L   +    ++  +AG  G  GY+       S+   + +SFA+   G+  +AD   
Sbjct: 203 SFNCLLRKIGANGIISTIAGVYGFCGYNSDNVIATSSYLSRFRSFAIASNGDFIIADYET 262

Query: 125 HVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQL 182
           H IR+I  +N  + TIAG G+       G A NA  S  F +T       +  S  G  +
Sbjct: 263 HRIRRIFKSNSTIITIAGSGTAGYSGDGGLAINALISYPFSITTTSKG-DIFFSSDGECV 321

Query: 183 IRQIN 187
           +R+I+
Sbjct: 322 VRKID 326


>gi|145487484|ref|XP_001429747.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124396841|emb|CAK62349.1| unnamed protein product [Paramecium tetraurelia]
          Length = 450

 Score = 46.6 bits (109), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 27/73 (36%), Positives = 41/73 (56%), Gaps = 2/73 (2%)

Query: 78  EESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
           ++ + K +AG  G  G+ DG  G   F+ P    +D+ GNI+V D +NH IR IT  G V
Sbjct: 190 QKKITKTIAGICGQSGFKDGLLGMNLFNTPDQMGIDVLGNIFVNDFNNHFIRMITLDGYV 249

Query: 136 TTIAGGGSKKEGR 148
            T+  G  +++ R
Sbjct: 250 NTLISGSCRQDVR 262


>gi|428206267|ref|YP_007090620.1| NHL repeat containing protein [Chroococcidiopsis thermalis PCC
           7203]
 gi|428008188|gb|AFY86751.1| NHL repeat containing protein [Chroococcidiopsis thermalis PCC
           7203]
          Length = 502

 Score = 46.6 bits (109), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 36/109 (33%), Positives = 57/109 (52%), Gaps = 9/109 (8%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNL---GVTTIAGGG 142
           G G  G +DG+   A+F  P+   +D +  + Y+AD  NH IR+I NL    V TIAG G
Sbjct: 212 GTGKPGLTDGDFTQAQFFAPQGMVLDDENQLLYLADTENHAIRRI-NLPLQRVETIAGTG 270

Query: 143 SKKEG--RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            +        G  Q  + ++ ++LT + +   L I+  GN  I Q++L+
Sbjct: 271 EQSRNIRPYGGIGQKIALNSPWDLTMLKNY--LFIAMAGNHQIWQMDLE 317


>gi|290976432|ref|XP_002670944.1| predicted protein [Naegleria gruberi]
 gi|284084508|gb|EFC38200.1| predicted protein [Naegleria gruberi]
          Length = 1514

 Score = 46.6 bits (109), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 68/137 (49%), Gaps = 10/137 (7%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGS-ARFDKPKSFAVDMK 114
           +S+L + D++    Y +   LS  +++  +AG+G  G+ D    + A+       AVD  
Sbjct: 222 NSELYIADANN---YRIRKILSNGTIIT-IAGNGGYGFIDNVLATNAKLAFVNGIAVDTN 277

Query: 115 GNIYVADK----SNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPH 169
           G +Y+ +     SNH IRKI TN  + T +G GS+     +  A +A F   + ++    
Sbjct: 278 GEVYITESENGFSNHRIRKILTNGTIITFSGTGSRGYYGDNVEAADAKFYTPYFVSVRQT 337

Query: 170 ICALLISDHGNQLIRQI 186
              +LISD GN  +R +
Sbjct: 338 TGEVLISDTGNNFLRLV 354



 Score = 44.7 bits (104), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 32/114 (28%), Positives = 51/114 (44%), Gaps = 3/114 (2%)

Query: 76  LSEESVVKRLAGDGVQGYSDGE--PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +S+  ++  +AG G  G++         +   P + A D++  + V+D  NHVIRKI   
Sbjct: 69  ISKNGMIDSIAGTGKFGFNGDNLFATDTQLSNPFAVAFDLETQLIVSDSGNHVIRKILRN 128

Query: 134 GVTTIAGGGSKKEGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G   I  G  +     DG  A+ A  +N   +        L I+D  N  IR+I
Sbjct: 129 GTMVIIAGTGQSGYNGDGIDAKIAKLNNPSGIVVDSKTGELFIADTANNRIRKI 182



 Score = 42.0 bits (97), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 55/100 (55%), Gaps = 7/100 (7%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DG-EPGSARF 103
           NP  ++    + +L + D++ +    +   L+  +++  +AG G  GY+ DG +  SA+ 
Sbjct: 156 NPSGIVVDSKTGELFIADTANNRIRKI---LTNGTIIT-IAGTGETGYNGDGIDATSAQI 211

Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGG 142
                 A+D+   +Y+AD +N+ IRKI +N  + TIAG G
Sbjct: 212 RGVYGIALDVNSELYIADANNYRIRKILSNGTIITIAGNG 251



 Score = 41.6 bits (96), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 53/106 (50%), Gaps = 5/106 (4%)

Query: 85  LAGDGVQGYS-DG-EPGSARFDKPKSFAVDMK-GNIYVADKSNHVIRKI-TNLGVTTIAG 140
           +AG G  GY+ DG +   A+ + P    VD K G +++AD +N+ IRKI TN  + TIAG
Sbjct: 134 IAGTGQSGYNGDGIDAKIAKLNNPSGIVVDSKTGELFIADTANNRIRKILTNGTIITIAG 193

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            G          A +A     + +    +   L I+D  N  IR+I
Sbjct: 194 TGETGYNGDGIDATSAQIRGVYGIALDVNS-ELYIADANNYRIRKI 238



 Score = 40.4 bits (93), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 41/82 (50%), Gaps = 2/82 (2%)

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFEL 164
           PK       G+IYVA+   H I +I+  G + +IAG G       +  A +   SN F +
Sbjct: 45  PKGTYFSANGDIYVAETQGHRISRISKNGMIDSIAGTGKFGFNGDNLFATDTQLSNPFAV 104

Query: 165 TFVPHICALLISDHGNQLIRQI 186
            F      L++SD GN +IR+I
Sbjct: 105 AFDLET-QLIVSDSGNHVIRKI 125


>gi|290987760|ref|XP_002676590.1| predicted protein [Naegleria gruberi]
 gi|284090193|gb|EFC43846.1| predicted protein [Naegleria gruberi]
          Length = 836

 Score = 46.6 bits (109), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 31/73 (42%), Positives = 43/73 (58%), Gaps = 6/73 (8%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMK-GNIYVADKSNHVIRKI-T 131
           +   +V+ +AG G QG+S G+ G   +A+ +KP S   D   GNIY+AD  NH IR I  
Sbjct: 501 TSNEMVEVIAGIGFQGFS-GDGGLAINAQLNKPHSVHFDKNSGNIYIADSGNHRIRMIFP 559

Query: 132 NLGVTTIAGGGSK 144
           N  +TTI G G+ 
Sbjct: 560 NGTITTIVGSGTN 572



 Score = 40.4 bits (93), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 33/93 (35%), Positives = 41/93 (44%), Gaps = 2/93 (2%)

Query: 96  GEPGSARFDKPKS-FAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKEGRADGPA 153
           G P S     P   F +D    IY+AD  NH I K T N  V  IAG G +      G A
Sbjct: 465 GPPSSIILKNPTGIFCLDSFSTIYIADTDNHRILKYTSNEMVEVIAGIGFQGFSGDGGLA 524

Query: 154 QNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            NA  +    + F  +   + I+D GN  IR I
Sbjct: 525 INAQLNKPHSVHFDKNSGNIYIADSGNHRIRMI 557


>gi|290970020|ref|XP_002668012.1| predicted protein [Naegleria gruberi]
 gi|284081028|gb|EFC35268.1| predicted protein [Naegleria gruberi]
          Length = 243

 Score = 46.6 bits (109), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 52/112 (46%), Gaps = 4/112 (3%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
           E   +  +AG+G  GYS  +G   +A+  +P S  V     +Y+AD  NH IRKI   G 
Sbjct: 115 ENGRIVTIAGNGQPGYSGDNGPATTAQLHRPYSVFVSANNEVYIADTFNHSIRKIDESGN 174

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + TIAG      G   G A NA   N     F+     + I++  N   R+I
Sbjct: 175 IETIAGNEQPGYGGDGGYATNAQL-NHPSGVFISTNYEIYITETNNHTTRKI 225



 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 36/104 (34%), Positives = 50/104 (48%), Gaps = 3/104 (2%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
           +AG+G  G    +G   S++   P+S  V     I +AD  NH IRKI N  + TIAG G
Sbjct: 67  IAGNGQTGCGGDNGSATSSQLYYPQSVFVSTNNEICIADTFNHRIRKIENGRIVTIAGNG 126

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
                  +GPA  A     + + FV     + I+D  N  IR+I
Sbjct: 127 QPGYSGDNGPATTAQLHRPYSV-FVSANNEVYIADTFNHSIRKI 169



 Score = 44.7 bits (104), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 4/105 (3%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
           +AG+G QGY   +G   SA+ + P    V     +++ADK+N+ IRKI   G +  IAG 
Sbjct: 11  IAGNGKQGYGGDNGLATSAQLNYPSGVFVSTNNEVFIADKNNNRIRKIVKNGNIVLIAGN 70

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G    G  +G A ++      +  FV     + I+D  N  IR+I
Sbjct: 71  GQTGCGGDNGSATSSQLYYP-QSVFVSTNNEICIADTFNHRIRKI 114


>gi|290996951|ref|XP_002681045.1| predicted protein [Naegleria gruberi]
 gi|284094668|gb|EFC48301.1| predicted protein [Naegleria gruberi]
          Length = 263

 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 7/90 (7%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAV--DMKGN---IYVADKSNHVIRKI 130
           +   V+ + G+G  G++  D +   A  ++P+S  V  + KG    +Y+AD SNH IRK+
Sbjct: 156 QNGTVETICGNGDNGFNGDDIKAVDASLNEPQSIYVTRNTKGTDHILYIADTSNHRIRKV 215

Query: 131 TNLGVTTIAGGGSKKEGRADGPAQNASFSN 160
            N  ++TI G G  K      PA+ AS +N
Sbjct: 216 ENGIISTICGNGESKYNGDLIPAKEASLNN 245



 Score = 37.0 bits (84), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 29/77 (37%), Positives = 38/77 (49%), Gaps = 10/77 (12%)

Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP-------H 169
           IY+ D +NH IRKI N  V TI G G       D  A +AS  N+ +  +V        H
Sbjct: 142 IYICDSNNHKIRKIQNGTVETICGNGDNGFNGDDIKAVDASL-NEPQSIYVTRNTKGTDH 200

Query: 170 ICALLISDHGNQLIRQI 186
           I  L I+D  N  IR++
Sbjct: 201 I--LYIADTSNHRIRKV 215


>gi|269837841|ref|YP_003320069.1| alkyl hydroperoxide reductase [Sphaerobacter thermophilus DSM
           20745]
 gi|269787104|gb|ACZ39247.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Sphaerobacter thermophilus DSM 20745]
          Length = 494

 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 42/128 (32%), Positives = 56/128 (43%), Gaps = 7/128 (5%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGN 116
           L V DS  SA  T++FP  +E  V+ L G G+  + D  G     R   P + A    G 
Sbjct: 356 LYVADSETSAVRTVTFPPGDE--VRTLVGTGLFDFGDVDGIGDEVRLQHPLAVACG-DGV 412

Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLIS 176
           +YVAD  NH I+++           GS + G ADG    A F     L+       L I+
Sbjct: 413 LYVADSYNHKIKRLDPATRRCETWLGSGEPGDADGTGTAARFHEPGGLSLAG--SRLYIA 470

Query: 177 DHGNQLIR 184
           D  N  IR
Sbjct: 471 DTNNHAIR 478



 Score = 42.4 bits (98), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 45/158 (28%), Positives = 64/158 (40%), Gaps = 12/158 (7%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V+    +  LIV DS            S+  V+    G G  G  DG    A F  P
Sbjct: 180 PGKVLADAATDRLIVADSGHHRLIVARLDGSDAWVI----GSGEAGLRDGPGAIAAFHDP 235

Query: 107 KSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGG--SKKEGRADGPAQNASFSNDF 162
           +  A+     +YVAD  NHV+R++      V TIAG G       RA G A+     + +
Sbjct: 236 QGLAL-YGDTLYVADTRNHVVRRVDLAARTVETIAGTGHIGWSYARA-GAARAIDLRSPW 293

Query: 163 ELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
           +L    H   L I+  G   I  ++L   +    + SG
Sbjct: 294 DLVL--HDGVLYIAMAGMHQIWALDLDEAELRPYAGSG 329


>gi|423216520|ref|ZP_17203043.1| hypothetical protein HMPREF1074_04575 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392690730|gb|EIY83986.1| hypothetical protein HMPREF1074_04575 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 423

 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 48/162 (29%), Positives = 70/162 (43%), Gaps = 31/162 (19%)

Query: 41  HQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDG---E 97
           H +E +P          +LIV  + R  F    F ++ E  +  +AG   +G SDG    
Sbjct: 275 HAMEFDPEG--------NLIVSSNDRGQF----FMVTPEKEIICIAGS-EKGDSDGTSGN 321

Query: 98  PGSARFDKPKSFAVDMKGNIYVAD------KSNHVIRKIT-------NLGVTTIAGGGSK 144
           P SA+F +   FA+D +G IY  D       S   I++IT       +  V T+   GS 
Sbjct: 322 PRSAKFYQLYGFAIDSEGTIYTVDGNDGNVGSGQKIKRITRGKKGYEDGTVVTLV--GSA 379

Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
                DG    A F N +++       AL +SD  N  IR+I
Sbjct: 380 GGAIVDGSVDEAVFGNPYDIILDEANRALYVSDRSNNAIRKI 421


>gi|290969880|ref|XP_002667977.1| predicted protein [Naegleria gruberi]
 gi|284080923|gb|EFC35233.1| predicted protein [Naegleria gruberi]
          Length = 366

 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 54/109 (49%), Gaps = 11/109 (10%)

Query: 89  GVQGYSDGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG--GSKK 145
           G  G ++G  P +ARF+ P   A+   G ++VADK+NH+IR+I   G   +  G  G++ 
Sbjct: 125 GTSGSTNGNTPTAARFNNPYGMALAADGTMFVADKANHIIRRINPDGSVELYAGTVGTQC 184

Query: 146 EGRADGPAQNASFSNDFELTFVPHICALL-------ISDHGNQLIRQIN 187
                 PA     S+   +TF      ++       + D+GN  IR+I+
Sbjct: 185 PSATASPAC-GDGSDRLSVTFTNPNAVIVDSNNNVYVFDYGNFRIRKID 232


>gi|290994368|ref|XP_002679804.1| predicted protein [Naegleria gruberi]
 gi|284093422|gb|EFC47060.1| predicted protein [Naegleria gruberi]
          Length = 1359

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 46/161 (28%), Positives = 71/161 (44%), Gaps = 23/161 (14%)

Query: 38  IDGHQLEINPHSVIDRPGS-----SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG 92
           I GH   I  +S  + P       S + V DS+      ++       +V  +AG G  G
Sbjct: 530 IKGHIDGIALNSTFNHPSGIHAEYSAIYVADSNNHVIRKITL---STGIVSTVAGSGEPG 586

Query: 93  YS-DGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRAD 150
           Y+ DG+ P   +F+ P    +  +  + VAD +NH IR IT++ V T+AG G       +
Sbjct: 587 YNGDGKLPLETQFNNPMG-VISAQMGLIVADTNNHRIRVITDVTVKTLAGTGIDGSSGDE 645

Query: 151 GPAQNASFSNDFELTFVPHICA-----LLISDHGNQLIRQI 186
           GPA  +         F PH  +     + ++D  N  IR I
Sbjct: 646 GPAIRSQL-------FYPHSVSYGYSDIYVADTFNNKIRAI 679



 Score = 42.7 bits (99), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 55/110 (50%), Gaps = 4/110 (3%)

Query: 81  VVKRLAGDGVQGYSDGEPGSAR--FDKPKSFAV-DMKGNIYVADKSNHVIRKITNLG-VT 136
           +V  +AG+G  G+   +  S +     P   A   + G+I  +D++NH IR +   G ++
Sbjct: 795 IVHTIAGNGGYGFYADDINSLQTSLAYPSGIAYSKITGDIIFSDRNNHRIRIVYKNGTIS 854

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           T+AG G K       PA+NA  ++   L  +P    ++ SD  N  IR+I
Sbjct: 855 TLAGTGIKGFSGDGLPAKNAQLNSPTFLKLIPPYEDIIFSDSNNGRIRRI 904


>gi|357417484|ref|YP_004930504.1| NHL repeat containing protein [Pseudoxanthomonas spadix BD-a59]
 gi|355335062|gb|AER56463.1| NHL repeat containing protein [Pseudoxanthomonas spadix BD-a59]
          Length = 474

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 45/105 (42%), Gaps = 3/105 (2%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGS 143
           AG G  G  DG    A F +P   A  ++  +YVAD     IR +   G    T+ G G 
Sbjct: 318 AGSGQLGQRDGAGILASFAQPSGLA-QVQQALYVADTLGSSIRGVQLRGDLTQTLVGQGM 376

Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
            + G ADG  Q A       L   P    L I+D GN  IR + L
Sbjct: 377 WEFGDADGQRQQARLQAPEALALDPDAPLLWIADTGNGCIRSLRL 421



 Score = 41.2 bits (95), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 39/76 (51%), Gaps = 4/76 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL--GVTTIA 139
           V R  G+G     DGEP  A F +P   ++ ++  +YVAD  NH++R+I NL  G     
Sbjct: 200 VLRRFGNGTADALDGEPEIASFCRPNGLSL-LREELYVADTGNHLLRRI-NLRSGAVQTV 257

Query: 140 GGGSKKEGRADGPAQN 155
            G       A+GP Q+
Sbjct: 258 LGNRLTRTPAEGPIQS 273


>gi|257069643|ref|YP_003155898.1| NHL repeat protein [Brachybacterium faecium DSM 4810]
 gi|256560461|gb|ACU86308.1| NHL repeat protein [Brachybacterium faecium DSM 4810]
          Length = 697

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 34/120 (28%), Positives = 54/120 (45%), Gaps = 3/120 (2%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGG 142
           LAG   +G  DG   ++ + +P        G I VAD     +R +   ++ VTT+ G G
Sbjct: 352 LAGTTQEGLVDGPAVTSWWAQPSGLDELPDGRIVVADSETSAVRVLDPRSMQVTTLVGEG 411

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
               G  DGPA  A   +   +T +P    + +SD  N  IR +   P +  +++  G A
Sbjct: 412 LFDFGHVDGPASTARLQHPLAVTALPD-GRIAVSDTYNGAIRLVEEAPAEGERATTGGIA 470


>gi|196009344|ref|XP_002114537.1| hypothetical protein TRIADDRAFT_11855 [Trichoplax adhaerens]
 gi|190582599|gb|EDV22671.1| hypothetical protein TRIADDRAFT_11855 [Trichoplax adhaerens]
          Length = 507

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 55/111 (49%), Gaps = 10/111 (9%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSKKEG 147
           +G+ DG    ARF+ P+       G  I+VAD +NH IRKI      VTTIAG G++   
Sbjct: 214 EGFVDGVFQKARFNNPQGITCSRNGKTIFVADTNNHAIRKIDLEYCEVTTIAGTGNQGND 273

Query: 148 RADGPA-QNASFSNDFELTFVPHI----CALLISDHGNQLIRQINLKPEDC 193
              G A +  + ++ ++LT  P        L I+  G+  I  + L  EDC
Sbjct: 274 LVGGGAGKMQALNSPWDLTLGPSCEDQEVVLYIAMAGSHQIWALAL--EDC 322


>gi|427430403|ref|ZP_18920257.1| hypothetical protein C882_1420 [Caenispirillum salinarum AK4]
 gi|425878863|gb|EKV27574.1| hypothetical protein C882_1420 [Caenispirillum salinarum AK4]
          Length = 498

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 35/118 (29%), Positives = 58/118 (49%), Gaps = 9/118 (7%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLG 134
           S+  V+ R  G G+ G  DG+   A F  P+  A   +  I+VAD  NH++R+I   +  
Sbjct: 204 SDGRVLARY-GTGLPGTDDGQAERAAFTSPQGLAC-TEDAIFVADTGNHLVRRIDLCSGA 261

Query: 135 VTTIAG---GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           VTT+AG    G     R   PA   + ++ +++   P    L I++ G   I  +N++
Sbjct: 262 VTTVAGLRFRGLPVSMRQ--PAAETALASPWDIEITPDGGTLFIANAGTHQILAMNMR 317



 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 43/141 (30%), Positives = 58/141 (41%), Gaps = 12/141 (8%)

Query: 54  PGSSDLIVLDSSRSAFYTL---SFPLSEESVVKRLAGDGVQ--GYSDGEPGSARFDKPKS 108
           P +S L   DS  SA   +    + L + + V+ L G G+   G+ DG   +AR   P  
Sbjct: 352 PDASALYFADSETSAVRKVILDGWRLGQTARVETLVGRGLFDFGHDDGPMTAARMQHPLG 411

Query: 109 FAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR--ADGPAQNASFSNDFELTF 166
            AV   G + VAD  NH IR I     T      + K GR    G A          L  
Sbjct: 412 VAVMPDGRVAVADSYNHAIRLIDEAAGTV----ETLKTGRLHCTGAACRRPLWEPAGLWP 467

Query: 167 VPHICALLISDHGNQLIRQIN 187
            P    LL+SD  N  I +++
Sbjct: 468 TPK-GRLLVSDTNNHRIVEVD 487


>gi|296089788|emb|CBI39607.3| unnamed protein product [Vitis vinifera]
          Length = 1002

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 54/108 (50%), Gaps = 4/108 (3%)

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTIAG 140
           ++   G +G  DG    A F++P+  A + K N+ YVAD  NH +R+I   N  V T+AG
Sbjct: 551 QIGSTGEEGLRDGSFDDATFNRPQGLAYNAKKNLLYVADTENHALREIDFVNETVQTLAG 610

Query: 141 GGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            G+K  + +  G       ++ +++ F P    + I+  G   I + N
Sbjct: 611 NGTKGSDYQGGGKGATQLLNSPWDVCFEPINEIVYIAMAGQHQIWEHN 658



 Score = 42.0 bits (97), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 5/104 (4%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRA 149
           G  DG         P   +    G IYVAD  NH I+K+      V+T+AG G  K G  
Sbjct: 739 GDHDGVGSEVLLQHPLGVSCGKDGQIYVADSYNHKIKKLDPATGRVSTLAGTG--KAGFK 796

Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
           DG A  A  S    +  V +   L I+D  N +IR ++LK ++ 
Sbjct: 797 DGRALAAQLSEPSGIVEVEN-GVLFIADTNNSVIRYLDLKKKEA 839


>gi|254447301|ref|ZP_05060768.1| NHL repeat containing protein [gamma proteobacterium HTCC5015]
 gi|198263440|gb|EDY87718.1| NHL repeat containing protein [gamma proteobacterium HTCC5015]
          Length = 2515

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 40/72 (55%), Gaps = 3/72 (4%)

Query: 76   LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
            ++ E  +++ AG   +GYS   G    A  + P+  AV   G +++AD  NH IRK+   
Sbjct: 1213 ITPEGEIEKYAGKETRGYSGDGGAATEAELNLPRGMAVSSGGTLFIADWGNHRIRKVAPN 1272

Query: 134  G-VTTIAGGGSK 144
            G +TT+AG G +
Sbjct: 1273 GIITTVAGNGEE 1284


>gi|413923862|gb|AFW63794.1| hypothetical protein ZEAMMB73_706161, partial [Zea mays]
          Length = 139

 Score = 46.2 bits (108), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 24/50 (48%), Positives = 32/50 (64%), Gaps = 1/50 (2%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSE 78
           E GYTV TV DG +L I P+SV +   S +L+V+DS  S  Y ++ PLS 
Sbjct: 75  EGGYTVETVFDGSKLGIEPYSV-EVTQSGELLVMDSMNSNIYRMALPLSR 123


>gi|299145011|ref|ZP_07038079.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
 gi|298515502|gb|EFI39383.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
          Length = 459

 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 36/95 (37%), Positives = 53/95 (55%), Gaps = 2/95 (2%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           G+++G+   A+ ++P    +D +GNIYVA +  H I KIT  G+ T    G+   G  DG
Sbjct: 364 GFTEGQGNEAKLEEPCQMDLDEEGNIYVAVRKKHRIAKITPDGMLTNY-TGTGTSGTTDG 422

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           P   A F++   + F P   AL ISD+ N  IR+I
Sbjct: 423 PLDKAQFNHPEGVQFGPD-GALYISDYWNHKIRKI 456


>gi|428308867|ref|YP_007119844.1| thiol-disulfide isomerase-like thioredoxin [Microcoleus sp. PCC
           7113]
 gi|428250479|gb|AFZ16438.1| thiol-disulfide isomerase-like thioredoxin [Microcoleus sp. PCC
           7113]
          Length = 515

 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 57/113 (50%), Gaps = 7/113 (6%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTI 138
           V+++ G G  G +DG    A+F  P+   +D +   +YVAD  NH +R++   N  V TI
Sbjct: 213 VQQVIGTGKPGLTDGSFEEAQFFAPQGMTLDSENQLLYVADTENHCLRQVDLKNQQVKTI 272

Query: 139 AGGGSKKEG--RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           AG G +        G A     ++ +++  V H   LLI+  G+  I ++ L+
Sbjct: 273 AGTGEQSHHIRPHSGKALETRLNSPWDVEKVGH--CLLIAMAGSHQIWEMQLE 323



 Score = 41.6 bits (96), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 3/58 (5%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTI 138
           + + G G  G+ DG+   ARF +P   +  +  ++YVAD +NH IR I    L VTT+
Sbjct: 440 QTMIGQGTAGHQDGQSIKARFSEPSGLSA-IAAHLYVADTNNHAIRCIALDTLEVTTL 496



 Score = 38.9 bits (89), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 31/129 (24%), Positives = 54/129 (41%), Gaps = 7/129 (5%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK---ITNLG 134
           E   ++  AG G +   DG P  A F +P     D +  +Y+AD     IR    + +L 
Sbjct: 323 ETGWLRTYAGTGAEACLDGTPDQAAFAQPSGLTTDGR-ELYIADSEVSSIRAVGLVDHLP 381

Query: 135 VTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
           V T+ G G     G  DG   +    +   + +  +   L ++D  N  I++++ +   C
Sbjct: 382 VRTVCGSGELFGFGDVDGEDADVRLQHALGVEYTQNY--LWVADTYNHKIKRVDPRTGTC 439

Query: 194 SKSSQSGSA 202
                 G+A
Sbjct: 440 QTMIGQGTA 448


>gi|320103300|ref|YP_004178891.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
 gi|319750582|gb|ADV62342.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
          Length = 698

 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 36/125 (28%), Positives = 59/125 (47%), Gaps = 8/125 (6%)

Query: 83  KRL--AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGVTTI 138
           KRL   G G  G++DG   +A F++ +   + +   +YVAD  NH IR   +T   V T+
Sbjct: 270 KRLDVIGSGATGFADGSFETASFNRQQGIRL-VGETLYVADTENHAIRAANLTTRTVATV 328

Query: 139 AGGGSKKEGRA---DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSK 195
           AG G +   R     GPA+    ++ ++L  +P    L+I+  G   I   +    +   
Sbjct: 329 AGTGKQTYHRGWGVRGPAKTTGLNSPWDLVKIPDEDRLIIAMAGPHQIWDYDFSKGEIGL 388

Query: 196 SSQSG 200
            S +G
Sbjct: 389 WSGTG 393


>gi|326672463|ref|XP_003199672.1| PREDICTED: NHL repeat-containing protein 2-like [Danio rerio]
          Length = 719

 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 8/79 (10%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAG---GGSKK 145
           G  DG    A+F  P+   +  KG+ +YVAD  NH++RKI  T   V+T+AG    G+ K
Sbjct: 262 GKRDGNLSEAQFSSPQGLFI--KGDTVYVADTENHLVRKINLTEGKVSTLAGIGVQGTDK 319

Query: 146 EGRADGPAQNASFSNDFEL 164
           EG A GP Q  S   D  L
Sbjct: 320 EGGALGPQQPISSPWDVTL 338


>gi|326672461|ref|XP_003199671.1| PREDICTED: NHL repeat-containing protein 2-like [Danio rerio]
          Length = 719

 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 8/79 (10%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAG---GGSKK 145
           G  DG    A+F  P+   +  KG+ +YVAD  NH++RKI  T   V+T+AG    G+ K
Sbjct: 262 GKRDGNLSEAQFSSPQGLFI--KGDTVYVADTENHLVRKINLTEGKVSTLAGIGVQGTDK 319

Query: 146 EGRADGPAQNASFSNDFEL 164
           EG A GP Q  S   D  L
Sbjct: 320 EGGALGPQQPISSPWDVTL 338


>gi|220914451|ref|YP_002489760.1| NHL repeat containing protein [Arthrobacter chlorophenolicus A6]
 gi|219861329|gb|ACL41671.1| NHL repeat containing protein [Arthrobacter chlorophenolicus A6]
          Length = 674

 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 33/111 (29%), Positives = 52/111 (46%), Gaps = 6/111 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-----NLGVT 136
           V  +AG+G++G  DG    + F +    A D  GNI+VAD     +R +      +L V 
Sbjct: 378 VAIVAGNGLEGLLDGPAHESWFAQSSGLAEDADGNIWVADSETSALRTLVIDDTGSLTVK 437

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           T  G G    G  DG A  A   +   +T +P   ++ I+D  N  +R+ +
Sbjct: 438 TAVGKGLFDFGFRDGTAAEARLQHPLGVTVLPD-GSVAIADTYNGAVRRFD 487


>gi|290986797|ref|XP_002676110.1| predicted protein [Naegleria gruberi]
 gi|284089710|gb|EFC43366.1| predicted protein [Naegleria gruberi]
          Length = 1218

 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 42/68 (61%), Gaps = 4/68 (5%)

Query: 85  LAGDGVQGYSDGE--PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
           + G GV+GY+  +     A+ +KP S +VD  G+IY++D S   +RKI+N  ++TIAG G
Sbjct: 675 ICGTGVEGYNGDDILATDAQLNKPTSVSVDQNGDIYISDNSR--LRKISNGVISTIAGNG 732

Query: 143 SKKEGRAD 150
           +  +   D
Sbjct: 733 NTTDIEND 740



 Score = 43.9 bits (102), Expect = 0.071,   Method: Compositional matrix adjust.
 Identities = 50/162 (30%), Positives = 73/162 (45%), Gaps = 21/162 (12%)

Query: 39  DGHQLEINPHSVIDR-------PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GV 90
           DG+  +   +SVI R       P +S++I+ DS  S        +S   ++  +AG  GV
Sbjct: 475 DGYNAK---YSVISRATGIDFLPSTSEMIIADSLNSCIR----KISTLGIITTIAGKPGV 527

Query: 91  QGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKE 146
            GYS GE   A       P    V   G IY +  + H ++KI+ N  +TTI G G    
Sbjct: 528 AGYS-GENVRAIDSMIGNPVRVLVSKMGEIYFSTNT-HRVQKISLNGNITTICGTGVDDY 585

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
               G A  A+ SN   + F      L ++ +GN  IR +NL
Sbjct: 586 SGDGGNAIEAAISNPNGMAFSSDEELLYVACNGNNAIRVVNL 627



 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 47/99 (47%), Gaps = 1/99 (1%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
           G  G++DG+   ++  +P    + +  ++  AD++NH IRKI+N  V TI G G    G 
Sbjct: 137 GSSGFNDGQAKLSKVSRPFGIGL-INSDVIFADRNNHRIRKISNGAVVTICGNGINNFGG 195

Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            + PA  A  +    L        +   D  N  IR+I+
Sbjct: 196 DNRPALTARINTPTGLFVNSTNGDIYFCDSLNHRIRKID 234



 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 56/111 (50%), Gaps = 7/111 (6%)

Query: 80  SVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT 136
           +++  +AG+G  GYS G+ G   +A  + P        G +Y+ D +N+VIR + + G  
Sbjct: 238 AIISTIAGNGDVGYS-GDGGLATNAALNFPTGVVSTPDGTLYIID-NNNVIRMVNSSGYI 295

Query: 137 TIAGGGSKKEGRAD-GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           +  GG        D GPA NA F+    ++ V     + ++D GN  IR+I
Sbjct: 296 STIGGSMDSGNYGDGGPAINAKFAYPTGIS-VSKNGDVYVTDTGNLKIRKI 345



 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 46/102 (45%), Gaps = 9/102 (8%)

Query: 93  YSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKEGRAD 150
           Y DG P  +A+F  P   +V   G++YV D  N  IRKI+   G+     G +   G   
Sbjct: 307 YGDGGPAINAKFAYPTGISVSKNGDVYVTDTGNLKIRKISKQTGIIETVIGNTNVYGGDG 366

Query: 151 GPAQNASFSNDFE------LTFVPHICALLISDHGNQLIRQI 186
            PA  A+     +      +T++ H C +   D  NQ++  I
Sbjct: 367 LPASKAALGYISQVVSIDGITYISHDCRIRKVDK-NQIVSTI 407


>gi|195164496|ref|XP_002023083.1| GL21149 [Drosophila persimilis]
 gi|194105168|gb|EDW27211.1| GL21149 [Drosophila persimilis]
          Length = 595

 Score = 46.2 bits (108), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 39/127 (30%), Positives = 58/127 (45%), Gaps = 19/127 (14%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NL 133
           +S + VV+   G    G+ DG+   +RF+ P+  A   +  + VAD  NH +RKI+  N 
Sbjct: 252 VSADGVVEHKIGGLQAGFVDGKLALSRFNSPQGVAFLNESTLIVADTENHALRKISLENE 311

Query: 134 GVTTIAGGGSKKEGRADG----------PAQNASF-SNDFELTF------VPHICALLIS 176
            V T+AG G +   R  G          P   A F + D +++F      VP    +LIS
Sbjct: 312 LVETLAGTGHQGNERIGGRVGPLQALSSPWDLAVFRTRDMDMSFHLDERNVPEKTIILIS 371

Query: 177 DHGNQLI 183
             G   I
Sbjct: 372 MAGTHQI 378


>gi|241584846|ref|XP_002403871.1| NHL repeat-containing protein [Ixodes scapularis]
 gi|215500274|gb|EEC09768.1| NHL repeat-containing protein [Ixodes scapularis]
          Length = 621

 Score = 46.2 bits (108), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 57/124 (45%), Gaps = 16/124 (12%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKK---E 146
           G+ DG    ARF+ P+         +YVAD  NH +R++      V T+AG G +    E
Sbjct: 280 GWRDGALNDARFNGPQGVVWRDPHFVYVADTGNHTVREVDLEQAQVRTVAGTGQRGSDLE 339

Query: 147 GRADGPAQNASFSNDF----ELTFVP-----HICALLISDHGNQLIRQINLKPEDCSKSS 197
           G   GP Q  S   D     ++ F+       + AL + D  +QL  +++L    C + +
Sbjct: 340 GGRQGPQQPLSSPWDLCLVEDILFIAMAGSHQLWALALRD--SQLFGKLHLGAGTCLRIA 397

Query: 198 QSGS 201
            SGS
Sbjct: 398 GSGS 401



 Score = 38.5 bits (88), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 59/144 (40%), Gaps = 24/144 (16%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-------GYSDGEPGSARFDKPKSFAV 111
           L + DS  SA  TLS        VK L G  +        G +DG     R   P     
Sbjct: 429 LCIADSESSAIRTLSL---RTGAVKNLVGGALNPTDLFCFGDADGSALDVRLQHPLGVCW 485

Query: 112 DM-KGNIYVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP 168
              K  +YVAD  NH IRK  +     TT+AG G    G A G     SFS++ +     
Sbjct: 486 SSDKQLLYVADSYNHKIRKVDVQKRLCTTLAGTGVA--GDATG-----SFSDEVQFDEPG 538

Query: 169 HICA----LLISDHGNQLIRQINL 188
            +C     L ++D  N  ++ ++L
Sbjct: 539 GLCVVGSRLYVADTNNHCVKLVHL 562


>gi|427706390|ref|YP_007048767.1| alkyl hydroperoxide reductase [Nostoc sp. PCC 7107]
 gi|427358895|gb|AFY41617.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Nostoc sp. PCC 7107]
          Length = 505

 Score = 46.2 bits (108), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 55/112 (49%), Gaps = 7/112 (6%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTI 138
           V  + G G  G +DG   +A+F  P+    D +  I Y+AD  NH +R++      V TI
Sbjct: 206 VLHIIGTGKSGLTDGSFSTAQFSAPQGMVFDSENQIFYIADTENHALRRVDLKQQIVETI 265

Query: 139 AGGGSKKEG--RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           AG G +        G A   + ++ ++L  V +I  L I+  G+  I Q+NL
Sbjct: 266 AGTGEQSRNIHPHSGVALETALNSPWDLVKVGNI--LFIAMAGSHQIWQMNL 315



 Score = 37.7 bits (86), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 18/58 (31%), Positives = 33/58 (56%), Gaps = 3/58 (5%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTI 138
           + + GDG  G  DG+  ++RF +P   +  +   +Y++D +NH IR++      VTT+
Sbjct: 433 QTILGDGFAGLQDGQGKNSRFFEPSGLSA-INSYLYISDTNNHAIRRVDLNTFEVTTL 489


>gi|390369862|ref|XP_798415.2| PREDICTED: NHL repeat-containing protein 2-like [Strongylocentrotus
           purpuratus]
          Length = 322

 Score = 46.2 bits (108), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 36/102 (35%), Positives = 49/102 (48%), Gaps = 11/102 (10%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V   P  + L V D+  +    ++     + VV+   G    G++DG    ARF  P
Sbjct: 173 PGKVTTNPEGTLLAVSDTGHNRVIIVAL----DGVVQHCIGGPETGFNDGLYQEARFHSP 228

Query: 107 K--SFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK 144
           +   +A D+   IYVAD  NH IRKI      VTTIAG G +
Sbjct: 229 QGLCWAQDV---IYVADTENHAIRKIDLKEKRVTTIAGTGEQ 267


>gi|134103740|ref|YP_001109401.1| NHL repeat-containing protein [Saccharopolyspora erythraea NRRL
           2338]
 gi|291007734|ref|ZP_06565707.1| NHL repeat-containing protein [Saccharopolyspora erythraea NRRL
           2338]
 gi|133916363|emb|CAM06476.1| NHL repeat containing protein [Saccharopolyspora erythraea NRRL
           2338]
          Length = 619

 Score = 46.2 bits (108), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 35/135 (25%), Positives = 69/135 (51%), Gaps = 13/135 (9%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV------- 111
           L+V DS+  +   + F    E+ V+R+ G G +G +DG    A F +P   A+       
Sbjct: 192 LLVSDSAHHSL--VEFTADGETPVRRI-GSGERGRADGAADEASFAEPAGLALLPEDVAA 248

Query: 112 DMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVP 168
           ++  ++ VAD  NH++R  ++ +  V T+AG G + ++G   GPA     ++ +++ +  
Sbjct: 249 EVGYDVVVADTVNHLLRGVQLADGEVVTVAGTGEQWRDGSDAGPALETPLTSPWDVAWWE 308

Query: 169 HICALLISDHGNQLI 183
               ++I+  GN  +
Sbjct: 309 PAGGVVIAMAGNHTL 323



 Score = 44.3 bits (103), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 34/108 (31%), Positives = 54/108 (50%), Gaps = 8/108 (7%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
           GDG    ++ E  + RF   K   +D  G + V+D ++H + + T  G T +   GS + 
Sbjct: 164 GDGPYVPAETENTTLRF-PAKVLPLD-GGTLLVSDSAHHSLVEFTADGETPVRRIGSGER 221

Query: 147 GRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQINL 188
           GRADG A  ASF+    L  +P   A      ++++D  N L+R + L
Sbjct: 222 GRADGAADEASFAEPAGLALLPEDVAAEVGYDVVVADTVNHLLRGVQL 269



 Score = 40.8 bits (94), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 33/117 (28%), Positives = 52/117 (44%), Gaps = 6/117 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG----VTT 137
           V+R AG  V+G  DG+   A F +    A D    +++AD     +R +   G    V T
Sbjct: 333 VRRFAGTTVEGLRDGDADEAFFAQTSGLA-DGGDRLWLADSETSALRWVEAGGDGFAVRT 391

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
             G G    G ADG A  A F +   +  +P   ++ + D  N  IR+ +   ++ S
Sbjct: 392 AVGAGLFDFGHADGAADQALFQHPLGVAVLPD-GSVAVCDTYNGAIRRYDPASDEVS 447


>gi|147818279|emb|CAN64724.1| hypothetical protein VITISV_026725 [Vitis vinifera]
          Length = 423

 Score = 46.2 bits (108), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 25/64 (39%), Positives = 37/64 (57%), Gaps = 3/64 (4%)

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTIAG 140
           ++   G +G  DG    A F++P+  A + K N+ YVAD  NH +R+I   N  V T+AG
Sbjct: 54  QIGSTGEEGLRDGSFDDATFNRPQGLAYNAKKNLLYVADTENHALREIDFVNETVQTLAG 113

Query: 141 GGSK 144
            G+K
Sbjct: 114 NGTK 117



 Score = 42.0 bits (97), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 5/104 (4%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRA 149
           G  DG         P   +    G IYVAD  NH I+K+      V+T+AG G  K G  
Sbjct: 160 GDHDGVGSEVLLQHPLGVSCGKDGQIYVADSYNHKIKKLDPATGRVSTLAGTG--KAGFK 217

Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
           DG A  A  S    +  V +   L I+D  N +IR ++LK ++ 
Sbjct: 218 DGRALAAQLSEPSGIVEVEN-GVLFIADTNNSVIRYLDLKKKEA 260


>gi|183222153|ref|YP_001840149.1| hypothetical protein LEPBI_I2796 [Leptospira biflexa serovar Patoc
           strain 'Patoc 1 (Paris)']
 gi|189912214|ref|YP_001963769.1| pentraxin-like protein [Leptospira biflexa serovar Patoc strain
           'Patoc 1 (Ames)']
 gi|167776890|gb|ABZ95191.1| Pentraxin-related protein [Leptospira biflexa serovar Patoc strain
           'Patoc 1 (Ames)']
 gi|167780575|gb|ABZ98873.1| Hypothetical protein LEPBI_I2796 [Leptospira biflexa serovar Patoc
           strain 'Patoc 1 (Paris)']
          Length = 1237

 Score = 45.8 bits (107), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 42/129 (32%), Positives = 58/129 (44%), Gaps = 10/129 (7%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNL 133
           L+  + V  +AGDGVQ  +DG   SA+F+ P     D   N+++++   H IR   I+  
Sbjct: 476 LTGTNAVDTIAGDGVQASTDGFGTSAQFNDPHGITTDGT-NLFISEWLGHRIRHLDISTN 534

Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
            VTT+ G      G AD    N  F   F          + ISD  N  IR   L+P + 
Sbjct: 535 KVTTLVG---SSPGYADNATGNGLF--QFPGFITSDGQKVYISDQANHSIRF--LEPAEI 587

Query: 194 SKSSQSGSA 202
              S  GSA
Sbjct: 588 LHYSFDGSA 596



 Score = 38.5 bits (88), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 49/106 (46%), Gaps = 10/106 (9%)

Query: 36  TVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTL--SFPLSEESVVKRLAGDGVQGY 93
           T + G+ +  N +S        +LI   + R+A  TL  SFP +  S    + G  V   
Sbjct: 206 TCVTGYIVSGNLYSTTGANLGPNLI---NRRTAVKTLAGSFPTNTPSGAGAVFGGAVP-- 260

Query: 94  SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTI 138
                 +ARF+ P   A D    IYV D +N VIRKI  L G TTI
Sbjct: 261 -SATASAARFNSPAMLATDSN-FIYVVDSTNSVIRKIDKLTGTTTI 304


>gi|428317077|ref|YP_007114959.1| NHL repeat containing protein [Oscillatoria nigro-viridis PCC 7112]
 gi|428240757|gb|AFZ06543.1| NHL repeat containing protein [Oscillatoria nigro-viridis PCC 7112]
          Length = 546

 Score = 45.8 bits (107), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 35/100 (35%), Positives = 47/100 (47%), Gaps = 7/100 (7%)

Query: 48  HSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPK 107
           +SVI     S L + DS  +     +     E  V  + G G  G +DG+   A F  P+
Sbjct: 219 NSVIQNLVGSCLFIADSGHNRIVVSTL----EGEVLHVIGSGKPGLTDGDFEEAEFFAPQ 274

Query: 108 SFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTIAGGGSK 144
             A D +  I YVAD  NH +RKI  T   V T+AG G +
Sbjct: 275 GMAFDAESQILYVADTENHALRKIDFTTQRVETVAGTGEQ 314



 Score = 40.8 bits (94), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 37/135 (27%), Positives = 62/135 (45%), Gaps = 8/135 (5%)

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-VQGYSDGEPGSARFDKPKSFAVDMKG 115
           S+L V DS  S+   +   + E+  V+ + G G + G+ D +  + +        V+   
Sbjct: 395 SELFVADSEISSIRAVG--IDEDPKVRTVCGSGELFGFGDKDGRADQVRLQHCLGVEYAQ 452

Query: 116 N-IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFV-PHICAL 173
           N ++VAD  NH I++I + G T     G    G  D     A F     L+ + PH   L
Sbjct: 453 NYLWVADTYNHKIKRIDHRGGTCRTMIGDGTAGLVDAKGLKARFFEPSGLSAIGPH---L 509

Query: 174 LISDHGNQLIRQINL 188
            ++D  N +IR++ L
Sbjct: 510 FVADTNNHVIRRVEL 524


>gi|381163732|ref|ZP_09872962.1| thiol-disulfide isomerase-like thioredoxin [Saccharomonospora
           azurea NA-128]
 gi|379255637|gb|EHY89563.1| thiol-disulfide isomerase-like thioredoxin [Saccharomonospora
           azurea NA-128]
          Length = 633

 Score = 45.8 bits (107), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 31/102 (30%), Positives = 47/102 (46%), Gaps = 9/102 (8%)

Query: 97  EPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPA 153
           E G+AR D    P    V  +G + VAD +NH I +  +   T +   G    G  DG  
Sbjct: 182 EDGAARIDGLRFPSKAVVTAEGRVLVADTANHSIAEFASDAETLLRRFGDGHRGAVDGAF 241

Query: 154 QNASFSNDFELTFVPHICA------LLISDHGNQLIRQINLK 189
             A F+    LT +P   A      L+++D  N L+R ++L+
Sbjct: 242 DVARFAEPSGLTLLPTHVADQVGYHLIVADTANHLLRGVDLR 283



 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 32/121 (26%), Positives = 60/121 (49%), Gaps = 11/121 (9%)

Query: 73  SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV------DMKG-NIYVADKSNH 125
            F    E++++R  GDG +G  DG    ARF +P    +      D  G ++ VAD +NH
Sbjct: 217 EFASDAETLLRRF-GDGHRGAVDGAFDVARFAEPSGLTLLPTHVADQVGYHLIVADTANH 275

Query: 126 VIRKI--TNLGVTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQL 182
           ++R +      V T+AG G++ ++G   GPA     ++ ++  +      ++++  GN  
Sbjct: 276 LLRGVDLRTGSVRTVAGTGAQWRDGDDSGPATEVDLTSPWDARWWDAAGGVVVAMAGNHT 335

Query: 183 I 183
           +
Sbjct: 336 L 336


>gi|315500945|ref|YP_004079832.1| alkyl hydroperoxide reductase/ thiol specific antioxidant/ mal
           allergen [Micromonospora sp. L5]
 gi|315407564|gb|ADU05681.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Micromonospora sp. L5]
          Length = 610

 Score = 45.8 bits (107), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 37/126 (29%), Positives = 67/126 (53%), Gaps = 18/126 (14%)

Query: 46  NPHSVIDRPGSS------DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG 99
            P +V+  PG +      +L+V DS+R +   L+     E+VV+R+ G G +G +DG   
Sbjct: 166 EPDTVLRFPGKALPLDGGNLLVSDSARHSLVELA--ADGETVVRRI-GTGSRGRTDGPAS 222

Query: 100 SARFDKPKSFAV------DMKG-NIYVADKSNHVIR--KITNLGVTTIAGGGSKKEGRAD 150
           +A F +P+   +      ++ G ++ VAD  NH++R  ++++  V T+AG G +     D
Sbjct: 223 AATFAEPQGLCLLPPHVAEVAGYDVVVADTVNHLLRGVRLSSGEVVTVAGTGRQWRSTVD 282

Query: 151 GPAQNA 156
             A +A
Sbjct: 283 DHAHDA 288



 Score = 42.7 bits (99), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 42/80 (52%), Gaps = 6/80 (7%)

Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFV-PHICA- 172
           GN+ V+D + H + ++   G T +   G+   GR DGPA  A+F+    L  + PH+   
Sbjct: 183 GNLLVSDSARHSLVELAADGETVVRRIGTGSRGRTDGPASAATFAEPQGLCLLPPHVAEV 242

Query: 173 ----LLISDHGNQLIRQINL 188
               ++++D  N L+R + L
Sbjct: 243 AGYDVVVADTVNHLLRGVRL 262



 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 47/103 (45%), Gaps = 2/103 (1%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAGGGSK 144
           AG  V+   DG        +P   +V   G+ ++VAD     IR +++  + T  G G  
Sbjct: 330 AGTTVEALRDGPLPDVWMAQPSGLSVSADGSRLWVADSETSAIRWVSDGEMHTAVGQGLF 389

Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
             G  DGPA  A   +   +  +P   ++LI+D  N  +R+ +
Sbjct: 390 DFGHVDGPADQALLQHPLGVCALPD-GSVLIADTYNGAVRRFD 431


>gi|285018581|ref|YP_003376292.1| nhl repeat protein [Xanthomonas albilineans GPE PC73]
 gi|283473799|emb|CBA16302.1| hypothetical nhl repeat protein [Xanthomonas albilineans GPE PC73]
          Length = 473

 Score = 45.8 bits (107), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 26/65 (40%), Positives = 35/65 (53%), Gaps = 3/65 (4%)

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL--GVTTIAGG 141
           R  G G   + DGE G A F +P+  A+  +G +YVAD  NH +R+I  L   V T+ G 
Sbjct: 201 RQFGTGTADFIDGELGQAAFHRPQGLALQ-RGVLYVADTGNHALRRIHLLTGQVDTLCGN 259

Query: 142 GSKKE 146
           G   E
Sbjct: 260 GRAGE 264



 Score = 45.1 bits (105), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 41/139 (29%), Positives = 59/139 (42%), Gaps = 11/139 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIA 139
           ++R AG G     DG    A F +P S AV +   +YV D     +R +   G  V T+ 
Sbjct: 311 LRRCAGAGTLELRDGSAHLAAFAQPCSLAV-VNQVMYVCDALGSSVRSMQLRGDLVQTLL 369

Query: 140 GG-GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSS- 197
           GG G    G  DGP   A       +   P    L I+D GN  +R + L   + S  + 
Sbjct: 370 GGQGVWDFGHEDGPRSRARLQFPQAIALSPDALHLWIADSGNGSLRSLRLGGGELSTVAL 429

Query: 198 ------QSGSALGAVSVWV 210
                  +G A+ A +VW+
Sbjct: 430 PRRLHGPAGLAVSAGAVWI 448


>gi|255542920|ref|XP_002512523.1| 2-deoxyglucose-6-phosphate phosphatase, putative [Ricinus communis]
 gi|223548484|gb|EEF49975.1| 2-deoxyglucose-6-phosphate phosphatase, putative [Ricinus communis]
          Length = 1016

 Score = 45.8 bits (107), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 25/68 (36%), Positives = 39/68 (57%), Gaps = 3/68 (4%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVT 136
           + + ++   G +G  DG    A F++P+  A + K N+ YVAD  NH +R+I   N  V 
Sbjct: 561 NFIVQIGSTGEEGLRDGPFDEATFNRPQGLAYNAKKNLLYVADTENHALREIDFVNEMVR 620

Query: 137 TIAGGGSK 144
           T+AG G+K
Sbjct: 621 TLAGNGTK 628



 Score = 42.0 bits (97), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 31/81 (38%), Positives = 42/81 (51%), Gaps = 5/81 (6%)

Query: 115 GNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA 172
           G IYVAD  NH I+K+      V+TIAG G  K G  DG A  A  S    +    +   
Sbjct: 776 GQIYVADSYNHKIKKLDPATKRVSTIAGTG--KAGFKDGKALAAQLSEPSGIIEAEN-GR 832

Query: 173 LLISDHGNQLIRQINLKPEDC 193
           L+I+D  N +IR ++L  E+ 
Sbjct: 833 LIIADTNNSIIRYLDLNKEEA 853


>gi|423214448|ref|ZP_17200976.1| hypothetical protein HMPREF1074_02508 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692863|gb|EIY86099.1| hypothetical protein HMPREF1074_02508 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 464

 Score = 45.8 bits (107), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 36/95 (37%), Positives = 53/95 (55%), Gaps = 2/95 (2%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           G+++G+   A+ ++P    +D +GNIYVA +  H I KIT  G+ T    G+   G  DG
Sbjct: 369 GFTEGQGNEAKLEEPCQMDLDEEGNIYVAVRRKHRIAKITPDGMLTNY-TGTGTSGTTDG 427

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           P   A F++   + F P   AL ISD+ N  IR+I
Sbjct: 428 PLDKAQFNHPEGVQFGPD-GALYISDYWNHKIRKI 461


>gi|408792280|ref|ZP_11203890.1| concanavalin A-like lectin/glucanases family protein [Leptospira
           meyeri serovar Hardjo str. Went 5]
 gi|408463690|gb|EKJ87415.1| concanavalin A-like lectin/glucanases family protein [Leptospira
           meyeri serovar Hardjo str. Went 5]
          Length = 1234

 Score = 45.8 bits (107), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 37/129 (28%), Positives = 62/129 (48%), Gaps = 10/129 (7%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNL 133
           L+  + V  +AGDGVQ  +DG   +A+F+ P     D   N+++++ S H IR   +T  
Sbjct: 475 LTGTNAVDTIAGDGVQASTDGIGTAAQFNDPHGLTTDGT-NLFISEWSGHKIRHLNLTTN 533

Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
            VTT+ G      G +D    N     +F    +     + I+D GN  +R I  +P + 
Sbjct: 534 KVTTLVG---SVSGYSDNTGGNGLL--NFPGFLLSDGLKVYIADTGNHSLRSI--EPSEV 586

Query: 194 SKSSQSGSA 202
            + +  G+A
Sbjct: 587 LRYTFDGNA 595



 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 42/117 (35%), Positives = 53/117 (45%), Gaps = 16/117 (13%)

Query: 97  EPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRA----- 149
            P +ARF+ PK    D  GN IYVAD  N VIRKI  + G TTI  GG+   G       
Sbjct: 262 SPTAARFNGPKMITTD--GNFIYVADTMNAVIRKIDKSSGTTTILAGGNTGGGTVCPGTV 319

Query: 150 -----DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
                DG    A F+    LT   +   L + +     IR++NL     S  + SGS
Sbjct: 320 TTNCQDGVGTAAQFNAITGLTTDGN--NLFVLEASGNRIRKVNLATGVVSTFAGSGS 374


>gi|418461883|ref|ZP_13032943.1| Peroxiredoxin [Saccharomonospora azurea SZMC 14600]
 gi|359738010|gb|EHK86922.1| Peroxiredoxin [Saccharomonospora azurea SZMC 14600]
          Length = 633

 Score = 45.8 bits (107), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 31/102 (30%), Positives = 47/102 (46%), Gaps = 9/102 (8%)

Query: 97  EPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPA 153
           E G+AR D    P    V  +G + VAD +NH I +  +   T +   G    G  DG  
Sbjct: 182 EDGAARIDGLRFPSKAVVTAEGRVLVADTANHSIAEFASDAETLLRRFGDGHRGAVDGAF 241

Query: 154 QNASFSNDFELTFVPHICA------LLISDHGNQLIRQINLK 189
             A F+    LT +P   A      L+++D  N L+R ++L+
Sbjct: 242 DVARFAEPSGLTLLPTHVADQVGYHLIVADTANHLLRGVDLR 283



 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 32/121 (26%), Positives = 60/121 (49%), Gaps = 11/121 (9%)

Query: 73  SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV------DMKG-NIYVADKSNH 125
            F    E++++R  GDG +G  DG    ARF +P    +      D  G ++ VAD +NH
Sbjct: 217 EFASDAETLLRRF-GDGHRGAVDGAFDVARFAEPSGLTLLPTHVADQVGYHLIVADTANH 275

Query: 126 VIRKI--TNLGVTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQL 182
           ++R +      V T+AG G++ ++G   GPA     ++ ++  +      ++++  GN  
Sbjct: 276 LLRGVDLRTGSVRTVAGTGAQWRDGDDSGPATEVDLTSPWDARWWDAAGGVVVAMAGNHT 335

Query: 183 I 183
           +
Sbjct: 336 L 336


>gi|395218326|ref|ZP_10401983.1| alkyl hydroperoxide reductase [Pontibacter sp. BAB1700]
 gi|394454578|gb|EJF09204.1| alkyl hydroperoxide reductase [Pontibacter sp. BAB1700]
          Length = 474

 Score = 45.8 bits (107), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 35/127 (27%), Positives = 63/127 (49%), Gaps = 4/127 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNL 133
           LS++  V  + G G +G+++G    A F++P   A+     +YVAD  NH IRK+   + 
Sbjct: 197 LSQQGQVLEVIGSGERGFNNGGYADATFNEPHGLAL-HGSTLYVADAKNHAIRKVDLQHR 255

Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
            V+T AG G  +    D   +     N      + +  ++ I++ GN  I +++L+ E  
Sbjct: 256 MVSTAAGTGELEYYFHDDKRREPVNPNS-PWDLLIYGRSMFIANAGNHQILRMDLETEQV 314

Query: 194 SKSSQSG 200
            + + SG
Sbjct: 315 YRFAGSG 321


>gi|334116916|ref|ZP_08491008.1| NHL repeat containing protein [Microcoleus vaginatus FGP-2]
 gi|333461736|gb|EGK90341.1| NHL repeat containing protein [Microcoleus vaginatus FGP-2]
          Length = 546

 Score = 45.8 bits (107), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 35/100 (35%), Positives = 47/100 (47%), Gaps = 7/100 (7%)

Query: 48  HSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPK 107
           +SVI     S L + DS  +     +     E  V  + G G  G +DG+   A F  P+
Sbjct: 219 NSVIQNLVGSCLFIADSGHNRIVVSTL----EGEVLHVIGSGQPGLTDGDFEEAEFFAPQ 274

Query: 108 SFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTIAGGGSK 144
             A D +  I YVAD  NH +RKI  T   V T+AG G +
Sbjct: 275 GMAFDAESQILYVADTENHALRKIDFTTQRVETVAGTGEQ 314



 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 37/135 (27%), Positives = 60/135 (44%), Gaps = 8/135 (5%)

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-VQGYSDGEPGSARFDKPKSFAVDMKG 115
           S+L V DS  S+   +   + E+  V+ + G G + G+ D +             V+   
Sbjct: 395 SELFVADSEISSIRAVG--IDEDPKVRTVCGSGELFGFGDKDGRGDEVRLQHCLGVEYAQ 452

Query: 116 N-IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFV-PHICAL 173
           N ++VAD  NH I++I + G T     G    G  D     A F     L+ + PH   L
Sbjct: 453 NYLWVADTYNHKIKRIDHRGGTCRTMIGEGTAGLVDAKGLKARFFEPSGLSAIGPH---L 509

Query: 174 LISDHGNQLIRQINL 188
            ++D  N +IR++ L
Sbjct: 510 FVADTNNHVIRRVEL 524


>gi|256556932|gb|ACU83555.1| thiol-disulfide isomerase/thioredoxin [uncultured bacterium
           HF130_AEPn_1]
          Length = 497

 Score = 45.8 bits (107), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 33/113 (29%), Positives = 57/113 (50%), Gaps = 7/113 (6%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIR--KITNLGVTTI 138
           VK + G G +G  DG+   ARF +P   A D K + +Y+AD  NH I+   + +  V T+
Sbjct: 221 VKMVIGSGKEGNKDGDIKVARFRRPHGLAFDEKNDLLYIADTDNHSIKSLDLKSKKVLTL 280

Query: 139 AGGGSK--KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           +G G +  K    +  A+    ++ ++L        LLI+  G   + +++LK
Sbjct: 281 SGNGERGFKRKAENAKAKGHPMASPWDLQLKGE--KLLIAMAGTHQLWEMDLK 331


>gi|302760605|ref|XP_002963725.1| hypothetical protein SELMODRAFT_79954 [Selaginella moellendorffii]
 gi|300168993|gb|EFJ35596.1| hypothetical protein SELMODRAFT_79954 [Selaginella moellendorffii]
          Length = 1052

 Score = 45.4 bits (106), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 20/48 (41%), Positives = 32/48 (66%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
            V+ +AG GV GY DG+   A+F +P   A+   G+++VAD +N++IR
Sbjct: 838 TVRTVAGTGVAGYEDGKSVKAQFSEPAGLALGPNGSLFVADTNNNLIR 885



 Score = 45.4 bits (106), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 52/105 (49%), Gaps = 5/105 (4%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTIAGGGS 143
           G G +G +DG    A F++P+  A D + NI YVAD  NH +R++      V T+AG G 
Sbjct: 610 GSGSEGLTDGSFSLASFNRPQGVAYDTEKNILYVADTENHALREVDFVKETVRTLAGTGE 669

Query: 144 KK-EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           +  + R     Q+   ++ ++L   P    L I+  G   I + N
Sbjct: 670 RGFDYRGGKAGQSQPLNSPWDLCLHPD-GMLYIAMAGEHQIWEFN 713



 Score = 45.1 bits (105), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 37/101 (36%), Positives = 46/101 (45%), Gaps = 7/101 (6%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSKKEGRA 149
           G  DG    A F  P        G IYVAD  NH I+   + +  V T+AG G    G  
Sbjct: 794 GDKDGPASQALFQHPLGILSSANGAIYVADSYNHKIKLMDLASKTVRTVAGTGVA--GYE 851

Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKP 190
           DG +  A FS    L   P+  +L ++D  N LIR   LKP
Sbjct: 852 DGKSVKAQFSEPAGLALGPN-GSLFVADTNNNLIRL--LKP 889



 Score = 39.3 bits (90), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 34/135 (25%), Positives = 58/135 (42%), Gaps = 7/135 (5%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG--YSDGEPGSAR-FDKPKSFAVDMKG 115
           L V D+   A   + F    +  V+ LAG G +G  Y  G+ G ++  + P    +   G
Sbjct: 641 LYVADTENHALREVDF---VKETVRTLAGTGERGFDYRGGKAGQSQPLNSPWDLCLHPDG 697

Query: 116 NIYVADKSNHVIRKITNL-GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
            +Y+A    H I +     GV  +  G   +         N+S++    L+  P +  L 
Sbjct: 698 MLYIAMAGEHQIWEFNRSNGVAKVFSGDGYERNLNGSKGSNSSYAQPSGLSLSPDMDHLY 757

Query: 175 ISDHGNQLIRQINLK 189
           I+D  +  +R +NLK
Sbjct: 758 IADSESSSVRSVNLK 772


>gi|308799191|ref|XP_003074376.1| NHL repeat (ISS) [Ostreococcus tauri]
 gi|116000547|emb|CAL50227.1| NHL repeat (ISS) [Ostreococcus tauri]
          Length = 1783

 Score = 45.4 bits (106), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 57/193 (29%), Positives = 85/193 (44%), Gaps = 32/193 (16%)

Query: 34  VTTVIDGHQLEINP-HSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG 92
           VTTV+    L   P   V+D     D+ VL  S+     +S  +S  +V          G
Sbjct: 285 VTTVV--ASLPFAPIELVVDESNGGDMYVLGQSQHGVMKIS--VSTLAVTTIAGSQTTSG 340

Query: 93  YSDGEPG-SARFDKPKSFAVD-MKGNIYVADKSNHVIRKI--TNLGVTTIAGGGS----- 143
           + D   G SARF  P+  A+D +   +YVAD  NH +R I  +   VTT+ G GS     
Sbjct: 341 FVDHNTGTSARFTLPRGLALDSLNSKLYVADTGNHAVRMIDLSTGVVTTVLGDGSPALNA 400

Query: 144 ---KKEGRADGPAQ-----NASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSK 195
               K+G    PA+       +++ D  L+       LL+SD G   +R++ L       
Sbjct: 401 STLNKDGVLSTPARFNDPVGIAYNYDSALS----SGVLLVSDAGTHQLRKLIL------N 450

Query: 196 SSQSGSALGAVSV 208
            S +G+A   V+V
Sbjct: 451 DSTAGNAATVVTV 463



 Score = 42.0 bits (97), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 36/68 (52%), Gaps = 8/68 (11%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGS--------ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +S V   AG+G +G +DG+  +        A F  P S A+   G++ VAD  N  +R I
Sbjct: 221 KSEVTNFAGNGTEGSTDGKTSANAELGETQALFKGPTSVALLASGDVLVADTGNKAVRNI 280

Query: 131 TNLGVTTI 138
           +N  VTT+
Sbjct: 281 SNGDVTTV 288


>gi|158318845|ref|YP_001511353.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
 gi|158114250|gb|ABW16447.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
          Length = 770

 Score = 45.4 bits (106), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 45/155 (29%), Positives = 64/155 (41%), Gaps = 10/155 (6%)

Query: 33  TVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG 92
           T+TTV     L   P +V   PG   LI    S   +      L      + +AG    G
Sbjct: 567 TITTVPTPAALR-RPRAVTTGPGGVLLIADTDSHRVWR-----LGPGETARVIAGTAEPG 620

Query: 93  YS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRA 149
           YS DG P + A   +P+S AVD  G + +AD     IR++ + G + T+AG        A
Sbjct: 621 YSGDGGPATRAAIGRPQSLAVDGAGRLLIADPDQRRIRRVDHAGRIGTMAGTAYGGRPAA 680

Query: 150 DG-PAQNASFSNDFELTFVPHICALLISDHGNQLI 183
           DG PA          L   P     L    G++++
Sbjct: 681 DGAPATGTDVGTPTGLAVGPDGTVYLADPAGDRVL 715


>gi|452910311|ref|ZP_21958992.1| hypothetical protein C884_01866 [Kocuria palustris PEL]
 gi|452834558|gb|EME37358.1| hypothetical protein C884_01866 [Kocuria palustris PEL]
          Length = 632

 Score = 45.4 bits (106), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 44/73 (60%), Gaps = 9/73 (12%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG-------NIYVADKSNHVIR--KITN 132
           V+ + GDG +G +DG P  A F++P+  A+  +G       ++ VAD  NH +R  ++++
Sbjct: 207 VRSVIGDGTRGPADGGPEEAHFNEPRGLALLPQGVREQVGYDVIVADTVNHRLRGVRLSD 266

Query: 133 LGVTTIAGGGSKK 145
             VTT+AG G ++
Sbjct: 267 GEVTTLAGSGVQR 279



 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 36/147 (24%), Positives = 59/147 (40%), Gaps = 9/147 (6%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           +P   +    +  L+V  S     +T      E +V    AG G +G  DG+P  + F +
Sbjct: 307 SPWDTVWSTAADTLVVAMSGTHQIFTFDPRTGELAVA---AGTGDEGLVDGDPSQSWFAQ 363

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNL-----GVTTIAGGGSKKEGRADGPAQNASFSN 160
                    G I+ AD  +  +R I        GV T AG G    G  DG +  A   +
Sbjct: 364 TSGLVEAPDGTIWAADSESSALRSIAVADGALSGVATAAGLGLYDFGFRDGDSTQARLQH 423

Query: 161 DFELTFVPHICALLISDHGNQLIRQIN 187
              +  +P   ++ ++D  N  IR+ +
Sbjct: 424 PLGVAVLPD-GSIAVADTYNGAIRRWD 449


>gi|290990712|ref|XP_002677980.1| predicted protein [Naegleria gruberi]
 gi|284091590|gb|EFC45236.1| predicted protein [Naegleria gruberi]
          Length = 1555

 Score = 45.4 bits (106), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 28/113 (24%), Positives = 54/113 (47%), Gaps = 4/113 (3%)

Query: 81  VVKRLAGDGVQGYSDGEPG--SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTT 137
           +++ +AG+G+QGY+       +A+ + P    V+    ++ +D  N+ +R+IT + G+  
Sbjct: 548 IIQTIAGNGIQGYNGDNINALTAQLNSPTYLQVNYYHELFFSDTFNNRLRQITLSTGIIQ 607

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKP 190
              G   K    DGPA +         +       +  +D GN  IR ++L+P
Sbjct: 608 TIAGSGVKGFNGDGPALSTHLDTPMGFSLNSDW-IIYFADSGNNRIRYVDLQP 659



 Score = 42.4 bits (98), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 47/149 (31%), Positives = 68/149 (45%), Gaps = 21/149 (14%)

Query: 58  DLIVLDSSRSAFYTLSFPLSEES-------------VVKRLAGDGVQGYSDGEPGSA--- 101
           D +VL+S +  F T S+    +S             VV  +AG G  G+ DG+ G A   
Sbjct: 456 DQLVLNSPKGIFATTSYVYISDSGNHRIIKVHRTSGVVIGVAGCGTSGF-DGDGGLATLA 514

Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
           + + P    +D    +YVAD +NH IR   IT+  + TIAG G +     +  A  A  +
Sbjct: 515 KLNSPTGIYLD-NNVLYVADTNNHRIRAVNITSGIIQTIAGNGIQGYNGDNINALTAQLN 573

Query: 160 NDFELTFVPHICALLISDHGNQLIRQINL 188
           +   L  V +   L  SD  N  +RQI L
Sbjct: 574 SPTYLQ-VNYYHELFFSDTFNNRLRQITL 601



 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 38/76 (50%), Gaps = 6/76 (7%)

Query: 81  VVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-----NLG 134
           +++ +AG GV+G++ DG   S   D P  F+++    IY AD  N+ IR +      N  
Sbjct: 605 IIQTIAGSGVKGFNGDGPALSTHLDTPMGFSLNSDWIIYFADSGNNRIRYVDLQPLFNDP 664

Query: 135 VTTIAGGGSKKEGRAD 150
           VT     GS  +G  D
Sbjct: 665 VTVTTHSGSGAKGYTD 680



 Score = 37.4 bits (85), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 22/72 (30%), Positives = 41/72 (56%), Gaps = 5/72 (6%)

Query: 76  LSEESVVKRLAGDGVQGYSD-GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++++ +++ +AG+G  G+S   +P ++  + P S  V  KG   ++D  N+ IRKI+ + 
Sbjct: 886 INKKGLLETVAGNGFTGFSQTSDPKTSSLNSPSS-VVSYKGQTIISDSKNNFIRKISTIA 944

Query: 135 ---VTTIAGGGS 143
                 IAG GS
Sbjct: 945 SHPFDVIAGDGS 956


>gi|380511998|ref|ZP_09855405.1| nhl repeat protein [Xanthomonas sacchari NCPPB 4393]
          Length = 473

 Score = 45.4 bits (106), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 29/76 (38%), Positives = 40/76 (52%), Gaps = 6/76 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL--GVTTIA 139
           V R  G G   + DG  G A F +P+  A++ +G +YVAD  NH +R+I  L   V T+ 
Sbjct: 199 VLRQFGIGTADFIDGAIGEAAFHRPRGLALE-RGVLYVADTGNHALRRINLLTGQVDTLC 257

Query: 140 GGGSKK---EGRADGP 152
           G G      EGR + P
Sbjct: 258 GNGRAGEPVEGRVEQP 273



 Score = 42.0 bits (97), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 43/148 (29%), Positives = 61/148 (41%), Gaps = 12/148 (8%)

Query: 73  SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           S+ L   S+  R AG G     DG    A F +P S A  ++  +YV D     +R +  
Sbjct: 303 SYDLGSRSLRWR-AGAGALELRDGSGHLAAFAQPCSLAA-VQQVLYVCDALGSAVRSMQL 360

Query: 133 LG--VTTIAGG-GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            G  V T+ GG G    G  DGP   A       +   P    L I+D GN  +R + L 
Sbjct: 361 RGDLVQTLLGGQGPWDFGNEDGPRSRARLQFPQAIALSPDAPLLWIADSGNGSLRSLRLG 420

Query: 190 PEDCSKS-------SQSGSALGAVSVWV 210
             + S +          G A+ A +VW+
Sbjct: 421 GGELSTAVLPRRLHGPGGLAVSAGTVWI 448


>gi|152968196|ref|YP_001363980.1| NHL repeat-containing protein [Kineococcus radiotolerans SRS30216]
 gi|151362713|gb|ABS05716.1| putative NHL repeat protein [Kineococcus radiotolerans SRS30216]
          Length = 584

 Score = 45.4 bits (106), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 35/111 (31%), Positives = 55/111 (49%), Gaps = 12/111 (10%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAV---DMKG----NIYVADKSNHVIRKITNL 133
           V  R  G G +G +DG   S  F  P+  AV   D+      ++ VAD  NH++R + + 
Sbjct: 187 VGDRPIGSGARGRTDGPAPS--FAAPQGVAVLPPDLAAALGYDLVVADTGNHLLRGVRSA 244

Query: 134 G--VTTIAGGGSKKEGRA-DGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
               TTIAG G+   GR+  GPA   + S+ +++ + P    LL++  G  
Sbjct: 245 DGRTTTIAGTGAPARGRSGGGPALATALSSPWDVAWWPAFGELLVAVAGTH 295


>gi|126179662|ref|YP_001047627.1| NHL repeat-containing protein [Methanoculleus marisnigri JR1]
 gi|125862456|gb|ABN57645.1| NHL repeat containing protein [Methanoculleus marisnigri JR1]
          Length = 487

 Score = 45.4 bits (106), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 29/51 (56%), Gaps = 1/51 (1%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           E   V   AGDG +GY DG  G AR ++P    VD  G  Y+AD  NH++R
Sbjct: 422 ETGWVLTRAGDGDRGYRDGLSGEARLNEPGGL-VDFGGLWYIADTGNHIVR 471



 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 46/107 (42%), Gaps = 18/107 (16%)

Query: 105 KPKSFAVDMKG-NIYVADKSNHVIRKITNLGVTTIAGG--------GSKKEGRADGPAQN 155
           +P   A D  G  ++++D  +H I          +AGG        GS   G ADGP   
Sbjct: 173 RPGKVAADNAGMRLFISDTGHHRI---------VVAGGDGKILETIGSGAAGNADGPFDE 223

Query: 156 ASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
           A+F     L F      L ++D GN  IR+++      + ++ +G A
Sbjct: 224 AAFYLPEGLAFDEEAGILYVADTGNHTIRRVSFTERRVATNAGTGLA 270



 Score = 37.7 bits (86), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 38/121 (31%), Positives = 53/121 (43%), Gaps = 12/121 (9%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTIAGGG- 142
           G G  G +DG    A F  P+  A D +  I YVAD  NH IR++  T   V T AG G 
Sbjct: 210 GSGAAGNADGPFDEAAFYLPEGLAFDEEAGILYVADTGNHTIRRVSFTERRVATNAGTGL 269

Query: 143 ---SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQS 199
              S  EG   GP    + +   +L  +     L I+  G   I +++L   +    + S
Sbjct: 270 AAPSFGEG---GPGTGTALNAPRDLALLG--GHLYIAMAGANQIWRMDLATYEVEPYAGS 324

Query: 200 G 200
           G
Sbjct: 325 G 325


>gi|195119117|ref|XP_002004078.1| GI18254 [Drosophila mojavensis]
 gi|193914653|gb|EDW13520.1| GI18254 [Drosophila mojavensis]
          Length = 734

 Score = 45.4 bits (106), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 28/73 (38%), Positives = 38/73 (52%), Gaps = 2/73 (2%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTI 138
           VVK   G    G+ DG    +RF+ P+  A   K  + VAD  NH +R+I+  N  V T+
Sbjct: 255 VVKYKIGGLEAGFVDGSLNMSRFNNPQGVAFLDKNTLIVADTDNHALRQISLNNGVVETL 314

Query: 139 AGGGSKKEGRADG 151
           AG GS+   R  G
Sbjct: 315 AGTGSQGNERTGG 327


>gi|167041535|gb|ABZ06284.1| putative NHL repeat protein [uncultured marine microorganism
           HF4000_008B14]
          Length = 426

 Score = 45.4 bits (106), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 40/145 (27%), Positives = 67/145 (46%), Gaps = 10/145 (6%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V+    S+ L + DS+ +     +     +    ++ G G  G  DG   +A FD P
Sbjct: 111 PGKVVADADSNRLFIADSNHNRIIVTTL----DGDALKVVGSGEPGMVDGSFAAASFDHP 166

Query: 107 KSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGR-ADGPAQNASFSNDFE 163
           +   + +   +YVAD  NH IRK+      V TIAG G++   R   GP  +   ++ ++
Sbjct: 167 QGMVL-VGDLLYVADAENHAIRKVDLAAEQVETIAGTGTQGSMREGTGPGLHTELNSPWD 225

Query: 164 LTFVPHICALLISDHGNQLIRQINL 188
           L F  H  +L I+  G   +  ++L
Sbjct: 226 LAF--HEGSLYIAMAGLHQLWSMDL 248



 Score = 39.7 bits (91), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 39/133 (29%), Positives = 58/133 (43%), Gaps = 7/133 (5%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGN 116
           L   DS  SA    S  L    +V+ + G G+  + D  G   + R   P   A    G 
Sbjct: 287 LYFADSETSA--VRSADLDPHGMVRTIIGLGLFEFGDVDGADHNIRLQHPIGVA-HHDGT 343

Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLIS 176
           IY+AD  NH I+K+  +  ++    G+ + G  DGP   A FS    L+       + I+
Sbjct: 344 IYLADTYNHKIKKVLPVMQSSFTMLGNGQPGDRDGPGNQAQFSEPSGLSI--DGGKIYIA 401

Query: 177 DHGNQLIRQINLK 189
           D  N  IR  +L+
Sbjct: 402 DTNNHAIRVADLE 414


>gi|423072738|ref|ZP_17061487.1| fibronectin type III domain protein [Desulfitobacterium hafniense
           DP7]
 gi|361856502|gb|EHL08400.1| fibronectin type III domain protein [Desulfitobacterium hafniense
           DP7]
          Length = 1926

 Score = 45.4 bits (106), Expect = 0.024,   Method: Composition-based stats.
 Identities = 26/77 (33%), Positives = 37/77 (48%), Gaps = 2/77 (2%)

Query: 92  GYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA 149
           GYS DG P +A +   P + A D  GN+Y++D  N  IRK+   G+ T   G        
Sbjct: 285 GYSGDGGPATAAQLKTPNALAFDGAGNMYISDSGNRRIRKVDPQGIITTIAGKETNGDPT 344

Query: 150 DGPAQNASFSNDFELTF 166
            GPA+++       L F
Sbjct: 345 PGPAEDSKIPGTHWLGF 361


>gi|219667746|ref|YP_002458181.1| hypothetical protein Dhaf_1699 [Desulfitobacterium hafniense DCB-2]
 gi|219538006|gb|ACL19745.1| protein of unknown function DUF1533 [Desulfitobacterium hafniense
           DCB-2]
          Length = 1916

 Score = 45.4 bits (106), Expect = 0.024,   Method: Composition-based stats.
 Identities = 26/77 (33%), Positives = 37/77 (48%), Gaps = 2/77 (2%)

Query: 92  GYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA 149
           GYS DG P +A +   P + A D  GN+Y++D  N  IRK+   G+ T   G        
Sbjct: 275 GYSGDGGPATAAQLKTPNALAFDGAGNMYISDSGNRRIRKVDPQGIITTIAGKETNGDPT 334

Query: 150 DGPAQNASFSNDFELTF 166
            GPA+++       L F
Sbjct: 335 PGPAEDSKIPGTHWLGF 351


>gi|145592666|ref|YP_001156963.1| alkyl hydroperoxide reductase [Salinispora tropica CNB-440]
 gi|145302003|gb|ABP52585.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Salinispora tropica CNB-440]
          Length = 612

 Score = 45.4 bits (106), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 30/103 (29%), Positives = 49/103 (47%), Gaps = 2/103 (1%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAGGGSK 144
           AG  V+   DG    A   +P   +V   G+ ++VAD     +R + N  +TT  G G  
Sbjct: 332 AGSTVEARKDGPLAEAWLAQPSGLSVSADGSRLWVADSETSAVRYVENGILTTAVGQGLF 391

Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           + G  DGPA  A   +   +  +P   ++LI+D  N  +R+ +
Sbjct: 392 EFGHVDGPAAQALLQHPLGVCALPD-GSVLIADTYNGAVRRYD 433



 Score = 42.0 bits (97), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 37/139 (26%), Positives = 69/139 (49%), Gaps = 21/139 (15%)

Query: 46  NPHSVIDRPGSS------DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG 99
            P + +  PG S      +L+V DS+R +   L+     E++++R+ G G +G +DG   
Sbjct: 168 EPETTLRFPGKSVVLDNGNLLVSDSARHSLAELAP--DGETLLRRI-GTGARGRADGTAT 224

Query: 100 SARFDKPKSFAV------DMKG-NIYVADKSNHVIR--KITNLGVTTIAGGGSKKEGRAD 150
           +A F +P+   +       + G ++ VAD  NH++R  ++    V T+AG G +     D
Sbjct: 225 AATFAEPQGLCLLPAQVSRLVGYDLVVADTVNHLLRGVRLATGEVVTVAGTGRQWRSTVD 284

Query: 151 GPAQNA---SFSNDFELTF 166
             A +A     S+ ++L +
Sbjct: 285 DHAHDALSVDLSSPWDLAW 303



 Score = 41.6 bits (96), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 27/88 (30%), Positives = 45/88 (51%), Gaps = 7/88 (7%)

Query: 107 KSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTF 166
           KS  +D  GN+ V+D + H + ++   G T +   G+   GRADG A  A+F+    L  
Sbjct: 178 KSVVLD-NGNLLVSDSARHSLAELAPDGETLLRRIGTGARGRADGTATAATFAEPQGLCL 236

Query: 167 VPHICA------LLISDHGNQLIRQINL 188
           +P   +      L+++D  N L+R + L
Sbjct: 237 LPAQVSRLVGYDLVVADTVNHLLRGVRL 264


>gi|428184600|gb|EKX53455.1| hypothetical protein GUITHDRAFT_101156 [Guillardia theta CCMP2712]
          Length = 367

 Score = 45.4 bits (106), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 42/88 (47%), Gaps = 9/88 (10%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG 140
           VV  +AG GV GY +G   + + + P   A+     IYV+D+ NH IR++  +   T  G
Sbjct: 12  VVCSIAGTGVPGYYEGNRMTCKMNCPSGIAMGQGDCIYVSDRQNHCIRRLPGIEEQTAPG 71

Query: 141 ---------GGSKKEGRADGPAQNASFS 159
                    G  +K G  DG  + A F+
Sbjct: 72  KEDDVSTFAGVPRKSGMEDGHNRRALFN 99



 Score = 39.3 bits (90), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 43/86 (50%), Gaps = 2/86 (2%)

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
           P    VD +  +YV++   H IR++   G + I  G S+K G+ DG   N S  +   ++
Sbjct: 226 PVGICVDEQNCVYVSEWGGHRIRRVQPDGSSVIWVGTSQK-GKLDGKGINGSLFHPAGMS 284

Query: 166 FVPHICALLISDHGNQLIRQINLKPE 191
            V     + ++D GN  +R+IN   E
Sbjct: 285 -VDTKGIVYVADFGNHCVRRINTTGE 309


>gi|160942444|ref|ZP_02089750.1| hypothetical protein CLOBOL_07327 [Clostridium bolteae ATCC
           BAA-613]
 gi|158434643|gb|EDP12410.1| hypothetical protein CLOBOL_07327 [Clostridium bolteae ATCC
           BAA-613]
          Length = 329

 Score = 45.4 bits (106), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 29/73 (39%), Positives = 40/73 (54%), Gaps = 3/73 (4%)

Query: 114 KGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
           +G +YVAD  N+ I K  +  V   AG G  ++G ADGP   A FS    +T      AL
Sbjct: 212 EGALYVADTGNNRIVKAMDGAVVWSAGTG--EDGFADGPVSQAMFSGPQRIT-AAEDGAL 268

Query: 174 LISDHGNQLIRQI 186
            +SD GN ++R+I
Sbjct: 269 YVSDTGNSVVRKI 281



 Score = 41.6 bits (96), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 49/109 (44%), Gaps = 19/109 (17%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
           AG G  G++DG    A F  P+       G +YV+D  N V+RKI    V+T+       
Sbjct: 237 AGTGEDGFADGPVSQAMFSGPQRITAAEDGALYVSDTGNSVVRKIWGDNVSTL------- 289

Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDH---GNQLIRQINLKPE 191
                 PA +A+   D +LT       +++ DH    +  +R++ + P 
Sbjct: 290 ------PAADAA---DPDLTIASPEGMMVMGDHIYLCDSFLRKVFVLPR 329



 Score = 41.2 bits (95), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 33/97 (34%), Positives = 46/97 (47%), Gaps = 4/97 (4%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           GY DG  G     +P      + G   ++D+SNHV+R I+  G  T AG G  K G  D 
Sbjct: 99  GYRDGSLGEMLIGEPWDMIPYLDGYA-LSDRSNHVVRYISPEGAMTAAGTG--KAGYEDN 155

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
            A  A F+    L   P    L I+D  N ++R ++L
Sbjct: 156 LASKALFAAPTGLAAGPD-GILYIADTDNDVVRSLSL 191


>gi|221633398|ref|YP_002522623.1| hypothetical protein trd_1418 [Thermomicrobium roseum DSM 5159]
 gi|221156519|gb|ACM05646.1| conserved hypothetical protein [Thermomicrobium roseum DSM 5159]
          Length = 443

 Score = 45.4 bits (106), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 39/148 (26%), Positives = 65/148 (43%), Gaps = 9/148 (6%)

Query: 21  LVSSGLLLEDGYTVTTVIDGHQLEIN-PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEE 79
           L  +G+   + + + +V++  Q  +  P  +   P    L+V D+              +
Sbjct: 94  LARTGVTTSEPFALLSVLERPQGPLAFPGKLAVDPSRDRLVVSDTGHHRLVIARL----D 149

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTT 137
             V  + GDG  G  DG    ARF +P+  A+ +    +VAD+ NH IR+I      V T
Sbjct: 150 GTVTAVIGDGRPGLVDGTFAEARFREPQGIAL-VGETCFVADRGNHAIRRIDLAAGTVET 208

Query: 138 IAGGGSKKEGR-ADGPAQNASFSNDFEL 164
           +AG G   +G  + GPA      + + L
Sbjct: 209 LAGTGRLGQGMLSAGPAHQVDLRSPWAL 236



 Score = 41.6 bits (96), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 41/92 (44%), Gaps = 14/92 (15%)

Query: 52  DRPGSSDLIVLDSSRSAFYTLSFPLSEESV---VKRL----------AGDGVQGYSDGEP 98
           DR G+ D ++L       +T    L  ++    +KRL           G G  G+ DG P
Sbjct: 333 DRDGTGDTVLLQHPLDVAWTGEELLVADTYNHKIKRLDPVARRCSSWLGTGQPGHEDGPP 392

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
             ARF +P   A      +YVAD +NH +R I
Sbjct: 393 ERARFWEPSGLATTFD-RVYVADTNNHAVRVI 423


>gi|290971625|ref|XP_002668588.1| predicted protein [Naegleria gruberi]
 gi|284082057|gb|EFC35844.1| predicted protein [Naegleria gruberi]
          Length = 731

 Score = 45.4 bits (106), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 25/90 (27%), Positives = 46/90 (51%), Gaps = 1/90 (1%)

Query: 97  EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNA 156
            P +   + P        G++Y+A + N+ I K +N  ++T+AG G        GPA+NA
Sbjct: 142 NPNNDGINGPNGIMYTPNGDLYIASQGNNKIMKYSNSQLSTVAGNGYAGYSGDGGPARNA 201

Query: 157 SFSNDFELTFVPHICALLISDHGNQLIRQI 186
             +   ++T V     + I+D GN+ +R++
Sbjct: 202 KLNYPADVT-VSSTGEVYIADWGNRAVRKV 230



 Score = 41.2 bits (95), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 36/72 (50%), Gaps = 4/72 (5%)

Query: 79  ESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-- 134
            S +  +AG+G  GYS DG P  +A+ + P    V   G +Y+AD  N  +RK+   G  
Sbjct: 177 NSQLSTVAGNGYAGYSGDGGPARNAKLNYPADVTVSSTGEVYIADWGNRAVRKVFRNGTI 236

Query: 135 VTTIAGGGSKKE 146
           +T I  G    E
Sbjct: 237 ITLIVSGNGLSE 248


>gi|451341293|ref|ZP_21911747.1| NHL repeat containing protein [Amycolatopsis azurea DSM 43854]
 gi|449415822|gb|EMD21660.1| NHL repeat containing protein [Amycolatopsis azurea DSM 43854]
          Length = 602

 Score = 45.4 bits (106), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 48/114 (42%), Gaps = 6/114 (5%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGP 152
           Y   E   A    P       +G I VAD  NH I +  + G T I   GS   G  DGP
Sbjct: 157 YVPAEEQRAELRFPSKAVTTAEGRILVADTGNHSIVEFASDGETIIRRFGSGSRGAQDGP 216

Query: 153 AQNASFSNDFELTFVPHICA------LLISDHGNQLIRQINLKPEDCSKSSQSG 200
              ASF+    +T +P+  A       +++D     +R I+L   + S  + +G
Sbjct: 217 FDLASFTEPSGITLLPYEVAERAGYHAVVADTAGHRLRGIDLITGEVSTVAGTG 270


>gi|159035770|ref|YP_001535023.1| alkyl hydroperoxide reductase [Salinispora arenicola CNS-205]
 gi|157914605|gb|ABV96032.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Salinispora arenicola CNS-205]
          Length = 612

 Score = 45.4 bits (106), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 45/93 (48%), Gaps = 8/93 (8%)

Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
           RF  P    V   GN+ V+D + H I ++   G   +   G+   GRADGPA  A+F+  
Sbjct: 174 RF--PGKAVVLGNGNLLVSDSARHSIAELAPDGERVVRRIGTGARGRADGPATAATFAEP 231

Query: 162 FELTFVPHICA------LLISDHGNQLIRQINL 188
             L  +P   A      L+++D  N L+R + L
Sbjct: 232 QGLCLLPAHVARLVDYDLVVADTVNHLLRGVRL 264



 Score = 43.9 bits (102), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 63/124 (50%), Gaps = 15/124 (12%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF----- 109
           G+ +L+V DS+R +   L+     E VV+R+ G G +G +DG   +A F +P+       
Sbjct: 183 GNGNLLVSDSARHSIAELAP--DGERVVRRI-GTGARGRADGPATAATFAEPQGLCLLPA 239

Query: 110 --AVDMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSKKEGRADGPAQNA---SFSNDF 162
             A  +  ++ VAD  NH++R  ++    V T+AG G +     D  A +A     S+ +
Sbjct: 240 HVARLVDYDLVVADTVNHLLRGVRLATGEVVTVAGTGRQWRSTVDDHAHDALSVDLSSPW 299

Query: 163 ELTF 166
           +L +
Sbjct: 300 DLAW 303



 Score = 42.0 bits (97), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 29/103 (28%), Positives = 48/103 (46%), Gaps = 2/103 (1%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAGGGSK 144
           AG  V+   DG    A   +P   +V   G+ ++VAD     IR + +  + T  G G  
Sbjct: 332 AGSTVEALKDGPLAEAWLAQPSGLSVSADGSRLWVADSETSAIRYVQDGVLNTAVGQGLF 391

Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           + G  DGPA  A   +   +  +P   ++LI+D  N  +R+ +
Sbjct: 392 EFGHVDGPAAQALLQHPLGVCALPD-GSVLIADTYNGAVRRYD 433


>gi|254383375|ref|ZP_04998727.1| NHL repeat containing protein [Streptomyces sp. Mg1]
 gi|194342272|gb|EDX23238.1| NHL repeat containing protein [Streptomyces sp. Mg1]
          Length = 607

 Score = 45.1 bits (105), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 57/115 (49%), Gaps = 12/115 (10%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S + +V DS+R     L+     ESVV+R+ G G +G+     G   F +P+  A+   G
Sbjct: 191 SGNFLVSDSTRHQLVELA--ADAESVVRRI-GSGERGF-----GPDAFSEPQGLALLPDG 242

Query: 116 NIYVADKSNHVIRKI--TNLGVTTIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
           ++ VAD  NH +R+       V T+AG G +  +     GPA     S+ +++ +
Sbjct: 243 SVVVADTVNHALRRFDPATGSVETVAGTGRQWWQGSPTSGPALEVDLSSPWDVAW 297


>gi|449670570|ref|XP_002158620.2| PREDICTED: NHL repeat-containing protein 2-like [Hydra
           magnipapillata]
          Length = 629

 Score = 45.1 bits (105), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 32/82 (39%), Positives = 43/82 (52%), Gaps = 7/82 (8%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTI 138
           VVKR+ G G  G+ DGE   ++F  P+  +  ++  I+VAD  NH IR+I   N    TI
Sbjct: 244 VVKRI-GTGFPGFKDGEAHVSQFSSPQGLSW-IENVIFVADTENHSIREINVANWITKTI 301

Query: 139 AGGGSK---KEGRADGPAQNAS 157
            G G +   KEG   G  Q  S
Sbjct: 302 VGNGKQGKDKEGGKLGIEQEIS 323



 Score = 40.0 bits (92), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 35/102 (34%), Positives = 51/102 (50%), Gaps = 13/102 (12%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGVTTIAGGGSKKEGRA 149
           G  DG   +A+   P   A  + G++YVAD  NH I+   ++NL  TT+AG G    G  
Sbjct: 449 GDVDGIGLNAKLQHPLDVAA-VNGHLYVADSYNHKIKVVNMSNLSCTTLAGNGVV--GIK 505

Query: 150 DGPAQNASFSNDFELTFVPHIC---ALLISDHGNQLIRQINL 188
           +G   +A F+    +     IC    + I+D  N LIR +NL
Sbjct: 506 NGDFLSAEFNEPGGI-----ICLDDTIYIADTNNHLIRVLNL 542



 Score = 38.5 bits (88), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 29/44 (65%), Gaps = 1/44 (2%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           LAG+GV G  +G+  SA F++P    + +   IY+AD +NH+IR
Sbjct: 496 LAGNGVVGIKNGDFLSAEFNEPGGI-ICLDDTIYIADTNNHLIR 538


>gi|354615998|ref|ZP_09033698.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Saccharomonospora paurometabolica YIM 90007]
 gi|353219649|gb|EHB84187.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Saccharomonospora paurometabolica YIM 90007]
          Length = 727

 Score = 45.1 bits (105), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 27/90 (30%), Positives = 44/90 (48%), Gaps = 6/90 (6%)

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
           P       +G + VAD ++H + +  +   T +   G  + G ADG    A+F+    LT
Sbjct: 241 PSKAVATAEGRVLVADTAHHSVVEFASDAETVVRRFGDGERGAADGAFDVATFAEPAGLT 300

Query: 166 FVPHICA------LLISDHGNQLIRQINLK 189
            +P   A      LL++D GN L+R I+L+
Sbjct: 301 LLPPEVAERVGYHLLVADTGNHLLRGIDLR 330



 Score = 45.1 bits (105), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 33/122 (27%), Positives = 62/122 (50%), Gaps = 11/122 (9%)

Query: 72  LSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF-------AVDMKGNIYVADKSN 124
           + F    E+VV+R  GDG +G +DG    A F +P          A  +  ++ VAD  N
Sbjct: 263 VEFASDAETVVRRF-GDGERGAADGAFDVATFAEPAGLTLLPPEVAERVGYHLLVADTGN 321

Query: 125 HVIRKI--TNLGVTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQ 181
           H++R I      V T+AG G++ ++G   GPA  ++ ++ +++ +      ++++  GN 
Sbjct: 322 HLLRGIDLRTGSVRTVAGTGAQWRDGADSGPATGSALTSPWDVRWWEPAGGVVVAMAGNH 381

Query: 182 LI 183
            +
Sbjct: 382 TL 383


>gi|290978007|ref|XP_002671728.1| predicted protein [Naegleria gruberi]
 gi|284085299|gb|EFC38984.1| predicted protein [Naegleria gruberi]
          Length = 2520

 Score = 45.1 bits (105), Expect = 0.030,   Method: Composition-based stats.
 Identities = 33/109 (30%), Positives = 53/109 (48%), Gaps = 15/109 (13%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI---------TNLGVTT 137
           G G  GY+DG   +A ++ P   A     ++Y+AD+ NHV+R +          NL   +
Sbjct: 228 GVGSAGYADGSAFNALYNAPTGVAYGPNKDLYIADRGNHVVRVVKFSYGTTGTQNLATIS 287

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           +  G +K  G +D    N ++  D +++   HI    I D GN  IR+I
Sbjct: 288 LFSGVAKTPG-SDLVKMNNTY--DVDVSSEGHI---FILDTGNLRIRKI 330



 Score = 37.0 bits (84), Expect = 7.8,   Method: Composition-based stats.
 Identities = 24/75 (32%), Positives = 36/75 (48%), Gaps = 11/75 (14%)

Query: 77  SEESVVKRL----------AGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNH 125
           S+  +VKRL          AG G+ G       + A  + P    V  +G + +AD  NH
Sbjct: 55  SKNHLVKRLDLTTLKVTIVAGTGIAGNDTNTTATGAMLNYPGGICVTSEGKLLIADTLNH 114

Query: 126 VIRKITNLGVTTIAG 140
           VIR+++N  + TI G
Sbjct: 115 VIREVSNGQIKTIIG 129


>gi|290991506|ref|XP_002678376.1| NHL repeat domain-containing protein [Naegleria gruberi]
 gi|284091988|gb|EFC45632.1| NHL repeat domain-containing protein [Naegleria gruberi]
          Length = 985

 Score = 45.1 bits (105), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 40/140 (28%), Positives = 63/140 (45%), Gaps = 12/140 (8%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV-QGYSDG-EPGSARFDKPKSFAVDM 113
           + D+ ++DS     Y +    +  +V+    G+G+ + Y +    G+A F+ P  F   +
Sbjct: 341 TGDIYIIDSQN---YRIRKITASTNVISTFLGNGLSRNYGENISAGAASFNSPVGFTFSL 397

Query: 114 ---KGNIYVADKSNHVIRKITNL----GVTTIAGGGSKKEGRADGPAQNASFSNDFELTF 166
                 IYVAD  NH IR  +       + T+AG      G  + PA ++SF+    L F
Sbjct: 398 AYPNNVIYVADTGNHRIRSFSQFYSGRNIVTVAGLPRPGLGGENLPALSSSFNFPSNLVF 457

Query: 167 VPHICALLISDHGNQLIRQI 186
            P    LLI D  N  IR +
Sbjct: 458 HPISGDLLIVDTQNHRIRSM 477



 Score = 44.3 bits (103), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 48/95 (50%), Gaps = 7/95 (7%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGE---PGSARF 103
           P +++  P S DL+++D+      ++S      S VK +AG G+ GY +GE       + 
Sbjct: 452 PSNLVFHPISGDLLIVDTQNHRIRSMS---KSTSFVKTIAGTGIAGY-NGEGMLSNMTKM 507

Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
           + P   AV   G I  AD  N+++R I   G+ +I
Sbjct: 508 NSPSGIAVLSTGEIIFADTFNNLVRMINLQGIVSI 542



 Score = 38.9 bits (89), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 35/109 (32%), Positives = 53/109 (48%), Gaps = 15/109 (13%)

Query: 89  GVQGYSDGEPG------SARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG 140
           G+QG S G+ G      SA  + P+S A +  G+++++D  N+ I+K+   N  + TIAG
Sbjct: 253 GIQGQS-GDSGEGYLATSAFLNGPQSVAFNSNGDMFISDSKNNKIKKVFFNNGTIITIAG 311

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVP---HICALLISDHGNQLIRQI 186
            GS       G   NA F+     TF+        + I D  N  IR+I
Sbjct: 312 TGS---ANFTGDGSNAKFATLNLPTFISVSNSTGDIYIIDSQNYRIRKI 357


>gi|162457162|ref|YP_001619529.1| hypothetical protein sce8877 [Sorangium cellulosum So ce56]
 gi|161167744|emb|CAN99049.1| Hypothetical protein sce8877 [Sorangium cellulosum So ce56]
          Length = 127

 Score = 45.1 bits (105), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 52/106 (49%), Gaps = 3/106 (2%)

Query: 85  LAGDGVQG-YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG 142
           +AG G +G   DG P  A+ D P+  A+   G + +A++  + +R+ T  G + T+AG G
Sbjct: 1   MAGTGNEGDRGDGGPAVAKLDVPRRLALGPDGRLRIAEQGGNRVRRSTPDGTIATVAGTG 60

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           +       GPA  A+ ++ F +   P   A   SD  +    QI L
Sbjct: 61  APGYSGDGGPATLATSTSPFSIALGPR-WATPASDPASNQTEQIEL 105


>gi|380016202|ref|XP_003692077.1| PREDICTED: membrane-associated protein Hem-like [Apis florea]
          Length = 1439

 Score = 45.1 bits (105), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 38/136 (27%), Positives = 62/136 (45%), Gaps = 12/136 (8%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIA 139
           ++ + G     + DG+  +ARF+ P+   + +   +YVAD  NH IRKI  +   VTT+ 
Sbjct: 256 IQHVIGGSNPDFRDGDFENARFNAPQGVCI-LDTFVYVADNENHAIRKIDLIKKIVTTVV 314

Query: 140 GGGSKKEGRADGP-AQNASFSNDFELTFVPH------ICALLISDHGNQLIRQINLKPED 192
           G G +      G   +N   S+ ++L    H      I  LLI+  G   I  + LK   
Sbjct: 315 GTGIQGHDYVGGKIGKNQILSSPWDLAIYEHEYNKNSIPILLIAMAGTHQIWALFLKDTI 374

Query: 193 CSKSSQ--SGSALGAV 206
             K  +  +G+ +  V
Sbjct: 375 WWKKREYKAGTCIAIV 390


>gi|269796827|ref|YP_003316282.1| thiol-disulfide isomerase-like thioredoxin [Sanguibacter keddieii
           DSM 10542]
 gi|269099012|gb|ACZ23448.1| thiol-disulfide isomerase-like thioredoxin [Sanguibacter keddieii
           DSM 10542]
          Length = 634

 Score = 45.1 bits (105), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 41/145 (28%), Positives = 61/145 (42%), Gaps = 10/145 (6%)

Query: 67  SAFYTLSFPLSEESVVKRLAGDGVQGYSDGEP--GSARFDKPKSFAVDMKGNIYVADKSN 124
           S    L   L EE   K     G   Y   EP  G+ RF  P +      GN+ VAD  +
Sbjct: 144 STLEILVRELVEEHSAKGTLHRGSGPYVPAEPTPGTLRF--PAAVIALESGNLLVADAGH 201

Query: 125 HVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDH 178
           H + ++   G T +   GS + G +DG  Q+A+FS    L  VP          +L++D 
Sbjct: 202 HSLAELAADGETLVRRIGSGERGLSDGGRQDATFSEPNGLCLVPATLRDQLGYDVLVADT 261

Query: 179 GNQLIRQINLKPEDCSKSSQSGSAL 203
            N  +R + L     +  + +G  L
Sbjct: 262 VNHALRGVRLSDGHVTTVAGTGDQL 286



 Score = 40.4 bits (93), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 39/147 (26%), Positives = 70/147 (47%), Gaps = 24/147 (16%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV---- 111
           S +L+V D+   +   L+     E++V+R+ G G +G SDG    A F +P    +    
Sbjct: 191 SGNLLVADAGHHSLAELAA--DGETLVRRI-GSGERGLSDGGRQDATFSEPNGLCLVPAT 247

Query: 112 --DMKG-NIYVADKSNHVIR--KITNLGVTTIAGGGSK-KEGRADG-----------PAQ 154
             D  G ++ VAD  NH +R  ++++  VTT+AG G +   G A+            PA 
Sbjct: 248 LRDQLGYDVLVADTVNHALRGVRLSDGHVTTVAGTGDQLMVGGAENVVPETTAPDATPAL 307

Query: 155 NASFSNDFELTFVPHICALLISDHGNQ 181
               ++ +++ +   + A L++  GN 
Sbjct: 308 RHRLTSPWDVVWSERLGAFLVAMAGNH 334



 Score = 39.7 bits (91), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 26/112 (23%), Positives = 54/112 (48%), Gaps = 6/112 (5%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-----NLGV 135
           VV+++ G   +G  DG    A F +P   +V   G++++AD     +R++      +  +
Sbjct: 345 VVEQVGGTQNEGLLDGPLAQAWFAQPSGLSVGPDGSVWLADAETSALRRVDVADDGSATI 404

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           T++ G G    G  DG A  A   +   +  +P   +++++D  N  +R+ +
Sbjct: 405 TSLVGQGLFDFGHRDGAAAQALLQHPLGVAALPD-GSVVVTDTYNGALRRYD 455


>gi|395225280|ref|ZP_10403807.1| NHL repeat protein, partial [Thiovulum sp. ES]
 gi|394446527|gb|EJF07348.1| NHL repeat protein, partial [Thiovulum sp. ES]
          Length = 4349

 Score = 45.1 bits (105), Expect = 0.032,   Method: Composition-based stats.
 Identities = 26/59 (44%), Positives = 35/59 (59%), Gaps = 1/59 (1%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRA 149
           G  DG   + RF  P+  A D +GN++VAD +N  I+KIT  G VTT A G ++  G A
Sbjct: 795 GDEDGNLSTVRFYNPRGLAFDEEGNLFVADSNNSKIKKITPDGDVTTFASGVNEPYGLA 853


>gi|424792718|ref|ZP_18218917.1| hypothetical protein XTG29_01651 [Xanthomonas translucens pv.
           graminis ART-Xtg29]
 gi|422796954|gb|EKU25368.1| hypothetical protein XTG29_01651 [Xanthomonas translucens pv.
           graminis ART-Xtg29]
          Length = 473

 Score = 45.1 bits (105), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 29/76 (38%), Positives = 41/76 (53%), Gaps = 4/76 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIA 139
           V R  G G   + DG  G A F +P+  A++ +G +YVAD  NH +R+I  L   V T+ 
Sbjct: 199 VLRQFGLGTADFIDGGIGEAAFHRPRGLALE-RGVLYVADTGNHALRRINLLSGHVDTLC 257

Query: 140 GGGSKKEGRADGPAQN 155
           G G   E   +GP Q+
Sbjct: 258 GNGRACE-PVEGPVQH 272



 Score = 45.1 bits (105), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 62/148 (41%), Gaps = 12/148 (8%)

Query: 73  SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           S+ L   S+  R AG G     DG    A F +P S A  ++  +YV D     +R +  
Sbjct: 303 SYELGNRSLRWR-AGAGALELRDGSGHLAAFAQPCSLAA-VQQALYVCDALGSAVRALQL 360

Query: 133 LG--VTTIAGG-GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            G  V T+ GG G    G  DGP   A       +   P    L I+D GN  +R + L 
Sbjct: 361 RGDLVQTLLGGQGPWDFGNEDGPRSRARLQFPQAIALSPDAPLLWIADSGNGSLRSLRLG 420

Query: 190 PEDCSKS-------SQSGSALGAVSVWV 210
             D S +         +G A+ A +VW+
Sbjct: 421 GGDLSTTPLPRRLHGPAGLAVSAGTVWI 448


>gi|219851969|ref|YP_002466401.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
 gi|219546228|gb|ACL16678.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
          Length = 676

 Score = 45.1 bits (105), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 31/89 (34%), Positives = 41/89 (46%), Gaps = 9/89 (10%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASF 158
           G  +F++P   AVD  GNIYV D +N  I+K    G  T   GGS   G  DG   N   
Sbjct: 40  GDGQFNQPSGVAVDSDGNIYVVDTNNFRIQKFNATGGFTTQWGGS---GPGDGQFNNP-- 94

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
               E   V +   + I+D  N  I++ N
Sbjct: 95  ----EGVAVDNNGNVYIADRDNNRIQKFN 119



 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 16/37 (43%), Positives = 25/37 (67%)

Query: 98  PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           PG  +F+ P+  AVD  GN+Y+AD+ N+ I+K  + G
Sbjct: 86  PGDGQFNNPEGVAVDNNGNVYIADRDNNRIQKFNSSG 122



 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 29/106 (27%), Positives = 46/106 (43%), Gaps = 9/106 (8%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASF 158
           G  +F++P   A+D  GN+YV DK N+ I+K  + G   +  G    EG  DG     S 
Sbjct: 134 GDGQFNQPSGVALDSAGNVYVTDKQNNRIQKFNSSGGFLMKWG---SEGSGDGQVHWPS- 189

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
                   V +  ++ + D  N  I++ N      +K    G+  G
Sbjct: 190 -----GVAVDNTGSVYVVDSYNHRIQKFNATGGFITKWGSEGTGDG 230



 Score = 38.1 bits (87), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 15/32 (46%), Positives = 21/32 (65%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G  +F++P   AVD  GN+YVAD  N+ I+K 
Sbjct: 569 GDGQFNRPSGVAVDSAGNVYVADSGNNRIQKF 600


>gi|402571882|ref|YP_006621225.1| hypothetical protein Desmer_1347 [Desulfosporosinus meridiei DSM
           13257]
 gi|402253079|gb|AFQ43354.1| conserved repeat protein [Desulfosporosinus meridiei DSM 13257]
          Length = 2025

 Score = 45.1 bits (105), Expect = 0.034,   Method: Composition-based stats.
 Identities = 31/107 (28%), Positives = 50/107 (46%), Gaps = 11/107 (10%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNAS 157
           G   FD P++ AVD   NIY+ D + H I+K++++G +    G    + G+ + P   A 
Sbjct: 688 GQWFFDVPRAAAVDSSSNIYIVDSNRHQIKKLSSMGEILATWGSYGNENGQMNVPGHIAL 747

Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
            S D           + ++D GN  I++ +   E   K   SGS  G
Sbjct: 748 DSED----------NVYVADTGNNRIQKFSSTGEYLMKFGSSGSGEG 784



 Score = 43.1 bits (100), Expect = 0.13,   Method: Composition-based stats.
 Identities = 28/73 (38%), Positives = 37/73 (50%), Gaps = 6/73 (8%)

Query: 89   GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEG 147
            G  G   GE     F++P+  AVD +GNIYVAD +NH I+K    G   T  G    + G
Sbjct: 966  GSNGSDQGE-----FNRPQGIAVDSEGNIYVADSNNHRIQKFNAAGAFMTTWGSFGTELG 1020

Query: 148  RADGPAQNASFSN 160
            + + P   A  SN
Sbjct: 1021 QFNSPKGIAVDSN 1033



 Score = 38.9 bits (89), Expect = 2.3,   Method: Composition-based stats.
 Identities = 19/48 (39%), Positives = 26/48 (54%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           G+ +  +     G  +F  PKS  VD  GNIYVAD +N  I+K  + G
Sbjct: 770 GEYLMKFGSSGSGEGQFRNPKSVTVDSAGNIYVADTTNKRIQKFDSDG 817



 Score = 38.5 bits (88), Expect = 3.0,   Method: Composition-based stats.
 Identities = 26/61 (42%), Positives = 35/61 (57%), Gaps = 5/61 (8%)

Query: 96   GEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK--KEGRADG 151
            G PG+  ++F  P   AVD  GN+YVAD S + IRK T  G T +A  G++    G+ D 
Sbjct: 1156 GSPGAGDSQFLFPHGIAVDSAGNVYVADTSANWIRKYTPEG-TLLAKWGTRGNSAGQFDN 1214

Query: 152  P 152
            P
Sbjct: 1215 P 1215



 Score = 38.1 bits (87), Expect = 4.2,   Method: Composition-based stats.
 Identities = 25/60 (41%), Positives = 33/60 (55%), Gaps = 6/60 (10%)

Query: 85   LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGS 143
            LA  G +G S G+     FD P   A D  GN+YVAD +N+ I+K+     VT  + GGS
Sbjct: 1199 LAKWGTRGNSAGQ-----FDNPYGIAADSAGNVYVADTNNNRIQKLAPAYTVTFDSNGGS 1253



 Score = 38.1 bits (87), Expect = 4.4,   Method: Composition-based stats.
 Identities = 25/79 (31%), Positives = 38/79 (48%), Gaps = 3/79 (3%)

Query: 77  SEESVVKRLA--GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           SE + +++ A  GD ++ +  G      F  P   A D +GNIYV+D +N+ I K  + G
Sbjct: 304 SENNRIQKFASSGDFLKKWGRGGSADGEFSLPFGIARDSEGNIYVSDTTNNRIEKFDSTG 363

Query: 135 V-TTIAGGGSKKEGRADGP 152
              T  G   K   + D P
Sbjct: 364 AFVTTWGSFGKDNHQLDMP 382



 Score = 37.0 bits (84), Expect = 7.9,   Method: Composition-based stats.
 Identities = 15/34 (44%), Positives = 22/34 (64%)

Query: 101  ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
             +F+ PK  AVD  GN+YVAD  N  ++K  ++G
Sbjct: 1020 GQFNSPKGIAVDSNGNVYVADVENDRVQKFDSMG 1053


>gi|218190788|gb|EEC73215.1| hypothetical protein OsI_07299 [Oryza sativa Indica Group]
          Length = 1103

 Score = 45.1 bits (105), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 55/110 (50%), Gaps = 9/110 (8%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG--VTTIAGGGS 143
           G   +G  DG   +A F++P+  A + K NI YVAD  NH +R+I  +   V T+AG G+
Sbjct: 622 GSSEEGLLDGTFDTASFNRPQGLAYNSKKNILYVADTENHALREINFVSETVKTLAGNGT 681

Query: 144 K-KEGRADGPAQNAS-----FSNDFELTFVPHICALLISDHGNQLIRQIN 187
           K  + R  G   N +      ++ +++ + P    L I+  G   I + N
Sbjct: 682 KGSDYRGGGQGTNQACFFMVLNSPWDVCYDPSKETLYIAMAGQHQIWKHN 731



 Score = 43.5 bits (101), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 52/184 (28%), Positives = 71/184 (38%), Gaps = 35/184 (19%)

Query: 30  DGYTVTTVIDGHQLEIN-----------PHSVIDRPGSSDLIVLDSSRSAFYTLSF---- 74
           DG T     DG++  +N           P  +   P   +LIV DS  S+   ++     
Sbjct: 734 DGVTEVLSGDGYERNLNGSSSTRTSFAQPSGISLAPELQELIVADSESSSIRVVNLKSGG 793

Query: 75  --------PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
                   P+  E++ +        G  DG         P          IYVAD  NH 
Sbjct: 794 SRSLAGGDPMIPENLFRF-------GDHDGTGSDVLLQHPLGVVYASDNQIYVADSYNHK 846

Query: 127 IRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIR 184
           I+++      VTTIAG G  + G  DGPA +A  S    L  V     LL++D  N  IR
Sbjct: 847 IKRLDPVTRKVTTIAGTG--RAGYKDGPALSAQLSEPAGLVEVGD-GRLLVADTNNSTIR 903

Query: 185 QINL 188
            I L
Sbjct: 904 YIVL 907



 Score = 41.6 bits (96), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 40/141 (28%), Positives = 63/141 (44%), Gaps = 13/141 (9%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG--YSDGEPGSAR------FDKPKSFA 110
           L V D+   A   ++F +SE   VK LAG+G +G  Y  G  G+ +       + P    
Sbjct: 653 LYVADTENHALREINF-VSE--TVKTLAGNGTKGSDYRGGGQGTNQACFFMVLNSPWDVC 709

Query: 111 VD-MKGNIYVADKSNHVIRKITNL-GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP 168
            D  K  +Y+A    H I K   L GVT +  G   +       +   SF+    ++  P
Sbjct: 710 YDPSKETLYIAMAGQHQIWKHNTLDGVTEVLSGDGYERNLNGSSSTRTSFAQPSGISLAP 769

Query: 169 HICALLISDHGNQLIRQINLK 189
            +  L+++D  +  IR +NLK
Sbjct: 770 ELQELIVADSESSSIRVVNLK 790



 Score = 40.8 bits (94), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 29/96 (30%), Positives = 42/96 (43%)

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
           P   A+D+  N      SNH    +TNL    I   GS +EG  DG    ASF+    L 
Sbjct: 586 PGKLAIDVLNNRLFISDSNHNRIVVTNLEGEFICQIGSSEEGLLDGTFDTASFNRPQGLA 645

Query: 166 FVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
           +      L ++D  N  +R+IN   E     + +G+
Sbjct: 646 YNSKKNILYVADTENHALREINFVSETVKTLAGNGT 681


>gi|149176433|ref|ZP_01855047.1| serine/threonine protein kinase [Planctomyces maris DSM 8797]
 gi|148844785|gb|EDL59134.1| serine/threonine protein kinase [Planctomyces maris DSM 8797]
          Length = 368

 Score = 45.1 bits (105), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 28/68 (41%), Positives = 41/68 (60%), Gaps = 4/68 (5%)

Query: 82  VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTT 137
           +  +AG G +G+S DG P  SA F +P S A+D   N+Y+ D  NH IR++  T   V+T
Sbjct: 138 ISTVAGTGEKGFSGDGGPATSATFSRPHSIALDNDNNLYICDIGNHRIRRVDLTTGTVST 197

Query: 138 IAGGGSKK 145
            +G G +K
Sbjct: 198 FSGTGERK 205



 Score = 40.0 bits (92), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 59/124 (47%), Gaps = 5/124 (4%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL--GVT 136
            VK LAG G  G++ DG P   A+  +P    +   G +YV +  +HVIR+I      ++
Sbjct: 23  TVKTLAGTGKVGHTGDGGPAVKAQVGEPYGLTLGPDGALYVCEIKSHVIRRIDEKTGKIS 82

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKS 196
           T+AG G K      G A  A  +  +E+ F      +   +  N ++R+++ K    S  
Sbjct: 83  TVAGSGKKGYSGDSGLALAAELNEPYEVRF-DKAGNMYFVEMINNIVRRVDAKTGKISTV 141

Query: 197 SQSG 200
           + +G
Sbjct: 142 AGTG 145



 Score = 38.9 bits (89), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 18/63 (28%), Positives = 32/63 (50%), Gaps = 1/63 (1%)

Query: 70  YTLSFPLSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           +T+     ++  ++   GDG +G   DG+P   R  +P    V   GN+Y+ D   + +R
Sbjct: 301 HTIRVIRKKDGTIETAVGDGKKGDGPDGDPAHCRMARPHGVYVGPNGNVYIGDSETYRVR 360

Query: 129 KIT 131
           K+T
Sbjct: 361 KLT 363



 Score = 38.5 bits (88), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 37/130 (28%), Positives = 56/130 (43%), Gaps = 20/130 (15%)

Query: 39  DGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL----------AGD 88
           DG ++E  P   ++ P + D     S + + Y     L E + V R+          AG 
Sbjct: 209 DGAKVEGTP---LNGPRALDFFTDGSGKGSLY---LALREGNKVYRIDLDDQTLHHIAGT 262

Query: 89  GVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK 144
           G +GY+   G    A    PK  +V   G++Y+AD  +H IR I   +  + T  G G K
Sbjct: 263 GKKGYTGHGGPAKKATLSGPKGISVAPNGDVYLADTESHTIRVIRKKDGTIETAVGDGKK 322

Query: 145 KEGRADGPAQ 154
            +G    PA 
Sbjct: 323 GDGPDGDPAH 332


>gi|145340877|ref|XP_001415544.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144575767|gb|ABO93836.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1675

 Score = 45.1 bits (105), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 48/159 (30%), Positives = 68/159 (42%), Gaps = 21/159 (13%)

Query: 60  IVLDSSRSAFYTLSFPLSEESVVKRLAGD-----GVQGYS----DGEPGSARFDKPKSFA 110
           +V D S  A Y L    +  S +   +G      G QG S    +  P SARF+ P+  A
Sbjct: 186 MVFDESDGAIYFLGSSQNHVSKIAYGSGTVSTYAGSQGTSGLVDNVTPSSARFNAPEGLA 245

Query: 111 VDMKGNI-YVADKSNHVIRKI--TNLGVTTIAGGGS--------KKEGRADGPAQ-NASF 158
           +D    I YVAD  NH +R I  +   VTT+ G G+          +G    PA+ N   
Sbjct: 246 LDGANRILYVADTGNHAVRAIDLSTGAVTTVLGDGTISASGAVLDSDGILSTPARFNYPA 305

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSS 197
              + L        LL++D G   IR+I L     S ++
Sbjct: 306 GIAYNLDSSLSSGVLLVTDRGTHQIRKIILTDATASNAA 344


>gi|224125014|ref|XP_002319481.1| predicted protein [Populus trichocarpa]
 gi|222857857|gb|EEE95404.1| predicted protein [Populus trichocarpa]
          Length = 1065

 Score = 45.1 bits (105), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 39/68 (57%), Gaps = 3/68 (4%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVT 136
           + + ++   G +G  DG    A F++P+  A + K NI YVAD  NH +R++   +  V 
Sbjct: 608 NFIAQIGSSGEEGLRDGSFDDATFNRPQGLAYNSKKNILYVADTENHALREVDFVSEKVR 667

Query: 137 TIAGGGSK 144
           T+AG G+K
Sbjct: 668 TLAGNGTK 675



 Score = 40.8 bits (94), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 31/76 (40%), Positives = 40/76 (52%), Gaps = 5/76 (6%)

Query: 115 GNIYVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA 172
           G IY+AD  NH I+K  +    VTTIAG G  K G  DG A  A  S    L    +   
Sbjct: 824 GLIYIADSYNHKIKKLDLATKRVTTIAGTG--KAGFKDGKALTAQLSEPAGLIEAEN-GR 880

Query: 173 LLISDHGNQLIRQINL 188
           L+I+D  N +IR ++L
Sbjct: 881 LIIADTNNSVIRYLDL 896


>gi|156406833|ref|XP_001641249.1| predicted protein [Nematostella vectensis]
 gi|156228387|gb|EDO49186.1| predicted protein [Nematostella vectensis]
          Length = 708

 Score = 45.1 bits (105), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 36/89 (40%), Positives = 46/89 (51%), Gaps = 9/89 (10%)

Query: 77  SEESVVKRLAG--DGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-- 131
           S E VV  + G  DG   G+ DG    A+F  P+  A+ +   IYVAD  NH IR+I   
Sbjct: 229 STEGVVHEVIGGEDGFSAGFQDGTFKEAKFHAPQGVAM-LGEVIYVADTENHAIREINLD 287

Query: 132 NLGVTTIAGGGSK---KEGRADGPAQNAS 157
           +  VTT+AG G +   KEG   G  Q  S
Sbjct: 288 SKKVTTVAGTGRQGADKEGGRQGREQTIS 316



 Score = 37.4 bits (85), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 31/122 (25%), Positives = 51/122 (41%), Gaps = 11/122 (9%)

Query: 84  RLAGDG-VQGYSDGEPGSARFDKPKSFAVDMKGNI---YVADKSNHVIRKI-TNLGVTTI 138
           R AG G  +  ++  P  A F +P   AV +   +   +VAD  +  +R +    G T  
Sbjct: 372 RFAGSGNEENRNNSYPHKAAFAQPSGLAVAISNTLRCLFVADSESSSVRCVDIETGATKA 431

Query: 139 AGGGSKKE------GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPED 192
             G  +        G  DG   +A F +   + + P    LL++D  N  I+ +  K + 
Sbjct: 432 VVGADRDPMNLFAFGDVDGKGVDAKFQHPLGVAWNPQEDHLLVADSYNHKIKTVEPKQKT 491

Query: 193 CS 194
           CS
Sbjct: 492 CS 493


>gi|312198573|ref|YP_004018634.1| serine/threonine protein kinase [Frankia sp. EuI1c]
 gi|311229909|gb|ADP82764.1| serine/threonine protein kinase [Frankia sp. EuI1c]
          Length = 797

 Score = 45.1 bits (105), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 42/116 (36%), Positives = 57/116 (49%), Gaps = 18/116 (15%)

Query: 87  GDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTIAG 140
           G G  G+S G+ GSA   R D P   AV   G ++VAD  N +IR++ ++G   V T+AG
Sbjct: 449 GTGEAGFS-GDTGSASRSRIDTPTGLAVGPDGALFVADSGNGLIRRV-DVGADLVETVAG 506

Query: 141 GGSKKEGRADGPAQNASFSND-FELTF-------VPHICALLISDHGNQLIRQINL 188
           G ++   R DG       S D  EL         V     LLI+D GN  + +I L
Sbjct: 507 GAARA--RPDGAEPRRWTSLDPAELVLRGPRGLTVDAAGRLLIADTGNHRVLRIGL 560


>gi|290995061|ref|XP_002680150.1| predicted protein [Naegleria gruberi]
 gi|284093769|gb|EFC47406.1| predicted protein [Naegleria gruberi]
          Length = 880

 Score = 45.1 bits (105), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 42/120 (35%), Positives = 59/120 (49%), Gaps = 29/120 (24%)

Query: 86  AGDGVQGYS-DGEPG-SARFDKPKSFAVD-MKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
           AG G  G+S DG    +A+   P+  +VD  KG++Y+AD  N+ IRK++N  ++TIAG G
Sbjct: 410 AGTGSGGFSGDGVLAINAKLYFPRGVSVDPFKGDVYIADSYNNRIRKVSNGFISTIAGTG 469

Query: 143 SKKEGRADGPAQNASFSNDFELTF-----VPHICA-------LLISDHGNQLIRQINLKP 190
           S            A F+ D EL        P+  A       + I D  N  IR  N+KP
Sbjct: 470 S------------AGFTGDGELAIAAKLDTPYSVAYSNTSGLVYILDTNNARIR--NIKP 515


>gi|440733650|ref|ZP_20913345.1| nhl repeat protein [Xanthomonas translucens DAR61454]
 gi|440359672|gb|ELP96967.1| nhl repeat protein [Xanthomonas translucens DAR61454]
          Length = 473

 Score = 45.1 bits (105), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 62/148 (41%), Gaps = 12/148 (8%)

Query: 73  SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           S+ L   S+  R AG G     DG    A F +P S A  ++  +YV D     +R +  
Sbjct: 303 SYELGNRSLRWR-AGAGALELRDGSGHLAAFAQPCSLAA-VQQALYVCDALGSAVRSLQL 360

Query: 133 LG--VTTIAGG-GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            G  V T+ GG G    G  DGP   A       +   P    L I+D GN  +R + L 
Sbjct: 361 RGDLVQTLLGGQGPWDFGNEDGPRSRARLQFPQAIALSPDAPLLWIADSGNGSLRSLRLG 420

Query: 190 PEDCSKS-------SQSGSALGAVSVWV 210
             D S +         +G A+ A +VW+
Sbjct: 421 GGDLSTTPLPRRLHGPAGLAVSAGTVWI 448



 Score = 44.7 bits (104), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 40/74 (54%), Gaps = 4/74 (5%)

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGG 141
           R  G G   + DG  G A F +P+  A++ +G +YVAD  NH +R+I  L   V T+ G 
Sbjct: 201 RQFGLGTADFIDGGIGEAAFHRPRGLALE-RGVLYVADTGNHALRRINLLSGHVDTLCGN 259

Query: 142 GSKKEGRADGPAQN 155
           G   E   +GP Q+
Sbjct: 260 GRAGE-PVEGPVQH 272


>gi|222624800|gb|EEE58932.1| hypothetical protein OsJ_10596 [Oryza sativa Japonica Group]
          Length = 1017

 Score = 45.1 bits (105), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 55/110 (50%), Gaps = 9/110 (8%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG--VTTIAGGGS 143
           G   +G  DG   +A F++P+  A + K NI YVAD  NH +R+I  +   V T+AG G+
Sbjct: 536 GSSEEGLLDGTFDTASFNRPQGLAYNSKKNILYVADTENHALREINFVSETVKTLAGNGT 595

Query: 144 K-KEGRADGPAQNAS-----FSNDFELTFVPHICALLISDHGNQLIRQIN 187
           K  + R  G   N +      ++ +++ + P    L I+  G   I + N
Sbjct: 596 KGSDYRGGGQGTNQACFFMVLNSPWDVCYDPSKETLYIAMAGQHQIWKHN 645



 Score = 43.5 bits (101), Expect = 0.091,   Method: Compositional matrix adjust.
 Identities = 52/184 (28%), Positives = 71/184 (38%), Gaps = 35/184 (19%)

Query: 30  DGYTVTTVIDGHQLEIN-----------PHSVIDRPGSSDLIVLDSSRSAFYTLSF---- 74
           DG T     DG++  +N           P  +   P   +LIV DS  S+   ++     
Sbjct: 648 DGVTEVLSGDGYERNLNGSSSTRTSFAQPSGISLAPELQELIVADSESSSIRVVNLKSGG 707

Query: 75  --------PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
                   P+  E++ +        G  DG         P          IYVAD  NH 
Sbjct: 708 SRSLAGGDPMIPENLFRF-------GDHDGTGSDVLLQHPLGVVYASDNQIYVADSYNHK 760

Query: 127 IRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIR 184
           I+++      VTTIAG G  + G  DGPA +A  S    L  V     LL++D  N  IR
Sbjct: 761 IKRLDPVTRKVTTIAGTG--RAGYKDGPALSAQLSEPAGLVEVGD-GRLLVADTNNSTIR 817

Query: 185 QINL 188
            I L
Sbjct: 818 YIVL 821



 Score = 41.2 bits (95), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 40/141 (28%), Positives = 63/141 (44%), Gaps = 13/141 (9%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG--YSDGEPGSAR------FDKPKSFA 110
           L V D+   A   ++F +SE   VK LAG+G +G  Y  G  G+ +       + P    
Sbjct: 567 LYVADTENHALREINF-VSE--TVKTLAGNGTKGSDYRGGGQGTNQACFFMVLNSPWDVC 623

Query: 111 VD-MKGNIYVADKSNHVIRKITNL-GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVP 168
            D  K  +Y+A    H I K   L GVT +  G   +       +   SF+    ++  P
Sbjct: 624 YDPSKETLYIAMAGQHQIWKHNTLDGVTEVLSGDGYERNLNGSSSTRTSFAQPSGISLAP 683

Query: 169 HICALLISDHGNQLIRQINLK 189
            +  L+++D  +  IR +NLK
Sbjct: 684 ELQELIVADSESSSIRVVNLK 704



 Score = 40.8 bits (94), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 29/96 (30%), Positives = 42/96 (43%)

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
           P   A+D+  N      SNH    +TNL    I   GS +EG  DG    ASF+    L 
Sbjct: 500 PGKLAIDVLNNRLFISDSNHNRIVVTNLEGEFICQIGSSEEGLLDGTFDTASFNRPQGLA 559

Query: 166 FVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
           +      L ++D  N  +R+IN   E     + +G+
Sbjct: 560 YNSKKNILYVADTENHALREINFVSETVKTLAGNGT 595


>gi|406909052|gb|EKD49392.1| lipoprotein, partial [uncultured bacterium]
          Length = 552

 Score = 44.7 bits (104), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 32/100 (32%), Positives = 49/100 (49%), Gaps = 6/100 (6%)

Query: 95  DGEPGS-----ARFDKPKSFAVDMKG-NIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
           DGE G+     A F+KP +  V+  G   YV D  N+++R+I      T    G++  G 
Sbjct: 340 DGETGANKSGCAYFNKPSAIVVNRAGTKAYVTDSGNNMLREINLSTGKTRKLAGTRSAGF 399

Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
            +G    A+F+   +L        L ++D GN  IR+INL
Sbjct: 400 KNGFGPLATFNGPADLDLDAAGANLYVADKGNHAIRRINL 439



 Score = 42.4 bits (98), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 46/179 (25%), Positives = 78/179 (43%), Gaps = 33/179 (18%)

Query: 47  PHSVIDRPGSSDLIV-LDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           PHS++  P    L   +++    + T+S        V+ +AG  +  Y +G+  S RF  
Sbjct: 229 PHSIVWNPTKPILYAAINNQLIEYNTVS------GKVRFMAGHPMDNYVEGQGSSVRFSN 282

Query: 106 PKSFAVDMKGN-IYVADKSNHVIRKITNLGVTT--IAGGG----------SKKEGRADGP 152
           P    +   GN +Y+AD+ NH IRK++     T  I G G          + +EG   GP
Sbjct: 283 P-DIDISPDGNWLYIADQYNHRIRKMSTKTKKTEYITGAGEVNFYSPNYSAYQEG---GP 338

Query: 153 AQNASFSNDFELTFVPHICALLI---------SDHGNQLIRQINLKPEDCSKSSQSGSA 202
               + +N     +     A+++         +D GN ++R+INL      K + + SA
Sbjct: 339 CDGETGANKSGCAYFNKPSAIVVNRAGTKAYVTDSGNNMLREINLSTGKTRKLAGTRSA 397



 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 70/192 (36%), Gaps = 61/192 (31%)

Query: 50  VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF 109
           V++R G+    V DS  +    ++    +    ++LAG    G+ +G    A F+ P   
Sbjct: 360 VVNRAGTK-AYVTDSGNNMLREINLSTGK---TRKLAGTRSAGFKNGFGPLATFNGPADL 415

Query: 110 AVDMKG-NIYVADKSNHVIRKIT--NLGVTTIAG-------------------------- 140
            +D  G N+YVADK NH IR+I    L V+T+ G                          
Sbjct: 416 DLDAAGANLYVADKGNHAIRRINLATLNVSTVVGKGKPGYQEGSKKTALLQMPTGIEVRG 475

Query: 141 ---------------------------GGSKKEGRADGPAQNASFSNDFELTFVPHICA- 172
                                       GS K GR  GP+  + F+    L   P   + 
Sbjct: 476 NNMIFWSESGSQRIRMYNTTENQTRLFAGSGKRGRLTGPSTRSQFNEPKGLAINPKDGSK 535

Query: 173 LLISDHGNQLIR 184
           L I+D+ N  I+
Sbjct: 536 LYIADYYNDSIK 547


>gi|398337816|ref|ZP_10522521.1| hypothetical protein LkmesMB_21054 [Leptospira kmetyi serovar
           Malaysia str. Bejo-Iso9]
          Length = 359

 Score = 44.7 bits (104), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 46/100 (46%), Gaps = 1/100 (1%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
           G  G  +G+   A F+ P     D + N+Y+ +  NH IRK  NL   +++       G 
Sbjct: 164 GTAGLQNGDNSGALFNGPFFMDFDRERNLYLGELGNHDIRKF-NLNFQSVSTLSGSSLGY 222

Query: 149 ADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
            DG    A F +   L +     +LL+ D  +  IR+INL
Sbjct: 223 QDGSVSTALFKSPIGLVYDRKKNSLLVVDIQDHRIRKINL 262



 Score = 39.7 bits (91), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 33/69 (47%)

Query: 62  LDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD 121
           LD      Y     ++   VV   AG G     DG+  +A F  P    +D  GNIYV++
Sbjct: 37  LDCILKECYLCRMKVTGNPVVTLFAGTGTASSVDGDISTATFQNPFGIELDSFGNIYVSE 96

Query: 122 KSNHVIRKI 130
           ++ + IRKI
Sbjct: 97  QNVNKIRKI 105


>gi|157112324|ref|XP_001657496.1| hypothetical protein AaeL_AAEL000965 [Aedes aegypti]
 gi|108883769|gb|EAT47994.1| AAEL000965-PA [Aedes aegypti]
          Length = 812

 Score = 44.7 bits (104), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 38/73 (52%), Gaps = 3/73 (4%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTI 138
           ++ R+ G    G+ DG+   ARF+ P+  A      ++VAD  NH IRKI   +  VTT+
Sbjct: 345 ILYRIGGKRA-GFVDGDFRKARFNAPQGVAFQNDTTLFVADNENHAIRKIDLKSQQVTTV 403

Query: 139 AGGGSKKEGRADG 151
            G G +   R  G
Sbjct: 404 VGNGQQGCDRIGG 416



 Score = 38.1 bits (87), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 33/129 (25%), Positives = 59/129 (45%), Gaps = 12/129 (9%)

Query: 85  LAGDG-VQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRK--ITNLGVTTIAG 140
           +AG+G  +  ++  P +A F +P   A++     IY+AD  +  +RK  +++  V  +AG
Sbjct: 489 IAGNGREENRNNSYPNNAAFSQPSGLAINKDAKEIYLADSESSSVRKMSLSDGKVLAVAG 548

Query: 141 GGSK-----KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE---D 192
           G          G  DG   +A F +   + +      + ++D  N  I++IN        
Sbjct: 549 GDRNPLDLFSFGDIDGKQYSAKFQHPLGVAYNSKDNCIYVADTYNHKIKKINASTNVATT 608

Query: 193 CSKSSQSGS 201
           CS   +SGS
Sbjct: 609 CSFQDESGS 617


>gi|433676068|ref|ZP_20508221.1| RING finger protein nhl-1 [Xanthomonas translucens pv. translucens
           DSM 18974]
 gi|430818813|emb|CCP38486.1| RING finger protein nhl-1 [Xanthomonas translucens pv. translucens
           DSM 18974]
          Length = 473

 Score = 44.7 bits (104), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 40/74 (54%), Gaps = 4/74 (5%)

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGG 141
           R  G G   + DG  G A F +P+  A++ +G +YVAD  NH +R+I  L   V T+ G 
Sbjct: 201 RQFGLGTADFIDGGIGEAAFHRPRGLALE-RGVLYVADTGNHALRRINLLSGHVDTLCGN 259

Query: 142 GSKKEGRADGPAQN 155
           G   E   +GP Q+
Sbjct: 260 GRAGE-PVEGPVQH 272



 Score = 44.3 bits (103), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 62/148 (41%), Gaps = 12/148 (8%)

Query: 73  SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           S+ L   S+  R AG G     DG    A F +P S A  ++  +YV D     +R +  
Sbjct: 303 SYELGNRSLRWR-AGAGALELRDGSGHLAAFAQPCSVAA-VQQALYVCDALGSAVRSLQL 360

Query: 133 LG--VTTIAGG-GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            G  V T+ GG G    G  DGP   A       +   P    L I+D GN  +R + L 
Sbjct: 361 RGDLVQTLLGGQGPWDFGNEDGPRSRARLQFPQAIALSPDAPLLWIADSGNGSLRSLRLG 420

Query: 190 PEDCSKS-------SQSGSALGAVSVWV 210
             D S +         +G A+ A +VW+
Sbjct: 421 GGDLSTTPLPRRLHGPAGLAVSAGTVWI 448


>gi|171913874|ref|ZP_02929344.1| NHL repeat containing protein [Verrucomicrobium spinosum DSM 4136]
          Length = 358

 Score = 44.7 bits (104), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 47/106 (44%), Gaps = 2/106 (1%)

Query: 97  EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQ 154
           +P  A F+      +   G   + D  NH +R++   +  VTTIAG G    G   GPA 
Sbjct: 93  DPQKALFNGMHDIQITNDGKAILGDSFNHRVRQLDLKSGTVTTIAGTGKAAFGGDGGPAT 152

Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSG 200
            ASF+     T  P      ++D GN   R+I+L     S  + SG
Sbjct: 153 AASFNITMTATLSPDRQRYYVADIGNHRTREIDLATGRISTVAGSG 198


>gi|290975165|ref|XP_002670314.1| predicted protein [Naegleria gruberi]
 gi|284083871|gb|EFC37570.1| predicted protein [Naegleria gruberi]
          Length = 1504

 Score = 44.7 bits (104), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 1/66 (1%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           ++++  V  +AG G+ GYS  G+P  +    P    V   G+IY++D  NH IR+I N  
Sbjct: 471 ITQDGFVVNVAGTGISGYSLIGDPLLSYLSNPNGIVVLKNGDIYISDTDNHCIRQIKNGT 530

Query: 135 VTTIAG 140
           ++   G
Sbjct: 531 LSNFVG 536



 Score = 37.7 bits (86), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 40/72 (55%), Gaps = 5/72 (6%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGS--ARFDKPKSFAVDM--KGNIYVADKSNHVIRKIT 131
           +S E +V  + G G  G++  +  +  A  + P + AV    + + Y  D  N +IRK++
Sbjct: 789 ISIEGIVSTIVGKGF-GFNGNKVRNEDALLNLPSNMAVSSLNENDFYFCDTGNRLIRKVS 847

Query: 132 NLGVTTIAGGGS 143
           N  VTTIAGGG+
Sbjct: 848 NGYVTTIAGGGN 859


>gi|427731644|ref|YP_007077881.1| thiol-disulfide isomerase-like thioredoxin [Nostoc sp. PCC 7524]
 gi|427367563|gb|AFY50284.1| thiol-disulfide isomerase-like thioredoxin [Nostoc sp. PCC 7524]
          Length = 505

 Score = 44.7 bits (104), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 23/58 (39%), Positives = 34/58 (58%), Gaps = 3/58 (5%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTI 138
           + + GDG  G  DG+  SARF +P   +  M  N+Y+ D +NH IR++    L VTT+
Sbjct: 433 QTILGDGNAGLQDGQGKSARFFEPSGLSA-MDSNLYICDTNNHAIRRVDLNTLEVTTL 489



 Score = 44.3 bits (103), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 35/67 (52%), Gaps = 5/67 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG---VTT 137
           V  L G G  G++DG    A+F  P+    D    I YVAD  NH +R+I NL    V T
Sbjct: 206 VLHLIGTGKSGFTDGAFDEAQFSAPQGMTFDAANQILYVADTDNHAVRQI-NLKRQIVET 264

Query: 138 IAGGGSK 144
           IAG G +
Sbjct: 265 IAGTGEQ 271


>gi|184200016|ref|YP_001854223.1| hypothetical protein KRH_03700 [Kocuria rhizophila DC2201]
 gi|183580246|dbj|BAG28717.1| hypothetical protein [Kocuria rhizophila DC2201]
          Length = 690

 Score = 44.7 bits (104), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 50/88 (56%), Gaps = 14/88 (15%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV-------DMKGNIYVADKSNHVIRK 129
           S+ + V+R  G G +G+SDG    ARF++P+  AV       ++  ++ VAD  NH +R 
Sbjct: 205 SDLTTVRRAIGSGARGWSDGPRDVARFNEPQGIAVLPGRLRSEVGYDLVVADTVNHRLRG 264

Query: 130 ITNLG---VTTIAGGGSKK---EGRADG 151
           + +LG   V T+AG G ++   + RA G
Sbjct: 265 V-DLGSGSVRTLAGNGVQRLLDDARARG 291



 Score = 38.1 bits (87), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 16/46 (34%), Positives = 26/46 (56%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           LAG G +G  DG    + F +P   A D  G+++VAD  +  +R++
Sbjct: 345 LAGTGDEGLRDGSAHESWFAQPSGLAQDAHGDVWVADSESSALRRV 390


>gi|158318814|ref|YP_001511322.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
 gi|158114219|gb|ABW16416.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
          Length = 732

 Score = 44.7 bits (104), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 25/59 (42%), Positives = 37/59 (62%), Gaps = 3/59 (5%)

Query: 85  LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           +AG G  GYS DG P   A   +P++ AVD +G + +AD+ +  IR++   G +TTIAG
Sbjct: 581 VAGSGTPGYSGDGGPAVHASLTRPQAVAVDAQGRLLIADQEHRRIRRVDTTGRITTIAG 639



 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 33/107 (30%), Positives = 53/107 (49%), Gaps = 5/107 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAG 140
           ++R++ DG        PG     +P+S  VD  G IY+AD  NH I ++    G   +AG
Sbjct: 527 LRRVSPDGRIETMPALPG---LRQPRSVTVDPHGVIYLADTGNHRIWRLDPGDGARVVAG 583

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            G+       GPA +AS +   +   V     LLI+D  ++ IR+++
Sbjct: 584 SGTPGYSGDGGPAVHASLTRP-QAVAVDAQGRLLIADQEHRRIRRVD 629


>gi|158315875|ref|YP_001508383.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
 gi|158111280|gb|ABW13477.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
          Length = 756

 Score = 44.7 bits (104), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 36/90 (40%), Positives = 48/90 (53%), Gaps = 10/90 (11%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           L  +  V  +AG G QG SDG+ G    AR     + AVD  G +Y+ + SN  IR+I  
Sbjct: 463 LGLDGKVSVVAGTGAQG-SDGDGGPAVQARLRGLDAVAVDGAGTVYLGEDSNGKIRRIGR 521

Query: 133 LGV-TTIAGGGSK--KEGRADGPAQNASFS 159
            GV TT+ GGG++  +EG    PA   S S
Sbjct: 522 DGVITTVVGGGTRLAREGE---PATEVSLS 548


>gi|296221249|ref|XP_002756652.1| PREDICTED: NHL repeat-containing protein 2 [Callithrix jacchus]
          Length = 726

 Score = 44.7 bits (104), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
           G  DG    + F+ P+  A+ M   IYVAD  NH+IRKI      V+T+AG    G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNTIYVADTENHLIRKIDLEAEKVSTVAGIGIQGTDKE 325

Query: 147 GRADGPAQNASFSNDFELTF 166
           G A G  Q    S+ +++ F
Sbjct: 326 GGAQGEQQ--PISSPWDVVF 343


>gi|219851088|ref|YP_002465520.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
 gi|219545347|gb|ACL15797.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
          Length = 831

 Score = 44.7 bits (104), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 32/118 (27%), Positives = 54/118 (45%), Gaps = 9/118 (7%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
           + +F  P+  A+D  GN+YVAD+SNH + K T++G    A G    +G  DG        
Sbjct: 173 NGQFSYPEGIAIDGTGNVYVADESNHRVEKFTSIGTFLTAWG---TKGSEDGQFAYP--- 226

Query: 160 NDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSC 217
              +   V  +  + ++D GN  +++        +    SG+  G  S  V ++V S 
Sbjct: 227 ---DGVAVDSLGNVYVADSGNNRVQKFTSSGAFITDWGSSGTGSGQFSSPVDIAVDSS 281



 Score = 44.7 bits (104), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 32/118 (27%), Positives = 54/118 (45%), Gaps = 9/118 (7%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
           + +F  P+  A+D  GN+YVAD+SNH + K T++G    A G    +G  DG        
Sbjct: 473 NGQFSYPEGIAIDGTGNVYVADESNHRVEKFTSIGTFLTAWG---TKGSEDGQFAYP--- 526

Query: 160 NDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSC 217
              +   V  +  + ++D GN  +++        +    SG+  G  S  V ++V S 
Sbjct: 527 ---DGVAVDSLGNVYVADSGNNRVQKFTSSGAFITDWGSSGTGSGQFSSPVDIAVDSS 581



 Score = 43.5 bits (101), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 27/48 (56%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
           GS +F  P   A+D  GN+YV D+ NH ++K T+ G+     G S  E
Sbjct: 125 GSGQFSSPYGVAIDSAGNVYVVDRGNHRVQKFTSSGIFVAKWGSSGSE 172



 Score = 43.5 bits (101), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 27/48 (56%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
           GS +F  P   A+D  GN+YV D+ NH ++K T+ G+     G S  E
Sbjct: 425 GSGQFSSPYGVAIDGAGNVYVVDRGNHRVQKFTSSGIFVAKWGSSGSE 472



 Score = 41.6 bits (96), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 30/108 (27%), Positives = 51/108 (47%), Gaps = 9/108 (8%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
           + +F  P   AVD  GN+YVAD+ N+ ++K T+ G T +A  G+   G       +  FS
Sbjct: 79  NGQFRTPTGIAVDSAGNVYVADRDNNRVQKFTSTG-TYLAKWGTSGTG-------SGQFS 130

Query: 160 NDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVS 207
           + + +  +     + + D GN  +++        +K   SGS  G  S
Sbjct: 131 SPYGVA-IDSAGNVYVVDRGNHRVQKFTSSGIFVAKWGSSGSENGQFS 177



 Score = 41.2 bits (95), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 30/108 (27%), Positives = 51/108 (47%), Gaps = 9/108 (8%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFS 159
           + +F  P   AVD  GN+YVAD+ N+ ++K T+ G T +A  G+   G       +  FS
Sbjct: 379 NGQFRTPTGIAVDSAGNVYVADRDNNRVQKFTSTG-TYLAKWGTSGTG-------SGQFS 430

Query: 160 NDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVS 207
           + + +  +     + + D GN  +++        +K   SGS  G  S
Sbjct: 431 SPYGVA-IDGAGNVYVVDRGNHRVQKFTSSGIFVAKWGSSGSENGQFS 477


>gi|257057817|ref|YP_003135649.1| Peroxiredoxin [Saccharomonospora viridis DSM 43017]
 gi|256587689|gb|ACU98822.1| Peroxiredoxin [Saccharomonospora viridis DSM 43017]
          Length = 644

 Score = 44.7 bits (104), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 45/94 (47%), Gaps = 8/94 (8%)

Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
           RF  P    V  +G I VAD +NH + +  +   T +   G  + G  DG    A F+  
Sbjct: 194 RF--PAKAVVTAEGRILVADTANHSVAEFASDAETLLRRFGDGRRGAVDGAFDVARFAEP 251

Query: 162 FELTFVP-HICA-----LLISDHGNQLIRQINLK 189
             LT +P HI       LL++D  N L+R ++L+
Sbjct: 252 SGLTLLPTHIADEVGYHLLVADTANHLLRGVDLR 285



 Score = 38.5 bits (88), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 30/121 (24%), Positives = 59/121 (48%), Gaps = 11/121 (9%)

Query: 73  SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF-------AVDMKGNIYVADKSNH 125
            F    E++++R  GDG +G  DG    ARF +P          A ++  ++ VAD +NH
Sbjct: 219 EFASDAETLLRRF-GDGRRGAVDGAFDVARFAEPSGLTLLPTHIADEVGYHLLVADTANH 277

Query: 126 VIRKI--TNLGVTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQL 182
           ++R +      V T+AG G + ++G   G A     ++ +++ +      ++++  GN  
Sbjct: 278 LLRGVDLRTGAVRTVAGTGRQWRDGDDSGAALEVDLTSPWDVRWWDVAGGVVVAMAGNHT 337

Query: 183 I 183
           +
Sbjct: 338 L 338


>gi|383829668|ref|ZP_09984757.1| thiol-disulfide isomerase-like thioredoxin [Saccharomonospora
           xinjiangensis XJ-54]
 gi|383462321|gb|EID54411.1| thiol-disulfide isomerase-like thioredoxin [Saccharomonospora
           xinjiangensis XJ-54]
          Length = 681

 Score = 44.7 bits (104), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 29/90 (32%), Positives = 43/90 (47%), Gaps = 6/90 (6%)

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
           P    V  +G + VAD +NH I +  +   T +   G  + G  DG    A F+    LT
Sbjct: 227 PSRAVVTAEGRVLVADAANHSIAEFASDAETLLRRFGDGERGAVDGAFDIARFAEPSGLT 286

Query: 166 FVP-HICA-----LLISDHGNQLIRQINLK 189
            +P HI       LL++D  N L+R I+L+
Sbjct: 287 LLPTHIAEQVGYHLLVADTANHLLRAIDLR 316



 Score = 41.2 bits (95), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 31/120 (25%), Positives = 60/120 (50%), Gaps = 11/120 (9%)

Query: 74  FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF-------AVDMKGNIYVADKSNHV 126
           F    E++++R  GDG +G  DG    ARF +P          A  +  ++ VAD +NH+
Sbjct: 251 FASDAETLLRRF-GDGERGAVDGAFDIARFAEPSGLTLLPTHIAEQVGYHLLVADTANHL 309

Query: 127 IRKI--TNLGVTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
           +R I      V T+AG G + ++G   G A +   ++ +++ +   +  ++++  GN  +
Sbjct: 310 LRAIDLRTGAVRTVAGTGRQWRDGDDSGAALDVDLTSPWDVRWWDTVGGVVVAMAGNHTL 369



 Score = 37.0 bits (84), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 34/136 (25%), Positives = 54/136 (39%), Gaps = 20/136 (14%)

Query: 70  YTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           +TLS     E+ V R AG  V+G  DG    A F +    AVD    +++ D     +R 
Sbjct: 367 HTLSLFDPVEATVSRFAGTTVEGLRDGAATEAFFAQTSGLAVDGD-RLWLVDAETSALRY 425

Query: 130 ITNLGVTTIAGGGSKKE------------------GRADGPAQNASFSNDFELTFVPHIC 171
           +  +G T  AG   K                    G  DG A  A   +   +T +P   
Sbjct: 426 VERVGDTGDAGATDKTAESGGFVVRTAVGQDLFTFGHRDGAADTALLQHPLGVTVLPD-G 484

Query: 172 ALLISDHGNQLIRQIN 187
           ++ ++D  N  +R+ +
Sbjct: 485 SIGVADTYNGAVRRFD 500


>gi|378548897|ref|ZP_09824113.1| hypothetical protein CCH26_02380 [Citricoccus sp. CH26A]
          Length = 708

 Score = 44.7 bits (104), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 32/111 (28%), Positives = 51/111 (45%), Gaps = 12/111 (10%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG----------- 134
           AG G++G  DG+P  A F +    A    G I+VAD  +  +R+I+              
Sbjct: 396 AGTGLEGLEDGDPTVAWFAQSSGLAESADGTIWVADSESSALRRISPADPAAADGALSRR 455

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
           V++  G G    G  DG A  A F +   +  +P   ++L++D  N  IR+
Sbjct: 456 VSSAVGTGLFDFGFRDGAADQARFQHPLGVAALPD-GSVLVADTYNGAIRR 505


>gi|290969729|ref|XP_002667939.1| predicted protein [Naegleria gruberi]
 gi|284080810|gb|EFC35195.1| predicted protein [Naegleria gruberi]
          Length = 152

 Score = 44.7 bits (104), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 32/82 (39%), Positives = 42/82 (51%), Gaps = 2/82 (2%)

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFEL 164
           P+   V     +Y+AD  NH +RKI   G + TIAG G +     D PA NA  S+   +
Sbjct: 41  PEGVFVAPNDEVYMADIQNHRVRKILKDGTIVTIAGTGEEGFSGDDDPAINAQLSHPCGV 100

Query: 165 TFVPHICALLISDHGNQLIRQI 186
            FV     + ISD GN  IR+I
Sbjct: 101 -FVSLTNEVYISDFGNYRIRKI 121



 Score = 41.6 bits (96), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 44/85 (51%), Gaps = 3/85 (3%)

Query: 78  EESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           ++  +  +AG G +G+S D +P  +A+   P    V +   +Y++D  N+ IRKI  N  
Sbjct: 67  KDGTIVTIAGTGEEGFSGDDDPAINAQLSHPCGVFVSLTNEVYISDFGNYRIRKILRNGN 126

Query: 135 VTTIAGGGSKKEGRADGPAQNASFS 159
           + TI G G K      GPA NA + 
Sbjct: 127 IVTIVGTGVKGYSGDGGPAINAQYQ 151


>gi|383120609|ref|ZP_09941337.1| hypothetical protein BSIG_2379 [Bacteroides sp. 1_1_6]
 gi|382985045|gb|EES68418.2| hypothetical protein BSIG_2379 [Bacteroides sp. 1_1_6]
          Length = 483

 Score = 44.7 bits (104), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 40/89 (44%), Gaps = 4/89 (4%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTL--SFPLSEESVVKRLAGDGVQGYSDGEPGSARF 103
           NPH +        L + D   +    +    PL    V   +   G++GY DG P  A F
Sbjct: 394 NPHQICFTE-DGKLYIADCGNNCIRVIDTKLPLDRAMVTTPIGLPGMKGYKDGGPDIALF 452

Query: 104 DKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
           + P   AV   G I YVAD  N VIRK++
Sbjct: 453 NHPFGVAVSADGQIVYVADTGNKVIRKLS 481



 Score = 43.5 bits (101), Expect = 0.091,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 50/103 (48%), Gaps = 5/103 (4%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-----NLGVTTIAGGGSKKE 146
           G+ DG   +A+FD P        G +Y+AD  N+ IR I      +  + T   G    +
Sbjct: 381 GWEDGLLRNAKFDNPHQICFTEDGKLYIADCGNNCIRVIDTKLPLDRAMVTTPIGLPGMK 440

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           G  DG    A F++ F +        + ++D GN++IR+++++
Sbjct: 441 GYKDGGPDIALFNHPFGVAVSADGQIVYVADTGNKVIRKLSIQ 483


>gi|375103275|ref|ZP_09749538.1| thiol-disulfide isomerase-like thioredoxin [Saccharomonospora
           cyanea NA-134]
 gi|374664007|gb|EHR63885.1| thiol-disulfide isomerase-like thioredoxin [Saccharomonospora
           cyanea NA-134]
          Length = 638

 Score = 44.7 bits (104), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 53/114 (46%), Gaps = 13/114 (11%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
           V R   DG +G+ DG     RF  P    V  +G + VAD +NH + +  +   T +   
Sbjct: 177 VARADTDG-RGHLDG----LRF--PSKAVVTAEGRVLVADTANHSVAEFASDAETLLRRF 229

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQINLK 189
           G  + G  DG    A F+    LT +P   A      LL++D  N L+R ++L+
Sbjct: 230 GDGRRGAVDGAFDVARFAEPSGLTLLPTHVADEVGYHLLVADTANHLLRGVDLR 283



 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 31/121 (25%), Positives = 60/121 (49%), Gaps = 11/121 (9%)

Query: 73  SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF-------AVDMKGNIYVADKSNH 125
            F    E++++R  GDG +G  DG    ARF +P          A ++  ++ VAD +NH
Sbjct: 217 EFASDAETLLRRF-GDGRRGAVDGAFDVARFAEPSGLTLLPTHVADEVGYHLLVADTANH 275

Query: 126 VIRKI--TNLGVTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQL 182
           ++R +      V T+AG G + ++G   GPA     ++ +++ +      ++++  GN  
Sbjct: 276 LLRGVDLRTGAVRTVAGTGRQWRDGDDSGPALGVDLTSPWDVRWWDAANGVVVAMAGNHT 335

Query: 183 I 183
           +
Sbjct: 336 L 336


>gi|326431538|gb|EGD77108.1| NHL repeat-containing protein [Salpingoeca sp. ATCC 50818]
          Length = 1455

 Score = 44.7 bits (104), Expect = 0.045,   Method: Composition-based stats.
 Identities = 32/116 (27%), Positives = 52/116 (44%), Gaps = 4/116 (3%)

Query: 81   VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVT 136
            V+    G G  G+    G   +A+   P   A D +G +Y+AD  NHV+  +   +L + 
Sbjct: 1222 VLSVFVGTGEAGHRGMGGSSRAAQIHSPHGLAFDSRGTLYIADTENHVVYGVDRRSLVID 1281

Query: 137  TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPED 192
             +AG   +K    DG     ++ N      V     L I+D GN  IR ++L  ++
Sbjct: 1282 VVAGTPFRKGYHGDGRPATTAWLNAPTGIAVRGAGDLYIADTGNNRIRYVDLDSDN 1337



 Score = 38.9 bits (89), Expect = 2.5,   Method: Composition-based stats.
 Identities = 23/81 (28%), Positives = 42/81 (51%), Gaps = 3/81 (3%)

Query: 80   SVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVT 136
            S +  +AG   +G +   G   +A+ D+P   AVD  GN+Y+++K NH +  +  + GV 
Sbjct: 1164 STITTIAGTKTRGSAGVGGLATAAQLDRPTCVAVDTIGNVYISEKGNHRVSVVDASTGVL 1223

Query: 137  TIAGGGSKKEGRADGPAQNAS 157
            ++  G  +   R  G +  A+
Sbjct: 1224 SVFVGTGEAGHRGMGGSSRAA 1244


>gi|149689662|ref|XP_001495693.1| PREDICTED: NHL repeat-containing protein 2 [Equus caballus]
          Length = 726

 Score = 44.7 bits (104), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 44/80 (55%), Gaps = 8/80 (10%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG---GGSKKE 146
           G  DG    + F+ P+  A+ M   IYVAD  NH+IRKI      V+T+AG    G+ KE
Sbjct: 267 GRKDGTFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEMVSTVAGIGIQGTDKE 325

Query: 147 GRADGPAQNASFSNDFELTF 166
           G A+G  Q    S+ +++ F
Sbjct: 326 GGAEGEQQ--PISSPWDVVF 343


>gi|432922359|ref|XP_004080313.1| PREDICTED: NHL repeat-containing protein 2-like [Oryzias latipes]
          Length = 720

 Score = 44.7 bits (104), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 35/84 (41%), Positives = 43/84 (51%), Gaps = 8/84 (9%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLG--VTTIAGG-- 141
           G    G  DG    A F+ P+  A   KG+ +YVAD  NH+IRKI   G  V+TIAG   
Sbjct: 257 GGPASGRKDGSLSEASFNSPQGVA--FKGDAVYVADTENHLIRKIDLSGGRVSTIAGAGV 314

Query: 142 -GSKKEGRADGPAQNASFSNDFEL 164
            G+ +EG A G  Q  S   D  L
Sbjct: 315 QGTDREGGAVGVQQPISSPWDVTL 338


>gi|395226993|ref|ZP_10405365.1| hypothetical protein ThvES_00021150, partial [Thiovulum sp. ES]
 gi|394444658|gb|EJF05823.1| hypothetical protein ThvES_00021150, partial [Thiovulum sp. ES]
          Length = 240

 Score = 44.7 bits (104), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 19/50 (38%), Positives = 30/50 (60%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V+   G G+ GY DG   +++ +KP+    D KGN+ V D  N +IR+I+
Sbjct: 168 VQNFVGSGIAGYQDGVGFNSQLNKPRGMTFDEKGNLLVVDSYNEMIREIS 217


>gi|219853189|ref|YP_002467621.1| PKD domain-containing protein [Methanosphaerula palustris E1-9c]
 gi|219547448|gb|ACL17898.1| PKD domain containing protein [Methanosphaerula palustris E1-9c]
          Length = 930

 Score = 44.7 bits (104), Expect = 0.045,   Method: Composition-based stats.
 Identities = 24/65 (36%), Positives = 33/65 (50%), Gaps = 3/65 (4%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
           G+ +  +     G+ +F+ P   AVD  GNIYV D  NH ++K T+ G      GG   E
Sbjct: 76  GEFITQWGSEGSGNGQFENPGGIAVDSAGNIYVTDTWNHRVQKFTSTGEFITKWGG---E 132

Query: 147 GRADG 151
           G  DG
Sbjct: 133 GSGDG 137



 Score = 42.0 bits (97), Expect = 0.24,   Method: Composition-based stats.
 Identities = 37/110 (33%), Positives = 53/110 (48%), Gaps = 18/110 (16%)

Query: 44  EIN-PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSA 101
           E+N PH V +D  G+  +I   ++R   +T        S  + +A  G +G  DG+    
Sbjct: 326 ELNHPHGVALDDAGNVYIIDTWNNRVQIFT--------STGEFIAKWGSKGSEDGQ---- 373

Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
            F  P S AVD  GN+YV D  N+ ++K T+ G      GG   EG  DG
Sbjct: 374 -FIYPYSIAVDSAGNVYVVDTGNNRVQKFTSTGTFITQWGG---EGFGDG 419



 Score = 38.9 bits (89), Expect = 2.1,   Method: Composition-based stats.
 Identities = 30/101 (29%), Positives = 48/101 (47%), Gaps = 16/101 (15%)

Query: 51  IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFA 110
           +DR G+  ++   ++R   +T     S  + + +  G G +   DG+     F+ P   A
Sbjct: 476 VDRAGNVYVVDTGNNRVQIFT-----STGTFIAQWGGSGSR---DGQ-----FNYPGGIA 522

Query: 111 VDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           VD  GN+YV D+SN+  +K T+ G      G    EG  DG
Sbjct: 523 VDSAGNVYVVDESNNRFQKFTSTGEFITKWG---SEGLGDG 560



 Score = 37.4 bits (85), Expect = 6.2,   Method: Composition-based stats.
 Identities = 25/76 (32%), Positives = 39/76 (51%), Gaps = 11/76 (14%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           + ++ V+D S + F    F  + E + K     G +G  DGE     F  P+  AVD  G
Sbjct: 526 AGNVYVVDESNNRFQ--KFTSTGEFITKW----GSEGLGDGE-----FTYPRDVAVDSGG 574

Query: 116 NIYVADKSNHVIRKIT 131
           N+Y+ D+SN  I+K +
Sbjct: 575 NVYIVDESNSRIQKFS 590



 Score = 37.4 bits (85), Expect = 6.6,   Method: Composition-based stats.
 Identities = 24/74 (32%), Positives = 36/74 (48%), Gaps = 11/74 (14%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA-- 139
           + +  GDG         G   F+ P   AVD  GN+YV D  N+ ++  T+ G T IA  
Sbjct: 455 ITKWGGDG--------SGVGEFNYPYGIAVDRAGNVYVVDTGNNRVQIFTSTG-TFIAQW 505

Query: 140 GGGSKKEGRADGPA 153
           GG   ++G+ + P 
Sbjct: 506 GGSGSRDGQFNYPG 519


>gi|308178347|ref|YP_003917753.1| NHL repeat-containing protein [Arthrobacter arilaitensis Re117]
 gi|307745810|emb|CBT76782.1| NHL repeat-containing protein [Arthrobacter arilaitensis Re117]
          Length = 635

 Score = 44.7 bits (104), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 51/204 (25%), Positives = 88/204 (43%), Gaps = 46/204 (22%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK-------PKS 108
           + + +V DS       L+   S+ S V R  G GV+GY+DG+  +A+F++       P  
Sbjct: 190 NGNFLVGDSGHHRLVELA---SDLSTVVRTIGAGVKGYADGDAQTAQFNELQGLTALPAE 246

Query: 109 FAVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSKKEGRADGPAQ----------- 154
            A ++  ++ VAD  NH +R + NL    V T+AG G ++   ++   Q           
Sbjct: 247 LAAEVGYDVIVADTVNHRLRSV-NLETGVVGTLAGNGVQRLLDSENARQEVQRTELGTDA 305

Query: 155 -NASFSNDFELTFVPHICALLISDHGNQLIRQINLK-------------------PEDCS 194
            N S S+ +++ + P    ++I+  G   I   N +                   P++  
Sbjct: 306 TNISLSSPWDVLYHPS-GQVIIAMAGTHQIFAFNPRTSAVSVFAGTGLEGLADGAPDEAW 364

Query: 195 KSSQSGSALGAVSVWVLVSVLSCL 218
            +  SG AL   +VW+  S  S L
Sbjct: 365 FAQSSGLALDGQNVWIADSETSAL 388



 Score = 37.0 bits (84), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 49/109 (44%), Gaps = 8/109 (7%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
           G+G     +   G  RF  P        GN  V D  +H + ++ +   T +   G+  +
Sbjct: 165 GNGPYVAPEAREGDLRF--PGKAIALANGNFLVGDSGHHRLVELASDLSTVVRTIGAGVK 222

Query: 147 GRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQINLK 189
           G ADG AQ A F+    LT +P   A      ++++D  N  +R +NL+
Sbjct: 223 GYADGDAQTAQFNELQGLTALPAELAAEVGYDVIVADTVNHRLRSVNLE 271


>gi|423072880|ref|ZP_17061629.1| repeat protein [Desulfitobacterium hafniense DP7]
 gi|361856375|gb|EHL08282.1| repeat protein [Desulfitobacterium hafniense DP7]
          Length = 2017

 Score = 44.3 bits (103), Expect = 0.049,   Method: Composition-based stats.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 9/102 (8%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
           AG  + G+        +F  P+S AVD  GNIYV+D  N+ I+K  + G         KK
Sbjct: 254 AGSFIMGWGGAGIDDGQFATPRSIAVDSAGNIYVSDSMNNRIQKFASSGTFL------KK 307

Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            GR  G + +  F   + +  V H   + +SD GN  I + +
Sbjct: 308 WGR--GGSADGEFYQPYGVA-VDHEGTIYVSDTGNNRIEKFD 346



 Score = 42.4 bits (98), Expect = 0.19,   Method: Composition-based stats.
 Identities = 22/49 (44%), Positives = 27/49 (55%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEG 147
           G  +F  PKS AVD  GNIYVAD +N  I+K  + G      G S+  G
Sbjct: 772 GDGQFRNPKSVAVDNDGNIYVADTTNKRIQKFDSDGTLITKWGDSQDNG 820



 Score = 42.4 bits (98), Expect = 0.20,   Method: Composition-based stats.
 Identities = 29/73 (39%), Positives = 36/73 (49%), Gaps = 6/73 (8%)

Query: 89   GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEG 147
            G  G  +GE     F++P   AVD  GNIYVAD +NH I+K    GV  T  G      G
Sbjct: 956  GSNGSDEGE-----FNRPYGIAVDSDGNIYVADSNNHRIQKFNAAGVFITTWGSYGTGLG 1010

Query: 148  RADGPAQNASFSN 160
            + + P   A  SN
Sbjct: 1011 QFNSPKGIAVDSN 1023



 Score = 41.6 bits (96), Expect = 0.33,   Method: Composition-based stats.
 Identities = 33/82 (40%), Positives = 43/82 (52%), Gaps = 7/82 (8%)

Query: 85   LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK-ITNLGVTTIAGGGS 143
            LA  G +G S G+     FD P   AVD  GNIYVAD +N+ ++K      VT  + GGS
Sbjct: 1189 LAKWGTRGNSAGQ-----FDNPSGIAVDNDGNIYVADTNNNRVQKSAQTYTVTFDSNGGS 1243

Query: 144  KKEGRADGPAQNASFSNDFELT 165
              E   +  A+NA+ S   E T
Sbjct: 1244 AVEAIVN-IAKNATVSLPIEPT 1264



 Score = 39.7 bits (91), Expect = 1.4,   Method: Composition-based stats.
 Identities = 25/86 (29%), Positives = 43/86 (50%), Gaps = 11/86 (12%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSND 161
           FD P + AVD  GNIY+ D + H I+K+++ G +    G     EG+ + P   A  S++
Sbjct: 682 FDVPAAAAVDSNGNIYIVDSNRHHIKKLSSTGEILATWGSYGNAEGQLNVPGHIAIDSDN 741

Query: 162 FELTFVPHICALLISDHGNQLIRQIN 187
                      + ++D GN  I++ +
Sbjct: 742 ----------NVYVADTGNNRIQKFS 757



 Score = 38.1 bits (87), Expect = 4.1,   Method: Composition-based stats.
 Identities = 21/56 (37%), Positives = 30/56 (53%), Gaps = 1/56 (1%)

Query: 99   GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS-KKEGRADGPA 153
            G  +F+ PK  AVD  GN+YVAD  N  ++K  ++G    A G +   EG    P+
Sbjct: 1008 GLGQFNSPKGIAVDSNGNVYVADIENDRVQKFDSMGGNPEAFGSTGTGEGEFKRPS 1063



 Score = 37.4 bits (85), Expect = 7.6,   Method: Composition-based stats.
 Identities = 23/64 (35%), Positives = 31/64 (48%), Gaps = 5/64 (7%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
           G  G +DGE     F +P   AVD +G IYV+D  N+ I K    GV   + G    E +
Sbjct: 309 GRGGSADGE-----FYQPYGVAVDHEGTIYVSDTGNNRIEKFDAAGVFVTSWGNGSDEDQ 363

Query: 149 ADGP 152
            + P
Sbjct: 364 LNMP 367


>gi|290997640|ref|XP_002681389.1| hypothetical protein NAEGRDRAFT_78414 [Naegleria gruberi]
 gi|284095013|gb|EFC48645.1| hypothetical protein NAEGRDRAFT_78414 [Naegleria gruberi]
          Length = 630

 Score = 44.3 bits (103), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 35/96 (36%), Positives = 53/96 (55%), Gaps = 14/96 (14%)

Query: 91  QGYSDGEPG---SARFDKPKSFAVDMK-GNIYVADKSNHVIRKITNLG-VTTIAG---GG 142
           QGYS G+ G   +A  + P   A D+K G++Y++D  N +IRK+   G +TTIAG   GG
Sbjct: 107 QGYS-GDGGLAVNAALNWPYDVACDLKTGDVYISDNENQLIRKVFKNGTITTIAGSRSGG 165

Query: 143 SKKEGR-ADGPAQN----ASFSNDFELTFVPHICAL 173
           + ++G  A     N     S S++  L F  +IC +
Sbjct: 166 TTQDGSLATQTVINRPMMMSLSSNGNLYFTSNICTI 201



 Score = 43.9 bits (102), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 24/89 (26%), Positives = 49/89 (55%), Gaps = 2/89 (2%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA--DGPAQNAS 157
           +A  + P    ++ +G I++ ++++HV+RKI   G   +  G   ++G +   G A NA+
Sbjct: 61  NAMLNNPFGICLNDEGEIFIVERASHVVRKILTNGTIIVFAGIVNQQGYSGDGGLAVNAA 120

Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQI 186
            +  +++        + ISD+ NQLIR++
Sbjct: 121 LNWPYDVACDLKTGDVYISDNENQLIRKV 149


>gi|290996232|ref|XP_002680686.1| predicted protein [Naegleria gruberi]
 gi|284094308|gb|EFC47942.1| predicted protein [Naegleria gruberi]
          Length = 655

 Score = 44.3 bits (103), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 39/114 (34%), Positives = 65/114 (57%), Gaps = 11/114 (9%)

Query: 81  VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAV-DMKGNIYVADKSNHVIRKI---TNLG 134
           +V RL G G+ G++ DG   +  + + PK  ++  + G+IY AD SN++IR +   TN+ 
Sbjct: 77  IVTRLIGTGIGGFNGDGLLANLTQVNSPKGVSISQISGDIYFADSSNNLIRVMNAATNI- 135

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           V+TIAG GS   G ++G A ++  +   E  +      LL +D  N  IR+++L
Sbjct: 136 VSTIAGTGS--FGFSEGNATSSKLNKPTEPVYWNG--QLLFADSLNYRIRKVDL 185



 Score = 39.7 bits (91), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 31/100 (31%), Positives = 51/100 (51%), Gaps = 7/100 (7%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S D+   DSS +    ++   +  ++V  +AG G  G+S+G   S++ +KP    V   G
Sbjct: 113 SGDIYFADSSNNLIRVMN---AATNIVSTIAGTGSFGFSEGNATSSKLNKPTE-PVYWNG 168

Query: 116 NIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPA 153
            +  AD  N+ IRK+  +N  ++T+ G G   E   DG A
Sbjct: 169 QLLFADSLNYRIRKVDLSNGMISTLTGTG-ISEFNGDGLA 207



 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 34/101 (33%), Positives = 48/101 (47%), Gaps = 6/101 (5%)

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
           DL   DS   +F    +  S +SV+  L G     Y DG    +    P+  +V   G++
Sbjct: 225 DLYFADSD--SFRIRKYIQSTKSVIT-LVG-STNNYRDGLVNQSLLSYPEGVSVASNGDV 280

Query: 118 YVADKSNHVIRK--ITNLGVTTIAGGGSKKEGRADGPAQNA 156
             +D SNHVIRK  I+   V+TIAG GS      +  A N+
Sbjct: 281 IFSDNSNHVIRKISISKGMVSTIAGTGSAGAAIDNVDATNS 321


>gi|290999615|ref|XP_002682375.1| serine/threonine-protein kinase [Naegleria gruberi]
 gi|284096002|gb|EFC49631.1| serine/threonine-protein kinase [Naegleria gruberi]
          Length = 1357

 Score = 44.3 bits (103), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 55/104 (52%), Gaps = 4/104 (3%)

Query: 117 IYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADG-PAQNASFSNDFELTFVPHICALL 174
           +Y++D  NH+IRK++  G +TT+ G G+      DG    +   +N   L+  P+   L 
Sbjct: 29  LYISDTFNHMIRKVSKNGTLTTLVGVGAAGSNDDDGLLGIDTQLNNPNGLSISPNGELLF 88

Query: 175 ISDHGNQLIRQINLKPEDCS--KSSQSGSALGAVSVWVLVSVLS 216
           I+D  N+LIR +N+K   C+  K   S   L  ++  V VS+ S
Sbjct: 89  IADTDNKLIRVLNMKTMICNVFKMFTSSGVLDTLTKPVGVSITS 132


>gi|452951305|gb|EME56755.1| NHL repeat-containing protein [Amycolatopsis decaplanina DSM 44594]
          Length = 602

 Score = 44.3 bits (103), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 48/105 (45%), Gaps = 8/105 (7%)

Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
           RF  P       +G I VAD  NH I +  + G T I   GS + G  DGP   ASF+  
Sbjct: 168 RF--PSKAVTTAEGRILVADTGNHSIVEFASDGETIIRRFGSGERGAQDGPFDLASFTEP 225

Query: 162 FELTFVPHICA------LLISDHGNQLIRQINLKPEDCSKSSQSG 200
             +T +P+  A       +++D     +R I+L   + S  + +G
Sbjct: 226 SGITLLPYEVAERAGYHAVVADTAGHRLRGIDLITGEVSTVAGTG 270


>gi|451981204|ref|ZP_21929574.1| NHL repeat containing protein [Nitrospina gracilis 3/211]
 gi|451761567|emb|CCQ90828.1| NHL repeat containing protein [Nitrospina gracilis 3/211]
          Length = 488

 Score = 44.3 bits (103), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 35/128 (27%), Positives = 65/128 (50%), Gaps = 10/128 (7%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDKPKS--FAVDMKGNIYVADKSNHVIRKI--TNLG 134
           E  ++   G G  G +DG    A F +P+   F  D+   ++VAD  NH++RKI   +  
Sbjct: 206 EGTIRHRIGSGAVGKADGGFEEASFYRPQGLCFQDDV---VWVADTENHLLRKIDLNSKQ 262

Query: 135 VTTIAGGGSKKEG-RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
           VTT+AG G +    R   PA+  + S+ ++++   H  AL  ++ G   I + ++  +  
Sbjct: 263 VTTVAGTGDQGGFLRDTHPAKETALSSPWDVSL--HEGALYFANAGTHQIGRYHIAADTV 320

Query: 194 SKSSQSGS 201
            + + +G+
Sbjct: 321 EQFAGTGA 328



 Score = 38.5 bits (88), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 48/110 (43%), Gaps = 6/110 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG---VTTI 138
           V++ AG G +   DG    A F +P    V   G +++AD     IR I   G   V T 
Sbjct: 320 VEQFAGTGAEALQDGPRLQAPFAQPSGLTVG-DGKLFLADSETSAIRSIELGGQGKVETY 378

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
            G G    G  DG  + A   +   + +V    A+ I+D  N  IR ++L
Sbjct: 379 VGTGLFDFGDRDGVGKEAVLQHPLGVHYVE--GAVFIADSYNHKIRVLDL 426


>gi|312372811|gb|EFR20689.1| hypothetical protein AND_19679 [Anopheles darlingi]
          Length = 745

 Score = 44.3 bits (103), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 35/73 (47%), Gaps = 2/73 (2%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTI 138
            V+   G    G+ DG    ARF+ P+  A      +YVAD  NH IR+I      V+T+
Sbjct: 258 TVRYTIGGKSSGFVDGGFREARFNAPQGVAFRGGDELYVADNENHAIRRIDLRTRTVSTV 317

Query: 139 AGGGSKKEGRADG 151
           AG G +   R  G
Sbjct: 318 AGNGVQGNDRTGG 330


>gi|310657686|ref|YP_003935407.1| exported protein of unknown function [[Clostridium] sticklandii]
 gi|308824464|emb|CBH20502.1| exported protein of unknown function [[Clostridium] sticklandii]
          Length = 780

 Score = 44.3 bits (103), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 31/102 (30%), Positives = 48/102 (47%), Gaps = 13/102 (12%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMK-GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRAD 150
           GY DGE   A F   +    D +  ++Y+AD  N+ IRK+ N  V+T+AG G+  E    
Sbjct: 566 GYKDGESQEALFQSIRGITFDDRTKSLYIADTYNNRIRKVQNGIVSTVAGNGATGEPEFG 625

Query: 151 GPAQNASFSNDFELTFVPHIC-----ALLISDHGNQLIRQIN 187
             A N   S        PH        ++ISD  N ++ +++
Sbjct: 626 EKATNTYLSR-------PHDVIIKNDEIIISDSWNNIVSKVD 660


>gi|312198349|ref|YP_004018410.1| serine/threonine protein kinase [Frankia sp. EuI1c]
 gi|311229685|gb|ADP82540.1| serine/threonine protein kinase [Frankia sp. EuI1c]
          Length = 789

 Score = 44.3 bits (103), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 36/94 (38%), Positives = 49/94 (52%), Gaps = 5/94 (5%)

Query: 74  FPLSEESVVKRLAGDGVQGYS-DGEPGSAR-FDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           F    +  V  LAG G++G S DG P +A   + P   A+  KG++YV D  + V R  T
Sbjct: 485 FRRGPDGKVSVLAGTGIRGSSGDGGPATAAMLNGPTGLALGPKGDLYVLD-GDRVRRIAT 543

Query: 132 NLGVTTIAGGGSKKEGRADG--PAQNASFSNDFE 163
           +  +TT+AGGGS       G  PA +AS    FE
Sbjct: 544 SGKITTVAGGGSDSGPSPTGPVPATSASLGFLFE 577


>gi|256380948|ref|YP_003104608.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Actinosynnema mirum DSM 43827]
 gi|255925251|gb|ACU40762.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Actinosynnema mirum DSM 43827]
          Length = 612

 Score = 44.3 bits (103), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 40/135 (29%), Positives = 67/135 (49%), Gaps = 15/135 (11%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDG------EPGSARFDKPKSFAVD 112
           L+V DS+  +   L+     E+V++R+ G G +G +DG      EP       P   A  
Sbjct: 195 LLVSDSAHHSLVELA--ADGETVLRRV-GTGERGRADGLEPTFSEPAGIAL-LPPEVAAT 250

Query: 113 MKGNIYVADKSNHVIRKITNLG---VTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVP 168
              +  VAD  NH++R + NL    VTT+AG G + ++G  DGPA     ++ +++ +  
Sbjct: 251 TGYHAVVADTVNHLLRGL-NLDTGEVTTVAGTGEQWRDGATDGPADEIPLTSPWDVVWWE 309

Query: 169 HICALLISDHGNQLI 183
              A+ I+  GN  +
Sbjct: 310 PANAVAIALAGNHTL 324



 Score = 38.5 bits (88), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 30/107 (28%), Positives = 51/107 (47%), Gaps = 3/107 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAG 140
           ++RLAG  V+G +DG    A F +    A D +  +++ D     +R +  +  V T  G
Sbjct: 334 LERLAGTTVEGLNDGPADQAFFAQTSGLASDGE-RLWLVDSETSALRWLDPDRTVHTAVG 392

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            G    G  DGPA  A   +   +T +P   ++ I+D  N  +R+ +
Sbjct: 393 KGLFDFGHRDGPADQALLQHPLGVTALPD-GSVAIADTYNGAVRRYD 438


>gi|406947273|gb|EKD78225.1| lipoprotein [uncultured bacterium]
          Length = 883

 Score = 44.3 bits (103), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 51/94 (54%), Gaps = 6/94 (6%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV--DM 113
           + D+ VLD+  +        +  +  ++R+AG G  GY +    +A+F +P+  AV  D 
Sbjct: 50  ADDIYVLDTINNRIE----KVKSDGSLERVAGSGEYGYKNDSALNAQFAEPQDIAVFGDN 105

Query: 114 KGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEG 147
              I++AD +N+VIRK+ N  V+T   G S  +G
Sbjct: 106 AEQIFIADTNNNVIRKLENGVVSTFLTGLSGPKG 139



 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 23/52 (44%), Positives = 31/52 (59%), Gaps = 3/52 (5%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK-GNIYVADKSNHVIRKIT 131
           VV  L+G+G  GY +G   +A +  PK F  D + G +YVAD  N VIR I+
Sbjct: 833 VVMTLSGNGNVGYVNGTDTAAEWHNPKGF--DFRAGKLYVADTMNDVIRTIS 882


>gi|158521169|ref|YP_001529039.1| YD repeat-containing protein [Desulfococcus oleovorans Hxd3]
 gi|158509995|gb|ABW66962.1| YD repeat protein [Desulfococcus oleovorans Hxd3]
          Length = 2961

 Score = 44.3 bits (103), Expect = 0.054,   Method: Composition-based stats.
 Identities = 24/67 (35%), Positives = 43/67 (64%), Gaps = 3/67 (4%)

Query: 79   ESVVKRLAGDGVQGYS-DGEPGSAR-FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-V 135
            + ++  +AG+G++G   DG    A   + P+S  VD+KGN+Y +++ ++ +RKI   G +
Sbjct: 1769 DGIISTVAGNGLEGNGPDGISALATPLNDPESIVVDVKGNLYFSERDHYKVRKIDPSGII 1828

Query: 136  TTIAGGG 142
            TT+AG G
Sbjct: 1829 TTVAGNG 1835


>gi|431895427|gb|ELK04943.1| NHL repeat-containing protein 2 [Pteropus alecto]
          Length = 726

 Score = 44.3 bits (103), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 33/75 (44%), Positives = 39/75 (52%), Gaps = 6/75 (8%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG---GGSKKE 146
           G  DG    + F+ P+  A+ M   IYVAD  NH+IRKI      V+TIAG    GS KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEMVSTIAGIGIQGSDKE 325

Query: 147 GRADGPAQNASFSND 161
           G A G  Q  S   D
Sbjct: 326 GGAKGEQQPISSPWD 340


>gi|269957976|ref|YP_003327765.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Xylanimonas cellulosilytica DSM 15894]
 gi|269306657|gb|ACZ32207.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Xylanimonas cellulosilytica DSM 15894]
          Length = 688

 Score = 44.3 bits (103), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 56/107 (52%), Gaps = 13/107 (12%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P   I  PG + L+V D+   +   L+     E++V+R+ G G +G  DG P  ARF +P
Sbjct: 178 PAKAITLPGGT-LLVADAGHHSLAELA--PDGETLVRRI-GSGQRGLLDGGPDEARFAEP 233

Query: 107 KSFAV---DMKG----NIYVADKSNHVIR--KITNLGVTTIAGGGSK 144
               +   D++G    ++ VAD  NH +R  ++ +  V T+AG G +
Sbjct: 234 NGLCLVPEDLRGRLGYDVVVADTVNHALRGVRLADGLVVTLAGTGEQ 280


>gi|344274719|ref|XP_003409162.1| PREDICTED: NHL repeat-containing protein 2 [Loxodonta africana]
          Length = 726

 Score = 44.3 bits (103), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 40/78 (51%), Gaps = 6/78 (7%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG---GGSKKE 146
           G  DG    + F+ P+  A+ M   IYVAD  NH+IRKI      V+T+AG    G+ KE
Sbjct: 267 GRKDGTFSESTFNSPQGVAI-MNNVIYVADTENHLIRKIDLEAEIVSTVAGIGIQGTDKE 325

Query: 147 GRADGPAQNASFSNDFEL 164
           G A G  Q  S   D  L
Sbjct: 326 GGAKGEQQPISSPWDVVL 343


>gi|116751477|ref|YP_848164.1| NHL repeat-containing protein [Syntrophobacter fumaroxidans MPOB]
 gi|116700541|gb|ABK19729.1| NHL repeat containing protein [Syntrophobacter fumaroxidans MPOB]
          Length = 1763

 Score = 44.3 bits (103), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 45/167 (26%), Positives = 74/167 (44%), Gaps = 27/167 (16%)

Query: 41  HQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGS 100
           +  E+N  + +      DL + DS       +        ++  +AG+G  GYS G+ G+
Sbjct: 468 NHTEVNWPTGVAVDAKGDLYIADSGNCRVRKVD----SYGIITTVAGNGSCGYS-GDGGA 522

Query: 101 A---RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGG----SKKEGRA--- 149
           A   +   P   AVD + N+Y++D  +H IRK+   GV TT+AG G      ++G     
Sbjct: 523 AVEAQLGYPAFVAVDSRENLYISDTESHRIRKVGPDGVITTVAGSGLCHLKVEDGYGGYQ 582

Query: 150 ----------DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
                      GPA  A  +N   +        L I+D GN+ +R++
Sbjct: 583 EYDAPCFDGDQGPADQAKLNNPSGVA-TDASGNLYIADTGNRRLRKV 628



 Score = 42.4 bits (98), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 26/65 (40%), Positives = 36/65 (55%), Gaps = 6/65 (9%)

Query: 88  DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKE 146
           DG QG +D     A+ + P   A D  GN+Y+AD  N  +RK+   G ++T+AG G    
Sbjct: 590 DGDQGPAD----QAKLNNPSGVATDASGNLYIADTGNRRLRKVAPSGTISTVAGDGELGS 645

Query: 147 GRADG 151
           G ADG
Sbjct: 646 G-ADG 649



 Score = 42.0 bits (97), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 42/149 (28%), Positives = 66/149 (44%), Gaps = 7/149 (4%)

Query: 42  QLEINPHSVI-DRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG-YSDGEPG 99
           QL ++P  V+ D  G  +L V D S          +     V  +AG   +G   DG P 
Sbjct: 654 QLHLSPVGVVADSRG--NLFVSDLSHGEGNGRIRKIDPTGFVTTVAGHDWEGTLGDGGPA 711

Query: 100 S-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNAS 157
           + A    P+  AVD  GN+Y AD   + +R +  +G +TT+AG      G   G A  A 
Sbjct: 712 TQAYIGSPRGLAVDRGGNLYFADPWINRVRAVNTMGIITTVAGSDDGTFGEDGGLALRAR 771

Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQI 186
            +   ++   P    L  ++  NQ +R++
Sbjct: 772 LAFPMDVAVDPR-GDLYATEQWNQRVRKV 799


>gi|354612773|ref|ZP_09030715.1| NHL repeat containing protein [Saccharomonospora paurometabolica
           YIM 90007]
 gi|353222911|gb|EHB87206.1| NHL repeat containing protein [Saccharomonospora paurometabolica
           YIM 90007]
          Length = 436

 Score = 44.3 bits (103), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 53/103 (51%), Gaps = 13/103 (12%)

Query: 85  LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG 140
           +AG G  G S DG P   A    P + A+D  GN+Y+ D  +  +RK+  T+  ++T+AG
Sbjct: 159 VAGTGQNGDSGDGGPAVDAELRHPTTVALDRHGNVYLTDPESRRVRKVSATDHTISTVAG 218

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
            G + +    GPA +A      EL F P+  A  +  HGN  +
Sbjct: 219 TGREGDSGDGGPATDA------ELRF-PNCVA--VDGHGNVFL 252



 Score = 41.6 bits (96), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 37/115 (32%), Positives = 59/115 (51%), Gaps = 5/115 (4%)

Query: 77  SEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TN 132
           + +  +  +AG G +G S DG P + A    P   AVD  GN+++ D  +H +RK+  T+
Sbjct: 208 ATDHTISTVAGTGREGDSGDGGPATDAELRFPNCVAVDGHGNVFLTDPRSHRVRKVSATD 267

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
             +TT+AG G + +G   GPA +A       L  V     L   D GN  +R+++
Sbjct: 268 HTITTVAGTGREGDGGDGGPATDAEVRYPNSLA-VDGAGNLYFGDTGNHRVRRVS 321



 Score = 40.4 bits (93), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 36/80 (45%), Gaps = 11/80 (13%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT--TIAGGGSKKEGRADGPAQNASFSN 160
           F +P   A+D +GN YVAD     +R++T  G +   +AG G   +    GPA +A    
Sbjct: 122 FGRPWGLALDTQGNFYVADADGCQVRRVTADGASFAAVAGTGQNGDSGDGGPAVDAELR- 180

Query: 161 DFELTFVPHICALLISDHGN 180
                   H   + +  HGN
Sbjct: 181 --------HPTTVALDRHGN 192



 Score = 37.4 bits (85), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 38/83 (45%), Gaps = 4/83 (4%)

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFE 163
           P   AVD  GN+Y+AD     +RK+  T+  + TI G G   +G   GPA          
Sbjct: 353 PVGLAVDGAGNLYIADPDTCRVRKVSATDHTIDTIVGNGHAGDGDERGPATGLPLDPGGV 412

Query: 164 LTFVPHICALLISDHGNQLIRQI 186
           +T       L ++D G   IR++
Sbjct: 413 VTDGRE--NLFVADFGQYRIRRV 433


>gi|298383938|ref|ZP_06993499.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
 gi|298263542|gb|EFI06405.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
          Length = 483

 Score = 44.3 bits (103), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 29/103 (28%), Positives = 50/103 (48%), Gaps = 5/103 (4%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-----NLGVTTIAGGGSKKE 146
           G+ DG   +A+FD P        G +Y+AD  N+ IR I      +  + T   G    +
Sbjct: 381 GWEDGLLRNAKFDNPHQICFTEDGKLYIADCGNNCIRVIDTKLPLDRAMVTTPIGLPGMK 440

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           G  DG    A F++ F +        + I+D GN++IR+++++
Sbjct: 441 GYKDGGPDIALFNHPFGVAVSADGQIVYIADTGNKVIRKLSIQ 483



 Score = 44.3 bits (103), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 40/89 (44%), Gaps = 4/89 (4%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTL--SFPLSEESVVKRLAGDGVQGYSDGEPGSARF 103
           NPH +        L + D   +    +    PL    V   +   G++GY DG P  A F
Sbjct: 394 NPHQICFTE-DGKLYIADCGNNCIRVIDTKLPLDRAMVTTPIGLPGMKGYKDGGPDIALF 452

Query: 104 DKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
           + P   AV   G I Y+AD  N VIRK++
Sbjct: 453 NHPFGVAVSADGQIVYIADTGNKVIRKLS 481


>gi|167534306|ref|XP_001748831.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163772793|gb|EDQ86441.1| predicted protein [Monosiga brevicollis MX1]
          Length = 5844

 Score = 44.3 bits (103), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 33/112 (29%), Positives = 52/112 (46%), Gaps = 3/112 (2%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-NLGVTTIAGGGSKKEGRAD 150
            + DG    A F +P        G+  VAD +NH IR I+ +L       G   + G A+
Sbjct: 825 AHRDGRLSDACFHRPSGICQLEDGSFLVADSANHCIRSISRSLSTVKTFAGQPGQAGGAE 884

Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-KPEDCSKSSQSGS 201
           G    A F+   +L  +     +LISD  N  IR + L K ++ + ++ +GS
Sbjct: 885 GAVSKAQFNQPSDLLVLDEK-RILISDEANNSIRLLQLQKDKNWTVTTVAGS 935



 Score = 40.8 bits (94), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 29/106 (27%), Positives = 47/106 (44%), Gaps = 11/106 (10%)

Query: 57   SDLIVLDSSR--------SAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPK- 107
            SDL+VLD  R        ++   L     +   V  +AG    G++DG  GS + ++P+ 
Sbjct: 896  SDLLVLDEKRILISDEANNSIRLLQLQKDKNWTVTTVAGSRTAGFADGRRGSTQLNQPRG 955

Query: 108  -SFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADG 151
              F    +  + +AD  NH IR + T+     +  G   + GR  G
Sbjct: 956  MCFLNQRRTVVAIADAGNHCIRALNTSDWSLKLVAGVPGRAGRRYG 1001


>gi|184201467|ref|YP_001855674.1| hypothetical protein KRH_18210 [Kocuria rhizophila DC2201]
 gi|183581697|dbj|BAG30168.1| hypothetical protein [Kocuria rhizophila DC2201]
          Length = 246

 Score = 44.3 bits (103), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 43/93 (46%), Gaps = 7/93 (7%)

Query: 55  GSSDLIVLDSSRSAFYTLSF----PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFA 110
           G   L  +D   S+   L+     P  +  VV  +   G  GY DG    AR D P+   
Sbjct: 111 GDGRLWFVDRDSSSLRCLTIDTDRPDGDPHVVTVVGRHGA-GYQDGPATEARLDHPEDLQ 169

Query: 111 VDMKGNIYVADKSNHVIR--KITNLGVTTIAGG 141
           +   G++ VAD  NH +R   +T+  VTTI GG
Sbjct: 170 MLYDGSVVVADTGNHALRHVDVTDGEVTTICGG 202



 Score = 38.1 bits (87), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 28/116 (24%), Positives = 52/116 (44%), Gaps = 11/116 (9%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--------N 132
           V++  AG G  G  DG    A+F    S      G ++  D+ +  +R +T        +
Sbjct: 79  VLQEWAGTGEPGRRDGRVAQAKFAATVSITEGGDGRLWFVDRDSSSLRCLTIDTDRPDGD 138

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
             V T+   G    G  DGPA  A   +  +L  + +  +++++D GN  +R +++
Sbjct: 139 PHVVTVV--GRHGAGYQDGPATEARLDHPEDLQML-YDGSVVVADTGNHALRHVDV 191


>gi|219852941|ref|YP_002467373.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
 gi|219547200|gb|ACL17650.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
          Length = 579

 Score = 44.3 bits (103), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 55/120 (45%), Gaps = 13/120 (10%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
           LA  G +G  DG+     FD P   AVD + N+YV D  N+ ++K T+ G T +A  G++
Sbjct: 176 LAKWGTEGGGDGQ-----FDYPTGIAVDSENNVYVVDSYNNRVQKFTSNG-TFLAKWGAR 229

Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
             G  DG      F++  E   V     + ++D GN  I +        +K    GS  G
Sbjct: 230 --GSGDG-----EFADFPEEIAVDSTGNVFVTDTGNNRIEKFTSNGTFLAKWGGRGSGDG 282



 Score = 41.6 bits (96), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 5/72 (6%)

Query: 82  VKRLAGDGV--QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA 139
           V+R    G+    +  G  G  +F  P   AVD  GN+YVAD  N+ ++K ++ G T +A
Sbjct: 119 VQRFTSTGIFLARWGTGGTGDGQFKSPSGVAVDSAGNVYVADMYNYRVQKFSSAG-TLLA 177

Query: 140 GGGSKKEGRADG 151
             G+  EG  DG
Sbjct: 178 KWGT--EGGGDG 187


>gi|397779898|ref|YP_006544371.1| NHL repeat-containing protein 2 [Methanoculleus bourgensis MS2]
 gi|396938400|emb|CCJ35655.1| NHL repeat-containing protein 2 [Methanoculleus bourgensis MS2]
          Length = 483

 Score = 44.3 bits (103), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 38/131 (29%), Positives = 60/131 (45%), Gaps = 12/131 (9%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK-GNIYVADKSNHVIRKIT--NL 133
           S +  +  + G G  G +DG    A F  P+  A D + G +YVAD  NH IR+++  +L
Sbjct: 196 SRDGAILGVIGTGAPGNADGSSSEAAFYLPEGLAFDEEAGVLYVADAGNHTIRQVSWPDL 255

Query: 134 GVTTIAGGG----SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            V T+AG G    S  EG   GP    + +   +L  +     L I+  G   + +++L 
Sbjct: 256 RVETVAGTGLEAPSPGEG---GPGTGVALNAPRDLALMGDY--LYIAMAGANQVWRMDLA 310

Query: 190 PEDCSKSSQSG 200
                  + SG
Sbjct: 311 THTLEPYAGSG 321


>gi|380693174|ref|ZP_09858033.1| hypothetical protein BfaeM_04235 [Bacteroides faecis MAJ27]
          Length = 469

 Score = 44.3 bits (103), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 35/117 (29%), Positives = 51/117 (43%), Gaps = 22/117 (18%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKG--------------NIYVADKSNHVIRKITNLGVT 136
           + + +G  GSARF+KP+ F V +K               + Y  D+ NH I K+   GV 
Sbjct: 351 KSFKEGTGGSARFNKPQ-FGVFVKNEKYGEGIGPDKDQYDFYFCDRENHCIWKLDPHGVA 409

Query: 137 TIAGGGSKKE------GRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           ++A G S +       G  DG P   A F+    L + P      I D  N+ IR +
Sbjct: 410 SLAAGRSNENADGKIWGYVDGNPLHEARFNQPAGLAYDPDTDMFYIGDIDNKGIRYM 466


>gi|348688202|gb|EGZ28016.1| hypothetical protein PHYSODRAFT_321722 [Phytophthora sojae]
          Length = 490

 Score = 44.3 bits (103), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 55/205 (26%), Positives = 80/205 (39%), Gaps = 42/205 (20%)

Query: 11  AITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEIN---PHSVIDRPGSSDLIVLDSSRS 67
           A+T    +F+   SG   + G+      DG  LE N   P  V +    S +IV D+   
Sbjct: 124 AVTDDVEMFAFAGSG---KKGWR-----DGLALEANFNSPSGVCEYADGS-IIVADTGNH 174

Query: 68  AFYTLSFPLSEESVVKRLAGD---------GVQG----------YSDGEPGSARFDKPKS 108
               +    + + VVK +AG          GVQG          Y DGE   + F  P +
Sbjct: 175 CIRQIRRGANGKLVVKTIAGAYASFETKRVGVQGAGIKDQRTAGYRDGE--RSLFRSPST 232

Query: 109 FAVDMKGNIYVADKSNHVIRKI-------TNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
                 G + VAD  N+ IR +       +   V T+ G    + G ADG  + A F   
Sbjct: 233 VLAGPSGELLVADTMNNCIRGLLPPPDGTSPWRVKTVCG--QTRPGHADGNCEVALFDQP 290

Query: 162 FELTFVPHICALLISDHGNQLIRQI 186
             L +        ++D GN  IRQ+
Sbjct: 291 LSLCWGEDSNTFFVADRGNACIRQV 315


>gi|290976772|ref|XP_002671113.1| predicted protein [Naegleria gruberi]
 gi|284084679|gb|EFC38369.1| predicted protein [Naegleria gruberi]
          Length = 520

 Score = 44.3 bits (103), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 53/109 (48%), Gaps = 6/109 (5%)

Query: 82  VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTT 137
           +  +AG G  GYS G+ G   +A    PKS A+   G I+  D  N VIR+IT  G +TT
Sbjct: 255 ITTIAGTGAYGYS-GDGGLAINAVIRYPKSIAIGKHGEIFFTDSDNQVIRRITPDGIITT 313

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           IAG G+       G A +A  S    +  V     +   D+ N  +R++
Sbjct: 314 IAGTGNFGYSGDGGLATSADISKPTGIA-VDSNGTIYFCDNNNNRVRKL 361



 Score = 37.0 bits (84), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 55/123 (44%), Gaps = 5/123 (4%)

Query: 69  FYTLSFPLSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNH 125
           FY     + +   V  +AG G  G S G  G   +A    P    V+ K  IY+ +   H
Sbjct: 70  FYGRVRKVDKNGYVTTIAGFGTSGDSSGNGGLATNAMLGYPAGITVNDKDEIYLVEYGTH 129

Query: 126 VIRKI-TNLGVTTIAGG-GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
           ++RKI  N  +   AG       G  +G A NA F++ + ++  P    + ISD  N  I
Sbjct: 130 MVRKILVNGTIQEFAGILNGGGFGGDNGLATNAIFNHPYGVSVNPLTGDVYISDFSNNRI 189

Query: 184 RQI 186
           R++
Sbjct: 190 RKV 192


>gi|195387367|ref|XP_002052367.1| GJ17513 [Drosophila virilis]
 gi|194148824|gb|EDW64522.1| GJ17513 [Drosophila virilis]
          Length = 736

 Score = 44.3 bits (103), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 38/73 (52%), Gaps = 2/73 (2%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTI 138
           VV+   G    G+ DG   +ARF+ P+  A   K  + VAD  NH +R+I+  +  V T+
Sbjct: 255 VVEYKVGGLEAGFVDGNLNTARFNNPQGVAFLDKDTLIVADTDNHALRQISLKSGAVETL 314

Query: 139 AGGGSKKEGRADG 151
           AG G +   R  G
Sbjct: 315 AGTGCQGNDRTGG 327


>gi|219852041|ref|YP_002466473.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
 gi|219546300|gb|ACL16750.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
          Length = 930

 Score = 44.3 bits (103), Expect = 0.062,   Method: Composition-based stats.
 Identities = 29/99 (29%), Positives = 47/99 (47%), Gaps = 8/99 (8%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
           G  ++ +     G  +F  P   AVD  GN+YVAD  N+ I+K T++G+     G S   
Sbjct: 76  GTFIKKWGSSGSGDGQFSSPSGVAVDSAGNVYVADTGNNRIQKFTSMGIFIKQWGSS--- 132

Query: 147 GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
               G      FS+ F +  V +   + ++D GN  I++
Sbjct: 133 ----GSGNGQFFSSPFGVA-VDNAGNVYVADTGNNRIQK 166


>gi|383115566|ref|ZP_09936322.1| hypothetical protein BSGG_2564 [Bacteroides sp. D2]
 gi|313695029|gb|EFS31864.1| hypothetical protein BSGG_2564 [Bacteroides sp. D2]
          Length = 459

 Score = 43.9 bits (102), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 37/95 (38%), Positives = 51/95 (53%), Gaps = 2/95 (2%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           G+ +G+   AR ++P    +D +GNIYVA +  H I KIT  GV T    G+   G  DG
Sbjct: 364 GFVEGQGNDARLEEPCQMDLDEEGNIYVAVRKKHRIAKITPDGVVTSY-TGTGTSGTTDG 422

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           P   A F++   L F     AL +SD+ N  IR+I
Sbjct: 423 PLDKAQFNHPEGLQFGLD-GALYVSDYWNHKIRKI 456


>gi|219852087|ref|YP_002466519.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
 gi|219546346|gb|ACL16796.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
          Length = 668

 Score = 43.9 bits (102), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 39/135 (28%), Positives = 62/135 (45%), Gaps = 16/135 (11%)

Query: 51  IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFA 110
           +D+PGS   + +DS R   Y   +  +   V     G  +Q  S G  G  +FD P   A
Sbjct: 139 LDQPGS---VAVDS-RGQIYVADWGNNRVQVFNSTGGYLMQWGSSGS-GDGQFDGPNGIA 193

Query: 111 VDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE-GRADGPAQNASFSNDFELTFVPH 169
           +D  GN+YV D  N+ I++  + G   +  G S  E G+ + P   A  SND        
Sbjct: 194 IDSTGNVYVTDAYNNRIQEFNSTGGYLMQWGSSGSEAGQFEIPQGIAMDSND-------- 245

Query: 170 ICALLISDHGNQLIR 184
              + ++D GN++ +
Sbjct: 246 --NVYVADSGNRVQK 258



 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 14/37 (37%), Positives = 23/37 (62%)

Query: 98  PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           PG+ +F  P   A+D  GN+Y+ D  N+ ++K T+ G
Sbjct: 462 PGAGQFSGPNGIALDSTGNVYITDWGNNRVQKFTSTG 498



 Score = 38.9 bits (89), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 46/107 (42%), Gaps = 11/107 (10%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIAGGGSKKEGRADGPAQNAS 157
           G  +F+ P   AVD  G +YVAD +N+ I+K  +  G  T  G    +EG+ D P   A 
Sbjct: 88  GDGQFNIPYDIAVDSVGYVYVADMNNNRIQKFNSTGGYLTQWGTKGSEEGQLDQPGSVA- 146

Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
                    V     + ++D GN  ++  N       +   SGS  G
Sbjct: 147 ---------VDSRGQIYVADWGNNRVQVFNSTGGYLMQWGSSGSGDG 184


>gi|298387118|ref|ZP_06996672.1| hypothetical protein HMPREF9007_03883 [Bacteroides sp. 1_1_14]
 gi|298260268|gb|EFI03138.1| hypothetical protein HMPREF9007_03883 [Bacteroides sp. 1_1_14]
          Length = 469

 Score = 43.9 bits (102), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 35/117 (29%), Positives = 51/117 (43%), Gaps = 22/117 (18%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKG--------------NIYVADKSNHVIRKITNLGVT 136
           + + +G  GSARF+KP+ F V +K               + Y  D+ NH I K+   GV 
Sbjct: 351 KSFKEGTGGSARFNKPQ-FGVFVKNEKYGEGIGPDNDQYDFYFCDRENHCIWKLDPHGVA 409

Query: 137 TIAGGGSKKE------GRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           ++A G S +       G  DG P   A F+    L + P      I D  N+ IR +
Sbjct: 410 SLAAGRSNENADGKIWGYVDGNPLHEARFNQPAGLAYDPDTDMFYIGDIDNKGIRYM 466


>gi|119569872|gb|EAW49487.1| NHL repeat containing 2, isoform CRA_a [Homo sapiens]
          Length = 683

 Score = 43.9 bits (102), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 32/80 (40%), Positives = 43/80 (53%), Gaps = 8/80 (10%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
           G  DG    + F+ P+  A+ M   IYVAD  NH+IRKI      V+TIAG    G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEKVSTIAGIGIQGTDKE 325

Query: 147 GRADGPAQNASFSNDFELTF 166
           G A G  Q    S+ +++ F
Sbjct: 326 GGAKGEQQ--PISSPWDVVF 343


>gi|290979059|ref|XP_002672252.1| predicted protein [Naegleria gruberi]
 gi|284085827|gb|EFC39508.1| predicted protein [Naegleria gruberi]
          Length = 1239

 Score = 43.9 bits (102), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 34/108 (31%), Positives = 53/108 (49%), Gaps = 10/108 (9%)

Query: 39  DGHQLEINPHS-----VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
           DG+ L  N  +     V++R G  +L  ++        LSF  +    +  +AG+G  GY
Sbjct: 42  DGNALVTNLKNPRGLFVVERNGG-ELYFVEEENHIIRKLSFVTNN---ITTIAGNGTAGY 97

Query: 94  SDGEPGSARFDKPKSFAV-DMKGNIYVADKSNHVIRKITNLGVTTIAG 140
           SDG    +  + P S  V +  G IY+AD  N  +R++ N  + TIAG
Sbjct: 98  SDGVATLSMLNYPHSVHVSNTTGEIYIADTFNCKLRRVFNGQMETIAG 145



 Score = 38.1 bits (87), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 52/105 (49%), Gaps = 11/105 (10%)

Query: 55  GSS-DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKS-FAVD 112
           GSS D I + + +  F   S  LS+ ++V     D   G  DG         P+  F V+
Sbjct: 3   GSSLDTIYVSTGKQIFNFTSDLLSKVTIVAGTFVDSWNG--DGNALVTNLKNPRGLFVVE 60

Query: 113 MKG-NIYVADKSNHVIRK---ITNLGVTTIAGGGSKKEGRADGPA 153
             G  +Y  ++ NH+IRK   +TN  +TTIAG G+   G +DG A
Sbjct: 61  RNGGELYFVEEENHIIRKLSFVTN-NITTIAGNGTA--GYSDGVA 102


>gi|383124929|ref|ZP_09945589.1| hypothetical protein BSIG_1326 [Bacteroides sp. 1_1_6]
 gi|251840920|gb|EES69001.1| hypothetical protein BSIG_1326 [Bacteroides sp. 1_1_6]
          Length = 469

 Score = 43.9 bits (102), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 35/117 (29%), Positives = 51/117 (43%), Gaps = 22/117 (18%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKG--------------NIYVADKSNHVIRKITNLGVT 136
           + + +G  GSARF+KP+ F V +K               + Y  D+ NH I K+   GV 
Sbjct: 351 KSFKEGTGGSARFNKPQ-FGVFVKNEKYGEGIGPDNDQYDFYFCDRENHCIWKLDPHGVA 409

Query: 137 TIAGGGSKKE------GRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           ++A G S +       G  DG P   A F+    L + P      I D  N+ IR +
Sbjct: 410 SLAAGRSNENADGKIWGYVDGNPLHEARFNQPAGLAYDPDTDMFYIGDIDNKGIRYM 466


>gi|29348756|ref|NP_812259.1| hypothetical protein BT_3347 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29340662|gb|AAO78453.1| Quinoprotein amine dehydrogenase [Bacteroides thetaiotaomicron
           VPI-5482]
          Length = 469

 Score = 43.9 bits (102), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 35/117 (29%), Positives = 51/117 (43%), Gaps = 22/117 (18%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKG--------------NIYVADKSNHVIRKITNLGVT 136
           + + +G  GSARF+KP+ F V +K               + Y  D+ NH I K+   GV 
Sbjct: 351 KSFKEGTGGSARFNKPQ-FGVFVKNEKYGEGIGPDNDQYDFYFCDRENHCIWKLDPHGVA 409

Query: 137 TIAGGGSKKE------GRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           ++A G S +       G  DG P   A F+    L + P      I D  N+ IR +
Sbjct: 410 SLAAGRSNENADGKIWGYVDGNPLHEARFNQPAGLAYDPDTDMFYIGDIDNKGIRYM 466


>gi|261855185|ref|YP_003262468.1| redoxin [Halothiobacillus neapolitanus c2]
 gi|261835654|gb|ACX95421.1| Redoxin domain protein [Halothiobacillus neapolitanus c2]
          Length = 498

 Score = 43.9 bits (102), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 36/60 (60%), Gaps = 3/60 (5%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK 144
           GDG  G++DG   +A F++P+  A  +  ++YVAD  N  IR+I   +  V+T+ G G +
Sbjct: 234 GDGKAGFADGSFATAEFNRPQGIAW-LGNDLYVADTDNQRIRRIDFASQTVSTVVGNGQR 292


>gi|386384782|ref|ZP_10070129.1| redoxin domain-containing protein [Streptomyces tsukubaensis
           NRRL18488]
 gi|385667750|gb|EIF91146.1| redoxin domain-containing protein [Streptomyces tsukubaensis
           NRRL18488]
          Length = 591

 Score = 43.9 bits (102), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 56/115 (48%), Gaps = 18/115 (15%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S +L+V DS+R     L      E+VV+R+ G            +  F++P+  A+   G
Sbjct: 185 SGNLLVSDSTRHQLVELE--PDGETVVRRIGG------------ADEFNEPQGLALLPDG 230

Query: 116 NIYVADKSNHVIRKI--TNLGVTTIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
            + VAD  NH IR    ++  V T+AG G +  +   A+GPA     S+ ++L +
Sbjct: 231 RVAVADTVNHRIRAYDPSSGAVETLAGTGKQWWQGSPAEGPALTVDLSSPWDLAW 285



 Score = 42.7 bits (99), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 33/112 (29%), Positives = 50/112 (44%), Gaps = 3/112 (2%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLG-V 135
           E   V+R AG   +G  DG    A F +P   AV   G  +++AD     +R I   G V
Sbjct: 306 EAGTVRRAAGTTNEGLVDGPADEAWFAQPSGLAVAADGERLWIADSETSALRWIGRDGSV 365

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            +  G G    G  DG A  A F +   +T +P   ++ + D  N  +R+ +
Sbjct: 366 HSAVGTGLFDFGHRDGDAAQALFQHPLGVTALPD-GSVAVCDTYNHALRRYD 416


>gi|412985326|emb|CCO20351.1| predicted protein [Bathycoccus prasinos]
          Length = 897

 Score = 43.9 bits (102), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 38/123 (30%), Positives = 60/123 (48%), Gaps = 18/123 (14%)

Query: 81  VVKRLAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGN-IYVADKSNHVIRK--ITNLGVT 136
           VV+ +AG G+ GY D   G+ ARF++P    +   G+ ++V+D  N+ IRK  + +  VT
Sbjct: 220 VVRTVAGSGLTGYHDDLVGTKARFNQPMDVVMHPDGHTLFVSDAMNNCIRKVDVQSGEVT 279

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHI-----------CALL-ISDHGNQLIR 184
           T+   G K  G  DG   NA F     ++   +            C  L ++D GN  +R
Sbjct: 280 TLT--GDKLPGMIDGIKSNARFLQPMGMSCSTNTVLDGGGKSSNKCTFLAVADAGNHAVR 337

Query: 185 QIN 187
            +N
Sbjct: 338 LVN 340



 Score = 43.5 bits (101), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 37/101 (36%), Positives = 52/101 (51%), Gaps = 7/101 (6%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKI-TNLG-VTTIAGGGSKKEGRA 149
           + DG    ARF+KP        G +I++AD  NH IR+I    G VTT+AG G+   G A
Sbjct: 118 FHDGPSLEARFNKPFGVVASPDGRDIFIADTFNHRIRRIDVEFGTVTTLAGTGA--SGSA 175

Query: 150 DGPAQ--NASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           DG A+  +A+F     +   P    + ++D  N  IR I L
Sbjct: 176 DGLAKRGDATFKFPSGVAVSPDGGFVYVADTKNHKIRVIIL 216


>gi|428304139|ref|YP_007140964.1| NHL repeat containing protein [Crinalium epipsammum PCC 9333]
 gi|428245674|gb|AFZ11454.1| NHL repeat containing protein [Crinalium epipsammum PCC 9333]
          Length = 537

 Score = 43.9 bits (102), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 48/101 (47%), Gaps = 7/101 (6%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  VI    S+ L + D+        S     ++V+    G G   + DG+   A+F +P
Sbjct: 206 PSKVIACQQSNSLFIADTGHHRLVIASLNGETQAVI----GTGSPAWVDGDLEIAQFCEP 261

Query: 107 KSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSK 144
                D +   IYVAD  NH++RKI      V+TIAG G++
Sbjct: 262 MGMVFDYEQQVIYVADTVNHLLRKIDLKTRQVSTIAGNGTQ 302



 Score = 39.3 bits (90), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 37/141 (26%), Positives = 63/141 (44%), Gaps = 10/141 (7%)

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-VQGYSDGEPGSARFDKPKSFAVDM-KG 115
           +L + DS  S+   ++  L +  +V+ + G G + G+ D +             +    G
Sbjct: 384 ELFIADSESSSIRAVT--LGDFPIVRTICGSGQLFGFGDVDGIGENVRLQHCLGITYGAG 441

Query: 116 NIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
            ++V D  NH ++++  T     TI G G  K G  DG   +  FS    L F  +   L
Sbjct: 442 YLWVTDTYNHKLKRVNPTTGECQTICGSG--KAGLQDGFGTDVYFSEPSGLAFACNY--L 497

Query: 174 LISDHGNQLIRQINLKPEDCS 194
            I+D  N  IR+INL  ++ +
Sbjct: 498 YIADSNNHAIRRINLNSQEVT 518


>gi|427719807|ref|YP_007067801.1| NHL repeat containing protein [Calothrix sp. PCC 7507]
 gi|427352243|gb|AFY34967.1| NHL repeat containing protein [Calothrix sp. PCC 7507]
          Length = 505

 Score = 43.9 bits (102), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 24/58 (41%), Positives = 35/58 (60%), Gaps = 3/58 (5%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGG 142
           GDG  G  DGE  ++RF +P   +V +  ++YVAD +NHVIR++      VTT+   G
Sbjct: 437 GDGSAGLQDGEGKNSRFFEPSGLSV-INSHLYVADTNNHVIRRVDLNTFEVTTLKFPG 493



 Score = 43.5 bits (101), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 40/139 (28%), Positives = 61/139 (43%), Gaps = 11/139 (7%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           +S L + DS        SF    +  +  + G G  G+ DG+   A+F  P+  A D + 
Sbjct: 184 ASGLFIADSGHHRLVRSSF----DGEILHVIGTGKPGFIDGDFSEAQFFAPQGMAFDEEN 239

Query: 116 NI-YVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFE--LTFVPHI 170
            I Y+AD  NH +R++   +  V TIAG G  K+ R   P   A    +       V   
Sbjct: 240 QILYIADTENHALRRVDLRHQLVKTIAGTG--KQSRNIHPHSGAGLETELNSPWDLVKVG 297

Query: 171 CALLISDHGNQLIRQINLK 189
             L I+  G   I Q++L+
Sbjct: 298 NNLFIAMAGPHQIWQMDLE 316



 Score = 37.4 bits (85), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 32/129 (24%), Positives = 53/129 (41%), Gaps = 7/129 (5%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL---G 134
           E  ++   AG G +   DG    + F +P     D K  +Y+AD     IR +  +    
Sbjct: 316 ETGIINTYAGTGAEACVDGSLTESAFAQPSGITTDGK-ELYIADSEVSSIRSVGIVEPRQ 374

Query: 135 VTTIAG-GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
           V T+ G GG    G  DG  +N    +   + +  +   L ++D  N  I+ ++    +C
Sbjct: 375 VKTVCGSGGLFSFGDVDGQGENVRLQHCLGVEYAQNY--LWVADTYNHKIKLVSPSTRNC 432

Query: 194 SKSSQSGSA 202
                 GSA
Sbjct: 433 QTVLGDGSA 441


>gi|119569873|gb|EAW49488.1| NHL repeat containing 2, isoform CRA_b [Homo sapiens]
          Length = 726

 Score = 43.9 bits (102), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 32/80 (40%), Positives = 43/80 (53%), Gaps = 8/80 (10%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
           G  DG    + F+ P+  A+ M   IYVAD  NH+IRKI      V+TIAG    G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEKVSTIAGIGIQGTDKE 325

Query: 147 GRADGPAQNASFSNDFELTF 166
           G A G  Q    S+ +++ F
Sbjct: 326 GGAKGEQQ--PISSPWDVVF 343


>gi|336322170|ref|YP_004602138.1| alkyl hydroperoxide reductase [[Cellvibrio] gilvus ATCC 13127]
 gi|336105751|gb|AEI13570.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [[Cellvibrio] gilvus ATCC 13127]
          Length = 638

 Score = 43.9 bits (102), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 30/116 (25%), Positives = 53/116 (45%), Gaps = 8/116 (6%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL---- 133
           +E  ++ +AG   +G  DG    A F +P   AVD +G I++AD     +R +  +    
Sbjct: 342 KEPSLRHVAGTMNEGLLDGPLADAWFAQPSGLAVDDEGGIWLADSETSALRVVEPVHGSA 401

Query: 134 ---GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
               V T+ G G  + G  DG A  A   +   +  +P   ++ ++D  N  +R+ 
Sbjct: 402 GDGSVRTVVGAGLFEFGHRDGAADEARLQHPLGVAVLPD-GSVAVADTYNGAVRRF 456



 Score = 40.8 bits (94), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 39/159 (24%), Positives = 72/159 (45%), Gaps = 30/159 (18%)

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV------ 111
           +L+V D+   +   L+     +++V+R+ G G +G  DG P  ARF +P    +      
Sbjct: 187 NLLVADAGHHSLAELA--PDGQTLVRRI-GSGERGLVDGGPDDARFSEPNGLCLVPDELR 243

Query: 112 -DMKGNIYVADKSNHVIR--KITNLGVTTIAG-----------------GGSKKEGRADG 151
             +  ++ VAD  NH +R  ++ +  V+T+AG                 GG  + GRA  
Sbjct: 244 PWVGYDVLVADTVNHALRGVRLADGHVSTVAGTGEQFMVGAADNVLPGWGGDDRSGRAYA 303

Query: 152 PAQNA-SFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            A  A   S+ +++T+     A +++  GN  +   + K
Sbjct: 304 GAPLAVRLSSPWDVTWSAAWGAFVVAMAGNHTLWAFDPK 342



 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 31/110 (28%), Positives = 47/110 (42%), Gaps = 8/110 (7%)

Query: 98  PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNAS 157
           PG+ RF  P        GN+ VAD  +H + ++   G T +   GS + G  DG   +A 
Sbjct: 171 PGTLRF--PAKAVALPDGNLLVADAGHHSLAELAPDGQTLVRRIGSGERGLVDGGPDDAR 228

Query: 158 FSNDFELTFVPHICA------LLISDHGNQLIRQINLKPEDCSKSSQSGS 201
           FS    L  VP          +L++D  N  +R + L     S  + +G 
Sbjct: 229 FSEPNGLCLVPDELRPWVGYDVLVADTVNHALRGVRLADGHVSTVAGTGE 278


>gi|291235470|ref|XP_002737667.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
          Length = 638

 Score = 43.9 bits (102), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 3/56 (5%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQ 154
           G  RF  P    +D KGN+ VADK+NH I+ +T  G      G    +G++DG  Q
Sbjct: 511 GDCRFKSPYGITIDSKGNVIVADKNNHCIQVLTGEGEFLFKFGS---QGKSDGQLQ 563


>gi|29349707|ref|NP_813210.1| hypothetical protein BT_4299 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|298383965|ref|ZP_06993526.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
 gi|383120644|ref|ZP_09941370.1| hypothetical protein BSIG_2353 [Bacteroides sp. 1_1_6]
 gi|29341617|gb|AAO79404.1| putative cell surface protein [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|251840310|gb|EES68392.1| hypothetical protein BSIG_2353 [Bacteroides sp. 1_1_6]
 gi|298263569|gb|EFI06432.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
          Length = 495

 Score = 43.9 bits (102), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 38/76 (50%), Gaps = 3/76 (3%)

Query: 59  LIVLDSSRSAFYTL--SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           + + DS  S    +  + P    +V   +   G +GY DG P  A+F  P   AV+  G 
Sbjct: 418 MYIADSGNSCIRVIDTTMPKERAAVTTPIGLPGAEGYKDGGPEIAKFHFPCGVAVNSDGT 477

Query: 117 I-YVADKSNHVIRKIT 131
           I YVAD  N VIRK++
Sbjct: 478 IVYVADTQNKVIRKLS 493



 Score = 37.0 bits (84), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 30/119 (25%), Positives = 51/119 (42%), Gaps = 8/119 (6%)

Query: 79  ESVVKRLAGDGV---QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-- 133
           ES   +   +GV   +G+ DG   +A+F  P+       G +Y+AD  N  IR I     
Sbjct: 377 ESYAGKAIYEGVMNGKGWEDGLLKNAKFRHPRQICFTDDGKMYIADSGNSCIRVIDTTMP 436

Query: 134 ---GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
                 T   G    EG  DG  + A F     +        + ++D  N++IR+++++
Sbjct: 437 KERAAVTTPIGLPGAEGYKDGGPEIAKFHFPCGVAVNSDGTIVYVADTQNKVIRKLSIE 495


>gi|290976601|ref|XP_002671028.1| predicted protein [Naegleria gruberi]
 gi|284084593|gb|EFC38284.1| predicted protein [Naegleria gruberi]
          Length = 925

 Score = 43.9 bits (102), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 37/115 (32%), Positives = 55/115 (47%), Gaps = 3/115 (2%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +S++     +AG GV GYS DG+  + A+       +V   G + +AD SNH IRKI   
Sbjct: 512 ISQDGSSTVIAGTGVAGYSGDGQLATQAQLFNVSGVSVLPNGEVVIADTSNHRIRKINID 571

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           G + TI G G       +  A N+  +   ++   P    + I D  N  IR+IN
Sbjct: 572 GIIQTICGTGVAGYSGENILATNSQLNLPIDVEVSPISSEIFIVDTNNHRIRKIN 626



 Score = 42.0 bits (97), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 35/109 (32%), Positives = 55/109 (50%), Gaps = 8/109 (7%)

Query: 38  IDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-D 95
           ++  Q+ I+P  V +DR  + ++ V D   S    +     + + +   AG+G+ GYS D
Sbjct: 50  VNAGQVSISPTKVKVDRK-TGNVYVSDRINSRIRKIDINTGKATTI---AGNGIIGYSGD 105

Query: 96  GEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGG 142
           G+  + A  + P    +    +IY AD  NH IRKI  N  + TIAG G
Sbjct: 106 GQLATQANINNPGKVELSSTNDIYFADTENHRIRKIFANGTIITIAGCG 154



 Score = 38.9 bits (89), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 45/91 (49%), Gaps = 3/91 (3%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGP-AQNAS 157
           GS   D+P   A+   G IYVAD   H + +I+  G +T+  G        DG  A  A 
Sbjct: 482 GSKTSDRPHGIAI-FNGMIYVADPKTHRVIRISQDGSSTVIAGTGVAGYSGDGQLATQAQ 540

Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQINL 188
             N   ++ +P+   ++I+D  N  IR+IN+
Sbjct: 541 LFNVSGVSVLPN-GEVVIADTSNHRIRKINI 570


>gi|168702582|ref|ZP_02734859.1| NHL repeat [Gemmata obscuriglobus UQM 2246]
          Length = 599

 Score = 43.9 bits (102), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 51/98 (52%), Gaps = 10/98 (10%)

Query: 50  VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF 109
           V D PG   L + DS+           ++  V+    G G  G +DG    A+FD P+  
Sbjct: 269 VADEPGRR-LFIADSTHHRVVVTDLAGNKLHVI----GTGAPGKTDGPFDKAQFDDPQGM 323

Query: 110 AVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSK 144
           AV  +G+ ++VAD+ NH++R+I      V TIAG G++
Sbjct: 324 AV--RGDTLFVADRKNHLVREIDLKAKTVATIAGNGTQ 359


>gi|407792655|ref|ZP_11139692.1| NHL repeat containing protein [Idiomarina xiamenensis 10-D-4]
 gi|407217768|gb|EKE87600.1| NHL repeat containing protein [Idiomarina xiamenensis 10-D-4]
          Length = 510

 Score = 43.9 bits (102), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 47/89 (52%), Gaps = 5/89 (5%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTI 138
            V+R+ G G   ++DG   +A F+ P+  A   +  +YVAD  NH +R I  ++  VTT+
Sbjct: 230 TVQRI-GTGNAAFNDGSASAASFNSPQGLAF-AEQTLYVADTGNHALRAIDLSSWQVTTV 287

Query: 139 AGGGS-KKEGRADGPAQNASFSNDFELTF 166
           AG G   +   A G A   +  + +++T 
Sbjct: 288 AGTGQLGRNYNASGKATAVALRSPWDITM 316



 Score = 42.4 bits (98), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 42/136 (30%), Positives = 58/136 (42%), Gaps = 22/136 (16%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGG 142
           LAG G +   DG    A F++P   AV +   +YVAD     IR I  T   V T+ G G
Sbjct: 344 LAGSGREALLDGTAKQAAFNQPSGLAV-VGDTLYVADAEASAIRAIDLTTQKVRTVVGQG 402

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICA--------LLISDHGNQLIRQINLKPEDCS 194
             + G  DG         DF    + H  A        L+++D  N  +R ++L  +  S
Sbjct: 403 LFEFGSKDG---------DFARALLQHPLAVAALTPDQLVVADTYNHQLRLLDLGTQQVS 453

Query: 195 KSSQSGSAL--GAVSV 208
               S S L  G VS+
Sbjct: 454 TLHLSDSLLEPGGVSI 469


>gi|284044371|ref|YP_003394711.1| NHL repeat containing protein [Conexibacter woesei DSM 14684]
 gi|283948592|gb|ADB51336.1| NHL repeat containing protein [Conexibacter woesei DSM 14684]
          Length = 623

 Score = 43.9 bits (102), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 52/104 (50%), Gaps = 6/104 (5%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGS 143
            AG+GV         + R ++P+  +    G   +AD  N+ IR++   G +TT+AG G+
Sbjct: 117 FAGEGVPATD----ATVRLNQPRGVSPTGDGGFLIADTLNNRIRRVDAAGLITTVAGTGA 172

Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
                  GPA  A+ +    +  +P   + LI+D GN  IR+++
Sbjct: 173 AAFTGDGGPATAAALNAPQGVAALPD-GSFLIADTGNHRIRRVD 215


>gi|290974456|ref|XP_002669961.1| predicted protein [Naegleria gruberi]
 gi|284083515|gb|EFC37217.1| predicted protein [Naegleria gruberi]
          Length = 1106

 Score = 43.5 bits (101), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 35/120 (29%), Positives = 58/120 (48%), Gaps = 2/120 (1%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           +S + +V  +AG   + GY+  +P  +  + P S +    G+++  D+ N+VIRKI+N  
Sbjct: 806 VSSDGIVSTIAGVSQIHGYNLYDPQESILNGPTSLSSAKNGDLFFIDQKNYVIRKISNGI 865

Query: 135 VTTIAGGGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
           VTTI G  +++    +G   +    S    L   P   AL+ SD     I   NL  E+ 
Sbjct: 866 VTTIIGTPNERCDMEEGKRGRETCLSMPQSLAVSPGGDALVFSDLSTIGIFHYNLNDENA 925



 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 34/100 (34%), Positives = 50/100 (50%), Gaps = 10/100 (10%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGSARFD 104
           NP  V++R G     +  + RS    +    +E  V   + G+    ++ DG+ G+    
Sbjct: 528 NPRDVMERNG----YLYYAERSIICKIDLVSNELEV---MVGNRQTSFNGDGKKGNETSI 580

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGG 142
            P+S  V  +G+IY AD SNH IRK T     V+TIAG G
Sbjct: 581 GPESIYVTEEGDIYFADISNHRIRKWTRKTDLVSTIAGTG 620


>gi|290971759|ref|XP_002668647.1| predicted protein [Naegleria gruberi]
 gi|284082132|gb|EFC35903.1| predicted protein [Naegleria gruberi]
          Length = 355

 Score = 43.5 bits (101), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 39/114 (34%), Positives = 56/114 (49%), Gaps = 6/114 (5%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFD--KPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +    ++  +AGDG++GY DG+   A     K        K  IY+AD  ++ IRKI   
Sbjct: 130 IDRNGIITTIAGDGIEGY-DGDNQLATLSSLKHPCSVFQYKDEIYIADSFSNRIRKIDRD 188

Query: 134 GVT-TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G+  TIAG G +        A + S +N   L FV H   +  SD  NQ+IR+I
Sbjct: 189 GIIHTIAGNGIEGYNGDGRNATDTSLNNPNSL-FV-HNDEIYFSDLDNQVIRKI 240



 Score = 37.4 bits (85), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 32/114 (28%), Positives = 53/114 (46%), Gaps = 4/114 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +  + ++  +AG+G++GY+ DG   +    + P S  V     IY +D  N VIRKI   
Sbjct: 185 IDRDGIIHTIAGNGIEGYNGDGRNATDTSLNNPNSLFV-HNDEIYFSDLDNQVIRKILPN 243

Query: 134 G-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G +  IAG   KK    D      +  +  +  FV     +  +D+ N  +R+I
Sbjct: 244 GIIKNIAGIPGKKGYNGDYKLATRTTLSWPKGIFVDSKNQVYFADYYNHCVRKI 297



 Score = 37.0 bits (84), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 31/94 (32%), Positives = 41/94 (43%), Gaps = 13/94 (13%)

Query: 81  VVKRLAG-DGVQGYSDGEPGSAR--FDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVT 136
           ++K +AG  G +GY+     + R     PK   VD K  +Y AD  NH +RKI  N  + 
Sbjct: 245 IIKNIAGIPGKKGYNGDYKLATRTTLSWPKGIFVDSKNQVYFADYYNHCVRKIFKNGNMK 304

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHI 170
            IAG G       D P         F+    PHI
Sbjct: 305 IIAGIGDVFGYSGDVP---------FDFQQYPHI 329


>gi|441145773|ref|ZP_20963891.1| hypothetical protein SRIM_07703 [Streptomyces rimosus subsp.
           rimosus ATCC 10970]
 gi|440620877|gb|ELQ83900.1| hypothetical protein SRIM_07703 [Streptomyces rimosus subsp.
           rimosus ATCC 10970]
          Length = 615

 Score = 43.5 bits (101), Expect = 0.085,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 62/124 (50%), Gaps = 13/124 (10%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P   +  PG +  +V D++R     L+     E+V++R+ G G +G+     G+  F +P
Sbjct: 185 PGKALRLPGGT-FLVSDTTRHQLVELA--ADGETVLRRI-GSGERGF-----GADAFSEP 235

Query: 107 KSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK-KEGRA-DGPAQNASFSNDF 162
           +  A+   G + VAD  NH IR        V T+AG G +  +G A  GPA+    S+ +
Sbjct: 236 QGLALLPDGKVAVADTVNHAIRTFDPETGEVATVAGTGKQWWQGSATSGPAREVDLSSPW 295

Query: 163 ELTF 166
           ++ +
Sbjct: 296 DVAW 299


>gi|302535478|ref|ZP_07287820.1| NHL repeat containing protein [Streptomyces sp. C]
 gi|302444373|gb|EFL16189.1| NHL repeat containing protein [Streptomyces sp. C]
          Length = 612

 Score = 43.5 bits (101), Expect = 0.085,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 56/115 (48%), Gaps = 12/115 (10%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S +L+V DS+R     L      E+VV+R+ G G +G+     G   F +P+  A+   G
Sbjct: 198 SGNLLVSDSTRHQLVELE--ADGETVVRRI-GSGERGF-----GPDAFSEPQGLALLPDG 249

Query: 116 NIYVADKSNHVIRKI--TNLGVTTIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
            + VAD  NH +R +      V T+AG G +  +     GPA     S+ +++ +
Sbjct: 250 RVVVADTVNHALRVLDPETGAVETVAGTGRQWWQGSPTSGPALEVDLSSPWDVAW 304


>gi|295394954|ref|ZP_06805167.1| NHL repeat containing protein [Brevibacterium mcbrellneri ATCC
           49030]
 gi|294972287|gb|EFG48149.1| NHL repeat containing protein [Brevibacterium mcbrellneri ATCC
           49030]
          Length = 655

 Score = 43.5 bits (101), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 32/120 (26%), Positives = 53/120 (44%), Gaps = 23/120 (19%)

Query: 87  GDGVQGYSDGEPGSARFDKP-------KSFAVDMKGNIYVADKSNHVIRKIT--NLGVTT 137
           G G +G +DG+  +A+F +P          A  +  ++  AD  NH +R I      VTT
Sbjct: 231 GSGERGNADGDFATAQFSEPGGVAQLPPEIAQQVGYHLVAADTVNHTLRGINLETQTVTT 290

Query: 138 IAGGGSKKE--------------GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
           +AG G +                GR DGPA     S+ +++ + P    ++I+  GN  +
Sbjct: 291 VAGTGEQYMVGASDNVPDQPGFFGRYDGPATAVKLSSPWDVVYAPKTSNVVIAMAGNHTM 350



 Score = 40.8 bits (94), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 36/150 (24%), Positives = 67/150 (44%), Gaps = 12/150 (8%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD- 104
           +P  V+  P +S++++  +     +T   P++    V  ++G   +G  D    +  FD 
Sbjct: 327 SPWDVVYAPKTSNVVIAMAGNHTMWTFD-PVA--GTVAHISGSLNEGLRDTSAAATPFDL 383

Query: 105 -----KPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNAS 157
                +P    +   GN++VAD     +RK+  +   VT+  G G    G  DGPA++A 
Sbjct: 384 DEWFAQPSGVRLASDGNVWVADSETSALRKLDPSTGAVTSYVGVGLFDFGFQDGPAESAR 443

Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQIN 187
             +      +P    + I+D  N  IR+ +
Sbjct: 444 LQHPLGALELPD-GTVAIADTYNGAIRRFD 472


>gi|326434843|gb|EGD80413.1| NHL repeat-containing protein [Salpingoeca sp. ATCC 50818]
          Length = 1449

 Score = 43.5 bits (101), Expect = 0.089,   Method: Composition-based stats.
 Identities = 32/112 (28%), Positives = 48/112 (42%), Gaps = 4/112 (3%)

Query: 81   VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT-- 136
            V+    G G  G+    G   +A+   P   A D +G +Y+AD  NHV+  +    V   
Sbjct: 1216 VLSVFVGTGEAGHRGMGGSSRAAQIHSPHGLAFDSRGTLYIADTENHVVYGVDRRSVVID 1275

Query: 137  TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
             +AG   +K    DG     ++ N      V     L I+D GN  IR ++L
Sbjct: 1276 VVAGTPFRKGYHGDGRPATTAWLNAPTGIAVRGAGDLYIADKGNNRIRYVDL 1327



 Score = 39.7 bits (91), Expect = 1.5,   Method: Composition-based stats.
 Identities = 43/155 (27%), Positives = 71/155 (45%), Gaps = 14/155 (9%)

Query: 45   INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGSA-- 101
            +N  + I   G+ DL + D   +    +   L    ++  L G G  G   DG   +A  
Sbjct: 1298 LNAPTGIAVRGAGDLYIADKGNNRIRYVD--LDSYKIIDTLVGTGRYGGDVDGSTTTAAL 1355

Query: 102  --RFDKPKSFAVDMKGNIYV-ADKSNHVIRKITNLG----VTTIAGGGSKKEGRADGPAQ 154
                D P+  AV+ KG + V AD + HV+R ++  G    VT +AG GS+     DG   
Sbjct: 1356 ETNLDHPEGVAVNAKGTMLVIADTNKHVLRSVSLDGGDSPVTVLAGTGSRG-FNGDGNEP 1414

Query: 155  NAS-FSNDFELTFVPHICALLISDHGNQLIRQINL 188
            NA+   +   + +     A++  D  N+ IR+++L
Sbjct: 1415 NATNLHSPVSVAYDIGERAIVFVDQQNRRIRRVHL 1449



 Score = 39.3 bits (90), Expect = 1.7,   Method: Composition-based stats.
 Identities = 23/81 (28%), Positives = 42/81 (51%), Gaps = 3/81 (3%)

Query: 80   SVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVT 136
            S +  +AG   +G +   G   +A+ D+P   AVD  GN+Y+++K NH +  +  + GV 
Sbjct: 1158 STITTIAGTKTRGSTGVGGLATAAQLDRPTCVAVDTMGNVYISEKGNHRVSVVDASTGVL 1217

Query: 137  TIAGGGSKKEGRADGPAQNAS 157
            ++  G  +   R  G +  A+
Sbjct: 1218 SVFVGTGEAGHRGMGGSSRAA 1238


>gi|345882347|ref|ZP_08833836.1| hypothetical protein HMPREF0666_00012 [Prevotella sp. C561]
 gi|345045612|gb|EGW49528.1| hypothetical protein HMPREF0666_00012 [Prevotella sp. C561]
          Length = 496

 Score = 43.5 bits (101), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 49/104 (47%), Gaps = 6/104 (5%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-----TNLGVTTIAGGGSKK 145
           +G+ DG   +A+F  P+       G +Y+AD  N  IR I      N  VTT   G  + 
Sbjct: 394 KGWEDGALKNAKFCFPRQMTFTKDGKLYIADSGNQCIRMIDTTQGKNARVTTPI-GVPQS 452

Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            G  DG  + A F+    +        + ++D  NQ+IR++++K
Sbjct: 453 AGFQDGGVELAKFNWPTGVAVSADGSTVYVADSKNQVIRELSIK 496


>gi|390956314|ref|YP_006420071.1| hypothetical protein Terro_0387 [Terriglobus roseus DSM 18391]
 gi|390411232|gb|AFL86736.1| hypothetical protein Terro_0387 [Terriglobus roseus DSM 18391]
          Length = 1224

 Score = 43.5 bits (101), Expect = 0.091,   Method: Compositional matrix adjust.
 Identities = 31/89 (34%), Positives = 44/89 (49%), Gaps = 6/89 (6%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASF 158
           A    P   +VD  G++Y  D SNHV+RK+T     +TTIAG GS         A++  F
Sbjct: 262 ANIGWPSGMSVDANGDLYFVDISNHVVRKVTKSTGLITTIAGAGSNFPYNTVQLAKDTEF 321

Query: 159 SNDFELTFVPHICALLISDHG---NQLIR 184
            +  +L  +P    LL+ D G    Q+ R
Sbjct: 322 ISPEDLLILPS-GDLLVDDSGVGAGQIFR 349


>gi|380694052|ref|ZP_09858911.1| hypothetical protein BfaeM_08738 [Bacteroides faecis MAJ27]
          Length = 495

 Score = 43.5 bits (101), Expect = 0.091,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 38/76 (50%), Gaps = 3/76 (3%)

Query: 59  LIVLDSSRSAFYTL--SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           + + DS  S    +  + P    +V   +   G +GY DG P  A+F  P   AV+  G 
Sbjct: 418 MYIADSGNSCIRVVDTTMPKERAAVTTPIGLPGAEGYKDGGPEIAKFHFPCGVAVNSDGT 477

Query: 117 I-YVADKSNHVIRKIT 131
           I YVAD  N VIRK++
Sbjct: 478 IVYVADTQNKVIRKLS 493


>gi|334335815|ref|YP_004540967.1| alkyl hydroperoxide reductase/ thiol specific antioxidant/ Mal
           allergen [Isoptericola variabilis 225]
 gi|334106183|gb|AEG43073.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Isoptericola variabilis 225]
          Length = 668

 Score = 43.5 bits (101), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 47/186 (25%), Positives = 73/186 (39%), Gaps = 35/186 (18%)

Query: 48  HSVIDRPGSSDLIVLDS-SRSAFYTLSFPLSEESVVKRLAGDG----------------- 89
           H V+D P   +L+   S +  A+ TL     E  VV ++AG+G                 
Sbjct: 112 HPVLDDP---ELVTWGSYTARAWPTLVVVDPEGYVVAQMAGEGHASAIAALVRELVGEHE 168

Query: 90  VQG--------YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
            +G        Y   EP S     P        GN+ VAD  +H + ++   G T +   
Sbjct: 169 AKGTLHRGDAPYVPPEPTSGTLRFPAKAVALPNGNLLVADAGHHSLAELAPDGETLVRRI 228

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQINLKPEDCSK 195
           GS   G  DG A  A FS    L  VP +        +L++D  N  +R + L+  + + 
Sbjct: 229 GSGARGLVDGAADVARFSEPNGLCLVPEVLRATLGYDVLVADTVNHALRGVRLETGEVTT 288

Query: 196 SSQSGS 201
            + +G 
Sbjct: 289 VAGTGE 294



 Score = 38.9 bits (89), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 40/91 (43%), Gaps = 9/91 (9%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV----DMKGNIYVADKSNHVIRKI 130
           P  E   V+ LAG   +G  DG  G+A F +P   A     D  G++++AD     +R +
Sbjct: 358 PGPESGTVRLLAGTMNEGLEDGPGGTAWFAQPSGLAPVLGSDADGSLWLADAETSALRLV 417

Query: 131 T-----NLGVTTIAGGGSKKEGRADGPAQNA 156
           T      + V T  G G    G  DG A  A
Sbjct: 418 TPAADGQVQVRTAVGQGLFDFGHRDGAADQA 448


>gi|326923951|ref|XP_003208196.1| PREDICTED: NHL repeat-containing protein 2-like [Meleagris
           gallopavo]
          Length = 727

 Score = 43.5 bits (101), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 36/79 (45%), Positives = 42/79 (53%), Gaps = 8/79 (10%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKIT-NLG-VTTIAG---GGSKK 145
           G  DG    A F+ P+  A+  K N IYVAD  NH+IRKI  +L  VTT+AG    G  K
Sbjct: 262 GRKDGRFSEAAFNSPQGVAI--KNNVIYVADTENHLIRKIDLDLEIVTTVAGIGIQGVDK 319

Query: 146 EGRADGPAQNASFSNDFEL 164
           EG A G  Q  S   D  L
Sbjct: 320 EGGAKGEEQPISSPWDVVL 338


>gi|390559793|ref|ZP_10244078.1| conserved hypothetical protein [Nitrolancetus hollandicus Lb]
 gi|390173637|emb|CCF83377.1| conserved hypothetical protein [Nitrolancetus hollandicus Lb]
          Length = 442

 Score = 43.5 bits (101), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 31/100 (31%), Positives = 50/100 (50%), Gaps = 9/100 (9%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V+     + L + DS  +         +E  ++    G G  G +DG+  +A+F+ P
Sbjct: 128 PGKVLADAAGNRLFIADSGHNRIVVTRLDGAEPWMI----GSGQPGLADGDAATAQFNHP 183

Query: 107 KSFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTIAGGGS 143
           +  A  + G+I Y+AD  NHVIR++   +  V TIAG G 
Sbjct: 184 QGMA--LAGDILYLADTRNHVIRRVDLASRQVETIAGTGE 221



 Score = 37.7 bits (86), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 45/145 (31%), Positives = 61/145 (42%), Gaps = 12/145 (8%)

Query: 49  SVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEPGSARFDKP 106
           S ID  G+  L V DS  SA  T   P     +V+ + G G+  + D  G     R   P
Sbjct: 295 SGIDTDGTR-LYVADSETSAIRTADLP--PRDLVQTIVGIGLFEFGDVDGTGNQVRLQHP 351

Query: 107 KSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFEL 164
              AV   G +YVAD  NH I+++  T     T  G G+   G  DG  + A F     +
Sbjct: 352 LGLAVG-DGVVYVADSYNHKIKRLYPTERRCETWLGDGTP--GDRDGVREAARFHEPGGV 408

Query: 165 TFVPHICALLISDHGNQLIRQINLK 189
           +       L I+D  N  IR   L+
Sbjct: 409 SLAGD--RLYIADTNNHAIRVAELE 431


>gi|124009915|ref|ZP_01694581.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
 gi|123984066|gb|EAY24439.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
          Length = 768

 Score = 43.5 bits (101), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 50/92 (54%), Gaps = 4/92 (4%)

Query: 97  EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQ 154
           + GSA  + P    +D+ GN+Y+A+ S +V+RK+  ++  + TIAGGG+   G   G A 
Sbjct: 274 DAGSATINGPTGIHIDVSGNLYIAEFSGNVVRKVNASDNKINTIAGGGASDPGNG-GLAT 332

Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           +A  S  F L  +     + I +  N  IR++
Sbjct: 333 SAVVSLPFGL-VIDAAGDMYIPEFTNGKIRKV 363


>gi|297687408|ref|XP_002821208.1| PREDICTED: NHL repeat-containing protein 2 [Pongo abelii]
          Length = 726

 Score = 43.5 bits (101), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
           G  DG    + F+ P+  A+ M   IYVAD  NH+IRKI      V+T+AG    G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEKVSTVAGIGIQGTDKE 325

Query: 147 GRADGPAQNASFSNDFELTF 166
           G A G  Q    S+ +++ F
Sbjct: 326 GGAKGEQQ--PISSPWDIVF 343


>gi|373253064|ref|ZP_09541182.1| thiol-disulfide isomerase-like thioredoxin [Nesterenkonia sp. F]
          Length = 659

 Score = 43.5 bits (101), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 50/110 (45%), Gaps = 7/110 (6%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG------V 135
           ++ LAG G++G +DG    A F +    A D  G++++AD     +R +          V
Sbjct: 360 LRVLAGTGLEGLTDGSADEAWFAQTSGLAADAAGDVWIADSETSSLRVLRTAEDPKAPRV 419

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
            TI G G    G  DG    A   +   +  +P   ++L++D  N  IR+
Sbjct: 420 ETIVGEGMFDFGFRDGDPAQARLQHPLGVAALPD-GSVLVADTYNGAIRR 468


>gi|449435944|ref|XP_004135754.1| PREDICTED: NHL repeat-containing protein 2-like [Cucumis sativus]
          Length = 1086

 Score = 43.5 bits (101), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 42/77 (54%), Gaps = 6/77 (7%)

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG--VTTIAG 140
           ++   G  G  DG    A F++P+  A + K N+ YVAD  NH +R++  +   V T+AG
Sbjct: 639 QIGSTGEDGLRDGNFDDATFNRPQGLAYNAKKNLLYVADTENHALREVDFVKERVRTLAG 698

Query: 141 GGSK---KEGRADGPAQ 154
            GSK    +G  +G +Q
Sbjct: 699 DGSKGSDYQGGKEGTSQ 715



 Score = 38.9 bits (89), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 41/83 (49%), Gaps = 6/83 (7%)

Query: 115 GNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA 172
           G IYVAD  NH ++ +  +   VTTIAG G  K G  DG A  A  S    +T       
Sbjct: 850 GQIYVADSYNHKVKMLDPVSKKVTTIAGTG--KAGFKDGTALEAQLSEPSGITEAGG--R 905

Query: 173 LLISDHGNQLIRQINLKPEDCSK 195
           L I+D  N +IR + L   + S+
Sbjct: 906 LFIADTNNNVIRYLYLNNREQSQ 928


>gi|158293254|ref|XP_314572.4| AGAP010610-PA [Anopheles gambiae str. PEST]
 gi|157016870|gb|EAA09865.4| AGAP010610-PA [Anopheles gambiae str. PEST]
          Length = 730

 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 40/156 (25%), Positives = 64/156 (41%), Gaps = 20/156 (12%)

Query: 2   ASHFVYSSLAITLLSLLFSLVSSGLLLED----GYTVTTVIDGHQLEINPHSVIDRPGSS 57
           A  F      I   SL  +LVSSG +  +    G    +V  G          +   G+ 
Sbjct: 201 AIRFYREKGEIKRHSLPINLVSSGAIASNMKYPGKVACSVPTGAGGSEEQLFAVSDSGNH 260

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
            ++++DS+ +  + +              G    G+ DG    ARF+ P+  A      +
Sbjct: 261 RVLIVDSAGTVLHKV--------------GGKQSGFVDGNFTKARFNAPQGVAFQGTDVV 306

Query: 118 YVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADG 151
           +VAD  NH +R+I      V+TIAG G++   R  G
Sbjct: 307 FVADNENHAVRRIDLKARLVSTIAGNGTQGNDRTGG 342


>gi|57529823|ref|NP_001006504.1| NHL repeat-containing protein 2 [Gallus gallus]
 gi|82081010|sp|Q5ZI67.1|NHLC2_CHICK RecName: Full=NHL repeat-containing protein 2
 gi|53136494|emb|CAG32576.1| hypothetical protein RCJMB04_29n5 [Gallus gallus]
          Length = 727

 Score = 43.1 bits (100), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 45/82 (54%), Gaps = 12/82 (14%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLG---VTTIAG---GGSK 144
           G  DG    A F+ P+  A+  K N IYVAD  NH+IRKI +L    VTT+AG    G  
Sbjct: 262 GRKDGRFSEAAFNSPQGVAI--KNNVIYVADTENHLIRKI-DLELEIVTTVAGIGIQGVD 318

Query: 145 KEGRADGPAQNASFSNDFELTF 166
           KEG A G  Q    S+ +++ F
Sbjct: 319 KEGGAKGEEQ--PISSPWDVVF 338


>gi|345324001|ref|XP_001513492.2| PREDICTED: NHL repeat-containing protein 2-like [Ornithorhynchus
           anatinus]
          Length = 725

 Score = 43.1 bits (100), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 38/89 (42%), Positives = 47/89 (52%), Gaps = 20/89 (22%)

Query: 81  VVKRLAGDGVQGYSDGEPGSAR---------FDKPKSFAVDMKGNI-YVADKSNHVIRKI 130
           VVK+   +G   YS G P S R         F+ P+  A+  K NI YVAD  NH+IRKI
Sbjct: 250 VVKK---NGQIQYSIGGPNSGRKDGTFSESAFNSPQGVAI--KNNIIYVADTENHLIRKI 304

Query: 131 --TNLGVTTIAG---GGSKKEGRADGPAQ 154
                 V+T+AG    G+ KEG A G AQ
Sbjct: 305 DLEVEQVSTVAGVGVQGTDKEGGAKGEAQ 333


>gi|149279484|ref|ZP_01885614.1| hypothetical protein PBAL39_16831 [Pedobacter sp. BAL39]
 gi|149229777|gb|EDM35166.1| hypothetical protein PBAL39_16831 [Pedobacter sp. BAL39]
          Length = 742

 Score = 43.1 bits (100), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 34/106 (32%), Positives = 46/106 (43%), Gaps = 1/106 (0%)

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS 143
           R AG+G  G +DG   SA    PK F  D KGN+Y +D  +   RK+  +  T       
Sbjct: 638 RWAGNGNFGNADGPIKSAMLVSPKGFVGDGKGNMYFSDGFSQNTRKVDLIQNTISTVSAQ 697

Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
              G  DG    A F     +  +     + I D GN  IR++ LK
Sbjct: 698 TTRGFKDGGLNEAQFMQRDAMA-MDKDGNIYIVDMGNNAIRKLFLK 742


>gi|73667964|ref|YP_303979.1| hypothetical protein Mbar_A0416 [Methanosarcina barkeri str.
           Fusaro]
 gi|72395126|gb|AAZ69399.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro]
          Length = 346

 Score = 43.1 bits (100), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 30/112 (26%), Positives = 47/112 (41%), Gaps = 21/112 (18%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASF 158
           GS +F  P   AVD  GN+YV D  NH I+K  + G      G +  + R          
Sbjct: 46  GSGQFACPTGVAVDSSGNVYVTDTGNHRIQKFNSTGGYLTQWGSNGTDNRQ--------- 96

Query: 159 SNDFELTFVPHICA------LLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
                  F+P+  A      + ++D GN+ I++ N      ++   SG+  G
Sbjct: 97  ------FFLPYGVAVDSSGNVYVADKGNKCIQKFNSNGGHLTQWGSSGNGNG 142



 Score = 40.8 bits (94), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 25/36 (69%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           G+ +F+ P S AVD  GN+YVADK N  I+K  ++G
Sbjct: 234 GNGQFEFPLSIAVDSSGNVYVADKYNQRIQKFNSIG 269



 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 29/90 (32%), Positives = 45/90 (50%), Gaps = 11/90 (12%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASF 158
           G+ +F+ P+  AVD  GN+YVAD  N+ I+K  + G   +   GS   G       N  F
Sbjct: 187 GNGQFNDPEGVAVDSSGNVYVADSGNNRIQKFNSTG-GYLTQWGSYGSG-------NGQF 238

Query: 159 SNDFELTF-VPHICALLISDHGNQLIRQIN 187
             +F L+  V     + ++D  NQ I++ N
Sbjct: 239 --EFPLSIAVDSSGNVYVADKYNQRIQKFN 266


>gi|42476013|ref|NP_940916.2| NHL repeat-containing protein 2 [Homo sapiens]
 gi|74762548|sp|Q8NBF2.1|NHLC2_HUMAN RecName: Full=NHL repeat-containing protein 2
 gi|21748831|dbj|BAC03493.1| unnamed protein product [Homo sapiens]
          Length = 726

 Score = 43.1 bits (100), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
           G  DG    + F+ P+  A+ M   IYVAD  NH+IRKI      V+T+AG    G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEKVSTVAGIGIQGTDKE 325

Query: 147 GRADGPAQNASFSNDFELTF 166
           G A G  Q    S+ +++ F
Sbjct: 326 GGAKGEQQ--PISSPWDVVF 343


>gi|403259474|ref|XP_003922237.1| PREDICTED: NHL repeat-containing protein 2 [Saimiri boliviensis
           boliviensis]
          Length = 726

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
           G  DG    + F+ P+  A+ M   IYVAD  NH+IRKI      V+T+AG    G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEKVSTVAGIGIQGTDKE 325

Query: 147 GRADGPAQNASFSNDFELTF 166
           G A G  Q    S+ +++ F
Sbjct: 326 GGAKGEQQ--PISSPWDVVF 343


>gi|114632912|ref|XP_508046.2| PREDICTED: NHL repeat-containing protein 2 [Pan troglodytes]
 gi|410221294|gb|JAA07866.1| NHL repeat containing 2 [Pan troglodytes]
 gi|410252040|gb|JAA13987.1| NHL repeat containing 2 [Pan troglodytes]
 gi|410252042|gb|JAA13988.1| NHL repeat containing 2 [Pan troglodytes]
 gi|410252044|gb|JAA13989.1| NHL repeat containing 2 [Pan troglodytes]
 gi|410252046|gb|JAA13990.1| NHL repeat containing 2 [Pan troglodytes]
 gi|410292446|gb|JAA24823.1| NHL repeat containing 2 [Pan troglodytes]
 gi|410341451|gb|JAA39672.1| NHL repeat containing 2 [Pan troglodytes]
 gi|410341453|gb|JAA39673.1| NHL repeat containing 2 [Pan troglodytes]
          Length = 726

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
           G  DG    + F+ P+  A+ M   IYVAD  NH+IRKI      V+T+AG    G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEKVSTVAGIGIQGTDKE 325

Query: 147 GRADGPAQNASFSNDFELTF 166
           G A G  Q    S+ +++ F
Sbjct: 326 GGAKGEQQ--PISSPWDVVF 343


>gi|397510538|ref|XP_003825652.1| PREDICTED: NHL repeat-containing protein 2 [Pan paniscus]
          Length = 726

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
           G  DG    + F+ P+  A+ M   IYVAD  NH+IRKI      V+T+AG    G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEKVSTVAGIGIQGTDKE 325

Query: 147 GRADGPAQNASFSNDFELTF 166
           G A G  Q    S+ +++ F
Sbjct: 326 GGAKGEQQ--PISSPWDVVF 343


>gi|449524308|ref|XP_004169165.1| PREDICTED: NHL repeat-containing protein 2-like, partial [Cucumis
           sativus]
          Length = 892

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 42/77 (54%), Gaps = 6/77 (7%)

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG--VTTIAG 140
           ++   G  G  DG    A F++P+  A + K N+ YVAD  NH +R++  +   V T+AG
Sbjct: 639 QIGSTGEDGLRDGNFDDATFNRPQGLAYNAKKNLLYVADTENHALREVDFVKERVRTLAG 698

Query: 141 GGSK---KEGRADGPAQ 154
            GSK    +G  +G +Q
Sbjct: 699 DGSKGSDYQGGKEGTSQ 715


>gi|224052833|ref|XP_002198291.1| PREDICTED: NHL repeat-containing protein 2 [Taeniopygia guttata]
          Length = 727

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 34/81 (41%), Positives = 44/81 (54%), Gaps = 10/81 (12%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSK---K 145
           G  DG    A F+ P+  A+  K N IYVAD  NH+IRKI      VTT+AG G +   K
Sbjct: 262 GRRDGRFSEAAFNSPQGIAI--KNNVIYVADTENHLIRKIDLELQMVTTVAGIGVQGVDK 319

Query: 146 EGRADGPAQNASFSNDFELTF 166
           EG A G  Q    S+ +++ F
Sbjct: 320 EGGAKGEEQ--PISSPWDVVF 338


>gi|197121735|ref|YP_002133686.1| hypothetical protein AnaeK_1325 [Anaeromyxobacter sp. K]
 gi|196171584|gb|ACG72557.1| YD repeat protein [Anaeromyxobacter sp. K]
          Length = 2350

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 35/115 (30%), Positives = 58/115 (50%), Gaps = 9/115 (7%)

Query: 79   ESVVKRLAGDGVQGYSDGEPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKITNLG- 134
            + +++R AG  +     G+ G ARF K   P   +V   G++Y+AD  N  IR++   G 
Sbjct: 1230 DGIIERFAG--LSSGFGGDGGPARFAKLRGPNGVSVGPDGSVYIADTYNARIRRVDPSGI 1287

Query: 135  VTTIAGGG-SKKEGRAD-GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            + +IAG G ++ +   D GPA  A  +  ++    P    L I+D  N  IR++ 
Sbjct: 1288 IESIAGSGFAQPQFWGDGGPALAARLNGTWQAVVAPD-GDLFIADSFNARIRRVG 1341


>gi|298705700|emb|CBJ28938.1| NHL repeat-containing protein [Ectocarpus siliculosus]
          Length = 631

 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 42/92 (45%), Gaps = 5/92 (5%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV--TTIAGGGSKKEGRADGPAQNAS 157
           SA    PK  AVD   NIYVA+     IR I+  G+  T +  G +  EG   GPA  A 
Sbjct: 535 SAVLAYPKGIAVDADDNIYVAEFQGEKIRNISTSGIISTCVGTGSTAYEGDG-GPASEAG 593

Query: 158 FSNDFELTFVPHICA--LLISDHGNQLIRQIN 187
                 + F+P      + I+D G+  +R + 
Sbjct: 594 LDLPNAIAFMPSTPGSRMAIADFGHNAVRVVT 625


>gi|290983130|ref|XP_002674282.1| predicted protein [Naegleria gruberi]
 gi|284087871|gb|EFC41538.1| predicted protein [Naegleria gruberi]
          Length = 1572

 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 25/80 (31%), Positives = 42/80 (52%), Gaps = 1/80 (1%)

Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
           G++Y+AD S   IRKI N G+ ++  G  +K    DG A + S  + F +        + 
Sbjct: 74  GSMYIADASMAAIRKIDNNGIISLVAGNYRKTIPGDGSALSVSVISPFSMVLTQDETIMY 133

Query: 175 ISDHGNQLIRQINLKPEDCS 194
           I+  GN  IR++++K  + S
Sbjct: 134 IA-QGNGCIRKLDIKNGNIS 152



 Score = 38.1 bits (87), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 52/123 (42%), Gaps = 11/123 (8%)

Query: 60  IVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGN-I 117
           ++L  SRS+   L         +  +AG  GV GY DG+     F +  S AVD   N I
Sbjct: 590 LILSHSRSSIRMLK---KSSGFITNIAGTFGVSGYKDGKTSLFSF-RLNSLAVDRNDNSI 645

Query: 118 YVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISD 177
           Y++D  N  IRK+     T     G    G +   + + S   D  +TFV     + +  
Sbjct: 646 YISDNGNLAIRKL-----TPYCDDGWILSGNSTDCSPDISIKFDNTITFVDSTVLIAVKY 700

Query: 178 HGN 180
           H N
Sbjct: 701 HSN 703


>gi|391330742|ref|XP_003739813.1| PREDICTED: NHL repeat-containing protein 2-like [Metaseiulus
           occidentalis]
          Length = 685

 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 41/88 (46%), Gaps = 6/88 (6%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIA 139
           V+ + G G  G+ DG  G  RF  P+       G I+VAD  NH IR++  +   V+T+ 
Sbjct: 258 VQLIVGTGEAGFLDGAYGICRFSSPQGVCF-YDGGIFVADAGNHAIRRVDFSTKCVSTVV 316

Query: 140 GGGSKK---EGRADGPAQNASFSNDFEL 164
           G G +     G  DG  Q  S   D  L
Sbjct: 317 GTGKQGVDLVGNLDGNVQEISTPWDVVL 344


>gi|444916089|ref|ZP_21236213.1| hypothetical protein D187_08495 [Cystobacter fuscus DSM 2262]
 gi|444712768|gb|ELW53683.1| hypothetical protein D187_08495 [Cystobacter fuscus DSM 2262]
          Length = 2336

 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 38/118 (32%), Positives = 61/118 (51%), Gaps = 10/118 (8%)

Query: 76   LSEESVVKRLAGD-GVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
            ++ E ++  LAG    +G+   +G   +A  + P   AV   G++YV+D +NH +R+I +
Sbjct: 1111 VTPEGIIHTLAGKPNARGFCGDNGLASAACLNGPWDVAVGKAGDVYVSDSANHRVRRIGS 1170

Query: 133  LG-VTTIAGGGSKK--EGRA---DGPAQNASFSNDFELTFVPHICALLISDHGNQLIR 184
             G +TT+AG G     EG +    GPAQ A  S    L        L I+DH +++ R
Sbjct: 1171 NGRITTVAGTGDDGSLEGISIGDGGPAQQALLSAPKGLALDSE-GNLYIADHFSRVRR 1227



 Score = 42.7 bits (99), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 40/135 (29%), Positives = 62/135 (45%), Gaps = 14/135 (10%)

Query: 82   VKRLAGDG----VQGYSDGEPGSAR---FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
            +  +AG G    ++G S G+ G A+      PK  A+D +GN+Y+AD  + V R   N  
Sbjct: 1174 ITTVAGTGDDGSLEGISIGDGGPAQQALLSAPKGLALDSEGNLYIADHFSRVRRVDANGI 1233

Query: 135  VTTIAGGGSKKEGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN-----L 188
            +TT AG         +G PA    F +   L   P   +  +SD  N  +R+++     L
Sbjct: 1234 ITTYAGQLEASGFSGNGTPALQGKFDSPTGLAVGPD-GSCYVSDEWNHSVRRVSYPGAFL 1292

Query: 189  KPEDCSKSSQSGSAL 203
            +  +    SQ GS L
Sbjct: 1293 RENEAWVPSQDGSEL 1307



 Score = 39.3 bits (90), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 46/92 (50%), Gaps = 9/92 (9%)

Query: 100  SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT-TIAGGGSKKEGRA----DGPAQ 154
            SAR   P+  A+  +GN+Y+AD  N  +R +T  G+  T+AG   K   R     +G A 
Sbjct: 1081 SARLSYPRGIALGKEGNLYIADFDNDRVRYVTPEGIIHTLAG---KPNARGFCGDNGLAS 1137

Query: 155  NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             A  +  +++  V     + +SD  N  +R+I
Sbjct: 1138 AACLNGPWDVA-VGKAGDVYVSDSANHRVRRI 1168


>gi|358458536|ref|ZP_09168745.1| NHL repeat containing protein [Frankia sp. CN3]
 gi|357078265|gb|EHI87715.1| NHL repeat containing protein [Frankia sp. CN3]
          Length = 674

 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 50/94 (53%), Gaps = 6/94 (6%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRAD---GPAQN 155
           A  + P+  AV   G +Y+AD +N+ +RKI  ++  +TT+AG G+     +D   G A  
Sbjct: 461 AGLNSPQGIAVTSDGTVYIADTANNRVRKIDPSSGTITTVAGTGASTGSVSDDDGGLATQ 520

Query: 156 ASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           A  S   ++   P   AL I D G+  IR+++ +
Sbjct: 521 ADLSAPADVAVGPG-GALYIVDTGHDRIRKVDAQ 553



 Score = 41.2 bits (95), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 41/138 (29%), Positives = 61/138 (44%), Gaps = 6/138 (4%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           + +  VV   AG GV  +S DG P   A+   P   AV   G +Y+ D  N  IRKI   
Sbjct: 374 IDKAGVVSDFAGTGVSTFSGDGGPAIRAQVGFPNGVAVTDDGTVYIDDNHNDRIRKIDPS 433

Query: 134 G-VTTIAG---GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           G + TIAG   G        D  A   +  N  +   V     + I+D  N  +R+I+  
Sbjct: 434 GTIRTIAGIGTGDGHGTFSGDNSAATKAGLNSPQGIAVTSDGTVYIADTANNRVRKIDPS 493

Query: 190 PEDCSKSSQSGSALGAVS 207
               +  + +G++ G+VS
Sbjct: 494 SGTITTVAGTGASTGSVS 511


>gi|219852663|ref|YP_002467095.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
 gi|219546922|gb|ACL17372.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
          Length = 522

 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 22/50 (44%), Positives = 32/50 (64%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           L GD V   S    G+++F++P S AVD  G++YVAD  N+ I+K T+ G
Sbjct: 216 LTGDLVATRSISNSGASQFNRPTSVAVDTGGSVYVADTGNNRIQKFTSSG 265



 Score = 42.4 bits (98), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 31/105 (29%), Positives = 50/105 (47%), Gaps = 9/105 (8%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNAS 157
           G+ +F+ P+  AVD  GN+YVAD  N+ I K T+ G + T  G    + G+ + P   A 
Sbjct: 134 GTNQFNSPEGIAVDGAGNVYVADTGNNRIEKFTSSGDIVTSWGSYGSEVGQFNRPTSVAV 193

Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
            +          I  + ++D GN  I++  L  +  +  S S S 
Sbjct: 194 DNT--------GIGYIYVADTGNNRIQKFTLTGDLVATRSISNSG 230



 Score = 38.9 bits (89), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 41/159 (25%), Positives = 65/159 (40%), Gaps = 30/159 (18%)

Query: 51  IDRPGSSDLIVLDSSRSAFYT-LSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF 109
           +D  G+  +  +++ R +F+T  S P    S+ +  +G G            +F  P   
Sbjct: 50  VDGTGNVYVADMNNDRISFFTKASLPQMPSSIGRIGSGHG------------QFFYPHGV 97

Query: 110 AVDMKGNIYVADKSNHVIRKIT-NLGVTT---IAGGGSKKEGRADGPAQNASFSNDFELT 165
           AVD  GN+YVAD  NH I+K T N    T   I G G+ +    +G A            
Sbjct: 98  AVDSTGNVYVADTGNHQIQKFTVNGNFNTQWGIKGSGTNQFNSPEGIA------------ 145

Query: 166 FVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
            V     + ++D GN  I +     +  +     GS +G
Sbjct: 146 -VDGAGNVYVADTGNNRIEKFTSSGDIVTSWGSYGSEVG 183


>gi|290976426|ref|XP_002670941.1| predicted protein [Naegleria gruberi]
 gi|284084505|gb|EFC38197.1| predicted protein [Naegleria gruberi]
          Length = 872

 Score = 43.1 bits (100), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 45/160 (28%), Positives = 76/160 (47%), Gaps = 13/160 (8%)

Query: 34  VTTVIDGHQLEINPHSV--IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-V 90
           VT+ IDG +   +P +   +      ++   D +R +  T++       ++K +AG+G +
Sbjct: 284 VTSSIDGERALDSPLTAFGVSTSNEGEVYFADYARVSVVTIN------GLIKHVAGNGKL 337

Query: 91  QGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGR 148
           Q   DG   + ++  KP+  AV   G ++++D +   IRK++  GV  TIAGG     G 
Sbjct: 338 QYLGDGNLATQSQLFKPRGVAVSTSGELFISDSAQDTIRKVSTEGVINTIAGGTFPLGGY 397

Query: 149 -ADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             DG    N   +N   L F      L +SD  N  +R+I
Sbjct: 398 IQDGVDPLNTFLNNPSALEFSSSSGELFLSDSHNFRVRKI 437



 Score = 40.8 bits (94), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 31/112 (27%), Positives = 55/112 (49%), Gaps = 4/112 (3%)

Query: 79  ESVVKRLAGDGVQGYSDGE--PGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKITNLG- 134
           E +++ +AG G+ G++       SA+ + P    V  + G I + D +NH IRKI + G 
Sbjct: 538 EGIIQTIAGTGIPGFNGDNILATSAQLNSPTDIEVSPVSGEIIIVDSNNHRIRKIDSFGN 597

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           ++TIAG G          +   S +    L+       + I+D+ N+ IR++
Sbjct: 598 ISTIAGTGIASLSSHGKLSIYGSLNTPTTLSISAFSGHVYIADYENRRIRKL 649



 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 34/109 (31%), Positives = 53/109 (48%), Gaps = 3/109 (2%)

Query: 82  VKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTI 138
           V  +AG+G  GYS DG   + A+        V   G + +AD +NH IRK+   G + TI
Sbjct: 485 VTLIAGNGTAGYSGDGSLATQAQLFNVSGVTVLPNGEVLIADTNNHRIRKVNLEGIIQTI 544

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           AG G       +  A +A  ++  ++   P    ++I D  N  IR+I+
Sbjct: 545 AGTGIPGFNGDNILATSAQLNSPTDIEVSPVSGEIIIVDSNNHRIRKID 593



 Score = 37.7 bits (86), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 3/75 (4%)

Query: 117 IYVAD-KSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
           +YVAD +SN VIR   N G VT IAG G+         A  A   N   +T +P+   +L
Sbjct: 465 VYVADSQSNRVIRVDKNDGSVTLIAGNGTAGYSGDGSLATQAQLFNVSGVTVLPN-GEVL 523

Query: 175 ISDHGNQLIRQINLK 189
           I+D  N  IR++NL+
Sbjct: 524 IADTNNHRIRKVNLE 538


>gi|290979154|ref|XP_002672299.1| predicted protein [Naegleria gruberi]
 gi|284085875|gb|EFC39555.1| predicted protein [Naegleria gruberi]
          Length = 430

 Score = 42.7 bits (99), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 35/119 (29%), Positives = 64/119 (53%), Gaps = 12/119 (10%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           ++  +++  +AG+    YS G+ G   +A F  P   A+ M G +++ D +N+VIRK+++
Sbjct: 23  INTNNLITTIAGNNQIAYS-GDGGLAIAASFRIPYGVALGMNGELFITDTNNNVIRKVSS 81

Query: 133 -LG-VTTIAG---GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            +G +TTIAG   GG   +G   G A  A  S+ + +        + + D GN  +R++
Sbjct: 82  TIGIITTIAGSVTGGYSGDG---GLAIAAKLSSPYGIVVDQLNGTIYVCDFGNSRVRKL 137


>gi|219851898|ref|YP_002466330.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
 gi|219546157|gb|ACL16607.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
          Length = 387

 Score = 42.7 bits (99), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 39/84 (46%), Gaps = 9/84 (10%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDF 162
           FDKP   AVD  GN+YVAD  N+ ++K T+ G       GS   G A        F    
Sbjct: 233 FDKPSGVAVDSAGNVYVADWGNNWVQKFTSTGGFITGWNGSGLAGGA--------FQYPV 284

Query: 163 ELTFVPHICALLISDHGNQLIRQI 186
            +  V     + ++D+GN  I++ 
Sbjct: 285 SIA-VDSTGNVYVADYGNNRIQKF 307



 Score = 41.6 bits (96), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 21/47 (44%), Positives = 28/47 (59%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA 149
           F+KP   AVD  GN+YVAD +N+ I+K T+ G    +  GS   G A
Sbjct: 139 FNKPGGVAVDSVGNVYVADSANNQIQKFTSTGGFITSWNGSASAGGA 185



 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 50/115 (43%), Gaps = 11/115 (9%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSND 161
           F+ P   AVD  GN+YVAD  N  I+K T+ G   T   G +   G  D P+  A     
Sbjct: 186 FEYPGGVAVDSAGNVYVADTFNGQIQKFTSTGEFITRWNGSASAGGVFDKPSGVA----- 240

Query: 162 FELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLS 216
                V     + ++D GN  +++        +  + SG A GA    V ++V S
Sbjct: 241 -----VDSAGNVYVADWGNNWVQKFTSTGGFITGWNGSGLAGGAFQYPVSIAVDS 290


>gi|355783113|gb|EHH65034.1| hypothetical protein EGM_18374, partial [Macaca fascicularis]
          Length = 723

 Score = 42.7 bits (99), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
           G  DG    + F+ P+  A+ M   IYVAD  NH+IRKI      V+T+AG    G+ KE
Sbjct: 264 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAERVSTVAGIGIQGTDKE 322

Query: 147 GRADGPAQNASFSNDFELTF 166
           G A G  Q    S+ +++ F
Sbjct: 323 GGAKGEQQ--PISSPWDVVF 340


>gi|383418201|gb|AFH32314.1| NHL repeat-containing protein 2 [Macaca mulatta]
          Length = 726

 Score = 42.7 bits (99), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
           G  DG    + F+ P+  A+ M   IYVAD  NH+IRKI      V+T+AG    G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAERVSTVAGIGIQGTDKE 325

Query: 147 GRADGPAQNASFSNDFELTF 166
           G A G  Q    S+ +++ F
Sbjct: 326 GGAKGEQQ--PISSPWDVVF 343


>gi|302864652|ref|YP_003833289.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Micromonospora aurantiaca ATCC 27029]
 gi|302567511|gb|ADL43713.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Micromonospora aurantiaca ATCC 27029]
          Length = 611

 Score = 42.7 bits (99), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 42/80 (52%), Gaps = 6/80 (7%)

Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFV-PHICA- 172
           GN+ V+D + H + ++   G T +   G+   GR DGPA  A+F+    L  + PH+   
Sbjct: 183 GNLLVSDSARHSLVELAADGETVVRRIGAGSRGRTDGPASAATFAEPQGLCLLPPHVAEV 242

Query: 173 ----LLISDHGNQLIRQINL 188
               ++++D  N L+R + L
Sbjct: 243 AGYDVVVADTVNHLLRGVRL 262



 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 47/103 (45%), Gaps = 2/103 (1%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAGGGSK 144
           AG  V+   DG        +P   +V   G+ ++VAD     IR +++  + T  G G  
Sbjct: 330 AGTTVEALRDGPLPDVWMAQPSGLSVSADGSRLWVADSETSAIRWVSDGEMHTAVGQGLF 389

Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
             G  DGPA  A   +   +  +P   ++LI+D  N  +R+ +
Sbjct: 390 DFGHVDGPADQALLQHPLGVCALPD-GSVLIADTYNGAVRRFD 431


>gi|380812570|gb|AFE78159.1| NHL repeat-containing protein 2 [Macaca mulatta]
          Length = 726

 Score = 42.7 bits (99), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
           G  DG    + F+ P+  A+ M   IYVAD  NH+IRKI      V+T+AG    G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAERVSTVAGIGIQGTDKE 325

Query: 147 GRADGPAQNASFSNDFELTF 166
           G A G  Q    S+ +++ F
Sbjct: 326 GGAKGEQQ--PISSPWDVVF 343


>gi|330464985|ref|YP_004402728.1| alkyl hydroperoxide reductase/ thiol specific antioxidant/ mal
           allergen [Verrucosispora maris AB-18-032]
 gi|328807956|gb|AEB42128.1| alkyl hydroperoxide reductase/ thiol specific antioxidant/ mal
           allergen [Verrucosispora maris AB-18-032]
          Length = 611

 Score = 42.7 bits (99), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 28/93 (30%), Positives = 44/93 (47%), Gaps = 8/93 (8%)

Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSND 161
           RF  P    V   G + V+D + H + ++   G T +   G+   GR DG A  A+FS  
Sbjct: 172 RF--PGKLVVGPDGTLLVSDSARHRVVELAADGETLLRAIGTGSRGRVDGTADTATFSEP 229

Query: 162 FELTFV-PHICA-----LLISDHGNQLIRQINL 188
             L  + PH+       L+++D  N L+R + L
Sbjct: 230 QGLCLLPPHVAEVAGYDLVVADTVNHLLRGVRL 262



 Score = 40.8 bits (94), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 49/110 (44%), Gaps = 2/110 (1%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAGGGSK 144
           AG  V+   DG        +P   AV   G  ++VAD  +  IR + +  + T  G G  
Sbjct: 330 AGTTVEALRDGPLAETWLAQPSGLAVSADGRRLWVADSESSAIRYVEDDVMGTAVGQGLF 389

Query: 145 KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
             G  DGPA  A   +   +  +P   ++LI+D  N  +R+ +   ++ S
Sbjct: 390 DFGHVDGPAAQALLQHPLGVCALPD-GSVLIADTYNGAVRRYDPATDEVS 438



 Score = 37.7 bits (86), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 37/140 (26%), Positives = 64/140 (45%), Gaps = 23/140 (16%)

Query: 46  NPHSVIDRPGS------SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG 99
            P + +  PG         L+V DS+R     L+   ++   + R  G G +G  DG   
Sbjct: 166 EPQTTLRFPGKLVVGPDGTLLVSDSARHRVVELA---ADGETLLRAIGTGSRGRVDGTAD 222

Query: 100 SARFDKPKSFAV------DMKG-NIYVADKSNHVIRKITNLG---VTTIAGGGSKKEGRA 149
           +A F +P+   +      ++ G ++ VAD  NH++R +  LG   V T+AG G +     
Sbjct: 223 TATFSEPQGLCLLPPHVAEVAGYDLVVADTVNHLLRGV-RLGSGEVVTVAGTGRQWRSTV 281

Query: 150 DGPAQNA---SFSNDFELTF 166
           D  A +A     S+ ++L +
Sbjct: 282 DDHAHDALSVDLSSPWDLAW 301


>gi|109090607|ref|XP_001091193.1| PREDICTED: NHL repeat-containing protein 2 [Macaca mulatta]
          Length = 726

 Score = 42.7 bits (99), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
           G  DG    + F+ P+  A+ M   IYVAD  NH+IRKI      V+T+AG    G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAERVSTVAGIGIQGTDKE 325

Query: 147 GRADGPAQNASFSNDFELTF 166
           G A G  Q    S+ +++ F
Sbjct: 326 GGAKGEQQ--PISSPWDVVF 343


>gi|395646873|ref|ZP_10434733.1| PKD domain containing protein [Methanofollis liminatans DSM 4140]
 gi|395443613|gb|EJG08370.1| PKD domain containing protein [Methanofollis liminatans DSM 4140]
          Length = 2489

 Score = 42.7 bits (99), Expect = 0.15,   Method: Composition-based stats.
 Identities = 19/44 (43%), Positives = 26/44 (59%)

Query: 86   AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
            +G  +  +    PG + F  PK  AVD  GN+YVAD +NH I+K
Sbjct: 1671 SGTFITTWGSEPPGDSEFSYPKGVAVDDAGNVYVADTNNHRIQK 1714



 Score = 41.2 bits (95), Expect = 0.49,   Method: Composition-based stats.
 Identities = 21/65 (32%), Positives = 31/65 (47%), Gaps = 3/65 (4%)

Query: 87   GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
            G  +  +  G  G   FD+P    VD  GN++V+D  N+ I+K T+ G      G +   
Sbjct: 1151 GTFITKWGSGGTGDGEFDRPHGIVVDADGNVFVSDAGNNNIQKFTSTGTFITKWGTA--- 1207

Query: 147  GRADG 151
            G  DG
Sbjct: 1208 GSGDG 1212



 Score = 40.8 bits (94), Expect = 0.57,   Method: Composition-based stats.
 Identities = 32/111 (28%), Positives = 47/111 (42%), Gaps = 11/111 (9%)

Query: 99   GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNAS 157
            G  +FD P+   +D  G+IYVAD +N  I+K  + G   T  G G   +G  D P     
Sbjct: 1116 GDGQFDFPQGITIDADGSIYVADNANQRIQKFDSNGTFITKWGSGGTGDGEFDRPHG--- 1172

Query: 158  FSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSV 208
                     V     + +SD GN  I++        +K   +GS  G  +V
Sbjct: 1173 -------IVVDADGNVFVSDAGNNNIQKFTSTGTFITKWGTAGSGDGQFNV 1216



 Score = 40.4 bits (93), Expect = 0.75,   Method: Composition-based stats.
 Identities = 25/72 (34%), Positives = 34/72 (47%), Gaps = 5/72 (6%)

Query: 82  VKRLAGDGVQGYSDGEPGSA--RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA 139
           V++  GDG    + G  GS   +F+ P   AVD   NIYV D  NH ++K  + G     
Sbjct: 117 VQKFDGDGTYIATWGTKGSGEGQFESPGGIAVDNASNIYVTDTVNHRVQKFDSTGTFVTQ 176

Query: 140 GGGSKKEGRADG 151
            G    +G  DG
Sbjct: 177 WG---NQGAGDG 185



 Score = 39.7 bits (91), Expect = 1.5,   Method: Composition-based stats.
 Identities = 17/36 (47%), Positives = 25/36 (69%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           G A+F +P+  AVD  G++YVAD  N+ I+K +N G
Sbjct: 42  GDAQFIRPEGIAVDPGGDVYVADPGNNRIQKFSNTG 77



 Score = 37.7 bits (86), Expect = 4.6,   Method: Composition-based stats.
 Identities = 26/57 (45%), Positives = 29/57 (50%), Gaps = 5/57 (8%)

Query: 91   QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEG 147
            QG  DG+ G      P S AVD  GN+YV D  N  I+K  N G T IA  GS   G
Sbjct: 1539 QGSGDGQFGLG----PDSVAVDRAGNLYVTDPINSRIQKFDNSG-TFIAAWGSYGAG 1590



 Score = 37.7 bits (86), Expect = 5.0,   Method: Composition-based stats.
 Identities = 37/119 (31%), Positives = 54/119 (45%), Gaps = 19/119 (15%)

Query: 89   GVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS--KK 145
            G  GYS DGE     F+ P   AVD   N+YV D  N  I+K  + G T IA  GS    
Sbjct: 1063 GSSGYSEDGE-----FNMPHGIAVDSDSNVYVTDTWNSRIQKFDSTG-TFIAKWGSYGTG 1116

Query: 146  EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
            +G+ D P Q  +   D          ++ ++D+ NQ I++ +      +K    G+  G
Sbjct: 1117 DGQFDFP-QGITIDAD---------GSIYVADNANQRIQKFDSNGTFITKWGSGGTGDG 1165



 Score = 37.7 bits (86), Expect = 5.7,   Method: Composition-based stats.
 Identities = 24/65 (36%), Positives = 32/65 (49%), Gaps = 6/65 (9%)

Query: 89   GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEG 147
            G  G  DG+     F+ P+  AVD +GN++VAD  NH I+   TN    T  G     EG
Sbjct: 1205 GTAGSGDGQ-----FNVPRGIAVDSRGNVFVADSLNHRIQIFDTNGTFLTEFGSYGTGEG 1259

Query: 148  RADGP 152
              + P
Sbjct: 1260 EFNEP 1264



 Score = 37.0 bits (84), Expect = 8.9,   Method: Composition-based stats.
 Identities = 22/69 (31%), Positives = 38/69 (55%), Gaps = 3/69 (4%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK-- 144
           GD +  +     G+ +F  P+  AVD +G +YV +++N+ ++K    G T IA  G+K  
Sbjct: 77  GDFITKWGAPGTGAGQFSYPRGVAVDGEGYVYVVEQTNNRVQKFDGDG-TYIATWGTKGS 135

Query: 145 KEGRADGPA 153
            EG+ + P 
Sbjct: 136 GEGQFESPG 144


>gi|327277470|ref|XP_003223487.1| PREDICTED: NHL repeat-containing protein 2-like [Anolis
           carolinensis]
          Length = 724

 Score = 42.7 bits (99), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 71/159 (44%), Gaps = 35/159 (22%)

Query: 50  VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF 109
           VI   G   ++V+  +    YT+  P S              G  DG    A F+ P+  
Sbjct: 235 VIADTGHHRILVVKKTGQILYTVGGPDS--------------GRKDGTYSEATFNSPQGV 280

Query: 110 AVDMKGNI-YVADKSNHVIRKITNLG---VTTIAG---GGSKKEGRADGPAQNASFSNDF 162
           A+  K NI YVAD  NH+IRK+ +L    V+T+AG    G+ KEG A G  Q    S+ +
Sbjct: 281 AI--KNNIIYVADTENHLIRKV-DLQLEMVSTVAGVGIQGTDKEGGARGDEQ--PISSPW 335

Query: 163 ELTFVPHICA------LLISDHGNQLIRQINLKPEDCSK 195
           ++ F   + A      L I+  G   I QI     DC K
Sbjct: 336 DVAFGTSVSATHEDDVLWIAMAG---IHQIWALMLDCGK 371


>gi|290997205|ref|XP_002681172.1| predicted protein [Naegleria gruberi]
 gi|284094795|gb|EFC48428.1| predicted protein [Naegleria gruberi]
          Length = 2212

 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 42/130 (32%), Positives = 58/130 (44%), Gaps = 24/130 (18%)

Query: 85  LAGDGVQG-YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGG 142
           LAGDG QG YS       +  +P+   V   G+IY+ D  N+VIRKI + G ++T  G G
Sbjct: 480 LAGDGYQGIYS-------QLSQPRGVCVGPTGDIYIVDAGNYVIRKIDSNGIISTFIGDG 532

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDH------------GNQLIRQINLKP 190
               G  DG A  A       +T + +   LLISD              NQ IR+   K 
Sbjct: 533 V--SGYRDGDALTARIGFASAITCLSN-GDLLISDSISDGMLFFGNYLNNQRIRKFTAKT 589

Query: 191 EDCSKSSQSG 200
              +  + +G
Sbjct: 590 NQVTTVAGTG 599



 Score = 42.0 bits (97), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 58/124 (46%), Gaps = 3/124 (2%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAGGGS 143
           LAG G    S+     A+   P    VD   + IYV+D   + IRKI N  + TIAG G 
Sbjct: 151 LAGGGSNLSSNISALDAKLGAPFGLLVDEANDVIYVSDTYKYTIRKIVNGTIYTIAGTGV 210

Query: 144 KKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
           + +   +G  A +AS +    LTF P    L+  +   + +R+I  +    +  S +  A
Sbjct: 211 EDKNTPEGSLAASASIAKVLTLTFHPLTRELIYLEETGR-VRRITNEGRIFTLYSGTNGA 269

Query: 203 LGAV 206
           +G V
Sbjct: 270 IGMV 273



 Score = 41.2 bits (95), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 56/113 (49%), Gaps = 6/113 (5%)

Query: 82  VKRLAGDGVQGYS-DGEPGS-ARFDKPKS-FAVDMKGNIYVADKSNHVIRKITNLG--VT 136
           V  +AG GV+ YS DG P   A  + P   +  +  G+I   D  N+ IRKI+N    +T
Sbjct: 592 VTTVAGTGVRSYSGDGGPAIIAPLNGPTGVYYNETNGDIIFCDSENYRIRKISNSTGLIT 651

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           TIAG G       DG A  A+    F ++  P    +  SD  + ++R+I+ K
Sbjct: 652 TIAGTGVYSY-NGDGLAALATNIAPFGISVHPITDEIYFSDMNSFMVRKIDSK 703



 Score = 38.9 bits (89), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 32/120 (26%), Positives = 50/120 (41%), Gaps = 14/120 (11%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADK------------SNHVIR 128
           ++    GDGV GY DG+  +AR     +      G++ ++D             +N  IR
Sbjct: 524 IISTFIGDGVSGYRDGDALTARIGFASAITCLSNGDLLISDSISDGMLFFGNYLNNQRIR 583

Query: 129 KIT--NLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           K T     VTT+AG G +      GPA  A  +    + +      ++  D  N  IR+I
Sbjct: 584 KFTAKTNQVTTVAGTGVRSYSGDGGPAIIAPLNGPTGVYYNETNGDIIFCDSENYRIRKI 643



 Score = 37.7 bits (86), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 32/119 (26%), Positives = 50/119 (42%), Gaps = 9/119 (7%)

Query: 76   LSEESVVKRLAGDGVQGY--------SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
            +S   ++ R+AG    GY              +  F    SF     G + +AD +N+VI
Sbjct: 923  ISGTGILTRVAGKKECGYVIDSMLLNQTTFSTNTTFGYSISFTYMSNGEMLIADTNNNVI 982

Query: 128  RKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            RK+   G +TI  G        D  A+ A  +N   L+ +     ++ SD GN  IR +
Sbjct: 983  RKVDLNGYSTIIAGNGTAGFNGDSDAKQAYLNNPQGLSVLSD-GRIIFSDSGNDRIRML 1040


>gi|108707786|gb|ABF95581.1| HAD-superfamily hydrolase, subfamily IA, variant 3 containing
           protein, expressed [Oryza sativa Japonica Group]
          Length = 1064

 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 53/167 (31%), Positives = 66/167 (39%), Gaps = 17/167 (10%)

Query: 30  DGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSS------RSAFYTLSFPLSEESVVK 83
           DGY        + L IN H V+     S  I L+S         + Y L   L     ++
Sbjct: 735 DGYERNLNGSSYPLCINEH-VVYVSFYSHGIALESHFYCECWNCSLYPLLCYLGPTWFLQ 793

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGG 141
            L GD      DG         P          IYVAD  NH I+++      VTTIAG 
Sbjct: 794 YLFGD-----HDGTGSDVLLQHPLGVVYASDNQIYVADSYNHKIKRLDPVTRKVTTIAGT 848

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           G  + G  DGPA +A  S    L  V     LL++D  N  IR I L
Sbjct: 849 G--RAGYKDGPALSAQLSEPAGLVEVGD-GRLLVADTNNSTIRYIVL 892



 Score = 38.5 bits (88), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 19/46 (41%), Positives = 27/46 (58%), Gaps = 1/46 (2%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
           G   +G  DG   +A F++P+  A + K NI YVAD  NH +R +T
Sbjct: 610 GSSEEGLLDGTFDTASFNRPQGLAYNSKKNILYVADTENHALRTLT 655


>gi|332212861|ref|XP_003255539.1| PREDICTED: NHL repeat-containing protein 2 [Nomascus leucogenys]
          Length = 726

 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG---GGSKKE 146
           G  DG    + F+ P+  A+ M   IYVAD  NH+IRKI      V+T+AG    G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAGKVSTVAGIGIQGTDKE 325

Query: 147 GRADGPAQNASFSNDFELTF 166
           G A G  Q    S+ +++ F
Sbjct: 326 GGAKGEQQ--PISSPWDVVF 343


>gi|144899324|emb|CAM76188.1| conserved hypothetical protein [Magnetospirillum gryphiswaldense
           MSR-1]
          Length = 477

 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 32/116 (27%), Positives = 54/116 (46%), Gaps = 11/116 (9%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV-DMKGNIYVADKSNHVIRKI--TNLG 134
           +  +++ LAG G +   DGE G +   +P   A+ D  G++Y AD     +R++  +   
Sbjct: 312 DTGMIRMLAGSGGENLGDGEAGQSLLAQPSGLALSDDGGSLYFADAETSALRRVCLSTGR 371

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA---LLISDHGNQLIRQIN 187
           V T+ G G    G  +GP   A   +   ++     CA   L I+D  N  IR ++
Sbjct: 372 VDTLVGQGLFDFGHKNGPLAQARLQHPLGVS-----CANGRLFIADSYNHAIRVVD 422



 Score = 38.1 bits (87), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 25/65 (38%), Positives = 33/65 (50%), Gaps = 9/65 (13%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
           G G+ G  DG    A F+ P+  A+D +  IYVAD  NH+IR+I          GG +  
Sbjct: 210 GSGLAGRDDGI--DASFNGPEGVAID-RAFIYVADTRNHLIRRIDR------QSGGDRHH 260

Query: 147 GRADG 151
            RA G
Sbjct: 261 RRAQG 265


>gi|124003658|ref|ZP_01688507.1| NHL repeat [Microscilla marina ATCC 23134]
 gi|123991227|gb|EAY30679.1| NHL repeat [Microscilla marina ATCC 23134]
          Length = 508

 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 37/116 (31%), Positives = 55/116 (47%), Gaps = 5/116 (4%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGV 135
           + ++VKR AG G +  +DG    + F++P S  V     +YVAD     IR I      V
Sbjct: 347 QTNLVKRFAGSGREALTDGSFRKSAFNQP-SGLVKNGNTLYVADSEASAIRAIDLAKGEV 405

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
           +T  G G  + G  DG A  A   +   +TF  +   L I+D  N  I+  +LK +
Sbjct: 406 STPLGKGLFEFGDVDGKANKARLQHAVGVTFRAN--KLYIADTYNGKIKTFDLKTQ 459



 Score = 40.0 bits (92), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 57/124 (45%), Gaps = 20/124 (16%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNLGVTTIAGG-- 141
           G G QG  DG+   A F +P   A  +KG+ +YVAD  N+VIR++      V TIAG   
Sbjct: 245 GSGTQGLKDGDFDKATFYEPHGLA--LKGDFLYVADTKNNVIRRVDLKRKKVKTIAGDGR 302

Query: 142 -----GSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKS 196
                G ++ G A  P        D E+ +        I++ GN  I  ++L+     + 
Sbjct: 303 LDYYFGKERWGVAVNPNSPWDLWIDGEVMY--------IANAGNHQILTMDLQTNLVKRF 354

Query: 197 SQSG 200
           + SG
Sbjct: 355 AGSG 358


>gi|440794391|gb|ELR15552.1| haloacid dehalogenase family hydrolase family protein [Acanthamoeba
           castellanii str. Neff]
          Length = 731

 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 31/130 (23%), Positives = 58/130 (44%), Gaps = 6/130 (4%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFA---VDMKGNIYVADKSNHVIRKITNLG-- 134
           + ++ +   G  G+ DG    A+F++ +  A    D    +YVAD  NH +R +  L   
Sbjct: 256 AFLEEIGTRGSLGFKDGSYAEAKFNRLQGVAYHKTDEHERLYVADAENHALRVVDLLAKT 315

Query: 135 VTTIAGGGSKKEGRADGPA-QNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
           VTT+AG G++      G + +    S  +++   P    L ++  G   I +++L     
Sbjct: 316 VTTLAGDGTQGNDFVGGKSGREQQLSTPWDVALSPDGQQLFVAMAGTHQIWEVSLDSSAV 375

Query: 194 SKSSQSGSAL 203
           +  S  G  +
Sbjct: 376 TNYSGDGQEM 385


>gi|148654423|ref|YP_001274628.1| NHL repeat-containing protein [Roseiflexus sp. RS-1]
 gi|148566533|gb|ABQ88678.1| NHL repeat containing protein [Roseiflexus sp. RS-1]
          Length = 1146

 Score = 42.7 bits (99), Expect = 0.17,   Method: Composition-based stats.
 Identities = 47/187 (25%), Positives = 75/187 (40%), Gaps = 36/187 (19%)

Query: 11  AITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHS--VIDRPGSSDLIVLDSSRSA 68
           A+     +  LV++ +  E G     ++    + I+P    ++    +  LIV D S + 
Sbjct: 818 AVAATGSIIRLVATDIFGEFGNAPGQLVQPRGIAIDPQGNIIVSDSANHRLIVFDPSGTP 877

Query: 69  FYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
             T+                G  G  DG+     F +P+  AVD +GNIYVAD  N  I 
Sbjct: 878 IRTI----------------GSFGSGDGQ-----FYEPRGVAVDAQGNIYVADTWNARIV 916

Query: 129 KITNLGVTTIAGG------GSKKEGRADGPAQNASFSNDFELTFVPHICA------LLIS 176
           K+   G    + G      G  +     G  Q+A+ +   +L F P   A      + I+
Sbjct: 917 KLDPQGTFLASWGVGRDDFGDGRRASPTGGTQDANLARPLDL-FGPRGVAVDADGNVYIA 975

Query: 177 DHGNQLI 183
           D GN+ I
Sbjct: 976 DTGNKRI 982


>gi|219851408|ref|YP_002465840.1| carbohydrate binding family 6 [Methanosphaerula palustris E1-9c]
 gi|219545667|gb|ACL16117.1| Carbohydrate binding family 6 [Methanosphaerula palustris E1-9c]
          Length = 627

 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 20/48 (41%), Positives = 27/48 (56%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
           G  +F+ P+  AVD  GNIYVAD  N  ++K T+ G    A G +  E
Sbjct: 44  GDGQFNYPQGIAVDGAGNIYVADNGNKQVQKFTSSGTFVTAWGTAGPE 91



 Score = 38.5 bits (88), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 29/66 (43%), Gaps = 3/66 (4%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKK 145
           +G  V  +    P   RF +P   AVD +GN+YV D     ++K T  G      G    
Sbjct: 78  SGTFVTAWGTAGPEDDRFSEPSGIAVDSEGNVYVTDTGKFRVQKFTPTGTLITEWG---T 134

Query: 146 EGRADG 151
           EG  DG
Sbjct: 135 EGTGDG 140


>gi|421872066|ref|ZP_16303685.1| NHL repeat family protein [Brevibacillus laterosporus GI-9]
 gi|372458678|emb|CCF13234.1| NHL repeat family protein [Brevibacillus laterosporus GI-9]
          Length = 633

 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 36/73 (49%), Gaps = 16/73 (21%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG------- 141
           G +G  DG+     F++P   A+D K NIYV D++NH I+K  N G      G       
Sbjct: 487 GQKGTKDGQ-----FNQPMQLAIDSKDNIYVVDRNNHRIQKFDNSGRFLAKWGTNGGAGA 541

Query: 142 ----GSKKEGRAD 150
               G+ KEG+ D
Sbjct: 542 TDPLGNWKEGKGD 554



 Score = 40.4 bits (93), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 45/104 (43%), Gaps = 11/104 (10%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASF 158
           S  F  P   AVD KGNI+VAD  NH I+K     +  T  G    K+G+ + P Q A  
Sbjct: 446 SGFFQFPAGVAVDSKGNIFVADTLNHRIQKFNPFFIYMTEWGQKGTKDGQFNQPMQLAID 505

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
           S D           + + D  N  I++ +      +K   +G A
Sbjct: 506 SKD----------NIYVVDRNNHRIQKFDNSGRFLAKWGTNGGA 539



 Score = 38.1 bits (87), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 40/86 (46%), Gaps = 11/86 (12%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSND 161
           F  P + A D+ GNIYVAD  NH I K+   G V    GG   K G+ + P   A     
Sbjct: 62  FRTPVAMAKDLTGNIYVADMGNHRIVKMNKSGKVLETFGGLGDKPGKFNMPFGVA----- 116

Query: 162 FELTFVPHICALLISDHGNQLIRQIN 187
                +     +L++D GN  I++ +
Sbjct: 117 -----IDKEGNILVADTGNYRIQKFD 137



 Score = 37.7 bits (86), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 22/35 (62%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
            +F  P+  A+D  G +YVAD  NH I+K TN GV
Sbjct: 400 GQFIVPRQVAIDAYGFVYVADSVNHRIQKFTNTGV 434


>gi|326781271|ref|ZP_08240536.1| Ricin B lectin [Streptomyces griseus XylebKG-1]
 gi|326661604|gb|EGE46450.1| Ricin B lectin [Streptomyces griseus XylebKG-1]
          Length = 508

 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 48/87 (55%), Gaps = 2/87 (2%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFS 159
           A+ +KP   AVD  G +YVA+   H +R++   GV TT+ G G+       GPA  A  +
Sbjct: 263 AQLNKPYGIAVDSTGTLYVAEYGGHRVRRVGADGVITTVVGTGAAGSAGDGGPAAEAQLN 322

Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
             F LT V  + AL I+DH N  +R++
Sbjct: 323 CPFGLT-VDSVDALYIADHLNHRVRKV 348



 Score = 42.0 bits (97), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 25/88 (28%), Positives = 43/88 (48%), Gaps = 2/88 (2%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFS 159
           A+ + P+  AVD  G +Y+AD   + +R++   GV  T+AG G        GPA  A  +
Sbjct: 95  AQLNCPRQIAVDGDGALYIADAGGNRVRRVGADGVIVTVAGTGVAGSAGDGGPAAKAQLN 154

Query: 160 NDFELTFVPHICALLISDHGNQLIRQIN 187
             F +  V     L +++ G   +R++ 
Sbjct: 155 KPFGVA-VDGEGVLYVAEFGGHRVRRVG 181



 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 30/55 (54%), Gaps = 2/55 (3%)

Query: 82  VKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           +  +AG G  G+   DG   SA+   P   AV   G +YV+D SNH +R+IT  G
Sbjct: 18  ISTIAGSGTAGFGGDDGAAVSAQLKHPYEMAVSGTGILYVSDYSNHRVRRITTDG 72



 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 23/62 (37%), Positives = 37/62 (59%), Gaps = 2/62 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +  + V+  +AG GV G + DG P + A+ +KP   AVD +G +YVA+   H +R++   
Sbjct: 124 VGADGVIVTVAGTGVAGSAGDGGPAAKAQLNKPFGVAVDGEGVLYVAEFGGHRVRRVGAD 183

Query: 134 GV 135
           GV
Sbjct: 184 GV 185


>gi|299142776|ref|ZP_07035905.1| NHL repeat-containing protein [Prevotella oris C735]
 gi|298575805|gb|EFI47682.1| NHL repeat-containing protein [Prevotella oris C735]
          Length = 448

 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 51/105 (48%), Gaps = 12/105 (11%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIA 139
           V++ AG    G  DG   SA F  P   A D+ GN+YV +     IRKI  T+  VTT+A
Sbjct: 339 VEQFAGFNGVGSQDGPRMSASFKNPTGMAFDVDGNMYVTESMGFTIRKIGHTDGMVTTVA 398

Query: 140 GGGSKKE-GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
           G  +K +  + +G     +F+  ++++         + D GN  I
Sbjct: 399 GIYTKSDNNKVEGLPLETTFNYPYDIS---------VDDEGNFYI 434


>gi|281423977|ref|ZP_06254890.1| NHL repeat-containing domain protein [Prevotella oris F0302]
 gi|281401902|gb|EFB32733.1| NHL repeat-containing domain protein [Prevotella oris F0302]
          Length = 469

 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 51/105 (48%), Gaps = 12/105 (11%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIA 139
           V++ AG    G  DG   SA F  P   A D+ GN+YV +     IRKI  T+  VTT+A
Sbjct: 360 VEQFAGFNGVGSQDGPRMSASFKNPTGMAFDVDGNMYVTESMGFTIRKIGHTDGMVTTVA 419

Query: 140 GGGSKKE-GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
           G  +K +  + +G     +F+  ++++         + D GN  I
Sbjct: 420 GIYTKSDNNKVEGLPLETTFNYPYDIS---------VDDEGNFYI 455


>gi|290973472|ref|XP_002669472.1| basal body protein NBP-1 [Naegleria gruberi]
 gi|284083020|gb|EFC36728.1| basal body protein NBP-1 [Naegleria gruberi]
          Length = 2352

 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 55/104 (52%), Gaps = 4/104 (3%)

Query: 85  LAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGG 142
           +AG G +G++ D     A+ D P    ++    +Y+ D  N+ +RKI + GV  TIAG G
Sbjct: 76  IAGTGYEGFNGDILAKQAKLDNPGG-VLEYNNELYIMDYGNNRVRKINSEGVLVTIAGTG 134

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           ++     +G A +AS +  + +  +P    + I+++    IR+I
Sbjct: 135 TRSSAGDNGAATSASLNGPWGI-HIPSNGDIYITEYVGNKIRKI 177



 Score = 42.4 bits (98), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 52/106 (49%), Gaps = 12/106 (11%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGS 143
            +GDG      G   SA+   P S  V+  G I++AD  N  IRKI TN  +TTIAG GS
Sbjct: 195 FSGDG------GNAVSAKLKSPWSVFVNAIGEIFIADTDNDRIRKIATNGIITTIAGSGS 248

Query: 144 KKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
                +DG  A  AS      + F+     L I++     IR+++L
Sbjct: 249 ST---SDGVLATTASLKKPTSV-FISPANELFIAEADGGRIRKVDL 290



 Score = 37.4 bits (85), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 22/79 (27%), Positives = 39/79 (49%), Gaps = 19/79 (24%)

Query: 117 IYVADKSNHVIRKITNLGVTTIAGGGSK--------KEGRADGPAQNASFSNDFELTFVP 168
           +Y+++ S H ++K + LG+  IAG G +        K+ + D P     ++N+       
Sbjct: 55  VYMSEYSGHRVKKKSYLGLVVIAGTGYEGFNGDILAKQAKLDNPGGVLEYNNE------- 107

Query: 169 HICALLISDHGNQLIRQIN 187
               L I D+GN  +R+IN
Sbjct: 108 ----LYIMDYGNNRVRKIN 122


>gi|149177401|ref|ZP_01856005.1| hypothetical protein PM8797T_18906 [Planctomyces maris DSM 8797]
 gi|148843734|gb|EDL58093.1| hypothetical protein PM8797T_18906 [Planctomyces maris DSM 8797]
          Length = 683

 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 36/131 (27%), Positives = 61/131 (46%), Gaps = 11/131 (8%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  ++  P    L + DS+ +     S     +  +  + G G  G  DG+  +A FD P
Sbjct: 223 PGKLLADPAQQRLFISDSNHNRIVIASL----DGKLIDVIGSGQIGNKDGDYKTASFDHP 278

Query: 107 KSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGR-ADGPAQNASFSNDFE 163
           +  A+ +   +YVAD  NH IR +      V+T+AG G +   R A G  + A+ ++ + 
Sbjct: 279 QGMAL-VGNTLYVADTENHSIRTVDLDKKQVSTLAGTGEQARFRSAGGKLKEAALNSPWA 337

Query: 164 LTFVP---HIC 171
           L  +    +IC
Sbjct: 338 LAAIDDVLYIC 348


>gi|428162955|gb|EKX32055.1| hypothetical protein GUITHDRAFT_58597, partial [Guillardia theta
           CCMP2712]
          Length = 186

 Score = 42.7 bits (99), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 37/105 (35%), Positives = 53/105 (50%), Gaps = 5/105 (4%)

Query: 85  LAGDGVQGYSDGEPG-SARFDKPKSFAVDMKG-NIYVADKSNHVIRKI--TNLGVTTIAG 140
           LAGDG  G  D   G +A+F  P   AVD  G ++ VAD  +H IR I  ++  V+T+ G
Sbjct: 83  LAGDGTNGLVDNVNGAAAKFYNPTGVAVDFAGQSLLVADSRDHTIRIIDLSSTAVSTLVG 142

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQ 185
            G      +  P  +A+ S  F ++        L+SD GN  IR+
Sbjct: 143 TGVAGGEDSIDP-NSATLSLPFGVSVTRDGRWGLVSDQGNSGIRR 186


>gi|290988149|ref|XP_002676784.1| predicted protein [Naegleria gruberi]
 gi|284090388|gb|EFC44040.1| predicted protein [Naegleria gruberi]
          Length = 2167

 Score = 42.7 bits (99), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 45/101 (44%), Gaps = 17/101 (16%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVT---TIAGGGSKKEG-- 147
           Y DG    A+F+ P   A      IY+ADK NHVIR I  +G++   ++  G +K  G  
Sbjct: 231 YIDGNTTIAKFNSPTGVAYGPSKEIYIADKMNHVIRMIEFVGMSGTVSVISGIAKSSGND 290

Query: 148 --RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             R + P   A  S             + I D GN  IR++
Sbjct: 291 TLRLNSPVDVAVNSQK----------EVFILDSGNNRIRKL 321



 Score = 41.2 bits (95), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 23/65 (35%), Positives = 35/65 (53%), Gaps = 1/65 (1%)

Query: 77  SEESVVKRLAGDGVQGY-SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           S    +  +AG G  G  ++    SA+   P   AV + G + +AD SNH IR+++N  +
Sbjct: 69  SNSGAITIVAGTGAFGSDTNTNATSAKLYSPSGLAVTIDGKLLIADTSNHAIREVSNGII 128

Query: 136 TTIAG 140
            TIAG
Sbjct: 129 RTIAG 133


>gi|242041143|ref|XP_002467966.1| hypothetical protein SORBIDRAFT_01g037250 [Sorghum bicolor]
 gi|241921820|gb|EER94964.1| hypothetical protein SORBIDRAFT_01g037250 [Sorghum bicolor]
          Length = 360

 Score = 42.4 bits (98), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 35/99 (35%), Positives = 42/99 (42%), Gaps = 5/99 (5%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRA 149
           G  DG         P   A      +Y+AD  NH I+K+      VTTIAG G  + G  
Sbjct: 92  GDYDGTGSDVLLQHPLGVAYASDNQVYIADSYNHKIKKLDPVTRKVTTIAGTG--RAGYK 149

Query: 150 DGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
           DGP   A  S    L  V     LL++D  N  IR I L
Sbjct: 150 DGPGLAAQLSEPAGLVEVGD-GRLLVADTNNNTIRYITL 187


>gi|157376750|ref|YP_001475350.1| hypothetical protein Ssed_3618 [Shewanella sediminis HAW-EB3]
 gi|157319124|gb|ABV38222.1| hypothetical protein Ssed_3618 [Shewanella sediminis HAW-EB3]
          Length = 362

 Score = 42.4 bits (98), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 44/98 (44%), Gaps = 6/98 (6%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           G+ +     A   +P     D  GNI++ADK  H IRKI   G+ +   G ++     D 
Sbjct: 62  GFENHSAKKAELSRPHQAIADSDGNIFIADKDAHAIRKIDLNGMISTYVGTNESGFNGDS 121

Query: 152 PAQ---NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
             +     SF N   LT      AL I D GN +IR++
Sbjct: 122 LKRTLTQLSFPNGIWLTASE---ALFILDSGNDMIRRV 156



 Score = 37.4 bits (85), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 7/97 (7%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
           G G++  +DG      FD+  + AV+ KG ++V D+  +++ K++  G   I  G  K+ 
Sbjct: 201 GRGIRVLADG------FDELANLAVNAKGELFVTDRDANLVYKVSANGDKEIVAGTGKRS 254

Query: 147 GRADGP-AQNASFSNDFELTFVPHICALLISDHGNQL 182
           G  DG  A + + +    + F P+    L +  G Q+
Sbjct: 255 GCGDGCLATDTALNQVRGIWFHPNGGYFLATHKGGQI 291


>gi|290997291|ref|XP_002681215.1| predicted protein [Naegleria gruberi]
 gi|284094838|gb|EFC48471.1| predicted protein [Naegleria gruberi]
          Length = 1196

 Score = 42.4 bits (98), Expect = 0.19,   Method: Composition-based stats.
 Identities = 44/146 (30%), Positives = 65/146 (44%), Gaps = 21/146 (14%)

Query: 61  VLDSSRSAFYTLSFPLSE--------ESVVKRLAGDG----VQGYSDGEPGSARFDK--- 105
           ++ +  SAF  +S  +SE        +  +K++  +G    + G S G  G     K   
Sbjct: 128 IISTYGSAFSPVSLTMSETDDVLVGTDYSIKQIFKNGTNVRIAGASYGYGGDGSLAKDCK 187

Query: 106 ---PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSND 161
              P+     + G+IY AD  NH IRKI   G VTTIAG G+       G A NA     
Sbjct: 188 ITSPRGITAAI-GSIYFADAFNHRIRKIDKDGIVTTIAGTGTLGFSGDGGLAVNAQLYTP 246

Query: 162 FELTFVPHICALLISDHGNQLIRQIN 187
             L  +P    + I+D  N  IR+++
Sbjct: 247 AGLDVLP-TGVVFIADVNNHRIRRVS 271



 Score = 42.0 bits (97), Expect = 0.25,   Method: Composition-based stats.
 Identities = 30/104 (28%), Positives = 53/104 (50%), Gaps = 14/104 (13%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           + ++ +V  +AG G  G+S G+ G   +A+   P    V   G +++AD +NH IR+++ 
Sbjct: 214 IDKDGIVTTIAGTGTLGFS-GDGGLAVNAQLYTPAGLDVLPTGVVFIADVNNHRIRRVSK 272

Query: 133 LG-VTTIAGGGSKKE---------GRADGPAQNASFSNDFELTF 166
            G +TT+AG G+            GR  GP+  +    + E+ F
Sbjct: 273 DGIITTVAGTGTSGTSGDGSLATLGRIYGPSSISICPTNGEIIF 316


>gi|417404183|gb|JAA48864.1| Putative nhl repeat-containing protein 2 [Desmodus rotundus]
          Length = 725

 Score = 42.4 bits (98), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 31/75 (41%), Positives = 39/75 (52%), Gaps = 6/75 (8%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG---GGSKKE 146
           G  DG    + F+ P+  A+ M   IYVAD  NH+IRKI      V+T+AG    G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEMVSTVAGIGIQGTDKE 325

Query: 147 GRADGPAQNASFSND 161
           G A G  Q  S   D
Sbjct: 326 GGAKGEEQPISSPWD 340


>gi|406832294|ref|ZP_11091888.1| NHL repeat containing protein [Schlesneria paludicola DSM 18645]
          Length = 707

 Score = 42.4 bits (98), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 44/84 (52%), Gaps = 4/84 (4%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK 144
           G G  G+ DG    A+FD P+   + +   +YVAD  NH++R I      V+T AG G +
Sbjct: 268 GSGAIGHKDGGYAVAQFDHPQGMTL-VGDTLYVADTENHLLRAIDLKQKHVSTFAGVGEQ 326

Query: 145 KEGRADGPA-QNASFSNDFELTFV 167
              RA G   +  + ++ ++LT V
Sbjct: 327 ARTRAPGGTLRETALNSPWDLTVV 350


>gi|37519956|ref|NP_923333.1| hypothetical protein gll0387 [Gloeobacter violaceus PCC 7421]
 gi|35210948|dbj|BAC88328.1| gll0387 [Gloeobacter violaceus PCC 7421]
          Length = 544

 Score = 42.4 bits (98), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 25/82 (30%), Positives = 41/82 (50%), Gaps = 5/82 (6%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKI--TNLGVTTIAGGGS 143
           G G+ G+ DG    A F  P+  A+   G  ++V D  NH++RKI      V+T AG G 
Sbjct: 251 GSGIPGWRDGNFEEAEFWAPQGLALSADGRTLFVCDTENHLLRKIDLVRRQVSTFAGTGE 310

Query: 144 KK--EGRADGPAQNASFSNDFE 163
           +    G+ +GP      ++ ++
Sbjct: 311 QSLGYGQVEGPGLEIPLNSPWD 332


>gi|20092638|ref|NP_618713.1| cell surface protein [Methanosarcina acetivorans C2A]
 gi|19917919|gb|AAM07193.1| cell surface protein [Methanosarcina acetivorans C2A]
          Length = 526

 Score = 42.4 bits (98), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 19/46 (41%), Positives = 28/46 (60%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           +  +  G  G+ +F  P   AVD  GNIYVAD +NH ++K+ + GV
Sbjct: 84  IDKWGSGGSGNGQFYTPNGVAVDSMGNIYVADYNNHRVQKLDSTGV 129



 Score = 38.1 bits (87), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 40/161 (24%), Positives = 68/161 (42%), Gaps = 25/161 (15%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSAR--FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           S  + + +L  DG      G  G++R  F+ P+  AVD  GNIYV D  N  I K    G
Sbjct: 163 SRSNRIVKLNKDGNYLTQWGSKGASRNQFNDPEGIAVDSSGNIYVVDSGNSRIMKFDGTG 222

Query: 135 VT-TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN------ 187
              T  G   +++G+   P   A          +    A+ ++D GN+ I++ +      
Sbjct: 223 TYLTEWGTPGQEDGQFRSPHGIA----------IDSSGAIYVTDTGNRRIQKFDSTGSYV 272

Query: 188 ---LKPEDCSKSSQSGSAL---GAVSVWVLVSVLSCLVSLV 222
              + PE+     Q+   +    + +V+V+ S   C+   V
Sbjct: 273 TKWVSPENGDGKFQNPVGIVVDSSNNVYVVDSFYHCVFQFV 313


>gi|300791003|ref|YP_003771294.1| NHL repeat-containing protein [Amycolatopsis mediterranei U32]
 gi|384154546|ref|YP_005537362.1| NHL repeat-containing protein [Amycolatopsis mediterranei S699]
 gi|399542881|ref|YP_006555543.1| NHL repeat-containing protein [Amycolatopsis mediterranei S699]
 gi|299800517|gb|ADJ50892.1| NHL repeat-containing protein [Amycolatopsis mediterranei U32]
 gi|340532700|gb|AEK47905.1| NHL repeat-containing protein [Amycolatopsis mediterranei S699]
 gi|398323651|gb|AFO82598.1| NHL repeat-containing protein [Amycolatopsis mediterranei S699]
          Length = 605

 Score = 42.4 bits (98), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 28/111 (25%), Positives = 50/111 (45%), Gaps = 8/111 (7%)

Query: 97  EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNA 156
           +PG  RF  P       +G + VAD  +H I +  + G T I   GS   G  DGP   A
Sbjct: 163 QPGELRF--PSKAVATAEGRVLVADTGHHAIVEFASDGETVIRRFGSGARGSQDGPFDIA 220

Query: 157 SFSNDFELTFVPHICA------LLISDHGNQLIRQINLKPEDCSKSSQSGS 201
           +F+    +  +P+  A       +++D     +R ++L   + +  + +G+
Sbjct: 221 TFTEPSGIALLPYDIAERVGYHAVVADTAGHRLRGLDLITGEVTTVAGTGA 271



 Score = 41.2 bits (95), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 32/111 (28%), Positives = 49/111 (44%), Gaps = 6/111 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG----VTT 137
           ++R AG  V+G  DG+ G A F +    AVD    +++ D     +R I   G    V T
Sbjct: 322 IRRFAGTTVEGLRDGDVGEAFFAQTSGLAVDGD-KLWLVDAETSALRWIEPAGESFTVQT 380

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
             G      G  DGPA  A   +   L  +P    + I+D  N  +R+ ++
Sbjct: 381 AIGVDLFSFGHTDGPADQALLQHPLGLAVLPG-DKIAIADTYNGAVRRFDV 430


>gi|290972811|ref|XP_002669144.1| predicted protein [Naegleria gruberi]
 gi|284082687|gb|EFC36400.1| predicted protein [Naegleria gruberi]
          Length = 1362

 Score = 42.4 bits (98), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 32/97 (32%), Positives = 46/97 (47%), Gaps = 4/97 (4%)

Query: 86  AGDGVQGYSDGEP--GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS 143
           AG G+ GYS       S  F+ P   A + +  +++AD +NH IRK+        AG G+
Sbjct: 429 AGTGLVGYSGNYTLRNSTLFNSPNKLAFNSRNELFIADSNNHRIRKVVGGMTFDYAGNGN 488

Query: 144 KKEGRAD-GPAQNASFSNDFELTFVPHICALLISDHG 179
                 D G A NA  +N   L  +P+   LLI+  G
Sbjct: 489 PTGSDNDGGSALNAVLANPTSLFVLPND-DLLIASTG 524



 Score = 40.8 bits (94), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 55/111 (49%), Gaps = 6/111 (5%)

Query: 91  QGYS-DGEPG-SARFDKPKSFAVDM-KGNIYVADKSNHVIRKITNLG--VTTIAGGGSKK 145
           +GYS DG    +A+ + P S  VD   G+++  D  NH IRKI+     V+T+AG G + 
Sbjct: 107 RGYSGDGSLAVNAKINGPTSVIVDSSNGDVFFTDDWNHCIRKISASSGIVSTVAGTGVQG 166

Query: 146 EGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKS 196
                G A +   S    + F      +   D+GN +IR++N+     +K+
Sbjct: 167 FDGNTGNALSMRLSIPRSIRFTSAT-TMTFLDYGNCIIRELNISSSQITKT 216


>gi|426253136|ref|XP_004020256.1| PREDICTED: NHL repeat-containing protein 2 [Ovis aries]
          Length = 726

 Score = 42.4 bits (98), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG---GGSKKE 146
           G  DG    + F+ P+  A+ M   IYVAD  NH+IRKI      V+T+AG    G+ KE
Sbjct: 267 GRKDGIFSESSFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEMVSTVAGIGIQGTDKE 325

Query: 147 GRADGPAQNASFSNDFELTF 166
           G A G  Q    S+ +++ F
Sbjct: 326 GGAKGDEQ--PISSPWDVVF 343



 Score = 38.1 bits (87), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 52/104 (50%), Gaps = 5/104 (4%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLGVTTIAGGGSKKEG 147
           G++ Y D  P S     P   AVD   N + +AD  +H I  +   G T  + GG    G
Sbjct: 210 GIKLYKDSLPPSPLL-FPGKIAVDHVSNRLVIADTGHHRILVVWKNGQTQHSIGGPNP-G 267

Query: 148 RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
           R DG    +SF++   +  + +I  + ++D  N LIR+I+L+ E
Sbjct: 268 RKDGIFSESSFNSPQGVAIMNNI--IYVADTENHLIRKIDLEAE 309


>gi|139949007|ref|NP_001077192.1| NHL repeat-containing protein 2 [Bos taurus]
 gi|166233893|sp|A4IF69.1|NHLC2_BOVIN RecName: Full=NHL repeat-containing protein 2
 gi|134025213|gb|AAI34431.1| NHLRC2 protein [Bos taurus]
 gi|296472617|tpg|DAA14732.1| TPA: NHL repeat-containing protein 2 [Bos taurus]
          Length = 726

 Score = 42.4 bits (98), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG---GGSKKE 146
           G  DG    + F+ P+  A+ M   IYVAD  NH+IRKI      V+T+AG    G+ KE
Sbjct: 267 GRKDGIFSESSFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEMVSTVAGIGIQGTDKE 325

Query: 147 GRADGPAQNASFSNDFELTF 166
           G A G  Q    S+ +++ F
Sbjct: 326 GGAKGDEQ--PISSPWDVVF 343


>gi|149278854|ref|ZP_01884989.1| hypothetical protein PBAL39_06746 [Pedobacter sp. BAL39]
 gi|149230473|gb|EDM35857.1| hypothetical protein PBAL39_06746 [Pedobacter sp. BAL39]
          Length = 460

 Score = 42.4 bits (98), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 49/111 (44%), Gaps = 14/111 (12%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKG----------NIYVADKSNHVIRKITNLG-VTT 137
           G  GY+DG   SA  + P    V +K           + Y  DK NH IR +T  G V+T
Sbjct: 347 GTAGYADGIGTSALLNWPVQ-GVFVKNPNYEGEEDPYDFYFCDKENHAIRTLTPKGRVST 405

Query: 138 IAG-GGSKKEGRADGPAQ-NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            AG G +   G ADG  +  A F     + +        I D GN LIR+I
Sbjct: 406 YAGRGNNGTSGYADGDLRTEARFKQPQAIVYDEERKCFYIGDTGNNLIRKI 456


>gi|302036773|ref|YP_003797095.1| hypothetical protein NIDE1420 [Candidatus Nitrospira defluvii]
 gi|190343188|gb|ACE75576.1| putative NHL repeat protein [Candidatus Nitrospira defluvii]
 gi|300604837|emb|CBK41169.1| protein of unknown function, contains NHL repeats [Candidatus
           Nitrospira defluvii]
          Length = 989

 Score = 42.4 bits (98), Expect = 0.23,   Method: Composition-based stats.
 Identities = 22/46 (47%), Positives = 24/46 (52%), Gaps = 5/46 (10%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           G +G  DGE     F  P   AVD  GNIYVAD  NH I+K    G
Sbjct: 160 GQEGVEDGE-----FSSPTGIAVDKDGNIYVADTDNHSIQKFDKDG 200



 Score = 39.3 bits (90), Expect = 1.8,   Method: Composition-based stats.
 Identities = 36/130 (27%), Positives = 55/130 (42%), Gaps = 16/130 (12%)

Query: 82  VKRLAGDGVQGYSDGEPGSA-------RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           V+R   DG    + G+ G A       +FD P     D  GNIYV D +N  I+K    G
Sbjct: 241 VQRFDTDGNPMKAWGKFGFAWRGADMNKFDAPWGVTTDQDGNIYVTDTNNARIQKFKGDG 300

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCS 194
              +  G   ++G  DG      F     + FV +     ++D GN  I++ + +    +
Sbjct: 301 TPLLKWG---RDGSFDGA---FFFPRGVAVDFVGNT---YVADEGNNRIQKFDTRGSFLT 351

Query: 195 KSSQSGSALG 204
           K  + GS  G
Sbjct: 352 KWGREGSGPG 361



 Score = 38.1 bits (87), Expect = 3.7,   Method: Composition-based stats.
 Identities = 33/108 (30%), Positives = 49/108 (45%), Gaps = 11/108 (10%)

Query: 82  VKRLAGDGVQGYSDGEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA 139
           +++  GDG      G  GS    F  P+  AVD  GN YVAD+ N+ I+K    G     
Sbjct: 293 IQKFKGDGTPLLKWGRDGSFDGAFFFPRGVAVDFVGNTYVADEGNNRIQKFDTRGSFLTK 352

Query: 140 GGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            G   +EG   GP Q   F   + +T    +  + + D GN  I++ +
Sbjct: 353 WG---REG--SGPGQ---FKAPWGVT-CDALGNVYVVDQGNHRIQKFD 391



 Score = 37.4 bits (85), Expect = 6.0,   Method: Composition-based stats.
 Identities = 30/95 (31%), Positives = 39/95 (41%), Gaps = 13/95 (13%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA-GGGSKKEGRADGPAQNAS 157
           G  +F  P     D  GN+YV D+ NH I+K    G    A G   K EG+ + P+  A 
Sbjct: 359 GPGQFKAPWGVTCDALGNVYVVDQGNHRIQKFDGNGTFLCAWGNRGKTEGQLNFPSGVA- 417

Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQINLKPED 192
                    V    A+ + D GN   R I   P D
Sbjct: 418 ---------VDKEGAVYVVDSGNH--RVIKYVPTD 441


>gi|380694022|ref|ZP_09858881.1| hypothetical protein BfaeM_08588 [Bacteroides faecis MAJ27]
          Length = 490

 Score = 42.0 bits (97), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 42/89 (47%), Gaps = 4/89 (4%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLS--EESVVKRLAGDGVQGYSDGEPGSARF 103
           NPH +        L + D   +    +   LS  + +V   +   G++GY DG P  A F
Sbjct: 401 NPHQICFTE-DGKLYIADCGNNCIRVIDTRLSIDKATVSTPIGLPGMKGYKDGGPDIAMF 459

Query: 104 DKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
           + P   AV   G I YVAD  N VIRK++
Sbjct: 460 NHPFGVAVSADGQIVYVADTGNKVIRKLS 488



 Score = 42.0 bits (97), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 53/106 (50%), Gaps = 9/106 (8%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGV------TTIAGGGS 143
           +G+ DG   +A+F+ P        G +Y+AD  N+ IR I T L +      T I   G 
Sbjct: 387 KGWEDGLLRNAKFNNPHQICFTEDGKLYIADCGNNCIRVIDTRLSIDKATVSTPIGLPGM 446

Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           K  G  DG    A F++ F +        + ++D GN++IR+++++
Sbjct: 447 K--GYKDGGPDIAMFNHPFGVAVSADGQIVYVADTGNKVIRKLSIE 490


>gi|288921011|ref|ZP_06415303.1| serine/threonine protein kinase [Frankia sp. EUN1f]
 gi|288347586|gb|EFC81871.1| serine/threonine protein kinase [Frankia sp. EUN1f]
          Length = 801

 Score = 42.0 bits (97), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 40/89 (44%), Gaps = 3/89 (3%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA--DGPAQNASF 158
           A F  P   A+   G +YVAD S H +R+I+  G  T+  G     G +   G A +A  
Sbjct: 643 ATFTSPSGLALADDGTLYVADSSAHTVRRISTDGKITLVAGTPGAYGYSGDGGAATSAKL 702

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQIN 187
            N   L   P    L I+D  N +IR + 
Sbjct: 703 YNPHSLAVGPE-GELYIADTYNDVIRVVT 730



 Score = 38.9 bits (89), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 35/121 (28%), Positives = 56/121 (46%), Gaps = 14/121 (11%)

Query: 76  LSEESVVKRLAG-DGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           +S +  +  +AG  G  GYS   G   SA+   P S AV  +G +Y+AD  N VIR +T 
Sbjct: 672 ISTDGKITLVAGTPGAYGYSGDGGAATSAKLYNPHSLAVGPEGELYIADTYNDVIRVVTT 731

Query: 133 LG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC-----ALLISDHGNQLIRQI 186
            G + T AG        ADG     + + + E ++   +      A+ +S+     +R+I
Sbjct: 732 DGKINTFAGTDDVDANTADG-----ALATETEFSYPSGLAIDASGAVYVSEPDRGTVRRI 786

Query: 187 N 187
            
Sbjct: 787 G 787


>gi|29349674|ref|NP_813177.1| hypothetical protein BT_4266 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29341584|gb|AAO79371.1| conserved hypothetical protein with NHL repeat [Bacteroides
           thetaiotaomicron VPI-5482]
          Length = 490

 Score = 42.0 bits (97), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 53/106 (50%), Gaps = 9/106 (8%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGV------TTIAGGGS 143
           +G+ DG   +A+F+ P        G +Y+AD  N+ IR I T L +      T I   G 
Sbjct: 387 KGWEDGLLRNAKFNNPHQICFTEDGKLYIADCGNNCIRVIDTRLSIDKATVSTPIGLPGM 446

Query: 144 KKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
           K  G  DG    A F++ F +        + ++D GN++IR+++++
Sbjct: 447 K--GYKDGGPDIAMFNHPFGVAVSADGQIVYVADTGNKVIRKLSIE 490



 Score = 42.0 bits (97), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 42/89 (47%), Gaps = 4/89 (4%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLS--EESVVKRLAGDGVQGYSDGEPGSARF 103
           NPH +        L + D   +    +   LS  + +V   +   G++GY DG P  A F
Sbjct: 401 NPHQICFTE-DGKLYIADCGNNCIRVIDTRLSIDKATVSTPIGLPGMKGYKDGGPDIAMF 459

Query: 104 DKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
           + P   AV   G I YVAD  N VIRK++
Sbjct: 460 NHPFGVAVSADGQIVYVADTGNKVIRKLS 488


>gi|347757619|ref|YP_004865181.1| NHL repeat family protein [Micavibrio aeruginosavorus ARL-13]
 gi|347590137|gb|AEP09179.1| NHL repeat family protein [Micavibrio aeruginosavorus ARL-13]
          Length = 495

 Score = 42.0 bits (97), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 33/116 (28%), Positives = 51/116 (43%), Gaps = 27/116 (23%)

Query: 54  PGSSDLIVLDSSRSAFYTLSFP----------------------LSEESVVKRLAGDGVQ 91
           P  +D+  L  +R+   TLSFP                      L     VK   G G  
Sbjct: 170 PPFTDISTLVPARTKTSTLSFPARIEDAGDFIFVADSGHNRILVLDRTGAVKMTIGSGEL 229

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL---GVTTIAGGGSK 144
           G+ DG    A+F+ P+   + +  N+Y+AD  NH++R + NL    V T++G G +
Sbjct: 230 GHQDGSFVMAKFNHPRGMVL-IGRNLYIADTDNHMLR-VANLDSGSVMTLSGNGKR 283


>gi|430743486|ref|YP_007202615.1| NHL repeat protein [Singulisphaera acidiphila DSM 18658]
 gi|430015206|gb|AGA26920.1| NHL repeat protein [Singulisphaera acidiphila DSM 18658]
          Length = 307

 Score = 42.0 bits (97), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 54/124 (43%), Gaps = 14/124 (11%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
           K L   G  GY+ GE     F KP++ A+D +  IYVAD  NH I+    +G      G 
Sbjct: 169 KWLRQWGGHGYALGE-----FLKPRALAIDDQDRIYVADSCNHRIQVFDTMGKLLFHWG- 222

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
                R   P Q    +  ++L   P   AL + ++GN  +++ +L+         +G  
Sbjct: 223 ----TRGGAPGQ---MNYPYDLAIGPD-HALYVCEYGNHRVQKFSLEGRSLGVWGSAGRG 274

Query: 203 LGAV 206
            G +
Sbjct: 275 PGQL 278


>gi|440911916|gb|ELR61535.1| NHL repeat-containing protein 2, partial [Bos grunniens mutus]
          Length = 760

 Score = 42.0 bits (97), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 8/80 (10%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG---GGSKKE 146
           G  DG    + F+ P+  A+ M   IYVAD  NH+IRKI      V+T+AG    G+ KE
Sbjct: 283 GRKDGIFSESSFNSPQGVAI-MNNIIYVADTENHLIRKIDLEAEMVSTVAGIGIQGTDKE 341

Query: 147 GRADGPAQNASFSNDFELTF 166
           G A G  Q    S+ +++ F
Sbjct: 342 GGAKGDEQ--PISSPWDVVF 359


>gi|339008739|ref|ZP_08641312.1| cell surface protein [Brevibacillus laterosporus LMG 15441]
 gi|338774539|gb|EGP34069.1| cell surface protein [Brevibacillus laterosporus LMG 15441]
          Length = 633

 Score = 42.0 bits (97), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 25/68 (36%), Positives = 36/68 (52%), Gaps = 13/68 (19%)

Query: 96  GEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIRKI-----------TNLGVTTIAGGG 142
           G+ G+   +F++P   A+D K NIYV D++NH I+K            TN GV      G
Sbjct: 487 GQKGTKEGQFNQPMQLAIDSKDNIYVVDRNNHRIQKFDNSGRFLAKWGTNGGVGATDQLG 546

Query: 143 SKKEGRAD 150
           + KEG+ D
Sbjct: 547 NWKEGKGD 554



 Score = 40.4 bits (93), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 32/63 (50%), Gaps = 1/63 (1%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASF 158
           S  F  P   AVD KGNI+VAD  NH I+K     +  T  G    KEG+ + P Q A  
Sbjct: 446 SGFFQFPAGVAVDSKGNIFVADTLNHRIQKFNPFFIYMTEWGQKGTKEGQFNQPMQLAID 505

Query: 159 SND 161
           S D
Sbjct: 506 SKD 508



 Score = 38.1 bits (87), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 40/86 (46%), Gaps = 11/86 (12%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSND 161
           F  P + A D+ GNIYVAD  NH I K+   G V    GG   K G+ + P   A     
Sbjct: 62  FRTPVAMAKDLTGNIYVADMGNHRIVKMNKSGKVLETFGGLGDKPGKFNMPFGVA----- 116

Query: 162 FELTFVPHICALLISDHGNQLIRQIN 187
                +     +L++D GN  I++ +
Sbjct: 117 -----IDKEGNILVADTGNYRIQKFD 137



 Score = 37.7 bits (86), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 22/35 (62%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
            +F  P+  A+D  G +YVAD  NH I+K TN GV
Sbjct: 400 GQFIVPRQVAIDAYGFVYVADSVNHRIQKFTNTGV 434


>gi|182440597|ref|YP_001828316.1| hypothetical protein SGR_6804 [Streptomyces griseus subsp. griseus
           NBRC 13350]
 gi|178469113|dbj|BAG23633.1| hypothetical protein [Streptomyces griseus subsp. griseus NBRC
           13350]
          Length = 508

 Score = 42.0 bits (97), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 25/88 (28%), Positives = 43/88 (48%), Gaps = 2/88 (2%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFS 159
           A+ + P+  AVD  G +Y+AD   + +R++   GV  T+AG G        GPA  A  +
Sbjct: 95  AQLNCPRQIAVDGDGALYIADAGGNRVRRVGADGVMVTVAGTGVAGSAGDGGPAAKAQLN 154

Query: 160 NDFELTFVPHICALLISDHGNQLIRQIN 187
             F +  V     L +++ G   +R++ 
Sbjct: 155 KPFGVA-VDGEGVLYVAEFGGHRVRRVG 181



 Score = 41.2 bits (95), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 2/87 (2%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV-TTIAGGGSKKEGRADGPAQNASFS 159
           A+ +KP   AVD  G +YVA+   H +R++   GV +T+AG G+       GPA  A  +
Sbjct: 263 AQLNKPYGIAVDSTGTLYVAEYGGHRVRRVGADGVISTVAGTGAAGSAGDGGPAAEAQLN 322

Query: 160 NDFELTFVPHICALLISDHGNQLIRQI 186
             F LT V  + AL I+DH N  +R++
Sbjct: 323 CPFGLT-VDSVDALYIADHLNHRVRKV 348



 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 33/62 (53%), Gaps = 5/62 (8%)

Query: 78  EESVVKRL---AGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           +E  V R+   AG G  G+   DG   SA+   P   AV   G +YV+D SNH +R+IT 
Sbjct: 11  DEGFVPRISAIAGSGTAGFGGDDGAAVSAQLKHPYEMAVSGTGILYVSDYSNHRVRRITT 70

Query: 133 LG 134
            G
Sbjct: 71  DG 72



 Score = 38.1 bits (87), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 23/62 (37%), Positives = 37/62 (59%), Gaps = 2/62 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +  + V+  +AG GV G + DG P + A+ +KP   AVD +G +YVA+   H +R++   
Sbjct: 124 VGADGVMVTVAGTGVAGSAGDGGPAAKAQLNKPFGVAVDGEGVLYVAEFGGHRVRRVGAD 183

Query: 134 GV 135
           GV
Sbjct: 184 GV 185


>gi|425444428|ref|ZP_18824479.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9443]
 gi|389735830|emb|CCI00727.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9443]
          Length = 358

 Score = 42.0 bits (97), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 21/44 (47%), Positives = 26/44 (59%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGG 142
           G  +FD P   AVD  GNIYVAD+ NH ++   + GV   A GG
Sbjct: 245 GDGQFDDPVGIAVDSGGNIYVADRDNHRVQVFNSGGVFQSAFGG 288


>gi|348578653|ref|XP_003475097.1| PREDICTED: NHL repeat-containing protein 2-like [Cavia porcellus]
          Length = 726

 Score = 42.0 bits (97), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 29/71 (40%), Positives = 39/71 (54%), Gaps = 6/71 (8%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAG---GGSKKE 146
           G  DG    + F+ P+  A+ M   I+VAD  NH+IRKI  +   V+T+AG    G+ KE
Sbjct: 267 GRKDGLFSESAFNSPQGVAM-MDNIIFVADTENHLIRKIDLIAEKVSTVAGIGIQGTDKE 325

Query: 147 GRADGPAQNAS 157
           G A G  Q  S
Sbjct: 326 GGAKGEEQPIS 336


>gi|390341274|ref|XP_003725419.1| PREDICTED: tripartite motif-containing protein 71-like
           [Strongylocentrotus purpuratus]
          Length = 778

 Score = 42.0 bits (97), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 52/110 (47%), Gaps = 15/110 (13%)

Query: 82  VKRLAGDGVQGYSDGEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGVTT 137
           ++    DG   +  G PG+   +FD+P   AV+ +GNI +ADK NH  +  KI+   +  
Sbjct: 540 IQIFNADGSFSHKFGSPGTRNGQFDRPAGVAVNGEGNIIIADKDNHRCQVFKISGQFLFK 599

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
               GSK          N  F+  +++  V     +L+SD  N  I+Q N
Sbjct: 600 FGEKGSK----------NGQFNYPWDVA-VNSEGKILVSDTRNHRIQQFN 638


>gi|433610083|ref|YP_007042452.1| Alkyl hydroperoxide reductase/ thiol specific antioxidant/ Mal
           allergen [Saccharothrix espanaensis DSM 44229]
 gi|407887936|emb|CCH35579.1| Alkyl hydroperoxide reductase/ thiol specific antioxidant/ Mal
           allergen [Saccharothrix espanaensis DSM 44229]
          Length = 630

 Score = 42.0 bits (97), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 39/142 (27%), Positives = 67/142 (47%), Gaps = 15/142 (10%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDG------EPGSARFDKPKSF 109
           S  ++V DS+      L      E+ ++R+ G G +G  DG      EP       P   
Sbjct: 209 SGTILVSDSAHHRLVELD--TDGETALRRI-GTGERGRRDGLNPSFSEPAGLAL-LPAEV 264

Query: 110 AVDMKGNIYVADKSNHVIRKITNLG---VTTIAGGGSK-KEGRADGPAQNASFSNDFELT 165
           A  +  ++ VAD  NH++R + NL    VTT+AG G + + G  DGPA +   ++ +++ 
Sbjct: 265 AARVGYHVVVADTVNHLLRGL-NLDTGEVTTVAGTGEQWRGGETDGPALDIPLTSPWDVA 323

Query: 166 FVPHICALLISDHGNQLIRQIN 187
           +      ++I+  GN  +   N
Sbjct: 324 WWEPAGGVVIALAGNHTLGLFN 345



 Score = 40.4 bits (93), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 32/121 (26%), Positives = 53/121 (43%), Gaps = 10/121 (8%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKE 146
           GDG   Y    P       P    +   G I V+D ++H + ++   G T +   G+ + 
Sbjct: 184 GDGP--YVPPAPADTELRFPAKALLTPSGTILVSDSAHHRLVELDTDGETALRRIGTGER 241

Query: 147 GRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQLIRQINLKPEDCSKSSQSG 200
           GR DG   N SFS    L  +P   A      ++++D  N L+R +NL   + +  + +G
Sbjct: 242 GRRDG--LNPSFSEPAGLALLPAEVAARVGYHVVVADTVNHLLRGLNLDTGEVTTVAGTG 299

Query: 201 S 201
            
Sbjct: 300 E 300


>gi|374312921|ref|YP_005059351.1| NHL repeat containing protein [Granulicella mallensis MP5ACTX8]
 gi|358754931|gb|AEU38321.1| NHL repeat containing protein [Granulicella mallensis MP5ACTX8]
          Length = 654

 Score = 42.0 bits (97), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 47/83 (56%), Gaps = 7/83 (8%)

Query: 54  PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-VQGYSDGEPGSARFDKPKSFAVD 112
           P ++  I +D+S +A+ T     S    V +L+G G V    +G  G A  D+P+S A+D
Sbjct: 374 PTAAGNIAIDTSGNAWVT-----SGNFSVTKLSGTGTVLSVVNGSSGGA-LDEPQSLAID 427

Query: 113 MKGNIYVADKSNHVIRKITNLGV 135
             GN++V D  N+ + +I++ GV
Sbjct: 428 GSGNVWVNDFENNSVTEISSTGV 450


>gi|290995072|ref|XP_002680155.1| predicted protein [Naegleria gruberi]
 gi|284093775|gb|EFC47411.1| predicted protein [Naegleria gruberi]
          Length = 1951

 Score = 42.0 bits (97), Expect = 0.29,   Method: Composition-based stats.
 Identities = 32/107 (29%), Positives = 50/107 (46%), Gaps = 8/107 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG 140
           +  + GDG          SA+   P   A+     IYVAD +N+ IRKIT  G ++TIAG
Sbjct: 142 INAITGDG------SSATSAQLYNPYGVAISSSDEIYVADYNNNRIRKITTSGIISTIAG 195

Query: 141 GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            G+         A +A   N + +  +     + I+D  N  IR+++
Sbjct: 196 TGTSGYSGDGSSAISAQLYNPYGVA-ISSSDEIYITDTNNNRIRKLS 241



 Score = 40.4 bits (93), Expect = 0.79,   Method: Composition-based stats.
 Identities = 27/71 (38%), Positives = 39/71 (54%), Gaps = 5/71 (7%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKS--NHVIRKIT 131
           ++   ++  +AG G  GY+ DG    SA+   P   A+     IY+AD    N+ IRKIT
Sbjct: 60  ITTSGIISTIAGTGTAGYNGDGSSATSAKLYFPYGVAISSSDEIYIADMDMFNNRIRKIT 119

Query: 132 NLG-VTTIAGG 141
             G ++TIAGG
Sbjct: 120 TSGIISTIAGG 130


>gi|351706822|gb|EHB09741.1| NHL repeat-containing protein 2 [Heterocephalus glaber]
          Length = 726

 Score = 42.0 bits (97), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 29/68 (42%), Positives = 37/68 (54%), Gaps = 6/68 (8%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
           G  DG    + F+ P+  A+ M   IYVAD  NH+IRKI      V+T+AG    G+ KE
Sbjct: 267 GRKDGLFSESTFNSPQGVAM-MDNVIYVADTENHLIRKIDLEAERVSTVAGIGVQGTDKE 325

Query: 147 GRADGPAQ 154
           G A G  Q
Sbjct: 326 GGAKGEQQ 333


>gi|430760419|ref|YP_007216276.1| YD repeat protein [Thioalkalivibrio nitratireducens DSM 14787]
 gi|430010043|gb|AGA32795.1| YD repeat protein [Thioalkalivibrio nitratireducens DSM 14787]
          Length = 2395

 Score = 42.0 bits (97), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 39/134 (29%), Positives = 62/134 (46%), Gaps = 10/134 (7%)

Query: 60   IVLDSSRSAFYTLS------FPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAV 111
            + +D+  S F+T        F  + +  V  +AG+G + ++ DG P  +A F  P+  AV
Sbjct: 1125 VAVDAGGSVFFTRRNGGPRIFRANPDGTVHHVAGNGSRTFAGDGGPAVNASFIDPRGIAV 1184

Query: 112  DMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHI 170
               G++YV D   H +R+I   G + T+AG G    G   GPA  A     +++   P  
Sbjct: 1185 GADGSLYVGDFRTHRVRRIGPDGIIRTVAGNGGSGIGGIGGPAVQARIDRTYDVAVGPE- 1243

Query: 171  CALLISDHGNQLIR 184
              L I  H   + R
Sbjct: 1244 GDLYIDSHNYTVFR 1257


>gi|290972172|ref|XP_002668832.1| predicted protein [Naegleria gruberi]
 gi|284082360|gb|EFC36088.1| predicted protein [Naegleria gruberi]
          Length = 2254

 Score = 42.0 bits (97), Expect = 0.31,   Method: Composition-based stats.
 Identities = 23/60 (38%), Positives = 38/60 (63%), Gaps = 5/60 (8%)

Query: 82  VKRLAGDGVQGYSDGEPG---SARFDKPKS-FAVDMKGNIYVADKSNHVIRKITNLGVTT 137
           +  +AG   +G SDG+ G   +++ + P+  F  D+ G+++VAD +NH IRKI N GV +
Sbjct: 86  ISTIAGQMTKG-SDGDGGLATASQMNSPRGVFFDDVTGSLFVADTNNHKIRKIDNFGVIS 144


>gi|290982604|ref|XP_002674020.1| predicted protein [Naegleria gruberi]
 gi|284087607|gb|EFC41276.1| predicted protein [Naegleria gruberi]
          Length = 499

 Score = 41.6 bits (96), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 27/97 (27%), Positives = 45/97 (46%), Gaps = 4/97 (4%)

Query: 60  IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYV 119
           +V+D++    Y   F  +   VV    GD ++ +     G  +FD PK    D+ GN+ V
Sbjct: 296 LVIDTTNEDIYICDFD-NHRIVVLSKNGDFLRDFGSKGSGDTQFDGPKDIDFDLDGNLVV 354

Query: 120 ADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNA 156
           +D+ NH I  ++ L  + I      + G  + P Q A
Sbjct: 355 SDRENHRILVLSKLNGSLIR---KIENGHGNQPGQFA 388


>gi|256833654|ref|YP_003162381.1| alkyl hydroperoxide reductase [Jonesia denitrificans DSM 20603]
 gi|256687185|gb|ACV10078.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Jonesia denitrificans DSM 20603]
          Length = 628

 Score = 41.6 bits (96), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 27/116 (23%), Positives = 55/116 (47%), Gaps = 11/116 (9%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSF-------AVDMKGNIYVADKSNHVIR--KITNLGVTT 137
           G G +G  DG   +A+F++P          A ++  ++ VAD  NH +R   ++   V T
Sbjct: 228 GAGERGLVDGPAHTAQFNEPNGLLVLPPDVAAEVGYDVVVADTVNHCVRSLSLSTGAVGT 287

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDC 193
           +AG G  ++    GP      ++ +++ +  H   ++I+  GN  +  ++ +   C
Sbjct: 288 LAGNG--RQYMVGGPDNEGRLTSPWDVVWSAHRKQVIIAMAGNHTLWTLDPRTGQC 341



 Score = 38.5 bits (88), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 25/80 (31%), Positives = 38/80 (47%), Gaps = 6/80 (7%)

Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICA-- 172
           G   VAD  +H I  +   GVT  A  G+ + G  DGPA  A F+    L  +P   A  
Sbjct: 201 GGYLVADAGHHQIVHVAADGVTEQARYGAGERGLVDGPAHTAQFNEPNGLLVLPPDVAAE 260

Query: 173 ----LLISDHGNQLIRQINL 188
               ++++D  N  +R ++L
Sbjct: 261 VGYDVVVADTVNHCVRSLSL 280



 Score = 38.1 bits (87), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 31/109 (28%), Positives = 49/109 (44%), Gaps = 5/109 (4%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAV---DMKGNIYVADKSNHVIRKI-TNLGVTTI 138
           +  AG   +G  DG    + F +P   AV   D    ++VAD     +R I T+  V T 
Sbjct: 342 QWFAGTMNEGLVDGPVAQSWFAQPSGLAVTGVDADERVWVADSETSALRWIDTSDTVHTA 401

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            G G    G  DG A  A F +   +  +P   +L+++D  N  +R+ +
Sbjct: 402 VGAGLFDFGHRDGDADQALFQHPLAVAALPD-NSLVVADTYNGALRRYD 449


>gi|290978917|ref|XP_002672181.1| predicted protein [Naegleria gruberi]
 gi|284085756|gb|EFC39437.1| predicted protein [Naegleria gruberi]
          Length = 1457

 Score = 41.6 bits (96), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 8/60 (13%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDM-KGNIYVADKSNHVIRKIT-NLGVTTIAGGG 142
            AGDG      G    A F++P   A+D   G++YV D+ NH IRKIT +  ++T+AG G
Sbjct: 125 FAGDG------GMAIKALFNRPVHIAIDHSNGDLYVTDQGNHAIRKITWDKTISTVAGIG 178


>gi|374849371|dbj|BAL52388.1| PKD domain protein [uncultured candidate division OP1 bacterium]
 gi|374857216|dbj|BAL60069.1| hypothetical protein HGMM_OP4C705 [uncultured candidate division
           OP1 bacterium]
          Length = 635

 Score = 41.6 bits (96), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 24/91 (26%), Positives = 41/91 (45%), Gaps = 8/91 (8%)

Query: 98  PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNAS 157
           PG  +F+ P   AVD  GN+YV D  NH I+K  + G   +   G++  G       +  
Sbjct: 384 PGDGQFNGPTDIAVDAAGNVYVVDSGNHRIQKFDSTG-KFLGKWGTRGSG-------DGQ 435

Query: 158 FSNDFELTFVPHICALLISDHGNQLIRQINL 188
           F     +        + ++D GN  I++ ++
Sbjct: 436 FETPIGIALDGSGKFIYVADKGNHRIQKFDI 466



 Score = 41.6 bits (96), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 43/86 (50%), Gaps = 16/86 (18%)

Query: 50  VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV----QGYSDG-EPGSARFD 104
           ++DR G+     +   RS F         E +  R   DG+    Q   DG EPG+  F 
Sbjct: 271 LVDRQGNRSEQSVLRQRSTF---------EFIAGRKVADGLFFLDQFGQDGSEPGA--FR 319

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKI 130
           +P+  AVD KG+IYVAD  NH I++ 
Sbjct: 320 RPQGIAVDSKGSIYVADTENHRIQRF 345



 Score = 40.4 bits (93), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 48/109 (44%), Gaps = 9/109 (8%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASF 158
           G  +F +P++ AVD  GN+YV+D  NH I+K    G   +  G   + G A G      F
Sbjct: 504 GDGQFFEPQAIAVDGAGNVYVSDTGNHRIQKFDANGKFLLKWG---RNGLAQG-----QF 555

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVS 207
                L F      LL+ D  N  +++ N       +  + G+  G ++
Sbjct: 556 DVPRGLAFTKQ-GILLVVDQNNNRVQEFNADGTFVRQWGEQGNGEGELN 603



 Score = 37.7 bits (86), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 42/146 (28%), Positives = 59/146 (40%), Gaps = 21/146 (14%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRK--ITNLGV 135
           +S  K L   G +G  DG+     F+ P   A+D  G  IYVADK NH I+K  I+   V
Sbjct: 417 DSTGKFLGKWGTRGSGDGQ-----FETPIGIALDGSGKFIYVADKGNHRIQKFDISGPTV 471

Query: 136 TTIAGGGSK-------KEGRADGPAQNASFSNDFELTFVPHICA------LLISDHGNQL 182
             +   GS+         GR   P             F P   A      + +SD GN  
Sbjct: 472 RFVGKWGSECNLTVTPPTGRCIDPDGGGPLQTGDGQFFEPQAIAVDGAGNVYVSDTGNHR 531

Query: 183 IRQINLKPEDCSKSSQSGSALGAVSV 208
           I++ +   +   K  ++G A G   V
Sbjct: 532 IQKFDANGKFLLKWGRNGLAQGQFDV 557


>gi|124008663|ref|ZP_01693353.1| cohesin domain protein [Microscilla marina ATCC 23134]
 gi|123985739|gb|EAY25612.1| cohesin domain protein [Microscilla marina ATCC 23134]
          Length = 899

 Score = 41.6 bits (96), Expect = 0.37,   Method: Composition-based stats.
 Identities = 38/115 (33%), Positives = 63/115 (54%), Gaps = 17/115 (14%)

Query: 81  VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHV-IRKI--TNLGV 135
           V+  +AG GV G++   G    A+ + PK+ A+D + N+Y+AD   H+ IRK+  T   +
Sbjct: 136 VITPVAGTGVAGFAGDGGLATQAQLNTPKAIALDAENNLYIAD---HIRIRKVDATTGII 192

Query: 136 TTIAGGGSKKEGRADG-PAQNASFS--NDFELTFVPHICALLISDHGNQLIRQIN 187
           TT+AG G+      DG PA  A+ +  +D  L  + +I  + +  H    IR+I+
Sbjct: 193 TTVAGSGTLT---TDGVPALEANLAPIHDLALDAIGNIYVVEVLKH---RIRKID 241



 Score = 38.5 bits (88), Expect = 2.9,   Method: Composition-based stats.
 Identities = 28/77 (36%), Positives = 38/77 (49%), Gaps = 6/77 (7%)

Query: 81  VVKRLAGDGVQGYSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVT 136
           ++  +AG G    +DG P   A        A+D  GNIYV +   H IRKI   TN+ +T
Sbjct: 191 IITTVAGSGTLT-TDGVPALEANLAPIHDLALDAIGNIYVVEVLKHRIRKIDAQTNI-IT 248

Query: 137 TIAGGGSKKEGRADGPA 153
           T+AG   +      GPA
Sbjct: 249 TVAGSAQQTSNNPIGPA 265


>gi|423214678|ref|ZP_17201206.1| hypothetical protein HMPREF1074_02738 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692584|gb|EIY85821.1| hypothetical protein HMPREF1074_02738 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 593

 Score = 41.6 bits (96), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 43/74 (58%), Gaps = 2/74 (2%)

Query: 72  LSFPLSEES-VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           LSF   + S  V  +AG+     +DG+  +A F  P+  A+   G+I++A++ N+VIRK+
Sbjct: 281 LSFTYEDLSYTVATVAGNSATTSTDGKGTAASFKFPQGLALAPNGDIWIAERGNNVIRKM 340

Query: 131 -TNLGVTTIAGGGS 143
                V+T+A  G+
Sbjct: 341 DQEYNVSTVAKSGT 354



 Score = 37.7 bits (86), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 32/54 (59%), Gaps = 3/54 (5%)

Query: 79  ESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +  +K +AG  G +G  DG   +A F+ P S  V  K  +Y+AD+ NH+IR IT
Sbjct: 539 KGTLKTIAGIPGTKGKIDGSALTATFNCPASVLVSDK--VYIADEQNHLIRTIT 590



 Score = 37.4 bits (85), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 15/35 (42%), Positives = 21/35 (60%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTT 137
           F  P S   D  GN+Y+AD+ N  ++KIT+ G  T
Sbjct: 399 FKSPMSVTFDSNGNMYIADRDNKAVKKITSGGTVT 433


>gi|294644532|ref|ZP_06722286.1| IPT/TIG domain protein [Bacteroides ovatus SD CC 2a]
 gi|294805794|ref|ZP_06764671.1| IPT/TIG domain protein [Bacteroides xylanisolvens SD CC 1b]
 gi|336403611|ref|ZP_08584323.1| hypothetical protein HMPREF0127_01636 [Bacteroides sp. 1_1_30]
 gi|345510783|ref|ZP_08790344.1| hypothetical protein BSAG_00302 [Bacteroides sp. D1]
 gi|229442801|gb|EEO48592.1| hypothetical protein BSAG_00302 [Bacteroides sp. D1]
 gi|292640085|gb|EFF58349.1| IPT/TIG domain protein [Bacteroides ovatus SD CC 2a]
 gi|294447015|gb|EFG15605.1| IPT/TIG domain protein [Bacteroides xylanisolvens SD CC 1b]
 gi|335945537|gb|EGN07348.1| hypothetical protein HMPREF0127_01636 [Bacteroides sp. 1_1_30]
          Length = 593

 Score = 41.6 bits (96), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 43/74 (58%), Gaps = 2/74 (2%)

Query: 72  LSFPLSEES-VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           LSF   + S  V  +AG+     +DG+  +A F  P+  A+   G+I++A++ N+VIRK+
Sbjct: 281 LSFTYEDLSYTVATVAGNSATTSTDGKGTAASFKFPQGLALAPNGDIWIAERGNNVIRKM 340

Query: 131 -TNLGVTTIAGGGS 143
                V+T+A  G+
Sbjct: 341 DQEYNVSTVAKSGT 354



 Score = 37.7 bits (86), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 32/54 (59%), Gaps = 3/54 (5%)

Query: 79  ESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +  +K +AG  G +G  DG   +A F+ P S  V  K  +Y+AD+ NH+IR IT
Sbjct: 539 KGTLKTIAGIPGTKGKIDGSALTATFNCPASVLVSDK--VYIADEQNHLIRTIT 590



 Score = 37.4 bits (85), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 15/35 (42%), Positives = 21/35 (60%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTT 137
           F  P S   D  GN+Y+AD+ N  ++KIT+ G  T
Sbjct: 399 FKSPMSVTFDSNGNMYIADRDNKAVKKITSGGTVT 433


>gi|206603181|gb|EDZ39661.1| NHL repeat domain protein [Leptospirillum sp. Group II '5-way CG']
          Length = 494

 Score = 41.6 bits (96), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 37/69 (53%), Gaps = 4/69 (5%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKS--FAVDMKGNIYVADKSNHVIRKI--TNL 133
           E++ ++ + G G+ G +DG    A+F  P+   F  D      VAD  NH++R++     
Sbjct: 210 EKAALREVIGQGIPGSADGSFDQAQFRDPQGVRFCPDDPDTAIVADTGNHLLRRVDFRRR 269

Query: 134 GVTTIAGGG 142
            VTTIAG G
Sbjct: 270 SVTTIAGTG 278


>gi|163841552|ref|YP_001625957.1| hypothetical protein RSal33209_2819 [Renibacterium salmoninarum
           ATCC 33209]
 gi|162955028|gb|ABY24543.1| conserved hypothetical protein [Renibacterium salmoninarum ATCC
           33209]
          Length = 425

 Score = 41.6 bits (96), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 42/76 (55%), Gaps = 10/76 (13%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDKPKS-------FAVDMKGNIYVADKSNHVIRKI- 130
           E+VV+   G G +G++DG    ARF++P+         A  +  ++ VAD  NH +R I 
Sbjct: 88  ETVVRAF-GTGTKGFADGAAQMARFNEPQGLVLLPEVLAAQLGYDVVVADSVNHRLRGIS 146

Query: 131 -TNLGVTTIAGGGSKK 145
            T+  V T+AG G ++
Sbjct: 147 LTDGSVRTLAGNGVQR 162



 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 33/111 (29%), Positives = 51/111 (45%), Gaps = 6/111 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGVT--- 136
           V  LAG G++G  DG    A F +    AVD  G+I+VA      +R  ++T+  V    
Sbjct: 226 VSILAGTGLEGLLDGAADQAWFAQSSGLAVDANGDIWVAASETSALRVLRVTDGRVNRVE 285

Query: 137 TIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
           T  G G    G  DG A  A   +   +  +P   ++ I+D  N  +R+ +
Sbjct: 286 TAVGEGLFDFGFQDGDASAARLQHPLGVAALPD-GSVAIADTYNGAVRRYD 335


>gi|395502099|ref|XP_003755423.1| PREDICTED: NHL repeat-containing protein 2 [Sarcophilus harrisii]
          Length = 1044

 Score = 41.6 bits (96), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 36/90 (40%), Positives = 49/90 (54%), Gaps = 15/90 (16%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI---TNLGVTTIAG---GGSK 144
           G  DG    + F+ P+  A+  K NI YVAD  NH+IRKI   T + V+T+AG    G+ 
Sbjct: 587 GRKDGTFLESTFNSPQGVAI--KNNIIYVADTENHLIRKIDLETKM-VSTVAGIGIQGTD 643

Query: 145 KEGRADGPAQNASFSNDFELTF---VPHIC 171
           KEG A G  Q    S+ +++ F   V  IC
Sbjct: 644 KEGGAKGEEQ--PISSPWDVVFGISVSEIC 671


>gi|290990638|ref|XP_002677943.1| predicted protein [Naegleria gruberi]
 gi|284091553|gb|EFC45199.1| predicted protein [Naegleria gruberi]
          Length = 1317

 Score = 41.6 bits (96), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 39/67 (58%), Gaps = 2/67 (2%)

Query: 76  LSEESVVKRLAG-DGVQGY-SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +S+E ++  L G  GV GY ++ +     F++PK       G++YV D  N++IR+I+  
Sbjct: 202 VSKEGIISTLYGIPGVSGYNTESDASKTLFNQPKDIHYTNDGDLYVFDSKNNLIRRISKG 261

Query: 134 GVTTIAG 140
            +TTI G
Sbjct: 262 IITTIVG 268



 Score = 38.5 bits (88), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 27/96 (28%), Positives = 45/96 (46%), Gaps = 1/96 (1%)

Query: 92  GYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRAD 150
           G  DG P + A+  +  SF V ++G + +AD  +HVIRK++  G+ +   G     G   
Sbjct: 163 GLGDGLPATEAKIQEVSSFFVSVEGEMVIADTQDHVIRKVSKEGIISTLYGIPGVSGYNT 222

Query: 151 GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
               + +  N  +     +   L + D  N LIR+I
Sbjct: 223 ESDASKTLFNQPKDIHYTNDGDLYVFDSKNNLIRRI 258


>gi|427384743|ref|ZP_18881248.1| hypothetical protein HMPREF9447_02281 [Bacteroides oleiciplenus YIT
           12058]
 gi|425728004|gb|EKU90863.1| hypothetical protein HMPREF9447_02281 [Bacteroides oleiciplenus YIT
           12058]
          Length = 486

 Score = 41.6 bits (96), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 37/113 (32%), Positives = 51/113 (45%), Gaps = 18/113 (15%)

Query: 92  GYSDGEPGSARFDKPKS--------FAVDMKGN---IYVADKSNHVIRKITNLG-VTTIA 139
           G+ DG   SAR   P          +  + K +    Y  D+ NH IR +T  G VTT A
Sbjct: 361 GWLDGVGASARLSSPYQGIFVKNPDYVAEGKEDQYDFYFCDRDNHAIRVLTPEGIVTTFA 420

Query: 140 GGGS-----KKEGRADGPAQ-NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G GS     K EG  +G  + +A F+    L +     A  I + GN L+R+I
Sbjct: 421 GRGSTSVDGKPEGYINGAIRADARFNKPVSLAYDEKEDAFYIGEVGNNLLRKI 473


>gi|350403739|ref|XP_003486886.1| PREDICTED: NHL repeat-containing protein 2-like [Bombus impatiens]
          Length = 696

 Score = 41.6 bits (96), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 40/136 (29%), Positives = 62/136 (45%), Gaps = 12/136 (8%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIA 139
           V+ + G     + DG+  +A+F+ P+  +  +   IYVAD  NH IRKI  +   V TIA
Sbjct: 259 VQHIIGGPNPDFKDGDFETAKFNAPQGVST-LGSLIYVADTKNHAIRKIDLMRRIVITIA 317

Query: 140 GGGSKKEGRADGP-AQNASFSNDFELTFVPH------ICALLISDHGNQLIRQINLKPED 192
           G G K      G   ++   S+ ++L    H         LLI+  GN  I  + L+   
Sbjct: 318 GTGIKGHDYVGGKIGKDQVLSSPWDLAVYKHEHDNNITPILLIAMAGNHQIWALFLEDTI 377

Query: 193 CSKSSQ--SGSALGAV 206
             K+ +   G+ L  V
Sbjct: 378 WWKNKKYNEGTCLAIV 393


>gi|219853190|ref|YP_002467622.1| SMP-30/Gluconolaconase/LRE domain-containing protein
           [Methanosphaerula palustris E1-9c]
 gi|219547449|gb|ACL17899.1| SMP-30/Gluconolaconase/LRE domain protein [Methanosphaerula
           palustris E1-9c]
          Length = 343

 Score = 41.6 bits (96), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 30/52 (57%), Gaps = 1/52 (1%)

Query: 98  PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRA 149
           PG  +F  P+  AVD  GN+YV D  N+ I+K T+ G T I   G++  G A
Sbjct: 105 PGDGQFKSPEGIAVDNAGNVYVVDTVNNQIQKFTSDG-TFITKWGTRASGIA 155


>gi|402571883|ref|YP_006621226.1| hypothetical protein Desmer_1348 [Desulfosporosinus meridiei DSM
           13257]
 gi|402253080|gb|AFQ43355.1| conserved repeat protein [Desulfosporosinus meridiei DSM 13257]
          Length = 1977

 Score = 41.6 bits (96), Expect = 0.39,   Method: Composition-based stats.
 Identities = 31/86 (36%), Positives = 39/86 (45%), Gaps = 2/86 (2%)

Query: 77  SEESVVKRLAGDGV-QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLG 134
           S   + KR AG GV   +     G+ +F  P   AVD  GN+YVAD  N  I+K  +N  
Sbjct: 112 SNHRIQKRNAGTGVWTTWGSSGSGNKQFSSPSGVAVDADGNVYVADTFNSRIQKFDSNGN 171

Query: 135 VTTIAGGGSKKEGRADGPAQNASFSN 160
             T  G     EG+   P   A  SN
Sbjct: 172 YLTQWGSLGSDEGQLKYPFDLAVSSN 197



 Score = 40.8 bits (94), Expect = 0.58,   Method: Composition-based stats.
 Identities = 26/62 (41%), Positives = 34/62 (54%), Gaps = 4/62 (6%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTI---AGGGSKKEGRADGPAQ 154
           G+ +F  P   AVD  GN+YVAD SNH I+K     GV T    +G G+K+     G A 
Sbjct: 88  GAGKFYSPAGVAVDGAGNVYVADTSNHRIQKRNAGTGVWTTWGSSGSGNKQFSSPSGVAV 147

Query: 155 NA 156
           +A
Sbjct: 148 DA 149



 Score = 39.3 bits (90), Expect = 1.7,   Method: Composition-based stats.
 Identities = 19/39 (48%), Positives = 23/39 (58%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTT 137
           G  +F    S AVD   NIYVAD  NH I+++TNL   T
Sbjct: 556 GDGQFSGIYSLAVDRNDNIYVADSQNHRIQRLTNLYTVT 594



 Score = 37.7 bits (86), Expect = 5.4,   Method: Composition-based stats.
 Identities = 24/73 (32%), Positives = 34/73 (46%), Gaps = 1/73 (1%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIAGGGSKKEGR 148
           V+ +     G+ +FD      VD  GN+YVAD +N  I+K T+  G  T  G     +G+
Sbjct: 455 VRSWGSQGSGNGQFDNMNGIGVDSSGNVYVADTNNSRIQKFTSEGGYLTQWGSAGSTDGK 514

Query: 149 ADGPAQNASFSND 161
              P   A  S D
Sbjct: 515 FYSPRGIAVDSQD 527


>gi|290970758|ref|XP_002668244.1| predicted protein [Naegleria gruberi]
 gi|284081534|gb|EFC35500.1| predicted protein [Naegleria gruberi]
          Length = 735

 Score = 41.6 bits (96), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 24/51 (47%), Positives = 30/51 (58%), Gaps = 1/51 (1%)

Query: 95  DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSK 144
           +G+P  SA F  P S  V   G IYV+D   H IRKI N  +T IAG G++
Sbjct: 391 NGQPALSAHFKLPTSIFVTDDGTIYVSDTQLHQIRKIQNGIITAIAGTGTQ 441



 Score = 37.4 bits (85), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 38/74 (51%), Gaps = 11/74 (14%)

Query: 78  EESVVKRLAGDGVQGYSDG----EPGSA---RFD--KPKSFAVDMKGNIYVADKSNHVIR 128
           +  ++  +AG G Q  + G    + GS    R D   P    V   G +Y AD SNH +R
Sbjct: 428 QNGIITAIAGTGTQCTTAGSNTCDHGSTSPLRVDLYTPMGIYVTKSGEVYFADTSNHKVR 487

Query: 129 KITN--LGVTTIAG 140
           KI++  L +TT+AG
Sbjct: 488 KISSDGLTITTVAG 501


>gi|290982063|ref|XP_002673750.1| predicted protein [Naegleria gruberi]
 gi|284087336|gb|EFC41006.1| predicted protein [Naegleria gruberi]
          Length = 1759

 Score = 41.6 bits (96), Expect = 0.40,   Method: Composition-based stats.
 Identities = 31/106 (29%), Positives = 52/106 (49%), Gaps = 5/106 (4%)

Query: 65  SRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVD-MKGNIYVAD 121
           + S+   + F      ++   AGDG   YS DG   S A+ + P    +D +  N+ +AD
Sbjct: 52  AESSASVIRFVDKSTGIISTYAGDGSPAYSGDGVLASQAKLNYPNFVTIDNLSNNLLIAD 111

Query: 122 KSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
            +N V+R +  T   ++TIAG           P++ +SFS+ F +T
Sbjct: 112 TANFVVRYVNRTTRIISTIAGYNVNGNSGDGLPSKQSSFSSLFGIT 157


>gi|426366237|ref|XP_004050167.1| PREDICTED: NHL repeat-containing protein 2 [Gorilla gorilla
           gorilla]
          Length = 726

 Score = 41.6 bits (96), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 43/80 (53%), Gaps = 8/80 (10%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
           G  DG    + F+ P+  A+ +   IYVAD  NH+IRKI      V+T+AG    G+ KE
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-INNIIYVADTENHLIRKIDLEAEKVSTVAGIGIQGTDKE 325

Query: 147 GRADGPAQNASFSNDFELTF 166
           G A G  Q    S+ +++ F
Sbjct: 326 GGAKGEQQ--PISSPWDVVF 343


>gi|384248465|gb|EIE21949.1| hypothetical protein COCSUDRAFT_16964 [Coccomyxa subellipsoidea
           C-169]
          Length = 1021

 Score = 41.2 bits (95), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 38/73 (52%), Gaps = 3/73 (4%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           + D+IV DS       L  P ++   ++ +AG G  G +DG  G+AR  +P       +G
Sbjct: 773 NGDVIVADSYNHRLKLLD-PATDS--IRTVAGSGAAGLTDGTGGNARVSEPGGLCAGPRG 829

Query: 116 NIYVADKSNHVIR 128
            +++AD +N  IR
Sbjct: 830 TVFIADTNNSAIR 842


>gi|195641946|gb|ACG40441.1| hypothetical protein [Zea mays]
          Length = 79

 Score = 41.2 bits (95), Expect = 0.42,   Method: Composition-based stats.
 Identities = 18/34 (52%), Positives = 23/34 (67%)

Query: 134 GVTTIAGGGSKKEGRADGPAQNASFSNDFELTFV 167
           GVTTI  G S + G  DGP+  A FSNDFE+ ++
Sbjct: 43  GVTTIDWGKSGRGGHVDGPSGEAKFSNDFEVHYI 76


>gi|383100947|emb|CCD74491.1| haloacid dehalogenase-like hydrolase domain-containing protein
           [Arabidopsis halleri subsp. halleri]
          Length = 1028

 Score = 41.2 bits (95), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 43/103 (41%), Gaps = 2/103 (1%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           G +DG         P        G IY+ D  NH I+K+  +    +   G+ K G  DG
Sbjct: 764 GDNDGVGAEVLLQHPLGVLCAKDGQIYLTDSYNHKIKKLDPVTKRVVTVAGTGKAGFKDG 823

Query: 152 PAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL-KPEDC 193
             + A  S    L    +   L ++D  N LIR I+L K ED 
Sbjct: 824 KVKGAQLSEPAGLAITEN-GRLFVADTNNSLIRYIDLNKGEDA 865



 Score = 40.8 bits (94), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 19/56 (33%), Positives = 32/56 (57%), Gaps = 1/56 (1%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITN 132
           E + + ++   G +G+ DG    A F++P+  A + K N +YVAD  NH +R + N
Sbjct: 600 EGNFIVQIGSTGEEGFQDGSFEDAAFNRPQGLAYNAKKNLLYVADTENHALRYLLN 655



 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 31/55 (56%), Gaps = 3/55 (5%)

Query: 79  ESVVKRL---AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + V KR+   AG G  G+ DG+   A+  +P   A+   G ++VAD +N +IR I
Sbjct: 803 DPVTKRVVTVAGTGKAGFKDGKVKGAQLSEPAGLAITENGRLFVADTNNSLIRYI 857


>gi|291240969|ref|XP_002740388.1| PREDICTED: tripartite motif protein-like [Saccoglossus kowalevskii]
          Length = 720

 Score = 41.2 bits (95), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 42/145 (28%), Positives = 63/145 (43%), Gaps = 9/145 (6%)

Query: 49  SVIDRPGSS-DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD--- 104
            ++D  G   D I+LDS R     ++  L+   VV       V  YS+      +F    
Sbjct: 496 QILDSNGKYIDQILLDSERPI--GVAMALNSNIVVVEFDTKHVHIYSENREKIKQFTYKE 553

Query: 105 --KPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDF 162
             KP    V+ KG+I VAD   H I   T+ G   +  G  + EG+ + P   A+  ND 
Sbjct: 554 FVKPYGVTVNSKGHILVADPKAHCIFVFTSDGDFIMKIGKEEGEGKLNNPHFVATGRNDD 613

Query: 163 ELTFVPHICALLISDHGNQLIRQIN 187
            +    +    L + HG Q IR+I+
Sbjct: 614 VIVSDTYQGLALFTKHG-QFIRRIH 637


>gi|410477767|ref|YP_006765404.1| NHL repeat domain-containing protein [Leptospirillum ferriphilum
           ML-04]
 gi|406773019|gb|AFS52444.1| NHL repeat domain protein [Leptospirillum ferriphilum ML-04]
          Length = 494

 Score = 41.2 bits (95), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 39/131 (29%), Positives = 63/131 (48%), Gaps = 10/131 (7%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKS--FAVDMKGNIYVADKSNHVIRKI--TNL 133
           E++ ++ + G G  G +DG    A+F  P+   F  D      VAD  NH++R++     
Sbjct: 210 EKAALREVIGQGTPGSADGSFDQAQFRDPQGIRFCPDDPDIAIVADTGNHLLRRVDFRRR 269

Query: 134 GVTTIAGGGSKKEGRADG-PAQNASFSNDFELTFVPHICALL-ISDHGNQLIRQINLKPE 191
            VTTIAG G +     +  PA +A  ++ +++ F  H   +L ++  G   I  I L PE
Sbjct: 270 SVTTIAGTGVQGWAIFEPVPAMSAVLNSPWDILF--HRDGMLYVAQAGPHQI--IRLDPE 325

Query: 192 DCSKSSQSGSA 202
                  +GSA
Sbjct: 326 RQEIFPVAGSA 336


>gi|328783461|ref|XP_396242.3| PREDICTED: NHL repeat-containing protein 2 [Apis mellifera]
          Length = 699

 Score = 41.2 bits (95), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 35/63 (55%), Gaps = 3/63 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIA 139
           V+ + G     + DG+  +ARF+ P+   + +   IYVAD  NH IRKI  +   VTT+ 
Sbjct: 262 VQHVIGGSNPDFRDGDFENARFNAPQGVCI-LDTFIYVADNENHAIRKIDLIKKMVTTVV 320

Query: 140 GGG 142
           G G
Sbjct: 321 GTG 323


>gi|440683842|ref|YP_007158637.1| NHL repeat containing protein [Anabaena cylindrica PCC 7122]
 gi|428680961|gb|AFZ59727.1| NHL repeat containing protein [Anabaena cylindrica PCC 7122]
          Length = 506

 Score = 41.2 bits (95), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 43/139 (30%), Positives = 62/139 (44%), Gaps = 14/139 (10%)

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ---GYSDGEPGSARFDKPKSFAVDM 113
            +L + DS  S+   +   L E+  V+ + G G+    G  DGE    R        V+ 
Sbjct: 350 QELFIADSEVSSIRGVE--LDEQGKVRTICGSGMLFGFGDMDGEKLDVRLQH--CLGVEY 405

Query: 114 KGN-IYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHI 170
             N ++VAD  NH I+ +  T     T+ G G    G  DG  +N  FS    L+ +   
Sbjct: 406 FQNYLWVADTYNHKIKLVHPTTGNCQTVLGDGLP--GFQDGQGKNTRFSEPSGLSGID-- 461

Query: 171 CALLISDHGNQLIRQINLK 189
             L ISD  N  IR++NLK
Sbjct: 462 SYLYISDTNNHAIRRVNLK 480



 Score = 40.4 bits (93), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 18/56 (32%), Positives = 34/56 (60%), Gaps = 2/56 (3%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
           + + GDG+ G+ DG+  + RF +P   +  +   +Y++D +NH IR++ NL   T+
Sbjct: 431 QTVLGDGLPGFQDGQGKNTRFSEPSGLS-GIDSYLYISDTNNHAIRRV-NLKTLTV 484



 Score = 38.5 bits (88), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 37/139 (26%), Positives = 62/139 (44%), Gaps = 11/139 (7%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           SS L + DS         F    +  +  + G G  G +DG     +F  P+    D + 
Sbjct: 182 SSGLFIADSGHHRIVMTRF----DGKILYVIGTGKSGLTDGAFNQVQFSAPQGMTFDDES 237

Query: 116 NI-YVADKSNHVIRKI--TNLGVTTIAGGGSKKE--GRADGPAQNASFSNDFELTFVPHI 170
            I Y+AD  N+ +R+I      V TIAG G +        G A   + ++ ++L  V + 
Sbjct: 238 QILYIADTENYALRQIDLQRQVVETIAGTGQQNHIISPHGGVALETALNSPWDLQKVGN- 296

Query: 171 CALLISDHGNQLIRQINLK 189
             LLI+  G+  I +++L+
Sbjct: 297 -KLLIAMAGSHQIWEMDLE 314


>gi|422325157|ref|ZP_16406193.1| hypothetical protein HMPREF0737_01303 [Rothia mucilaginosa M508]
 gi|353343435|gb|EHB87752.1| hypothetical protein HMPREF0737_01303 [Rothia mucilaginosa M508]
          Length = 656

 Score = 41.2 bits (95), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 32/110 (29%), Positives = 50/110 (45%), Gaps = 5/110 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT----NLGVTT 137
           V   AG GV+G +DG    A F +P        G+++VA      +R IT     + VT+
Sbjct: 363 VSIFAGSGVEGLADGPAADAWFAQPSGIIEARDGSLWVACSETSGLRHITFTDNGVQVTS 422

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
             G G    G  DG +  A   +   LT +P   ++ ++D  N  IR+ +
Sbjct: 423 AVGKGLFDFGFVDGDSDTARMQHPLGLTELPD-GSIAVADTYNGAIRRWD 471


>gi|326429396|gb|EGD74966.1| hypothetical protein PTSG_12546 [Salpingoeca sp. ATCC 50818]
          Length = 1448

 Score = 41.2 bits (95), Expect = 0.46,   Method: Composition-based stats.
 Identities = 23/83 (27%), Positives = 45/83 (54%), Gaps = 3/83 (3%)

Query: 80   SVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVT 136
            + +K +AG+G   ++   +P     ++ +  AVD  GN+YVA ++ H + +I   +  +T
Sbjct: 1097 NTIKTVAGNGKTAWTGSSQPTQTGLNQAECVAVDGSGNMYVALRTGHRVVRIARGDGAIT 1156

Query: 137  TIAGGGSKKEGRADGPAQNASFS 159
            T+AG G+  +    G A +A  +
Sbjct: 1157 TVAGTGTAGDSGVGGAAASAQLN 1179



 Score = 38.9 bits (89), Expect = 2.6,   Method: Composition-based stats.
 Identities = 25/115 (21%), Positives = 53/115 (46%), Gaps = 4/115 (3%)

Query: 79   ESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLG 134
            +  +  +AG G  G S   G   SA+ ++P   A+D  GN+Y++++ +H +  +  T+  
Sbjct: 1152 DGAITTVAGTGTAGDSGVGGAAASAQLNRPTCVALDAAGNVYISEEGSHRVSVVDATSGD 1211

Query: 135  VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            ++ + G G        G ++ A       L        L ++D  N ++  ++L+
Sbjct: 1212 LSVLVGTGHSGHRGMGGSSRAADIHTPRGLAVDNDAKVLYVADAENHVVYSVDLR 1266


>gi|290991059|ref|XP_002678153.1| serine/threonine protein kinase [Naegleria gruberi]
 gi|284091764|gb|EFC45409.1| serine/threonine protein kinase [Naegleria gruberi]
          Length = 1397

 Score = 41.2 bits (95), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 33/74 (44%), Positives = 39/74 (52%), Gaps = 4/74 (5%)

Query: 117 IYVADKSNHVIRKI-TNLGV-TTIAGG-GSKKEGRADGP-AQNASFSNDFELTFVPHICA 172
           +YV D  NHVIRKI TN  V TTIAG   +      DG  A NASFS    +        
Sbjct: 181 LYVCDSGNHVIRKIDTNTNVITTIAGNVNTNNTFYGDGILALNASFSYPQGIYVSQSSGE 240

Query: 173 LLISDHGNQLIRQI 186
           + +SD GN  IR+I
Sbjct: 241 IFVSDSGNNRIRKI 254



 Score = 37.7 bits (86), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 36/142 (25%), Positives = 62/142 (43%), Gaps = 7/142 (4%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P S++  P + +  +L+S R+   +L   LS+   + R  G   +   + +P S R    
Sbjct: 341 PRSIMRNPFTEEWYILES-RADRISL---LSQTGTISRFCGSPFEPSFNSDPFSIRL--T 394

Query: 107 KSFAVDMKGNI-YVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
           K++ V    N+ Y+     H I +I N  V +   G +   G       N  F+N   + 
Sbjct: 395 KTYGVTKVQNVLYLTQSLTHSILQIKNSQVYSFGKGEAGFAGDGLVLDSNTQFNNPTAVA 454

Query: 166 FVPHICALLISDHGNQLIRQIN 187
           F     A+ I+D  N  IR+I+
Sbjct: 455 FSSGTKAIYIADTYNHRIRRID 476


>gi|295837701|ref|ZP_06824634.1| NHL repeat protein [Streptomyces sp. SPB74]
 gi|197695928|gb|EDY42861.1| NHL repeat protein [Streptomyces sp. SPB74]
          Length = 610

 Score = 41.2 bits (95), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 66/124 (53%), Gaps = 8/124 (6%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P   +  PG +  +V D++R A   L    ++   V+R  G G +G  DG+  +ARF++P
Sbjct: 183 PGKALRLPGGT-FLVSDTTRHALVELE---ADGETVRRRIGGGERGLLDGDAATARFNEP 238

Query: 107 KSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK-KEGR-ADGPAQNASFSNDF 162
           +  A+     + VAD  NH +R++   +  VTT+AG G +  +G    G A+  S S+ +
Sbjct: 239 QGLALLPDRAVVVADTVNHALRRVDLASGEVTTLAGTGRQWWQGEPVAGAAREVSLSSPW 298

Query: 163 ELTF 166
           ++ +
Sbjct: 299 DVAW 302


>gi|126273410|ref|XP_001377730.1| PREDICTED: NHL repeat-containing protein 2 [Monodelphis domestica]
          Length = 777

 Score = 41.2 bits (95), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 31/70 (44%), Positives = 40/70 (57%), Gaps = 10/70 (14%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI---TNLGVTTIAG---GGSK 144
           G  DG    + F+ P+   V +K NI YVAD  NH+IRKI   T + V+T+AG    G+ 
Sbjct: 317 GSKDGTFLESTFNSPQ--GVSIKNNIIYVADTENHLIRKIDLETEM-VSTVAGIGIQGTD 373

Query: 145 KEGRADGPAQ 154
           KEG A G  Q
Sbjct: 374 KEGGAKGEEQ 383


>gi|160883851|ref|ZP_02064854.1| hypothetical protein BACOVA_01824 [Bacteroides ovatus ATCC 8483]
 gi|156110581|gb|EDO12326.1| NHL repeat protein [Bacteroides ovatus ATCC 8483]
          Length = 494

 Score = 41.2 bits (95), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 39/124 (31%), Positives = 51/124 (41%), Gaps = 20/124 (16%)

Query: 89  GVQGYSDGEPGSARFDKP-----------KSFAVDMKGNIYVADKSNHVIRKITNLGVT- 136
           G  GY D     AR  +P           K+   D   + Y  D+ NH IRK+T  G+T 
Sbjct: 356 GQAGYDDKIGAKARLSRPYQGVFVKNPEYKAAGKDEIYDFYFTDRDNHCIRKLTPDGITS 415

Query: 137 TIAGGGS-----KKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK- 189
           T AG GS        G  DG   + A F +   L +        + D  N  IR+I L+ 
Sbjct: 416 TFAGRGSVGMNIHANGYVDGALREEARFDSPAALAYDEVNNIFYVGDVNNHRIRKIALEK 475

Query: 190 -PED 192
            PED
Sbjct: 476 IPED 479


>gi|124004309|ref|ZP_01689155.1| cohesin domain protein [Microscilla marina ATCC 23134]
 gi|123990379|gb|EAY29878.1| cohesin domain protein [Microscilla marina ATCC 23134]
          Length = 849

 Score = 41.2 bits (95), Expect = 0.50,   Method: Composition-based stats.
 Identities = 38/114 (33%), Positives = 59/114 (51%), Gaps = 15/114 (13%)

Query: 81  VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VT 136
           V+  +AG GV G++   G    A+   PK+ A+D   N+Y+ D +   IRK+  +   +T
Sbjct: 86  VITPVAGTGVAGFAGDGGLATQAQLVGPKAIALDATNNLYIVDGTR--IRKVDAVTGIIT 143

Query: 137 TIAGGGSKKEGRADG-PAQNASFS--NDFELTFVPHICALLISDHGNQLIRQIN 187
           TIAG G     R DG PA  AS +  +D  L  + +I  + +  H    IR+I+
Sbjct: 144 TIAGSGRL---RTDGVPALEASLALIHDLALDAIGNIYVVEVLKH---RIRKID 191



 Score = 37.7 bits (86), Expect = 4.8,   Method: Composition-based stats.
 Identities = 28/77 (36%), Positives = 39/77 (50%), Gaps = 6/77 (7%)

Query: 81  VVKRLAGDGVQGYSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI---TNLGVT 136
           ++  +AG G +  +DG P   A        A+D  GNIYV +   H IRKI   TN+ +T
Sbjct: 141 IITTIAGSG-RLRTDGVPALEASLALIHDLALDAIGNIYVVEVLKHRIRKIDAQTNI-IT 198

Query: 137 TIAGGGSKKEGRADGPA 153
           T+AG   +      GPA
Sbjct: 199 TVAGSAQQTSNNPIGPA 215


>gi|317504749|ref|ZP_07962709.1| NHL repeat containing protein [Prevotella salivae DSM 15606]
 gi|315664131|gb|EFV03838.1| NHL repeat containing protein [Prevotella salivae DSM 15606]
          Length = 448

 Score = 41.2 bits (95), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 31/105 (29%), Positives = 50/105 (47%), Gaps = 12/105 (11%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIA 139
           +++ AG    G  DG   SA F  P   A D+ GN+Y+ D     IRKI +    VTT+A
Sbjct: 339 IEQFAGFNGVGSMDGPRMSASFRNPTGMAFDVDGNMYITDSMGFTIRKIGHADGMVTTVA 398

Query: 140 GGGSKKE-GRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
           G  +K +  + +G     +F+  ++++         + D GN  I
Sbjct: 399 GIYNKSDNNKVEGLPSEVTFTYPYDIS---------VDDDGNFYI 434


>gi|443708985|gb|ELU03866.1| hypothetical protein CAPTEDRAFT_52631, partial [Capitella teleta]
          Length = 507

 Score = 40.8 bits (94), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 40/135 (29%), Positives = 60/135 (44%), Gaps = 13/135 (9%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-------GYSDGEPGSARFDKPKSFAV 111
           L + DS  S+  +L   LS+ SV K + G  +        G  DG+   A+   P +  +
Sbjct: 356 LYIADSESSSIRSLD--LSDGSV-KSVVGADIDPKNLFAFGDVDGKGIEAKLQHPLAVTL 412

Query: 112 DMKGNIYVADKSNHVIRKITN--LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPH 169
              G ++VAD  NH I+K+      V T+ G G   +   D P Q A  +    L + PH
Sbjct: 413 AEDGQLFVADSYNHKIKKVVPKFREVRTVFGTGKMGDTLKDSPLQCA-LNEPGGLAYNPH 471

Query: 170 ICALLISDHGNQLIR 184
              L I+D  N  I+
Sbjct: 472 SRRLYIADTNNHYIK 486



 Score = 37.0 bits (84), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 36/77 (46%), Gaps = 2/77 (2%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-- 134
           S   +V+ + G    G +DG   S++F  P+         I VAD  NH IRKI  +   
Sbjct: 188 SMNGMVQHVIGGRGHGLTDGSFISSQFFGPQGVCFHGDDLIIVADTENHAIRKIDLVAGT 247

Query: 135 VTTIAGGGSKKEGRADG 151
           V TIAG G +    A G
Sbjct: 248 VRTIAGNGKQGTDLAGG 264


>gi|186683945|ref|YP_001867141.1| redoxin domain-containing protein [Nostoc punctiforme PCC 73102]
 gi|186466397|gb|ACC82198.1| Redoxin domain protein [Nostoc punctiforme PCC 73102]
          Length = 509

 Score = 40.8 bits (94), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 20/58 (34%), Positives = 35/58 (60%), Gaps = 3/58 (5%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTI 138
           + + GDG  G  DG+  ++RF +P   +++    +Y+AD +NH IR++    L VTT+
Sbjct: 432 QTIQGDGTAGLQDGQGKNSRFFEPSGLSIN-GSYLYIADTNNHAIRRVDLNTLEVTTL 488



 Score = 40.8 bits (94), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 40/89 (44%), Gaps = 7/89 (7%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI- 117
           L + DS        SF    +  +  L G G  G +DG    A+F  P+  A D +  I 
Sbjct: 187 LFIADSGHHRLIMSSF----DGEILHLIGTGKSGLTDGAFNEAQFFAPQGMAYDAENQIL 242

Query: 118 YVADKSNHVIRKI--TNLGVTTIAGGGSK 144
           YVAD  NH +R++      V  IAG G +
Sbjct: 243 YVADTENHTLRRVDMKRQVVDAIAGTGEQ 271


>gi|309790072|ref|ZP_07684646.1| NHL repeat-containing protein [Oscillochloris trichoides DG-6]
 gi|308227927|gb|EFO81581.1| NHL repeat-containing protein [Oscillochloris trichoides DG6]
          Length = 1124

 Score = 40.8 bits (94), Expect = 0.54,   Method: Composition-based stats.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 2/37 (5%)

Query: 92   GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
            GY+ GEPG+  F++P   AVD +GN+YVAD  N  ++
Sbjct: 980  GYAGGEPGA--FNEPVGVAVDAQGNVYVADTWNSRVQ 1014


>gi|91076402|ref|XP_969236.1| PREDICTED: similar to CG12547 CG12547-PA [Tribolium castaneum]
 gi|270002449|gb|EEZ98896.1| hypothetical protein TcasGA2_TC004511 [Tribolium castaneum]
          Length = 712

 Score = 40.8 bits (94), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 30/124 (24%), Positives = 46/124 (37%), Gaps = 22/124 (17%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNL 133
           L  +  +    G G  G+ DG      F+ P+      +  ++VAD  NH IRKI     
Sbjct: 245 LKSDGTILHQVGSGEIGFRDGSFTQCEFNAPQGLVFQNENTLFVADTENHAIRKIDLAKK 304

Query: 134 GVTTIAGGGSKKEGRADGPAQNASF-----------SNDFELTFVPHICALLISDHGNQL 182
            V ++ G G +   R  G                  + D +++F P          GN  
Sbjct: 305 TVESVVGSGVQGHDRVGGKQWTQQVISSPWDLCIFRTPDMDMSFYP---------EGNPP 355

Query: 183 IRQI 186
           IR+I
Sbjct: 356 IREI 359


>gi|325106982|ref|YP_004268050.1| redoxin [Planctomyces brasiliensis DSM 5305]
 gi|324967250|gb|ADY58028.1| Redoxin domain protein [Planctomyces brasiliensis DSM 5305]
          Length = 563

 Score = 40.8 bits (94), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 3/72 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIA 139
           ++ + G G  G  DG    A FD P+  A+ +   +YVAD  NH++R I   +  V T+A
Sbjct: 268 LQAVIGTGKIGRQDGSYEQASFDHPQGMAL-VGNTLYVADTENHLLRTIDLESKQVATLA 326

Query: 140 GGGSKKEGRADG 151
           G G +   RA G
Sbjct: 327 GTGEQARFRAGG 338


>gi|293372067|ref|ZP_06618462.1| IPT/TIG domain protein [Bacteroides ovatus SD CMC 3f]
 gi|292632971|gb|EFF51556.1| IPT/TIG domain protein [Bacteroides ovatus SD CMC 3f]
          Length = 484

 Score = 40.8 bits (94), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 33/100 (33%), Positives = 47/100 (47%), Gaps = 10/100 (10%)

Query: 97  EPGSARF---DKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSK-----KEG 147
           +PG   F   ++ K+   D   + Y ADK NH IR +T  G V+T AG GS      K G
Sbjct: 362 KPGQGVFVLNEEYKAANKDDWYDFYFADKENHCIRILTPDGVVSTFAGRGSASASSYKWG 421

Query: 148 RADGPA-QNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + +G   + A F+    L +        + D GN  IR+I
Sbjct: 422 KQNGEVRERARFNQPVALAYNEATKTFYVGDSGNYKIRKI 461


>gi|383110973|ref|ZP_09931791.1| hypothetical protein BSGG_2078 [Bacteroides sp. D2]
 gi|382949397|gb|EFS31378.2| hypothetical protein BSGG_2078 [Bacteroides sp. D2]
          Length = 499

 Score = 40.8 bits (94), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 33/100 (33%), Positives = 47/100 (47%), Gaps = 10/100 (10%)

Query: 97  EPGSARF---DKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSK-----KEG 147
           +PG   F   ++ K+   D   + Y ADK NH IR +T  G V+T AG GS      K G
Sbjct: 377 KPGQGVFVLNEEYKAANKDDWYDFYFADKENHCIRILTPDGVVSTFAGRGSASASSYKWG 436

Query: 148 RADGPA-QNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + +G   + A F+    L +        + D GN  IR+I
Sbjct: 437 KQNGEVRERARFNQPVALAYNEATKTFYVGDSGNYKIRKI 476


>gi|255513298|gb|EET89564.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Candidatus Micrarchaeum acidiphilum ARMAN-2]
          Length = 579

 Score = 40.8 bits (94), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 42/140 (30%), Positives = 63/140 (45%), Gaps = 16/140 (11%)

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ--GYSDGEPGSARFDKPKSFAVDMK 114
           +++ V DS  SA  +++    ++     + G G+   G  DG   SAR   P   +    
Sbjct: 429 NEIYVADSEASAIRSINM---KDGYASTITGSGLFTFGDQDGSLASARLQHPIGVSAK-D 484

Query: 115 GNIYVADKSNHVIRKI---TNLGVTTIAGGGSKKEGRADGPAQNA---SFSNDFELTFVP 168
           G IYVAD  N  IR+I    N   T ++G G K   R   P  +       ND +    P
Sbjct: 485 GLIYVADTYNSAIREIDVKANRITTLVSGPGRKSACRFGDPKCDTLQLYEPNDVK----P 540

Query: 169 HICALLISDHGNQLIRQINL 188
              AL+I+D  N LIR+ ++
Sbjct: 541 FGRALIIADTNNNLIRRFDI 560


>gi|87311714|ref|ZP_01093830.1| NHL repeat protein [Blastopirellula marina DSM 3645]
 gi|87285608|gb|EAQ77526.1| NHL repeat protein [Blastopirellula marina DSM 3645]
          Length = 652

 Score = 40.8 bits (94), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 38/68 (55%), Gaps = 5/68 (7%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGGSK 144
           G+G  G +DG+  +A F   +  A+D    +YVAD  NH++RK+      VTTIAG G  
Sbjct: 199 GNGAIGTTDGDYATASFHHLQGMALDGD-TLYVADTENHMLRKVDLKTKQVTTIAGKGV- 256

Query: 145 KEGRADGP 152
            +GR   P
Sbjct: 257 -QGRNSWP 263


>gi|57107627|ref|XP_544027.1| PREDICTED: NHL repeat-containing protein 2 isoform 1 [Canis lupus
           familiaris]
          Length = 726

 Score = 40.8 bits (94), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 31/91 (34%), Positives = 48/91 (52%), Gaps = 10/91 (10%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG--VTTI 138
           ++ + G    G  DG    + F+ P+  A+  + NI YVAD  NH+IRKI      V+T+
Sbjct: 257 IQYIIGGPNPGRKDGIFSESTFNSPQGVAI--RNNIIYVADTENHLIRKIDLEAEMVSTV 314

Query: 139 AG---GGSKKEGRADGPAQNASFSNDFELTF 166
           AG    G+ KEG A  P +    S+ +++ F
Sbjct: 315 AGIGIQGTDKEGGA--PGEEQPISSPWDVVF 343


>gi|383111215|ref|ZP_09932032.1| hypothetical protein BSGG_2315 [Bacteroides sp. D2]
 gi|313694780|gb|EFS31615.1| hypothetical protein BSGG_2315 [Bacteroides sp. D2]
          Length = 593

 Score = 40.8 bits (94), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 43/74 (58%), Gaps = 2/74 (2%)

Query: 72  LSFPLSEES-VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           LSF   + S  V+ +AG+     +DG+  +A F  P+  A+   G+I++A++ N+ IRK+
Sbjct: 281 LSFTYEDLSYTVETVAGNSATTSTDGKGTAASFKFPQGLALAPNGDIWIAERGNNTIRKM 340

Query: 131 -TNLGVTTIAGGGS 143
                V+T+A  G+
Sbjct: 341 DQEYNVSTVAKSGT 354



 Score = 37.7 bits (86), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 15/35 (42%), Positives = 21/35 (60%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTT 137
           F  P S   D  GN+Y+AD+ N  ++KIT+ G  T
Sbjct: 399 FKSPMSITFDANGNMYIADRDNKAVKKITSGGAVT 433



 Score = 37.7 bits (86), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 32/54 (59%), Gaps = 3/54 (5%)

Query: 79  ESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +  +K +AG  G +G  DG   +A F+ P S  V  K  +Y+AD+ NH+IR IT
Sbjct: 539 KGTLKTIAGIPGTKGKIDGSALTATFNCPASVLVSDK--VYIADEQNHLIRSIT 590


>gi|290984250|ref|XP_002674840.1| predicted protein [Naegleria gruberi]
 gi|284088433|gb|EFC42096.1| predicted protein [Naegleria gruberi]
          Length = 831

 Score = 40.8 bits (94), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 37/111 (33%), Positives = 52/111 (46%), Gaps = 5/111 (4%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VT 136
           E SVV  +AG   QGYS G    ++   P    V +  ++   D  NH IRKI   G +T
Sbjct: 103 ENSVVSFIAGSSSQGYSTGYAIGSQLSSPNG-VVALGSSVVFTDTGNHCIRKINTDGTLT 161

Query: 137 TIAGGGSKKEGRAD-GPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           TIAG  +     +D   A NA  S+   +  V     L  +D  N +IR++
Sbjct: 162 TIAGVCTTSGYNSDQADATNAWLSSPSGIVEVNG--DLYFTDSMNGIIRKL 210


>gi|320105699|ref|YP_004181289.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
 gi|319924220|gb|ADV81295.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
          Length = 1347

 Score = 40.8 bits (94), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 39/138 (28%), Positives = 67/138 (48%), Gaps = 14/138 (10%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARF---DKPKSFAVDMKG 115
           L++  + R AF   +  ++   +  + AG G   Y +G+ GSA F   + P + A+D  G
Sbjct: 19  LLLTTTGRGAF---AQAVTLVPIQTKFAGTGAGTY-NGDFGSADFVSLNGPAAVALDSSG 74

Query: 116 NIYVADKSNHVIRKI--TNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC-- 171
           N +++D+ N+ +R+I      VTT+ G  S   G +   A NA+ S    L     I   
Sbjct: 75  NTFISDRGNNCVRRIDAATGSVTTLLGLVSSGTGDSCNTATNATPSAAQGLYQPGGIVID 134

Query: 172 ---ALLISDHGNQLIRQI 186
              +L I+D G+  +R +
Sbjct: 135 SNDSLYIADTGHNCVRML 152


>gi|167997357|ref|XP_001751385.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162697366|gb|EDQ83702.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 663

 Score = 40.8 bits (94), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 46/95 (48%), Gaps = 14/95 (14%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDM-KGNIYVADKSNHVIRK--ITNLGVTTIAGGGSK-- 144
           + G+ DG   SAR  +P S   D  K  +Y+AD  NH IR+  I++  V T+     +  
Sbjct: 218 IAGFEDGSFESARLRRPSSIVFDKEKKLLYIADCENHAIRRAHISSRMVDTLYPKPDECP 277

Query: 145 --------KEGRADGPAQNAS-FSNDFELTFVPHI 170
                   + G   G +Q  S  SND+E+T+  H+
Sbjct: 278 GFLQRWLHRLGILQGSSQKRSNVSNDYEMTYPWHL 312


>gi|290971428|ref|XP_002668505.1| predicted protein [Naegleria gruberi]
 gi|284081943|gb|EFC35761.1| predicted protein [Naegleria gruberi]
          Length = 355

 Score = 40.8 bits (94), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 33/125 (26%), Positives = 55/125 (44%), Gaps = 26/125 (20%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSAR---FDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           +    ++  +AG+G++GY +G+  +A     + P S  V     IY +D  NHVIRKI  
Sbjct: 185 IDRNGIIHTIAGNGIEGY-NGDRRNATDTSLNNPDSLFV-HNDEIYFSDLDNHVIRKILP 242

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC-----------ALLISDHGNQ 181
            G+     G   K+G          ++ DF+L     +C            +  +DH N 
Sbjct: 243 NGIIKNIAGIPGKKG----------YNGDFKLATRTKLCWPTGIFVDSKNQVYFADHYNH 292

Query: 182 LIRQI 186
            +R+I
Sbjct: 293 CVRKI 297



 Score = 37.7 bits (86), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 36/115 (31%), Positives = 60/115 (52%), Gaps = 8/115 (6%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           +    +++ +AG+G+  Y+ D +  +    +  S     K  IY+AD  ++ IRKI   G
Sbjct: 130 IDRNGIIRTIAGNGIMTYNGDNQLATLASLRYPSSVFQYKDEIYIADSFSNRIRKIDRNG 189

Query: 135 VT-TIAGGGSKKEGRADGPAQNASFS--NDFELTFVPHICALLISDHGNQLIRQI 186
           +  TIAG G   EG  +G  +NA+ +  N+ +  FV H   +  SD  N +IR+I
Sbjct: 190 IIHTIAGNGI--EG-YNGDRRNATDTSLNNPDSLFV-HNDEIYFSDLDNHVIRKI 240


>gi|160882120|ref|ZP_02063123.1| hypothetical protein BACOVA_00060 [Bacteroides ovatus ATCC 8483]
 gi|156112488|gb|EDO14233.1| IPT/TIG domain protein [Bacteroides ovatus ATCC 8483]
          Length = 514

 Score = 40.8 bits (94), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 44/93 (47%), Gaps = 8/93 (8%)

Query: 116 NIYVADKSNHVIRKITNLG-VTTIAGGGSK-----KEGRADGP-AQNASFSNDFELTFVP 168
           + Y  D  NH IRK+T  G V+T AG GS      K G+ +G   + A F+    L +  
Sbjct: 407 DFYFTDTQNHCIRKLTPDGVVSTFAGRGSASTSAYKWGKQNGEIRERARFNEPVALAYDE 466

Query: 169 HICALLISDHGNQLIRQINLKPEDCSKSSQSGS 201
                 + D GN  IR+I  K ++  +  + GS
Sbjct: 467 ETKTFYVGDTGNFKIRKI-AKEQEADEQGEGGS 498


>gi|283458725|ref|YP_003363360.1| thiol-disulfide isomerase [Rothia mucilaginosa DY-18]
 gi|283134775|dbj|BAI65540.1| thiol-disulfide isomerase [Rothia mucilaginosa DY-18]
          Length = 656

 Score = 40.8 bits (94), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 31/110 (28%), Positives = 50/110 (45%), Gaps = 5/110 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT----NLGVTT 137
           V   AG GV+G +DG    A F +P        G+++VA      +R +T     + VT+
Sbjct: 363 VSIFAGSGVEGLADGPAADAWFAQPSGIIEARDGSLWVACSETSGLRHVTFTEDGVQVTS 422

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
             G G    G  DG +  A   +   LT +P   ++ ++D  N  IR+ +
Sbjct: 423 AVGKGLFDFGFVDGDSDTARMQHPLGLTELPD-GSIAVADTYNGAIRRWD 471


>gi|343780954|ref|NP_001230490.1| NHL repeat-containing protein 2 [Sus scrofa]
          Length = 725

 Score = 40.4 bits (93), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 31/81 (38%), Positives = 45/81 (55%), Gaps = 10/81 (12%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG--VTTIAG---GGSKK 145
           G  DG    + F+ P+  A+  + NI YVAD  NH+IRKI      V+T+AG    G+ K
Sbjct: 267 GRKDGIFSESTFNSPQGVAI--RNNIIYVADTENHLIRKIDLEAEMVSTVAGIGIQGTDK 324

Query: 146 EGRADGPAQNASFSNDFELTF 166
           EG A+G  Q    S+ +++ F
Sbjct: 325 EGGANGEEQ--PISSPWDVVF 343


>gi|392965340|ref|ZP_10330759.1| pre-peptidase C-terminal domain-containing protein [Fibrisoma limi
           BUZ 3]
 gi|387844404|emb|CCH52805.1| pre-peptidase C-terminal domain-containing protein [Fibrisoma limi
           BUZ 3]
          Length = 480

 Score = 40.4 bits (93), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 11/107 (10%)

Query: 54  PGSSDLIVLDSSRSAFYTLS---FPLSEESVVKRLAGDGVQGYSDGEPGSARFD-----K 105
           P  ++ ++  + +S F   +   + L+  S+ +     G  GY DG+  SARF      +
Sbjct: 330 PNDANTMLASNGKSMFVFANQTLWKLNGNSLQQVAGSPGQVGYQDGKGASARFQDGGYAR 389

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAG--GGSKKEGRA 149
             + A D   N+YVAD     IRKI   G VTT+AG    S + GR 
Sbjct: 390 GNAIAADGDDNVYVADYGCGCIRKIDKEGNVTTVAGNKNASNRVGRG 436


>gi|119509213|ref|ZP_01628363.1| NHL repeat protein [Nodularia spumigena CCY9414]
 gi|119466055|gb|EAW46942.1| NHL repeat protein [Nodularia spumigena CCY9414]
          Length = 502

 Score = 40.4 bits (93), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 33/118 (27%), Positives = 55/118 (46%), Gaps = 7/118 (5%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI--TNL 133
           S +  +  L G G  G +DG    A+F  P+  A D +   +YVAD  NH +R++     
Sbjct: 201 SYDGEILHLIGTGKSGLTDGAFNEAQFFAPQGMAFDPENQMLYVADTENHTLRRVDLKRQ 260

Query: 134 GVTTIAGGGSKKEG--RADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
            V TIAG G +        G     + ++ +++  +     L I+  G+  I Q+NL+
Sbjct: 261 IVKTIAGTGEQSRNIYPHGGAGIETALNSPWDVVQLGQ--TLFIAMAGSHQIWQMNLE 316



 Score = 38.5 bits (88), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 20/58 (34%), Positives = 33/58 (56%), Gaps = 3/58 (5%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTI 138
           + + GDG  G  DG+  + RF +P   +V M   +Y++D +NH IR+  +    VTT+
Sbjct: 433 QTILGDGSAGLQDGQGQNTRFFEPSGLSV-MGDYLYISDTNNHAIRRGDLNKFEVTTL 489


>gi|383781882|ref|YP_005466449.1| hypothetical protein AMIS_67130 [Actinoplanes missouriensis 431]
 gi|381375115|dbj|BAL91933.1| hypothetical protein AMIS_67130 [Actinoplanes missouriensis 431]
          Length = 332

 Score = 40.4 bits (93), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 31/97 (31%), Positives = 44/97 (45%), Gaps = 14/97 (14%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKI-TNLGVTTIAGGGSKKEGRADGPAQNA----- 156
            + P   AV   G +YV+D  NHV+R I TN  + T+AG G    G  + PA  A     
Sbjct: 1   MNNPTGIAVGPDGTVYVSDSGNHVVRAISTNGTIRTVAGTGRDGAGAPE-PAVGAKATDV 59

Query: 157 --SFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
             +F ND     V     + I+D   + +R   L P+
Sbjct: 60  PPAFPNDLA---VGQDGTVFIAD--GRTVRVYALAPD 91


>gi|255327417|ref|ZP_05368491.1| NHL repeat containing protein [Rothia mucilaginosa ATCC 25296]
 gi|255295697|gb|EET75040.1| NHL repeat containing protein [Rothia mucilaginosa ATCC 25296]
          Length = 656

 Score = 40.4 bits (93), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 32/110 (29%), Positives = 50/110 (45%), Gaps = 5/110 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT----NLGVTT 137
           V   AG GV+G +DG    A F +P        G+++VA      +R IT     + VT+
Sbjct: 363 VSIFAGSGVEGLADGPADDAWFAQPSGIIEARDGSLWVACSETSGLRHITFTEDGVQVTS 422

Query: 138 IAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
             G G    G  DG +  A   +   LT +P   ++ ++D  N  IR+ +
Sbjct: 423 AVGKGLFDFGFVDGDSDTARMQHPLGLTELPD-GSIAVADTYNGAIRRWD 471


>gi|223940572|ref|ZP_03632418.1| NHL repeat containing protein [bacterium Ellin514]
 gi|223890764|gb|EEF57279.1| NHL repeat containing protein [bacterium Ellin514]
          Length = 791

 Score = 40.4 bits (93), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 23/41 (56%), Gaps = 2/41 (4%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V G   G PG   F  P S A D KGN+YVAD  NH ++K 
Sbjct: 736 VLGGVGGAPGE--FSNPWSIAFDSKGNLYVADSQNHRVQKF 774


>gi|428297525|ref|YP_007135831.1| NHL repeat containing protein [Calothrix sp. PCC 6303]
 gi|428234069|gb|AFY99858.1| NHL repeat containing protein [Calothrix sp. PCC 6303]
          Length = 511

 Score = 40.4 bits (93), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 25/65 (38%), Positives = 37/65 (56%), Gaps = 5/65 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN--IYVADKSNHVIRKIT--NLGVTT 137
           V ++ G G  G+ DG+  +A F  P+    D +GN  +Y+AD  NH +R++   N  V T
Sbjct: 206 VLQIIGVGKAGFVDGDFEAATFSTPQGMTFD-EGNQILYIADTGNHALRQVDLGNQLVRT 264

Query: 138 IAGGG 142
           IAG G
Sbjct: 265 IAGTG 269


>gi|290995474|ref|XP_002680320.1| predicted protein [Naegleria gruberi]
 gi|284093940|gb|EFC47576.1| predicted protein [Naegleria gruberi]
          Length = 992

 Score = 40.4 bits (93), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 25/99 (25%), Positives = 47/99 (47%), Gaps = 5/99 (5%)

Query: 45  INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG--YSDGEPGSAR 102
           ++P+ V+  P + D+ + D   +    +     +  +V  +AG G  G    +G    A+
Sbjct: 444 VDPYGVVVDPSNGDVFISDGYLNCVRKID---GKSGIVTTVAGTGEAGDVGDNGPSNKAQ 500

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
              P   ++   G++ +AD  N  IRK++N  +TTI  G
Sbjct: 501 LFSPSGLSLTSSGDLLIADNGNQAIRKVSNGIITTIVSG 539



 Score = 37.4 bits (85), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 44/75 (58%), Gaps = 5/75 (6%)

Query: 76  LSEESVVKRLAGD-GVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           +S++ V+  +AG  G+ G S DG+  + ++ + P   AV  KGNI +AD  N  +R I N
Sbjct: 833 VSKDGVITTIAGQSGLIGLSGDGDKAANSKLNIPTGVAVTKKGNIIIADTINGRLRMINN 892

Query: 133 -LGV-TTIAGGGSKK 145
             GV TTIAG  + K
Sbjct: 893 DTGVITTIAGAENDK 907



 Score = 37.4 bits (85), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 44/84 (52%), Gaps = 4/84 (4%)

Query: 106 PKSFAVD-MKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQNASFSNDF 162
           P    VD   G+++++D   + +RKI      VTT+AG G   +   +GP+  A   +  
Sbjct: 446 PYGVVVDPSNGDVFISDGYLNCVRKIDGKSGIVTTVAGTGEAGDVGDNGPSNKAQLFSPS 505

Query: 163 ELTFVPHICALLISDHGNQLIRQI 186
            L+       LLI+D+GNQ IR++
Sbjct: 506 GLSLTSS-GDLLIADNGNQAIRKV 528


>gi|397781302|ref|YP_006545775.1| Tripartite motif-containing protein 71 [Methanoculleus bourgensis
           MS2]
 gi|396939804|emb|CCJ37059.1| Tripartite motif-containing protein 71 AltName: Full=Lin-41 homolog
           [Methanoculleus bourgensis MS2]
          Length = 786

 Score = 40.4 bits (93), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 23/49 (46%), Positives = 30/49 (61%), Gaps = 3/49 (6%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADG 151
           FD PK  AVD  GN+YVAD+ N+ I+K  + G   +A  GS   G+ DG
Sbjct: 78  FDTPKGVAVDAAGNVYVADRGNNRIQKFDSDG-NLLAKWGS--HGKGDG 123



 Score = 40.0 bits (92), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 21/37 (56%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           G   F  P   A+D  GN+YVADK NH I+K  + G 
Sbjct: 547 GDGEFRNPSGVAIDAAGNVYVADKENHRIQKFDSDGT 583


>gi|282879766|ref|ZP_06288496.1| NHL repeat protein [Prevotella timonensis CRIS 5C-B1]
 gi|281306435|gb|EFA98465.1| NHL repeat protein [Prevotella timonensis CRIS 5C-B1]
          Length = 486

 Score = 40.4 bits (93), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 1/58 (1%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
           P ++ +V   +   G  G+ +G P  A+F  P+  AV+  G+I YVAD  N  IRK+T
Sbjct: 427 PDNKATVTTVIGLPGSAGFKEGGPDVAQFRYPRGVAVNADGSIVYVADTGNRCIRKLT 484


>gi|424868210|ref|ZP_18291970.1| NHL repeat domain protein [Leptospirillum sp. Group II 'C75']
 gi|124516337|gb|EAY57845.1| NHL repeat domain protein [Leptospirillum rubarum]
 gi|387221511|gb|EIJ76065.1| NHL repeat domain protein [Leptospirillum sp. Group II 'C75']
          Length = 494

 Score = 40.4 bits (93), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 39/131 (29%), Positives = 63/131 (48%), Gaps = 10/131 (7%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKS--FAVDMKGNIYVADKSNHVIRKI--TNL 133
           E++ ++ + G G  G +DG    A+F  P+   F  D      VAD  NH++R++     
Sbjct: 210 EKAALREVIGQGTPGSADGSFDQAQFRDPQGIRFCPDDPDIAIVADTGNHLLRRVDFRRG 269

Query: 134 GVTTIAGGGSKKEGRADG-PAQNASFSNDFELTFVPHICALL-ISDHGNQLIRQINLKPE 191
            VTTIAG G +     +  PA +A  ++ +++ F  H   +L ++  G   I  I L PE
Sbjct: 270 SVTTIAGTGVQGWAIFEPVPAMSAVLNSPWDILF--HRDGMLYVAQAGPHQI--IRLDPE 325

Query: 192 DCSKSSQSGSA 202
                  +GSA
Sbjct: 326 RQEIFPVAGSA 336


>gi|350561579|ref|ZP_08930417.1| YD repeat protein [Thioalkalivibrio thiocyanoxidans ARh 4]
 gi|349780611|gb|EGZ34929.1| YD repeat protein [Thioalkalivibrio thiocyanoxidans ARh 4]
          Length = 2546

 Score = 40.4 bits (93), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 39/134 (29%), Positives = 61/134 (45%), Gaps = 10/134 (7%)

Query: 60   IVLDSSRSAFYTLS------FPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAV 111
            + +D+  S F+T        F  + +  V  +AG+G   ++ DG P  +A F  P+  AV
Sbjct: 1276 VAVDAGGSVFFTRRNGGPRIFRANPDGTVHHVAGNGSSTFAGDGGPAVNASFIDPRGIAV 1335

Query: 112  DMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHI 170
               G++YV D   H +R+I   G + T+AG G    G   GPA  A     +++   P  
Sbjct: 1336 GADGSLYVGDFRTHRVRRIGPDGIIRTVAGNGGSGIGGIGGPAVQARIDRIYDVAVSPE- 1394

Query: 171  CALLISDHGNQLIR 184
              L I  H   + R
Sbjct: 1395 GDLYIDSHNYTVFR 1408


>gi|298386035|ref|ZP_06995592.1| hypothetical protein HMPREF9007_02755 [Bacteroides sp. 1_1_14]
 gi|298261263|gb|EFI04130.1| hypothetical protein HMPREF9007_02755 [Bacteroides sp. 1_1_14]
          Length = 462

 Score = 40.4 bits (93), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 31/49 (63%), Gaps = 2/49 (4%)

Query: 84  RLAGDGVQGYSDGE-PGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKI 130
           R  GDG +G++DG+    ARF+ P S   D ++G + V D +NH IRKI
Sbjct: 410 RPNGDGTKGFNDGDLRTQARFNYPASIVYDEVRGCLLVGDSNNHRIRKI 458



 Score = 37.4 bits (85), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 38/156 (24%), Positives = 63/156 (40%), Gaps = 16/156 (10%)

Query: 50  VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESV-VKRLA--GDGVQGYSDGEPGSARFDKP 106
           V+  P  + + ++  +R   Y   +    ++     LA   DG     DG     R ++P
Sbjct: 306 VVIHPTGNYMYIMVRNRHTIYRADYNWETKTFGTPYLACGKDGTSACEDGVGSMVRLNQP 365

Query: 107 KSFAV----DMKG-----NIYVADKSNHVIRKITNLGVTTIAGG---GSKKEGRADGPAQ 154
           +        D +G     + Y  DK NH +R +T  G  T+  G   G   +G  DG  +
Sbjct: 366 QQGCFVKNPDYEGQADEYDFYFVDKQNHCVRVMTPAGKVTLYAGRPNGDGTKGFNDGDLR 425

Query: 155 -NASFSNDFELTFVPHICALLISDHGNQLIRQINLK 189
             A F+    + +      LL+ D  N  IR+I L+
Sbjct: 426 TQARFNYPASIVYDEVRGCLLVGDSNNHRIRKIALE 461


>gi|290976675|ref|XP_002671065.1| predicted protein [Naegleria gruberi]
 gi|284084630|gb|EFC38321.1| predicted protein [Naegleria gruberi]
          Length = 343

 Score = 40.4 bits (93), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 45/173 (26%), Positives = 74/173 (42%), Gaps = 26/173 (15%)

Query: 59  LIVLDSSRSAFYTLSFPLSEES-VVKRLAGDGVQGYSDGEPGS----ARFDKPKSFAVDM 113
           L + D    A   +S    +++ +++ + G+G  G S  E GS    ++ + P   A+  
Sbjct: 140 LYIADRDNHAIRKVSLNDDDDNGLIETIVGNGSSGES--ENGSLALNSQINCPYGIAISK 197

Query: 114 KGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHIC-- 171
              +Y ++ SNH IRKI   G      G   K G  +    + +F+N   L +  +IC  
Sbjct: 198 DDELYFSEFSNHTIRKIDKNGKLWKVAGIDGKNG--NNEYDHHAFNN--LLNYPAYICFD 253

Query: 172 ---ALLISDHGNQLIRQINLKPEDCSKSSQSGSALGAVSVWVLVSVLSCLVSL 221
               L I+D  N  IR+I+LK             +    V  L S L C V++
Sbjct: 254 NNNNLYIADRDNNSIRKIDLKT----------GLINTAVVKELNSPLGCFVTI 296



 Score = 38.5 bits (88), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 42/92 (45%), Gaps = 6/92 (6%)

Query: 101 ARFDKPKSFAV----DMK--GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQ 154
           A+ ++P S  V    DM     +Y+AD  NH IRKI N  + TIAG G       +  A 
Sbjct: 5   AKLNRPFSVFVHSEQDMNQCDEVYIADTENHCIRKIINGNIITIAGTGEAGFNGDNIKAT 64

Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            A+ +    +        +  SD GN  IR+I
Sbjct: 65  QATINKPVCVIVNHRNGNVYFSDLGNNRIRKI 96


>gi|291000670|ref|XP_002682902.1| predicted protein [Naegleria gruberi]
 gi|284096530|gb|EFC50158.1| predicted protein [Naegleria gruberi]
          Length = 726

 Score = 40.4 bits (93), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 36/129 (27%), Positives = 58/129 (44%), Gaps = 18/129 (13%)

Query: 81  VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
            +  +AG G  GY+ DG  G+ A+   P+   V+  G ++ AD  N  +RKI+  G+ + 
Sbjct: 217 TITTIAGIGSNGYNGDGILGTMAKIFSPRGITVNQLGEVFFADSVNFRVRKISTSGIIST 276

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPH------ICALLISDHGNQLIRQINLKPED 192
             G   +    DG   N++          PH      +  + I+D     IR+I   P +
Sbjct: 277 FAGTGNQLFNGDGLLGNST-------NLSPHGVSINQLGEIFIADTNYNYIRKIVPCPLN 329

Query: 193 CSKSSQSGS 201
              S+QSGS
Sbjct: 330 ---STQSGS 335


>gi|325917587|ref|ZP_08179787.1| NHL repeat protein [Xanthomonas vesicatoria ATCC 35937]
 gi|325536221|gb|EGD08017.1| NHL repeat protein [Xanthomonas vesicatoria ATCC 35937]
          Length = 472

 Score = 40.4 bits (93), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 25/67 (37%), Positives = 34/67 (50%), Gaps = 3/67 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTIA 139
           V R  G G     DG  G A F +P+  A++ +  +YVAD  NH +R+I   N  V T+ 
Sbjct: 198 VLRQFGHGNADLIDGGTGEAAFRRPQGLALE-RDQLYVADTGNHALRRINLRNGQVDTLC 256

Query: 140 GGGSKKE 146
           G G   E
Sbjct: 257 GTGRSGE 263


>gi|58581916|ref|YP_200932.1| hypothetical protein XOO2293 [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|58426510|gb|AAW75547.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC
           10331]
          Length = 472

 Score = 40.4 bits (93), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 31/81 (38%), Positives = 42/81 (51%), Gaps = 6/81 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL---GVTTI 138
           V R  G G     DG  G A F +P+  A++ +  +YVAD  NH +R+I NL    V T+
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQVDTL 255

Query: 139 AGGGSKKEGRADGPAQNASFS 159
            G G   E   +GP  +AS S
Sbjct: 256 CGTGRSGE-PVEGPLASASAS 275


>gi|86742862|ref|YP_483262.1| serine/threonine protein kinase [Frankia sp. CcI3]
 gi|86569724|gb|ABD13533.1| serine/threonine protein kinase [Frankia sp. CcI3]
          Length = 719

 Score = 40.4 bits (93), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 33/107 (30%), Positives = 48/107 (44%), Gaps = 6/107 (5%)

Query: 85  LAGDGVQGYSDGEPGSARF---DKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGG 141
           LAG GV G S G+ G A+F    +P         N+Y+AD  NH++R +   G+ + A G
Sbjct: 570 LAGAGVPGLS-GDHGPAQFAQLRRPHDVTAPPGRNVYLADTDNHLLRAVDADGIISTAAG 628

Query: 142 GSKKEGRADG-PAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
                   DG PA+ A       L   P    LL++D     +R + 
Sbjct: 629 MFYGASPDDGAPARIADVGRPHSLAPTPS-GGLLVTDPDRGRVRLVT 674



 Score = 40.0 bits (92), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 37/86 (43%), Gaps = 2/86 (2%)

Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRAD-GPAQNASFSN 160
           R D P S      G +YV+ +  H + +I   G T    G  K     D GPA NA   N
Sbjct: 419 RVDYPYSVVAAPDGAVYVSQRLRHRVLRIERDGRTVHVAGSGKSGPHGDGGPAVNAELDN 478

Query: 161 DFELTFVPHICALLISDHGNQLIRQI 186
              L   P   +L I+D  N  IR++
Sbjct: 479 PCGLALGPD-GSLFIADSFNNRIRRV 503



 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 42/78 (53%), Gaps = 5/78 (6%)

Query: 85  LAGDGVQG-YSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
           +AG G  G + DG P  +A  D P   A+   G++++AD  N+ IR++   G + T+AG 
Sbjct: 456 VAGSGKSGPHGDGGPAVNAELDNPCGLALGPDGSLFIADSFNNRIRRVAPDGRIVTVAGS 515

Query: 142 GSKKEGRADGPAQNASFS 159
           G  + G   GPA   + S
Sbjct: 516 G--RHGPPAGPAARHAAS 531


>gi|380693132|ref|ZP_09857991.1| hypothetical protein BfaeM_04015 [Bacteroides faecis MAJ27]
          Length = 472

 Score = 40.4 bits (93), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 40/122 (32%), Positives = 55/122 (45%), Gaps = 20/122 (16%)

Query: 88  DGVQGYSDGEPGSARFDKPK--SFAVDMK--------GNIY---VADKSNHVIRKITNLG 134
           +G  GYS     SARF +P   +F  + +        GNIY   + D+ NH I KIT  G
Sbjct: 351 EGEPGYSPSPGPSARFQEPYQGTFVKNEEYIKNPRPDGNIYDYYICDRGNHCIFKITPDG 410

Query: 135 -VTTIAGGGS-----KKEGRADGPAQ-NASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
            V+  AG GS     K  G  DG  +  A F N   + +        I+D  N  IR I+
Sbjct: 411 NVSLFAGRGSVSSDNKVHGNIDGELKTEARFDNPCGIAYDEETQTFYIADKENHSIRTIS 470

Query: 188 LK 189
           ++
Sbjct: 471 IE 472


>gi|308159172|gb|EFO61716.1| Haloacid dehalogenase-like hydrolase family [Giardia lamblia P15]
          Length = 967

 Score = 40.0 bits (92), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 44/93 (47%), Gaps = 6/93 (6%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDK 105
           PHS++  P +  + + D+      TLS    +   VK LAG  GV GY DG   +  F++
Sbjct: 143 PHSIVSDPVNGIIYLTDTKNHCIRTLSLMTRK---VKTLAGTPGVFGYKDGL--NPLFNE 197

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
           P    +   GN+ V D  N  +R I+     TI
Sbjct: 198 PLGLVLTEDGNLIVCDSKNGALRYISRENGETI 230


>gi|188576512|ref|YP_001913441.1| hypothetical protein PXO_00583 [Xanthomonas oryzae pv. oryzae
           PXO99A]
 gi|188520964|gb|ACD58909.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae
           PXO99A]
          Length = 472

 Score = 40.0 bits (92), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 31/81 (38%), Positives = 42/81 (51%), Gaps = 6/81 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL---GVTTI 138
           V R  G G     DG  G A F +P+  A++ +  +YVAD  NH +R+I NL    V T+
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQVDTL 255

Query: 139 AGGGSKKEGRADGPAQNASFS 159
            G G   E   +GP  +AS S
Sbjct: 256 CGTGRSGE-PVEGPLASASAS 275


>gi|84623828|ref|YP_451200.1| hypothetical protein XOO_2171 [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|84367768|dbj|BAE68926.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
           311018]
          Length = 499

 Score = 40.0 bits (92), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 31/81 (38%), Positives = 42/81 (51%), Gaps = 6/81 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL---GVTTI 138
           V R  G G     DG  G A F +P+  A++ +  +YVAD  NH +R+I NL    V T+
Sbjct: 225 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQVDTL 282

Query: 139 AGGGSKKEGRADGPAQNASFS 159
            G G   E   +GP  +AS S
Sbjct: 283 CGTGRSGE-PVEGPLASASAS 302


>gi|389844275|ref|YP_006346355.1| NHL repeat protein [Mesotoga prima MesG1.Ag.4.2]
 gi|387859021|gb|AFK07112.1| NHL repeat protein [Mesotoga prima MesG1.Ag.4.2]
          Length = 496

 Score = 40.0 bits (92), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 38/141 (26%), Positives = 59/141 (41%), Gaps = 17/141 (12%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR--KITNLGV 135
           E + +   AG GV+   DG    A F +P    +D + ++YVAD     IR   +    V
Sbjct: 315 ENNTISAFAGSGVENIGDGHLKDANFAQPSGLFLD-RNSLYVADSEVSAIRFVDLEAGTV 373

Query: 136 TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI--------- 186
            T+AG G    G  DG  + + F +   +        L ++D  N  IR+I         
Sbjct: 374 QTVAGSGLFSFGYVDGILKRSLFQHPIGIHGEGRF--LYVADTYNHAIRKIDLGIRRVET 431

Query: 187 ---NLKPEDCSKSSQSGSALG 204
              NL+   C+ +    S+LG
Sbjct: 432 VIKNLREGTCTLNGDKCSSLG 452



 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 37/141 (26%), Positives = 67/141 (47%), Gaps = 8/141 (5%)

Query: 63  DSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADK 122
           ++ R     LS P   +++ +   G G+ G  DG   +AR +KP+   V   G +YVAD 
Sbjct: 191 NNDRILLTELSTPFVAKTIDQ--IGSGLSGLEDGPFENARLNKPQGI-VYSNGRLYVADT 247

Query: 123 SNHVIR--KITNLGVTTIAGGGSK-KEGRADGPAQNASFSNDFELTFVPHICALLISDHG 179
            NH +R   +    ++T++G G +  +   +G A  A  ++ ++L        L I+  G
Sbjct: 248 ENHALRIADMNQRCLSTLSGDGFQDNDWNYNGDASKARLNSPWDLQTDGRF--LYIAMAG 305

Query: 180 NQLIRQINLKPEDCSKSSQSG 200
              I +++L+    S  + SG
Sbjct: 306 MHQIWRLDLENNTISAFAGSG 326


>gi|111220793|ref|YP_711587.1| hypothetical protein FRAAL1337 [Frankia alni ACN14a]
 gi|111148325|emb|CAJ59997.1| Hypothetical protein FRAAL1337 [Frankia alni ACN14a]
          Length = 722

 Score = 40.0 bits (92), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 53/115 (46%), Gaps = 10/115 (8%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           L  +   + +AG G  G+S G+ G    A    P++ AVD +G + VAD+ +  +R++  
Sbjct: 562 LDPDGTARVVAGSGTPGHS-GDGGLAIHASLRGPQAVAVDAQGRLLVADQEHRRVRRVDA 620

Query: 133 LG-VTTIAG---GGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLI 183
            G + TIAG   GG  +   A GPA+         L   P     +     NQ++
Sbjct: 621 AGRIETIAGTAYGG--RPATAGGPARATDIGAPTSLAVGPDGIVYVADSANNQVL 673


>gi|423290544|ref|ZP_17269393.1| hypothetical protein HMPREF1069_04436 [Bacteroides ovatus
           CL02T12C04]
 gi|392665931|gb|EIY59454.1| hypothetical protein HMPREF1069_04436 [Bacteroides ovatus
           CL02T12C04]
          Length = 499

 Score = 40.0 bits (92), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 38/78 (48%), Gaps = 7/78 (8%)

Query: 116 NIYVADKSNHVIRKITNLG-VTTIAGGGSK-----KEGRADGPA-QNASFSNDFELTFVP 168
           + Y ADK NH IR +T  G V+T AG GS      K G+ +G   + A F+    L +  
Sbjct: 399 DFYFADKENHCIRVLTPDGVVSTFAGRGSASASSYKWGKQNGEVRERARFNQPVALAYDE 458

Query: 169 HICALLISDHGNQLIRQI 186
                 + D GN  IR+I
Sbjct: 459 ATKTFYVGDSGNYKIRKI 476


>gi|299147167|ref|ZP_07040234.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
 gi|298515052|gb|EFI38934.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
          Length = 499

 Score = 40.0 bits (92), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 38/78 (48%), Gaps = 7/78 (8%)

Query: 116 NIYVADKSNHVIRKITNLG-VTTIAGGGSK-----KEGRADGPA-QNASFSNDFELTFVP 168
           + Y ADK NH IR +T  G V+T AG GS      K G+ +G   + A F+    L +  
Sbjct: 399 DFYFADKENHCIRVLTPDGVVSTFAGRGSASASSYKWGKQNGEVRERARFNQPVALAYDE 458

Query: 169 HICALLISDHGNQLIRQI 186
                 + D GN  IR+I
Sbjct: 459 ATKTFYVGDSGNYKIRKI 476


>gi|147918859|ref|YP_687415.1| hypothetical protein RRC374 [Methanocella arvoryzae MRE50]
 gi|110622811|emb|CAJ38089.1| hypothetical protein RRC374 [Methanocella arvoryzae MRE50]
          Length = 514

 Score = 40.0 bits (92), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 23/32 (71%), Gaps = 2/32 (6%)

Query: 97  EPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           +PG   FD+P+  AVD +GNIYV+D  NH I+
Sbjct: 363 QPG--EFDRPRGIAVDGRGNIYVSDSYNHRIQ 392



 Score = 39.7 bits (91), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 33/64 (51%), Gaps = 7/64 (10%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNH---VIRKITNLGVTTIAGGGSKKEGR 148
           G  D +PG  +F  P+  AVD  GNIYVAD  N+   V  K TN    TI G    + G+
Sbjct: 170 GSPDDDPG--QFSSPRGIAVDGAGNIYVADTYNYRIQVWNKTTN--TWTIIGSDGTEPGQ 225

Query: 149 ADGP 152
            D P
Sbjct: 226 FDRP 229


>gi|386841434|ref|YP_006246492.1| serine/threonine protein kinase [Streptomyces hygroscopicus subsp.
           jinggangensis 5008]
 gi|374101734|gb|AEY90618.1| serine/threonine protein kinase [Streptomyces hygroscopicus subsp.
           jinggangensis 5008]
 gi|451794728|gb|AGF64777.1| serine/threonine protein kinase [Streptomyces hygroscopicus subsp.
           jinggangensis TL01]
          Length = 476

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 25/63 (39%), Positives = 39/63 (61%), Gaps = 3/63 (4%)

Query: 85  LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGG 141
           +AG+G++G++ DG     AR   P +  V   G ++VAD+ NH IR+I   G + T+AG 
Sbjct: 28  IAGNGLRGFAGDGADAVRARLGFPSALVVAPDGALFVADEMNHRIRRIDPSGAIGTVAGD 87

Query: 142 GSK 144
           GS+
Sbjct: 88  GSR 90



 Score = 39.7 bits (91), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 37/74 (50%), Gaps = 2/74 (2%)

Query: 115 GNIYVADKSNHVIRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICAL 173
           G ++VAD +NH IR+I   G +TTIAG G +        A  A       L   P   AL
Sbjct: 4   GAVFVADHANHRIRRIDGTGTLTTIAGNGLRGFAGDGADAVRARLGFPSALVVAPD-GAL 62

Query: 174 LISDHGNQLIRQIN 187
            ++D  N  IR+I+
Sbjct: 63  FVADEMNHRIRRID 76


>gi|160885320|ref|ZP_02066323.1| hypothetical protein BACOVA_03319 [Bacteroides ovatus ATCC 8483]
 gi|156108942|gb|EDO10687.1| NHL repeat protein [Bacteroides ovatus ATCC 8483]
          Length = 198

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 33/100 (33%), Positives = 47/100 (47%), Gaps = 10/100 (10%)

Query: 97  EPGSARF---DKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSK-----KEG 147
           +PG   F   ++ K+   D   + Y ADK NH IR +T  G V+T AG GS      K G
Sbjct: 76  KPGQGVFVFNEEYKAANKDDWYDFYFADKENHCIRILTPDGVVSTFAGRGSASASSYKWG 135

Query: 148 RADGPA-QNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + +G   + A F+    L +        + D GN  IR+I
Sbjct: 136 KQNGEVRERARFNQPVALAYDEVTKTFYVGDSGNYKIRKI 175


>gi|303274116|ref|XP_003056382.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226462466|gb|EEH59758.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 490

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 41/82 (50%), Gaps = 2/82 (2%)

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELT 165
           P   A+D  G I+VAD  N+ IR+I + GV T+   GS   G  DG    A F+N  +  
Sbjct: 4   PAGIALDGNGAIFVADSGNNRIRQIKSGGVVTVL-AGSGTAGFLDGSLLEAEFNNP-QSV 61

Query: 166 FVPHICALLISDHGNQLIRQIN 187
            V     + ++D  N  IR IN
Sbjct: 62  VVTSSGVVYVADTNNHRIRLIN 83



 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 24/59 (40%), Positives = 34/59 (57%), Gaps = 3/59 (5%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTIAG 140
           L G G  G+SD    SARF+ P    ++    + Y+AD+ N+ IR +  T L VT+IAG
Sbjct: 148 LNGAGHAGFSDSSGLSARFNSPSHMTLNTDETVLYIADRRNNRIRAVTLTTLAVTSIAG 206



 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 44/108 (40%), Positives = 51/108 (47%), Gaps = 4/108 (3%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTI 138
           VV  LAG G  G+ DG    A F+ P+S  V   G +YVAD +NH IR I   N  VTT 
Sbjct: 33  VVTVLAGSGTAGFLDGSLLEAEFNNPQSVVVTSSGVVYVADTNNHRIRLINEENDVVTTY 92

Query: 139 AGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           AG G      A    Q A  S    L F      L +SD G+  IR I
Sbjct: 93  AGAGISDYRDAQVAIQ-AYISKPLGLAFRTLTKDLFVSD-GDSRIRII 138


>gi|262199596|ref|YP_003270805.1| NHL repeat containing protein [Haliangium ochraceum DSM 14365]
 gi|262082943|gb|ACY18912.1| NHL repeat containing protein [Haliangium ochraceum DSM 14365]
          Length = 344

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 37/76 (48%), Gaps = 6/76 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKIT-----NLGV 135
           V+ + G G  G+ DG+   A    P+  AV   G  +YVAD     IR++      +  V
Sbjct: 64  VRLVVGSGDPGWRDGDGADAALHAPRGIAVCASGRRLYVADSGTCTIRRVDLDAVGDATV 123

Query: 136 TTIAGGGSKKEGRADG 151
           TT+AG G + E  A G
Sbjct: 124 TTVAGTGERGEALAPG 139


>gi|428162946|gb|EKX32046.1| hypothetical protein GUITHDRAFT_61646, partial [Guillardia theta
           CCMP2712]
          Length = 254

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 39/139 (28%), Positives = 65/139 (46%), Gaps = 15/139 (10%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIR--KITN 132
           L+  +V   +  DG  G  +G   S RF  P         N + V D SN++I+  +I++
Sbjct: 25  LTTLAVTNFIGLDGTSGNVNGFGASVRFRNPAILLFSPSYNYLIVTDASNNLIKTIRISD 84

Query: 133 LGVTTIAGGGSKKE--------GRADGPAQNASFSNDF-ELTFVPHICALLISDHGNQLI 183
           L V+ + G G+                P ++ +FS DF +L F    C  L++D GN  I
Sbjct: 85  LYVSVLIGTGTATNVDNFNALSTTLTNP-KSLAFSEDFTKLYFSTVYC--LVADSGNHAI 141

Query: 184 RQINLKPEDCSKSSQSGSA 202
           R++++     S  + SG+A
Sbjct: 142 RKVDIVSGYMSTLAGSGTA 160



 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 44/154 (28%), Positives = 65/154 (42%), Gaps = 16/154 (10%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG-SARFD 104
           NP  ++  P  + LIV D+S +   T+      +  V  L G G     D     S    
Sbjct: 54  NPAILLFSPSYNYLIVTDASNNLIKTIRI---SDLYVSVLIGTGTATNVDNFNALSTTLT 110

Query: 105 KPKSFAVDMK------GNIY--VADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGPAQ 154
            PKS A            +Y  VAD  NH IRK+  +   ++T+AG G+   G  D    
Sbjct: 111 NPKSLAFSEDFTKLYFSTVYCLVADSGNHAIRKVDIVSGYMSTLAGSGT--AGYNDAVGV 168

Query: 155 NASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
            A F+   ++T         +SD+ N  IR+I+L
Sbjct: 169 LAQFNKPVDVTLDWTESYAYVSDYNNNCIRRIDL 202


>gi|321469552|gb|EFX80532.1| hypothetical protein DAPPUDRAFT_304106 [Daphnia pulex]
          Length = 689

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 42/84 (50%), Gaps = 5/84 (5%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNL 133
           +S +  +K   G    G+++G   +A+F  P+  A     ++YVAD  NH IR I  +  
Sbjct: 220 ISLQGEIKVTIGGPTPGWNEGAFSTAKFRSPQGIAFLPPCHLYVADTGNHSIRLIDFSTK 279

Query: 134 GVTTIAG---GGSKKEGRADGPAQ 154
            V T+AG    G+ KEG   G  Q
Sbjct: 280 QVKTVAGTGLQGTDKEGGMPGIEQ 303



 Score = 37.4 bits (85), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 39/76 (51%), Gaps = 6/76 (7%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGE-PGSARFDKPKSFAVDMKGN- 116
           L V DS      T+  PL++  +     G+G  G +DG      +F++P    +D +G  
Sbjct: 464 LYVADSYNHKIKTVD-PLTK--LCSTYLGNGKAGLADGSSEDEIQFNEPGGLCIDEEGGG 520

Query: 117 -IYVADKSNHVIRKIT 131
            +YVAD +NH IRK+ 
Sbjct: 521 ILYVADTNNHCIRKVN 536


>gi|219851493|ref|YP_002465925.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
 gi|219545752|gb|ACL16202.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
          Length = 390

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 24/37 (64%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
           G   FD+P   AVD  GN+YVAD  N+ ++K T+ G+
Sbjct: 180 GVGEFDEPFGIAVDGTGNVYVADTYNNRVQKFTSAGL 216



 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 24/45 (53%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGS 143
           G  +FD P   AVD  GN+YVAD  N  I+K T+ G      G S
Sbjct: 39  GIGQFDHPGGVAVDSAGNVYVADTENDRIQKFTSSGTFVTVWGSS 83


>gi|340725457|ref|XP_003401086.1| PREDICTED: NHL repeat-containing protein 2-like [Bombus terrestris]
          Length = 692

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 37/144 (25%), Positives = 63/144 (43%), Gaps = 25/144 (17%)

Query: 42  QLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSA 101
           Q E   + V+   G + ++V+D++ +              V+ + G     + DG    A
Sbjct: 229 QTEQGENLVVADTGHNRILVMDTAGN--------------VQHIIGGPNPDFRDGNFEIA 274

Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG--VTTIAGGGSKKEGRADGP-AQNASF 158
           +F+ P+  +  +   IYVAD  NH IRKI  +   V TIAG G+K      G    +   
Sbjct: 275 KFNGPQGMST-LDSVIYVADTKNHAIRKIDLIKKTVITIAGTGTKGNDYNGGKIGTDQVL 333

Query: 159 SNDFELTFVPHICALLISDHGNQL 182
           ++ ++L    H       +HGN +
Sbjct: 334 ASPWDLAIYKH-------EHGNNI 350


>gi|288923320|ref|ZP_06417454.1| serine/threonine protein kinase [Frankia sp. EUN1f]
 gi|288345336|gb|EFC79731.1| serine/threonine protein kinase [Frankia sp. EUN1f]
          Length = 748

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 56/119 (47%), Gaps = 8/119 (6%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSARF----DKPKSFAVDMKGNIYVADKSNHVIRKI 130
           L  +  V+ LAG G   ++ DG P +A        P S A+D  GN++  + S + +R++
Sbjct: 623 LEPDEPVRTLAGGGPAEFTGDGGPATAAGLTLGSGPSSLALDPVGNLFFPEPSVYRVRRV 682

Query: 131 TNLGV-TTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
              GV TT+AG G        GPA  A  ++   +  V     L I + G   IR++ L
Sbjct: 683 DTRGVITTVAGTGEYGSDGDGGPATAARLASPARVA-VDTAGVLYIGEVGTA-IRRVGL 739



 Score = 38.9 bits (89), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 43/88 (48%), Gaps = 9/88 (10%)

Query: 66  RSAFYTLS------FPLSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNI 117
           R  FY +         L  +  +  +AG GV G + DG P +A R       A    G++
Sbjct: 438 RGGFYVVDRAGLRVLRLDLDGTLSAVAGTGVAGSTGDGGPATAARLRGVSDVAAASDGSV 497

Query: 118 YVADKSNHVIRKITNLGV-TTIAGGGSK 144
           ++AD  N  IR+I   GV TT+AGGG +
Sbjct: 498 FLADSGNGRIRRIGPDGVITTVAGGGER 525


>gi|423294181|ref|ZP_17272308.1| hypothetical protein HMPREF1070_00973 [Bacteroides ovatus
           CL03T12C18]
 gi|392676438|gb|EIY69874.1| hypothetical protein HMPREF1070_00973 [Bacteroides ovatus
           CL03T12C18]
          Length = 499

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 33/100 (33%), Positives = 47/100 (47%), Gaps = 10/100 (10%)

Query: 97  EPGSARF---DKPKSFAVDMKGNIYVADKSNHVIRKITNLG-VTTIAGGGSK-----KEG 147
           +PG   F   ++ K+   D   + Y ADK NH IR +T  G V+T AG GS      K G
Sbjct: 377 KPGQGVFVFNEEYKAANKDDWYDFYFADKENHCIRILTPDGVVSTFAGRGSASASSYKWG 436

Query: 148 RADGPA-QNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           + +G   + A F+    L +        + D GN  IR+I
Sbjct: 437 KQNGEVRERARFNQPVALAYDEVTKTFYVGDSGNYKIRKI 476


>gi|301755504|ref|XP_002913612.1| PREDICTED: NHL repeat-containing protein 2-like [Ailuropoda
           melanoleuca]
 gi|281347739|gb|EFB23323.1| hypothetical protein PANDA_001408 [Ailuropoda melanoleuca]
          Length = 726

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 44/81 (54%), Gaps = 10/81 (12%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG--VTTIAG---GGSKK 145
           G  DG    + F+ P+  A+  + NI YVAD  NH++RKI      V+T+AG    G+ K
Sbjct: 267 GRKDGVFSESTFNSPQGVAI--RNNIIYVADTENHLVRKIDLEAEMVSTVAGIGIQGTDK 324

Query: 146 EGRADGPAQNASFSNDFELTF 166
           EG A G  Q    S+ +++ F
Sbjct: 325 EGGAQGEEQ--PISSPWDVVF 343


>gi|392963930|ref|ZP_10329351.1| hypothetical protein BN8_00315 [Fibrisoma limi BUZ 3]
 gi|387846825|emb|CCH51395.1| hypothetical protein BN8_00315 [Fibrisoma limi BUZ 3]
          Length = 351

 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 24/58 (41%), Positives = 35/58 (60%), Gaps = 3/58 (5%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAV--DMKGNIYVADKSNHVIRKITNLG-VTTIA 139
           + G   +GY DG+  +A F+ P+  AV  D + NIY+ D  N  IR+++  G VTTIA
Sbjct: 290 MGGRSKEGYKDGKALAALFNFPQRCAVSIDSRQNIYILDGGNDCIRRLSADGMVTTIA 347



 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 44/113 (38%), Gaps = 26/113 (23%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAG-----------G 141
           Y  G    A    P+   +D  G +  AD   + I K+ N  V T+AG           G
Sbjct: 232 YVQGTISKAELMAPEDILIDRTGAVIFADGRLNRIIKVANQTVATLAGNSEIQPNNVNMG 291

Query: 142 GSKKEGRADGPAQNASFSNDFELTFVPHICA--------LLISDHGNQLIRQI 186
           G  KEG  DG A  A F+        P  CA        + I D GN  IR++
Sbjct: 292 GRSKEGYKDGKALAALFN-------FPQRCAVSIDSRQNIYILDGGNDCIRRL 337


>gi|309813024|ref|ZP_07706752.1| putative septum site-determining protein MinC [Dermacoccus sp.
           Ellin185]
 gi|308433096|gb|EFP57000.1| putative septum site-determining protein MinC [Dermacoccus sp.
           Ellin185]
          Length = 651

 Score = 39.7 bits (91), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 51/115 (44%), Gaps = 6/115 (5%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKIT----N 132
           E + V  LAG   +G  DG    A F +P   A    G +++VAD     +R ++     
Sbjct: 354 ELNTVAVLAGTSNEGIRDGAAHDAWFAQPSGLATSADGSHVWVADSETSALRSLSVSDEG 413

Query: 133 LGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
             VTT  G G    G  DG A  A   +   +T +P   ++ ++D  N  IR+ +
Sbjct: 414 FEVTTHVGQGLFDFGHVDGAADAALLQHPLGVTELPD-GSVAVADTYNGAIRRFD 467


>gi|110639199|ref|YP_679408.1| hypothetical protein CHU_2818 [Cytophaga hutchinsonii ATCC 33406]
 gi|110281880|gb|ABG60066.1| conserved hypothetical protein [Cytophaga hutchinsonii ATCC 33406]
          Length = 392

 Score = 39.7 bits (91), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 51/116 (43%), Gaps = 21/116 (18%)

Query: 96  GEPGSA--RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPA 153
           G  GSA  +FD   +  VD  GN+YVAD  N  I+K +N G T I   G++  G      
Sbjct: 31  GGTGSAAGKFDDIGALDVDADGNVYVADWRNFRIQKFSNTG-TYITSWGTEGTG------ 83

Query: 154 QNASFSNDFELTFVPHICA-----LLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
                  D +   V ++C      +L+SD G  ++++        SK   SG   G
Sbjct: 84  -------DNQFKRVNNLCVDNDGNILVSDAGASVVKKYTNTGVFISKFGSSGKQEG 132



 Score = 38.1 bits (87), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 20/50 (40%), Positives = 29/50 (58%), Gaps = 2/50 (4%)

Query: 98  PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV--TTIAGGGSKK 145
            G+  F+      +D  GN+YVAD +N+ I+K TN GV   TI G G+ +
Sbjct: 176 TGAGSFNGNFDLTIDASGNVYVADVNNNRIQKFTNTGVYIKTIGGLGTTE 225


>gi|398784412|ref|ZP_10547676.1| redoxin domain-containing protein [Streptomyces auratus AGR0001]
 gi|396995335|gb|EJJ06353.1| redoxin domain-containing protein [Streptomyces auratus AGR0001]
          Length = 602

 Score = 39.7 bits (91), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 32/112 (28%), Positives = 58/112 (51%), Gaps = 12/112 (10%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
            +V D++R     L+     E V++R+ G G +G +   P S  F++P+  A+   G + 
Sbjct: 188 FLVSDTTRHQLVELA--ADGEQVLRRI-GSGERGLT---PDS--FNEPQGLALLPDGTVA 239

Query: 119 VADKSNHVIRKI--TNLGVTTIAGGGSK--KEGRADGPAQNASFSNDFELTF 166
           VAD  NH IR     +  + T+AG G++  +     GPA+  + S+ ++L +
Sbjct: 240 VADTVNHAIRVFDPESGALETVAGTGTQWWQGSPTSGPAREVALSSPWDLAW 291


>gi|42524282|ref|NP_969662.1| hemagglutinin/hemolysin-related protein [Bdellovibrio bacteriovorus
            HD100]
 gi|39576491|emb|CAE80655.1| putative hemagglutinin/hemolysin-related protein [Bdellovibrio
            bacteriovorus HD100]
          Length = 1637

 Score = 39.7 bits (91), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 29/115 (25%), Positives = 55/115 (47%), Gaps = 5/115 (4%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGV 135
            L+ ++V        + G +DG  G++RF +P      + G IY  D+S   IR  TN GV
Sbjct: 1018 LANQTVTTIAGNSSLGGITDGPLGTSRFSQPWGV-YKVPGGIYSIDRSAATIRFTTNAGV 1076

Query: 136  TTIAGGGSKKEGRADGPAQN--ASFSNDFELTFVPHICALLISDHGNQLIRQINL 188
            T    G     G  +  +Q   + F+   +L  + +  ++ + D+ N  +R++++
Sbjct: 1077 TRTVAGTPNVHGFIEAASQGDISRFNGPTDLCAIGN--SIYLVDNDNARVRRLDV 1129


>gi|159113256|ref|XP_001706855.1| Haloacid dehalogenase-like hydrolase family [Giardia lamblia ATCC
           50803]
 gi|157434955|gb|EDO79181.1| Haloacid dehalogenase-like hydrolase family [Giardia lamblia ATCC
           50803]
          Length = 969

 Score = 39.7 bits (91), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 29/93 (31%), Positives = 44/93 (47%), Gaps = 6/93 (6%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDK 105
           PHS++  P +  + + D+      TLS    +   VK LAG  GV GY DG   +  F++
Sbjct: 143 PHSIVSDPVNGIIYLTDAKNHCIRTLSLMTRK---VKTLAGTPGVFGYKDGL--NPLFNE 197

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLGVTTI 138
           P    +   GN+ V D  N  +R ++     TI
Sbjct: 198 PLGLVLTEDGNVIVCDSKNGALRYVSRENGETI 230


>gi|195030130|ref|XP_001987921.1| GH10881 [Drosophila grimshawi]
 gi|193903921|gb|EDW02788.1| GH10881 [Drosophila grimshawi]
          Length = 728

 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 36/122 (29%), Positives = 55/122 (45%), Gaps = 19/122 (15%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NLGVTTI 138
           +V+   G    G  DG+  +ARF+ P+  A   +  + VAD  NH +R+I+  N  V T+
Sbjct: 255 IVEHKIGGLDNGLVDGDLLTARFNNPQGVAFLDEHTLIVADTDNHALRQISLKNGIVETL 314

Query: 139 AGGGSKKEGRADG----------PAQNASF-SNDFELTF------VPHICALLISDHGNQ 181
           AG G +   R  G          P   A F + D +++F      VP    +LI+  G  
Sbjct: 315 AGTGYQGNERVGGKLGQLQPLSSPWDVAVFRTRDMDMSFHVDELNVPEKTIVLIAMAGTH 374

Query: 182 LI 183
            I
Sbjct: 375 QI 376


>gi|355707227|gb|AES02894.1| NHL repeat containing 2 [Mustela putorius furo]
          Length = 498

 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 29/69 (42%), Positives = 38/69 (55%), Gaps = 8/69 (11%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLG--VTTIAG---GGSKK 145
           G  DG    + F+ P+  A+  + NI YVAD  NH+IRKI      V+T+AG    G+ K
Sbjct: 192 GRKDGIFSESTFNSPQGVAI--RNNIIYVADTENHLIRKIDLEAEMVSTVAGIGIQGTDK 249

Query: 146 EGRADGPAQ 154
           EG A G  Q
Sbjct: 250 EGGAQGEEQ 258


>gi|434391812|ref|YP_007126759.1| NHL repeat containing protein [Gloeocapsa sp. PCC 7428]
 gi|428263653|gb|AFZ29599.1| NHL repeat containing protein [Gloeocapsa sp. PCC 7428]
          Length = 509

 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 24/66 (36%), Positives = 33/66 (50%), Gaps = 3/66 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI--TNLGVTTI 138
           ++ + G G  G  DG    A+F  P+    D +  + YVAD  NH IR+I      V TI
Sbjct: 205 LQYVIGTGKPGLRDGSFSEAQFFSPQGMCFDTEKQLLYVADTENHAIRRIDFQRQVVETI 264

Query: 139 AGGGSK 144
           AG G +
Sbjct: 265 AGTGDQ 270


>gi|380300595|ref|ZP_09850288.1| NHL repeat protein [Brachybacterium squillarum M-6-3]
          Length = 644

 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 34/126 (26%), Positives = 58/126 (46%), Gaps = 5/126 (3%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGG 142
           LAG   +G  DG   ++ + +P        G + VAD  +  IR +    + V+T+ G G
Sbjct: 352 LAGTTQEGLVDGPAVTSWWAQPSGIDELPDGRLVVADSESSAIRVLDPQTMEVSTLVGQG 411

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSA 202
               G  DGPA+ A   +   +T +P    + ++D  N  IR ++   E+     ++G A
Sbjct: 412 LFDFGHVDGPAEVARLQHPLGVTVLPD-GRIAVADTYNGAIRIVDEHAEE--NPIRTGDA 468

Query: 203 LGAVSV 208
              V+V
Sbjct: 469 ATVVTV 474



 Score = 38.9 bits (89), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 23/110 (20%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPK-------SFAVDMKGNIYVADKSNHVIRKI-- 130
           + V    GDG +G  DGE    RF +P          A ++  ++ VAD +NHV+R +  
Sbjct: 213 TTVDEEIGDGRRGLLDGESAGVRFAEPNGVLALPSDVAAEVGYDLLVADTANHVLRGVKV 272

Query: 131 ---------TNLGVTTIAGGGSK-KEG----RADGPAQNASFSNDFELTF 166
                    T   V T+AG G++  +G    R +G  ++ + S  ++L +
Sbjct: 273 GQDRFLRSRTATEVWTVAGSGAQWMQGDALPRGEGDPRSYALSTPWDLGW 322


>gi|78047814|ref|YP_363989.1| hypothetical protein XCV2258 [Xanthomonas campestris pv.
           vesicatoria str. 85-10]
 gi|78036244|emb|CAJ23935.1| conserved hypothetical protein [Xanthomonas campestris pv.
           vesicatoria str. 85-10]
          Length = 499

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 36/68 (52%), Gaps = 5/68 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL---GVTTI 138
           V R  G G   + DG  G A F +P+  A++ +  +YVAD  NH +R+I NL    V T+
Sbjct: 225 VLRQFGHGNADFIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQVDTL 282

Query: 139 AGGGSKKE 146
            G G   E
Sbjct: 283 CGTGRSGE 290


>gi|333381581|ref|ZP_08473261.1| hypothetical protein HMPREF9455_01427 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332829828|gb|EGK02470.1| hypothetical protein HMPREF9455_01427 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 455

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 32/114 (28%), Positives = 52/114 (45%), Gaps = 13/114 (11%)

Query: 85  LAG-DGVQGYSDGEPGSARFDKPKSFAVDMKG----------NIYVADKSNHVIRKITNL 133
           +AG D   GY+DG    AR ++P+   V +K           + Y  DK+NHV+R +T  
Sbjct: 339 IAGKDQTAGYADGVGSYARVNEPQQ-GVFVKNPEYEGQEDEYDFYFCDKANHVVRIMTPQ 397

Query: 134 GVTTIAGGGSKKEGRADGPAQ-NASFSNDFELTFVPHICALLISDHGNQLIRQI 186
           G  +   G  +  G  DG  + +A F+    + +        I D  N+ IR++
Sbjct: 398 GRVSTFAGTPQASGYRDGDLRLDARFAYPAGIAYDEERQCFYIGDSNNRRIRKV 451


>gi|397781344|ref|YP_006545817.1| Internalin-A [Methanoculleus bourgensis MS2]
 gi|396939846|emb|CCJ37101.1| Internalin-A [Methanoculleus bourgensis MS2]
          Length = 2759

 Score = 39.7 bits (91), Expect = 1.4,   Method: Composition-based stats.
 Identities = 34/106 (32%), Positives = 51/106 (48%), Gaps = 9/106 (8%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASF 158
           G  +F  P S AVD  GN+YV+D +N  I+K  + G T +   GS   G  DG  +   F
Sbjct: 138 GDGQFSYPDSVAVDAAGNVYVSDTNNGRIQKFDSDG-TFLGKWGS--WGSGDGQLR---F 191

Query: 159 SNDFELTFVPHICALLISDHGNQLIRQINLKPEDCSKSSQSGSALG 204
           S D  +    +I    ++++GN  I++ +       K   SGS  G
Sbjct: 192 SQDLVVDAAGNI---YVAEYGNHRIQKFDSNGNFLWKKGSSGSGDG 234



 Score = 37.7 bits (86), Expect = 5.8,   Method: Composition-based stats.
 Identities = 16/36 (44%), Positives = 24/36 (66%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           G+ +F+ P+  AV+  GN+YVAD  NH I+K  + G
Sbjct: 91  GNGQFNGPQGIAVNAAGNVYVADTYNHRIQKFDSSG 126


>gi|29789158|ref|NP_080087.1| NHL repeat-containing protein 2 [Mus musculus]
 gi|81898236|sp|Q8BZW8.1|NHLC2_MOUSE RecName: Full=NHL repeat-containing protein 2
 gi|26329039|dbj|BAC28258.1| unnamed protein product [Mus musculus]
 gi|63146329|gb|AAH95956.1| NHL repeat containing 2 [Mus musculus]
 gi|148669816|gb|EDL01763.1| NHL repeat containing 2 [Mus musculus]
 gi|187951881|gb|AAI38191.1| NHL repeat containing 2 [Mus musculus]
 gi|223459878|gb|AAI38192.1| NHL repeat containing 2 [Mus musculus]
          Length = 725

 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 38/78 (48%), Gaps = 6/78 (7%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAG---GGSKKE 146
           G  DG    + F+ P+  A+     IYVAD  NH+IRKI      VTT+AG    G+  E
Sbjct: 267 GRKDGMFSESSFNSPQGVAI-ADNVIYVADTENHLIRKIDLEAEKVTTVAGVGIQGTDTE 325

Query: 147 GRADGPAQNASFSNDFEL 164
           G  +G  Q  S   D  L
Sbjct: 326 GGEEGDKQPISSPWDVAL 343


>gi|188501663|gb|ACD54780.1| NHL repeat containing protein-like protein [Philodina roseola]
          Length = 324

 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 34/106 (32%), Positives = 47/106 (44%), Gaps = 12/106 (11%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNH--VIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSN 160
           F+ P   A+D  GNIYVAD  NH  V+  +     T  AG G    G  +       F+ 
Sbjct: 89  FNNPSDVAIDSVGNIYVADTLNHRVVLYSVGATNGTLFAGTGLPGSGIHE-------FNQ 141

Query: 161 DFELTFVPHICALLISDHGNQLIRQINLKPED---CSKSSQSGSAL 203
              L+F  +  AL I+DH N  I +  L   +    +  S SG A+
Sbjct: 142 PSGLSFDSNSNALYIADHENHRIMRYYLNSSNGTVVAGGSGSGRAI 187


>gi|148657095|ref|YP_001277300.1| PA14 domain-containing protein [Roseiflexus sp. RS-1]
 gi|148569205|gb|ABQ91350.1| PA14 domain protein [Roseiflexus sp. RS-1]
          Length = 1293

 Score = 39.3 bits (90), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 40/72 (55%), Gaps = 5/72 (6%)

Query: 82  VKRLAGDGVQGYSDGEPGSA--RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIA 139
           ++R + DG    + G  GS   +F  P+S AV   G +YVAD +NH I++ +  G T + 
Sbjct: 352 IQRFSADGTFLGAWGAEGSGDGQFVYPRSVAVASDGTVYVADSNNHRIQRFSATG-TFLG 410

Query: 140 GGGSKKEGRADG 151
             GS  EG +DG
Sbjct: 411 KWGS--EGSSDG 420



 Score = 38.5 bits (88), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 20/60 (33%), Positives = 33/60 (55%), Gaps = 2/60 (3%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSA--RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLG 134
           S+   ++R +  G    + G PG+   +F  P+S AV   G +YVAD  NH I++ + +G
Sbjct: 76  SDNHRIQRFSAAGELLGAWGSPGTGDGQFSSPRSVAVAPDGTVYVADTGNHRIQRFSAIG 135


>gi|295132101|ref|YP_003582777.1| hypothetical protein ZPR_0220 [Zunongwangia profunda SM-A87]
 gi|294980116|gb|ADF50581.1| conserved hypothetical protein [Zunongwangia profunda SM-A87]
          Length = 466

 Score = 39.3 bits (90), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 40/128 (31%), Positives = 54/128 (42%), Gaps = 22/128 (17%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG------------NIYVADKSNHVIR 128
           V  +L+ DG  GY DG   S R +KP    V +K             + Y  ++ NH IR
Sbjct: 338 VCGKLSNDG--GYQDGVGSSVRLNKPYQ-GVFVKNEDYVEAGKTDEYDFYFTEQGNHDIR 394

Query: 129 KITNLG-VTTIAGGGSKK-----EGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQ 181
           K+T  G VTT AG GS        G  +G   + A F     + +     A  I D  N 
Sbjct: 395 KLTPEGSVTTFAGRGSSSINPDPWGYVNGDLREEARFDQPTGIAYNEEEKAFYIGDKSNH 454

Query: 182 LIRQINLK 189
             R+I L+
Sbjct: 455 RFRKIALE 462


>gi|403363814|gb|EJY81656.1| NHL repeat-containing protein 2 [Oxytricha trifallax]
          Length = 478

 Score = 39.3 bits (90), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 41/135 (30%), Positives = 61/135 (45%), Gaps = 12/135 (8%)

Query: 76  LSEESV-VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN---IYVADKSNHVIRKI- 130
           ++EE++  + + G+G  G  DG      F  P+      + N   IY+ D  NH IR+I 
Sbjct: 121 VNEETLECQGVIGNGKIGLVDGNYEEGSFHHPQGMCHIYRDNMHFIYLCDTKNHAIREIN 180

Query: 131 -TNLGVTTIAGGGSK---KEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQI 186
            T   V T+ G G K   +EG  +   Q    S+ +++  V     LLI+  G   I  +
Sbjct: 181 LTKKEVLTVIGTGEKGFDREGNKEPELQ--KLSSPWDIVAVNRD-TLLIAMAGVHQIWAL 237

Query: 187 NLKPEDCSKSSQSGS 201
           NLK   C   S SG 
Sbjct: 238 NLKTNRCFNFSGSGK 252


>gi|302834543|ref|XP_002948834.1| hypothetical protein VOLCADRAFT_80356 [Volvox carteri f.
           nagariensis]
 gi|300266025|gb|EFJ50214.1| hypothetical protein VOLCADRAFT_80356 [Volvox carteri f.
           nagariensis]
          Length = 774

 Score = 39.3 bits (90), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 47/100 (47%), Gaps = 10/100 (10%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           +P +V+     S + V DS       L+ P + E V   LAG G  G+ DG   +A+F +
Sbjct: 498 HPLAVLSSADGSAIYVADSYNHRIKALN-PNTNEIVT--LAGSGAAGFRDGVGTAAQFSE 554

Query: 106 PKSFAVDMKGNIYVADKSNHVIR-------KITNLGVTTI 138
           P        G I +AD +N  +R       +++ LG+T +
Sbjct: 555 PAGLCRGPNGTILIADTNNSAVRILDPKTQRVSTLGLTGV 594



 Score = 37.4 bits (85), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 38/66 (57%), Gaps = 4/66 (6%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIR--KITNLGVTTI 138
           ++++ G+G     DG   +A F++P+      + N ++VAD  NH +R   +T+  V T+
Sbjct: 295 IEQIGGNG-PALRDGSFETAAFNRPQGLVFSPRRNQLFVADTENHAVRCCDLTSNTVITL 353

Query: 139 AGGGSK 144
           AG GSK
Sbjct: 354 AGNGSK 359


>gi|414163305|ref|ZP_11419552.1| hypothetical protein HMPREF9697_01453 [Afipia felis ATCC 53690]
 gi|410881085|gb|EKS28925.1| hypothetical protein HMPREF9697_01453 [Afipia felis ATCC 53690]
          Length = 357

 Score = 39.3 bits (90), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 80  SVVKRLAGDGVQGYSDGEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           S V ++  DG    S GE GS   +F+ P S A D KGN+YVAD+ N  I+
Sbjct: 184 SRVAKVDKDGNWIKSWGEKGSEPGQFNTPHSIATDAKGNVYVADRGNARIQ 234


>gi|290981371|ref|XP_002673404.1| predicted protein [Naegleria gruberi]
 gi|284086987|gb|EFC40660.1| predicted protein [Naegleria gruberi]
          Length = 1204

 Score = 39.3 bits (90), Expect = 1.6,   Method: Composition-based stats.
 Identities = 19/42 (45%), Positives = 28/42 (66%), Gaps = 2/42 (4%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRK--ITNLGVTTIAGGG 142
           F+KP+   V   GNIY+AD  N++I+K  I+   ++ IAGGG
Sbjct: 69  FNKPQGVFVTKNGNIYIADTGNNLIKKYEISTQKLSIIAGGG 110


>gi|194767578|ref|XP_001965892.1| GF16360 [Drosophila ananassae]
 gi|190619368|gb|EDV34892.1| GF16360 [Drosophila ananassae]
          Length = 300

 Score = 39.3 bits (90), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 40/80 (50%), Gaps = 2/80 (2%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT--NL 133
           +S   +V+ + GD   G  DG+   ARF  P+      +  + VAD  NH +RKI+  + 
Sbjct: 72  VSSRGLVEHIIGDHKAGLIDGKFTEARFKHPQGLTFLDEHTLIVADTENHALRKISLADG 131

Query: 134 GVTTIAGGGSKKEGRADGPA 153
            V T+AG G +   R  G A
Sbjct: 132 IVKTLAGTGVQGCDRIGGNA 151


>gi|423223182|ref|ZP_17209651.1| hypothetical protein HMPREF1062_01837 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392639433|gb|EIY33254.1| hypothetical protein HMPREF1062_01837 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 447

 Score = 39.3 bits (90), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 39/80 (48%), Gaps = 4/80 (5%)

Query: 83  KRLAGDGV--QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIA 139
           +R AG+G   Q  ++G     +F  P    VD +GNIYV +     IRKI+   G  TIA
Sbjct: 338 ERYAGNGWPNQTNTEGYREDIQFAWPNGVTVDNEGNIYVVNAGGQSIRKISYPDGYVTIA 397

Query: 140 GGGSKK-EGRADGPAQNASF 158
            GG+   E   DG     +F
Sbjct: 398 AGGTMSGETETDGLPLECTF 417


>gi|224537023|ref|ZP_03677562.1| hypothetical protein BACCELL_01899 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521331|gb|EEF90436.1| hypothetical protein BACCELL_01899 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 447

 Score = 39.3 bits (90), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 39/80 (48%), Gaps = 4/80 (5%)

Query: 83  KRLAGDGV--QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL-GVTTIA 139
           +R AG+G   Q  ++G     +F  P    VD +GNIYV +     IRKI+   G  TIA
Sbjct: 338 ERYAGNGWPNQTNTEGYREDIQFAWPNGVTVDNEGNIYVVNAGGQSIRKISYPDGYVTIA 397

Query: 140 GGGSKK-EGRADGPAQNASF 158
            GG+   E   DG     +F
Sbjct: 398 AGGTMSGETETDGLPLECTF 417


>gi|291232828|ref|XP_002736356.1| PREDICTED: NHL (ring finger b-box coiled coil) domain containing
           family member (nhl-1)-like [Saccoglossus kowalevskii]
          Length = 871

 Score = 39.3 bits (90), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 49/121 (40%), Gaps = 25/121 (20%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGR 148
           G +G  DG     + + P   A D   NIYV D+ NH I+      V T++G   +K GR
Sbjct: 695 GQEGSEDG-----KLNYPWGVACDRADNIYVCDRDNHRIQ------VFTVSGTFIRKFGR 743

Query: 149 ADGPAQNASFSNDFELTFVPHICAL-----LISDHGNQLIRQINLKPEDCSKSSQSGSAL 203
               A              PH  A+      +SD GN  ++  NL  +   K  Q GS  
Sbjct: 744 LGHSAGQLE---------SPHYVAIHDDKVFVSDSGNHCVQVFNLDGQFQWKFGQEGSNY 794

Query: 204 G 204
           G
Sbjct: 795 G 795


>gi|283779938|ref|YP_003370693.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
 gi|283438391|gb|ADB16833.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
          Length = 846

 Score = 39.3 bits (90), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 24/82 (29%), Positives = 36/82 (43%), Gaps = 6/82 (7%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASF 158
           G   F  P     D +GN++  D  NHV+R+  + GV T   G   K G       N  F
Sbjct: 86  GRGMFKVPHGLRFDKQGNLWTTDNGNHVLRRFNSEGVQTKTIGEENKPGNG-----NLQF 140

Query: 159 SNDFELTFVPHICALLISDHGN 180
            +  +L F     ++ ++D GN
Sbjct: 141 RSPDDLVFGSD-GSIYVADAGN 161


>gi|428164545|gb|EKX33567.1| hypothetical protein GUITHDRAFT_120259 [Guillardia theta CCMP2712]
          Length = 396

 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 32/52 (61%), Gaps = 3/52 (5%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIR--KITNLGVTTIAG 140
           GY DG+  SARF++P+ F +   G  I +AD +N+ +R   I+   VTT+ G
Sbjct: 100 GYVDGDGLSARFNQPRDFVMSTDGTKILIADTNNYCLRFFDISTRKVTTLTG 151


>gi|374855656|dbj|BAL58511.1| NHL repeat containing protein [uncultured candidate division OP1
           bacterium]
          Length = 419

 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 23/32 (71%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G  +FD+P   AVD +G IYVA++ NH I+KI
Sbjct: 385 GDGQFDQPVDLAVDPEGVIYVAEEGNHRIQKI 416



 Score = 37.7 bits (86), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 39/89 (43%), Gaps = 9/89 (10%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSN 160
            +F  P   AVD  G +YVAD  N  I+K ++ G   I  G     G  DG      F  
Sbjct: 340 GQFQAPLGIAVDTYGFVYVADTGNDRIQKFSSDGAFIIKWG---SLGAGDG-----QFDQ 391

Query: 161 DFELTFVPHICALLISDHGNQLIRQINLK 189
             +L   P    + +++ GN  I++I  K
Sbjct: 392 PVDLAVDPE-GVIYVAEEGNHRIQKIKFK 419


>gi|290979904|ref|XP_002672673.1| predicted protein [Naegleria gruberi]
 gi|284086251|gb|EFC39929.1| predicted protein [Naegleria gruberi]
          Length = 524

 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 38/118 (32%), Positives = 55/118 (46%), Gaps = 9/118 (7%)

Query: 76  LSEESVVKRLAGDG---VQGYSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           LSE  +V  +AG+G   +    D  P  SA    P S A    G + +A++  HVIRK+ 
Sbjct: 110 LSEMGMVTTIAGNGSTTLTSLIDNVPATSAYLKAPASVAFLSNGEMIIAERQGHVIRKVD 169

Query: 132 NLGVTTIAGGGSKKEGRADGPAQNASFS--ND-FELTFVPHICALLISDHGNQLIRQI 186
             G  ++  G   + G  +G + NA  S  ND   LT +     +L  D GN  IR +
Sbjct: 170 LTGQISLIAGSFLQLG-LNGDSSNAKISLLNDPLSLTVLKGD-RILFVDRGNNRIRML 225


>gi|187250772|ref|YP_001875254.1| hypothetical protein Emin_0359 [Elusimicrobium minutum Pei191]
 gi|186970932|gb|ACC97917.1| hypothetical protein Emin_0359 [Elusimicrobium minutum Pei191]
          Length = 448

 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 34/119 (28%), Positives = 61/119 (51%), Gaps = 5/119 (4%)

Query: 71  TLSFPLSEES-VVKRLAGDGVQGYS-DGEPGS--ARFDKPKSFAVDMKGNIYVADKSNHV 126
           TLS  L + + +V+ +AG G+ G++ D + G+  +    P + AVD    +Y+ DK N  
Sbjct: 105 TLSMQLIKRTGIVRPVAGKGIAGFNGDDQLGALQSSLSNPCAIAVDNLSALYILDKGNKR 164

Query: 127 IRKITNLG-VTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIR 184
           IRK+   G +TT+AG G     +    A++  FS+  ++   P     +I     +L++
Sbjct: 165 IRKVFADGMITTLAGNGRSGMFQEGLVAEDFRFSDLQDIALSPEGTIYIIDSGFKRLLK 223


>gi|427796943|gb|JAA63923.1| Putative tlpa-like family, partial [Rhipicephalus pulchellus]
          Length = 719

 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 50/113 (44%), Gaps = 5/113 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIA 139
           V ++ G    GY DG    A F +P          I VAD  NH IR++   N  V+T+A
Sbjct: 279 VLQVVGGYEPGYQDGPLNGALFSEPNGVQWRDPHFILVADTGNHAIREVDLENGTVSTLA 338

Query: 140 GGGSKKEGRADGP-AQNASFSNDFELTFVPHICALLISDHGNQLIRQINLKPE 191
           G G + E    G   Q    ++ +++  V ++  L I+  G+  I     + E
Sbjct: 339 GTGKQGEDTEGGRLGQQQELNSPWDICLVDNV--LFIAMAGSHQIWAYFFRDE 389



 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 44/84 (52%), Gaps = 6/84 (7%)

Query: 107 KSFAVDMKGNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTF 166
           K FA +M+  + +AD  +H I  ITN     +   G  + G  DGP   A FS    + +
Sbjct: 252 KVFATEMR--LIIADTGHHRI-VITNHSGEVLQVVGGYEPGYQDGPLNGALFSEPNGVQW 308

Query: 167 V-PHICALLISDHGNQLIRQINLK 189
             PH   +L++D GN  IR+++L+
Sbjct: 309 RDPHF--ILVADTGNHAIREVDLE 330


>gi|428304361|ref|YP_007141186.1| NHL repeat containing protein [Crinalium epipsammum PCC 9333]
 gi|428245896|gb|AFZ11676.1| NHL repeat containing protein [Crinalium epipsammum PCC 9333]
          Length = 491

 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 37/134 (27%), Positives = 58/134 (43%), Gaps = 8/134 (5%)

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ---GYSDGEPGSARFDKPKSFAVDMK 114
           +L + DS  S+   ++  L    VV+ + G G     G  DG   + R       A    
Sbjct: 362 ELFIADSETSSIRAVT--LGNFPVVRTICGSGQLFGFGDVDGIGENVRLQHCLGVAY-AS 418

Query: 115 GNIYVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALL 174
           G ++V D  NH ++++           GS K G  DG  ++  FS    L++  +   L 
Sbjct: 419 GYLWVTDTYNHKLKRVNPTTGECQTMSGSGKAGLQDGIGKDVYFSEPSGLSYACNY--LY 476

Query: 175 ISDHGNQLIRQINL 188
           I+D  N  IR+INL
Sbjct: 477 IADSNNHAIRRINL 490



 Score = 38.5 bits (88), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 38/144 (26%), Positives = 60/144 (41%), Gaps = 7/144 (4%)

Query: 63  DSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF-----AVDMKGNI 117
           +  R     L   L EES  K+   DG  G    EP       P +F     A     ++
Sbjct: 138 EGKREFLDNLVQQLIEESSGKKTV-DGQSGKFTLEPNQTLRLSPLAFPSKVIACQQSNSL 196

Query: 118 YVADKSNHVIRKITNLGVTTIAGGGSKKEGRADGPAQNASFSNDFELTFVPHICALLISD 177
           ++AD  +H +  I +L   T A  G+   G  DG  + A F     + F      + ++D
Sbjct: 197 FIADTGHHRL-VIASLDGETKAVIGTGSAGWVDGDLEIAQFCEPMGMVFNHEQQVIYVAD 255

Query: 178 HGNQLIRQINLKPEDCSKSSQSGS 201
             N L+R+I+LK    S  + +G+
Sbjct: 256 TVNHLLRKIDLKTRQVSTLAGTGT 279


>gi|322435783|ref|YP_004217995.1| cell surface receptor IPT/TIG [Granulicella tundricola MP5ACTX9]
 gi|321163510|gb|ADW69215.1| cell surface receptor IPT/TIG domain protein [Granulicella tundricola
            MP5ACTX9]
          Length = 1852

 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 14/28 (50%), Positives = 21/28 (75%)

Query: 103  FDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            F++P S AVD  GN+YVAD  N+ +++I
Sbjct: 1159 FNQPYSMAVDAAGNVYVADFKNNAVKEI 1186


>gi|386773539|ref|ZP_10095917.1| NHL repeat protein [Brachybacterium paraconglomeratum LC44]
          Length = 634

 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 54/110 (49%), Gaps = 23/110 (20%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPK-------SFAVDMKGNIYVADKSNHVIRKI-- 130
           + V  + G G +G++DG+  +A+F +P          A ++  ++ VAD + H +R +  
Sbjct: 213 ATVDAIIGTGERGHADGDEDTAQFAEPNGVLALPAEVAHEVGYDLLVADTAGHRLRGVKI 272

Query: 131 ---------TNLGVTTIAGGGSK-KEG----RADGPAQNASFSNDFELTF 166
                    T   VTT+AG G +  +G    R +G A+  S S  ++LT+
Sbjct: 273 GQDRLLRSRTTTEVTTLAGTGEQWMQGEPLPRGEGDARTYSLSTPWDLTW 322



 Score = 37.0 bits (84), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 47/105 (44%), Gaps = 3/105 (2%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TNLGVTTIAGGG 142
           LAG   +G  DG   ++ + +P        G I +AD  +  +R +    + V+T+ G G
Sbjct: 352 LAGTTQEGLVDGPAVTSWWAQPSGLDEMPDGRIVIADSESSAVRLLDPQTMQVSTLVGKG 411

Query: 143 SKKEGRADGPAQNASFSNDFELTFVPHICALLISDHGNQLIRQIN 187
               G  DGP   A   +   +T +P    + I+D  N  IR ++
Sbjct: 412 LFDFGHVDGPLDRARLQHPLGVTALPD-GRIAIADTYNGAIRLLD 455


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.134    0.386 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,191,056,213
Number of Sequences: 23463169
Number of extensions: 167193491
Number of successful extensions: 449925
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 611
Number of HSP's successfully gapped in prelim test: 580
Number of HSP's that attempted gapping in prelim test: 444585
Number of HSP's gapped (non-prelim): 4447
length of query: 269
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 129
effective length of database: 9,074,351,707
effective search space: 1170591370203
effective search space used: 1170591370203
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 75 (33.5 bits)