BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 029556
         (191 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255560013|ref|XP_002521025.1| conserved hypothetical protein [Ricinus communis]
 gi|223539862|gb|EEF41442.1| conserved hypothetical protein [Ricinus communis]
          Length = 408

 Score =  198 bits (504), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 106/195 (54%), Positives = 126/195 (64%), Gaps = 45/195 (23%)

Query: 14  LLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLS 73
           LL L  + V  GL+LEDGYTVTT+IDGH+LEINPH+V+ RP SSDLI+LDSS S  YT+S
Sbjct: 18  LLHLPSNYVLGGLILEDGYTVTTIIDGHKLEINPHAVLSRPQSSDLILLDSSHSTIYTIS 77

Query: 74  FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN- 132
           FP+S+ESVVKRL+GDGV G SDGEPGSARF+KP+SFAVD KGNIYVAD+ N  IRKITN 
Sbjct: 78  FPISQESVVKRLSGDGVAGLSDGEPGSARFNKPRSFAVDNKGNIYVADRLNGTIRKITNS 137

Query: 133 ------------------------------------------LDHGNQLIRQINLKPEDC 150
                                                      DHGNQL+R++ LKP+DC
Sbjct: 138 GVSTIAGGYSKGFGREDGPAQNATFSSDFEVAFVAEECALLISDHGNQLVRRLPLKPDDC 197

Query: 151 SKSSQSGSGM--FWV 163
           + +S S  G   FWV
Sbjct: 198 ATASHSALGAVSFWV 212


>gi|224079468|ref|XP_002305875.1| predicted protein [Populus trichocarpa]
 gi|222848839|gb|EEE86386.1| predicted protein [Populus trichocarpa]
          Length = 241

 Score =  175 bits (443), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 95/183 (51%), Positives = 115/183 (62%), Gaps = 48/183 (26%)

Query: 27  LLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESV-VKRL 85
           +LEDGY VTTV+DGH+L INPH+V  R  SS+++VLDSSRS FYTL FP+S+ SV VKRL
Sbjct: 1   MLEDGYMVTTVLDGHKLNINPHAVQLR--SSEIVVLDSSRSVFYTLPFPISQASVMVKRL 58

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------------- 132
           +G+G  GY DGEPG ARF+KPKSFAVD++GN+YVAD+ NH +RKI+N             
Sbjct: 59  SGEGKTGYIDGEPGLARFNKPKSFAVDLRGNVYVADQQNHAVRKISNSGVTSTIVGNYSQ 118

Query: 133 ------------------------------LDHGNQLIRQINLKPEDCSKSSQSGSGM-- 160
                                          DHGNQL+RQI+LKPEDC   SQS  G   
Sbjct: 119 TGRQDGPGKTATFSSDFEVLFVPQICALLISDHGNQLLRQIDLKPEDCVIGSQSALGAVK 178

Query: 161 FWV 163
           FWV
Sbjct: 179 FWV 181


>gi|297850862|ref|XP_002893312.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297339154|gb|EFH69571.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 398

 Score =  168 bits (425), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 90/199 (45%), Positives = 122/199 (61%), Gaps = 45/199 (22%)

Query: 7   YSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSR 66
           +SS+++ L  ++F+LVS  ++LEDGY VTTV+DGH+  +NP+++   PGSS+LIVLDSS 
Sbjct: 5   FSSVSLFLF-VVFNLVSGKIVLEDGYEVTTVVDGHKSGLNPYTIHALPGSSNLIVLDSSG 63

Query: 67  SAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
           S FYT SFPLS +SV+ R AGDG  GY DG+ G++RF KP+ FA+D KGN+YVAD+SN  
Sbjct: 64  STFYTTSFPLSFDSVIHRFAGDGTSGYVDGKAGNSRFKKPRGFAIDAKGNVYVADRSNKA 123

Query: 127 IRKITN--------------------------------------------LDHGNQLIRQ 142
           IRKI++                                             DHGN+++RQ
Sbjct: 124 IRKISSSGYVTTIAGGISKEFGHRDGPAQNATFSSDFEITFVPQRCCLLVSDHGNEMVRQ 183

Query: 143 INLKPEDCSKSSQSGSGMF 161
           INLK EDC +SS S  G +
Sbjct: 184 INLKEEDCLESSHSNLGAY 202


>gi|225424715|ref|XP_002265272.1| PREDICTED: uncharacterized protein LOC100243227 [Vitis vinifera]
          Length = 438

 Score =  167 bits (423), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 97/213 (45%), Positives = 128/213 (60%), Gaps = 53/213 (24%)

Query: 1   MASHFVYSSLAITLLSLLFSLV----SSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
           MA++    SL++ ++  +F++     S+ L+LEDGYTV TV DG++LEINPHS++ R GS
Sbjct: 1   MATYL--QSLSLFIVFTIFTIAAIHGSADLVLEDGYTVRTVFDGNKLEINPHSILPRYGS 58

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           SD I+LDSS+S FYT+S PLS+ES +KRL+G    G+SDG+  SA F KP+SFAVD+KGN
Sbjct: 59  SDFIILDSSKSVFYTVSSPLSQESEIKRLSGSSA-GFSDGDSASATFSKPRSFAVDLKGN 117

Query: 117 IYVADKSNHVIRKITNL------------------------------------------- 133
           +YVAD+SN VIRKITN                                            
Sbjct: 118 VYVADQSNGVIRKITNRGVTTTIAGGYAQKTGKVDGPAQNASFSKDFELVFVPEKCAVLV 177

Query: 134 -DHGNQLIRQINLKPEDCSKSSQS--GSGMFWV 163
            D G+QL+RQI+LK EDC +S QS  G    WV
Sbjct: 178 SDRGSQLVRQIDLKVEDCRRSPQSVLGGAFLWV 210


>gi|296086532|emb|CBI32121.3| unnamed protein product [Vitis vinifera]
          Length = 369

 Score =  167 bits (423), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 97/213 (45%), Positives = 128/213 (60%), Gaps = 53/213 (24%)

Query: 1   MASHFVYSSLAITLLSLLFSLV----SSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
           MA++    SL++ ++  +F++     S+ L+LEDGYTV TV DG++LEINPHS++ R GS
Sbjct: 1   MATYL--QSLSLFIVFTIFTIAAIHGSADLVLEDGYTVRTVFDGNKLEINPHSILPRYGS 58

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           SD I+LDSS+S FYT+S PLS+ES +KRL+G    G+SDG+  SA F KP+SFAVD+KGN
Sbjct: 59  SDFIILDSSKSVFYTVSSPLSQESEIKRLSGSSA-GFSDGDSASATFSKPRSFAVDLKGN 117

Query: 117 IYVADKSNHVIRKITNL------------------------------------------- 133
           +YVAD+SN VIRKITN                                            
Sbjct: 118 VYVADQSNGVIRKITNRGVTTTIAGGYAQKTGKVDGPAQNASFSKDFELVFVPEKCAVLV 177

Query: 134 -DHGNQLIRQINLKPEDCSKSSQS--GSGMFWV 163
            D G+QL+RQI+LK EDC +S QS  G    WV
Sbjct: 178 SDRGSQLVRQIDLKVEDCRRSPQSVLGGAFLWV 210


>gi|224065156|ref|XP_002301692.1| predicted protein [Populus trichocarpa]
 gi|222843418|gb|EEE80965.1| predicted protein [Populus trichocarpa]
          Length = 264

 Score =  165 bits (418), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 90/184 (48%), Positives = 114/184 (61%), Gaps = 48/184 (26%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESV-VKR 84
           ++LEDGY VTTV+DGH+L +NPH+V  R  SSDL+VLDSS+S FYTL FP+S++ V VKR
Sbjct: 23  IMLEDGYMVTTVMDGHKLNVNPHAVQLR--SSDLVVLDSSKSVFYTLPFPISQDGVMVKR 80

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------------ 132
           L+G G +GY DGEPG ARF+KPKSF VD++GN+YVAD+ NH +RKI++            
Sbjct: 81  LSGSGDKGYIDGEPGLARFNKPKSFTVDLRGNVYVADQLNHAVRKISSSGMTTTIAGNYS 140

Query: 133 -------------------------------LDHGNQLIRQINLKPEDCSKSSQSGSGM- 160
                                           DHGNQL+RQ++LK EDC   SQS  G  
Sbjct: 141 QIGRQDGPGETATFSTDFEVLFVPQICALLISDHGNQLLRQVDLKQEDCIIGSQSALGAV 200

Query: 161 -FWV 163
            FWV
Sbjct: 201 KFWV 204


>gi|18395326|ref|NP_564209.1| NHL domain-containing protein [Arabidopsis thaliana]
 gi|15724248|gb|AAL06517.1|AF412064_1 At1g23890/T23E23_13 [Arabidopsis thaliana]
 gi|16974539|gb|AAL31179.1| At1g23890/T23E23_13 [Arabidopsis thaliana]
 gi|110742589|dbj|BAE99208.1| hypothetical protein [Arabidopsis thaliana]
 gi|332192326|gb|AEE30447.1| NHL domain-containing protein [Arabidopsis thaliana]
          Length = 261

 Score =  165 bits (417), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 90/199 (45%), Positives = 122/199 (61%), Gaps = 45/199 (22%)

Query: 7   YSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSR 66
           +SS+++ L  ++F+LVS  ++LE+GY VTTV+DGH+  +NP+++   PGSS+LIVLDSS 
Sbjct: 5   FSSVSLFLF-VVFNLVSGKIVLEEGYEVTTVVDGHKSGLNPYTIHALPGSSNLIVLDSSG 63

Query: 67  SAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
           S FYT SFPLS +SV+ R AGDG  G+ DG+ G++RF KP+ FAVD KGN+YVADKSN  
Sbjct: 64  STFYTTSFPLSVDSVINRFAGDGSSGHVDGKAGNSRFSKPRGFAVDAKGNVYVADKSNKA 123

Query: 127 IRKITN--------------------------------------------LDHGNQLIRQ 142
           IRKI++                                             DHGN++IRQ
Sbjct: 124 IRKISSSGSVTTIAGGISKAFGHRDGPAQNATFSSDFEITFVPQRCCLLVSDHGNEMIRQ 183

Query: 143 INLKPEDCSKSSQSGSGMF 161
           INLK EDC ++S S  G +
Sbjct: 184 INLKEEDCLENSHSNLGTY 202


>gi|42571623|ref|NP_973902.1| NHL domain-containing protein [Arabidopsis thaliana]
 gi|332192325|gb|AEE30446.1| NHL domain-containing protein [Arabidopsis thaliana]
          Length = 400

 Score =  165 bits (417), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/199 (45%), Positives = 122/199 (61%), Gaps = 45/199 (22%)

Query: 7   YSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSR 66
           +SS+++ L  ++F+LVS  ++LE+GY VTTV+DGH+  +NP+++   PGSS+LIVLDSS 
Sbjct: 5   FSSVSLFLF-VVFNLVSGKIVLEEGYEVTTVVDGHKSGLNPYTIHALPGSSNLIVLDSSG 63

Query: 67  SAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
           S FYT SFPLS +SV+ R AGDG  G+ DG+ G++RF KP+ FAVD KGN+YVADKSN  
Sbjct: 64  STFYTTSFPLSVDSVINRFAGDGSSGHVDGKAGNSRFSKPRGFAVDAKGNVYVADKSNKA 123

Query: 127 IRKITN--------------------------------------------LDHGNQLIRQ 142
           IRKI++                                             DHGN++IRQ
Sbjct: 124 IRKISSSGSVTTIAGGISKAFGHRDGPAQNATFSSDFEITFVPQRCCLLVSDHGNEMIRQ 183

Query: 143 INLKPEDCSKSSQSGSGMF 161
           INLK EDC ++S S  G +
Sbjct: 184 INLKEEDCLENSHSNLGTY 202


>gi|9369410|gb|AAF87158.1|AC002423_23 T23E23.6 [Arabidopsis thaliana]
          Length = 370

 Score =  163 bits (413), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 89/194 (45%), Positives = 120/194 (61%), Gaps = 45/194 (23%)

Query: 7   YSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSR 66
           +SS+++ L  ++F+LVS  ++LE+GY VTTV+DGH+  +NP+++   PGSS+LIVLDSS 
Sbjct: 5   FSSVSLFLF-VVFNLVSGKIVLEEGYEVTTVVDGHKSGLNPYTIHALPGSSNLIVLDSSG 63

Query: 67  SAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
           S FYT SFPLS +SV+ R AGDG  G+ DG+ G++RF KP+ FAVD KGN+YVADKSN  
Sbjct: 64  STFYTTSFPLSVDSVINRFAGDGSSGHVDGKAGNSRFSKPRGFAVDAKGNVYVADKSNKA 123

Query: 127 IRKITN--------------------------------------------LDHGNQLIRQ 142
           IRKI++                                             DHGN++IRQ
Sbjct: 124 IRKISSSGSVTTIAGGISKAFGHRDGPAQNATFSSDFEITFVPQRCCLLVSDHGNEMIRQ 183

Query: 143 INLKPEDCSKSSQS 156
           INLK EDC ++S S
Sbjct: 184 INLKEEDCLENSHS 197


>gi|147798774|emb|CAN69795.1| hypothetical protein VITISV_022545 [Vitis vinifera]
          Length = 459

 Score =  162 bits (411), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 95/213 (44%), Positives = 126/213 (59%), Gaps = 53/213 (24%)

Query: 1   MASHFVYSSLAITLLSLLFSLV----SSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
           MA++    SL++ ++  +F++     S+ L+LEDGYTV TV DG++LEINPHS++ R GS
Sbjct: 1   MATYL--QSLSLFIVFTIFTIAAIHGSADLVLEDGYTVRTVFDGNKLEINPHSILPRYGS 58

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           SD I+LDSS+S FYT+  PLS+ES + RL+G    G+SDG+  SA F KP+SFAVD+KGN
Sbjct: 59  SDFIILDSSKSVFYTVXSPLSQESEINRLSGSSA-GFSDGDSASAMFSKPRSFAVDLKGN 117

Query: 117 IYVADKSNHVIRKITNL------------------------------------------- 133
           +YVAD+SN VIRKITN                                            
Sbjct: 118 VYVADQSNGVIRKITNRGVTTTIAGGYAQKTGKVDGPAQNASFSKDFELVFVPEKCAVLV 177

Query: 134 -DHGNQLIRQINLKPEDCSKSSQS--GSGMFWV 163
            D G+QL+RQI+LK EDC +S QS  G    WV
Sbjct: 178 SDRGSQLVRQIDLKVEDCRRSPQSVLGGAFLWV 210


>gi|388505270|gb|AFK40701.1| unknown [Medicago truncatula]
          Length = 384

 Score =  157 bits (398), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 83/179 (46%), Positives = 107/179 (59%), Gaps = 45/179 (25%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD 88
           E+GYT+TT++DGH+L INP S++ RP SSDLIVLDS+ S FYT+  P+S+ESV KR +G+
Sbjct: 31  EEGYTITTILDGHKLHINPFSILQRPISSDLIVLDSTNSTFYTVQLPISQESVFKRFSGN 90

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN---------------- 132
           G  GY DG+ G ARFDKP+SFAVD +GN+YVAD+ N VIRKI+                 
Sbjct: 91  GSPGYEDGDVGLARFDKPRSFAVDFRGNVYVADRVNKVIRKISTNGVTTIAGGSSEKSSI 150

Query: 133 ---------------------------LDHGNQLIRQINLKPEDCSKSSQS--GSGMFW 162
                                       DH +QL+ QINLK EDC+  S+S  G+ M W
Sbjct: 151 KDGPVQNASFSNDFELTFIPALCALLVSDHMHQLVHQINLKGEDCTLGSKSALGAVMTW 209


>gi|356526828|ref|XP_003532018.1| PREDICTED: uncharacterized protein LOC100816542 [Glycine max]
          Length = 400

 Score =  153 bits (386), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 85/184 (46%), Positives = 105/184 (57%), Gaps = 46/184 (25%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
           ++ E+GYTVTTV DGH+  I P +V+ RP SSDLI+LDS  S FYT  FP++EESV  RL
Sbjct: 24  VITEEGYTVTTVFDGHKPHIFPFTVLQRPFSSDLILLDSVNSTFYTAQFPITEESVFTRL 83

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL------------ 133
           +GDG  GYSDG+ GSARF KP+SFA DM+GN+YVADKSN  IRKI+              
Sbjct: 84  SGDGSVGYSDGDVGSARFAKPRSFAFDMRGNVYVADKSNRAIRKISAKGVTTIAGGEFSE 143

Query: 134 --------------------------------DHGNQLIRQINLKPEDCSKSSQSGSG-- 159
                                           DH ++L+RQINL  EDC+  S+ G G  
Sbjct: 144 KSRTKDGPALNASFSNDFDLTFIPGLCALLVSDHMHRLVRQINLMEEDCTLGSKPGLGAV 203

Query: 160 MFWV 163
           M W 
Sbjct: 204 MTWT 207


>gi|148909539|gb|ABR17864.1| unknown [Picea sitchensis]
          Length = 264

 Score =  120 bits (300), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 77/211 (36%), Positives = 112/211 (53%), Gaps = 55/211 (26%)

Query: 14  LLSLLFSLVS---SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFY 70
           LL +LFSL++     L+ E+GY V TV+ G+++++ PHS++  P + D+++LDS  S  +
Sbjct: 8   LLIILFSLITFTEPSLIPEEGYRVETVLYGNKMDVYPHSIL--PFNGDILLLDSVNSTLF 65

Query: 71  TLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
            +  PLS++S +K  AG  + V G++DG    A+F+ PKSF +D KGNIYVAD++NH IR
Sbjct: 66  RIGLPLSQDSTIKVFAGSRNTVPGFADGGFLDAQFNHPKSFTLDSKGNIYVADRANHAIR 125

Query: 129 KITN-------------------------------------------LDHGNQLIRQINL 145
           KI+                                             D GN++IRQI L
Sbjct: 126 KISKSGVSTIAGGTAGKTGHADGPSQEAMFSNDYDLTFIPSICALLVSDRGNRMIRQIKL 185

Query: 146 KPEDCSKSSQSGSGMFWVTVFSPYLISCKSF 176
            P DC +   SGSG  WVT   P + S  SF
Sbjct: 186 PPGDCVQ--HSGSGTHWVT---PVIASGVSF 211


>gi|115476416|ref|NP_001061804.1| Os08g0414600 [Oryza sativa Japonica Group]
 gi|37806208|dbj|BAC99711.1| unknown protein [Oryza sativa Japonica Group]
 gi|113623773|dbj|BAF23718.1| Os08g0414600 [Oryza sativa Japonica Group]
 gi|215715379|dbj|BAG95130.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215715380|dbj|BAG95131.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 275

 Score =  108 bits (271), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 69/193 (35%), Positives = 95/193 (49%), Gaps = 50/193 (25%)

Query: 25  GLLLEDGYTVTTVID----GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEES 80
           GL+L+DGYTVTT  D    G     +P++++ RP + DL++LDS+ SA YTL+ PLS  +
Sbjct: 26  GLVLDDGYTVTTAADLNHPGAPAAPHPYALLPRPRAGDLVLLDSAASALYTLALPLSGGA 85

Query: 81  VVKRLAGDGVQGYSDG--EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------ 132
             + LAG G         EP  A FD+P+S AVD   N+YVAD+ N  +RKI        
Sbjct: 86  AARTLAGGGGGPAGFADGEPRDAAFDRPRSLAVDHADNVYVADRMNGAVRKIAPSGFTTT 145

Query: 133 --------------------------------------LDHGNQLIRQINLKPEDCSKSS 154
                                                  D GN+LIRQINLK EDC++ +
Sbjct: 146 IAGGRSKGPGRKDGPAQNATFSSDFELVYVPKMCALLVTDRGNRLIRQINLKQEDCARET 205

Query: 155 QSGSGMFWVTVFS 167
           Q G G   V++ +
Sbjct: 206 QPGLGTTSVSIIA 218


>gi|222640551|gb|EEE68683.1| hypothetical protein OsJ_27311 [Oryza sativa Japonica Group]
          Length = 420

 Score =  108 bits (270), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 69/193 (35%), Positives = 95/193 (49%), Gaps = 50/193 (25%)

Query: 25  GLLLEDGYTVTTVID----GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEES 80
           GL+L+DGYTVTT  D    G     +P++++ RP + DL++LDS+ SA YTL+ PLS  +
Sbjct: 26  GLVLDDGYTVTTAADLNHPGAPAAPHPYALLPRPRAGDLVLLDSAASALYTLALPLSGGA 85

Query: 81  VVKRLAGDGVQGYSDG--EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------ 132
             + LAG G         EP  A FD+P+S AVD   N+YVAD+ N  +RKI        
Sbjct: 86  AARTLAGGGGGPAGFADGEPRDAAFDRPRSLAVDHADNVYVADRMNGAVRKIAPSGFTTT 145

Query: 133 --------------------------------------LDHGNQLIRQINLKPEDCSKSS 154
                                                  D GN+LIRQINLK EDC++ +
Sbjct: 146 IAGGRSKGPGRKDGPAQNATFSSDFELVYVPKMCALLVTDRGNRLIRQINLKQEDCARET 205

Query: 155 QSGSGMFWVTVFS 167
           Q G G   V++ +
Sbjct: 206 QPGLGTTSVSIIA 218


>gi|414870561|tpg|DAA49118.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
          Length = 280

 Score =  106 bits (265), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 64/188 (34%), Positives = 91/188 (48%), Gaps = 47/188 (25%)

Query: 27  LLEDGYTVTTVIDGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           +LEDGYTVTT  D + L     +P++V+ R  + DL++LDS+ SA YT+S   S     +
Sbjct: 28  VLEDGYTVTTFADLNPLPASGPHPYAVLPRSRAGDLLLLDSAGSALYTISLSSSPGEPRR 87

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN----------- 132
              G    G++DG PG A FD+P+S AVD   N+YVAD+ +  +RK+             
Sbjct: 88  LAGGKRGSGFNDGGPGDAAFDRPRSVAVDAADNVYVADRLHGAVRKVAPSGYTTTIAGGF 147

Query: 133 ---------------------------------LDHGNQLIRQINLKPEDCSKSSQSGSG 159
                                             D GN+++RQINLKPEDC+   QSG G
Sbjct: 148 SSGPGHRDGPAQNATFSPDFALVYVPKICALLVADRGNRMVRQINLKPEDCAHEKQSGLG 207

Query: 160 MFWVTVFS 167
              V+  +
Sbjct: 208 NTLVSFIA 215


>gi|226495023|ref|NP_001144050.1| uncharacterized protein LOC100276874 precursor [Zea mays]
 gi|195636116|gb|ACG37526.1| hypothetical protein [Zea mays]
 gi|414870559|tpg|DAA49116.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
          Length = 272

 Score =  105 bits (263), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 64/188 (34%), Positives = 91/188 (48%), Gaps = 47/188 (25%)

Query: 27  LLEDGYTVTTVIDGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           +LEDGYTVTT  D + L     +P++V+ R  + DL++LDS+ SA YT+S   S     +
Sbjct: 28  VLEDGYTVTTFADLNPLPASGPHPYAVLPRSRAGDLLLLDSAGSALYTISLSSSPGEPRR 87

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN----------- 132
              G    G++DG PG A FD+P+S AVD   N+YVAD+ +  +RK+             
Sbjct: 88  LAGGKRGSGFNDGGPGDAAFDRPRSVAVDAADNVYVADRLHGAVRKVAPSGYTTTIAGGF 147

Query: 133 ---------------------------------LDHGNQLIRQINLKPEDCSKSSQSGSG 159
                                             D GN+++RQINLKPEDC+   QSG G
Sbjct: 148 SSGPGHRDGPAQNATFSPDFALVYVPKICALLVADRGNRMVRQINLKPEDCAHEKQSGLG 207

Query: 160 MFWVTVFS 167
              V+  +
Sbjct: 208 NTLVSFIA 215


>gi|224028415|gb|ACN33283.1| unknown [Zea mays]
 gi|414870560|tpg|DAA49117.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
          Length = 418

 Score =  105 bits (263), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 64/185 (34%), Positives = 90/185 (48%), Gaps = 47/185 (25%)

Query: 27  LLEDGYTVTTVIDGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           +LEDGYTVTT  D + L     +P++V+ R  + DL++LDS+ SA YT+S   S     +
Sbjct: 28  VLEDGYTVTTFADLNPLPASGPHPYAVLPRSRAGDLLLLDSAGSALYTISLSSSPGEPRR 87

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN----------- 132
              G    G++DG PG A FD+P+S AVD   N+YVAD+ +  +RK+             
Sbjct: 88  LAGGKRGSGFNDGGPGDAAFDRPRSVAVDAADNVYVADRLHGAVRKVAPSGYTTTIAGGF 147

Query: 133 ---------------------------------LDHGNQLIRQINLKPEDCSKSSQSGSG 159
                                             D GN+++RQINLKPEDC+   QSG G
Sbjct: 148 SSGPGHRDGPAQNATFSPDFALVYVPKICALLVADRGNRMVRQINLKPEDCAHEKQSGLG 207

Query: 160 MFWVT 164
              V+
Sbjct: 208 NTLVS 212


>gi|414870562|tpg|DAA49119.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
          Length = 207

 Score =  104 bits (259), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 62/178 (34%), Positives = 87/178 (48%), Gaps = 47/178 (26%)

Query: 27  LLEDGYTVTTVIDGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           +LEDGYTVTT  D + L     +P++V+ R  + DL++LDS+ SA YT+S   S     +
Sbjct: 28  VLEDGYTVTTFADLNPLPASGPHPYAVLPRSRAGDLLLLDSAGSALYTISLSSSPGEPRR 87

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN----------- 132
              G    G++DG PG A FD+P+S AVD   N+YVAD+ +  +RK+             
Sbjct: 88  LAGGKRGSGFNDGGPGDAAFDRPRSVAVDAADNVYVADRLHGAVRKVAPSGYTTTIAGGF 147

Query: 133 ---------------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
                                             D GN+++RQINLKPEDC+   QSG
Sbjct: 148 SSGPGHRDGPAQNATFSPDFALVYVPKICALLVADRGNRMVRQINLKPEDCAHEKQSG 205


>gi|242079119|ref|XP_002444328.1| hypothetical protein SORBIDRAFT_07g020240 [Sorghum bicolor]
 gi|241940678|gb|EES13823.1| hypothetical protein SORBIDRAFT_07g020240 [Sorghum bicolor]
          Length = 416

 Score =  102 bits (255), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 66/189 (34%), Positives = 95/189 (50%), Gaps = 52/189 (27%)

Query: 27  LLEDGYTVTTVIDGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           +LEDGYTVTT  + + L     +P++++ RP + DL++LDS+ SA YTLS   S     +
Sbjct: 29  VLEDGYTVTTFANFNPLPASGPHPYAILPRPRAGDLLLLDSAGSALYTLSLS-SSPGEPR 87

Query: 84  RLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN---------- 132
           RLAG   + G+ DG+   A FD+P+S AVD   N+YVAD+ +  +RK+            
Sbjct: 88  RLAGGKRRSGFDDGD---AAFDRPRSVAVDAADNVYVADQRHGAVRKVAPSGYTTTVAGG 144

Query: 133 ----------------------------------LDHGNQLIRQINLKPEDCSKSSQSGS 158
                                              D GN+++RQINLKPEDC+   QSG 
Sbjct: 145 LSSGPGHRDGLAQNATFSADFELVYVPKICALLVADRGNRMVRQINLKPEDCAHEKQSGL 204

Query: 159 GMFWVTVFS 167
           G   V+V +
Sbjct: 205 GTTSVSVIA 213


>gi|296087695|emb|CBI34951.3| unnamed protein product [Vitis vinifera]
          Length = 811

 Score = 96.3 bits (238), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 69/201 (34%), Positives = 94/201 (46%), Gaps = 56/201 (27%)

Query: 6   VYSSLAITLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
           + S++   L+  L+SL S         S +  EDGYTV TV DG +L I P+S ++   S
Sbjct: 34  IVSNVVSALVKWLWSLKSTTNTAISSRSKMNFEDGYTVETVFDGSKLGIEPYS-LEVSTS 92

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMK 114
            +L+VLDS  S  Y +S PLS  S  K +AG  DG  G+ DG+   AR + PK   +D +
Sbjct: 93  GELLVLDSENSNIYKISSPLSRYSRPKLVAGSPDGYSGHVDGKLREARMNHPKGLTMDER 152

Query: 115 GNIYVADKSNHVIRKITN------------------------------------------ 132
           GNIY+AD  N  IRKI++                                          
Sbjct: 153 GNIYIADTMNMAIRKISDAGVTTIAGGKWGRGGGHVDGPSEDAKFSNDFDVVYIGSSCSL 212

Query: 133 --LDHGNQLIRQINLKPEDCS 151
             +D GNQ IR+I L  EDC+
Sbjct: 213 LVIDRGNQAIREIQLHYEDCA 233


>gi|225452518|ref|XP_002274991.1| PREDICTED: uncharacterized protein LOC100266244 [Vitis vinifera]
          Length = 677

 Score = 96.3 bits (238), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 69/201 (34%), Positives = 94/201 (46%), Gaps = 56/201 (27%)

Query: 6   VYSSLAITLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
           + S++   L+  L+SL S         S +  EDGYTV TV DG +L I P+S ++   S
Sbjct: 34  IVSNVVSALVKWLWSLKSTTNTAISSRSKMNFEDGYTVETVFDGSKLGIEPYS-LEVSTS 92

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMK 114
            +L+VLDS  S  Y +S PLS  S  K +AG  DG  G+ DG+   AR + PK   +D +
Sbjct: 93  GELLVLDSENSNIYKISSPLSRYSRPKLVAGSPDGYSGHVDGKLREARMNHPKGLTMDER 152

Query: 115 GNIYVADKSNHVIRKITN------------------------------------------ 132
           GNIY+AD  N  IRKI++                                          
Sbjct: 153 GNIYIADTMNMAIRKISDAGVTTIAGGKWGRGGGHVDGPSEDAKFSNDFDVVYIGSSCSL 212

Query: 133 --LDHGNQLIRQINLKPEDCS 151
             +D GNQ IR+I L  EDC+
Sbjct: 213 LVIDRGNQAIREIQLHYEDCA 233


>gi|147860395|emb|CAN80443.1| hypothetical protein VITISV_043282 [Vitis vinifera]
          Length = 527

 Score = 95.9 bits (237), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 69/201 (34%), Positives = 94/201 (46%), Gaps = 56/201 (27%)

Query: 6   VYSSLAITLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
           + S++   L+  L+SL S         S +  EDGYTV TV DG +L I P+S ++   S
Sbjct: 17  IVSNVVSALVKWLWSLKSTTNTAISSXSKMNFEDGYTVETVFDGSKLGIEPYS-LEVSTS 75

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMK 114
            +L+VLDS  S  Y +S PLS  S  K +AG  DG  G+ DG+   AR + PK   +D +
Sbjct: 76  GELLVLDSENSNIYKISSPLSRYSRPKLVAGSPDGYSGHVDGKLREARMNHPKGLTMDER 135

Query: 115 GNIYVADKSNHVIRKITN------------------------------------------ 132
           GNIY+AD  N  IRKI++                                          
Sbjct: 136 GNIYIADTMNMAIRKISDAGVTTIAGGKWGRGGGHVDGPSEDAKFSNDFDVVYIGSSCSL 195

Query: 133 --LDHGNQLIRQINLKPEDCS 151
             +D GNQ IR+I L  EDC+
Sbjct: 196 LVIDRGNQAIREIQLHYEDCA 216


>gi|115448467|ref|NP_001048013.1| Os02g0730400 [Oryza sativa Japonica Group]
 gi|46390465|dbj|BAD15926.1| putative NHL repeat-containing protein [Oryza sativa Japonica
           Group]
 gi|46390861|dbj|BAD16365.1| putative NHL repeat-containing protein [Oryza sativa Japonica
           Group]
 gi|113537544|dbj|BAF09927.1| Os02g0730400 [Oryza sativa Japonica Group]
 gi|125541002|gb|EAY87397.1| hypothetical protein OsI_08804 [Oryza sativa Indica Group]
 gi|125583566|gb|EAZ24497.1| hypothetical protein OsJ_08258 [Oryza sativa Japonica Group]
 gi|215715228|dbj|BAG94979.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 508

 Score = 94.7 bits (234), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 69/204 (33%), Positives = 95/204 (46%), Gaps = 55/204 (26%)

Query: 8   SSLAITLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSD 58
           S+ A +++  L+SL S         S +  E GYTV TV DG +L I P++V    G  +
Sbjct: 43  SNAASSVMKRLWSLKSTTKTGSGGKSMVKYEGGYTVETVFDGSKLGIEPYAVEVTQGG-E 101

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           L+V+DS  S  Y ++ PLS  S  K +AG  +G  G+ DG P  AR + PK F VD +GN
Sbjct: 102 LLVMDSMNSNIYRIALPLSRYSRPKLVAGSPEGFPGHVDGRPREARMNHPKGFTVDGRGN 161

Query: 117 IYVADKSNHVIRKITN-------------------------------------------L 133
           IYVAD  N  IRKI++                                           +
Sbjct: 162 IYVADAMNMAIRKISDTGVTTIAGGKSSRGGHVDGPSDDAKFSTDFEVRYIGSSCSLLVI 221

Query: 134 DHGNQLIRQINLKPEDCSKSSQSG 157
           D GNQ IR+I L  +DC    ++G
Sbjct: 222 DRGNQAIREIQLNFDDCVYQYEAG 245


>gi|242095458|ref|XP_002438219.1| hypothetical protein SORBIDRAFT_10g009760 [Sorghum bicolor]
 gi|241916442|gb|EER89586.1| hypothetical protein SORBIDRAFT_10g009760 [Sorghum bicolor]
          Length = 507

 Score = 94.4 bits (233), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 64/174 (36%), Positives = 81/174 (46%), Gaps = 46/174 (26%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
           E GY V TV DG +L I PH+V   P + DL+VLDS  S  Y +  PLS  S  K LAG 
Sbjct: 76  EGGYAVDTVFDGSKLGIEPHAVEITP-AGDLLVLDSINSNIYRVQLPLSPYSRPKLLAGS 134

Query: 88  -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------------- 132
            +G+ G+ DG    AR + P  F VD KGNIYVAD  N  IRKI++              
Sbjct: 135 PEGLSGHVDGRLREARMNHPNGFTVDDKGNIYVADAMNMAIRKISDTGVTTIAGGKSIRG 194

Query: 133 -----------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
                                        +D GNQ IR+I L  +DC+   ++G
Sbjct: 195 GHIDGPSDDAKFSTDFEIQYISSSCSLLVIDRGNQAIREIPLNDDDCAYQYETG 248


>gi|255552640|ref|XP_002517363.1| conserved hypothetical protein [Ricinus communis]
 gi|223543374|gb|EEF44905.1| conserved hypothetical protein [Ricinus communis]
          Length = 494

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 68/194 (35%), Positives = 88/194 (45%), Gaps = 56/194 (28%)

Query: 13  TLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLD 63
            L+  L+SL S         S +  E GYTV TV DG +L I PH+V   P S +L+VLD
Sbjct: 41  ALVKWLWSLKSNTKTAASSRSMMKFEGGYTVETVFDGSKLGIEPHTVDVSP-SGELLVLD 99

Query: 64  SSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD 121
           S  S  Y +S PLS  S  K +AG  +G  G+ DG    AR + PK   VD +GNIY+AD
Sbjct: 100 SENSNIYKISTPLSRYSRPKLIAGSPEGYSGHVDGRLREARMNHPKGLTVDNRGNIYIAD 159

Query: 122 KSNHVIRKITN--------------------------------------------LDHGN 137
             N  IRKI++                                            +D GN
Sbjct: 160 TMNMAIRKISDGGVTTIAGGKWTRAGGHVDGPSEDAKFSNDFDIAYIGSSCSLLVIDRGN 219

Query: 138 QLIRQINLKPEDCS 151
           Q IR+I L  +DC+
Sbjct: 220 QAIREIQLNDDDCN 233


>gi|9755656|emb|CAC01808.1| putative protein [Arabidopsis thaliana]
          Length = 733

 Score = 94.0 bits (232), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 61/192 (31%), Positives = 89/192 (46%), Gaps = 48/192 (25%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
           E GY + TV DG +L I P+++   P   +LIVLDS  S  + +S PLS     K L+G 
Sbjct: 54  ESGYNMETVFDGSKLGIEPYAIEVSPNGGELIVLDSENSNIHKISMPLSRYGKPKLLSGS 113

Query: 88  -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------------- 132
            +G  G+ DG+   AR ++P+  A+D +GNIYVAD  N  IRKI++              
Sbjct: 114 QEGYTGHVDGKLKEARMNRPRGLAMDDRGNIYVADTINMAIRKISDDGVSTIAAGGRWSG 173

Query: 133 --------------------------LDHGNQLIRQINLKPEDCSKSSQS------GSGM 160
                                     +D GNQLI++I L   DCS+          G+ +
Sbjct: 174 GSKEESMRFSDDFDLIYVSSSCSLLVIDRGNQLIKEIQLHDHDCSQPEPDTDSLHLGTAL 233

Query: 161 FWVTVFSPYLIS 172
               VF  Y+++
Sbjct: 234 LVAAVFFGYMLA 245


>gi|293332279|ref|NP_001170567.1| uncharacterized protein LOC100384592 precursor [Zea mays]
 gi|238006082|gb|ACR34076.1| unknown [Zea mays]
          Length = 505

 Score = 94.0 bits (232), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 64/177 (36%), Positives = 83/177 (46%), Gaps = 50/177 (28%)

Query: 29  EDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
           E GY V TV DG +L I PH+  I+R G  DL++LDS  S  Y +  PLS  S  K LAG
Sbjct: 76  EGGYAVDTVFDGSKLGIEPHAAQINRAG--DLLLLDSINSNIYRVQLPLSPYSRPKLLAG 133

Query: 88  --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------------- 132
             +G+ G+ DG    AR + PK F VD +GNIYVAD  N  IRKI++             
Sbjct: 134 SPEGLSGHVDGRLREARMNHPKGFTVDDRGNIYVADAMNMAIRKISDTGVTTIAGGKSIR 193

Query: 133 --------------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
                                           +D GNQ IR+I L  +DC+   ++G
Sbjct: 194 GGYIDEPSVSDDAKFSTDFEVQYISSTCSLLVIDRGNQAIREIPLNDDDCAYQYEAG 250


>gi|30685100|ref|NP_196993.2| NHL domain-containing protein [Arabidopsis thaliana]
 gi|332004703|gb|AED92086.1| NHL domain-containing protein [Arabidopsis thaliana]
          Length = 754

 Score = 94.0 bits (232), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 61/192 (31%), Positives = 89/192 (46%), Gaps = 48/192 (25%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
           E GY + TV DG +L I P+++   P   +LIVLDS  S  + +S PLS     K L+G 
Sbjct: 75  ESGYNMETVFDGSKLGIEPYAIEVSPNGGELIVLDSENSNIHKISMPLSRYGKPKLLSGS 134

Query: 88  -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------------- 132
            +G  G+ DG+   AR ++P+  A+D +GNIYVAD  N  IRKI++              
Sbjct: 135 QEGYTGHVDGKLKEARMNRPRGLAMDDRGNIYVADTINMAIRKISDDGVSTIAAGGRWSG 194

Query: 133 --------------------------LDHGNQLIRQINLKPEDCSKSSQS------GSGM 160
                                     +D GNQLI++I L   DCS+          G+ +
Sbjct: 195 GSKEESMRFSDDFDLIYVSSSCSLLVIDRGNQLIKEIQLHDHDCSQPEPDTDSLHLGTAL 254

Query: 161 FWVTVFSPYLIS 172
               VF  Y+++
Sbjct: 255 LVAAVFFGYMLA 266


>gi|413953662|gb|AFW86311.1| hypothetical protein ZEAMMB73_666053 [Zea mays]
          Length = 506

 Score = 93.6 bits (231), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 64/177 (36%), Positives = 83/177 (46%), Gaps = 50/177 (28%)

Query: 29  EDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
           E GY V TV DG +L I PH+  I+R G  DL++LDS  S  Y +  PLS  S  K LAG
Sbjct: 76  EGGYAVDTVFDGSKLGIEPHAAQINRAG--DLLLLDSINSNIYRVQLPLSPYSRPKLLAG 133

Query: 88  --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------------- 132
             +G+ G+ DG    AR + PK F VD +GNIYVAD  N  IRKI++             
Sbjct: 134 SPEGLSGHVDGRLREARMNHPKGFTVDDRGNIYVADAMNMAIRKISDTGVTTIAGGKSIR 193

Query: 133 --------------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
                                           +D GNQ IR+I L  +DC+   ++G
Sbjct: 194 GGYIDEPSVSDDAKFSTDFEVQYISSTCSLLVIDRGNQAIREIPLNDDDCAYQYEAG 250


>gi|326503362|dbj|BAJ99306.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 452

 Score = 93.2 bits (230), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 61/188 (32%), Positives = 90/188 (47%), Gaps = 53/188 (28%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           +GL+LEDGYTV+TV D +    +P++++ RP + DL++LDS+ S  YTL  P+S ++  +
Sbjct: 54  AGLVLEDGYTVSTVSDLNPSGTHPYALLPRPRAGDLVLLDSAGSTLYTLPLPVSADAGPR 113

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN----------- 132
           RLAG            +     P+S AVD   N+YVAD++N  IRK+             
Sbjct: 114 RLAGGAG---------ALGDGHPRSIAVDGADNVYVADRANGSIRKVAPSGYTTTIAGAY 164

Query: 133 ---------------------------------LDHGNQLIRQINLKPEDCSKSSQSGSG 159
                                             D GN+LIRQI LKPEDC+  +Q G G
Sbjct: 165 SAGTGHRDEPAQNATFSADFELIYVPQICALLVADRGNRLIRQIKLKPEDCAHENQKGLG 224

Query: 160 MFWVTVFS 167
               ++ +
Sbjct: 225 TTSTSIIA 232


>gi|195621162|gb|ACG32411.1| hypothetical protein [Zea mays]
 gi|219884599|gb|ACL52674.1| unknown [Zea mays]
 gi|413944257|gb|AFW76906.1| hypothetical protein ZEAMMB73_849037 [Zea mays]
          Length = 508

 Score = 92.4 bits (228), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 63/174 (36%), Positives = 81/174 (46%), Gaps = 46/174 (26%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
           E GY V TV DG +L I PH+V   P + +L+VLDS  S  Y +  PLS  S  K LAG 
Sbjct: 73  EGGYAVDTVFDGSKLGIEPHAVEITP-AGNLLVLDSINSNIYRVQLPLSPYSRPKLLAGS 131

Query: 88  -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------------- 132
             G+ G+ DG    AR + PK F VD +GNIYVAD  N  IRKI++              
Sbjct: 132 PKGLSGHVDGRLREARMNHPKGFTVDDRGNIYVADGMNMAIRKISDTGVTTIAGGKSIRG 191

Query: 133 -----------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
                                        +D GNQ IR+I L  +DC+   ++G
Sbjct: 192 GHIDGPSDDAKFSTDFEIKYISSSCSLLVIDRGNQAIREIPLNDDDCAYQYEAG 245


>gi|218197942|gb|EEC80369.1| hypothetical protein OsI_22479 [Oryza sativa Indica Group]
          Length = 517

 Score = 92.4 bits (228), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 69/207 (33%), Positives = 92/207 (44%), Gaps = 58/207 (28%)

Query: 8   SSLAITLLSLLFSLVSSG------------LLLEDGYTVTTVIDGHQLEINPHSVIDRPG 55
           SS A  ++  L+SL S+             +  E GY V TV DG +L I PH V   P 
Sbjct: 45  SSTASAVVKQLWSLKSTATRTGSAAAGRSMVKYEGGYAVETVFDGSKLGIEPHDVEVTP- 103

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDM 113
           S +L+VLDS  S  Y +  PLS  S  K +AG  +G+ G+ DG    A+ + PK F VD 
Sbjct: 104 SGELLVLDSMNSNVYRVQLPLSRYSRPKLVAGSPEGLSGHVDGRLREAKMNHPKGFTVDD 163

Query: 114 KGNIYVADKSNHVIRKITN----------------------------------------- 132
           +GNIYVAD  N  IRKI++                                         
Sbjct: 164 RGNIYVADAMNMAIRKISDTGVTTIAGGKSMRGGHLDGPSDDAKFSTDFEIRYISSSCSL 223

Query: 133 --LDHGNQLIRQINLKPEDCSKSSQSG 157
             +D GNQ IR+I L  +DC    ++G
Sbjct: 224 LVIDRGNQAIREIALHNDDCEYQYEAG 250


>gi|115467514|ref|NP_001057356.1| Os06g0269300 [Oryza sativa Japonica Group]
 gi|53793193|dbj|BAD54399.1| NHL repeat-containing protein-like [Oryza sativa Japonica Group]
 gi|113595396|dbj|BAF19270.1| Os06g0269300 [Oryza sativa Japonica Group]
 gi|215768552|dbj|BAH00781.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222635365|gb|EEE65497.1| hypothetical protein OsJ_20925 [Oryza sativa Japonica Group]
          Length = 517

 Score = 92.0 bits (227), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 69/207 (33%), Positives = 92/207 (44%), Gaps = 58/207 (28%)

Query: 8   SSLAITLLSLLFSLVSSG------------LLLEDGYTVTTVIDGHQLEINPHSVIDRPG 55
           SS A  ++  L+SL S+             +  E GY V TV DG +L I PH V   P 
Sbjct: 45  SSTASAVVKQLWSLKSTATRTGSAAAGRSMVKYEGGYAVETVFDGSKLGIEPHDVEVTP- 103

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDM 113
           S +L+VLDS  S  Y +  PLS  S  K +AG  +G+ G+ DG    A+ + PK F VD 
Sbjct: 104 SGELLVLDSMNSNVYRVQLPLSRYSRPKLVAGSPEGLSGHVDGRLREAKMNHPKGFTVDD 163

Query: 114 KGNIYVADKSNHVIRKITN----------------------------------------- 132
           +GNIYVAD  N  IRKI++                                         
Sbjct: 164 RGNIYVADAMNMAIRKISDTGVTTIAGGKSMRGGHLDGPSDDAKFSTDFEIRYISSSCSL 223

Query: 133 --LDHGNQLIRQINLKPEDCSKSSQSG 157
             +D GNQ IR+I L  +DC    ++G
Sbjct: 224 LVIDRGNQAIREIALHNDDCEYQYEAG 250


>gi|413923863|gb|AFW63795.1| hypothetical protein ZEAMMB73_706161 [Zea mays]
          Length = 515

 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 70/204 (34%), Positives = 95/204 (46%), Gaps = 55/204 (26%)

Query: 8   SSLAITLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSD 58
           S+ A  ++  L+SL S         S +  E GYTV TV DG +L I P+SV +   S +
Sbjct: 44  SNAASAVVKRLWSLKSTTKTASGGKSMVKYEGGYTVETVFDGSKLGIEPYSV-EVTQSGE 102

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           L+V+DS  S  Y ++ PLS  S  K +AG  +G  G+ DG+   AR + PK F VD KGN
Sbjct: 103 LLVMDSMNSNIYRMALPLSRYSRPKLVAGSPEGFPGHVDGKLREARMNHPKGFTVDDKGN 162

Query: 117 IYVADKSNHVIRKITN-------------------------------------------L 133
           IYVAD  N  IRKI++                                           +
Sbjct: 163 IYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYIGSSCSLLVI 222

Query: 134 DHGNQLIRQINLKPEDCSKSSQSG 157
           D GNQ IR+I L  +DC    ++G
Sbjct: 223 DRGNQAIREIQLHFDDCVYQYEAG 246


>gi|226530993|ref|NP_001143690.1| uncharacterized protein LOC100276420 precursor [Zea mays]
 gi|195624742|gb|ACG34201.1| hypothetical protein [Zea mays]
          Length = 515

 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 63/174 (36%), Positives = 83/174 (47%), Gaps = 46/174 (26%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
           E GYTV TV DG +L I P+SV +   S +L+V+DS  S  Y ++ PLS  S  K +AG 
Sbjct: 75  EGGYTVETVFDGSKLGIEPYSV-EVTQSGELLVMDSMNSNIYRMALPLSRYSRPKLVAGS 133

Query: 88  -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------------- 132
            +G  G+ DG+   AR + PK F VD KGNIYVAD  N  IRKI++              
Sbjct: 134 PEGFPGHVDGKLREARMNHPKGFTVDDKGNIYVADAMNMAIRKISDTGVTTIAGGKSGRG 193

Query: 133 -----------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
                                        +D GNQ IR+I L  +DC    ++G
Sbjct: 194 GHVDGPSDEAKFSTDFEVRYIGSSCSLLVIDRGNQAIREIQLHFDDCVYQYEAG 247


>gi|413923861|gb|AFW63793.1| hypothetical protein ZEAMMB73_706161 [Zea mays]
          Length = 516

 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 63/174 (36%), Positives = 83/174 (47%), Gaps = 46/174 (26%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
           E GYTV TV DG +L I P+SV +   S +L+V+DS  S  Y ++ PLS  S  K +AG 
Sbjct: 75  EGGYTVETVFDGSKLGIEPYSV-EVTQSGELLVMDSMNSNIYRMALPLSRYSRPKLVAGS 133

Query: 88  -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------------- 132
            +G  G+ DG+   AR + PK F VD KGNIYVAD  N  IRKI++              
Sbjct: 134 PEGFPGHVDGKLREARMNHPKGFTVDDKGNIYVADAMNMAIRKISDTGVTTIAGGKSGRG 193

Query: 133 -----------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
                                        +D GNQ IR+I L  +DC    ++G
Sbjct: 194 GHVDGPSDEAKFSTDFEVRYIGSSCSLLVIDRGNQAIREIQLHFDDCVYQYEAG 247


>gi|297807503|ref|XP_002871635.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317472|gb|EFH47894.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 704

 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 54/165 (32%), Positives = 80/165 (48%), Gaps = 42/165 (25%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
           E GY + TV DG +L I P+++   P   +LIVLDS  S  + +S PLS  +  K ++G 
Sbjct: 75  ESGYNIETVFDGSKLGIEPYAIEVSPNGGELIVLDSENSNIHKISMPLSRYAKPKLVSGS 134

Query: 88  -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------------- 132
            +G  G+ DG+   A+ ++P+  A+D  GNIYVAD +N  IRKI++              
Sbjct: 135 QEGYTGHVDGKLKEAKMNRPRGLAIDDSGNIYVADTNNMAIRKISDDGVSTITAGGRWSG 194

Query: 133 --------------------------LDHGNQLIRQINLKPEDCS 151
                                     +D GNQ+I++I L   DCS
Sbjct: 195 GSKEESMRFSDDFDLIYVSSSCSLLVIDRGNQMIKEIQLHDHDCS 239


>gi|326512916|dbj|BAK03365.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326529495|dbj|BAK04694.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326529987|dbj|BAK08273.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 503

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 66/204 (32%), Positives = 94/204 (46%), Gaps = 55/204 (26%)

Query: 8   SSLAITLLSLLFSLVSSG---------LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSD 58
           S+ A  +L  L+SL S+          +  E GYTV TV DG +L I P++V    G  +
Sbjct: 40  SNAASAVLKRLWSLKSTAKTASGAKSMVKYEGGYTVETVFDGSKLGIEPYAVEVTQGG-E 98

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           L+V+DS  S  Y ++ PLS  S  K +AG  +G+ G+ DG    A+ + PK F VD +GN
Sbjct: 99  LLVMDSMNSNIYRIALPLSRYSRPKLVAGSPEGIPGHIDGRLREAKMNHPKGFTVDGRGN 158

Query: 117 IYVADKSNHVIRKITN-------------------------------------------L 133
           IYVAD  N  +RKI++                                           +
Sbjct: 159 IYVADAMNMAVRKISDTGVTTIAGGKSSRGGHVDGPSDDAKFSTDFEVRYIASSCSLLVI 218

Query: 134 DHGNQLIRQINLKPEDCSKSSQSG 157
           D GNQ IR+I L  +DC    ++G
Sbjct: 219 DRGNQAIREIQLNFDDCVYQYEAG 242


>gi|293331401|ref|NP_001168100.1| uncharacterized protein LOC100381837 precursor [Zea mays]
 gi|223946007|gb|ACN27087.1| unknown [Zea mays]
 gi|413938717|gb|AFW73268.1| hypothetical protein ZEAMMB73_298159 [Zea mays]
          Length = 510

 Score = 90.1 bits (222), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 69/204 (33%), Positives = 96/204 (47%), Gaps = 55/204 (26%)

Query: 8   SSLAITLLSLLFSL-----VSSG----LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSD 58
           S+ A  ++  L+SL     ++SG    +  E GYTV TV DG +L I P+SV    G  +
Sbjct: 44  SNAASAVVKRLWSLKSTTKIASGGKSMVKYEGGYTVETVFDGSKLGIEPYSVEVTQGG-E 102

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           L+V+DS  S  Y ++ PLS  S  K +AG  +G  G+ DG+   AR + PK F VD +GN
Sbjct: 103 LLVMDSMNSNIYRMALPLSRYSRPKLVAGSPEGFPGHVDGKLREARMNHPKGFTVDDRGN 162

Query: 117 IYVADKSNHVIRKITN-------------------------------------------L 133
           IYVAD  N  IRKI++                                           +
Sbjct: 163 IYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYIGSSCSLLVI 222

Query: 134 DHGNQLIRQINLKPEDCSKSSQSG 157
           D GNQ IR+I L  +DC    ++G
Sbjct: 223 DRGNQAIREIQLHFDDCVYQYEAG 246


>gi|413944258|gb|AFW76907.1| hypothetical protein ZEAMMB73_849037 [Zea mays]
          Length = 187

 Score = 89.7 bits (221), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 52/106 (49%), Positives = 64/106 (60%), Gaps = 3/106 (2%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
           E GY V TV DG +L I PH+V   P + +L+VLDS  S  Y +  PLS  S  K LAG 
Sbjct: 73  EGGYAVDTVFDGSKLGIEPHAVEITP-AGNLLVLDSINSNIYRVQLPLSPYSRPKLLAGS 131

Query: 88  -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
             G+ G+ DG    AR + PK F VD +GNIYVAD  N  IRKI++
Sbjct: 132 PKGLSGHVDGRLREARMNHPKGFTVDDRGNIYVADGMNMAIRKISD 177


>gi|413938718|gb|AFW73269.1| hypothetical protein ZEAMMB73_298159 [Zea mays]
          Length = 511

 Score = 89.4 bits (220), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 68/205 (33%), Positives = 94/205 (45%), Gaps = 56/205 (27%)

Query: 8   SSLAITLLSLLFSLVSSGLLL----------EDGYTVTTVIDGHQLEINPHSVIDRPGSS 57
           S+ A  ++  L+SL S+  +           E GYTV TV DG +L I P+SV    G  
Sbjct: 44  SNAASAVVKRLWSLKSTTKIAASGGKSMVKYEGGYTVETVFDGSKLGIEPYSVEVTQGG- 102

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           +L+V+DS  S  Y ++ PLS  S  K +AG  +G  G+ DG+   AR + PK F VD +G
Sbjct: 103 ELLVMDSMNSNIYRMALPLSRYSRPKLVAGSPEGFPGHVDGKLREARMNHPKGFTVDDRG 162

Query: 116 NIYVADKSNHVIRKITN------------------------------------------- 132
           NIYVAD  N  IRKI++                                           
Sbjct: 163 NIYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYIGSSCSLLV 222

Query: 133 LDHGNQLIRQINLKPEDCSKSSQSG 157
           +D GNQ IR+I L  +DC    ++G
Sbjct: 223 IDRGNQAIREIQLHFDDCVYQYEAG 247


>gi|242066098|ref|XP_002454338.1| hypothetical protein SORBIDRAFT_04g029020 [Sorghum bicolor]
 gi|241934169|gb|EES07314.1| hypothetical protein SORBIDRAFT_04g029020 [Sorghum bicolor]
          Length = 518

 Score = 89.4 bits (220), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 62/174 (35%), Positives = 82/174 (47%), Gaps = 46/174 (26%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
           E GYTV TV DG +L I P+SV    G  +L+V+DS  S  Y ++ PLS  S  K +AG 
Sbjct: 75  EGGYTVETVFDGSKLGIEPYSVEVTQGG-ELLVMDSMNSNIYRMALPLSRYSRPKLVAGS 133

Query: 88  -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------------- 132
            +G  G+ DG+   AR + PK F VD +GNIYVAD  N  IRKI++              
Sbjct: 134 PEGFPGHVDGKLREARMNHPKGFTVDDRGNIYVADAMNMAIRKISDTGVTTIAGGKSGRG 193

Query: 133 -----------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
                                        +D GNQ IR+I L  +DC    ++G
Sbjct: 194 GHVDGPSDEAKFSTDFEVRYVGSSCSLLVIDRGNQAIREIQLHFDDCVYQYEAG 247


>gi|9369390|gb|AAF87138.1|AC002423_3 T23E23.5 [Arabidopsis thaliana]
          Length = 493

 Score = 89.0 bits (219), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 70/232 (30%), Positives = 102/232 (43%), Gaps = 60/232 (25%)

Query: 1   MASHFVYSSLAITLLSLLFSLVSSG-----------LLLEDGYTVTTVIDGHQLEINPHS 49
           M+ HF++  + I L S   +   S            +  E+GY+V TV+DG +L I P+S
Sbjct: 1   MSRHFLFLGIIILLFSAFVASAPSSTSPATVPTKSMVKFENGYSVETVLDGSKLGIEPYS 60

Query: 50  VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPK 107
            I    + +L++LDS  S  Y +S  LS  S  + + G  +G  G+ DG    AR + PK
Sbjct: 61  -IQVLSNGELLILDSQNSNIYQISSSLSLYSRPRLVTGSPEGYPGHVDGRLRDARLNNPK 119

Query: 108 SFAVDMKGNIYVADKSNHVIRKITN----------------------------------- 132
              VD +GNIYVAD  N+ IRKI+                                    
Sbjct: 120 GLTVDDRGNIYVADTVNNAIRKISEAGVTTIAGGKMVRGGGHVDGPSEDAKFSNDFDVVY 179

Query: 133 ---------LDHGNQLIRQINLKPEDCSKSSQSG--SGMFWVTVFSPYLISC 173
                    +D GNQ IR+I L  +DC+    SG   G  ++  FS ++ SC
Sbjct: 180 LGSSCSLLVIDRGNQAIREIQLHFDDCADQYGSGFPLGNIYIPKFSVWVPSC 231


>gi|357137830|ref|XP_003570502.1| PREDICTED: uncharacterized protein LOC100839308 [Brachypodium
           distachyon]
          Length = 495

 Score = 88.6 bits (218), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 67/204 (32%), Positives = 93/204 (45%), Gaps = 55/204 (26%)

Query: 8   SSLAITLLSLLFSLVS---------SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSD 58
           S+ A  +L  L+SL S         S +  E GYTV TV DG +L I P++V    G  +
Sbjct: 39  SNAASAVLKRLWSLKSTTKTATGAKSMVKYEGGYTVETVFDGSKLGIEPYAVEVTQGG-E 97

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           L+V+DS  S  Y ++ PLS  S  K +AG  +G  G+ DG    A+ + PK F VD +GN
Sbjct: 98  LLVMDSMNSNIYRIALPLSRYSRPKLVAGSPEGYPGHVDGRLREAKMNHPKGFTVDGRGN 157

Query: 117 IYVADKSNHVIRKITN-------------------------------------------L 133
           IYVAD  N  +RKI++                                           +
Sbjct: 158 IYVADAMNMAVRKISDTGVTTIAGGKSSRGGHVDGPSDDAKFSTDFEVRYIASSCSLLVI 217

Query: 134 DHGNQLIRQINLKPEDCSKSSQSG 157
           D GNQ IR+I L  +DC    ++G
Sbjct: 218 DRGNQAIREIQLHFDDCVYQYEAG 241


>gi|356573394|ref|XP_003554846.1| PREDICTED: uncharacterized protein LOC100793916 [Glycine max]
          Length = 528

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 59/170 (34%), Positives = 77/170 (45%), Gaps = 47/170 (27%)

Query: 28  LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
            E GY+V T+ DG QL I PHSV   P + + +VLDS  S  Y +S  +S  S  K LAG
Sbjct: 70  FESGYSVETIFDGSQLGIEPHSVKISP-NGEFLVLDSENSNIYKVSGSMSRYSRPKLLAG 128

Query: 88  --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------------- 132
             +G  G+ DG P  AR + PK   VD +GNIY+AD  N  IRKI++             
Sbjct: 129 SAEGNIGHIDGRPREARMNHPKGLTVDDRGNIYIADTLNMAIRKISDEGVTTIAGGKRGY 188

Query: 133 -------------------------------LDHGNQLIRQINLKPEDCS 151
                                          +D GN  IR+I L  +DC+
Sbjct: 189 AGGHVDGPSEDAKFSNDFDVVYVGSSCSLLVVDRGNHAIREIQLHQDDCT 238


>gi|449446849|ref|XP_004141183.1| PREDICTED: uncharacterized protein LOC101216407 [Cucumis sativus]
 gi|449489545|ref|XP_004158343.1| PREDICTED: uncharacterized protein LOC101227198 [Cucumis sativus]
          Length = 417

 Score = 87.4 bits (215), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 54/172 (31%), Positives = 81/172 (47%), Gaps = 46/172 (26%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
           +  E GY V T++DG ++ I P+SV   P S +L++LD+  S  + +S P+S+    K  
Sbjct: 2   IKFERGYVVETLLDGSKMGIEPYSVGVSP-SGELLILDAENSNVHKISMPVSQFCRPKLF 60

Query: 86  AG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN----------- 132
           AG  +G  G+ DG+   AR   P+   VD +GNIY+AD  N  IRKI++           
Sbjct: 61  AGSSEGYSGHVDGKLRDARMSHPRGLTVDQRGNIYIADTKNKAIRKISDAGVTTIAGGKW 120

Query: 133 --------------------------------LDHGNQLIRQINLKPEDCSK 152
                                           +D GNQ IR+I L+ EDC++
Sbjct: 121 RKSGHIDGPSEESKFSNDFDLVYVGSSCSLLVVDRGNQAIREIQLRAEDCTE 172


>gi|147798773|emb|CAN69794.1| hypothetical protein VITISV_022544 [Vitis vinifera]
          Length = 491

 Score = 87.4 bits (215), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 64/182 (35%), Positives = 86/182 (47%), Gaps = 49/182 (26%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           S +  E GYTV TV DG +L I P+S I+  GS +L++LDS+ S  Y +S  LS+ +  K
Sbjct: 62  SMMKFESGYTVETVFDGSKLGIEPYS-IEVLGSGELLILDSANSNLYKISSSLSQYTRPK 120

Query: 84  RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN--------- 132
            + G  +G  G+ DG    AR + PK   VD +GNIYVAD  N  IRKI++         
Sbjct: 121 LVTGSAEGYSGHVDGRLREARMNHPKGLTVDDRGNIYVADTLNMAIRKISDAGVTTIAGG 180

Query: 133 -----------------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
                                              +D GNQ IR+I L  +DC  + Q G
Sbjct: 181 KLGRGGGHVDGPSEDAKFSNDFDVVYIGSSCSLLVIDRGNQAIREIQLHFDDC--AYQYG 238

Query: 158 SG 159
           SG
Sbjct: 239 SG 240


>gi|224102017|ref|XP_002312513.1| predicted protein [Populus trichocarpa]
 gi|222852333|gb|EEE89880.1| predicted protein [Populus trichocarpa]
          Length = 494

 Score = 87.4 bits (215), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 71/208 (34%), Positives = 99/208 (47%), Gaps = 57/208 (27%)

Query: 6   VYSSLAITLLSLLFSL------VSSG---LLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
           ++S++   L+  L+SL      V SG   +  E GYTV TV DG +L I P+SV   P S
Sbjct: 38  LFSNVVSALMKWLWSLKATTKTVISGRPMMKFESGYTVETVFDGSKLGIEPYSVQVLP-S 96

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMK 114
            +L++LDS+ S  Y +S  LS  S  K +AG  +G  G+ DG+   A+ + PK   VD +
Sbjct: 97  GELLILDSANSNIYRISASLSLYSRPKLVAGSHEGYSGHVDGKLREAKMNHPKGLTVDDR 156

Query: 115 GNIYVADKSNHVIRKITN------------------------------------------ 132
           GNIYVAD  N  IRKI++                                          
Sbjct: 157 GNIYVADTMNMAIRKISDAGVTTIAGGKWGRGSHVDGASEDANFSNDFDVVYIGSSCSLL 216

Query: 133 -LDHGNQLIRQINLKPEDCSKSSQSGSG 159
            +D GN+ IR+I L  +DC  + Q GSG
Sbjct: 217 VIDRGNRAIREIQLHFDDC--AYQYGSG 242


>gi|225424717|ref|XP_002265467.1| PREDICTED: uncharacterized protein LOC100263777 [Vitis vinifera]
 gi|296086531|emb|CBI32120.3| unnamed protein product [Vitis vinifera]
          Length = 491

 Score = 87.4 bits (215), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 64/182 (35%), Positives = 86/182 (47%), Gaps = 49/182 (26%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           S +  E GYTV TV DG +L I P+S I+  GS +L++LDS+ S  Y +S  LS+ +  K
Sbjct: 62  SMMKFESGYTVETVFDGSKLGIEPYS-IEVLGSGELLILDSANSNLYKISSSLSQYTRPK 120

Query: 84  RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN--------- 132
            + G  +G  G+ DG    AR + PK   VD +GNIYVAD  N  IRKI++         
Sbjct: 121 LVTGSAEGYSGHVDGRLREARMNHPKGLTVDDRGNIYVADTLNMAIRKISDAGVTTIAGG 180

Query: 133 -----------------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
                                              +D GNQ IR+I L  +DC  + Q G
Sbjct: 181 KLGRGGGHVDGPSEDAKFSNDFDVVYIGSSCSLLVIDRGNQAIREIQLHFDDC--AYQYG 238

Query: 158 SG 159
           SG
Sbjct: 239 SG 240


>gi|225461726|ref|XP_002285515.1| PREDICTED: uncharacterized protein LOC100245328 [Vitis vinifera]
 gi|302142876|emb|CBI20171.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score = 87.4 bits (215), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 61/186 (32%), Positives = 89/186 (47%), Gaps = 50/186 (26%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
           L  EDGY V TV++G++L + PHS+ +   G  +L  +D+ ++    ++ PLS+ S  + 
Sbjct: 57  LQFEDGYLVETVVEGNELGVVPHSIRVSEDG--ELFAVDAVKNNIVRITPPLSQYSRARL 114

Query: 85  LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN---------- 132
           +AG   G  G+ DG+P  ARF+ PK   +D KGN+YVAD SN  IRKI +          
Sbjct: 115 VAGSFQGHTGHVDGKPSDARFNGPKGVTMDDKGNVYVADTSNLAIRKIGDSGVTTIAGGK 174

Query: 133 ---------------------------------LDHGNQLIRQINLKPEDC--SKSSQSG 157
                                            +D GN  +RQI+L  EDC    SS S 
Sbjct: 175 SNVAGYRDGPSEDAKFSSDFDVVYVRPTCSLLVVDRGNAALRQISLNQEDCDYQNSSISA 234

Query: 158 SGMFWV 163
           + +F V
Sbjct: 235 TDIFMV 240


>gi|449492692|ref|XP_004159073.1| PREDICTED: uncharacterized protein LOC101226879 [Cucumis sativus]
          Length = 516

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 94/201 (46%), Gaps = 56/201 (27%)

Query: 6   VYSSLAITLLSLLFSLVSSG---------LLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
           V S++  +L+  L+S+ SS          +  E GY+V TV DG +L I+P+SV   P S
Sbjct: 34  VLSNVVSSLVKKLWSMKSSAKTAVSSRSMMKFESGYSVETVFDGSKLGIDPYSVEMSP-S 92

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMK 114
            +L++LD+  S  + +S PLS  S  K ++G  +G  G+ DG    AR + PK   +D +
Sbjct: 93  GELLILDAENSNIHKISMPLSRFSRPKLVSGSAEGYSGHVDGHSREARMNHPKGLTLDER 152

Query: 115 GNIYVADKSNHVIRKITN------------------------------------------ 132
           GNIY+AD  N  IRKI++                                          
Sbjct: 153 GNIYIADTMNMAIRKISDTGVTTIAGGRWNQGSGHIDGPSEDAKFSNDFDVVYVGSSCSL 212

Query: 133 --LDHGNQLIRQINLKPEDCS 151
             +D GN+ IR+I L  +DC+
Sbjct: 213 LVIDRGNKAIREIELNYDDCN 233


>gi|195609352|gb|ACG26506.1| hypothetical protein [Zea mays]
 gi|414870563|tpg|DAA49120.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
          Length = 159

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 49/109 (44%), Positives = 70/109 (64%), Gaps = 5/109 (4%)

Query: 27  LLEDGYTVTTVIDGHQLEIN---PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           +LEDGYTVTT  D + L  +   P++V+ R  + DL++LDS+ SA YT+S   S     +
Sbjct: 28  VLEDGYTVTTFADLNPLPASGPHPYAVLPRSRAGDLLLLDSAGSALYTISL-SSSPGEPR 86

Query: 84  RLAGDGV-QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           RLAG     G++DG PG A FD+P+S AVD   N+YVAD+ +  +RK+ 
Sbjct: 87  RLAGGKRGSGFNDGGPGDAAFDRPRSVAVDAADNVYVADRLHGAVRKVA 135


>gi|224108107|ref|XP_002314724.1| predicted protein [Populus trichocarpa]
 gi|222863764|gb|EEF00895.1| predicted protein [Populus trichocarpa]
          Length = 487

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 68/208 (32%), Positives = 96/208 (46%), Gaps = 57/208 (27%)

Query: 6   VYSSLAITLLSLLFSLVSSG---------LLLEDGYTVTTVIDGHQLEINPHSVIDRPGS 56
           V+S++    +  L+S+ S+          +  E GYTV TV DG +L I P+SV   P S
Sbjct: 41  VFSNVVPAFMKWLWSMKSTTKTVISGRPMMKFESGYTVETVFDGSKLGIEPYSVQVLP-S 99

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMK 114
            +L++LDS  S  Y +S  LS  S  K +AG  +G  G+ DG+   A+ + PK   VD +
Sbjct: 100 GELLILDSVNSNIYRMSSSLSLYSRPKLVAGSHEGYSGHVDGKLREAKMNHPKGLTVDDR 159

Query: 115 GNIYVADKSNHVIRKITN------------------------------------------ 132
           GNIY+AD  N  IRKI++                                          
Sbjct: 160 GNIYIADTMNMAIRKISDAGVTTIAGGKWGRGSHVDGASEDAKFSNDFDVLYIGSSCSLL 219

Query: 133 -LDHGNQLIRQINLKPEDCSKSSQSGSG 159
            +D GNQ IR+I L  +DC+   Q GSG
Sbjct: 220 VIDRGNQAIREIQLHFDDCAY--QYGSG 245


>gi|357118152|ref|XP_003560822.1| PREDICTED: uncharacterized protein LOC100834977 [Brachypodium
           distachyon]
          Length = 544

 Score = 86.7 bits (213), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 59/174 (33%), Positives = 81/174 (46%), Gaps = 46/174 (26%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
           E  Y V TV DG +L I P++V   P + +L+VLDS  S  Y +  PLS  S  K ++G 
Sbjct: 79  EGWYEVETVFDGSKLGIEPYAVEVTP-AGELLVLDSMNSNIYRVQLPLSRYSRPKLVSGS 137

Query: 88  -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------------- 132
            +G+ G+ DG    AR + PK F VD +GNIYVAD  N  IRKI++              
Sbjct: 138 PEGLSGHVDGRLREARMNHPKGFTVDDRGNIYVADAMNMAIRKISDTGVTTIAGGKSVRG 197

Query: 133 -----------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
                                        +D GNQ IR+I L+ +DC    ++G
Sbjct: 198 GHTDGPSDDAKFSTDFEIRYISSSCSLLVIDRGNQAIREIPLQLDDCEYQHEAG 251


>gi|255560011|ref|XP_002521024.1| conserved hypothetical protein [Ricinus communis]
 gi|223539861|gb|EEF41441.1| conserved hypothetical protein [Ricinus communis]
          Length = 500

 Score = 85.9 bits (211), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 60/178 (33%), Positives = 83/178 (46%), Gaps = 47/178 (26%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
           +  E GY V TV DG +L I P+SV   P + +L++LDS+ S  Y +S  LS  S  K +
Sbjct: 72  MKFEGGYNVETVFDGSKLGIEPYSVEVLP-NGELLILDSANSNIYRISSSLSLYSRPKLV 130

Query: 86  AG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN----------- 132
           AG  DG  G+ DG+   AR + PK   VD +GNIY+AD  N  IRKI++           
Sbjct: 131 AGSPDGYSGHVDGKHREARMNHPKGLTVDDRGNIYIADAMNMAIRKISDAGVTTIAGGKW 190

Query: 133 ---------------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
                                            +D GN+ IR+I L  +DC+   +SG
Sbjct: 191 GRGGSHVDGASEDAKFSNDFDVVYIGSSCSLLVIDRGNRAIREIQLHFDDCAYQYESG 248


>gi|356497860|ref|XP_003517774.1| PREDICTED: uncharacterized protein LOC100799483 [Glycine max]
          Length = 508

 Score = 85.5 bits (210), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 84/180 (46%), Gaps = 47/180 (26%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           S +  E GYTV TV DG +L I P++V   P + +L++LDS+ S  Y +S  LS  S  K
Sbjct: 62  SMMKFESGYTVETVFDGSKLGIEPYAVEVLP-NGELLILDSANSNIYRISSSLSLNSRPK 120

Query: 84  RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN--------- 132
            +AG  +G  G+ DG    AR + PK   VD +GNIYVAD  N  IRKI++         
Sbjct: 121 LVAGSAEGYSGHVDGRFREARMNHPKGITVDNRGNIYVADIMNMAIRKISDSGVTTIAGG 180

Query: 133 -----------------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
                                              +D GNQ IR+I L  +DC+   ++G
Sbjct: 181 KWSRGGGHVDGPSEEAKFSNDFDVVYVGSSCSLLVIDRGNQAIREIQLHFDDCAYQYENG 240


>gi|356501873|ref|XP_003519748.1| PREDICTED: uncharacterized protein LOC100797570 [Glycine max]
          Length = 507

 Score = 84.7 bits (208), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 59/182 (32%), Positives = 85/182 (46%), Gaps = 47/182 (25%)

Query: 22  VSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESV 81
             S +  E GYTV TV DG +L I P++V +   + +L++LDS+ S  Y +S  LS  + 
Sbjct: 60  AKSMMKFESGYTVETVFDGSKLGIEPYAV-EVLANGELLILDSANSNIYRISSSLSLNTR 118

Query: 82  VKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------- 132
            K +AG  +G  G+ DG    AR + PK  AVD +GNIY+AD  N  IRKI++       
Sbjct: 119 PKLVAGSAEGYSGHVDGRLREARMNHPKGIAVDNRGNIYIADIMNMAIRKISDSGVTTIA 178

Query: 133 -------------------------------------LDHGNQLIRQINLKPEDCSKSSQ 155
                                                +D GNQ IR+I L  +DC+   +
Sbjct: 179 GGKWSRGGGHIDGPSEEAKFSNDFDVVYIGSSCSLLVIDRGNQAIREIQLHFDDCAYQYE 238

Query: 156 SG 157
           +G
Sbjct: 239 NG 240


>gi|317015034|gb|ADU85604.1| NHL repeat-containing protein, partial [Wolffia arrhiza]
          Length = 239

 Score = 84.3 bits (207), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/107 (43%), Positives = 65/107 (60%), Gaps = 3/107 (2%)

Query: 28  LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
            E GY V T+ DG++L + PH++   P   +LI +DS  S    ++ PLS  S  + +AG
Sbjct: 60  FESGYLVETIADGNRLGLTPHAIRVSP-DGELIAVDSVNSNIVRITPPLSAFSRGRLVAG 118

Query: 88  --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
              G  G  DG+P  ARF++P+  AVD +GNIYVAD SN  IRKI +
Sbjct: 119 SFQGRSGLIDGKPSEARFNQPRGVAVDRRGNIYVADVSNLAIRKIGD 165


>gi|449449204|ref|XP_004142355.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           LOC101209861 [Cucumis sativus]
          Length = 454

 Score = 84.3 bits (207), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 56/174 (32%), Positives = 82/174 (47%), Gaps = 47/174 (27%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           S +  E GY+V TV DG +L I+P+SV   P S +L++LD+  S  + +S PLS  S  K
Sbjct: 13  SMMKFESGYSVETVFDGSKLGIDPYSVEMSP-SGELLILDAENSNIHKISMPLSRFSRPK 71

Query: 84  RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN--------- 132
            ++G  +G  G+ DG    AR + PK   +D +GNIY+AD  N  IRKI++         
Sbjct: 72  LVSGSAEGYSGHVDGHSREARMNHPKGLTLDERGNIYIADTMNMAIRKISDTGVTTIAGG 131

Query: 133 -----------------------------------LDHGNQLIRQINLKPEDCS 151
                                              +D GN+ IR+I L  +DC+
Sbjct: 132 RWNQGSGHIDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAIREIELNYDDCN 185


>gi|297845436|ref|XP_002890599.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336441|gb|EFH66858.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 485

 Score = 83.6 bits (205), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 63/197 (31%), Positives = 90/197 (45%), Gaps = 49/197 (24%)

Query: 9   SLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSA 68
           SL+    ++      S +  E+GY+V TV+DG +L I P+S+   P + +L++LDS  S 
Sbjct: 49  SLSFKTTTITAVSTKSMVKFENGYSVETVLDGSKLGIEPYSLQVLP-NGELLILDSQNSN 107

Query: 69  FYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
            Y +S  LS  S  + + G  +G  G+ DG    AR + PK   VD +GNIYVAD  N+ 
Sbjct: 108 IYKISSSLSLYSRPRLITGSPEGYPGHVDGRLRDARLNNPKGLTVDDRGNIYVADTVNNA 167

Query: 127 IRKITN--------------------------------------------LDHGNQLIRQ 142
           IRKI+                                             +D GNQ IR+
Sbjct: 168 IRKISEAGVTTIAGGKMVHGGGHVDGPSEDAKFSNDFDVVYLGSSCSLLVIDRGNQAIRE 227

Query: 143 INLKPEDCSKSSQSGSG 159
           I L  +DC  + Q GSG
Sbjct: 228 IQLHFDDC--ADQYGSG 242


>gi|356549639|ref|XP_003543199.1| PREDICTED: uncharacterized protein LOC100797841 [Glycine max]
          Length = 487

 Score = 83.2 bits (204), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 57/191 (29%), Positives = 85/191 (44%), Gaps = 45/191 (23%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
           L  E+GY V TV++G+++ + P+ +       +L  +D+  S    ++ PLS+ S  + +
Sbjct: 57  LQFENGYVVETVVEGNEIGVVPYRIRVSEEDGELFAVDAINSNIVRITPPLSQYSRGRLV 116

Query: 86  AG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN----------- 132
           AG   G  G+ DG+P  ARF+ PK   VD KGN+YVAD  N  IRKI +           
Sbjct: 117 AGSFQGYTGHVDGKPSDARFNHPKGITVDDKGNVYVADTQNLAIRKIGDAGVTTIAGGKS 176

Query: 133 --------------------------------LDHGNQLIRQINLKPEDCSKSSQSGSGM 160
                                           +D GN  +RQI+L  EDC   S S S  
Sbjct: 177 NVAGYRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLDQEDCDYQSNSISST 236

Query: 161 FWVTVFSPYLI 171
             +TV    ++
Sbjct: 237 DILTVVGAVIV 247


>gi|22329761|ref|NP_173800.2| NHL domain-containing protein [Arabidopsis thaliana]
 gi|18700091|gb|AAL77657.1| At1g23880/T23E23_8 [Arabidopsis thaliana]
 gi|20856013|gb|AAM26643.1| At1g23880/T23E23_8 [Arabidopsis thaliana]
 gi|332192324|gb|AEE30445.1| NHL domain-containing protein [Arabidopsis thaliana]
          Length = 545

 Score = 83.2 bits (204), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 60/178 (33%), Positives = 82/178 (46%), Gaps = 49/178 (27%)

Query: 28  LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
            E+GY+V TV+DG +L I P+S I    + +L++LDS  S  Y +S  LS  S  + + G
Sbjct: 128 FENGYSVETVLDGSKLGIEPYS-IQVLSNGELLILDSQNSNIYQISSSLSLYSRPRLVTG 186

Query: 88  --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------------- 132
             +G  G+ DG    AR + PK   VD +GNIYVAD  N+ IRKI+              
Sbjct: 187 SPEGYPGHVDGRLRDARLNNPKGLTVDDRGNIYVADTVNNAIRKISEAGVTTIAGGKMVR 246

Query: 133 -------------------------------LDHGNQLIRQINLKPEDCSKSSQSGSG 159
                                          +D GNQ IR+I L  +DC  + Q GSG
Sbjct: 247 GGGHVDGPSEDAKFSNDFDVVYLGSSCSLLVIDRGNQAIREIQLHFDDC--ADQYGSG 302


>gi|356565931|ref|XP_003551189.1| PREDICTED: uncharacterized protein LOC100781836 [Glycine max]
          Length = 493

 Score = 82.8 bits (203), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 62/182 (34%), Positives = 86/182 (47%), Gaps = 49/182 (26%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           S +  E GY V TV DG +L I P++V   P + +L++LDS+ S  Y +S  LS  S  K
Sbjct: 66  SMMKFESGYNVETVFDGSKLGIEPYAVEVLP-NGELLILDSANSNIYRISSSLSLYSRPK 124

Query: 84  RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN--------- 132
            +AG  +G  G+ DG+   AR + PK   VD +GNIYVAD +N  IRKI++         
Sbjct: 125 LVAGSAEGYSGHVDGKLREARMNHPKGITVDDRGNIYVADTTNMAIRKISDSGVTTIAGG 184

Query: 133 -----------------------------------LDHGNQLIRQINLKPEDCSKSSQSG 157
                                              +D GN+ IR+I L  +DC  + Q G
Sbjct: 185 KWNRGGGHVDGPSEEAKFSDDLDVVYVGSSCSLLVIDRGNRAIREIQLHFDDC--AYQYG 242

Query: 158 SG 159
           SG
Sbjct: 243 SG 244


>gi|224117004|ref|XP_002317450.1| predicted protein [Populus trichocarpa]
 gi|222860515|gb|EEE98062.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score = 82.4 bits (202), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 63/207 (30%), Positives = 88/207 (42%), Gaps = 54/207 (26%)

Query: 3   SHFVYS-SLAITLLSLLFSLVSSG-------LLLEDGYTVTTVIDGHQLEINPHSVIDRP 54
           SHF Y+  LA+  L+  F   +         L  EDGY V TV+ G+ + + P+  I   
Sbjct: 4   SHFSYTLVLAVLSLACTFQFQAHAAPPDENVLQFEDGYLVETVVKGNAMGVVPYK-IRLS 62

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVD 112
              +L  +D   S    ++ PLS+ S  + +AG   G  G+ DG+P   RF+ P+   +D
Sbjct: 63  EDGELYAVDEVNSNVVKITPPLSQYSRARLVAGSFQGYTGHIDGKPNEVRFNHPRGLTMD 122

Query: 113 MKGNIYVADKSNHVIRKITN---------------------------------------- 132
            KGNIYVAD  NH IRKI +                                        
Sbjct: 123 DKGNIYVADSLNHAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVHSTCS 182

Query: 133 ---LDHGNQLIRQINLKPEDCSKSSQS 156
              +D GN  +RQI+L  EDC   S S
Sbjct: 183 LLVVDRGNAALRQISLNQEDCDYQSSS 209


>gi|413936635|gb|AFW71186.1| hypothetical protein ZEAMMB73_395034 [Zea mays]
          Length = 501

 Score = 82.4 bits (202), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 56/176 (31%), Positives = 80/176 (45%), Gaps = 46/176 (26%)

Query: 22  VSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRP-GSSDLIVLDSSRSAFYTLSFPLSEES 80
           V   L  E GY V T+++G +L + PH+V   P    +L+ +DS+ S    ++ PLSE S
Sbjct: 67  VGVALQFESGYFVETLVEGDKLGVTPHTVRVSPVEGGELLAVDSAHSNIVRITPPLSEYS 126

Query: 81  VVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------ 132
             + +AG   G  G+ DG+P  ARF +P   AVD  GN+YVAD +N  IRKI        
Sbjct: 127 RGRLVAGSFQGHSGHIDGKPSDARFKRPTGVAVDDMGNVYVADTANLAIRKIGESGVTTI 186

Query: 133 -------------------------------------LDHGNQLIRQINLKPEDCS 151
                                                +D GN  +R+I+L  EDC+
Sbjct: 187 AGGKSNIPGYRDGPSEDAKFSTDFDVVYVKKMCSLLVIDRGNAALRKISLPQEDCT 242


>gi|224117390|ref|XP_002331700.1| predicted protein [Populus trichocarpa]
 gi|222874306|gb|EEF11437.1| predicted protein [Populus trichocarpa]
          Length = 495

 Score = 82.0 bits (201), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 64/213 (30%), Positives = 92/213 (43%), Gaps = 53/213 (24%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
           L  EDGY V TV++G+ + + P+ + +   G  +L  +D   S    ++ PLS+ S   R
Sbjct: 61  LQFEDGYLVETVVEGNAMGVVPYKIRVSEDG--ELYAVDEVNSNVVKITPPLSQYSRA-R 117

Query: 85  LAGDGVQGYS---DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN--------- 132
           LA    QGY+   DG+P  ARF+ P+   +D KGNIYVAD  N  IRKI +         
Sbjct: 118 LAAGSFQGYTGHIDGKPNEARFNHPRGLTMDDKGNIYVADTLNLAIRKIGDAGVTTIAGG 177

Query: 133 ----------------------------------LDHGNQLIRQINLKPEDCSKSSQSGS 158
                                             +D GN  +RQI+L  EDC   S+   
Sbjct: 178 KSNVAGFRDGPSEDAKFSNDFDVVYLHSTCSLLVVDRGNAALRQISLNQEDCDYQSKCTR 237

Query: 159 GMFWVTVFSPYLISCKSFWVHLYLYFCSFKGSI 191
                T+ S  L+S    W  L+ Y     G++
Sbjct: 238 K---TTIESTRLVSKTRCWGVLFDYVIMVVGAV 267


>gi|226504928|ref|NP_001143593.1| uncharacterized protein LOC100276297 precursor [Zea mays]
 gi|195623016|gb|ACG33338.1| hypothetical protein [Zea mays]
          Length = 501

 Score = 81.6 bits (200), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 55/176 (31%), Positives = 80/176 (45%), Gaps = 46/176 (26%)

Query: 22  VSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRP-GSSDLIVLDSSRSAFYTLSFPLSEES 80
           V   L  E GY V T+++G +L + PH++   P    +L+ +DS+ S    ++ PLSE S
Sbjct: 67  VGVALQFESGYFVETLVEGDKLGVTPHTIRVSPVEGGELLAVDSAHSNIVRITPPLSEYS 126

Query: 81  VVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------ 132
             + +AG   G  G+ DG+P  ARF +P   AVD  GN+YVAD +N  IRKI        
Sbjct: 127 RGRLVAGSFQGHSGHIDGKPSDARFKRPTGVAVDDMGNVYVADTANLAIRKIGESGVTTI 186

Query: 133 -------------------------------------LDHGNQLIRQINLKPEDCS 151
                                                +D GN  +R+I+L  EDC+
Sbjct: 187 AGGKSNIPGYRDGPSEDAKFSTDFDVVYVKKMCSLLVIDRGNAALRKISLPQEDCT 242


>gi|242061092|ref|XP_002451835.1| hypothetical protein SORBIDRAFT_04g008420 [Sorghum bicolor]
 gi|241931666|gb|EES04811.1| hypothetical protein SORBIDRAFT_04g008420 [Sorghum bicolor]
          Length = 495

 Score = 81.3 bits (199), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 54/172 (31%), Positives = 79/172 (45%), Gaps = 46/172 (26%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSVIDRP-GSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
           L  E GY V T+++G +L + PH++   P    +L+ +DS+ S    ++ PLSE S  + 
Sbjct: 70  LQFESGYFVETLVEGDKLGVTPHTIRVSPVEGGELLAVDSAHSNIVRITPPLSEYSRGRL 129

Query: 85  LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN---------- 132
           +AG   G  G+ DG+P  ARF +P   AVD  GN+YVAD +N  IRKI            
Sbjct: 130 VAGSFQGHSGHIDGKPSDARFKRPTGVAVDDMGNVYVADTANLAIRKIGESGVTTIAGGK 189

Query: 133 ---------------------------------LDHGNQLIRQINLKPEDCS 151
                                            +D GN  +R+I+L  EDC+
Sbjct: 190 SNIPGYRDGPSEDAKFSTDFDVVYVKKMCSLLVIDRGNAALRKISLPQEDCT 241


>gi|357140648|ref|XP_003571876.1| PREDICTED: uncharacterized protein LOC100824092 [Brachypodium
           distachyon]
          Length = 491

 Score = 80.5 bits (197), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 53/173 (30%), Positives = 79/173 (45%), Gaps = 46/173 (26%)

Query: 25  GLLLEDGYTVTTVIDGHQLEINPHSVIDRP-GSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
            L  E GY V T+++G +L + PH++   P    +L+ +DS+ S    ++ PLSE S  +
Sbjct: 65  ALQFESGYFVETLVEGDKLGVTPHTIRVSPLEGGELLAVDSAHSNIVRITPPLSEYSRAR 124

Query: 84  RLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN--------- 132
            +AG   G  G+ DG+P  ARF +P   AVD  GN+Y+AD +N  IRKI +         
Sbjct: 125 LVAGSFQGHAGHIDGKPIDARFKRPTGVAVDDTGNVYIADTANLAIRKIGDSGVTTIAGG 184

Query: 133 ----------------------------------LDHGNQLIRQINLKPEDCS 151
                                             +D GN  +R+I L  EDC+
Sbjct: 185 KSNIPGYRDGPSEDAKFSTDFDVVYVKKMCSLLVIDRGNAALRKIALPQEDCT 237


>gi|115445197|ref|NP_001046378.1| Os02g0234200 [Oryza sativa Japonica Group]
 gi|50251216|dbj|BAD27660.1| putative NHL repeat-containing protein [Oryza sativa Japonica
           Group]
 gi|113535909|dbj|BAF08292.1| Os02g0234200 [Oryza sativa Japonica Group]
 gi|125538730|gb|EAY85125.1| hypothetical protein OsI_06475 [Oryza sativa Indica Group]
 gi|222622487|gb|EEE56619.1| hypothetical protein OsJ_06002 [Oryza sativa Japonica Group]
          Length = 493

 Score = 80.1 bits (196), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 54/172 (31%), Positives = 78/172 (45%), Gaps = 46/172 (26%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSVIDRP-GSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
           L  E GY V T+++G +L + PH++   P    +L+ +DS+ S    ++ PLSE S  + 
Sbjct: 69  LQFESGYFVETLVEGDKLGVTPHTIRVSPVEGGELLAVDSAHSNIVRITPPLSEYSRGRL 128

Query: 85  LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN---------- 132
           +AG   G  G+ DG+P  ARF +P   AVD  GN+YVAD +N  IRKI            
Sbjct: 129 VAGSFQGHSGHIDGKPSDARFKRPTGVAVDDTGNVYVADTANLAIRKIGESGVTTIAGGK 188

Query: 133 ---------------------------------LDHGNQLIRQINLKPEDCS 151
                                            +D GN  +R+I L  EDC+
Sbjct: 189 SNIPGYRDGPSEDAKFSTDFDVVYVKKMCSLLVIDRGNAALRKIALPQEDCT 240


>gi|449456567|ref|XP_004146020.1| PREDICTED: uncharacterized protein LOC101206392 [Cucumis sativus]
          Length = 477

 Score = 79.7 bits (195), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 52/173 (30%), Positives = 80/173 (46%), Gaps = 48/173 (27%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVV 82
           + +  E GY V T+++G+++ + P+ + +   G  +L  +DS  S    +S PLS  S  
Sbjct: 55  NAIQFESGYLVETIVEGNEIGMVPYKIRVSEDG--ELFAVDSVNSNVVKVSPPLSRYSRA 112

Query: 83  KRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------- 132
           + +AG   G +G+ DG+P  ARF++PK   +D KGN+YVAD  N  IRKI +        
Sbjct: 113 RLVAGSFQGYKGHVDGKPSDARFNQPKGITIDDKGNVYVADTLNLAIRKIVDAGVTTIAG 172

Query: 133 -----------------------------------LDHGNQLIRQINLKPEDC 150
                                              +D GN  +RQI+L  EDC
Sbjct: 173 GKTNVPGYSDGPGEEAKFSNDFDVIYVRRTCSLLVVDRGNAALRQISLNKEDC 225


>gi|449518065|ref|XP_004166064.1| PREDICTED: uncharacterized protein LOC101228723 [Cucumis sativus]
          Length = 449

 Score = 79.3 bits (194), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 52/173 (30%), Positives = 80/173 (46%), Gaps = 48/173 (27%)

Query: 24  SGLLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVV 82
           + +  E GY V T+++G+++ + P+ + +   G  +L  +DS  S    +S PLS  S  
Sbjct: 27  NAIQFESGYLVETIVEGNEIGMVPYKIRVSEDG--ELFAVDSVNSNVVKVSPPLSRYSRA 84

Query: 83  KRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------- 132
           + +AG   G +G+ DG+P  ARF++PK   +D KGN+YVAD  N  IRKI +        
Sbjct: 85  RLVAGSFQGYKGHVDGKPSDARFNQPKGITIDDKGNVYVADTLNLAIRKIVDAGVTTIAG 144

Query: 133 -----------------------------------LDHGNQLIRQINLKPEDC 150
                                              +D GN  +RQI+L  EDC
Sbjct: 145 GKTNVPGYSDGPGEEAKFSNDFDVIYVRRTCSLLVVDRGNAALRQISLNKEDC 197


>gi|224140207|ref|XP_002323476.1| predicted protein [Populus trichocarpa]
 gi|222868106|gb|EEF05237.1| predicted protein [Populus trichocarpa]
          Length = 453

 Score = 79.0 bits (193), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 48/108 (44%), Positives = 62/108 (57%), Gaps = 3/108 (2%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
           +  E GYTV TV DG +L I P+SV   P S +L++LDS  S  Y +S   S+    K +
Sbjct: 9   MKFEGGYTVETVFDGSKLGIEPYSVEVSP-SGELLLLDSENSNIYKISTQFSKYGRPKLI 67

Query: 86  AG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            G  +G  G+ DG+   AR + PK   VD  GNIYVAD  N  IRKI+
Sbjct: 68  TGSFEGYPGHVDGKLRDARMNHPKGLTVDEGGNIYVADTMNMAIRKIS 115


>gi|449438911|ref|XP_004137231.1| PREDICTED: uncharacterized protein LOC101210502 [Cucumis sativus]
          Length = 480

 Score = 79.0 bits (193), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 54/171 (31%), Positives = 80/171 (46%), Gaps = 48/171 (28%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
           L  E+GY V TV++G+++ + PH + + + G  +L V+DS  S    ++ PLS+ +  + 
Sbjct: 59  LQFENGYLVGTVVEGNEIGVLPHKIHVSKDG--ELFVVDSVNSNIVKITPPLSKYTRARL 116

Query: 85  LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN---------- 132
           +AG      G+ DG+P  ARF+ P+   VD KGN+YVAD  N  IRKI +          
Sbjct: 117 VAGSFQSHTGHIDGKPNDARFNHPRGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGK 176

Query: 133 ---------------------------------LDHGNQLIRQINLKPEDC 150
                                            +D GN  IRQI+L  EDC
Sbjct: 177 SNVVGYRDGPGEDAKFSNDFDVMYVRSTCSLLVIDRGNAAIRQISLNQEDC 227


>gi|449530424|ref|XP_004172195.1| PREDICTED: uncharacterized LOC101210502, partial [Cucumis sativus]
          Length = 437

 Score = 78.6 bits (192), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 54/171 (31%), Positives = 80/171 (46%), Gaps = 48/171 (28%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
           L  E+GY V TV++G+++ + PH + + + G  +L V+DS  S    ++ PLS+ +  + 
Sbjct: 16  LQFENGYLVGTVVEGNEIGVLPHKIHVSKDG--ELFVVDSVNSNIVKITPPLSKYTRARL 73

Query: 85  LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN---------- 132
           +AG      G+ DG+P  ARF+ P+   VD KGN+YVAD  N  IRKI +          
Sbjct: 74  VAGSFQSHTGHIDGKPNDARFNHPRGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGK 133

Query: 133 ---------------------------------LDHGNQLIRQINLKPEDC 150
                                            +D GN  IRQI+L  EDC
Sbjct: 134 SNVVGYRDGPGEDAKFSNDFDVMYVRSTCSLLVIDRGNAAIRQISLNQEDC 184


>gi|297841763|ref|XP_002888763.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297334604|gb|EFH65022.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 503

 Score = 77.8 bits (190), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 58/178 (32%), Positives = 82/178 (46%), Gaps = 49/178 (27%)

Query: 28  LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
            E+GY+V TV DG +L I P+S+   P + +L++LDS  S  Y +S  LS  S  + + G
Sbjct: 66  FENGYSVETVFDGSKLGIEPYSIEVLP-NGELLILDSENSNIYKISSSLSLYSRPRLVTG 124

Query: 88  --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------------- 132
             +G  G+ DG    A+ + PK   VD +GNIYVAD  N+ IRKI+              
Sbjct: 125 SPEGYPGHVDGRLRDAKLNHPKGLTVDDRGNIYVADTVNNAIRKISEGGVTTIAGGKTVR 184

Query: 133 -------------------------------LDHGNQLIRQINLKPEDCSKSSQSGSG 159
                                          +D GN+ IR+I L  +DC+   Q GSG
Sbjct: 185 NGGHVDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAIREIQLHFDDCAY--QYGSG 240


>gi|30698020|ref|NP_177185.3| NHL domain-containing protein [Arabidopsis thaliana]
 gi|332196920|gb|AEE35041.1| NHL domain-containing protein [Arabidopsis thaliana]
          Length = 509

 Score = 77.4 bits (189), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 58/178 (32%), Positives = 82/178 (46%), Gaps = 49/178 (27%)

Query: 28  LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
            E+GY+V TV DG +L I P+S+   P + +L++LDS  S  Y +S  LS  S  + + G
Sbjct: 66  FENGYSVETVFDGSKLGIEPYSIEVLP-NGELLILDSENSNIYKISSSLSLYSRPRLVTG 124

Query: 88  --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------------- 132
             +G  G+ DG    A+ + PK   VD +GNIYVAD  N+ IRKI+              
Sbjct: 125 SPEGYPGHVDGRLRDAKLNHPKGLTVDDRGNIYVADTVNNAIRKISEGGVTTIAGGKTVR 184

Query: 133 -------------------------------LDHGNQLIRQINLKPEDCSKSSQSGSG 159
                                          +D GN+ IR+I L  +DC+   Q GSG
Sbjct: 185 NGGHVDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAIREIQLHFDDCAY--QYGSG 240


>gi|3176691|gb|AAC18814.1| Contains homology to serine/threonine protein kinase gb|X99618 from
           Mycobacterium tuberculosis. ESTs gb|F14403, gb|F14404,
           and gb|N96730 come from this gene [Arabidopsis thaliana]
          Length = 493

 Score = 77.4 bits (189), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 58/178 (32%), Positives = 82/178 (46%), Gaps = 49/178 (27%)

Query: 28  LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
            E+GY+V TV DG +L I P+S+   P + +L++LDS  S  Y +S  LS  S  + + G
Sbjct: 39  FENGYSVETVFDGSKLGIEPYSIEVLP-NGELLILDSENSNIYKISSSLSLYSRPRLVTG 97

Query: 88  --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------------- 132
             +G  G+ DG    A+ + PK   VD +GNIYVAD  N+ IRKI+              
Sbjct: 98  SPEGYPGHVDGRLRDAKLNHPKGLTVDDRGNIYVADTVNNAIRKISEGGVTTIAGGKTVR 157

Query: 133 -------------------------------LDHGNQLIRQINLKPEDCSKSSQSGSG 159
                                          +D GN+ IR+I L  +DC  + Q GSG
Sbjct: 158 NGGHVDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAIREIQLHFDDC--AYQYGSG 213


>gi|30698018|ref|NP_850974.1| NHL domain-containing protein [Arabidopsis thaliana]
 gi|17065224|gb|AAL32766.1| Unknown protein [Arabidopsis thaliana]
 gi|21387163|gb|AAM47985.1| unknown protein [Arabidopsis thaliana]
 gi|332196919|gb|AEE35040.1| NHL domain-containing protein [Arabidopsis thaliana]
          Length = 447

 Score = 77.0 bits (188), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 58/178 (32%), Positives = 82/178 (46%), Gaps = 49/178 (27%)

Query: 28  LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
            E+GY+V TV DG +L I P+S+   P + +L++LDS  S  Y +S  LS  S  + + G
Sbjct: 4   FENGYSVETVFDGSKLGIEPYSIEVLP-NGELLILDSENSNIYKISSSLSLYSRPRLVTG 62

Query: 88  --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN------------- 132
             +G  G+ DG    A+ + PK   VD +GNIYVAD  N+ IRKI+              
Sbjct: 63  SPEGYPGHVDGRLRDAKLNHPKGLTVDDRGNIYVADTVNNAIRKISEGGVTTIAGGKTVR 122

Query: 133 -------------------------------LDHGNQLIRQINLKPEDCSKSSQSGSG 159
                                          +D GN+ IR+I L  +DC  + Q GSG
Sbjct: 123 NGGHVDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAIREIQLHFDDC--AYQYGSG 178


>gi|255567088|ref|XP_002524526.1| conserved hypothetical protein [Ricinus communis]
 gi|223536200|gb|EEF37853.1| conserved hypothetical protein [Ricinus communis]
          Length = 484

 Score = 76.6 bits (187), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 54/177 (30%), Positives = 80/177 (45%), Gaps = 48/177 (27%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
           L  EDGY V TV++G+++ + P+ + +   G  +L  +D   S    ++ PLS+ S  + 
Sbjct: 60  LQFEDGYLVETVVEGNEIGVVPYKIRVSEDG--ELYAVDEVNSNIVKITPPLSQYSRARL 117

Query: 85  LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN---------- 132
           +AG   G +G+ DG+   ARF+ PK   +D KGN+YVAD  N  IRKI +          
Sbjct: 118 VAGSFQGYKGHVDGKSNEARFNHPKGITMDDKGNVYVADTLNLAIRKIGDAGVTTIAGGK 177

Query: 133 ---------------------------------LDHGNQLIRQINLKPEDCSKSSQS 156
                                            +D GN  +RQI+L  EDC   S S
Sbjct: 178 SNTAGYRDGPSEDAKFSTDFDVVYVHSTCSLLVVDRGNVALRQISLNQEDCDYQSSS 234


>gi|357452081|ref|XP_003596317.1| hypothetical protein MTR_2g075860 [Medicago truncatula]
 gi|355485365|gb|AES66568.1| hypothetical protein MTR_2g075860 [Medicago truncatula]
          Length = 493

 Score = 75.9 bits (185), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 42/108 (38%), Positives = 63/108 (58%), Gaps = 4/108 (3%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
           L  E+GY V TV++G+++ + P+ +       +L  +D   S    ++ PLS+ S  + +
Sbjct: 63  LQFENGYVVETVVEGNEIGVIPYRIRVSEEDGELFAVDEINSNIVRITPPLSQYSRGRLV 122

Query: 86  AGDGVQGYSD---GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           AG   QGY+D   G+P  ARF+ PK   +D KGN+YVAD  N  IRKI
Sbjct: 123 AG-SFQGYTDHVDGKPSDARFNHPKGITMDDKGNVYVADTQNLAIRKI 169


>gi|168018878|ref|XP_001761972.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686689|gb|EDQ73076.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 386

 Score = 75.9 bits (185), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 46/106 (43%), Positives = 64/106 (60%), Gaps = 6/106 (5%)

Query: 31  GYTVTTVIDGHQLEINPHSV---IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
           GY VTTV++G+   ++ + +    DR   +  IVLDS++S    +  PLS++SVV+ +AG
Sbjct: 44  GYDVTTVLNGNLRGLSFYCIDEATDR-APAWAIVLDSTKSKVLRVQLPLSQDSVVEHIAG 102

Query: 88  --DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
             +G  GY DG  G A F+ PK   +D  GNIYVAD  N  IR IT
Sbjct: 103 SLEGKAGYQDGRGGDALFNHPKMLTLDSDGNIYVADVRNTAIRMIT 148


>gi|357485963|ref|XP_003613269.1| hypothetical protein MTR_5g034750 [Medicago truncatula]
 gi|355514604|gb|AES96227.1| hypothetical protein MTR_5g034750 [Medicago truncatula]
          Length = 589

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 62/208 (29%), Positives = 88/208 (42%), Gaps = 65/208 (31%)

Query: 14  LLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLS 73
           +L+L      S +  E GY V TV DG +L I P++V +   + +L++LDS  S  Y +S
Sbjct: 66  ILNLQTIAGKSMMKFESGYNVETVFDGSKLGIEPYAV-EVLSNGELLILDSENSNIYKIS 124

Query: 74  FPLS------------------EESVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDM 113
             LS                    S  K +AG  +G  G+ DG+   AR + PK   VD 
Sbjct: 125 SSLSLCKCANVITSPMMSLVAPTNSRPKLVAGSAEGYSGHVDGKLREARMNHPKGITVDD 184

Query: 114 KGNIYVADKSNHVIRKITN----------------------------------------- 132
           +GNIYVAD  N  IRKI++                                         
Sbjct: 185 RGNIYVADIMNMAIRKISDSGVTTIAGGKLSRGGGHVDGPSEEAKFSNDFDVVYVGSSCS 244

Query: 133 ---LDHGNQLIRQINLKPEDCSKSSQSG 157
              +D GNQ IR+I L+ +DC+   +SG
Sbjct: 245 LLVIDRGNQAIREIQLRFDDCAYQYESG 272


>gi|11994597|dbj|BAB02652.1| unnamed protein product [Arabidopsis thaliana]
          Length = 511

 Score = 73.9 bits (180), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 51/170 (30%), Positives = 76/170 (44%), Gaps = 46/170 (27%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
           L  E+GY V TV++G+ + + P+  I      +L  +D   S    ++ PLS+ S  + +
Sbjct: 71  LQFENGYLVETVVEGNDIGVVPYK-IRVSDDGELYAVDELNSNIMKITPPLSQYSRGRLV 129

Query: 86  AG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN----------- 132
           AG   G  G++DG+P  ARF+ P+   +D KGN+YVAD  N  IRKI +           
Sbjct: 130 AGSFQGKTGHADGKPSEARFNHPRGVTMDDKGNVYVADTLNLAIRKIGDSGVTTIAGGKS 189

Query: 133 --------------------------------LDHGNQLIRQINLKPEDC 150
                                           +D GN  +RQI+L  EDC
Sbjct: 190 NIAGYRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLSEEDC 239


>gi|22331093|ref|NP_188104.2| NHL domain-containing protein [Arabidopsis thaliana]
 gi|20260314|gb|AAM13055.1| unknown protein [Arabidopsis thaliana]
 gi|25083516|gb|AAN72090.1| unknown protein [Arabidopsis thaliana]
 gi|332642056|gb|AEE75577.1| NHL domain-containing protein [Arabidopsis thaliana]
          Length = 492

 Score = 73.9 bits (180), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 51/170 (30%), Positives = 76/170 (44%), Gaps = 46/170 (27%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
           L  E+GY V TV++G+ + + P+  I      +L  +D   S    ++ PLS+ S  + +
Sbjct: 71  LQFENGYLVETVVEGNDIGVVPYK-IRVSDDGELYAVDELNSNIMKITPPLSQYSRGRLV 129

Query: 86  AG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN----------- 132
           AG   G  G++DG+P  ARF+ P+   +D KGN+YVAD  N  IRKI +           
Sbjct: 130 AGSFQGKTGHADGKPSEARFNHPRGVTMDDKGNVYVADTLNLAIRKIGDSGVTTIAGGKS 189

Query: 133 --------------------------------LDHGNQLIRQINLKPEDC 150
                                           +D GN  +RQI+L  EDC
Sbjct: 190 NIAGYRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLSEEDC 239


>gi|42572439|ref|NP_974315.1| NHL domain-containing protein [Arabidopsis thaliana]
 gi|332642057|gb|AEE75578.1| NHL domain-containing protein [Arabidopsis thaliana]
          Length = 493

 Score = 73.9 bits (180), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 51/170 (30%), Positives = 76/170 (44%), Gaps = 46/170 (27%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
           L  E+GY V TV++G+ + + P+  I      +L  +D   S    ++ PLS+ S  + +
Sbjct: 71  LQFENGYLVETVVEGNDIGVVPYK-IRVSDDGELYAVDELNSNIMKITPPLSQYSRGRLV 129

Query: 86  AG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN----------- 132
           AG   G  G++DG+P  ARF+ P+   +D KGN+YVAD  N  IRKI +           
Sbjct: 130 AGSFQGKTGHADGKPSEARFNHPRGVTMDDKGNVYVADTLNLAIRKIGDSGVTTIAGGKS 189

Query: 133 --------------------------------LDHGNQLIRQINLKPEDC 150
                                           +D GN  +RQI+L  EDC
Sbjct: 190 NIAGYRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLSEEDC 239


>gi|297834352|ref|XP_002885058.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297330898|gb|EFH61317.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 489

 Score = 73.6 bits (179), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 50/171 (29%), Positives = 78/171 (45%), Gaps = 48/171 (28%)

Query: 26  LLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKR 84
           L  E+GY V TV++G+++ + P+ + +   G  +L  +D   S    ++ PLS+ S  + 
Sbjct: 67  LQFENGYLVETVVEGNEIGVVPYKIRVSHDG--ELYAVDELNSNIMKITPPLSQYSRGRL 124

Query: 85  LAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN---------- 132
           +AG   G  G++DG+P  ARF+ P+   +D KGN+YV D  N  IRKI +          
Sbjct: 125 VAGSFQGKTGHADGKPSEARFNHPRGVTMDDKGNVYVGDTLNLAIRKIGDSGVTTIAGGK 184

Query: 133 ---------------------------------LDHGNQLIRQINLKPEDC 150
                                            +D GN  +RQI+L  EDC
Sbjct: 185 SNIAGYRDGPSEDAKFSNDFDVVYVRSTCSLLVIDRGNAALRQISLSDEDC 235


>gi|302768697|ref|XP_002967768.1| hypothetical protein SELMODRAFT_67906 [Selaginella moellendorffii]
 gi|300164506|gb|EFJ31115.1| hypothetical protein SELMODRAFT_67906 [Selaginella moellendorffii]
          Length = 186

 Score = 68.6 bits (166), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 39/113 (34%), Positives = 64/113 (56%), Gaps = 2/113 (1%)

Query: 21  LVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEES 80
           L+ + +L+E G+TV T+ D  +   N  ++   P +  ++ LDSS +  + L  PLS+ S
Sbjct: 7   LLKTFVLVEPGFTVKTLFDLGKHSANAFALYPDPRNKFVLALDSSGNRIWKLRLPLSQNS 66

Query: 81  VVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            ++  AG   G  GY DG    + F++P+S A+   G ++VAD  N  IRKI+
Sbjct: 67  SLEAFAGSFVGESGYVDGPAAKSLFNRPQSLAICDNGAVFVADTRNLAIRKIS 119


>gi|302821461|ref|XP_002992393.1| hypothetical protein SELMODRAFT_47902 [Selaginella moellendorffii]
 gi|300139809|gb|EFJ06543.1| hypothetical protein SELMODRAFT_47902 [Selaginella moellendorffii]
          Length = 183

 Score = 67.4 bits (163), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 63/113 (55%), Gaps = 2/113 (1%)

Query: 21  LVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEES 80
           L+ + +L+E G+TV T+ D  +   N  ++   P    ++ LDSS +  + L  PLS+ S
Sbjct: 7   LLKTFVLVEPGFTVKTLFDLGKHSANAFALYPDPRKKFVLALDSSGNRIWKLRLPLSQNS 66

Query: 81  VVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            ++  AG   G  GY DG    + F++P+S ++   G ++VAD  N  IRKI+
Sbjct: 67  SLEAFAGSFVGESGYVDGPAAKSLFNRPQSLSICDNGAVFVADTRNLAIRKIS 119


>gi|414870491|tpg|DAA49048.1| TPA: hypothetical protein ZEAMMB73_516870 [Zea mays]
          Length = 271

 Score = 64.3 bits (155), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 49/165 (29%), Positives = 79/165 (47%), Gaps = 25/165 (15%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG- 87
           E GYTV  ++D  +L I P+S+    G S+++++DS+    Y+++ PLS +S  K +AG 
Sbjct: 105 ECGYTVEKILDYSKLGIEPYSMEVTQG-SEILIMDSN---IYSMALPLSRDSRPKLVAGS 160

Query: 88  -DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN-------------- 132
            +G  G+ DG     R   PK   VD KG     D  +   +  T+              
Sbjct: 161 LEGFPGHVDGNLREGRIYHPKGVTVDDKGRGGQVDGPSDEAKSSTDFEVCYYIGSSCSLL 220

Query: 133 -LDHGNQLIRQINLKPEDCSKSSQS----GSGMFWVTVFSPYLIS 172
            +D GNQ IR+I L  +DC    ++    G  +  V  F  Y+++
Sbjct: 221 VIDRGNQTIREIQLHFDDCVYQHEADFPLGVALLAVAAFLGYMLA 265


>gi|124002516|ref|ZP_01687369.1| hypothetical protein M23134_05219 [Microscilla marina ATCC 23134]
 gi|123992345|gb|EAY31713.1| hypothetical protein M23134_05219 [Microscilla marina ATCC 23134]
          Length = 2385

 Score = 63.9 bits (154), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 30/62 (48%), Positives = 40/62 (64%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
           V  +AG GV G+ D     A+F+ P   A+DM GN+YVADK+NH IR ITN   G+ ++R
Sbjct: 631 VSIVAGSGVAGFKDDNGSLAQFNYPSGLALDMAGNLYVADKNNHAIRMITNPSGGSPVVR 690

Query: 142 QI 143
            I
Sbjct: 691 TI 692



 Score = 57.4 bits (137), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 32/82 (39%), Positives = 45/82 (54%), Gaps = 5/82 (6%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-----GVQGYSDGEPGSARFDKPKSFA 110
           + +L V D +  A   ++ P     VV+ +AG+      V G   G    A+F++P   A
Sbjct: 663 AGNLYVADKNNHAIRMITNPSGGSPVVRTIAGNSSYPTAVSGNVTGALAVAKFNEPSGVA 722

Query: 111 VDMKGNIYVADKSNHVIRKITN 132
           VD  GNIYVADK+NH I+KI N
Sbjct: 723 VDAAGNIYVADKNNHRIKKIAN 744



 Score = 47.8 bits (112), Expect = 0.002,   Method: Composition-based stats.
 Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 3/49 (6%)

Query: 86  AGD---GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           AGD   G+ G+ DG   SA+F  P    V+  G++Y+AD  N VIRKI+
Sbjct: 812 AGDVANGIAGHQDGNAASAKFRSPAGITVNAVGDVYIADTHNQVIRKIS 860



 Score = 41.6 bits (96), Expect = 0.16,   Method: Composition-based stats.
 Identities = 23/65 (35%), Positives = 38/65 (58%), Gaps = 9/65 (13%)

Query: 71  TLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRK 129
           TL+ P+++ +         + G +DG   +ARF  P S A+D+ G  +YVADK N++IR+
Sbjct: 749 TLAGPMNDAA--------SIPGRTDGAADAARFFFPTSVALDITGAQLYVADKLNNIIRQ 800

Query: 130 ITNLD 134
           +   D
Sbjct: 801 VNTAD 805


>gi|325102830|ref|YP_004272484.1| cell surface receptor IPT/TIG domain protein [Pedobacter saltans
           DSM 12145]
 gi|324971678|gb|ADY50662.1| cell surface receptor IPT/TIG domain protein [Pedobacter saltans
           DSM 12145]
          Length = 491

 Score = 63.5 bits (153), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 36/84 (42%), Positives = 46/84 (54%), Gaps = 4/84 (4%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P+ V   P + DLIV D +  A   +S    +  +V ++AG G  G  DG+   A F+ P
Sbjct: 170 PNGVSFDPTTGDLIVADRTAQAIKRIS----KAGIVTKIAGTGTAGRVDGDISIASFNNP 225

Query: 107 KSFAVDMKGNIYVADKSNHVIRKI 130
              AVD  GNIYVAD  NH IRKI
Sbjct: 226 YKTAVDKYGNIYVADNGNHRIRKI 249



 Score = 49.7 bits (117), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 30/43 (69%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
            GY +G PG +RF  P   AVD +GN+++AD +N+VIRKI  +
Sbjct: 443 NGYVNGLPGISRFYNPTDVAVDEEGNVFIADMNNYVIRKIVKI 485



 Score = 45.1 bits (105), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 44/87 (50%), Gaps = 3/87 (3%)

Query: 46  NPH-SVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
           NP+ + +D+ G  ++ V D+       +    +  S +   AG    GY+DG   +   +
Sbjct: 224 NPYKTAVDKYG--NIYVADNGNHRIRKIDLSTNTVSTIAGGAGAATSGYTDGIGANGLLN 281

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKIT 131
            P   AVD   N+YVAD +NH +RKIT
Sbjct: 282 TPTGLAVDDDLNVYVADAANHAVRKIT 308


>gi|223935789|ref|ZP_03627704.1| NHL repeat containing protein [bacterium Ellin514]
 gi|223895390|gb|EEF61836.1| NHL repeat containing protein [bacterium Ellin514]
          Length = 755

 Score = 62.4 bits (150), Expect = 8e-08,   Method: Composition-based stats.
 Identities = 34/68 (50%), Positives = 43/68 (63%), Gaps = 3/68 (4%)

Query: 67  SAFYTLSF-PLSEES--VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS 123
           + F +L+F PLS      V  LAG+  QG +DG   SARF+ P   AVD  GN+YVAD +
Sbjct: 9   TVFLSLAFVPLSHSQPLTVSTLAGNAGQGSADGNNSSARFNLPGGVAVDKTGNLYVADTA 68

Query: 124 NHVIRKIT 131
           NH IRKI+
Sbjct: 69  NHTIRKIS 76



 Score = 54.7 bits (130), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 28/56 (50%), Positives = 34/56 (60%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S    V  LAG    G +DG  GSA+F  PK  AVD  GN++V+D  NH IRKIT
Sbjct: 343 ISSGGAVTTLAGSASIGNADGTGGSAKFYWPKGTAVDASGNVFVSDTFNHTIRKIT 398



 Score = 53.5 bits (127), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 32/81 (39%), Positives = 47/81 (58%), Gaps = 8/81 (9%)

Query: 51  IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSF 109
           +D+ G  +L V D++      +S       VV   AG  GV G ++G+  +ARF++P+  
Sbjct: 56  VDKTG--NLYVADTANHTIRKIS-----GGVVSTFAGLAGVSGSANGKGSAARFNQPQGV 108

Query: 110 AVDMKGNIYVADKSNHVIRKI 130
           AVD  G +YVAD  NH+IRKI
Sbjct: 109 AVDTNGIVYVADTGNHIIRKI 129



 Score = 50.1 bits (118), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 30/77 (38%), Positives = 45/77 (58%), Gaps = 6/77 (7%)

Query: 62  LDSSRSAFYTLSF-----PLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKG 115
           +D+S + F + +F      ++    V  LAG  G  G ++G  G A+F  P+  AVD  G
Sbjct: 378 VDASGNVFVSDTFNHTIRKITAAGTVSTLAGTAGSSGTNNGVGGGAQFYAPQGIAVDTGG 437

Query: 116 NIYVADKSNHVIRKITN 132
           N YVAD +N+VIRK+T+
Sbjct: 438 NAYVADTANNVIRKVTS 454



 Score = 48.9 bits (115), Expect = 0.001,   Method: Composition-based stats.
 Identities = 24/51 (47%), Positives = 33/51 (64%), Gaps = 1/51 (1%)

Query: 82  VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V  LAG  GV+G  DG   +A+F  P++ A+D   N+YV+D  NH IRKI+
Sbjct: 458 VTTLAGTAGVEGQGDGTGSNAQFSGPQAVALDGAANVYVSDTGNHTIRKIS 508



 Score = 45.4 bits (106), Expect = 0.010,   Method: Composition-based stats.
 Identities = 20/32 (62%), Positives = 22/32 (68%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +ARF  P   AVD  GN+YVAD  NH IRKIT
Sbjct: 537 TARFYSPSGLAVDSSGNVYVADTGNHTIRKIT 568



 Score = 44.7 bits (104), Expect = 0.019,   Method: Composition-based stats.
 Identities = 26/85 (30%), Positives = 46/85 (54%), Gaps = 2/85 (2%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
           ++ +  V  LAG  GV G +DG    ARF +P+  ++D +GN++V D  NH +R +    
Sbjct: 567 ITADGSVSTLAGLPGVWGNADGTNRDARFFQPEGISIDSQGNLFVMDSGNHTMRMLI-AS 625

Query: 135 HGNQLIRQINLKPEDCSKSSQSGSG 159
             N ++  I  +P+    +  +G+G
Sbjct: 626 GTNWIVTTIAGQPDLGGAADGTGNG 650



 Score = 40.4 bits (93), Expect = 0.33,   Method: Composition-based stats.
 Identities = 22/52 (42%), Positives = 30/52 (57%), Gaps = 1/52 (1%)

Query: 82  VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           V  LAG  G  G ++G   +A+F +P   AV   G +YVAD  NH IR +T+
Sbjct: 240 VTTLAGSPGNYGSTNGTGSAAQFYQPMGVAVAANGTVYVADNLNHTIRAVTS 291


>gi|323454387|gb|EGB10257.1| hypothetical protein AURANDRAFT_62887 [Aureococcus anophagefferens]
          Length = 434

 Score = 61.6 bits (148), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 45/79 (56%), Gaps = 13/79 (16%)

Query: 60  IVLDSSRSAFYTLSFPLSEESVVKRL----------AGDGVQGYSDGEPGSARFDKPKSF 109
           + LD++R+  Y      S    V+RL          AGDG +GY+DG+ G ARFD+P + 
Sbjct: 247 VCLDAARNCLYVAD---SGNHCVRRLDLASGAVTTVAGDGTRGYADGDAGGARFDEPTAV 303

Query: 110 AVDMKGNIYVADKSNHVIR 128
           A+D  G +YVAD+ N  +R
Sbjct: 304 ALDADGALYVADQENRRVR 322



 Score = 37.4 bits (85), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 40/87 (45%), Gaps = 7/87 (8%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG-YSDGEPGSARFD 104
            P  V   P + D+ V     S ++     +     V  LAG G +G  +DG   +A F 
Sbjct: 188 GPWGVAAHPATGDVYV-----SEYFGCKIRVIRRGDVTTLAGSGPRGGRADGVGPAASFA 242

Query: 105 KPKSFAVDMKGN-IYVADKSNHVIRKI 130
            P+   +D   N +YVAD  NH +R++
Sbjct: 243 GPRGVCLDAARNCLYVADSGNHCVRRL 269


>gi|373955434|ref|ZP_09615394.1| NHL repeat containing protein [Mucilaginibacter paludis DSM 18603]
 gi|373892034|gb|EHQ27931.1| NHL repeat containing protein [Mucilaginibacter paludis DSM 18603]
          Length = 459

 Score = 60.8 bits (146), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 28/56 (50%), Positives = 33/56 (58%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S   VV  LAG G  GY+DG    A F  P   AVD  GNIYVAD+  + IRK+T
Sbjct: 243 ISAAGVVTTLAGKGSAGYADGTGADAVFKSPAGLAVDASGNIYVADQGTNTIRKVT 298



 Score = 57.4 bits (137), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/56 (46%), Positives = 34/56 (60%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   V    AG G  GY DG   +A+F  PK  A D +GNIYVAD  N++IRKI+
Sbjct: 189 ITAAGVTTTYAGTGTLGYLDGPAATAQFYAPKGVAADAQGNIYVADMGNNMIRKIS 244



 Score = 52.0 bits (123), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 25/57 (43%), Positives = 31/57 (54%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   VV  LAG    G  D    + ARF  P    VD  GN+YVAD +NH IRK+T
Sbjct: 297 VTSAGVVTTLAGAAASGQVDATTNTDARFSSPSGVTVDASGNVYVADLANHAIRKVT 353



 Score = 46.2 bits (108), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 25/64 (39%), Positives = 36/64 (56%), Gaps = 1/64 (1%)

Query: 69  FYTLSFPLSEESVVKRLAGDGVQG-YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
           F T S   +  + V   AG+   G + +G   +A+F  P+    D +GN+YVAD  N+VI
Sbjct: 127 FKTNSATFAITATVSTFAGNANGGGFVNGTGVNAQFHNPQGICTDAQGNMYVADSYNNVI 186

Query: 128 RKIT 131
           RKIT
Sbjct: 187 RKIT 190



 Score = 43.9 bits (102), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 35/54 (64%), Gaps = 1/54 (1%)

Query: 78  EESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
             +++  LAG  G  G+++G   +A F+ P++  +D +GNIYV D  N++IRKI
Sbjct: 400 TTNIIYSLAGVAGTSGFANGTNINALFNGPQALTLDSQGNIYVVDYYNNMIRKI 453


>gi|255531263|ref|YP_003091635.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
 gi|255344247|gb|ACU03573.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
          Length = 646

 Score = 60.5 bits (145), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 27/52 (51%), Positives = 36/52 (69%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           +V   AG G  G  +G   SARFD P+  A+D  GN+Y+ADK+N+VIRKIT+
Sbjct: 327 LVSNYAGTGTAGLVNGAKASARFDSPEGLAIDASGNMYIADKNNNVIRKITS 378



 Score = 57.8 bits (138), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 27/56 (48%), Positives = 36/56 (64%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   VV  +AG+G  GY++G   +A+F  P    VD  GNIYVAD+ NH IRKIT
Sbjct: 483 ITAAGVVTTIAGNGTAGYTNGTGTAAQFKNPSGVDVDASGNIYVADRLNHRIRKIT 538



 Score = 54.7 bits (130), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 26/56 (46%), Positives = 34/56 (60%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++    V   AG GV G++DG    A+F+ P   AVD  GN+YVAD+ N  IRKIT
Sbjct: 376 ITSTGTVSTFAGTGVAGFADGAGSIAKFNGPWKVAVDATGNVYVADRDNFKIRKIT 431



 Score = 48.1 bits (113), Expect = 0.002,   Method: Composition-based stats.
 Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   +V  LAG    GY+DG   +A+F +P   A+D  GN+ VAD ++H IRKIT
Sbjct: 430 ITPAGIVSTLAGS-TAGYADGTGSAAKFMQPLDVAIDPSGNVIVADNTSHRIRKIT 484



 Score = 39.3 bits (90), Expect = 0.88,   Method: Composition-based stats.
 Identities = 26/57 (45%), Positives = 35/57 (61%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           ++   VV  LAG G  G +DG  GSA+F  P    VD+ GN+YVAD  +  IRKI++
Sbjct: 537 ITTSGVVSSLAGTGTSGTTDGAAGSAKFSDPYGVTVDVSGNVYVADLISSRIRKISS 593


>gi|325104746|ref|YP_004274400.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
 gi|324973594|gb|ADY52578.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
          Length = 1769

 Score = 60.1 bits (144), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 47/135 (34%), Positives = 67/135 (49%), Gaps = 15/135 (11%)

Query: 52   DRPGSSDL------IVLDSSRSAFYTLSF-----PLSEESVVKRLAGDGVQGYSDGEPGS 100
            D PG+S +      +V+D+S + + + S       ++    V  +AG GVQG  DG    
Sbjct: 1098 DGPGTSAMFNSPQAMVMDASDNIYVSESSNHRIRKITPAGEVSTVAGKGVQGNRDGTKEE 1157

Query: 101  ARFDKPKSFAVDMKGNIYVADKSNHVIRKITN--LDHGNQLIRQINLKPEDCSKSSQSGS 158
            A+F  P   A+DM GNIYVA+ SNH IRKIT      G   I  + + P     S+  GS
Sbjct: 1158 AQFWGPVGIALDMSGNIYVAEWSNHRIRKITQGITLTGTPGITDVGINPVALKASNTVGS 1217

Query: 159  G--MFWVTVFSPYLI 171
                F +TV  P ++
Sbjct: 1218 TSQQFDITVSGPAIL 1232



 Score = 55.8 bits (133), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 25/56 (44%), Positives = 35/56 (62%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            ++    V   AG+G  GY+DG   SA F+ P++  +D   NIYV++ SNH IRKIT
Sbjct: 1079 ITPNGTVSTFAGNGTAGYADGPGTSAMFNSPQAMVMDASDNIYVSESSNHRIRKIT 1134



 Score = 55.5 bits (132), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 26/56 (46%), Positives = 36/56 (64%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            ++ + +V  +AG G  G S+G   +A F+KP   A+D  GNIY+AD  NH IRKIT
Sbjct: 972  ITPQGLVSTIAGSGTNGNSNGVGTAASFNKPTGIAIDPWGNIYIADDLNHSIRKIT 1027



 Score = 48.9 bits (115), Expect = 0.001,   Method: Composition-based stats.
 Identities = 24/56 (42%), Positives = 34/56 (60%), Gaps = 1/56 (1%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            ++    V   AG+G  GY+DG    ARF++P     D  GN+YV++ SN+ IRKIT
Sbjct: 1026 ITPNGTVSTFAGNGTAGYADGVGVLARFNRPTGIITDALGNVYVSESSNY-IRKIT 1080



 Score = 46.2 bits (108), Expect = 0.006,   Method: Composition-based stats.
 Identities = 22/60 (36%), Positives = 36/60 (60%), Gaps = 8/60 (13%)

Query: 80  SVVKRLAGDG--------VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           S+++++  DG          GY+DG   +A+F+ P+  A+D  GNIY+A++    IRKIT
Sbjct: 914 SLIRKITADGQVSTFAGSTFGYADGIGTAAKFNGPQGMAIDASGNIYIAERGASRIRKIT 973


>gi|405345633|ref|ZP_11022426.1| hypothetical protein A176_0011 [Chondromyces apiculatus DSM 436]
 gi|397093682|gb|EJJ24379.1| hypothetical protein A176_0011 [Myxococcus sp. (contaminant ex DSM
           436)]
          Length = 870

 Score = 60.1 bits (144), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 36/87 (41%), Positives = 49/87 (56%), Gaps = 5/87 (5%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEES-VVKRLAGDG-VQGYSDGEPGSARF 103
           +P +V+  P + +L V DS       L     EE   V+  AG G V G++DG P  ARF
Sbjct: 611 SPTAVVAGP-TGELYVADSGNHVIRRLD--RGEEGWQVRTWAGQGFVAGFADGGPARARF 667

Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKI 130
            +P + AVD  GN+YVAD+ NH IR +
Sbjct: 668 SRPMALAVDAAGNVYVADQDNHRIRMV 694



 Score = 43.1 bits (100), Expect = 0.053,   Method: Composition-based stats.
 Identities = 21/58 (36%), Positives = 29/58 (50%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQL 139
           V  +AG G  G+ DG    A    P +      G +YVAD  NHVIR++   + G Q+
Sbjct: 588 VSTIAGSGELGHRDGAGSQALLRSPTAVVAGPTGELYVADSGNHVIRRLDRGEEGWQV 645


>gi|431798813|ref|YP_007225717.1| NHL repeat protein [Echinicola vietnamensis DSM 17526]
 gi|430789578|gb|AGA79707.1| NHL repeat protein [Echinicola vietnamensis DSM 17526]
          Length = 655

 Score = 60.1 bits (144), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 23/48 (47%), Positives = 37/48 (77%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           +AG G+ G+ +G   +A+FD P+  A+D +GNI+VAD+ N+VIRKI++
Sbjct: 338 IAGTGIAGFQNGSSSTAKFDNPEGIAIDQQGNIFVADRDNNVIRKISS 385



 Score = 59.7 bits (143), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 27/56 (48%), Positives = 36/56 (64%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S    V   AG G+ G++DG  G A+F+ P   AVD +GN+ VAD+ NH IRKIT
Sbjct: 383 ISSSGDVSTFAGTGIAGFTDGVAGVAKFNSPWKVAVDNQGNVIVADRGNHSIRKIT 438



 Score = 49.3 bits (116), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 23/49 (46%), Positives = 28/49 (57%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V  +AG G  GY DG    A+F  P     D  GN++VAD+ NH IRKI
Sbjct: 496 VSTIAGTGNGGYVDGAGSQAQFYYPSGIDTDPNGNLFVADRKNHAIRKI 544



 Score = 46.6 bits (109), Expect = 0.006,   Method: Composition-based stats.
 Identities = 24/55 (43%), Positives = 29/55 (52%), Gaps = 1/55 (1%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++    V  LAG    GY DG    A+FD+P   AV   GNI +AD  NH IR I
Sbjct: 437 ITPNGTVSTLAGT-TNGYQDGSGNQAKFDQPTDVAVLPNGNIVIADNRNHCIRMI 490



 Score = 40.4 bits (93), Expect = 0.32,   Method: Composition-based stats.
 Identities = 33/101 (32%), Positives = 48/101 (47%), Gaps = 5/101 (4%)

Query: 40  GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG 99
           G Q +    S ID   + +L V D    A   +    S  +V     G+G +G  +G   
Sbjct: 512 GSQAQFYYPSGIDTDPNGNLFVADRKNHAIRKID---SYHNVSTVAGGNG-EGIQNGGIA 567

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
            A+FD P   AV   G + VAD  N+VIR+I N D+ + +I
Sbjct: 568 VAKFDDPYGVAVGQNGKVLVADLDNNVIREI-NGDYVSTII 607


>gi|430809992|ref|ZP_19437107.1| NHL repeat containing protein [Cupriavidus sp. HMR-1]
 gi|429497580|gb|EKZ96110.1| NHL repeat containing protein [Cupriavidus sp. HMR-1]
          Length = 536

 Score = 59.3 bits (142), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 29/56 (51%), Positives = 35/56 (62%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S    V  LAG GV G +DG   +A F+ P   AVD  GN+YVAD  NH+IRKIT
Sbjct: 263 ISPYGAVTTLAGSGVAGSTDGTGTAASFNLPSGVAVDGAGNVYVADTYNHLIRKIT 318



 Score = 57.4 bits (137), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/56 (46%), Positives = 35/56 (62%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   VV  LAG G  G++DG   +A F++P   AVD  GNIYV+D  N  IRKI+
Sbjct: 371 VTPTGVVTTLAGTGTAGFADGPAQAAAFNRPAGIAVDAAGNIYVSDLGNARIRKIS 426



 Score = 43.1 bits (100), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 30/56 (53%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S  +VV  LAG   +G +DG   +A F      A D  GN+Y  D  ++ +RK+T
Sbjct: 425 ISPANVVTTLAGSTTRGSADGTGAAASFTSLLRIATDAAGNVYAVDAGSNAVRKVT 480



 Score = 36.6 bits (83), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 16/26 (61%), Positives = 18/26 (69%)

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT 131
           P   AVD  GN+YVAD  NH IRKI+
Sbjct: 239 PIDVAVDGTGNVYVADYGNHRIRKIS 264



 Score = 35.8 bits (81), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 14/32 (43%), Positives = 21/32 (65%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +A F  P    VD+ GN++VAD  NH++R +T
Sbjct: 341 AASFYYPSDIVVDVAGNVFVADTYNHLVRMVT 372


>gi|302791603|ref|XP_002977568.1| hypothetical protein SELMODRAFT_417419 [Selaginella moellendorffii]
 gi|300154938|gb|EFJ21572.1| hypothetical protein SELMODRAFT_417419 [Selaginella moellendorffii]
          Length = 288

 Score = 58.9 bits (141), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 39/166 (23%)

Query: 31  GYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV 90
           G+TV T+ D  +  +N   +   P +  ++  DSS +  + L  PLS+ S ++  AG   
Sbjct: 38  GFTVKTLFDLGKHSVNAFGLYPDPRNKFMLAPDSSGNRIWKLRLPLSQNSSLEAFAG-WE 96

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL----------------- 133
            GY DG    + F++P+S A+   G I+VAD +   IRKI+ +                 
Sbjct: 97  SGYIDGPAAKSLFNRPQSLAMCGNGAIFVAD-TRMAIRKISKVTTIAGGSSRKPGIADGT 155

Query: 134 -------------------DHGNQLIRQINL-KPEDCSKSSQSGSG 159
                              D GN LIR+I +  P+ C  S  + SG
Sbjct: 156 ARFSSEFSLACSCGSLPIADRGNWLIREIQIDDPKSCDSSDSTVSG 201


>gi|255531264|ref|YP_003091636.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
 gi|255344248|gb|ACU03574.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
          Length = 439

 Score = 58.9 bits (141), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 38/68 (55%)

Query: 64  SSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS 123
           S+    +T     + E +V   AGDG  G  +G    A+F  P+  AVD +GN+ VAD+ 
Sbjct: 101 SAEGPVFTYKAAPASEYIVSTFAGDGTAGLFNGAAADAQFRNPEGLAVDAQGNVIVADRQ 160

Query: 124 NHVIRKIT 131
           NH IRKI+
Sbjct: 161 NHTIRKIS 168



 Score = 56.6 bits (135), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 28/56 (50%), Positives = 32/56 (57%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S    V  LAGDG  GY D     A+F  P    VD +GNI VAD+ NH IRKIT
Sbjct: 274 VSPSGTVTTLAGDGTAGYLDATGVQAKFRNPSGLTVDQQGNIIVADRLNHRIRKIT 329



 Score = 53.5 bits (127), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 34/86 (39%), Positives = 46/86 (53%), Gaps = 7/86 (8%)

Query: 46  NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
           NP  + +D  G  ++IV D        +S P  +   V  +AGDG  GY+DG   +A+F 
Sbjct: 142 NPEGLAVDAQG--NVIVADRQNHTIRKIS-PAGQ---VTTIAGDGTAGYADGTGTAAKFS 195

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKI 130
            P   A+D  GNI VAD+ N  IRKI
Sbjct: 196 SPWKLAIDPLGNIIVADRDNFKIRKI 221



 Score = 46.6 bits (109), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 34/56 (60%), Gaps = 1/56 (1%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ +  V  LAG    G++DG   +A+F +P     D +GNIYVAD + H IRK++
Sbjct: 221 IAPDGAVTTLAGS-TAGFADGTGSAAKFMQPLDVVADAQGNIYVADNTAHRIRKVS 275



 Score = 46.6 bits (109), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 36/104 (34%), Positives = 55/104 (52%), Gaps = 11/104 (10%)

Query: 31  GYTVTTVIDGHQLEI---NPHSVIDRPGSSDLIV-LDSSRSAFYTLSFPLSEESVVKRLA 86
           G   T ++DG  L     +P+ V    G + L+  L ++R    T   P+ +   V  LA
Sbjct: 339 GAGTTGLLDGDALTAKFADPYGVAVDAGGNILVAELTNARIRKIT---PVGQ---VSTLA 392

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G    G++DG   +A+F++P    +D KGNIYVA+ +NH IR I
Sbjct: 393 GSSA-GFADGLSVNAKFNQPTDLDIDAKGNIYVAEVTNHRIRMI 435



 Score = 46.2 bits (108), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 32/56 (57%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++    V  +AG G  G  DG+  +A+F  P   AVD  GNI VA+ +N  IRKIT
Sbjct: 328 ITPAGTVSTIAGAGTTGLLDGDALTAKFADPYGVAVDAGGNILVAELTNARIRKIT 383


>gi|402814801|ref|ZP_10864394.1| copper amine oxidase domain protein [Paenibacillus alvei DSM 29]
 gi|402507172|gb|EJW17694.1| copper amine oxidase domain protein [Paenibacillus alvei DSM 29]
          Length = 571

 Score = 58.5 bits (140), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 24/40 (60%), Positives = 28/40 (70%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           QG+ DGE    +FD+P   A D KGN+YVAD  NHVIRKI
Sbjct: 129 QGWEDGEQAQVQFDQPTGLAADKKGNLYVADAGNHVIRKI 168



 Score = 58.2 bits (139), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 26/49 (53%), Positives = 33/49 (67%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V  +AGDG+ G+ DG  G+ARF  P++ AV   G IYV D  NHVIR+I
Sbjct: 174 VTTVAGDGIPGWKDGAAGTARFHTPRAIAVADDGAIYVTDSLNHVIRRI 222



 Score = 38.5 bits (88), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 50/117 (42%), Gaps = 13/117 (11%)

Query: 14  LLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLS 73
           L   +F+  S   L+ DG          ++ +     ID    S   +  SS +A YTL 
Sbjct: 257 LTEAMFNEPSGIALMNDGKLAVADTGNQRIRL-----IDLKQGSVSTIAGSSSTAGYTLP 311

Query: 74  FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
                  V + L   G  GY DG   ++ F+ P   A+  +  I VAD+ NHV+R I
Sbjct: 312 ------GVQRVLYAPG--GYRDGAASNSIFNSPTGIAITNENGIVVADRWNHVVRYI 360


>gi|455647406|gb|EMF26379.1| hypothetical protein H114_24527 [Streptomyces gancidicus BKS 13-15]
          Length = 627

 Score = 58.5 bits (140), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 36/116 (31%), Positives = 55/116 (47%), Gaps = 18/116 (15%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V+  P S   +V D++R     L+     E+V++R  G G +G  DG PG ARF +P
Sbjct: 186 PGKVLRLPSSGTFLVTDTTRHQVVELA--EDGETVLRRF-GSGGRGLEDGPPGRARFQEP 242

Query: 107 KSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSGMFW 162
           +  A+   G + VAD  NH +R+               L PE    ++ +G+G  W
Sbjct: 243 QGLALLPDGAVVVADTVNHALRR---------------LDPESGEVTTLAGTGSPW 283


>gi|325104251|ref|YP_004273905.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
 gi|324973099|gb|ADY52083.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
          Length = 532

 Score = 58.5 bits (140), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 65/128 (50%), Gaps = 20/128 (15%)

Query: 15  LSLLFSLVSSGLLLEDGYTVTTV--------IDGHQLEI---NPHSV-IDRPGSSDLIVL 62
           L+L+  L +  +     YTVTTV        +DG+ L+    NP  + +D  G  D+I+ 
Sbjct: 6   LTLILMLGTYMVTKSQNYTVTTVAGSTTFGFVDGNGLDARFRNPDGILVDMNG--DIIIT 63

Query: 63  DSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADK 122
           D +  +       ++   VV  LAG GV GY++G+PG  +F+ P    VD  GNI V +K
Sbjct: 64  DRTNHSIR----KMTTAGVVSTLAGTGVSGYANGKPG--QFNTPWQSTVDAAGNIIVVEK 117

Query: 123 SNHVIRKI 130
               IRKI
Sbjct: 118 DGARIRKI 125



 Score = 55.5 bits (132), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 50/106 (47%), Gaps = 6/106 (5%)

Query: 48  HSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPK 107
            S +D  G+  ++  D +R     L      +  V  +AG G  GYSDG    ARFD   
Sbjct: 103 QSTVDAAGNIIVVEKDGARIRKIAL------DGTVSTIAGTGSAGYSDGAVSVARFDNAL 156

Query: 108 SFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKS 153
              VD  GNI++AD++N  +RKIT    GN     ++    D + S
Sbjct: 157 DAVVDSDGNIFIADRNNRRVRKITPGAGGNWTTATVSTVAGDGTTS 202



 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 31/55 (56%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           S+   V  +AG    G+ DG    ARF  P    VDM G+I + D++NH IRK+T
Sbjct: 20  SQNYTVTTVAGSTTFGFVDGNGLDARFRNPDGILVDMNGDIIITDRTNHSIRKMT 74



 Score = 46.6 bits (109), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 28/51 (54%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + V  +AG G  G  DG    A F++P     D  GNIYVAD  NH IRK+
Sbjct: 299 ATVTTIAGTGSTGRIDGLGNVATFNQPYDVVTDENGNIYVADNVNHSIRKL 349


>gi|255532869|ref|YP_003093241.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
 gi|255345853|gb|ACU05179.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
          Length = 2296

 Score = 58.2 bits (139), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 27/51 (52%), Positives = 33/51 (64%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           VV   AG GV+G+ DG   +A+F  P   A D  GN+YVAD+ NH IRKIT
Sbjct: 642 VVSTFAGSGVKGFKDGTGTAAQFAFPTGLAADAAGNVYVADRDNHRIRKIT 692



 Score = 55.8 bits (133), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 25/56 (44%), Positives = 37/56 (66%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   VV  LAG+G   ++DG   +ARF+ P   AVD  GN++VAD++N  IRK+T
Sbjct: 691 ITAAGVVTTLAGNGTADFADGTQANARFNMPVGVAVDANGNVFVADRNNFRIRKVT 746



 Score = 50.8 bits (120), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 25/51 (49%), Positives = 33/51 (64%), Gaps = 1/51 (1%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           VVKR  G  + GY++G   SA F+ P   A+D  GN Y+AD+ NH IRKI+
Sbjct: 481 VVKRFVGKDL-GYANGTGTSAMFNSPAGIAMDAAGNTYLADRGNHRIRKIS 530



 Score = 40.4 bits (93), Expect = 0.34,   Method: Composition-based stats.
 Identities = 26/56 (46%), Positives = 35/56 (62%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S   VV  LAG G  G++DG   +A+F+ P   AVD  GN+YV+D  N  +RKIT
Sbjct: 529 ISPAGVVTTLAGSGTAGFADGAGAAAQFNDPSGVAVDAAGNVYVSDTRNFRVRKIT 584



 Score = 37.7 bits (86), Expect = 2.1,   Method: Composition-based stats.
 Identities = 21/57 (36%), Positives = 28/57 (49%), Gaps = 3/57 (5%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           S+   V   AG GV GY +G+     +A+F        D   N+YV D  N+ IRKI
Sbjct: 33  SQAQTVSPYAGTGVDGYYEGDNNDAITAQFSYIGGITTDANNNLYVTDSRNYCIRKI 89



 Score = 36.2 bits (82), Expect = 6.1,   Method: Composition-based stats.
 Identities = 21/50 (42%), Positives = 28/50 (56%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V   AG G  GY++G   +A F      A+D  GN+YV D+  + IRKIT
Sbjct: 589 VSTFAGTGTYGYANGTTTTATFTYLLGIAIDNSGNLYVTDQDVNSIRKIT 638


>gi|421740691|ref|ZP_16178929.1| NHL repeat protein [Streptomyces sp. SM8]
 gi|406690906|gb|EKC94689.1| NHL repeat protein [Streptomyces sp. SM8]
          Length = 444

 Score = 57.8 bits (138), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 28/76 (36%), Positives = 47/76 (61%), Gaps = 3/76 (3%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S + +V D++R     L+     ESVV+R+ G+G +G +DG P  ARF +P+  A+  +G
Sbjct: 16  SGNFLVSDTTRHQLVELAE--DGESVVRRIGGEGERGLADGGPAEARFSEPQGLALTPEG 73

Query: 116 N-IYVADKSNHVIRKI 130
           + + VAD  NH +R++
Sbjct: 74  DAVIVADTVNHALRRV 89


>gi|348676369|gb|EGZ16187.1| hypothetical protein PHYSODRAFT_506757 [Phytophthora sojae]
          Length = 1981

 Score = 57.8 bits (138), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 28/63 (44%), Positives = 39/63 (61%), Gaps = 2/63 (3%)

Query: 70  YTLSFPLS--EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
           Y+ +F  S   +  V  +AG+G  G+ DG   +ARF+ P+  AVD  G +YVAD +NH I
Sbjct: 190 YSAAFGASGRAQGRVTTVAGEGTAGFLDGPAATARFNHPRGVAVDSNGVVYVADTANHRI 249

Query: 128 RKI 130
           RKI
Sbjct: 250 RKI 252



 Score = 50.8 bits (120), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 21/39 (53%), Positives = 27/39 (69%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G +DG+P  ARFD P   AVD  G ++VAD  NH+IR+I
Sbjct: 344 GLADGDPLGARFDSPMGVAVDADGVVFVADTGNHLIRRI 382



 Score = 43.9 bits (102), Expect = 0.033,   Method: Composition-based stats.
 Identities = 24/56 (42%), Positives = 34/56 (60%), Gaps = 6/56 (10%)

Query: 81  VVKRLAGDGVQGYSDGEP-GSARFDKPKSFAV-DMKG----NIYVADKSNHVIRKI 130
           +V  LAGDG++G++DG    +ARF  P   AV +  G     ++VAD  NH IR+I
Sbjct: 258 MVSTLAGDGIEGFADGAALSAARFSYPSDVAVLETNGGATVTVFVADTGNHRIRQI 313



 Score = 40.0 bits (92), Expect = 0.43,   Method: Composition-based stats.
 Identities = 24/71 (33%), Positives = 34/71 (47%), Gaps = 5/71 (7%)

Query: 66  RSAFYTLSFPLSEESV-----VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           R   Y   F  + E+V     V  LAG  V G+ DGE   A F+ P   A    G +YVA
Sbjct: 450 RRVTYDGGFASTIETVTSTNRVVTLAGSNVPGHIDGEGNEATFNAPAGVAFAADGRVYVA 509

Query: 121 DKSNHVIRKIT 131
             ++  +R+I+
Sbjct: 510 SSTDCTVRQIS 520


>gi|344924046|ref|ZP_08777507.1| NHL repeat containing protein [Candidatus Odyssella
           thessalonicensis L13]
          Length = 368

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 42/78 (53%), Gaps = 5/78 (6%)

Query: 60  IVLDSSRSAFYTLSFP-----LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
           + +DSS + + T ++      ++   V   LAG    GY++G   SARF  P   AVD  
Sbjct: 222 VAVDSSGTVYVTDTYTQRVRKITSGGVTSLLAGSNTIGYAEGTGASARFSSPYGIAVDSS 281

Query: 115 GNIYVADKSNHVIRKITN 132
           G  YVAD  NH IRKIT+
Sbjct: 282 GTAYVADSDNHRIRKITS 299



 Score = 53.1 bits (126), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 27/57 (47%), Positives = 35/57 (61%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           ++   V   LAG   QGY++G    ARFD+P S AVD  G +YVAD  N  IR+IT+
Sbjct: 136 ITSAGVTSLLAGS-AQGYAEGTGSGARFDRPYSVAVDSSGTVYVADFFNSRIRRITS 191



 Score = 49.7 bits (117), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 25/52 (48%), Positives = 32/52 (61%), Gaps = 1/52 (1%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           +V  LAG    GY++G   SARF+ P   AV   G IYVAD +NH IR I++
Sbjct: 34  IVSTLAGS-TGGYAEGTGASARFNYPYGIAVHSSGTIYVADSANHRIRSISS 84



 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 42/86 (48%), Gaps = 8/86 (9%)

Query: 52  DRPGSSDLIVLDSSRSAFYTLSFP-----LSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           DRP S   + +DSS + +    F      ++   V   LAG    GY +G  G+A+F  P
Sbjct: 163 DRPYS---VAVDSSGTVYVADFFNSRIRRITSAGVTSTLAGSSTGGYLEGTGGAAQFGTP 219

Query: 107 KSFAVDMKGNIYVADKSNHVIRKITN 132
              AVD  G +YV D     +RKIT+
Sbjct: 220 IDVAVDSSGTVYVTDTYTQRVRKITS 245



 Score = 40.4 bits (93), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 17/33 (51%), Positives = 24/33 (72%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           SA+F+ P   AVD  G +YV++ +NH IRKIT+
Sbjct: 106 SAQFNNPYGVAVDSSGTLYVSEYTNHRIRKITS 138



 Score = 39.3 bits (90), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 22/32 (68%)

Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +F+ P    VD  G++Y+AD SNH+IRKI  L
Sbjct: 323 QFNYPAGIDVDSSGSLYIADSSNHLIRKIDAL 354


>gi|302848538|ref|XP_002955801.1| hypothetical protein VOLCADRAFT_119185 [Volvox carteri f.
           nagariensis]
 gi|300258994|gb|EFJ43226.1| hypothetical protein VOLCADRAFT_119185 [Volvox carteri f.
           nagariensis]
          Length = 1149

 Score = 57.4 bits (137), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 29/71 (40%), Positives = 42/71 (59%), Gaps = 9/71 (12%)

Query: 73  SFPLSEESVVKRLAGDG---------VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS 123
           S  +S++  ++R+A DG           G++DG  GSARF   +  AVD  GNI+VAD S
Sbjct: 241 SLVVSDQHCLRRVASDGFVTTIAGGTTPGHADGPSGSARFYNLRGVAVDGDGNIFVADSS 300

Query: 124 NHVIRKITNLD 134
           NH +R++T  D
Sbjct: 301 NHCVRQLTAAD 311



 Score = 41.6 bits (96), Expect = 0.16,   Method: Composition-based stats.
 Identities = 27/78 (34%), Positives = 41/78 (52%), Gaps = 5/78 (6%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
           G  ++ V DSS      L+   + +S+V  L G  G  G+ DG    ARF  P   AV++
Sbjct: 290 GDGNIFVADSSNHCVRQLT---AADSIVSTLVGSPGNAGFRDGTGTDARFRNPCGAAVNL 346

Query: 114 K-GNIYVADKSNHVIRKI 130
           + G + VAD  N+ +RK+
Sbjct: 347 QDGTLVVADAGNNRLRKV 364


>gi|291452573|ref|ZP_06591963.1| NHL repeat containing protein [Streptomyces albus J1074]
 gi|291355522|gb|EFE82424.1| NHL repeat containing protein [Streptomyces albus J1074]
          Length = 613

 Score = 57.4 bits (137), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 28/76 (36%), Positives = 47/76 (61%), Gaps = 3/76 (3%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S + +V D++R     L+     ESVV+R+ G+G +G +DG P  ARF +P+  A+  +G
Sbjct: 185 SGNFLVSDTTRHQLVELA--EDGESVVRRIGGEGERGLADGGPAEARFSEPQGLALTPEG 242

Query: 116 N-IYVADKSNHVIRKI 130
           + + VAD  NH +R++
Sbjct: 243 DAVIVADTVNHALRRV 258



 Score = 37.4 bits (85), Expect = 3.2,   Method: Composition-based stats.
 Identities = 25/77 (32%), Positives = 36/77 (46%), Gaps = 6/77 (7%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ--GYSDGEPGSARFDKPKSFAVD 112
           G + L V DS  SA   +      E  V  + G G+   G+ DGE   A F  P +  V 
Sbjct: 354 GDARLWVADSETSALRWVD----PEGTVHTVVGTGLFDFGHRDGEAAQALFQHPLAVTVL 409

Query: 113 MKGNIYVADKSNHVIRK 129
             G++ V+D  NH +R+
Sbjct: 410 PDGSVAVSDTYNHALRR 426


>gi|409204160|ref|ZP_11232357.1| putative outer membrane adhesin-like protein [Pseudoalteromonas
           flavipulchra JG1]
          Length = 2380

 Score = 56.6 bits (135), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 39/102 (38%), Positives = 52/102 (50%), Gaps = 3/102 (2%)

Query: 32  YTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYT-LSFPLSEESVVKRLAGDG 89
           +T T V    +  +  +S+    P   D+ V  S+ +   + LS   S E+ V  LAG  
Sbjct: 489 FTSTAVTASTEDSVYKYSIATSDPDKDDVTVSVSASTPLPSWLSLNTSTEATVSTLAGQS 548

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
             GY+DG   +A F  P   A D   N+YVAD SNHVIRKIT
Sbjct: 549 -SGYADGTGTAASFKSPYDVATDSNNNVYVADYSNHVIRKIT 589



 Score = 56.2 bits (134), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 28/56 (50%), Positives = 36/56 (64%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ E VV  LAG G  G  +G   +A F+ PK+  VD  GN+YVAD SN+ IRKIT
Sbjct: 588 ITPEGVVTTLAGSGTAGSDEGTGSAASFNFPKAVTVDASGNVYVADSSNNKIRKIT 643



 Score = 49.3 bits (116), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 27/77 (35%), Positives = 43/77 (55%), Gaps = 4/77 (5%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           ++DL + D +  A   ++      SVV   AG G  G ++G    A F+KP   A+D   
Sbjct: 735 NNDLFIADRNNHAIRKVT----SASVVTTFAGTGSAGSANGTGTQASFNKPYDIALDSAD 790

Query: 116 NIYVADKSNHVIRKITN 132
           N+YV++++ H IRKIT+
Sbjct: 791 NLYVSEQTGHTIRKITS 807



 Score = 48.9 bits (115), Expect = 0.001,   Method: Composition-based stats.
 Identities = 27/57 (47%), Positives = 33/57 (57%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   VV   AG  GV G +DG    ARF +P   AVD  G +YVAD  NH IRKI+
Sbjct: 805 ITSAGVVTTFAGSAGVSGSTDGLASVARFSQPFGIAVDSNGIVYVADTGNHRIRKIS 861



 Score = 45.8 bits (107), Expect = 0.009,   Method: Composition-based stats.
 Identities = 25/77 (32%), Positives = 39/77 (50%), Gaps = 4/77 (5%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
            S ++ V DSS +    ++       VV   AG G  G +DG   +A F  P    +D  
Sbjct: 625 ASGNVYVADSSNNKIRKIT----PAGVVTTFAGSGSPGSTDGTGTAATFAAPTGITIDSN 680

Query: 115 GNIYVADKSNHVIRKIT 131
           GN++V + + H++RKIT
Sbjct: 681 GNLFVVETNPHIVRKIT 697



 Score = 41.6 bits (96), Expect = 0.17,   Method: Composition-based stats.
 Identities = 24/79 (30%), Positives = 43/79 (54%), Gaps = 6/79 (7%)

Query: 60  IVLDSSRSAFYTLSFP-----LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
           I +DS+ + F   + P     ++   VV   AG  G  G++D    SA+F+ P +     
Sbjct: 675 ITIDSNGNLFVVETNPHIVRKITPAGVVTTFAGSKGSSGFTDATGTSAKFNFPYNGGSSS 734

Query: 114 KGNIYVADKSNHVIRKITN 132
             ++++AD++NH IRK+T+
Sbjct: 735 NNDLFIADRNNHAIRKVTS 753


>gi|392540748|ref|ZP_10287885.1| Ig family protein [Pseudoalteromonas piscicida JCM 20779]
          Length = 2384

 Score = 56.6 bits (135), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 38/102 (37%), Positives = 48/102 (47%), Gaps = 3/102 (2%)

Query: 32  YTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYT--LSFPLSEESVVKRLAGDG 89
           +T T V    +  +  +SV       D + +  S S      LS   S E+ V  LAG  
Sbjct: 493 FTSTAVTASTEDSVYKYSVTTSDPDKDAVTVSVSASTPLPSWLSLNTSTEATVSTLAGQS 552

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
             GY+DG   +A F  P     D  GN+YVAD  NHVIRKIT
Sbjct: 553 -SGYADGTGTAASFKSPYDLVTDSNGNVYVADYGNHVIRKIT 593



 Score = 55.1 bits (131), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 27/56 (48%), Positives = 36/56 (64%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ E VV  LAG G  G  DG   +A F+ PK+  +D  GN+YVAD SN+ IRK+T
Sbjct: 592 ITPEGVVTTLAGSGSAGSDDGTGSAASFNFPKAVTLDSSGNVYVADSSNNNIRKVT 647



 Score = 47.8 bits (112), Expect = 0.002,   Method: Composition-based stats.
 Identities = 27/77 (35%), Positives = 41/77 (53%), Gaps = 4/77 (5%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           ++DL + D +  A   ++      SVV   AG G  G ++G    A F+KP   A D   
Sbjct: 739 NNDLFIADRNNHAIRKVT----SASVVTTFAGTGSAGSTNGTGTQASFNKPYDVAADSAD 794

Query: 116 NIYVADKSNHVIRKITN 132
           N+YV +++ H IRKIT+
Sbjct: 795 NLYVTEQAAHTIRKITS 811



 Score = 46.2 bits (108), Expect = 0.006,   Method: Composition-based stats.
 Identities = 26/76 (34%), Positives = 38/76 (50%), Gaps = 4/76 (5%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S ++ V DSS +    ++       VV   AG G  G  DG   +A F  P    +D  G
Sbjct: 630 SGNVYVADSSNNNIRKVT----PAGVVTTFAGSGTYGSDDGTGTAATFAAPTGITIDSNG 685

Query: 116 NIYVADKSNHVIRKIT 131
           N+YV + + H++RKIT
Sbjct: 686 NLYVVETNPHIVRKIT 701



 Score = 43.9 bits (102), Expect = 0.032,   Method: Composition-based stats.
 Identities = 25/57 (43%), Positives = 31/57 (54%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   VV   AG  G  G +DG    ARF +P   AVD    +YVAD  NH IRKI+
Sbjct: 809 ITSTGVVTTYAGSAGASGNTDGLVSVARFSQPYGIAVDSNDVVYVADTGNHRIRKIS 865



 Score = 37.4 bits (85), Expect = 3.3,   Method: Composition-based stats.
 Identities = 22/79 (27%), Positives = 42/79 (53%), Gaps = 6/79 (7%)

Query: 60  IVLDSSRSAFYTLSFP-----LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
           I +DS+ + +   + P     ++   VV   AG     G++D    SA F+ P +   + 
Sbjct: 679 ITIDSNGNLYVVETNPHIVRKITPAGVVTTFAGSKNSSGFTDATGTSATFNFPYNGGSNS 738

Query: 114 KGNIYVADKSNHVIRKITN 132
             ++++AD++NH IRK+T+
Sbjct: 739 NNDLFIADRNNHAIRKVTS 757


>gi|340788705|ref|YP_004754170.1| hypothetical protein CFU_3523 [Collimonas fungivorans Ter331]
 gi|340553972|gb|AEK63347.1| hypothetical protein CFU_3523 [Collimonas fungivorans Ter331]
          Length = 394

 Score = 56.6 bits (135), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 32/77 (41%), Positives = 43/77 (55%), Gaps = 5/77 (6%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
           + ++ V DS      T+S       VV  LAG  G  G +DG    ARF++P+  A D  
Sbjct: 297 AGNIRVSDSGNQLIRTVSL----TGVVSTLAGAAGTAGSTDGSGNKARFNQPEGIAADAA 352

Query: 115 GNIYVADKSNHVIRKIT 131
            NIYVAD SN++IRK+T
Sbjct: 353 NNIYVADTSNNLIRKVT 369



 Score = 50.8 bits (120), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 25/57 (43%), Positives = 33/57 (57%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   VV  +AG  G  G +DG  G ARF++P   A D  GN+YV D  N  +RKIT
Sbjct: 137 ITSAGVVTTIAGSAGQAGQNDGTAGDARFNQPWGVARDGAGNLYVTDTGNATVRKIT 193



 Score = 49.3 bits (116), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 21/40 (52%), Positives = 28/40 (70%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G SDG   + RF +P + AVD  GN++VAD +N+ IRKIT
Sbjct: 44  GSSDGSGSNVRFSQPGAIAVDAGGNLFVADTANNTIRKIT 83



 Score = 45.1 bits (105), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 22/31 (70%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ARF++P   AVD  GN+Y+AD  NH IR IT
Sbjct: 108 ARFNQPGGIAVDAGGNLYLADTQNHTIRMIT 138



 Score = 36.2 bits (82), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 14/31 (45%), Positives = 21/31 (67%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ARF++P   A+D  GNI V+D  N +IR ++
Sbjct: 284 ARFNQPYGIALDSAGNIRVSDSGNQLIRTVS 314


>gi|124008176|ref|ZP_01692873.1| hypothetical protein M23134_05970 [Microscilla marina ATCC 23134]
 gi|123986275|gb|EAY26097.1| hypothetical protein M23134_05970 [Microscilla marina ATCC 23134]
          Length = 2548

 Score = 56.6 bits (135), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 26/51 (50%), Positives = 34/51 (66%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + V  +AGDG  G + G   SARF +P   AVD  GN+YVADK+NH I+K+
Sbjct: 715 ATVTTVAGDGTSGDAIGASTSARFREPSDVAVDFSGNLYVADKNNHKIKKV 765



 Score = 50.1 bits (118), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 34/89 (38%), Positives = 45/89 (50%), Gaps = 9/89 (10%)

Query: 47  PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG---YSDGEPGSAR 102
           P S+ +DR G  +L V D   +    ++   +        AGD  Q    Y DG   SAR
Sbjct: 801 PTSIALDRSG--NLFVADRHNNLIRQVA---TSSGATSTYAGDISQTNALYVDGAAASAR 855

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           F+ P    VDM G++YVAD  N VIRKI+
Sbjct: 856 FNHPTGITVDMVGDVYVADTRNQVIRKIS 884



 Score = 46.6 bits (109), Expect = 0.005,   Method: Composition-based stats.
 Identities = 27/58 (46%), Positives = 34/58 (58%), Gaps = 2/58 (3%)

Query: 79  ESVVKRLAGD-GVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
           +S V  +AG  G  G  +   G+ ARF+ P   A+D  G +YVADK NHVIR I N D
Sbjct: 655 DSTVSIVAGSVGASGLVNDPVGTNARFNHPSGLAIDDAGILYVADKDNHVIRAIANPD 712



 Score = 41.6 bits (96), Expect = 0.14,   Method: Composition-based stats.
 Identities = 18/39 (46%), Positives = 27/39 (69%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G +DG    ARF  P S A+D  GN++VAD+ N++IR++
Sbjct: 787 GATDGTASIARFFFPTSIALDRSGNLFVADRHNNLIRQV 825



 Score = 38.1 bits (87), Expect = 1.7,   Method: Composition-based stats.
 Identities = 15/39 (38%), Positives = 26/39 (66%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           +G ++G   +A+F+ P S   D++ NIYV DK N ++R+
Sbjct: 898 RGLTNGTAQAAKFNYPGSVYADLEQNIYVGDKVNQLVRR 936


>gi|223939820|ref|ZP_03631690.1| NHL repeat containing protein [bacterium Ellin514]
 gi|223891508|gb|EEF57999.1| NHL repeat containing protein [bacterium Ellin514]
          Length = 805

 Score = 55.8 bits (133), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 33/82 (40%), Positives = 49/82 (59%), Gaps = 4/82 (4%)

Query: 51  IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSF 109
           ID+ G  +L+V D+  S    ++ P+    VV  LAG     G +DG    ARF+ P+  
Sbjct: 358 IDKAG--NLLVADTQNSEIRKIA-PVGTNYVVTTLAGFAQNVGGADGTNAVARFNSPRGI 414

Query: 110 AVDMKGNIYVADKSNHVIRKIT 131
           AVD  GN++VAD++N+ IRK+T
Sbjct: 415 AVDAAGNVFVADQNNNTIRKLT 436



 Score = 53.1 bits (126), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 45/78 (57%), Gaps = 2/78 (2%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-VQGYSDGEPGSARFDKPKSFAVDMK 114
           S  L V DSS ++   ++ P+    VV  +AG   V G +DG  G A F++P   AVD  
Sbjct: 187 SGHLFVGDSSNNSIRKMT-PVGTNWVVTTIAGSSPVSGSNDGTNGFAHFNQPCGLAVDAA 245

Query: 115 GNIYVADKSNHVIRKITN 132
           G+I+VAD  N  IRKIT+
Sbjct: 246 GSIFVADYFNSTIRKITS 263



 Score = 50.4 bits (119), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 28/78 (35%), Positives = 42/78 (53%), Gaps = 2/78 (2%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
            + ++ V D + +    L+ P+     V  +AG  G+  Y DG   +A F+ P   AVD 
Sbjct: 418 AAGNVFVADQNNNTIRKLT-PVGTNWAVTTIAGQAGMAYYGDGNGTNAYFNYPAGIAVDA 476

Query: 114 KGNIYVADKSNHVIRKIT 131
            GN++V D  NHVIRK+T
Sbjct: 477 SGNLFVTDAGNHVIRKLT 494



 Score = 49.3 bits (116), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 24/46 (52%), Positives = 30/46 (65%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +AG    G +DG    ARF+ P++ A D  GNIYVAD  NHVIRK+
Sbjct: 42  IAGVASIGSNDGTNRVARFNNPEATASDSAGNIYVADSGNHVIRKM 87



 Score = 46.2 bits (108), Expect = 0.006,   Method: Composition-based stats.
 Identities = 25/54 (46%), Positives = 34/54 (62%), Gaps = 1/54 (1%)

Query: 81  VVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           VV  +AG  G  G +DG   +A+F  P + AVD  GN++VAD +N+ IRKI  L
Sbjct: 95  VVTTIAGLPGNPGSADGTNSAAQFYYPAAVAVDNSGNVFVADSANYTIRKIAPL 148



 Score = 46.2 bits (108), Expect = 0.006,   Method: Composition-based stats.
 Identities = 29/77 (37%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMK 114
           S ++ V DS+      ++ PL    VV  +AG  G  G SDG    A F  P+  A D  
Sbjct: 129 SGNVFVADSANYTIRKIA-PLGTNWVVTTIAGVPGYHGSSDGTNTDALFFYPEGLAFDSS 187

Query: 115 GNIYVADKSNHVIRKIT 131
           G+++V D SN+ IRK+T
Sbjct: 188 GHLFVGDSSNNSIRKMT 204



 Score = 43.5 bits (101), Expect = 0.039,   Method: Composition-based stats.
 Identities = 22/52 (42%), Positives = 33/52 (63%), Gaps = 1/52 (1%)

Query: 81  VVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +V  +AG  GV   ++G   +A F+ P + AVD  GN++VAD  N+ IRK+T
Sbjct: 269 LVTTIAGKVGVADSAEGTGTNAVFNYPHAVAVDTNGNVFVADSENYTIRKLT 320



 Score = 42.0 bits (97), Expect = 0.13,   Method: Composition-based stats.
 Identities = 18/42 (42%), Positives = 24/42 (57%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           G+ DG    ARF  P   A+D    +YV D+ N+ IRK+T L
Sbjct: 630 GFMDGTNSDARFAYPTGIAIDTNDTLYVTDQGNNTIRKVTPL 671



 Score = 40.8 bits (94), Expect = 0.30,   Method: Composition-based stats.
 Identities = 23/56 (41%), Positives = 30/56 (53%), Gaps = 1/56 (1%)

Query: 75  PLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           PL    +V  LAG     G +DG   +A F+ P   A+D  GN++VAD  N  IRK
Sbjct: 670 PLGTNWMVTTLAGIHASTGSADGAGSAALFNGPFGIAIDKTGNLFVADLQNSSIRK 725


>gi|373488690|ref|ZP_09579354.1| NHL repeat containing protein [Holophaga foetida DSM 6591]
 gi|372005635|gb|EHP06271.1| NHL repeat containing protein [Holophaga foetida DSM 6591]
          Length = 438

 Score = 55.8 bits (133), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 34/77 (44%), Positives = 44/77 (57%), Gaps = 3/77 (3%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRL-AG-DGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           L V D++R     L  P S+ +V + L AG DG+ G  D    SA F+ P+  A+D  GN
Sbjct: 132 LYVSDATRHTISKL-VPNSDGTVTQSLLAGSDGISGSEDKSGSSASFNSPEGLALDASGN 190

Query: 117 IYVADKSNHVIRKITNL 133
           +YVAD  NH IRKI  L
Sbjct: 191 LYVADYGNHTIRKIDTL 207



 Score = 37.4 bits (85), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 28/51 (54%), Gaps = 1/51 (1%)

Query: 82  VKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +  LAG  G  G ++G    ARF++P    +   G + VAD  NH IRK++
Sbjct: 320 ISTLAGYAGDTGTANGTGNGARFNQPVGITLHTSGYLLVADAYNHAIRKVS 370


>gi|153868997|ref|ZP_01998705.1| protein kinase-like protein [Beggiatoa sp. PS]
 gi|152074439|gb|EDN71293.1| protein kinase-like protein [Beggiatoa sp. PS]
          Length = 1432

 Score = 55.8 bits (133), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 27/58 (46%), Positives = 35/58 (60%), Gaps = 2/58 (3%)

Query: 75  PLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           PL+  S++  +AGDG  GYS  +G    AR   P S  VD + N+Y+AD  NH IRKI
Sbjct: 558 PLNANSLITTIAGDGRSGYSGDNGPAIQARLGNPSSLVVDNENNLYIADTDNHRIRKI 615



 Score = 49.7 bits (117), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 25/51 (49%), Positives = 33/51 (64%), Gaps = 2/51 (3%)

Query: 82  VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +   AG G +GYS DG    +AR + P   AVD  GNI++AD++NH IRKI
Sbjct: 621 ITTFAGSGYKGYSGDGNLAITARLNMPTGLAVDGTGNIFIADQNNHRIRKI 671



 Score = 48.5 bits (114), Expect = 0.001,   Method: Composition-based stats.
 Identities = 23/59 (38%), Positives = 33/59 (55%), Gaps = 2/59 (3%)

Query: 74  FPLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           F L  E  +   AG G +G+   +G    A    P+  A+D +GN+Y+AD +NH IRKI
Sbjct: 436 FKLDPEGNLTVFAGTGTKGFEGDNGPAIEANLSNPEGLAIDAQGNLYIADTNNHRIRKI 494



 Score = 47.0 bits (110), Expect = 0.003,   Method: Composition-based stats.
 Identities = 24/59 (40%), Positives = 35/59 (59%), Gaps = 2/59 (3%)

Query: 79  ESVVKRLAGDGVQGY-SDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           E +++   G GV+G  +DG   S A  ++P   A+D  GN+Y+A+K NH IRKI   D 
Sbjct: 674 EGIIRTFTGTGVRGTATDGILASVAEINQPTDIALDQYGNLYLAEKGNHFIRKIGEKDQ 732



 Score = 38.1 bits (87), Expect = 2.0,   Method: Composition-based stats.
 Identities = 24/75 (32%), Positives = 38/75 (50%), Gaps = 7/75 (9%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI--- 130
           +  + ++  + G G  GY+ D E   +A+  KP +   D  G+ Y+AD  N+ IRKI   
Sbjct: 494 IDSDGIITTVVGTGEAGYAGDNEFAIAAQLKKPTAIVFDHNGHFYIADSGNNSIRKINYQ 553

Query: 131 --TNLDHGNQLIRQI 143
             T+  + N LI  I
Sbjct: 554 PGTSPLNANSLITTI 568


>gi|111219919|ref|YP_710713.1| serine/threonine protein kinase [Frankia alni ACN14a]
 gi|111147451|emb|CAJ59101.1| Putative serine/threonine protein kinase [Frankia alni ACN14a]
          Length = 779

 Score = 55.8 bits (133), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 33/92 (35%), Positives = 50/92 (54%), Gaps = 8/92 (8%)

Query: 42  QLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS 100
           +L+    +V DR G  DL + DS  +    +        +V  +AG G +G+S DG P +
Sbjct: 622 RLQATTAAVADRSG--DLYIADSEANRIRRID----SRGIVTTVAGTGAKGFSGDGGPAT 675

Query: 101 -ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            A  + P   AVD  GN+Y+AD +NH IR++T
Sbjct: 676 RAMLNNPTGVAVDRAGNLYIADSANHRIRRVT 707



 Score = 40.4 bits (93), Expect = 0.34,   Method: Composition-based stats.
 Identities = 24/49 (48%), Positives = 29/49 (59%), Gaps = 2/49 (4%)

Query: 82  VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           V  +AG G  G S DG P   A F +  S AVD  GNIY++D+SN V R
Sbjct: 486 VSTVAGSGETGTSGDGGPALQAAFQRLGSVAVDAAGNIYISDESNRVRR 534


>gi|359151030|ref|ZP_09183771.1| Redoxin domain-containing protein [Streptomyces sp. S4]
          Length = 568

 Score = 55.8 bits (133), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 46/76 (60%), Gaps = 3/76 (3%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S + +V D++R     L+     ESVV+R+ G+G +G +DG P  ARF +P+   +  +G
Sbjct: 140 SGNFLVSDTTRHQLVELAE--DGESVVRRIGGEGERGLADGGPAEARFSEPQGLVLTPEG 197

Query: 116 N-IYVADKSNHVIRKI 130
           + + VAD  NH +R++
Sbjct: 198 DAVIVADTVNHALRRV 213


>gi|329908044|ref|ZP_08274804.1| hypothetical protein IMCC9480_3484 [Oxalobacteraceae bacterium
           IMCC9480]
 gi|327546777|gb|EGF31711.1| hypothetical protein IMCC9480_3484 [Oxalobacteraceae bacterium
           IMCC9480]
          Length = 642

 Score = 55.8 bits (133), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 8/88 (9%)

Query: 46  NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARF 103
           +P S+ IDR G   L V DS       +S     + VV  LAG  G  G +DG   +ARF
Sbjct: 78  SPESIAIDRAGM--LYVADSVNHTIRKVS----PQGVVTTLAGRAGEPGSADGRGSAARF 131

Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKIT 131
             PK  AVD+ GN+ V+D +NH IRKI+
Sbjct: 132 FDPKGVAVDVAGNVVVSDNANHTIRKIS 159



 Score = 49.7 bits (117), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 25/56 (44%), Positives = 30/56 (53%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   VV +LAG G  G  DG    A F  PK    D  GN++VAD  N  IRKIT
Sbjct: 369 ITPAGVVSQLAGSGKWGSVDGTGAGASFSAPKGIVADPLGNVFVADTYNSTIRKIT 424



 Score = 46.6 bits (109), Expect = 0.005,   Method: Composition-based stats.
 Identities = 24/52 (46%), Positives = 30/52 (57%), Gaps = 2/52 (3%)

Query: 82  VKRLAGDGVQ--GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V  LAG      G  DG   +ARF  P+S A+D  G +YVAD  NH IRK++
Sbjct: 53  VSTLAGASASTPGAVDGRGAAARFYSPESIAIDRAGMLYVADSVNHTIRKVS 104



 Score = 46.2 bits (108), Expect = 0.007,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 33/57 (57%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   VV  +AG     G +DG    ARF  P++ A+D + N+YV D  NH IRKI+
Sbjct: 423 ITPAGVVTTIAGAPSSTGSTDGPGNLARFSGPEAIAIDAQRNLYVGDTGNHTIRKIS 479



 Score = 43.1 bits (100), Expect = 0.058,   Method: Composition-based stats.
 Identities = 24/59 (40%), Positives = 33/59 (55%), Gaps = 1/59 (1%)

Query: 74  FPLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           + ++  S  KRLAG   +    +G    ARF+ P +   D  GN YVAD  N++IRKIT
Sbjct: 312 YVVTNGSTFKRLAGLAPELALVNGNGAQARFNLPWAVTSDATGNWYVADAGNYMIRKIT 370



 Score = 42.0 bits (97), Expect = 0.11,   Method: Composition-based stats.
 Identities = 25/57 (43%), Positives = 31/57 (54%), Gaps = 3/57 (5%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S   VV  LAG  G  G  DG   +AR   P+S +VD  GN+YV   S   +RKIT
Sbjct: 478 ISTSGVVSTLAGSPGRYGSEDGTGAAARLASPRSMSVDQAGNVYVI--SYRAVRKIT 532



 Score = 38.9 bits (89), Expect = 0.98,   Method: Composition-based stats.
 Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S +  V  LAG      + DG   +ARF  P++   D   N+YV D+ N ++RKIT
Sbjct: 158 ISPDGNVTTLAGSARNASALDGIGSAARFAFPQALVTDAARNVYVVDRGNGLLRKIT 214



 Score = 38.9 bits (89), Expect = 1.2,   Method: Composition-based stats.
 Identities = 19/42 (45%), Positives = 25/42 (59%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G  G  +G    ARF  P S  VD  GN Y+AD++N+ IRK+
Sbjct: 6   GGSGNLNGTGSLARFAYPGSMVVDSAGNRYIADQNNNQIRKV 47


>gi|325916326|ref|ZP_08178604.1| gluconolactonase [Xanthomonas vesicatoria ATCC 35937]
 gi|325537497|gb|EGD09215.1| gluconolactonase [Xanthomonas vesicatoria ATCC 35937]
          Length = 699

 Score = 55.8 bits (133), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 26/54 (48%), Positives = 32/54 (59%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + + VV  LAGDG  G+ DG    ARF+ P   AVD +G +YVAD  N  IR I
Sbjct: 153 TPQGVVTTLAGDGTAGFGDGAASQARFNGPMGVAVDAQGRVYVADTYNDRIRVI 206



 Score = 51.6 bits (122), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 24/50 (48%), Positives = 33/50 (66%), Gaps = 1/50 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V+ LAG G +G++DG   +A F+ P   A+D  GN+Y+AD  NH IRK T
Sbjct: 105 VQTLAG-GREGFADGIGAAAAFNTPSGIALDTAGNLYIADTGNHAIRKRT 153



 Score = 46.6 bits (109), Expect = 0.005,   Method: Composition-based stats.
 Identities = 22/56 (39%), Positives = 30/56 (53%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +  +  V+ LAG  + G +DG    AR D P    VD  G ++VAD  N  IR+IT
Sbjct: 206 IERDGQVRTLAGSALPGMADGVGEQARLDTPTDLKVDAHGVVWVADMRNDAIRRIT 261


>gi|182416423|ref|YP_001821489.1| NHL repeat-containing protein [Opitutus terrae PB90-1]
 gi|177843637|gb|ACB77889.1| NHL repeat containing protein [Opitutus terrae PB90-1]
          Length = 1026

 Score = 55.5 bits (132), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 29/56 (51%), Positives = 34/56 (60%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S +  V  LAG G  G +DG+  SARF      AVD  GN+YVAD  NH IRKIT
Sbjct: 187 VSPDGNVTILAGAGKAGSADGDSASARFSGSDGLAVDGTGNVYVADLFNHTIRKIT 242



 Score = 50.4 bits (119), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 29/57 (50%), Positives = 34/57 (59%), Gaps = 3/57 (5%)

Query: 78  EESVVKRLAG---DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           E  V  R AG   D   G +DG   SARF  P+S AVD  G +YVAD ++ VIRKIT
Sbjct: 22  ESYVFTRYAGNPNDTPYGSTDGFGDSARFAAPRSIAVDASGTLYVADAASSVIRKIT 78



 Score = 45.4 bits (106), Expect = 0.011,   Method: Composition-based stats.
 Identities = 29/94 (30%), Positives = 46/94 (48%), Gaps = 5/94 (5%)

Query: 38  IDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGE 97
           +D  +  +     + R G  D+ V DS  S    ++   +  +   R  G    GY++G 
Sbjct: 318 VDAARFSLPRGVAVSRTG--DIYVADSGNSTVRRIAVGGAVTTFAGRPGG---PGYANGS 372

Query: 98  PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
             +A+F  P   A+D   N++VAD  N+VIRKIT
Sbjct: 373 SETAQFYFPTGIAIDQNRNVFVADSYNNVIRKIT 406



 Score = 45.4 bits (106), Expect = 0.011,   Method: Composition-based stats.
 Identities = 28/86 (32%), Positives = 46/86 (53%), Gaps = 6/86 (6%)

Query: 47  PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           P S+ +D  G+  L V D++ S    ++   +E  V   +   G +G +DG   +ARF  
Sbjct: 53  PRSIAVDASGT--LYVADAASSVIRKIT---AEGMVTTFVGTAGQRGSADGIGAAARFQG 107

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT 131
               A+D +GN+Y  D ++H +RKIT
Sbjct: 108 IDGLAIDARGNLYAVDFTDHTVRKIT 133



 Score = 43.9 bits (102), Expect = 0.037,   Method: Composition-based stats.
 Identities = 24/65 (36%), Positives = 32/65 (49%), Gaps = 11/65 (16%)

Query: 78  EESVVKRLAGDGVQGYSDGEPG-----------SARFDKPKSFAVDMKGNIYVADKSNHV 126
           +  V+ ++A DG   +  G PG            ARF+ P   AV   G IYVAD  NH 
Sbjct: 453 QTHVIAKIAPDGAVTFFAGSPGLSGSTDGNARTEARFNGPTGIAVGPSGTIYVADFDNHT 512

Query: 127 IRKIT 131
           IR+I+
Sbjct: 513 IRQIS 517



 Score = 43.5 bits (101), Expect = 0.038,   Method: Composition-based stats.
 Identities = 27/74 (36%), Positives = 36/74 (48%), Gaps = 7/74 (9%)

Query: 65  SRSAFYTLSF------PLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
           +R   Y + F       ++ E VV  LAG  G  G   G  G ARFD P + AVD   N+
Sbjct: 115 ARGNLYAVDFTDHTVRKITPEGVVTTLAGSAGDHGTQVGHGGEARFDSPMAVAVDRWDNL 174

Query: 118 YVADKSNHVIRKIT 131
           YV    +  IRK++
Sbjct: 175 YVGQMGDGAIRKVS 188



 Score = 43.5 bits (101), Expect = 0.045,   Method: Composition-based stats.
 Identities = 19/39 (48%), Positives = 25/39 (64%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           +G  DG  G ARF  P   AVD  G++YVAD  N+++RK
Sbjct: 635 EGKQDGRGGEARFLNPYGVAVDAAGHLYVADSGNNLVRK 673



 Score = 43.1 bits (100), Expect = 0.050,   Method: Composition-based stats.
 Identities = 27/68 (39%), Positives = 36/68 (52%), Gaps = 1/68 (1%)

Query: 76  LSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
           +S   +V  LAG   Q G +DG   +ARF  P +  VD  G IYVAD  +  +RKIT   
Sbjct: 516 ISPAGMVSTLAGAAGQPGTADGTGSAARFYAPAAVTVDRAGMIYVADSWSSAVRKITPDG 575

Query: 135 HGNQLIRQ 142
               ++RQ
Sbjct: 576 VVTTVVRQ 583



 Score = 42.0 bits (97), Expect = 0.14,   Method: Composition-based stats.
 Identities = 24/56 (42%), Positives = 32/56 (57%), Gaps = 2/56 (3%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +S   VV  +AG  G+ G  DG   +ARF  P+  AV   G+IYVAD  N  +R+I
Sbjct: 296 VSPSGVVSTVAGKTGLSGSDDG-VDAARFSLPRGVAVSRTGDIYVADSGNSTVRRI 350



 Score = 40.8 bits (94), Expect = 0.29,   Method: Composition-based stats.
 Identities = 28/77 (36%), Positives = 43/77 (55%), Gaps = 6/77 (7%)

Query: 60  IVLDSSRSAFYTLSF-----PLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDM 113
           I +D +R+ F   S+      ++   VV  +AG  GV G ++G   +ARF  P + A+D 
Sbjct: 384 IAIDQNRNVFVADSYNNVIRKITPGGVVTTVAGLGGVFGSAEGSGAAARFGVPAAVAIDA 443

Query: 114 KGNIYVADKSNHVIRKI 130
             N+YVA++  HVI KI
Sbjct: 444 AANLYVANRQTHVIAKI 460


>gi|255589882|ref|XP_002535117.1| nhl repeat-containing protein, putative [Ricinus communis]
 gi|223523999|gb|EEF27264.1| nhl repeat-containing protein, putative [Ricinus communis]
          Length = 435

 Score = 55.1 bits (131), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 25/52 (48%), Positives = 34/52 (65%), Gaps = 1/52 (1%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + V   LAG G +GY++G P +A F+ P   A+D  GN+YVAD  N+ IRKI
Sbjct: 96  DGVTTTLAG-GTEGYAEGAPTAAAFNTPSGLAIDAAGNLYVADTGNNAIRKI 146



 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/75 (36%), Positives = 39/75 (52%), Gaps = 4/75 (5%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           + +L V D+  +A   +      +  V  LAGDG+ G  DG+   A+F+ P   AVD  G
Sbjct: 130 AGNLYVADTGNNAIRKIG----PDGTVSTLAGDGLAGDKDGKGAGAQFNGPIGVAVDAAG 185

Query: 116 NIYVADKSNHVIRKI 130
            +YV D  N  IR+I
Sbjct: 186 VVYVTDTYNDRIRRI 200



 Score = 44.3 bits (103), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 24/57 (42%), Positives = 33/57 (57%), Gaps = 1/57 (1%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS-NHVIRKI 130
           P    + +  +AGDG+ G S+G     RF  P   A+D +GN+YVAD   N+ IRKI
Sbjct: 37  PAGWHARITTMAGDGLPGSSNGSGKRTRFADPFGVALDRQGNLYVADGGDNNSIRKI 93



 Score = 41.2 bits (95), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 28/55 (50%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ +  V  +AG    G +DG    A FD P   A+   G +Y+AD  N  IRKI
Sbjct: 200 IAPDGTVTTVAGGKRAGMADGAGAQALFDTPTGIALGASGALYIADTGNSAIRKI 254


>gi|194364186|ref|YP_002026796.1| SMP-30/gluconolaconase/LRE domain-containing protein
           [Stenotrophomonas maltophilia R551-3]
 gi|194346990|gb|ACF50113.1| SMP-30/Gluconolaconase/LRE domain protein [Stenotrophomonas
           maltophilia R551-3]
          Length = 693

 Score = 55.1 bits (131), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 38/95 (40%), Positives = 46/95 (48%), Gaps = 7/95 (7%)

Query: 38  IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
           IDG  L+   N  S I      +L V D+   A   +      +  V  LAG G QGY+D
Sbjct: 114 IDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIG----TDGQVTTLAG-GEQGYAD 168

Query: 96  GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G    ARFD P   AVD +G +YVAD  N  IR I
Sbjct: 169 GPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVI 203



 Score = 47.8 bits (112), Expect = 0.002,   Method: Composition-based stats.
 Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 1/49 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ +AG G +G  DG    A F+ P   A D +GN+YVAD  NH IR+I
Sbjct: 103 IETVAGQG-EGRIDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRI 150



 Score = 41.2 bits (95), Expect = 0.21,   Method: Composition-based stats.
 Identities = 20/49 (40%), Positives = 29/49 (59%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V+ LAG    G +DG   +ARFD P + A D  G + VAD  N+ +R++
Sbjct: 209 VRTLAGGERPGLADGLGVAARFDTPVALAFDAHGALLVADLFNNAVRRV 257



 Score = 38.5 bits (88), Expect = 1.5,   Method: Composition-based stats.
 Identities = 21/56 (37%), Positives = 29/56 (51%), Gaps = 1/56 (1%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADK-SNHVIRK 129
           PL+  + ++ LAGDG  G  DG    ARF  P +      G+IY  D   N+ IR+
Sbjct: 41  PLAWTAQIEMLAGDGHPGDRDGASAQARFADPYALLRSADGSIYFTDAGDNNRIRR 96


>gi|431798812|ref|YP_007225716.1| gluconolactonase [Echinicola vietnamensis DSM 17526]
 gi|430789577|gb|AGA79706.1| gluconolactonase [Echinicola vietnamensis DSM 17526]
          Length = 428

 Score = 55.1 bits (131), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 36/97 (37%), Positives = 51/97 (52%), Gaps = 8/97 (8%)

Query: 37  VIDGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
           +IDG  ++    NP  V   P    LIV D + ++   ++     +  V  + G G  G+
Sbjct: 121 LIDGSGIQAAFRNPEGVTMHPDGY-LIVTDRANNSIRKVT----TDGAVSTVLGTGNSGF 175

Query: 94  SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            +G   SA  D P    VDM+GNIYVAD+ NH+IRKI
Sbjct: 176 QNGPVASALLDYPWKSCVDMEGNIYVADRDNHMIRKI 212



 Score = 49.7 bits (117), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 33/55 (60%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  + +V  +AG G  G++DG    A+FD+P   AV  +G +YV D  NH IRKI
Sbjct: 212 IDPQGMVSTVAGTGEAGFADGPAEEAQFDQPLDIAVTAEGVLYVTDNRNHRIRKI 266



 Score = 47.4 bits (111), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 43/83 (51%), Gaps = 7/83 (8%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDC 150
           QG  DG    A F  P    VD  GNIYVAD+ NH+IRKI +L+ G     Q++    D 
Sbjct: 280 QGNQDGALEEATFRYPSGLDVDDMGNIYVADRINHLIRKI-DLNAG-----QVSTVAGDG 333

Query: 151 SKSSQSGSGMFWVTVFSPYLISC 173
           S+ ++ G  +      +PY IS 
Sbjct: 334 SQGTRDGQ-VMTAQFNNPYGISV 355



 Score = 44.3 bits (103), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 23/49 (46%), Positives = 30/49 (61%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V  +AGDG QG  DG+  +A+F+ P   +V   G + VAD SNH IR I
Sbjct: 326 VSTVAGDGSQGTRDGQVMTAQFNNPYGISVADNGQLVVADLSNHKIRLI 374


>gi|254522386|ref|ZP_05134441.1| NHL repeat containing protein [Stenotrophomonas sp. SKA14]
 gi|219719977|gb|EED38502.1| NHL repeat containing protein [Stenotrophomonas sp. SKA14]
          Length = 643

 Score = 55.1 bits (131), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 38/96 (39%), Positives = 47/96 (48%), Gaps = 7/96 (7%)

Query: 37  VIDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS 94
           +IDG  L+   N  S I      +L V D+   A   +      +  V  LAG GVQG+ 
Sbjct: 63  LIDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIGI----DGQVTTLAG-GVQGHG 117

Query: 95  DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           DG    ARFD P   AVD +G +YVAD  N  IR I
Sbjct: 118 DGPVAQARFDAPMGIAVDAQGQVYVADTFNDRIRVI 153



 Score = 48.5 bits (114), Expect = 0.001,   Method: Composition-based stats.
 Identities = 23/49 (46%), Positives = 30/49 (61%), Gaps = 1/49 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V+ +AG G +G  DG    A F+ P   A D +GN+YVAD  NH IR+I
Sbjct: 53  VETVAGQG-EGLIDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRI 100



 Score = 43.5 bits (101), Expect = 0.038,   Method: Composition-based stats.
 Identities = 20/49 (40%), Positives = 30/49 (61%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V+ LAG    G +DG   +ARFD P + A D +G + VAD  N+ +R++
Sbjct: 159 VRTLAGGERPGLADGAGDAARFDTPVALAFDAQGALLVADLFNNAVRRV 207


>gi|408823744|ref|ZP_11208634.1| NHL repeat-containing protein [Pseudomonas geniculata N1]
          Length = 681

 Score = 54.7 bits (130), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 37/95 (38%), Positives = 46/95 (48%), Gaps = 7/95 (7%)

Query: 38  IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
           +DG  L+   N  S I      +L V D+   A   +      +  V  LAG G QGY+D
Sbjct: 102 VDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIG----TDGQVTTLAG-GEQGYAD 156

Query: 96  GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G    ARFD P   AVD +G +YVAD  N  IR I
Sbjct: 157 GPAAQARFDAPMGIAVDAQGQVYVADTYNDRIRVI 191



 Score = 48.5 bits (114), Expect = 0.001,   Method: Composition-based stats.
 Identities = 23/49 (46%), Positives = 30/49 (61%), Gaps = 1/49 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V+ +AG G +G  DG    A F+ P   A D +GN+YVAD  NH IR+I
Sbjct: 91  VETVAGQG-EGRVDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRI 138



 Score = 44.3 bits (103), Expect = 0.025,   Method: Composition-based stats.
 Identities = 21/49 (42%), Positives = 30/49 (61%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V+ LAG    G +DG   +ARFD P + A D +G + VAD  N+ +R+I
Sbjct: 197 VRTLAGGERPGMADGVGAAARFDTPVALAFDAQGALLVADLFNNAVRRI 245


>gi|374606289|ref|ZP_09679172.1| copper amine oxidase domain-containing protein [Paenibacillus
           dendritiformis C454]
 gi|374388103|gb|EHQ59542.1| copper amine oxidase domain-containing protein [Paenibacillus
           dendritiformis C454]
          Length = 564

 Score = 54.7 bits (130), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 36/55 (65%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           L +   V  +AG G+ G+ DG+   ARF++P+  AV   G++YVAD  NHVIR+I
Sbjct: 165 LDKSGRVTTVAGSGLAGWKDGKGAEARFNEPRDVAVAEDGSLYVADALNHVIRRI 219



 Score = 49.7 bits (117), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 29/38 (76%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++DG+   ARF++P   A D +GN+Y+AD +NHVIRK+
Sbjct: 128 WADGKGTEARFNEPMGMAEDRQGNLYIADAANHVIRKL 165



 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 24/61 (39%), Positives = 31/61 (50%), Gaps = 8/61 (13%)

Query: 68  AFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
           A Y+  FP +      RL   G  GY DG+   A F+ P   A+  +G I VAD+ NH I
Sbjct: 303 ASYSYKFPDT------RLYAAG--GYRDGKASEALFNGPAGIALTGEGGIVVADRWNHAI 354

Query: 128 R 128
           R
Sbjct: 355 R 355



 Score = 37.7 bits (86), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 20/63 (31%), Positives = 28/63 (44%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINL 145
           AG GV G +DG    A+F  P        G + +AD  NH+IR+       + L  Q+  
Sbjct: 58  AGSGVSGSADGRAEQAQFRYPAGLLAGKNGALLIADTYNHLIRRADAAGQVSTLAGQVRF 117

Query: 146 KPE 148
             E
Sbjct: 118 AAE 120


>gi|455790313|gb|EMF42184.1| putative lipoprotein [Leptospira interrogans serovar Lora str. TE
           1992]
          Length = 294

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/91 (37%), Positives = 44/91 (48%), Gaps = 2/91 (2%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNIYV+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95

Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
           D+ N++IRKI     GN      NL  +D S
Sbjct: 96  DQINNLIRKID--PSGNVKTLSTNLPLQDPS 124



 Score = 38.9 bits (89), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 29/51 (56%), Gaps = 2/51 (3%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
            V  L+G G+ GY DG+  SARF  P   A + K N +  AD  +H IRKI
Sbjct: 210 TVSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKI 259



 Score = 37.0 bits (84), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 29/47 (61%), Gaps = 1/47 (2%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           + G  +G+  ++ FD P    +D + N+YV + SNH IRKI NL+ G
Sbjct: 164 LSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLNSG 209


>gi|383636240|ref|ZP_09950646.1| redoxin domain-containing protein [Streptomyces chartreusis NRRL
           12338]
          Length = 605

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 33/99 (33%), Positives = 55/99 (55%), Gaps = 10/99 (10%)

Query: 39  DGHQLEINPHSVIDR-PG------SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ 91
           DG  +   P   + R PG      S   +V D++R     L+     ESVV+R+ G G +
Sbjct: 165 DGPYVAPEPEPTVLRFPGKALLLPSGTFLVSDTTRHQLVELA--EDGESVVRRI-GSGTR 221

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G++DG  G+A F++P+  A+  +G++ VAD  NH +R++
Sbjct: 222 GFADGPTGTAAFNEPQGLALLDEGSVVVADTVNHALRRL 260


>gi|124006033|ref|ZP_01690870.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
 gi|123988440|gb|EAY28086.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
          Length = 789

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 45/127 (35%), Positives = 66/127 (51%), Gaps = 15/127 (11%)

Query: 12  ITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFY 70
           ITLL ++ SL +SG   ++ Y + TV+  +    N   V +D    +++ V D+ R+  Y
Sbjct: 9   ITLLCIV-SLEASG---QNQYKINTVLSNYD---NIWGVAVD--AQNNVYVTDAERNRIY 59

Query: 71  TLSFPLSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIR 128
            L       +++   AG G +GYS DG P +A   D P    VD  GN+Y AD SN V+R
Sbjct: 60  RLDAITGVRTII---AGTGTEGYSGDGGPATAAMLDYPTGITVDAAGNVYFADGSNDVVR 116

Query: 129 KITNLDH 135
           KI    H
Sbjct: 117 KIDATTH 123



 Score = 40.0 bits (92), Expect = 0.45,   Method: Composition-based stats.
 Identities = 19/52 (36%), Positives = 31/52 (59%), Gaps = 2/52 (3%)

Query: 81  VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V+  +AG+G +G++   G+  +A+   P   A+D  GNIY+ D  N  IRK+
Sbjct: 124 VISTIAGNGNRGFAGDGGQATAAQLHFPSDVALDTAGNIYIVDHRNDRIRKV 175



 Score = 37.7 bits (86), Expect = 2.3,   Method: Composition-based stats.
 Identities = 20/47 (42%), Positives = 26/47 (55%), Gaps = 2/47 (4%)

Query: 85  LAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            AGDG    +DG P   A    PK  A+D  GN+Y+AD  +  IRK+
Sbjct: 225 FAGDG-NALNDGGPAHLASLRNPKGLAIDAAGNVYIADVLDDRIRKV 270


>gi|386716891|ref|YP_006183217.1| NHL repeat protein [Stenotrophomonas maltophilia D457]
 gi|384076453|emb|CCH11034.1| NHL repeat protein [Stenotrophomonas maltophilia D457]
          Length = 652

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 37/95 (38%), Positives = 47/95 (49%), Gaps = 7/95 (7%)

Query: 38  IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
           +DG  L+   N  S I      +L V D+   A   +S     +  V  LAG G QG++D
Sbjct: 73  VDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIS----TDGQVTTLAG-GEQGHAD 127

Query: 96  GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G    ARFD P   AVD +G +YVAD  N  IR I
Sbjct: 128 GPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVI 162



 Score = 48.9 bits (115), Expect = 9e-04,   Method: Composition-based stats.
 Identities = 23/50 (46%), Positives = 31/50 (62%), Gaps = 1/50 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V+ +AG G +G  DG    A F+ P   A D +GN+YVAD  NH IR+I+
Sbjct: 62  VETVAGQG-EGRVDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIS 110



 Score = 44.7 bits (104), Expect = 0.018,   Method: Composition-based stats.
 Identities = 29/102 (28%), Positives = 49/102 (48%), Gaps = 5/102 (4%)

Query: 34  VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFP-----LSEESVVKRLAGD 88
           VTT+  G Q   +  +   R  +   I +D+    +   +F      +  +  V+ LAG 
Sbjct: 115 VTTLAGGEQGHADGPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVIGTDGSVRTLAGG 174

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
              G +DG   +ARFD P + A D +G + VAD  N+ +R++
Sbjct: 175 DRPGLADGVGAAARFDTPVALAFDAQGALLVADLFNNAVRRV 216


>gi|456988165|gb|EMG23306.1| hypothetical protein LEP1GSC150_2908 [Leptospira interrogans
           serovar Copenhageni str. LT2050]
          Length = 231

 Score = 54.3 bits (129), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 34/91 (37%), Positives = 44/91 (48%), Gaps = 2/91 (2%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNIYV+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95

Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
           D+ N++IRKI     GN      NL  +D S
Sbjct: 96  DQINNLIRKID--PSGNVKTLSTNLPLQDPS 124


>gi|427400920|ref|ZP_18892158.1| hypothetical protein HMPREF9710_01754 [Massilia timonae CCUG 45783]
 gi|425720099|gb|EKU83025.1| hypothetical protein HMPREF9710_01754 [Massilia timonae CCUG 45783]
          Length = 717

 Score = 54.3 bits (129), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 34/93 (36%), Positives = 47/93 (50%), Gaps = 13/93 (13%)

Query: 50  VIDRPGSSDLI-----VLDSSRSAFYTLSFP-------LSEESVVKRLAGDGVQGYSDGE 97
           VID PGS+        V   +R A Y            +  +  V  LAG G +G++DG 
Sbjct: 64  VIDGPGSASRFSDPYGVAIGARGAVYVADGGEANRIRLIQPDGAVSTLAG-GKEGFADGI 122

Query: 98  PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
             +A F  P + A+D +GN+YVAD  NH IRK+
Sbjct: 123 GAAAAFHTPSALALDHEGNLYVADTGNHAIRKV 155



 Score = 47.8 bits (112), Expect = 0.002,   Method: Composition-based stats.
 Identities = 22/55 (40%), Positives = 31/55 (56%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ + +V  LAG+G  G  DG    A FD P + A    G +YVAD  NH +R+I
Sbjct: 209 IAPDGMVTTLAGNGKPGLLDGALLDAGFDTPSALAAGRDGTLYVADTGNHAVRRI 263



 Score = 45.1 bits (105), Expect = 0.014,   Method: Composition-based stats.
 Identities = 23/55 (41%), Positives = 32/55 (58%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ +  V  +AG G  GY DG   +A+F+ P   AVD  G +YVAD  N  IR+I
Sbjct: 155 VAPDGTVTTVAGSGSPGYLDGIGRAAQFNGPVGIAVDDAGIVYVADTYNDRIRRI 209



 Score = 39.7 bits (91), Expect = 0.62,   Method: Composition-based stats.
 Identities = 22/41 (53%), Positives = 25/41 (60%), Gaps = 2/41 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSA-RFDKPKSFAVDMKGNIYVAD 121
           V  LAGDG  G  DG PGSA RF  P   A+  +G +YVAD
Sbjct: 53  VSLLAGDGRDGVIDG-PGSASRFSDPYGVAIGARGAVYVAD 92


>gi|290974765|ref|XP_002670115.1| predicted protein [Naegleria gruberi]
 gi|284083670|gb|EFC37371.1| predicted protein [Naegleria gruberi]
          Length = 784

 Score = 54.3 bits (129), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 35/91 (38%), Positives = 48/91 (52%), Gaps = 5/91 (5%)

Query: 43  LEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGS 100
           LE  P  V   P S+DL++ D        L+     +++V  + GD   GYS  +G    
Sbjct: 95  LEFQPGGVALYPNSNDLLISDPVGGVIVRLN-SKGIQTIVAGMKGD--LGYSGDNGLATR 151

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           AR + P   AV   G +Y+ADKSNHVIRKI+
Sbjct: 152 ARLNTPTGVAVASNGEVYIADKSNHVIRKIS 182



 Score = 43.1 bits (100), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 31/51 (60%), Gaps = 2/51 (3%)

Query: 82  VKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  +AG+G +G+S   G   +A+ + P   ++   G +Y+AD  NHVIRKI
Sbjct: 187 ISTIAGNGEEGFSGDGGNAKTAQLNSPIGLSISSTGELYIADSKNHVIRKI 237



 Score = 39.7 bits (91), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 38/68 (55%), Gaps = 3/68 (4%)

Query: 65  SRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADK 122
           + SA++ +   +S   ++ + AG G  GYS DG     A+F++P   A    G+I++AD 
Sbjct: 405 AESAYHKIR-KISPNGIISQYAGTGDHGYSGDGSDAKLAKFNEPTRIACSKNGDIFIADL 463

Query: 123 SNHVIRKI 130
            N  IR+I
Sbjct: 464 FNGAIRRI 471



 Score = 37.0 bits (84), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 6/99 (6%)

Query: 33  TVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG 92
           TV TV+ G     +P +VI    S +L+V D S      +S       +   L   G  G
Sbjct: 477 TVDTVVSGLG---SPQAVILTE-SGELLVADRSSHVIRKISTNGVMSIIAGVLEDGGFNG 532

Query: 93  YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
             DG     +F  P+  A+   G++Y+AD  N++IRK++
Sbjct: 533 --DGLATKTKFSGPQDIALAPNGDLYIADYDNYLIRKLS 569


>gi|409099442|ref|ZP_11219466.1| NHL repeat containing protein [Pedobacter agri PB92]
          Length = 754

 Score = 54.3 bits (129), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 28/56 (50%), Positives = 32/56 (57%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S    V  +AG G  G  DG   SA F+ P   AVD  GNIYVAD  N+ IRKIT
Sbjct: 193 ISSTGFVTTVAGSGATGSLDGNAASATFNNPAGVAVDRAGNIYVADAGNNKIRKIT 248



 Score = 45.1 bits (105), Expect = 0.013,   Method: Composition-based stats.
 Identities = 30/87 (34%), Positives = 45/87 (51%), Gaps = 7/87 (8%)

Query: 46  NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
           NP  V +DR G  ++ V D+  +    ++       VV  LAG+G  G +DG   +AR  
Sbjct: 222 NPAGVAVDRAG--NIYVADAGNNKIRKIT----PAGVVSTLAGNGTAGSTDGTGTNARLY 275

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKIT 131
            P    +D   N+Y+AD  N+ IRKI+
Sbjct: 276 YPTGITIDPLDNLYIADYRNNRIRKIS 302



 Score = 43.9 bits (102), Expect = 0.034,   Method: Composition-based stats.
 Identities = 21/56 (37%), Positives = 35/56 (62%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S   VV  LAG+G  G  +G   ++ F+ P   A D++GN+++ D  N++IRKI+
Sbjct: 301 ISPSGVVTTLAGNGSAGNLNGVGTNSSFNGPIMLAFDVQGNLFITDFQNNLIRKIS 356



 Score = 35.8 bits (81), Expect = 9.8,   Method: Composition-based stats.
 Identities = 19/50 (38%), Positives = 24/50 (48%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V   AG G     +G   S+ F+ P     D + NIYV D  +  IRKIT
Sbjct: 63  VSTFAGSGAASSYNGTGLSSGFNMPSGIGSDAQSNIYVCDYGSGAIRKIT 112


>gi|418714767|ref|ZP_13275259.1| putative lipoprotein [Leptospira interrogans str. UI 08452]
 gi|410788941|gb|EKR82646.1| putative lipoprotein [Leptospira interrogans str. UI 08452]
          Length = 358

 Score = 54.3 bits (129), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 34/91 (37%), Positives = 44/91 (48%), Gaps = 2/91 (2%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNIYV+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95

Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
           D+ N++IRKI     GN      NL  +D S
Sbjct: 96  DQINNLIRKID--PSGNVKTLSTNLPLQDPS 124



 Score = 38.9 bits (89), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 29/51 (56%), Gaps = 2/51 (3%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
            V  L+G G+ GY DG+  SARF  P   A + K N +  AD  +H IRKI
Sbjct: 210 TVSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKI 259



 Score = 37.0 bits (84), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 29/47 (61%), Gaps = 1/47 (2%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           + G  +G+  ++ FD P    +D + N+YV + SNH IRKI NL+ G
Sbjct: 164 LSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLNSG 209


>gi|418707795|ref|ZP_13268612.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
           str. UI 08368]
 gi|410771897|gb|EKR47094.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
           str. UI 08368]
 gi|456968671|gb|EMG09839.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
           str. LT2186]
          Length = 358

 Score = 54.3 bits (129), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 34/91 (37%), Positives = 44/91 (48%), Gaps = 2/91 (2%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNIYV+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95

Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
           D+ N++IRKI     GN      NL  +D S
Sbjct: 96  DQINNLIRKID--PSGNVKTLSTNLPLQDPS 124



 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 25/43 (58%), Gaps = 1/43 (2%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           G+ GY DG+  SARF  P   A + K N +  AD  +H IRKI
Sbjct: 217 GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKI 259



 Score = 36.6 bits (83), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 19/45 (42%), Positives = 28/45 (62%), Gaps = 1/45 (2%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           G  +G+  ++ FD P    +D + N+YV + SNH IRKI NL+ G
Sbjct: 166 GLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLNSG 209


>gi|417772389|ref|ZP_12420278.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
           Pomona]
 gi|418684105|ref|ZP_13245295.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
           Kennewicki LC82-25]
 gi|400324210|gb|EJO76509.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
           Kennewicki LC82-25]
 gi|409945760|gb|EKN95775.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
           Pomona]
 gi|455670390|gb|EMF35375.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
           Fox 32256]
          Length = 358

 Score = 54.3 bits (129), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 34/91 (37%), Positives = 44/91 (48%), Gaps = 2/91 (2%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNIYV+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95

Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
           D+ N++IRKI     GN      NL  +D S
Sbjct: 96  DQINNLIRKID--PSGNVKTLSTNLPLQDPS 124



 Score = 38.9 bits (89), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 29/51 (56%), Gaps = 2/51 (3%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
            V  L+G G+ GY DG+  SARF  P   A + K N +  AD  +H IRKI
Sbjct: 210 TVSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKI 259



 Score = 37.0 bits (84), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 19/45 (42%), Positives = 28/45 (62%), Gaps = 1/45 (2%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           G  +G+  ++ FD P    +D + N+YV + SNH IRKI NL+ G
Sbjct: 166 GLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLNSG 209


>gi|418707100|ref|ZP_13267936.1| putative lipoprotein [Leptospira interrogans serovar Hebdomadis
           str. R499]
 gi|410763306|gb|EKR34037.1| putative lipoprotein [Leptospira interrogans serovar Hebdomadis
           str. R499]
          Length = 358

 Score = 53.9 bits (128), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 34/91 (37%), Positives = 44/91 (48%), Gaps = 2/91 (2%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNIYV+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95

Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
           D+ N++IRKI     GN      NL  +D S
Sbjct: 96  DQINNLIRKID--PSGNVKTLSTNLPLQDPS 124



 Score = 38.9 bits (89), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 29/51 (56%), Gaps = 2/51 (3%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
            V  L+G G+ GY DG+  SARF  P   A + K N +  AD  +H IRKI
Sbjct: 210 TVSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKI 259



 Score = 37.0 bits (84), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 29/47 (61%), Gaps = 1/47 (2%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           + G  +G+  ++ FD P    +D + N+YV + SNH IRKI NL+ G
Sbjct: 164 LSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLNSG 209


>gi|24215844|ref|NP_713325.1| hypothetical protein LA_3145 [Leptospira interrogans serovar Lai
           str. 56601]
 gi|45656856|ref|YP_000942.1| hypothetical protein LIC10968 [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
 gi|386074980|ref|YP_005989298.1| hypothetical protein LIF_A2535 [Leptospira interrogans serovar Lai
           str. IPAV]
 gi|417765006|ref|ZP_12412972.1| putative lipoprotein [Leptospira interrogans serovar Bulgarica str.
           Mallika]
 gi|418666993|ref|ZP_13228411.1| putative lipoprotein [Leptospira interrogans serovar Pyrogenes str.
           2006006960]
 gi|418699338|ref|ZP_13260303.1| putative lipoprotein [Leptospira interrogans serovar Bataviae str.
           L1111]
 gi|418723579|ref|ZP_13282417.1| putative lipoprotein [Leptospira interrogans str. UI 12621]
 gi|418727930|ref|ZP_13286513.1| putative lipoprotein [Leptospira interrogans str. UI 12758]
 gi|421083673|ref|ZP_15544545.1| putative lipoprotein [Leptospira santarosai str. HAI1594]
 gi|421102269|ref|ZP_15562874.1| putative lipoprotein [Leptospira interrogans serovar
           Icterohaemorrhagiae str. Verdun LP]
 gi|421122244|ref|ZP_15582528.1| putative lipoprotein [Leptospira interrogans str. Brem 329]
 gi|24197042|gb|AAN50343.1| hypothetical protein LA_3145 [Leptospira interrogans serovar Lai
           str. 56601]
 gi|45600092|gb|AAS69579.1| conserved hypothetical protein [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
 gi|353458770|gb|AER03315.1| hypothetical protein LIF_A2535 [Leptospira interrogans serovar Lai
           str. IPAV]
 gi|400352606|gb|EJP04785.1| putative lipoprotein [Leptospira interrogans serovar Bulgarica str.
           Mallika]
 gi|409962948|gb|EKO26678.1| putative lipoprotein [Leptospira interrogans str. UI 12621]
 gi|410344796|gb|EKO95953.1| putative lipoprotein [Leptospira interrogans str. Brem 329]
 gi|410367934|gb|EKP23317.1| putative lipoprotein [Leptospira interrogans serovar
           Icterohaemorrhagiae str. Verdun LP]
 gi|410433778|gb|EKP78117.1| putative lipoprotein [Leptospira santarosai str. HAI1594]
 gi|410757315|gb|EKR18927.1| putative lipoprotein [Leptospira interrogans serovar Pyrogenes str.
           2006006960]
 gi|410761708|gb|EKR27881.1| putative lipoprotein [Leptospira interrogans serovar Bataviae str.
           L1111]
 gi|410777322|gb|EKR57287.1| putative lipoprotein [Leptospira interrogans str. UI 12758]
 gi|456822102|gb|EMF70597.1| putative lipoprotein [Leptospira interrogans serovar Canicola str.
           LT1962]
          Length = 358

 Score = 53.9 bits (128), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 34/91 (37%), Positives = 44/91 (48%), Gaps = 2/91 (2%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNIYV+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95

Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
           D+ N++IRKI     GN      NL  +D S
Sbjct: 96  DQINNLIRKID--PSGNVKTLSTNLPLQDPS 124



 Score = 38.9 bits (89), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 29/51 (56%), Gaps = 2/51 (3%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
            V  L+G G+ GY DG+  SARF  P   A + K N +  AD  +H IRKI
Sbjct: 210 TVSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKI 259



 Score = 37.0 bits (84), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 29/47 (61%), Gaps = 1/47 (2%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           + G  +G+  ++ FD P    +D + N+YV + SNH IRKI NL+ G
Sbjct: 164 LSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLNSG 209


>gi|417783289|ref|ZP_12431009.1| putative lipoprotein [Leptospira interrogans str. C10069]
 gi|409953418|gb|EKO07917.1| putative lipoprotein [Leptospira interrogans str. C10069]
          Length = 358

 Score = 53.9 bits (128), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 34/91 (37%), Positives = 44/91 (48%), Gaps = 2/91 (2%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNIYV+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95

Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
           D+ N++IRKI     GN      NL  +D S
Sbjct: 96  DQINNLIRKID--PSGNVKTLSTNLPLQDPS 124



 Score = 39.3 bits (90), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           +V  L+G G+ GY DG+  SARF  P   A + K N +  AD  +H IRKI
Sbjct: 210 MVSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKI 259



 Score = 37.0 bits (84), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 29/47 (61%), Gaps = 1/47 (2%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           + G  +G+  ++ FD P    +D + N+YV + SNH IRKI NL+ G
Sbjct: 164 LSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLNSG 209


>gi|421861496|ref|ZP_16293498.1| hypothetical protein PPOP_3335 [Paenibacillus popilliae ATCC 14706]
 gi|410828922|dbj|GAC43935.1| hypothetical protein PPOP_3335 [Paenibacillus popilliae ATCC 14706]
          Length = 575

 Score = 53.9 bits (128), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 24/55 (43%), Positives = 35/55 (63%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           L +   V  +AG G+ G+ DG    ARF++P+  AV   G++YVAD  NHV+R+I
Sbjct: 176 LDKSGRVTTVAGSGLAGWRDGTGSEARFNEPRDVAVAEDGSLYVADALNHVLRRI 230



 Score = 48.9 bits (115), Expect = 0.001,   Method: Composition-based stats.
 Identities = 19/38 (50%), Positives = 28/38 (73%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++DG+   ARF++P   A D +GN+Y+AD  NHVIRK+
Sbjct: 139 WTDGKGTEARFNQPMGMAEDRQGNLYIADAGNHVIRKL 176



 Score = 42.0 bits (97), Expect = 0.12,   Method: Composition-based stats.
 Identities = 30/105 (28%), Positives = 43/105 (40%), Gaps = 19/105 (18%)

Query: 44  EINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-------------- 89
           + N  S +    S DL+V D+       ++    ++  V  LAG G              
Sbjct: 269 KFNEPSSLAFTSSGDLVVSDTGNQRLRLVNL---KQKYVTTLAGAGSVASYSCKFPDAQL 325

Query: 90  --VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
               GY D E   A F+ P   A+  +G I VAD+ NH IR + N
Sbjct: 326 YAAGGYRDSEASEALFNGPAGIAITAEGGIIVADRWNHAIRYLFN 370



 Score = 38.1 bits (87), Expect = 1.6,   Method: Composition-based stats.
 Identities = 20/66 (30%), Positives = 35/66 (53%), Gaps = 13/66 (19%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSK 152
           Y+DG+ G ++F++P S A    G++ V+D              GNQ +R +NLK +  + 
Sbjct: 260 YADGKLGESKFNEPSSLAFTSSGDLVVSDT-------------GNQRLRLVNLKQKYVTT 306

Query: 153 SSQSGS 158
            + +GS
Sbjct: 307 LAGAGS 312



 Score = 37.7 bits (86), Expect = 2.4,   Method: Composition-based stats.
 Identities = 17/44 (38%), Positives = 21/44 (47%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           AG GV G  DG    A F  P        G + +AD  NH+IR+
Sbjct: 72  AGSGVSGSVDGRAERAEFRYPSGLLAGKNGELLIADTYNHLIRR 115


>gi|417763451|ref|ZP_12411429.1| putative lipoprotein [Leptospira interrogans str. 2002000624]
 gi|417776933|ref|ZP_12424763.1| putative lipoprotein [Leptospira interrogans str. 2002000621]
 gi|418671130|ref|ZP_13232484.1| putative lipoprotein [Leptospira interrogans str. 2002000623]
 gi|409940657|gb|EKN86296.1| putative lipoprotein [Leptospira interrogans str. 2002000624]
 gi|410573292|gb|EKQ36344.1| putative lipoprotein [Leptospira interrogans str. 2002000621]
 gi|410581799|gb|EKQ49606.1| putative lipoprotein [Leptospira interrogans str. 2002000623]
          Length = 358

 Score = 53.9 bits (128), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 34/91 (37%), Positives = 44/91 (48%), Gaps = 2/91 (2%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNIYV+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTTSTASFKTPFGLEVDTSGNIYVS 95

Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
           D+ N++IRKI     GN      NL  +D S
Sbjct: 96  DQINNLIRKID--PSGNVKTLSTNLPLQDPS 124



 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 29/51 (56%), Gaps = 2/51 (3%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
            V  L+G G+ GY DG+  SARF  P   A + K N +  AD  +H IRKI
Sbjct: 210 TVSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKI 259



 Score = 35.8 bits (81), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 18/47 (38%), Positives = 29/47 (61%), Gaps = 1/47 (2%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           + G  +G+  ++ FD P    +D + N+YV + +NH IRKI NL+ G
Sbjct: 164 LSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELNNHTIRKI-NLNSG 209


>gi|445496654|ref|ZP_21463509.1| NHL repeat containing protein [Janthinobacterium sp. HH01]
 gi|444786649|gb|ELX08197.1| NHL repeat containing protein [Janthinobacterium sp. HH01]
          Length = 741

 Score = 53.9 bits (128), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 33/80 (41%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 51  IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFA 110
           ID  G  +L V D+  +A   ++     E VV  LAGDG+ G  DG   +A+F+ P   A
Sbjct: 165 IDAAG--NLYVADTGNNAIRKVT----PEGVVSTLAGDGLPGDKDGRGAAAQFNGPVGIA 218

Query: 111 VDMKGNIYVADKSNHVIRKI 130
           VD  G +YVAD  N  IR+I
Sbjct: 219 VDAAGVVYVADTYNDRIRRI 238



 Score = 53.1 bits (126), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 24/47 (51%), Positives = 33/47 (70%), Gaps = 1/47 (2%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           LAG GV+GY++G   +A F+ P   A+D  GN+YVAD  N+ IRK+T
Sbjct: 140 LAG-GVEGYAEGAGKAAAFNTPSGLAIDAAGNLYVADTGNNAIRKVT 185



 Score = 46.2 bits (108), Expect = 0.006,   Method: Composition-based stats.
 Identities = 20/49 (40%), Positives = 27/49 (55%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V  +AG    G +DG    A FD P   A+   G++Y+AD  NH IRK+
Sbjct: 244 VTTIAGGSRAGKADGAAAQALFDTPTGLALSAAGDLYIADTGNHAIRKL 292



 Score = 43.9 bits (102), Expect = 0.032,   Method: Composition-based stats.
 Identities = 23/50 (46%), Positives = 30/50 (60%), Gaps = 1/50 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD-KSNHVIRKI 130
           V  +AGDG+ G ++G   S RF  P    +D  GN+YVAD   N+ IRKI
Sbjct: 82  VTSIAGDGLPGANNGHGRSTRFADPFGVVIDGAGNLYVADGGDNNSIRKI 131


>gi|443625938|ref|ZP_21110373.1| putative NHL repeat protein [Streptomyces viridochromogenes Tue57]
 gi|443340614|gb|ELS54821.1| putative NHL repeat protein [Streptomyces viridochromogenes Tue57]
          Length = 632

 Score = 53.9 bits (128), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 38/132 (28%), Positives = 62/132 (46%), Gaps = 25/132 (18%)

Query: 39  DGHQLEINPHSVIDR-PG------SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ 91
           DG  +   P   + R PG      S D +V D++R     L+     E+VV+R+ G G +
Sbjct: 169 DGPYVAPEPEPTVLRFPGKVLLLPSGDFLVSDTTRHQLVELAG--DGETVVRRI-GQGSR 225

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
           G +DG    ARF +P+  A+   G + VAD  NH++R+                 PE   
Sbjct: 226 GSTDGYADRARFQEPQGLALLPDGTVAVADTVNHLVRR---------------FDPETGR 270

Query: 152 KSSQSGSGMFWV 163
            ++ +G+G+ W+
Sbjct: 271 TTTLAGTGIQWM 282


>gi|225872818|ref|YP_002754275.1| hypothetical protein ACP_1177 [Acidobacterium capsulatum ATCC
           51196]
 gi|225794459|gb|ACO34549.1| hypothetical protein ACP_1177 [Acidobacterium capsulatum ATCC
           51196]
          Length = 855

 Score = 53.5 bits (127), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 33/86 (38%), Positives = 48/86 (55%), Gaps = 6/86 (6%)

Query: 49  SVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKP 106
           S +   GS +L + D++ +    +S       VV  +AG+G +GYS DG    SA  D P
Sbjct: 82  SAVAYDGSGNLYIADTNNNVIREVS----TTGVVTTIAGNGEEGYSGDGAAATSAMLDTP 137

Query: 107 KSFAVDMKGNIYVADKSNHVIRKITN 132
              AVD  GNIY+AD  N+ IR+++N
Sbjct: 138 TGIAVDSNGNIYIADSHNNRIREVSN 163



 Score = 46.2 bits (108), Expect = 0.006,   Method: Composition-based stats.
 Identities = 24/52 (46%), Positives = 31/52 (59%), Gaps = 2/52 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++  +AG+G  GYS DG    SA  D P + AVD  GNIY+AD  N  IR +
Sbjct: 165 IINTVAGNGTAGYSGDGAAATSAMLDDPTAVAVDASGNIYIADTGNQRIRNV 216



 Score = 43.5 bits (101), Expect = 0.041,   Method: Composition-based stats.
 Identities = 22/49 (44%), Positives = 31/49 (63%), Gaps = 2/49 (4%)

Query: 85  LAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +AG G  GYS   G   SA F  P + A D  GN+Y+AD +N+VIR+++
Sbjct: 58  VAGTGSAGYSGNGGAATSAAFASPSAVAYDGSGNLYIADTNNNVIREVS 106



 Score = 42.4 bits (98), Expect = 0.11,   Method: Composition-based stats.
 Identities = 22/52 (42%), Positives = 33/52 (63%), Gaps = 2/52 (3%)

Query: 82  VKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +  +AG+G +GYS DG   ++   D P   AVD  GNIY+AD  N+ IR+++
Sbjct: 221 IHTVAGNGEEGYSGDGAAAASAELDTPTGIAVDSSGNIYIADSHNNRIREVS 272



 Score = 37.4 bits (85), Expect = 2.9,   Method: Composition-based stats.
 Identities = 21/53 (39%), Positives = 33/53 (62%), Gaps = 2/53 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V+  +AG+G QGYS DG    SA  + P++ +V+  G++ VAD  N  +R +T
Sbjct: 334 VIATVAGNGQQGYSGDGAAATSAALNDPRNASVNASGSVAVADTLNERVRGLT 386



 Score = 35.8 bits (81), Expect = 9.1,   Method: Composition-based stats.
 Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 6/58 (10%)

Query: 81  VVKRLAGDGV---QGYSDGEPGSAR---FDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           V+  +AG G     G   G+ GSA      KP   A+D  G++Y+AD +N  +R+I N
Sbjct: 275 VINTVAGSGAVTFPGSYSGDGGSATAATLAKPTGVALDAAGHVYIADTNNERLREIAN 332


>gi|330824371|ref|YP_004387674.1| NHL repeat containing protein [Alicycliphilus denitrificans K601]
 gi|329309743|gb|AEB84158.1| NHL repeat containing protein [Alicycliphilus denitrificans K601]
          Length = 362

 Score = 53.5 bits (127), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 36/55 (65%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           L     V  +AG G  G++DG   +ARF++P + AV+  G +YVAD++NH IRKI
Sbjct: 83  LRSGGTVATVAGTGKAGFADGPAHAARFNEPIAVAVERSGAVYVADRNNHRIRKI 137



 Score = 52.0 bits (123), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 20/50 (40%), Positives = 36/50 (72%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V  LAG+G  G++DG+  +ARF++P +  +D +G ++V D+ NH +R++T
Sbjct: 199 VDTLAGNGAPGFADGQRATARFNQPYNVRIDGQGRLWVPDQLNHAVRRVT 248



 Score = 45.4 bits (106), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 24/58 (41%), Positives = 33/58 (56%), Gaps = 7/58 (12%)

Query: 71  TLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           TL FP      V+ LAGDG  G +DG PG  R ++P   +    G++Y AD+ NH +R
Sbjct: 31  TLPFPE-----VRTLAGDGQPGAADGSPG--RVNRPHGLSYGNDGSLYFADRGNHQVR 81



 Score = 41.2 bits (95), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 20/50 (40%), Positives = 29/50 (58%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V  +AG G  GY+DG   + RFD P   A    G + VAD++N+ +R +T
Sbjct: 253 VTTVAGAGKAGYADGPAATVRFDNPTGVAPLPNGAVVVADRNNNRLRLVT 302



 Score = 40.0 bits (92), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 23/56 (41%), Positives = 33/56 (58%), Gaps = 1/56 (1%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKI 130
           +  +  V  LAG    G+ DG+  SARF++P   A+D  +  +YVAD  NH IR+I
Sbjct: 137 IRPDGTVITLAGGDSAGFVDGDLKSARFNQPYGVALDAAQTTLYVADYLNHAIRRI 192


>gi|182414082|ref|YP_001819148.1| NHL repeat-containing protein [Opitutus terrae PB90-1]
 gi|177841296|gb|ACB75548.1| NHL repeat containing protein [Opitutus terrae PB90-1]
          Length = 963

 Score = 53.5 bits (127), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 28/60 (46%), Positives = 37/60 (61%), Gaps = 1/60 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
           ++   VV  LAGD  + G +DG  G ARF  P   AVD  GNI+VAD  N+ IR++T L+
Sbjct: 253 ITPTGVVLTLAGDPDLVGSADGTGGDARFSSPAGLAVDRDGNIFVADSLNNTIRRVTPLN 312



 Score = 51.6 bits (122), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 26/56 (46%), Positives = 32/56 (57%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++    V  LAG+   G +DG    ARF+ P   AVD+ GN YVAD  N  IRKIT
Sbjct: 369 IAPSGAVTTLAGEASVGTADGPGPMARFNYPNGVAVDLAGNTYVADTFNATIRKIT 424



 Score = 49.7 bits (117), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 35/84 (41%), Positives = 47/84 (55%), Gaps = 7/84 (8%)

Query: 51  IDRPGSSDLIVLDSSRSAFYTLSFPLSEES---VVKRLAGD-GVQGYSDGEPGSARFDKP 106
           +DR G  ++ V DS  +    ++ PL+  +   VV  +AG  GV G +DG    ARF+ P
Sbjct: 289 VDRDG--NIFVADSLNNTIRRVT-PLNGPAPLGVVTTVAGQAGVTGSADGVGSQARFNLP 345

Query: 107 KSFAVDMKGNIYVADKSNHVIRKI 130
              AVD  GNI+VAD  N  IRKI
Sbjct: 346 YGIAVDAAGNIFVADLGNTTIRKI 369



 Score = 49.3 bits (116), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 25/47 (53%), Positives = 30/47 (63%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           LAG    G  DG+  +A F  P S AVD  GN++VAD +NH IRKIT
Sbjct: 44  LAGRPFFGGLDGQGRAAGFTTPSSAAVDQAGNLFVADTTNHTIRKIT 90



 Score = 48.1 bits (113), Expect = 0.002,   Method: Composition-based stats.
 Identities = 31/84 (36%), Positives = 43/84 (51%), Gaps = 7/84 (8%)

Query: 49  SVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPK 107
           + +D+ G  +L V D++      ++        V   AG G Q G  DG   +ARF  P 
Sbjct: 68  AAVDQAG--NLFVADTTNHTIRKIT----PSGTVSTFAGMGGQPGSVDGTGNAARFLSPH 121

Query: 108 SFAVDMKGNIYVADKSNHVIRKIT 131
             A+D  GN+YVAD  N+ IRKIT
Sbjct: 122 GVALDEAGNLYVADSGNNTIRKIT 145



 Score = 45.4 bits (106), Expect = 0.011,   Method: Composition-based stats.
 Identities = 22/57 (38%), Positives = 34/57 (59%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ + +V  LAG   Q G  DG   +ARF +P    +D  GN+YV+D+ +  +RKIT
Sbjct: 532 ITPDGMVSTLAGSPAQRGGIDGTGTAARFVQPAGLTIDAAGNLYVSDRGDFTVRKIT 588



 Score = 42.0 bits (97), Expect = 0.13,   Method: Composition-based stats.
 Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   VV  +AG  G  G +DG   +ARF  P   AV   G +YVAD+ N  IR+IT
Sbjct: 477 ITPAGVVTTIAGVSGNFGSADGPGLAARFAFPNGLAVATDGTLYVADEENSTIRQIT 533



 Score = 39.3 bits (90), Expect = 0.82,   Method: Composition-based stats.
 Identities = 24/57 (42%), Positives = 31/57 (54%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   VV  LAG   Q G +DG   +ARF+ P   AVD  GN+Y    S   +RKIT
Sbjct: 423 ITPAGVVSTLAGAAGQIGSADGTGSAARFEFPLGIAVDRAGNVYTTANSA-TVRKIT 478



 Score = 38.1 bits (87), Expect = 1.7,   Method: Composition-based stats.
 Identities = 22/51 (43%), Positives = 32/51 (62%), Gaps = 2/51 (3%)

Query: 82  VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V  +AG  G+ G +DG   +A+F      A+D +G +YVAD SN+ IR+IT
Sbjct: 593 VTTVAGQHGIAGGADGTGSAAQFAYAGGIAIDRRGTLYVAD-SNNRIRQIT 642


>gi|42522312|ref|NP_967692.1| hypothetical protein Bd0727 [Bdellovibrio bacteriovorus HD100]
 gi|39574843|emb|CAE78685.1| hypothetical protein with NHL repeat [Bdellovibrio bacteriovorus
           HD100]
          Length = 709

 Score = 53.5 bits (127), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 30/78 (38%), Positives = 43/78 (55%), Gaps = 6/78 (7%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
            + D+ ++DS+ S    +S       VV   AG  GV  ++DG   SARFD P    +D 
Sbjct: 85  AAGDIFIVDSNASVIRKIS-----NGVVSTFAGKFGVFDHADGTGDSARFDYPTGITIDG 139

Query: 114 KGNIYVADKSNHVIRKIT 131
            GN++V + +NH IRKIT
Sbjct: 140 SGNLFVTEGNNHTIRKIT 157



 Score = 48.9 bits (115), Expect = 0.001,   Method: Composition-based stats.
 Identities = 30/77 (38%), Positives = 39/77 (50%), Gaps = 1/77 (1%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
            S +L V DS  ++   ++       V   LAGD   G +DG   +A F  P+  A D  
Sbjct: 358 ASGNLFVTDSDNASIRKVTPARVVTLVAGSLAGDS-DGSADGTGTAASFHSPEGVAADPA 416

Query: 115 GNIYVADKSNHVIRKIT 131
           GN+YVAD  N  IRKIT
Sbjct: 417 GNLYVADTMNRTIRKIT 433



 Score = 47.0 bits (110), Expect = 0.004,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 41/78 (52%), Gaps = 5/78 (6%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
           G+  L V DS     YT+   ++   VV  LAG  G+QG  DG    A F +     V  
Sbjct: 523 GAGSLYVSDSGN---YTVR-KVTLAGVVTTLAGQVGIQGSDDGTGTGATFSRVAGITVTP 578

Query: 114 KGNIYVADKSNHVIRKIT 131
            GNI+VAD  N+VIRKIT
Sbjct: 579 SGNIFVADTDNNVIRKIT 596



 Score = 46.2 bits (108), Expect = 0.007,   Method: Composition-based stats.
 Identities = 26/78 (33%), Positives = 43/78 (55%), Gaps = 5/78 (6%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
           GS +L V + +      ++      +VV  +AG  G  G +DG   +ARF+ P+   +  
Sbjct: 139 GSGNLFVTEGNNHTIRKIT----PAAVVTTVAGSPGNAGTADGTGSAARFNNPEDITLAA 194

Query: 114 KGNIYVADKSNHVIRKIT 131
            GN Y+ DK+N++IRK+T
Sbjct: 195 DGNFYITDKNNNMIRKMT 212



 Score = 45.4 bits (106), Expect = 0.011,   Method: Composition-based stats.
 Identities = 29/77 (37%), Positives = 38/77 (49%), Gaps = 5/77 (6%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
           S +L V D S SA   ++      +VV   AG  G  G  DG   +ARF  P    +D  
Sbjct: 304 SDNLYVADYSNSAIRKVT----SSAVVSNFAGSYGDYGAVDGTGTAARFAGPAGVGIDAS 359

Query: 115 GNIYVADKSNHVIRKIT 131
           GN++V D  N  IRK+T
Sbjct: 360 GNLFVTDSDNASIRKVT 376



 Score = 43.1 bits (100), Expect = 0.054,   Method: Composition-based stats.
 Identities = 22/56 (39%), Positives = 29/56 (51%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   VV   AGDG  G +DG   +A F+ P     D  GN++V   S   IRKIT
Sbjct: 211 MTPAGVVTTFAGDGTYGCTDGTGAAAHFNYPTGIVGDSAGNLFVVCSSCSTIRKIT 266



 Score = 42.4 bits (98), Expect = 0.096,   Method: Composition-based stats.
 Identities = 27/75 (36%), Positives = 38/75 (50%), Gaps = 1/75 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
           L+   VV  LAGD    G +DG   SARF      A D  G++YV+D  N+ +RK+T   
Sbjct: 485 LTPGGVVTSLAGDYDNSGSADGTGTSARFGGVAGIASDGAGSLYVSDSGNYTVRKVTLAG 544

Query: 135 HGNQLIRQINLKPED 149
               L  Q+ ++  D
Sbjct: 545 VVTTLAGQVGIQGSD 559



 Score = 42.0 bits (97), Expect = 0.11,   Method: Composition-based stats.
 Identities = 29/79 (36%), Positives = 38/79 (48%), Gaps = 6/79 (7%)

Query: 60  IVLDSSRSAFYTLSF-----PLSEESVVKRLAGDG-VQGYSDGEPGSARFDKPKSFAVDM 113
           IV DS+ + F   S       ++   VV   AG     G  DG   +ARF  P    +D 
Sbjct: 244 IVGDSAGNLFVVCSSCSTIRKITPAGVVTTFAGQAYTTGAVDGTGTAARFSWPVGITIDS 303

Query: 114 KGNIYVADKSNHVIRKITN 132
             N+YVAD SN  IRK+T+
Sbjct: 304 SDNLYVADYSNSAIRKVTS 322



 Score = 42.0 bits (97), Expect = 0.14,   Method: Composition-based stats.
 Identities = 25/53 (47%), Positives = 30/53 (56%), Gaps = 1/53 (1%)

Query: 81  VVKRLAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           VV   AG   QG +D   GS ARF +P   A D  GN+YVA+     IRKIT+
Sbjct: 600 VVTTFAGAAGQGGNDDGMGSNARFSQPHFVATDSSGNLYVAEWGEATIRKITS 652



 Score = 36.6 bits (83), Expect = 4.6,   Method: Composition-based stats.
 Identities = 23/51 (45%), Positives = 29/51 (56%), Gaps = 3/51 (5%)

Query: 82  VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V  +AG  G  G +DG   +ARF  P    V   GNIYVAD+  + IRK+T
Sbjct: 438 VTTIAGSPGQIGSADGTGAAARFSYPTKLTVAEDGNIYVADE--YRIRKLT 486


>gi|421117009|ref|ZP_15577380.1| putative lipoprotein [Leptospira interrogans serovar Canicola str.
           Fiocruz LV133]
 gi|410011441|gb|EKO69561.1| putative lipoprotein [Leptospira interrogans serovar Canicola str.
           Fiocruz LV133]
          Length = 358

 Score = 53.5 bits (127), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 34/91 (37%), Positives = 44/91 (48%), Gaps = 2/91 (2%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNIYV+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTISTASFKTPFGLEVDTSGNIYVS 95

Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
           D+ N++IRKI     GN      NL  +D S
Sbjct: 96  DQINNLIRKID--PSGNVKTLSTNLPLQDPS 124



 Score = 38.9 bits (89), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 29/51 (56%), Gaps = 2/51 (3%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
            V  L+G G+ GY DG+  SARF  P   A + K N +  AD  +H IRKI
Sbjct: 210 TVSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKI 259



 Score = 37.0 bits (84), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 19/45 (42%), Positives = 28/45 (62%), Gaps = 1/45 (2%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           G  +G+  ++ FD P    +D + N+YV + SNH IRKI NL+ G
Sbjct: 166 GLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLNSG 209


>gi|418688943|ref|ZP_13250072.1| putative lipoprotein [Leptospira interrogans str. FPW2026]
 gi|421128059|ref|ZP_15588277.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
           str. 2006006986]
 gi|421134520|ref|ZP_15594655.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
           str. Andaman]
 gi|400361935|gb|EJP17894.1| putative lipoprotein [Leptospira interrogans str. FPW2026]
 gi|410021314|gb|EKO88104.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
           str. Andaman]
 gi|410434526|gb|EKP83664.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
           str. 2006006986]
          Length = 358

 Score = 53.5 bits (127), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 34/91 (37%), Positives = 44/91 (48%), Gaps = 2/91 (2%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNIYV+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAASVDGTISTASFKTPFGLEVDTSGNIYVS 95

Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
           D+ N++IRKI     GN      NL  +D S
Sbjct: 96  DQINNLIRKID--PSGNVKTLSTNLPLQDPS 124



 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 29/51 (56%), Gaps = 2/51 (3%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
            V  L+G G+ GY DG+  SARF  P   A + K N +  AD  +H IRKI
Sbjct: 210 TVSTLSG-GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKI 259



 Score = 36.6 bits (83), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 19/45 (42%), Positives = 28/45 (62%), Gaps = 1/45 (2%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           G  +G+  ++ FD P    +D + N+YV + SNH IRKI NL+ G
Sbjct: 166 GLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLNSG 209


>gi|110639161|ref|YP_679370.1| hypothetical protein CHU_2779 [Cytophaga hutchinsonii ATCC 33406]
 gi|110281842|gb|ABG60028.1| conserved hypothetical protein [Cytophaga hutchinsonii ATCC 33406]
          Length = 1051

 Score = 53.5 bits (127), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 25/57 (43%), Positives = 36/57 (63%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           ++   VV   AG G  G +DG   +A+F+ P   A+D  GN+YVAD  N++IRKIT+
Sbjct: 135 ITPAGVVITFAGSGTAGSNDGTGTAAQFNNPYGMAIDASGNLYVADSGNNLIRKITS 191



 Score = 50.8 bits (120), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 23/51 (45%), Positives = 31/51 (60%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +V  +AG+G  G  D    +ARF+ P +  +D   NIY+AD  NH IRKIT
Sbjct: 86  IVSTIAGNGTSGLIDATGSAARFNLPAAVVLDAAQNIYIADNGNHCIRKIT 136



 Score = 50.1 bits (118), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 32/88 (36%), Positives = 48/88 (54%), Gaps = 7/88 (7%)

Query: 46  NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
           NP+ + ID   S +L V DS  +    L   ++   VV  +AG+   GY DG   +ARF 
Sbjct: 164 NPYGMAID--ASGNLYVADSGNN----LIRKITSSGVVTTIAGNTTPGYVDGTGTAARFY 217

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITN 132
            P +  +D+ GN ++ D  NH IRK+T+
Sbjct: 218 LPVNITLDVSGNFFITDNRNHRIRKMTS 245



 Score = 49.7 bits (117), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 24/57 (42%), Positives = 35/57 (61%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           ++   VV  +AG G  GY +G   +A+F++P    VD   N+YV D +N VIRKIT+
Sbjct: 243 MTSAGVVSTVAGSGSAGYMNGTGVTAQFNRPYGIVVDAFSNLYVTDTNNGVIRKITS 299



 Score = 46.2 bits (108), Expect = 0.007,   Method: Composition-based stats.
 Identities = 23/75 (30%), Positives = 38/75 (50%), Gaps = 4/75 (5%)

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           S+L V D++      ++       VV   AG G  G++DG   +A+F  P    ++  G+
Sbjct: 282 SNLYVTDTNNGVIRKIT----SSGVVSTYAGTGTPGFADGPAANAQFQWPTGLTINASGD 337

Query: 117 IYVADKSNHVIRKIT 131
           +Y AD   H +RK+T
Sbjct: 338 LYEADNETHRVRKVT 352


>gi|426402696|ref|YP_007021667.1| hypothetical protein Bdt_0693 [Bdellovibrio bacteriovorus str.
           Tiberius]
 gi|425859364|gb|AFY00400.1| hypothetical protein Bdt_0693 [Bdellovibrio bacteriovorus str.
           Tiberius]
          Length = 692

 Score = 53.5 bits (127), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 30/78 (38%), Positives = 43/78 (55%), Gaps = 6/78 (7%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
            + D+ ++DS+ S    +S       VV   AG  GV  ++DG   SARFD P    +D 
Sbjct: 68  AAGDIFIVDSNASVIRKIS-----NGVVSTFAGKFGVFDHADGTGDSARFDYPTGITIDG 122

Query: 114 KGNIYVADKSNHVIRKIT 131
            GN++V + +NH IRKIT
Sbjct: 123 SGNLFVTEGNNHTIRKIT 140



 Score = 50.1 bits (118), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 31/77 (40%), Positives = 39/77 (50%), Gaps = 1/77 (1%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
            S DL V DS  ++   ++       V   LAGD   G +DG   +A F  P+  A D  
Sbjct: 341 ASGDLFVTDSDNASIRKITPARVVTLVAGSLAGDS-DGSADGTGTAASFFSPEGVAADPA 399

Query: 115 GNIYVADKSNHVIRKIT 131
           GN+YVAD  N  IRKIT
Sbjct: 400 GNLYVADTMNRTIRKIT 416



 Score = 46.6 bits (109), Expect = 0.005,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 41/78 (52%), Gaps = 5/78 (6%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
           G+  L V DS     YT+   ++   VV  LAG  G+QG  DG    A F +     V  
Sbjct: 506 GAGSLYVSDSGN---YTVR-KVTLAGVVTTLAGQVGIQGSDDGTGTGATFSRVAGITVTP 561

Query: 114 KGNIYVADKSNHVIRKIT 131
            GNI+VAD  N+VIRKIT
Sbjct: 562 SGNIFVADTDNNVIRKIT 579



 Score = 45.8 bits (107), Expect = 0.008,   Method: Composition-based stats.
 Identities = 26/78 (33%), Positives = 43/78 (55%), Gaps = 5/78 (6%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
           GS +L V + +      ++      +VV  +AG  G  G +DG   +ARF+ P+   +  
Sbjct: 122 GSGNLFVTEGNNHTIRKIT----PAAVVTTVAGSPGNAGTADGTGSAARFNNPEDITLAA 177

Query: 114 KGNIYVADKSNHVIRKIT 131
            GN Y+ DK+N++IRK+T
Sbjct: 178 DGNFYITDKNNNMIRKMT 195



 Score = 43.1 bits (100), Expect = 0.060,   Method: Composition-based stats.
 Identities = 22/56 (39%), Positives = 29/56 (51%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   VV   AGDG  G +DG   +A F+ P     D  GN++V   S   IRKIT
Sbjct: 194 MTPAGVVTTFAGDGTYGCTDGTGAAAHFNYPTGIVGDSAGNLFVVCSSCSTIRKIT 249



 Score = 42.7 bits (99), Expect = 0.071,   Method: Composition-based stats.
 Identities = 28/77 (36%), Positives = 37/77 (48%), Gaps = 5/77 (6%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
           S +L V D   SA   ++      +VV   AG  G  G  DG   +ARF  P    +D  
Sbjct: 287 SDNLYVADYGNSAIRKVT----SSAVVSNFAGSYGDYGAVDGTGTAARFAGPAGIGIDAS 342

Query: 115 GNIYVADKSNHVIRKIT 131
           G+++V D  N  IRKIT
Sbjct: 343 GDLFVTDSDNASIRKIT 359



 Score = 42.0 bits (97), Expect = 0.11,   Method: Composition-based stats.
 Identities = 27/75 (36%), Positives = 38/75 (50%), Gaps = 1/75 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
           L+   VV  LAGD    G +DG   SARF      A D  G++YV+D  N+ +RK+T   
Sbjct: 468 LTPGGVVTSLAGDYDNSGSADGTGTSARFGGVAGIASDGAGSLYVSDSGNYTVRKVTLAG 527

Query: 135 HGNQLIRQINLKPED 149
               L  Q+ ++  D
Sbjct: 528 VVTTLAGQVGIQGSD 542



 Score = 41.6 bits (96), Expect = 0.18,   Method: Composition-based stats.
 Identities = 25/52 (48%), Positives = 29/52 (55%), Gaps = 1/52 (1%)

Query: 81  VVKRLAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           VV   AG   QG +D   GS ARF +P   A D  GN+YVA+     IRKIT
Sbjct: 583 VVTTFAGAAGQGGNDDGMGSNARFSQPHFVATDSSGNLYVAEWGEATIRKIT 634



 Score = 38.9 bits (89), Expect = 1.0,   Method: Composition-based stats.
 Identities = 28/79 (35%), Positives = 37/79 (46%), Gaps = 6/79 (7%)

Query: 60  IVLDSSRSAFYTLSF-----PLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDM 113
           IV DS+ + F   S       ++   VV   AG     G  DG   +ARF  P    +D 
Sbjct: 227 IVGDSAGNLFVVCSSCSTIRKITPAGVVTTFAGLANATGALDGTGTAARFSWPIGITIDS 286

Query: 114 KGNIYVADKSNHVIRKITN 132
             N+YVAD  N  IRK+T+
Sbjct: 287 SDNLYVADYGNSAIRKVTS 305


>gi|411003633|ref|ZP_11379962.1| hypothetical protein SgloC_12561 [Streptomyces globisporus C-1027]
          Length = 605

 Score = 53.1 bits (126), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 28/72 (38%), Positives = 39/72 (54%), Gaps = 3/72 (4%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
            +V D++R     L    ++   V+R  G G +G SDG PG ARF +P+  AV   G + 
Sbjct: 188 FLVSDTTRHRLVELD---ADGETVRRHFGTGERGLSDGGPGEARFSEPQGLAVLPDGRVA 244

Query: 119 VADKSNHVIRKI 130
           VAD  NH IR +
Sbjct: 245 VADTVNHAIRAL 256


>gi|182414669|ref|YP_001819735.1| SMP-30/gluconolaconase/LRE domain-containing protein [Opitutus
           terrae PB90-1]
 gi|177841883|gb|ACB76135.1| SMP-30/Gluconolaconase/LRE domain protein [Opitutus terrae PB90-1]
          Length = 1292

 Score = 52.8 bits (125), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 27/50 (54%), Positives = 32/50 (64%), Gaps = 1/50 (2%)

Query: 82  VKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V  +AG  G+ G  DGE  +ARF+ P   AVD  GN+YVAD  NH IRKI
Sbjct: 456 VSTVAGAHGLDGSLDGEKTAARFNAPHGLAVDQHGNLYVADTFNHSIRKI 505



 Score = 52.0 bits (123), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 22/37 (59%), Positives = 26/37 (70%)

Query: 95  DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           DG    ARF+ PK  AVD  G +YVAD SNH+IRK+T
Sbjct: 209 DGAGSDARFNGPKGIAVDANGTVYVADTSNHIIRKVT 245



 Score = 47.4 bits (111), Expect = 0.003,   Method: Composition-based stats.
 Identities = 33/81 (40%), Positives = 42/81 (51%), Gaps = 9/81 (11%)

Query: 52  DRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-VQGYSDGEPGSARFDKPKSFA 110
           DR GS  L V D  R         +S E VV  +AG     G  DG   +A F + K  A
Sbjct: 542 DRDGS--LFVADGHRVR------RISPEGVVTTVAGTANATGSIDGVGAAATFGEIKGLA 593

Query: 111 VDMKGNIYVADKSNHVIRKIT 131
           VD  GN++VA+ + HVIR+IT
Sbjct: 594 VDRAGNVFVAENTTHVIRRIT 614



 Score = 47.4 bits (111), Expect = 0.003,   Method: Composition-based stats.
 Identities = 21/40 (52%), Positives = 25/40 (62%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           GY DGE   ARF +P   A D  GN+YV D   ++IRKIT
Sbjct: 43  GYVDGEGTQARFHEPAGLACDADGNLYVVDPGTNLIRKIT 82



 Score = 46.6 bits (109), Expect = 0.005,   Method: Composition-based stats.
 Identities = 27/57 (47%), Positives = 34/57 (59%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   VV  LAG  G+ G SDG   +ARF+ P   AVD  G IYV D+S   +RKIT
Sbjct: 244 VTPSGVVTTLAGSPGISGNSDGAGDAARFNAPTDIAVDDAGTIYVVDQSGS-LRKIT 299



 Score = 40.0 bits (92), Expect = 0.50,   Method: Composition-based stats.
 Identities = 20/59 (33%), Positives = 32/59 (54%), Gaps = 10/59 (16%)

Query: 81  VVKRLAGDGVQ----------GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           V++R+  DG            G +DG   +ARF++P   A+D  G++Y+ D  N+ IRK
Sbjct: 609 VIRRITPDGTVVTIGGLAGSIGTADGVGSAARFNEPWGLALDRFGHLYIVDSGNNTIRK 667



 Score = 40.0 bits (92), Expect = 0.52,   Method: Composition-based stats.
 Identities = 19/40 (47%), Positives = 22/40 (55%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G  DG   SARF  P+  AV   G +Y+AD  N  IR IT
Sbjct: 98  GLVDGPAASARFGLPQGVAVGADGTVYIADTGNAAIRIIT 137



 Score = 38.1 bits (87), Expect = 1.7,   Method: Composition-based stats.
 Identities = 18/43 (41%), Positives = 27/43 (62%), Gaps = 2/43 (4%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           GV+G  DG   +ARF  P + A D  G+++VAD   H +R+I+
Sbjct: 519 GVEGTVDGIGNAARFGSPTALAFDRDGSLFVAD--GHRVRRIS 559


>gi|373953102|ref|ZP_09613062.1| Ig family protein [Mucilaginibacter paludis DSM 18603]
 gi|373889702|gb|EHQ25599.1| Ig family protein [Mucilaginibacter paludis DSM 18603]
          Length = 3731

 Score = 52.8 bits (125), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 26/51 (50%), Positives = 32/51 (62%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           VV  LAG G  G  +G   +A F+ P S   DM+GN+YVAD  N +IRKIT
Sbjct: 168 VVTTLAGSGSAGSVNGTGTAASFNTPYSLTTDMQGNVYVADYGNQLIRKIT 218



 Score = 52.8 bits (125), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 27/56 (48%), Positives = 33/56 (58%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            ++   VV   AG G  G +DG   +A F+ P   A D +GNIYVAD  N VIRKIT
Sbjct: 1140 ITPSGVVTTFAGSGSSGAADGIGMAASFNSPYGLATDAQGNIYVADFGNQVIRKIT 1195



 Score = 50.8 bits (120), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 37/57 (64%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
            ++ ++VV  LAG+G  G ++G   +A F+ P     D  GN+YVAD  N++IRK+T+
Sbjct: 1690 ITPDAVVTTLAGNGAAGSANGTGNAASFNHPWGIVADAAGNLYVADTYNNLIRKVTS 1746



 Score = 50.8 bits (120), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 26/57 (45%), Positives = 36/57 (63%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   VV  LAG  G  G+ +G   +A+F+ P+S A D  GN+YVAD+ N  IRKIT
Sbjct: 217 ITPAGVVTTLAGTVGSSGFVNGTGTAAKFNYPRSVATDAAGNVYVADQVNQAIRKIT 273



 Score = 50.4 bits (119), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 25/56 (44%), Positives = 32/56 (57%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   VV   AG GV G  +G   +A F  P    +D +GN+YVAD  NH IRKIT
Sbjct: 272 ITPAGVVTTFAGSGVPGALNGTGTAATFYNPTGVTMDAQGNVYVADSQNHSIRKIT 327



 Score = 50.1 bits (118), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 25/56 (44%), Positives = 31/56 (55%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   VV  LAG G  G +DG   +A F  P     D +GN+YVAD  N  IRKIT
Sbjct: 657 ITPAGVVTTLAGSGDSGSADGTGTAASFHTPYDLTTDAQGNVYVADNFNQTIRKIT 712



 Score = 50.1 bits (118), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 25/57 (43%), Positives = 33/57 (57%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ E VV   AG  G  G+ +G   +A+F  P   A D +GN+YVAD  N  IRKIT
Sbjct: 711 ITREGVVNTFAGTSGSSGFVNGTAAAAKFKNPIGIATDTQGNVYVADNGNLAIRKIT 767



 Score = 49.7 bits (117), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 24/56 (42%), Positives = 37/56 (66%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S   VV   AG GV G ++G   +A F+ P   A D++GN+YV+D ++++IRKIT
Sbjct: 109 ISPVGVVSTFAGSGVAGSANGTGTAASFNNPFGIATDVQGNLYVSDVNSNLIRKIT 164



 Score = 49.3 bits (116), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 27/51 (52%), Positives = 30/51 (58%)

Query: 81   VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            VV  LAG G  G +DG   SA F  P + A D  GNIYV D  N +IRKIT
Sbjct: 1308 VVTTLAGTGSIGSADGVGTSASFYNPNAVATDAVGNIYVVDTYNQLIRKIT 1358



 Score = 48.5 bits (114), Expect = 0.001,   Method: Composition-based stats.
 Identities = 22/56 (39%), Positives = 32/56 (57%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            ++   +V   AG G  G  +G   +A F+ P     D +GNIYV+D +N+ IRKIT
Sbjct: 1086 ITPSGLVTTFAGSGAAGSENGNGAAASFNNPFGLTTDAQGNIYVSDANNNTIRKIT 1141



 Score = 48.1 bits (113), Expect = 0.001,   Method: Composition-based stats.
 Identities = 25/56 (44%), Positives = 33/56 (58%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            ++   VV  LAG    G +DG   +A F+ P   AVD  GN+YVAD  N ++RKIT
Sbjct: 2130 ITAAGVVTTLAGKLTAGSADGVGAAASFNLPAGVAVDASGNVYVADLLNSMVRKIT 2185



 Score = 47.8 bits (112), Expect = 0.002,   Method: Composition-based stats.
 Identities = 25/50 (50%), Positives = 31/50 (62%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V  LAGDG  G ++G    + F  P + A D  GNIYVAD  N++IRKIT
Sbjct: 813 VTILAGDGSAGSANGAGTVSNFYVPNALATDALGNIYVADAGNNLIRKIT 862



 Score = 47.8 bits (112), Expect = 0.003,   Method: Composition-based stats.
 Identities = 23/55 (41%), Positives = 33/55 (60%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++   VV  LAG G  G ++G   +A F  P +   D  GN+Y+AD +NH+IRKI
Sbjct: 326 ITPAGVVTTLAGTGSMGSANGAGTNASFYYPNAVVADALGNLYIADTNNHLIRKI 380



 Score = 45.1 bits (105), Expect = 0.013,   Method: Composition-based stats.
 Identities = 23/52 (44%), Positives = 31/52 (59%)

Query: 80   SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            +VV   AG G  G  +    SA F+ P   AVD +GN YV D+ N+++RKIT
Sbjct: 2026 AVVTTFAGSGAAGSVNSTGTSATFNGPLDVAVDAEGNTYVLDQLNNLVRKIT 2077



 Score = 43.5 bits (101), Expect = 0.039,   Method: Composition-based stats.
 Identities = 23/56 (41%), Positives = 33/56 (58%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            ++ +  V  LAG    G +DG   +A F  P +  VD +GNI VAD+ N+ IRKI+
Sbjct: 2184 ITPDGTVTTLAGSTSAGSADGTGAAAGFHYPTNLQVDDQGNIIVADQLNNKIRKIS 2239



 Score = 43.5 bits (101), Expect = 0.040,   Method: Composition-based stats.
 Identities = 22/55 (40%), Positives = 32/55 (58%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            +S   VV  +AG G  G ++G   +A F    S A D  G++YVAD+  ++IRKI
Sbjct: 1798 ISPAGVVTTIAGSGASGIANGIGKAATFGNLYSIATDASGDVYVADQYKYIIRKI 1852



 Score = 43.1 bits (100), Expect = 0.060,   Method: Composition-based stats.
 Identities = 22/56 (39%), Positives = 30/56 (53%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            ++   +V   AG G  G  +G   +A F  P     D +GN+YVAD  N+ IRKIT
Sbjct: 1249 ITPAGLVTTFAGSGGIGALNGTGTAASFHNPTGITTDAQGNVYVADLYNNAIRKIT 1304



 Score = 42.7 bits (99), Expect = 0.067,   Method: Composition-based stats.
 Identities = 20/50 (40%), Positives = 30/50 (60%)

Query: 82   VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            V   AG G     DG   +A F+ P + ++D  GN+YVA+ + +VIRKI+
Sbjct: 1750 VTTFAGSGAASSVDGTGTAASFNYPSAISIDASGNLYVAELNGNVIRKIS 1799



 Score = 42.7 bits (99), Expect = 0.080,   Method: Composition-based stats.
 Identities = 19/32 (59%), Positives = 24/32 (75%)

Query: 100  SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            +A F+ P   AVD  GNIYVAD+ N++IRKIT
Sbjct: 2100 AATFNHPTGLAVDAAGNIYVADQGNNMIRKIT 2131



 Score = 42.4 bits (98), Expect = 0.090,   Method: Composition-based stats.
 Identities = 22/51 (43%), Positives = 31/51 (60%), Gaps = 1/51 (1%)

Query: 82  VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V   AG+ G+ G ++    +A F  P   AVD  GN+YVAD  N++IRKI+
Sbjct: 60  VSTFAGNAGIAGNTNATGTAATFHSPFGVAVDASGNVYVADAGNNLIRKIS 110



 Score = 41.6 bits (96), Expect = 0.14,   Method: Composition-based stats.
 Identities = 21/59 (35%), Positives = 32/59 (54%)

Query: 73   SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            + P    S+V  + G+G  G  +G   +A  +       D+ GN++VAD  NH+IRKIT
Sbjct: 1524 AVPAKTYSLVSTIVGNGSSGAVNGTGTAASLNLCDGLVFDLLGNMFVADFGNHMIRKIT 1582



 Score = 41.6 bits (96), Expect = 0.14,   Method: Composition-based stats.
 Identities = 21/56 (37%), Positives = 33/56 (58%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            ++  +VV    G G  G ++G+  +A F  P   A+D  GN++VAD+  + IRKIT
Sbjct: 1581 ITPATVVSTFVGTGSPGSTNGKGTAASFYVPYGMAIDAAGNLFVADQFYNQIRKIT 1636



 Score = 41.2 bits (95), Expect = 0.19,   Method: Composition-based stats.
 Identities = 23/50 (46%), Positives = 30/50 (60%), Gaps = 1/50 (2%)

Query: 82  VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V  LAG  G  G ++G   +A F  P   A D  GNIYV+D +N++IRKI
Sbjct: 556 VSTLAGKAGSAGNANGTGTAATFSSPTGVATDPSGNIYVSDYNNNLIRKI 605



 Score = 41.2 bits (95), Expect = 0.19,   Method: Composition-based stats.
 Identities = 18/30 (60%), Positives = 23/30 (76%)

Query: 102  RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            +F+ P   AVD+ GN+YVAD+ N VIRKIT
Sbjct: 1221 KFNSPYDVAVDVTGNVYVADELNQVIRKIT 1250



 Score = 40.8 bits (94), Expect = 0.25,   Method: Composition-based stats.
 Identities = 23/65 (35%), Positives = 36/65 (55%), Gaps = 3/65 (4%)

Query: 69   FYTLSFPLSEESVVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
            FY     ++ + +V   AG   G  G +DG   +A F  P+  A+D  GN++V +  N++
Sbjct: 1628 FYNQIRKITPDGLVTTFAGSLTGAPGATDGTGAAATFRSPRGMAIDALGNLFVVE-DNYL 1686

Query: 127  IRKIT 131
            IRKIT
Sbjct: 1687 IRKIT 1691



 Score = 40.4 bits (93), Expect = 0.35,   Method: Composition-based stats.
 Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 1/51 (1%)

Query: 82   VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            V  +AG  G+ G ++G   +A F  P     D  GN+Y+AD +N +IRKIT
Sbjct: 1037 VSTVAGSVGIAGKANGIGTAATFSGPSGVTTDASGNLYIADFNNRLIRKIT 1087


>gi|344205849|ref|YP_004790990.1| NHL repeat containing protein [Stenotrophomonas maltophilia JV3]
 gi|343777211|gb|AEM49764.1| NHL repeat containing protein [Stenotrophomonas maltophilia JV3]
          Length = 693

 Score = 52.8 bits (125), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 37/95 (38%), Positives = 47/95 (49%), Gaps = 7/95 (7%)

Query: 38  IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
           +DG  L+   N  S I      +L V D+   A   +S     +  V  LAG G QG++D
Sbjct: 114 VDGPALQASFNTPSGIAADVQGNLYVADTGNHAIRRIS----TDGQVTTLAG-GEQGHAD 168

Query: 96  GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G    ARFD P   AVD +G +YVAD  N  IR I
Sbjct: 169 GPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVI 203



 Score = 49.7 bits (117), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 23/50 (46%), Positives = 32/50 (64%), Gaps = 1/50 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V+ +AG G +G  DG    A F+ P   A D++GN+YVAD  NH IR+I+
Sbjct: 103 VETVAGQG-EGRVDGPALQASFNTPSGIAADVQGNLYVADTGNHAIRRIS 151



 Score = 44.7 bits (104), Expect = 0.017,   Method: Composition-based stats.
 Identities = 29/102 (28%), Positives = 49/102 (48%), Gaps = 5/102 (4%)

Query: 34  VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFP-----LSEESVVKRLAGD 88
           VTT+  G Q   +  +   R  +   I +D+    +   +F      +  +  V+ LAG 
Sbjct: 156 VTTLAGGEQGHADGPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVIGTDGSVRTLAGG 215

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
              G++DG   +ARFD P + A D  G + VAD  N+ +R++
Sbjct: 216 DRPGFADGVGTAARFDTPVALAFDAHGALLVADLFNNAVRRV 257


>gi|424666862|ref|ZP_18103887.1| hypothetical protein A1OC_00420 [Stenotrophomonas maltophilia
           Ab55555]
 gi|401069531|gb|EJP78052.1| hypothetical protein A1OC_00420 [Stenotrophomonas maltophilia
           Ab55555]
          Length = 693

 Score = 52.8 bits (125), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 36/95 (37%), Positives = 46/95 (48%), Gaps = 7/95 (7%)

Query: 38  IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
           +DG  L+   N  S I      +L V D+   A   +      +  V  LAG G QG++D
Sbjct: 114 VDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIGI----DGQVTTLAG-GEQGHAD 168

Query: 96  GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G    ARFD P   AVD +G +YVAD  N  IR I
Sbjct: 169 GPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVI 203



 Score = 48.5 bits (114), Expect = 0.001,   Method: Composition-based stats.
 Identities = 23/49 (46%), Positives = 30/49 (61%), Gaps = 1/49 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V+ +AG G +G  DG    A F+ P   A D +GN+YVAD  NH IR+I
Sbjct: 103 VETVAGQG-EGRVDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRI 150



 Score = 44.3 bits (103), Expect = 0.026,   Method: Composition-based stats.
 Identities = 29/102 (28%), Positives = 49/102 (48%), Gaps = 5/102 (4%)

Query: 34  VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFP-----LSEESVVKRLAGD 88
           VTT+  G Q   +  +   R  +   I +D+    +   +F      +  +  V+ LAG 
Sbjct: 156 VTTLAGGEQGHADGPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVIGTDGNVRTLAGG 215

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
              G +DG   +ARFD P + A D +G + VAD  N+ +R++
Sbjct: 216 DRPGLADGVGAAARFDTPVALAFDAQGALLVADLFNNAVRRV 257


>gi|223934698|ref|ZP_03626618.1| NHL repeat containing protein [bacterium Ellin514]
 gi|223896653|gb|EEF63094.1| NHL repeat containing protein [bacterium Ellin514]
          Length = 847

 Score = 52.8 bits (125), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 28/56 (50%), Positives = 35/56 (62%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   VV  LAG   +G  +G   SARF  P++ AVD + NIYVAD  N VIRKIT
Sbjct: 358 MTAAGVVTTLAGSPSEGSINGVTSSARFYSPQNVAVDGQNNIYVADTQNSVIRKIT 413



 Score = 50.8 bits (120), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 34/78 (43%), Positives = 43/78 (55%), Gaps = 5/78 (6%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDM 113
           G +++ V D+  S    ++ P     VV  LAG  GV G +DG   +A F  P+  AVD 
Sbjct: 395 GQNNIYVADTQNSVIRKIT-PFG---VVSVLAGTTGVFGSADGSGANALFSGPQGIAVDG 450

Query: 114 KGNIYVADKSNHVIRKIT 131
            GNIYVAD  N  IRKIT
Sbjct: 451 GGNIYVADTGNSTIRKIT 468



 Score = 48.1 bits (113), Expect = 0.002,   Method: Composition-based stats.
 Identities = 24/57 (42%), Positives = 35/57 (61%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   + + LAG  GVQG +DG    A F++P   A+D  GNIYV+D  +  IRK+T
Sbjct: 85  VTASGLCRTLAGTAGVQGSADGMGAQASFNQPSGIALDSDGNIYVSDYGSSTIRKVT 141



 Score = 46.2 bits (108), Expect = 0.007,   Method: Composition-based stats.
 Identities = 25/57 (43%), Positives = 33/57 (57%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S   VV  LAG  G  G +DG   SARF  P+  AVD  G +++AD +N  IR +T
Sbjct: 303 ISPAGVVTTLAGLSGTAGSADGTNSSARFLGPQGVAVDSTGTVFIADTANSTIRVMT 359



 Score = 45.4 bits (106), Expect = 0.011,   Method: Composition-based stats.
 Identities = 22/51 (43%), Positives = 33/51 (64%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++  +AG   +G +DG  GSA+   P+  AVD  GN+YVAD  N++IR +T
Sbjct: 36  LIGTVAGYAGKGSADGVGGSAQLFGPQGVAVDGAGNVYVADTGNNIIRVVT 86



 Score = 45.1 bits (105), Expect = 0.015,   Method: Composition-based stats.
 Identities = 26/77 (33%), Positives = 41/77 (53%), Gaps = 3/77 (3%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
           G  ++ V D+  S    ++ P    S +   AG+   G +DG   +A+F +P+  AVD  
Sbjct: 450 GGGNIYVADTGNSTIRKIT-PSGSTSTLAGSAGN--PGNADGAGITAQFYQPQGVAVDSA 506

Query: 115 GNIYVADKSNHVIRKIT 131
            N+YVAD  NH +R +T
Sbjct: 507 NNVYVADTGNHTVRMVT 523



 Score = 43.9 bits (102), Expect = 0.029,   Method: Composition-based stats.
 Identities = 24/49 (48%), Positives = 29/49 (59%), Gaps = 1/49 (2%)

Query: 85  LAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           LAG  G  G  DG    ARF+ P   AVD  GN+YV D +N  IRK+T+
Sbjct: 531 LAGLAGTFGTFDGTNAGARFNGPTGIAVDGAGNLYVTDYNNDTIRKVTS 579



 Score = 42.4 bits (98), Expect = 0.100,   Method: Composition-based stats.
 Identities = 33/93 (35%), Positives = 42/93 (45%), Gaps = 5/93 (5%)

Query: 40  GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG-DGVQGYSDGEP 98
           G Q   N  S I      ++ V D   S    ++    +   V  LAG  GV G  +   
Sbjct: 108 GAQASFNQPSGIALDSDGNIYVSDYGSSTIRKVT----QSGQVTTLAGMTGVTGSVNNTG 163

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            +A F  P   AVD   N+YVAD  NH+IRKIT
Sbjct: 164 TNALFFHPMGLAVDNATNLYVADYGNHLIRKIT 196



 Score = 40.0 bits (92), Expect = 0.53,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 30/57 (52%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++    V  LAG  G+ G  DG   SA F  P   +VD  GN+YV D  N  +RK+T
Sbjct: 577 VTSAGAVTTLAGWTGMWGSIDGAGNSALFFGPSGISVDALGNLYVIDSGNSTLRKLT 633



 Score = 38.1 bits (87), Expect = 1.6,   Method: Composition-based stats.
 Identities = 24/71 (33%), Positives = 30/71 (42%)

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
           +L V+DS  S    L+      +V       GV G  DG    A F  P    V   G +
Sbjct: 618 NLYVIDSGNSTLRKLTLSGGTWTVSTVAGMPGVNGGIDGSGAGAEFYYPAGVTVSAAGYV 677

Query: 118 YVADKSNHVIR 128
           YVAD  N+ IR
Sbjct: 678 YVADAGNNTIR 688



 Score = 36.6 bits (83), Expect = 5.2,   Method: Composition-based stats.
 Identities = 20/51 (39%), Positives = 29/51 (56%), Gaps = 1/51 (1%)

Query: 82  VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V  LAG  G  G +D    +A F +P    ++  GN+YVAD  N+ IR+I+
Sbjct: 254 VTTLAGAAGFVGSADASGTNALFHQPAGIGINSAGNLYVADYFNNTIRQIS 304


>gi|329908045|ref|ZP_08274805.1| hypothetical protein IMCC9480_3485 [Oxalobacteraceae bacterium
           IMCC9480]
 gi|327546778|gb|EGF31712.1| hypothetical protein IMCC9480_3485 [Oxalobacteraceae bacterium
           IMCC9480]
          Length = 685

 Score = 52.8 bits (125), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 32/87 (36%), Positives = 48/87 (55%), Gaps = 8/87 (9%)

Query: 47  PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFD 104
           P S+ IDR G+  L V D++      ++     + VV  +AG  GV G +DG   +ARF 
Sbjct: 120 PQSIAIDRAGT--LYVADTNNQTIRKIT----PQGVVTTIAGRVGVDGSTDGRGNAARFS 173

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKIT 131
            P+  AVD+ G + V+D  NH +R I+
Sbjct: 174 YPQGIAVDVAGTVLVSDTYNHTVRTIS 200



 Score = 50.4 bits (119), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 26/57 (45%), Positives = 35/57 (61%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S +  V  LAG     G +DG   +ARF+ P+S A+D  G +YVAD +N  IRKIT
Sbjct: 89  VSADGSVSTLAGAMSTPGAADGTGTAARFNYPQSIAIDRAGTLYVADTNNQTIRKIT 145



 Score = 50.4 bits (119), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 30/78 (38%), Positives = 43/78 (55%), Gaps = 6/78 (7%)

Query: 60  IVLDSSRSAFYTLSF-----PLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDM 113
           IV D+  + F   +F      ++   VV  +AG G   G +DG    ARF +P++ A+D 
Sbjct: 443 IVADALGNLFVADTFNSRIRKITPAGVVTTIAGAGSSSGSTDGPGNIARFFEPEAIAIDA 502

Query: 114 KGNIYVADKSNHVIRKIT 131
             N+YVAD  NH +RKIT
Sbjct: 503 SRNLYVADTGNHTVRKIT 520



 Score = 50.4 bits (119), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 37/116 (31%), Positives = 57/116 (49%), Gaps = 10/116 (8%)

Query: 25  GLLLEDGYTVTTVIDGHQLE---INPHSVIDRPGSSDLIVLDSSRSAFYTLSFP------ 75
           G+ +  G T    + G +LE   ++ +    R GS   +  D+S + +Y +  P      
Sbjct: 351 GVHVITGGTTLKTLAGKELERGMVDGNGAKARFGSLQGVTSDASGN-WYVVDTPHHMIRK 409

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   VV +LAG+G  G  DG    A F+ P+    D  GN++VAD  N  IRKIT
Sbjct: 410 ITPAGVVSQLAGNGKPGGVDGTGSGASFNYPRGIVADALGNLFVADTFNSRIRKIT 465



 Score = 47.0 bits (110), Expect = 0.004,   Method: Composition-based stats.
 Identities = 28/68 (41%), Positives = 40/68 (58%), Gaps = 2/68 (2%)

Query: 65  SRSAFYTLSFPLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKS 123
           S+ A+ T+   ++   VV  LAG  ++ G  DG   +ARF  P+  A D  GN+YVAD  
Sbjct: 564 SQPAYGTIR-KITPGGVVTTLAGRVLETGALDGAGDAARFFNPQGLAADNAGNVYVADTG 622

Query: 124 NHVIRKIT 131
           N+ IR+IT
Sbjct: 623 NNTIRRIT 630



 Score = 45.1 bits (105), Expect = 0.013,   Method: Composition-based stats.
 Identities = 26/57 (45%), Positives = 31/57 (54%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S   VV  LAG  G  G  DG   +ARF  P+    D   NIYVAD  N V+RK+T
Sbjct: 199 ISPGGVVGTLAGSAGNFGVLDGVRSAARFSNPQGLVTDAARNIYVADAGNGVLRKVT 255



 Score = 42.7 bits (99), Expect = 0.069,   Method: Composition-based stats.
 Identities = 32/93 (34%), Positives = 48/93 (51%), Gaps = 14/93 (15%)

Query: 52  DRPGS------SDLIVLDSSRSAF------YTLSFPLSEESVVKRLAGD-GVQGYSDGEP 98
           D PG+       + I +D+SR+ +      +T+   ++   VV  LAG  G  G  DG  
Sbjct: 484 DGPGNIARFFEPEAIAIDASRNLYVADTGNHTVR-KITVAGVVSTLAGSPGKYGSDDGTG 542

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            +ARF KP + AVD  GN+ ++  +   IRKIT
Sbjct: 543 AAARFLKPPAIAVDPSGNVVLSQPAYGTIRKIT 575



 Score = 40.4 bits (93), Expect = 0.34,   Method: Composition-based stats.
 Identities = 19/31 (61%), Positives = 22/31 (70%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ARF+ P   AVD  GN YVAD  NHVIRK++
Sbjct: 60  ARFNFPIGIAVDGVGNRYVADWQNHVIRKVS 90


>gi|290992224|ref|XP_002678734.1| predicted protein [Naegleria gruberi]
 gi|284092348|gb|EFC45990.1| predicted protein [Naegleria gruberi]
          Length = 1366

 Score = 52.4 bits (124), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 40/105 (38%), Positives = 56/105 (53%), Gaps = 9/105 (8%)

Query: 28  LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
           L   Y +TTV  G +   +P   I +  + DLIV D+   +   +S+      VV R+AG
Sbjct: 399 LPSSYYITTVARGLK---DPQK-ISKLSNGDLIVSDTGDHSIKKISY---STGVVTRIAG 451

Query: 88  DGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            GV G+S DG   + A+F+KP   AV +   IY+AD  NH IR I
Sbjct: 452 TGVAGFSGDGGLATLAQFNKPYGIAVTINDEIYIADSLNHRIRFI 496



 Score = 45.8 bits (107), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 37/131 (28%), Positives = 60/131 (45%), Gaps = 6/131 (4%)

Query: 2   ASHFVYSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIV 61
           +++ V  SLAI L+ +   ++  G  L       +   G   E      + R  + D++V
Sbjct: 9   SANIVGFSLAIVLIMMAMVVMVKGTSLNPYSVKISTFAGGNGEFKDPQKLARLSNGDILV 68

Query: 62  LDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYV 119
            D+   A       ++   V+  +AG GV G++  +G   +A+ +KP   AV     IY 
Sbjct: 69  TDALGHAIK----KINGTGVITTIAGTGVAGFAGDNGPAINAQVNKPYGIAVSSNDEIYF 124

Query: 120 ADKSNHVIRKI 130
           AD  NH IRKI
Sbjct: 125 ADSMNHRIRKI 135


>gi|456737758|gb|EMF62435.1| Hypothetical protein EPM1_0109 [Stenotrophomonas maltophilia EPM1]
          Length = 693

 Score = 52.4 bits (124), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 36/95 (37%), Positives = 46/95 (48%), Gaps = 7/95 (7%)

Query: 38  IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
           +DG  L+   N  S I      +L V D+   A   +      +  V  LAG G QG++D
Sbjct: 114 VDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIG----TDGQVTTLAG-GEQGHAD 168

Query: 96  GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G    ARFD P   AVD +G +YVAD  N  IR I
Sbjct: 169 GPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVI 203



 Score = 48.5 bits (114), Expect = 0.001,   Method: Composition-based stats.
 Identities = 23/49 (46%), Positives = 30/49 (61%), Gaps = 1/49 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V+ +AG G +G  DG    A F+ P   A D +GN+YVAD  NH IR+I
Sbjct: 103 VETVAGQG-EGRVDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRI 150



 Score = 44.3 bits (103), Expect = 0.024,   Method: Composition-based stats.
 Identities = 29/102 (28%), Positives = 49/102 (48%), Gaps = 5/102 (4%)

Query: 34  VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFP-----LSEESVVKRLAGD 88
           VTT+  G Q   +  +   R  +   I +D+    +   +F      +  +  V+ LAG 
Sbjct: 156 VTTLAGGEQGHADGPAAQARFDAPMGIAVDAQGQVYVADTFNDRIRVIGTDGNVRTLAGG 215

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
              G +DG   +ARFD P + A D +G + VAD  N+ +R++
Sbjct: 216 DRPGLADGVGATARFDTPVALAFDAQGALLVADLFNNAVRRV 257


>gi|318059218|ref|ZP_07977941.1| hypothetical protein SSA3_14816 [Streptomyces sp. SA3_actG]
 gi|318079124|ref|ZP_07986456.1| hypothetical protein SSA3_21010 [Streptomyces sp. SA3_actF]
          Length = 610

 Score = 52.4 bits (124), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 27/84 (32%), Positives = 45/84 (53%), Gaps = 4/84 (4%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P   +  PG +  +V D++R A   L    ++   V+R  G G +G  DG+   ARF++P
Sbjct: 183 PGKALRLPGGT-FLVSDTTRHALVELE---ADGETVRRRIGSGERGLRDGDAAGARFNEP 238

Query: 107 KSFAVDMKGNIYVADKSNHVIRKI 130
           +  A+   G + VAD  NH +R++
Sbjct: 239 QGLALLPDGAVVVADTVNHALRRV 262


>gi|302520584|ref|ZP_07272926.1| NHL repeat containing protein [Streptomyces sp. SPB78]
 gi|302429479|gb|EFL01295.1| NHL repeat containing protein [Streptomyces sp. SPB78]
          Length = 634

 Score = 52.4 bits (124), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 27/84 (32%), Positives = 45/84 (53%), Gaps = 4/84 (4%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P   +  PG +  +V D++R A   L    ++   V+R  G G +G  DG+   ARF++P
Sbjct: 183 PGKALRLPGGT-FLVSDTTRHALVELE---ADGETVRRRIGSGERGLRDGDAAGARFNEP 238

Query: 107 KSFAVDMKGNIYVADKSNHVIRKI 130
           +  A+   G + VAD  NH +R++
Sbjct: 239 QGLALLPDGAVVVADTVNHALRRV 262


>gi|153806007|ref|ZP_01958675.1| hypothetical protein BACCAC_00252 [Bacteroides caccae ATCC 43185]
 gi|149130684|gb|EDM21890.1| IPT/TIG domain protein [Bacteroides caccae ATCC 43185]
          Length = 440

 Score = 52.4 bits (124), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 27/50 (54%), Positives = 31/50 (62%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           VV+ +AG    GY D   GSA F  P+  AVD  G IYV DK N+VIRKI
Sbjct: 388 VVRLVAGGTQAGYVDANAGSAAFSGPQDLAVDKNGVIYVYDKKNNVIRKI 437



 Score = 39.7 bits (91), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 33/92 (35%), Positives = 43/92 (46%), Gaps = 15/92 (16%)

Query: 55  GSSDLIVLDSSRSAFYTLS-FPLSEESVVKRLAG--DGVQGYSDGEPGSARF--DKPK-- 107
           G  +L +  ++R    T+  F  S  +VV  LAG  +G  GY DG    A F  D  K  
Sbjct: 98  GDGELTITIANREPVRTIEKFTYSFSAVVTTLAGSANGEPGYQDGVGSEALFFFDAAKAE 157

Query: 108 --------SFAVDMKGNIYVADKSNHVIRKIT 131
                   S  VD  GN+YV D  N+ +RKIT
Sbjct: 158 PAEDWKKGSVCVDDDGNVYVGDCVNYCVRKIT 189



 Score = 36.6 bits (83), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 25/85 (29%), Positives = 39/85 (45%), Gaps = 5/85 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA-----DKSNHVIRKITNLDHG 136
            + L G G  GY +G    A F  P   A+D  G+IYVA     D   ++ + I  LD  
Sbjct: 326 AENLIGKGTSGYLNGSFEDALFTFPSDLAIDSNGDIYVAGNGAWDGGENLDQSIRLLDMT 385

Query: 137 NQLIRQINLKPEDCSKSSQSGSGMF 161
           N+++R +    +     + +GS  F
Sbjct: 386 NRVVRLVAGGTQAGYVDANAGSAAF 410


>gi|375146530|ref|YP_005008971.1| NHL repeat containing protein [Niastella koreensis GR20-10]
 gi|361060576|gb|AEV99567.1| NHL repeat containing protein [Niastella koreensis GR20-10]
          Length = 434

 Score = 52.4 bits (124), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 36/56 (64%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ +  V  +AG G  GY++G    A+F+ P + A+D  GN+YVA+++   IRKIT
Sbjct: 174 IAPDGTVTTIAGTGENGYNEGPASGAKFNNPCATAMDANGNMYVAERNGRRIRKIT 229



 Score = 51.6 bits (122), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 23/50 (46%), Positives = 32/50 (64%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V  +AG G  GY++G   S RF+ P   AVD  GN+Y+ D +N+ +RKIT
Sbjct: 383 VTTIAGAGDSGYAEGTGSSVRFNGPTGIAVDKNGNMYILDMANNRVRKIT 432



 Score = 43.1 bits (100), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 39/81 (48%), Gaps = 3/81 (3%)

Query: 53  RPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPK---SF 109
           + G+  ++V     +   T  F      VV  LAG G  G++DG+  +A F+        
Sbjct: 94  KTGTGPVVVTIGKNTGASTTDFKYLFSYVVTTLAGSGNAGFADGKGANASFNFAGVRCQL 153

Query: 110 AVDMKGNIYVADKSNHVIRKI 130
           +VD  GN+YV D  N  IRKI
Sbjct: 154 SVDNIGNVYVPDGGNQRIRKI 174


>gi|375146499|ref|YP_005008940.1| cell surface receptor IPT/TIG domain-containing protein [Niastella
           koreensis GR20-10]
 gi|361060545|gb|AEV99536.1| cell surface receptor IPT/TIG domain protein [Niastella koreensis
           GR20-10]
          Length = 500

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 29/76 (38%), Positives = 42/76 (55%), Gaps = 4/76 (5%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
           G  ++ V D S +  Y +  P  + S++   AG    GY DG+P  ARF  P+   +D  
Sbjct: 426 GVGNIYVTDDSTNGIYRVD-PNGKLSLI---AGGVRSGYIDGKPQDARFSGPRGIVIDAS 481

Query: 115 GNIYVADKSNHVIRKI 130
           GN++VAD  N+ IRKI
Sbjct: 482 GNLFVADIGNNCIRKI 497



 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 4/75 (5%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           + +LIV +   S    ++ PL    VV  +AG G  GY DG   +A+F  P+   +D  G
Sbjct: 232 AGNLIVSEIGNSKIRRIT-PLG---VVSTIAGSGTYGYLDGPGLTAQFHFPQGLTLDNAG 287

Query: 116 NIYVADKSNHVIRKI 130
           NIYVA+  N  IRKI
Sbjct: 288 NIYVAEYQNQSIRKI 302



 Score = 45.8 bits (107), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 42/75 (56%), Gaps = 3/75 (4%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           +S++ V D S +    +S    +  ++  L+G+G  G  DG+P  A+F +P   A+D   
Sbjct: 330 ASNVYVTDQSNNRICKISL---QTGLLSVLSGNGNWGMVDGDPQQAQFYQPAKMALDNNN 386

Query: 116 NIYVADKSNHVIRKI 130
           N+ +ADK N  +RK+
Sbjct: 387 NLIIADKINGRVRKV 401


>gi|423219560|ref|ZP_17206056.1| hypothetical protein HMPREF1061_02829 [Bacteroides caccae
           CL03T12C61]
 gi|392624765|gb|EIY18843.1| hypothetical protein HMPREF1061_02829 [Bacteroides caccae
           CL03T12C61]
          Length = 436

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/50 (54%), Positives = 31/50 (62%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           VV+ +AG    GY D   GSA F  P+  AVD  G IYV DK N+VIRKI
Sbjct: 384 VVRLVAGGTQAGYVDANAGSAAFSGPQDLAVDKNGVIYVYDKKNNVIRKI 433



 Score = 39.7 bits (91), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 33/92 (35%), Positives = 43/92 (46%), Gaps = 15/92 (16%)

Query: 55  GSSDLIVLDSSRSAFYTL-SFPLSEESVVKRLAG--DGVQGYSDGEPGSARF--DKPK-- 107
           G  +L +  ++R    T+  F  S  +VV  LAG  +G  GY DG    A F  D  K  
Sbjct: 94  GDGELTITIANREPVRTIDKFTYSFSAVVTTLAGSANGEPGYQDGVGSEALFFFDAAKAE 153

Query: 108 --------SFAVDMKGNIYVADKSNHVIRKIT 131
                   S  VD  GN+YV D  N+ +RKIT
Sbjct: 154 PAEDWKKGSVCVDDDGNVYVGDCVNYCVRKIT 185



 Score = 36.6 bits (83), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 25/85 (29%), Positives = 39/85 (45%), Gaps = 5/85 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA-----DKSNHVIRKITNLDHG 136
            + L G G  GY +G    A F  P   A+D  G+IYVA     D   ++ + I  LD  
Sbjct: 322 AENLIGKGTSGYLNGSFEDALFTFPSDLAIDSNGDIYVAGNGAWDGGENLDQSIRLLDMT 381

Query: 137 NQLIRQINLKPEDCSKSSQSGSGMF 161
           N+++R +    +     + +GS  F
Sbjct: 382 NRVVRLVAGGTQAGYVDANAGSAAF 406


>gi|351729452|ref|ZP_08947143.1| NHL repeat-containing protein [Acidovorax radicis N35]
          Length = 659

 Score = 52.0 bits (123), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 21/43 (48%), Positives = 32/43 (74%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            V  ++DG+  +ARF +PK+ AVD  GN+YVAD  NH++R+I+
Sbjct: 43  AVSNHADGDGPAARFYQPKAVAVDTAGNMYVADTENHLVRRIS 85



 Score = 42.0 bits (97), Expect = 0.12,   Method: Composition-based stats.
 Identities = 22/67 (32%), Positives = 37/67 (55%), Gaps = 11/67 (16%)

Query: 76  LSEESVVKRLAGD--GVQGYSDGEPGSARF---------DKPKSFAVDMKGNIYVADKSN 124
           ++   +V  LAG+     GY+DG   +A F          +  + AVD +GN+Y+ D ++
Sbjct: 538 ITPSGMVSTLAGNPAAAPGYADGTGAAAVFTVDLGFRYPQQTAALAVDAQGNVYITDGAH 597

Query: 125 HVIRKIT 131
           HV+R+IT
Sbjct: 598 HVVRRIT 604



 Score = 39.3 bits (90), Expect = 0.76,   Method: Composition-based stats.
 Identities = 21/56 (37%), Positives = 28/56 (50%), Gaps = 2/56 (3%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           P    ++V  LA     G  DG+ GSARFD P + A    G +YV D  N  +R +
Sbjct: 321 PQGTSTLVAGLAS--APGADDGQGGSARFDAPGALAWSPSGTLYVRDDGNRSLRTV 374



 Score = 37.4 bits (85), Expect = 2.8,   Method: Composition-based stats.
 Identities = 21/47 (44%), Positives = 26/47 (55%), Gaps = 1/47 (2%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD 121
           +S    V  LAG  GV G  DG   +A F +P S AVD  G +YVA+
Sbjct: 84  ISPTGAVTTLAGKPGVCGNQDGTSDTATFCRPSSIAVDKAGTVYVAE 130



 Score = 37.4 bits (85), Expect = 2.9,   Method: Composition-based stats.
 Identities = 20/50 (40%), Positives = 28/50 (56%), Gaps = 1/50 (2%)

Query: 82  VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V  LAG  G  G++DG+  +A+F    + A D  GN+YV D   H + KI
Sbjct: 482 VTLLAGSLGTVGHADGQGAAAQFSALGNMATDAAGNVYVVDGLFHEVNKI 531


>gi|344924044|ref|ZP_08777505.1| NHL repeat containing protein [Candidatus Odyssella
           thessalonicensis L13]
          Length = 383

 Score = 52.0 bits (123), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 24/47 (51%), Positives = 30/47 (63%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           LAG G  GY +G   +A+F  P S AVD  G +YV D +N+ IRKIT
Sbjct: 201 LAGSGATGYVEGTGSAAQFSTPTSVAVDSSGTVYVIDANNYRIRKIT 247



 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 49/88 (55%), Gaps = 8/88 (9%)

Query: 46  NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
           NP+++ ID  G+  L V D+     Y +   ++       LAG G  GY++G   SA+F+
Sbjct: 59  NPYAITIDSSGT--LYVADNGN---YRIR-KITSSGTTSLLAGSGTTGYAEGTGASAQFN 112

Query: 105 KPK-SFAVDMKGNIYVADKSNHVIRKIT 131
             +   A D  GN+YV+D +N+ +RKIT
Sbjct: 113 TFQWGIAADNSGNVYVSDTTNNRVRKIT 140



 Score = 47.4 bits (111), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 27/41 (65%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +GY +G   +ARF+ P +  +D  G +YVAD  N+ IRKIT
Sbjct: 45  RGYQEGTGSAARFENPYAITIDSSGTLYVADNGNYRIRKIT 85



 Score = 40.4 bits (93), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 20/46 (43%), Positives = 26/46 (56%), Gaps = 4/46 (8%)

Query: 90  VQGYSDGEPGSARFD----KPKSFAVDMKGNIYVADKSNHVIRKIT 131
            QGY++G   +ARF+     P    VD  G +YVAD  N+ IR IT
Sbjct: 259 TQGYAEGTGSAARFNFFNLIPSGITVDNAGTVYVADTFNYRIRTIT 304



 Score = 39.3 bits (90), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 25/47 (53%), Gaps = 1/47 (2%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           LAG    GY +G    AR   P+  AV+  G +YVA  ++  IR IT
Sbjct: 148 LAGS-TSGYQEGTGAGARLSSPRGLAVNSAGTVYVATATSERIRAIT 193


>gi|299473428|emb|CBN77825.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 2120

 Score = 52.0 bits (123), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 22/46 (47%), Positives = 29/46 (63%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            G G  GY+DG   +A F+ P+  AVD   N+YVAD  NH IR+I+
Sbjct: 205 GGSGEAGYADGSGTAALFNDPQDVAVDAHANVYVADTGNHRIRRIS 250



 Score = 44.7 bits (104), Expect = 0.018,   Method: Composition-based stats.
 Identities = 24/60 (40%), Positives = 34/60 (56%), Gaps = 4/60 (6%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN----IYVADKSNHVIRKIT 131
           +S E VV  +AGDG +G  DG+   A F  P   A+    +    +YVAD +NH +RKI+
Sbjct: 249 ISPEGVVTTVAGDGEEGSDDGDAMEASFSFPGGIALYYDSSEGLVLYVADTNNHRLRKIS 308



 Score = 43.9 bits (102), Expect = 0.036,   Method: Composition-based stats.
 Identities = 19/49 (38%), Positives = 29/49 (59%), Gaps = 2/49 (4%)

Query: 84  RLAGDGV--QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           R+A +     G++DG+   ARFD P   A    G ++VAD +NH+IR +
Sbjct: 334 RMAAEATPEAGFADGDGSYARFDGPSGLAAAEDGTLFVADTNNHLIRMV 382


>gi|182412380|ref|YP_001817446.1| immunoglobulin I-set domain-containing protein [Opitutus terrae
           PB90-1]
 gi|177839594|gb|ACB73846.1| Immunoglobulin I-set domain protein [Opitutus terrae PB90-1]
          Length = 1130

 Score = 52.0 bits (123), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 33/89 (37%), Positives = 49/89 (55%), Gaps = 6/89 (6%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-GYSDGEPGSARFD 104
           NP +++  P S DL V DS+  A   ++       VV  LAG  ++ GY DG    A+F 
Sbjct: 126 NPSALVVGP-SGDLYVADSNGHAIRKVT----PAGVVTTLAGGPLRYGYMDGPGTEAQFS 180

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNL 133
            P+  AV+  G I+V+D+S H IR++  L
Sbjct: 181 YPRGIAVNATGVIFVSDRSAHTIRRVDQL 209



 Score = 50.1 bits (118), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 22/42 (52%), Positives = 27/42 (64%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G  G +DG    ARF  P+  A+D  GN+YVAD +NH IRKI
Sbjct: 220 GSAGSADGPGDQARFRDPEGLAIDAAGNVYVADINNHTIRKI 261



 Score = 44.7 bits (104), Expect = 0.018,   Method: Composition-based stats.
 Identities = 27/77 (35%), Positives = 38/77 (49%), Gaps = 6/77 (7%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMK 114
           S ++ V+D   S    +S     + +V   AG  G  G  DG   +ARF  P    +D  
Sbjct: 499 SGNIFVVDRGASTIRKIS-----QGIVTTFAGMPGETGQDDGAGAAARFRDPMGIVIDGA 553

Query: 115 GNIYVADKSNHVIRKIT 131
            N+YVAD +N  IRK+T
Sbjct: 554 DNLYVADTNNWKIRKVT 570



 Score = 44.3 bits (103), Expect = 0.022,   Method: Composition-based stats.
 Identities = 24/51 (47%), Positives = 29/51 (56%), Gaps = 1/51 (1%)

Query: 82  VKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V  +AG DG  G +DG   SARF      AVD  GNI+V D+    IRKI+
Sbjct: 466 VTTIAGADGEDGDTDGPAASARFRAVDGLAVDSSGNIFVVDRGASTIRKIS 516



 Score = 43.5 bits (101), Expect = 0.040,   Method: Composition-based stats.
 Identities = 24/51 (47%), Positives = 30/51 (58%), Gaps = 1/51 (1%)

Query: 82  VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V  LAG  G  G++DG   +ARF  P S A+D  G I+V D  N  IRKI+
Sbjct: 267 VTTLAGAAGESGFADGPAANARFFCPTSLAIDPAGAIWVNDAINRAIRKIS 317



 Score = 42.0 bits (97), Expect = 0.12,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 31/57 (54%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   VV  LAG   V G  DG   +A F  P +  V   G++YVAD + H IRK+T
Sbjct: 96  ITPAGVVTTLAGVANVHGCIDGVGSNALFHNPSALVVGPSGDLYVADSNGHAIRKVT 152



 Score = 38.9 bits (89), Expect = 1.0,   Method: Composition-based stats.
 Identities = 20/47 (42%), Positives = 26/47 (55%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           LAG+  +G  DG    ARF+     A+   G +YVAD +N  IR IT
Sbjct: 51  LAGEWNKGSRDGAAAVARFNGANGVAIAPNGLVYVADLANSTIRAIT 97



 Score = 37.0 bits (84), Expect = 4.2,   Method: Composition-based stats.
 Identities = 31/101 (30%), Positives = 48/101 (47%), Gaps = 15/101 (14%)

Query: 33  TVTTVID--GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV 90
           TVTTV D  G  + I+P+ V+    + D I    S S    ++ P   +S   R      
Sbjct: 321 TVTTVADTAGEGITIDPNGVL-YIAADDRIKRLESGSVLSVVAGP--TDSYTSR------ 371

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
               +G   +ARF +P   A+ + GN+YV D   + IR++T
Sbjct: 372 ----NGVGANARFVQPIGSALAVDGNLYVTDSGGYAIRRVT 408


>gi|357516309|ref|XP_003628443.1| hypothetical protein MTR_8g058300 [Medicago truncatula]
 gi|355522465|gb|AET02919.1| hypothetical protein MTR_8g058300 [Medicago truncatula]
          Length = 160

 Score = 52.0 bits (123), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 38/122 (31%), Positives = 48/122 (39%), Gaps = 46/122 (37%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN---------------- 132
           G  G+ DG+   ARF  PK  A+D KGN+YVAD  N  IRKI +                
Sbjct: 24  GRTGHVDGKLSDARFHYPKGIALDDKGNVYVADTQNMAIRKIGDAGVTTIAGGKSNVAGY 83

Query: 133 ---------------------------LDHGNQLIRQINLKPEDCSKSSQSGS---GMFW 162
                                      +D GN  +RQI L  EDC+  S S S     FW
Sbjct: 84  RDGPGEDAKLSNDFDVVYIRPTCSLLVIDRGNAALRQIFLNQEDCNYQSSSISLTGSEFW 143

Query: 163 VT 164
           ++
Sbjct: 144 IS 145


>gi|301122083|ref|XP_002908768.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262099530|gb|EEY57582.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 1981

 Score = 52.0 bits (123), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 22/46 (47%), Positives = 31/46 (67%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +AGDG  G+ DG   +++F+ P+  AVD  G +YVAD +NH IR I
Sbjct: 208 VAGDGSAGFQDGLAATSQFNHPRGVAVDSNGVVYVADTANHRIRII 253



 Score = 50.4 bits (119), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 21/39 (53%), Positives = 27/39 (69%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G +DG+P  ARFD P   AVD  G ++VAD  NH+IR+I
Sbjct: 344 GMADGDPLGARFDSPMGVAVDADGVVFVADTGNHLIRRI 382



 Score = 44.3 bits (103), Expect = 0.022,   Method: Composition-based stats.
 Identities = 25/56 (44%), Positives = 32/56 (57%), Gaps = 5/56 (8%)

Query: 82  VKRLAGDGVQGYSDGEP-GSARFDKPKSFAVDMKGN----IYVADKSNHVIRKITN 132
           V  LAGDGV+G+ DG    +ARF  P   AV    +    ++VAD  NH IR+I N
Sbjct: 260 VSTLAGDGVEGFVDGPALTAARFSYPSDVAVRESNSGTITVFVADTGNHRIRQIRN 315



 Score = 36.6 bits (83), Expect = 5.1,   Method: Composition-based stats.
 Identities = 17/50 (34%), Positives = 25/50 (50%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V  LAG  V G+ DGE   A F+ P        G +Y A  ++  +R++T
Sbjct: 471 VVTLAGSNVPGHIDGEGNEATFNAPAGVTFAADGRVYAASSTDCSVRQVT 520


>gi|153868931|ref|ZP_01998653.1| receptor protein kinase [Beggiatoa sp. PS]
 gi|152074497|gb|EDN71345.1| receptor protein kinase [Beggiatoa sp. PS]
          Length = 3115

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 24/51 (47%), Positives = 36/51 (70%), Gaps = 2/51 (3%)

Query: 82   VKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            +  +AG+G +GYS DG+P +A + + P    VD  GN+Y+ADK+NH IRK+
Sbjct: 2336 ITTVAGNGNKGYSGDGDPATAAQINTPTGLEVDSTGNLYIADKNNHRIRKV 2386



 Score = 49.7 bits (117), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 23/56 (41%), Positives = 32/56 (57%), Gaps = 2/56 (3%)

Query: 77   SEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            S  S++  +AG+G  GY   +G    AR   P   AVD + N+Y+AD  NH IRK+
Sbjct: 2275 SANSIITTVAGNGRSGYQGDNGPATGARLSNPTGLAVDSQNNLYIADTDNHRIRKV 2330



 Score = 43.1 bits (100), Expect = 0.051,   Method: Composition-based stats.
 Identities = 20/48 (41%), Positives = 30/48 (62%), Gaps = 2/48 (4%)

Query: 85   LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            +AG G +G +  DG    A+   P+  A+D +GN+Y+AD  NH IRK+
Sbjct: 2163 VAGTGAKGSTGDDGPAIEAKLKNPQGTAIDHEGNLYIADTLNHRIRKV 2210



 Score = 40.4 bits (93), Expect = 0.38,   Method: Composition-based stats.
 Identities = 23/58 (39%), Positives = 31/58 (53%), Gaps = 2/58 (3%)

Query: 79   ESVVKRLAGDGVQGY-SDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
            E ++    G G  G  +DG   S A+  +P   A+D  GN+Y+ADK N  IRKI   D
Sbjct: 2389 EGIITTFTGTGKPGTATDGIIASVAQISQPTDVALDQYGNLYIADKGNDTIRKIGEKD 2446



 Score = 36.6 bits (83), Expect = 4.8,   Method: Composition-based stats.
 Identities = 17/53 (32%), Positives = 30/53 (56%), Gaps = 2/53 (3%)

Query: 81   VVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            ++  +AG G  G +  +G   +A+   P +   D  G++Y+AD  NH IRK++
Sbjct: 2215 IITTVAGIGKAGNTGDNGLATAAKLRNPTAIVFDNNGHLYIADSGNHRIRKVS 2267


>gi|86605024|ref|YP_473787.1| NHL repeat-containing protein [Synechococcus sp. JA-3-3Ab]
 gi|86553566|gb|ABC98524.1| NHL repeat domain protein [Synechococcus sp. JA-3-3Ab]
          Length = 637

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 24/56 (42%), Positives = 35/56 (62%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S +  V   AG  V GY DGE   A+F +P   A+ + G++YVAD++N  IR+IT
Sbjct: 408 ISRQRQVTTFAGTCVAGYRDGERDEAQFREPFGLALGLDGSLYVADRANRRIRRIT 463



 Score = 51.2 bits (121), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 24/58 (41%), Positives = 33/58 (56%)

Query: 74  FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           F LS E  ++  AG G  G+ DG    A+FD P+    D KG ++VAD  NH +R I+
Sbjct: 352 FRLSPEGELEVWAGSGKAGHRDGAADQAQFDSPQGLLWDPKGGLWVADSGNHCLRHIS 409



 Score = 38.9 bits (89), Expect = 0.91,   Method: Composition-based stats.
 Identities = 18/50 (36%), Positives = 31/50 (62%), Gaps = 2/50 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V   AG G  G +DG    A+  +P + AVD +GN+++AD+  H +R+++
Sbjct: 468 VTTAAGTGQPGSADGPADQAQLLQPTALAVDREGNLWIADR--HRLRRLS 515



 Score = 38.5 bits (88), Expect = 1.3,   Method: Composition-based stats.
 Identities = 19/63 (30%), Positives = 32/63 (50%), Gaps = 8/63 (12%)

Query: 76  LSEESVVKRLAGDGV--------QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
           +++   ++RL+ DG          GY DG    ARF      A D  G +++AD+ NH +
Sbjct: 505 IADRHRLRRLSADGQLTTLSRAEAGYRDGPLAEARFQTLAGLAFDSAGILWLADRDNHRL 564

Query: 128 RKI 130
           R++
Sbjct: 565 RRL 567


>gi|116624994|ref|YP_827150.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116228156|gb|ABJ86865.1| NHL repeat containing protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 912

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 27/51 (52%), Positives = 36/51 (70%), Gaps = 2/51 (3%)

Query: 82  VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  LAG GV GYS DG P  SA+ + P++ AVD  GN+YVAD +N+ +RKI
Sbjct: 511 ISTLAGTGVSGYSGDGGPATSAQLNGPQAVAVDGSGNVYVADTANNRVRKI 561



 Score = 41.6 bits (96), Expect = 0.15,   Method: Composition-based stats.
 Identities = 21/52 (40%), Positives = 33/52 (63%), Gaps = 2/52 (3%)

Query: 81  VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V+  +AG G  G++   G   SA+ + P S A+D  GN+Y+A+ SN+ IRK+
Sbjct: 454 VMSTVAGSGTSGFAGDGGAATSAQLNAPFSVALDAAGNLYIAEFSNNRIRKV 505



 Score = 41.6 bits (96), Expect = 0.17,   Method: Composition-based stats.
 Identities = 29/103 (28%), Positives = 50/103 (48%), Gaps = 4/103 (3%)

Query: 32  YTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLS--FPLSEESVVKRLAGDG 89
           YT++TV  G        ++    G    I L      F + +  F +     +  +AG+ 
Sbjct: 11  YTISTVAGGAPPPTPVAALSTSIGQPRKIALSGGNMYFSSGNSVFKIDGSGTLTLVAGNS 70

Query: 90  VQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
             G+S DG P  +A+ + P+  A+D  GN+Y+AD  N+ +RK+
Sbjct: 71  RAGFSGDGGPAVNAQLNSPQGVALDSAGNLYIADSQNNRVRKV 113



 Score = 40.8 bits (94), Expect = 0.24,   Method: Composition-based stats.
 Identities = 20/47 (42%), Positives = 26/47 (55%), Gaps = 2/47 (4%)

Query: 87  GDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           GDG QG++   G P       P S  VD  GN+Y AD  N+ IRK++
Sbjct: 294 GDGTQGFAGDGGAPNKVEMSLPTSVQVDSSGNLYFADSLNNRIRKLS 340



 Score = 37.4 bits (85), Expect = 3.0,   Method: Composition-based stats.
 Identities = 13/30 (43%), Positives = 21/30 (70%)

Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           + + P+  AVD  GN+Y+AD  NH +RK++
Sbjct: 422 QLNNPQGLAVDSAGNLYIADTQNHRVRKVS 451



 Score = 36.2 bits (82), Expect = 6.2,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 36/69 (52%), Gaps = 10/69 (14%)

Query: 72  LSFPLSEESVVKRLAGDGVQGYSD----------GEPGSARFDKPKSFAVDMKGNIYVAD 121
           L F  S  + +++L+G  V  Y+           G   +A+ + P   AVD  GN+YV+D
Sbjct: 326 LYFADSLNNRIRKLSGGNVNTYAGNGIVARSGDGGAATNAQLNTPLGVAVDAAGNLYVSD 385

Query: 122 KSNHVIRKI 130
             N+++R++
Sbjct: 386 TLNNLVRRV 394


>gi|325104254|ref|YP_004273908.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
 gi|324973102|gb|ADY52086.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
          Length = 439

 Score = 51.6 bits (122), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/114 (34%), Positives = 58/114 (50%), Gaps = 20/114 (17%)

Query: 29  EDGYTVTTV--------IDGHQLEI---NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPL 76
           E  Y V+T+        IDG+ L+    NP  + +D  G  D+I+ D +  +       +
Sbjct: 117 ESNYIVSTIAGSTTFGFIDGNGLDARFRNPDGILVDMNG--DIIITDRTNHSIR----KM 170

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +   VV  LAG GV GY++G+PG  +F+ P    +D  GNI V +K    IRKI
Sbjct: 171 TTAGVVSTLAGTGVSGYANGKPG--QFNTPWQSTMDAAGNIIVIEKDGGRIRKI 222



 Score = 51.2 bits (121), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 24/52 (46%), Positives = 33/52 (63%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ V  +AG+G  G  DG   +A F++P    +D KG+IYVAD  NH IRKI
Sbjct: 385 KATVSTIAGNGTAGRGDGLGHAATFNQPYDVVMDAKGDIYVADNVNHSIRKI 436



 Score = 50.1 bits (118), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 30/84 (35%), Positives = 45/84 (53%), Gaps = 6/84 (7%)

Query: 48  HSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPK 107
            S +D  G+  +I  D  R         ++ +  V  +AG G  G++DG    ARF+   
Sbjct: 200 QSTMDAAGNIIVIEKDGGRIR------KIAPDGAVSLIAGTGSLGFTDGNVSVARFNHAL 253

Query: 108 SFAVDMKGNIYVADKSNHVIRKIT 131
              VD +GNI+VAD++N+ IRKIT
Sbjct: 254 DGVVDSEGNIFVADRNNYRIRKIT 277



 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 46/91 (50%), Gaps = 13/91 (14%)

Query: 54  PGSSDLIVLDSSRSAFYTLSFP-----------LSEES--VVKRLAGDGVQGYSDGEPGS 100
           P   ++IV D++ +   T++             + +ES  +V  +AG    G+ DG    
Sbjct: 81  PNRLEVIVPDNATTGNVTIAVKGQQIYGPRFTVIKKESNYIVSTIAGSTTFGFIDGNGLD 140

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ARF  P    VDM G+I + D++NH IRK+T
Sbjct: 141 ARFRNPDGILVDMNGDIIITDRTNHSIRKMT 171



 Score = 35.8 bits (81), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 32/59 (54%), Gaps = 4/59 (6%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +++  V+  + G   +GY DG PG   + +         D +GNI +AD SN+ IRKIT
Sbjct: 319 ITKSGVMTTIVGS-TKGYDDGTPGNPLTVKLGDIFGLNFDNEGNILLADASNNRIRKIT 376


>gi|387219837|gb|AFJ69627.1| nhl repeat containing 2 [Nannochloropsis gaditana CCMP526]
          Length = 295

 Score = 51.6 bits (122), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 24/50 (48%), Positives = 30/50 (60%), Gaps = 1/50 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK-GNIYVADKSNHVIRKI 130
           V RLAG G +GY DG    A FD+P+   VD K G  +V D +NH IR +
Sbjct: 229 VSRLAGSGKRGYRDGAGAEAEFDEPRGLCVDGKGGQAFVCDSNNHCIRTV 278


>gi|441163236|ref|ZP_20968263.1| hypothetical protein SRIM_29923 [Streptomyces rimosus subsp.
           rimosus ATCC 10970]
 gi|440616375|gb|ELQ79517.1| hypothetical protein SRIM_29923 [Streptomyces rimosus subsp.
           rimosus ATCC 10970]
          Length = 238

 Score = 51.6 bits (122), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 24/57 (42%), Positives = 37/57 (64%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGY-SDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  + ++  +AG+G  GY SDG P +A + + P S AVD  GN+Y+AD  NH +RK+
Sbjct: 127 VDTKGIITTVAGNGTGGYVSDGGPATATQLNGPHSVAVDRDGNVYIADYHNHRVRKV 183



 Score = 46.6 bits (109), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 36/57 (63%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGY-SDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ + ++  +AG+G+ GY SDG P +A +   P   A D  GN+Y+AD  N+ +RK+
Sbjct: 71  VTPQGIITTVAGNGIAGYVSDGGPATATQLSSPCGVAADGAGNLYIADLGNNRVRKV 127



 Score = 41.2 bits (95), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 34/55 (61%), Gaps = 2/55 (3%)

Query: 79  ESVVKRLAGDGVQGY-SDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +  +  +AG+G  GY SDG P +  + + P   A+D  GN+Y+  +SN+ +RK+T
Sbjct: 18  QGCIATVAGNGTAGYLSDGGPATLTQLNWPHDVALDEHGNLYIVCRSNNRVRKVT 72


>gi|347754243|ref|YP_004861807.1| gluconolactonase [Candidatus Chloracidobacterium thermophilum B]
 gi|347586761|gb|AEP11291.1| Gluconolactonase [Candidatus Chloracidobacterium thermophilum B]
          Length = 1222

 Score = 51.6 bits (122), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 25/56 (44%), Positives = 35/56 (62%), Gaps = 2/56 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           L+ +  V  LAGDGV+G  DG    ARF  P++ AVD  G +YVAD +   +R+I+
Sbjct: 333 LTLDGAVTTLAGDGVEGTRDGVGKQARFKAPRAVAVDASGTVYVADDAR--VRRIS 386



 Score = 50.4 bits (119), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 24/52 (46%), Positives = 31/52 (59%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           E VV   AGD   G+ DG   SAR + P   A+D +G +Y+AD  NH IRK+
Sbjct: 616 ELVVTVTAGDSTPGFLDGVGTSARLNHPVGLALDAQGVLYIADHFNHAIRKL 667



 Score = 45.4 bits (106), Expect = 0.011,   Method: Composition-based stats.
 Identities = 29/75 (38%), Positives = 39/75 (52%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQIN 144
           LAG G +G+ DG    A+F+ P   AV   G +YVAD  N  IRK+T   + + L     
Sbjct: 676 LAGGGQRGFQDGYGPEAQFNGPLGLAVGRDGELYVADHLNMRIRKVTPDGYVSTLAGTGI 735

Query: 145 LKPEDCSKSSQSGSG 159
            K ED S ++ S  G
Sbjct: 736 SKIEDGSVATASFEG 750



 Score = 43.9 bits (102), Expect = 0.036,   Method: Composition-based stats.
 Identities = 23/52 (44%), Positives = 31/52 (59%), Gaps = 2/52 (3%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           SV    A +G  GY+DG    A+F +P   A D +GN+YVAD   H IR+I+
Sbjct: 232 SVSTLTASEG--GYADGPLTVAKFLRPNGIACDPQGNLYVADFGGHRIRQIS 281



 Score = 42.7 bits (99), Expect = 0.064,   Method: Composition-based stats.
 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ +  V  LAG G+    DG   +A F+ PK  AVDM G +YV D     +R IT
Sbjct: 721 VTPDGYVSTLAGTGISKIEDGSVATASFEGPKGVAVDMHGVVYVTDGV--TVRTIT 774



 Score = 40.4 bits (93), Expect = 0.33,   Method: Composition-based stats.
 Identities = 24/80 (30%), Positives = 45/80 (56%), Gaps = 14/80 (17%)

Query: 60  IVLDSSRSAFYTLSFPLSEESVVKRLAGDG-VQGYSDGEPG-------SARFDKPKSFAV 111
           + +D+S + +      +++++ V+R++  G V   + GEPG       +ARFD     A+
Sbjct: 366 VAVDASGTVY------VADDARVRRISPGGMVVTIAGGEPGCVDGPAEAARFDTLSGLAL 419

Query: 112 DMKGNIYVADKSNHVIRKIT 131
           D  GN+Y+AD  N  +RK++
Sbjct: 420 DRVGNLYLADAGNRRLRKLS 439



 Score = 36.2 bits (82), Expect = 6.0,   Method: Composition-based stats.
 Identities = 20/55 (36%), Positives = 31/55 (56%), Gaps = 1/55 (1%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ +  V+ LAG  V+G+ DG    A F    + AVD+ G  +V D +NH +R I
Sbjct: 773 ITPDGEVRTLAGQ-VRGFRDGIGTRAMFGWAYAIAVDVSGLCFVTDAANHAVRCI 826


>gi|408828763|ref|ZP_11213653.1| hypothetical protein SsomD4_16351 [Streptomyces somaliensis DSM
           40738]
          Length = 614

 Score = 51.2 bits (121), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 36/108 (33%), Positives = 53/108 (49%), Gaps = 18/108 (16%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S +L+V D++R     ++ P   E+VV+R  GDG +G  DG   SARF +P+       G
Sbjct: 197 SGNLLVSDTTRHRLVEVA-P-DGETVVRRY-GDGRRGLVDGPAESARFSEPQGMCALPDG 253

Query: 116 NIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSGMFWV 163
            I VAD  NH +R                L PE  + S+ +G+G  W+
Sbjct: 254 RIVVADTVNHALRA---------------LDPETGAVSTLAGTGAQWM 286


>gi|386841352|ref|YP_006246410.1| hypothetical protein SHJG_5269 [Streptomyces hygroscopicus subsp.
           jinggangensis 5008]
 gi|374101653|gb|AEY90537.1| hypothetical protein SHJG_5269 [Streptomyces hygroscopicus subsp.
           jinggangensis 5008]
 gi|451794647|gb|AGF64696.1| hypothetical protein SHJGH_5033 [Streptomyces hygroscopicus subsp.
           jinggangensis TL01]
          Length = 611

 Score = 51.2 bits (121), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 26/75 (34%), Positives = 44/75 (58%), Gaps = 3/75 (4%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S   +V D++R     L+     E+VV+R+ G G +G++DG P  A F +P+  A+   G
Sbjct: 189 SGTFLVSDTTRHQLVELA--EDAETVVRRI-GSGTRGFADGGPREASFSEPQGLALLDDG 245

Query: 116 NIYVADKSNHVIRKI 130
           ++ VAD  NH +R++
Sbjct: 246 SVVVADTVNHALRRL 260


>gi|393778566|ref|ZP_10366837.1| Ig family protein [Ralstonia sp. PBA]
 gi|392714468|gb|EIZ02071.1| Ig family protein [Ralstonia sp. PBA]
          Length = 342

 Score = 51.2 bits (121), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 34/56 (60%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   +V   AG G  G +DG   +A F+ P   A+D  GN+YVAD  N+++R+IT
Sbjct: 227 ITPGGLVTTFAGSGTTGSADGPGNTASFNSPGGVALDGDGNVYVADTGNNLLRRIT 282



 Score = 46.6 bits (109), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 33/56 (58%), Gaps = 1/56 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++   VV  LAG  GV G  +G   +ARF +P    VD  G +YVAD  N++IRK+
Sbjct: 281 ITRAGVVSTLAGQAGVTGAQNGIGSAARFKQPYGVVVDADGTVYVADTFNNLIRKV 336



 Score = 37.7 bits (86), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 43/87 (49%), Gaps = 8/87 (9%)

Query: 47  PHSVI-DRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-GYSDGEPGSARFD 104
           P+ +I D+ G  +L V D        ++ P  E +    +AG G+  G  DG   +A FD
Sbjct: 148 PYGIIFDKNG--NLFVSDVQTGILRKIT-PAGETT---NIAGGGISSGPVDGNGAAAYFD 201

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKIT 131
                A+D  GNIY  + S + IRKIT
Sbjct: 202 GLGGIALDAAGNIYATELSGNRIRKIT 228


>gi|223937279|ref|ZP_03629185.1| NHL repeat containing protein [bacterium Ellin514]
 gi|223894064|gb|EEF60519.1| NHL repeat containing protein [bacterium Ellin514]
          Length = 429

 Score = 51.2 bits (121), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 29/55 (52%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   +   AG    G  DG   SARF  P S A D  GNIYVAD  N  IRKIT
Sbjct: 25  AQSLTINTFAGHDAPGSKDGLSSSARFRHPNSIAADSAGNIYVADTENSTIRKIT 79



 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/57 (49%), Positives = 34/57 (59%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   VV  LAG      S DG   +A F +P+S AVD  GN+YVAD  NH IRKIT
Sbjct: 133 ITPAGVVSTLAGSAGNINSFDGTGINANFYQPRSLAVDNGGNVYVADTWNHTIRKIT 189



 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/57 (43%), Positives = 33/57 (57%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   +V  LAG  G  G +DG    ARF++P   AVD   N++V D  NH IRKIT
Sbjct: 188 ITPAGLVSTLAGLAGNPGSADGTNSKARFNRPSGIAVDNATNLFVTDFHNHTIRKIT 244



 Score = 45.8 bits (107), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 23/57 (40%), Positives = 32/57 (56%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++    V  +AG  GV G +DG    ARF +P+    D  GN++VAD  N  IRKI+
Sbjct: 243 ITPGGTVTTIAGLPGVWGNADGTNNVARFFQPQGIVADNAGNLFVADSGNQTIRKIS 299



 Score = 43.1 bits (100), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 40/80 (50%), Gaps = 4/80 (5%)

Query: 50  VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKS 108
           V D  G  +L V DS       +S P     +V  +AG  G+ G ++G   +ARF  P  
Sbjct: 278 VADNAG--NLFVADSGNQTIRKIS-PSGTNWIVSTVAGLSGIAGNANGTNNTARFYFPAD 334

Query: 109 FAVDMKGNIYVADKSNHVIR 128
            A D+ G IYVAD  N+ IR
Sbjct: 335 VAQDIAGYIYVADLGNNAIR 354



 Score = 42.4 bits (98), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 42/139 (30%), Positives = 56/139 (40%), Gaps = 31/139 (22%)

Query: 15  LSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSD------------LIVL 62
           LS     V  GLL         V     L IN  +  D PGS D             I  
Sbjct: 5   LSFKNLFVGGGLLAS-----MAVAHAQSLTINTFAGHDAPGSKDGLSSSARFRHPNSIAA 59

Query: 63  DSSRSAFYTLSFPLSEESVVKRLAGDG----------VQGYSDGEPGSARFDKPKSFAVD 112
           DS+ + +   +    E S ++++  +G            G +DG   +A F  P+  AVD
Sbjct: 60  DSAGNIYVADT----ENSTIRKITPNGSVSTFAGFAGTFGSADGVGTNALFYAPQGIAVD 115

Query: 113 MKGNIYVADKSNHVIRKIT 131
             G IYVAD +N  IRKIT
Sbjct: 116 SAGFIYVADTANATIRKIT 134


>gi|223993873|ref|XP_002286620.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220977935|gb|EED96261.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1937

 Score = 51.2 bits (121), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 20/42 (47%), Positives = 29/42 (69%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           GY+DGE   ARFD P+   V   G ++VAD +NH++R+I+ L
Sbjct: 392 GYADGEKDEARFDSPQGITVASDGRVFVADTNNHLVREISRL 433



 Score = 49.3 bits (116), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 25/61 (40%), Positives = 35/61 (57%), Gaps = 2/61 (3%)

Query: 73  SFPLSEESVVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ L+ +  V  +AG   G +G+ DG    ARF  P+  AVD  G +YVAD  NH IR I
Sbjct: 218 NYTLNPQGFVFTMAGSTSGEEGFVDGVESDARFRHPEGVAVDHDGYVYVADTGNHAIRMI 277

Query: 131 T 131
           +
Sbjct: 278 S 278


>gi|444306537|ref|ZP_21142300.1| NHL repeat containing protein [Arthrobacter sp. SJCon]
 gi|443481173|gb|ELT44105.1| NHL repeat containing protein [Arthrobacter sp. SJCon]
          Length = 673

 Score = 51.2 bits (121), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 39/111 (35%), Positives = 54/111 (48%), Gaps = 4/111 (3%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDL-IVLDSSRSAFYTLSF-PLSEESVVKRLA 86
           ED    T  + GH LE++  S  D   S  L  V+ +        SF P+S E  V  +A
Sbjct: 324 EDAAGFTGRLSGHPLEVSLSSPWDLVWSRKLNAVVIAMAGTHQIFSFDPVSGE--VDIIA 381

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           G+G++G  DG    A F +P   A D  GNI+VAD     +RK+   D G+
Sbjct: 382 GNGLEGLLDGPAHEAWFAQPSGLAEDGDGNIWVADSETSALRKLVIDDAGS 432


>gi|333025645|ref|ZP_08453709.1| hypothetical protein STTU_3149 [Streptomyces sp. Tu6071]
 gi|332745497|gb|EGJ75938.1| hypothetical protein STTU_3149 [Streptomyces sp. Tu6071]
          Length = 610

 Score = 51.2 bits (121), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 27/84 (32%), Positives = 45/84 (53%), Gaps = 4/84 (4%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P   +  PG +  +V D++R A   L    ++   V+R  G G +G  DG+   ARF++P
Sbjct: 183 PAKALRLPGGT-FLVSDTTRHALVELE---ADGETVRRRIGSGERGLLDGDATGARFNEP 238

Query: 107 KSFAVDMKGNIYVADKSNHVIRKI 130
           +  A+   G + VAD  NH +R++
Sbjct: 239 QGLALLPDGAVVVADTVNHALRRV 262


>gi|190572588|ref|YP_001970433.1| NHL repeat-containing protein [Stenotrophomonas maltophilia K279a]
 gi|190010510|emb|CAQ44119.1| putative NHL repeat protein [Stenotrophomonas maltophilia K279a]
          Length = 1267

 Score = 51.2 bits (121), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 36/95 (37%), Positives = 46/95 (48%), Gaps = 7/95 (7%)

Query: 38  IDGHQLE--INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
           +DG  L+   N  S I      +L V D+   A   +      +  V  LAG G QG++D
Sbjct: 688 VDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRIG----TDGQVTTLAG-GEQGHAD 742

Query: 96  GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G    ARFD P   AVD +G +YVAD  N  IR I
Sbjct: 743 GPAVQARFDAPMGIAVDAQGQVYVADTFNDRIRVI 777



 Score = 48.5 bits (114), Expect = 0.001,   Method: Composition-based stats.
 Identities = 23/49 (46%), Positives = 30/49 (61%), Gaps = 1/49 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V+ +AG G +G  DG    A F+ P   A D +GN+YVAD  NH IR+I
Sbjct: 677 VETVAGQG-EGRVDGPALQASFNTPSGIAADAQGNLYVADTGNHAIRRI 724



 Score = 47.4 bits (111), Expect = 0.003,   Method: Composition-based stats.
 Identities = 30/102 (29%), Positives = 51/102 (50%), Gaps = 5/102 (4%)

Query: 34  VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFP-----LSEESVVKRLAGD 88
           VTT+  G Q   +  +V  R  +   I +D+    +   +F      +  + +V+ LAG 
Sbjct: 730 VTTLAGGEQGHADGPAVQARFDAPMGIAVDAQGQVYVADTFNDRIRVIGTDGMVRTLAGG 789

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
              G +DG   +ARFD P + A D +G + VAD  N+ +R++
Sbjct: 790 DRPGLADGVGAAARFDTPVALAFDAQGALLVADLFNNAVRRV 831


>gi|111219921|ref|YP_710715.1| ascorbate-dependent monooxygenase [Frankia alni ACN14a]
 gi|111147453|emb|CAJ59103.1| Putative ascorbate-dependent monooxygenase [Frankia alni ACN14a]
          Length = 865

 Score = 51.2 bits (121), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 39/57 (68%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++   V+  +AG GV G+S DG P  +A+ D+P   A+D +G++Y+AD+ NH IR++
Sbjct: 558 INAAGVITTIAGTGVAGFSGDGGPAVAAQLDQPAGIALDNRGDLYIADRLNHRIRRV 614



 Score = 42.0 bits (97), Expect = 0.14,   Method: Composition-based stats.
 Identities = 20/52 (38%), Positives = 32/52 (61%), Gaps = 2/52 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V+  +AG+ ++G++ DG P   A    P+  AVD  GN+Y+ D+ N  IRK+
Sbjct: 742 VITTVAGNRLRGFAGDGGPAVKASLQDPRGIAVDAVGNLYITDRGNSRIRKV 793



 Score = 37.0 bits (84), Expect = 3.9,   Method: Composition-based stats.
 Identities = 19/52 (36%), Positives = 28/52 (53%), Gaps = 2/52 (3%)

Query: 81  VVKRLAGDGVQG-YSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++   AG   +G + DG P + A  D P   A D  G++Y+AD  N  IRK+
Sbjct: 686 IISTFAGSSGRGSFGDGGPATDALLDVPFGVAADAAGDVYIADTDNSRIRKV 737


>gi|290990564|ref|XP_002677906.1| NHL domain-containing protein kinase [Naegleria gruberi]
 gi|284091516|gb|EFC45162.1| NHL domain-containing protein kinase [Naegleria gruberi]
          Length = 2733

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 25/53 (47%), Positives = 34/53 (64%), Gaps = 4/53 (7%)

Query: 82  VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +  +AG G  GYS G+ G   SA+F +P+  AV   G IY+AD  NHV+RKI+
Sbjct: 548 IATVAGSGASGYS-GDGGLLTSAKFQQPQGVAVSSNGEIYIADTENHVVRKIS 599



 Score = 45.1 bits (105), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 44/76 (57%), Gaps = 9/76 (11%)

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMK 114
           D+ + DS  +    +S     +  +  +AG+G+ GYS G+ G   SA+   P+S A+++ 
Sbjct: 272 DVYIADSGNNRIRKIS-----KGYISTVAGNGIAGYS-GDGGLSTSAQLATPQSVAINLN 325

Query: 115 GNIYVADKSNHVIRKI 130
           G I +AD +N+ IRKI
Sbjct: 326 GEIIIADSNNNRIRKI 341



 Score = 43.5 bits (101), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 40/75 (53%), Gaps = 5/75 (6%)

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGE-PGSARFDKPKSFAVDMKG 115
           D+ + D+  +    +SF      ++  +AG G+ G + DG    SA+ + P   AV   G
Sbjct: 471 DVYIADTYNNRIRKVSF---STGIISTVAGTGIAGQNGDGNLATSAQLNFPSGVAVSSNG 527

Query: 116 NIYVADKSNHVIRKI 130
           +IY+AD  NH IRKI
Sbjct: 528 DIYIADTENHRIRKI 542



 Score = 42.7 bits (99), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 31/53 (58%), Gaps = 4/53 (7%)

Query: 82  VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +  +AG G  GYS G+ G   +A+   P   AV   G +++AD+ NH IRK+T
Sbjct: 821 ISTIAGTGAMGYS-GDGGLAITAKLSSPVGVAVSSTGEVFIADRDNHRIRKVT 872



 Score = 41.6 bits (96), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 19/57 (33%), Positives = 31/57 (54%), Gaps = 4/57 (7%)

Query: 81  VVKRLAGDGVQ----GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           V++R+    +     G  DG P +  + + +SF +   G IY+AD  NH IRK++ L
Sbjct: 762 VIRRVISGNISTIAGGVGDGGPATGGYIQAQSFDISSTGEIYIADTENHRIRKVSTL 818



 Score = 41.6 bits (96), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 34/54 (62%), Gaps = 3/54 (5%)

Query: 81  VVKRLAGDGVQGY-SDG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           ++  +AG+G  G+ SDG E   A+ + P S  +   G IY+AD +NH IR+I+N
Sbjct: 876 IITTIAGNGTSGFNSDGIEAKFAQLNSPSSVTIS-GGEIYIADTNNHRIRQISN 928



 Score = 39.7 bits (91), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 34/58 (58%), Gaps = 4/58 (6%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +S  +V+  +AG G  GY +G+ G   +A+   P   A+   G +++AD +NH IRKI
Sbjct: 598 ISTSNVLSTIAGTGSYGY-NGDGGLAITAKLFSPIGVAISSSGEVFIADNNNHRIRKI 654



 Score = 39.3 bits (90), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 19/59 (32%), Positives = 33/59 (55%), Gaps = 4/59 (6%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   ++  +AG G  GY DG+     +A+   P    V+  G++Y+AD  N+ IRKI+
Sbjct: 230 IATNGIITTIAGTGTHGY-DGDGALAINAQLYSPTGVVVNSNGDVYIADSGNNRIRKIS 287



 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 16/57 (28%), Positives = 34/57 (59%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGY-SDGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++++  +  + G G+ GY  DG+   +A+   P+   +   G +++AD++NH IRK+
Sbjct: 654 IAKDGYISTIVGTGLNGYDGDGDLATNAKLGNPQGVTLSPTGELFIADQNNHRIRKV 710



 Score = 37.4 bits (85), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 33/58 (56%), Gaps = 4/58 (6%)

Query: 76  LSEESVVKRLAGDGVQGY-SDGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S   ++K +AG+G  GY  DG  P +A+ + P   A+     IY+A+  N  IRK+T
Sbjct: 926 ISNTGIIKTIAGNGFGGYIGDGVLPPNAQLNSPSGVAISSTAEIYIAE--NKRIRKLT 981


>gi|29347802|ref|NP_811305.1| NHL repeat-containing protein [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|383120108|ref|ZP_09940841.1| hypothetical protein BSIG_4606 [Bacteroides sp. 1_1_6]
 gi|29339703|gb|AAO77499.1| NHL repeat-containing protein [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|251838367|gb|EES66454.1| hypothetical protein BSIG_4606 [Bacteroides sp. 1_1_6]
          Length = 440

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 45/82 (54%), Gaps = 5/82 (6%)

Query: 50  VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF 109
            ID+ G+  L + D   +    +   L+   VV  +AG G  GY DG+P  A+F++P   
Sbjct: 362 AIDKEGN--LYIADVGNNRIRKVDTKLN---VVTTIAGSGAAGYKDGDPLEAQFNQPWGV 416

Query: 110 AVDMKGNIYVADKSNHVIRKIT 131
            +D    +Y+AD++NH IRK+ 
Sbjct: 417 YLDKNEFLYIADQNNHCIRKLA 438



 Score = 46.2 bits (108), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 41/73 (56%), Gaps = 8/73 (10%)

Query: 65  SRSAFYTLS-------FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
           S+SA Y          +P  +  + + + G G  G+ DG+   A+F +P+  A+D +GN+
Sbjct: 311 SKSAVYRFKLTRDADGWPALDGDIDEYIPGAGA-GFRDGDVQEAQFKEPRGIAIDKEGNL 369

Query: 118 YVADKSNHVIRKI 130
           Y+AD  N+ IRK+
Sbjct: 370 YIADVGNNRIRKV 382


>gi|223934991|ref|ZP_03626910.1| NHL repeat containing protein [bacterium Ellin514]
 gi|223896444|gb|EEF62886.1| NHL repeat containing protein [bacterium Ellin514]
          Length = 1064

 Score = 50.8 bits (120), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 28/58 (48%), Positives = 34/58 (58%), Gaps = 1/58 (1%)

Query: 75  PLSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           P     +V  +AG  G  G  DG   SARF +P+S +VD  G +YVAD  NH IRKIT
Sbjct: 593 PSGTNWIVNTVAGLAGFWGSIDGTNTSARFFQPRSLSVDASGALYVADSGNHAIRKIT 650



 Score = 47.8 bits (112), Expect = 0.002,   Method: Composition-based stats.
 Identities = 21/47 (44%), Positives = 32/47 (68%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           LAG   +G ++G   S +F  P++ A+D+  N++VAD  NHVIRKI+
Sbjct: 52  LAGYAGRGSANGISSSVQFKAPQAIAIDISNNVFVADTENHVIRKIS 98



 Score = 45.1 bits (105), Expect = 0.016,   Method: Composition-based stats.
 Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S   ++  LAG  G  G  DG   +A F +P   AVD  GN+ VAD  N+ +RKIT
Sbjct: 97  ISCTGIITTLAGSLGTHGSRDGSGTNALFFRPAGIAVDASGNVLVADTGNNTVRKIT 153



 Score = 44.7 bits (104), Expect = 0.020,   Method: Composition-based stats.
 Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 5/86 (5%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP   I    + +L + D+  +   T++        V   AG    G +DG    ARF  
Sbjct: 347 NPQG-ITSDATGNLYIADTGNNTIRTIT----PGGSVTTFAGLPSIGSADGLSSDARFRF 401

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT 131
           P++ AVD   N+YVAD +N  IRKI+
Sbjct: 402 PQAVAVDAATNVYVADTANQTIRKIS 427



 Score = 44.7 bits (104), Expect = 0.021,   Method: Composition-based stats.
 Identities = 21/43 (48%), Positives = 29/43 (67%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           GV G ++G    A+F  P+  AVD  GN++VAD  N++IRKIT
Sbjct: 495 GVSGTANGTNTDAQFYAPQGLAVDGTGNVFVADTFNNLIRKIT 537



 Score = 44.3 bits (103), Expect = 0.027,   Method: Composition-based stats.
 Identities = 29/77 (37%), Positives = 41/77 (53%), Gaps = 5/77 (6%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDM 113
           G+ ++ V D+    F  L   ++    V  LAG+    G SDG   +ARF  P   AVD 
Sbjct: 519 GTGNVFVADT----FNNLIRKITPGGAVTTLAGNFENFGSSDGTNSNARFYWPSGVAVDN 574

Query: 114 KGNIYVADKSNHVIRKI 130
            GN++VAD  NH IR++
Sbjct: 575 AGNVFVADYMNHTIREL 591



 Score = 43.5 bits (101), Expect = 0.048,   Method: Composition-based stats.
 Identities = 27/78 (34%), Positives = 40/78 (51%), Gaps = 5/78 (6%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
           S +L V+D+       ++       VV   AG  G  G ++G   +A F  P+   +D+ 
Sbjct: 246 SGNLYVVDTGNGTIRKIT----SSGVVTTFAGSAGNYGATNGIGANALFYAPQGITIDLF 301

Query: 115 GNIYVADKSNHVIRKITN 132
           G +YVAD  NH IRKIT+
Sbjct: 302 GCVYVADTGNHTIRKITS 319



 Score = 42.4 bits (98), Expect = 0.092,   Method: Composition-based stats.
 Identities = 28/63 (44%), Positives = 33/63 (52%), Gaps = 13/63 (20%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGS-------ARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           +S   +V  LAG      S G PGS       A F  P+   VD  GNIYVAD  NH+IR
Sbjct: 426 ISPSGLVCTLAG------SIGHPGSVNNIGTNALFSGPQGITVDGVGNIYVADTLNHIIR 479

Query: 129 KIT 131
           +IT
Sbjct: 480 RIT 482



 Score = 41.6 bits (96), Expect = 0.16,   Method: Composition-based stats.
 Identities = 23/52 (44%), Positives = 28/52 (53%), Gaps = 1/52 (1%)

Query: 82  VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           V  +AG  GV G  D    +A F  P+   VD  GN+YV D  N  IRKIT+
Sbjct: 213 VNTMAGSAGVYGNLDNSGANALFSGPQGLTVDSSGNLYVVDTGNGTIRKITS 264



 Score = 39.3 bits (90), Expect = 0.73,   Method: Composition-based stats.
 Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ +  V  LAG  G  G +D    SA F  P+    D  GN+Y+AD  N+ IR IT
Sbjct: 317 ITSDGTVTTLAGLAGNYGSADSVNSSASFWNPQGITSDATGNLYIADTGNNTIRTIT 373



 Score = 36.6 bits (83), Expect = 4.9,   Method: Composition-based stats.
 Identities = 22/51 (43%), Positives = 28/51 (54%), Gaps = 1/51 (1%)

Query: 82  VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V   AG  G  G +D    +A F +P   A+D   NI+VAD  N+ IRKIT
Sbjct: 158 VTTFAGSAGNYGSTDNLGTNALFYRPTGIAIDNFNNIFVADTGNNTIRKIT 208


>gi|223937730|ref|ZP_03629631.1| NHL repeat containing protein [bacterium Ellin514]
 gi|223893523|gb|EEF59983.1| NHL repeat containing protein [bacterium Ellin514]
          Length = 1474

 Score = 50.8 bits (120), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 34/95 (35%), Positives = 54/95 (56%), Gaps = 9/95 (9%)

Query: 43   LEINPHSVIDRPGSSDLIVL----DSSRSAFYTLSFPL--SEESVVKRLAGDGVQGYSDG 96
            L  NP  +    G+S L+V     + + SA  T ++ L   E +VV+  AG GV G+++G
Sbjct: 1002 LYTNPFPI---SGNSTLVVKAWLSNYAPSALATATYSLLDYEPTVVQTYAGTGVPGFTNG 1058

Query: 97   EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
                A+F+ P+   VD  G ++V+D  N+VIRKI+
Sbjct: 1059 PNLKAQFNAPQGICVDGLGTLFVSDTGNNVIRKIS 1093



 Score = 48.5 bits (114), Expect = 0.001,   Method: Composition-based stats.
 Identities = 25/56 (44%), Positives = 31/56 (55%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            +S   VV   AG GV G  DG   +A F  P   A+D   N+YVAD  N +IRK+T
Sbjct: 1092 ISTNGVVTTFAGSGVAGTHDGVGTNASFLAPTGIALDSSNNLYVADSGNSLIRKVT 1147



 Score = 42.0 bits (97), Expect = 0.12,   Method: Composition-based stats.
 Identities = 23/50 (46%), Positives = 29/50 (58%)

Query: 82   VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            V  LAG G  G  +G    A F++  +  VD  GNIY AD +NH IRKI+
Sbjct: 1299 VTTLAGAGDLGLRNGSGLFAAFNQLGAVTVDSAGNIYAADAANHSIRKIS 1348


>gi|29349195|ref|NP_812698.1| hypothetical protein BT_3787 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29341103|gb|AAO78892.1| conserved hypothetical protein with NHL repeat [Bacteroides
           thetaiotaomicron VPI-5482]
          Length = 452

 Score = 50.8 bits (120), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 24/57 (42%), Positives = 36/57 (63%), Gaps = 2/57 (3%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           P  E S+   L G    G++DG P  ARF+KP+   V +  ++YVAD+ NH+IR++T
Sbjct: 396 PTGEVSLYAGLPGQA--GFTDGLPEKARFNKPECVTVYLDNSLYVADRDNHLIRRVT 450



 Score = 43.5 bits (101), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 19/40 (47%), Positives = 25/40 (62%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           GY +G   +ARFD P+  A D  GN++V +   H IRKIT
Sbjct: 356 GYVNGSGVTARFDNPRQPAFDQDGNMFVPEYGRHTIRKIT 395


>gi|325186506|emb|CCA21046.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 1881

 Score = 50.8 bits (120), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 27/65 (41%), Positives = 36/65 (55%), Gaps = 6/65 (9%)

Query: 71  TLSFPLSE-----ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNH 125
           T SFP S      +  V+  AG G +G++DGE  ++ FD P   A+D  G  YV D  NH
Sbjct: 214 TWSFPKSNSKTKPQGAVETFAG-GSRGHNDGEAVASLFDNPNDLAIDSTGVTYVVDTGNH 272

Query: 126 VIRKI 130
            IR+I
Sbjct: 273 CIRRI 277



 Score = 46.6 bits (109), Expect = 0.005,   Method: Composition-based stats.
 Identities = 19/39 (48%), Positives = 26/39 (66%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G+SDG    +RFD P   A+D+ GN+ VAD  N++IR I
Sbjct: 353 GFSDGNASESRFDTPLGLAIDVDGNLIVADSGNNLIRLI 391



 Score = 40.0 bits (92), Expect = 0.52,   Method: Composition-based stats.
 Identities = 32/107 (29%), Positives = 51/107 (47%), Gaps = 16/107 (14%)

Query: 30  DGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD 88
           DG  V ++ D      NP+ + ID  G +   V+D+       +     + S V   AG+
Sbjct: 242 DGEAVASLFD------NPNDLAIDSTGVT--YVVDTGNHCIRRID----QRSRVTTFAGN 289

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGN---IYVADKSNHVIRKITN 132
             +G+ DG    A+++ P   AV  + N   +YVAD  NH IR+I +
Sbjct: 290 RTRGFRDGPLDIAQYNFPSGIAVVNEQNRVTVYVADTGNHRIRRIRD 336


>gi|312200909|ref|YP_004020970.1| serine/threonine protein kinase [Frankia sp. EuI1c]
 gi|311232245|gb|ADP85100.1| serine/threonine protein kinase [Frankia sp. EuI1c]
          Length = 847

 Score = 50.8 bits (120), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 27/58 (46%), Positives = 37/58 (63%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S++  V  +AG+G  G+S DG P  SA  + P +  VD  GNIYV D +N+ IRKIT
Sbjct: 554 ISKDGTVTPIAGNGTAGFSGDGGPATSAELNGPGTAVVDKNGNIYVPDTANNRIRKIT 611



 Score = 42.0 bits (97), Expect = 0.13,   Method: Composition-based stats.
 Identities = 20/57 (35%), Positives = 33/57 (57%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ + V+  +AG G +G + DG P +A +   P S  VD  G +Y+AD  N  IR++
Sbjct: 777 VTPDGVITTIAGTGAEGNTGDGGPATAAQLKSPSSVVVDASGAVYIADNGNKEIRRV 833



 Score = 37.4 bits (85), Expect = 2.7,   Method: Composition-based stats.
 Identities = 20/58 (34%), Positives = 34/58 (58%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ +  +  + G+G  G+S DG P + A  +  +  AV   G++Y+AD SN  IRK+T
Sbjct: 610 ITPDGKITTVVGNGTAGFSGDGGPATQAEINSVEGIAVGPDGSLYLADYSNERIRKVT 667


>gi|452992643|emb|CCQ95895.1| hypothetical protein CULT_320031 [Clostridium ultunense Esp]
          Length = 329

 Score = 50.4 bits (119), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 47/91 (51%), Gaps = 15/91 (16%)

Query: 53  RPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSAR---------- 102
           R    ++ +LDS   A   ++    ++  V+ +AG G  GY+ G+ G A+          
Sbjct: 182 RGPEGNIYILDSKNYAVRKINI---DKETVELIAGTGKPGYT-GDGGDAKDATFGGNKES 237

Query: 103 -FDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
            FD P S ++D  GNIYV D  NHV+R ITN
Sbjct: 238 YFDGPWSLSLDEVGNIYVGDTQNHVVRMITN 268


>gi|357516379|ref|XP_003628478.1| hypothetical protein MTR_8g058630 [Medicago truncatula]
 gi|355522500|gb|AET02954.1| hypothetical protein MTR_8g058630 [Medicago truncatula]
          Length = 150

 Score = 50.4 bits (119), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 35/111 (31%), Positives = 43/111 (38%), Gaps = 43/111 (38%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN---------------- 132
           G  G+ DG+   ARF  PK  A+D KGN+YVAD  N  IRKI +                
Sbjct: 24  GRTGHVDGKLSDARFHYPKGIALDDKGNVYVADTQNMAIRKIGDAGVTTIAGGKSNVAGY 83

Query: 133 ---------------------------LDHGNQLIRQINLKPEDCSKSSQS 156
                                      +D GN  +RQI L  EDC+  S S
Sbjct: 84  RDGPGEDAKLSNDFDVVYIRPTCSLLVIDRGNAALRQIFLNQEDCNYQSSS 134


>gi|298387997|ref|ZP_06997545.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
 gi|298259263|gb|EFI02139.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
          Length = 439

 Score = 50.4 bits (119), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 43/73 (58%), Gaps = 8/73 (10%)

Query: 65  SRSAFYTLS-------FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
           S+SA Y          +P  +  + + + G G  G+ DG+   A+F++P+  A+D +GN+
Sbjct: 311 SKSAVYRFKLTRDADGWPALDGDIEEYIPGAGA-GFRDGDVQEAQFNEPRGIAIDKEGNL 369

Query: 118 YVADKSNHVIRKI 130
           Y+AD +NH IRK+
Sbjct: 370 YIADVNNHRIRKV 382



 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 35/52 (67%), Gaps = 1/52 (1%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++V  +AG G +GY DG+P  A+FD+P    +D    +Y+AD++NH IRK+ 
Sbjct: 387 NIVTTIAGSG-KGYKDGDPLEAQFDQPWGVYLDKNEFLYIADQNNHCIRKLA 437


>gi|380692433|ref|ZP_09857292.1| NHL repeat-containing protein [Bacteroides faecis MAJ27]
          Length = 439

 Score = 50.4 bits (119), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 43/73 (58%), Gaps = 8/73 (10%)

Query: 65  SRSAFYTLS-------FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
           S+SA Y          +P  +  + + + G G  G+ DG+   A+F++P+  A+D +GN+
Sbjct: 311 SKSAVYRFKLTRDADGWPALDGDIEEYIPGAGA-GFRDGDVQEAQFNEPRGIAIDKEGNL 369

Query: 118 YVADKSNHVIRKI 130
           Y+AD +NH IRK+
Sbjct: 370 YIADVNNHRIRKV 382



 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 35/52 (67%), Gaps = 1/52 (1%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++V  +AG G +GY DG+P  A+FD+P    +D    +Y+AD++NH IRK+ 
Sbjct: 387 NIVTTIAGSG-KGYKDGDPLEAQFDQPWGVYLDKNEFLYIADQNNHCIRKLA 437


>gi|406836139|ref|ZP_11095733.1| NHL repeat containing protein [Schlesneria paludicola DSM 18645]
          Length = 365

 Score = 50.4 bits (119), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 45/78 (57%), Gaps = 5/78 (6%)

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKG 115
           D+I++    +A Y +     +E  V+ LAG G +GY+ DG  G  A+ + PK  A+D +G
Sbjct: 236 DVILVLREGNAVYRID---RKEKSVRHLAGTGKKGYAGDGGDGKLAQVNGPKGIAIDHQG 292

Query: 116 NIYVADKSNHVIRKITNL 133
           NI + D  NHVIR I  L
Sbjct: 293 NILLCDTENHVIRIIERL 310



 Score = 42.0 bits (97), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 27/51 (52%), Gaps = 1/51 (1%)

Query: 82  VKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +  L GDG  G   DG P   R ++P    V + G +Y+ D  NH IRK+T
Sbjct: 314 IDTLVGDGTIGDGPDGNPRHCRLNRPHGVFVALDGTVYIGDSGNHKIRKLT 364


>gi|29348037|ref|NP_811540.1| cell surface protein [Bacteroides thetaiotaomicron VPI-5482]
 gi|29339939|gb|AAO77734.1| putative cell surface protein, have conserved domain [Bacteroides
           thetaiotaomicron VPI-5482]
          Length = 434

 Score = 50.4 bits (119), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 27/73 (36%), Positives = 40/73 (54%), Gaps = 4/73 (5%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
           L +  +++   YT +    E    K  AG  G  G++DG    A F+ P+  ++DM+GNI
Sbjct: 307 LYIAYAAKHVIYTYNIETGEH---KLFAGAFGQSGWNDGIATDAEFNSPRQMSLDMEGNI 363

Query: 118 YVADKSNHVIRKI 130
           Y+AD  NH IR I
Sbjct: 364 YIADSGNHCIRMI 376



 Score = 46.2 bits (108), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 30/51 (58%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V   +   G  GY+DG P  A  + P+  AV+ +G++Y+AD  N  IRK+T
Sbjct: 382 VTTPIGQPGEAGYADGSPDMALLNDPRGVAVNSEGDVYIADLGNRCIRKLT 432


>gi|29348305|ref|NP_811808.1| hypothetical protein BT_2896 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|383121762|ref|ZP_09942466.1| hypothetical protein BSIG_1754 [Bacteroides sp. 1_1_6]
 gi|29340208|gb|AAO78002.1| NHL repeat protein [Bacteroides thetaiotaomicron VPI-5482]
 gi|251841348|gb|EES69429.1| hypothetical protein BSIG_1754 [Bacteroides sp. 1_1_6]
          Length = 452

 Score = 50.4 bits (119), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 52/106 (49%), Gaps = 13/106 (12%)

Query: 31  GYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSF-----PLSEESVVKRL 85
           G  V  +IDG  +     S +D+P   + I +D+  +   T  +      ++ E  V   
Sbjct: 351 GNGVWGIIDGQGVS----SRMDQP---NQIAVDAEGNLLVTTVYGRTVRKITPEGYVSTY 403

Query: 86  AGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           AG G Q GY DG    A+F+KP   A+D +GN+YV D  N  IR I
Sbjct: 404 AGIGYQTGYVDGLAAEAKFNKPYGIAIDAQGNVYVGDCENWRIRVI 449



 Score = 46.6 bits (109), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 22/48 (45%), Positives = 29/48 (60%)

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           R AG+GV G  DG+  S+R D+P   AVD +GN+ V       +RKIT
Sbjct: 348 RFAGNGVWGIIDGQGVSSRMDQPNQIAVDAEGNLLVTTVYGRTVRKIT 395


>gi|380692993|ref|ZP_09857852.1| cell surface protein [Bacteroides faecis MAJ27]
          Length = 457

 Score = 50.1 bits (118), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 77  SEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           + E   +  AGD G  G++DG+   A+F+ P+   +D + N+Y+AD  NH IRK+T
Sbjct: 345 TREGNFRIYAGDKGRAGHADGQETEAQFNSPRQLVLDEEENLYIADSGNHCIRKVT 400



 Score = 46.2 bits (108), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 33/57 (57%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ E VV  + G+    GY DG P  A F +P   A+D +G IY+ DK N  +RK++
Sbjct: 399 VTPEGVVSTVIGNPNSSGYKDGTPEIALFTEPWGLAIDSEGTIYIGDKDNRCVRKLS 455


>gi|333383538|ref|ZP_08475196.1| hypothetical protein HMPREF9455_03362 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332827477|gb|EGK00223.1| hypothetical protein HMPREF9455_03362 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 434

 Score = 50.1 bits (118), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 30/51 (58%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V   +   G +GY DG P  A F KP+  AVD  G++Y+AD  N  +RK+T
Sbjct: 382 VTTPIGQPGQEGYMDGSPEIALFKKPRGVAVDKNGDVYIADYGNRCLRKLT 432



 Score = 49.3 bits (116), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 38/72 (52%), Gaps = 2/72 (2%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
           L +  + R   YT +   +E  +       G  G+SDG    A F+ P+   +D++GN+Y
Sbjct: 307 LYIAYTQRHCIYTYNLATNEHLLFA--GAPGQPGWSDGRVSDAEFNLPRQMTLDLEGNLY 364

Query: 119 VADKSNHVIRKI 130
           +AD+ NH IR I
Sbjct: 365 IADEGNHCIRMI 376


>gi|118161380|gb|ABK64062.1| conserved hypothetical protein [Janthinobacterium lividum]
          Length = 779

 Score = 50.1 bits (118), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 31/85 (36%), Positives = 46/85 (54%), Gaps = 6/85 (7%)

Query: 53  RPGSSDLIVLDSSRSAFYTLSF-----PLSEESVVKRLAGD-GVQGYSDGEPGSARFDKP 106
           R G    I  D+S + + T S       ++   VV  LAG  GV G +DG   +A F+ P
Sbjct: 642 RFGYLRAIAFDASGNLYLTDSLNHNVRKITPAGVVTTLAGTTGVAGDADGRGSAASFNGP 701

Query: 107 KSFAVDMKGNIYVADKSNHVIRKIT 131
              A+D  GN+YVAD  N+++R+I+
Sbjct: 702 HGIALDKAGNVYVADTENNLVRRIS 726



 Score = 48.9 bits (115), Expect = 0.001,   Method: Composition-based stats.
 Identities = 23/56 (41%), Positives = 31/56 (55%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           L+   +V  LAG    G +DG    A F       +D +GN++VAD+ NH IRKIT
Sbjct: 331 LTPAGIVSTLAGGSAYGSADGAGAVATFSSLNGLTIDKQGNLFVADEGNHTIRKIT 386



 Score = 38.9 bits (89), Expect = 0.99,   Method: Composition-based stats.
 Identities = 17/38 (44%), Positives = 23/38 (60%)

Query: 94  SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +DG    ARF   ++ A D  GN+Y+ D  NH +RKIT
Sbjct: 634 ADGSLAQARFGYLRAIAFDASGNLYLTDSLNHNVRKIT 671



 Score = 38.1 bits (87), Expect = 1.8,   Method: Composition-based stats.
 Identities = 24/63 (38%), Positives = 34/63 (53%), Gaps = 7/63 (11%)

Query: 76  LSEESVVKRLAGDGVQ---GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV----IR 128
           ++   VV  LAG   Q     +DG   +ARF+ P   AVD  GN+YV+D +  +    +R
Sbjct: 385 ITPAGVVSTLAGTASQLRPAVADGIGAAARFNLPYGLAVDGAGNVYVSDSNPGLQLNGVR 444

Query: 129 KIT 131
           KIT
Sbjct: 445 KIT 447



 Score = 37.7 bits (86), Expect = 2.5,   Method: Composition-based stats.
 Identities = 15/40 (37%), Positives = 25/40 (62%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G +DG    ARF  P++ A+   G ++VAD  N +IR+++
Sbjct: 463 GVTDGLASEARFAGPQAIALHSDGTLFVADTGNQLIRRVS 502



 Score = 37.4 bits (85), Expect = 3.1,   Method: Composition-based stats.
 Identities = 25/64 (39%), Positives = 33/64 (51%), Gaps = 8/64 (12%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV-------I 127
           ++ +  V  LAG     G SDG+  SARF      AVD  G IYVAD  +++       I
Sbjct: 269 ITPDGTVTTLAGISRTYGTSDGDAKSARFGGISGIAVDTAGAIYVADAGSNINGIVGSRI 328

Query: 128 RKIT 131
           RK+T
Sbjct: 329 RKLT 332


>gi|329935738|ref|ZP_08285543.1| hypothetical protein SGM_1035 [Streptomyces griseoaurantiacus M045]
 gi|329304829|gb|EGG48702.1| hypothetical protein SGM_1035 [Streptomyces griseoaurantiacus M045]
          Length = 617

 Score = 50.1 bits (118), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 26/75 (34%), Positives = 45/75 (60%), Gaps = 3/75 (4%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S +L+V D++R     L+     ES V+R+ G G +G++DG   +A F +P+  A+   G
Sbjct: 198 SGNLLVSDTTRHQLVELA--ADGESEVRRI-GSGARGFADGTAEAAAFSEPQGLALLEDG 254

Query: 116 NIYVADKSNHVIRKI 130
           ++ VAD  NH +R++
Sbjct: 255 SVVVADTVNHALRRL 269


>gi|196230675|ref|ZP_03129536.1| Peptidase S53 propeptide [Chthoniobacter flavus Ellin428]
 gi|196225016|gb|EDY19525.1| Peptidase S53 propeptide [Chthoniobacter flavus Ellin428]
          Length = 1120

 Score = 50.1 bits (118), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 44/77 (57%), Gaps = 6/77 (7%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMK 114
           ++++ V DS  SA   ++       VV  LAG  G+ G +DG    A F+ P+  AVD  
Sbjct: 753 ANNIYVADSGNSAIRKIT----PGRVVSTLAGQPGLSGSADGTT-KALFNNPQGVAVDSA 807

Query: 115 GNIYVADKSNHVIRKIT 131
           GN+YVAD +N  IRKIT
Sbjct: 808 GNVYVADTTNETIRKIT 824



 Score = 50.1 bits (118), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 32/71 (45%), Positives = 41/71 (57%), Gaps = 5/71 (7%)

Query: 62  LDSSRSAFYTLSFPLSEESVVKRLAGDGV-QGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           L S+ S+   +  PL    V+  LAG  + QG +DG   +A+F  P   AVD  GNIYVA
Sbjct: 649 LTSAPSSVLIVDSPL----VISTLAGQVLTQGTADGTGTAAQFGYPSGVAVDSSGNIYVA 704

Query: 121 DKSNHVIRKIT 131
           D +N  IRKIT
Sbjct: 705 DFNNDTIRKIT 715



 Score = 43.5 bits (101), Expect = 0.037,   Method: Composition-based stats.
 Identities = 21/43 (48%), Positives = 26/43 (60%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           GV G ++G   +A F+ P   A+D   NIYVAD  N  IRKIT
Sbjct: 728 GVIGATNGTGTNATFNTPNGVAIDSANNIYVADSGNSAIRKIT 770



 Score = 39.7 bits (91), Expect = 0.59,   Method: Composition-based stats.
 Identities = 18/29 (62%), Positives = 21/29 (72%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           F+ P S AVD  GN+YVAD  N V+RKIT
Sbjct: 851 FNGPSSVAVDSAGNVYVADLYNFVVRKIT 879



 Score = 37.0 bits (84), Expect = 4.2,   Method: Composition-based stats.
 Identities = 19/56 (33%), Positives = 31/56 (55%), Gaps = 2/56 (3%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQIN 144
           G+ G  DG   +A F+ P   AVD   N+Y+ D  + +   +T+   GN L+R++N
Sbjct: 892 GMPGRLDGIGTAALFNAPIGVAVDANNNLYITD--SQIPPDLTSTSTGNNLVRRVN 945


>gi|116620673|ref|YP_822829.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116223835|gb|ABJ82544.1| NHL repeat containing protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 912

 Score = 50.1 bits (118), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 24/60 (40%), Positives = 38/60 (63%), Gaps = 2/60 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ + V+  +AG G  G+S   G   +A+ ++P+  AVD +GN+YVAD  N+ IRKI  L
Sbjct: 524 VTTDGVIHTIAGFGTPGFSGDGGAATAAKLNRPRGVAVDAQGNVYVADTGNNRIRKIDPL 583



 Score = 48.5 bits (114), Expect = 0.001,   Method: Composition-based stats.
 Identities = 21/58 (36%), Positives = 40/58 (68%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   +++ +AG+G  GY  DG P + A+ + P + AVD +GN+++AD +N V+R++T
Sbjct: 124 IATTGIIQTVAGNGTPGYQGDGGPATTAQLNAPFAVAVDGQGNVFIADTNNVVVRRVT 181



 Score = 45.8 bits (107), Expect = 0.007,   Method: Composition-based stats.
 Identities = 30/115 (26%), Positives = 54/115 (46%), Gaps = 5/115 (4%)

Query: 21  LVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTL---SFPLS 77
           L+  G +    Y ++T+  G        +     G    + +D++ + +++     F + 
Sbjct: 10  LLFCGSVWAQQYVISTIAGGAAPPTPVAAAKASVGDPTRVAVDAAGNVYFSSLHSIFKVD 69

Query: 78  EESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
               + R AG+G  G S   G+  SA+   P   AVD  GNI+VAD+  +V+R+I
Sbjct: 70  SSGTMTRFAGNGRPGNSGDGGQASSAQLLFPMGLAVDAAGNIFVADRDANVVRRI 124



 Score = 43.1 bits (100), Expect = 0.055,   Method: Composition-based stats.
 Identities = 35/118 (29%), Positives = 55/118 (46%), Gaps = 17/118 (14%)

Query: 17  LLFSLVSSGL--LLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSF 74
           LL +L  +GL     DG   TT     Q +      ID  G  +++++DS       +S 
Sbjct: 362 LLTTLAGNGLASFGGDGAAATTA----QFDTPTGVAIDAGG--NVLIVDSQNQRLRKIS- 414

Query: 75  PLSEESVVKRLAGDGVQGYSDGE---PGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
                 V+  +AG G  G+ +GE   P +A+ + P+  A D  GN +VAD  N  +R+
Sbjct: 415 ----RGVITTIAGTGTAGF-NGEVVLPATAQLNTPRGVAADAYGNYFVADTGNRRVRE 467



 Score = 42.0 bits (97), Expect = 0.11,   Method: Composition-based stats.
 Identities = 21/58 (36%), Positives = 35/58 (60%), Gaps = 4/58 (6%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSAR---FDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ +  +   AG G +G++ G+ G+AR   FD P+  AVD  G +Y+AD  N  IR++
Sbjct: 180 VTPDGTISTYAGSGARGFA-GDGGAARNAWFDGPEGVAVDANGVLYIADTFNGRIRRV 236



 Score = 40.0 bits (92), Expect = 0.44,   Method: Composition-based stats.
 Identities = 23/55 (41%), Positives = 33/55 (60%), Gaps = 2/55 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIR 128
           +S    +  +AG+G   Y+ DG  G+A + ++P   AVD  GNIYVAD  N+ IR
Sbjct: 636 VSTGGTITTIAGNGQCCYTGDGGLGTAAQLNQPWGIAVDSAGNIYVADSGNNAIR 690



 Score = 40.0 bits (92), Expect = 0.53,   Method: Composition-based stats.
 Identities = 19/44 (43%), Positives = 23/44 (52%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           GV G  +G P SA    P   AVD  GN Y+AD  N  +R + N
Sbjct: 252 GVFGGDNGPPASAALSLPTDVAVDRSGNPYIADFGNSRVRMVAN 295



 Score = 39.7 bits (91), Expect = 0.56,   Method: Composition-based stats.
 Identities = 19/49 (38%), Positives = 29/49 (59%), Gaps = 2/49 (4%)

Query: 85  LAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           LAG+G+  +  DG   + A+FD P   A+D  GN+ + D  N  +RKI+
Sbjct: 366 LAGNGLASFGGDGAAATTAQFDTPTGVAIDAGGNVLIVDSQNQRLRKIS 414



 Score = 38.9 bits (89), Expect = 0.94,   Method: Composition-based stats.
 Identities = 20/48 (41%), Positives = 30/48 (62%), Gaps = 2/48 (4%)

Query: 86  AGDGVQGY-SDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           AG+G   Y  DG P + A  ++P+  A D  GN Y+AD  ++V+RK+T
Sbjct: 478 AGNGNASYFGDGLPATQASVNQPEGVAGDAAGNTYIADTFDNVVRKVT 525


>gi|158315873|ref|YP_001508381.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
 gi|158111278|gb|ABW13475.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
          Length = 850

 Score = 50.1 bits (118), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 31/87 (35%), Positives = 49/87 (56%), Gaps = 9/87 (10%)

Query: 47  PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARF 103
           P SV +DRPG   L + DS  +    +      + V++ +AG G+ GYS DG P + A  
Sbjct: 711 PISVAMDRPGR--LYIADSGNNRIRRIGL----DGVIETVAGTGLPGYSGDGGPATRATL 764

Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKI 130
             P+  AVD +G I++ D++N  IR++
Sbjct: 765 RSPRGVAVDARGAIFITDRTNRRIRRV 791



 Score = 42.7 bits (99), Expect = 0.076,   Method: Composition-based stats.
 Identities = 21/57 (36%), Positives = 34/57 (59%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ +  +  +AG G  G+S DG P + AR  +P + A+D  GNI +AD  N  IR++
Sbjct: 567 ITADGTITTIAGTGEAGFSGDGGPATQARLRQPAAVALDSAGNILIADTFNQRIRRV 623



 Score = 41.2 bits (95), Expect = 0.20,   Method: Composition-based stats.
 Identities = 24/52 (46%), Positives = 30/52 (57%), Gaps = 2/52 (3%)

Query: 82  VKRLAGDGVQGYS-DGEP-GSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V  +AG G   +S DG P GSA  + P     D  GNIYVAD  N+ +R+IT
Sbjct: 517 VVTVAGTGEAAFSGDGGPAGSAALNGPFGMVADWAGNIYVADFDNNRVRRIT 568



 Score = 38.1 bits (87), Expect = 1.6,   Method: Composition-based stats.
 Identities = 20/54 (37%), Positives = 32/54 (59%), Gaps = 2/54 (3%)

Query: 79  ESVVKRLAG-DGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + +++ +AG DG   + DG P + A    P S A+D  G +Y+AD  N+ IR+I
Sbjct: 682 DGIIQTVAGGDGEGAFGDGGPAADALLAFPISVAMDRPGRLYIADSGNNRIRRI 735


>gi|302552781|ref|ZP_07305123.1| NHL repeat containing protein [Streptomyces viridochromogenes DSM
           40736]
 gi|302470399|gb|EFL33492.1| NHL repeat containing protein [Streptomyces viridochromogenes DSM
           40736]
          Length = 605

 Score = 50.1 bits (118), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 31/99 (31%), Positives = 52/99 (52%), Gaps = 10/99 (10%)

Query: 39  DGHQLEINPHSVIDR-PG------SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ 91
           DG  +   P   + R PG      S + +V D++R     L+     ESVV+R+ G G +
Sbjct: 165 DGPYVAPEPEPTVLRFPGKALLLPSGNFLVSDTTRHQLVELAG--DGESVVRRI-GSGAR 221

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G++DG    A F++P+   +   G++ VAD  NH +R++
Sbjct: 222 GFADGSADEAAFNEPQGLTLLDDGSVVVADTVNHALRRL 260


>gi|326778226|ref|ZP_08237491.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Streptomyces griseus XylebKG-1]
 gi|326658559|gb|EGE43405.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Streptomyces griseus XylebKG-1]
          Length = 615

 Score = 49.7 bits (117), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 27/72 (37%), Positives = 38/72 (52%), Gaps = 3/72 (4%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
           L+V D++R     L    ++   V+R  G G +G  DG P  ARF +P+  AV   G + 
Sbjct: 198 LLVSDTTRHRLVELD---ADGETVRRHFGTGERGLRDGGPDEARFSEPQGLAVLPDGRVA 254

Query: 119 VADKSNHVIRKI 130
           VAD  NH IR +
Sbjct: 255 VADTVNHAIRAL 266


>gi|182437571|ref|YP_001825290.1| hypothetical protein SGR_3778 [Streptomyces griseus subsp. griseus
           NBRC 13350]
 gi|178466087|dbj|BAG20607.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus
           NBRC 13350]
          Length = 605

 Score = 49.7 bits (117), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 27/72 (37%), Positives = 38/72 (52%), Gaps = 3/72 (4%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
           L+V D++R     L    ++   V+R  G G +G  DG P  ARF +P+  AV   G + 
Sbjct: 188 LLVSDTTRHRLVELD---ADGETVRRHFGTGERGLRDGGPDEARFSEPQGLAVLPDGRVA 244

Query: 119 VADKSNHVIRKI 130
           VAD  NH IR +
Sbjct: 245 VADTVNHAIRAL 256


>gi|116619867|ref|YP_822023.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116223029|gb|ABJ81738.1| NHL repeat containing protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 592

 Score = 49.7 bits (117), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 5/77 (6%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDM 113
           S +L+++D        +    +  +++   AG+G QG+S   G+   A F+ P + AVD 
Sbjct: 166 SGNLLIVDQGAHKIRKVE---AASNIISTFAGNGSQGFSGDGGQAAQASFNNPTALAVDA 222

Query: 114 KGNIYVADKSNHVIRKI 130
            G +YV D+SN  IR+I
Sbjct: 223 AGTVYVTDQSNQRIRRI 239



 Score = 42.4 bits (98), Expect = 0.091,   Method: Composition-based stats.
 Identities = 22/52 (42%), Positives = 32/52 (61%), Gaps = 2/52 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V+  LAG G  G++ DG P + A+ + P    +D  GNIYV+D+ N  +RKI
Sbjct: 75  VITTLAGIGTAGFAGDGGPAAQAQLNGPLGLCIDGSGNIYVSDQGNKRVRKI 126



 Score = 41.2 bits (95), Expect = 0.21,   Method: Composition-based stats.
 Identities = 31/88 (35%), Positives = 45/88 (51%), Gaps = 11/88 (12%)

Query: 51  IDRPGSSDLIVLDSSRSAFYTLSFPLSEESV----VKRLAGDGVQGYSDGEPGSA---RF 103
           ++ PG    IVLDSS + +   S       V    +  +AG G  G+S G+ G+A   + 
Sbjct: 268 LNYPGG---IVLDSSGTLYIVDSVNQRVRKVSGTTISTVAGTGTAGFS-GDGGAALQAQL 323

Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           + P     D  GN+YV D SN+ +RKIT
Sbjct: 324 NNPFPITADGSGNLYVGDVSNNRVRKIT 351



 Score = 35.8 bits (81), Expect = 8.8,   Method: Composition-based stats.
 Identities = 17/68 (25%), Positives = 36/68 (52%), Gaps = 2/68 (2%)

Query: 65  SRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADK 122
           SR     L+  ++    +  +AG+G  G++   G+   A+ ++      D  GN+Y+A++
Sbjct: 3   SRFVLPLLAAAIASGQTITTVAGNGTAGFAGDTGQATQAQINRAVGLVTDANGNLYLAEE 62

Query: 123 SNHVIRKI 130
            N+ +RK+
Sbjct: 63  LNNRVRKV 70


>gi|380693812|ref|ZP_09858671.1| hypothetical protein BfaeM_07498 [Bacteroides faecis MAJ27]
          Length = 456

 Score = 49.7 bits (117), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 23/44 (52%), Positives = 27/44 (61%)

Query: 88  DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +G  GY DG  G A F+KP    VD +GNI + D  NH IRKIT
Sbjct: 355 EGKSGYLDGPIGQAMFNKPGQMCVDSEGNIILTDTENHCIRKIT 398


>gi|29349488|ref|NP_812991.1| hypothetical protein BT_4080 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|298384984|ref|ZP_06994543.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
 gi|383120427|ref|ZP_09941155.1| hypothetical protein BSIG_2566 [Bacteroides sp. 1_1_6]
 gi|29341397|gb|AAO79185.1| putative cell surface protein [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|251840523|gb|EES68605.1| hypothetical protein BSIG_2566 [Bacteroides sp. 1_1_6]
 gi|298262128|gb|EFI04993.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
          Length = 456

 Score = 49.7 bits (117), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 23/44 (52%), Positives = 27/44 (61%)

Query: 88  DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +G  GY DG  G A F+KP    VD +GNI + D  NH IRKIT
Sbjct: 355 EGKSGYLDGPIGQAMFNKPGQMCVDSEGNIILTDTENHCIRKIT 398


>gi|116749588|ref|YP_846275.1| NHL repeat-containing protein [Syntrophobacter fumaroxidans MPOB]
 gi|116698652|gb|ABK17840.1| NHL repeat containing protein [Syntrophobacter fumaroxidans MPOB]
          Length = 1750

 Score = 49.7 bits (117), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 24/48 (50%), Positives = 31/48 (64%), Gaps = 2/48 (4%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +AG GV GYS  DG    AR + P + AVD   +IY+AD +NH IRK+
Sbjct: 465 VAGKGVAGYSGDDGPAAEARLNNPSAIAVDGSESIYIADTNNHRIRKV 512



 Score = 47.8 bits (112), Expect = 0.002,   Method: Composition-based stats.
 Identities = 41/127 (32%), Positives = 56/127 (44%), Gaps = 15/127 (11%)

Query: 12  ITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPG------SSDLIVLDSS 65
           I L  LLF   SS   + D        DG  LE N    +D P       S ++ + D++
Sbjct: 2   IFLAILLFPGPSSAERVVDTVAGLQSDDGPALEAN----LDSPSGVAVDTSGNVYLSDTN 57

Query: 66  RSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKS 123
                 L     +   ++ LAG    GYS   G    A  ++P+  AVD  GN+Y AD +
Sbjct: 58  NHRIRKLDVAAGQ---IQTLAGGQSPGYSGDGGTAAKAGLNRPRGIAVDAAGNVYFADSN 114

Query: 124 NHVIRKI 130
           NH IRKI
Sbjct: 115 NHCIRKI 121



 Score = 44.3 bits (103), Expect = 0.023,   Method: Composition-based stats.
 Identities = 25/52 (48%), Positives = 31/52 (59%), Gaps = 2/52 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++  +AG G  G + DG P  SAR   P   AVD  GNIYVAD  NH +R+I
Sbjct: 126 IITTVAGTGSAGSNGDGGPAASARLAYPFGIAVDPSGNIYVADLGNHKVRRI 177



 Score = 42.7 bits (99), Expect = 0.067,   Method: Composition-based stats.
 Identities = 27/91 (29%), Positives = 44/91 (48%), Gaps = 6/91 (6%)

Query: 42  QLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS 100
           +  +N  S I   GS  + + D++      +         +  +AG+G  GYS DG   +
Sbjct: 482 EARLNNPSAIAVDGSESIYIADTNNHRIRKVD----GGGTITTVAGNGTPGYSGDGASAT 537

Query: 101 A-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           A   + P   AVD  GN+++AD SNH +R +
Sbjct: 538 AASLNFPNGVAVDADGNVFIADTSNHRVRMV 568



 Score = 42.0 bits (97), Expect = 0.11,   Method: Composition-based stats.
 Identities = 22/54 (40%), Positives = 32/54 (59%), Gaps = 2/54 (3%)

Query: 79  ESVVKRLAGDGVQG-YSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + V+  +AG G  G   DG P + AR   P + AVD + NIY+AD  +H IR++
Sbjct: 291 DGVINTVAGTGYGGSLEDGIPATGARLKSPVALAVDSQNNIYIADTYSHRIRRV 344



 Score = 39.3 bits (90), Expect = 0.71,   Method: Composition-based stats.
 Identities = 20/57 (35%), Positives = 30/57 (52%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + +  V+  +AG+G  GYS   G   SA    P    VD  G +Y+AD  N+ +RK+
Sbjct: 567 MVDSGVITTVAGNGTPGYSGDGGAAVSASLKAPHGVWVDATGALYIADAHNYRVRKV 623


>gi|325965074|ref|YP_004242980.1| thiol-disulfide isomerase-like thioredoxin [Arthrobacter
           phenanthrenivorans Sphe3]
 gi|323471161|gb|ADX74846.1| thiol-disulfide isomerase-like thioredoxin [Arthrobacter
           phenanthrenivorans Sphe3]
          Length = 674

 Score = 49.7 bits (117), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 36/113 (31%), Positives = 57/113 (50%), Gaps = 10/113 (8%)

Query: 29  EDGYTVTTVIDGHQLEI---NPHSVI-DRPGSSDLIVLDSSRSAFYTLSF-PLSEESVVK 83
           ED    +  + GH L++   +P  V+  R  ++ +I +  +   F   SF PL+    + 
Sbjct: 325 EDAAGFSGQLVGHPLDVALSSPWDVVWSRKLNAVVIAMAGTHQIF---SFDPLTGSVAI- 380

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
            +AG+G++G  DG P  A F +P   A D  GNI+VAD     +RK+   D G
Sbjct: 381 -VAGNGLEGLLDGAPHEAWFAQPSGLAEDADGNIWVADSETSALRKLVIDDDG 432


>gi|451980854|ref|ZP_21929239.1| conserved hypothetical protein, contains NHL repeats [Nitrospina
           gracilis 3/211]
 gi|451761976|emb|CCQ90480.1| conserved hypothetical protein, contains NHL repeats [Nitrospina
           gracilis 3/211]
          Length = 729

 Score = 49.3 bits (116), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 34/57 (59%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + +  ++  + G GV GYS DG P + A    P     D +GN+YVAD+ NHV+RK+
Sbjct: 68  IDQNGMLTNVVGTGVSGYSGDGGPATEATLKVPAGLTFDKEGNLYVADRENHVVRKV 124



 Score = 47.4 bits (111), Expect = 0.003,   Method: Composition-based stats.
 Identities = 21/52 (40%), Positives = 32/52 (61%), Gaps = 2/52 (3%)

Query: 81  VVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++   AG G  GYS   G    A+ + P    VD KGN++++D+SN+VIRK+
Sbjct: 129 IITTFAGTGKAGYSGDKGPATQAKLNLPSDMTVDHKGNLFISDRSNNVIRKV 180



 Score = 47.0 bits (110), Expect = 0.004,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 36/57 (63%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ E  +  +AG+G   Y+ DG P   A F  P+S  +D +GN+Y+ D +N+VIRKI
Sbjct: 528 VTPEGKIITIAGNGSSDYTGDGGPAKDASFRDPQSLTMDKEGNLYIGDTANNVIRKI 584



 Score = 43.9 bits (102), Expect = 0.035,   Method: Composition-based stats.
 Identities = 25/57 (43%), Positives = 34/57 (59%), Gaps = 5/57 (8%)

Query: 86  AGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
           AG G +GY+ D  P      DKP   AVD  GN+Y+AD+ N+ IRK+   D G+ L+
Sbjct: 190 AGTGNEGYNGDNMPALRTNLDKPFGLAVDKHGNLYIADRGNNRIRKV---DAGSGLM 243



 Score = 37.7 bits (86), Expect = 2.2,   Method: Composition-based stats.
 Identities = 18/39 (46%), Positives = 24/39 (61%)

Query: 95  DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +G    A    P   AVD +GN+YVAD++N+ IRKI  L
Sbjct: 258 NGPAYRASIAGPTDVAVDDEGNVYVADRNNNRIRKINTL 296



 Score = 37.7 bits (86), Expect = 2.5,   Method: Composition-based stats.
 Identities = 22/58 (37%), Positives = 34/58 (58%), Gaps = 3/58 (5%)

Query: 76  LSEESVVKRLAG-DGVQG-YSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ +  V+ +AG  GVQG + DG   + A   +P   A D KGN+Y+ D  N+ IRK+
Sbjct: 640 ITRDGKVELVAGRPGVQGLFGDGGKATEAMLKQPACIAFDSKGNMYITDMGNNRIRKV 697



 Score = 35.8 bits (81), Expect = 7.8,   Method: Composition-based stats.
 Identities = 24/78 (30%), Positives = 35/78 (44%), Gaps = 11/78 (14%)

Query: 65  SRSAFYTLSFPLSEESVVKRLAGDGVQGYSD-----GEPGSARFDKP------KSFAVDM 113
           S S F +L   +    ++  L   G Q Y +     G  G +   +          A D 
Sbjct: 452 SPSGFVSLIRKVDNNGIISLLISTGDQDYQEAVMTAGYTGRSSHSEITIITQFSGLAFDK 511

Query: 114 KGNIYVADKSNHVIRKIT 131
           KGN+YV+D+ NH IRK+T
Sbjct: 512 KGNLYVSDRINHQIRKVT 529


>gi|428180908|gb|EKX49774.1| hypothetical protein GUITHDRAFT_67798, partial [Guillardia theta
           CCMP2712]
          Length = 247

 Score = 49.3 bits (116), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%), Gaps = 2/55 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           V  +AG G  G+ DGE   A F++P   A D +GNI+VAD  N  IR IT   HG
Sbjct: 182 VTTIAGTGECGHRDGEASQATFNEPCFIACDQQGNIFVADSRNDSIRCITR--HG 234



 Score = 47.0 bits (110), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 20/50 (40%), Positives = 30/50 (60%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
           +AG G +G  DG   S R + P+    D  G++Y+AD SNH IR++ + D
Sbjct: 74  VAGCGRRGCQDGSHESGRLNGPRGLTFDPHGDLYIADSSNHRIRRLRSQD 123


>gi|158319004|ref|YP_001511512.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
 gi|158114409|gb|ABW16606.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
          Length = 892

 Score = 49.3 bits (116), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 34/90 (37%), Positives = 48/90 (53%), Gaps = 9/90 (10%)

Query: 45  INPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSA 101
           ++P+SV ID  G+  +  L S R    T +  +S+      LAG G  GYS  +G   +A
Sbjct: 576 LSPYSVAIDPQGTLFITSLSSDRIQKVTRTGEVSD------LAGTGADGYSGDNGPATAA 629

Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           + + P S   D  GNIY+ D  N+ IRKIT
Sbjct: 630 KLNGPGSAVPDKNGNIYIPDAQNYRIRKIT 659



 Score = 45.1 bits (105), Expect = 0.015,   Method: Composition-based stats.
 Identities = 22/58 (37%), Positives = 36/58 (62%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ + ++  +AG G  G+S DG P +A + +  +  A+   G+IY+AD  NH IRKIT
Sbjct: 658 ITPDGIITTIAGTGTAGFSGDGGPATAAQINSAEKVAIGPDGSIYIADYDNHRIRKIT 715



 Score = 42.4 bits (98), Expect = 0.096,   Method: Composition-based stats.
 Identities = 20/58 (34%), Positives = 36/58 (62%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ + ++  +AG G+QGYS DG P +A + D P    +   G +Y+A+  ++ I+KIT
Sbjct: 714 ITPDGIINTIAGTGLQGYSGDGGPATAAKLDGPNDVELGDDGTLYIANLGSNTIQKIT 771



 Score = 38.9 bits (89), Expect = 0.93,   Method: Composition-based stats.
 Identities = 19/56 (33%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++++ +V  +AG+G +G+S DG P +A      S ++   G IY+AD  N+ +RK+
Sbjct: 770 ITKDGIVTTVAGNGQKGFSGDGGPATAAQLSVPSVSLGNGGEIYIADYGNNRVRKV 825


>gi|153808835|ref|ZP_01961503.1| hypothetical protein BACCAC_03135 [Bacteroides caccae ATCC 43185]
 gi|149128661|gb|EDM19879.1| NHL repeat protein [Bacteroides caccae ATCC 43185]
          Length = 426

 Score = 49.3 bits (116), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 25/56 (44%), Positives = 31/56 (55%), Gaps = 1/56 (1%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  E  V  + G  G  GY DG P  A FD+P   AVD +G IY+AD  N  IRK+
Sbjct: 368 IDREGAVSTVIGVPGRAGYVDGTPDDALFDEPWGVAVDEEGTIYIADTKNKCIRKL 423



 Score = 43.9 bits (102), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 19/39 (48%), Positives = 25/39 (64%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           GY DG+  +ARF+ P     D+ G +Y+AD SNH IR I
Sbjct: 330 GYEDGKRLNARFNFPSQICFDLDGIMYIADSSNHCIRSI 368


>gi|423220897|ref|ZP_17207391.1| hypothetical protein HMPREF1061_04164 [Bacteroides caccae
           CL03T12C61]
 gi|392622375|gb|EIY16503.1| hypothetical protein HMPREF1061_04164 [Bacteroides caccae
           CL03T12C61]
          Length = 426

 Score = 49.3 bits (116), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 25/56 (44%), Positives = 31/56 (55%), Gaps = 1/56 (1%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  E  V  + G  G  GY DG P  A FD+P   AVD +G IY+AD  N  IRK+
Sbjct: 368 IDREGAVSTVIGVPGRAGYVDGTPDDALFDEPWGVAVDEEGTIYIADTKNKCIRKL 423



 Score = 43.9 bits (102), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 19/39 (48%), Positives = 25/39 (64%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           GY DG+  +ARF+ P     D+ G +Y+AD SNH IR I
Sbjct: 330 GYEDGKRLNARFNFPSQICFDLDGIMYIADSSNHCIRSI 368


>gi|428179226|gb|EKX48098.1| hypothetical protein GUITHDRAFT_69001 [Guillardia theta CCMP2712]
          Length = 334

 Score = 49.3 bits (116), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 36/117 (30%), Positives = 50/117 (42%), Gaps = 13/117 (11%)

Query: 15  LSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSF 74
           ++LL      GL  EDG        G Q   N    I      DL + D        ++ 
Sbjct: 26  ITLLAGCEEDGL--EDGV-------GDQARFNSPVGIVESKEGDLYLCDQENHCIRKVT- 75

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
               +  V  LAG+G +GY DG+   AR + P    +D +GNI  AD  N  IRK++
Sbjct: 76  ---RKGEVTTLAGNGEEGYHDGKGKDARLNIPTGLCMDAQGNIIFADSGNQRIRKVS 129



 Score = 48.9 bits (115), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 34/53 (64%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           E +V  +AG+G  G++DG+   ARF+ P+   +D +  +Y+AD  N  IRK++
Sbjct: 184 EGMVTTVAGNGKMGWADGKGAKARFNSPQGMCIDKEDTVYIADYGNQRIRKMS 236



 Score = 45.1 bits (105), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 34/55 (61%), Gaps = 1/55 (1%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +S +  V  +AG   +G+ DG  G A F+ P   AVD KG+I+V+D  NH IRKI
Sbjct: 128 VSPDGTVTTIAGS-KKGFKDGPAGKALFNYPAYVAVDSKGSIFVSDFGNHCIRKI 181



 Score = 42.7 bits (99), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 33/56 (58%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S+E  V  +AG G  G++ G    ARF  P+S +V   G +YV D+ N  +RKI+
Sbjct: 235 MSKEGEVVTVAGSGEPGFAHGHGQLARFRGPRSVSVSQDGIVYVGDRENFRVRKIS 290



 Score = 40.4 bits (93), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 30/56 (53%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           + +E  +  LAG    G  DG    ARF+ P       +G++Y+ D+ NH IRK+T
Sbjct: 20  IDKEGWITLLAGCEEDGLEDGVGDQARFNSPVGIVESKEGDLYLCDQENHCIRKVT 75


>gi|344923600|ref|ZP_08777061.1| NHL repeat containing protein [Candidatus Odyssella
           thessalonicensis L13]
          Length = 1226

 Score = 49.3 bits (116), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 43/77 (55%), Gaps = 5/77 (6%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
           GS  + V DS+ +    ++ P    SV   LAG GV  Y++G   +ARF  P+   VD+ 
Sbjct: 35  GSGTVYVADSNNNVIKKIT-PAGTVSV---LAGSGVSDYAEGTGTAARFSYPEGVGVDVA 90

Query: 115 G-NIYVADKSNHVIRKI 130
           G  +YVAD  N VIRKI
Sbjct: 91  GTTVYVADSQNGVIRKI 107



 Score = 41.6 bits (96), Expect = 0.16,   Method: Composition-based stats.
 Identities = 19/42 (45%), Positives = 24/42 (57%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            +  + DG   +ARF  P S AVD   N+Y+ D  NH IRKI
Sbjct: 123 ALNAHVDGTYTTARFAYPTSVAVDSSSNLYIGDTLNHCIRKI 164


>gi|294630411|ref|ZP_06708971.1| NHL repeat protein [Streptomyces sp. e14]
 gi|292833744|gb|EFF92093.1| NHL repeat protein [Streptomyces sp. e14]
          Length = 609

 Score = 49.3 bits (116), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 26/74 (35%), Positives = 43/74 (58%), Gaps = 3/74 (4%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S + +V D++R     L+     ESVV+R+ G G +G++DG    A F++P+  A+   G
Sbjct: 193 SGNFLVSDTTRHQLVELA--ADGESVVRRI-GSGARGFADGAAEDAAFNEPQGLALLDDG 249

Query: 116 NIYVADKSNHVIRK 129
            + VAD  NH +R+
Sbjct: 250 AVVVADTVNHALRR 263


>gi|398339828|ref|ZP_10524531.1| hypothetical protein LkirsB1_10395 [Leptospira kirschneri serovar
           Bim str. 1051]
          Length = 358

 Score = 49.3 bits (116), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 44/91 (48%), Gaps = 2/91 (2%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++++ VV   AG GV    DG   +A F  P    VD  GNI+V+
Sbjct: 36  ILDCILKECYLCSLKITDKPVVSLFAGTGVAESIDGTISTASFKTPFGLEVDTSGNIFVS 95

Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
           D+  ++IRKI     GN      NL  +D S
Sbjct: 96  DQMTNLIRKID--PSGNVKTLSTNLPLQDPS 124



 Score = 40.4 bits (93), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 30/47 (63%), Gaps = 1/47 (2%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           + G  +G+  ++ FD P    +D++ N+YV + SNH IRKI NLD G
Sbjct: 164 LSGLQNGDRLNSLFDGPFFMDIDLERNLYVGELSNHTIRKI-NLDSG 209



 Score = 37.7 bits (86), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 25/43 (58%), Gaps = 1/43 (2%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           G+ GY DG+  SARF  P   A D K + +  AD  +H IRKI
Sbjct: 217 GISGYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKI 259


>gi|403382146|ref|ZP_10924203.1| copper amine oxidase domain-containing protein [Paenibacillus sp.
           JC66]
          Length = 531

 Score = 49.3 bits (116), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 41/115 (35%), Positives = 52/115 (45%), Gaps = 12/115 (10%)

Query: 21  LVSSGLLLEDGYTVTTVIDGHQLEINPHS-----VIDRPGSSDLIVLDSSRSAFYTLSFP 75
           L  S  L EDG  V  + DG + E++  S      +D  G  +L V D    A   +   
Sbjct: 98  LAGSSFLQEDGQVVDALGDG-KGELSSFSEPAGLALDHNG--NLFVADKGNHAVRKVD-- 152

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
              E  V   AG GV G+ DG    + F  P+   V   G +YVAD  NHVIRKI
Sbjct: 153 --AEGNVTTYAGQGVLGHKDGTAEESLFYAPEDVVVASDGTVYVADTLNHVIRKI 205



 Score = 42.7 bits (99), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 4/44 (9%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           LAGD    Y DG    A+F++P   A+D  GN+Y++D  N VIR
Sbjct: 231 LAGD----YKDGPLQEAKFNEPTGLAIDHLGNLYISDTGNRVIR 270



 Score = 41.2 bits (95), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 42/89 (47%), Gaps = 4/89 (4%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH-GNQ 138
           S V+ +AG G  G  DG+   A F  P   A    G +YV+D  +H++R+   LDH G  
Sbjct: 40  SSVQTIAGTGSFGARDGDKAEASFRHPSGLAAAPDGTLYVSDTKSHLLRR---LDHSGVS 96

Query: 139 LIRQINLKPEDCSKSSQSGSGMFWVTVFS 167
           L+   +   ED       G G   ++ FS
Sbjct: 97  LLAGSSFLQEDGQVVDALGDGKGELSSFS 125


>gi|225420274|ref|ZP_03762577.1| hypothetical protein CLOSTASPAR_06617 [Clostridium asparagiforme
           DSM 15981]
 gi|225041091|gb|EEG51337.1| hypothetical protein CLOSTASPAR_06617 [Clostridium asparagiforme
           DSM 15981]
          Length = 336

 Score = 49.3 bits (116), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 22/48 (45%), Positives = 28/48 (58%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + LAG G +GY D   G A F  P   A D  GN+Y+AD  N VIR++
Sbjct: 150 RTLAGRGREGYEDNRGGRALFSGPTGLAADDAGNLYIADTGNDVIRRL 197



 Score = 44.7 bits (104), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 20/44 (45%), Positives = 26/44 (59%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           G +G +DG  G ARF  P   AV  +G +YV+D  N  +RKI N
Sbjct: 247 GAEGQADGGFGQARFSSPTYLAVSEQGTLYVSDTGNAAVRKIEN 290


>gi|298386946|ref|ZP_06996500.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
 gi|298260096|gb|EFI02966.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
          Length = 457

 Score = 49.3 bits (116), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 34/56 (60%), Gaps = 1/56 (1%)

Query: 77  SEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           + E   +  AGD G  G++DG+   A+F  P+   +D + N+Y+AD  NH IRK+T
Sbjct: 345 TREGNFRIYAGDKGRAGHADGQETEAQFSSPRQLVLDEEENLYIADSGNHCIRKVT 400



 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 33/57 (57%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ E VV  + G+    GY DG P  A F +P   A+D +G IY+ DK N  +RK++
Sbjct: 399 VTPEGVVSTVIGNPNTSGYKDGTPEIALFTEPWGLAIDSEGTIYIGDKDNRCVRKLS 455


>gi|421091013|ref|ZP_15551797.1| putative lipoprotein [Leptospira kirschneri str. 200802841]
 gi|410000210|gb|EKO50873.1| putative lipoprotein [Leptospira kirschneri str. 200802841]
          Length = 358

 Score = 49.3 bits (116), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 44/91 (48%), Gaps = 2/91 (2%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++++ VV   AG GV    DG   +A F  P    VD  GNI+V+
Sbjct: 36  ILDCILKECYLCSLKITDKPVVSLFAGTGVAESIDGTISTASFKTPFGLEVDTSGNIFVS 95

Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
           D+  ++IRKI     GN      NL  +D S
Sbjct: 96  DQMTNLIRKID--PSGNVKTLSTNLPLQDPS 124



 Score = 40.4 bits (93), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 30/47 (63%), Gaps = 1/47 (2%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           + G  +G+  ++ FD P    +D++ N+YV + SNH IRKI NLD G
Sbjct: 164 LSGLQNGDRLNSLFDGPFFMDIDLERNLYVGELSNHTIRKI-NLDSG 209



 Score = 38.1 bits (87), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 25/43 (58%), Gaps = 1/43 (2%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           G+ GY DG+  SARF  P   A D K + +  AD  +H IRKI
Sbjct: 217 GISGYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKI 259


>gi|410098158|ref|ZP_11293137.1| hypothetical protein HMPREF1076_02315 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409222431|gb|EKN15373.1| hypothetical protein HMPREF1076_02315 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 446

 Score = 49.3 bits (116), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 25/46 (54%), Positives = 29/46 (63%), Gaps = 3/46 (6%)

Query: 89  GVQ---GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           GVQ   G+ DGE   A F++PK    D  G +YVAD  NHVIRKIT
Sbjct: 344 GVQNETGWQDGEREEALFNEPKQICFDQDGVMYVADAGNHVIRKIT 389



 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 23/56 (41%), Positives = 33/56 (58%), Gaps = 1/56 (1%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ + VV  + G  GV+GY DG P  A F  P   A+D +G IYV D  N+ +RK+
Sbjct: 388 ITRDGVVSTVIGIAGVKGYVDGSPEDALFQYPTGVAIDKEGTIYVGDARNNCVRKL 443



 Score = 36.2 bits (82), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 28/121 (23%), Positives = 54/121 (44%), Gaps = 10/121 (8%)

Query: 53  RPGSSDLIVLDSSRSAF-YTLSFPLSEESVVKRLAGDGVQGYS------DGEPGSARFDK 105
           +PG+   I +   + +  +  +F  + ++ V  +AG  V G        DG    A FD+
Sbjct: 106 QPGAECQISVAIGKDSLTFENTFSYTTQTTVSTIAGKPVGGDDGHGEMIDGTLAEASFDQ 165

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSGMFWVTV 165
           P    VD + NI+V+++  H +R+I   +    ++  +     +C   + S +     TV
Sbjct: 166 PWFLCVDAEKNIFVSERLGHAVRQI---NEEKNVVSTLVKGEGNCQNPNASATDAEGKTV 222

Query: 166 F 166
           F
Sbjct: 223 F 223


>gi|357485931|ref|XP_003613253.1| hypothetical protein MTR_5g034550 [Medicago truncatula]
 gi|355514588|gb|AES96211.1| hypothetical protein MTR_5g034550 [Medicago truncatula]
          Length = 154

 Score = 49.3 bits (116), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 44/107 (41%), Gaps = 36/107 (33%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT-------------- 131
           + +G  G+ D +   AR + PK   VD +GNIYVAD  N  IRKI+              
Sbjct: 11  SAEGYSGHVDEKLREARMNHPKGITVDDRGNIYVADIINMAIRKISLGNNMTYLSFLYEE 70

Query: 132 ----------------------NLDHGNQLIRQINLKPEDCSKSSQS 156
                                  +D G Q IR+I L+ +DC+   +S
Sbjct: 71  SLILFYLLLFDVIYVGSSYSLLVIDRGKQAIREIQLRFDDCAYQYES 117


>gi|429202583|ref|ZP_19193964.1| antioxidant, AhpC/TSA family [Streptomyces ipomoeae 91-03]
 gi|428661888|gb|EKX61363.1| antioxidant, AhpC/TSA family [Streptomyces ipomoeae 91-03]
          Length = 619

 Score = 48.9 bits (115), Expect = 9e-04,   Method: Composition-based stats.
 Identities = 33/99 (33%), Positives = 52/99 (52%), Gaps = 10/99 (10%)

Query: 39  DGHQLEINPHSVIDR-PGSS------DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ 91
           DG  +   P   + R PG +      D +V D++R     L+ P   E+VV+R  G G +
Sbjct: 170 DGPYVPPEPDPTVLRFPGKALLLPDGDFLVSDTTRHRLVRLA-P-DGETVVRRY-GTGER 226

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G+ DG    ARF +P+  A+   G++ VAD  NH +R++
Sbjct: 227 GFVDGPADHARFSEPQGLALLDSGDVVVADTVNHALRRL 265


>gi|365861024|ref|ZP_09400809.1| alkyl hydroperoxide reductase [Streptomyces sp. W007]
 gi|364009547|gb|EHM30502.1| alkyl hydroperoxide reductase [Streptomyces sp. W007]
          Length = 560

 Score = 48.9 bits (115), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 38/71 (53%), Gaps = 3/71 (4%)

Query: 60  IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYV 119
           +V D++R     L    ++   V+R  G G +G SDG P  ARF +P+  AV   G + V
Sbjct: 144 LVSDTTRHRLVELD---ADGETVRRHFGTGERGLSDGGPDEARFSEPQGLAVLPDGRVAV 200

Query: 120 ADKSNHVIRKI 130
           AD  NH +R +
Sbjct: 201 ADTVNHALRAL 211


>gi|296136309|ref|YP_003643551.1| NHL repeat containing protein [Thiomonas intermedia K12]
 gi|295796431|gb|ADG31221.1| NHL repeat containing protein [Thiomonas intermedia K12]
          Length = 366

 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/57 (45%), Positives = 35/57 (61%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S + VV  LAG  G  G+ DG    ARF+ P    +D +G IYV++  N+VIRKIT
Sbjct: 196 ISAKGVVSTLAGSPGDTGWRDGRGAQARFNTPVGLTLDAQGQIYVSEYFNNVIRKIT 252



 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/57 (42%), Positives = 34/57 (59%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S E  V  LAG   V+G++DG    ARF+ P    V+ KG +YVAD  N  +R+I+
Sbjct: 141 ISREGRVSTLAGKLEVEGFADGTGQQARFNHPVGLGVNAKGVVYVADAYNSTVRRIS 197



 Score = 43.5 bits (101), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 26/38 (68%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           Y+ G+   ARF  P+  A+D +GNI+VAD  N V+RK+
Sbjct: 49  YTGGDFAQARFHDPRGMALDAQGNIFVADYVNSVVRKL 86


>gi|358461043|ref|ZP_09171215.1| serine/threonine protein kinase [Frankia sp. CN3]
 gi|357074242|gb|EHI83734.1| serine/threonine protein kinase [Frankia sp. CN3]
          Length = 866

 Score = 48.9 bits (115), Expect = 0.001,   Method: Composition-based stats.
 Identities = 24/57 (42%), Positives = 35/57 (61%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ + ++  +AG G +GY+ DG P  +A+   P S  VD  G IYVAD  N VIR+I
Sbjct: 797 VTSDGIITTIAGTGAEGYTGDGGPANAAQLSDPTSVVVDAGGAIYVADNGNKVIRRI 853



 Score = 40.0 bits (92), Expect = 0.42,   Method: Composition-based stats.
 Identities = 21/57 (36%), Positives = 34/57 (59%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++++ VV  +AG+   G+S DG P +A +   P   A D  GN+Y+ D  N+ +RKI
Sbjct: 574 IAKDGVVTPIAGNAQDGFSGDGGPATAAQLYGPGRVAWDKAGNLYIPDTQNYRVRKI 630


>gi|239989050|ref|ZP_04709714.1| hypothetical protein SrosN1_17240 [Streptomyces roseosporus NRRL
           11379]
 gi|291446053|ref|ZP_06585443.1| alkyl hydroperoxide reductase/Thiol specific antioxidant/Mal
           allergen [Streptomyces roseosporus NRRL 15998]
 gi|291349000|gb|EFE75904.1| alkyl hydroperoxide reductase/Thiol specific antioxidant/Mal
           allergen [Streptomyces roseosporus NRRL 15998]
          Length = 605

 Score = 48.9 bits (115), Expect = 0.001,   Method: Composition-based stats.
 Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 3/72 (4%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
            +V D++R     L    ++   V+R  G G +G +DG P  ARF +P+  AV   G + 
Sbjct: 188 FLVSDTTRHRLVELD---ADGETVRRHFGTGERGLNDGGPDEARFSEPQGLAVLPDGRVA 244

Query: 119 VADKSNHVIRKI 130
           VAD  NH IR +
Sbjct: 245 VADTVNHAIRAL 256


>gi|223939666|ref|ZP_03631540.1| NHL repeat containing protein [bacterium Ellin514]
 gi|223891717|gb|EEF58204.1| NHL repeat containing protein [bacterium Ellin514]
          Length = 452

 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/70 (40%), Positives = 38/70 (54%), Gaps = 2/70 (2%)

Query: 75  PLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           P     VV  +AG  G+ G+ DG    A F  P   A+D  GN+YVAD  N+ IRK+T L
Sbjct: 203 PDGTNWVVTTIAGAAGIHGFKDGTNNDALFYSPIGLALDSAGNMYVADTGNNAIRKLT-L 261

Query: 134 DHGNQLIRQI 143
           +  N ++  I
Sbjct: 262 EGTNWVVTTI 271



 Score = 47.4 bits (111), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ--GYSDGEPGSARFDKPKSFAVDM 113
           + ++ V D+  +A   L+       VV  +AG   Q  G  DG    A F  P+S AVD 
Sbjct: 243 AGNMYVADTGNNAIRKLTLE-GTNWVVTTIAGSTNQQNGSLDGTNNQALFTWPESPAVDS 301

Query: 114 KGNIYVADKSNHVIRKIT 131
            GN+YVAD  N+ IRK+T
Sbjct: 302 AGNVYVADSYNYTIRKVT 319



 Score = 42.0 bits (97), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 32/57 (56%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           + ++ VV  +AG +   G  DG    ARF  P   AVD  GN++VA+   + IRK++
Sbjct: 88  IGKDWVVTTIAGTNQAYGTKDGTNAEARFTGPTGLAVDASGNVFVAEGYANTIRKLS 144



 Score = 40.4 bits (93), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 23/36 (63%)

Query: 95  DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           DG   +A F  P+S A D  GN+YVAD +  VIRK+
Sbjct: 50  DGTNTAAHFYSPQSVACDASGNVYVADYNGRVIRKV 85


>gi|86610239|ref|YP_479001.1| NHL repeat-containing protein [Synechococcus sp. JA-2-3B'a(2-13)]
 gi|86558781|gb|ABD03738.1| NHL repeat protein [Synechococcus sp. JA-2-3B'a(2-13)]
          Length = 649

 Score = 48.9 bits (115), Expect = 0.001,   Method: Composition-based stats.
 Identities = 24/57 (42%), Positives = 31/57 (54%)

Query: 74  FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           F LS E  ++  AG G  G  DG    A+FD P+    D KG ++VAD  NH +R I
Sbjct: 364 FRLSPEGELEVWAGTGQAGRRDGAANQAQFDSPQGLLWDPKGGLWVADSGNHCLRYI 420



 Score = 46.2 bits (108), Expect = 0.007,   Method: Composition-based stats.
 Identities = 21/50 (42%), Positives = 30/50 (60%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V   AG  + GY DG    A+F +P   A+   G++YVAD++N  IR+IT
Sbjct: 426 VSTFAGTCIAGYRDGGLDRAQFREPSGLALGSDGSLYVADRANRRIRRIT 475



 Score = 42.4 bits (98), Expect = 0.091,   Method: Composition-based stats.
 Identities = 18/39 (46%), Positives = 23/39 (58%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           GY DG    ARF      A D  GN++VAD+ NH IR++
Sbjct: 541 GYRDGPLAEARFQTLGGLAFDAAGNLWVADRDNHRIRRL 579



 Score = 38.9 bits (89), Expect = 1.1,   Method: Composition-based stats.
 Identities = 20/55 (36%), Positives = 28/55 (50%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           V  LAG    G+ DG    ARFD+P   AV   G++ V D     +R+I+ +  G
Sbjct: 585 VSTLAGQNQPGWQDGPTSVARFDQPAGLAVLPDGSVVVVDAGLPGLRRISRVASG 639



 Score = 37.7 bits (86), Expect = 2.6,   Method: Composition-based stats.
 Identities = 18/50 (36%), Positives = 30/50 (60%), Gaps = 2/50 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V  +AG G  G  DG    A+  +P + AVD +GN+++AD   H +R+++
Sbjct: 480 VTTVAGTGQPGSVDGPAEQAQLLQPTALAVDKEGNLWIAD--GHRLRRLS 527


>gi|410938688|ref|ZP_11370531.1| putative lipoprotein [Leptospira noguchii str. 2006001870]
 gi|410786234|gb|EKR75182.1| putative lipoprotein [Leptospira noguchii str. 2006001870]
          Length = 358

 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/91 (32%), Positives = 43/91 (47%), Gaps = 2/91 (2%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + +V   AG G     DG   +A F  P    +D  GNI+V+
Sbjct: 36  ILDCILKECYLCSLKITNKPIVSLFAGTGAAVSVDGTTSTASFKTPFGLEIDTSGNIFVS 95

Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
           D++ ++IRKI     GN      NL  ED S
Sbjct: 96  DQTANLIRKID--PSGNVKTLSTNLPLEDPS 124



 Score = 37.4 bits (85), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 22/50 (44%), Positives = 29/50 (58%), Gaps = 2/50 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           V  L+G G+ GY DG+  SARF  P   A + K + +  AD  +H IRKI
Sbjct: 211 VSTLSG-GISGYLDGDLASARFKSPLGIAYNHKTDSLLAADIQDHRIRKI 259


>gi|410694120|ref|YP_003624742.1| putative Soluble quinoprotein glucose dehydrogenase [Thiomonas sp.
           3As]
 gi|294340545|emb|CAZ88930.1| putative Soluble quinoprotein glucose dehydrogenase [Thiomonas sp.
           3As]
          Length = 366

 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/57 (45%), Positives = 35/57 (61%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S + VV  LAG  G  G+ DG    ARF+ P    +D +G IYV++  N+VIRKIT
Sbjct: 196 ISAKGVVSTLAGSPGDTGWRDGRGAQARFNTPVGLTLDAQGQIYVSEYFNNVIRKIT 252



 Score = 43.1 bits (100), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 26/38 (68%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           Y+ G+   ARF  P+  A+D +GNI+VAD  N V+RK+
Sbjct: 49  YNGGDFAQARFHDPRGMALDAQGNIFVADYVNSVVRKL 86


>gi|421129432|ref|ZP_15589632.1| putative lipoprotein [Leptospira kirschneri str. 2008720114]
 gi|410358807|gb|EKP05916.1| putative lipoprotein [Leptospira kirschneri str. 2008720114]
          Length = 358

 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 44/91 (48%), Gaps = 2/91 (2%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++++ VV   AG GV    DG   +A F  P    VD  GNI+V+
Sbjct: 36  ILDCILKECYLCSLKITDKPVVSLFAGTGVAESIDGTISTASFKTPFGLEVDTSGNIFVS 95

Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
           D+  ++IRKI     GN      NL  +D S
Sbjct: 96  DQMTNLIRKID--PSGNVKTLSTNLPLQDPS 124



 Score = 40.0 bits (92), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 30/47 (63%), Gaps = 1/47 (2%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           + G  +G+  ++ FD P    +D++ N+YV + SNH IRKI NLD G
Sbjct: 164 LSGLQNGDRLNSLFDGPFFMDIDLERNLYVGELSNHTIRKI-NLDSG 209



 Score = 37.7 bits (86), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 25/43 (58%), Gaps = 1/43 (2%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           G+ GY DG+  SARF  P   A D K + +  AD  +H IRKI
Sbjct: 217 GISGYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKI 259


>gi|380692925|ref|ZP_09857784.1| hypothetical protein BfaeM_02978 [Bacteroides faecis MAJ27]
          Length = 454

 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 44/76 (57%), Gaps = 3/76 (3%)

Query: 59  LIVLDSSRSAFYTLSFPLSEES--VVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKG 115
           L ++   +   Y +++  +  +  + +  AGD G  GY+ G+   ARF++P +  +D +G
Sbjct: 323 LYIIARKKHCIYKVAYNAATHTFGIPELFAGDYGESGYASGKGTGARFNQPSTPCLDPEG 382

Query: 116 NIYVADKSNHVIRKIT 131
           N+ + DK NH IRKIT
Sbjct: 383 NLLIPDKMNHCIRKIT 398


>gi|171913873|ref|ZP_02929343.1| NHL repeat containing protein [Verrucomicrobium spinosum DSM 4136]
          Length = 370

 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 24/57 (42%), Positives = 36/57 (63%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSAR-FDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ +  ++ +AG+G +GY+ DG P +A  FD P     D  GN Y+AD SNH IRK+
Sbjct: 84  VTPDGKIETMAGNGKKGYTGDGGPATAASFDLPHEIRFDKAGNYYIADMSNHAIRKV 140



 Score = 40.4 bits (93), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 42/87 (48%), Gaps = 8/87 (9%)

Query: 47  PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSARF 103
           PH +  D+ G  +  + D S  A   +    ++  ++  LAG G  GY+  DG    A  
Sbjct: 116 PHEIRFDKAG--NYYIADMSNHAIRKVD---AKTGIITTLAGTGRGGYTGDDGPSEKAEL 170

Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKI 130
            +P S      G++Y+ D  NHVIRKI
Sbjct: 171 KQPHSIQFGPDGDLYICDIGNHVIRKI 197



 Score = 37.0 bits (84), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 18/55 (32%), Positives = 31/55 (56%), Gaps = 2/55 (3%)

Query: 78  EESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  ++  +AG G  G++ +G P   A    PK  A+D +GN ++AD  +H +R I
Sbjct: 257 KAGIIHHIAGTGASGFTGNGGPAKLATLKGPKGIALDAEGNAWLADTESHTVRVI 311


>gi|196233467|ref|ZP_03132310.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
 gi|196222463|gb|EDY16990.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
          Length = 509

 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/87 (36%), Positives = 45/87 (51%), Gaps = 6/87 (6%)

Query: 46  NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
           NP  + ID  G+ D  V D +      +S P  E   +   AG+   G +DG   +ARF 
Sbjct: 224 NPSGLAIDEHGNVD--VADRTNQTLRRIS-PSGETETIAGTAGE--HGTADGAGKAARFW 278

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKIT 131
            P+  A+D  GN+YVAD S   +RK+T
Sbjct: 279 YPQGLAIDEHGNVYVADTSAKSVRKVT 305



 Score = 42.0 bits (97), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 37/68 (54%), Gaps = 8/68 (11%)

Query: 71  TLSFPLSEESVVKRL-------AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKS 123
           T+    SE  VV  L       AG+  +G +DG   +A+F  P   A+D  GN+ VAD++
Sbjct: 184 TVKGASSEPRVVNELPCTFHVFAGE-TEGAADGAGVTAQFKNPSGLAIDEHGNVDVADRT 242

Query: 124 NHVIRKIT 131
           N  +R+I+
Sbjct: 243 NQTLRRIS 250


>gi|290972390|ref|XP_002668936.1| predicted protein [Naegleria gruberi]
 gi|284082474|gb|EFC36192.1| predicted protein [Naegleria gruberi]
          Length = 727

 Score = 48.5 bits (114), Expect = 0.001,   Method: Composition-based stats.
 Identities = 40/126 (31%), Positives = 60/126 (47%), Gaps = 23/126 (18%)

Query: 27  LLEDGYTVTTVIDGHQ-------LEINPHSVIDRPGSSDLIVLDSSRSAFYTLSF----- 74
           +LE+G  VT   +G Q       L +N   V      S   V  SS++  Y   +     
Sbjct: 35  ILENGNIVTIAGNGQQGYSGDNGLAVNAQLV------SPSYVFVSSKNEVYIADYQNNRI 88

Query: 75  -PLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
             + E   +  +AG+G QGY+  +G   +A+   P S  V +K  +Y+AD  NH IRKI 
Sbjct: 89  RKILENGNIITIAGNGQQGYNGDNGLAVNAQLASPSSVCVSLKNEVYIADSGNHSIRKI- 147

Query: 132 NLDHGN 137
            L++GN
Sbjct: 148 -LENGN 152



 Score = 39.3 bits (90), Expect = 0.85,   Method: Composition-based stats.
 Identities = 23/55 (41%), Positives = 32/55 (58%), Gaps = 4/55 (7%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           +AG+G QGYS  +G   +A+   P    V  K  +Y+AD  N  IRKI  L++GN
Sbjct: 212 IAGNGQQGYSGDNGPAVNAKLSSPVDVFVSSKNEVYIADYQNTRIRKI--LENGN 264



 Score = 37.4 bits (85), Expect = 3.3,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 33/62 (53%), Gaps = 4/62 (6%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           E   +  +AG+G QGY+  +G   +A    P S  V     +Y+AD  NH IRKI  L +
Sbjct: 149 ENGNIVTIAGNGQQGYNGDNGPAVNAPLFDPSSVFVTPNDEVYIADTGNHRIRKI--LRN 206

Query: 136 GN 137
           GN
Sbjct: 207 GN 208


>gi|375148602|ref|YP_005011043.1| NHL repeat containing protein [Niastella koreensis GR20-10]
 gi|361062648|gb|AEW01640.1| NHL repeat containing protein [Niastella koreensis GR20-10]
          Length = 716

 Score = 48.5 bits (114), Expect = 0.001,   Method: Composition-based stats.
 Identities = 30/81 (37%), Positives = 43/81 (53%), Gaps = 8/81 (9%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFD--KPKSFAV 111
           S ++ + D S S    ++       V+   AG+G  GYS DG P   A+F    P+  AV
Sbjct: 229 SGNIYIADPSNSVVRKIN----PAGVITTFAGNGTAGYSGDGGPAIKAQFQMGSPQGLAV 284

Query: 112 DMKGNIYVADKSNHVIRKITN 132
           D  GN+Y +D  NH IRKI++
Sbjct: 285 DPAGNVYASDYQNHAIRKISS 305



 Score = 48.1 bits (113), Expect = 0.002,   Method: Composition-based stats.
 Identities = 21/57 (36%), Positives = 36/57 (63%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++   ++   AG+G +G+S   G+   A+  +P + AVD  GN+Y++D SN VIRK+
Sbjct: 132 VTTSGIMSIFAGNGTEGFSGDGGQAAQAKLYRPTAVAVDKDGNLYISDASNKVIRKV 188



 Score = 46.2 bits (108), Expect = 0.007,   Method: Composition-based stats.
 Identities = 26/58 (44%), Positives = 37/58 (63%), Gaps = 3/58 (5%)

Query: 76  LSEESVVKRLAG-DGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++++ V+  +AG  G  GY+ DG P + A   +P   AVD  GNIY+AD SN V+RKI
Sbjct: 188 VNKQGVISTIAGVPGRAGYAGDGGPATKALLTQPAGIAVDYSGNIYIADPSNSVVRKI 245



 Score = 45.1 bits (105), Expect = 0.016,   Method: Composition-based stats.
 Identities = 24/73 (32%), Positives = 40/73 (54%), Gaps = 5/73 (6%)

Query: 60  IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYV 119
           ++L +S S  Y+ +      + V  + G    G   G   +ARF++P++ A D   N+Y+
Sbjct: 11  LLLFASSSVAYSQTI-----TTVAGVVGSPGYGGDAGPANNARFNQPRAVATDNDNNLYI 65

Query: 120 ADKSNHVIRKITN 132
           AD  NHVIRK+ +
Sbjct: 66  ADMRNHVIRKVNS 78


>gi|290985802|ref|XP_002675614.1| predicted protein [Naegleria gruberi]
 gi|284089211|gb|EFC42870.1| predicted protein [Naegleria gruberi]
          Length = 848

 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 35/102 (34%), Positives = 55/102 (53%), Gaps = 8/102 (7%)

Query: 32  YTVTTVIDG-HQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV 90
           +T+ T+I   +Q++  P  +I  P S DL +  +  +  +  S   S E  V  +AG+G 
Sbjct: 521 FTIQTIIQAPNQVQFLPQQLIYGP-SCDLYIACTLENQIFKYS---SSEGFV-LIAGNGK 575

Query: 91  QGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            GY+   G    AR + PK+ AV+  G +Y+AD  N V+RKI
Sbjct: 576 VGYTGDYGPAIYARLNNPKAIAVNKFGEVYIADSFNKVVRKI 617


>gi|418677567|ref|ZP_13238841.1| putative lipoprotein [Leptospira kirschneri serovar Grippotyphosa
           str. RM52]
 gi|418687786|ref|ZP_13248945.1| putative lipoprotein [Leptospira kirschneri serovar Grippotyphosa
           str. Moskva]
 gi|418742122|ref|ZP_13298495.1| putative lipoprotein [Leptospira kirschneri serovar Valbuzzi str.
           200702274]
 gi|400320757|gb|EJO68617.1| putative lipoprotein [Leptospira kirschneri serovar Grippotyphosa
           str. RM52]
 gi|410738110|gb|EKQ82849.1| putative lipoprotein [Leptospira kirschneri serovar Grippotyphosa
           str. Moskva]
 gi|410750480|gb|EKR07460.1| putative lipoprotein [Leptospira kirschneri serovar Valbuzzi str.
           200702274]
          Length = 358

 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 44/91 (48%), Gaps = 2/91 (2%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++++ VV   AG GV    DG   +A F  P    VD  GNI+V+
Sbjct: 36  ILDCILKECYLCSLKITDKPVVSLFAGTGVAESIDGIISTASFKTPFGLEVDTSGNIFVS 95

Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
           D+  ++IRKI     GN      NL  +D S
Sbjct: 96  DQMTNLIRKID--PSGNVKTLSTNLPLQDPS 124



 Score = 40.4 bits (93), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 30/47 (63%), Gaps = 1/47 (2%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           + G  +G+  ++ FD P    +D++ N+YV + SNH IRKI NLD G
Sbjct: 164 LSGLQNGDRLNSLFDGPFFMDIDLERNLYVGELSNHTIRKI-NLDSG 209



 Score = 37.7 bits (86), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 25/43 (58%), Gaps = 1/43 (2%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           G+ GY DG+  SARF  P   A D K + +  AD  +H IRKI
Sbjct: 217 GISGYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKI 259


>gi|336178839|ref|YP_004584214.1| serine/threonine protein kinase [Frankia symbiont of Datisca
           glomerata]
 gi|334859819|gb|AEH10293.1| serine/threonine protein kinase [Frankia symbiont of Datisca
           glomerata]
          Length = 776

 Score = 48.5 bits (114), Expect = 0.001,   Method: Composition-based stats.
 Identities = 22/54 (40%), Positives = 35/54 (64%), Gaps = 2/54 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           V+  +AG G  GYS DG P +A +  +P   A D  GN+Y+AD +N+ IR++++
Sbjct: 608 VITTVAGSGTSGYSGDGGPATAAQLARPGGLAADTAGNLYIADNANNRIRRVSS 661



 Score = 43.9 bits (102), Expect = 0.036,   Method: Composition-based stats.
 Identities = 22/57 (38%), Positives = 34/57 (59%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +S + V+  +AG G  GYS DG P + A+   P S AV   G +Y+AD  N+ +R++
Sbjct: 659 VSSDGVIITVAGSGTSGYSGDGGPATAAQLANPGSVAVTDDGRVYIADTDNNRVRRV 715



 Score = 43.1 bits (100), Expect = 0.055,   Method: Composition-based stats.
 Identities = 20/57 (35%), Positives = 31/57 (54%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  + ++  +AG G  G+S  +G    A F  P   A D  GN+Y+AD  N+ IR+I
Sbjct: 491 IGADGIITTVAGTGTYGFSGDNGPAAQAHFATPTDVARDRAGNLYIADTDNNRIRRI 547



 Score = 41.6 bits (96), Expect = 0.14,   Method: Composition-based stats.
 Identities = 22/45 (48%), Positives = 27/45 (60%), Gaps = 2/45 (4%)

Query: 92  GYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
           GYS DG P +A +  KP S  VD  G +Y+AD  NH IR+I   D
Sbjct: 563 GYSGDGGPATAAQLAKPTSVLVDADGTLYIADTGNHRIRRIGTND 607


>gi|418693423|ref|ZP_13254475.1| putative lipoprotein [Leptospira kirschneri str. H1]
 gi|409958780|gb|EKO17669.1| putative lipoprotein [Leptospira kirschneri str. H1]
          Length = 358

 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 43/91 (47%), Gaps = 2/91 (2%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNI+V+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAESIDGTISTASFKTPFGLEVDTSGNIFVS 95

Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
           D+  ++IRKI     GN      NL  +D S
Sbjct: 96  DQITNLIRKID--PSGNVKTLSTNLPLQDPS 124



 Score = 38.5 bits (88), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 28/47 (59%), Gaps = 1/47 (2%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           + G  +G+   + FD P    +D + N+YV + SNH IRKI NLD G
Sbjct: 164 LSGLQNGDRLDSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLDSG 209



 Score = 37.7 bits (86), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 25/43 (58%), Gaps = 1/43 (2%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           G+ GY DG+  SARF  P   A D K + +  AD  +H IRKI
Sbjct: 217 GISGYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKI 259


>gi|329927682|ref|ZP_08281814.1| copper amine oxidase N-terminal domain protein [Paenibacillus sp.
           HGF5]
 gi|328938318|gb|EGG34710.1| copper amine oxidase N-terminal domain protein [Paenibacillus sp.
           HGF5]
          Length = 1251

 Score = 48.5 bits (114), Expect = 0.001,   Method: Composition-based stats.
 Identities = 30/87 (34%), Positives = 43/87 (49%), Gaps = 4/87 (4%)

Query: 45  INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
           +N  S I   G   + + DS   A   +     +   V  +AG+G  G  DG+   A F 
Sbjct: 111 LNRPSGIAVDGKGQVYIADSGNHAIRKID----QAGRVNTIAGNGRIGLKDGKAQDALFH 166

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +P+  AV   G +YVAD  NHVIR+I+
Sbjct: 167 EPQDVAVTEDGILYVADTLNHVIRRIS 193



 Score = 44.3 bits (103), Expect = 0.025,   Method: Composition-based stats.
 Identities = 21/46 (45%), Positives = 28/46 (60%), Gaps = 4/46 (8%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           LAGD    Y +G    A+F++P   A+D KGN+YV+D  N  IR I
Sbjct: 218 LAGD----YKNGSLAEAQFNEPAGLALDGKGNLYVSDSGNQAIRYI 259



 Score = 40.0 bits (92), Expect = 0.50,   Method: Composition-based stats.
 Identities = 16/30 (53%), Positives = 20/30 (66%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           A  ++P   AVD KG +Y+AD  NH IRKI
Sbjct: 109 AFLNRPSGIAVDGKGQVYIADSGNHAIRKI 138



 Score = 39.7 bits (91), Expect = 0.53,   Method: Composition-based stats.
 Identities = 25/90 (27%), Positives = 39/90 (43%), Gaps = 14/90 (15%)

Query: 55  GSSDLIVLDSSRSAFYTLSF------------PLSEESVVKRLAGDGVQGYSDGEPGSAR 102
           G  +L V DS   A   + F            P S+      L  D   GY DG   +AR
Sbjct: 242 GKGNLYVSDSGNQAIRYIDFGKGTVSTAAGSVPSSDGYAKDALYAD--PGYLDGAADAAR 299

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           F+ P+  A  ++  + +AD  NH +R++ +
Sbjct: 300 FNSPRGLAWSVEDGLLIADSHNHAVRQLKD 329


>gi|357400623|ref|YP_004912548.1| hypothetical protein SCAT_3039 [Streptomyces cattleya NRRL 8057 =
           DSM 46488]
 gi|386356676|ref|YP_006054922.1| hypothetical protein SCATT_30300 [Streptomyces cattleya NRRL 8057 =
           DSM 46488]
 gi|337767032|emb|CCB75743.1| conserved protein of unknown function [Streptomyces cattleya NRRL
           8057 = DSM 46488]
 gi|365807185|gb|AEW95401.1| hypothetical protein SCATT_30300 [Streptomyces cattleya NRRL 8057 =
           DSM 46488]
          Length = 613

 Score = 48.1 bits (113), Expect = 0.001,   Method: Composition-based stats.
 Identities = 29/76 (38%), Positives = 47/76 (61%), Gaps = 4/76 (5%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           + +L+V DS+R A   L+ P   E+VV+R+ G G +G +DG    ARF +P+  A+   G
Sbjct: 192 TGNLLVSDSTRHALVELA-P-DGETVVRRI-GSGERGLADGPADRARFSEPQGLALLPDG 248

Query: 116 -NIYVADKSNHVIRKI 130
            ++ VAD  NH +R++
Sbjct: 249 ESVIVADTVNHALRRV 264


>gi|421106984|ref|ZP_15567544.1| putative lipoprotein [Leptospira kirschneri str. H2]
 gi|410007898|gb|EKO61577.1| putative lipoprotein [Leptospira kirschneri str. H2]
          Length = 358

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 43/91 (47%), Gaps = 2/91 (2%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y  S  ++ + VV   AG GV    DG   +A F  P    VD  GNI+V+
Sbjct: 36  ILDCILKECYLCSLKITNKPVVSLFAGTGVAESIDGTISTASFKTPFGLEVDTSGNIFVS 95

Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCS 151
           D+  ++IRKI     GN      NL  +D S
Sbjct: 96  DQITNLIRKID--PSGNVKTLSTNLPLQDPS 124



 Score = 38.5 bits (88), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 28/47 (59%), Gaps = 1/47 (2%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           + G  +G+   + FD P    +D + N+YV + SNH IRKI NLD G
Sbjct: 164 LSGLQNGDRLDSLFDSPFFMDIDPERNLYVGELSNHTIRKI-NLDSG 209



 Score = 37.7 bits (86), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 25/43 (58%), Gaps = 1/43 (2%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           G+ GY DG+  SARF  P   A D K + +  AD  +H IRKI
Sbjct: 217 GISGYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKI 259


>gi|332027687|gb|EGI67755.1| NHL repeat-containing protein 2 [Acromyrmex echinatior]
          Length = 686

 Score = 48.1 bits (113), Expect = 0.002,   Method: Composition-based stats.
 Identities = 35/126 (27%), Positives = 59/126 (46%), Gaps = 5/126 (3%)

Query: 5   FVYSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDS 64
            +Y+ +AI     L  + + GL L+  + +        L     +V+     + L++ DS
Sbjct: 180 LLYTKIAIAYFKSLKQMSNHGLPLKPAHHLLPSSKDGLLFPGKLTVLQLEQETKLVISDS 239

Query: 65  SRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSN 124
             +     +    E   V+ + G   QG+ DG+  +ARF+ P+   V +   IYVAD +N
Sbjct: 240 GNNRIVITN----EHGRVEHVIGGCNQGFKDGDFKNARFNSPQGVCV-LNNIIYVADNNN 294

Query: 125 HVIRKI 130
           H IRKI
Sbjct: 295 HAIRKI 300


>gi|261404592|ref|YP_003240833.1| copper amine oxidase domain-containing protein [Paenibacillus sp.
           Y412MC10]
 gi|261281055|gb|ACX63026.1| copper amine oxidase domain protein [Paenibacillus sp. Y412MC10]
          Length = 1280

 Score = 48.1 bits (113), Expect = 0.002,   Method: Composition-based stats.
 Identities = 30/87 (34%), Positives = 43/87 (49%), Gaps = 4/87 (4%)

Query: 45  INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
           +N  S I   G   + + DS   A   +     +   V  +AG+G  G  DG+   A F 
Sbjct: 140 LNRPSGIAVDGKGQVYIADSGNHAIRKID----QAGRVTTIAGNGRIGLKDGKAQDALFH 195

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +P+  AV   G +YVAD  NHVIR+I+
Sbjct: 196 EPQDVAVTEDGILYVADTLNHVIRRIS 222



 Score = 47.4 bits (111), Expect = 0.003,   Method: Composition-based stats.
 Identities = 22/46 (47%), Positives = 29/46 (63%), Gaps = 4/46 (8%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           LAGD    Y +G    A+F++P   A+D KGN+YV+D  NH IR I
Sbjct: 247 LAGD----YKNGSLAEAQFNEPAGLALDGKGNLYVSDSGNHAIRYI 288



 Score = 40.0 bits (92), Expect = 0.49,   Method: Composition-based stats.
 Identities = 16/30 (53%), Positives = 20/30 (66%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           A  ++P   AVD KG +Y+AD  NH IRKI
Sbjct: 138 AFLNRPSGIAVDGKGQVYIADSGNHAIRKI 167



 Score = 39.7 bits (91), Expect = 0.64,   Method: Composition-based stats.
 Identities = 25/90 (27%), Positives = 39/90 (43%), Gaps = 14/90 (15%)

Query: 55  GSSDLIVLDSSRSAFYTLSF------------PLSEESVVKRLAGDGVQGYSDGEPGSAR 102
           G  +L V DS   A   + F            P S+      L  D   GY DG   +AR
Sbjct: 271 GKGNLYVSDSGNHAIRYIDFGKGTVSTAAGSVPSSDGYAKDALYAD--PGYLDGAADAAR 328

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           F+ P+  A  ++  + +AD  NH +R++ +
Sbjct: 329 FNSPRGLAWSVEDGLLIADSHNHAVRQLKD 358



 Score = 37.4 bits (85), Expect = 3.2,   Method: Composition-based stats.
 Identities = 23/71 (32%), Positives = 36/71 (50%), Gaps = 7/71 (9%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDM-KGNI 117
           L++ DS   A   L     ++  V  +AG G +GY+DG     RF+ P   AV    G +
Sbjct: 343 LLIADSHNHAVRQL-----KDERVSTMAG-GTRGYADGIESEVRFNAPADIAVASDSGEL 396

Query: 118 YVADKSNHVIR 128
           ++AD+ N  +R
Sbjct: 397 FIADQRNGAVR 407


>gi|289667480|ref|ZP_06488555.1| putative NHL repeat protein [Xanthomonas campestris pv. musacearum
           NCPPB 4381]
          Length = 310

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 34/56 (60%), Gaps = 1/56 (1%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           L  +  V  LAG G +G++DG   +A F+ P    +D  GN+Y+AD  NH IRK+T
Sbjct: 42  LGADGHVHTLAG-GREGFADGIGVAAAFNTPSGIMLDTAGNLYIADTGNHAIRKLT 96



 Score = 46.2 bits (108), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 23/52 (44%), Positives = 29/52 (55%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
           L+ +  V  LAGDGV G  +G     RF+ P   AVD +G +YVAD  N  I
Sbjct: 95  LTPQGKVTTLAGDGVAGDRNGAAAQVRFNGPVGVAVDAQGRVYVADTYNDRI 146


>gi|345000781|ref|YP_004803635.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Streptomyces sp. SirexAA-E]
 gi|344316407|gb|AEN11095.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Streptomyces sp. SirexAA-E]
          Length = 608

 Score = 48.1 bits (113), Expect = 0.002,   Method: Composition-based stats.
 Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 3/72 (4%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
            +V D++R     L    ++   V+   G G +G +DG PG ARF +P+  A+   G I 
Sbjct: 193 FLVSDTTRHRLVELD---ADGETVRGHFGTGERGLTDGGPGEARFSEPQGLALLPDGRIA 249

Query: 119 VADKSNHVIRKI 130
           VAD  NH IR +
Sbjct: 250 VADTVNHAIRAL 261


>gi|427400231|ref|ZP_18891469.1| hypothetical protein HMPREF9710_01065 [Massilia timonae CCUG 45783]
 gi|425720971|gb|EKU83886.1| hypothetical protein HMPREF9710_01065 [Massilia timonae CCUG 45783]
          Length = 367

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/48 (52%), Positives = 31/48 (64%), Gaps = 1/48 (2%)

Query: 85  LAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           LAGD  ++G  DG   +ARF  P+  A+D   N+YVAD  N VIRKIT
Sbjct: 55  LAGDPTIEGTMDGAGTAARFKWPQGIAIDADDNLYVADTGNFVIRKIT 102



 Score = 45.8 bits (107), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 26/57 (45%), Positives = 35/57 (61%), Gaps = 3/57 (5%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   VV  +AG  G  G+ DG+ G+ARF  P + AV+ +G IYVAD  N  IR IT
Sbjct: 101 ITPAGVVTTVAGAAGTSGFVDGDAGNARFGNPVAVAVNRRGTIYVAD--NLRIRSIT 155



 Score = 40.0 bits (92), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 16/41 (39%), Positives = 29/41 (70%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +G ++G   +ARF++  +  VD +GN+Y AD  N+++R+IT
Sbjct: 273 RGSNNGVGTAARFEQVVALTVDPRGNVYAADAINNLVRRIT 313


>gi|290982010|ref|XP_002673724.1| NHL repeat-containing protein [Naegleria gruberi]
 gi|284087309|gb|EFC40980.1| NHL repeat-containing protein [Naegleria gruberi]
          Length = 456

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 33/55 (60%), Gaps = 4/55 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKITNL 133
           +K + G+G+ GY  G+ G+A       P S AVD  GNIY+AD  NH IRK T L
Sbjct: 115 IKTIVGNGMPGYG-GDSGAATSTSLLFPTSIAVDSSGNIYIADTQNHRIRKFTAL 168



 Score = 40.4 bits (93), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 28/49 (57%), Gaps = 2/49 (4%)

Query: 85  LAGDGVQGYSDGE--PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +AG   +GY+       +A+   P   A+D  GN+Y+AD  NH IRKIT
Sbjct: 175 IAGTEEEGYNGDNIIATTAKLSSPTGVALDSIGNVYIADSQNHRIRKIT 223


>gi|114568006|ref|YP_755160.1| leucine-rich repeat-containing protein [Syntrophomonas wolfei
           subsp. wolfei str. Goettingen]
 gi|114338941|gb|ABI69789.1| Leucine-rich repeat (LRR) protein-like protein [Syntrophomonas
           wolfei subsp. wolfei str. Goettingen]
          Length = 1351

 Score = 48.1 bits (113), Expect = 0.002,   Method: Composition-based stats.
 Identities = 24/54 (44%), Positives = 33/54 (61%), Gaps = 2/54 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           ++  +AG+G  GYS DG P  SA+   P   A D  GN+Y+ DK N+ IRK+ N
Sbjct: 506 IISTVAGNGTAGYSGDGGPATSAQLKDPVGVACDKNGNLYIVDKDNNRIRKVDN 559



 Score = 41.2 bits (95), Expect = 0.23,   Method: Composition-based stats.
 Identities = 20/52 (38%), Positives = 30/52 (57%), Gaps = 2/52 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++  +AG+G  GYS DG P  SA+ + P+    D   N+Y+ D  N  IRK+
Sbjct: 335 IISTVAGNGTAGYSGDGGPATSAQLNTPRGVVCDGADNLYIVDGDNQRIRKV 386



 Score = 40.4 bits (93), Expect = 0.32,   Method: Composition-based stats.
 Identities = 22/51 (43%), Positives = 32/51 (62%), Gaps = 5/51 (9%)

Query: 85  LAGDGVQGYS--DGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +AG   QG++  +G+ G   SAR + P   A D  GN+Y+AD+ NH IRK+
Sbjct: 451 VAGIVTQGWATYEGDGGLATSARLNYPFGVACDGSGNLYIADRGNHRIRKV 501



 Score = 39.3 bits (90), Expect = 0.78,   Method: Composition-based stats.
 Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 82  VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  +AG+G  GY+ DG P  SA+   P     D  GN+Y+AD +N  IRK+
Sbjct: 563 ISTVAGNGTGGYAGDGSPATSAQIWVPYGVTFDNNGNMYIADMNNKRIRKV 613


>gi|386849777|ref|YP_006267790.1| Teneurin-4 [Actinoplanes sp. SE50/110]
 gi|359837281|gb|AEV85722.1| Teneurin-4 [Actinoplanes sp. SE50/110]
          Length = 631

 Score = 48.1 bits (113), Expect = 0.002,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 34/55 (61%), Gaps = 2/55 (3%)

Query: 80  SVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           + V  +AG G  G S DG   S A+ + P+   VD  G+IYVAD  NH++RKI+N
Sbjct: 172 AAVSVIAGTGTAGTSPDGTAASLAKLNAPRDVKVDASGDIYVADTGNHLVRKISN 226


>gi|351729451|ref|ZP_08947142.1| NHL repeat-containing protein [Acidovorax radicis N35]
          Length = 663

 Score = 48.1 bits (113), Expect = 0.002,   Method: Composition-based stats.
 Identities = 20/38 (52%), Positives = 26/38 (68%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           Y D    +ARF+ P++ AVD  GN YVAD  NH++RKI
Sbjct: 48  YLDAAGTAARFNAPQAVAVDTAGNTYVADTGNHLVRKI 85



 Score = 42.0 bits (97), Expect = 0.14,   Method: Composition-based stats.
 Identities = 17/39 (43%), Positives = 25/39 (64%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G++DG   +ARFD P + A+   G +YV D++N  IR I
Sbjct: 341 GWADGSGSTARFDGPGALAMGKSGTLYVRDRNNAAIRTI 379



 Score = 40.4 bits (93), Expect = 0.33,   Method: Composition-based stats.
 Identities = 29/89 (32%), Positives = 42/89 (47%), Gaps = 8/89 (8%)

Query: 67  SAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNH 125
           S+   + F ++    V   AG+   +G++DG  GSA F    S A+D  GN+YV D   H
Sbjct: 473 SSLQGVIFKINPTGAVSLFAGNPNARGHADGPAGSALFAGLGSMALDPAGNLYVVDGVYH 532

Query: 126 -------VIRKITNLDHGNQLIRQINLKP 147
                   IRKIT     + L  + N+ P
Sbjct: 533 FITGVGPTIRKITPAGMVSTLAGRANVPP 561


>gi|322435234|ref|YP_004217446.1| hypothetical protein AciX9_1613 [Granulicella tundricola MP5ACTX9]
 gi|321162961|gb|ADW68666.1| NHL repeat containing protein [Granulicella tundricola MP5ACTX9]
          Length = 781

 Score = 47.8 bits (112), Expect = 0.002,   Method: Composition-based stats.
 Identities = 24/57 (42%), Positives = 34/57 (59%), Gaps = 2/57 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           ++  LAG G QG+S D  P  +A+ D P   A+D   N+Y+AD  NH IR+I  + H
Sbjct: 168 LITTLAGTGTQGFSGDAGPALAAQIDTPTGLALDASNNLYLADTHNHRIRRIDAVTH 224



 Score = 44.7 bits (104), Expect = 0.017,   Method: Composition-based stats.
 Identities = 37/134 (27%), Positives = 60/134 (44%), Gaps = 12/134 (8%)

Query: 10  LAITLLSLLFSLVSSGLLL------EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLD 63
           LA T   L+  +  SG L         GY     +    L  +P ++   P + DL + D
Sbjct: 38  LAETAAHLILRVSPSGALTIVAGTGTQGYAGDGTLPTQALLDSPTALAITP-TGDLYLAD 96

Query: 64  SSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVAD 121
           +   A   +    +   ++  +AG G  G S DG   + A+ D P + A+D   N+Y+AD
Sbjct: 97  THNHAIRRID---AATQIITTVAGTGTPGRSPDGTLATKAQLDTPTAIALDSSQNLYIAD 153

Query: 122 KSNHVIRKITNLDH 135
             NH+IR++    H
Sbjct: 154 TRNHIIRRVDATTH 167


>gi|421614357|ref|ZP_16055418.1| NHL repeat containing protein [Rhodopirellula baltica SH28]
 gi|408494877|gb|EKJ99474.1| NHL repeat containing protein [Rhodopirellula baltica SH28]
          Length = 351

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/95 (28%), Positives = 47/95 (49%), Gaps = 4/95 (4%)

Query: 34  VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
           + T + G + ++     +D  G+  L+VL+ + +    L         +  LAG G +G 
Sbjct: 211 LVTTLAGRKSKLKDPRAVDLDGNGRLLVLERNGNRLRRLE----SNGDITTLAGSGKKGT 266

Query: 94  SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           +DG+   A F+ PK   V   G +Y+AD  NH++R
Sbjct: 267 ADGDAKQASFNGPKHMDVAPDGRVYIADDVNHLVR 301



 Score = 38.9 bits (89), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 16/49 (32%), Positives = 29/49 (59%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
           GY DG   SARF+K  +  +D +  +Y++D +NH +R++     G  ++
Sbjct: 106 GYVDGPARSARFNKLHNLVIDAEDVLYLSDHANHCVRRLIQTSDGEWMV 154


>gi|189462806|ref|ZP_03011591.1| hypothetical protein BACCOP_03504 [Bacteroides coprocola DSM 17136]
 gi|189430422|gb|EDU99406.1| NHL repeat protein [Bacteroides coprocola DSM 17136]
          Length = 500

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/69 (42%), Positives = 36/69 (52%), Gaps = 9/69 (13%)

Query: 76  LSEESVVKRLAGDG-------VQGYSDGE-PGSARFDKPKSFAVDMKGNI-YVADKSNHV 126
           L+ E VV   AG G        +GY DG   G ARF+ P+  A D + N+ YV D  NH+
Sbjct: 412 LTPEGVVTTYAGRGSANLNNEARGYVDGTLRGEARFNNPRGLAYDEQNNVFYVGDMDNHI 471

Query: 127 IRKITNLDH 135
           IRKI   D 
Sbjct: 472 IRKIAMEDE 480


>gi|345008031|ref|YP_004810385.1| redoxin domain-containing protein [Streptomyces violaceusniger Tu
           4113]
 gi|344034380|gb|AEM80105.1| Redoxin domain protein [Streptomyces violaceusniger Tu 4113]
          Length = 615

 Score = 47.8 bits (112), Expect = 0.002,   Method: Composition-based stats.
 Identities = 29/83 (34%), Positives = 45/83 (54%), Gaps = 4/83 (4%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P   +  PG +  +V D++R     L+     ESVV+R+ G G +G +DG    ARF +P
Sbjct: 188 PGKALLLPGGT-FLVSDTTRHQLVELA--ADGESVVRRI-GAGERGLTDGTGERARFSEP 243

Query: 107 KSFAVDMKGNIYVADKSNHVIRK 129
           +  A+   G + VAD  NH +R+
Sbjct: 244 QGLALLPDGTVAVADTVNHALRR 266


>gi|323343289|ref|ZP_08083516.1| hypothetical protein HMPREF0663_10051 [Prevotella oralis ATCC
           33269]
 gi|323095108|gb|EFZ37682.1| hypothetical protein HMPREF0663_10051 [Prevotella oralis ATCC
           33269]
          Length = 436

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 23/57 (40%), Positives = 34/57 (59%), Gaps = 1/57 (1%)

Query: 79  ESVVKRLAGDGVQG-YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
           E+  +RL+   + G + DG+   A+F  P     D  GNIYVAD++NH IR+I+  D
Sbjct: 326 ENTYRRLSSSNISGGHRDGKLEKAQFHNPTQIYCDADGNIYVADRNNHCIRRISPDD 382



 Score = 36.6 bits (83), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 18/57 (31%), Positives = 31/57 (54%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S + +V+ + G    +G+ DG    A F++P    +   G +YVAD  N  +RK+T
Sbjct: 378 ISPDDMVETVLGMPETKGWKDGAKSEALFNEPTGIGIGKDGAVYVADWGNGRVRKLT 434


>gi|222054715|ref|YP_002537077.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
 gi|221564004|gb|ACM19976.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
          Length = 2393

 Score = 47.8 bits (112), Expect = 0.003,   Method: Composition-based stats.
 Identities = 24/61 (39%), Positives = 41/61 (67%), Gaps = 4/61 (6%)

Query: 73   SFPLSEESVVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRK 129
            S  L+E+ ++  +AG+GV G++ G+ G A     + P+  AVD+ GNIY+AD +N+ IR+
Sbjct: 1554 SITLAEQGIINTVAGNGVAGFT-GDGGIAVGSSLNYPEDAAVDLDGNIYIADSANNRIRR 1612

Query: 130  I 130
            +
Sbjct: 1613 V 1613



 Score = 38.5 bits (88), Expect = 1.2,   Method: Composition-based stats.
 Identities = 20/51 (39%), Positives = 33/51 (64%), Gaps = 2/51 (3%)

Query: 82   VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            +  +AG+G+  +S DG P   A  ++P + A+D  GNIY++D  N+ +RKI
Sbjct: 1620 IATIAGNGLAEFSGDGGPAYMAGLNRPFAVALDTSGNIYISDNLNYRVRKI 1670


>gi|302543897|ref|ZP_07296239.1| putative NHL repeat protein [Streptomyces hygroscopicus ATCC 53653]
 gi|302461515|gb|EFL24608.1| putative NHL repeat protein [Streptomyces himastatinicus ATCC
           53653]
          Length = 608

 Score = 47.4 bits (111), Expect = 0.003,   Method: Composition-based stats.
 Identities = 28/83 (33%), Positives = 44/83 (53%), Gaps = 4/83 (4%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P   +  PG +  +V D++R     L+     ESV++R+ G G +G +DG    ARF +P
Sbjct: 177 PGKALLLPGGT-FLVSDTTRHQLVELA--ADGESVLRRI-GTGERGLADGRADRARFSEP 232

Query: 107 KSFAVDMKGNIYVADKSNHVIRK 129
           +  A    G + VAD  NH +R+
Sbjct: 233 QGLAALPDGTVAVADTVNHALRR 255


>gi|418747486|ref|ZP_13303786.1| hypothetical protein LEP1GSC163_2583 [Leptospira santarosai str.
           CBC379]
 gi|410791609|gb|EKR89564.1| hypothetical protein LEP1GSC163_2583 [Leptospira santarosai str.
           CBC379]
          Length = 356

 Score = 47.4 bits (111), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 38/129 (29%), Positives = 56/129 (43%), Gaps = 10/129 (7%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP   + R  + + I+ D      Y     ++   VV   AG G++   DG   +A F  
Sbjct: 20  NPSIFMSRAWAENTII-DCILKECYLCKLKITNNPVVSLFAGTGIKESIDGTTQTASFKT 78

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSGMFWVTV 165
           P    VD  GNI+V+D+  ++IRKI     GN      NL  ED        SG+ +  +
Sbjct: 79  PFGLEVDTFGNIFVSDQMANLIRKIDR--SGNVTTLSTNLLLED-------PSGIKFDPI 129

Query: 166 FSPYLISCK 174
                +SCK
Sbjct: 130 TGDKYVSCK 138



 Score = 37.7 bits (86), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 29/48 (60%), Gaps = 1/48 (2%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           GV G+ +G+  ++ F  P    +D + N+YV +  NH IRKI NL+ G
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKI-NLNSG 207


>gi|260821256|ref|XP_002605949.1| hypothetical protein BRAFLDRAFT_132236 [Branchiostoma floridae]
 gi|229291286|gb|EEN61959.1| hypothetical protein BRAFLDRAFT_132236 [Branchiostoma floridae]
          Length = 446

 Score = 47.4 bits (111), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 40/83 (48%), Gaps = 7/83 (8%)

Query: 60  IVLDSSRSAFYTLSFP------LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDM 113
           I  D++    Y + F       ++   +VK LAG    GY DG   +ARF  P    +D 
Sbjct: 308 IAFDTTHRVLYVVEFGNNCIRMVTPAGIVKTLAGGPEPGYKDGSGSAARFFHPTGLTLDP 367

Query: 114 KGN-IYVADKSNHVIRKITNLDH 135
               IYV D+ NH++R +T + H
Sbjct: 368 TNKVIYVTDQYNHLVRSVTAVGH 390


>gi|359777541|ref|ZP_09280822.1| hypothetical protein ARGLB_073_01730 [Arthrobacter globiformis NBRC
           12137]
 gi|359305319|dbj|GAB14651.1| hypothetical protein ARGLB_073_01730 [Arthrobacter globiformis NBRC
           12137]
          Length = 666

 Score = 47.4 bits (111), Expect = 0.003,   Method: Composition-based stats.
 Identities = 37/110 (33%), Positives = 52/110 (47%), Gaps = 4/110 (3%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDL-IVLDSSRSAFYTLSF-PLSEESVVKRLA 86
           ED    T  +  H L++   S  D   SS L  V+ +        S+ PL+ +  V  LA
Sbjct: 317 EDAAGFTGRLGDHPLDVALSSPWDVVWSSKLNAVVIAMAGVHQIFSYEPLTGD--VSILA 374

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           G+G++G  DG    A F +P   A D  GNI+VAD     +RK+   D G
Sbjct: 375 GNGLEGLLDGPAHEAWFAQPSGLAEDADGNIWVADSETSALRKLVISDDG 424


>gi|365121900|ref|ZP_09338810.1| hypothetical protein HMPREF1033_02156 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363643613|gb|EHL82920.1| hypothetical protein HMPREF1033_02156 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 443

 Score = 47.4 bits (111), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 36/134 (26%), Positives = 63/134 (47%), Gaps = 22/134 (16%)

Query: 3   SHFVYSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGS------ 56
           SH +Y+   +T    LF+    G++   GY     +DG    I  +++ ++P        
Sbjct: 323 SHCIYTYDVLTKKHELFA----GIVNNAGY-----LDG----IGSYAMFNQPRQLILDED 369

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
            +L + D+       ++ P  + S V  +   GV GY DG+P  A FD+P    ++ +G 
Sbjct: 370 DNLYIADTENHVIRKIT-PQGQVSTV--IGQAGVAGYQDGDPEVALFDRPHGVCINKEGI 426

Query: 117 IYVADKSNHVIRKI 130
           IY+ D  N  IR++
Sbjct: 427 IYIGDYENQCIRRL 440



 Score = 45.4 bits (106), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 20/40 (50%), Positives = 26/40 (65%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           GY DG    A F++P+   +D   N+Y+AD  NHVIRKIT
Sbjct: 347 GYLDGIGSYAMFNQPRQLILDEDDNLYIADTENHVIRKIT 386


>gi|284038675|ref|YP_003388605.1| NHL repeat containing protein [Spirosoma linguale DSM 74]
 gi|283817968|gb|ADB39806.1| NHL repeat containing protein [Spirosoma linguale DSM 74]
          Length = 359

 Score = 47.4 bits (111), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/58 (41%), Positives = 31/58 (53%)

Query: 74  FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           F ++ +  +   AG G  GY DG    A+F  P   A D  GN+YVAD  N  IRKI+
Sbjct: 73  FKVTPQGNLSLYAGSGGTGYQDGSLDKAKFLWPYGLAFDRAGNLYVADSGNQAIRKIS 130



 Score = 38.5 bits (88), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 31/60 (51%), Gaps = 9/60 (15%)

Query: 81  VVKRLAGDG---------VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +V+++A DG         + GY DG   +A F   +  A    G++YVAD  N  IRKIT
Sbjct: 298 IVRKIATDGQVSDVTGSRLGGYKDGPLQAAEFGSVEDLAFSPSGSLYVADNRNGAIRKIT 357


>gi|428186137|gb|EKX54988.1| hypothetical protein GUITHDRAFT_59145, partial [Guillardia theta
           CCMP2712]
          Length = 231

 Score = 47.4 bits (111), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 29/49 (59%), Gaps = 1/49 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V   AG G+QG+ DG  G ARF +P    +  +  +YVAD +NH IR I
Sbjct: 20  VSYFAGSGIQGFRDGAAGDARFSEPLGI-IHFRNKLYVADSNNHAIRSI 67



 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 26/79 (32%), Positives = 41/79 (51%), Gaps = 10/79 (12%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD-------KPKSFAV 111
           L V DS+  A  ++  P+S E  V  +AG+  +G  DG    ++ D       +P+    
Sbjct: 54  LYVADSNNHAIRSID-PISGE--VATVAGNQQRGNLDGAASQSQLDTPTSLLNRPEGVVF 110

Query: 112 DMKGNIYVADKSNHVIRKI 130
           D  G++YVAD  NHVI+ +
Sbjct: 111 DSHGHLYVADTGNHVIKML 129


>gi|289664052|ref|ZP_06485633.1| putative NHL repeat protein [Xanthomonas campestris pv. vasculorum
           NCPPB 702]
          Length = 263

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 22/47 (46%), Positives = 31/47 (65%), Gaps = 1/47 (2%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           LAG G +G++DG   +A F+ P    +D  GN+Y+AD  NH IRK+T
Sbjct: 4   LAG-GREGFADGIGVAAAFNTPSGIMLDTAGNLYIADTGNHAIRKLT 49



 Score = 47.0 bits (110), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/52 (44%), Positives = 29/52 (55%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
           L+ +  V  LAGDGV G  +G     RF+ P   AVD +G +YVAD  N  I
Sbjct: 48  LTPQGKVTTLAGDGVAGDRNGAAAQVRFNGPVGVAVDAQGRVYVADTYNDRI 99


>gi|290977087|ref|XP_002671270.1| predicted protein [Naegleria gruberi]
 gi|284084837|gb|EFC38526.1| predicted protein [Naegleria gruberi]
          Length = 1095

 Score = 47.0 bits (110), Expect = 0.003,   Method: Composition-based stats.
 Identities = 36/107 (33%), Positives = 51/107 (47%), Gaps = 10/107 (9%)

Query: 31  GYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV 90
           G+T  T ++   L  +P  +     +SDL + D        +S        +  +AG G 
Sbjct: 319 GFTDNTTVNLATLA-SPELMNFAKNTSDLYIADKGNHRIRKIS-----NGFITTIAGQGS 372

Query: 91  QGYSDGEPGSARFD---KPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
             Y  GE   +R     KPK  A+D  GNIY+AD SNH +RKI+ LD
Sbjct: 373 PSYC-GENVDSRLSALSKPKGAALDSLGNIYIADTSNHRVRKISYLD 418



 Score = 40.8 bits (94), Expect = 0.29,   Method: Composition-based stats.
 Identities = 24/54 (44%), Positives = 32/54 (59%), Gaps = 2/54 (3%)

Query: 79  ESVVKRLAGDGVQGYS-DG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  +  +AG G  GY+ DG    SA+ +KP   A D  GNIY+AD  N+ IRKI
Sbjct: 418 DGTITTIAGTGSFGYNGDGILATSAQVNKPTGIAFDSIGNIYIADSGNNRIRKI 471


>gi|255530479|ref|YP_003090851.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
 gi|255343463|gb|ACU02789.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
          Length = 453

 Score = 47.0 bits (110), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 37/56 (66%), Gaps = 1/56 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +S +++V  +AG  G  GY DG P  + F++P   AV+ +G+IY+AD SN  IRK+
Sbjct: 395 VSADNIVTTVAGQPGKSGYKDGGPVESLFNQPWGVAVNEQGDIYIADWSNARIRKL 450



 Score = 42.7 bits (99), Expect = 0.071,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 26/49 (53%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           KRL   G   + DG    A F+ PK    D  GN++VAD  NH IR ++
Sbjct: 348 KRLNAPGGSAFRDGPLADALFNDPKEIKFDNSGNMFVADYGNHCIRMVS 396


>gi|307182571|gb|EFN69764.1| NHL repeat-containing protein 2 [Camponotus floridanus]
          Length = 687

 Score = 47.0 bits (110), Expect = 0.004,   Method: Composition-based stats.
 Identities = 36/133 (27%), Positives = 58/133 (43%), Gaps = 6/133 (4%)

Query: 5   FVYSSLAITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDS 64
            +Y+ + I     L  + +  L L   Y + +      L     +++     + LI+ DS
Sbjct: 182 LLYARVMIAYFKSLNQISNHSLPLRPAYHLLSSFKNGLLFPGKLAILSSEHGTKLIISDS 241

Query: 65  SRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSN 124
             +     +    +   V+   G   QG+ DG   +ARF+ P+   V +   IYVAD +N
Sbjct: 242 GNNRIVIAT----KHGEVEHFIGGCNQGFKDGSFKNARFNSPQGVCV-LNNTIYVADNNN 296

Query: 125 HVIRKITNLDHGN 137
           H IRKI NL   N
Sbjct: 297 HAIRKI-NLTENN 308


>gi|302786076|ref|XP_002974809.1| hypothetical protein SELMODRAFT_101985 [Selaginella moellendorffii]
 gi|300157704|gb|EFJ24329.1| hypothetical protein SELMODRAFT_101985 [Selaginella moellendorffii]
          Length = 1049

 Score = 47.0 bits (110), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 32/48 (66%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
            V+ +AG GV GY DG+   A+F +P   A+   G+++VAD +N+VIR
Sbjct: 835 TVRTVAGTGVAGYEDGKGVKAQFSEPAGLALGPNGSLFVADTNNNVIR 882



 Score = 41.2 bits (95), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 19/45 (42%), Positives = 28/45 (62%), Gaps = 1/45 (2%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI 130
           G G +G +DG    A F++P+  A D + NI YVAD  NH +R++
Sbjct: 607 GSGAEGLTDGSFSLASFNRPQGVAYDTEKNILYVADTENHALREV 651


>gi|294817139|ref|ZP_06775781.1| Receptor protein kinase [Streptomyces clavuligerus ATCC 27064]
 gi|294321954|gb|EFG04089.1| Receptor protein kinase [Streptomyces clavuligerus ATCC 27064]
          Length = 510

 Score = 47.0 bits (110), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/58 (39%), Positives = 33/58 (56%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ +  +  +AG G  G    DG   SA+ ++P   AVD  G +Y+AD  NH IRKIT
Sbjct: 182 VAADGTISTVAGTGATGNRGDDGPAASAQLNRPYGVAVDGAGAVYIADTDNHRIRKIT 239



 Score = 42.7 bits (99), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 24/58 (41%), Positives = 33/58 (56%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ +  V  +AG G  GY  D  P  SA+ + P+  AVD  G +YV D  NH +RKIT
Sbjct: 70  ITTDGKVSTVAGIGSAGYRGDNGPAVSAQLNCPREVAVDGAGAVYVTDAENHRVRKIT 127



 Score = 41.2 bits (95), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 32/49 (65%), Gaps = 2/49 (4%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +AG GV G++  +G   +A+ ++P   AVD  G +Y +D +NH +RKIT
Sbjct: 23  VAGTGVAGFTGDNGPAVAAQLNRPYGIAVDSAGTLYFSDYNNHRVRKIT 71



 Score = 39.7 bits (91), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 35/57 (61%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ +  +  + G GV G+S DG P +A R ++P   AVD  G +YV + +N+ IRK+
Sbjct: 126 ITADGTISTVVGTGVAGFSGDGGPATAARLNRPFGVAVDSTGVLYVTEYNNNRIRKV 182



 Score = 39.7 bits (91), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 21/32 (65%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +AR ++P    +D  G +YVAD  NH IRKIT
Sbjct: 264 AARLNRPMGVVIDSTGTLYVADYGNHRIRKIT 295


>gi|326445969|ref|ZP_08220703.1| hypothetical protein SclaA2_33112 [Streptomyces clavuligerus ATCC
           27064]
          Length = 508

 Score = 47.0 bits (110), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/58 (39%), Positives = 33/58 (56%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ +  +  +AG G  G    DG   SA+ ++P   AVD  G +Y+AD  NH IRKIT
Sbjct: 180 VAADGTISTVAGTGATGNRGDDGPAASAQLNRPYGVAVDGAGAVYIADTDNHRIRKIT 237



 Score = 42.7 bits (99), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 24/58 (41%), Positives = 33/58 (56%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ +  V  +AG G  GY  D  P  SA+ + P+  AVD  G +YV D  NH +RKIT
Sbjct: 68  ITTDGKVSTVAGIGSAGYRGDNGPAVSAQLNCPREVAVDGAGAVYVTDAENHRVRKIT 125



 Score = 40.8 bits (94), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 32/49 (65%), Gaps = 2/49 (4%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +AG GV G++  +G   +A+ ++P   AVD  G +Y +D +NH +RKIT
Sbjct: 21  VAGTGVAGFTGDNGPAVAAQLNRPYGIAVDSAGTLYFSDYNNHRVRKIT 69



 Score = 39.7 bits (91), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 35/57 (61%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ +  +  + G GV G+S DG P +A R ++P   AVD  G +YV + +N+ IRK+
Sbjct: 124 ITADGTISTVVGTGVAGFSGDGGPATAARLNRPFGVAVDSTGVLYVTEYNNNRIRKV 180



 Score = 39.7 bits (91), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 21/32 (65%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +AR ++P    +D  G +YVAD  NH IRKIT
Sbjct: 262 AARLNRPMGVVIDSTGTLYVADYGNHRIRKIT 293


>gi|449137215|ref|ZP_21772546.1| NHL repeat containing protein [Rhodopirellula europaea 6C]
 gi|448884292|gb|EMB14794.1| NHL repeat containing protein [Rhodopirellula europaea 6C]
          Length = 351

 Score = 47.0 bits (110), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/95 (27%), Positives = 47/95 (49%), Gaps = 4/95 (4%)

Query: 34  VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
           + ++I G + ++     +D  G   L+VL+ + +    +         +  LAG G +G 
Sbjct: 211 MVSIIAGQKSKLKDPRAVDLDGERRLLVLERNGNRLRRVE----SNGDITTLAGSGKKGT 266

Query: 94  SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           +DG+   A F+ PK   V   G +Y+AD  NH++R
Sbjct: 267 ADGDAKQASFNGPKHMDVAPDGRVYIADDVNHLVR 301



 Score = 37.0 bits (84), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 16/49 (32%), Positives = 29/49 (59%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
           GY DG   SARF+K  +  +D    +Y++D +NH +R++   + G  ++
Sbjct: 106 GYVDGPARSARFNKLHNLMIDSNDVLYLSDHANHSVRRLMESEDGKWVV 154


>gi|359685014|ref|ZP_09255015.1| hypothetical protein Lsan2_10264 [Leptospira santarosai str.
           2000030832]
          Length = 356

 Score = 47.0 bits (110), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 41/85 (48%), Gaps = 1/85 (1%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP   + R  + + I LD      Y     ++   VV   AG G+Q   DG   +A F  
Sbjct: 20  NPSIFMSRAWAENTI-LDCILKECYLCKLKVTNNPVVSLFAGTGIQESIDGTTQTASFKT 78

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKI 130
           P    VD  GNI+V+D+  ++IRKI
Sbjct: 79  PFGLEVDTFGNIFVSDQMANLIRKI 103



 Score = 37.7 bits (86), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 29/48 (60%), Gaps = 1/48 (2%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           GV G+ +G+  ++ F  P    +D + N+YV +  NH IRKI NL+ G
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKI-NLNSG 207


>gi|254390149|ref|ZP_05005369.1| NHL repeat-containing protein [Streptomyces clavuligerus ATCC
           27064]
 gi|197703856|gb|EDY49668.1| NHL repeat-containing protein [Streptomyces clavuligerus ATCC
           27064]
          Length = 527

 Score = 47.0 bits (110), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/58 (39%), Positives = 33/58 (56%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ +  +  +AG G  G    DG   SA+ ++P   AVD  G +Y+AD  NH IRKIT
Sbjct: 199 VAADGTISTVAGTGATGNRGDDGPAASAQLNRPYGVAVDGAGAVYIADTDNHRIRKIT 256



 Score = 42.7 bits (99), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 24/58 (41%), Positives = 33/58 (56%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ +  V  +AG G  GY  D  P  SA+ + P+  AVD  G +YV D  NH +RKIT
Sbjct: 87  ITTDGKVSTVAGIGSAGYRGDNGPAVSAQLNCPREVAVDGAGAVYVTDAENHRVRKIT 144



 Score = 40.8 bits (94), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 32/49 (65%), Gaps = 2/49 (4%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +AG GV G++  +G   +A+ ++P   AVD  G +Y +D +NH +RKIT
Sbjct: 40  VAGTGVAGFTGDNGPAVAAQLNRPYGIAVDSAGTLYFSDYNNHRVRKIT 88



 Score = 39.7 bits (91), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 35/57 (61%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ +  +  + G GV G+S DG P +A R ++P   AVD  G +YV + +N+ IRK+
Sbjct: 143 ITADGTISTVVGTGVAGFSGDGGPATAARLNRPFGVAVDSTGVLYVTEYNNNRIRKV 199



 Score = 39.7 bits (91), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 21/32 (65%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +AR ++P    +D  G +YVAD  NH IRKIT
Sbjct: 281 AARLNRPMGVVIDSTGTLYVADYGNHRIRKIT 312


>gi|256393018|ref|YP_003114582.1| NHL repeat containing protein [Catenulispora acidiphila DSM 44928]
 gi|256359244|gb|ACU72741.1| NHL repeat containing protein [Catenulispora acidiphila DSM 44928]
          Length = 1046

 Score = 47.0 bits (110), Expect = 0.004,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 38/57 (66%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           L+   ++ RL G G  GYS DG P   A+ ++P   A+D +G++Y+AD +N+VIR++
Sbjct: 849 LTPRGLLLRLIGTGRAGYSGDGRPSPLAQLNQPIGLALDAQGDLYIADSANNVIRRV 905


>gi|418721636|ref|ZP_13280812.1| hypothetical protein LEP1GSC101_0633 [Leptospira borgpetersenii
           str. UI 09149]
 gi|410741989|gb|EKQ90740.1| hypothetical protein LEP1GSC101_0633 [Leptospira borgpetersenii
           str. UI 09149]
          Length = 246

 Score = 47.0 bits (110), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/71 (33%), Positives = 38/71 (53%)

Query: 60  IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYV 119
           ++LD      +T    ++   VV   AG G++   DG   +A F  P S  +D  GNI+V
Sbjct: 35  MILDCILKECHTCKLKVTNNPVVSPFAGTGIRESIDGTTQTASFKTPFSLELDTFGNIFV 94

Query: 120 ADKSNHVIRKI 130
           +D  +++IRKI
Sbjct: 95  SDSDSNLIRKI 105


>gi|75909875|ref|YP_324171.1| NHL repeat-containing protein [Anabaena variabilis ATCC 29413]
 gi|75703600|gb|ABA23276.1| NHL repeat protein [Anabaena variabilis ATCC 29413]
          Length = 503

 Score = 47.0 bits (110), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/84 (35%), Positives = 39/84 (46%), Gaps = 7/84 (8%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V+  PG   L V DS         F       +  L G+G  G +DG    A+F  P
Sbjct: 175 PGKVLATPGG--LFVADSGHHRIVVSDF----NGEILHLIGNGKSGLTDGNFQEAQFSAP 228

Query: 107 KSFAVDMKGNI-YVADKSNHVIRK 129
           +  A DM+  I YVAD  NHV+R+
Sbjct: 229 QGMAFDMENQILYVADTDNHVVRR 252



 Score = 37.0 bits (84), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 16/48 (33%), Positives = 29/48 (60%), Gaps = 1/48 (2%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + + GDG  G  +G+  + RF +P   +  M   +Y++D +NHVIR++
Sbjct: 431 QTVLGDGSAGLQNGQGKNTRFFEPSGLSA-MDSYLYISDTNNHVIRRV 477


>gi|441163237|ref|ZP_20968264.1| NHL repeat-containing protein [Streptomyces rimosus subsp. rimosus
           ATCC 10970]
 gi|440616376|gb|ELQ79518.1| NHL repeat-containing protein [Streptomyces rimosus subsp. rimosus
           ATCC 10970]
          Length = 551

 Score = 47.0 bits (110), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 36/58 (62%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGY-SDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ + ++  +AG+G  GY  DG P +A R   P+  A+D  GN+Y+AD  N+ +R +T
Sbjct: 78  VTTDGIITTVAGNGTAGYVDDGGPATATRLYGPRGVALDRAGNLYIADGDNNRVRGVT 135



 Score = 40.4 bits (93), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 20/58 (34%), Positives = 34/58 (58%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ +  +  +AG G  GY SDG P  S++ + P    +D  GN+Y+A+  +  IRK+T
Sbjct: 22  VTPQGTITTVAGTGTAGYTSDGGPATSSQLNTPAYVTLDDAGNLYIAESGSQRIRKVT 79


>gi|29347940|ref|NP_811443.1| hypothetical protein BT_2530 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|383121619|ref|ZP_09942326.1| hypothetical protein BSIG_4948 [Bacteroides sp. 1_1_6]
 gi|29339842|gb|AAO77637.1| conserved hypothetical protein, with a conserved domain
           [Bacteroides thetaiotaomicron VPI-5482]
 gi|251837946|gb|EES66035.1| hypothetical protein BSIG_4948 [Bacteroides sp. 1_1_6]
          Length = 441

 Score = 47.0 bits (110), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +S +++V  +AG  GV GY DG P  + F  P   AV+ +G+IY+AD  N  IRK+
Sbjct: 383 ISADNIVTTVAGQPGVAGYKDGGPVESLFKNPWGVAVNEQGDIYIADWGNARIRKL 438



 Score = 42.0 bits (97), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 26/49 (53%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           KRL   G   + DG    A F+ PK    D  GN++VAD  NH IR I+
Sbjct: 336 KRLNAPGGSAFRDGPIEDALFNYPKDIKFDNDGNMFVADYGNHCIRMIS 384


>gi|442323754|ref|YP_007363775.1| putative lipoprotein [Myxococcus stipitatus DSM 14675]
 gi|441491396|gb|AGC48091.1| putative lipoprotein [Myxococcus stipitatus DSM 14675]
          Length = 906

 Score = 47.0 bits (110), Expect = 0.004,   Method: Composition-based stats.
 Identities = 24/50 (48%), Positives = 29/50 (58%), Gaps = 1/50 (2%)

Query: 82  VKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V  LAG   Q G+ DG    AR  +P   AVD  GNI+VAD  NH +R+I
Sbjct: 479 VSTLAGVAFQDGFVDGPVARARLRRPVGVAVDGLGNIFVADTGNHAVRRI 528



 Score = 38.1 bits (87), Expect = 1.6,   Method: Composition-based stats.
 Identities = 24/76 (31%), Positives = 37/76 (48%), Gaps = 2/76 (2%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
           G  ++ V D+   A   ++ P +  +V   +AG G  G  +G   +     P+S AV   
Sbjct: 511 GLGNIFVADTGNHAVRRIA-PDAARTVTT-IAGLGTPGVGEGPGATTALRSPQSIAVAPD 568

Query: 115 GNIYVADKSNHVIRKI 130
           G +YVAD  NH I +I
Sbjct: 569 GTLYVADTGNHRIVRI 584


>gi|380693545|ref|ZP_09858404.1| hypothetical protein BfaeM_06134 [Bacteroides faecis MAJ27]
          Length = 454

 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 28/40 (70%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           GY +G   + R +KP+  A D  GN++V +++NH+IRKIT
Sbjct: 358 GYVNGTGATVRLNKPEQMAFDEDGNMFVPERNNHIIRKIT 397



 Score = 46.2 bits (108), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 19/40 (47%), Positives = 28/40 (70%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G+ DG P  A+F++P+   V    +IYVAD+ NHVIR++T
Sbjct: 413 GFGDGLPEEAKFNQPECVTVYPDNSIYVADRDNHVIRRVT 452


>gi|298386893|ref|ZP_06996448.1| IPT/TIG domain-containing protein [Bacteroides sp. 1_1_14]
 gi|298260567|gb|EFI03436.1| IPT/TIG domain-containing protein [Bacteroides sp. 1_1_14]
          Length = 454

 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 28/40 (70%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           GY +G   + R +KP+  A D  GN++V +++NH+IRKIT
Sbjct: 358 GYVNGTGATVRLNKPEQMAFDEDGNMFVPERNNHIIRKIT 397



 Score = 46.2 bits (108), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 19/40 (47%), Positives = 28/40 (70%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G+ DG P  A+F++P+   V    +IYVAD+ NHVIR++T
Sbjct: 413 GFGDGLPEEAKFNQPECVTVYPDNSIYVADRDNHVIRRVT 452


>gi|359728310|ref|ZP_09267006.1| hypothetical protein Lwei2_15904 [Leptospira weilii str.
           2006001855]
 gi|417778551|ref|ZP_12426356.1| putative lipoprotein [Leptospira weilii str. 2006001853]
 gi|410781344|gb|EKR65918.1| putative lipoprotein [Leptospira weilii str. 2006001853]
          Length = 357

 Score = 46.6 bits (109), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/114 (25%), Positives = 53/114 (46%), Gaps = 9/114 (7%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      +     ++ + V+   AG G+    DG   +A F+ P    +D  GNI+V+
Sbjct: 36  ILDCILKECHLCKLKVTNKPVISLFAGTGINQSVDGTTQTASFNTPFGLELDTFGNIFVS 95

Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSGMFWVTVFSPYLISCK 174
           D++ ++IRKI +  +   L   + L         Q+ SG+ +  +     +SCK
Sbjct: 96  DQAANLIRKIDHFGNVTTLFTSLTL---------QAPSGIKFDPITGDKYVSCK 140



 Score = 36.6 bits (83), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 23/50 (46%), Positives = 29/50 (58%), Gaps = 2/50 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           V  L+G GV GY DG+  SA+F  P     D K + + VAD  +H IRKI
Sbjct: 210 VSTLSG-GVSGYLDGDLVSAQFKSPLGITYDQKTDSLLVADLQDHRIRKI 258


>gi|312198348|ref|YP_004018409.1| serine/threonine protein kinase [Frankia sp. EuI1c]
 gi|311229684|gb|ADP82539.1| serine/threonine protein kinase [Frankia sp. EuI1c]
          Length = 782

 Score = 46.6 bits (109), Expect = 0.005,   Method: Composition-based stats.
 Identities = 24/79 (30%), Positives = 41/79 (51%), Gaps = 8/79 (10%)

Query: 60  IVLDSSRSAFYTLSF------PLSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAV 111
           I LD  R  FY   +       +    ++  +AG+G+ G++   G+  SA+ D P   AV
Sbjct: 400 ITLDLKRGYFYIADYYNERVRRVDHNGIITTVAGNGIAGFAGDGGQATSAQLDSPTDVAV 459

Query: 112 DMKGNIYVADKSNHVIRKI 130
              G+IY+AD +N+ +R +
Sbjct: 460 AADGSIYIADSANNRVRHV 478



 Score = 38.1 bits (87), Expect = 1.8,   Method: Composition-based stats.
 Identities = 17/47 (36%), Positives = 26/47 (55%), Gaps = 1/47 (2%)

Query: 85  LAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  DG+    DG P + A    P   AVD +GN+Y+AD  N  +R++
Sbjct: 502 IGDDGLTYAGDGVPATKAMLRSPNDLAVDGRGNLYIADGENERVRRV 548


>gi|418737091|ref|ZP_13293489.1| NHL repeat protein [Leptospira borgpetersenii serovar Castellonis
           str. 200801910]
 gi|410747250|gb|EKR00156.1| NHL repeat protein [Leptospira borgpetersenii serovar Castellonis
           str. 200801910]
          Length = 358

 Score = 46.6 bits (109), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 24/71 (33%), Positives = 38/71 (53%)

Query: 60  IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYV 119
           ++LD      +T    ++   VV   AG G++   DG   +A F  P S  +D  GNI+V
Sbjct: 35  MILDCILKECHTCKLKVTNNPVVSPFAGTGIRESIDGTTQTASFKTPFSLELDTFGNIFV 94

Query: 120 ADKSNHVIRKI 130
           +D  +++IRKI
Sbjct: 95  SDSDSNLIRKI 105



 Score = 38.1 bits (87), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 24/51 (47%), Positives = 29/51 (56%), Gaps = 2/51 (3%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
            V  L+G GV GY DG+  SA+F  P     D K + + VAD  NH IRKI
Sbjct: 210 TVSTLSG-GVSGYLDGDLTSAQFKFPSGIIYDQKTDSLLVADLQNHRIRKI 259



 Score = 37.4 bits (85), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 18/51 (35%), Positives = 31/51 (60%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           + S V  L G+G++   DG+  +A FD P   ++D  G ++V+D S++ IR
Sbjct: 262 KTSTVSTLLGNGIEASIDGKGLNASFDGPAFISLDNSGYMFVSDTSSNKIR 312


>gi|298387816|ref|ZP_06997366.1| hypothetical protein HMPREF9007_04616 [Bacteroides sp. 1_1_14]
 gi|298259421|gb|EFI02295.1| hypothetical protein HMPREF9007_04616 [Bacteroides sp. 1_1_14]
          Length = 440

 Score = 46.6 bits (109), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +S +++V  +AG  GV GY DG P  + F  P   AV+ +G+IY+AD  N  IRK+
Sbjct: 382 VSADNIVTTVAGQPGVAGYKDGGPVESLFKNPWGVAVNEQGDIYIADWGNARIRKL 437



 Score = 41.6 bits (96), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 26/49 (53%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           KRL   G   + DG    A F+ PK    D  GN++VAD  NH IR ++
Sbjct: 335 KRLNAPGGSAFRDGPIEDALFNYPKDIKFDNDGNMFVADYGNHCIRMVS 383


>gi|345851882|ref|ZP_08804843.1| hypothetical protein SZN_18986 [Streptomyces zinciresistens K42]
 gi|345636654|gb|EGX58200.1| hypothetical protein SZN_18986 [Streptomyces zinciresistens K42]
          Length = 468

 Score = 46.6 bits (109), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 36/58 (62%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ +  +  +AG G  G+  DG P  +A+ + P+  AVD  G +Y+AD +NH IRK+T
Sbjct: 28  ITTDGRITTVAGTGAAGFRGDGGPAVAAQLNGPREVAVDGAGAVYIADSNNHRIRKVT 85


>gi|421094122|ref|ZP_15554843.1| putative lipoprotein [Leptospira borgpetersenii str. 200801926]
 gi|410363263|gb|EKP14295.1| putative lipoprotein [Leptospira borgpetersenii str. 200801926]
 gi|456886734|gb|EMF97860.1| putative lipoprotein [Leptospira borgpetersenii str. 200701203]
          Length = 358

 Score = 46.6 bits (109), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 24/71 (33%), Positives = 38/71 (53%)

Query: 60  IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYV 119
           ++LD      +T    ++   VV   AG G++   DG   +A F  P S  +D  GNI+V
Sbjct: 35  MILDCILKECHTCKLKVTNNPVVSPFAGTGIRESIDGTTQTASFKTPFSLELDTFGNIFV 94

Query: 120 ADKSNHVIRKI 130
           +D  +++IRKI
Sbjct: 95  SDSDSNLIRKI 105



 Score = 39.3 bits (90), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 24/51 (47%), Positives = 29/51 (56%), Gaps = 2/51 (3%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
            V  L+G GV GY DG+  SA+F  P     D K + + VAD  NH IRKI
Sbjct: 210 TVSTLSG-GVSGYLDGDLTSAQFKSPSGIIYDQKTDSLLVADLQNHRIRKI 259



 Score = 37.4 bits (85), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 18/51 (35%), Positives = 31/51 (60%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           + S V  L G+G++   DG+  +A FD P   ++D  G ++V+D S++ IR
Sbjct: 262 KTSTVSTLLGNGIEASIDGKGLNASFDGPAFISLDNSGYMFVSDTSSNKIR 312


>gi|393782357|ref|ZP_10370541.1| hypothetical protein HMPREF1071_01409 [Bacteroides salyersiae
           CL02T12C01]
 gi|392673185|gb|EIY66648.1| hypothetical protein HMPREF1071_01409 [Bacteroides salyersiae
           CL02T12C01]
          Length = 446

 Score = 46.6 bits (109), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 32/56 (57%), Gaps = 1/56 (1%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +S++ +V  + G  G  G+ DG P  A FDKP   A+D  G IY+ D  N  +R++
Sbjct: 388 ISQDGIVSTVIGLAGQSGFMDGTPEEALFDKPFGVALDTDGTIYIGDSENQCVRRL 443



 Score = 42.4 bits (98), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 26/40 (65%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           GY+DG    A F++P+   V     +Y+AD +NHVIRKI+
Sbjct: 350 GYADGPCEYAMFNEPRQMIVSADNELYLADTNNHVIRKIS 389


>gi|320106842|ref|YP_004182432.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
 gi|319925363|gb|ADV82438.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
          Length = 382

 Score = 46.6 bits (109), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 23/56 (41%), Positives = 34/56 (60%), Gaps = 1/56 (1%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDM-KGNIYVADKSNHVIRKI 130
           L+  + V  LAG+G  GY+DG+  SARF+KP     +   G  Y+AD  N+ IR++
Sbjct: 261 LNGANQVTTLAGNGTAGYADGQGNSARFNKPTQVVYNSGDGAYYIADTFNNCIRRM 316



 Score = 40.4 bits (93), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 23/50 (46%), Positives = 29/50 (58%), Gaps = 2/50 (4%)

Query: 82  VKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V   AG G Q G  DG   SA+FDKP   A+   G +YVAD  N+ IR++
Sbjct: 322 VTTYAGIGGQSGLVDGASTSAKFDKPTGIAI-ANGYLYVADSGNNAIRRV 370


>gi|417302109|ref|ZP_12089222.1| NHL repeat containing protein [Rhodopirellula baltica WH47]
 gi|327541551|gb|EGF28082.1| NHL repeat containing protein [Rhodopirellula baltica WH47]
          Length = 351

 Score = 46.6 bits (109), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 26/95 (27%), Positives = 47/95 (49%), Gaps = 4/95 (4%)

Query: 34  VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
           + T + G + ++     +D  G+  L+VL+ + +    +         +  LAG G +G 
Sbjct: 211 LVTTLAGRKSKLKDPRAVDLDGNGRLLVLERNGNRLRRVE----SNGDITTLAGSGKKGT 266

Query: 94  SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           +DG+   A F+ PK   V   G +Y+AD  NH++R
Sbjct: 267 ADGDAKQASFNGPKHMDVAPDGRVYIADDVNHLVR 301



 Score = 38.5 bits (88), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 16/49 (32%), Positives = 29/49 (59%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
           GY DG   SARF+K  +  +D +  +Y++D +NH +R++     G  ++
Sbjct: 106 GYVDGPARSARFNKLHNLVIDAEDVLYLSDHANHSVRRLIQTSDGEWMV 154


>gi|383124185|ref|ZP_09944852.1| hypothetical protein BSIG_3780 [Bacteroides sp. 1_1_6]
 gi|251839311|gb|EES67395.1| hypothetical protein BSIG_3780 [Bacteroides sp. 1_1_6]
          Length = 454

 Score = 46.6 bits (109), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 28/40 (70%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           GY +G   + R +KP+  A D  GN++V +++NH+IRKIT
Sbjct: 358 GYINGTGATVRLNKPEQMAFDEDGNMFVPERNNHIIRKIT 397



 Score = 45.8 bits (107), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 28/40 (70%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G+ DG P  A+F++P+   V    ++YVAD+ NHVIR++T
Sbjct: 413 GFGDGLPEEAKFNQPECVTVYPDNSVYVADRDNHVIRRVT 452


>gi|322434821|ref|YP_004217033.1| hypothetical protein AciX9_1190 [Granulicella tundricola MP5ACTX9]
 gi|321162548|gb|ADW68253.1| NHL repeat containing protein [Granulicella tundricola MP5ACTX9]
          Length = 1631

 Score = 46.6 bits (109), Expect = 0.005,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 33/55 (60%), Gaps = 3/55 (5%)

Query: 81  VVKRLAGDGVQ--GYSDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           V+   AG+GV   GY DG P +A   + P   AV  KG IY+AD+ N +IRK+ N
Sbjct: 313 VITTFAGNGVTTGGYGDGGPAAAAMLNAPWGIAVSSKGEIYIADQGNSLIRKVVN 367



 Score = 37.0 bits (84), Expect = 4.0,   Method: Composition-based stats.
 Identities = 20/55 (36%), Positives = 31/55 (56%), Gaps = 3/55 (5%)

Query: 79  ESVVKRLAGD-GVQGY-SDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  +  +AG  G  GY  DG P S A+ + P     D +GN+Y  D +N+V+R+I
Sbjct: 253 DGTISTIAGQMGKNGYVGDGGPASSAKLNGPNGLVFDAQGNLYFCDTNNNVVRRI 307



 Score = 36.2 bits (82), Expect = 6.7,   Method: Composition-based stats.
 Identities = 15/35 (42%), Positives = 22/35 (62%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
           SA    P S A+D  GN++++D  N+VIRK+   D
Sbjct: 219 SATLSSPSSVALDGAGNVFISDTGNNVIRKVNVFD 253


>gi|430744164|ref|YP_007203293.1| NHL repeat protein [Singulisphaera acidiphila DSM 18658]
 gi|430015884|gb|AGA27598.1| NHL repeat protein [Singulisphaera acidiphila DSM 18658]
          Length = 358

 Score = 46.6 bits (109), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 38/68 (55%), Gaps = 5/68 (7%)

Query: 78  EESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           +  ++   AG G +GYS DG P +A  F+ PK FA+D  G++ + D  N  IR+I   D 
Sbjct: 245 QTGLITTRAGTGAKGYSGDGGPATAATFNGPKEFAIDRAGDLLIVDTENQAIRRI---DA 301

Query: 136 GNQLIRQI 143
              LIR +
Sbjct: 302 RTGLIRTL 309



 Score = 40.0 bits (92), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 29/110 (26%), Positives = 52/110 (47%), Gaps = 11/110 (10%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
            +  ++  +AG G +G+S G+ G    A+ D+P    +D +GN+Y AD+ N  +R++ + 
Sbjct: 72  GKSGIITTVAGSGTKGFS-GDGGVALKAKLDEPYGIVLDSRGNLYFADRLNRRVRRV-DA 129

Query: 134 DHGNQLIRQINLKPEDCSKS-SQSGSGMFWVTVFSPYLISCKSFWVHLYL 182
           D G      I     D SK+ S  G       +  P  ++  S    L++
Sbjct: 130 DSG-----MITTIAGDGSKTYSGDGGPGARAGLVEPNGVALDSQEARLFI 174



 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 42/83 (50%), Gaps = 7/83 (8%)

Query: 50  VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG--SARFDKPK 107
            IDR G  DL+++D+   A   +    +   +++ LAG+G +G         SA  D+P 
Sbjct: 279 AIDRAG--DLLIVDTENQAIRRID---ARTGLIRTLAGNGQRGGEGDGGAATSALLDRPH 333

Query: 108 SFAVDMKGNIYVADKSNHVIRKI 130
             AV   G +Y+ D  NH IRK+
Sbjct: 334 GVAVGPDGAVYIGDTGNHRIRKV 356


>gi|374988208|ref|YP_004963703.1| hypothetical protein SBI_05452 [Streptomyces bingchenggensis BCW-1]
 gi|297158860|gb|ADI08572.1| hypothetical protein SBI_05452 [Streptomyces bingchenggensis BCW-1]
          Length = 609

 Score = 46.6 bits (109), Expect = 0.005,   Method: Composition-based stats.
 Identities = 28/84 (33%), Positives = 44/84 (52%), Gaps = 4/84 (4%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P   +  PG +  +V D++R     L+     ESV++R+ G G +G  DG    ARF +P
Sbjct: 176 PGKALLLPGGT-FLVSDTTRHQLVELA--ADGESVLRRI-GSGERGLVDGPADRARFSEP 231

Query: 107 KSFAVDMKGNIYVADKSNHVIRKI 130
           +  A+   G + VAD  NH +R +
Sbjct: 232 QGLALLPDGTVIVADTVNHALRSL 255


>gi|290975887|ref|XP_002670673.1| predicted protein [Naegleria gruberi]
 gi|284084234|gb|EFC37929.1| predicted protein [Naegleria gruberi]
          Length = 1074

 Score = 46.6 bits (109), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 49/88 (55%), Gaps = 7/88 (7%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSARFD 104
           P+ +     S ++I++D+S +        +S    +  +AG G  GYS  +G+  +A+F+
Sbjct: 152 PYGLALNSSSGEVIIVDTSNNVIR----KVSSIGNITTIAGTGAAGYSGDNGQATNAKFN 207

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITN 132
            P++ A    G ++VAD  NH IRKI+N
Sbjct: 208 APRA-AFYSNGELFVADSRNHRIRKISN 234



 Score = 43.5 bits (101), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 23/52 (44%), Positives = 29/52 (55%), Gaps = 2/52 (3%)

Query: 82  VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +  +AG G  GYS DG    SA+ + PK   V   G IY +D  NH IRKI+
Sbjct: 492 ITTIAGTGTSGYSGDGSSATSAKLNNPKGVVVSSSGEIYFSDSENHRIRKIS 543



 Score = 40.0 bits (92), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 34/58 (58%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           L+    +   AG+G+ G S DG  P SA+ + P   AV   G +Y++D SN+ IRK++
Sbjct: 934 LNVNGTISTSAGNGIAGLSGDGSLPTSAQLNFPTGLAVSSVGELYISDDSNNRIRKVS 991



 Score = 38.5 bits (88), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 32/53 (60%), Gaps = 4/53 (7%)

Query: 82  VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +  +AG GV G+S G+ G   SA  + P   AV+  G + ++D +NH IRK++
Sbjct: 828 ISTVAGSGVAGFS-GDGGLATSATLNNPSYVAVNSNGELLISDTNNHRIRKVS 879



 Score = 37.4 bits (85), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 23/86 (26%), Positives = 44/86 (51%), Gaps = 5/86 (5%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSARFD 104
           P +V     ++D+ + DS       +S   S   ++  +AG G  G+S  +G   +A+ +
Sbjct: 634 PFAVAVNLTNNDIYIADSGNHRIRKVS---SSSGIITTVAGTGTSGFSGDNGLATNAKLN 690

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKI 130
            P S ++   G IY++D+ N  +RK+
Sbjct: 691 FPFSISIGNSGEIYISDQYNQRVRKV 716


>gi|429737840|ref|ZP_19271682.1| IPT/TIG domain protein [Prevotella saccharolytica F0055]
 gi|429161715|gb|EKY04093.1| IPT/TIG domain protein [Prevotella saccharolytica F0055]
          Length = 438

 Score = 46.6 bits (109), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 23/50 (46%), Positives = 29/50 (58%), Gaps = 1/50 (2%)

Query: 83  KRLAGDGVQG-YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           KRL+   + G + DG    A+F  P     D  GNIYVAD+ NH IR+IT
Sbjct: 332 KRLSSSNISGGHRDGPLSEAQFRNPSQIFCDNDGNIYVADRGNHCIRRIT 381



 Score = 41.2 bits (95), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 19/57 (33%), Positives = 35/57 (61%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ E++V+ + G  G +G+ DG+   A F++P    +   G++YVAD  N  +RK+T
Sbjct: 380 ITPENMVETVLGMPGTKGWKDGKKEDALFNEPTGIGIAQDGSVYVADFKNGRVRKLT 436


>gi|375146761|ref|YP_005009202.1| NHL repeat containing protein [Niastella koreensis GR20-10]
 gi|361060807|gb|AEV99798.1| NHL repeat containing protein [Niastella koreensis GR20-10]
          Length = 439

 Score = 46.2 bits (108), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 23/49 (46%), Positives = 28/49 (57%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V   AG G  GY+D    +A F  P   AVD  G +YV DK+N+ IRKI
Sbjct: 388 VSTFAGSGSAGYADAVGEAAAFSAPTGIAVDKNGTLYVLDKNNNRIRKI 436



 Score = 42.4 bits (98), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 52/124 (41%), Gaps = 14/124 (11%)

Query: 21  LVSSGLLLEDGYTVTTVIDGHQLEI---NPHSVI----DRPGSSDLIVLDSSRSAFYTLS 73
           L S    + D   +T  ++G  L++   N   ++     + GS   +V   + SA  T  
Sbjct: 55  LFSGSNYINDTNQITVTLNGKPLKVVGVNGSQIMAVVPKKAGSGHFVVKIGADSAVSTGI 114

Query: 74  FPLSEESVVKRLAGDGVQGYSDGEPGSARFD-------KPKSFAVDMKGNIYVADKSNHV 126
           F       V   AG G  G+++G    A FD       +     VD   N+YVAD  NH 
Sbjct: 115 FNYVYTRTVSTFAGSGKAGFANGMGTDAMFDFGGQSWYRSMGIIVDDNLNLYVADPGNHC 174

Query: 127 IRKI 130
           IRKI
Sbjct: 175 IRKI 178



 Score = 41.6 bits (96), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 35/73 (47%), Gaps = 4/73 (5%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPE 148
           GV GY +G    A+FD P   A+D  GN+YVA    +      N  + +Q IR I     
Sbjct: 331 GVPGYVNGAAAVAKFDHPWGLAIDTAGNLYVAGNGTY----DGNTGNSDQAIRYIEAASF 386

Query: 149 DCSKSSQSGSGMF 161
           + S  + SGS  +
Sbjct: 387 NVSTFAGSGSAGY 399



 Score = 40.0 bits (92), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 25/39 (64%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           GY+DG+  +A+F  P   A D +GNI+  D +N  IRKI
Sbjct: 195 GYADGKGTAAKFSLPYDVAFDAQGNIWCVDPANWDIRKI 233


>gi|227537797|ref|ZP_03967846.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33300]
 gi|227242411|gb|EEI92426.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33300]
          Length = 441

 Score = 46.2 bits (108), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 23/56 (41%), Positives = 37/56 (66%), Gaps = 1/56 (1%)

Query: 76  LSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           LS + +V  +AG   + GY DG P  ++F++P   AV+ +G+IY+AD +N  IRK+
Sbjct: 383 LSADGIVSTVAGQPTKAGYKDGGPVESQFNQPWGVAVNDQGDIYIADWNNARIRKL 438



 Score = 42.4 bits (98), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 27/49 (55%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           KR+   G  G+ DG    A F+ PK    D  GN+++AD  NH IR ++
Sbjct: 336 KRINAPGGSGFRDGPVADAVFNFPKDIKFDRAGNMFIADYGNHCIRMLS 384


>gi|116619658|ref|YP_821814.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116222820|gb|ABJ81529.1| NHL repeat containing protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 585

 Score = 46.2 bits (108), Expect = 0.006,   Method: Composition-based stats.
 Identities = 27/88 (30%), Positives = 44/88 (50%), Gaps = 7/88 (7%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG--SARF 103
           +P  +   P + +L + DS+      +S      S +  +AG+G  G++  +    SA  
Sbjct: 52  SPTGLALDPKTGNLYIADSANHRIRMIS-----GSTISTVAGNGTAGFAGDKAAATSANL 106

Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           + P   A+D  GN Y+AD  N VIRK+T
Sbjct: 107 NTPSGVALDSSGNFYIADSLNSVIRKVT 134



 Score = 37.4 bits (85), Expect = 3.0,   Method: Composition-based stats.
 Identities = 19/48 (39%), Positives = 30/48 (62%), Gaps = 2/48 (4%)

Query: 85  LAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            AG+   G++ DG   + ++ +KP   AVD  GN+Y+AD +N  IRK+
Sbjct: 250 FAGNLTAGFAGDGNTATLSQLNKPVGIAVDSAGNVYIADSNNGRIRKV 297


>gi|300772471|ref|ZP_07082341.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33861]
 gi|300760774|gb|EFK57600.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33861]
          Length = 441

 Score = 46.2 bits (108), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 23/56 (41%), Positives = 37/56 (66%), Gaps = 1/56 (1%)

Query: 76  LSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           LS + +V  +AG   + GY DG P  ++F++P   AV+ +G+IY+AD +N  IRK+
Sbjct: 383 LSADGIVSTVAGQPTKAGYKDGGPVESQFNQPWGVAVNDQGDIYIADWNNARIRKL 438



 Score = 42.4 bits (98), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 27/49 (55%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           KR+   G  G+ DG    A F+ PK    D  GN+++AD  NH IR ++
Sbjct: 336 KRINAPGGSGFRDGPVADAVFNFPKDIKFDRAGNMFIADYGNHCIRMLS 384


>gi|456865078|gb|EMF83443.1| putative lipoprotein [Leptospira weilii serovar Topaz str. LT2116]
          Length = 357

 Score = 46.2 bits (108), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 29/114 (25%), Positives = 52/114 (45%), Gaps = 9/114 (7%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      +     ++   V+   AG G+    DG   +A F+ P    +D  GNI+V+
Sbjct: 36  ILDCILKECHLCKLKVTNNPVISLFAGTGINQSVDGTTQTASFNTPFGLELDTFGNIFVS 95

Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSGMFWVTVFSPYLISCK 174
           D++ ++IRKI +  +   L   + L         Q+ SG+ +  +     +SCK
Sbjct: 96  DQAANLIRKIDHFGNVTTLFTSLTL---------QAPSGIKFDPITGDKYVSCK 140



 Score = 36.6 bits (83), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 23/50 (46%), Positives = 29/50 (58%), Gaps = 2/50 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           V  L+G GV GY DG+  SA+F  P     D K + + VAD  +H IRKI
Sbjct: 210 VSTLSG-GVSGYLDGDLVSAQFKSPLGITYDQKTDSLLVADLQDHRIRKI 258


>gi|322788140|gb|EFZ13922.1| hypothetical protein SINV_02735 [Solenopsis invicta]
          Length = 717

 Score = 46.2 bits (108), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 25/83 (30%), Positives = 42/83 (50%), Gaps = 5/83 (6%)

Query: 49  SVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKS 108
           +V+     + L++ DS  +     +    E   V+ + G   QG+ DG+  +ARF+ P+ 
Sbjct: 255 TVLQSEQGTKLVISDSGNNRIVITN----EHGRVEHVIGGCSQGFKDGDFKNARFNSPQG 310

Query: 109 FAVDMKGNIYVADKSNHVIRKIT 131
               +   IY+AD +NH IRKI 
Sbjct: 311 VCA-LNNTIYIADNNNHAIRKIN 332


>gi|290999745|ref|XP_002682440.1| predicted protein [Naegleria gruberi]
 gi|284096067|gb|EFC49696.1| predicted protein [Naegleria gruberi]
          Length = 731

 Score = 46.2 bits (108), Expect = 0.007,   Method: Composition-based stats.
 Identities = 23/59 (38%), Positives = 35/59 (59%), Gaps = 2/59 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           +S   ++  +AG GV GYS DG P  + + + P    +D   NI +AD++NH IR I+N
Sbjct: 228 VSNSGIISTVAGTGVSGYSGDGIPANTTKLNTPNGITIDSNDNIIIADRNNHRIRLISN 286



 Score = 44.3 bits (103), Expect = 0.022,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 32/57 (56%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSDG--EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +S   ++  +AG G  GY+        A+   P   +VD KGN+Y+ADK NH IRKI
Sbjct: 508 VSNNGIISTIAGTGSAGYNGDVIMATEAKLYLPHGVSVDNKGNVYIADKQNHRIRKI 564



 Score = 40.8 bits (94), Expect = 0.28,   Method: Composition-based stats.
 Identities = 22/55 (40%), Positives = 34/55 (61%), Gaps = 2/55 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           +S++  VK +AG+G+ GY+ DG     A+ + P   A D  GNIY++D  NH +R
Sbjct: 396 ISKDGSVKTIAGNGIGGYNGDGMLAIDAQLNYPHGVASDSIGNIYISDSYNHRVR 450



 Score = 38.5 bits (88), Expect = 1.4,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 33/60 (55%), Gaps = 3/60 (5%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGS--ARFDKPKSFAVDMKGN-IYVADKSNHVIRKITN 132
           L+   ++  +AG G  G++D    +  +R + P    VD  G  IY+AD +NH IR+I N
Sbjct: 565 LASTGMISTIAGTGQAGFNDDNMSALESRVNSPYDVTVDESGQVIYIADTNNHKIRRIQN 624


>gi|380692527|ref|ZP_09857386.1| hypothetical protein BfaeM_00899 [Bacteroides faecis MAJ27]
          Length = 441

 Score = 46.2 bits (108), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +S +++V  +AG  GV GY DG P  + F  P   AV+ +G+IY+AD  N  IRK+
Sbjct: 383 VSADNIVTTVAGQPGVAGYKDGGPLESLFKNPWGVAVNEQGDIYIADWGNARIRKL 438



 Score = 41.2 bits (95), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 26/49 (53%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           KRL   G   + DG    A F+ PK    D  GN++VAD  NH IR ++
Sbjct: 336 KRLNAPGGSAFRDGPIEDALFNYPKDIKFDNDGNMFVADYGNHCIRMVS 384


>gi|374311754|ref|YP_005058184.1| NHL repeat containing protein [Granulicella mallensis MP5ACTX8]
 gi|358753764|gb|AEU37154.1| NHL repeat containing protein [Granulicella mallensis MP5ACTX8]
          Length = 1456

 Score = 46.2 bits (108), Expect = 0.007,   Method: Composition-based stats.
 Identities = 26/93 (27%), Positives = 51/93 (54%), Gaps = 7/93 (7%)

Query: 40  GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GE 97
           G + EI    ++DR G  D+ ++D   +    ++   +   ++  +AG+G +G+S   G 
Sbjct: 82  GAKFEIPFGILVDRAG--DIFIVDPGNNNVRKIA---ASTGIITTVAGNGTRGFSGDGGA 136

Query: 98  PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
             SA  ++P++ A+D  GN+Y+ D  N  +RK+
Sbjct: 137 ATSAELNQPQAVALDAAGNMYITDTINSRVRKV 169



 Score = 45.8 bits (107), Expect = 0.008,   Method: Composition-based stats.
 Identities = 30/82 (36%), Positives = 49/82 (59%), Gaps = 9/82 (10%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVD 112
           GS ++ ++D+ R A + L+      SVV   AG+G +G+S   G   SA    P++ A+D
Sbjct: 258 GSGNVFIVDTDRYAVFKLTTGTGTVSVV---AGNGTRGFSGDGGAATSAELAYPQAVALD 314

Query: 113 MKGNIYVAD---KSNHVIRKIT 131
             GN+Y++D   +SN+ IRK+T
Sbjct: 315 TAGNLYISDGDYRSNN-IRKVT 335



 Score = 42.7 bits (99), Expect = 0.075,   Method: Composition-based stats.
 Identities = 21/56 (37%), Positives = 37/56 (66%), Gaps = 4/56 (7%)

Query: 81  VVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++  +AGDG   +S G+ G+A   R ++P   A+D  GN+Y+AD +N V+R+++ L
Sbjct: 340 IISTVAGDGTAEFS-GDGGAAVDARLNEPGGVALDTAGNLYIADVNNSVVREVSPL 394



 Score = 42.0 bits (97), Expect = 0.13,   Method: Composition-based stats.
 Identities = 21/52 (40%), Positives = 34/52 (65%), Gaps = 2/52 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++  +AG+G  GY+ DG P  SA   +P+  A+D  G++Y AD  +HV+R+I
Sbjct: 503 IITTIAGNGTSGYTGDGGPATSAGMYEPQGVALDNAGDVYFADFYHHVVRRI 554



 Score = 40.8 bits (94), Expect = 0.26,   Method: Composition-based stats.
 Identities = 17/52 (32%), Positives = 32/52 (61%), Gaps = 2/52 (3%)

Query: 81  VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++  +AG+G +G+S   G   SA  + P+  ++D   N+Y+AD  N+ +RK+
Sbjct: 560 IITTVAGNGTEGFSGDGGSATSAELNYPEGISLDSAANLYIADSGNYRVRKV 611


>gi|290995346|ref|XP_002680256.1| predicted protein [Naegleria gruberi]
 gi|284093876|gb|EFC47512.1| predicted protein [Naegleria gruberi]
          Length = 1226

 Score = 46.2 bits (108), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 37/72 (51%), Gaps = 4/72 (5%)

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
           DL   DS+  A       +  + V+  L G+G  G+ DG+  SA F+ P S A+   G+ 
Sbjct: 398 DLYFCDSNNHAIR----KIDTQGVITTLTGNGTAGFQDGDAASALFNSPASIALLPNGDF 453

Query: 118 YVADKSNHVIRK 129
            VAD +N  IRK
Sbjct: 454 LVADFTNSRIRK 465


>gi|168701257|ref|ZP_02733534.1| NHL repeat containing protein [Gemmata obscuriglobus UQM 2246]
          Length = 358

 Score = 46.2 bits (108), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 24/51 (47%), Positives = 31/51 (60%), Gaps = 2/51 (3%)

Query: 82  VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++R AG G +GY+ DG     A FD PK   +D  GN++V D  N VIRKI
Sbjct: 248 IERFAGTGKKGYTGDGTKALDATFDGPKEIDIDKDGNVFVVDTENEVIRKI 298



 Score = 37.4 bits (85), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 20/52 (38%), Positives = 31/52 (59%), Gaps = 2/52 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V+    G G +G++ DG P + A  D+P   A D  GN+Y +D  NH++RK+
Sbjct: 18  VITSPVGSGKKGFAGDGGPAAKAELDQPFDVAFDKAGNLYFSDTFNHLVRKV 69


>gi|302760605|ref|XP_002963725.1| hypothetical protein SELMODRAFT_79954 [Selaginella moellendorffii]
 gi|300168993|gb|EFJ35596.1| hypothetical protein SELMODRAFT_79954 [Selaginella moellendorffii]
          Length = 1052

 Score = 46.2 bits (108), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 20/48 (41%), Positives = 32/48 (66%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
            V+ +AG GV GY DG+   A+F +P   A+   G+++VAD +N++IR
Sbjct: 838 TVRTVAGTGVAGYEDGKSVKAQFSEPAGLALGPNGSLFVADTNNNLIR 885



 Score = 40.0 bits (92), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 19/45 (42%), Positives = 28/45 (62%), Gaps = 1/45 (2%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI 130
           G G +G +DG    A F++P+  A D + NI YVAD  NH +R++
Sbjct: 610 GSGSEGLTDGSFSLASFNRPQGVAYDTEKNILYVADTENHALREV 654


>gi|428297525|ref|YP_007135831.1| NHL repeat containing protein [Calothrix sp. PCC 6303]
 gi|428234069|gb|AFY99858.1| NHL repeat containing protein [Calothrix sp. PCC 6303]
          Length = 511

 Score = 46.2 bits (108), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 30/91 (32%), Positives = 48/91 (52%), Gaps = 15/91 (16%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN--IYVADKSNHVIRKITNLDHGNQL 139
           V ++ G G  G+ DG+  +A F  P+    D +GN  +Y+AD  NH +R++   D GNQL
Sbjct: 206 VLQIIGVGKAGFVDGDFEAATFSTPQGMTFD-EGNQILYIADTGNHALRQV---DLGNQL 261

Query: 140 IRQI--------NLKPEDCSKSSQSGSGMFW 162
           +R I        N++P  C K+ ++     W
Sbjct: 262 VRTIAGTGIQSRNIRPH-CGKALETPLNSPW 291


>gi|159471860|ref|XP_001694074.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158277241|gb|EDP03010.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 1591

 Score = 46.2 bits (108), Expect = 0.007,   Method: Composition-based stats.
 Identities = 21/59 (35%), Positives = 33/59 (55%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
            ++ +  V  +AG  + G+ DG   +ARF   +  AVD +GN Y +D SNH +R +   D
Sbjct: 1233 VASDGFVTTIAGSSMPGHLDGPAATARFYNLRGVAVDGEGNCYCSDSSNHCVRLLHAAD 1291


>gi|116625316|ref|YP_827472.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116228478|gb|ABJ87187.1| NHL repeat containing protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 981

 Score = 46.2 bits (108), Expect = 0.007,   Method: Composition-based stats.
 Identities = 24/57 (42%), Positives = 36/57 (63%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ + ++  +AG+G +G+S DG P +A  F  P   AVD  GN+YV D SN  +RKI
Sbjct: 226 VTPDGIITTIAGNGTRGFSGDGGPATAATFRGPIGLAVDAFGNVYVTDNSNGRVRKI 282



 Score = 44.3 bits (103), Expect = 0.023,   Method: Composition-based stats.
 Identities = 20/58 (34%), Positives = 34/58 (58%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S   ++   AG+G  G++   G   SA+ + P   A+D  GN+Y+ D SN+ +RK+T
Sbjct: 116 ISAAGIISTYAGNGTAGFAGDSGAATSAQLNGPTDVAIDGNGNLYICDSSNNRVRKVT 173


>gi|158315874|ref|YP_001508382.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
 gi|158111279|gb|ABW13476.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
          Length = 772

 Score = 45.8 bits (107), Expect = 0.008,   Method: Composition-based stats.
 Identities = 23/51 (45%), Positives = 32/51 (62%), Gaps = 2/51 (3%)

Query: 82  VKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  +AG G  G+S DG P + A+ D+P S AV   G +YVAD  NH +R+I
Sbjct: 442 ITTVAGTGADGFSGDGGPATEAQLDEPTSVAVAPDGTLYVADTRNHRVRRI 492



 Score = 36.6 bits (83), Expect = 4.6,   Method: Composition-based stats.
 Identities = 15/31 (48%), Positives = 21/31 (67%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           SARF  P + A    G++YVAD+ NH +R+I
Sbjct: 585 SARFSYPSALARGPDGSLYVADQDNHRVRRI 615


>gi|413923862|gb|AFW63794.1| hypothetical protein ZEAMMB73_706161, partial [Zea mays]
          Length = 139

 Score = 45.8 bits (107), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 24/50 (48%), Positives = 32/50 (64%), Gaps = 1/50 (2%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSE 78
           E GYTV TV DG +L I P+SV +   S +L+V+DS  S  Y ++ PLS 
Sbjct: 75  EGGYTVETVFDGSKLGIEPYSV-EVTQSGELLVMDSMNSNIYRMALPLSR 123


>gi|348690525|gb|EGZ30339.1| hypothetical protein PHYSODRAFT_295167 [Phytophthora sojae]
          Length = 383

 Score = 45.8 bits (107), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 40/74 (54%), Gaps = 3/74 (4%)

Query: 60  IVLDSSRSAFYTLS---FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           IV DSS + + T +      +    +  LAG    G  DG   SARF  P + A+   G+
Sbjct: 261 IVTDSSDNIYVTDNNRVMKFTSGGAMTVLAGSTSSGLVDGTGTSARFYHPDALAIGSDGD 320

Query: 117 IYVADKSNHVIRKI 130
           +YVADK+N+ IR++
Sbjct: 321 LYVADKANYCIRRL 334



 Score = 42.0 bits (97), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 26/41 (63%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V GY+DG   +  F KP+S AV    ++YVAD  N+ IRK+
Sbjct: 11  VSGYADGYGDAVFFKKPRSLAVGSDADLYVADVGNYYIRKV 51


>gi|395768739|ref|ZP_10449254.1| hypothetical protein Saci8_03116 [Streptomyces acidiscabies 84-104]
          Length = 596

 Score = 45.8 bits (107), Expect = 0.008,   Method: Composition-based stats.
 Identities = 26/75 (34%), Positives = 43/75 (57%), Gaps = 4/75 (5%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S + +V D++R     L+     ESVV+R+ G G +G++DG    A F +P+  A+   G
Sbjct: 181 SGNFLVSDTTRHQLVELA--QDGESVVRRI-GTGTRGFADGT-AEAAFSEPQGLALLDDG 236

Query: 116 NIYVADKSNHVIRKI 130
            + VAD  NH +R++
Sbjct: 237 AVVVADTVNHALRRV 251


>gi|289774196|ref|ZP_06533574.1| NHL repeat containing protein [Streptomyces lividans TK24]
 gi|289704395|gb|EFD71824.1| NHL repeat containing protein [Streptomyces lividans TK24]
          Length = 605

 Score = 45.8 bits (107), Expect = 0.008,   Method: Composition-based stats.
 Identities = 27/83 (32%), Positives = 44/83 (53%), Gaps = 4/83 (4%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P   +  P S   +V D++R     L+     ES V+R+ G G +G++DG    A F +P
Sbjct: 181 PGKAVSLP-SGTFLVSDTTRHQLVELA--EDGESPVRRI-GSGRRGFTDGPADRAEFSEP 236

Query: 107 KSFAVDMKGNIYVADKSNHVIRK 129
           +  A+   G++ VAD  NH +R+
Sbjct: 237 QGLALLGDGSVVVADTVNHALRR 259


>gi|21218701|ref|NP_624480.1| hypothetical protein SCO0142 [Streptomyces coelicolor A3(2)]
 gi|6066645|emb|CAB58305.1| conserved hypothetical protein SCJ33.06c [Streptomyces coelicolor
           A3(2)]
          Length = 601

 Score = 45.8 bits (107), Expect = 0.008,   Method: Composition-based stats.
 Identities = 27/83 (32%), Positives = 44/83 (53%), Gaps = 4/83 (4%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P   +  P S   +V D++R     L+     ES V+R+ G G +G++DG    A F +P
Sbjct: 177 PGKAVSLP-SGTFLVSDTTRHQLVELA--EDGESPVRRI-GSGRRGFTDGPADRAEFSEP 232

Query: 107 KSFAVDMKGNIYVADKSNHVIRK 129
           +  A+   G++ VAD  NH +R+
Sbjct: 233 QGLALLGDGSVVVADTVNHALRR 255


>gi|262195555|ref|YP_003266764.1| NHL repeat containing protein [Haliangium ochraceum DSM 14365]
 gi|262078902|gb|ACY14871.1| NHL repeat containing protein [Haliangium ochraceum DSM 14365]
          Length = 404

 Score = 45.8 bits (107), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 22/40 (55%), Positives = 27/40 (67%), Gaps = 1/40 (2%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G++DG    ARFD+P   AV +   IYVAD +NH IRKIT
Sbjct: 241 GFADGNGAEARFDRPLGVAV-VDDEIYVADSANHRIRKIT 279



 Score = 37.4 bits (85), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 24/47 (51%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           LAG    G +DG    A    P   AV   GN+Y+A+  N  IRKIT
Sbjct: 86  LAGGEEAGTTDGTRAVATLRNPAGLAVGPNGNVYIAEFDNDRIRKIT 132



 Score = 35.8 bits (81), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 26/50 (52%), Gaps = 1/50 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV-IRKI 130
           V  LAG GV   SDG    A FD P +   D  GN+YV +  +   +RKI
Sbjct: 284 VSTLAGTGVAASSDGSLAEATFDTPYAMTRDSDGNLYVTELGDSFRVRKI 333


>gi|288921012|ref|ZP_06415304.1| serine/threonine protein kinase [Frankia sp. EUN1f]
 gi|288347587|gb|EFC81872.1| serine/threonine protein kinase [Frankia sp. EUN1f]
          Length = 882

 Score = 45.8 bits (107), Expect = 0.009,   Method: Composition-based stats.
 Identities = 24/58 (41%), Positives = 35/58 (60%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ +  +  +AG GV GYS DG P +A +    +  AV   G+IY+AD  NH IRKI+
Sbjct: 648 IAADGTISTIAGTGVAGYSGDGGPATAAQLKSAEKLAVAPDGSIYIADYENHRIRKIS 705



 Score = 40.0 bits (92), Expect = 0.49,   Method: Composition-based stats.
 Identities = 34/112 (30%), Positives = 54/112 (48%), Gaps = 14/112 (12%)

Query: 30  DGYTVT---TVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSF------PLSEES 80
           DG   T   T +DG+  E  P +     G +D+ V  +S    Y  +        ++ + 
Sbjct: 707 DGIITTIAGTGVDGYSGEGGPATAATLNGPNDVEV--ASDGTLYIANLGSDTIQKINTDG 764

Query: 81  VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            +  +AG G QGYS DG P + A+   P S ++   G +Y+AD  N+ IRK+
Sbjct: 765 TIHTVAGTGEQGYSGDGGPATKAQLSIP-SVSLGRNGTLYIADYGNNRIRKV 815



 Score = 36.6 bits (83), Expect = 4.7,   Method: Composition-based stats.
 Identities = 16/31 (51%), Positives = 20/31 (64%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +A+ D P S   D KGNIY+ D  N+ IRKI
Sbjct: 618 AAKLDGPGSAVTDGKGNIYIPDARNYRIRKI 648



 Score = 35.8 bits (81), Expect = 8.4,   Method: Composition-based stats.
 Identities = 19/57 (33%), Positives = 32/57 (56%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +S + ++  +AG GV GYS +G P + A  + P    V   G +Y+A+  +  I+KI
Sbjct: 704 ISTDGIITTIAGTGVDGYSGEGGPATAATLNGPNDVEVASDGTLYIANLGSDTIQKI 760


>gi|428183007|gb|EKX51866.1| hypothetical protein GUITHDRAFT_84912 [Guillardia theta CCMP2712]
          Length = 351

 Score = 45.8 bits (107), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 23/54 (42%), Positives = 31/54 (57%), Gaps = 1/54 (1%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKITNLD 134
            + LAG G Q +SDG P  + F+ P + AV   G  +YVAD +N  +RKI   D
Sbjct: 173 TRTLAGSGNQSFSDGPPSESSFNTPAAIAVSPDGLTVYVADLNNERVRKIAVRD 226



 Score = 38.9 bits (89), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 18/47 (38%), Positives = 29/47 (61%), Gaps = 1/47 (2%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           LAG G  G++DG+  S+ F++P   +    G  +YVAD  N+ +RK+
Sbjct: 64  LAGTGTPGFADGDWNSSSFNQPCGVSTSPDGQWLYVADSGNNRVRKV 110


>gi|440714668|ref|ZP_20895245.1| NHL repeat containing protein [Rhodopirellula baltica SWK14]
 gi|436440433|gb|ELP33756.1| NHL repeat containing protein [Rhodopirellula baltica SWK14]
          Length = 283

 Score = 45.8 bits (107), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 26/95 (27%), Positives = 46/95 (48%), Gaps = 4/95 (4%)

Query: 34  VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
           + T + G   ++     +D  G+  L+VL+ + +    +         +  LAG G +G 
Sbjct: 143 LVTTLAGRTSKLKDPRAVDLDGNRRLLVLERNGNRLRRVE----SNGDITTLAGSGKKGT 198

Query: 94  SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           +DG+   A F+ PK   V   G +Y+AD  NH++R
Sbjct: 199 ADGDAKQASFNGPKHMDVAPDGRVYIADDVNHLVR 233



 Score = 37.7 bits (86), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 16/49 (32%), Positives = 29/49 (59%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
           GY DG   SARF+K  +  +D +  +Y++D +NH +R++     G  ++
Sbjct: 38  GYVDGPARSARFNKLHNLVIDAEDVLYLSDHANHSVRRLIQTTDGEWMV 86


>gi|410448598|ref|ZP_11302672.1| hypothetical protein LEP1GSC068_3151 [Leptospira sp. Fiocruz
           LV3954]
 gi|410017668|gb|EKO79726.1| hypothetical protein LEP1GSC068_3151 [Leptospira sp. Fiocruz
           LV3954]
          Length = 356

 Score = 45.4 bits (106), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 41/85 (48%), Gaps = 1/85 (1%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP   + R  + + I LD      Y     ++   VV   AG G++   DG   +A F  
Sbjct: 20  NPSIFMSRAWAENTI-LDCILKECYLCKLKITNNPVVSLFAGTGIKESIDGTTQTASFKT 78

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKI 130
           P    VD  GNI+V+D+  ++IRKI
Sbjct: 79  PFGLEVDTFGNIFVSDQMANLIRKI 103



 Score = 37.7 bits (86), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 29/48 (60%), Gaps = 1/48 (2%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           GV G+ +G+  ++ F  P    +D + N+YV +  NH IRKI NL+ G
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKI-NLNSG 207


>gi|32472743|ref|NP_865737.1| hypothetical protein RB3815 [Rhodopirellula baltica SH 1]
 gi|32443980|emb|CAD73422.1| conserved hypothetical protein [Rhodopirellula baltica SH 1]
          Length = 351

 Score = 45.4 bits (106), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 26/95 (27%), Positives = 46/95 (48%), Gaps = 4/95 (4%)

Query: 34  VTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
           + T + G   ++     +D  G+  L+VL+ + +    +         +  LAG G +G 
Sbjct: 211 LVTTLAGRTSKLKDPRAVDLDGNRRLLVLERNGNRLRRVK----SNGDITTLAGSGKKGK 266

Query: 94  SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           +DG+   A F+ PK   V   G +Y+AD  NH++R
Sbjct: 267 ADGDAKQASFNGPKHMDVAPDGRVYIADDVNHLVR 301



 Score = 38.5 bits (88), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 16/49 (32%), Positives = 29/49 (59%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
           GY DG   SARF+K  +  +D +  +Y++D +NH +R++     G  ++
Sbjct: 106 GYVDGPARSARFNKLHNLVIDAEDVLYLSDHANHSVRRLIQTTDGEWMV 154


>gi|421111981|ref|ZP_15572448.1| hypothetical protein LEP1GSC071_4063 [Leptospira santarosai str.
           JET]
 gi|410802632|gb|EKS08783.1| hypothetical protein LEP1GSC071_4063 [Leptospira santarosai str.
           JET]
          Length = 356

 Score = 45.4 bits (106), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 41/85 (48%), Gaps = 1/85 (1%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP   + R  + + I LD      Y     ++   VV   AG G++   DG   +A F  
Sbjct: 20  NPSIFMSRAWAENTI-LDCILKECYLCKLKITNNPVVSLFAGTGIKESIDGTTQTASFKT 78

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKI 130
           P    VD  GNI+V+D+  ++IRKI
Sbjct: 79  PFGLEVDTFGNIFVSDQMANLIRKI 103



 Score = 37.7 bits (86), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 29/48 (60%), Gaps = 1/48 (2%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           GV G+ +G+  ++ F  P    +D + N+YV +  NH IRKI NL+ G
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKI-NLNSG 207


>gi|451982091|ref|ZP_21930422.1| hypothetical protein NITGR_780006 [Nitrospina gracilis 3/211]
 gi|451760645|emb|CCQ91702.1| hypothetical protein NITGR_780006 [Nitrospina gracilis 3/211]
          Length = 724

 Score = 45.4 bits (106), Expect = 0.010,   Method: Composition-based stats.
 Identities = 18/57 (31%), Positives = 36/57 (63%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + +  ++  +AG+G  G+S  +G    A  + P    +D+KGN+Y+AD++NH +R++
Sbjct: 81  IGKNGIITTVAGNGNAGFSGDEGPALEAALNFPAGLCLDLKGNLYIADRNNHRVRRV 137



 Score = 44.3 bits (103), Expect = 0.028,   Method: Composition-based stats.
 Identities = 20/57 (35%), Positives = 34/57 (59%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + +  ++  +AG GVQG+S DG P   A+   P   A D +GN+Y  D+ N+ +R++
Sbjct: 636 VDKNGIITTVAGTGVQGFSGDGGPAVQAQLKSPYRMAFDSEGNLYFTDRDNNRVRRV 692



 Score = 36.6 bits (83), Expect = 4.7,   Method: Composition-based stats.
 Identities = 30/95 (31%), Positives = 43/95 (45%), Gaps = 4/95 (4%)

Query: 39  DGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--G 96
           +G   EI  H   D    S   V  S RS    L   ++ +  +  +AG G+ GY    G
Sbjct: 158 EGPAEEIPLHFPSDVACDSQGQVYISDRSNNRVLK--MNPQGQIVTVAGLGMAGYGGDFG 215

Query: 97  EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
               A    P    VD  GN+Y+AD+ N+ +RK+T
Sbjct: 216 PAIDALLKYPFGIHVDESGNLYIADRGNNRVRKVT 250


>gi|456874551|gb|EMF89837.1| hypothetical protein LEP1GSC005_3835 [Leptospira santarosai str.
           ST188]
          Length = 356

 Score = 45.4 bits (106), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 41/85 (48%), Gaps = 1/85 (1%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP   + R  + + I LD      Y     ++   VV   AG G++   DG   +A F  
Sbjct: 20  NPSIFMSRAWAENTI-LDCILKECYLCKLKITNNPVVSLFAGTGIKESIDGTTQTASFKT 78

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKI 130
           P    VD  GNI+V+D+  ++IRKI
Sbjct: 79  PFGLEVDTFGNIFVSDQMANLIRKI 103



 Score = 37.7 bits (86), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 29/48 (60%), Gaps = 1/48 (2%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           GV G+ +G+  ++ F  P    +D + N+YV +  NH IRKI NL+ G
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKI-NLNSG 207


>gi|422003150|ref|ZP_16350382.1| hypothetical protein LSS_06614 [Leptospira santarosai serovar
           Shermani str. LT 821]
 gi|417258114|gb|EKT87507.1| hypothetical protein LSS_06614 [Leptospira santarosai serovar
           Shermani str. LT 821]
          Length = 356

 Score = 45.4 bits (106), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 41/85 (48%), Gaps = 1/85 (1%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP   + R  + + I LD      Y     ++   VV   AG G++   DG   +A F  
Sbjct: 20  NPSIFMSRAWAENTI-LDCILKECYLCKLKITNNPVVSLFAGTGIKESIDGTTQTASFKT 78

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKI 130
           P    VD  GNI+V+D+  ++IRKI
Sbjct: 79  PFGLEVDTFGNIFVSDQMANLIRKI 103



 Score = 37.4 bits (85), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 29/48 (60%), Gaps = 1/48 (2%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           GV G+ +G+  ++ F  P    +D + N+YV +  NH IRKI NL+ G
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKI-NLNSG 207


>gi|398331208|ref|ZP_10515913.1| hypothetical protein LalesM3_04029 [Leptospira alexanderi serovar
           Manhao 3 str. L 60]
          Length = 357

 Score = 45.4 bits (106), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 36/70 (51%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      Y     ++   V+   AG G+    DG   +A F+ P    +D  GNI+V+
Sbjct: 36  ILDCILKECYLCKLKVTNNPVISLFAGTGINVSIDGTTQTASFNTPFGLELDTFGNIFVS 95

Query: 121 DKSNHVIRKI 130
           D+S ++IRKI
Sbjct: 96  DQSANLIRKI 105



 Score = 40.0 bits (92), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 24/50 (48%), Positives = 29/50 (58%), Gaps = 2/50 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           V  L+G GV GY DG+  SA+F  P     D K + + VAD  NH IRKI
Sbjct: 210 VSTLSG-GVSGYLDGDLASAQFKSPLGITYDQKTDSLLVADLQNHRIRKI 258


>gi|290975027|ref|XP_002670245.1| predicted protein [Naegleria gruberi]
 gi|284083802|gb|EFC37501.1| predicted protein [Naegleria gruberi]
          Length = 1363

 Score = 45.4 bits (106), Expect = 0.011,   Method: Composition-based stats.
 Identities = 33/103 (32%), Positives = 50/103 (48%), Gaps = 9/103 (8%)

Query: 28  LEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
           L   Y +TTV    +   +P   I +  + D+IV D+   +   +S+      V+ R+AG
Sbjct: 393 LPSKYYITTVATDFK---DPQK-IAKLSNGDIIVSDTGDHSIKKISY---STGVISRIAG 445

Query: 88  DGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
            GV G+S   G    A+ +KP   A+     IY+AD  NH IR
Sbjct: 446 TGVAGFSGDGGLASQAQLNKPYGIAITANDEIYIADNLNHRIR 488



 Score = 38.5 bits (88), Expect = 1.5,   Method: Composition-based stats.
 Identities = 31/121 (25%), Positives = 57/121 (47%), Gaps = 6/121 (4%)

Query: 12  ITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYT 71
           + ++++L  +V+   L      ++T   G+    +P   + R  + D++V D+   A   
Sbjct: 13  VVMMAMLVYMVNGAPLNTYSVKISTFAGGNGEFKDPQK-LARLSNGDILVSDALGHAIKK 71

Query: 72  LSFPLSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           ++       V+  +AG+G  G+   +G   +A  +KP   AV     IY AD  N+ IRK
Sbjct: 72  INV---TSGVITTIAGNGTAGFGGDNGPAVNALVNKPFGIAVSQTDEIYFADSGNNRIRK 128

Query: 130 I 130
           I
Sbjct: 129 I 129


>gi|290969880|ref|XP_002667977.1| predicted protein [Naegleria gruberi]
 gi|284080923|gb|EFC35233.1| predicted protein [Naegleria gruberi]
          Length = 366

 Score = 45.4 bits (106), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 19/43 (44%), Positives = 29/43 (67%), Gaps = 1/43 (2%)

Query: 89  GVQGYSDGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G  G ++G  P +ARF+ P   A+   G ++VADK+NH+IR+I
Sbjct: 125 GTSGSTNGNTPTAARFNNPYGMALAADGTMFVADKANHIIRRI 167


>gi|393789630|ref|ZP_10377750.1| hypothetical protein HMPREF1068_04030 [Bacteroides nordii
           CL02T12C05]
 gi|392650346|gb|EIY44015.1| hypothetical protein HMPREF1068_04030 [Bacteroides nordii
           CL02T12C05]
          Length = 435

 Score = 45.4 bits (106), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 34/122 (27%), Positives = 51/122 (41%), Gaps = 22/122 (18%)

Query: 31  GYTVTTVIDGHQLEINPHS----VIDRPGSSD----LIVLDSSRSAFYTLSFPL----SE 78
            YT    I  + L+   H     +++ PG +D      + D+ R    T    L    S 
Sbjct: 311 AYTNAHCIYTYNLKTGAHKLFAGMVNTPGYADGPCEYALFDTPRQIILTAENDLFLADSN 370

Query: 79  ESVVKRLAGDGV----------QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
             V++++  DGV           G  DG P  A F++P   AVD  G IY+ D  N  IR
Sbjct: 371 NHVIRKITQDGVVSTVIGQAGMTGNQDGTPEVALFNEPFGVAVDTDGTIYIGDSKNQSIR 430

Query: 129 KI 130
           ++
Sbjct: 431 RL 432



 Score = 42.4 bits (98), Expect = 0.087,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 27/40 (67%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           GY+DG    A FD P+   +  + ++++AD +NHVIRKIT
Sbjct: 339 GYADGPCEYALFDTPRQIILTAENDLFLADSNNHVIRKIT 378


>gi|441163235|ref|ZP_20968262.1| leucine-rich repeat-containing protein [Streptomyces rimosus subsp.
           rimosus ATCC 10970]
 gi|440616374|gb|ELQ79516.1| leucine-rich repeat-containing protein [Streptomyces rimosus subsp.
           rimosus ATCC 10970]
          Length = 553

 Score = 45.4 bits (106), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 27/98 (27%), Positives = 46/98 (46%), Gaps = 17/98 (17%)

Query: 76  LSEESVVKRLAGDGVQGY-SDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++ + ++  +AG+G  GY  DG P +  R   P   A+D  GN+Y+AD +NH +R +T  
Sbjct: 80  VTSDGIITTVAGNGTAGYVDDGGPAAGTRLYYPYGIALDRGGNLYIADCNNHRVRGVT-- 137

Query: 134 DHGNQLIRQINLKPEDCSKSSQSGSGMFWVTVFSPYLI 171
                 + Q+   P   +          +  V SPY +
Sbjct: 138 -----AVAQMTPPPPPTAD--------LYGEVISPYRV 162



 Score = 44.3 bits (103), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   ++  +AG+G  GY SDG P    R   P   A+D  G++Y+ D  NH IRK+T
Sbjct: 24  VTPNGLITTVAGNGTAGYVSDGGPALGTRLHYPWGLALDEAGSLYIGDGHNHRIRKVT 81


>gi|116622401|ref|YP_824557.1| Ig domain-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116225563|gb|ABJ84272.1| Ig domain protein, group 1 domain protein [Candidatus Solibacter
           usitatus Ellin6076]
          Length = 2770

 Score = 45.4 bits (106), Expect = 0.011,   Method: Composition-based stats.
 Identities = 20/52 (38%), Positives = 35/52 (67%), Gaps = 2/52 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++  +AG G +GYS DG+P + A+ + P   A+D  GN+++AD  NH +R++
Sbjct: 141 IITTVAGTGERGYSGDGQPATLAKINSPYHIALDGHGNLFIADDGNHRVRRV 192



 Score = 42.4 bits (98), Expect = 0.11,   Method: Composition-based stats.
 Identities = 20/51 (39%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 82  VKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  +AG G+QG+S   G   +AR   P   A+   G++Y+AD  NH IRK+
Sbjct: 25  ISTVAGTGIQGFSGDSGPATAARLFDPTDVAIHPNGDLYIADTYNHRIRKV 75



 Score = 38.5 bits (88), Expect = 1.5,   Method: Composition-based stats.
 Identities = 18/52 (34%), Positives = 28/52 (53%), Gaps = 2/52 (3%)

Query: 81  VVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V+  +AG G  GY+  D +   A    P+   +D  GN+Y+AD  NH +R +
Sbjct: 198 VITTVAGTGNAGYNGDDQQATHADLQNPRGVLIDASGNLYIADYGNHRVRVV 249


>gi|290982604|ref|XP_002674020.1| predicted protein [Naegleria gruberi]
 gi|284087607|gb|EFC41276.1| predicted protein [Naegleria gruberi]
          Length = 499

 Score = 45.4 bits (106), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 29/90 (32%), Positives = 43/90 (47%), Gaps = 13/90 (14%)

Query: 60  IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYV 119
           +V+D++    Y   F  +   VV    GD ++ +     G  +FD PK    D+ GN+ V
Sbjct: 296 LVIDTTNEDIYICDFD-NHRIVVLSKNGDFLRDFGSKGSGDTQFDGPKDIDFDLDGNLVV 354

Query: 120 ADKSNH-----------VIRKITNLDHGNQ 138
           +D+ NH           +IRKI N  HGNQ
Sbjct: 355 SDRENHRILVLSKLNGSLIRKIEN-GHGNQ 383


>gi|290970060|ref|XP_002668023.1| predicted protein [Naegleria gruberi]
 gi|284081057|gb|EFC35279.1| predicted protein [Naegleria gruberi]
          Length = 318

 Score = 45.4 bits (106), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 36/100 (36%), Positives = 51/100 (51%), Gaps = 14/100 (14%)

Query: 79  ESVVKRLAGDGVQGYSDG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           +S++K +AG   QG  DG    SA  + P S A++  G +Y+ADKS+H IRK+ N    N
Sbjct: 30  KSIIKTIAG---QGVCDGISATSASLNGPSSTAINSLGEVYIADKSHHRIRKVYN----N 82

Query: 138 QLIRQINLKPEDCSKSSQSGSGMFWVT--VFSPYLISCKS 175
             I  I         +  +G G+   T  +FSPY I   S
Sbjct: 83  GTIVTI----AGTGIAGYNGDGIDATTAQLFSPYGIGIDS 118



 Score = 36.6 bits (83), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 30/55 (54%), Gaps = 2/55 (3%)

Query: 78  EESVVKRLAGDGVQGYS-DG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
               +  +AG G+ GY+ DG +  +A+   P    +D K  +Y +D SNH IRKI
Sbjct: 81  NNGTIVTIAGTGIAGYNGDGIDATTAQLFSPYGIGIDSKDLVYFSDSSNHRIRKI 135


>gi|223939922|ref|ZP_03631790.1| NHL repeat containing protein [bacterium Ellin514]
 gi|223891413|gb|EEF57906.1| NHL repeat containing protein [bacterium Ellin514]
          Length = 359

 Score = 45.4 bits (106), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 26/79 (32%), Positives = 44/79 (55%), Gaps = 8/79 (10%)

Query: 60  IVLDSSRSAFY------TLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAV 111
           + LD + + F+      T+    ++  +V  +AG+G  G+S DG P + A+ ++P S   
Sbjct: 113 VRLDQAENLFWVERLSHTVRKCDAKTGIVTTIAGNGTAGFSGDGGPATKAQMNEPHSIGF 172

Query: 112 DMKGNIYVADKSNHVIRKI 130
           D  GN+Y+ D  NH IRK+
Sbjct: 173 DKAGNLYICDVRNHRIRKV 191



 Score = 40.4 bits (93), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 32/55 (58%), Gaps = 2/55 (3%)

Query: 78  EESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           E+  +  +AG G  G++ +G P  +A F  PK  +V   GN++VAD  NH IR I
Sbjct: 251 EKGRIFHVAGTGKNGFTGNGGPAKAATFKGPKGLSVASNGNVFVADTENHAIRMI 305


>gi|29830929|ref|NP_825563.1| hypothetical protein SAV_4386 [Streptomyces avermitilis MA-4680]
 gi|29608042|dbj|BAC72098.1| hypothetical protein SAV_4386 [Streptomyces avermitilis MA-4680]
          Length = 341

 Score = 45.4 bits (106), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 52/99 (52%), Gaps = 10/99 (10%)

Query: 39  DGHQLEINPHSVIDR-PG------SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ 91
           DG  +   P   + R PG      S + +V D++R     L+     ESVV+R+ G G++
Sbjct: 165 DGPYVAPEPEPTVLRFPGKALLLPSGNFLVSDTTRHQLVELA--ADGESVVRRI-GSGIR 221

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G++DG   +A F +P+   +  +  + VAD  NH +R++
Sbjct: 222 GFADGPADAASFSEPQGMTLLDEDVVVVADTVNHALRRL 260


>gi|302821467|ref|XP_002992396.1| hypothetical protein SELMODRAFT_135182 [Selaginella moellendorffii]
 gi|300139812|gb|EFJ06546.1| hypothetical protein SELMODRAFT_135182 [Selaginella moellendorffii]
          Length = 161

 Score = 45.4 bits (106), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 36/131 (27%), Positives = 53/131 (40%), Gaps = 45/131 (34%)

Query: 74  FPLSEESVVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           FPL  +S ++  AG   G  GY DG    + F++P+S ++   G ++VAD  N  IRKI+
Sbjct: 14  FPL--DSSLEAFAGSFVGESGYVDGPAAKSLFNRPQSLSICDNGAVFVADTRNLAIRKIS 71

Query: 132 NL----------------------------------------DHGNQLIRQINL-KPEDC 150
                                                     D GN+LIR+I +  P+ C
Sbjct: 72  KFLPTVTTIAGGSSRKPGFAGDTARFSSEFSLACSCGSLLIADCGNRLIREIQIDDPKSC 131

Query: 151 SKSSQSGSGMF 161
             S  + SG F
Sbjct: 132 DSSDSAVSGDF 142


>gi|114566902|ref|YP_754056.1| hypothetical protein Swol_1378 [Syntrophomonas wolfei subsp. wolfei
           str. Goettingen]
 gi|114337837|gb|ABI68685.1| hypothetical protein Swol_1378 [Syntrophomonas wolfei subsp. wolfei
           str. Goettingen]
          Length = 1030

 Score = 45.4 bits (106), Expect = 0.012,   Method: Composition-based stats.
 Identities = 22/52 (42%), Positives = 32/52 (61%), Gaps = 2/52 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++   AG+G  GYS DG P  SA+ + P   + D +GN+Y+AD  NH IR +
Sbjct: 656 IISTAAGNGTYGYSGDGGPATSAQLNNPNGLSFDNRGNMYIADTYNHRIRMV 707



 Score = 43.9 bits (102), Expect = 0.030,   Method: Composition-based stats.
 Identities = 24/61 (39%), Positives = 34/61 (55%), Gaps = 8/61 (13%)

Query: 81  VVKRLAGDGV------QGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           ++  +AG+GV       GYS   G   SA+ + P   A D  GN+Y+AD +NH IRK+  
Sbjct: 594 IISTVAGNGVLTGSYKSGYSGDGGSATSAQLNYPYGVAFDASGNMYIADSNNHCIRKVDT 653

Query: 133 L 133
           L
Sbjct: 654 L 654



 Score = 43.9 bits (102), Expect = 0.036,   Method: Composition-based stats.
 Identities = 24/63 (38%), Positives = 35/63 (55%), Gaps = 3/63 (4%)

Query: 71  TLSFPLSEES-VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
           T    L++ + ++  +AG+G  GYS   G   SA  + P   A D  GN+Y+AD SN  +
Sbjct: 471 TFRLTLTQNTGIISTVAGNGTAGYSGDGGLAASALLNYPHGLAFDGNGNLYIADASNRRV 530

Query: 128 RKI 130
           RKI
Sbjct: 531 RKI 533



 Score = 42.4 bits (98), Expect = 0.086,   Method: Composition-based stats.
 Identities = 27/79 (34%), Positives = 41/79 (51%), Gaps = 8/79 (10%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAV 111
           G+ +L + D+S      +        ++  +AG+G  GYS G+ GSA   +   P   A 
Sbjct: 516 GNGNLYIADASNRRVRKID----SAGIITTVAGNGTSGYS-GDGGSAIAAKITCPYGVAF 570

Query: 112 DMKGNIYVADKSNHVIRKI 130
           D  GN+Y+AD  NH IRK+
Sbjct: 571 DSNGNMYIADIFNHRIRKV 589



 Score = 40.4 bits (93), Expect = 0.40,   Method: Composition-based stats.
 Identities = 34/146 (23%), Positives = 67/146 (45%), Gaps = 14/146 (9%)

Query: 44  EINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG-YSDGEPG-SA 101
           ++N  + I    S ++ + DS+ +    +        ++   AG+G  G + DG P  SA
Sbjct: 743 QLNNPNGITFDSSGNMYIADSNNNCIRKVD----HSGMISTFAGNGTSGHFGDGGPATSA 798

Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI--------RQINLKPEDCSKS 153
           +   P   A+D  GN+++AD  +H IRK+      N  I        ++I+ + +  +  
Sbjct: 799 QLRNPVGVALDNSGNLFIADYFDHSIRKVVLAAQQNSTISPTTGNFDKRISAQADVSTTM 858

Query: 154 SQSGSGMFWVTVFSPYLISCKSFWVH 179
           + +G+ +  +T     LI  K + V+
Sbjct: 859 TLNGNELLRITNGETALIQGKDYNVN 884



 Score = 37.4 bits (85), Expect = 3.1,   Method: Composition-based stats.
 Identities = 22/69 (31%), Positives = 33/69 (47%), Gaps = 13/69 (18%)

Query: 81  VVKRLAGDGVQGYSDGEPG----------SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V+  +AG+G  G   G  G          SA+ + P     D  GN+Y+AD +N+ IRK+
Sbjct: 712 VISTVAGNGNSGDRYGNDGGYSGDGGLATSAQLNNPNGITFDSSGNMYIADSNNNCIRKV 771

Query: 131 TNLDHGNQL 139
              DH   +
Sbjct: 772 ---DHSGMI 777


>gi|312198347|ref|YP_004018408.1| serine/threonine protein kinase [Frankia sp. EuI1c]
 gi|311229683|gb|ADP82538.1| serine/threonine protein kinase [Frankia sp. EuI1c]
          Length = 814

 Score = 45.1 bits (105), Expect = 0.013,   Method: Composition-based stats.
 Identities = 20/54 (37%), Positives = 32/54 (59%), Gaps = 2/54 (3%)

Query: 79  ESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + V++ +AG GV GYS   G+  +A    P+   VD  GN+Y+ D  N+ +R+I
Sbjct: 703 DGVIETIAGTGVAGYSGDGGKATAATLRDPRGVTVDAAGNVYITDSGNNRVRRI 756



 Score = 42.4 bits (98), Expect = 0.088,   Method: Composition-based stats.
 Identities = 21/54 (38%), Positives = 33/54 (61%), Gaps = 2/54 (3%)

Query: 79  ESVVKRLAGD-GVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + ++  +AG  G   + DG+P + A   KP + A+D +G IY+AD  NH IR+I
Sbjct: 647 DGIITTVAGRFGYGSWGDGKPATQAMISKPFNVALDRQGRIYIADSYNHKIRRI 700



 Score = 38.5 bits (88), Expect = 1.4,   Method: Composition-based stats.
 Identities = 24/61 (39%), Positives = 35/61 (57%), Gaps = 8/61 (13%)

Query: 73  SFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           SFP      V  +AG G  G++ DG P + A+ D P   A+D  GN+Y AD  N+ +R+I
Sbjct: 477 SFP------VVAIAGTGQAGFAGDGGPAAQAQLDHPYGPAMDGFGNLYFADFDNNRVRRI 530

Query: 131 T 131
           +
Sbjct: 531 S 531



 Score = 37.0 bits (84), Expect = 3.8,   Method: Composition-based stats.
 Identities = 18/58 (31%), Positives = 34/58 (58%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S +  +  +AG+G  G+S DG P +A   +KP + A+   G +Y+ D  N  +R+++
Sbjct: 530 ISPDGTITTVAGNGQPGFSGDGGPATAAMLNKPVAVAIGPGGTLYIVDTFNMRVRQVS 587



 Score = 36.6 bits (83), Expect = 4.7,   Method: Composition-based stats.
 Identities = 23/88 (26%), Positives = 47/88 (53%), Gaps = 9/88 (10%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS---DGEPGS-AR 102
           P +V   PG + L ++D+    F      +S + +++ +AG G + ++   DG P + A 
Sbjct: 562 PVAVAIGPGGT-LYIVDT----FNMRVRQVSPDGIIQTIAGSGERPWNPADDGGPATNAA 616

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKI 130
              P   A+D  G++++AD  N ++R++
Sbjct: 617 LWYPSGIAIDSAGDLFIADNGNDIVRRV 644


>gi|373955343|ref|ZP_09615303.1| Ig family protein [Mucilaginibacter paludis DSM 18603]
 gi|373891943|gb|EHQ27840.1| Ig family protein [Mucilaginibacter paludis DSM 18603]
          Length = 929

 Score = 45.1 bits (105), Expect = 0.014,   Method: Composition-based stats.
 Identities = 20/47 (42%), Positives = 31/47 (65%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           AG G  G ++G   ++ F  P   A D+ GN+YVAD+ N++IRKI++
Sbjct: 142 AGTGNSGSTNGSALTSTFYSPTRVAADLSGNLYVADRDNNLIRKISS 188



 Score = 43.1 bits (100), Expect = 0.052,   Method: Composition-based stats.
 Identities = 21/52 (40%), Positives = 31/52 (59%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           V  LAG G+  ++DG   +A F  P    +D  GN+Y+AD  N+ +RK+T L
Sbjct: 286 VTTLAGSGMAAFADGTGTAASFYGPCGGTLDAMGNLYIADGVNNRVRKVTPL 337



 Score = 42.7 bits (99), Expect = 0.075,   Method: Composition-based stats.
 Identities = 22/49 (44%), Positives = 31/49 (63%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V  +AG+G  G ++G   +A F  P S  VD  GN+YV+D  N++IRKI
Sbjct: 232 VTVVAGNGSMGSNNGIGSAASFYYPFSVTVDGAGNLYVSDNGNNLIRKI 280



 Score = 40.8 bits (94), Expect = 0.26,   Method: Composition-based stats.
 Identities = 22/49 (44%), Positives = 27/49 (55%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V  LAG  V G ++G   +A F +P     D  G IYV D  N+VIRKI
Sbjct: 394 VTILAGSNVAGSANGIGTAASFRRPNDVQADQSGFIYVTDYGNNVIRKI 442



 Score = 36.2 bits (82), Expect = 7.4,   Method: Composition-based stats.
 Identities = 14/29 (48%), Positives = 21/29 (72%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           F++P    VD+ GN+YVAD + + I+KIT
Sbjct: 199 FNQPNGVTVDLNGNVYVADAATNSIKKIT 227



 Score = 35.8 bits (81), Expect = 8.8,   Method: Composition-based stats.
 Identities = 17/50 (34%), Positives = 28/50 (56%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           VV  +AG+G +   +G   SA  + P    +D+ G +YVA+   + IRK+
Sbjct: 339 VVTTVAGNGTRATINGNGTSASLNTPTGATIDIAGIVYVAELDGNCIRKV 388


>gi|418754100|ref|ZP_13310334.1| hypothetical protein LEP1GSC179_2637 [Leptospira santarosai str.
           MOR084]
 gi|409965522|gb|EKO33385.1| hypothetical protein LEP1GSC179_2637 [Leptospira santarosai str.
           MOR084]
          Length = 356

 Score = 45.1 bits (105), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 27/85 (31%), Positives = 41/85 (48%), Gaps = 1/85 (1%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP   + R  + + I+ D      Y     ++   VV   AG G++   DG   +A F  
Sbjct: 20  NPSIFMSRAWAENTII-DCILKECYLCKLKITNNPVVSLFAGTGIKESIDGTTQTASFKT 78

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKI 130
           P    VD  GNI+V+D+  ++IRKI
Sbjct: 79  PFGLEVDTFGNIFVSDQMANLIRKI 103



 Score = 37.7 bits (86), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 29/48 (60%), Gaps = 1/48 (2%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           GV G+ +G+  ++ F  P    +D + N+YV +  NH IRKI NL+ G
Sbjct: 161 GVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKI-NLNSG 207


>gi|332878080|ref|ZP_08445810.1| NHL repeat protein [Capnocytophaga sp. oral taxon 329 str. F0087]
 gi|357047673|ref|ZP_09109271.1| NHL repeat protein [Paraprevotella clara YIT 11840]
 gi|332684042|gb|EGJ56909.1| NHL repeat protein [Capnocytophaga sp. oral taxon 329 str. F0087]
 gi|355529361|gb|EHG98795.1| NHL repeat protein [Paraprevotella clara YIT 11840]
          Length = 461

 Score = 45.1 bits (105), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 46/101 (45%), Gaps = 26/101 (25%)

Query: 53  RPGSSD------------LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV---------- 90
           RPG +D             +VLD   +    L    SE + ++++  +GV          
Sbjct: 363 RPGHADGLASDAQFHSPRQLVLDEEEN----LYIADSENNCIRKITPEGVVSTVIGIPGK 418

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            GY DG P  A F +P   A+D +G IYV DK N  +R+++
Sbjct: 419 SGYKDGTPDVALFTQPWGLAIDSEGIIYVGDKDNLCVRQLS 459



 Score = 43.5 bits (101), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 21/63 (33%), Positives = 33/63 (52%)

Query: 69  FYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           F  ++ P  E +        G  G++DG    A+F  P+   +D + N+Y+AD  N+ IR
Sbjct: 342 FEDITDPTREGNFKIYAGMQGRPGHADGLASDAQFHSPRQLVLDEEENLYIADSENNCIR 401

Query: 129 KIT 131
           KIT
Sbjct: 402 KIT 404


>gi|320105416|ref|YP_004181006.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
 gi|319923937|gb|ADV81012.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
          Length = 822

 Score = 45.1 bits (105), Expect = 0.014,   Method: Composition-based stats.
 Identities = 25/53 (47%), Positives = 32/53 (60%), Gaps = 4/53 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +  +AG GV G+S G+ G+A       P   AVD  GN+YVAD  NH IRKI+
Sbjct: 149 IATIAGTGVAGFS-GDGGAAVLATLSHPTGLAVDTGGNLYVADTDNHRIRKIS 200



 Score = 43.5 bits (101), Expect = 0.045,   Method: Composition-based stats.
 Identities = 22/54 (40%), Positives = 33/54 (61%), Gaps = 2/54 (3%)

Query: 80  SVVKRLAGDGVQGYS-DGEPGSARF-DKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           + +  +AG G QG++ DG P +A + D P   AVD  GN+Y+AD  N  IR ++
Sbjct: 202 TTITTVAGSGEQGFAGDGGPATAAWLDSPDGVAVDATGNLYIADTHNQRIRVVS 255



 Score = 42.4 bits (98), Expect = 0.086,   Method: Composition-based stats.
 Identities = 21/58 (36%), Positives = 33/58 (56%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S   ++  +AG G QG++   G   SA  D P   AVD  GN+Y++D  N  IR+++
Sbjct: 88  VSAAGIITTVAGTGEQGFAGDGGSATSAWLDSPVGVAVDRVGNLYISDSHNQRIRRVS 145



 Score = 40.8 bits (94), Expect = 0.28,   Method: Composition-based stats.
 Identities = 23/58 (39%), Positives = 34/58 (58%), Gaps = 4/58 (6%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +S E  +  +AG+G + Y+ G+ GSA      +P+  +VD  GNIY AD  N+ IR I
Sbjct: 254 VSAEGTISTIAGNGSRAYA-GDGGSAVAASLARPRGLSVDALGNIYFADSDNNRIRLI 310



 Score = 36.6 bits (83), Expect = 5.7,   Method: Composition-based stats.
 Identities = 16/38 (42%), Positives = 23/38 (60%)

Query: 94  SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           S+ E  +A    P   A D  GN+Y AD +NHV+RK++
Sbjct: 52  SNVEATTAPLASPAGIAYDASGNMYFADLNNHVVRKVS 89


>gi|383120609|ref|ZP_09941337.1| hypothetical protein BSIG_2379 [Bacteroides sp. 1_1_6]
 gi|382985045|gb|EES68418.2| hypothetical protein BSIG_2379 [Bacteroides sp. 1_1_6]
          Length = 483

 Score = 45.1 bits (105), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 40/89 (44%), Gaps = 4/89 (4%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTL--SFPLSEESVVKRLAGDGVQGYSDGEPGSARF 103
           NPH +        L + D   +    +    PL    V   +   G++GY DG P  A F
Sbjct: 394 NPHQICFTE-DGKLYIADCGNNCIRVIDTKLPLDRAMVTTPIGLPGMKGYKDGGPDIALF 452

Query: 104 DKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
           + P   AV   G I YVAD  N VIRK++
Sbjct: 453 NHPFGVAVSADGQIVYVADTGNKVIRKLS 481


>gi|357412450|ref|YP_004924186.1| alkyl hydroperoxide reductase [Streptomyces flavogriseus ATCC
           33331]
 gi|320009819|gb|ADW04669.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Streptomyces flavogriseus ATCC 33331]
          Length = 603

 Score = 45.1 bits (105), Expect = 0.015,   Method: Composition-based stats.
 Identities = 26/72 (36%), Positives = 37/72 (51%), Gaps = 3/72 (4%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
            +V DS+R     L    ++   V+   G G +G++DG    ARF +P+  AV   G I 
Sbjct: 188 FLVSDSTRHRLVELE---ADGETVRGHFGTGERGFADGGREEARFSEPQGLAVLPDGRIA 244

Query: 119 VADKSNHVIRKI 130
           VAD  NH IR +
Sbjct: 245 VADTVNHAIRAL 256


>gi|126179662|ref|YP_001047627.1| NHL repeat-containing protein [Methanoculleus marisnigri JR1]
 gi|125862456|gb|ABN57645.1| NHL repeat containing protein [Methanoculleus marisnigri JR1]
          Length = 487

 Score = 45.1 bits (105), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 22/50 (44%), Positives = 29/50 (58%), Gaps = 1/50 (2%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           AGDG +GY DG  G AR ++P    VD  G  Y+AD  NH++R    + H
Sbjct: 430 AGDGDRGYRDGLSGEARLNEPGGL-VDFGGLWYIADTGNHIVRVYDPVRH 478



 Score = 35.8 bits (81), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 19/46 (41%), Positives = 25/46 (54%), Gaps = 1/46 (2%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
           G G  G +DG    A F  P+  A D +  I YVAD  NH IR+++
Sbjct: 210 GSGAAGNADGPFDEAAFYLPEGLAFDEEAGILYVADTGNHTIRRVS 255


>gi|88601128|gb|ABD46562.1| NHL repeat-containing protein [Vermamoeba vermiformis]
          Length = 309

 Score = 45.1 bits (105), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 25/57 (43%), Positives = 33/57 (57%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ + VV  LAG  G  G  DG    ARF+ P    VD  GN+YVAD  N+ +RK+T
Sbjct: 29  ITPDGVVVTLAGHPGHWGSRDGVGSKARFNGPSGLDVDTDGNVYVADYYNNTMRKVT 85



 Score = 44.7 bits (104), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 24/53 (45%), Positives = 36/53 (67%), Gaps = 1/53 (1%)

Query: 82  VKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           V  +AG  +Q G +DG+  +ARF++P   +VD  GN++V+D  NH IRKI+ L
Sbjct: 202 VTTIAGRVMQPGSADGKLTAARFNQPSGISVDKIGNLFVSDYYNHTIRKISPL 254



 Score = 36.2 bits (82), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 38/60 (63%), Gaps = 1/60 (1%)

Query: 76  LSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
           ++ E +V  +AG  G  G +DG    ARF+ P    +D +GNI ++D +N+ +RKI+N+D
Sbjct: 84  VTPEGIVTTIAGHVGQWGSTDGSGEKARFNGPSGVRIDTEGNIIISDNNNNTVRKISNID 143



 Score = 35.8 bits (81), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 21/40 (52%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G  DG    ARF  P   AV   G I+V D+ NH IR I+
Sbjct: 158 GSEDGNGQQARFFGPSGIAVAPDGTIFVCDRYNHTIRSIS 197


>gi|255070387|ref|XP_002507275.1| predicted protein [Micromonas sp. RCC299]
 gi|226522550|gb|ACO68533.1| predicted protein [Micromonas sp. RCC299]
          Length = 131

 Score = 45.1 bits (105), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 19/54 (35%), Positives = 32/54 (59%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           V  ++G G+ G+ DG+   + F  P   AV   G++YVAD  N+ IR++++  H
Sbjct: 6   VTSISGSGIPGFRDGQGEKSHFSSPAGVAVSSDGSVYVADSGNNRIRRVSSKGH 59



 Score = 37.4 bits (85), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 32/56 (57%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S +  V  +AG G+ G+SD +   A F++P+       G I+VAD  NH IR ++
Sbjct: 54  VSSKGHVTWIAGSGLAGFSDDKLLRAEFNRPQGIVTIPTGLIFVADTLNHRIRLVS 109


>gi|111219920|ref|YP_710714.1| serine/threonine protein kinase [Frankia alni ACN14a]
 gi|111147452|emb|CAJ59102.1| Putative serine/threonine protein kinase [Frankia alni ACN14a]
          Length = 725

 Score = 45.1 bits (105), Expect = 0.016,   Method: Composition-based stats.
 Identities = 20/57 (35%), Positives = 35/57 (61%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  + VV  +AG+G++G++ DG P   A    P   AVD  G+IY++D+ N  +R++
Sbjct: 422 VDRDGVVTTVAGNGIRGFTGDGGPAIRAELADPAGIAVDAAGDIYISDQGNQRVRRV 478



 Score = 40.0 bits (92), Expect = 0.47,   Method: Composition-based stats.
 Identities = 24/64 (37%), Positives = 31/64 (48%), Gaps = 15/64 (23%)

Query: 81  VVKRLAGDGVQGYSDGEPG--------------SARFDKPKSFAVDMKGNIYVADKSNHV 126
           V+   AG GV G+S GE G               A  D+P +  VD  GN+Y+ D SN  
Sbjct: 483 VITTFAGTGVFGFS-GENGPKIGGFSGDGVLARQAMLDEPSALWVDRAGNVYICDGSNDR 541

Query: 127 IRKI 130
           IRK+
Sbjct: 542 IRKV 545



 Score = 37.4 bits (85), Expect = 2.7,   Method: Composition-based stats.
 Identities = 18/46 (39%), Positives = 27/46 (58%), Gaps = 2/46 (4%)

Query: 87  GDGVQGYSDGEPGSAR--FDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G G  G+S     +A    +KP + AVD  GN+Y+ DKSN  +R++
Sbjct: 377 GTGNDGFSGDGGRAAGAGLNKPDAMAVDTAGNLYIVDKSNQRVRRV 422



 Score = 36.2 bits (82), Expect = 6.8,   Method: Composition-based stats.
 Identities = 16/37 (43%), Positives = 24/37 (64%), Gaps = 2/37 (5%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           +AR   P+S AVD+ G +YV D+ N+ +R+I    HG
Sbjct: 571 AARLQWPESLAVDVAGTMYVTDQGNNRVRRIDT--HG 605


>gi|357008076|ref|ZP_09073075.1| copper amine oxidase domain-containing protein [Paenibacillus elgii
           B69]
          Length = 533

 Score = 44.7 bits (104), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 23/50 (46%), Positives = 29/50 (58%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V  +AG+G+ G  DGE   A F +P   AV   G +YVAD  NH IR I+
Sbjct: 160 VSTVAGNGLLGRKDGEGKDALFYRPTDVAVAADGTLYVADSLNHAIRSIS 209



 Score = 41.6 bits (96), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 34/60 (56%), Gaps = 11/60 (18%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR-------KITNLDHGNQ 138
           AGD    ++DG+  SA+F++P +  +D KGN+ V+D  N  IR       K+T L  G Q
Sbjct: 235 AGD----FADGDLKSAKFNEPTALVLDAKGNLIVSDSGNQRIRYIDLQQGKVTTLAGGGQ 290



 Score = 38.5 bits (88), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 41/89 (46%), Gaps = 15/89 (16%)

Query: 50  VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG---------VQG-YSDGEPG 99
           V+D  G  +LIV DS       +     ++  V  LAG G         VQG ++DG   
Sbjct: 255 VLDAKG--NLIVSDSGNQRIRYIDL---QQGKVTTLAGGGQAATNKELHVQGGFADGSAS 309

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIR 128
            ARF  P   A+  +G + +AD  NH IR
Sbjct: 310 DARFSFPMGLALTEEGGLVIADSQNHAIR 338



 Score = 37.0 bits (84), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 31/58 (53%), Gaps = 3/58 (5%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           PL+E +    LAG+G  G +DG   SA F  P    V   G + V+D  N +IRK++ 
Sbjct: 40  PLTEMAT---LAGNGGLGSADGAGVSASFRMPGGLTVLKDGTVLVSDSRNQLIRKLSQ 94


>gi|399157177|ref|ZP_10757244.1| NHL repeat containing protein, partial [SAR324 cluster bacterium
           SCGC AAA001-C10]
          Length = 352

 Score = 44.7 bits (104), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 24/51 (47%), Positives = 29/51 (56%), Gaps = 1/51 (1%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + V  LAG    G +D    SA+F+ PK    D   N+YVAD SNH IRKI
Sbjct: 238 TAVTTLAGQSDNGSTDATGTSAQFNYPKGITTDGT-NLYVADYSNHRIRKI 287



 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 24/50 (48%), Positives = 27/50 (54%), Gaps = 2/50 (4%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           VV  LAG   QG +D    SA F  P     D   N+YVAD+ NH IRKI
Sbjct: 293 VVTTLAGSS-QGSTDATGTSASFYYPSGITTDGT-NLYVADRYNHRIRKI 340


>gi|441185716|ref|ZP_20970504.1| hypothetical protein SRIM_41417, partial [Streptomyces rimosus
           subsp. rimosus ATCC 10970]
 gi|440613960|gb|ELQ77297.1| hypothetical protein SRIM_41417, partial [Streptomyces rimosus
           subsp. rimosus ATCC 10970]
          Length = 108

 Score = 44.7 bits (104), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 35/55 (63%), Gaps = 2/55 (3%)

Query: 79  ESVVKRLAGDGVQGY-SDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +  +  +AG+GV G+ SDG  G+  R   P+   VD  GN+++AD+ NH IRK+T
Sbjct: 18  DGSIATVAGNGVAGFISDGGHGALTRVYYPQGVTVDKNGNLFIADRYNHRIRKVT 72


>gi|423215200|ref|ZP_17201728.1| hypothetical protein HMPREF1074_03260 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692463|gb|EIY85701.1| hypothetical protein HMPREF1074_03260 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 451

 Score = 44.7 bits (104), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 19/43 (44%), Positives = 28/43 (65%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G  G+ DG P  A+F+ P+   V    ++YVAD+ NHVIR++T
Sbjct: 407 GQSGFGDGLPELAKFNSPECVTVYPDNSVYVADRENHVIRRVT 449



 Score = 43.5 bits (101), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 27/40 (67%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           GY +G   + R ++P+  A D  GN++V +KS H+IRKIT
Sbjct: 355 GYVNGSGATVRLNQPRQPAFDEDGNMFVPEKSAHIIRKIT 394


>gi|320107007|ref|YP_004182597.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
 gi|319925528|gb|ADV82603.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
          Length = 710

 Score = 44.7 bits (104), Expect = 0.019,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 31/57 (54%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + E   +  +AG G QGYS  E  +  AR D P S A D  G +  AD+ NH IR++
Sbjct: 175 VDENGSISTVAGTGDQGYSGDEGPAILARIDSPSSLAFDAAGRLLFADRRNHRIRRV 231



 Score = 43.5 bits (101), Expect = 0.045,   Method: Composition-based stats.
 Identities = 23/58 (39%), Positives = 33/58 (56%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           LS       +AGDG QG++ DG P  SA  D P   AV  +G +++AD  NH +R ++
Sbjct: 73  LSTTGTWTTVAGDGRQGFAGDGGPAISAELDTPMGIAVSAEGTLWIADAHNHRVRLVS 130


>gi|298245110|ref|ZP_06968916.1| NHL repeat containing protein [Ktedonobacter racemifer DSM 44963]
 gi|297552591|gb|EFH86456.1| NHL repeat containing protein [Ktedonobacter racemifer DSM 44963]
          Length = 496

 Score = 44.7 bits (104), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 57/115 (49%), Gaps = 17/115 (14%)

Query: 27  LLEDGYTVTTVIDGHQLEIN----PHSVIDRPG-------SSDLIVLDSSRSAFYTLSFP 75
           +LE+ +    ++D  QL  +    P S +  PG       +  LI+ DS+       +  
Sbjct: 148 MLEE-FEARGLLDHRQLRFSAATAPSSFLAFPGKLAVDAQADRLIISDSAHHRLVETNL- 205

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
              +  V+ + G GVQG +DG    A+F+ P+  A+ +   +YVAD  NH+IR++
Sbjct: 206 ---QGRVRTIIGSGVQGQADGSFAEAQFNHPQGVAL-VNDLLYVADTDNHLIRRV 256



 Score = 41.2 bits (95), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 24/67 (35%), Positives = 38/67 (56%), Gaps = 6/67 (8%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLDHGNQLI 140
           V+ LAG G  G  +G    A+F++P   A+   GN +Y+AD +NH IR    LD   + I
Sbjct: 435 VRTLAGTGEAGIHNGPAEQAQFNEPGDLAI--TGNTLYIADTNNHAIRL---LDMATRNI 489

Query: 141 RQINLKP 147
             ++++P
Sbjct: 490 TTLDIQP 496


>gi|153867703|ref|ZP_01997987.1| NHL repeat protein [Beggiatoa sp. SS]
 gi|152144978|gb|EDN72014.1| NHL repeat protein [Beggiatoa sp. SS]
          Length = 263

 Score = 44.7 bits (104), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 22/47 (46%), Positives = 30/47 (63%), Gaps = 2/47 (4%)

Query: 86  AGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           AGDG  G++ DG P +A + + P    +D  GN+Y+AD  NH IRKI
Sbjct: 11  AGDGNLGFAGDGGPATAAQLNSPLGMGIDSAGNLYIADTENHRIRKI 57



 Score = 44.3 bits (103), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 34/51 (66%), Gaps = 2/51 (3%)

Query: 82  VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  +AGDG +GYS DG    SA+ + P   ++D  GN+Y+AD  N+V+RK+
Sbjct: 63  ISTVAGDGNRGYSGDGAAAVSAKLNNPMWVSLDSAGNLYIADTGNNVVRKL 113



 Score = 43.9 bits (102), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 33/52 (63%), Gaps = 2/52 (3%)

Query: 82  VKRLAGDGVQGYS-DGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +  +AGDG  G++ DG+   +A+ + P    VD  G +Y+AD SNH IRK++
Sbjct: 176 ITTIAGDGTPGFAGDGKIATAAQLNAPTQVMVDSTGQVYIADTSNHRIRKVS 227



 Score = 41.6 bits (96), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 23/53 (43%), Positives = 31/53 (58%), Gaps = 6/53 (11%)

Query: 82  VKRLAGD---GVQGYSDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  +AGD   GV G  DG P +A +   P    +D  GN+Y+AD SNH IRK+
Sbjct: 120 ISTVAGDSSVGVAG--DGGPATAAQLSYPTGIDIDTAGNLYIADSSNHRIRKV 170


>gi|283779991|ref|YP_003370746.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
 gi|283438444|gb|ADB16886.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
          Length = 364

 Score = 44.7 bits (104), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 49/96 (51%), Gaps = 9/96 (9%)

Query: 38  IDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-D 95
           IDG    + P +V ID+  +   I L    S  + L+   SE   +  +AG G +G+S D
Sbjct: 214 IDGTMSLLGPRAVCIDK--NVMWIALREGHS-VWKLNLATSE---LTHIAGTGKKGFSGD 267

Query: 96  GEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G P   A FD PK  AV   G + V D  NHVIRKI
Sbjct: 268 GGPAKDATFDGPKGVAVCPDGGVVVVDTENHVIRKI 303



 Score = 43.1 bits (100), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 33/56 (58%), Gaps = 2/56 (3%)

Query: 77  SEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++  ++  +AG G +G++ DG P   A F  P S A+D   NIYV D  NH IRKI
Sbjct: 135 AKTGIISTIAGSGKEGFAGDGGPAKEAVFSIPHSIALDADDNIYVCDLGNHRIRKI 190


>gi|29349707|ref|NP_813210.1| hypothetical protein BT_4299 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|298383965|ref|ZP_06993526.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
 gi|383120644|ref|ZP_09941370.1| hypothetical protein BSIG_2353 [Bacteroides sp. 1_1_6]
 gi|29341617|gb|AAO79404.1| putative cell surface protein [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|251840310|gb|EES68392.1| hypothetical protein BSIG_2353 [Bacteroides sp. 1_1_6]
 gi|298263569|gb|EFI06432.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
          Length = 495

 Score = 44.7 bits (104), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 38/76 (50%), Gaps = 3/76 (3%)

Query: 59  LIVLDSSRSAFYTL--SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           + + DS  S    +  + P    +V   +   G +GY DG P  A+F  P   AV+  G 
Sbjct: 418 MYIADSGNSCIRVIDTTMPKERAAVTTPIGLPGAEGYKDGGPEIAKFHFPCGVAVNSDGT 477

Query: 117 I-YVADKSNHVIRKIT 131
           I YVAD  N VIRK++
Sbjct: 478 IVYVADTQNKVIRKLS 493


>gi|297568314|ref|YP_003689658.1| hypothetical protein [Desulfurivibrio alkaliphilus AHT2]
 gi|296924229|gb|ADH85039.1| YD repeat protein [Desulfurivibrio alkaliphilus AHT2]
          Length = 2439

 Score = 44.7 bits (104), Expect = 0.020,   Method: Composition-based stats.
 Identities = 19/50 (38%), Positives = 30/50 (60%)

Query: 81   VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            +++ +AG G  G+S G   +AR  +P   ++D  G IY+AD  NH IR +
Sbjct: 1068 IIETIAGTGSWGHSGGLAVNARLYRPSCVSIDPDGTIYIADTGNHRIRSV 1117



 Score = 42.0 bits (97), Expect = 0.13,   Method: Composition-based stats.
 Identities = 21/54 (38%), Positives = 33/54 (61%), Gaps = 2/54 (3%)

Query: 79   ESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            + ++  +AG G +G+S DG P   AR   P + A+   G+IY+AD  NH IR++
Sbjct: 1244 DGIITTIAGTGYRGFSGDGGPADEARLFGPIAVAIGPDGSIYIADTYNHRIRRV 1297



 Score = 41.2 bits (95), Expect = 0.22,   Method: Composition-based stats.
 Identities = 21/49 (42%), Positives = 28/49 (57%), Gaps = 2/49 (4%)

Query: 84   RLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            R  G G  GYS  DG   +AR + P   A+   G+IY+AD  NH IR++
Sbjct: 1132 RAGGLGDSGYSGDDGPAVNARLNSPTDIALGPDGSIYIADSINHRIRRV 1180



 Score = 40.8 bits (94), Expect = 0.28,   Method: Composition-based stats.
 Identities = 20/53 (37%), Positives = 31/53 (58%), Gaps = 1/53 (1%)

Query: 79   ESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            + ++  +AG G +G+S DG P       P+  ++   G+IYVAD +NH IR I
Sbjct: 1354 DGIITTIAGTGYRGFSGDGGPADEARLGPRGVSLGPDGSIYVADSNNHRIRHI 1406



 Score = 37.0 bits (84), Expect = 3.8,   Method: Composition-based stats.
 Identities = 14/30 (46%), Positives = 21/30 (70%)

Query: 101  ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            AR D+P + A+   G+IY+AD  NH IR++
Sbjct: 1212 ARLDRPHAVAIGPDGSIYIADTYNHRIRRV 1241


>gi|380694052|ref|ZP_09858911.1| hypothetical protein BfaeM_08738 [Bacteroides faecis MAJ27]
          Length = 495

 Score = 44.7 bits (104), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 38/76 (50%), Gaps = 3/76 (3%)

Query: 59  LIVLDSSRSAFYTL--SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           + + DS  S    +  + P    +V   +   G +GY DG P  A+F  P   AV+  G 
Sbjct: 418 MYIADSGNSCIRVVDTTMPKERAAVTTPIGLPGAEGYKDGGPEIAKFHFPCGVAVNSDGT 477

Query: 117 I-YVADKSNHVIRKIT 131
           I YVAD  N VIRK++
Sbjct: 478 IVYVADTQNKVIRKLS 493


>gi|290983166|ref|XP_002674300.1| predicted protein [Naegleria gruberi]
 gi|284087889|gb|EFC41556.1| predicted protein [Naegleria gruberi]
          Length = 1435

 Score = 44.7 bits (104), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 37/115 (32%), Positives = 53/115 (46%), Gaps = 16/115 (13%)

Query: 31  GYTVTTVIDGHQLEINPHSVIDRP----------GSSDLIVLDSSRSAFYTLSFPLSEES 80
           G ++T + +G+ L+    + ++RP             +L   D S S    L   +    
Sbjct: 140 GTSITGLYNGNNLK-GTLTALNRPIGLFYTEIFSSQGELYFADESNSLIRKL---VIANG 195

Query: 81  VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
            V  +AG+ V GYS   GE  SA    P S  V   G IY+AD  N VIRK++NL
Sbjct: 196 TVSNVAGNVVAGYSGDGGEATSASLRYPSSVYVSNSGEIYIADTFNCVIRKVSNL 250


>gi|298383938|ref|ZP_06993499.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
 gi|298263542|gb|EFI06405.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
          Length = 483

 Score = 44.7 bits (104), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 40/89 (44%), Gaps = 4/89 (4%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTL--SFPLSEESVVKRLAGDGVQGYSDGEPGSARF 103
           NPH +        L + D   +    +    PL    V   +   G++GY DG P  A F
Sbjct: 394 NPHQICFTE-DGKLYIADCGNNCIRVIDTKLPLDRAMVTTPIGLPGMKGYKDGGPDIALF 452

Query: 104 DKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
           + P   AV   G I Y+AD  N VIRK++
Sbjct: 453 NHPFGVAVSADGQIVYIADTGNKVIRKLS 481


>gi|395226993|ref|ZP_10405365.1| hypothetical protein ThvES_00021150, partial [Thiovulum sp. ES]
 gi|394444658|gb|EJF05823.1| hypothetical protein ThvES_00021150, partial [Thiovulum sp. ES]
          Length = 240

 Score = 44.7 bits (104), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 32/56 (57%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S    V+   G G+ GY DG   +++ +KP+    D KGN+ V D  N +IR+I+
Sbjct: 162 ISPSGEVQNFVGSGIAGYQDGVGFNSQLNKPRGMTFDEKGNLLVVDSYNEMIREIS 217


>gi|303248818|ref|ZP_07335068.1| YD repeat protein [Desulfovibrio fructosovorans JJ]
 gi|302489759|gb|EFL49690.1| YD repeat protein [Desulfovibrio fructosovorans JJ]
          Length = 2115

 Score = 44.3 bits (103), Expect = 0.025,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 37/62 (59%), Gaps = 2/62 (3%)

Query: 71  TLSFPLSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIR 128
           T  F L  +  ++ +AG+G   Y+ DG P +A     P   AVD +GN+Y+AD+ N+ IR
Sbjct: 846 TKVFRLDTDGTIRVVAGNGSTHYNGDGIPATAASLYHPNGLAVDAQGNLYIADQYNNRIR 905

Query: 129 KI 130
           K+
Sbjct: 906 KV 907


>gi|302551068|ref|ZP_07303410.1| NHL repeat-containing protein [Streptomyces viridochromogenes DSM
           40736]
 gi|302468686|gb|EFL31779.1| NHL repeat-containing protein [Streptomyces viridochromogenes DSM
           40736]
          Length = 498

 Score = 44.3 bits (103), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 35/51 (68%), Gaps = 2/51 (3%)

Query: 82  VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  +AG G+ G++ D +P  +A+ + P + AVD  GN+++AD SNH +R++
Sbjct: 244 ITTVAGTGIAGFNGDNQPAVNAQLNAPHTVAVDSGGNVFIADTSNHRVRRV 294



 Score = 40.0 bits (92), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 18/51 (35%), Positives = 33/51 (64%), Gaps = 2/51 (3%)

Query: 82  VKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  +AG+G  G++ D +P   +R   P+  AVD  GN+++AD +NH +R++
Sbjct: 73  ITTVAGNGTAGFNGDNQPAVYSRLHYPRGVAVDAGGNLFIADCNNHRVRRV 123



 Score = 39.3 bits (90), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 19/57 (33%), Positives = 34/57 (59%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ E  +  +AG+G  G+S   G    A+ + P+  AVD  GN++ +D SNH +R++
Sbjct: 10  VTREPAIYIVAGNGTAGFSSDYGPAVCAQLNYPRKVAVDSSGNLFFSDYSNHRVRRV 66



 Score = 36.6 bits (83), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 25/90 (27%), Positives = 48/90 (53%), Gaps = 8/90 (8%)

Query: 47  PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARF 103
           PH+V +D  G  ++ + D+S      +     +   +  +AG G  G++ DG P    + 
Sbjct: 270 PHTVAVDSGG--NVFIADTSNHRVRRVDAATRQ---ITTVAGIGTAGFNGDGGPAIGVQL 324

Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           + P   AVD  G +++AD SN+ +RK++++
Sbjct: 325 NSPVGVAVDSGGGLFIADASNYRVRKVSDV 354



 Score = 35.8 bits (81), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 16/51 (31%), Positives = 32/51 (62%), Gaps = 2/51 (3%)

Query: 82  VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  +AG G+ G++ D +P  +A+ + P   AVD  GN+++ D +N  +R++
Sbjct: 130 ITTVAGTGIAGFNGDNQPAVNAQLNSPIGVAVDAGGNLFITDFNNQRVRRV 180


>gi|221633398|ref|YP_002522623.1| hypothetical protein trd_1418 [Thermomicrobium roseum DSM 5159]
 gi|221156519|gb|ACM05646.1| conserved hypothetical protein [Thermomicrobium roseum DSM 5159]
          Length = 443

 Score = 44.3 bits (103), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 33/108 (30%), Positives = 48/108 (44%), Gaps = 17/108 (15%)

Query: 52  DRPGSSDLIVLDSSRSAFYTLSFPLSEESV---VKRL----------AGDGVQGYSDGEP 98
           DR G+ D ++L       +T    L  ++    +KRL           G G  G+ DG P
Sbjct: 333 DRDGTGDTVLLQHPLDVAWTGEELLVADTYNHKIKRLDPVARRCSSWLGTGQPGHEDGPP 392

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLK 146
             ARF +P   A      +YVAD +NH +R I   D    L+R + L+
Sbjct: 393 ERARFWEPSGLATTFD-RVYVADTNNHAVRVI---DRTTGLVRTLELQ 436



 Score = 43.1 bits (100), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 43/177 (24%), Positives = 72/177 (40%), Gaps = 27/177 (15%)

Query: 21  LVSSGLLLEDGYTVTTVIDGHQLEIN-PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEE 79
           L  +G+   + + + +V++  Q  +  P  +   P    L+V D+              +
Sbjct: 94  LARTGVTTSEPFALLSVLERPQGPLAFPGKLAVDPSRDRLVVSDTGHHRLVIARL----D 149

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQL 139
             V  + GDG  G  DG    ARF +P+  A+ +    +VAD+ NH IR+I +L  G   
Sbjct: 150 GTVTAVIGDGRPGLVDGTFAEARFREPQGIAL-VGETCFVADRGNHAIRRI-DLAAGTV- 206

Query: 140 IRQINLKPEDCSKSSQSGSGMF------WVTVFSPYLISCKSFWVHLYLYFCSFKGS 190
                   E  + + + G GM        V + SP+     + W    L F +  GS
Sbjct: 207 --------ETLAGTGRLGQGMLSAGPAHQVDLRSPW-----ALWHRRGLLFVAMAGS 250


>gi|125561542|gb|EAZ06990.1| hypothetical protein OsI_29235 [Oryza sativa Indica Group]
          Length = 498

 Score = 44.3 bits (103), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 18/34 (52%), Positives = 25/34 (73%)

Query: 134 DHGNQLIRQINLKPEDCSKSSQSGSGMFWVTVFS 167
           D GN+LIRQINLK EDC++ +Q G G   V++ +
Sbjct: 264 DRGNRLIRQINLKREDCARETQPGLGTTSVSIIA 297


>gi|116620348|ref|YP_822504.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116223510|gb|ABJ82219.1| NHL repeat containing protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 915

 Score = 44.3 bits (103), Expect = 0.027,   Method: Composition-based stats.
 Identities = 22/79 (27%), Positives = 40/79 (50%), Gaps = 6/79 (7%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVD 112
           G  +L + D+   +   ++     + V+  + G+G  G S  +G   SAR   P+   VD
Sbjct: 549 GKGNLYIADTGNHSIRKVT----ADGVISTVVGNGTVGASGDEGAAASARLASPRGLTVD 604

Query: 113 MKGNIYVADKSNHVIRKIT 131
             GN+Y+ D  N+ +R++T
Sbjct: 605 DNGNLYIGDTGNNRVRQVT 623



 Score = 41.2 bits (95), Expect = 0.22,   Method: Composition-based stats.
 Identities = 23/52 (44%), Positives = 32/52 (61%), Gaps = 4/52 (7%)

Query: 82  VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V  +AG G+ G+S G+ G   +AR + P   A+D  GNIYVAD  N  +R+I
Sbjct: 82  VTTIAGTGIAGFS-GDGGLAVNARLNLPYGLALDDFGNIYVADLGNERVRRI 132



 Score = 40.0 bits (92), Expect = 0.53,   Method: Composition-based stats.
 Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 3/51 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +  +AG G  G  DG P +A +F      A D  GN+YVAD +NH +RK++
Sbjct: 29  IDTVAGSGRIG--DGGPATAAQFSDMSGIAADRLGNLYVADTNNHRVRKVS 77



 Score = 38.9 bits (89), Expect = 1.1,   Method: Composition-based stats.
 Identities = 29/83 (34%), Positives = 46/83 (55%), Gaps = 8/83 (9%)

Query: 51  IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGS--ARFDKPKS 108
           +DR G+  L ++D+S          L+ +  V+ +AG+G  GY+     +  A+   P +
Sbjct: 491 LDRGGT--LYIVDTSNHRV----LRLAADGTVQTVAGNGSGGYAGDGGSARFAQLRVPGA 544

Query: 109 FAVDMKGNIYVADKSNHVIRKIT 131
            A D KGN+Y+AD  NH IRK+T
Sbjct: 545 CAFDGKGNLYIADTGNHSIRKVT 567



 Score = 38.1 bits (87), Expect = 1.7,   Method: Composition-based stats.
 Identities = 23/56 (41%), Positives = 28/56 (50%), Gaps = 3/56 (5%)

Query: 81  VVKRLAGDGVQGYSDGEP---GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           V+  LAG GV G   G+    G A  + P   AVD  GN+ VAD  NH +  I  L
Sbjct: 402 VMTTLAGTGVVGRDTGDAVPAGIAPLNSPMGLAVDGAGNVMVADAFNHRVVAINAL 457



 Score = 36.2 bits (82), Expect = 5.9,   Method: Composition-based stats.
 Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           LS +  +  +AG G+ G   DG     A+ + P   A D  G +Y+AD  N+V+RKI
Sbjct: 188 LSADGRLVTVAGTGIAGLGGDGFASVKAQVNYPAGLAFDRAGALYIADSGNNVVRKI 244


>gi|290995070|ref|XP_002680154.1| predicted protein [Naegleria gruberi]
 gi|284093774|gb|EFC47410.1| predicted protein [Naegleria gruberi]
          Length = 636

 Score = 44.3 bits (103), Expect = 0.028,   Method: Composition-based stats.
 Identities = 34/102 (33%), Positives = 51/102 (50%), Gaps = 8/102 (7%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++   ++  +AG G QGYS DG    SA+   P   A+  +G IYVAD +N+ IRKIT  
Sbjct: 457 ITTSGIISTIAGTGTQGYSGDGSSATSAQLYNPYGVAISSRGEIYVADYNNNRIRKITT- 515

Query: 134 DHGNQLIRQINLKPEDCSKSSQSGSGMFWVTVFSPYLISCKS 175
              + +I  I       S  S  GS      +++PY ++  S
Sbjct: 516 ---SGIISTI--AGTGTSGYSGDGSSAISAQLYNPYGVAISS 552



 Score = 42.7 bits (99), Expect = 0.076,   Method: Composition-based stats.
 Identities = 23/58 (39%), Positives = 34/58 (58%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   ++  +AG G  GYS DG    SA+ + P   A+   G IY+AD +N+ IRKIT
Sbjct: 401 ITTSGIISTIAGTGTSGYSGDGSSATSAQLNSPYGIAISSSGEIYIADTNNNRIRKIT 458



 Score = 42.4 bits (98), Expect = 0.086,   Method: Composition-based stats.
 Identities = 20/56 (35%), Positives = 30/56 (53%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   ++  +AG+G  GYS     SA+   P   A+      Y+AD +NH IRKIT
Sbjct: 129 ITTSGIISTIAGNGTAGYSGDVATSAKLYYPSGIAISSSDETYIADTNNHRIRKIT 184



 Score = 39.7 bits (91), Expect = 0.55,   Method: Composition-based stats.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   ++  +AG+G  GYS DG    SA+ + P   A+     IY+AD  N+ IRKIT
Sbjct: 239 ITTSGIISTIAGNGTAGYSGDGSSATSAQLNSPSGIAISSSDEIYIADMFNNRIRKIT 296



 Score = 39.3 bits (90), Expect = 0.71,   Method: Composition-based stats.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   ++  +AG+G  GYS DG    SA+   P   A+     IY+ D+SN+ IRKIT
Sbjct: 73  ITTSGIISTIAGNGTAGYSGDGSSAKSAQLYYPSGVAISSSDEIYIVDRSNNRIRKIT 130



 Score = 39.3 bits (90), Expect = 0.71,   Method: Composition-based stats.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++   ++  +AG+G  GYS DG    SA+   P   A+     IY+ D+SN+ IRKIT
Sbjct: 183 ITTSGIISTIAGNGTAGYSGDGSSAKSAQLYYPSGVAISSSDEIYIVDRSNNRIRKIT 240



 Score = 37.0 bits (84), Expect = 4.3,   Method: Composition-based stats.
 Identities = 25/85 (29%), Positives = 40/85 (47%), Gaps = 8/85 (9%)

Query: 64  SSRSAFYTLSF------PLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKG 115
           SSR   Y   +       ++   ++  +AG G  GYS DG    SA+   P   A+    
Sbjct: 495 SSRGEIYVADYNNNRIRKITTSGIISTIAGTGTSGYSGDGSSAISAQLYNPYGVAISSSD 554

Query: 116 NIYVADKSNHVIRKITNLDHGNQLI 140
            IY+ D +N+ IRK++    GN ++
Sbjct: 555 EIYITDTNNNRIRKLSPWCTGNAIL 579


>gi|374849371|dbj|BAL52388.1| PKD domain protein [uncultured candidate division OP1 bacterium]
 gi|374857216|dbj|BAL60069.1| hypothetical protein HGMM_OP4C705 [uncultured candidate division
           OP1 bacterium]
          Length = 635

 Score = 44.3 bits (103), Expect = 0.028,   Method: Composition-based stats.
 Identities = 31/85 (36%), Positives = 43/85 (50%), Gaps = 16/85 (18%)

Query: 50  VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV----QGYSDG-EPGSARFD 104
           ++DR G+     +   RS F         E +  R   DG+    Q   DG EPG+  F 
Sbjct: 271 LVDRQGNRSEQSVLRQRSTF---------EFIAGRKVADGLFFLDQFGQDGSEPGA--FR 319

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRK 129
           +P+  AVD KG+IYVAD  NH I++
Sbjct: 320 RPQGIAVDSKGSIYVADTENHRIQR 344



 Score = 41.2 bits (95), Expect = 0.21,   Method: Composition-based stats.
 Identities = 16/32 (50%), Positives = 20/32 (62%)

Query: 98  PGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           PG  +F+ P   AVD  GN+YV D  NH I+K
Sbjct: 384 PGDGQFNGPTDIAVDAAGNVYVVDSGNHRIQK 415



 Score = 38.5 bits (88), Expect = 1.5,   Method: Composition-based stats.
 Identities = 15/33 (45%), Positives = 23/33 (69%)

Query: 97  EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           + G  +F +P++ AVD  GN+YV+D  NH I+K
Sbjct: 502 QTGDGQFFEPQAIAVDGAGNVYVSDTGNHRIQK 534


>gi|291612976|ref|YP_003523133.1| NHL repeat-containing protein [Sideroxydans lithotrophicus ES-1]
 gi|291583088|gb|ADE10746.1| NHL repeat-containing protein [Sideroxydans lithotrophicus ES-1]
          Length = 385

 Score = 44.3 bits (103), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 24/46 (52%), Positives = 29/46 (63%), Gaps = 2/46 (4%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           LA  GV G SDG   +ARF++P     D   N+YVAD +NH IRKI
Sbjct: 285 LASSGV-GSSDGIGANARFNQPMGITTDGS-NLYVADTNNHTIRKI 328


>gi|296128041|ref|YP_003635291.1| alkyl hydroperoxide reductase [Cellulomonas flavigena DSM 20109]
 gi|296019856|gb|ADG73092.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Cellulomonas flavigena DSM 20109]
          Length = 639

 Score = 43.9 bits (102), Expect = 0.028,   Method: Composition-based stats.
 Identities = 33/131 (25%), Positives = 61/131 (46%), Gaps = 21/131 (16%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P   ++ PG + L+    + +  +TL+   ++   + R  G G +G  DG P  ARF +P
Sbjct: 181 PAKAVELPGGTFLV----ADAGHHTLTEVAADGETLVRRIGSGERGLVDGGPDDARFSEP 236

Query: 107 KSFAV---DMKG----NIYVADKSNHVIRKITNLD--------HGNQLIRQI--NLKPED 149
               +   +++G    ++ VAD  NH +R +   D         G Q +     N++P D
Sbjct: 237 NGLGLVPDELRGRVGYDVLVADTVNHALRGVRLSDGHVTTLAGTGEQYMVGAVDNVRPAD 296

Query: 150 CSKSSQSGSGM 160
              ++ +GSG+
Sbjct: 297 GDTANPAGSGV 307



 Score = 40.8 bits (94), Expect = 0.30,   Method: Composition-based stats.
 Identities = 24/80 (30%), Positives = 34/80 (42%), Gaps = 15/80 (18%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
           V  LAG   +G  DG P  A F +P   +VD  G +++AD     +R +           
Sbjct: 353 VTHLAGTMNEGLVDGTPQEAWFAQPSGLSVDAAGRLWLADAETSALRWV----------- 401

Query: 142 QINLKPEDCSKSSQSGSGMF 161
                P D S  S  G+G+F
Sbjct: 402 ----DPADASVHSVVGTGLF 417


>gi|423301816|ref|ZP_17279839.1| hypothetical protein HMPREF1057_02980 [Bacteroides finegoldii
           CL09T03C10]
 gi|408470907|gb|EKJ89439.1| hypothetical protein HMPREF1057_02980 [Bacteroides finegoldii
           CL09T03C10]
          Length = 469

 Score = 43.9 bits (102), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 24/41 (58%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            +GY DG    A FD P   AVD  G IYVAD  N+ IR+I
Sbjct: 426 TEGYQDGLISEALFDDPMGIAVDKNGVIYVADSDNYAIRRI 466



 Score = 36.6 bits (83), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 19/42 (45%), Positives = 26/42 (61%), Gaps = 1/42 (2%)

Query: 95  DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           DG+ G +  + P    +D + NIYVAD+ N  IRKI+ LD G
Sbjct: 375 DGDLGGSTLNHPCQICMDSQNNIYVADRYNCCIRKIS-LDAG 415


>gi|399034109|ref|ZP_10732477.1| gluconolactonase [Flavobacterium sp. CF136]
 gi|398067543|gb|EJL59038.1| gluconolactonase [Flavobacterium sp. CF136]
          Length = 418

 Score = 43.9 bits (102), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 28/108 (25%), Positives = 48/108 (44%), Gaps = 15/108 (13%)

Query: 38  IDGHQLEINPHSVIDRP----------GSSDLIVLDSSRSAFYTLS-----FPLSEESVV 82
           ++G+  +  P  V+ RP          G++  + LDS  + + +         L+   ++
Sbjct: 212 VNGYIYKFTPSGVLTRPRNNEGALIRVGAASGMCLDSKNNLYISEDATSKIMKLTPTGIL 271

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
              AG    G+ +G    A+F KP     D   N+YV D+ N+ IRKI
Sbjct: 272 TIFAGANGTGFQNGNIKEAKFSKPSGIVFDKMDNLYVVDRYNNRIRKI 319



 Score = 41.6 bits (96), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 18/56 (32%), Positives = 31/56 (55%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ +  V  +AG G++G  DG    A F+ P+   +D   N+Y+ +  N+ IR IT
Sbjct: 319 IAVDGTVSTVAGSGIRGNKDGITSEATFNFPQEITIDKYNNLYIVEPDNNRIRMIT 374


>gi|116672456|ref|YP_833389.1| NHL repeat-containing protein [Arthrobacter sp. FB24]
 gi|116612565|gb|ABK05289.1| NHL repeat containing protein [Arthrobacter sp. FB24]
          Length = 672

 Score = 43.9 bits (102), Expect = 0.030,   Method: Composition-based stats.
 Identities = 35/110 (31%), Positives = 52/110 (47%), Gaps = 4/110 (3%)

Query: 29  EDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTL-SF-PLSEESVVKRLA 86
           ED    T  +  H L++   S  D   S+ L  +  + +  + + SF P+S    V  LA
Sbjct: 323 EDAAGFTGRLSDHPLDVALSSPWDVVWSAKLNAVVVAMAGVHQIFSFDPIS--GAVSILA 380

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           G+G++G  DG    A F +    A D  GNI+VAD     +RK+   D G
Sbjct: 381 GNGLEGLLDGAAHEAWFAQSSGLAEDADGNIWVADSETSALRKLVIDDAG 430


>gi|308178347|ref|YP_003917753.1| NHL repeat-containing protein [Arthrobacter arilaitensis Re117]
 gi|307745810|emb|CBT76782.1| NHL repeat-containing protein [Arthrobacter arilaitensis Re117]
          Length = 635

 Score = 43.9 bits (102), Expect = 0.031,   Method: Composition-based stats.
 Identities = 28/89 (31%), Positives = 47/89 (52%), Gaps = 11/89 (12%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK-------PK 107
            + + +V DS       L+   S+ S V R  G GV+GY+DG+  +A+F++       P 
Sbjct: 189 ANGNFLVGDSGHHRLVELA---SDLSTVVRTIGAGVKGYADGDAQTAQFNELQGLTALPA 245

Query: 108 SFAVDMKGNIYVADKSNHVIRKITNLDHG 136
             A ++  ++ VAD  NH +R + NL+ G
Sbjct: 246 ELAAEVGYDVIVADTVNHRLRSV-NLETG 273



 Score = 37.7 bits (86), Expect = 2.6,   Method: Composition-based stats.
 Identities = 24/85 (28%), Positives = 40/85 (47%), Gaps = 5/85 (5%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           +P  V+  P    +I +  +   F   +F     S V   AG G++G +DG P  A F +
Sbjct: 312 SPWDVLYHPSGQVIIAMAGTHQIF---AFN-PRTSAVSVFAGTGLEGLADGAPDEAWFAQ 367

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKI 130
               A+D + N+++AD     +R I
Sbjct: 368 SSGLALDGQ-NVWIADSETSALRWI 391


>gi|384248465|gb|EIE21949.1| hypothetical protein COCSUDRAFT_16964 [Coccomyxa subellipsoidea
           C-169]
          Length = 1021

 Score = 43.9 bits (102), Expect = 0.032,   Method: Composition-based stats.
 Identities = 23/73 (31%), Positives = 38/73 (52%), Gaps = 3/73 (4%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           + D+IV DS       L  P ++   ++ +AG G  G +DG  G+AR  +P       +G
Sbjct: 773 NGDVIVADSYNHRLKLLD-PATDS--IRTVAGSGAAGLTDGTGGNARVSEPGGLCAGPRG 829

Query: 116 NIYVADKSNHVIR 128
            +++AD +N  IR
Sbjct: 830 TVFIADTNNSAIR 842


>gi|290975165|ref|XP_002670314.1| predicted protein [Naegleria gruberi]
 gi|284083871|gb|EFC37570.1| predicted protein [Naegleria gruberi]
          Length = 1504

 Score = 43.9 bits (102), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 33/58 (56%), Gaps = 1/58 (1%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           ++++  V  +AG G+ GYS  G+P  +    P    V   G+IY++D  NH IR+I N
Sbjct: 471 ITQDGFVVNVAGTGISGYSLIGDPLLSYLSNPNGIVVLKNGDIYISDTDNHCIRQIKN 528


>gi|421097867|ref|ZP_15558546.1| hypothetical protein LEP1GSC125_1130 [Leptospira borgpetersenii
           str. 200901122]
 gi|410799150|gb|EKS01231.1| hypothetical protein LEP1GSC125_1130 [Leptospira borgpetersenii
           str. 200901122]
          Length = 357

 Score = 43.9 bits (102), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 30/114 (26%), Positives = 52/114 (45%), Gaps = 9/114 (7%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVA 120
           +LD      +     ++   V+   AG G++   DG   +A F+ P    +D  GNI+V+
Sbjct: 35  ILDCILKECHLCKLKVTNNPVISLFAGTGIKESIDGTTQTASFNTPFGLELDTFGNIFVS 94

Query: 121 DKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSGMFWVTVFSPYLISCK 174
           D++ ++IRKI          R  N+     S   Q+ SG+ +  +     +SCK
Sbjct: 95  DQTANLIRKID---------RFGNVTTLSTSLVFQNPSGIKFDPITGDKYVSCK 139



 Score = 39.3 bits (90), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 20/48 (41%), Positives = 30/48 (62%), Gaps = 1/48 (2%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           GV G+ +G+  ++ F  P    +D + N+YV + SNH IRKI NL+ G
Sbjct: 162 GVDGFQNGDRLNSLFKSPFFMDLDRERNLYVGELSNHAIRKI-NLNSG 208



 Score = 39.3 bits (90), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 24/50 (48%), Positives = 29/50 (58%), Gaps = 2/50 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           V  L+G GV GY DG+  SA+F  P     D K + + VAD  NH IRKI
Sbjct: 210 VSTLSG-GVLGYLDGDLASAQFKSPLGITYDQKTDSLLVADIQNHNIRKI 258


>gi|332668794|ref|YP_004451801.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Cellulomonas fimi ATCC 484]
 gi|332337831|gb|AEE44414.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Cellulomonas fimi ATCC 484]
          Length = 639

 Score = 43.9 bits (102), Expect = 0.034,   Method: Composition-based stats.
 Identities = 28/91 (30%), Positives = 45/91 (49%), Gaps = 11/91 (12%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P   +  PG + L+V D+       L+     E++V+R+ G G +G+ DG P  ARF +P
Sbjct: 181 PAKAVALPGGT-LLVADAGHHGLAELA--ADGETLVRRI-GSGERGFVDGGPDEARFSEP 236

Query: 107 KSFAVDMKG-------NIYVADKSNHVIRKI 130
              A+   G       ++ VAD  NH +R +
Sbjct: 237 NGLALVPDGLRATLGYDVLVADTVNHALRGV 267


>gi|449684055|ref|XP_002166330.2| PREDICTED: NHL repeat-containing protein 2-like [Hydra
           magnipapillata]
          Length = 193

 Score = 43.5 bits (101), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 38/126 (30%), Positives = 60/126 (47%), Gaps = 20/126 (15%)

Query: 28  LEDGYTVTTVIDGHQLEINPHS-----VIDRPGSSDLIVLDSSRSAFYTLSFP------L 76
           L+ GY  T    G   EI  ++     +I  P   + +V D   +  Y   F       +
Sbjct: 16  LKTGYVTTLSSQGFGAEIGLNNQPVNLIIHYP---EGLVYDPDYNLLYVCEFENHNIRVV 72

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKIT---- 131
           S + +VK LAG+  +G  DG   +ARF  P   A D K   +YV+D+ NH++R +T    
Sbjct: 73  SMQGLVKTLAGE-FKGKKDGVGRTARFYYPSGLAFDRKNRMLYVSDQYNHLVRTVTAIGS 131

Query: 132 NLDHGN 137
           N+++ N
Sbjct: 132 NVEYSN 137


>gi|427719807|ref|YP_007067801.1| NHL repeat containing protein [Calothrix sp. PCC 7507]
 gi|427352243|gb|AFY34967.1| NHL repeat containing protein [Calothrix sp. PCC 7507]
          Length = 505

 Score = 43.5 bits (101), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 20/44 (45%), Positives = 30/44 (68%), Gaps = 1/44 (2%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           GDG  G  DGE  ++RF +P   +V +  ++YVAD +NHVIR++
Sbjct: 437 GDGSAGLQDGEGKNSRFFEPSGLSV-INSHLYVADTNNHVIRRV 479



 Score = 41.2 bits (95), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 44/89 (49%), Gaps = 8/89 (8%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           +S L + DS        SF    +  +  + G G  G+ DG+   A+F  P+  A D + 
Sbjct: 184 ASGLFIADSGHHRLVRSSF----DGEILHVIGTGKPGFIDGDFSEAQFFAPQGMAFDEEN 239

Query: 116 NI-YVADKSNHVIRKITNLDHGNQLIRQI 143
            I Y+AD  NH +R++   D  +QL++ I
Sbjct: 240 QILYIADTENHALRRV---DLRHQLVKTI 265


>gi|427731644|ref|YP_007077881.1| thiol-disulfide isomerase-like thioredoxin [Nostoc sp. PCC 7524]
 gi|427367563|gb|AFY50284.1| thiol-disulfide isomerase-like thioredoxin [Nostoc sp. PCC 7524]
          Length = 505

 Score = 43.5 bits (101), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 29/48 (60%), Gaps = 1/48 (2%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + + GDG  G  DG+  SARF +P   +  M  N+Y+ D +NH IR++
Sbjct: 433 QTILGDGNAGLQDGQGKSARFFEPSGLSA-MDSNLYICDTNNHAIRRV 479



 Score = 40.0 bits (92), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 26/51 (50%), Gaps = 1/51 (1%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
           V  L G G  G++DG    A+F  P+    D    I YVAD  NH +R+I 
Sbjct: 206 VLHLIGTGKSGFTDGAFDEAQFSAPQGMTFDAANQILYVADTDNHAVRQIN 256


>gi|17228508|ref|NP_485056.1| hypothetical protein alr1013 [Nostoc sp. PCC 7120]
 gi|17130359|dbj|BAB72970.1| alr1013 [Nostoc sp. PCC 7120]
          Length = 503

 Score = 43.5 bits (101), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 39/85 (45%), Gaps = 7/85 (8%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V+  P  S L V DS         F       +  L G+G  G +DG    A+F  P
Sbjct: 175 PGKVLATP--SGLFVADSGHHRIVVSDF----NGEILHLIGNGKSGLTDGNFQEAQFSAP 228

Query: 107 KSFAVDMKGNI-YVADKSNHVIRKI 130
           +  A DM+  I Y+AD  NH +R++
Sbjct: 229 QGMAFDMENQILYLADTENHALRRV 253



 Score = 36.6 bits (83), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 15/48 (31%), Positives = 30/48 (62%), Gaps = 1/48 (2%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + + GDG+ G  +G+  + RF +P   +  +   +Y++D +NHVIR++
Sbjct: 431 QTVLGDGLAGLQNGQGKNTRFFEPSGLSA-IDSYLYISDTNNHVIRRV 477


>gi|326432343|gb|EGD77913.1| NHL repeat protein [Salpingoeca sp. ATCC 50818]
          Length = 1505

 Score = 43.5 bits (101), Expect = 0.041,   Method: Composition-based stats.
 Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 81   VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
            V+  +AG GV G+S   G P  AR D P+  AV   G++ +AD  NH +R+
Sbjct: 1391 VITTVAGTGVAGFSGDGGLPTDARLDTPRGVAVHSSGSLAIADSGNHRVRE 1441


>gi|222056335|ref|YP_002538697.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
 gi|221565624|gb|ACM21596.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
          Length = 372

 Score = 43.5 bits (101), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 23/56 (41%), Positives = 31/56 (55%), Gaps = 2/56 (3%)

Query: 76  LSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           L+    V  LAG   Q G +DG   +ARF+ P     D   N+YVAD  N++IRK+
Sbjct: 44  LTLTGTVSVLAGQAPQMGTADGTGSAARFNAPSGITTDGT-NLYVADTGNNLIRKV 98



 Score = 39.3 bits (90), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 44/95 (46%), Gaps = 7/95 (7%)

Query: 40  GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ---GYSDG 96
           G     N  S I   G+ +L V D+  +    +       + +    G G     G +DG
Sbjct: 67  GSAARFNAPSGITTDGT-NLYVADTGNNLIRKVVITTGAVTTLAGTVGTGTAQTSGSTDG 125

Query: 97  EPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKI 130
              +A+F+ P  FA+   G N+YVAD +N+ IRK+
Sbjct: 126 TGSAAKFNAP--FAITTDGTNLYVADTNNNTIRKV 158


>gi|302821969|ref|XP_002992645.1| hypothetical protein SELMODRAFT_6806 [Selaginella moellendorffii]
 gi|300139609|gb|EFJ06347.1| hypothetical protein SELMODRAFT_6806 [Selaginella moellendorffii]
          Length = 79

 Score = 43.5 bits (101), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 19/43 (44%), Positives = 28/43 (65%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G  GY DG    +RF++P+S A+   G ++VAD +N  IRKI+
Sbjct: 15  GESGYVDGPAAKSRFNRPQSLAICDNGAVFVADTTNLAIRKIS 57


>gi|145487484|ref|XP_001429747.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124396841|emb|CAK62349.1| unnamed protein product [Paramecium tetraurelia]
          Length = 450

 Score = 43.5 bits (101), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 25/64 (39%), Positives = 36/64 (56%), Gaps = 1/64 (1%)

Query: 78  EESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           ++ + K +AG  G  G+ DG  G   F+ P    +D+ GNI+V D +NH IR IT   + 
Sbjct: 190 QKKITKTIAGICGQSGFKDGLLGMNLFNTPDQMGIDVLGNIFVNDFNNHFIRMITLDGYV 249

Query: 137 NQLI 140
           N LI
Sbjct: 250 NTLI 253


>gi|116625892|ref|YP_828048.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116229054|gb|ABJ87763.1| NHL repeat containing protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 2384

 Score = 43.5 bits (101), Expect = 0.044,   Method: Composition-based stats.
 Identities = 22/55 (40%), Positives = 32/55 (58%), Gaps = 2/55 (3%)

Query: 78  EESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  ++  +AG GVQG++   G  GSA+   P   A D  GN+Y+AD  N  IR+I
Sbjct: 138 QTGMISTVAGTGVQGFAGDGGLAGSAQLSHPAGIAFDSAGNLYIADMGNFRIRRI 192


>gi|290985160|ref|XP_002675294.1| predicted protein [Naegleria gruberi]
 gi|284088889|gb|EFC42550.1| predicted protein [Naegleria gruberi]
          Length = 1229

 Score = 43.5 bits (101), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 31/53 (58%), Gaps = 1/53 (1%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKITN 132
           +V   +G  + GY DG   +ARF  P    +D   G++YV+D  N VIRKI+N
Sbjct: 608 IVSTFSGGTISGYQDGPIQTARFYFPNGLDIDTTNGDLYVSDTFNCVIRKISN 660


>gi|238062013|ref|ZP_04606722.1| LOW QUALITY PROTEIN: alkyl hydroperoxide reductase/thiol specific
           antioxidant/Mal allergen [Micromonospora sp. ATCC 39149]
 gi|237883824|gb|EEP72652.1| LOW QUALITY PROTEIN: alkyl hydroperoxide reductase/thiol specific
           antioxidant/Mal allergen [Micromonospora sp. ATCC 39149]
          Length = 582

 Score = 43.5 bits (101), Expect = 0.045,   Method: Composition-based stats.
 Identities = 30/97 (30%), Positives = 54/97 (55%), Gaps = 16/97 (16%)

Query: 47  PHSVIDRPGSS------DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGS 100
           P + +  PG +      +L+V DS+R +   L+ P   E++V+R+ G GV+G +DG  G+
Sbjct: 139 PETALRFPGKAIALDGGNLLVSDSARHSLVELA-P-DGETLVRRI-GSGVRGRADGPAGA 195

Query: 101 ARFDKPKSF------AVDMKG-NIYVADKSNHVIRKI 130
           A F +P+          ++ G ++ VAD  NH++R +
Sbjct: 196 AAFAEPQGLCRLPEHVAEVAGYDVVVADTVNHLLRGV 232


>gi|389875076|ref|YP_006374432.1| NHL repeat-containing protein [Tistrella mobilis KA081020-065]
 gi|388532256|gb|AFK57450.1| NHL repeat-containing protein [Tistrella mobilis KA081020-065]
          Length = 487

 Score = 43.5 bits (101), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 34/72 (47%), Gaps = 5/72 (6%)

Query: 81  VVKRLAGDGVQ--GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQ 138
            V  L GDG+   G  DG   +ARF  P     DM GNI VAD  N  IR +   D    
Sbjct: 372 TVSTLVGDGLFEFGMVDGPARTARFQHPLGLCFDMNGNILVADAYNDAIRVV---DPETG 428

Query: 139 LIRQINLKPEDC 150
           ++  ++  P DC
Sbjct: 429 MVSSLDDPPYDC 440



 Score = 40.8 bits (94), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 26/47 (55%), Gaps = 1/47 (2%)

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           R  G G  G  DG   SARF +P+   +   G I+VAD  NH +R+I
Sbjct: 210 RRVGSGEPGLVDGPAESARFQRPQGL-IAADGAIFVADTWNHAVRRI 255


>gi|407780651|ref|ZP_11127872.1| NHL repeat-containing protein [Oceanibaculum indicum P24]
 gi|407208878|gb|EKE78785.1| NHL repeat-containing protein [Oceanibaculum indicum P24]
          Length = 498

 Score = 43.5 bits (101), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 21/44 (47%), Positives = 25/44 (56%), Gaps = 1/44 (2%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G G  G++ G    ARF  P+  A D  G IYVAD  NH IR+I
Sbjct: 217 GSGEAGFASGPADRARFRDPQGLAADT-GAIYVADTGNHAIRRI 259


>gi|290983036|ref|XP_002674235.1| predicted protein [Naegleria gruberi]
 gi|284087824|gb|EFC41491.1| predicted protein [Naegleria gruberi]
          Length = 737

 Score = 43.5 bits (101), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 23/53 (43%), Positives = 33/53 (62%), Gaps = 4/53 (7%)

Query: 82  VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +  +AG G  G+S G+ G   +A+   P++  V  +G I +AD SNHVIRKIT
Sbjct: 90  IHTIAGTGTSGHS-GDGGLAVNAKLFHPRTVTVTSEGEILIADTSNHVIRKIT 141


>gi|326435583|gb|EGD81153.1| hypothetical protein PTSG_13151 [Salpingoeca sp. ATCC 50818]
          Length = 1551

 Score = 43.5 bits (101), Expect = 0.048,   Method: Composition-based stats.
 Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 81   VVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
            V+  +AG GV G+S   G P  AR D P+  AV   G++ +AD  NH +R+
Sbjct: 1437 VITTVAGTGVAGFSGDGGLPTDARLDTPRGVAVHSSGSLAIADSGNHRVRE 1487


>gi|433458786|ref|ZP_20416675.1| NHL repeat containing protein [Arthrobacter crystallopoietes
           BAB-32]
 gi|432192788|gb|ELK49609.1| NHL repeat containing protein [Arthrobacter crystallopoietes
           BAB-32]
          Length = 680

 Score = 43.1 bits (100), Expect = 0.048,   Method: Composition-based stats.
 Identities = 23/60 (38%), Positives = 33/60 (55%), Gaps = 2/60 (3%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
           PL+ E  V  LAG G++G  DGE  +A F +    A D  G+I++AD     +R +T  D
Sbjct: 378 PLTGELAV--LAGTGLEGLQDGEANAAWFAQSSGLAQDANGSIWIADSETSALRVLTFSD 435


>gi|345482264|ref|XP_001607897.2| PREDICTED: NHL repeat-containing protein 2-like [Nasonia
           vitripennis]
          Length = 768

 Score = 43.1 bits (100), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 34/56 (60%), Gaps = 1/56 (1%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           + ++  V+ + G    G+ DG+  +ARF+ P+   V +K +++VAD  NH IRKI 
Sbjct: 329 MGKDGKVEHVVGGYSPGFKDGDFETARFNAPQGVCV-LKDDVFVADNENHAIRKIN 383


>gi|283781091|ref|YP_003371846.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
 gi|283439544|gb|ADB17986.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
          Length = 782

 Score = 43.1 bits (100), Expect = 0.048,   Method: Composition-based stats.
 Identities = 21/44 (47%), Positives = 28/44 (63%), Gaps = 1/44 (2%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G GV G +DG    A FD P+  A+D +  +YVAD  NH++RKI
Sbjct: 332 GSGVIGKADGSFAEASFDHPQGCALDGE-TLYVADTENHLLRKI 374



 Score = 40.0 bits (92), Expect = 0.42,   Method: Composition-based stats.
 Identities = 23/49 (46%), Positives = 32/49 (65%), Gaps = 4/49 (8%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           VK +AG G  G SD +P  A FD+P   A+ +   +YVAD +NH+IRK+
Sbjct: 578 VKTIAGSGSPGTSD-DP--ATFDEPAGLAL-VGETLYVADTNNHLIRKV 622


>gi|434403812|ref|YP_007146697.1| thiol-disulfide isomerase-like thioredoxin [Cylindrospermum
           stagnale PCC 7417]
 gi|428258067|gb|AFZ24017.1| thiol-disulfide isomerase-like thioredoxin [Cylindrospermum
           stagnale PCC 7417]
          Length = 506

 Score = 43.1 bits (100), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 38/85 (44%), Gaps = 7/85 (8%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V+  P    L + DS        SF       V  + G G  G++DG    A+F  P
Sbjct: 177 PGKVLATPAG--LFIADSGHHRLVMSSF----NGEVFHIIGTGKSGFTDGNFSEAQFSAP 230

Query: 107 KSFAVDMKGNI-YVADKSNHVIRKI 130
           +  A D +  I YVAD  NH +RK+
Sbjct: 231 QGMAYDAENQILYVADTENHALRKV 255


>gi|163841552|ref|YP_001625957.1| hypothetical protein RSal33209_2819 [Renibacterium salmoninarum
           ATCC 33209]
 gi|162955028|gb|ABY24543.1| conserved hypothetical protein [Renibacterium salmoninarum ATCC
           33209]
          Length = 425

 Score = 43.1 bits (100), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 32/112 (28%), Positives = 51/112 (45%), Gaps = 17/112 (15%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDKPKS-------FAVDMKGNIYVADKSNHVIRKIT 131
           E+VV+   G G +G++DG    ARF++P+         A  +  ++ VAD  NH +R I+
Sbjct: 88  ETVVRAF-GTGTKGFADGAAQMARFNEPQGLVLLPEVLAAQLGYDVVVADSVNHRLRGIS 146

Query: 132 NLD------HGNQLIRQINLKPEDCSKSSQSGSGMFWVTVFSPYLISCKSFW 177
             D       GN + R ++  P   +   +  SG   V    P  I+  S W
Sbjct: 147 LTDGSVRTLAGNGVQRLLDAGPNTVNDDGEGTSG---VLSGDPLTIALSSPW 195


>gi|307202136|gb|EFN81636.1| NHL repeat-containing protein 2 [Harpegnathos saltator]
          Length = 688

 Score = 43.1 bits (100), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 39/73 (53%), Gaps = 5/73 (6%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
           LI+ DS  S     +    +   ++ + G   QG+ DG+  SA+F+ P+   + +   IY
Sbjct: 236 LIISDSGNSRILITN----QHGEIEHIIGGTNQGFEDGDFKSAKFNSPQGVCM-LDDVIY 290

Query: 119 VADKSNHVIRKIT 131
           VAD +NH IRKI 
Sbjct: 291 VADNNNHAIRKIN 303


>gi|428225769|ref|YP_007109866.1| NHL repeat containing protein [Geitlerinema sp. PCC 7407]
 gi|427985670|gb|AFY66814.1| NHL repeat containing protein [Geitlerinema sp. PCC 7407]
          Length = 498

 Score = 43.1 bits (100), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 29/53 (54%), Gaps = 1/53 (1%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           E  V+   G G  GY DG   +A+F  P+    D  G+ +YVAD  NH+IR +
Sbjct: 197 EGKVRATVGSGTPGYQDGPWETAQFRAPQGLTYDPAGDRLYVADTGNHLIRCV 249



 Score = 37.4 bits (85), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 38/75 (50%), Gaps = 4/75 (5%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
           G  +L + D+       L+  L+E   V  + G G+ G+ DG    A F +P   A + +
Sbjct: 402 GPGNLWIADTYNHKIKRLN--LAEGHCVN-IVGSGLPGHQDGWGPEASFSEPSGLASEGQ 458

Query: 115 GNIYVADKSNHVIRK 129
             +Y+AD +NH IR+
Sbjct: 459 -TLYIADTNNHAIRR 472


>gi|340370782|ref|XP_003383925.1| PREDICTED: NHL repeat-containing protein 2-like [Amphimedon
           queenslandica]
          Length = 730

 Score = 43.1 bits (100), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 43/80 (53%), Gaps = 5/80 (6%)

Query: 53  RPGSSDLIVL-DSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV 111
           R G S+L+ + DSS      +    +   +VK++ G G  G+ DG    A F+ P+   +
Sbjct: 233 RGGVSELLCISDSSNHRVLVVD---AVTGLVKQVYGSGSPGFKDGRGKEAEFNCPQGLVI 289

Query: 112 DMKGNIYVADKSNHVIRKIT 131
             +  +YVAD  NH+IRKI+
Sbjct: 290 -CEECVYVADTENHLIRKIS 308



 Score = 40.0 bits (92), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 24/67 (35%), Positives = 35/67 (52%), Gaps = 4/67 (5%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLDHGNQL 139
           V   LAG G  G  DG    A+F +P    +  +G+ +YVAD +NH IR    LD   + 
Sbjct: 504 VCSTLAGTGSPGLVDGSFEVAQFSEPAGLCMSQEGDTLYVADTNNHAIRI---LDLKEKK 560

Query: 140 IRQINLK 146
           + Q+ L+
Sbjct: 561 VSQVKLR 567


>gi|290989149|ref|XP_002677205.1| predicted protein [Naegleria gruberi]
 gi|284090811|gb|EFC44461.1| predicted protein [Naegleria gruberi]
          Length = 691

 Score = 43.1 bits (100), Expect = 0.054,   Method: Composition-based stats.
 Identities = 21/63 (33%), Positives = 36/63 (57%), Gaps = 1/63 (1%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK-GNIYVADKSNHVIRKITNLD 134
           ++ +  +  +AG   +GY DG    A+F+ P   A+D + G++YV D  N+ IRKI+   
Sbjct: 333 INRDRTLVTIAGTTEEGYQDGPVRMAKFNNPSDIAIDNETGDVYVVDGGNNYIRKISTTK 392

Query: 135 HGN 137
           + N
Sbjct: 393 YLN 395



 Score = 40.8 bits (94), Expect = 0.28,   Method: Composition-based stats.
 Identities = 22/54 (40%), Positives = 34/54 (62%), Gaps = 4/54 (7%)

Query: 81  VVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++  +AG G +GY DG+ G    AR   P   AV+ +G++Y +D  N+VIRKI+
Sbjct: 226 LIHTVAGFGEKGY-DGDGGFAFYARIRCPTGIAVNQRGDVYFSDSGNNVIRKIS 278


>gi|365864816|ref|ZP_09404493.1| NHL repeat-containing protein [Streptomyces sp. W007]
 gi|364005745|gb|EHM26808.1| NHL repeat-containing protein [Streptomyces sp. W007]
          Length = 292

 Score = 43.1 bits (100), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 33/58 (56%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ +  +  +AG G  GYS   G   SAR + P    VD   N+Y++D +NH +RK+T
Sbjct: 20  VTADGTISTVAGTGTAGYSGDSGAATSARLNLPLGVVVDSADNLYISDYNNHRVRKVT 77


>gi|405960880|gb|EKC26754.1| NHL repeat-containing protein 2 [Crassostrea gigas]
          Length = 699

 Score = 43.1 bits (100), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 25/56 (44%), Positives = 33/56 (58%), Gaps = 3/56 (5%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           LS++ VV+   G   +G+ +G     RFD P+   V MK N IY+AD  NH IRKI
Sbjct: 240 LSKDGVVQHCIGGKERGFKNGTFTQCRFDSPQ--GVVMKENVIYLADTENHAIRKI 293


>gi|285018581|ref|YP_003376292.1| nhl repeat protein [Xanthomonas albilineans GPE PC73]
 gi|283473799|emb|CBA16302.1| hypothetical nhl repeat protein [Xanthomonas albilineans GPE PC73]
          Length = 473

 Score = 43.1 bits (100), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 28/47 (59%), Gaps = 1/47 (2%)

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           R  G G   + DGE G A F +P+  A+  +G +YVAD  NH +R+I
Sbjct: 201 RQFGTGTADFIDGELGQAAFHRPQGLAL-QRGVLYVADTGNHALRRI 246


>gi|284097436|ref|ZP_06385541.1| NHL repeat containing protein [Candidatus Poribacteria sp. WGA-A3]
 gi|283831050|gb|EFC35055.1| NHL repeat containing protein [Candidatus Poribacteria sp. WGA-A3]
          Length = 387

 Score = 43.1 bits (100), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 18/44 (40%), Positives = 28/44 (63%), Gaps = 9/44 (20%)

Query: 96  GEPG---------SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           GEPG         +A  ++PK+ A+D  GN+Y+AD  NH++RK+
Sbjct: 1   GEPGWGGDAGPATAAMLNEPKTVALDAAGNLYIADSENHLVRKV 44



 Score = 37.0 bits (84), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 35/77 (45%), Gaps = 5/77 (6%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVD 112
           G   L + D S +    L        V+  + G G  GY+ DG PG       P   AVD
Sbjct: 194 GPDRLYIADQSNNRVRKLDL---TSGVMTTVVGTGESGYNGDGAPGPETALAGPSGLAVD 250

Query: 113 MKGNIYVADKSNHVIRK 129
            +GN+Y+AD  +  IRK
Sbjct: 251 QEGNLYIADTFSGRIRK 267


>gi|74317309|ref|YP_315049.1| hypothetical protein Tbd_1291 [Thiobacillus denitrificans ATCC
           25259]
 gi|74056804|gb|AAZ97244.1| conserved hypothetical protein containing twin-arginine
           translocation pathway signal [Thiobacillus denitrificans
           ATCC 25259]
          Length = 510

 Score = 43.1 bits (100), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 21/47 (44%), Positives = 26/47 (55%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           + G     + DG    ARF+       D  GN+YVAD+ NHVIRKIT
Sbjct: 332 MVGQKQGAFVDGPANEARFNGCSQIDYDNAGNLYVADRFNHVIRKIT 378


>gi|269926393|ref|YP_003323016.1| alkyl hydroperoxide reductase [Thermobaculum terrenum ATCC BAA-798]
 gi|269790053|gb|ACZ42194.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Thermobaculum terrenum ATCC BAA-798]
          Length = 493

 Score = 43.1 bits (100), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 31/48 (64%), Gaps = 3/48 (6%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIR 128
           ++R+ G+G +G  DG    A+F+ P+  A+  +GN +YVAD  NH +R
Sbjct: 210 IRRIWGNGEEGLVDGSASEAKFNHPQGMAI--RGNELYVADTENHALR 255


>gi|347757619|ref|YP_004865181.1| NHL repeat family protein [Micavibrio aeruginosavorus ARL-13]
 gi|347590137|gb|AEP09179.1| NHL repeat family protein [Micavibrio aeruginosavorus ARL-13]
          Length = 495

 Score = 43.1 bits (100), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 31/105 (29%), Positives = 46/105 (43%), Gaps = 24/105 (22%)

Query: 54  PGSSDLIVLDSSRSAFYTLSFP----------------------LSEESVVKRLAGDGVQ 91
           P  +D+  L  +R+   TLSFP                      L     VK   G G  
Sbjct: 170 PPFTDISTLVPARTKTSTLSFPARIEDAGDFIFVADSGHNRILVLDRTGAVKMTIGSGEL 229

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           G+ DG    A+F+ P+   + +  N+Y+AD  NH++R + NLD G
Sbjct: 230 GHQDGSFVMAKFNHPRGMVL-IGRNLYIADTDNHMLR-VANLDSG 272


>gi|290987128|ref|XP_002676275.1| predicted protein [Naegleria gruberi]
 gi|284089876|gb|EFC43531.1| predicted protein [Naegleria gruberi]
          Length = 1182

 Score = 43.1 bits (100), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 50/102 (49%), Gaps = 11/102 (10%)

Query: 31  GYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV 90
           G  +TT+   +   I+P S +  P   +   +DSS     T+ + +S   +V   AG G 
Sbjct: 422 GGIITTITPSY---ISPRSPL-VPNVGEYYFVDSS-----TVKYLISSSGIVITKAGSGT 472

Query: 91  QGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            GY +D  P  SA  + PK  A    G++Y+ D  NH IRK+
Sbjct: 473 TGYNADNIPATSAHLNLPKGIARSTNGDLYITDSGNHRIRKV 514


>gi|320107313|ref|YP_004182903.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
 gi|319925834|gb|ADV82909.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
          Length = 1821

 Score = 42.7 bits (99), Expect = 0.066,   Method: Composition-based stats.
 Identities = 20/38 (52%), Positives = 27/38 (71%), Gaps = 1/38 (2%)

Query: 95  DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           DG P SA RF+ P   A+D  G++Y+AD++NH IR IT
Sbjct: 341 DGGPASAGRFNNPLGLAIDSGGSLYIADQANHRIRMIT 378



 Score = 40.8 bits (94), Expect = 0.26,   Method: Composition-based stats.
 Identities = 20/51 (39%), Positives = 36/51 (70%), Gaps = 4/51 (7%)

Query: 85  LAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           +AG GV  Y +G+ G   +A+  +P + AVD+  N+Y+AD +NH++RK+++
Sbjct: 386 VAGTGVPRY-NGDGGLAVAAQLQQPAAVAVDVARNLYIADSNNHLVRKVSS 435



 Score = 36.6 bits (83), Expect = 4.7,   Method: Composition-based stats.
 Identities = 27/80 (33%), Positives = 41/80 (51%), Gaps = 8/80 (10%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARF---DKPKSFA 110
           G+ DL + D+  +A   L+       ++  +AG  GV G S G+ GSA     + P   A
Sbjct: 244 GAGDLYIADTRNNAIRKLTLA---TGILSTIAGRLGVPG-SSGDGGSATLATLNAPGGLA 299

Query: 111 VDMKGNIYVADKSNHVIRKI 130
           +D  G +Y+AD  N  IR+I
Sbjct: 300 MDAAGFLYIADTGNDTIRRI 319


>gi|197118445|ref|YP_002138872.1| lipoprotein [Geobacter bemidjiensis Bem]
 gi|197087805|gb|ACH39076.1| repeat-containing lipoprotein, putative [Geobacter bemidjiensis
           Bem]
          Length = 652

 Score = 42.7 bits (99), Expect = 0.067,   Method: Composition-based stats.
 Identities = 30/85 (35%), Positives = 42/85 (49%), Gaps = 5/85 (5%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDK 105
           P  +     +S L V D+  +   +++        V  LAG  GV G ++G  G+ARF  
Sbjct: 130 PSGIALSSDNSTLYVSDTGNNTIRSINVA---TGAVTTLAGTAGVVGSTNGTGGAARFSS 186

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKI 130
           P   A D   N+YVAD  NH +RKI
Sbjct: 187 PSGVATD-GANLYVADSLNHRVRKI 210



 Score = 39.7 bits (91), Expect = 0.57,   Method: Composition-based stats.
 Identities = 19/51 (37%), Positives = 32/51 (62%), Gaps = 1/51 (1%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + V  LAG G+Q ++DG   +A F+ P+  A D   ++Y+AD+ N  +R+I
Sbjct: 215 AAVTTLAGSGLQDFADGTGIAASFNSPRGIATD-GVSLYLADQGNSAVRRI 264



 Score = 38.9 bits (89), Expect = 1.1,   Method: Composition-based stats.
 Identities = 23/50 (46%), Positives = 27/50 (54%), Gaps = 2/50 (4%)

Query: 82  VKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V  LAG  G  G +DG   +ARF  P     D   N+YVAD  N+ IRKI
Sbjct: 477 VSTLAGTAGATGSADGAGSAARFHSPNGLTTD-GTNLYVADSGNNAIRKI 525



 Score = 36.2 bits (82), Expect = 6.2,   Method: Composition-based stats.
 Identities = 28/93 (30%), Positives = 42/93 (45%), Gaps = 11/93 (11%)

Query: 45  INPHSVIDRPGS-----SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG--VQGYSDGE 97
           + P S I+ P       + + V D  R+    +         V  LAGDG    G +D  
Sbjct: 275 VQPASGIESPAGIATDGTSVFVTDLERNNLRKVDIA---TQAVSTLAGDGGGSPGSTDAV 331

Query: 98  PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            G+ARF  P    ++  G +YVAD  N ++RK+
Sbjct: 332 GGAARFSAPAGLVLN-AGVLYVADTGNDLLRKV 363



 Score = 36.2 bits (82), Expect = 7.1,   Method: Composition-based stats.
 Identities = 24/75 (32%), Positives = 35/75 (46%), Gaps = 3/75 (4%)

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN 116
           + L V+D+  S    +       S V  LAG  + G  DG    ARF+ P   A+    +
Sbjct: 83  TSLYVVDTGNSVIRKVVLATGATSTVAGLAG--ITGSGDGTGAVARFNYPSGIALSSDNS 140

Query: 117 -IYVADKSNHVIRKI 130
            +YV+D  N+ IR I
Sbjct: 141 TLYVSDTGNNTIRSI 155


>gi|290989086|ref|XP_002677176.1| predicted protein [Naegleria gruberi]
 gi|284090782|gb|EFC44432.1| predicted protein [Naegleria gruberi]
          Length = 2103

 Score = 42.7 bits (99), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 20/60 (33%), Positives = 34/60 (56%), Gaps = 4/60 (6%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSAR---FDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           +S   ++  +AG+G  G+  G+ G A     ++P   A+   G +Y+AD  NH IRK++N
Sbjct: 525 VSNNGIITTIAGNGTAGFC-GDGGLAVNTCLNRPNGIAISSSGELYIADYGNHRIRKVSN 583



 Score = 39.3 bits (90), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 29/98 (29%), Positives = 45/98 (45%), Gaps = 7/98 (7%)

Query: 38  IDGHQLEIN-PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-D 95
           ID  +  +N PH V   P S ++   D +          +     +  +AG+G   Y+ D
Sbjct: 105 IDATKATVNSPHGVAFHPTSGEVYFADYANHRIR----KIYSNGTITTIAGNGEPNYNGD 160

Query: 96  GEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
             P  SA+   P   AV   G +++AD  N+ IRKI+N
Sbjct: 161 NIPATSAQLGYPIGIAVSSGGEVFIADTLNNRIRKISN 198



 Score = 37.7 bits (86), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 32/54 (59%), Gaps = 3/54 (5%)

Query: 81  VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGN-IYVADKSNHVIRKIT 131
           ++  +AGDG  GY+ DG   + A  + P   A+D   N  Y+AD +N+ IRK++
Sbjct: 313 IISSIAGDGQSGYNYDGIDATIATLNNPVGIAIDSSNNEFYIADSNNNRIRKVS 366



 Score = 37.0 bits (84), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 37/75 (49%), Gaps = 6/75 (8%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S +LIV +++ S  + +   L+   +V  +AG    G  DG   S  F     F +   G
Sbjct: 790 SENLIVCETTSS--WKIRKILATTGMVSTIAG----GIGDGLNASNAFLVSTLFDISNSG 843

Query: 116 NIYVADKSNHVIRKI 130
            IY+AD  NH IRKI
Sbjct: 844 EIYIADTGNHRIRKI 858


>gi|430749689|ref|YP_007212597.1| copper amine oxidase family protein [Thermobacillus composti KWC4]
 gi|430733654|gb|AGA57599.1| copper amine oxidase family protein [Thermobacillus composti KWC4]
          Length = 187

 Score = 42.7 bits (99), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 23/48 (47%), Positives = 26/48 (54%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           +AG G+ G  DG      F  P   AV   G IYVAD  N+VIRKI N
Sbjct: 4   IAGTGILGLKDGSASQENFYYPMDVAVSPSGVIYVADTLNYVIRKIEN 51


>gi|269957976|ref|YP_003327765.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Xylanimonas cellulosilytica DSM 15894]
 gi|269306657|gb|ACZ32207.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Xylanimonas cellulosilytica DSM 15894]
          Length = 688

 Score = 42.7 bits (99), Expect = 0.072,   Method: Composition-based stats.
 Identities = 30/91 (32%), Positives = 48/91 (52%), Gaps = 11/91 (12%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P   I  PG + L+V D+   +   L+ P   E++V+R+ G G +G  DG P  ARF +P
Sbjct: 178 PAKAITLPGGT-LLVADAGHHSLAELA-P-DGETLVRRI-GSGQRGLLDGGPDEARFAEP 233

Query: 107 KSFAV---DMKG----NIYVADKSNHVIRKI 130
               +   D++G    ++ VAD  NH +R +
Sbjct: 234 NGLCLVPEDLRGRLGYDVVVADTVNHALRGV 264


>gi|406901359|gb|EKD44040.1| NHL repeat containing protein, partial [uncultured bacterium]
          Length = 667

 Score = 42.7 bits (99), Expect = 0.074,   Method: Composition-based stats.
 Identities = 23/72 (31%), Positives = 38/72 (52%), Gaps = 2/72 (2%)

Query: 61  VLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGN-IY 118
           V D+       ++   S++   K +AG   + G+ DG   +A F+ P +  +D   N +Y
Sbjct: 531 VADTGNEVIRRVTLTGSKQGQTKLIAGSPTESGFKDGTKTAAEFNVPIALTIDSADNYLY 590

Query: 119 VADKSNHVIRKI 130
           VAD+ NH IRK+
Sbjct: 591 VADRDNHAIRKV 602



 Score = 41.2 bits (95), Expect = 0.23,   Method: Composition-based stats.
 Identities = 22/77 (28%), Positives = 39/77 (50%), Gaps = 5/77 (6%)

Query: 60  IVLDSSRSAFYTLSF-----PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
           I+ D + + + T +F      +    +V  + G G  G  +G   +++F  P + AVD  
Sbjct: 51  IIADGAGNFYVTDTFNGVIRKIDANGIVSTVVGQGGYGDVNGSATTSKFAHPSAVAVDDS 110

Query: 115 GNIYVADKSNHVIRKIT 131
           GN+Y+AD  N  I+K +
Sbjct: 111 GNVYIADAGNGKIKKFS 127


>gi|290986743|ref|XP_002676083.1| predicted protein [Naegleria gruberi]
 gi|284089683|gb|EFC43339.1| predicted protein [Naegleria gruberi]
          Length = 733

 Score = 42.7 bits (99), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 34/55 (61%), Gaps = 4/55 (7%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           +AG+G  G+S  +G   +A+ + P    V     +Y+AD+SNH IRKI  L++GN
Sbjct: 158 IAGNGTSGFSGDNGPATNAQLNNPVGVFVSSNNEVYIADQSNHRIRKI--LENGN 210



 Score = 40.4 bits (93), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 23/58 (39%), Positives = 34/58 (58%), Gaps = 4/58 (6%)

Query: 82  VKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           +  +AG G  G+S  +G   +A+   P S  V     +Y+AD+SNH IRKI  L++GN
Sbjct: 43  IATIAGKGTCGFSGDNGPATNAQIYYPSSVFVSSNNEVYIADQSNHRIRKI--LENGN 98



 Score = 39.3 bits (90), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 35/55 (63%), Gaps = 5/55 (9%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           +AG+G  G+S  +G   +A+ ++P S  V     +Y+AD +NH IRKI  L++GN
Sbjct: 269 IAGNGTSGFSGDNGPATNAQLNRPSSVFVS-NNEVYIADLNNHKIRKI--LENGN 320


>gi|153864226|ref|ZP_01997193.1| NHL repeat protein [Beggiatoa sp. SS]
 gi|152146280|gb|EDN72808.1| NHL repeat protein [Beggiatoa sp. SS]
          Length = 313

 Score = 42.7 bits (99), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 31/96 (32%), Positives = 50/96 (52%), Gaps = 8/96 (8%)

Query: 40  GHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGE 97
           G QL       ID  GS  L ++D++      +       +++  LAG GV G+S  DG+
Sbjct: 120 GAQLNSPSDIAIDDSGS--LYIIDTNNHRIRKVE---KATNIITTLAG-GVSGFSGDDGQ 173

Query: 98  PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
             SA+F+ P   A+  +G++ +AD  NH +R IT +
Sbjct: 174 ALSAQFNTPDGIAISDEGHLLIADSENHRVRMITGV 209



 Score = 38.9 bits (89), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 18/52 (34%), Positives = 30/52 (57%), Gaps = 2/52 (3%)

Query: 81  VVKRLAGDGVQ--GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++  + G+GV   G   GE   A+ + P   A+D  G++Y+ D +NH IRK+
Sbjct: 99  IITTIVGNGVASFGGDGGEAPGAQLNSPSDIAIDDSGSLYIIDTNNHRIRKV 150



 Score = 37.7 bits (86), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 28/50 (56%), Gaps = 3/50 (6%)

Query: 84  RLAGD-GVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           R+AG  GV G+S   G    A+ + PK   V   G +Y+ DK NH IRK+
Sbjct: 45  RIAGTLGVAGFSGDGGRAVEAKLNNPKRIVVADDGVVYITDKGNHRIRKV 94


>gi|184200016|ref|YP_001854223.1| hypothetical protein KRH_03700 [Kocuria rhizophila DC2201]
 gi|183580246|dbj|BAG28717.1| hypothetical protein [Kocuria rhizophila DC2201]
          Length = 690

 Score = 42.7 bits (99), Expect = 0.075,   Method: Composition-based stats.
 Identities = 31/104 (29%), Positives = 51/104 (49%), Gaps = 14/104 (13%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V+  P  S  +V D+       L    S+ + V+R  G G +G+SDG    ARF++P
Sbjct: 179 PGKVVALPDGS-FMVSDTGHHRLVQLE---SDLTTVRRAIGSGARGWSDGPRDVARFNEP 234

Query: 107 KSFAV-------DMKGNIYVADKSNHVIRKITNLDHGNQLIRQI 143
           +  AV       ++  ++ VAD  NH +R +   D G+  +R +
Sbjct: 235 QGIAVLPGRLRSEVGYDLVVADTVNHRLRGV---DLGSGSVRTL 275



 Score = 38.9 bits (89), Expect = 1.1,   Method: Composition-based stats.
 Identities = 16/46 (34%), Positives = 26/46 (56%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           LAG G +G  DG    + F +P   A D  G+++VAD  +  +R++
Sbjct: 345 LAGTGDEGLRDGSAHESWFAQPSGLAQDAHGDVWVADSESSALRRV 390


>gi|398335921|ref|ZP_10520626.1| hypothetical protein LkmesMB_09964 [Leptospira kmetyi serovar
           Malaysia str. Bejo-Iso9]
          Length = 947

 Score = 42.7 bits (99), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 21/42 (50%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G  G +DG   +ARF+ PK  A D    +YVAD  NH IRKI
Sbjct: 722 GTSGNADGVGTAARFNAPKGIATDGT-YLYVADTGNHKIRKI 762


>gi|290975761|ref|XP_002670610.1| predicted protein [Naegleria gruberi]
 gi|284084171|gb|EFC37866.1| predicted protein [Naegleria gruberi]
          Length = 308

 Score = 42.7 bits (99), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 37/62 (59%), Gaps = 4/62 (6%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           E   +  +AG+G  G+S  +G   +A+F+ P S  V  K  +Y+ D SNH IRKI  L++
Sbjct: 39  ENGNIVTIAGNGNYGFSGDNGPATNAQFNYPCSVFVSSKNEVYITDYSNHRIRKI--LEN 96

Query: 136 GN 137
           GN
Sbjct: 97  GN 98



 Score = 42.7 bits (99), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 37/62 (59%), Gaps = 4/62 (6%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           E   +  +AG+G  G+S  +G   +A+F+ P S  V  K  +Y+ D SNH IRKI  L++
Sbjct: 151 ENGNIVTIAGNGNYGFSGDNGPATNAQFNYPCSVFVSSKNEVYITDYSNHRIRKI--LEN 208

Query: 136 GN 137
           GN
Sbjct: 209 GN 210



 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 4/62 (6%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           E   +  +AG+G  G+S  +G   +A+   P S  V     +Y+AD  NH IRKI  L++
Sbjct: 95  ENGNIITIAGNGTVGFSGDNGPATNAQLYNPSSVFVSSNNEVYIADFCNHRIRKI--LEN 152

Query: 136 GN 137
           GN
Sbjct: 153 GN 154


>gi|170066897|ref|XP_001868267.1| NHL repeat containing 2 [Culex quinquefasciatus]
 gi|167863075|gb|EDS26458.1| NHL repeat containing 2 [Culex quinquefasciatus]
          Length = 734

 Score = 42.7 bits (99), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 31/54 (57%), Gaps = 1/54 (1%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           SE +V+ R+ G    G+ DG+   ARF+ P+  A      I+VAD  NH IR+I
Sbjct: 260 SEGTVLHRIGGKK-SGFVDGDFRKARFNAPQGLAFQNDDVIFVADNENHAIRRI 312


>gi|423216521|ref|ZP_17203044.1| hypothetical protein HMPREF1074_04576 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392690731|gb|EIY83987.1| hypothetical protein HMPREF1074_04576 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 589

 Score = 42.7 bits (99), Expect = 0.078,   Method: Composition-based stats.
 Identities = 20/54 (37%), Positives = 31/54 (57%), Gaps = 1/54 (1%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKIT 131
           ++ V  +AG G +G  DG+  +A+   P    +   GN IY +D  N++IRKIT
Sbjct: 531 DATVSTIAGSGTKGNQDGDGATAQLAMPHGITMTADGNTIYFSDLDNYIIRKIT 584


>gi|336403610|ref|ZP_08584322.1| hypothetical protein HMPREF0127_01635 [Bacteroides sp. 1_1_30]
 gi|335945536|gb|EGN07347.1| hypothetical protein HMPREF0127_01635 [Bacteroides sp. 1_1_30]
          Length = 419

 Score = 42.7 bits (99), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 41/140 (29%), Positives = 61/140 (43%), Gaps = 21/140 (15%)

Query: 56  SSDLIVLDSSRSAFYTLS-----FPLSEESVVKRLAGDGVQG--YSDGEPG---SARFDK 105
           S + IV D +++     S       +S +   K +AGDGV+G  Y DGEPG   SA+   
Sbjct: 270 SPNYIVFDKNKNMIVGTSNGYVLIQISPDGTQKTIAGDGVKGQEYYDGEPGNPLSAKVGA 329

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT---NLDHGNQLIRQINLKPEDCSKSSQSGSGMFW 162
               A    G +Y++D + + IRK+T   N D+    +  I        K+  S      
Sbjct: 330 TFGVAAGSDGCLYLSDNTYNCIRKLTPDANGDYSKGTLETI----AGSGKAGFSDGKGLK 385

Query: 163 VTVFSPYLI----SCKSFWV 178
            T   PY I     CK+ +V
Sbjct: 386 ATFNQPYEIIITEDCKTMYV 405


>gi|406909052|gb|EKD49392.1| lipoprotein, partial [uncultured bacterium]
          Length = 552

 Score = 42.7 bits (99), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 42/83 (50%), Gaps = 5/83 (6%)

Query: 50  VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF 109
           V++R G+    V DS  +    ++    +    ++LAG    G+ +G    A F+ P   
Sbjct: 360 VVNRAGTK-AYVTDSGNNMLREINLSTGK---TRKLAGTRSAGFKNGFGPLATFNGPADL 415

Query: 110 AVDMKG-NIYVADKSNHVIRKIT 131
            +D  G N+YVADK NH IR+I 
Sbjct: 416 DLDAAGANLYVADKGNHAIRRIN 438


>gi|423214677|ref|ZP_17201205.1| hypothetical protein HMPREF1074_02737 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692583|gb|EIY85820.1| hypothetical protein HMPREF1074_02737 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 419

 Score = 42.7 bits (99), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 41/140 (29%), Positives = 61/140 (43%), Gaps = 21/140 (15%)

Query: 56  SSDLIVLDSSRSAFYTLS-----FPLSEESVVKRLAGDGVQG--YSDGEPG---SARFDK 105
           S + IV D +++     S       +S +   K +AGDGV+G  Y DGEPG   SA+   
Sbjct: 270 SPNYIVFDKNKNMIVGTSNGYVLIQISPDGTQKTIAGDGVKGQEYYDGEPGNPLSAKVGA 329

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT---NLDHGNQLIRQINLKPEDCSKSSQSGSGMFW 162
               A    G +Y++D + + IRK+T   N D+    +  I        K+  S      
Sbjct: 330 TFGVAAGSDGCLYLSDNTYNCIRKLTPDANGDYSKGTLETI----AGSGKAGFSDGKGLK 385

Query: 163 VTVFSPYLI----SCKSFWV 178
            T   PY I     CK+ +V
Sbjct: 386 ATFNQPYEIIITEDCKTMYV 405


>gi|262405589|ref|ZP_06082139.1| NHL repeat containing protein [Bacteroides sp. 2_1_22]
 gi|294644531|ref|ZP_06722285.1| putative lipoprotein [Bacteroides ovatus SD CC 2a]
 gi|294805793|ref|ZP_06764670.1| putative lipoprotein [Bacteroides xylanisolvens SD CC 1b]
 gi|345510784|ref|ZP_08790345.1| hypothetical protein BSAG_00303 [Bacteroides sp. D1]
 gi|262356464|gb|EEZ05554.1| NHL repeat containing protein [Bacteroides sp. 2_1_22]
 gi|292640084|gb|EFF58348.1| putative lipoprotein [Bacteroides ovatus SD CC 2a]
 gi|294447014|gb|EFG15604.1| putative lipoprotein [Bacteroides xylanisolvens SD CC 1b]
 gi|345454355|gb|EEO48593.2| hypothetical protein BSAG_00303 [Bacteroides sp. D1]
          Length = 419

 Score = 42.7 bits (99), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 41/140 (29%), Positives = 61/140 (43%), Gaps = 21/140 (15%)

Query: 56  SSDLIVLDSSRSAFYTLS-----FPLSEESVVKRLAGDGVQG--YSDGEPG---SARFDK 105
           S + IV D +++     S       +S +   K +AGDGV+G  Y DGEPG   SA+   
Sbjct: 270 SPNYIVFDKNKNMIVGTSNGYVLIQISPDGTQKTIAGDGVKGQEYYDGEPGNPLSAKVGA 329

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT---NLDHGNQLIRQINLKPEDCSKSSQSGSGMFW 162
               A    G +Y++D + + IRK+T   N D+    +  I        K+  S      
Sbjct: 330 TFGVAAGSDGCLYLSDNTYNCIRKLTPDANGDYSKGTLETI----AGSGKAGFSDGKGLK 385

Query: 163 VTVFSPYLI----SCKSFWV 178
            T   PY I     CK+ +V
Sbjct: 386 ATFNQPYEIIITEDCKTMYV 405


>gi|168024767|ref|XP_001764907.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683943|gb|EDQ70349.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1040

 Score = 42.7 bits (99), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 22/65 (33%), Positives = 31/65 (47%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
           V  LAG G  G+ DG     +  +P   A+   G +YVAD +N +IR +     G   I 
Sbjct: 818 VTTLAGTGTAGFKDGTAQEGQLSEPAGIALGPDGKLYVADTNNSLIRVLDLQAQGGPKIE 877

Query: 142 QINLK 146
            + LK
Sbjct: 878 TLELK 882


>gi|412985326|emb|CCO20351.1| predicted protein [Bathycoccus prasinos]
          Length = 897

 Score = 42.7 bits (99), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 42/98 (42%), Gaps = 9/98 (9%)

Query: 39  DGHQLEIN---PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
           DG  LE     P  V+  P   D+ + D+       +     E   V  LAG G  G +D
Sbjct: 120 DGPSLEARFNKPFGVVASPDGRDIFIADTFNHRIRRIDV---EFGTVTTLAGTGASGSAD 176

Query: 96  G--EPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           G  + G A F  P   AV   G  +YVAD  NH IR I
Sbjct: 177 GLAKRGDATFKFPSGVAVSPDGGFVYVADTKNHKIRVI 214



 Score = 36.2 bits (82), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 20/52 (38%), Positives = 33/52 (63%), Gaps = 2/52 (3%)

Query: 81  VVKRLAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           VV+ +AG G+ GY D   G+ ARF++P    +   G+ ++V+D  N+ IRK+
Sbjct: 220 VVRTVAGSGLTGYHDDLVGTKARFNQPMDVVMHPDGHTLFVSDAMNNCIRKV 271


>gi|302036371|ref|YP_003796693.1| hypothetical protein NIDE1006 [Candidatus Nitrospira defluvii]
 gi|300604435|emb|CBK40767.1| conserved protein of unknown function, contains NHL repeats
           [Candidatus Nitrospira defluvii]
          Length = 404

 Score = 42.7 bits (99), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 14/28 (50%), Positives = 21/28 (75%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            ++PK   VD KGN+Y+AD  NHV+R++
Sbjct: 34  LNEPKGLCVDRKGNLYIADSENHVVRRV 61



 Score = 40.8 bits (94), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 24/37 (64%), Gaps = 1/37 (2%)

Query: 95  DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           DG P   AR + P + AVD  GN+Y+AD  NH +RK+
Sbjct: 137 DGGPARRARLNFPSAVAVDRAGNLYIADTMNHRVRKV 173


>gi|440789641|gb|ELR10946.1| NHL repeat containing 2 isoform 3, putative [Acanthamoeba
           castellanii str. Neff]
          Length = 934

 Score = 42.7 bits (99), Expect = 0.082,   Method: Composition-based stats.
 Identities = 27/83 (32%), Positives = 38/83 (45%), Gaps = 4/83 (4%)

Query: 47  PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           PH V +DR G   + V DS       LS   +  + +   + D   G  DG+  +ARF  
Sbjct: 298 PHGVAVDREGDR-MFVADSGHHRILVLSLDGAFRTAIG--SDDATTGLVDGDYATARFHS 354

Query: 106 PKSFAVDMKGNIYVADKSNHVIR 128
           P   +   +  +YVAD  NH IR
Sbjct: 355 PLGLSYAGEDKLYVADSENHCIR 377


>gi|29349674|ref|NP_813177.1| hypothetical protein BT_4266 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29341584|gb|AAO79371.1| conserved hypothetical protein with NHL repeat [Bacteroides
           thetaiotaomicron VPI-5482]
          Length = 490

 Score = 42.7 bits (99), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 42/89 (47%), Gaps = 4/89 (4%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLS--EESVVKRLAGDGVQGYSDGEPGSARF 103
           NPH +        L + D   +    +   LS  + +V   +   G++GY DG P  A F
Sbjct: 401 NPHQICFTE-DGKLYIADCGNNCIRVIDTRLSIDKATVSTPIGLPGMKGYKDGGPDIAMF 459

Query: 104 DKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
           + P   AV   G I YVAD  N VIRK++
Sbjct: 460 NHPFGVAVSADGQIVYVADTGNKVIRKLS 488


>gi|397781307|ref|YP_006545780.1| Tripartite motif-containing protein 71 [Methanoculleus bourgensis
            MS2]
 gi|396939809|emb|CCJ37064.1| Tripartite motif-containing protein 71 AltName: Full=Lin-41 homolog
            [Methanoculleus bourgensis MS2]
          Length = 1996

 Score = 42.7 bits (99), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 99   GSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
            G+ +FDKP   AVD  G+IYVAD +NH I+K
Sbjct: 1676 GNGQFDKPWGIAVDTAGDIYVADYNNHRIQK 1706



 Score = 42.4 bits (98), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 99   GSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
            G+ +FDKP   AVD  G+IYVAD +NH I+K
Sbjct: 1394 GNGQFDKPWGIAVDAAGDIYVADYNNHRIQK 1424



 Score = 38.1 bits (87), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 14/30 (46%), Positives = 21/30 (70%)

Query: 100  SARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
            + +FD+P   A D+ GN+YV+D  NH I+K
Sbjct: 1348 NGQFDRPTGIATDISGNVYVSDYYNHRIQK 1377



 Score = 35.8 bits (81), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 20/40 (50%), Positives = 21/40 (52%), Gaps = 4/40 (10%)

Query: 89   GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
            G  G SD E     F  P   A D  GN+YVAD SNH IR
Sbjct: 1294 GSGGSSDVE----SFSYPTGIATDATGNVYVADYSNHCIR 1329


>gi|290972869|ref|XP_002669173.1| predicted protein [Naegleria gruberi]
 gi|284082716|gb|EFC36429.1| predicted protein [Naegleria gruberi]
          Length = 1010

 Score = 42.7 bits (99), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 28/79 (35%), Positives = 38/79 (48%), Gaps = 6/79 (7%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDM 113
           S DL + DSSR       F +     +   AG G  GYS DG     + F+ P   A+  
Sbjct: 75  SDDLFISDSSRHVI----FRMFSNGTISLFAGIGFAGYSKDGYSALDSLFNSPNGIAMSP 130

Query: 114 KGNIYVADKSNHVIRKITN 132
            GN+Y+AD  N  IR ++N
Sbjct: 131 NGNLYIADSQNDKIRIVSN 149


>gi|290972086|ref|XP_002668792.1| predicted protein [Naegleria gruberi]
 gi|284082314|gb|EFC36048.1| predicted protein [Naegleria gruberi]
          Length = 679

 Score = 42.7 bits (99), Expect = 0.082,   Method: Composition-based stats.
 Identities = 25/62 (40%), Positives = 37/62 (59%), Gaps = 4/62 (6%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           E   +  +AG+G  G+S  +G   +A+F+ P S  V  K  +Y+ D SNH IRKI  L++
Sbjct: 39  ENGNIVTIAGNGNYGFSGDNGPATNAQFNYPCSVFVSSKNEVYITDYSNHSIRKI--LEN 96

Query: 136 GN 137
           GN
Sbjct: 97  GN 98



 Score = 38.5 bits (88), Expect = 1.5,   Method: Composition-based stats.
 Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 4/62 (6%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           E   +  +AG+G  G+S  +G   +A+   P S  V     +Y+ D SNH IRKI  L++
Sbjct: 151 ENGNIITIAGNGTYGFSGDNGPATNAQLYNPYSVFVSSNNEVYITDYSNHRIRKI--LEN 208

Query: 136 GN 137
           GN
Sbjct: 209 GN 210



 Score = 35.8 bits (81), Expect = 9.9,   Method: Composition-based stats.
 Identities = 22/62 (35%), Positives = 37/62 (59%), Gaps = 5/62 (8%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           E   +  +AG+G  G+S  +G   +A+ ++P S  V     +Y++D+SN  IRKI  L++
Sbjct: 207 ENGNIVTIAGNGNYGFSGDNGPATNAQLNRPNSVFVS-NNEVYISDQSNQRIRKI--LEN 263

Query: 136 GN 137
           GN
Sbjct: 264 GN 265


>gi|428174200|gb|EKX43097.1| hypothetical protein GUITHDRAFT_153351 [Guillardia theta CCMP2712]
          Length = 392

 Score = 42.4 bits (98), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 40/85 (47%), Gaps = 6/85 (7%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P ++   P  + LIV D       T +    + S   +  G+G +GY DG   +A F+ P
Sbjct: 150 PSALALAPNQATLIVSDLELHQLLTFNMADGKTS---KFVGNGQRGYQDGA--TASFNGP 204

Query: 107 KSFAVDMKGN-IYVADKSNHVIRKI 130
           +       G  I VAD  NHV+RKI
Sbjct: 205 RGLTFSPDGTYILVADTGNHVVRKI 229


>gi|428179990|gb|EKX48859.1| hypothetical protein GUITHDRAFT_105483 [Guillardia theta CCMP2712]
          Length = 388

 Score = 42.4 bits (98), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 24/53 (45%), Positives = 30/53 (56%), Gaps = 1/53 (1%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKIT 131
           S V  LAG G  GY DG   SA+F  P   AV  ++  + V+D+ NH IR IT
Sbjct: 60  SQVTLLAGTGEAGYKDGIGSSAKFKSPSGLAVQSLRTGLCVSDRGNHAIRIIT 112



 Score = 38.9 bits (89), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 23/54 (42%), Positives = 33/54 (61%), Gaps = 3/54 (5%)

Query: 80  SVVKRLAG--DGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           +VV  +AG  D   G++DGE  +A F+ P+  A+   G+ I VAD  NH IRK+
Sbjct: 120 AVVDTVAGSDDSGAGFADGEGPNALFNHPEGLAMSPDGSFILVADSGNHRIRKV 173


>gi|406947273|gb|EKD78225.1| lipoprotein [uncultured bacterium]
          Length = 883

 Score = 42.4 bits (98), Expect = 0.084,   Method: Composition-based stats.
 Identities = 25/79 (31%), Positives = 44/79 (55%), Gaps = 6/79 (7%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV--DM 113
           + D+ VLD+  +    +      +  ++R+AG G  GY +    +A+F +P+  AV  D 
Sbjct: 50  ADDIYVLDTINNRIEKVK----SDGSLERVAGSGEYGYKNDSALNAQFAEPQDIAVFGDN 105

Query: 114 KGNIYVADKSNHVIRKITN 132
              I++AD +N+VIRK+ N
Sbjct: 106 AEQIFIADTNNNVIRKLEN 124



 Score = 39.7 bits (91), Expect = 0.62,   Method: Composition-based stats.
 Identities = 23/52 (44%), Positives = 31/52 (59%), Gaps = 3/52 (5%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK-GNIYVADKSNHVIRKIT 131
           VV  L+G+G  GY +G   +A +  PK F  D + G +YVAD  N VIR I+
Sbjct: 833 VVMTLSGNGNVGYVNGTDTAAEWHNPKGF--DFRAGKLYVADTMNDVIRTIS 882


>gi|380694022|ref|ZP_09858881.1| hypothetical protein BfaeM_08588 [Bacteroides faecis MAJ27]
          Length = 490

 Score = 42.4 bits (98), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 42/89 (47%), Gaps = 4/89 (4%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLS--EESVVKRLAGDGVQGYSDGEPGSARF 103
           NPH +        L + D   +    +   LS  + +V   +   G++GY DG P  A F
Sbjct: 401 NPHQICFTE-DGKLYIADCGNNCIRVIDTRLSIDKATVSTPIGLPGMKGYKDGGPDIAMF 459

Query: 104 DKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
           + P   AV   G I YVAD  N VIRK++
Sbjct: 460 NHPFGVAVSADGQIVYVADTGNKVIRKLS 488


>gi|428184600|gb|EKX53455.1| hypothetical protein GUITHDRAFT_101156 [Guillardia theta CCMP2712]
          Length = 367

 Score = 42.4 bits (98), Expect = 0.085,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 31/55 (56%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           VV  +AG GV GY +G   + + + P   A+     IYV+D+ NH IR++  ++ 
Sbjct: 12  VVCSIAGTGVPGYYEGNRMTCKMNCPSGIAMGQGDCIYVSDRQNHCIRRLPGIEE 66


>gi|290975268|ref|XP_002670365.1| predicted protein [Naegleria gruberi]
 gi|284083923|gb|EFC37621.1| predicted protein [Naegleria gruberi]
          Length = 1034

 Score = 42.4 bits (98), Expect = 0.086,   Method: Composition-based stats.
 Identities = 28/78 (35%), Positives = 43/78 (55%), Gaps = 9/78 (11%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVD 112
            ++D++ +D+ R     LS     + +V  LAG    G++ DG P  SA+ + P   ++ 
Sbjct: 15  ANNDILFIDNDR--LRKLS-----QGIVSTLAGTKEAGFNGDGIPAVSAQLNSPLGVSMS 67

Query: 113 MKGNIYVADKSNHVIRKI 130
             G IY+ADK NH IRKI
Sbjct: 68  KGGEIYIADKLNHRIRKI 85


>gi|290997788|ref|XP_002681463.1| predicted protein [Naegleria gruberi]
 gi|284095087|gb|EFC48719.1| predicted protein [Naegleria gruberi]
          Length = 2297

 Score = 42.4 bits (98), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 26/99 (26%), Positives = 46/99 (46%), Gaps = 10/99 (10%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY--SDGEPGSARFD 104
           P+ V   P + +L + D+  +    +      ++ +  + G GV GY    G+   A+  
Sbjct: 240 PNDVAISPTTGELFIADTGNNVIRKVGL----DNKIITVVGTGVSGYLGDGGQAKQAQLS 295

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQI 143
            P S A    G + ++D  N+VIRK+    + N +IR I
Sbjct: 296 SPTSIAFTSAGEMLISDSDNYVIRKV----YSNGIIRTI 330



 Score = 36.2 bits (82), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 30/57 (52%), Gaps = 2/57 (3%)

Query: 78  EESVVKRLAGDGVQGYS-DG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           E   +    GDG +G S DG  P  A      +  V   G++Y+ADK+ H IRK++N
Sbjct: 42  ERFTITSYVGDGNEGNSGDGVSPSHASIGYVDAIEVGSNGDLYIADKTYHRIRKVSN 98


>gi|395646873|ref|ZP_10434733.1| PKD domain containing protein [Methanofollis liminatans DSM 4140]
 gi|395443613|gb|EJG08370.1| PKD domain containing protein [Methanofollis liminatans DSM 4140]
          Length = 2489

 Score = 42.4 bits (98), Expect = 0.088,   Method: Composition-based stats.
 Identities = 19/44 (43%), Positives = 26/44 (59%)

Query: 86   AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
            +G  +  +    PG + F  PK  AVD  GN+YVAD +NH I+K
Sbjct: 1671 SGTFITTWGSEPPGDSEFSYPKGVAVDDAGNVYVADTNNHRIQK 1714



 Score = 38.5 bits (88), Expect = 1.5,   Method: Composition-based stats.
 Identities = 16/46 (34%), Positives = 25/46 (54%)

Query: 87   GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
            G  +  +  G  G   FD+P    VD  GN++V+D  N+ I+K T+
Sbjct: 1151 GTFITKWGSGGTGDGEFDRPHGIVVDADGNVFVSDAGNNNIQKFTS 1196



 Score = 37.4 bits (85), Expect = 2.9,   Method: Composition-based stats.
 Identities = 16/34 (47%), Positives = 24/34 (70%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           G A+F +P+  AVD  G++YVAD  N+ I+K +N
Sbjct: 42  GDAQFIRPEGIAVDPGGDVYVADPGNNRIQKFSN 75



 Score = 37.0 bits (84), Expect = 3.6,   Method: Composition-based stats.
 Identities = 20/50 (40%), Positives = 27/50 (54%), Gaps = 2/50 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSA--RFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           V++  GDG    + G  GS   +F+ P   AVD   NIYV D  NH ++K
Sbjct: 117 VQKFDGDGTYIATWGTKGSGEGQFESPGGIAVDNASNIYVTDTVNHRVQK 166



 Score = 36.2 bits (82), Expect = 6.6,   Method: Composition-based stats.
 Identities = 14/30 (46%), Positives = 21/30 (70%)

Query: 99   GSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
            G  +F+ P+  AVD +GN++VAD  NH I+
Sbjct: 1210 GDGQFNVPRGIAVDSRGNVFVADSLNHRIQ 1239


>gi|119961614|ref|YP_949427.1| NHL repeat-containing protein [Arthrobacter aurescens TC1]
 gi|403528903|ref|YP_006663790.1| NHL repeat-containing protein [Arthrobacter sp. Rue61a]
 gi|119948473|gb|ABM07384.1| putative NHL repeat protein [Arthrobacter aurescens TC1]
 gi|403231330|gb|AFR30752.1| NHL repeat-containing protein [Arthrobacter sp. Rue61a]
          Length = 657

 Score = 42.4 bits (98), Expect = 0.089,   Method: Composition-based stats.
 Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 2/62 (3%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
           PL+ E  V  LAG G++G  DG+   A F +    A+D + NI+VAD     +R++   D
Sbjct: 359 PLANE--VSILAGSGLEGLLDGKAEEAWFAQSSGLAIDGEDNIWVADSETSSLRRLVISD 416

Query: 135 HG 136
            G
Sbjct: 417 SG 418


>gi|428214156|ref|YP_007087300.1| thiol-disulfide isomerase-like thioredoxin [Oscillatoria acuminata
           PCC 6304]
 gi|428002537|gb|AFY83380.1| thiol-disulfide isomerase-like thioredoxin [Oscillatoria acuminata
           PCC 6304]
          Length = 506

 Score = 42.4 bits (98), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 26/85 (30%), Positives = 39/85 (45%), Gaps = 5/85 (5%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V+    SS L + DS        +        V+ + G+G+ G +DG    A+F  P
Sbjct: 178 PGKVLADAASSRLFIADSGHHRIIVTTL----NGTVQSIIGNGIPGLTDGSFSEAQFFGP 233

Query: 107 KSFAVDMKGN-IYVADKSNHVIRKI 130
           +  + D     +YVAD  NH IR+I
Sbjct: 234 QGMSWDANSQRLYVADTENHAIRQI 258



 Score = 38.9 bits (89), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 18/49 (36%), Positives = 30/49 (61%), Gaps = 1/49 (2%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           K + GDG  G  DG+  ++RF +P   ++   G +Y+AD +NH IR++ 
Sbjct: 437 KTVLGDGTPGLLDGKNTTSRFFEPGGLSI-ASGILYIADTNNHRIRRVA 484


>gi|297848174|ref|XP_002891968.1| hypothetical protein ARALYDRAFT_892829 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297337810|gb|EFH68227.1| hypothetical protein ARALYDRAFT_892829 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 1058

 Score = 42.4 bits (98), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 38/67 (56%), Gaps = 4/67 (5%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLDHG 136
           E + + ++   G +G+ DG    A F++P+  A + K N +YVAD  NH +R+I   D  
Sbjct: 600 EGNFIVQIGSTGEEGFQDGSFEDAAFNRPQGLAYNAKKNLLYVADTENHALREI---DFV 656

Query: 137 NQLIRQI 143
           N+ +R +
Sbjct: 657 NERVRTL 663



 Score = 39.3 bits (90), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 3/62 (4%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
           P+++  V   +AG G  G+ DG+   A+  +P   A+   G ++VAD +N +IR I +L+
Sbjct: 834 PITKRVVT--VAGTGKAGFKDGKVKGAQLSEPAGLAITENGRLFVADTNNSLIRYI-DLN 890

Query: 135 HG 136
            G
Sbjct: 891 KG 892


>gi|219852663|ref|YP_002467095.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
 gi|219546922|gb|ACL17372.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
          Length = 522

 Score = 42.4 bits (98), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 21/47 (44%), Positives = 30/47 (63%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           L GD V   S    G+++F++P S AVD  G++YVAD  N+ I+K T
Sbjct: 216 LTGDLVATRSISNSGASQFNRPTSVAVDTGGSVYVADTGNNRIQKFT 262



 Score = 39.3 bits (90), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 22/33 (66%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G+ +F+ P+  AVD  GN+YVAD  N+ I K T
Sbjct: 134 GTNQFNSPEGIAVDGAGNVYVADTGNNRIEKFT 166



 Score = 38.5 bits (88), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 40/83 (48%), Gaps = 13/83 (15%)

Query: 50  VIDRPGSSDLIVLDSSRSAFYT-LSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKS 108
            +D  G+  +  +++ R +F+T  S P    S+ +  +G G            +F  P  
Sbjct: 49  AVDGTGNVYVADMNNDRISFFTKASLPQMPSSIGRIGSGHG------------QFFYPHG 96

Query: 109 FAVDMKGNIYVADKSNHVIRKIT 131
            AVD  GN+YVAD  NH I+K T
Sbjct: 97  VAVDSTGNVYVADTGNHQIQKFT 119


>gi|381205317|ref|ZP_09912388.1| NHL repeat containing protein [SAR324 cluster bacterium JCVI-SC
           AAA005]
          Length = 458

 Score = 42.4 bits (98), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 20/46 (43%), Positives = 31/46 (67%), Gaps = 1/46 (2%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +AG G  GY+DG   SA+F+ P +F  + K +++VAD  N+ IR+I
Sbjct: 271 IAGGGSIGYTDGSGISAQFNAPNAF-TNTKDSLFVADTCNNTIRRI 315


>gi|290991506|ref|XP_002678376.1| NHL repeat domain-containing protein [Naegleria gruberi]
 gi|284091988|gb|EFC45632.1| NHL repeat domain-containing protein [Naegleria gruberi]
          Length = 985

 Score = 42.0 bits (97), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 44/87 (50%), Gaps = 7/87 (8%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGE---PGSARF 103
           P +++  P S DL+++D+      ++S      S VK +AG G+ GY +GE       + 
Sbjct: 452 PSNLVFHPISGDLLIVDTQNHRIRSMS---KSTSFVKTIAGTGIAGY-NGEGMLSNMTKM 507

Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + P   AV   G I  AD  N+++R I
Sbjct: 508 NSPSGIAVLSTGEIIFADTFNNLVRMI 534


>gi|383784367|ref|YP_005468936.1| hypothetical protein LFE_1113 [Leptospirillum ferrooxidans C2-3]
 gi|383083279|dbj|BAM06806.1| hypothetical protein LFE_1113 [Leptospirillum ferrooxidans C2-3]
          Length = 382

 Score = 42.0 bits (97), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 26/41 (63%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +G +DG   SARF+ P+       G+IYVAD  N +IRKIT
Sbjct: 53  RGDNDGNGASARFEFPQGIVAAPDGSIYVADTGNDMIRKIT 93



 Score = 40.0 bits (92), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 23/52 (44%), Positives = 31/52 (59%), Gaps = 3/52 (5%)

Query: 82  VKRLAGDG--VQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI 130
           V  LAG G  V G++DG    A F +P+  A+   G I Y+AD  NH+IRK+
Sbjct: 214 VTTLAGQGALVPGHADGVGTQATFLEPRGLAISPDGQILYIADTRNHLIRKL 265



 Score = 36.6 bits (83), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 17/39 (43%), Positives = 24/39 (61%), Gaps = 1/39 (2%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKI 130
           + DG   +ARF+ P+  A+   G  +YVAD  N+ IRKI
Sbjct: 113 FRDGAGAAARFNNPEGLAISPDGKTLYVADSRNNRIRKI 151


>gi|290974673|ref|XP_002670069.1| predicted protein [Naegleria gruberi]
 gi|284083624|gb|EFC37325.1| predicted protein [Naegleria gruberi]
          Length = 821

 Score = 42.0 bits (97), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 29/97 (29%), Positives = 51/97 (52%), Gaps = 23/97 (23%)

Query: 51  IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL-------------AGDGVQGYSDGE 97
           ++ PG   L+    + +  Y + FPLS +  V+R+             AG+G  G + GE
Sbjct: 101 VENPG---LLAFSPNETTLY-VCFPLSNK--VQRMDFLPNGTEIFSDFAGNGNSGNT-GE 153

Query: 98  PG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            G   +A+ ++P   AVD  G++Y++D  N +IRK++
Sbjct: 154 GGLAINAQLNQPSGVAVDSNGDVYISDSGNGLIRKVS 190



 Score = 41.6 bits (96), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 51/102 (50%), Gaps = 13/102 (12%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP  +   P + DL + DS  +    ++       ++  +AGD   GYS G+ G A+  K
Sbjct: 220 NPRGLNFGP-NGDLYIADSDNNVVRRIN----SSGIITTIAGDTTSGYS-GDGGDAKLAK 273

Query: 106 ---PKSFAVDMKGNIYVADKSNHVIRKITNLD----HGNQLI 140
              P +  V     I+++D  N+VIRK++N +     GN+L+
Sbjct: 274 MKNPINVIVSKTNEIFISDADNNVIRKVSNGNISTIAGNELV 315



 Score = 37.7 bits (86), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 44/87 (50%), Gaps = 8/87 (9%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYS--DGEPGSARF 103
           P+SV  +  SSD IV+  + +      F   +   +  +AG  G QGYS  +G   S+  
Sbjct: 501 PNSV--KVSSSDEIVIADTSNNVIRKVF---KNGTIITIAGTIGSQGYSGDNGLAISSVL 555

Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKI 130
             P   A+   G +++AD  NHVIRKI
Sbjct: 556 SSPSDVAIAPNGEVFIADWGNHVIRKI 582


>gi|158520859|ref|YP_001528729.1| YD repeat-containing protein [Desulfococcus oleovorans Hxd3]
 gi|158509685|gb|ABW66652.1| YD repeat protein [Desulfococcus oleovorans Hxd3]
          Length = 2831

 Score = 42.0 bits (97), Expect = 0.11,   Method: Composition-based stats.
 Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 3/58 (5%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            +S + +++  AG GV     G+ G    AR   P   AVD  GN+++AD  N  IRK+
Sbjct: 1629 ISPDGIIEAFAGMGVDAGYSGDGGLAVDARLQSPTGLAVDKTGNLFIADSGNFSIRKV 1686



 Score = 40.8 bits (94), Expect = 0.26,   Method: Composition-based stats.
 Identities = 20/50 (40%), Positives = 27/50 (54%), Gaps = 6/50 (12%)

Query: 82   VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            +K  AGDG      G    AR   P+   +D  GNIY+AD  NH +R+I+
Sbjct: 1587 MKVFAGDG------GPAIEARLYHPQGMEIDASGNIYIADTDNHCVRRIS 1630


>gi|406944252|gb|EKD76066.1| Serine/threonine protein kinase, partial [uncultured bacterium]
          Length = 212

 Score = 42.0 bits (97), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 33/57 (57%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV--DMKGNIYVADKSNHVIRKI 130
           +  + ++  +AG G  GY DG    A F +P+  A+  D    +++AD +N+VIRKI
Sbjct: 100 IGTDGILTNVAGAGDYGYRDGSSDYALFAQPQDIAIYGDTASELFIADTNNNVIRKI 156


>gi|302768701|ref|XP_002967770.1| hypothetical protein SELMODRAFT_17931 [Selaginella moellendorffii]
 gi|300164508|gb|EFJ31117.1| hypothetical protein SELMODRAFT_17931 [Selaginella moellendorffii]
          Length = 128

 Score = 42.0 bits (97), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 35/64 (54%), Gaps = 2/64 (3%)

Query: 79  ESVVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           +S ++ LAG   G  GY DG    + F++P+S A+   G ++VAD  N  IRKI+    G
Sbjct: 3   DSSLEALAGSLVGESGYVDGPAAKSLFNRPQSLAICDNGAVFVADTRNLAIRKISKDGEG 62

Query: 137 NQLI 140
              I
Sbjct: 63  MTTI 66


>gi|156399714|ref|XP_001638646.1| predicted protein [Nematostella vectensis]
 gi|156225768|gb|EDO46583.1| predicted protein [Nematostella vectensis]
          Length = 286

 Score = 42.0 bits (97), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 32/51 (62%), Gaps = 1/51 (1%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKIT 131
           V   AG+G +G  DGE   A+F  P+S ++D +G+ ++V D  NH IR I+
Sbjct: 131 VTTFAGNGSEGLVDGERTRAQFKGPQSLSLDDEGDRLFVGDTDNHAIRVIS 181


>gi|379719861|ref|YP_005311992.1| copper amine oxidase domain-containing protein [Paenibacillus
           mucilaginosus 3016]
 gi|378568533|gb|AFC28843.1| copper amine oxidase domain-containing protein [Paenibacillus
           mucilaginosus 3016]
          Length = 535

 Score = 42.0 bits (97), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 4/45 (8%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           AGD    ++DG    A+F++P   A+D KGN+YV+D  N  IR I
Sbjct: 235 AGD----FADGPLAQAKFNEPTGIALDAKGNLYVSDSGNQRIRYI 275



 Score = 40.0 bits (92), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 47/106 (44%), Gaps = 8/106 (7%)

Query: 29  EDGYTVTTVIDGHQ---LEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
           + GY V  ++DG     L   P  +   P  S L V D+   A   +      +  V  +
Sbjct: 109 DKGYPVGGLLDGAANASLFQEPLGLSAGPDGS-LYVADAGNHAIRRID----AKGNVTTV 163

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           AG G  G  DG+  +A F +P   A    G +YVAD   H IR+I+
Sbjct: 164 AGSGRLGAKDGKGAAAEFYRPGDVAAAPDGTLYVADTLGHTIRRIS 209


>gi|337745906|ref|YP_004640068.1| copper amine oxidase domain-containing protein [Paenibacillus
           mucilaginosus KNP414]
 gi|336297095|gb|AEI40198.1| copper amine oxidase domain protein [Paenibacillus mucilaginosus
           KNP414]
          Length = 537

 Score = 42.0 bits (97), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 4/45 (8%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           AGD    ++DG    A+F++P   A+D KGN+YV+D  N  IR I
Sbjct: 235 AGD----FADGPLAQAKFNEPTGIALDAKGNLYVSDSGNQRIRYI 275



 Score = 40.0 bits (92), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 47/106 (44%), Gaps = 8/106 (7%)

Query: 29  EDGYTVTTVIDGHQ---LEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
           + GY V  ++DG     L   P  +   P  S L V D+   A   +      +  V  +
Sbjct: 109 DKGYPVGGLLDGAANASLFQEPLGLSAGPDGS-LYVADAGNHAIRRID----AKGNVTTV 163

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           AG G  G  DG+  +A F +P   A    G +YVAD   H IR+I+
Sbjct: 164 AGSGRLGAKDGKGAAAEFYRPGDVAAAPDGTLYVADTLGHTIRRIS 209


>gi|315500945|ref|YP_004079832.1| alkyl hydroperoxide reductase/ thiol specific antioxidant/ mal
           allergen [Micromonospora sp. L5]
 gi|315407564|gb|ADU05681.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Micromonospora sp. L5]
          Length = 610

 Score = 42.0 bits (97), Expect = 0.12,   Method: Composition-based stats.
 Identities = 29/97 (29%), Positives = 53/97 (54%), Gaps = 16/97 (16%)

Query: 47  PHSVIDRPGSS------DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGS 100
           P +V+  PG +      +L+V DS+R +   L+     E+VV+R+ G G +G +DG   +
Sbjct: 167 PDTVLRFPGKALPLDGGNLLVSDSARHSLVELA--ADGETVVRRI-GTGSRGRTDGPASA 223

Query: 101 ARFDKPKSFAV------DMKG-NIYVADKSNHVIRKI 130
           A F +P+   +      ++ G ++ VAD  NH++R +
Sbjct: 224 ATFAEPQGLCLLPPHVAEVAGYDVVVADTVNHLLRGV 260


>gi|290982388|ref|XP_002673912.1| predicted protein [Naegleria gruberi]
 gi|284087499|gb|EFC41168.1| predicted protein [Naegleria gruberi]
          Length = 2313

 Score = 42.0 bits (97), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 15/50 (30%), Positives = 31/50 (62%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
           LAG  ++G++DG   +A  + P S ++ +   +Y +D+ NH IR ++ ++
Sbjct: 500 LAGGSLKGFADGVGSNANLNYPDSISIGLNNMLYFSDRDNHAIRSVSTIN 549



 Score = 40.4 bits (93), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 33/59 (55%), Gaps = 2/59 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           +S +S    +AG G  G+S   G   SA  + P +   D  GN+ ++D SN+ IRK+TN
Sbjct: 272 VSLQSGTPIVAGTGTSGFSGDGGVATSALLNGPSALTFDSSGNMLISDSSNNRIRKVTN 330



 Score = 40.0 bits (92), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 32/56 (57%), Gaps = 4/56 (7%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKIT 131
            ++V  ++G G+ GY+ G+ G A + K   P S  V + G+I   DK N  IRKIT
Sbjct: 549 NALVTTISGSGIAGYT-GDEGPAIYAKLNLPGSIEVALNGDIIFMDKGNQRIRKIT 603



 Score = 39.7 bits (91), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 29/55 (52%), Gaps = 2/55 (3%)

Query: 78  EESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           E  ++  +AG G  GY+  G   SA  + P   AV     +Y AD  NHVIRKI+
Sbjct: 134 ENGIISTIAGTGSPGYTASGTATSALINTPLGLAV-YNNEVYFADSLNHVIRKIS 187



 Score = 36.6 bits (83), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 23/54 (42%), Positives = 29/54 (53%), Gaps = 2/54 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           VV  +AG G  GYS DG     A+ + P+   +   G I V+D  NH IRKI N
Sbjct: 82  VVSTIAGTGNAGYSGDGANALFAQLNSPQGIGLLSGGAIIVSDTLNHRIRKIEN 135


>gi|296124331|ref|YP_003632109.1| NHL repeat containing protein [Planctomyces limnophilus DSM 3776]
 gi|296016671|gb|ADG69910.1| NHL repeat containing protein [Planctomyces limnophilus DSM 3776]
          Length = 581

 Score = 42.0 bits (97), Expect = 0.12,   Method: Composition-based stats.
 Identities = 26/85 (30%), Positives = 42/85 (49%), Gaps = 7/85 (8%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  ++  P    + + DS+ +     S        + ++ G G  G  DG   SA+FD P
Sbjct: 250 PGKLLVDPAHERVFISDSNHNRIVVASLA----GQLLKVIGSGKIGAKDGPAESAQFDHP 305

Query: 107 KSFAVDMKGN-IYVADKSNHVIRKI 130
           +  A+D  GN +YVAD  NH++R +
Sbjct: 306 QGMALD--GNTLYVADTENHLLRTV 328


>gi|255542920|ref|XP_002512523.1| 2-deoxyglucose-6-phosphate phosphatase, putative [Ricinus communis]
 gi|223548484|gb|EEF49975.1| 2-deoxyglucose-6-phosphate phosphatase, putative [Ricinus communis]
          Length = 1016

 Score = 42.0 bits (97), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 4/61 (6%)

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQLIRQ 142
           ++   G +G  DG    A F++P+  A + K N+ YVAD  NH +R+I   D  N+++R 
Sbjct: 565 QIGSTGEEGLRDGPFDEATFNRPQGLAYNAKKNLLYVADTENHALREI---DFVNEMVRT 621

Query: 143 I 143
           +
Sbjct: 622 L 622



 Score = 35.8 bits (81), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 19/65 (29%), Positives = 34/65 (52%), Gaps = 1/65 (1%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
           V  +AG G  G+ DG+  +A+  +P        G + +AD +N +IR + +L+     +R
Sbjct: 798 VSTIAGTGKAGFKDGKALAAQLSEPSGIIEAENGRLIIADTNNSIIRYL-DLNKEEAELR 856

Query: 142 QINLK 146
            + LK
Sbjct: 857 TLELK 861


>gi|116621305|ref|YP_823461.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116224467|gb|ABJ83176.1| NHL repeat containing protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 1042

 Score = 42.0 bits (97), Expect = 0.13,   Method: Composition-based stats.
 Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 3/61 (4%)

Query: 74  FPLSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           F L+    V  +AG+G  G   G+ G   +A F    + A+D  GN+Y+ D  N+ IRK+
Sbjct: 110 FKLTSAGKVTTVAGNGTSGTYTGDGGPAINAGFSGAYAIALDSTGNLYICDSGNNAIRKV 169

Query: 131 T 131
           T
Sbjct: 170 T 170



 Score = 38.9 bits (89), Expect = 0.96,   Method: Composition-based stats.
 Identities = 27/75 (36%), Positives = 36/75 (48%), Gaps = 8/75 (10%)

Query: 65  SRSAFYTLSFPLSEESVVKRLAGD-------GVQGYSDGEPG-SARFDKPKSFAVDMKGN 116
           S+    TLS  L   +   +LAG        GV    DG P  SA  + P++   D  G 
Sbjct: 2   SKHFILTLSAALCLTTAHAQLAGSYVSRTIAGVFPLGDGGPATSAILETPQAVVADSSGT 61

Query: 117 IYVADKSNHVIRKIT 131
           IY+AD  N VIRK++
Sbjct: 62  IYIADAGNGVIRKVS 76


>gi|302036773|ref|YP_003797095.1| hypothetical protein NIDE1420 [Candidatus Nitrospira defluvii]
 gi|190343188|gb|ACE75576.1| putative NHL repeat protein [Candidatus Nitrospira defluvii]
 gi|300604837|emb|CBK41169.1| protein of unknown function, contains NHL repeats [Candidatus
           Nitrospira defluvii]
          Length = 989

 Score = 42.0 bits (97), Expect = 0.13,   Method: Composition-based stats.
 Identities = 21/41 (51%), Positives = 23/41 (56%), Gaps = 5/41 (12%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           G +G  DGE     F  P   AVD  GNIYVAD  NH I+K
Sbjct: 160 GQEGVEDGE-----FSSPTGIAVDKDGNIYVADTDNHSIQK 195



 Score = 36.2 bits (82), Expect = 7.3,   Method: Composition-based stats.
 Identities = 20/50 (40%), Positives = 27/50 (54%), Gaps = 2/50 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           +++  GDG      G  GS    F  P+  AVD  GN YVAD+ N+ I+K
Sbjct: 293 IQKFKGDGTPLLKWGRDGSFDGAFFFPRGVAVDFVGNTYVADEGNNRIQK 342


>gi|37522270|ref|NP_925647.1| hypothetical protein gll2701 [Gloeobacter violaceus PCC 7421]
 gi|35213270|dbj|BAC90642.1| gll2701 [Gloeobacter violaceus PCC 7421]
          Length = 351

 Score = 42.0 bits (97), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 24/57 (42%), Positives = 31/57 (54%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  +  ++ +AG G   Y  DG P   ARF+ P   AVD  GN+ VAD  NH IR I
Sbjct: 124 VGTDGRIETIAGTGEADYGGDGGPAKQARFNDPAGVAVDALGNVLVADTYNHRIRTI 180



 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 23/57 (40%), Positives = 31/57 (54%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  +  ++ +AG G  GYS DG P +A R D P   AV   G I +AD  N+ IR I
Sbjct: 180 IGPDGTIRTVAGTGQAGYSGDGGPATAARLDFPWGVAVAPDGRILIADTGNNRIRSI 236


>gi|290986699|ref|XP_002676061.1| predicted protein [Naegleria gruberi]
 gi|284089661|gb|EFC43317.1| predicted protein [Naegleria gruberi]
          Length = 819

 Score = 42.0 bits (97), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 23/48 (47%), Positives = 31/48 (64%), Gaps = 2/48 (4%)

Query: 85  LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +AG+G QGY+ +G  G  +   +P S  +D  GNIY ADK N +IRKI
Sbjct: 207 IAGNGKQGYNGEGLSGLDSALKEPYSVDMDSDGNIYFADKGNSIIRKI 254


>gi|324997987|ref|ZP_08119099.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Pseudonocardia sp. P1]
          Length = 614

 Score = 42.0 bits (97), Expect = 0.13,   Method: Composition-based stats.
 Identities = 19/44 (43%), Positives = 27/44 (61%), Gaps = 1/44 (2%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           Y+DG  GSARF +P+   V     + VAD  NH +R+++ LD G
Sbjct: 231 YTDGPAGSARFSEPQGLLVLDPSTVLVADTVNHAVRRVS-LDDG 273


>gi|292491957|ref|YP_003527396.1| hypothetical protein Nhal_1896 [Nitrosococcus halophilus Nc4]
 gi|291580552|gb|ADE15009.1| YD repeat protein [Nitrosococcus halophilus Nc4]
          Length = 2539

 Score = 42.0 bits (97), Expect = 0.14,   Method: Composition-based stats.
 Identities = 21/58 (36%), Positives = 36/58 (62%), Gaps = 4/58 (6%)

Query: 76   LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            +S E ++  +AG+G+ GY DG+ G    A+ + P + A+    N+Y+AD  NH IR++
Sbjct: 1461 VSPEGIITTVAGNGILGY-DGDGGISTGAKLNLPIAVALSPNENLYIADYYNHRIRRV 1517



 Score = 41.6 bits (96), Expect = 0.15,   Method: Composition-based stats.
 Identities = 19/56 (33%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 76   LSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            +  + VV   AG+G+ G+S  G+  +    +P + A+D  GN+Y+A+  N+VIR++
Sbjct: 1294 IGTDGVVSIAAGNGIAGFSGTGDLATQAMMRPFAIALDSDGNLYIAESQNNVIRRV 1349



 Score = 39.7 bits (91), Expect = 0.64,   Method: Composition-based stats.
 Identities = 18/55 (32%), Positives = 32/55 (58%), Gaps = 2/55 (3%)

Query: 79   ESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            + ++  +AG+G +G+S   G    AR + P   A+   G++Y+AD  NH IR ++
Sbjct: 1352 DGLITLVAGNGTRGFSGDGGLATQARLNHPNDLAIGPDGSLYIADYDNHRIRLVS 1406


>gi|290987760|ref|XP_002676590.1| predicted protein [Naegleria gruberi]
 gi|284090193|gb|EFC43846.1| predicted protein [Naegleria gruberi]
          Length = 836

 Score = 42.0 bits (97), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 25/58 (43%), Positives = 35/58 (60%), Gaps = 5/58 (8%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMK-GNIYVADKSNHVIRKI 130
           +   +V+ +AG G QG+S G+ G   +A+ +KP S   D   GNIY+AD  NH IR I
Sbjct: 501 TSNEMVEVIAGIGFQGFS-GDGGLAINAQLNKPHSVHFDKNSGNIYIADSGNHRIRMI 557


>gi|301064161|ref|ZP_07204608.1| RHS repeat-associated core domain protein [delta proteobacterium
           NaphS2]
 gi|300441781|gb|EFK06099.1| RHS repeat-associated core domain protein [delta proteobacterium
           NaphS2]
          Length = 2050

 Score = 41.6 bits (96), Expect = 0.14,   Method: Composition-based stats.
 Identities = 21/52 (40%), Positives = 31/52 (59%), Gaps = 2/52 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++  +AG+GV G+S DG P   A        A+D  GN+Y+ D SNH +RK+
Sbjct: 917 IITTVAGNGVAGFSGDGGPAVEASLGWAMGIAIDSAGNLYILDGSNHRVRKV 968



 Score = 37.7 bits (86), Expect = 2.2,   Method: Composition-based stats.
 Identities = 19/52 (36%), Positives = 29/52 (55%), Gaps = 2/52 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++  +AG G  G+S DG P   A    P   A+D  GN+Y+ D  N+ +RK+
Sbjct: 804 IITTVAGSGDYGFSGDGGPAIEASLSFPMGIAIDSAGNLYILDSDNNRVRKV 855


>gi|290972323|ref|XP_002668903.1| predicted protein [Naegleria gruberi]
 gi|284082440|gb|EFC36159.1| predicted protein [Naegleria gruberi]
          Length = 296

 Score = 41.6 bits (96), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 25/35 (71%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
           +A  + P   AV   G++Y+ADKSNHVIRK++ L+
Sbjct: 108 NAELNFPSGVAVHSNGDVYIADKSNHVIRKVSALN 142


>gi|414163305|ref|ZP_11419552.1| hypothetical protein HMPREF9697_01453 [Afipia felis ATCC 53690]
 gi|410881085|gb|EKS28925.1| hypothetical protein HMPREF9697_01453 [Afipia felis ATCC 53690]
          Length = 357

 Score = 41.6 bits (96), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 37/68 (54%), Gaps = 5/68 (7%)

Query: 80  SVVKRLAGDGVQGYSDGEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           S V ++  DG    S GE GS   +F+ P S A D KGN+YVAD+ N    +I   D   
Sbjct: 184 SRVAKVDKDGNWIKSWGEKGSEPGQFNTPHSIATDAKGNVYVADRGNA---RIQVFDGDG 240

Query: 138 QLIRQINL 145
           + +RQ+ +
Sbjct: 241 KFLRQMKI 248


>gi|290973047|ref|XP_002669261.1| predicted protein [Naegleria gruberi]
 gi|284082806|gb|EFC36517.1| predicted protein [Naegleria gruberi]
          Length = 618

 Score = 41.6 bits (96), Expect = 0.15,   Method: Composition-based stats.
 Identities = 23/62 (37%), Positives = 36/62 (58%), Gaps = 4/62 (6%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           E   +  +AG+G  G+S  +G   +A+   P S  V +   +Y+AD+ NH IRKI  L++
Sbjct: 148 ENGNIITIAGNGTDGFSGDNGPATNAQLYYPSSVFVSLTNEVYIADQHNHRIRKI--LEN 205

Query: 136 GN 137
           GN
Sbjct: 206 GN 207



 Score = 38.1 bits (87), Expect = 1.7,   Method: Composition-based stats.
 Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 15/79 (18%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           E   +  +AG+G  G+S  +G   +A+F+ P S  V     + +AD  NH IRKI  L++
Sbjct: 39  ENGNIVTIAGNGTAGFSGDNGPATNAQFNYPSSVFVSSNNEVCIADLHNHRIRKI--LEN 96

Query: 136 G-----------NQLIRQI 143
           G           NQ IR+I
Sbjct: 97  GSIITIAGNGTHNQRIRKI 115


>gi|290979906|ref|XP_002672674.1| predicted protein [Naegleria gruberi]
 gi|284086252|gb|EFC39930.1| predicted protein [Naegleria gruberi]
          Length = 747

 Score = 41.6 bits (96), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 22/74 (29%), Positives = 39/74 (52%), Gaps = 3/74 (4%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           + DL + DS   A   +     E  ++  +AG+G+ G+ DG    A+F+     +V   G
Sbjct: 510 NGDLYICDSINHAIRKIDH---ETGIISTIAGNGIAGFKDGNASIAQFNSNLGLSVLPNG 566

Query: 116 NIYVADKSNHVIRK 129
           ++ +AD +N+ IRK
Sbjct: 567 DLLIADYNNNRIRK 580


>gi|167534306|ref|XP_001748831.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163772793|gb|EDQ86441.1| predicted protein [Monosiga brevicollis MX1]
          Length = 5844

 Score = 41.6 bits (96), Expect = 0.16,   Method: Composition-based stats.
 Identities = 26/94 (27%), Positives = 43/94 (45%), Gaps = 10/94 (10%)

Query: 57  SDLIVLDSSR--------SAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKS 108
           SDL+VLD  R        ++   L     +   V  +AG    G++DG  GS + ++P+ 
Sbjct: 896 SDLLVLDEKRILISDEANNSIRLLQLQKDKNWTVTTVAGSRTAGFADGRRGSTQLNQPRG 955

Query: 109 --FAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
             F    +  + +AD  NH IR +   D   +L+
Sbjct: 956 MCFLNQRRTVVAIADAGNHCIRALNTSDWSLKLV 989


>gi|428177953|gb|EKX46830.1| hypothetical protein GUITHDRAFT_70199, partial [Guillardia theta
           CCMP2712]
          Length = 305

 Score = 41.6 bits (96), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 46/96 (47%), Gaps = 6/96 (6%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPL----SEESVVKRLAGDGVQGYSDGEPGSAR 102
           P  V   P  S L+V DS       ++       S +  V  +AG+G  G +DG   +A+
Sbjct: 41  PSDVALSPDESFLVVADSRNHVIRRINKGAQGWSSNDVTVTTIAGNGQPGLADGSATTAQ 100

Query: 103 FDKPKSFAVDMKGN-IYVADKSNHVIRKITNLDHGN 137
           F KP    +   G  + VAD +N+ IRK+T L  GN
Sbjct: 101 FQKPLGVDISPDGTWVAVADSANNRIRKVT-LATGN 135



 Score = 37.4 bits (85), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 47/92 (51%), Gaps = 3/92 (3%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P SV   P  + L V D++ +    +S P S     K     G+ GY++G   +A ++ P
Sbjct: 159 PASVSFHPQGTGLAVADTNNNRIRWISLP-SYSVTTKSGRCLGIAGYANGLGKTAAYNSP 217

Query: 107 KSFAVDMKGN-IYVADKSNHVIRKITNLDHGN 137
           +   V   G  IY+AD  N+VIR+   +D+G+
Sbjct: 218 QYVRVSPDGVWIYIADGQNNVIRR-ARVDNGD 248


>gi|356571070|ref|XP_003553704.1| PREDICTED: NHL repeat-containing protein 2-like [Glycine max]
          Length = 1098

 Score = 41.6 bits (96), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 35/63 (55%), Gaps = 4/63 (6%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQLI 140
           + ++   G +G  DG    A F++P+  A + K NI YVAD  NH +R+I   D  N+ +
Sbjct: 644 IVQIGSSGEEGLQDGSFDDATFNRPQGLAYNAKKNILYVADTENHALREI---DFVNEKV 700

Query: 141 RQI 143
           R +
Sbjct: 701 RTL 703



 Score = 37.7 bits (86), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 20/65 (30%), Positives = 34/65 (52%), Gaps = 1/65 (1%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
           V  +AG G  G+ DG    A+  +P       KG +++AD +N +IR + +L+     +R
Sbjct: 879 VSTIAGTGKAGFKDGTAVKAQLSEPSGIVEGNKGRLFIADTNNSLIRYL-DLNINETELR 937

Query: 142 QINLK 146
            + LK
Sbjct: 938 TLELK 942


>gi|290978635|ref|XP_002672041.1| predicted protein [Naegleria gruberi]
 gi|284085614|gb|EFC39297.1| predicted protein [Naegleria gruberi]
          Length = 706

 Score = 41.6 bits (96), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 22/64 (34%), Positives = 36/64 (56%), Gaps = 3/64 (4%)

Query: 75  PLSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKIT 131
           P   E  +  +AG+G++G+   +G    A+   P   AV+   N +Y+AD  NH IRK++
Sbjct: 5   PKGMEQTIWTIAGNGIKGFYGDNGLAIEAKLSSPYGVAVNPYSNDVYIADYRNHCIRKVS 64

Query: 132 NLDH 135
            LD+
Sbjct: 65  ALDN 68



 Score = 40.8 bits (94), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 48/89 (53%), Gaps = 8/89 (8%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG---SARF 103
           P+ V   P S+D+ + D        +S   + ++ +  +AG GV GYS G+ G    A+ 
Sbjct: 38  PYGVAVNPYSNDVYIADYRNHCIRKVS---ALDNKITTIAGTGVAGYS-GDGGLALHAQL 93

Query: 104 DKPKSFAVDMKGN-IYVADKSNHVIRKIT 131
           + P S ++  K   +Y+AD SNH IRKI+
Sbjct: 94  NCPLSVSIHPKNEELYIADYSNHRIRKIS 122



 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 33/128 (25%), Positives = 58/128 (45%), Gaps = 12/128 (9%)

Query: 23  SSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVV 82
           ++G   E G+     ++G      P  +   P + D+ + DS+      ++    +  ++
Sbjct: 194 TAGFCGEGGFATNAQLNG------PSGLDFNPTTGDVYIADSNNHRVRKVN---CKSGII 244

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLDHGNQLIR 141
             LAG G  GYSDG    A+ + P   +   +G  IYV D+SN+ I  ++ +D     + 
Sbjct: 245 TTLAGTGKAGYSDGI--DAQLNYPYDVSFCTRGQIIYVTDRSNNRICTVSPIDGRITTVC 302

Query: 142 QINLKPED 149
            IN K  D
Sbjct: 303 GINEKGFD 310



 Score = 37.4 bits (85), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 23/88 (26%), Positives = 44/88 (50%), Gaps = 8/88 (9%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG---SARF 103
           P  V   P + D+ ++D   +    +S   +   ++   AG+G  G+  GE G   +A+ 
Sbjct: 154 PQCVKIHPITFDIYIIDFVNNKIRRIS---NSSGIISTFAGNGTAGFC-GEGGFATNAQL 209

Query: 104 DKPKSFAVD-MKGNIYVADKSNHVIRKI 130
           + P     +   G++Y+AD +NH +RK+
Sbjct: 210 NGPSGLDFNPTTGDVYIADSNNHRVRKV 237


>gi|288923319|ref|ZP_06417453.1| serine/threonine protein kinase [Frankia sp. EUN1f]
 gi|288345335|gb|EFC79730.1| serine/threonine protein kinase [Frankia sp. EUN1f]
          Length = 752

 Score = 41.6 bits (96), Expect = 0.16,   Method: Composition-based stats.
 Identities = 32/113 (28%), Positives = 51/113 (45%), Gaps = 27/113 (23%)

Query: 60  IVLDSSRSAFYTLSFPLSEESVVKRL---------AGDGVQGYS-DGEPGSA-RFDKPKS 108
           + +D++R   Y      +E + ++R+         AG+G  G+  DG P +A    +P  
Sbjct: 423 LTIDTTRGVIYVAD---TENNRIRRIDRTGRITTVAGNGTAGFDGDGGPATAASLQEPSG 479

Query: 109 FAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSGMF 161
             V   G +++AD  NH IR+I      N +I  I         + QSGSG F
Sbjct: 480 VVVAPDGTLFIADNGNHRIRRIGT----NGVITTI---------AGQSGSGGF 519


>gi|30696124|ref|NP_564718.2| haloacid dehalogenase-like hydrolase domain-containing protein
           [Arabidopsis thaliana]
 gi|17529226|gb|AAL38840.1| unknown protein [Arabidopsis thaliana]
 gi|332195279|gb|AEE33400.1| haloacid dehalogenase-like hydrolase domain-containing protein
           [Arabidopsis thaliana]
          Length = 1055

 Score = 41.6 bits (96), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 25/95 (26%), Positives = 43/95 (45%), Gaps = 4/95 (4%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLDHG 136
           E + + ++   G +G+ DG    A F++P+  A + K N +YVAD  NH +R+I  ++  
Sbjct: 597 EGNFIVQIGSSGEEGFQDGSFEDAAFNRPQGLAYNAKKNLLYVADTENHALREIDFVNER 656

Query: 137 NQLIRQINLKPEDCSKSSQSGSGMF---WVTVFSP 168
            Q +     K  D     +    +    W   F P
Sbjct: 657 VQTLAGNGTKGSDYQGGRKGTKQLLNSPWDVCFEP 691



 Score = 41.2 bits (95), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 35/61 (57%), Gaps = 4/61 (6%)

Query: 79  ESVVKR---LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           + V KR   LAG G  G+ DG+   A+  +P   A+   G ++VAD +N +IR I +L+ 
Sbjct: 830 DPVTKRVVTLAGTGKAGFKDGKVKGAQLSEPAGLAITENGRLFVADTNNSLIRYI-DLNK 888

Query: 136 G 136
           G
Sbjct: 889 G 889


>gi|444912221|ref|ZP_21232386.1| putative hemolysin [Cystobacter fuscus DSM 2262]
 gi|444717129|gb|ELW57964.1| putative hemolysin [Cystobacter fuscus DSM 2262]
          Length = 844

 Score = 41.6 bits (96), Expect = 0.17,   Method: Composition-based stats.
 Identities = 23/56 (41%), Positives = 27/56 (48%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S    V  LAG G  G  DG    A F  P   AV   G +YVAD  N ++R IT
Sbjct: 788 ISPAGEVTTLAGSGRFGARDGRGADADFVVPGGLAVGTDGTLYVADSGNALLRAIT 843



 Score = 39.7 bits (91), Expect = 0.57,   Method: Composition-based stats.
 Identities = 20/50 (40%), Positives = 28/50 (56%), Gaps = 4/50 (8%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           VKR++ DG    S       R + P   A D +GN+YV+D  N+ IR+IT
Sbjct: 591 VKRVSPDG----SVSTLSHVRLNGPLGIAADAQGNVYVSDSDNYCIRRIT 636


>gi|168702582|ref|ZP_02734859.1| NHL repeat [Gemmata obscuriglobus UQM 2246]
          Length = 599

 Score = 41.6 bits (96), Expect = 0.17,   Method: Composition-based stats.
 Identities = 27/82 (32%), Positives = 43/82 (52%), Gaps = 8/82 (9%)

Query: 50  VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF 109
           V D PG   L + DS+           ++  V+    G G  G +DG    A+FD P+  
Sbjct: 269 VADEPGRR-LFIADSTHHRVVVTDLAGNKLHVI----GTGAPGKTDGPFDKAQFDDPQGM 323

Query: 110 AVDMKGN-IYVADKSNHVIRKI 130
           AV  +G+ ++VAD+ NH++R+I
Sbjct: 324 AV--RGDTLFVADRKNHLVREI 343


>gi|149176433|ref|ZP_01855047.1| serine/threonine protein kinase [Planctomyces maris DSM 8797]
 gi|148844785|gb|EDL59134.1| serine/threonine protein kinase [Planctomyces maris DSM 8797]
          Length = 368

 Score = 41.6 bits (96), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 22/48 (45%), Positives = 31/48 (64%), Gaps = 2/48 (4%)

Query: 85  LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +AG G +G+S DG P  SA F +P S A+D   N+Y+ D  NH IR++
Sbjct: 141 VAGTGEKGFSGDGGPATSATFSRPHSIALDNDNNLYICDIGNHRIRRV 188



 Score = 40.8 bits (94), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 19/67 (28%), Positives = 33/67 (49%), Gaps = 1/67 (1%)

Query: 70  YTLSFPLSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           +T+     ++  ++   GDG +G   DG+P   R  +P    V   GN+Y+ D   + +R
Sbjct: 301 HTIRVIRKKDGTIETAVGDGKKGDGPDGDPAHCRMARPHGVYVGPNGNVYIGDSETYRVR 360

Query: 129 KITNLDH 135
           K+T   H
Sbjct: 361 KLTTGTH 367



 Score = 36.6 bits (83), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 31/108 (28%), Positives = 47/108 (43%), Gaps = 18/108 (16%)

Query: 39  DGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL----------AGD 88
           DG ++E  P   ++ P + D     S + + Y     L E + V R+          AG 
Sbjct: 209 DGAKVEGTP---LNGPRALDFFTDGSGKGSLY---LALREGNKVYRIDLDDQTLHHIAGT 262

Query: 89  GVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
           G +GY+   G    A    PK  +V   G++Y+AD  +H IR I   D
Sbjct: 263 GKKGYTGHGGPAKKATLSGPKGISVAPNGDVYLADTESHTIRVIRKKD 310


>gi|334116916|ref|ZP_08491008.1| NHL repeat containing protein [Microcoleus vaginatus FGP-2]
 gi|333461736|gb|EGK90341.1| NHL repeat containing protein [Microcoleus vaginatus FGP-2]
          Length = 546

 Score = 41.6 bits (96), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 42/143 (29%), Positives = 59/143 (41%), Gaps = 9/143 (6%)

Query: 20  SLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEE 79
            L   G LL +       +D   L    +SVI     S L + DS  +     +     E
Sbjct: 191 QLHQQGALLGEMSQGNESVDETVLPGPANSVIQNLVGSCLFIADSGHNRIVVSTL----E 246

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQ 138
             V  + G G  G +DG+   A F  P+  A D +  I YVAD  NH +RKI   D   Q
Sbjct: 247 GEVLHVIGSGQPGLTDGDFEEAEFFAPQGMAFDAESQILYVADTENHALRKI---DFTTQ 303

Query: 139 LIRQINLKPEDCSK-SSQSGSGM 160
            +  +    E   + S +SG G+
Sbjct: 304 RVETVAGTGEQSHEISPRSGKGL 326


>gi|428185691|gb|EKX54543.1| hypothetical protein GUITHDRAFT_63359, partial [Guillardia theta
           CCMP2712]
          Length = 305

 Score = 41.6 bits (96), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 31/89 (34%), Positives = 45/89 (50%), Gaps = 6/89 (6%)

Query: 45  INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
           ++P  +   P  S L+V DS      ++S+    E VV   AG G +G+ DG    A+FD
Sbjct: 57  LHPKGICPLP--SGLLVCDSGHHRIRSVSY--DGERVVA-FAGSGKRGHRDGPVQVAQFD 111

Query: 105 KPKSFAV-DMKGNIYVADKSNHVIRKITN 132
            P S  V     +I VAD  N+ +R+I N
Sbjct: 112 TPCSICVCPSDKSIIVADSGNNAVRRIAN 140



 Score = 37.7 bits (86), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 38/86 (44%), Gaps = 9/86 (10%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG----DGVQGYSDGEPGSAR 102
           P S+   P    +IV DS  +A   ++       +V  LAG    D   G+ DGE   A+
Sbjct: 113 PCSICVCPSDKSIIVADSGNNAVRRIA-----NGMVTTLAGGSGPDRAGGFVDGESEGAK 167

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIR 128
           F +P     D +  + V D  NH +R
Sbjct: 168 FRRPTFVMFDKEETLLVIDSGNHCLR 193


>gi|386722458|ref|YP_006188784.1| copper amine oxidase domain-containing protein [Paenibacillus
           mucilaginosus K02]
 gi|384089583|gb|AFH61019.1| copper amine oxidase domain-containing protein [Paenibacillus
           mucilaginosus K02]
          Length = 537

 Score = 41.6 bits (96), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 19/45 (42%), Positives = 28/45 (62%), Gaps = 4/45 (8%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           AGD    ++DG    A+F++P   A+D KGN+YV+D  N  +R I
Sbjct: 235 AGD----FADGPLAQAKFNEPTGIALDAKGNLYVSDSGNQRVRYI 275



 Score = 40.0 bits (92), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 47/106 (44%), Gaps = 8/106 (7%)

Query: 29  EDGYTVTTVIDGHQ---LEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
           + GY V  ++DG     L   P  +   P  S L V D+   A   +      +  V  +
Sbjct: 109 DKGYPVGGLLDGAANASLFQEPLGLSAGPDGS-LYVADAGNHAIRRID----AKGNVTTV 163

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           AG G  G  DG+  +A F +P   A    G +YVAD   H IR+I+
Sbjct: 164 AGSGRLGVKDGKGAAAEFYRPGDVAAAPDGTLYVADTLGHTIRRIS 209


>gi|428317077|ref|YP_007114959.1| NHL repeat containing protein [Oscillatoria nigro-viridis PCC 7112]
 gi|428240757|gb|AFZ06543.1| NHL repeat containing protein [Oscillatoria nigro-viridis PCC 7112]
          Length = 546

 Score = 41.6 bits (96), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 36/115 (31%), Positives = 51/115 (44%), Gaps = 9/115 (7%)

Query: 48  HSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPK 107
           +SVI     S L + DS  +     +     E  V  + G G  G +DG+   A F  P+
Sbjct: 219 NSVIQNLVGSCLFIADSGHNRIVVSTL----EGEVLHVIGSGKPGLTDGDFEEAEFFAPQ 274

Query: 108 SFAVDMKGNI-YVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSK-SSQSGSGM 160
             A D +  I YVAD  NH +RKI   D   Q +  +    E   + S +SG G+
Sbjct: 275 GMAFDAESQILYVADTENHALRKI---DFTTQRVETVAGTGEQSHEISPRSGKGL 326



 Score = 36.6 bits (83), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 17/48 (35%), Positives = 29/48 (60%), Gaps = 1/48 (2%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + + GDG  G  D +   ARF +P   +  +  +++VAD +NHVIR++
Sbjct: 476 RTMIGDGTAGLVDAKGLKARFFEPSGLSA-IGPHLFVADTNNHVIRRV 522


>gi|290995422|ref|XP_002680294.1| predicted protein [Naegleria gruberi]
 gi|284093914|gb|EFC47550.1| predicted protein [Naegleria gruberi]
          Length = 1170

 Score = 41.6 bits (96), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 53/109 (48%), Gaps = 26/109 (23%)

Query: 48  HSVIDRPGSSDLIVLDS-----SRSAFYT-LSFPLSEESV---------VKRLAGD---- 88
           +S++DR   + +I LD+     + ++ YT +S   SE S          +K+L       
Sbjct: 281 NSLVDRVAGTGIIELDTDGKLGNLTSLYTPISVTQSESSSDIYIGTTIGIKKLTDSNKLI 340

Query: 89  ----GVQGYS--DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
               G Q YS  DG P +A R   P+  A+   G ++++DK +H IRKI
Sbjct: 341 SSFAGSQAYSSGDGYPATAARLSSPEGIAISTSGEVFISDKGSHTIRKI 389



 Score = 37.4 bits (85), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 17/56 (30%), Positives = 30/56 (53%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +  + V+  +AG G  GY DG    A+ + P   A    G++ + + SN+ IRK++
Sbjct: 389 IDSKGVISTVAGTGSAGYVDGPALKAKLNGPGFLAFTPNGDLLITESSNNRIRKLS 444


>gi|390369862|ref|XP_798415.2| PREDICTED: NHL repeat-containing protein 2-like [Strongylocentrotus
           purpuratus]
          Length = 322

 Score = 41.2 bits (95), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 41/86 (47%), Gaps = 9/86 (10%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V   P  + L V D+  +    ++     + VV+   G    G++DG    ARF  P
Sbjct: 173 PGKVTTNPEGTLLAVSDTGHNRVIIVAL----DGVVQHCIGGPETGFNDGLYQEARFHSP 228

Query: 107 KS--FAVDMKGNIYVADKSNHVIRKI 130
           +   +A D+   IYVAD  NH IRKI
Sbjct: 229 QGLCWAQDV---IYVADTENHAIRKI 251


>gi|430741385|ref|YP_007200514.1| NHL repeat protein [Singulisphaera acidiphila DSM 18658]
 gi|430013105|gb|AGA24819.1| NHL repeat protein [Singulisphaera acidiphila DSM 18658]
          Length = 361

 Score = 41.2 bits (95), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 44/86 (51%), Gaps = 7/86 (8%)

Query: 45  INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SAR 102
           I+P +V    G  ++ +LD   +A   +         ++ L G G +G S DG PG  A+
Sbjct: 218 IDPRAVATD-GKGNVYILDRGGNALRVVD----RSGKIRTLVGTGEKGASGDGGPGRQAK 272

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIR 128
            + PK   V+ +G++ +AD  NH IR
Sbjct: 273 LNGPKHLCVNPQGDVIIADTENHRIR 298


>gi|290982061|ref|XP_002673749.1| predicted protein [Naegleria gruberi]
 gi|284087335|gb|EFC41005.1| predicted protein [Naegleria gruberi]
          Length = 185

 Score = 41.2 bits (95), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 35/59 (59%), Gaps = 2/59 (3%)

Query: 74  FPLSEESVVKRLAGDGV-QGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           F    + V + +AG+G+ Q   DG  G+ A  + P+  A+DM GN+Y+A++  H IR I
Sbjct: 16  FTSHAQYVTQTVAGNGISQSLGDGFLGTQASLNSPRDVALDMYGNMYIAEEKAHKIRFI 74



 Score = 37.7 bits (86), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 22/65 (33%), Positives = 36/65 (55%), Gaps = 5/65 (7%)

Query: 70  YTLSFPLSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKG-NIYVADKSNH 125
           + + F      ++  L G+G+ G S G+ G   +A+   P S A+D  G N++++D  NH
Sbjct: 69  HKIRFINKTTGIISTLVGNGIAG-SVGDGGLAINAQLHLPFSLAIDSTGQNLFISDSGNH 127

Query: 126 VIRKI 130
           VIR I
Sbjct: 128 VIRHI 132


>gi|390559793|ref|ZP_10244078.1| conserved hypothetical protein [Nitrolancetus hollandicus Lb]
 gi|390173637|emb|CCF83377.1| conserved hypothetical protein [Nitrolancetus hollandicus Lb]
          Length = 442

 Score = 41.2 bits (95), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 34/132 (25%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V+     + L + DS  +         +E  ++    G G  G +DG+  +A+F+ P
Sbjct: 128 PGKVLADAAGNRLFIADSGHNRIVVTRLDGAEPWMI----GSGQPGLADGDAATAQFNHP 183

Query: 107 KSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSGMFWVTV 165
           +  A  + G+I Y+AD  NHVIR++      +   RQ+    E  + + +   GM + + 
Sbjct: 184 QGMA--LAGDILYLADTRNHVIRRV------DLASRQV----ETIAGTGE--PGMSYASA 229

Query: 166 FSPYLISCKSFW 177
            S   I  +S W
Sbjct: 230 GSARTIDLRSPW 241



 Score = 36.2 bits (82), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 33/60 (55%), Gaps = 4/60 (6%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLDHGNQLIRQINL 145
           GDG  G  DG   +ARF +P    V + G+ +Y+AD +NH IR +  L+ G     +I L
Sbjct: 386 GDGTPGDRDGVREAARFHEPG--GVSLAGDRLYIADTNNHAIR-VAELETGRVTTLRITL 442


>gi|242017281|ref|XP_002429120.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212513984|gb|EEB16382.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 791

 Score = 41.2 bits (95), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 26/50 (52%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           E VVK   G   QG+ DG    A FD P   A   +  +Y+AD +NH IR
Sbjct: 237 EGVVKYTIGGKEQGFKDGNFQEALFDTPHGLAFFNESILYIADTNNHAIR 286


>gi|395225280|ref|ZP_10403807.1| NHL repeat protein, partial [Thiovulum sp. ES]
 gi|394446527|gb|EJF07348.1| NHL repeat protein, partial [Thiovulum sp. ES]
          Length = 4349

 Score = 41.2 bits (95), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 25/40 (62%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G  DG   + RF  P+  A D +GN++VAD +N  I+KIT
Sbjct: 795 GDEDGNLSTVRFYNPRGLAFDEEGNLFVADSNNSKIKKIT 834


>gi|12323029|gb|AAG51506.1|AC058785_9 hypothetical protein [Arabidopsis thaliana]
          Length = 1041

 Score = 41.2 bits (95), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 35/61 (57%), Gaps = 4/61 (6%)

Query: 79  ESVVKR---LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           + V KR   LAG G  G+ DG+   A+  +P   A+   G ++VAD +N +IR I +L+ 
Sbjct: 816 DPVTKRVVTLAGTGKAGFKDGKVKGAQLSEPAGLAITENGRLFVADTNNSLIRYI-DLNK 874

Query: 136 G 136
           G
Sbjct: 875 G 875



 Score = 41.2 bits (95), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 19/54 (35%), Positives = 32/54 (59%), Gaps = 1/54 (1%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           E + + ++   G +G+ DG    A F++P+  A + K N +YVAD  NH +R+I
Sbjct: 597 EGNFIVQIGSSGEEGFQDGSFEDAAFNRPQGLAYNAKKNLLYVADTENHALREI 650


>gi|291239557|ref|XP_002739689.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
          Length = 744

 Score = 41.2 bits (95), Expect = 0.21,   Method: Composition-based stats.
 Identities = 24/57 (42%), Positives = 30/57 (52%), Gaps = 4/57 (7%)

Query: 87  GDGVQGYSDGE--PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
           GDG   YS G+   GS R   P   + D  GN+YV D +NH I+K  +  HG  L R
Sbjct: 650 GDGEYLYSFGDCGNGSDRLQNPLGVSTDRDGNVYVCDHNNHRIQKYDS--HGKYLTR 704


>gi|147918859|ref|YP_687415.1| hypothetical protein RRC374 [Methanocella arvoryzae MRE50]
 gi|110622811|emb|CAJ38089.1| hypothetical protein RRC374 [Methanocella arvoryzae MRE50]
          Length = 514

 Score = 41.2 bits (95), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 19/39 (48%), Positives = 27/39 (69%), Gaps = 3/39 (7%)

Query: 97  EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           +PG   FD+P+  AVD +GNIYV+D  NH I ++ N+ H
Sbjct: 363 QPG--EFDRPRGIAVDGRGNIYVSDSYNHRI-QVWNVAH 398



 Score = 37.0 bits (84), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 5/44 (11%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR---KITN 132
           G  D +PG  +F  P+  AVD  GNIYVAD  N+ I+   K TN
Sbjct: 170 GSPDDDPG--QFSSPRGIAVDGAGNIYVADTYNYRIQVWNKTTN 211


>gi|406947628|gb|EKD78528.1| NHL repeat containing protein [uncultured bacterium]
          Length = 810

 Score = 41.2 bits (95), Expect = 0.22,   Method: Composition-based stats.
 Identities = 22/53 (41%), Positives = 34/53 (64%), Gaps = 3/53 (5%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKI 130
           +  VK +AGDG +GY++G+  + +FD P   A   KG N+ +AD  N +IRK+
Sbjct: 657 DEAVKLVAGDGNRGYNNGDRDNTQFDNPVGLA--RKGDNLLIADSLNDLIRKV 707



 Score = 38.5 bits (88), Expect = 1.4,   Method: Composition-based stats.
 Identities = 23/75 (30%), Positives = 36/75 (48%), Gaps = 5/75 (6%)

Query: 60  IVLDSSRSAFYTLSF-----PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
           IV D + S +   +F      ++   VV  + G G  G + G   S +F  P +  +D  
Sbjct: 51  IVSDGAGSFYLADTFNGVIRKINSNGVVSTVVGAGGYGDTTGSGSSTKFALPAAVGLDSS 110

Query: 115 GNIYVADKSNHVIRK 129
           GN+Y+AD  N  I+K
Sbjct: 111 GNVYIADTGNGKIKK 125



 Score = 35.8 bits (81), Expect = 9.3,   Method: Composition-based stats.
 Identities = 16/40 (40%), Positives = 24/40 (60%), Gaps = 1/40 (2%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           GY D     A+F+ P +  +D   N +YVAD+ NH IR++
Sbjct: 559 GYQDATRKDAQFNVPIAVTIDNADNYLYVADRDNHTIRQV 598


>gi|344923601|ref|ZP_08777062.1| hypothetical protein COdytL_03000 [Candidatus Odyssella
           thessalonicensis L13]
          Length = 1165

 Score = 41.2 bits (95), Expect = 0.22,   Method: Composition-based stats.
 Identities = 21/58 (36%), Positives = 29/58 (50%), Gaps = 8/58 (13%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPED 149
           G  +G   +A FD P     D  G++Y+AD  NHVIRKI           +IN+ P +
Sbjct: 332 GLVNGSGATALFDGPSGMVQDADGSLYIADSYNHVIRKID--------ANKININPSN 381



 Score = 39.7 bits (91), Expect = 0.53,   Method: Composition-based stats.
 Identities = 22/56 (39%), Positives = 32/56 (57%), Gaps = 3/56 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           V  +AG G  G+++G  GSA+F+ P+   V   G + VAD +N  IR +   D GN
Sbjct: 100 VATVAGTGTPGHTNGPGGSAQFNMPQGITVMNDGRLLVADTNNDRIRLV---DPGN 152


>gi|282879766|ref|ZP_06288496.1| NHL repeat protein [Prevotella timonensis CRIS 5C-B1]
 gi|281306435|gb|EFA98465.1| NHL repeat protein [Prevotella timonensis CRIS 5C-B1]
          Length = 486

 Score = 41.2 bits (95), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 1/58 (1%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
           P ++ +V   +   G  G+ +G P  A+F  P+  AV+  G+I YVAD  N  IRK+T
Sbjct: 427 PDNKATVTTVIGLPGSAGFKEGGPDVAQFRYPRGVAVNADGSIVYVADTGNRCIRKLT 484


>gi|390957686|ref|YP_006421443.1| gluconolactonase [Terriglobus roseus DSM 18391]
 gi|390412604|gb|AFL88108.1| gluconolactonase [Terriglobus roseus DSM 18391]
          Length = 738

 Score = 41.2 bits (95), Expect = 0.23,   Method: Composition-based stats.
 Identities = 22/59 (37%), Positives = 32/59 (54%), Gaps = 2/59 (3%)

Query: 76  LSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           +S   V+  +AG G QGY    G   +A  + P + AV   G++Y AD  NH IR+I +
Sbjct: 96  VSSGGVLAVVAGTGHQGYVGDGGAATAAELNAPTAVAVAPDGSVYFADSGNHCIRRIAS 154


>gi|380511998|ref|ZP_09855405.1| nhl repeat protein [Xanthomonas sacchari NCPPB 4393]
          Length = 473

 Score = 40.8 bits (94), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 31/57 (54%), Gaps = 1/57 (1%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           S    V R  G G   + DG  G A F +P+  A++ +G +YVAD  NH +R+I  L
Sbjct: 194 STNGRVLRQFGIGTADFIDGAIGEAAFHRPRGLALE-RGVLYVADTGNHALRRINLL 249


>gi|348681654|gb|EGZ21470.1| hypothetical protein PHYSODRAFT_329412 [Phytophthora sojae]
          Length = 251

 Score = 40.8 bits (94), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 41/138 (29%), Positives = 68/138 (49%), Gaps = 16/138 (11%)

Query: 6   VYSSLAITLLSLLFSLVSSGLLLE--DGYTVTTVIDGHQLEINPHSVIDRPGSSDL---- 59
           + S+  IT+ SL  SLV  G+ ++  +   V+ + +   ++I P   +    +S +    
Sbjct: 55  ITSTDIITIASLADSLVLQGITVDSSENVYVSRLTEFDVVKIAPDGSVSIFANSSMFTLA 114

Query: 60  --IVLDSSRSAFYT-----LSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVD 112
             I +DSS + + T     L F L+ E  V  +AG  VQG+ +     ARF  P +  + 
Sbjct: 115 MGIAIDSSDNIYITDQHRILKFTLAGEMGV--VAGR-VQGFLNALGELARFSTPWALTIG 171

Query: 113 MKGNIYVADKSNHVIRKI 130
             GN+YV D  N+ IRK+
Sbjct: 172 SDGNLYVVDSDNNCIRKV 189


>gi|290986946|ref|XP_002676184.1| NHL repeat-containing protein [Naegleria gruberi]
 gi|284089785|gb|EFC43440.1| NHL repeat-containing protein [Naegleria gruberi]
          Length = 1290

 Score = 40.8 bits (94), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 22/62 (35%), Positives = 33/62 (53%), Gaps = 2/62 (3%)

Query: 71  TLSFPLSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           T+ + +S   +V   AG G  GY +D  P  SA  + PK  A    G++++ D  NH IR
Sbjct: 450 TVKYLISSSGIVITKAGSGTTGYNADNIPATSAHLNFPKGIARSTNGDLFITDSGNHRIR 509

Query: 129 KI 130
           K+
Sbjct: 510 KV 511


>gi|268574858|ref|XP_002642408.1| C. briggsae CBR-NHL-1 protein [Caenorhabditis briggsae]
          Length = 979

 Score = 40.8 bits (94), Expect = 0.25,   Method: Composition-based stats.
 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 3/56 (5%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
           L GD +  +     G+A+F+ P   AVD  G+IYVADK NH   ++   D   Q I
Sbjct: 792 LEGDHLFSFGGHGQGNAKFNNPWGVAVDDLGSIYVADKDNH---RVQVFDKNGQFI 844


>gi|341877854|gb|EGT33789.1| CBN-NHL-1 protein [Caenorhabditis brenneri]
          Length = 976

 Score = 40.8 bits (94), Expect = 0.25,   Method: Composition-based stats.
 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 3/56 (5%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
           L GD +  +     G+A+F+ P   AVD  G+IYVADK NH   ++   D   Q I
Sbjct: 789 LEGDHLFSFGGHGQGNAKFNNPWGVAVDDLGSIYVADKDNH---RVQVFDKNGQFI 841


>gi|423298428|ref|ZP_17276486.1| hypothetical protein HMPREF1070_05151 [Bacteroides ovatus
           CL03T12C18]
 gi|392663340|gb|EIY56891.1| hypothetical protein HMPREF1070_05151 [Bacteroides ovatus
           CL03T12C18]
          Length = 507

 Score = 40.8 bits (94), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 18/41 (43%), Positives = 23/41 (56%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           GY DG    ARF + +    D  GNIY+ D+ NH IR + N
Sbjct: 413 GYWDGIFAEARFLRGEQMCSDDAGNIYIVDQDNHAIRMVNN 453


>gi|290982352|ref|XP_002673894.1| predicted protein [Naegleria gruberi]
 gi|284087481|gb|EFC41150.1| predicted protein [Naegleria gruberi]
          Length = 2807

 Score = 40.8 bits (94), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 29/51 (56%), Gaps = 2/51 (3%)

Query: 82  VKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  +AG G  GYS   G   +A+   P++ AV   G IY+AD  NH IRKI
Sbjct: 572 ISTIAGTGSVGYSGDGGLATNAQLYYPQTVAVSSSGEIYIADAYNHRIRKI 622



 Score = 40.4 bits (93), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 29/51 (56%), Gaps = 2/51 (3%)

Query: 82  VKRLAGDGVQGYS-DGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  +AG G  GY+ DG    SA+ + P   A+   G I +AD +NH IRKI
Sbjct: 902 ISTIAGTGTGGYNGDGVLATSAQLNYPNGIAIQENGEILIADNNNHRIRKI 952



 Score = 40.0 bits (92), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 18/44 (40%), Positives = 25/44 (56%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           G+ DG+  +  F    SF   + G I +AD +NH IRKIT L +
Sbjct: 802 GFGDGDMATTSFISAYSFEFTLNGEIIIADSNNHRIRKITTLGY 845



 Score = 39.7 bits (91), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 28/52 (53%), Gaps = 2/52 (3%)

Query: 82  VKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +  +AG G  GYS   G   SA+   P   ++   G IY+AD  NH IRK+T
Sbjct: 251 ISTVAGSGTAGYSGDGGLATSAQLYYPLGVSISSAGEIYIADYYNHRIRKVT 302



 Score = 39.3 bits (90), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 33/55 (60%), Gaps = 3/55 (5%)

Query: 80  SVVKRLAGDG-VQGYS-DGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           SV  ++AG G + GY  DG    SAR + P   A    G++Y+AD +N+ IRK+T
Sbjct: 457 SVASKVAGTGTLNGYGGDGNLATSARLNYPYGTAFSSNGDMYIADTNNNRIRKVT 511



 Score = 39.3 bits (90), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 28/51 (54%), Gaps = 2/51 (3%)

Query: 82  VKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  +AG G  GYS   G   SA+   P S A+   G IY+AD  NH IRKI
Sbjct: 628 ISTIAGTGSVGYSGDGGLATSAQLYYPFSVAISSVGEIYIADTYNHRIRKI 678



 Score = 38.5 bits (88), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 33/141 (23%), Positives = 63/141 (44%), Gaps = 26/141 (18%)

Query: 10  LAITLLSLLFSLVSSGLLLEDG----YTVTTVIDG------------HQLEINPHSVIDR 53
           + + LL++L  +V S   ++      Y +TT+  G             QL I     I  
Sbjct: 1   MKVVLLAILLCIVMSAKFIDTQSLLLYNITTIAGGGSVVGDGLLATNAQLAIPAGVAISS 60

Query: 54  PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFA 110
            G  ++I+ DS+ +        +S    +  +AG G   +  G+ G   +A+ + P + +
Sbjct: 61  IG--EIIIADSNNNRIR----KISTNGYISTIAGTGSASFG-GDGGLAINAQLNSPSAVS 113

Query: 111 VDMKGNIYVADKSNHVIRKIT 131
           +   G++Y++D  N  IRKI+
Sbjct: 114 ISSNGDMYISDTGNQRIRKIS 134



 Score = 38.1 bits (87), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 16/31 (51%), Positives = 22/31 (70%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           SA+ + P+S A+   G IY+AD +NH IRKI
Sbjct: 536 SAQLNYPRSVAISSSGEIYIADFNNHRIRKI 566


>gi|320103301|ref|YP_004178892.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
 gi|319750583|gb|ADV62343.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
          Length = 676

 Score = 40.8 bits (94), Expect = 0.27,   Method: Composition-based stats.
 Identities = 24/72 (33%), Positives = 38/72 (52%), Gaps = 5/72 (6%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
           L V D+  +    + F  +    ++ L GD  +G +D  P   RFD+P   +V + G +Y
Sbjct: 441 LYVADTYNNKIKVIDFS-ARTKTIRTLVGDRRRGDTDDPP---RFDQPGGLSV-LGGQLY 495

Query: 119 VADKSNHVIRKI 130
           VAD +NH IR +
Sbjct: 496 VADTNNHKIRVV 507



 Score = 40.4 bits (93), Expect = 0.38,   Method: Composition-based stats.
 Identities = 19/50 (38%), Positives = 29/50 (58%), Gaps = 1/50 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKI 130
           +K + G+G  G  DG    A F++P+   +D ++  +YVAD  NH IR I
Sbjct: 236 LKAVVGNGKIGMRDGAYERASFNRPQGIRLDALRNRLYVADTENHAIRAI 285


>gi|453054414|gb|EMF01867.1| redoxin domain-containing protein [Streptomyces mobaraensis NBRC
           13819 = DSM 40847]
          Length = 608

 Score = 40.8 bits (94), Expect = 0.27,   Method: Composition-based stats.
 Identities = 28/82 (34%), Positives = 42/82 (51%), Gaps = 7/82 (8%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P   +  PG   L V D++R     L+     E+VV+R+ G G +G+ D     ARF +P
Sbjct: 183 PGKALALPGGGYL-VSDTTRHQLVELA--ADGETVVRRI-GTGERGFGDS---PARFSEP 235

Query: 107 KSFAVDMKGNIYVADKSNHVIR 128
           +  A+   G + VAD  NH +R
Sbjct: 236 QGLALLPDGRVAVADTVNHALR 257


>gi|421872066|ref|ZP_16303685.1| NHL repeat family protein [Brevibacillus laterosporus GI-9]
 gi|372458678|emb|CCF13234.1| NHL repeat family protein [Brevibacillus laterosporus GI-9]
          Length = 633

 Score = 40.8 bits (94), Expect = 0.27,   Method: Composition-based stats.
 Identities = 19/44 (43%), Positives = 27/44 (61%), Gaps = 5/44 (11%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           G +G  DG+     F++P   A+D K NIYV D++NH I+K  N
Sbjct: 487 GQKGTKDGQ-----FNQPMQLAIDSKDNIYVVDRNNHRIQKFDN 525


>gi|348681648|gb|EGZ21464.1| hypothetical protein PHYSODRAFT_329405 [Phytophthora sojae]
          Length = 145

 Score = 40.8 bits (94), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 38/76 (50%), Gaps = 5/76 (6%)

Query: 60  IVLDSSRSAFY-----TLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
           I +DSS S +       L   ++ +  VK +AG G +G+ +G   SARF  P +      
Sbjct: 3   IAIDSSDSIYVLDMHRILKVTVTLDGDVKVVAGSGAKGFVNGFGESARFSTPWALTFGSD 62

Query: 115 GNIYVADKSNHVIRKI 130
           G +YV D  N  IRK+
Sbjct: 63  GYLYVPDLDNDCIRKV 78


>gi|298386035|ref|ZP_06995592.1| hypothetical protein HMPREF9007_02755 [Bacteroides sp. 1_1_14]
 gi|298261263|gb|EFI04130.1| hypothetical protein HMPREF9007_02755 [Bacteroides sp. 1_1_14]
          Length = 462

 Score = 40.8 bits (94), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 31/49 (63%), Gaps = 2/49 (4%)

Query: 84  RLAGDGVQGYSDGE-PGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKI 130
           R  GDG +G++DG+    ARF+ P S   D ++G + V D +NH IRKI
Sbjct: 410 RPNGDGTKGFNDGDLRTQARFNYPASIVYDEVRGCLLVGDSNNHRIRKI 458


>gi|17553622|ref|NP_499028.1| Protein NHL-1 [Caenorhabditis elegans]
 gi|6226931|sp|Q03601.2|NHL1_CAEEL RecName: Full=RING finger protein nhl-1
 gi|3877606|emb|CAA79562.1| Protein NHL-1 [Caenorhabditis elegans]
          Length = 974

 Score = 40.8 bits (94), Expect = 0.27,   Method: Composition-based stats.
 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 3/56 (5%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
           L GD +  +     G+A+F+ P   AVD  G+IYVADK NH   ++   D   Q I
Sbjct: 787 LEGDHLFSFGGHGQGNAKFNNPWGVAVDDLGSIYVADKDNH---RVQVFDKNGQFI 839


>gi|308799191|ref|XP_003074376.1| NHL repeat (ISS) [Ostreococcus tauri]
 gi|116000547|emb|CAL50227.1| NHL repeat (ISS) [Ostreococcus tauri]
          Length = 1783

 Score = 40.8 bits (94), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 41/87 (47%), Gaps = 11/87 (12%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGS--------ARFDKPK 107
           +S  +V DS  +    +    SE   V   AG+G +G +DG+  +        A F  P 
Sbjct: 201 ASTCVVGDSGNNVVRVIDLGKSE---VTNFAGNGTEGSTDGKTSANAELGETQALFKGPT 257

Query: 108 SFAVDMKGNIYVADKSNHVIRKITNLD 134
           S A+   G++ VAD  N  +R I+N D
Sbjct: 258 SVALLASGDVLVADTGNKAVRNISNGD 284



 Score = 40.0 bits (92), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 33/100 (33%), Positives = 46/100 (46%), Gaps = 7/100 (7%)

Query: 34  VTTVIDGHQLEINP-HSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG 92
           VTTV+    L   P   V+D     D+ VL  S+     +S  +S  +V          G
Sbjct: 285 VTTVV--ASLPFAPIELVVDESNGGDMYVLGQSQHGVMKIS--VSTLAVTTIAGSQTTSG 340

Query: 93  YSDGEPG-SARFDKPKSFAVD-MKGNIYVADKSNHVIRKI 130
           + D   G SARF  P+  A+D +   +YVAD  NH +R I
Sbjct: 341 FVDHNTGTSARFTLPRGLALDSLNSKLYVADTGNHAVRMI 380


>gi|290976772|ref|XP_002671113.1| predicted protein [Naegleria gruberi]
 gi|284084679|gb|EFC38369.1| predicted protein [Naegleria gruberi]
          Length = 520

 Score = 40.8 bits (94), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 4/58 (6%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ + ++  +AG G  GYS G+ G   SA   KP   AVD  G IY  D +N+ +RK+
Sbjct: 305 ITPDGIITTIAGTGNFGYS-GDGGLATSADISKPTGIAVDSNGTIYFCDNNNNRVRKL 361



 Score = 37.7 bits (86), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 29/53 (54%), Gaps = 4/53 (7%)

Query: 82  VKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +  +AG G  GYS G+ G   +A    PKS A+   G I+  D  N VIR+IT
Sbjct: 255 ITTIAGTGAYGYS-GDGGLAINAVIRYPKSIAIGKHGEIFFTDSDNQVIRRIT 306


>gi|308501841|ref|XP_003113105.1| CRE-NHL-1 protein [Caenorhabditis remanei]
 gi|308265406|gb|EFP09359.1| CRE-NHL-1 protein [Caenorhabditis remanei]
          Length = 986

 Score = 40.8 bits (94), Expect = 0.28,   Method: Composition-based stats.
 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 3/56 (5%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
           L GD +  +     G+A+F+ P   AVD  G+IYVADK NH   ++   D   Q I
Sbjct: 799 LEGDHLFSFGGHGQGNAKFNNPWGVAVDDLGSIYVADKDNH---RVQVFDKNGQFI 851


>gi|410096564|ref|ZP_11291551.1| hypothetical protein HMPREF1076_00729 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409226528|gb|EKN19437.1| hypothetical protein HMPREF1076_00729 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 457

 Score = 40.8 bits (94), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 25/43 (58%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G  GY DG+P  A F+KP   A    G +YVAD  N++IR I 
Sbjct: 413 GESGYKDGKPQEALFNKPYDVAATPDGILYVADTYNYLIRCIA 455



 Score = 40.4 bits (93), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 18/58 (31%), Positives = 33/58 (56%)

Query: 73  SFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           S P   ++ ++ + G G     +G   +A+  + +  A D +GN+Y+AD +NHVI K+
Sbjct: 342 STPWITQNEIEHIVGTGKGAAKEGNGKAAQLGEIEGMAADQEGNVYLADYTNHVIWKV 399


>gi|402571882|ref|YP_006621225.1| hypothetical protein Desmer_1347 [Desulfosporosinus meridiei DSM
            13257]
 gi|402253079|gb|AFQ43354.1| conserved repeat protein [Desulfosporosinus meridiei DSM 13257]
          Length = 2025

 Score = 40.8 bits (94), Expect = 0.29,   Method: Composition-based stats.
 Identities = 20/41 (48%), Positives = 26/41 (63%), Gaps = 5/41 (12%)

Query: 89   GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
            G  G   GE     F++P+  AVD +GNIYVAD +NH I+K
Sbjct: 966  GSNGSDQGE-----FNRPQGIAVDSEGNIYVADSNNHRIQK 1001



 Score = 38.5 bits (88), Expect = 1.4,   Method: Composition-based stats.
 Identities = 18/43 (41%), Positives = 24/43 (55%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           G+ +  +     G  +F  PKS  VD  GNIYVAD +N  I+K
Sbjct: 770 GEYLMKFGSSGSGEGQFRNPKSVTVDSAGNIYVADTTNKRIQK 812



 Score = 37.0 bits (84), Expect = 4.5,   Method: Composition-based stats.
 Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 5/46 (10%)

Query: 85   LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            LA  G +G S G+     FD P   A D  GN+YVAD +N+ I+K+
Sbjct: 1199 LAKWGTRGNSAGQ-----FDNPYGIAADSAGNVYVADTNNNRIQKL 1239



 Score = 36.2 bits (82), Expect = 6.4,   Method: Composition-based stats.
 Identities = 17/43 (39%), Positives = 21/43 (48%)

Query: 87   GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
            G   QG+         F KP   AVD  GNIYV +  NH ++K
Sbjct: 1053 GGNPQGFGSTGTDEGEFKKPSGVAVDSDGNIYVVEAMNHRMQK 1095



 Score = 35.8 bits (81), Expect = 8.7,   Method: Composition-based stats.
 Identities = 19/38 (50%), Positives = 24/38 (63%), Gaps = 2/38 (5%)

Query: 96   GEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            G PG+  ++F  P   AVD  GN+YVAD S + IRK T
Sbjct: 1156 GSPGAGDSQFLFPHGIAVDSAGNVYVADTSANWIRKYT 1193


>gi|309790072|ref|ZP_07684646.1| NHL repeat-containing protein [Oscillochloris trichoides DG-6]
 gi|308227927|gb|EFO81581.1| NHL repeat-containing protein [Oscillochloris trichoides DG6]
          Length = 1124

 Score = 40.8 bits (94), Expect = 0.29,   Method: Composition-based stats.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 2/37 (5%)

Query: 92   GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
            GY+ GEPG+  F++P   AVD +GN+YVAD  N  ++
Sbjct: 980  GYAGGEPGA--FNEPVGVAVDAQGNVYVADTWNSRVQ 1014


>gi|290991466|ref|XP_002678356.1| predicted protein [Naegleria gruberi]
 gi|284091968|gb|EFC45612.1| predicted protein [Naegleria gruberi]
          Length = 1017

 Score = 40.8 bits (94), Expect = 0.29,   Method: Composition-based stats.
 Identities = 21/52 (40%), Positives = 31/52 (59%), Gaps = 2/52 (3%)

Query: 82  VKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +  +AG+G+ G+S  +G    A F+     AV   G++YVAD  NH IRKI+
Sbjct: 574 ISTIAGNGIVGFSGDNGLATKATFNSRNGIAVASNGDVYVADTRNHRIRKIS 625



 Score = 40.4 bits (93), Expect = 0.40,   Method: Composition-based stats.
 Identities = 17/35 (48%), Positives = 25/35 (71%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
           +A  + P   AV   G++Y+ADKSNHVIRK++ L+
Sbjct: 108 NAELNFPSGVAVHSNGDVYIADKSNHVIRKVSALN 142



 Score = 39.3 bits (90), Expect = 0.74,   Method: Composition-based stats.
 Identities = 27/88 (30%), Positives = 44/88 (50%), Gaps = 7/88 (7%)

Query: 45  INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ--GYSDGEPGSAR 102
           +N +S    P S + I+ DS+ +    L   +S   ++  +AG G    G  +     A+
Sbjct: 820 LNSNSFTTTP-SGEFIIADSNNN----LIRKISTSGIISTIAGTGAATFGGDNANATIAK 874

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            + P + AV   G I++AD +NH IRKI
Sbjct: 875 LNNPLNVAVSSSGEIFIADTNNHRIRKI 902



 Score = 37.0 bits (84), Expect = 4.0,   Method: Composition-based stats.
 Identities = 20/51 (39%), Positives = 29/51 (56%), Gaps = 2/51 (3%)

Query: 82  VKRLAGDGVQGYS-DG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  +AG+G  GYS DG +    + + P + AV   G I++ D  NH IRKI
Sbjct: 908 ITTIAGNGTAGYSGDGLDSTKCQLNYPSAVAVSSGGEIFIVDTHNHRIRKI 958



 Score = 37.0 bits (84), Expect = 4.1,   Method: Composition-based stats.
 Identities = 34/92 (36%), Positives = 46/92 (50%), Gaps = 6/92 (6%)

Query: 49  SVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG---SARFDK 105
           SVI  P  S  +   S   A Y  SF  S  + +  +AG G  G   G+ G   SAR  K
Sbjct: 429 SVISAPLPS-AVGYKSKLHAMYLDSFLGSLATKITVVAGTGKLGGYAGDGGLATSARIQK 487

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           P S  ++ + N+Y+ D  NH IRK++ L  GN
Sbjct: 488 PTSVVLNDQ-NLYIVDTLNHRIRKVS-LTFGN 517


>gi|108761192|ref|YP_633033.1| lipoprotein [Myxococcus xanthus DK 1622]
 gi|108465072|gb|ABF90257.1| putative lipoprotein [Myxococcus xanthus DK 1622]
          Length = 868

 Score = 40.8 bits (94), Expect = 0.29,   Method: Composition-based stats.
 Identities = 21/50 (42%), Positives = 26/50 (52%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V  LAG G  G+SDG   +A    P    V   G +YVAD  N ++R IT
Sbjct: 818 VTTLAGSGRYGHSDGPGDAADLVLPAGLTVGPDGRLYVADAGNSLVRAIT 867


>gi|290971625|ref|XP_002668588.1| predicted protein [Naegleria gruberi]
 gi|284082057|gb|EFC35844.1| predicted protein [Naegleria gruberi]
          Length = 731

 Score = 40.8 bits (94), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 30/54 (55%), Gaps = 2/54 (3%)

Query: 79  ESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            S +  +AG+G  GYS DG P  +A+ + P    V   G +Y+AD  N  +RK+
Sbjct: 177 NSQLSTVAGNGYAGYSGDGGPARNAKLNYPADVTVSSTGEVYIADWGNRAVRKV 230


>gi|224536973|ref|ZP_03677512.1| hypothetical protein BACCELL_01849 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521412|gb|EEF90517.1| hypothetical protein BACCELL_01849 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 446

 Score = 40.8 bits (94), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 27/40 (67%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +GY+DG    A+F++P+    D   N YV D++N++IRK+
Sbjct: 347 KGYTDGPGSKAQFNEPQQGCFDNDDNFYVCDQNNNLIRKV 386


>gi|395226792|ref|ZP_10405227.1| Cadherin domain-containing protein, partial [Thiovulum sp. ES]
 gi|394444937|gb|EJF05961.1| Cadherin domain-containing protein, partial [Thiovulum sp. ES]
          Length = 1326

 Score = 40.8 bits (94), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 19/56 (33%), Positives = 33/56 (58%), Gaps = 1/56 (1%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++++  + ++ G G  G++DG+   A+   P  F  D   NI+VAD +N  +RKIT
Sbjct: 205 INKDGEISKIVG-GSYGFADGDSSIAQLKSPSGFDFDKNWNIFVADTNNGRVRKIT 259


>gi|225450563|ref|XP_002277564.1| PREDICTED: NHL repeat-containing protein 2-like [Vitis vinifera]
          Length = 1078

 Score = 40.8 bits (94), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 24/89 (26%), Positives = 40/89 (44%), Gaps = 4/89 (4%)

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQLIRQ 142
           ++   G +G  DG    A F++P+  A + K N+ YVAD  NH +R+I  ++   Q +  
Sbjct: 627 QIGSTGEEGLRDGSFDDATFNRPQGLAYNAKKNLLYVADTENHALREIDFVNETVQTLAG 686

Query: 143 INLKPEDCSKSSQSGSGMF---WVTVFSP 168
              K  D     +  + +    W   F P
Sbjct: 687 NGTKGSDYQGGGKGATQLLNSPWDVCFEP 715


>gi|158318845|ref|YP_001511353.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
 gi|158114250|gb|ABW16447.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
          Length = 770

 Score = 40.8 bits (94), Expect = 0.30,   Method: Composition-based stats.
 Identities = 38/113 (33%), Positives = 52/113 (46%), Gaps = 14/113 (12%)

Query: 30  DGYTVTTVIDGHQLEINPHSVIDRPGSSDLIV-LDSSRSAFYTLSFPLSEESVVKRLAGD 88
           DG T+TTV     L   P +V   PG   LI   DS R       + L      + +AG 
Sbjct: 565 DG-TITTVPTPAALR-RPRAVTTGPGGVLLIADTDSHRV------WRLGPGETARVIAGT 616

Query: 89  GVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQL 139
              GYS DG P + A   +P+S AVD  G + +AD     IR++   DH  ++
Sbjct: 617 AEPGYSGDGGPATRAAIGRPQSLAVDGAGRLLIADPDQRRIRRV---DHAGRI 666


>gi|291223251|ref|XP_002731624.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 434

 Score = 40.8 bits (94), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 41/87 (47%), Gaps = 6/87 (6%)

Query: 46  NPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFD 104
           NP +V +D        V D+   A   +S P  E   V  +AG G +G+ DG+   A F 
Sbjct: 183 NPQAVAVDHVYKDRFFVADTDNHAIREVSLPDGE---VTTIAG-GEKGFKDGKGTGATFY 238

Query: 105 KPKSFAVDMKGNI-YVADKSNHVIRKI 130
            P    +D   NI ++AD  NH IR I
Sbjct: 239 HPAGVTIDPIRNILFIADHYNHAIRMI 265



 Score = 37.7 bits (86), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 30/108 (27%), Positives = 47/108 (43%), Gaps = 12/108 (11%)

Query: 33  TVTTVIDGHQLEINPHSVIDRPG-------SSDLIVLDSSRSAFYTLSFPLSEESVVKRL 85
           T+   + G+Q  +   ++   P        +  L V DS+ S    ++    E   V   
Sbjct: 107 TLAGSLQGYQDGVGEKAMFHHPAGMSMHRKTRTLFVADSANSRIRAINVVTGE---VTTF 163

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVD--MKGNIYVADKSNHVIRKIT 131
           AG G +   DG    A F  P++ AVD   K   +VAD  NH IR+++
Sbjct: 164 AGSGKEELKDGLKTIASFFNPQAVAVDHVYKDRFFVADTDNHAIREVS 211


>gi|196229731|ref|ZP_03128595.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
 gi|196226057|gb|EDY20563.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
          Length = 360

 Score = 40.8 bits (94), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 32/57 (56%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ +  +   AG G +GYS DG P   A F+ P     D  GN Y+AD +NH IR++
Sbjct: 74  VTPDGKIHTAAGIGQKGYSGDGGPALEATFNLPHEIRFDRAGNYYIADMANHAIRRV 130



 Score = 39.3 bits (90), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 38/68 (55%), Gaps = 3/68 (4%)

Query: 65  SRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADK 122
           +R     L F  ++  ++   AG G +G++ +G P   A    PK  AVD +GN+++AD 
Sbjct: 235 TREGNQVLKFD-AKTGIISIAAGTGKKGFTGNGGPALEATLSGPKGIAVDAQGNVWLADT 293

Query: 123 SNHVIRKI 130
            +H IR+I
Sbjct: 294 ESHSIREI 301


>gi|348688202|gb|EGZ28016.1| hypothetical protein PHYSODRAFT_321722 [Phytophthora sojae]
          Length = 490

 Score = 40.8 bits (94), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 30/59 (50%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQI 143
            AG G +G+ DG    A F+ P        G+I VAD  NH IR+I    +G  +++ I
Sbjct: 134 FAGSGKKGWRDGLALEANFNSPSGVCEYADGSIIVADTGNHCIRQIRRGANGKLVVKTI 192



 Score = 36.2 bits (82), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 31/96 (32%), Positives = 46/96 (47%), Gaps = 5/96 (5%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEES--VVKRLAGDGVQGYSDGEPGSARFD 104
           P +V+  P S +L+V D+  +    L  P    S   VK + G    G++DG    A FD
Sbjct: 230 PSTVLAGP-SGELLVADTMNNCIRGLLPPPDGTSPWRVKTVCGQTRPGHADGNCEVALFD 288

Query: 105 KPKSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQL 139
           +P S       N  +VAD+ N  IR++    +GN L
Sbjct: 289 QPLSLCWGEDSNTFFVADRGNACIRQV-GRSYGNGL 323


>gi|405375498|ref|ZP_11029528.1| putative hemolysin [Chondromyces apiculatus DSM 436]
 gi|397086225|gb|EJJ17355.1| putative hemolysin [Myxococcus sp. (contaminant ex DSM 436)]
          Length = 855

 Score = 40.8 bits (94), Expect = 0.31,   Method: Composition-based stats.
 Identities = 20/50 (40%), Positives = 26/50 (52%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V  LAG G  G++DG    A+   P    V   G +YVAD  N ++R IT
Sbjct: 805 VTTLAGSGTYGHADGPGEDAQLVLPAGLVVGPDGRLYVADSGNALVRTIT 854


>gi|395218326|ref|ZP_10401983.1| alkyl hydroperoxide reductase [Pontibacter sp. BAB1700]
 gi|394454578|gb|EJF09204.1| alkyl hydroperoxide reductase [Pontibacter sp. BAB1700]
          Length = 474

 Score = 40.8 bits (94), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 35/60 (58%), Gaps = 2/60 (3%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           LS++  V  + G G +G+++G    A F++P   A+     +YVAD  NH IRK+ +L H
Sbjct: 197 LSQQGQVLEVIGSGERGFNNGGYADATFNEPHGLAL-HGSTLYVADAKNHAIRKV-DLQH 254


>gi|290983716|ref|XP_002674574.1| predicted protein [Naegleria gruberi]
 gi|284088165|gb|EFC41830.1| predicted protein [Naegleria gruberi]
          Length = 2271

 Score = 40.8 bits (94), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 35/121 (28%), Positives = 58/121 (47%), Gaps = 15/121 (12%)

Query: 19  FSLVSSGLLLEDGYTVTTVIDGHQLEINPHSVIDRP-----GSSDLIVL-DSSRSAFYTL 72
           F + +S L    G ++   +DG    I  +++++ P     G  ++I   D   +A  ++
Sbjct: 500 FEINNSTLTTLVGRSLKGFLDG----IGSNALLNYPESLIIGPDNMIYFSDRDNNAIRSV 555

Query: 73  SFPLSEESVVKRLAGDGVQGY-SDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           S   +  ++V  LAGD   G+  DG P  S+R D P    + + G I   D+ N  IRKI
Sbjct: 556 S---TSSALVTTLAGDRFNGFLGDGGPAKSSRLDSPGPIQLTLGGEIIFMDRGNQRIRKI 612

Query: 131 T 131
           T
Sbjct: 613 T 613



 Score = 40.0 bits (92), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 19/54 (35%), Positives = 31/54 (57%), Gaps = 1/54 (1%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           E  ++  LAG+G  G + G   SA+ + P +  V    +IY A+  NH+IR+I+
Sbjct: 136 ENGIISTLAGNGSPGLTVGSAISAQLNTPTALIV-ASNDIYFAESGNHLIRRIS 188



 Score = 39.7 bits (91), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 22/50 (44%), Positives = 29/50 (58%), Gaps = 2/50 (4%)

Query: 85  LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
            AG G  GYS DG  G SA  + P + A D  GN+ ++D  N+ IRK+ N
Sbjct: 291 FAGSGTSGYSGDGGLGTSALLNGPSALAFDSSGNLLISDSFNNRIRKVAN 340



 Score = 37.4 bits (85), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 23/79 (29%), Positives = 37/79 (46%), Gaps = 6/79 (7%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDM 113
           +SD+I  D++      +        ++  +AG GV  YS   G   +A+ + P+   +  
Sbjct: 63  NSDIIFCDTNNHRIRKIDV----NGIMSTIAGTGVANYSGDGGAAVNAQLNSPQGIGILS 118

Query: 114 KGNIYVADKSNHVIRKITN 132
            G I  +D  NH IRKI N
Sbjct: 119 TGAIVFSDTLNHCIRKIEN 137


>gi|296089788|emb|CBI39607.3| unnamed protein product [Vitis vinifera]
          Length = 1002

 Score = 40.4 bits (93), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 24/89 (26%), Positives = 40/89 (44%), Gaps = 4/89 (4%)

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQLIRQ 142
           ++   G +G  DG    A F++P+  A + K N+ YVAD  NH +R+I  ++   Q +  
Sbjct: 551 QIGSTGEEGLRDGSFDDATFNRPQGLAYNAKKNLLYVADTENHALREIDFVNETVQTLAG 610

Query: 143 INLKPEDCSKSSQSGSGMF---WVTVFSP 168
              K  D     +  + +    W   F P
Sbjct: 611 NGTKGSDYQGGGKGATQLLNSPWDVCFEP 639


>gi|296122042|ref|YP_003629820.1| NHL repeat containing protein [Planctomyces limnophilus DSM 3776]
 gi|296014382|gb|ADG67621.1| NHL repeat containing protein [Planctomyces limnophilus DSM 3776]
          Length = 425

 Score = 40.4 bits (93), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 49/105 (46%), Gaps = 15/105 (14%)

Query: 30  DGYTVTTVI--DGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG 87
           DG   T  +  + H+L   P        + D+++ D      +T+     +  ++  LAG
Sbjct: 158 DGGPATKALMNEPHELRFTP--------AGDIVIADMR---THTIRKIDGKTGMISTLAG 206

Query: 88  DGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            G  G+S DG P   A+ + P S  +D  G++ + D  NH +RK+
Sbjct: 207 TGTAGFSGDGGPAEKAQLNMPHSIQIDPAGDLLICDTGNHRVRKV 251



 Score = 36.2 bits (82), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 38/94 (40%), Gaps = 16/94 (17%)

Query: 45  INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVV--------KRLAGDGVQGYSDG 96
           +N    ID     D+I+     +A Y   FP  E  ++          L GDG+      
Sbjct: 281 LNGPRSIDFTPEGDMILALREGNAVY--RFPKGEAKLIHIAGVGGKPSLVGDGIDARK-- 336

Query: 97  EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
               A    PK  AVD  G+IY+AD   H IR I
Sbjct: 337 ----AILGAPKGAAVDANGDIYLADTETHTIRVI 366


>gi|290995436|ref|XP_002680301.1| predicted protein [Naegleria gruberi]
 gi|284093921|gb|EFC47557.1| predicted protein [Naegleria gruberi]
          Length = 699

 Score = 40.4 bits (93), Expect = 0.33,   Method: Composition-based stats.
 Identities = 23/62 (37%), Positives = 36/62 (58%), Gaps = 4/62 (6%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           E   +  +AG+G+ G+S  +G   +A+   P S  V     +Y+AD+ NH IRKI  L++
Sbjct: 95  ENGNIITIAGNGIHGFSGDNGLATNAQLYTPCSVFVSSNNEVYIADQGNHRIRKI--LEN 152

Query: 136 GN 137
           GN
Sbjct: 153 GN 154



 Score = 39.7 bits (91), Expect = 0.54,   Method: Composition-based stats.
 Identities = 22/62 (35%), Positives = 37/62 (59%), Gaps = 4/62 (6%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           E   +  +AG+G +G S  +G   +A+ ++P S  V     +Y+AD+ N+ IRKI  L++
Sbjct: 39  ENGTIVTIAGNGTKGSSGDNGLATNAQLNRPYSVFVSSNNEVYIADQGNNRIRKI--LEN 96

Query: 136 GN 137
           GN
Sbjct: 97  GN 98



 Score = 38.5 bits (88), Expect = 1.5,   Method: Composition-based stats.
 Identities = 22/63 (34%), Positives = 37/63 (58%), Gaps = 4/63 (6%)

Query: 77  SEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
           +E   +  +AG+G+ G++  +G   +AR + P S  V     +Y+AD  N+ IRKI  L+
Sbjct: 226 NENGNIITIAGNGIHGFNGDNGLATNARLNHPFSVFVSSNNEVYIADYYNNRIRKI--LE 283

Query: 135 HGN 137
           +GN
Sbjct: 284 NGN 286


>gi|256556932|gb|ACU83555.1| thiol-disulfide isomerase/thioredoxin [uncultured bacterium
           HF130_AEPn_1]
          Length = 497

 Score = 40.4 bits (93), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 20/50 (40%), Positives = 29/50 (58%), Gaps = 1/50 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           VK + G G +G  DG+   ARF +P   A D K + +Y+AD  NH I+ +
Sbjct: 221 VKMVIGSGKEGNKDGDIKVARFRRPHGLAFDEKNDLLYIADTDNHSIKSL 270


>gi|436838118|ref|YP_007323334.1| NHL repeat-containing protein [Fibrella aestuarina BUZ 2]
 gi|384069531|emb|CCH02741.1| NHL repeat-containing protein [Fibrella aestuarina BUZ 2]
          Length = 375

 Score = 40.4 bits (93), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 32/63 (50%), Gaps = 6/63 (9%)

Query: 74  FPLSEESVVKRLAG----DGVQ--GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
           F ++    V RL+G    DG    GY DG    A+F  P S  +D  G +Y+ D  NH I
Sbjct: 141 FRINSAGQVTRLSGGPDPDGGYSFGYRDGPLARAKFSNPWSIIMDKAGMLYIGDGLNHCI 200

Query: 128 RKI 130
           RK+
Sbjct: 201 RKV 203


>gi|325110177|ref|YP_004271245.1| hypothetical protein Plabr_3626 [Planctomyces brasiliensis DSM
           5305]
 gi|324970445|gb|ADY61223.1| NHL repeat containing protein [Planctomyces brasiliensis DSM 5305]
          Length = 370

 Score = 40.4 bits (93), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 24/38 (63%), Gaps = 1/38 (2%)

Query: 94  SDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            DG P   AR   PK  +VD +GNI++AD  +H IR+I
Sbjct: 273 GDGRPARLARLAGPKGISVDKQGNIFLADTESHTIRRI 310



 Score = 37.7 bits (86), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 21/66 (31%), Positives = 34/66 (51%), Gaps = 4/66 (6%)

Query: 69  FYTLSFP--LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSN 124
            + L  P  +  E  V  + GDG  GY+  +G    A+ ++P    +   G +YV + +N
Sbjct: 17  LFVLQLPATVQAEGKVITILGDGKAGYTGDNGSADEAQCEQPFGLVIGPDGALYVCETAN 76

Query: 125 HVIRKI 130
           HVIR+I
Sbjct: 77  HVIRRI 82



 Score = 37.0 bits (84), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 19/56 (33%), Positives = 34/56 (60%), Gaps = 2/56 (3%)

Query: 77  SEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++  V+  +AG G +G+S DG+  + A F++P S   D  G++Y+ D  NH +R +
Sbjct: 141 AKTGVISTVAGTGEKGFSGDGKAATEATFNRPHSICFDSAGHLYICDIGNHRVRVV 196



 Score = 35.8 bits (81), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 17/60 (28%), Positives = 30/60 (50%), Gaps = 1/60 (1%)

Query: 70  YTLSFPLSEESVVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           +T+     +  ++  L G+G +G   DG P   + ++P    V   G +Y+ D SNH +R
Sbjct: 305 HTIRRIDGQTGIITTLVGNGKKGDGPDGNPLECKLNRPHGVFVAADGKVYIGDSSNHRVR 364


>gi|219851088|ref|YP_002465520.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
 gi|219545347|gb|ACL15797.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
          Length = 831

 Score = 40.4 bits (93), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 15/34 (44%), Positives = 24/34 (70%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           + +F  P+  A+D  GN+YVAD+SNH + K T++
Sbjct: 173 NGQFSYPEGIAIDGTGNVYVADESNHRVEKFTSI 206



 Score = 40.4 bits (93), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 15/34 (44%), Positives = 24/34 (70%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           + +F  P+  A+D  GN+YVAD+SNH + K T++
Sbjct: 473 NGQFSYPEGIAIDGTGNVYVADESNHRVEKFTSI 506



 Score = 39.7 bits (91), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 21/33 (63%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           GS +F  P   A+D  GN+YV D+ NH ++K T
Sbjct: 125 GSGQFSSPYGVAIDSAGNVYVVDRGNHRVQKFT 157



 Score = 39.7 bits (91), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 21/33 (63%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           GS +F  P   A+D  GN+YV D+ NH ++K T
Sbjct: 425 GSGQFSSPYGVAIDGAGNVYVVDRGNHRVQKFT 457



 Score = 37.7 bits (86), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 38/76 (50%), Gaps = 15/76 (19%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSG 159
           + +F  P   AVD  GN+YVAD+ N+ ++K T+   G  L +           +S +GSG
Sbjct: 79  NGQFRTPTGIAVDSAGNVYVADRDNNRVQKFTST--GTYLAKW---------GTSGTGSG 127

Query: 160 MFWVTVFSPYLISCKS 175
            F     SPY ++  S
Sbjct: 128 QF----SSPYGVAIDS 139



 Score = 37.0 bits (84), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 14/32 (43%), Positives = 21/32 (65%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           + +F  P   AVD  GN+YVAD+ N+ ++K T
Sbjct: 379 NGQFRTPTGIAVDSAGNVYVADRDNNRVQKFT 410



 Score = 35.8 bits (81), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 18/43 (41%), Positives = 24/43 (55%), Gaps = 5/43 (11%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G +G  DG+     F  P   AVD  GN+YVAD  N+ ++K T
Sbjct: 214 GTKGSEDGQ-----FAYPDGVAVDSLGNVYVADSGNNRVQKFT 251



 Score = 35.8 bits (81), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 18/43 (41%), Positives = 24/43 (55%), Gaps = 5/43 (11%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G +G  DG+     F  P   AVD  GN+YVAD  N+ ++K T
Sbjct: 514 GTKGSEDGQ-----FAYPDGVAVDSLGNVYVADSGNNRVQKFT 551


>gi|408405002|ref|YP_006862985.1| NHL repeat-containing protein [Candidatus Nitrososphaera gargensis
           Ga9.2]
 gi|408365598|gb|AFU59328.1| NHL repeat-containing protein [Candidatus Nitrososphaera gargensis
           Ga9.2]
          Length = 501

 Score = 40.4 bits (93), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 32/92 (34%), Positives = 40/92 (43%), Gaps = 8/92 (8%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDG--VQGYSDGEPGSARFDKPKSFAVDMKGN 116
           L V DS  SA   +        VV+   G+G  V G+ DG    AR   P   A +    
Sbjct: 359 LFVADSEVSAVRRIDLG---RKVVQTAVGEGLFVFGHKDGPLEEARLQHPLGVACESSNK 415

Query: 117 IYVADKSNHVIRKITNLDHGNQLIRQINLKPE 148
           IYVAD  NH +R I   D   Q I  +  +PE
Sbjct: 416 IYVADTYNHAVRLI---DLAEQRISTLVGRPE 444


>gi|348689157|gb|EGZ28971.1| hypothetical protein PHYSODRAFT_474561 [Phytophthora sojae]
          Length = 98

 Score = 40.4 bits (93), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 25/39 (64%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           V GY+DG   +  F  P+S AV   G++YVAD +N VI+
Sbjct: 11  VSGYADGYGNAVLFKNPRSLAVGSDGHLYVADDNNRVIK 49


>gi|254383375|ref|ZP_04998727.1| NHL repeat containing protein [Streptomyces sp. Mg1]
 gi|194342272|gb|EDX23238.1| NHL repeat containing protein [Streptomyces sp. Mg1]
          Length = 607

 Score = 40.4 bits (93), Expect = 0.37,   Method: Composition-based stats.
 Identities = 25/74 (33%), Positives = 40/74 (54%), Gaps = 8/74 (10%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S + +V DS+R     L+     ESVV+R+ G G +G+     G   F +P+  A+   G
Sbjct: 191 SGNFLVSDSTRHQLVELA--ADAESVVRRI-GSGERGF-----GPDAFSEPQGLALLPDG 242

Query: 116 NIYVADKSNHVIRK 129
           ++ VAD  NH +R+
Sbjct: 243 SVVVADTVNHALRR 256


>gi|261855185|ref|YP_003262468.1| redoxin [Halothiobacillus neapolitanus c2]
 gi|261835654|gb|ACX95421.1| Redoxin domain protein [Halothiobacillus neapolitanus c2]
          Length = 498

 Score = 40.4 bits (93), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 18/44 (40%), Positives = 28/44 (63%), Gaps = 1/44 (2%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           GDG  G++DG   +A F++P+  A  +  ++YVAD  N  IR+I
Sbjct: 234 GDGKAGFADGSFATAEFNRPQGIAW-LGNDLYVADTDNQRIRRI 276


>gi|424792718|ref|ZP_18218917.1| hypothetical protein XTG29_01651 [Xanthomonas translucens pv.
           graminis ART-Xtg29]
 gi|422796954|gb|EKU25368.1| hypothetical protein XTG29_01651 [Xanthomonas translucens pv.
           graminis ART-Xtg29]
          Length = 473

 Score = 40.4 bits (93), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 30/52 (57%), Gaps = 1/52 (1%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           V R  G G   + DG  G A F +P+  A++ +G +YVAD  NH +R+I  L
Sbjct: 199 VLRQFGLGTADFIDGGIGEAAFHRPRGLALE-RGVLYVADTGNHALRRINLL 249


>gi|291235470|ref|XP_002737667.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
          Length = 638

 Score = 40.0 bits (92), Expect = 0.41,   Method: Composition-based stats.
 Identities = 17/45 (37%), Positives = 25/45 (55%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G  ++ +     G  RF  P    +D KGN+ VADK+NH I+ +T
Sbjct: 499 GGYIKSFGSRGDGDCRFKSPYGITIDSKGNVIVADKNNHCIQVLT 543


>gi|449017726|dbj|BAM81128.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 924

 Score = 40.0 bits (92), Expect = 0.41,   Method: Composition-based stats.
 Identities = 25/67 (37%), Positives = 32/67 (47%), Gaps = 4/67 (5%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQLI 140
           V+ LAG G  G  DG    A F  P+  A D    + YVAD  NH IR++   D     +
Sbjct: 842 VRTLAGSGEPGLHDGASTQAAFFAPQGIAYDPTVRMAYVADTYNHCIRQV---DVATGTV 898

Query: 141 RQINLKP 147
           R + L P
Sbjct: 899 RTLTLTP 905


>gi|397781302|ref|YP_006545775.1| Tripartite motif-containing protein 71 [Methanoculleus bourgensis
           MS2]
 gi|396939804|emb|CCJ37059.1| Tripartite motif-containing protein 71 AltName: Full=Lin-41 homolog
           [Methanoculleus bourgensis MS2]
          Length = 786

 Score = 40.0 bits (92), Expect = 0.42,   Method: Composition-based stats.
 Identities = 20/41 (48%), Positives = 23/41 (56%), Gaps = 5/41 (12%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           G  G  DGE     F  P   A+D  GN+YVADK NH I+K
Sbjct: 542 GSYGRGDGE-----FRNPSGVAIDAAGNVYVADKENHRIQK 577



 Score = 38.9 bits (89), Expect = 1.0,   Method: Composition-based stats.
 Identities = 18/41 (43%), Positives = 25/41 (60%), Gaps = 4/41 (9%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           G + Y++       FD PK  AVD  GN+YVAD+ N+ I+K
Sbjct: 68  GTKAYTE----DGAFDTPKGVAVDAAGNVYVADRGNNRIQK 104



 Score = 36.6 bits (83), Expect = 5.7,   Method: Composition-based stats.
 Identities = 29/78 (37%), Positives = 36/78 (46%), Gaps = 16/78 (20%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQIN 144
           LA  G  G  DGE     F +P S AVD  GN+YV D  N+ I+   +   GN L     
Sbjct: 112 LAKWGSHGKGDGE-----FRQPSSIAVDAAGNVYVTDVGNNRIQIFDSA--GNFL----- 159

Query: 145 LKPEDCSKSSQSGSGMFW 162
           +K E C     +G   FW
Sbjct: 160 MKWEPCG----TGDAEFW 173


>gi|428308867|ref|YP_007119844.1| thiol-disulfide isomerase-like thioredoxin [Microcoleus sp. PCC
           7113]
 gi|428250479|gb|AFZ16438.1| thiol-disulfide isomerase-like thioredoxin [Microcoleus sp. PCC
           7113]
          Length = 515

 Score = 40.0 bits (92), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 18/48 (37%), Positives = 28/48 (58%), Gaps = 1/48 (2%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + + G G  G+ DG+   ARF +P   +  +  ++YVAD +NH IR I
Sbjct: 440 QTMIGQGTAGHQDGQSIKARFSEPSGLSA-IAAHLYVADTNNHAIRCI 486



 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 21/63 (33%), Positives = 35/63 (55%), Gaps = 4/63 (6%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLDHGNQLI 140
           V+++ G G  G +DG    A+F  P+   +D +   +YVAD  NH +R++   D  NQ +
Sbjct: 213 VQQVIGTGKPGLTDGSFEEAQFFAPQGMTLDSENQLLYVADTENHCLRQV---DLKNQQV 269

Query: 141 RQI 143
           + I
Sbjct: 270 KTI 272


>gi|290975813|ref|XP_002670636.1| predicted protein [Naegleria gruberi]
 gi|284084197|gb|EFC37892.1| predicted protein [Naegleria gruberi]
          Length = 2442

 Score = 40.0 bits (92), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 33/57 (57%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ + V+  +AG    G+   +G   +A  + P+S  V+ KG IYV+D  NH +RKI
Sbjct: 857 ITSDGVISLVAGSYYGGFGGDNGPAANAVLNSPESLFVNSKGEIYVSDTVNHRVRKI 913



 Score = 37.7 bits (86), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 28/51 (54%), Gaps = 2/51 (3%)

Query: 82  VKRLAGDGVQG-YSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  +AG G+ G Y DG   + A+   P    V   G+IY+AD  NH IRK+
Sbjct: 919 ITTVAGSGLAGFYGDGGLATNAKLSYPMGIYVSKNGDIYIADYGNHRIRKV 969


>gi|196228192|ref|ZP_03127059.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
 gi|196227595|gb|EDY22098.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
          Length = 357

 Score = 40.0 bits (92), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 30/50 (60%), Gaps = 2/50 (4%)

Query: 82  VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           +K + G G +G S D  PG  A  + PK  A+D  G++ +AD  NHVIR+
Sbjct: 248 IKTVVGTGAKGLSGDAGPGLQATMNGPKYIALDRDGSVLIADAENHVIRR 297


>gi|383100947|emb|CCD74491.1| haloacid dehalogenase-like hydrolase domain-containing protein
           [Arabidopsis halleri subsp. halleri]
          Length = 1028

 Score = 40.0 bits (92), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 19/56 (33%), Positives = 32/56 (57%), Gaps = 1/56 (1%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITN 132
           E + + ++   G +G+ DG    A F++P+  A + K N +YVAD  NH +R + N
Sbjct: 600 EGNFIVQIGSTGEEGFQDGSFEDAAFNRPQGLAYNAKKNLLYVADTENHALRYLLN 655



 Score = 40.0 bits (92), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 22/61 (36%), Positives = 35/61 (57%), Gaps = 4/61 (6%)

Query: 79  ESVVKRL---AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           + V KR+   AG G  G+ DG+   A+  +P   A+   G ++VAD +N +IR I +L+ 
Sbjct: 803 DPVTKRVVTVAGTGKAGFKDGKVKGAQLSEPAGLAITENGRLFVADTNNSLIRYI-DLNK 861

Query: 136 G 136
           G
Sbjct: 862 G 862


>gi|302821973|ref|XP_002992647.1| hypothetical protein SELMODRAFT_7079 [Selaginella moellendorffii]
 gi|300139611|gb|EFJ06349.1| hypothetical protein SELMODRAFT_7079 [Selaginella moellendorffii]
          Length = 124

 Score = 40.0 bits (92), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 18/43 (41%), Positives = 26/43 (60%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G  GY DG    + F++P+S A+   G ++VAD  N  IRKI+
Sbjct: 15  GESGYVDGPAAKSLFNRPQSLAICDNGAVFVADTRNLAIRKIS 57


>gi|440804160|gb|ELR25038.1| Bbox zinc finger domain containing protein [Acanthamoeba
           castellanii str. Neff]
          Length = 690

 Score = 40.0 bits (92), Expect = 0.46,   Method: Composition-based stats.
 Identities = 15/42 (35%), Positives = 26/42 (61%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           G  V+ +  G  G  +F +P+  AVD +G+++VAD  NH ++
Sbjct: 514 GSFVRTFGSGGSGPGQFQRPRGVAVDKEGHVFVADSENHRVQ 555


>gi|372267905|ref|ZP_09503953.1| hypothetical protein AlS89_08394 [Alteromonas sp. S89]
          Length = 2227

 Score = 40.0 bits (92), Expect = 0.46,   Method: Composition-based stats.
 Identities = 18/52 (34%), Positives = 30/52 (57%), Gaps = 2/52 (3%)

Query: 81   VVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            ++  +AG G +G+S DG P + A    P    +D +GN+Y+ D  N  IR++
Sbjct: 1056 IISTVAGSGARGFSGDGGPATDASLSAPSGITLDPEGNLYIVDSGNRRIRRV 1107


>gi|219851969|ref|YP_002466401.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
 gi|219546228|gb|ACL16678.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
          Length = 676

 Score = 40.0 bits (92), Expect = 0.48,   Method: Composition-based stats.
 Identities = 15/35 (42%), Positives = 24/35 (68%)

Query: 98  PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           PG  +F+ P+  AVD  GN+Y+AD+ N+ I+K  +
Sbjct: 86  PGDGQFNNPEGVAVDNNGNVYIADRDNNRIQKFNS 120



 Score = 38.1 bits (87), Expect = 1.6,   Method: Composition-based stats.
 Identities = 21/63 (33%), Positives = 33/63 (52%), Gaps = 2/63 (3%)

Query: 69  FYTLSFPLSEESVVKRLAGDG--VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
            YT   P   +S V++   +G  +  +     G  +F++P   AVD  GN+YVAD  N+ 
Sbjct: 537 LYTCGPPELIQSRVQKFTPNGTFITEWGSLGSGDGQFNRPSGVAVDSAGNVYVADSGNNR 596

Query: 127 IRK 129
           I+K
Sbjct: 597 IQK 599



 Score = 35.8 bits (81), Expect = 7.8,   Method: Composition-based stats.
 Identities = 14/34 (41%), Positives = 22/34 (64%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           G  +F++P   A+D  GN+YV DK N+ I+K  +
Sbjct: 134 GDGQFNQPSGVALDSAGNVYVTDKQNNRIQKFNS 167


>gi|229822379|ref|YP_002883905.1| alkyl hydroperoxide reductase [Beutenbergia cavernae DSM 12333]
 gi|229568292|gb|ACQ82143.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Beutenbergia cavernae DSM 12333]
          Length = 641

 Score = 40.0 bits (92), Expect = 0.49,   Method: Composition-based stats.
 Identities = 24/73 (32%), Positives = 33/73 (45%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S DL     + +  +TL     E+  + RLAG   +G  DG    A F +P   AV   G
Sbjct: 324 SPDLAAFVVAMAGNHTLWAFDDEQGSLTRLAGTEHEGLRDGPAAEAWFAQPSGLAVAQDG 383

Query: 116 NIYVADKSNHVIR 128
            I+VAD     +R
Sbjct: 384 RIWVADSETSALR 396


>gi|357417484|ref|YP_004930504.1| NHL repeat containing protein [Pseudoxanthomonas spadix BD-a59]
 gi|355335062|gb|AER56463.1| NHL repeat containing protein [Pseudoxanthomonas spadix BD-a59]
          Length = 474

 Score = 40.0 bits (92), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 32/55 (58%), Gaps = 2/55 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           V R  G+G     DGEP  A F +P   ++ ++  +YVAD  NH++R+I NL  G
Sbjct: 200 VLRRFGNGTADALDGEPEIASFCRPNGLSL-LREELYVADTGNHLLRRI-NLRSG 252


>gi|116620267|ref|YP_822423.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116223429|gb|ABJ82138.1| NHL repeat containing protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 380

 Score = 40.0 bits (92), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 33/55 (60%), Gaps = 2/55 (3%)

Query: 78  EESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + +V+   AG G +GYS DG P   A  ++P     D  GN+++A+  NHV+R++
Sbjct: 98  KTNVISTAAGSGEKGYSGDGGPALGAALNEPYEVRFDRVGNMFIAEMQNHVVRRV 152



 Score = 36.6 bits (83), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 20/48 (41%), Positives = 28/48 (58%), Gaps = 2/48 (4%)

Query: 85  LAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +AG G  GYS DG P   A+   PK  A    G++Y+AD  +H IR++
Sbjct: 276 VAGTGESGYSGDGGPAKLAKLSGPKGIAWAPDGSLYLADTESHTIRRV 323


>gi|72094692|ref|XP_795849.1| PREDICTED: NHL repeat-containing protein 2-like [Strongylocentrotus
           purpuratus]
          Length = 706

 Score = 40.0 bits (92), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 41/86 (47%), Gaps = 9/86 (10%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V   P  + L V D+  +    ++     + VV+   G    G++DG    ARF  P
Sbjct: 212 PGKVTTNPEGTLLAVSDTGHNRVIIVAL----DGVVQHCIGGPETGFNDGLYREARFHSP 267

Query: 107 KS--FAVDMKGNIYVADKSNHVIRKI 130
           +   +A D+   IYVAD  NH IRKI
Sbjct: 268 QGLCWAQDV---IYVADTENHAIRKI 290


>gi|308159172|gb|EFO61716.1| Haloacid dehalogenase-like hydrolase family [Giardia lamblia P15]
          Length = 967

 Score = 40.0 bits (92), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 42/86 (48%), Gaps = 6/86 (6%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDK 105
           PHS++  P +  + + D+      TLS    +   VK LAG  GV GY DG   +  F++
Sbjct: 143 PHSIVSDPVNGIIYLTDTKNHCIRTLSLMTRK---VKTLAGTPGVFGYKDGL--NPLFNE 197

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT 131
           P    +   GN+ V D  N  +R I+
Sbjct: 198 PLGLVLTEDGNLIVCDSKNGALRYIS 223


>gi|374855656|dbj|BAL58511.1| NHL repeat containing protein [uncultured candidate division OP1
           bacterium]
          Length = 419

 Score = 40.0 bits (92), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 23/32 (71%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G  +FD+P   AVD +G IYVA++ NH I+KI
Sbjct: 385 GDGQFDQPVDLAVDPEGVIYVAEEGNHRIQKI 416


>gi|194877446|ref|XP_001973882.1| GG16549 [Drosophila erecta]
 gi|190657069|gb|EDV54282.1| GG16549 [Drosophila erecta]
          Length = 717

 Score = 40.0 bits (92), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 30/56 (53%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           L+   VV+   G    G+ DG   +ARF+ P+  A   +  + VAD  NH IR+I+
Sbjct: 252 LTGGGVVQHKIGGYQPGFVDGNSTAARFNNPQGIAFLNENTLIVADTKNHAIRQIS 307


>gi|219852041|ref|YP_002466473.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
 gi|219546300|gb|ACL16750.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
          Length = 930

 Score = 40.0 bits (92), Expect = 0.52,   Method: Composition-based stats.
 Identities = 26/80 (32%), Positives = 39/80 (48%), Gaps = 11/80 (13%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLK 146
           G  ++ +     G  +F  P   AVD  GN+YVAD  N+ I+K T++      I+Q    
Sbjct: 76  GTFIKKWGSSGSGDGQFSSPSGVAVDSAGNVYVADTGNNRIQKFTSM---GIFIKQWG-- 130

Query: 147 PEDCSKSSQSGSGMFWVTVF 166
                 SS SG+G F+ + F
Sbjct: 131 ------SSGSGNGQFFSSPF 144


>gi|430760419|ref|YP_007216276.1| YD repeat protein [Thioalkalivibrio nitratireducens DSM 14787]
 gi|430010043|gb|AGA32795.1| YD repeat protein [Thioalkalivibrio nitratireducens DSM 14787]
          Length = 2395

 Score = 40.0 bits (92), Expect = 0.53,   Method: Composition-based stats.
 Identities = 24/79 (30%), Positives = 41/79 (51%), Gaps = 8/79 (10%)

Query: 60   IVLDSSRSAFYTLS------FPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAV 111
            + +D+  S F+T        F  + +  V  +AG+G + ++ DG P  +A F  P+  AV
Sbjct: 1125 VAVDAGGSVFFTRRNGGPRIFRANPDGTVHHVAGNGSRTFAGDGGPAVNASFIDPRGIAV 1184

Query: 112  DMKGNIYVADKSNHVIRKI 130
               G++YV D   H +R+I
Sbjct: 1185 GADGSLYVGDFRTHRVRRI 1203


>gi|398337816|ref|ZP_10522521.1| hypothetical protein LkmesMB_21054 [Leptospira kmetyi serovar
           Malaysia str. Bejo-Iso9]
          Length = 359

 Score = 40.0 bits (92), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 33/69 (47%)

Query: 62  LDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD 121
           LD      Y     ++   VV   AG G     DG+  +A F  P    +D  GNIYV++
Sbjct: 37  LDCILKECYLCRMKVTGNPVVTLFAGTGTASSVDGDISTATFQNPFGIELDSFGNIYVSE 96

Query: 122 KSNHVIRKI 130
           ++ + IRKI
Sbjct: 97  QNVNKIRKI 105


>gi|290971645|ref|XP_002668597.1| predicted protein [Naegleria gruberi]
 gi|284082068|gb|EFC35853.1| predicted protein [Naegleria gruberi]
          Length = 223

 Score = 40.0 bits (92), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 35/57 (61%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +S+  +++ +AG+G +G+S  DG    A+ + P S  V+  G IY+ D +N  IRK+
Sbjct: 86  VSKSGIIETIAGNGNEGFSGDDGLATQAQLNCPMSTFVNSNGEIYITDSNNFRIRKV 142


>gi|290978083|ref|XP_002671766.1| predicted protein [Naegleria gruberi]
 gi|284085337|gb|EFC39022.1| predicted protein [Naegleria gruberi]
          Length = 258

 Score = 39.7 bits (91), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 35/57 (61%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +S+  +++ +AG+G +G+S  DG    A+ + P S  V+  G IY+ D +N  IRK+
Sbjct: 198 VSKSGIIETIAGNGNEGFSGDDGLATQAQLNCPMSTFVNSNGEIYITDSNNFRIRKV 254


>gi|425444428|ref|ZP_18824479.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9443]
 gi|389735830|emb|CCI00727.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9443]
          Length = 358

 Score = 39.7 bits (91), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 16/30 (53%), Positives = 20/30 (66%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           G  +FD P   AVD  GNIYVAD+ NH ++
Sbjct: 245 GDGQFDDPVGIAVDSGGNIYVADRDNHRVQ 274


>gi|338536093|ref|YP_004669427.1| putative lipoprotein [Myxococcus fulvus HW-1]
 gi|337262189|gb|AEI68349.1| putative lipoprotein [Myxococcus fulvus HW-1]
          Length = 855

 Score = 39.7 bits (91), Expect = 0.56,   Method: Composition-based stats.
 Identities = 20/50 (40%), Positives = 26/50 (52%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           V  LAG+G  G++DG    A    P    V   G +YVAD  N ++R IT
Sbjct: 805 VTTLAGNGRYGHADGPGDKAELVLPAGLTVGPDGRLYVADAGNGLVRAIT 854



 Score = 36.6 bits (83), Expect = 4.8,   Method: Composition-based stats.
 Identities = 36/104 (34%), Positives = 51/104 (49%), Gaps = 8/104 (7%)

Query: 27  LLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLA 86
           L+ +G+ VTT + G  L   P  V      S L V++S  S    +   L+  SVV   A
Sbjct: 699 LVAEGHPVTT-LQGDWL-YRPSGVAVSADGSTLYVVESGMSRVVRIRDGLT--SVV---A 751

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G    GY DG P SA+F      AV   G++ VAD  N+ +R++
Sbjct: 752 GT-TPGYRDGAPTSAQFLPYLGIAVLKDGSLAVADPGNYRVRRV 794


>gi|408792280|ref|ZP_11203890.1| concanavalin A-like lectin/glucanases family protein [Leptospira
           meyeri serovar Hardjo str. Went 5]
 gi|408463690|gb|EKJ87415.1| concanavalin A-like lectin/glucanases family protein [Leptospira
           meyeri serovar Hardjo str. Went 5]
          Length = 1234

 Score = 39.7 bits (91), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 32/55 (58%), Gaps = 1/55 (1%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           L+  + V  +AGDGVQ  +DG   +A+F+ P     D   N+++++ S H IR +
Sbjct: 475 LTGTNAVDTIAGDGVQASTDGIGTAAQFNDPHGLTTDGT-NLFISEWSGHKIRHL 528


>gi|290971764|ref|XP_002668649.1| predicted protein [Naegleria gruberi]
 gi|284082135|gb|EFC35905.1| predicted protein [Naegleria gruberi]
          Length = 556

 Score = 39.7 bits (91), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 53/109 (48%), Gaps = 26/109 (23%)

Query: 48  HSVIDRPGSSDLIVLDS-----SRSAFYT-LSFPLSEESV---------VKRLAGD---- 88
           +S++DR   + +I LD+     + ++ YT +S   SE S          +K+L       
Sbjct: 338 NSLVDRVAGTGIIELDTDGKLGNLTSLYTPISVTQSESSSDIYIGTTIGIKKLTDSNKLI 397

Query: 89  ----GVQGYS--DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
               G Q YS  DG P +A R   P+  A+   G ++++DK +H IRKI
Sbjct: 398 SSFAGSQAYSSGDGYPATAARLSSPEGIAISTSGEVFISDKGSHTIRKI 446


>gi|290995458|ref|XP_002680312.1| predicted protein [Naegleria gruberi]
 gi|284093932|gb|EFC47568.1| predicted protein [Naegleria gruberi]
          Length = 744

 Score = 39.7 bits (91), Expect = 0.57,   Method: Composition-based stats.
 Identities = 23/55 (41%), Positives = 32/55 (58%), Gaps = 4/55 (7%)

Query: 85  LAGDGVQGYSDGE-PG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           +AG+G +GY D   P  +A  ++P S  V     +Y+AD  NH IRKI  L +GN
Sbjct: 177 IAGNGKEGYGDDNVPAVNAMLNRPSSVFVTSNNEVYIADTGNHRIRKI--LRNGN 229



 Score = 37.0 bits (84), Expect = 3.6,   Method: Composition-based stats.
 Identities = 21/55 (38%), Positives = 31/55 (56%), Gaps = 4/55 (7%)

Query: 85  LAGDGVQGYSDGE--PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           +AG+G+ G+ +      +   + PKS  V   G +Y+AD  NH IRKI  L +GN
Sbjct: 233 IAGNGLVGFHEDNILATNTSLNFPKSVFVSSNGEVYIADTGNHRIRKI--LANGN 285


>gi|188501588|gb|ACD54714.1| NHL repeat containing hypothetical protein RRC374-like protein
           [Adineta vaga]
          Length = 493

 Score = 39.7 bits (91), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 30/108 (27%), Positives = 50/108 (46%), Gaps = 14/108 (12%)

Query: 28  LEDGYTVTTVIDGHQLEINPHSVIDRP------GSSDLIVLDSSRSAFYTLSFPLSEESV 81
           L++G  V T I G  +  N    +  P       SS++ V+D+     Y +       +V
Sbjct: 236 LQEGSLVGTTIAGTGIAGNSAGQLKIPLDLFVDSSSNIYVVDNDN---YRVMLWRKNATV 292

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
             ++AG G QG +     ++ F +P    VD  GN+YV+D + H + K
Sbjct: 293 GIKVAGSGSQGST-----ASTFREPNGITVDSMGNVYVSDFTTHRVMK 335


>gi|223940572|ref|ZP_03632418.1| NHL repeat containing protein [bacterium Ellin514]
 gi|223890764|gb|EEF57279.1| NHL repeat containing protein [bacterium Ellin514]
          Length = 791

 Score = 39.7 bits (91), Expect = 0.58,   Method: Composition-based stats.
 Identities = 20/40 (50%), Positives = 23/40 (57%), Gaps = 2/40 (5%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           V G   G PG   F  P S A D KGN+YVAD  NH ++K
Sbjct: 736 VLGGVGGAPG--EFSNPWSIAFDSKGNLYVADSQNHRVQK 773


>gi|302535478|ref|ZP_07287820.1| NHL repeat containing protein [Streptomyces sp. C]
 gi|302444373|gb|EFL16189.1| NHL repeat containing protein [Streptomyces sp. C]
          Length = 612

 Score = 39.7 bits (91), Expect = 0.58,   Method: Composition-based stats.
 Identities = 31/107 (28%), Positives = 48/107 (44%), Gaps = 23/107 (21%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S +L+V DS+R     L      E+VV+R+ G G +G+     G   F +P+  A+   G
Sbjct: 198 SGNLLVSDSTRHQLVELE--ADGETVVRRI-GSGERGF-----GPDAFSEPQGLALLPDG 249

Query: 116 NIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSGMFW 162
            + VAD  NH +R                L PE  +  + +G+G  W
Sbjct: 250 RVVVADTVNHALRV---------------LDPETGAVETVAGTGRQW 281


>gi|224125014|ref|XP_002319481.1| predicted protein [Populus trichocarpa]
 gi|222857857|gb|EEE95404.1| predicted protein [Populus trichocarpa]
          Length = 1065

 Score = 39.7 bits (91), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 18/50 (36%), Positives = 29/50 (58%), Gaps = 1/50 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI 130
           + ++   G +G  DG    A F++P+  A + K NI YVAD  NH +R++
Sbjct: 610 IAQIGSSGEEGLRDGSFDDATFNRPQGLAYNSKKNILYVADTENHALREV 659



 Score = 36.6 bits (83), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 38/76 (50%), Gaps = 5/76 (6%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
           V  +AG G  G+ DG+  +A+  +P        G + +AD +N VIR + +L+ G   + 
Sbjct: 846 VTTIAGTGKAGFKDGKALTAQLSEPAGLIEAENGRLIIADTNNSVIRYL-DLNKGEAELL 904

Query: 142 QINLK----PEDCSKS 153
            + LK    P   SKS
Sbjct: 905 TLELKGVQPPASKSKS 920


>gi|162448821|ref|YP_001611188.1| NHL repeat-containing protein [Sorangium cellulosum So ce56]
 gi|161159403|emb|CAN90708.1| hypothetical protein with NHL repeat [Sorangium cellulosum So ce56]
          Length = 1889

 Score = 39.7 bits (91), Expect = 0.60,   Method: Composition-based stats.
 Identities = 16/38 (42%), Positives = 23/38 (60%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           G  DG   +ARF +P     D +GN+Y+AD  NH +R+
Sbjct: 949 GNVDGVGTTARFAEPSGIVADGRGNLYIADAFNHSLRR 986



 Score = 37.4 bits (85), Expect = 3.0,   Method: Composition-based stats.
 Identities = 14/42 (33%), Positives = 27/42 (64%)

Query: 80   SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVAD 121
            S+V  +AG+G  G +DG   +AR   P++ A+D  G++++ +
Sbjct: 992  SMVTTVAGNGTAGSADGVGTAARLTSPQALAIDGSGSVFIGE 1033


>gi|423214678|ref|ZP_17201206.1| hypothetical protein HMPREF1074_02738 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692584|gb|EIY85821.1| hypothetical protein HMPREF1074_02738 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 593

 Score = 39.7 bits (91), Expect = 0.61,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 36/60 (60%), Gaps = 1/60 (1%)

Query: 72  LSFPLSEES-VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           LSF   + S  V  +AG+     +DG+  +A F  P+  A+   G+I++A++ N+VIRK+
Sbjct: 281 LSFTYEDLSYTVATVAGNSATTSTDGKGTAASFKFPQGLALAPNGDIWIAERGNNVIRKM 340



 Score = 37.7 bits (86), Expect = 2.2,   Method: Composition-based stats.
 Identities = 22/54 (40%), Positives = 32/54 (59%), Gaps = 3/54 (5%)

Query: 79  ESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +  +K +AG  G +G  DG   +A F+ P S  V  K  +Y+AD+ NH+IR IT
Sbjct: 539 KGTLKTIAGIPGTKGKIDGSALTATFNCPASVLVSDK--VYIADEQNHLIRTIT 590


>gi|290976760|ref|XP_002671107.1| predicted protein [Naegleria gruberi]
 gi|284084673|gb|EFC38363.1| predicted protein [Naegleria gruberi]
          Length = 1349

 Score = 39.7 bits (91), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 30/54 (55%), Gaps = 2/54 (3%)

Query: 79  ESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            S +  +AG+G  GYS DG P  +A+ + P    V   G +Y+AD  N  +RK+
Sbjct: 553 NSQLSTVAGNGYAGYSGDGGPARNAKLNYPADVTVSSTGEVYIADWGNRAVRKV 606


>gi|294644532|ref|ZP_06722286.1| IPT/TIG domain protein [Bacteroides ovatus SD CC 2a]
 gi|294805794|ref|ZP_06764671.1| IPT/TIG domain protein [Bacteroides xylanisolvens SD CC 1b]
 gi|336403611|ref|ZP_08584323.1| hypothetical protein HMPREF0127_01636 [Bacteroides sp. 1_1_30]
 gi|345510783|ref|ZP_08790344.1| hypothetical protein BSAG_00302 [Bacteroides sp. D1]
 gi|229442801|gb|EEO48592.1| hypothetical protein BSAG_00302 [Bacteroides sp. D1]
 gi|292640085|gb|EFF58349.1| IPT/TIG domain protein [Bacteroides ovatus SD CC 2a]
 gi|294447015|gb|EFG15605.1| IPT/TIG domain protein [Bacteroides xylanisolvens SD CC 1b]
 gi|335945537|gb|EGN07348.1| hypothetical protein HMPREF0127_01636 [Bacteroides sp. 1_1_30]
          Length = 593

 Score = 39.7 bits (91), Expect = 0.61,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 36/60 (60%), Gaps = 1/60 (1%)

Query: 72  LSFPLSEES-VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           LSF   + S  V  +AG+     +DG+  +A F  P+  A+   G+I++A++ N+VIRK+
Sbjct: 281 LSFTYEDLSYTVATVAGNSATTSTDGKGTAASFKFPQGLALAPNGDIWIAERGNNVIRKM 340



 Score = 37.7 bits (86), Expect = 2.2,   Method: Composition-based stats.
 Identities = 22/54 (40%), Positives = 32/54 (59%), Gaps = 3/54 (5%)

Query: 79  ESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +  +K +AG  G +G  DG   +A F+ P S  V  K  +Y+AD+ NH+IR IT
Sbjct: 539 KGTLKTIAGIPGTKGKIDGSALTATFNCPASVLVSDK--VYIADEQNHLIRTIT 590


>gi|157112324|ref|XP_001657496.1| hypothetical protein AaeL_AAEL000965 [Aedes aegypti]
 gi|108883769|gb|EAT47994.1| AAEL000965-PA [Aedes aegypti]
          Length = 812

 Score = 39.7 bits (91), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 19/50 (38%), Positives = 28/50 (56%), Gaps = 1/50 (2%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ R+ G    G+ DG+   ARF+ P+  A      ++VAD  NH IRKI
Sbjct: 345 ILYRIGGKRA-GFVDGDFRKARFNAPQGVAFQNDTTLFVADNENHAIRKI 393


>gi|402571883|ref|YP_006621226.1| hypothetical protein Desmer_1348 [Desulfosporosinus meridiei DSM
           13257]
 gi|402253080|gb|AFQ43355.1| conserved repeat protein [Desulfosporosinus meridiei DSM 13257]
          Length = 1977

 Score = 39.7 bits (91), Expect = 0.61,   Method: Composition-based stats.
 Identities = 25/66 (37%), Positives = 34/66 (51%), Gaps = 3/66 (4%)

Query: 77  SEESVVKRLAGDGV-QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           S   + KR AG GV   +     G+ +F  P   AVD  GN+YVAD  N  I+K  +  +
Sbjct: 112 SNHRIQKRNAGTGVWTTWGSSGSGNKQFSSPSGVAVDADGNVYVADTFNSRIQKFDS--N 169

Query: 136 GNQLIR 141
           GN L +
Sbjct: 170 GNYLTQ 175



 Score = 38.9 bits (89), Expect = 1.0,   Method: Composition-based stats.
 Identities = 17/31 (54%), Positives = 21/31 (67%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           G+ +F  P   AVD  GN+YVAD SNH I+K
Sbjct: 88  GAGKFYSPAGVAVDGAGNVYVADTSNHRIQK 118



 Score = 38.5 bits (88), Expect = 1.2,   Method: Composition-based stats.
 Identities = 18/35 (51%), Positives = 22/35 (62%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           G  +F    S AVD   NIYVAD  NH I+++TNL
Sbjct: 556 GDGQFSGIYSLAVDRNDNIYVADSQNHRIQRLTNL 590


>gi|392945711|ref|ZP_10311353.1| serine/threonine protein kinase [Frankia sp. QA3]
 gi|392289005|gb|EIV95029.1| serine/threonine protein kinase [Frankia sp. QA3]
          Length = 859

 Score = 39.7 bits (91), Expect = 0.61,   Method: Composition-based stats.
 Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +S   V+  +AG G  GYS DG P +A + +  +       G++Y +D  NH IRKI
Sbjct: 622 ISPAGVISTIAGTGTAGYSGDGGPATAAQLNNAEQVTTGPDGSVYFSDYENHRIRKI 678


>gi|440733650|ref|ZP_20913345.1| nhl repeat protein [Xanthomonas translucens DAR61454]
 gi|440359672|gb|ELP96967.1| nhl repeat protein [Xanthomonas translucens DAR61454]
          Length = 473

 Score = 39.7 bits (91), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 28/47 (59%), Gaps = 1/47 (2%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           G G   + DG  G A F +P+  A++ +G +YVAD  NH +R+I  L
Sbjct: 204 GLGTADFIDGGIGEAAFHRPRGLALE-RGVLYVADTGNHALRRINLL 249


>gi|167041535|gb|ABZ06284.1| putative NHL repeat protein [uncultured marine microorganism
           HF4000_008B14]
          Length = 426

 Score = 39.7 bits (91), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 31/53 (58%), Gaps = 2/53 (3%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           + G+G  G  DG    A+F +P   ++D  G IY+AD +NH IR + +L+ G 
Sbjct: 367 MLGNGQPGDRDGPGNQAQFSEPSGLSID-GGKIYIADTNNHAIR-VADLESGE 417



 Score = 37.0 bits (84), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 24/84 (28%), Positives = 39/84 (46%), Gaps = 5/84 (5%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V+    S+ L + DS+ +     +     +    ++ G G  G  DG   +A FD P
Sbjct: 111 PGKVVADADSNRLFIADSNHNRIIVTTL----DGDALKVVGSGEPGMVDGSFAAASFDHP 166

Query: 107 KSFAVDMKGNIYVADKSNHVIRKI 130
           +   + +   +YVAD  NH IRK+
Sbjct: 167 QGMVL-VGDLLYVADAENHAIRKV 189


>gi|290992324|ref|XP_002678784.1| predicted protein [Naegleria gruberi]
 gi|284092398|gb|EFC46040.1| predicted protein [Naegleria gruberi]
          Length = 2331

 Score = 39.7 bits (91), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 34/51 (66%), Gaps = 1/51 (1%)

Query: 82  VKRLAGDGVQGYSDG-EPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +  +AG+G  GYSD  +P  A F+   S  ++  G++Y++D +N+VIRK++
Sbjct: 509 ISSVAGNGYLGYSDNTQPLRASFNLAVSVLINNVGDMYISDYANNVIRKMS 559


>gi|302834543|ref|XP_002948834.1| hypothetical protein VOLCADRAFT_80356 [Volvox carteri f.
           nagariensis]
 gi|300266025|gb|EFJ50214.1| hypothetical protein VOLCADRAFT_80356 [Volvox carteri f.
           nagariensis]
          Length = 774

 Score = 39.7 bits (91), Expect = 0.63,   Method: Composition-based stats.
 Identities = 26/83 (31%), Positives = 39/83 (46%), Gaps = 3/83 (3%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           +P +V+     S + V DS       L+ P + E V   LAG G  G+ DG   +A+F +
Sbjct: 498 HPLAVLSSADGSAIYVADSYNHRIKALN-PNTNEIVT--LAGSGAAGFRDGVGTAAQFSE 554

Query: 106 PKSFAVDMKGNIYVADKSNHVIR 128
           P        G I +AD +N  +R
Sbjct: 555 PAGLCRGPNGTILIADTNNSAVR 577


>gi|206602867|gb|EDZ39348.1| Conserved protein of unknown function [Leptospirillum sp. Group II
           '5-way CG']
          Length = 374

 Score = 39.7 bits (91), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 42/87 (48%), Gaps = 6/87 (6%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG--VQGYSDGEPGSARFD 104
           P  +   P    L V DS  +A   +       + V  LAG G  + G +DG    A F 
Sbjct: 174 PRGLAISPDGKTLYVADSGNNAIRKIDL---ATNTVTTLAGAGKLMSGSADGVGVQATFH 230

Query: 105 KPKSFAVDMKGNI-YVADKSNHVIRKI 130
           +P+S A+   G + Y+AD  N++IRK+
Sbjct: 231 EPRSLAISGDGQVLYIADTRNNLIRKM 257



 Score = 37.4 bits (85), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 17/39 (43%), Positives = 25/39 (64%), Gaps = 1/39 (2%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKI 130
           Y DG   +ARF+ P+  A+   G  +Y+AD  N++IRKI
Sbjct: 105 YRDGVGANARFNNPEGMAISPDGKTLYIADSRNNMIRKI 143



 Score = 36.2 bits (82), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 35/86 (40%), Gaps = 4/86 (4%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP  +   P    L + DS  +    +         V  +AG       DG    A F+ 
Sbjct: 117 NPEGMAISPDGKTLYIADSRNNMIRKIDL---ATKTVSTIAGHSFPSSGDGVGKEAGFET 173

Query: 106 PKSFAVDMKG-NIYVADKSNHVIRKI 130
           P+  A+   G  +YVAD  N+ IRKI
Sbjct: 174 PRGLAISPDGKTLYVADSGNNAIRKI 199


>gi|433676068|ref|ZP_20508221.1| RING finger protein nhl-1 [Xanthomonas translucens pv. translucens
           DSM 18974]
 gi|430818813|emb|CCP38486.1| RING finger protein nhl-1 [Xanthomonas translucens pv. translucens
           DSM 18974]
          Length = 473

 Score = 39.7 bits (91), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 28/47 (59%), Gaps = 1/47 (2%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           G G   + DG  G A F +P+  A++ +G +YVAD  NH +R+I  L
Sbjct: 204 GLGTADFIDGGIGEAAFHRPRGLALE-RGVLYVADTGNHALRRINLL 249


>gi|380692796|ref|ZP_09857655.1| hypothetical protein BfaeM_02286 [Bacteroides faecis MAJ27]
          Length = 465

 Score = 39.7 bits (91), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 37/79 (46%), Gaps = 2/79 (2%)

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGE-PGSARFDKPKSFAVDMKGN 116
           D   +D       TL+     +    R  GDG +G++DG+    ARF+ P S   D K  
Sbjct: 387 DFYFVDKQNHCVRTLTPTGRVKIYAGRPNGDGTKGFNDGDLRKEARFNYPASIVWDEKRE 446

Query: 117 -IYVADKSNHVIRKITNLD 134
            + V D +NH IRKI   D
Sbjct: 447 CLLVGDSNNHRIRKIAMED 465


>gi|160883851|ref|ZP_02064854.1| hypothetical protein BACOVA_01824 [Bacteroides ovatus ATCC 8483]
 gi|156110581|gb|EDO12326.1| NHL repeat protein [Bacteroides ovatus ATCC 8483]
          Length = 494

 Score = 39.7 bits (91), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 39/96 (40%), Gaps = 18/96 (18%)

Query: 64  SSRSAFYTLSFPLSEESVVKRLAGDGV----------------QGYSDGE-PGSARFDKP 106
           + +   Y   F   +   +++L  DG+                 GY DG     ARFD P
Sbjct: 387 AGKDEIYDFYFTDRDNHCIRKLTPDGITSTFAGRGSVGMNIHANGYVDGALREEARFDSP 446

Query: 107 KSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQLIR 141
            + A D   NI YV D +NH IRKI        LIR
Sbjct: 447 AALAYDEVNNIFYVGDVNNHRIRKIALEKIPEDLIR 482


>gi|186683945|ref|YP_001867141.1| redoxin domain-containing protein [Nostoc punctiforme PCC 73102]
 gi|186466397|gb|ACC82198.1| Redoxin domain protein [Nostoc punctiforme PCC 73102]
          Length = 509

 Score = 39.7 bits (91), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 16/48 (33%), Positives = 30/48 (62%), Gaps = 1/48 (2%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + + GDG  G  DG+  ++RF +P   +++    +Y+AD +NH IR++
Sbjct: 432 QTIQGDGTAGLQDGQGKNSRFFEPSGLSIN-GSYLYIADTNNHAIRRV 478



 Score = 39.7 bits (91), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 35/76 (46%), Gaps = 5/76 (6%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
            + L + DS        SF    +  +  L G G  G +DG    A+F  P+  A D + 
Sbjct: 184 QTGLFIADSGHHRLIMSSF----DGEILHLIGTGKSGLTDGAFNEAQFFAPQGMAYDAEN 239

Query: 116 NI-YVADKSNHVIRKI 130
            I YVAD  NH +R++
Sbjct: 240 QILYVADTENHTLRRV 255


>gi|423072880|ref|ZP_17061629.1| repeat protein [Desulfitobacterium hafniense DP7]
 gi|361856375|gb|EHL08282.1| repeat protein [Desulfitobacterium hafniense DP7]
          Length = 2017

 Score = 39.7 bits (91), Expect = 0.67,   Method: Composition-based stats.
 Identities = 20/41 (48%), Positives = 25/41 (60%), Gaps = 5/41 (12%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           G  G  +GE     F++P   AVD  GNIYVAD +NH I+K
Sbjct: 956 GSNGSDEGE-----FNRPYGIAVDSDGNIYVADSNNHRIQK 991



 Score = 38.9 bits (89), Expect = 1.1,   Method: Composition-based stats.
 Identities = 18/31 (58%), Positives = 21/31 (67%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           G  +F  PKS AVD  GNIYVAD +N  I+K
Sbjct: 772 GDGQFRNPKSVAVDNDGNIYVADTTNKRIQK 802



 Score = 38.1 bits (87), Expect = 1.9,   Method: Composition-based stats.
 Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 5/45 (11%)

Query: 85   LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
            LA  G +G S G+     FD P   AVD  GNIYVAD +N+ ++K
Sbjct: 1189 LAKWGTRGNSAGQ-----FDNPSGIAVDNDGNIYVADTNNNRVQK 1228



 Score = 37.7 bits (86), Expect = 2.2,   Method: Composition-based stats.
 Identities = 18/47 (38%), Positives = 26/47 (55%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           AG  + G+        +F  P+S AVD  GNIYV+D  N+ I+K  +
Sbjct: 254 AGSFIMGWGGAGIDDGQFATPRSIAVDSAGNIYVSDSMNNRIQKFAS 300



 Score = 35.8 bits (81), Expect = 8.2,   Method: Composition-based stats.
 Identities = 15/35 (42%), Positives = 22/35 (62%)

Query: 99   GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
            G  +F+ PK  AVD  GN+YVAD  N  ++K  ++
Sbjct: 1008 GLGQFNSPKGIAVDSNGNVYVADIENDRVQKFDSM 1042


>gi|288923318|ref|ZP_06417452.1| serine/threonine protein kinase [Frankia sp. EUN1f]
 gi|288345334|gb|EFC79729.1| serine/threonine protein kinase [Frankia sp. EUN1f]
          Length = 849

 Score = 39.7 bits (91), Expect = 0.67,   Method: Composition-based stats.
 Identities = 22/57 (38%), Positives = 33/57 (57%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAG-DGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  + ++  LAG DG   + DG P S A    P + A+D  G +Y+AD SN+ IR+I
Sbjct: 678 VGSDGIITTLAGQDGEGSFGDGGPASKALLAFPLAVALDRFGRLYIADTSNNRIRRI 734


>gi|290994196|ref|XP_002679718.1| predicted protein [Naegleria gruberi]
 gi|284093336|gb|EFC46974.1| predicted protein [Naegleria gruberi]
          Length = 762

 Score = 39.7 bits (91), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 32/55 (58%), Gaps = 2/55 (3%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           E   +  +AG+G  G+S  DG   +A+ + P S  V  K  +++AD+ NH IRKI
Sbjct: 67  ESGRIVTIAGNGKGGFSGDDGLATNAQLNCPSSVFVSNKNEVFIADQYNHRIRKI 121



 Score = 39.3 bits (90), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 35/62 (56%), Gaps = 4/62 (6%)

Query: 78  EESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           E   V  +AG G QG+S DG P + A+   P    V +   IY AD +N+ IRK+  L++
Sbjct: 283 ENGNVVTVAGTGQQGFSGDGGPATRAQLACPTGLFVTLNNEIYFADSANNRIRKV--LEN 340

Query: 136 GN 137
           GN
Sbjct: 341 GN 342



 Score = 36.2 bits (82), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 21/61 (34%), Positives = 34/61 (55%), Gaps = 4/61 (6%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           E   +  +AG+G +G+S  +G   SAR + P S  V     +Y  D +N+ +RKI  L++
Sbjct: 123 ESGRIVTIAGNGEEGFSGDNGPATSARLNCPMSIFVSNMNEVYFVDSNNNRVRKI--LEN 180

Query: 136 G 136
           G
Sbjct: 181 G 181


>gi|354612773|ref|ZP_09030715.1| NHL repeat containing protein [Saccharomonospora paurometabolica
           YIM 90007]
 gi|353222911|gb|EHB87206.1| NHL repeat containing protein [Saccharomonospora paurometabolica
           YIM 90007]
          Length = 436

 Score = 39.7 bits (91), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 21/61 (34%), Positives = 34/61 (55%), Gaps = 2/61 (3%)

Query: 77  SEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
           + +  +  +AG G +G S DG P + A    P   AVD  GN+++ D  +H +RK++  D
Sbjct: 208 ATDHTISTVAGTGREGDSGDGGPATDAELRFPNCVAVDGHGNVFLTDPRSHRVRKVSATD 267

Query: 135 H 135
           H
Sbjct: 268 H 268



 Score = 38.5 bits (88), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 14/30 (46%), Positives = 19/30 (63%)

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           P S AVD  GN+Y  D  NH +R+++  DH
Sbjct: 296 PNSLAVDGAGNLYFGDTGNHRVRRVSAADH 325



 Score = 37.7 bits (86), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 29/53 (54%), Gaps = 2/53 (3%)

Query: 85  LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           +AG G  G S DG P   A    P + A+D  GN+Y+ D  +  +RK++  DH
Sbjct: 159 VAGTGQNGDSGDGGPAVDAELRHPTTVALDRHGNVYLTDPESRRVRKVSATDH 211


>gi|298388124|ref|ZP_06997668.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
 gi|298259116|gb|EFI01996.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
          Length = 435

 Score = 39.7 bits (91), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 26/45 (57%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           AG+  +G  +G P  + F +P+       G IYVAD  NHVIRKI
Sbjct: 388 AGNRSEGLVNGLPLKSSFRRPEGLTRSKDGVIYVADHDNHVIRKI 432



 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 17/39 (43%), Positives = 26/39 (66%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +++G    ARF++P     D  GN+YV+D+ N+ IRKIT
Sbjct: 341 FANGIGEEARFNQPCQGVFDEDGNLYVSDRDNNCIRKIT 379


>gi|383111216|ref|ZP_09932033.1| hypothetical protein BSGG_2316 [Bacteroides sp. D2]
 gi|313694781|gb|EFS31616.1| hypothetical protein BSGG_2316 [Bacteroides sp. D2]
          Length = 419

 Score = 39.7 bits (91), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 40/140 (28%), Positives = 61/140 (43%), Gaps = 21/140 (15%)

Query: 56  SSDLIVLDSSRSAFYTLS-----FPLSEESVVKRLAGDGVQG--YSDGEPG---SARFDK 105
           S + IV D +++     S       +S +   K +AGDGV+G  Y DG+PG   SA+   
Sbjct: 270 SPNYIVFDKNKNMIVGTSNGYVLIQISPDGTQKTIAGDGVKGQEYYDGDPGNPLSAKVGA 329

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT---NLDHGNQLIRQINLKPEDCSKSSQSGSGMFW 162
               A    G +Y++D + + IRK+T   + D+    +  I        KS  S      
Sbjct: 330 TFGVAAGSDGCLYLSDNTYNCIRKLTPDASGDYSKGTLETI----AGSGKSGFSDGKGLK 385

Query: 163 VTVFSPYLI----SCKSFWV 178
            T   PY I     CK+ +V
Sbjct: 386 ATFNQPYEIIITKDCKTMYV 405


>gi|452910311|ref|ZP_21958992.1| hypothetical protein C884_01866 [Kocuria palustris PEL]
 gi|452834558|gb|EME37358.1| hypothetical protein C884_01866 [Kocuria palustris PEL]
          Length = 632

 Score = 39.3 bits (90), Expect = 0.70,   Method: Composition-based stats.
 Identities = 19/56 (33%), Positives = 32/56 (57%), Gaps = 7/56 (12%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG-------NIYVADKSNHVIRKI 130
           V+ + GDG +G +DG P  A F++P+  A+  +G       ++ VAD  NH +R +
Sbjct: 207 VRSVIGDGTRGPADGGPEEAHFNEPRGLALLPQGVREQVGYDVIVADTVNHRLRGV 262


>gi|29348891|ref|NP_812394.1| hypothetical protein BT_3482 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|383125072|ref|ZP_09945731.1| hypothetical protein BSIG_5391 [Bacteroides sp. 1_1_6]
 gi|29340797|gb|AAO78588.1| conserved hypothetical protein containing NHL repeat [Bacteroides
           thetaiotaomicron VPI-5482]
 gi|382983525|gb|EES65521.2| hypothetical protein BSIG_5391 [Bacteroides sp. 1_1_6]
          Length = 513

 Score = 39.3 bits (90), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 40/83 (48%), Gaps = 13/83 (15%)

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-------GYSDGE-PGSARFDKPKSF 109
           D  + D    A   L+ PL +   V   AG G         GY +G+    ARF++PK+ 
Sbjct: 429 DFYIADKHNHAIRILT-PLGQ---VSTFAGRGSASLNSNPWGYVNGDLRKEARFERPKAI 484

Query: 110 AVDMK-GNIYVADKSNHVIRKIT 131
           A D K G +YV D  NH IRKI 
Sbjct: 485 AYDEKTGIVYVGDAYNHRIRKIA 507


>gi|290986797|ref|XP_002676110.1| predicted protein [Naegleria gruberi]
 gi|284089710|gb|EFC43366.1| predicted protein [Naegleria gruberi]
          Length = 1218

 Score = 39.3 bits (90), Expect = 0.72,   Method: Composition-based stats.
 Identities = 21/50 (42%), Positives = 32/50 (64%), Gaps = 4/50 (8%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           + G GV+GY+  D     A+ +KP S +VD  G+IY++D S   +RKI+N
Sbjct: 675 ICGTGVEGYNGDDILATDAQLNKPTSVSVDQNGDIYISDNSR--LRKISN 722


>gi|424869138|ref|ZP_18292858.1| uncharacterized protein [Leptospirillum sp. Group II 'C75']
 gi|124514239|gb|EAY55753.1| conserved protein of unknown function [Leptospirillum rubarum]
 gi|387220840|gb|EIJ75456.1| uncharacterized protein [Leptospirillum sp. Group II 'C75']
          Length = 389

 Score = 39.3 bits (90), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 42/87 (48%), Gaps = 6/87 (6%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG--VQGYSDGEPGSARFD 104
           P  +   P    L V DS  +A   +       + V  LAG G  + G +DG    A F 
Sbjct: 189 PRGLAISPDGKTLYVADSGNNAIRKIDL---ATNTVTTLAGAGKLMSGSADGVGVQATFH 245

Query: 105 KPKSFAVDMKGNI-YVADKSNHVIRKI 130
           +P+S A+   G + Y+AD  N++IRK+
Sbjct: 246 EPRSLAISGDGQVLYIADTRNNLIRKM 272



 Score = 38.1 bits (87), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 25/39 (64%), Gaps = 1/39 (2%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKI 130
           Y DG   +ARF+ P+  A+   G  +YVAD  N++IRKI
Sbjct: 120 YRDGVGANARFNNPEGMAISPDGKTLYVADSRNNMIRKI 158



 Score = 36.6 bits (83), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 35/86 (40%), Gaps = 4/86 (4%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP  +   P    L V DS  +    +         V  +AG       DG    A F+ 
Sbjct: 132 NPEGMAISPDGKTLYVADSRNNMIRKIDL---ATKTVSTIAGHSFPSSGDGVGKEAGFET 188

Query: 106 PKSFAVDMKG-NIYVADKSNHVIRKI 130
           P+  A+   G  +YVAD  N+ IRKI
Sbjct: 189 PRGLAISPDGKTLYVADSGNNAIRKI 214



 Score = 35.8 bits (81), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 17/41 (41%), Positives = 23/41 (56%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +G  DG    ARF+ P+       G IY+AD  N +IRKI+
Sbjct: 47  RGAIDGPGREARFEFPQGILAAPDGTIYIADTGNDMIRKIS 87


>gi|350561579|ref|ZP_08930417.1| YD repeat protein [Thioalkalivibrio thiocyanoxidans ARh 4]
 gi|349780611|gb|EGZ34929.1| YD repeat protein [Thioalkalivibrio thiocyanoxidans ARh 4]
          Length = 2546

 Score = 39.3 bits (90), Expect = 0.73,   Method: Composition-based stats.
 Identities = 24/79 (30%), Positives = 40/79 (50%), Gaps = 8/79 (10%)

Query: 60   IVLDSSRSAFYTLS------FPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAV 111
            + +D+  S F+T        F  + +  V  +AG+G   ++ DG P  +A F  P+  AV
Sbjct: 1276 VAVDAGGSVFFTRRNGGPRIFRANPDGTVHHVAGNGSSTFAGDGGPAVNASFIDPRGIAV 1335

Query: 112  DMKGNIYVADKSNHVIRKI 130
               G++YV D   H +R+I
Sbjct: 1336 GADGSLYVGDFRTHRVRRI 1354


>gi|20092638|ref|NP_618713.1| cell surface protein [Methanosarcina acetivorans C2A]
 gi|19917919|gb|AAM07193.1| cell surface protein [Methanosarcina acetivorans C2A]
          Length = 526

 Score = 39.3 bits (90), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 17/41 (41%), Positives = 25/41 (60%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  +  G  G+ +F  P   AVD  GNIYVAD +NH ++K+
Sbjct: 84  IDKWGSGGSGNGQFYTPNGVAVDSMGNIYVADYNNHRVQKL 124


>gi|290980328|ref|XP_002672884.1| predicted protein [Naegleria gruberi]
 gi|284086464|gb|EFC40140.1| predicted protein [Naegleria gruberi]
          Length = 1380

 Score = 39.3 bits (90), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 28/94 (29%), Positives = 43/94 (45%), Gaps = 5/94 (5%)

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMK 114
           SD  +  S+ +    L    +    ++ + GDG  GYS D  P  S+    PK+F V   
Sbjct: 845 SDQTIYFSTNTKIRVLELAQNGSYYLRTIVGDGNGGYSGDNLPALSSSVQNPKAFHVKKN 904

Query: 115 GNIYVADKSNHVIRKITNLDHGNQLIRQINLKPE 148
           G+I   D+ N++IRK         L++ I  KP 
Sbjct: 905 GDIVFVDEGNYLIRKFV---KATGLLKNIAGKPR 935


>gi|219851898|ref|YP_002466330.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
 gi|219546157|gb|ACL16607.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
          Length = 387

 Score = 39.3 bits (90), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 16/29 (55%), Positives = 20/29 (68%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           FDKP   AVD  GN+YVAD  N+ ++K T
Sbjct: 233 FDKPSGVAVDSAGNVYVADWGNNWVQKFT 261



 Score = 38.5 bits (88), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 21/32 (65%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
              F+KP   AVD  GN+YVAD +N+ I+K T
Sbjct: 136 GGAFNKPGGVAVDSVGNVYVADSANNQIQKFT 167


>gi|410478931|ref|YP_006766568.1| hypothetical protein LFML04_1390 [Leptospirillum ferriphilum ML-04]
 gi|406774183|gb|AFS53608.1| hypothetical protein LFML04_1390 [Leptospirillum ferriphilum ML-04]
          Length = 427

 Score = 39.3 bits (90), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 42/87 (48%), Gaps = 6/87 (6%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG--VQGYSDGEPGSARFD 104
           P  +   P    L V DS  +A   +       + V  LAG G  + G +DG    A F 
Sbjct: 227 PRGLAISPDGKTLYVADSGNNAIRKIDL---ATNTVTTLAGAGKLMSGSADGVGVQATFH 283

Query: 105 KPKSFAVDMKGNI-YVADKSNHVIRKI 130
           +P+S A+   G + Y+AD  N++IRK+
Sbjct: 284 EPRSLAISGDGQVLYIADTRNNLIRKM 310



 Score = 38.1 bits (87), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 25/39 (64%), Gaps = 1/39 (2%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKI 130
           Y DG   +ARF+ P+  A+   G  +YVAD  N++IRKI
Sbjct: 158 YRDGVGANARFNNPEGMAISPDGKTLYVADSRNNMIRKI 196



 Score = 36.6 bits (83), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 35/86 (40%), Gaps = 4/86 (4%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP  +   P    L V DS  +    +         V  +AG       DG    A F+ 
Sbjct: 170 NPEGMAISPDGKTLYVADSRNNMIRKIDL---ATKTVSTIAGHSFPSSGDGVGKEAGFET 226

Query: 106 PKSFAVDMKG-NIYVADKSNHVIRKI 130
           P+  A+   G  +YVAD  N+ IRKI
Sbjct: 227 PRGLAISPDGKTLYVADSGNNAIRKI 252



 Score = 35.8 bits (81), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 17/41 (41%), Positives = 23/41 (56%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +G  DG    ARF+ P+       G IY+AD  N +IRKI+
Sbjct: 85  RGAIDGPGREARFEFPQGILAAPDGTIYIADTGNDMIRKIS 125


>gi|290994368|ref|XP_002679804.1| predicted protein [Naegleria gruberi]
 gi|284093422|gb|EFC47060.1| predicted protein [Naegleria gruberi]
          Length = 1359

 Score = 39.3 bits (90), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 31/52 (59%), Gaps = 2/52 (3%)

Query: 81  VVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            V  +AG  G++G+ DG   ++ F+ P     +    IYVAD +NHVIRKIT
Sbjct: 520 TVSLIAGQPGIKGHIDGIALNSTFNHPSGIHAEYSA-IYVADSNNHVIRKIT 570


>gi|290978007|ref|XP_002671728.1| predicted protein [Naegleria gruberi]
 gi|284085299|gb|EFC38984.1| predicted protein [Naegleria gruberi]
          Length = 2520

 Score = 39.3 bits (90), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 16/44 (36%), Positives = 26/44 (59%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G G  GY+DG   +A ++ P   A     ++Y+AD+ NHV+R +
Sbjct: 228 GVGSAGYADGSAFNALYNAPTGVAYGPNKDLYIADRGNHVVRVV 271


>gi|290976488|ref|XP_002670972.1| predicted protein [Naegleria gruberi]
 gi|284084536|gb|EFC38228.1| predicted protein [Naegleria gruberi]
          Length = 704

 Score = 39.3 bits (90), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 33/53 (62%), Gaps = 2/53 (3%)

Query: 81  VVKRLAGD-GVQGYSDGE-PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++  +AG  G  G++D     SA+ + P++ A+   G+IY+AD  N+ IRKIT
Sbjct: 211 IITTIAGTAGSSGFTDNVLSTSAKLNGPQAVAIMSNGDIYIADTQNNRIRKIT 263


>gi|380693318|ref|ZP_09858177.1| hypothetical protein BfaeM_04967 [Bacteroides faecis MAJ27]
          Length = 472

 Score = 39.3 bits (90), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 25/64 (39%), Positives = 33/64 (51%), Gaps = 9/64 (14%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGS--------ARFDKPKSFAVDMKGNI-YVADKSNHV 126
           L+ E +V   AG G  G +D   G+        ARFD+P + A D   N+ YV D  NH 
Sbjct: 391 LTPEGLVTTFAGRGSAGLNDKPYGNVNGDLREEARFDQPAAIAYDSINNVFYVGDIENHS 450

Query: 127 IRKI 130
           IR+I
Sbjct: 451 IRRI 454


>gi|440683842|ref|YP_007158637.1| NHL repeat containing protein [Anabaena cylindrica PCC 7122]
 gi|428680961|gb|AFZ59727.1| NHL repeat containing protein [Anabaena cylindrica PCC 7122]
          Length = 506

 Score = 39.3 bits (90), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 15/48 (31%), Positives = 30/48 (62%), Gaps = 1/48 (2%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + + GDG+ G+ DG+  + RF +P   +  +   +Y++D +NH IR++
Sbjct: 431 QTVLGDGLPGFQDGQGKNTRFSEPSGLS-GIDSYLYISDTNNHAIRRV 477


>gi|219853190|ref|YP_002467622.1| SMP-30/Gluconolaconase/LRE domain-containing protein
           [Methanosphaerula palustris E1-9c]
 gi|219547449|gb|ACL17899.1| SMP-30/Gluconolaconase/LRE domain protein [Methanosphaerula
           palustris E1-9c]
          Length = 343

 Score = 39.3 bits (90), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 16/34 (47%), Positives = 21/34 (61%)

Query: 98  PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           PG  +F  P+  AVD  GN+YV D  N+ I+K T
Sbjct: 105 PGDGQFKSPEGIAVDNAGNVYVVDTVNNQIQKFT 138



 Score = 35.8 bits (81), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 30/103 (29%), Positives = 49/103 (47%), Gaps = 7/103 (6%)

Query: 32  YTVTTVIDGHQLEINPHSVIDRPGS-SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV 90
           Y V TV +  Q   +  + I + G+ +  I +DS  + + T +      + V++   DG 
Sbjct: 125 YVVDTVNNQIQKFTSDGTFITKWGTRASGIAVDSDGNVYVTDA----TSNQVQKFTSDGT 180

Query: 91  QGYSDGEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
              + G  GS   +F  P+  AVD  GN+YV D  N  ++K T
Sbjct: 181 HLLNWGTSGSENGQFSSPEGIAVDNAGNVYVNDVGNQRVQKFT 223


>gi|149279484|ref|ZP_01885614.1| hypothetical protein PBAL39_16831 [Pedobacter sp. BAL39]
 gi|149229777|gb|EDM35166.1| hypothetical protein PBAL39_16831 [Pedobacter sp. BAL39]
          Length = 742

 Score = 39.3 bits (90), Expect = 0.78,   Method: Composition-based stats.
 Identities = 16/52 (30%), Positives = 30/52 (57%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ +  ++    +G+ DG    A+F +  + A+D  GNIY+ D  N+ IRK+
Sbjct: 688 QNTISTVSAQTTRGFKDGGLNEAQFMQRDAMAMDKDGNIYIVDMGNNAIRKL 739



 Score = 38.1 bits (87), Expect = 1.8,   Method: Composition-based stats.
 Identities = 23/63 (36%), Positives = 30/63 (47%)

Query: 68  AFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
           + Y   F L       R AG+G  G +DG   SA    PK F  D KGN+Y +D  +   
Sbjct: 622 SLYIGRFDLVNSIYDLRWAGNGNFGNADGPIKSAMLVSPKGFVGDGKGNMYFSDGFSQNT 681

Query: 128 RKI 130
           RK+
Sbjct: 682 RKV 684


>gi|290985668|ref|XP_002675547.1| predicted protein [Naegleria gruberi]
 gi|284089144|gb|EFC42803.1| predicted protein [Naegleria gruberi]
          Length = 1037

 Score = 39.3 bits (90), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 32/53 (60%), Gaps = 2/53 (3%)

Query: 82  VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           V+ + G+G+ G+S DGE   +A+ ++P    VD  G +   D SN+ IR I+N
Sbjct: 28  VEVIGGNGISGFSGDGELAINAKLNRPTKIVVDSAGRVVFCDNSNNRIRMISN 80



 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 45/86 (52%), Gaps = 6/86 (6%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG-SARFDK 105
           P+ V+    S D+ + D+S      +S   S   ++  +AG G  G+ D      A+ + 
Sbjct: 445 PYGVV-TSSSGDVFIADTSNCRIRKVS---SSTGIITTVAG-GTCGFGDNVLAVDAQLNT 499

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT 131
           P   +V+ KG +++AD +NH IRK++
Sbjct: 500 PYGISVNSKGELFIADTNNHRIRKVS 525


>gi|339008739|ref|ZP_08641312.1| cell surface protein [Brevibacillus laterosporus LMG 15441]
 gi|338774539|gb|EGP34069.1| cell surface protein [Brevibacillus laterosporus LMG 15441]
          Length = 633

 Score = 39.3 bits (90), Expect = 0.82,   Method: Composition-based stats.
 Identities = 18/44 (40%), Positives = 27/44 (61%), Gaps = 5/44 (11%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           G +G  +G+     F++P   A+D K NIYV D++NH I+K  N
Sbjct: 487 GQKGTKEGQ-----FNQPMQLAIDSKDNIYVVDRNNHRIQKFDN 525


>gi|209522300|ref|ZP_03270926.1| NHL repeat containing protein [Burkholderia sp. H160]
 gi|209497260|gb|EDZ97489.1| NHL repeat containing protein [Burkholderia sp. H160]
          Length = 383

 Score = 39.3 bits (90), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 20/59 (33%), Positives = 31/59 (52%), Gaps = 3/59 (5%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINL 145
           G+ ++ + D   G   F  P S AVD   ++YVAD+SN   R+I   D   + +RQ  +
Sbjct: 220 GNWLKSWGDRGTGPGEFHTPHSIAVDANDHVYVADRSN---RRIQVFDTEGRFLRQFTI 275


>gi|219851408|ref|YP_002465840.1| carbohydrate binding family 6 [Methanosphaerula palustris E1-9c]
 gi|219545667|gb|ACL16117.1| Carbohydrate binding family 6 [Methanosphaerula palustris E1-9c]
          Length = 627

 Score = 39.3 bits (90), Expect = 0.84,   Method: Composition-based stats.
 Identities = 31/93 (33%), Positives = 40/93 (43%), Gaps = 20/93 (21%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPE 148
           G  G  DG+     F+ P+  AVD  GNIYVAD  N  ++K T+   G  +       PE
Sbjct: 39  GTNGSGDGQ-----FNYPQGIAVDGAGNIYVADNGNKQVQKFTS--SGTFVTAWGTAGPE 91

Query: 149 DCSKSSQSG-------------SGMFWVTVFSP 168
           D   S  SG             +G F V  F+P
Sbjct: 92  DDRFSEPSGIAVDSEGNVYVTDTGKFRVQKFTP 124



 Score = 35.8 bits (81), Expect = 8.7,   Method: Composition-based stats.
 Identities = 16/46 (34%), Positives = 23/46 (50%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +G  V  +    P   RF +P   AVD +GN+YV D     ++K T
Sbjct: 78  SGTFVTAWGTAGPEDDRFSEPSGIAVDSEGNVYVTDTGKFRVQKFT 123


>gi|378548897|ref|ZP_09824113.1| hypothetical protein CCH26_02380 [Citricoccus sp. CH26A]
          Length = 708

 Score = 39.3 bits (90), Expect = 0.84,   Method: Composition-based stats.
 Identities = 17/46 (36%), Positives = 26/46 (56%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           AG G++G  DG+P  A F +    A    G I+VAD  +  +R+I+
Sbjct: 396 AGTGLEGLEDGDPTVAWFAQSSGLAESADGTIWVADSESSALRRIS 441


>gi|115376708|ref|ZP_01463935.1| hypothetical protein STIAU_5350 [Stigmatella aurantiaca DW4/3-1]
 gi|310822897|ref|YP_003955255.1| hypothetical protein STAUR_5665 [Stigmatella aurantiaca DW4/3-1]
 gi|115366257|gb|EAU65265.1| hypothetical protein STIAU_5350 [Stigmatella aurantiaca DW4/3-1]
 gi|309395969|gb|ADO73428.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1]
          Length = 856

 Score = 39.3 bits (90), Expect = 0.86,   Method: Composition-based stats.
 Identities = 23/71 (32%), Positives = 33/71 (46%), Gaps = 10/71 (14%)

Query: 77  SEESVVKRLAGDGVQ----------GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
           S+   ++R+A DG            G  DG   +ARF++P   A    G + VAD  N V
Sbjct: 635 SDNHCIRRIAPDGTATVFAGAVMEPGQGDGTAKAARFNQPAGLAFGPGGELLVADLGNGV 694

Query: 127 IRKITNLDHGN 137
           IR++     GN
Sbjct: 695 IRRVDLSAPGN 705



 Score = 38.9 bits (89), Expect = 1.1,   Method: Composition-based stats.
 Identities = 21/49 (42%), Positives = 25/49 (51%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V  LAG G  G  DGE  SA    P   AV   G +YVAD  N ++R +
Sbjct: 806 VTTLAGSGRFGSRDGEGQSADLVLPAGLAVGEDGTLYVADAGNALLRAV 854


>gi|290971380|ref|XP_002668485.1| predicted protein [Naegleria gruberi]
 gi|284081915|gb|EFC35741.1| predicted protein [Naegleria gruberi]
          Length = 277

 Score = 39.3 bits (90), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 46/89 (51%), Gaps = 8/89 (8%)

Query: 44  EINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARF 103
           +IN H +  R  SSD ++   + +   TL+ P+S  +  +    +G     DG   +   
Sbjct: 138 DINNHRI--RKVSSDGVISTIAGTGRTTLN-PISPNTYRQEGGYNG-----DGLATATDL 189

Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           + PK   VD KG++Y AD  N +IRK++N
Sbjct: 190 NYPKYAQVDSKGDVYFADTYNGLIRKVSN 218


>gi|158318814|ref|YP_001511322.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
 gi|158114219|gb|ABW16416.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
          Length = 732

 Score = 39.3 bits (90), Expect = 0.88,   Method: Composition-based stats.
 Identities = 19/48 (39%), Positives = 30/48 (62%), Gaps = 2/48 (4%)

Query: 85  LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +AG G  GYS DG P   A   +P++ AVD +G + +AD+ +  IR++
Sbjct: 581 VAGSGTPGYSGDGGPAVHASLTRPQAVAVDAQGRLLIADQEHRRIRRV 628


>gi|159113256|ref|XP_001706855.1| Haloacid dehalogenase-like hydrolase family [Giardia lamblia ATCC
           50803]
 gi|157434955|gb|EDO79181.1| Haloacid dehalogenase-like hydrolase family [Giardia lamblia ATCC
           50803]
          Length = 969

 Score = 39.3 bits (90), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 42/86 (48%), Gaps = 6/86 (6%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDK 105
           PHS++  P +  + + D+      TLS    +   VK LAG  GV GY DG   +  F++
Sbjct: 143 PHSIVSDPVNGIIYLTDAKNHCIRTLSLMTRK---VKTLAGTPGVFGYKDGL--NPLFNE 197

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT 131
           P    +   GN+ V D  N  +R ++
Sbjct: 198 PLGLVLTEDGNVIVCDSKNGALRYVS 223


>gi|47219684|emb|CAG12606.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 805

 Score = 39.3 bits (90), Expect = 0.89,   Method: Composition-based stats.
 Identities = 40/137 (29%), Positives = 62/137 (45%), Gaps = 27/137 (19%)

Query: 16  SLLFSLVSSG-----LLLEDG----YTVTTVIDGHQLEIN------PHSVIDRPGSSDLI 60
           +LLFSLV  G     +L  D     Y    ++  H + +       P S++  PG    I
Sbjct: 168 NLLFSLVGEGHWDKQMLFTDAALRYYRERGLLRTHAVPLKLLRDSLPPSILSFPGK---I 224

Query: 61  VLD--SSRSAF----YTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
            +D  S R A     +     +S    + ++ G    G  DG+   A F+ P+  A+  K
Sbjct: 225 AIDDRSKRVAIADTGHHRVLLVSTTGQLLQVIGGPESGRKDGDLSEASFNSPQGVAI--K 282

Query: 115 GN-IYVADKSNHVIRKI 130
           G+ +YVAD  NH+IRK+
Sbjct: 283 GDTVYVADTENHLIRKV 299


>gi|260904571|ref|ZP_05912893.1| NHL repeat-containing protein [Brevibacterium linens BL2]
          Length = 647

 Score = 39.3 bits (90), Expect = 0.90,   Method: Composition-based stats.
 Identities = 31/95 (32%), Positives = 51/95 (53%), Gaps = 12/95 (12%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V + P S +L+V DS   +   + +  S +++++R+ G G +G +DG+  SA F +P
Sbjct: 186 PGKVTELP-SGNLLVADSGHHSL--VEYDASGQNIIRRI-GTGERGANDGDFTSASFSEP 241

Query: 107 KSFAV---DMKG----NIYVADKSNHVIRKITNLD 134
               V   D+      ++ VAD  NH +R I NLD
Sbjct: 242 GGITVLPDDVAAKAGYHLVVADTVNHTLRGI-NLD 275



 Score = 37.7 bits (86), Expect = 2.1,   Method: Composition-based stats.
 Identities = 21/95 (22%), Positives = 47/95 (49%), Gaps = 6/95 (6%)

Query: 39  DGHQLEI---NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD 95
           DG  L++   +P  V+  P +++++V  +     ++      E   ++ L+G   +G +D
Sbjct: 308 DGPALDVKLSSPWDVLYIPATAEVVVAMAGNHTIWSFD---PETGSIRLLSGTMNEGLAD 364

Query: 96  GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           GE  +A F +     +   G++++AD     IR++
Sbjct: 365 GEAEAAWFAQTSGLDLSSDGDVFIADSETSAIRRL 399


>gi|392403177|ref|YP_006439789.1| NHL repeat containing protein [Turneriella parva DSM 21527]
 gi|390611131|gb|AFM12283.1| NHL repeat containing protein [Turneriella parva DSM 21527]
          Length = 392

 Score = 38.9 bits (89), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 19/45 (42%), Positives = 25/45 (55%), Gaps = 1/45 (2%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           AGD   G +DG    ARF+ P+    D   N++VAD  N  +RKI
Sbjct: 174 AGDTTSGDTDGTANDARFNNPQGIVTDGT-NLFVADSLNRKVRKI 217



 Score = 37.4 bits (85), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 41/89 (46%), Gaps = 10/89 (11%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG--DGVQGY--SDGEPGSA 101
           NP  ++     ++L V DS       +        +V  LAG   GV G   +DG   +A
Sbjct: 193 NPQGIVTD--GTNLFVADSLNRKVRKIVI---ASGIVSTLAGPAQGVTGSGDTDGSANTA 247

Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           RF  P +   D   N+Y+ D SNH IRKI
Sbjct: 248 RFGLPGAMTTDGV-NLYLCDSSNHKIRKI 275



 Score = 35.8 bits (81), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 24/54 (44%), Positives = 30/54 (55%), Gaps = 5/54 (9%)

Query: 81  VVKRLAGDG----VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           VV  LAG G    + G  DG   +ARF   ++   D   N+YVAD SN+ IRKI
Sbjct: 107 VVSTLAGPGPGTSLSGDVDGIGEAARFSSMRAITTDGT-NLYVADNSNNKIRKI 159


>gi|290979059|ref|XP_002672252.1| predicted protein [Naegleria gruberi]
 gi|284085827|gb|EFC39508.1| predicted protein [Naegleria gruberi]
          Length = 1239

 Score = 38.9 bits (89), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 48/100 (48%), Gaps = 10/100 (10%)

Query: 39  DGHQLEINPHS-----VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY 93
           DG+ L  N  +     V++R G  +L  ++        LSF  +    +  +AG+G  GY
Sbjct: 42  DGNALVTNLKNPRGLFVVERNGG-ELYFVEEENHIIRKLSFVTNN---ITTIAGNGTAGY 97

Query: 94  SDGEPGSARFDKPKSFAV-DMKGNIYVADKSNHVIRKITN 132
           SDG    +  + P S  V +  G IY+AD  N  +R++ N
Sbjct: 98  SDGVATLSMLNYPHSVHVSNTTGEIYIADTFNCKLRRVFN 137


>gi|290973961|ref|XP_002669715.1| predicted protein [Naegleria gruberi]
 gi|284083266|gb|EFC36971.1| predicted protein [Naegleria gruberi]
          Length = 710

 Score = 38.9 bits (89), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 31/49 (63%), Gaps = 3/49 (6%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +AG+G +G+S  +G   +A+ + P    V     +Y+AD SNHVIRKI+
Sbjct: 144 IAGNGTKGFSGDNGPATNAQLNGPAGVFVS-NNEVYIADYSNHVIRKIS 191



 Score = 36.2 bits (82), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 19/57 (33%), Positives = 31/57 (54%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +S+   +  +AG+G  G+S  +G   +A+   P    V     +Y++D  NHVIRKI
Sbjct: 190 ISQNGTIVTIAGNGKPGFSGDNGLATNAQLYNPSGTFVSSNNEVYISDCFNHVIRKI 246


>gi|145340877|ref|XP_001415544.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144575767|gb|ABO93836.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1675

 Score = 38.9 bits (89), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 38/81 (46%), Gaps = 10/81 (12%)

Query: 60  IVLDSSRSAFYTLSFPLSEESVVKRLAGD-----GVQGYS----DGEPGSARFDKPKSFA 110
           +V D S  A Y L    +  S +   +G      G QG S    +  P SARF+ P+  A
Sbjct: 186 MVFDESDGAIYFLGSSQNHVSKIAYGSGTVSTYAGSQGTSGLVDNVTPSSARFNAPEGLA 245

Query: 111 VDMKGNI-YVADKSNHVIRKI 130
           +D    I YVAD  NH +R I
Sbjct: 246 LDGANRILYVADTGNHAVRAI 266


>gi|427430403|ref|ZP_18920257.1| hypothetical protein C882_1420 [Caenispirillum salinarum AK4]
 gi|425878863|gb|EKV27574.1| hypothetical protein C882_1420 [Caenispirillum salinarum AK4]
          Length = 498

 Score = 38.9 bits (89), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 30/90 (33%), Positives = 41/90 (45%), Gaps = 5/90 (5%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTL---SFPLSEESVVKRLAGDGVQ--GYSDGEPGS 100
            P  +   P +S L   DS  SA   +    + L + + V+ L G G+   G+ DG   +
Sbjct: 344 QPSGLALAPDASALYFADSETSAVRKVILDGWRLGQTARVETLVGRGLFDFGHDDGPMTA 403

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           AR   P   AV   G + VAD  NH IR I
Sbjct: 404 ARMQHPLGVAVMPDGRVAVADSYNHAIRLI 433


>gi|325917587|ref|ZP_08179787.1| NHL repeat protein [Xanthomonas vesicatoria ATCC 35937]
 gi|325536221|gb|EGD08017.1| NHL repeat protein [Xanthomonas vesicatoria ATCC 35937]
          Length = 472

 Score = 38.9 bits (89), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 31/56 (55%), Gaps = 2/56 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           V R  G G     DG  G A F +P+  A++ +  +YVAD  NH +R+I NL +G 
Sbjct: 198 VLRQFGHGNADLIDGGTGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRNGQ 251


>gi|254447301|ref|ZP_05060768.1| NHL repeat containing protein [gamma proteobacterium HTCC5015]
 gi|198263440|gb|EDY87718.1| NHL repeat containing protein [gamma proteobacterium HTCC5015]
          Length = 2515

 Score = 38.9 bits (89), Expect = 1.00,   Method: Composition-based stats.
 Identities = 19/57 (33%), Positives = 31/57 (54%), Gaps = 2/57 (3%)

Query: 76   LSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            ++ E  +++ AG   +GYS   G    A  + P+  AV   G +++AD  NH IRK+
Sbjct: 1213 ITPEGEIEKYAGKETRGYSGDGGAATEAELNLPRGMAVSSGGTLFIADWGNHRIRKV 1269


>gi|116007528|ref|NP_001036460.1| CG12547, isoform A [Drosophila melanogaster]
 gi|442622271|ref|NP_001260704.1| CG12547, isoform B [Drosophila melanogaster]
 gi|17945781|gb|AAL48938.1| RE33981p [Drosophila melanogaster]
 gi|30923523|gb|EAA46001.1| CG12547, isoform A [Drosophila melanogaster]
 gi|220948326|gb|ACL86706.1| CG12547-PA [synthetic construct]
 gi|440214080|gb|AGB93239.1| CG12547, isoform B [Drosophila melanogaster]
          Length = 717

 Score = 38.9 bits (89), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 30/56 (53%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           L+   VV+   G    G+ DG+   ARF+ P+  A   +  + VAD  NH IR+I+
Sbjct: 252 LTAGGVVQHRIGGHQPGFVDGDLTVARFNNPQGIAFLNENVLIVADTKNHAIRQIS 307


>gi|299145011|ref|ZP_07038079.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
 gi|298515502|gb|EFI39383.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
          Length = 459

 Score = 38.9 bits (89), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 26/40 (65%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G+++G+   A+ ++P    +D +GNIYVA +  H I KIT
Sbjct: 364 GFTEGQGNEAKLEEPCQMDLDEEGNIYVAVRKKHRIAKIT 403


>gi|423300918|ref|ZP_17278942.1| hypothetical protein HMPREF1057_02083 [Bacteroides finegoldii
           CL09T03C10]
 gi|408472253|gb|EKJ90781.1| hypothetical protein HMPREF1057_02083 [Bacteroides finegoldii
           CL09T03C10]
          Length = 459

 Score = 38.9 bits (89), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 18/43 (41%), Positives = 25/43 (58%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G  G+ +G+   AR  +P    +D +GNIYVA +  H I KIT
Sbjct: 361 GKTGFVEGQGNDARLQEPCQMDLDEEGNIYVAVRKKHRIAKIT 403


>gi|119509213|ref|ZP_01628363.1| NHL repeat protein [Nodularia spumigena CCY9414]
 gi|119466055|gb|EAW46942.1| NHL repeat protein [Nodularia spumigena CCY9414]
          Length = 502

 Score = 38.9 bits (89), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 32/124 (25%), Positives = 51/124 (41%), Gaps = 10/124 (8%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V+  P  + L + DS        S+    +  +  L G G  G +DG    A+F  P
Sbjct: 177 PGKVLATP--AGLFIADSGHHRVVMSSY----DGEILHLIGTGKSGLTDGAFNEAQFFAP 230

Query: 107 KSFAVDMKGN-IYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSGMFWVTV 165
           +  A D +   +YVAD  NH +R++   D   Q+++ I    E        G       +
Sbjct: 231 QGMAFDPENQMLYVADTENHTLRRV---DLKRQIVKTIAGTGEQSRNIYPHGGAGIETAL 287

Query: 166 FSPY 169
            SP+
Sbjct: 288 NSPW 291



 Score = 37.4 bits (85), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 17/47 (36%), Positives = 28/47 (59%), Gaps = 1/47 (2%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           + + GDG  G  DG+  + RF +P   +V M   +Y++D +NH IR+
Sbjct: 433 QTILGDGSAGLQDGQGQNTRFFEPSGLSV-MGDYLYISDTNNHAIRR 478


>gi|317419409|emb|CBN81446.1| NHL repeat-containing protein 2 [Dicentrarchus labrax]
          Length = 719

 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 41/141 (29%), Positives = 61/141 (43%), Gaps = 27/141 (19%)

Query: 16  SLLFSLVSSG-----LLLEDG----YTVTTVIDGHQLEIN------PHSVIDRPGSSDLI 60
           +LLFS+V  G     +L  D     Y    ++  H ++I       P S +  PG    +
Sbjct: 168 NLLFSIVGEGHRDRLMLFTDCALRYYGEQGLLKTHAVKIKLYRDSLPTSFLSFPGK---V 224

Query: 61  VLDSSRSAFYTLS------FPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
            +DSS                LS    +  + G    G  DG+   A F+ P+  A+  K
Sbjct: 225 AIDSSEKRLAIADTGHHRILLLSSTGQLLHVIGGPESGRRDGDLSEASFNSPQGVAI--K 282

Query: 115 GN-IYVADKSNHVIRKITNLD 134
           G+ +YVAD  NH+IRKI  L+
Sbjct: 283 GDTVYVADTENHLIRKIDLLE 303


>gi|428166914|gb|EKX35881.1| hypothetical protein GUITHDRAFT_79273, partial [Guillardia theta
           CCMP2712]
          Length = 318

 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 28/93 (30%), Positives = 44/93 (47%), Gaps = 5/93 (5%)

Query: 40  GHQLEIN-PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEP 98
           G +  IN P+ V   P  + +IV D+  +    +     E      LAG  + G ++G  
Sbjct: 107 GTKARINTPYGVKVSPCGNYVIVSDTGNNMIRKVDI---ESGYTNTLAGQSLAGTANGVG 163

Query: 99  GSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI 130
             A+F+ P    VD    + YV+D+ N+ IRKI
Sbjct: 164 TLAQFNMPVDVTVDWNETVAYVSDQGNNCIRKI 196


>gi|386875467|ref|ZP_10117636.1| NHL repeat protein, partial [Candidatus Nitrosopumilus salaria
           BD31]
 gi|386806721|gb|EIJ66171.1| NHL repeat protein, partial [Candidatus Nitrosopumilus salaria
           BD31]
          Length = 65

 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 21/32 (65%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G+  FD P   AVD  GNIYVAD +N+ ++K 
Sbjct: 23  GNGNFDAPSGVAVDSAGNIYVADTNNNRVQKF 54


>gi|78047814|ref|YP_363989.1| hypothetical protein XCV2258 [Xanthomonas campestris pv.
           vesicatoria str. 85-10]
 gi|78036244|emb|CAJ23935.1| conserved hypothetical protein [Xanthomonas campestris pv.
           vesicatoria str. 85-10]
          Length = 499

 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 31/56 (55%), Gaps = 2/56 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           V R  G G   + DG  G A F +P+  A++ +  +YVAD  NH +R+I NL  G 
Sbjct: 225 VLRQFGHGNADFIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 278


>gi|449670570|ref|XP_002158620.2| PREDICTED: NHL repeat-containing protein 2-like [Hydra
           magnipapillata]
          Length = 629

 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 29/44 (65%), Gaps = 1/44 (2%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           LAG+GV G  +G+  SA F++P    + +   IY+AD +NH+IR
Sbjct: 496 LAGNGVVGIKNGDFLSAEFNEPGGI-ICLDDTIYIADTNNHLIR 538



 Score = 38.5 bits (88), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 31/51 (60%), Gaps = 2/51 (3%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           VVKR+ G G  G+ DGE   ++F  P+  +  ++  I+VAD  NH IR+I 
Sbjct: 244 VVKRI-GTGFPGFKDGEAHVSQFSSPQGLSW-IENVIFVADTENHSIREIN 292


>gi|219852087|ref|YP_002466519.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
 gi|219546346|gb|ACL16796.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
          Length = 668

 Score = 38.9 bits (89), Expect = 1.1,   Method: Composition-based stats.
 Identities = 26/82 (31%), Positives = 38/82 (46%), Gaps = 5/82 (6%)

Query: 51  IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFA 110
           +D+PGS    V   SR   Y   +  +   V     G  +Q  S G  G  +FD P   A
Sbjct: 139 LDQPGS----VAVDSRGQIYVADWGNNRVQVFNSTGGYLMQWGSSGS-GDGQFDGPNGIA 193

Query: 111 VDMKGNIYVADKSNHVIRKITN 132
           +D  GN+YV D  N+ I++  +
Sbjct: 194 IDSTGNVYVTDAYNNRIQEFNS 215



 Score = 36.2 bits (82), Expect = 6.8,   Method: Composition-based stats.
 Identities = 15/47 (31%), Positives = 26/47 (55%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           AG  +  +    PG+ +F  P   A+D  GN+Y+ D  N+ ++K T+
Sbjct: 450 AGGYLSQWGSHGPGAGQFSGPNGIALDSTGNVYITDWGNNRVQKFTS 496


>gi|383115566|ref|ZP_09936322.1| hypothetical protein BSGG_2564 [Bacteroides sp. D2]
 gi|313695029|gb|EFS31864.1| hypothetical protein BSGG_2564 [Bacteroides sp. D2]
          Length = 459

 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 17/40 (42%), Positives = 25/40 (62%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G+ +G+   AR ++P    +D +GNIYVA +  H I KIT
Sbjct: 364 GFVEGQGNDARLEEPCQMDLDEEGNIYVAVRKKHRIAKIT 403


>gi|291227073|ref|XP_002733512.1| PREDICTED: tripartite motif protein-like, partial [Saccoglossus
           kowalevskii]
          Length = 594

 Score = 38.9 bits (89), Expect = 1.1,   Method: Composition-based stats.
 Identities = 24/67 (35%), Positives = 35/67 (52%), Gaps = 3/67 (4%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSG 159
           +A F++P    VD KGN+ VAD  N   ++I  LD   + I QI L  E  +  + + +G
Sbjct: 429 NAEFNQPVGVTVDKKGNVLVADFGN---KRIQILDSNGKYIDQILLDSERPTGVAMAPNG 485

Query: 160 MFWVTVF 166
              V VF
Sbjct: 486 NIVVVVF 492


>gi|336399828|ref|ZP_08580628.1| NHL repeat containing protein [Prevotella multisaccharivorax DSM
           17128]
 gi|336069564|gb|EGN58198.1| NHL repeat containing protein [Prevotella multisaccharivorax DSM
           17128]
          Length = 494

 Score = 38.9 bits (89), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 16/50 (32%), Positives = 25/50 (50%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
           +G+ DG   +A+F+ P+       G +Y+AD  NH IR I      N  +
Sbjct: 392 KGWEDGSLKNAKFNYPRQICFTKDGRLYIADSQNHCIRSIDTTQGANARV 441


>gi|383123946|ref|ZP_09944616.1| hypothetical protein BSIG_4035 [Bacteroides sp. 1_1_6]
 gi|251838822|gb|EES66907.1| hypothetical protein BSIG_4035 [Bacteroides sp. 1_1_6]
          Length = 451

 Score = 38.9 bits (89), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 24/108 (22%), Positives = 51/108 (47%), Gaps = 8/108 (7%)

Query: 30  DGYTVTTVIDGHQLEI-----NPHSVI-DRPGSSDLIVLDSSRSAFYTLSFPLSEESVVK 83
           D  + T VI+G++ +      N  ++I D  G  ++ + D+       +   ++   V  
Sbjct: 344 DNTSTTQVINGYRTDCLFNWNNQMTIIVDESGQKNMYIADAGTQTIRKID--MNTGMVTT 401

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            +    V+G   G P  A F+ PK   +  +G++Y++D  +  +RK++
Sbjct: 402 VVGRQNVKGNQSGTPLEATFNWPKGVGLTAEGDLYISDCGSGCVRKLS 449


>gi|348175195|ref|ZP_08882089.1| NHL repeat-containing protein [Saccharopolyspora spinosa NRRL
           18395]
          Length = 630

 Score = 38.9 bits (89), Expect = 1.2,   Method: Composition-based stats.
 Identities = 30/95 (31%), Positives = 50/95 (52%), Gaps = 13/95 (13%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  VI  PG  +L+V DS+  +   + F    E+VV+R+ G G +G +DG   +A F +P
Sbjct: 194 PGKVIALPGG-NLLVSDSANHSL--VEFAADGETVVRRI-GTGQRGRADGV--AAEFAEP 247

Query: 107 KSFAV-------DMKGNIYVADKSNHVIRKITNLD 134
              A+        +  ++ VAD  NH++R +  +D
Sbjct: 248 AGLALLPQRVAEQVGYDVVVADTVNHLLRGVRLVD 282


>gi|219851493|ref|YP_002465925.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
 gi|219545752|gb|ACL16202.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
          Length = 390

 Score = 38.9 bits (89), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 17/33 (51%), Positives = 20/33 (60%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G  +FD P   AVD  GN+YVAD  N  I+K T
Sbjct: 39  GIGQFDHPGGVAVDSAGNVYVADTENDRIQKFT 71



 Score = 38.1 bits (87), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 21/33 (63%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G   FD+P   AVD  GN+YVAD  N+ ++K T
Sbjct: 180 GVGEFDEPFGIAVDGTGNVYVADTYNNRVQKFT 212


>gi|383454552|ref|YP_005368541.1| putative lipoprotein [Corallococcus coralloides DSM 2259]
 gi|380728713|gb|AFE04715.1| putative lipoprotein [Corallococcus coralloides DSM 2259]
          Length = 861

 Score = 38.5 bits (88), Expect = 1.2,   Method: Composition-based stats.
 Identities = 23/70 (32%), Positives = 37/70 (52%), Gaps = 9/70 (12%)

Query: 77  SEESVVKRLAGDG--------VQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVI 127
           +E +V++R++ +G          GY DG   SA F++P   +    G  + VAD +N VI
Sbjct: 642 TENAVIRRISPEGKAEVFAGTTWGYQDGPALSAAFNQPAGLSFTPDGTALLVADLNNSVI 701

Query: 128 RKITNLDHGN 137
           R+I  +  GN
Sbjct: 702 RRIDMVTPGN 711



 Score = 36.2 bits (82), Expect = 6.9,   Method: Composition-based stats.
 Identities = 24/73 (32%), Positives = 32/73 (43%), Gaps = 1/73 (1%)

Query: 59  LIVLDSSRSAFYTLSFPLS-EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI 117
           L + D        L+F  S E   +  LAG G  G  DG    A+   P   A+   G +
Sbjct: 787 LAISDPGNYRVRRLTFNASGEPEKLTTLAGSGRYGAEDGTGREAQLVLPAGLALGPDGTL 846

Query: 118 YVADKSNHVIRKI 130
           YVAD  N ++R I
Sbjct: 847 YVADAGNSLVRAI 859


>gi|290995104|ref|XP_002680171.1| predicted protein [Naegleria gruberi]
 gi|284093791|gb|EFC47427.1| predicted protein [Naegleria gruberi]
          Length = 928

 Score = 38.5 bits (88), Expect = 1.2,   Method: Composition-based stats.
 Identities = 22/62 (35%), Positives = 34/62 (54%), Gaps = 4/62 (6%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           E   +  +AG+G  G+S  +G   +A+ + P S  V     +Y+ D +NH IRKI  L +
Sbjct: 95  ENGKIVTIAGNGTGGFSGDNGPATNAQLNNPYSVFVSSNNEVYIVDYNNHRIRKI--LKN 152

Query: 136 GN 137
           GN
Sbjct: 153 GN 154



 Score = 38.5 bits (88), Expect = 1.3,   Method: Composition-based stats.
 Identities = 22/62 (35%), Positives = 34/62 (54%), Gaps = 4/62 (6%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           E   +  +AG+G  G+S  +G   +A+ + P    V     +Y+AD  NH IRKI  L++
Sbjct: 478 ENGNIVTIAGNGTGGFSGDNGPATNAQLNNPMGVFVSSNNEVYIADYYNHRIRKI--LEN 535

Query: 136 GN 137
           GN
Sbjct: 536 GN 537



 Score = 38.1 bits (87), Expect = 1.6,   Method: Composition-based stats.
 Identities = 21/55 (38%), Positives = 32/55 (58%), Gaps = 4/55 (7%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           +AG+G  G+S  +G   +A+ + P    V     +Y+AD  NH IRKI  L++GN
Sbjct: 158 IAGNGTGGFSGDNGPATNAQLNNPMGVFVSSNNEVYIADYYNHRIRKI--LENGN 210



 Score = 37.4 bits (85), Expect = 3.0,   Method: Composition-based stats.
 Identities = 23/71 (32%), Positives = 36/71 (50%), Gaps = 4/71 (5%)

Query: 69  FYTLSFPLSEESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
           F  L   L++   +  +AG+G  G+   +G   +A+   P S  V     +Y+ D +NH 
Sbjct: 340 FIDLKRSLTKNGNIVTIAGNGTAGFRGDNGPATNAQLYNPYSVFVSSNNEVYIVDYNNHR 399

Query: 127 IRKITNLDHGN 137
           IRKI  L +GN
Sbjct: 400 IRKI--LKNGN 408



 Score = 36.6 bits (83), Expect = 5.6,   Method: Composition-based stats.
 Identities = 19/48 (39%), Positives = 27/48 (56%), Gaps = 2/48 (4%)

Query: 85  LAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +AG+G  G+   +G   +A+   P S  V     +Y+AD SNH IRKI
Sbjct: 46  IAGNGTAGFRGDNGPATNAQLYNPYSVFVSSNNEVYIADFSNHRIRKI 93


>gi|290995072|ref|XP_002680155.1| predicted protein [Naegleria gruberi]
 gi|284093775|gb|EFC47411.1| predicted protein [Naegleria gruberi]
          Length = 1951

 Score = 38.5 bits (88), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 35/67 (52%), Gaps = 2/67 (2%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++   ++  +AG G  GYS DG    SA+   P   A+     IY+ D +N+ IRK++  
Sbjct: 184 ITTSGIISTIAGTGTSGYSGDGSSAISAQLYNPYGVAISSSDEIYITDTNNNRIRKLSPW 243

Query: 134 DHGNQLI 140
             GN ++
Sbjct: 244 CTGNAIL 250


>gi|195164496|ref|XP_002023083.1| GL21149 [Drosophila persimilis]
 gi|194105168|gb|EDW27211.1| GL21149 [Drosophila persimilis]
          Length = 595

 Score = 38.5 bits (88), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 19/56 (33%), Positives = 31/56 (55%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S + VV+   G    G+ DG+   +RF+ P+  A   +  + VAD  NH +RKI+
Sbjct: 252 VSADGVVEHKIGGLQAGFVDGKLALSRFNSPQGVAFLNESTLIVADTENHALRKIS 307


>gi|147818279|emb|CAN64724.1| hypothetical protein VITISV_026725 [Vitis vinifera]
          Length = 423

 Score = 38.5 bits (88), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 18/48 (37%), Positives = 28/48 (58%), Gaps = 1/48 (2%)

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI 130
           ++   G +G  DG    A F++P+  A + K N+ YVAD  NH +R+I
Sbjct: 54  QIGSTGEEGLRDGSFDDATFNRPQGLAYNAKKNLLYVADTENHALREI 101


>gi|197120448|ref|YP_002132399.1| fibronectin type III domain-containing protein [Anaeromyxobacter
           sp. K]
 gi|196170297|gb|ACG71270.1| Fibronectin type III domain protein [Anaeromyxobacter sp. K]
          Length = 460

 Score = 38.5 bits (88), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 35/77 (45%), Gaps = 6/77 (7%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ--GYSDGEPGSARFDKPKSFAVDMKGN 116
           L V D+  +A   L        VV  LAGD     GY+DG   +ARF  P   A+   G 
Sbjct: 281 LYVADTMNNAIRALDL---ATGVVSTLAGDPASYAGYADGRGTAARFSAPYGLAM-HGGE 336

Query: 117 IYVADKSNHVIRKITNL 133
           + VAD  N  IR +  L
Sbjct: 337 LLVADSGNSAIRAVDPL 353


>gi|290995833|ref|XP_002680487.1| serine/threonine kinase [Naegleria gruberi]
 gi|284094108|gb|EFC47743.1| serine/threonine kinase [Naegleria gruberi]
          Length = 1078

 Score = 38.5 bits (88), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 48/94 (51%), Gaps = 8/94 (8%)

Query: 39  DGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEP 98
           D +  +IN H +  R  SSD ++   + +   TL+ P+S  +  +    +G     DG  
Sbjct: 133 DVYICDINNHRI--RKVSSDGVISTIAGTGRTTLN-PISPNTYRQEGGYNG-----DGLA 184

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
            +   + PK   VD KG++Y AD  N +IRK++N
Sbjct: 185 TATDLNYPKYAQVDSKGDVYFADTYNGLIRKVSN 218


>gi|295700390|ref|YP_003608283.1| NHL repeat containing protein [Burkholderia sp. CCGE1002]
 gi|295439603|gb|ADG18772.1| NHL repeat containing protein [Burkholderia sp. CCGE1002]
          Length = 383

 Score = 38.5 bits (88), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 20/59 (33%), Positives = 31/59 (52%), Gaps = 3/59 (5%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINL 145
           G+ ++ + D   G  +F  P S AVD   ++YVAD+SN   R+I   D     +RQ  +
Sbjct: 220 GNWLKSWGDRGTGPGQFHTPHSIAVDAHDHVYVADRSN---RRIQVFDTEGTFLRQFTI 275


>gi|391330742|ref|XP_003739813.1| PREDICTED: NHL repeat-containing protein 2-like [Metaseiulus
           occidentalis]
          Length = 685

 Score = 38.5 bits (88), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 26/49 (53%), Gaps = 1/49 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V+ + G G  G+ DG  G  RF  P+       G I+VAD  NH IR++
Sbjct: 258 VQLIVGTGEAGFLDGAYGICRFSSPQGVCF-YDGGIFVADAGNHAIRRV 305


>gi|383124876|ref|ZP_09945537.1| hypothetical protein BSIG_1376 [Bacteroides sp. 1_1_6]
 gi|382983584|gb|EES69051.2| hypothetical protein BSIG_1376 [Bacteroides sp. 1_1_6]
          Length = 479

 Score = 38.5 bits (88), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 24/65 (36%), Positives = 33/65 (50%), Gaps = 9/65 (13%)

Query: 76  LSEESVVKRLAGDG-------VQGYSDGE-PGSARFDKPKSFAVDMKGN-IYVADKSNHV 126
           ++ + ++   AG G       V GY DGE    ARFD P   A D +    Y+AD+ NH 
Sbjct: 413 ITPDGIISTYAGRGSVSSDGRVNGYIDGELRTEARFDSPCGIAYDEETQTFYIADRENHR 472

Query: 127 IRKIT 131
           IR I+
Sbjct: 473 IRTIS 477


>gi|307106270|gb|EFN54516.1| hypothetical protein CHLNCDRAFT_31604 [Chlorella variabilis]
          Length = 1072

 Score = 38.5 bits (88), Expect = 1.3,   Method: Composition-based stats.
 Identities = 24/83 (28%), Positives = 38/83 (45%), Gaps = 4/83 (4%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           +P +V+  P S  + V DS       L       + ++ LAG G  GY DG   +A+  +
Sbjct: 813 HPLAVLAAP-SGKVYVADSYNHRLKELD---PATNTIRTLAGSGAAGYRDGVGTTAQLSE 868

Query: 106 PKSFAVDMKGNIYVADKSNHVIR 128
           P   A    G + + D +N +IR
Sbjct: 869 PAGLAAGPDGTVIICDTNNSLIR 891


>gi|397779898|ref|YP_006544371.1| NHL repeat-containing protein 2 [Methanoculleus bourgensis MS2]
 gi|396938400|emb|CCJ35655.1| NHL repeat-containing protein 2 [Methanoculleus bourgensis MS2]
          Length = 483

 Score = 38.5 bits (88), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 30/56 (53%), Gaps = 1/56 (1%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK-GNIYVADKSNHVIRKIT 131
           S +  +  + G G  G +DG    A F  P+  A D + G +YVAD  NH IR+++
Sbjct: 196 SRDGAILGVIGTGAPGNADGSSSEAAFYLPEGLAFDEEAGVLYVADAGNHTIRQVS 251



 Score = 37.4 bits (85), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 18/49 (36%), Positives = 26/49 (53%), Gaps = 1/49 (2%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           G G +GY +G  G AR  +P    V++ G  Y+AD  NH +R    + H
Sbjct: 427 GSGDRGYQNGVSGDARLSEPGGL-VNLGGLWYIADTGNHAVRVYDPVSH 474


>gi|253748633|gb|EET02664.1| Haloacid dehalogenase-like hydrolase family [Giardia intestinalis
           ATCC 50581]
          Length = 963

 Score = 38.5 bits (88), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 6/86 (6%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDK 105
           PHSV+  P +  + + D+      TLS    +   V+ LAG  GV GY DG   +  F++
Sbjct: 143 PHSVVSDPVNGIIYLTDTKNHCIRTLSLATRK---VRTLAGTPGVFGYRDGL--NPLFNE 197

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKIT 131
           P    +    NI V D  N  +R I+
Sbjct: 198 PLGLVLTEDSNIIVCDSKNGALRHIS 223


>gi|254417234|ref|ZP_05030979.1| Redoxin family [Coleofasciculus chthonoplastes PCC 7420]
 gi|196176040|gb|EDX71059.1| Redoxin family [Coleofasciculus chthonoplastes PCC 7420]
          Length = 496

 Score = 38.5 bits (88), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 21/63 (33%), Positives = 34/63 (53%), Gaps = 4/63 (6%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQ 142
           + L G+G  G+ DG+   A F +P   +  +   +Y+AD +NHVIR +   D G   +  
Sbjct: 435 QTLLGNGKAGHRDGQSHEAEFAEPSGVSA-IAARLYIADTNNHVIRCV---DLGTLEVTT 490

Query: 143 INL 145
           +NL
Sbjct: 491 LNL 493


>gi|383111215|ref|ZP_09932032.1| hypothetical protein BSGG_2315 [Bacteroides sp. D2]
 gi|313694780|gb|EFS31615.1| hypothetical protein BSGG_2315 [Bacteroides sp. D2]
          Length = 593

 Score = 38.5 bits (88), Expect = 1.3,   Method: Composition-based stats.
 Identities = 35/109 (32%), Positives = 54/109 (49%), Gaps = 13/109 (11%)

Query: 33  TVTTVIDGHQLEINPHSVIDRPG-----SSDLIVLDSSRSAFYTLSFPLSE----ESVVK 83
           TV T  DG Q +++  ++    G        L + D +     TL+ P +E    +  +K
Sbjct: 485 TVATYSDGEQNDLSKATMGATFGIAFGPDEILYITDYTMHTIRTLT-PDAEGDYTKGTLK 543

Query: 84  RLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            +AG  G +G  DG   +A F+ P S  V  K  +Y+AD+ NH+IR IT
Sbjct: 544 TIAGIPGTKGKIDGSALTATFNCPASVLVSDK--VYIADEQNHLIRSIT 590



 Score = 38.5 bits (88), Expect = 1.3,   Method: Composition-based stats.
 Identities = 20/60 (33%), Positives = 36/60 (60%), Gaps = 1/60 (1%)

Query: 72  LSFPLSEES-VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           LSF   + S  V+ +AG+     +DG+  +A F  P+  A+   G+I++A++ N+ IRK+
Sbjct: 281 LSFTYEDLSYTVETVAGNSATTSTDGKGTAASFKFPQGLALAPNGDIWIAERGNNTIRKM 340


>gi|73667964|ref|YP_303979.1| hypothetical protein Mbar_A0416 [Methanosarcina barkeri str.
           Fusaro]
 gi|72395126|gb|AAZ69399.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro]
          Length = 346

 Score = 38.5 bits (88), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 24/35 (68%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           G+ +F+ P S AVD  GN+YVADK N  I+K  ++
Sbjct: 234 GNGQFEFPLSIAVDSSGNVYVADKYNQRIQKFNSI 268



 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 22/33 (66%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G+ +F+ P+  AVD  GN+YVAD  N+ I+K  
Sbjct: 187 GNGQFNDPEGVAVDSSGNVYVADSGNNRIQKFN 219



 Score = 37.0 bits (84), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 19/33 (57%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           GS +F  P   AVD  GN+YV D  NH I+K  
Sbjct: 46  GSGQFACPTGVAVDSSGNVYVTDTGNHRIQKFN 78


>gi|423214448|ref|ZP_17200976.1| hypothetical protein HMPREF1074_02508 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692863|gb|EIY86099.1| hypothetical protein HMPREF1074_02508 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 464

 Score = 38.5 bits (88), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 26/40 (65%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G+++G+   A+ ++P    +D +GNIYVA +  H I KIT
Sbjct: 369 GFTEGQGNEAKLEEPCQMDLDEEGNIYVAVRRKHRIAKIT 408


>gi|428175994|gb|EKX44881.1| hypothetical protein GUITHDRAFT_60229, partial [Guillardia theta
           CCMP2712]
          Length = 181

 Score = 38.5 bits (88), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 26/47 (55%), Gaps = 1/47 (2%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           LAG    GY DG+    RF+ P   A    G+ + VAD +NH IR+I
Sbjct: 1   LAGPSSAGYQDGQAKIVRFNSPLDIAFSKDGSWLAVADTNNHAIRRI 47



 Score = 36.2 bits (82), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 26/40 (65%), Gaps = 1/40 (2%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           G  DG   +A F+ P S A+D +   I+VAD SN++IR+I
Sbjct: 68  GSQDGPASAASFNSPTSIALDPQDKFIFVADTSNNMIRQI 107


>gi|29348707|ref|NP_812210.1| hypothetical protein BT_3298 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|298387071|ref|ZP_06996625.1| IPT/TIG domain-containing protein [Bacteroides sp. 1_1_14]
 gi|29340612|gb|AAO78404.1| conserved hypothetical protein, with a conserved domain
           [Bacteroides thetaiotaomicron VPI-5482]
 gi|298260221|gb|EFI03091.1| IPT/TIG domain-containing protein [Bacteroides sp. 1_1_14]
          Length = 470

 Score = 38.5 bits (88), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 24/65 (36%), Positives = 33/65 (50%), Gaps = 9/65 (13%)

Query: 76  LSEESVVKRLAGDG-------VQGYSDGE-PGSARFDKPKSFAVDMKGN-IYVADKSNHV 126
           ++ + ++   AG G       V GY DGE    ARFD P   A D +    Y+AD+ NH 
Sbjct: 404 ITPDGIISTYAGRGSVSSDGRVNGYIDGELRTEARFDSPCGIAYDEETQTFYIADRENHR 463

Query: 127 IRKIT 131
           IR I+
Sbjct: 464 IRTIS 468


>gi|218190788|gb|EEC73215.1| hypothetical protein OsI_07299 [Oryza sativa Indica Group]
          Length = 1103

 Score = 38.5 bits (88), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 25/85 (29%), Positives = 41/85 (48%), Gaps = 2/85 (2%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQLIRQINL 145
           G   +G  DG   +A F++P+  A + K NI YVAD  NH +R+I  +    + +     
Sbjct: 622 GSSEEGLLDGTFDTASFNRPQGLAYNSKKNILYVADTENHALREINFVSETVKTLAGNGT 681

Query: 146 KPEDCSKSSQ-SGSGMFWVTVFSPY 169
           K  D     Q +    F++ + SP+
Sbjct: 682 KGSDYRGGGQGTNQACFFMVLNSPW 706



 Score = 36.2 bits (82), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 22/64 (34%), Positives = 31/64 (48%), Gaps = 1/64 (1%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
           V  +AG G  GY DG   SA+  +P        G + VAD +N  IR I  L+     +R
Sbjct: 857 VTTIAGTGRAGYKDGPALSAQLSEPAGLVEVGDGRLLVADTNNSTIRYIV-LNEKGAEVR 915

Query: 142 QINL 145
            ++L
Sbjct: 916 TLDL 919


>gi|108707787|gb|ABF95582.1| HAD-superfamily hydrolase, subfamily IA, variant 3 containing
           protein, expressed [Oryza sativa Japonica Group]
          Length = 772

 Score = 38.5 bits (88), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 19/46 (41%), Positives = 27/46 (58%), Gaps = 1/46 (2%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
           G   +G  DG   +A F++P+  A + K NI YVAD  NH +R+I 
Sbjct: 610 GSSEEGLLDGTFDTASFNRPQGLAYNSKKNILYVADTENHALREIN 655


>gi|290971766|ref|XP_002668650.1| predicted protein [Naegleria gruberi]
 gi|284082136|gb|EFC35906.1| predicted protein [Naegleria gruberi]
          Length = 728

 Score = 38.5 bits (88), Expect = 1.4,   Method: Composition-based stats.
 Identities = 23/65 (35%), Positives = 35/65 (53%), Gaps = 4/65 (6%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           E   +  +AG+G  G+S  +G    A+ + P    V     +Y+AD  NH IRKI  L++
Sbjct: 39  ENGNIVTIAGNGTAGFSGDNGIATKAQLNGPVGVFVSSNNEVYIADYDNHRIRKI--LEN 96

Query: 136 GNQLI 140
           GN +I
Sbjct: 97  GNIVI 101



 Score = 37.0 bits (84), Expect = 3.9,   Method: Composition-based stats.
 Identities = 24/63 (38%), Positives = 35/63 (55%), Gaps = 6/63 (9%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKITNLD 134
           E   +  +AG G  G+S G+ G A  +K   P+   V     +Y+AD+ NH IRKI  L+
Sbjct: 207 ENGNIVTIAGKGTAGFS-GDNGLATKEKLNFPRCVFVSSNNEVYIADQINHRIRKI--LE 263

Query: 135 HGN 137
           +GN
Sbjct: 264 NGN 266



 Score = 37.0 bits (84), Expect = 3.9,   Method: Composition-based stats.
 Identities = 22/62 (35%), Positives = 33/62 (53%), Gaps = 4/62 (6%)

Query: 78  EESVVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           E   +  +AG+G  G+   +G   +A+ + P    V     IY+AD  NH IRKI  L++
Sbjct: 151 ENGNIVTIAGNGPYGFCGDNGLATNAQLNSPAGVFVSSNNEIYIADYDNHRIRKI--LEN 208

Query: 136 GN 137
           GN
Sbjct: 209 GN 210



 Score = 36.6 bits (83), Expect = 4.6,   Method: Composition-based stats.
 Identities = 23/56 (41%), Positives = 33/56 (58%), Gaps = 6/56 (10%)

Query: 85  LAGDGVQGYSDGEPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           +AG G  G+S G+ G A  +K   P+   V     +Y+AD+ NH IRKI  L++GN
Sbjct: 102 IAGKGTAGFS-GDNGLATKEKLNFPRCVFVSSNNEVYIADQINHRIRKI--LENGN 154


>gi|147920395|ref|YP_685830.1| hypothetical protein RCIX1195 [Methanocella arvoryzae MRE50]
 gi|110621226|emb|CAJ36504.1| hypothetical protein RCIX1195 [Methanocella arvoryzae MRE50]
          Length = 673

 Score = 38.5 bits (88), Expect = 1.4,   Method: Composition-based stats.
 Identities = 18/37 (48%), Positives = 24/37 (64%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           G   G  G  +FD P S  +D +GNIYVAD++N+ IR
Sbjct: 360 GTGIGGAGDDQFDDPMSVDLDSEGNIYVADRNNNRIR 396



 Score = 36.2 bits (82), Expect = 6.7,   Method: Composition-based stats.
 Identities = 19/45 (42%), Positives = 26/45 (57%), Gaps = 3/45 (6%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQI 143
           G ++FD+P S AVD  G+IYVAD  N+   K+   D     +R I
Sbjct: 169 GDSQFDRPMSVAVDSAGSIYVADYMNN---KVKIFDGAGTYLRSI 210


>gi|336180276|ref|YP_004585651.1| NHL repeat containing protein [Frankia symbiont of Datisca
           glomerata]
 gi|334861256|gb|AEH11730.1| NHL repeat containing protein [Frankia symbiont of Datisca
           glomerata]
          Length = 398

 Score = 38.5 bits (88), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 21/33 (63%)

Query: 101 ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           A  + P + AVD  GN+Y++D  NH IRKI  L
Sbjct: 133 AELNSPATTAVDSDGNVYISDSHNHRIRKIDPL 165



 Score = 37.7 bits (86), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 32/55 (58%), Gaps = 2/55 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           ++  +AG G  G+S DG P +A + ++P   AV    +IY++D  N  IRKI  L
Sbjct: 167 IITTIAGTGTAGFSGDGGPATAAQLNEPYGLAVATDDSIYISDYENQRIRKIDPL 221


>gi|290988149|ref|XP_002676784.1| predicted protein [Naegleria gruberi]
 gi|284090388|gb|EFC44040.1| predicted protein [Naegleria gruberi]
          Length = 2167

 Score = 38.5 bits (88), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 21/38 (55%)

Query: 93  YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           Y DG    A+F+ P   A      IY+ADK NHVIR I
Sbjct: 231 YIDGNTTIAKFNSPTGVAYGPSKEIYIADKMNHVIRMI 268



 Score = 37.4 bits (85), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 19/57 (33%), Positives = 30/57 (52%), Gaps = 1/57 (1%)

Query: 77  SEESVVKRLAGDGVQGY-SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           S    +  +AG G  G  ++    SA+   P   AV + G + +AD SNH IR+++N
Sbjct: 69  SNSGAITIVAGTGAFGSDTNTNATSAKLYSPSGLAVTIDGKLLIADTSNHAIREVSN 125


>gi|219853189|ref|YP_002467621.1| PKD domain-containing protein [Methanosphaerula palustris E1-9c]
 gi|219547448|gb|ACL17898.1| PKD domain containing protein [Methanosphaerula palustris E1-9c]
          Length = 930

 Score = 38.5 bits (88), Expect = 1.5,   Method: Composition-based stats.
 Identities = 17/46 (36%), Positives = 26/46 (56%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           G+ +  +     G+ +F+ P   AVD  GNIYV D  NH ++K T+
Sbjct: 76  GEFITQWGSEGSGNGQFENPGGIAVDSAGNIYVTDTWNHRVQKFTS 121



 Score = 37.0 bits (84), Expect = 4.0,   Method: Composition-based stats.
 Identities = 25/76 (32%), Positives = 39/76 (51%), Gaps = 11/76 (14%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           + ++ V+D S + F    F  + E + K     G +G  DGE     F  P+  AVD  G
Sbjct: 526 AGNVYVVDESNNRFQ--KFTSTGEFITKW----GSEGLGDGE-----FTYPRDVAVDSGG 574

Query: 116 NIYVADKSNHVIRKIT 131
           N+Y+ D+SN  I+K +
Sbjct: 575 NVYIVDESNSRIQKFS 590



 Score = 35.8 bits (81), Expect = 9.6,   Method: Composition-based stats.
 Identities = 30/91 (32%), Positives = 46/91 (50%), Gaps = 15/91 (16%)

Query: 44  EIN-PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSA 101
           E+N PH V +D  G+  +I   ++R   +T        S  + +A  G +G  DG+    
Sbjct: 326 ELNHPHGVALDDAGNVYIIDTWNNRVQIFT--------STGEFIAKWGSKGSEDGQ---- 373

Query: 102 RFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
            F  P S AVD  GN+YV D  N+ ++K T+
Sbjct: 374 -FIYPYSIAVDSAGNVYVVDTGNNRVQKFTS 403



 Score = 35.8 bits (81), Expect = 9.8,   Method: Composition-based stats.
 Identities = 24/82 (29%), Positives = 42/82 (51%), Gaps = 13/82 (15%)

Query: 51  IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFA 110
           +DR G+  ++   ++R   +T     S  + + +  G G +   DG+     F+ P   A
Sbjct: 476 VDRAGNVYVVDTGNNRVQIFT-----STGTFIAQWGGSGSR---DGQ-----FNYPGGIA 522

Query: 111 VDMKGNIYVADKSNHVIRKITN 132
           VD  GN+YV D+SN+  +K T+
Sbjct: 523 VDSAGNVYVVDESNNRFQKFTS 544


>gi|108707786|gb|ABF95581.1| HAD-superfamily hydrolase, subfamily IA, variant 3 containing
           protein, expressed [Oryza sativa Japonica Group]
          Length = 1064

 Score = 38.5 bits (88), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 19/46 (41%), Positives = 27/46 (58%), Gaps = 1/46 (2%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKIT 131
           G   +G  DG   +A F++P+  A + K NI YVAD  NH +R +T
Sbjct: 610 GSSEEGLLDGTFDTASFNRPQGLAYNSKKNILYVADTENHALRTLT 655



 Score = 37.4 bits (85), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 22/64 (34%), Positives = 32/64 (50%), Gaps = 1/64 (1%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
           V  +AG G  GY DG   SA+  +P        G + VAD +N  IR I   + G + +R
Sbjct: 842 VTTIAGTGRAGYKDGPALSAQLSEPAGLVEVGDGRLLVADTNNSTIRYIVLNEKGAE-VR 900

Query: 142 QINL 145
            ++L
Sbjct: 901 TLDL 904


>gi|348508723|ref|XP_003441903.1| PREDICTED: NHL repeat-containing protein 2-like [Oreochromis
           niloticus]
          Length = 719

 Score = 38.5 bits (88), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 40/142 (28%), Positives = 62/142 (43%), Gaps = 29/142 (20%)

Query: 16  SLLFSLVSSG-----LLLEDG----YTVTTVIDGHQLEIN------PHSVIDRPG----- 55
           +LLFSLV  G     +L  D     Y    ++  H + I       P S++  PG     
Sbjct: 168 NLLFSLVGEGHRDRLMLFTDCALRYYGGRGLLKAHTVGIKLYKDALPPSILSFPGKVAID 227

Query: 56  --SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDM 113
             +  L++ D+       +S        +  + G    G  DG+   A F+ P+  A+  
Sbjct: 228 NSTKRLVIADTGHHRILVVS----STGDLLHVIGGPESGRKDGDVSEASFNSPQGVAI-- 281

Query: 114 KGN-IYVADKSNHVIRKITNLD 134
           KG+ +YVAD  NH+IRKI  L+
Sbjct: 282 KGDTVYVADTENHLIRKIDLLE 303


>gi|251771416|gb|EES51995.1| conserved protein of unknown function [Leptospirillum
           ferrodiazotrophum]
          Length = 373

 Score = 38.5 bits (88), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 40/87 (45%), Gaps = 6/87 (6%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGV--QGYSDGEPGSARFD 104
           P ++   P    L + DS  +    L         V  +AG G    G  DG  G+A F 
Sbjct: 173 PRALALTPDGKTLYISDSGNNMIRKLDV---ATGTVTTVAGKGALAPGMDDGIGGAASFR 229

Query: 105 KPKSFAVDMKGNI-YVADKSNHVIRKI 130
            P+  A+   G++ YVAD  N++IRKI
Sbjct: 230 DPRGIALSSDGSVLYVADTRNNLIRKI 256



 Score = 36.2 bits (82), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 30/51 (58%), Gaps = 1/51 (1%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKI 130
            V  LAG    G +DG  G+A F  P++ A+   G  +Y++D  N++IRK+
Sbjct: 148 AVTTLAGRAFPGSNDGTGGAAGFYGPRALALTPDGKTLYISDSGNNMIRKL 198


>gi|222624800|gb|EEE58932.1| hypothetical protein OsJ_10596 [Oryza sativa Japonica Group]
          Length = 1017

 Score = 38.5 bits (88), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 25/85 (29%), Positives = 41/85 (48%), Gaps = 2/85 (2%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQLIRQINL 145
           G   +G  DG   +A F++P+  A + K NI YVAD  NH +R+I  +    + +     
Sbjct: 536 GSSEEGLLDGTFDTASFNRPQGLAYNSKKNILYVADTENHALREINFVSETVKTLAGNGT 595

Query: 146 KPEDCSKSSQ-SGSGMFWVTVFSPY 169
           K  D     Q +    F++ + SP+
Sbjct: 596 KGSDYRGGGQGTNQACFFMVLNSPW 620



 Score = 36.2 bits (82), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 22/64 (34%), Positives = 31/64 (48%), Gaps = 1/64 (1%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
           V  +AG G  GY DG   SA+  +P        G + VAD +N  IR I  L+     +R
Sbjct: 771 VTTIAGTGRAGYKDGPALSAQLSEPAGLVEVGDGRLLVADTNNSTIRYIV-LNEKGAEVR 829

Query: 142 QINL 145
            ++L
Sbjct: 830 TLDL 833


>gi|307727531|ref|YP_003910744.1| NHL repeat containing protein [Burkholderia sp. CCGE1003]
 gi|307588056|gb|ADN61453.1| NHL repeat containing protein [Burkholderia sp. CCGE1003]
          Length = 396

 Score = 38.5 bits (88), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 31/55 (56%), Gaps = 2/55 (3%)

Query: 78  EESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +   V  +AG G  GYS DG P + A    P   A+D  GN+++A+   HV+RK+
Sbjct: 80  DNGTVTIIAGTGQAGYSGDGGPATQAMLHYPTGVALDRAGNVFIAEPFAHVVRKV 134


>gi|290971688|ref|XP_002668616.1| predicted protein [Naegleria gruberi]
 gi|284082092|gb|EFC35872.1| predicted protein [Naegleria gruberi]
          Length = 348

 Score = 38.5 bits (88), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 31/49 (63%), Gaps = 3/49 (6%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +AG+G +G+S  +G   +A+ + P    V     +Y+AD SNHVIRKI+
Sbjct: 143 IAGNGTKGFSGDNGPATNAQLNGPAGVFVS-NNEVYIADYSNHVIRKIS 190



 Score = 35.8 bits (81), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 19/58 (32%), Positives = 32/58 (55%), Gaps = 2/58 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S+   +  +AG+G  G+S  +G   +A+   P    V     +Y++D  NHVIRKI+
Sbjct: 189 ISQNGTIVTIAGNGKPGFSGDNGLATNAQLYNPSGTFVSSNNEVYISDCFNHVIRKIS 246


>gi|326781271|ref|ZP_08240536.1| Ricin B lectin [Streptomyces griseus XylebKG-1]
 gi|326661604|gb|EGE46450.1| Ricin B lectin [Streptomyces griseus XylebKG-1]
          Length = 508

 Score = 38.1 bits (87), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 35/57 (61%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  + V+  +AG GV G + DG P + A+ +KP   AVD +G +YVA+   H +R++
Sbjct: 124 VGADGVIVTVAGTGVAGSAGDGGPAAKAQLNKPFGVAVDGEGVLYVAEFGGHRVRRV 180



 Score = 38.1 bits (87), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 28/49 (57%), Gaps = 2/49 (4%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +AG G  G+   DG   SA+   P   AV   G +YV+D SNH +R+IT
Sbjct: 21  IAGSGTAGFGGDDGAAVSAQLKHPYEMAVSGTGILYVSDYSNHRVRRIT 69


>gi|428304139|ref|YP_007140964.1| NHL repeat containing protein [Crinalium epipsammum PCC 9333]
 gi|428245674|gb|AFZ11454.1| NHL repeat containing protein [Crinalium epipsammum PCC 9333]
          Length = 537

 Score = 38.1 bits (87), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 26/85 (30%), Positives = 39/85 (45%), Gaps = 5/85 (5%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  VI    S+ L + D+        S     ++V+    G G   + DG+   A+F +P
Sbjct: 206 PSKVIACQQSNSLFIADTGHHRLVIASLNGETQAVI----GTGSPAWVDGDLEIAQFCEP 261

Query: 107 KSFAVDMKGN-IYVADKSNHVIRKI 130
                D +   IYVAD  NH++RKI
Sbjct: 262 MGMVFDYEQQVIYVADTVNHLLRKI 286


>gi|290972172|ref|XP_002668832.1| predicted protein [Naegleria gruberi]
 gi|284082360|gb|EFC36088.1| predicted protein [Naegleria gruberi]
          Length = 2254

 Score = 38.1 bits (87), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 35/56 (62%), Gaps = 5/56 (8%)

Query: 82  VKRLAGDGVQGYSDGEPG---SARFDKPKS-FAVDMKGNIYVADKSNHVIRKITNL 133
           +  +AG   +G SDG+ G   +++ + P+  F  D+ G+++VAD +NH IRKI N 
Sbjct: 86  ISTIAGQMTKG-SDGDGGLATASQMNSPRGVFFDDVTGSLFVADTNNHKIRKIDNF 140


>gi|116751477|ref|YP_848164.1| NHL repeat-containing protein [Syntrophobacter fumaroxidans MPOB]
 gi|116700541|gb|ABK19729.1| NHL repeat containing protein [Syntrophobacter fumaroxidans MPOB]
          Length = 1763

 Score = 38.1 bits (87), Expect = 1.6,   Method: Composition-based stats.
 Identities = 20/53 (37%), Positives = 33/53 (62%), Gaps = 4/53 (7%)

Query: 81  VVKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++  +AG+G  GYS G+ G+A   +   P   AVD + N+Y++D  +H IRK+
Sbjct: 504 IITTVAGNGSCGYS-GDGGAAVEAQLGYPAFVAVDSRENLYISDTESHRIRKV 555



 Score = 37.0 bits (84), Expect = 3.6,   Method: Composition-based stats.
 Identities = 17/43 (39%), Positives = 24/43 (55%), Gaps = 4/43 (9%)

Query: 88  DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           DG QG +D     A+ + P   A D  GN+Y+AD  N  +RK+
Sbjct: 590 DGDQGPAD----QAKLNNPSGVATDASGNLYIADTGNRRLRKV 628


>gi|260798200|ref|XP_002594088.1| hypothetical protein BRAFLDRAFT_118791 [Branchiostoma floridae]
 gi|229279321|gb|EEN50099.1| hypothetical protein BRAFLDRAFT_118791 [Branchiostoma floridae]
          Length = 603

 Score = 38.1 bits (87), Expect = 1.6,   Method: Composition-based stats.
 Identities = 21/48 (43%), Positives = 29/48 (60%), Gaps = 4/48 (8%)

Query: 96  GEPGS--ARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
           G  GS   +FD+P   AVD +G I VADK NH I+  T   HG+ +++
Sbjct: 379 GSAGSRNGQFDRPAGVAVDNQGRIIVADKDNHRIQIFTF--HGDFMLK 424


>gi|160942444|ref|ZP_02089750.1| hypothetical protein CLOBOL_07327 [Clostridium bolteae ATCC
           BAA-613]
 gi|158434643|gb|EDP12410.1| hypothetical protein CLOBOL_07327 [Clostridium bolteae ATCC
           BAA-613]
          Length = 329

 Score = 38.1 bits (87), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 18/45 (40%), Positives = 24/45 (53%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           AG G  G++DG    A F  P+       G +YV+D  N V+RKI
Sbjct: 237 AGTGEDGFADGPVSQAMFSGPQRITAAEDGALYVSDTGNSVVRKI 281


>gi|158293254|ref|XP_314572.4| AGAP010610-PA [Anopheles gambiae str. PEST]
 gi|157016870|gb|EAA09865.4| AGAP010610-PA [Anopheles gambiae str. PEST]
          Length = 730

 Score = 38.1 bits (87), Expect = 1.6,   Method: Composition-based stats.
 Identities = 29/119 (24%), Positives = 50/119 (42%), Gaps = 18/119 (15%)

Query: 16  SLLFSLVSSGLLLED----GYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYT 71
           SL  +LVSSG +  +    G    +V  G          +   G+  ++++DS+ +  + 
Sbjct: 215 SLPINLVSSGAIASNMKYPGKVACSVPTGAGGSEEQLFAVSDSGNHRVLIVDSAGTVLHK 274

Query: 72  LSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +              G    G+ DG    ARF+ P+  A      ++VAD  NH +R+I
Sbjct: 275 V--------------GGKQSGFVDGNFTKARFNAPQGVAFQGTDVVFVADNENHAVRRI 319


>gi|242041143|ref|XP_002467966.1| hypothetical protein SORBIDRAFT_01g037250 [Sorghum bicolor]
 gi|241921820|gb|EER94964.1| hypothetical protein SORBIDRAFT_01g037250 [Sorghum bicolor]
          Length = 360

 Score = 38.1 bits (87), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 35/68 (51%), Gaps = 2/68 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQL-- 139
           V  +AG G  GY DG   +A+  +P        G + VAD +N+ IR IT  D G ++  
Sbjct: 137 VTTIAGTGRAGYKDGPGLAAQLSEPAGLVEVGDGRLLVADTNNNTIRYITLSDKGAEVKT 196

Query: 140 IRQINLKP 147
           +  I ++P
Sbjct: 197 LELIGVQP 204


>gi|195387367|ref|XP_002052367.1| GJ17513 [Drosophila virilis]
 gi|194148824|gb|EDW64522.1| GJ17513 [Drosophila virilis]
          Length = 736

 Score = 38.1 bits (87), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 28/51 (54%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           VV+   G    G+ DG   +ARF+ P+  A   K  + VAD  NH +R+I+
Sbjct: 255 VVEYKVGGLEAGFVDGNLNTARFNNPQGVAFLDKDTLIVADTDNHALRQIS 305


>gi|288921011|ref|ZP_06415303.1| serine/threonine protein kinase [Frankia sp. EUN1f]
 gi|288347586|gb|EFC81871.1| serine/threonine protein kinase [Frankia sp. EUN1f]
          Length = 801

 Score = 38.1 bits (87), Expect = 1.6,   Method: Composition-based stats.
 Identities = 25/73 (34%), Positives = 33/73 (45%), Gaps = 12/73 (16%)

Query: 71  TLSFPLSEESVVKRLAGDGVQGYSDGEPG------------SARFDKPKSFAVDMKGNIY 118
           TL    S    V+R++ DG      G PG            SA+   P S AV  +G +Y
Sbjct: 658 TLYVADSSAHTVRRISTDGKITLVAGTPGAYGYSGDGGAATSAKLYNPHSLAVGPEGELY 717

Query: 119 VADKSNHVIRKIT 131
           +AD  N VIR +T
Sbjct: 718 IADTYNDVIRVVT 730


>gi|312372811|gb|EFR20689.1| hypothetical protein AND_19679 [Anopheles darlingi]
          Length = 745

 Score = 38.1 bits (87), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 19/50 (38%), Positives = 25/50 (50%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            V+   G    G+ DG    ARF+ P+  A      +YVAD  NH IR+I
Sbjct: 258 TVRYTIGGKSSGFVDGGFREARFNAPQGVAFRGGDELYVADNENHAIRRI 307


>gi|300771890|ref|ZP_07081761.1| NHL repeat containing protein [Sphingobacterium spiritivorum ATCC
           33861]
 gi|300761276|gb|EFK58101.1| NHL repeat containing protein [Sphingobacterium spiritivorum ATCC
           33861]
          Length = 658

 Score = 38.1 bits (87), Expect = 1.6,   Method: Composition-based stats.
 Identities = 26/85 (30%), Positives = 42/85 (49%), Gaps = 6/85 (7%)

Query: 50  VIDRPGSSDLIVLDSSRSA---FYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           V+D  G  DL   D++ +       ++F  S    V  +AG    G +DG   +A F++P
Sbjct: 406 VVDNKG--DLYFTDNNATGGNRIRKITFDASGNGTVSTIAGQASTGNADGIGAAASFNRP 463

Query: 107 KSFAVDMKGNI-YVADKSNHVIRKI 130
              A+D   NI YV++   + IR+I
Sbjct: 464 SGLAIDKNNNILYVSEWIGNRIRRI 488


>gi|428304361|ref|YP_007141186.1| NHL repeat containing protein [Crinalium epipsammum PCC 9333]
 gi|428245896|gb|AFZ11676.1| NHL repeat containing protein [Crinalium epipsammum PCC 9333]
          Length = 491

 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 26/85 (30%), Positives = 40/85 (47%), Gaps = 5/85 (5%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  VI    S+ L + D+        S     ++V+    G G  G+ DG+   A+F +P
Sbjct: 184 PSKVIACQQSNSLFIADTGHHRLVIASLDGETKAVI----GTGSAGWVDGDLEIAQFCEP 239

Query: 107 KSFAVDMKGN-IYVADKSNHVIRKI 130
                + +   IYVAD  NH++RKI
Sbjct: 240 MGMVFNHEQQVIYVADTVNHLLRKI 264


>gi|290997205|ref|XP_002681172.1| predicted protein [Naegleria gruberi]
 gi|284094795|gb|EFC48428.1| predicted protein [Naegleria gruberi]
          Length = 2212

 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 21/47 (44%), Positives = 27/47 (57%), Gaps = 8/47 (17%)

Query: 85  LAGDGVQG-YSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           LAGDG QG YS       +  +P+   V   G+IY+ D  N+VIRKI
Sbjct: 480 LAGDGYQGIYS-------QLSQPRGVCVGPTGDIYIVDAGNYVIRKI 519


>gi|290970020|ref|XP_002668012.1| predicted protein [Naegleria gruberi]
 gi|284081028|gb|EFC35268.1| predicted protein [Naegleria gruberi]
          Length = 243

 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 30/55 (54%), Gaps = 2/55 (3%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           E   +  +AG+G  GYS  +G   +A+  +P S  V     +Y+AD  NH IRKI
Sbjct: 115 ENGRIVTIAGNGQPGYSGDNGPATTAQLHRPYSVFVSANNEVYIADTFNHSIRKI 169



 Score = 37.4 bits (85), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 29/48 (60%), Gaps = 2/48 (4%)

Query: 85  LAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +AG+G QGY   +G   SA+ + P    V     +++ADK+N+ IRKI
Sbjct: 11  IAGNGKQGYGGDNGLATSAQLNYPSGVFVSTNNEVFIADKNNNRIRKI 58


>gi|336313754|ref|ZP_08568676.1| thiol-disulfide isomerase-like thioredoxin [Rheinheimera sp. A13L]
 gi|335881693|gb|EGM79570.1| thiol-disulfide isomerase-like thioredoxin [Rheinheimera sp. A13L]
          Length = 498

 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 28/50 (56%), Gaps = 2/50 (4%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +VKRL G G+    DG   S+ F  P+   +  K  +YVAD  NH IR+I
Sbjct: 223 LVKRL-GSGIAELKDGHSDSSAFSSPQGLVLTDKA-LYVADTGNHAIRRI 270


>gi|241997422|ref|XP_002433360.1| nhl repeat-containing protein, putative [Ixodes scapularis]
 gi|215490783|gb|EEC00424.1| nhl repeat-containing protein, putative [Ixodes scapularis]
          Length = 631

 Score = 38.1 bits (87), Expect = 1.7,   Method: Composition-based stats.
 Identities = 24/70 (34%), Positives = 36/70 (51%), Gaps = 10/70 (14%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQI--NLK 146
           G++G S G      F  P+S  +D  G IYV D  NH   +I  LDH    + +I  +++
Sbjct: 446 GMKGRSAGH-----FRSPESICIDNTGRIYVCDTCNH---RIQVLDHDGIFLHEIGTHVQ 497

Query: 147 PEDCSKSSQS 156
           PE+    S+S
Sbjct: 498 PENAFNRSRS 507


>gi|29349077|ref|NP_812580.1| hypothetical protein BT_3669 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29340984|gb|AAO78774.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
           VPI-5482]
          Length = 196

 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 19/54 (35%), Positives = 30/54 (55%), Gaps = 1/54 (1%)

Query: 78  EESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           E  ++  +AG  G  GY+ G P  +R ++P    +  + +IYVAD  N VI K+
Sbjct: 140 ETKLMSTVAGIGGKSGYAAGNPTVSRMNRPYGICITPENDIYVADAGNKVIMKL 193


>gi|290985545|ref|XP_002675486.1| predicted protein [Naegleria gruberi]
 gi|284089082|gb|EFC42742.1| predicted protein [Naegleria gruberi]
          Length = 819

 Score = 38.1 bits (87), Expect = 1.7,   Method: Composition-based stats.
 Identities = 21/48 (43%), Positives = 28/48 (58%), Gaps = 2/48 (4%)

Query: 85  LAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +AG G +G+S DG P  SA+ D P    V     +Y+ D +NH IRKI
Sbjct: 567 IAGTGEKGFSGDGGPATSAQLDSPCGVFVSNNDEVYIVDYNNHRIRKI 614


>gi|281423977|ref|ZP_06254890.1| NHL repeat-containing domain protein [Prevotella oris F0302]
 gi|281401902|gb|EFB32733.1| NHL repeat-containing domain protein [Prevotella oris F0302]
          Length = 469

 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 27/54 (50%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
            V++ AG    G  DG   SA F  P   A D+ GN+YV +     IRKI + D
Sbjct: 359 TVEQFAGFNGVGSQDGPRMSASFKNPTGMAFDVDGNMYVTESMGFTIRKIGHTD 412


>gi|182440597|ref|YP_001828316.1| hypothetical protein SGR_6804 [Streptomyces griseus subsp. griseus
           NBRC 13350]
 gi|178469113|dbj|BAG23633.1| hypothetical protein [Streptomyces griseus subsp. griseus NBRC
           13350]
          Length = 508

 Score = 38.1 bits (87), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 24/59 (40%), Positives = 32/59 (54%), Gaps = 5/59 (8%)

Query: 78  EESVVKRL---AGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +E  V R+   AG G  G+   DG   SA+   P   AV   G +YV+D SNH +R+IT
Sbjct: 11  DEGFVPRISAIAGSGTAGFGGDDGAAVSAQLKHPYEMAVSGTGILYVSDYSNHRVRRIT 69



 Score = 37.4 bits (85), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 35/57 (61%), Gaps = 2/57 (3%)

Query: 76  LSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  + V+  +AG GV G + DG P + A+ +KP   AVD +G +YVA+   H +R++
Sbjct: 124 VGADGVMVTVAGTGVAGSAGDGGPAAKAQLNKPFGVAVDGEGVLYVAEFGGHRVRRV 180


>gi|220914451|ref|YP_002489760.1| NHL repeat containing protein [Arthrobacter chlorophenolicus A6]
 gi|219861329|gb|ACL41671.1| NHL repeat containing protein [Arthrobacter chlorophenolicus A6]
          Length = 674

 Score = 38.1 bits (87), Expect = 1.8,   Method: Composition-based stats.
 Identities = 21/67 (31%), Positives = 34/67 (50%), Gaps = 2/67 (2%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
           PL+    V  +AG+G++G  DG    + F +    A D  GNI+VAD     +R +   D
Sbjct: 373 PLT--GAVAIVAGNGLEGLLDGPAHESWFAQSSGLAEDADGNIWVADSETSALRTLVIDD 430

Query: 135 HGNQLIR 141
            G+  ++
Sbjct: 431 TGSLTVK 437



 Score = 37.0 bits (84), Expect = 4.5,   Method: Composition-based stats.
 Identities = 29/115 (25%), Positives = 48/115 (41%), Gaps = 4/115 (3%)

Query: 19  FSLVSSGLLLEDGYTVTTVIDG--HQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPL 76
           F  ++  + +  G  +  ++DG  H+      S +      ++ V DS  SA  TL    
Sbjct: 371 FDPLTGAVAIVAGNGLEGLLDGPAHESWFAQSSGLAEDADGNIWVADSETSALRTLVIDD 430

Query: 77  SEESVVKRLAGDGVQ--GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           +    VK   G G+   G+ DG    AR   P    V   G++ +AD  N  +R+
Sbjct: 431 TGSLTVKTAVGKGLFDFGFRDGTAAEARLQHPLGVTVLPDGSVAIADTYNGAVRR 485


>gi|116754580|ref|YP_843698.1| NHL repeat-containing protein [Methanosaeta thermophila PT]
 gi|116666031|gb|ABK15058.1| NHL repeat containing protein [Methanosaeta thermophila PT]
          Length = 504

 Score = 38.1 bits (87), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 27/105 (25%), Positives = 49/105 (46%), Gaps = 12/105 (11%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
           G  ++ + DS+ +    +S     +  +  + G G +GYSDG+   AR  +P+  A+ + 
Sbjct: 202 GGDNIFIADSNNNRILIVS----PDGDLMDVIGSGERGYSDGDFSEARLFRPQGIAI-VG 256

Query: 115 GNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSGSG 159
             +Y+AD  NH++R          L R+  ++     KS   G G
Sbjct: 257 DVVYIADTGNHMVRAA-------DLRRRTLVRMAGTGKSRHPGLG 294


>gi|414866501|tpg|DAA45058.1| TPA: hypothetical protein ZEAMMB73_922990 [Zea mays]
          Length = 237

 Score = 38.1 bits (87), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 30/58 (51%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQL 139
           V  +AG G  GY DG   +A+  +P        G + VAD +N+ IR IT  D G ++
Sbjct: 14  VTTIAGTGSAGYRDGPGLTAQLSEPAGLVEVGDGRLLVADTNNNTIRYITLSDKGAEV 71


>gi|299142776|ref|ZP_07035905.1| NHL repeat-containing protein [Prevotella oris C735]
 gi|298575805|gb|EFI47682.1| NHL repeat-containing protein [Prevotella oris C735]
          Length = 448

 Score = 38.1 bits (87), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 27/55 (49%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
             V++ AG    G  DG   SA F  P   A D+ GN+YV +     IRKI + D
Sbjct: 337 QTVEQFAGFNGVGSQDGPRMSASFKNPTGMAFDVDGNMYVTESMGFTIRKIGHTD 391


>gi|91076402|ref|XP_969236.1| PREDICTED: similar to CG12547 CG12547-PA [Tribolium castaneum]
 gi|270002449|gb|EEZ98896.1| hypothetical protein TcasGA2_TC004511 [Tribolium castaneum]
          Length = 712

 Score = 38.1 bits (87), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 17/55 (30%), Positives = 25/55 (45%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           L  +  +    G G  G+ DG      F+ P+      +  ++VAD  NH IRKI
Sbjct: 245 LKSDGTILHQVGSGEIGFRDGSFTQCEFNAPQGLVFQNENTLFVADTENHAIRKI 299


>gi|144899324|emb|CAM76188.1| conserved hypothetical protein [Magnetospirillum gryphiswaldense
           MSR-1]
          Length = 477

 Score = 38.1 bits (87), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 32/56 (57%), Gaps = 3/56 (5%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQ 142
           G G+ G  DG    A F+ P+  A+D +  IYVAD  NH+IR+I     G++  R+
Sbjct: 210 GSGLAGRDDGI--DASFNGPEGVAID-RAFIYVADTRNHLIRRIDRQSGGDRHHRR 262


>gi|290985345|ref|XP_002675386.1| predicted protein [Naegleria gruberi]
 gi|284088982|gb|EFC42642.1| predicted protein [Naegleria gruberi]
          Length = 788

 Score = 38.1 bits (87), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 20/52 (38%), Positives = 30/52 (57%), Gaps = 2/52 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++  +AG G +GYS DG P  +A+     +  V     +Y+AD +NH IRKI
Sbjct: 297 IINTIAGTGEEGYSGDGGPAINAQISNVYNIFVSQNDEVYIADTNNHRIRKI 348


>gi|227540165|ref|ZP_03970214.1| conserved protein of hypothetical function [Sphingobacterium
           spiritivorum ATCC 33300]
 gi|227240026|gb|EEI90041.1| conserved protein of hypothetical function [Sphingobacterium
           spiritivorum ATCC 33300]
          Length = 658

 Score = 38.1 bits (87), Expect = 1.9,   Method: Composition-based stats.
 Identities = 26/85 (30%), Positives = 42/85 (49%), Gaps = 6/85 (7%)

Query: 50  VIDRPGSSDLIVLDSSRSA---FYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           V+D  G  DL   D++ +       ++F  S    V  +AG    G +DG   +A F++P
Sbjct: 406 VVDNKG--DLYFTDNNATGGNRIRKITFDASGNGTVSTIAGQASTGNADGIGAAASFNRP 463

Query: 107 KSFAVDMKGNI-YVADKSNHVIRKI 130
              A+D   NI YV++   + IR+I
Sbjct: 464 SGLAIDKNNNILYVSEWIGNRIRRI 488


>gi|167997357|ref|XP_001751385.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162697366|gb|EDQ83702.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 663

 Score = 38.1 bits (87), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 20/62 (32%), Positives = 33/62 (53%), Gaps = 4/62 (6%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDM-KGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPE 148
           + G+ DG   SAR  +P S   D  K  +Y+AD  NH IR+       ++++  +  KP+
Sbjct: 218 IAGFEDGSFESARLRRPSSIVFDKEKKLLYIADCENHAIRRA---HISSRMVDTLYPKPD 274

Query: 149 DC 150
           +C
Sbjct: 275 EC 276


>gi|383124309|ref|ZP_09944974.1| hypothetical protein BSIG_5606 [Bacteroides sp. 1_1_6]
 gi|382983730|gb|EIC72829.1| hypothetical protein BSIG_5606 [Bacteroides sp. 1_1_6]
          Length = 468

 Score = 38.1 bits (87), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 19/54 (35%), Positives = 30/54 (55%), Gaps = 1/54 (1%)

Query: 78  EESVVKRLAG-DGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           E  ++  +AG  G  GY+ G P  +R ++P    +  + +IYVAD  N VI K+
Sbjct: 412 ETKLMSTVAGIGGKSGYAAGNPTVSRMNRPYGICITPENDIYVADAGNKVIMKL 465


>gi|290978917|ref|XP_002672181.1| predicted protein [Naegleria gruberi]
 gi|284085756|gb|EFC39437.1| predicted protein [Naegleria gruberi]
          Length = 1457

 Score = 38.1 bits (87), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 27/48 (56%), Gaps = 7/48 (14%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDM-KGNIYVADKSNHVIRKIT 131
            AGDG      G    A F++P   A+D   G++YV D+ NH IRKIT
Sbjct: 125 FAGDG------GMAIKALFNRPVHIAIDHSNGDLYVTDQGNHAIRKIT 166


>gi|290975594|ref|XP_002670527.1| predicted protein [Naegleria gruberi]
 gi|284084087|gb|EFC37783.1| predicted protein [Naegleria gruberi]
          Length = 443

 Score = 38.1 bits (87), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 21/62 (33%), Positives = 35/62 (56%), Gaps = 4/62 (6%)

Query: 78  EESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDH 135
           E   +  +AG+G  G+S  +G   SA+   P    V     +Y++D +N++IRKI  L++
Sbjct: 95  ENGNIITIAGNGTVGFSGDNGPATSAQLYNPFGIFVSANNEVYISDSNNNIIRKI--LEN 152

Query: 136 GN 137
           GN
Sbjct: 153 GN 154


>gi|326431736|gb|EGD77306.1| NHL repeat-containing protein [Salpingoeca sp. ATCC 50818]
          Length = 1384

 Score = 38.1 bits (87), Expect = 2.0,   Method: Composition-based stats.
 Identities = 22/65 (33%), Positives = 31/65 (47%), Gaps = 2/65 (3%)

Query: 81   VVKRLAGDGVQGY--SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQ 138
            V+  LAG GV G+    G P +AR + P+  A    G I +AD  N  IR +     G  
Sbjct: 1270 VITALAGTGVAGFRGDGGAPMAARLNTPRGIAAMPAGEIAIADTGNSRIRTLNVGGQGAG 1329

Query: 139  LIRQI 143
            +I  +
Sbjct: 1330 VIETV 1334


>gi|290978674|ref|XP_002672060.1| predicted protein [Naegleria gruberi]
 gi|284085634|gb|EFC39316.1| predicted protein [Naegleria gruberi]
          Length = 1219

 Score = 38.1 bits (87), Expect = 2.0,   Method: Composition-based stats.
 Identities = 19/52 (36%), Positives = 29/52 (55%), Gaps = 3/52 (5%)

Query: 82  VKRLAGDGVQGYS--DGEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKI 130
           +  +AG G+ GY+  D    SA+ + P    +D   G I+++D  NH IRKI
Sbjct: 114 ITTIAGTGIAGYNGDDKSADSAQLNTPSGIVIDPNNGEIFISDSKNHRIRKI 165



 Score = 37.7 bits (86), Expect = 2.2,   Method: Composition-based stats.
 Identities = 22/49 (44%), Positives = 32/49 (65%), Gaps = 3/49 (6%)

Query: 85  LAGDGVQGYS-DG-EPGSARFDKPKSFAVDM-KGNIYVADKSNHVIRKI 130
           +AG G +GY+ DG E  SA+ + P   A+++  G IY+AD  N+ IRKI
Sbjct: 397 IAGTGTKGYNGDGIEATSAQLNTPHDVALNLATGEIYIADTENNRIRKI 445



 Score = 36.2 bits (82), Expect = 6.7,   Method: Composition-based stats.
 Identities = 21/49 (42%), Positives = 30/49 (61%), Gaps = 3/49 (6%)

Query: 85  LAGDGVQGYS-DG-EPGSARFDKPKSFAVDMKG-NIYVADKSNHVIRKI 130
           +AG+G  GY+ DG +  SA+ + P    VD     +Y+AD +NH IRKI
Sbjct: 60  IAGNGFAGYNGDGLDATSAQLNNPVGIYVDTNSREVYIADSNNHRIRKI 108


>gi|425734502|ref|ZP_18852820.1| NHL repeat-containing protein [Brevibacterium casei S18]
 gi|425481116|gb|EKU48277.1| NHL repeat-containing protein [Brevibacterium casei S18]
          Length = 666

 Score = 37.7 bits (86), Expect = 2.0,   Method: Composition-based stats.
 Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 13/95 (13%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV---D 112
           S +L+V DS   +   + +    ++V++R+ G G +G +DG+  SA+F +P    V   D
Sbjct: 208 SGNLLVADSGHHSL--VEYTPDGQTVLRRI-GTGERGLTDGDFASAQFSEPGGITVLPED 264

Query: 113 MKG----NIYVADKSNHVIRKITNLDHGNQLIRQI 143
           +       + VAD  NHV+R I   D   + +R I
Sbjct: 265 IAARAGYQLVVADTVNHVLRGI---DLDGETVRTI 296


>gi|224539142|ref|ZP_03679681.1| hypothetical protein BACCELL_04044 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224519246|gb|EEF88351.1| hypothetical protein BACCELL_04044 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 470

 Score = 37.7 bits (86), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 19/54 (35%), Positives = 31/54 (57%), Gaps = 1/54 (1%)

Query: 78  EESVVKRLAGDGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           E +++  +AG G Q G+S G P  ++ + P    +   G+IY+AD  N VI K+
Sbjct: 414 ETNMMTTVAGIGKQKGFSTGSPKESKLNWPAGICLSPNGDIYIADAGNRVIMKL 467


>gi|111219555|ref|YP_710349.1| serine/threonine-protein kinase [Frankia alni ACN14a]
 gi|111147087|emb|CAJ58734.1| putative serine/threonine-protein kinase (partial match) [Frankia
           alni ACN14a]
          Length = 899

 Score = 37.7 bits (86), Expect = 2.0,   Method: Composition-based stats.
 Identities = 20/52 (38%), Positives = 29/52 (55%), Gaps = 2/52 (3%)

Query: 81  VVKRLAGDGVQGYS-DGEPGSA-RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V+  +AG G  GYS DG P +A + +  +       G++Y +D  NH IRKI
Sbjct: 667 VISTVAGTGTAGYSGDGGPATAAQLNSAEKVTTGPDGSVYFSDYDNHRIRKI 718



 Score = 37.0 bits (84), Expect = 3.7,   Method: Composition-based stats.
 Identities = 16/32 (50%), Positives = 21/32 (65%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +A+ D P S A D  GNIY+ D  N+ IRK+T
Sbjct: 632 AAQLDGPGSTARDKAGNIYIGDAKNNRIRKVT 663


>gi|196009344|ref|XP_002114537.1| hypothetical protein TRIADDRAFT_11855 [Trichoplax adhaerens]
 gi|190582599|gb|EDV22671.1| hypothetical protein TRIADDRAFT_11855 [Trichoplax adhaerens]
          Length = 507

 Score = 37.7 bits (86), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 18/41 (43%), Positives = 24/41 (58%), Gaps = 1/41 (2%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           +G+ DG    ARF+ P+       G  I+VAD +NH IRKI
Sbjct: 214 EGFVDGVFQKARFNNPQGITCSRNGKTIFVADTNNHAIRKI 254


>gi|377811375|ref|YP_005043815.1| NHL repeat containing protein [Burkholderia sp. YI23]
 gi|357940736|gb|AET94292.1| NHL repeat containing protein [Burkholderia sp. YI23]
          Length = 382

 Score = 37.7 bits (86), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 20/59 (33%), Positives = 30/59 (50%), Gaps = 3/59 (5%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINL 145
           G+ ++ + +   G  +F  P S AVD  G +YVAD+SN   R+I   D      RQ  +
Sbjct: 219 GNWLKSWGERGKGPGQFHTPHSIAVDANGMVYVADRSN---RRIQVFDGEGNFQRQFTI 274


>gi|340724286|ref|XP_003400514.1| PREDICTED: tripartite motif-containing protein 71-like [Bombus
           terrestris]
          Length = 711

 Score = 37.7 bits (86), Expect = 2.1,   Method: Composition-based stats.
 Identities = 27/80 (33%), Positives = 37/80 (46%), Gaps = 12/80 (15%)

Query: 52  DRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAV 111
           DR G   +IV D S +      +   + S V+R    G         G  +FD+P   AV
Sbjct: 454 DREGH--MIVADRSNNRIQI--YRQEDGSFVRRFGSYGT--------GPGQFDRPAGVAV 501

Query: 112 DMKGNIYVADKSNHVIRKIT 131
           D +  I VADK NH I+ +T
Sbjct: 502 DARRRIIVADKDNHRIQILT 521


>gi|390341274|ref|XP_003725419.1| PREDICTED: tripartite motif-containing protein 71-like
           [Strongylocentrotus purpuratus]
          Length = 778

 Score = 37.7 bits (86), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 18/41 (43%), Positives = 25/41 (60%), Gaps = 2/41 (4%)

Query: 87  GDGVQGYSDGEPGS--ARFDKPKSFAVDMKGNIYVADKSNH 125
            DG   +  G PG+   +FD+P   AV+ +GNI +ADK NH
Sbjct: 545 ADGSFSHKFGSPGTRNGQFDRPAGVAVNGEGNIIIADKDNH 585


>gi|406916662|gb|EKD55633.1| NHL repeat containing protein, partial [uncultured bacterium]
          Length = 188

 Score = 37.7 bits (86), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 3/58 (5%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQS 156
           GS +F++P    +D K NIYV D  N   R+IT  D    ++ +I LK     KS+ +
Sbjct: 49  GSGKFNEPVDIGIDSKSNIYVLDAGN---RRITGFDEYGNILGEIPLKKIGGLKSAGA 103


>gi|380300595|ref|ZP_09850288.1| NHL repeat protein [Brachybacterium squillarum M-6-3]
          Length = 644

 Score = 37.7 bits (86), Expect = 2.2,   Method: Composition-based stats.
 Identities = 39/115 (33%), Positives = 49/115 (42%), Gaps = 9/115 (7%)

Query: 19  FSLVSSGLLLEDGYTVTTVIDGHQLE---INPHSVIDRPGSSDLIVLDSSRSAFYTLSFP 75
           F  V+  L++  G T   ++DG  +      P S ID      L+V DS  SA   L  P
Sbjct: 342 FDPVTGSLMVLAGTTQEGLVDGPAVTSWWAQP-SGIDELPDGRLVVADSESSAIRVLD-P 399

Query: 76  LSEESVVKRLAGDGVQ--GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
            + E  V  L G G+   G+ DG    AR   P    V   G I VAD  N  IR
Sbjct: 400 QTME--VSTLVGQGLFDFGHVDGPAEVARLQHPLGVTVLPDGRIAVADTYNGAIR 452


>gi|325923856|ref|ZP_08185461.1| NHL repeat protein [Xanthomonas gardneri ATCC 19865]
 gi|325545661|gb|EGD16910.1| NHL repeat protein [Xanthomonas gardneri ATCC 19865]
          Length = 472

 Score = 37.7 bits (86), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           V R  G G     DG  G A F +P+  A++ +  +YVAD  NH +R+I NL  G 
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 251


>gi|290973007|ref|XP_002669241.1| predicted protein [Naegleria gruberi]
 gi|284082786|gb|EFC36497.1| predicted protein [Naegleria gruberi]
          Length = 747

 Score = 37.7 bits (86), Expect = 2.3,   Method: Composition-based stats.
 Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 82  VKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  +AG GV G S DG+  + A+ + P+  A+   G IY AD SN  IR+I
Sbjct: 244 INTIAGTGVLGLSGDGDLATKAQLNTPRGIAISSTGEIYFADTSNQRIRRI 294



 Score = 37.4 bits (85), Expect = 3.2,   Method: Composition-based stats.
 Identities = 19/55 (34%), Positives = 31/55 (56%), Gaps = 2/55 (3%)

Query: 78  EESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           E  ++  +AG G   +  DG+  + A+ + P+  A+   G IY+AD  NH IR+I
Sbjct: 128 ENGIITTIAGTGDSRFGGDGDLATKAQLNSPRGIAISSTGEIYIADTYNHRIRRI 182


>gi|317504749|ref|ZP_07962709.1| NHL repeat containing protein [Prevotella salivae DSM 15606]
 gi|315664131|gb|EFV03838.1| NHL repeat containing protein [Prevotella salivae DSM 15606]
          Length = 448

 Score = 37.7 bits (86), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 27/54 (50%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
            +++ AG    G  DG   SA F  P   A D+ GN+Y+ D     IRKI + D
Sbjct: 338 TIEQFAGFNGVGSMDGPRMSASFRNPTGMAFDVDGNMYITDSMGFTIRKIGHAD 391


>gi|320103824|ref|YP_004179415.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
 gi|319751106|gb|ADV62866.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
          Length = 417

 Score = 37.7 bits (86), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 27/98 (27%), Positives = 48/98 (48%), Gaps = 13/98 (13%)

Query: 41  HQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGY-SDGEPG 99
            +L+++P  ++         ++D+   A    +    +  VV  LAG+G +G   DG P 
Sbjct: 327 KELDVSPEGLV--------AIVDTENQAIRLYN---PKTRVVTTLAGNGQRGREGDGVPA 375

Query: 100 SA-RFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           +A R D+P   A    G I ++D +NH IR + ++  G
Sbjct: 376 TAARLDRPHGIAFGPGGVIAISDTNNHRIRLVKSVPVG 413


>gi|152964340|ref|YP_001360124.1| fibronectin type III domain-containing protein [Kineococcus
           radiotolerans SRS30216]
 gi|151358857|gb|ABS01860.1| Fibronectin type III domain protein [Kineococcus radiotolerans
           SRS30216]
          Length = 841

 Score = 37.7 bits (86), Expect = 2.4,   Method: Composition-based stats.
 Identities = 19/48 (39%), Positives = 25/48 (52%), Gaps = 1/48 (2%)

Query: 85  LAGDGVQGYSD-GEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            AG G  G    G+   +    P   A+D  GN++VAD  NHV+ KIT
Sbjct: 192 FAGTGFAGSPQAGDANKSPLASPTGVALDAAGNLHVADADNHVVEKIT 239


>gi|328783461|ref|XP_396242.3| PREDICTED: NHL repeat-containing protein 2 [Apis mellifera]
          Length = 699

 Score = 37.7 bits (86), Expect = 2.4,   Method: Composition-based stats.
 Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 1/49 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           V+ + G     + DG+  +ARF+ P+   + +   IYVAD  NH IRKI
Sbjct: 262 VQHVIGGSNPDFRDGDFENARFNAPQGVCI-LDTFIYVADNENHAIRKI 309


>gi|290976601|ref|XP_002671028.1| predicted protein [Naegleria gruberi]
 gi|284084593|gb|EFC38284.1| predicted protein [Naegleria gruberi]
          Length = 925

 Score = 37.7 bits (86), Expect = 2.4,   Method: Composition-based stats.
 Identities = 29/96 (30%), Positives = 48/96 (50%), Gaps = 7/96 (7%)

Query: 38  IDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-D 95
           ++  Q+ I+P  V +DR  + ++ V D   S    +     + + +   AG+G+ GYS D
Sbjct: 50  VNAGQVSISPTKVKVDRK-TGNVYVSDRINSRIRKIDINTGKATTI---AGNGIIGYSGD 105

Query: 96  GEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G+  + A  + P    +    +IY AD  NH IRKI
Sbjct: 106 GQLATQANINNPGKVELSSTNDIYFADTENHRIRKI 141


>gi|452822384|gb|EME29404.1| haloacid dehalogenase-like hydrolase family protein [Galdieria
           sulphuraria]
          Length = 679

 Score = 37.7 bits (86), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 25/47 (53%), Gaps = 1/47 (2%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIR 128
           + L GD   GY DG    A+F  P   A D +   +Y+ D+ NHVIR
Sbjct: 609 RWLCGDSQLGYVDGPKTYAKFHCPCDVAWDPISQRLYIVDRENHVIR 655


>gi|340378860|ref|XP_003387945.1| PREDICTED: hypothetical protein LOC100632455 [Amphimedon
           queenslandica]
          Length = 1157

 Score = 37.7 bits (86), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 14/32 (43%), Positives = 22/32 (68%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           + RF  P+  A+D +G +YVAD+ NH I+K +
Sbjct: 150 NGRFQSPRDIAIDSQGLVYVADQCNHRIQKFS 181


>gi|334335815|ref|YP_004540967.1| alkyl hydroperoxide reductase/ thiol specific antioxidant/ Mal
           allergen [Isoptericola variabilis 225]
 gi|334106183|gb|AEG43073.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Isoptericola variabilis 225]
          Length = 668

 Score = 37.7 bits (86), Expect = 2.4,   Method: Composition-based stats.
 Identities = 22/71 (30%), Positives = 33/71 (46%), Gaps = 4/71 (5%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFA----VDMKGNIYVADKSNHVIRKI 130
           P  E   V+ LAG   +G  DG  G+A F +P   A     D  G++++AD     +R +
Sbjct: 358 PGPESGTVRLLAGTMNEGLEDGPGGTAWFAQPSGLAPVLGSDADGSLWLADAETSALRLV 417

Query: 131 TNLDHGNQLIR 141
           T    G   +R
Sbjct: 418 TPAADGQVQVR 428


>gi|427706390|ref|YP_007048767.1| alkyl hydroperoxide reductase [Nostoc sp. PCC 7107]
 gi|427358895|gb|AFY41617.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Nostoc sp. PCC 7107]
          Length = 505

 Score = 37.7 bits (86), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 20/63 (31%), Positives = 32/63 (50%), Gaps = 4/63 (6%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQLI 140
           V  + G G  G +DG   +A+F  P+    D +  I Y+AD  NH +R++   D   Q++
Sbjct: 206 VLHIIGTGKSGLTDGSFSTAQFSAPQGMVFDSENQIFYIADTENHALRRV---DLKQQIV 262

Query: 141 RQI 143
             I
Sbjct: 263 ETI 265



 Score = 37.4 bits (85), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 15/48 (31%), Positives = 29/48 (60%), Gaps = 1/48 (2%)

Query: 83  KRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + + GDG  G  DG+  ++RF +P   +  +   +Y++D +NH IR++
Sbjct: 433 QTILGDGFAGLQDGQGKNSRFFEPSGLSA-INSYLYISDTNNHAIRRV 479


>gi|290978943|ref|XP_002672194.1| predicted protein [Naegleria gruberi]
 gi|284085769|gb|EFC39450.1| predicted protein [Naegleria gruberi]
          Length = 1483

 Score = 37.7 bits (86), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 36/77 (46%), Gaps = 6/77 (7%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S+  IV+  S++    L F L EE  +  LAG    G+ D  P       P    +    
Sbjct: 733 STKYIVISDSKNN--RLRFYLKEEKTIYTLAG----GFGDNGPSENGIGTPFGVTISPNN 786

Query: 116 NIYVADKSNHVIRKITN 132
            IYV+D  N++IR+I N
Sbjct: 787 EIYVSDVLNNLIRRIKN 803


>gi|290994583|ref|XP_002679911.1| predicted protein [Naegleria gruberi]
 gi|284093530|gb|EFC47167.1| predicted protein [Naegleria gruberi]
          Length = 2449

 Score = 37.7 bits (86), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 31/89 (34%), Positives = 44/89 (49%), Gaps = 9/89 (10%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGE-PGSARF 103
           N  SV   P + D + +D  R    + +  L     V  +AG       SDG+   SA+F
Sbjct: 87  NVVSVFKSPATGDCVYMDQKRVRMISKTTNL-----VTTIAGSLTSTNTSDGQFATSAKF 141

Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKITN 132
             PK+F +D  GN+YV D  N  IRK++N
Sbjct: 142 -FPKAFTIDEYGNLYVVD-FNVRIRKVSN 168


>gi|58581916|ref|YP_200932.1| hypothetical protein XOO2293 [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|58426510|gb|AAW75547.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC
           10331]
          Length = 472

 Score = 37.7 bits (86), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           V R  G G     DG  G A F +P+  A++ +  +YVAD  NH +R+I NL  G 
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 251


>gi|269837841|ref|YP_003320069.1| alkyl hydroperoxide reductase [Sphaerobacter thermophilus DSM
           20745]
 gi|269787104|gb|ACZ39247.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Sphaerobacter thermophilus DSM 20745]
          Length = 494

 Score = 37.7 bits (86), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 36/84 (42%), Gaps = 5/84 (5%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V+    +  LIV DS            S+  V+    G G  G  DG    A F  P
Sbjct: 180 PGKVLADAATDRLIVADSGHHRLIVARLDGSDAWVI----GSGEAGLRDGPGAIAAFHDP 235

Query: 107 KSFAVDMKGNIYVADKSNHVIRKI 130
           +  A+     +YVAD  NHV+R++
Sbjct: 236 QGLAL-YGDTLYVADTRNHVVRRV 258



 Score = 37.4 bits (85), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 38/74 (51%), Gaps = 5/74 (6%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGN 116
           L V DS  SA  T++FP  +E  V+ L G G+  + D  G     R   P + A    G 
Sbjct: 356 LYVADSETSAVRTVTFPPGDE--VRTLVGTGLFDFGDVDGIGDEVRLQHPLAVACG-DGV 412

Query: 117 IYVADKSNHVIRKI 130
           +YVAD  NH I+++
Sbjct: 413 LYVADSYNHKIKRL 426


>gi|184201467|ref|YP_001855674.1| hypothetical protein KRH_18210 [Kocuria rhizophila DC2201]
 gi|183581697|dbj|BAG30168.1| hypothetical protein [Kocuria rhizophila DC2201]
          Length = 246

 Score = 37.7 bits (86), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 25/84 (29%), Positives = 36/84 (42%), Gaps = 5/84 (5%)

Query: 55  GSSDLIVLDSSRSAFYTLSF----PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFA 110
           G   L  +D   S+   L+     P  +  VV  +   G  GY DG    AR D P+   
Sbjct: 111 GDGRLWFVDRDSSSLRCLTIDTDRPDGDPHVVTVVGRHGA-GYQDGPATEARLDHPEDLQ 169

Query: 111 VDMKGNIYVADKSNHVIRKITNLD 134
           +   G++ VAD  NH +R +   D
Sbjct: 170 MLYDGSVVVADTGNHALRHVDVTD 193


>gi|188576512|ref|YP_001913441.1| hypothetical protein PXO_00583 [Xanthomonas oryzae pv. oryzae
           PXO99A]
 gi|188520964|gb|ACD58909.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae
           PXO99A]
          Length = 472

 Score = 37.7 bits (86), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           V R  G G     DG  G A F +P+  A++ +  +YVAD  NH +R+I NL  G 
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 251


>gi|160882316|ref|ZP_02063319.1| hypothetical protein BACOVA_00264 [Bacteroides ovatus ATCC 8483]
 gi|156112324|gb|EDO14069.1| IPT/TIG domain protein [Bacteroides ovatus ATCC 8483]
          Length = 465

 Score = 37.7 bits (86), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 27/60 (45%), Positives = 31/60 (51%), Gaps = 5/60 (8%)

Query: 76  LSEESVVKRLAGDGVQ---GYSDGEPGS-ARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           L+ E  V   AG G     GY DGE  S A F+ P S A DMK    Y+ D  NH +RKI
Sbjct: 402 LTPEGRVTTYAGRGNSREWGYVDGELRSQALFNHPTSIAYDMKRKCFYIGDCDNHRVRKI 461


>gi|380016202|ref|XP_003692077.1| PREDICTED: membrane-associated protein Hem-like [Apis florea]
          Length = 1439

 Score = 37.7 bits (86), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 17/49 (34%), Positives = 28/49 (57%), Gaps = 1/49 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           ++ + G     + DG+  +ARF+ P+   + +   +YVAD  NH IRKI
Sbjct: 256 IQHVIGGSNPDFRDGDFENARFNAPQGVCI-LDTFVYVADNENHAIRKI 303


>gi|346725095|ref|YP_004851764.1| Thiol-disulfide isomerase and thioredoxin [Xanthomonas axonopodis
           pv. citrumelo F1]
 gi|418518756|ref|ZP_13084891.1| Thiol-disulfide isomerase and thioredoxin [Xanthomonas axonopodis
           pv. malvacearum str. GSPB1386]
 gi|346649842|gb|AEO42466.1| Thiol-disulfide isomerase and thioredoxin [Xanthomonas axonopodis
           pv. citrumelo F1]
 gi|410702450|gb|EKQ60955.1| Thiol-disulfide isomerase and thioredoxin [Xanthomonas axonopodis
           pv. malvacearum str. GSPB1386]
          Length = 472

 Score = 37.4 bits (85), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           V R  G G     DG  G A F +P+  A++ +  +YVAD  NH +R+I NL  G 
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 251


>gi|384428019|ref|YP_005637378.1| hypothetical protein XCR_2380 [Xanthomonas campestris pv. raphani
           756C]
 gi|341937121|gb|AEL07260.1| conserved hypothetical protein [Xanthomonas campestris pv. raphani
           756C]
          Length = 472

 Score = 37.4 bits (85), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           V R  G G     DG  G A F +P+  A++ +  +YVAD  NH +R+I NL  G 
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALERE-ELYVADTGNHALRRI-NLRSGQ 251


>gi|289665900|ref|ZP_06487481.1| hypothetical protein XcampvN_23187 [Xanthomonas campestris pv.
           vasculorum NCPPB 702]
          Length = 472

 Score = 37.4 bits (85), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           V R  G G     DG  G A F +P+  A++ +  +YVAD  NH +R+I NL  G 
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 251


>gi|118575759|ref|YP_875502.1| hypothetical protein CENSYa_0565 [Cenarchaeum symbiosum A]
 gi|118194280|gb|ABK77198.1| hypothetical protein CENSYa_0565 [Cenarchaeum symbiosum A]
          Length = 1566

 Score = 37.4 bits (85), Expect = 2.6,   Method: Composition-based stats.
 Identities = 21/47 (44%), Positives = 26/47 (55%), Gaps = 5/47 (10%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           L G G QG + GE     F  P+  A D  G+IYVAD  NH I++ T
Sbjct: 81  LFGWGTQGSNSGE-----FLLPRGIAADPGGDIYVADTGNHRIQRFT 122


>gi|449435944|ref|XP_004135754.1| PREDICTED: NHL repeat-containing protein 2-like [Cucumis sativus]
          Length = 1086

 Score = 37.4 bits (85), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 17/48 (35%), Positives = 27/48 (56%), Gaps = 1/48 (2%)

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI 130
           ++   G  G  DG    A F++P+  A + K N+ YVAD  NH +R++
Sbjct: 639 QIGSTGEDGLRDGNFDDATFNRPQGLAYNAKKNLLYVADTENHALREV 686


>gi|188991459|ref|YP_001903469.1| hypothetical protein xccb100_2064 [Xanthomonas campestris pv.
           campestris str. B100]
 gi|167733219|emb|CAP51417.1| Conserved hypothetical protein [Xanthomonas campestris pv.
           campestris]
          Length = 481

 Score = 37.4 bits (85), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           V R  G G     DG  G A F +P+  A++ +  +YVAD  NH +R+I NL  G 
Sbjct: 207 VLRQFGHGNADLIDGGVGEAAFRRPQGLALERE-ELYVADTGNHALRRI-NLRSGQ 260


>gi|418523208|ref|ZP_13089230.1| hypothetical protein WS7_19596 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
 gi|410700170|gb|EKQ58738.1| hypothetical protein WS7_19596 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
          Length = 472

 Score = 37.4 bits (85), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           V R  G G     DG  G A F +P+  A++ +  +YVAD  NH +R+I NL  G 
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 251


>gi|423303024|ref|ZP_17281045.1| hypothetical protein HMPREF1057_04186 [Bacteroides finegoldii
           CL09T03C10]
 gi|408470353|gb|EKJ88888.1| hypothetical protein HMPREF1057_04186 [Bacteroides finegoldii
           CL09T03C10]
          Length = 465

 Score = 37.4 bits (85), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 27/60 (45%), Positives = 31/60 (51%), Gaps = 5/60 (8%)

Query: 76  LSEESVVKRLAGDGVQ---GYSDGEPGS-ARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           L+ E  V   AG G     GY DGE  S A F+ P S A DMK    Y+ D  NH +RKI
Sbjct: 402 LTPEGRVTTYAGRGNSREWGYVDGELRSQALFNHPTSIAYDMKRKCFYIGDCDNHRVRKI 461


>gi|381173685|ref|ZP_09882761.1| NHL repeat family protein [Xanthomonas citri pv. mangiferaeindicae
           LMG 941]
 gi|380685855|emb|CCG39248.1| NHL repeat family protein [Xanthomonas citri pv. mangiferaeindicae
           LMG 941]
          Length = 472

 Score = 37.4 bits (85), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           V R  G G     DG  G A F +P+  A++ +  +YVAD  NH +R+I NL  G 
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 251


>gi|390991569|ref|ZP_10261830.1| NHL repeat family protein [Xanthomonas axonopodis pv. punicae str.
           LMG 859]
 gi|372553665|emb|CCF68805.1| NHL repeat family protein [Xanthomonas axonopodis pv. punicae str.
           LMG 859]
          Length = 472

 Score = 37.4 bits (85), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           V R  G G     DG  G A F +P+  A++ +  +YVAD  NH +R+I NL  G 
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 251


>gi|299144811|ref|ZP_07037879.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
 gi|298515302|gb|EFI39183.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
          Length = 459

 Score = 37.4 bits (85), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 30/59 (50%), Gaps = 12/59 (20%)

Query: 85  LAGD-GVQGYSDGEPGSARFDKP--------KSFAVDMKGNI---YVADKSNHVIRKIT 131
            AG+ G  GY D    SARFD P        + +  + + +I   Y+ D +NH IRKIT
Sbjct: 336 FAGEFGSYGYKDASQNSARFDNPMQGCFVLNEEYVAEQRADIYDFYLTDAANHCIRKIT 394


>gi|219852941|ref|YP_002467373.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
 gi|219547200|gb|ACL17650.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
          Length = 579

 Score = 37.4 bits (85), Expect = 2.7,   Method: Composition-based stats.
 Identities = 18/53 (33%), Positives = 28/53 (52%), Gaps = 2/53 (3%)

Query: 82  VKRLAGDGV--QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           V+R    G+    +  G  G  +F  P   AVD  GN+YVAD  N+ ++K ++
Sbjct: 119 VQRFTSTGIFLARWGTGGTGDGQFKSPSGVAVDSAGNVYVADMYNYRVQKFSS 171



 Score = 36.6 bits (83), Expect = 5.4,   Method: Composition-based stats.
 Identities = 19/48 (39%), Positives = 27/48 (56%), Gaps = 5/48 (10%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           LA  G +G  DG+     FD P   AVD + N+YV D  N+ ++K T+
Sbjct: 176 LAKWGTEGGGDGQ-----FDYPTGIAVDSENNVYVVDSYNNRVQKFTS 218


>gi|340382512|ref|XP_003389763.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 24-like [Amphimedon
            queenslandica]
          Length = 1354

 Score = 37.4 bits (85), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 19/54 (35%), Positives = 28/54 (51%), Gaps = 6/54 (11%)

Query: 78   EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            E   VK   G+G      G   + +F  P+  A+D +G +YVAD  NH I+K +
Sbjct: 1006 EGKKVKSFGGEG------GRSANGQFQYPRDIAIDSQGLVYVADSYNHRIQKFS 1053


>gi|290996889|ref|XP_002681014.1| predicted protein [Naegleria gruberi]
 gi|284094637|gb|EFC48270.1| predicted protein [Naegleria gruberi]
          Length = 1023

 Score = 37.4 bits (85), Expect = 2.8,   Method: Composition-based stats.
 Identities = 16/30 (53%), Positives = 22/30 (73%)

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
           KPK   +D  GNIY+AD +N+ +RKI+ LD
Sbjct: 356 KPKGAVIDSLGNIYIADTNNNRVRKISYLD 385



 Score = 37.4 bits (85), Expect = 3.1,   Method: Composition-based stats.
 Identities = 22/54 (40%), Positives = 30/54 (55%), Gaps = 3/54 (5%)

Query: 85  LAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           +AG G  G++ D     A+   P S  +D   N+Y+AD  NH IRKI  L +GN
Sbjct: 427 VAGVGSSGFNGDILATDAKLSNPVSVTIDSNDNVYIADTYNHRIRKI--LQNGN 478



 Score = 35.8 bits (81), Expect = 8.9,   Method: Composition-based stats.
 Identities = 22/60 (36%), Positives = 34/60 (56%), Gaps = 4/60 (6%)

Query: 72  LSFPLSEESVVKRLAGDGV-QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +S  +++ES V  +AG GV  GY   +   A    P S  +   G+IY+AD SNH +R++
Sbjct: 11  VSTTVAQESFVSTIAGGGVCDGYLATQ---ASLAYPGSPTIGPDGSIYIADSSNHRVRQV 67


>gi|237718531|ref|ZP_04549012.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|229452238|gb|EEO58029.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
          Length = 457

 Score = 37.4 bits (85), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 27/60 (45%), Positives = 31/60 (51%), Gaps = 5/60 (8%)

Query: 76  LSEESVVKRLAGDGVQ---GYSDGEPGS-ARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           L+ E  V   AG G     GY DGE  S A F+ P S A DMK    Y+ D  NH +RKI
Sbjct: 394 LTPEGRVTTYAGRGNSREWGYVDGELRSQALFNHPTSIAYDMKRKCFYIGDCDNHRVRKI 453


>gi|449524308|ref|XP_004169165.1| PREDICTED: NHL repeat-containing protein 2-like, partial [Cucumis
           sativus]
          Length = 892

 Score = 37.4 bits (85), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 26/107 (24%), Positives = 42/107 (39%), Gaps = 11/107 (10%)

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKITNLDHGNQLIRQ 142
           ++   G  G  DG    A F++P+  A + K N+ YVAD  NH +R++  +    + +  
Sbjct: 639 QIGSTGEDGLRDGNFDDATFNRPQGLAYNAKKNLLYVADTENHALREVDFVKERVRTLAG 698

Query: 143 INLKPEDCSKSSQSGSGMF---WVTVFSPY-------LISCKSFWVH 179
              K  D     +  S +    W   F P        +      WVH
Sbjct: 699 DGSKGSDYQGGKEGTSQLLNSPWDVCFEPINEKVYIAMAGQHQIWVH 745


>gi|397781344|ref|YP_006545817.1| Internalin-A [Methanoculleus bourgensis MS2]
 gi|396939846|emb|CCJ37101.1| Internalin-A [Methanoculleus bourgensis MS2]
          Length = 2759

 Score = 37.4 bits (85), Expect = 2.9,   Method: Composition-based stats.
 Identities = 18/43 (41%), Positives = 27/43 (62%), Gaps = 2/43 (4%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
           G+ +F+ P+  AV+  GN+YVAD  NH I+K  +   GN L +
Sbjct: 91  GNGQFNGPQGIAVNAAGNVYVADTYNHRIQKFDS--SGNLLTK 131


>gi|325926967|ref|ZP_08188244.1| NHL repeat protein [Xanthomonas perforans 91-118]
 gi|325542663|gb|EGD14128.1| NHL repeat protein [Xanthomonas perforans 91-118]
          Length = 443

 Score = 37.4 bits (85), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           V R  G G     DG  G A F +P+  A++ +  +YVAD  NH +R+I NL  G 
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 251


>gi|156406833|ref|XP_001641249.1| predicted protein [Nematostella vectensis]
 gi|156228387|gb|EDO49186.1| predicted protein [Nematostella vectensis]
          Length = 708

 Score = 37.4 bits (85), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 27/61 (44%), Positives = 34/61 (55%), Gaps = 5/61 (8%)

Query: 77  SEESVVKRLAG--DGVQ-GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           S E VV  + G  DG   G+ DG    A+F  P+  A+ +   IYVAD  NH IR+I NL
Sbjct: 229 STEGVVHEVIGGEDGFSAGFQDGTFKEAKFHAPQGVAM-LGEVIYVADTENHAIREI-NL 286

Query: 134 D 134
           D
Sbjct: 287 D 287


>gi|302821385|ref|XP_002992355.1| hypothetical protein SELMODRAFT_6804 [Selaginella moellendorffii]
 gi|300139771|gb|EFJ06505.1| hypothetical protein SELMODRAFT_6804 [Selaginella moellendorffii]
          Length = 78

 Score = 37.4 bits (85), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 18/43 (41%), Positives = 27/43 (62%), Gaps = 1/43 (2%)

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G  GY DG    +RF++P+S A+   G ++V D +N  IRKI+
Sbjct: 15  GESGYVDGPAAKSRFNRPQSLAICDNGAVFV-DTTNLAIRKIS 56


>gi|84623828|ref|YP_451200.1| hypothetical protein XOO_2171 [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|84367768|dbj|BAE68926.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
           311018]
          Length = 499

 Score = 37.4 bits (85), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           V R  G G     DG  G A F +P+  A++ +  +YVAD  NH +R+I NL  G 
Sbjct: 225 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 278


>gi|422325157|ref|ZP_16406193.1| hypothetical protein HMPREF0737_01303 [Rothia mucilaginosa M508]
 gi|353343435|gb|EHB87752.1| hypothetical protein HMPREF0737_01303 [Rothia mucilaginosa M508]
          Length = 656

 Score = 37.4 bits (85), Expect = 3.0,   Method: Composition-based stats.
 Identities = 20/62 (32%), Positives = 30/62 (48%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
           V   AG GV+G +DG    A F +P        G+++VA      +R IT  D+G Q+  
Sbjct: 363 VSIFAGSGVEGLADGPAADAWFAQPSGIIEARDGSLWVACSETSGLRHITFTDNGVQVTS 422

Query: 142 QI 143
            +
Sbjct: 423 AV 424


>gi|293372451|ref|ZP_06618835.1| IPT/TIG domain protein [Bacteroides ovatus SD CMC 3f]
 gi|292632634|gb|EFF51228.1| IPT/TIG domain protein [Bacteroides ovatus SD CMC 3f]
          Length = 446

 Score = 37.4 bits (85), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 30/59 (50%), Gaps = 12/59 (20%)

Query: 85  LAGD-GVQGYSDGEPGSARFDKP--------KSFAVDMKGNI---YVADKSNHVIRKIT 131
            AG+ G  GY D    SARFD P        + +  + + +I   Y+ D +NH IRKIT
Sbjct: 323 FAGEFGSYGYKDASQNSARFDNPMQGCFVLNEEYVAEQRADIYDFYLTDAANHCIRKIT 381


>gi|195119117|ref|XP_002004078.1| GI18254 [Drosophila mojavensis]
 gi|193914653|gb|EDW13520.1| GI18254 [Drosophila mojavensis]
          Length = 734

 Score = 37.4 bits (85), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 31/56 (55%), Gaps = 1/56 (1%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHG 136
           VVK   G    G+ DG    +RF+ P+  A   K  + VAD  NH +R+I+ L++G
Sbjct: 255 VVKYKIGGLEAGFVDGSLNMSRFNNPQGVAFLDKNTLIVADTDNHALRQIS-LNNG 309


>gi|290975356|ref|XP_002670409.1| predicted protein [Naegleria gruberi]
 gi|284083967|gb|EFC37665.1| predicted protein [Naegleria gruberi]
          Length = 2200

 Score = 37.4 bits (85), Expect = 3.0,   Method: Composition-based stats.
 Identities = 19/56 (33%), Positives = 27/56 (48%), Gaps = 6/56 (10%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           +E S     +GDG      G    A+ + P   ++   G IY+AD  NH IRKI +
Sbjct: 158 NEHSDCDGFSGDG------GLASRAKLNSPNGLSISQNGEIYIADTLNHRIRKINS 207


>gi|21242830|ref|NP_642412.1| hypothetical protein XAC2091 [Xanthomonas axonopodis pv. citri str.
           306]
 gi|21108317|gb|AAM36948.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
           str. 306]
          Length = 499

 Score = 37.4 bits (85), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           V R  G G     DG  G A F +P+  A++ +  +YVAD  NH +R+I NL  G 
Sbjct: 225 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 278


>gi|321469552|gb|EFX80532.1| hypothetical protein DAPPUDRAFT_304106 [Daphnia pulex]
          Length = 689

 Score = 37.4 bits (85), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 39/76 (51%), Gaps = 6/76 (7%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGE-PGSARFDKPKSFAVDMKGN- 116
           L V DS      T+  PL++  +     G+G  G +DG      +F++P    +D +G  
Sbjct: 464 LYVADSYNHKIKTVD-PLTK--LCSTYLGNGKAGLADGSSEDEIQFNEPGGLCIDEEGGG 520

Query: 117 -IYVADKSNHVIRKIT 131
            +YVAD +NH IRK+ 
Sbjct: 521 ILYVADTNNHCIRKVN 536


>gi|393912207|gb|EFO20548.2| RING finger protein nhl-1 [Loa loa]
          Length = 1144

 Score = 37.4 bits (85), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 20/48 (41%), Positives = 24/48 (50%), Gaps = 8/48 (16%)

Query: 81   VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
             VK+  G G        P S RF+ P   AVD  G IYV DK NH ++
Sbjct: 962  CVKKFGGHG--------PSSGRFNNPWGVAVDDMGMIYVVDKDNHRVQ 1001


>gi|289669010|ref|ZP_06490085.1| hypothetical protein XcampmN_11072, partial [Xanthomonas campestris
           pv. musacearum NCPPB 4381]
          Length = 284

 Score = 37.4 bits (85), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           V R  G G     DG  G A F +P+  A++ +  +YVAD  NH +R+I NL  G 
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALE-RDQLYVADTGNHALRRI-NLRSGQ 251


>gi|410896258|ref|XP_003961616.1| PREDICTED: NHL repeat-containing protein 2-like [Takifugu rubripes]
          Length = 716

 Score = 37.4 bits (85), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 20/48 (41%), Positives = 29/48 (60%), Gaps = 3/48 (6%)

Query: 84  RLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           ++ G    G  DG+   A F+ P+  A+  KG+ +YVAD  NH+IRKI
Sbjct: 254 QVIGGPESGRKDGDLSEASFNSPQGVAI--KGDTVYVADTENHLIRKI 299


>gi|21231560|ref|NP_637477.1| hypothetical protein XCC2116 [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
 gi|66768320|ref|YP_243082.1| hypothetical protein XC_1999 [Xanthomonas campestris pv. campestris
           str. 8004]
 gi|21113245|gb|AAM41401.1| conserved hypothetical protein [Xanthomonas campestris pv.
           campestris str. ATCC 33913]
 gi|66573652|gb|AAY49062.1| conserved hypothetical protein [Xanthomonas campestris pv.
           campestris str. 8004]
          Length = 499

 Score = 37.4 bits (85), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 2/56 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           V R  G G     DG  G A F +P+  A++ +  +YVAD  NH +R+I NL  G 
Sbjct: 225 VLRQFGHGNADLIDGGVGEAAFRRPQGLALERE-ELYVADTGNHALRRI-NLRSGQ 278


>gi|326434843|gb|EGD80413.1| NHL repeat-containing protein [Salpingoeca sp. ATCC 50818]
          Length = 1449

 Score = 37.4 bits (85), Expect = 3.3,   Method: Composition-based stats.
 Identities = 17/50 (34%), Positives = 29/50 (58%), Gaps = 2/50 (4%)

Query: 80   SVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
            S +  +AG   +G +   G   +A+ D+P   AVD  GN+Y+++K NH +
Sbjct: 1158 STITTIAGTKTRGSTGVGGLATAAQLDRPTCVAVDTMGNVYISEKGNHRV 1207


>gi|194767578|ref|XP_001965892.1| GF16360 [Drosophila ananassae]
 gi|190619368|gb|EDV34892.1| GF16360 [Drosophila ananassae]
          Length = 300

 Score = 37.4 bits (85), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 20/59 (33%), Positives = 30/59 (50%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLD 134
           +S   +V+ + GD   G  DG+   ARF  P+      +  + VAD  NH +RKI+  D
Sbjct: 72  VSSRGLVEHIIGDHKAGLIDGKFTEARFKHPQGLTFLDEHTLIVADTENHALRKISLAD 130


>gi|260822773|ref|XP_002606776.1| hypothetical protein BRAFLDRAFT_226091 [Branchiostoma floridae]
 gi|229292120|gb|EEN62786.1| hypothetical protein BRAFLDRAFT_226091 [Branchiostoma floridae]
          Length = 238

 Score = 37.4 bits (85), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 20/70 (28%), Positives = 38/70 (54%), Gaps = 11/70 (15%)

Query: 59  LIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIY 118
           ++V D  + + +      ++ ++VK++ G G         G  +FD+P    VD + NI+
Sbjct: 76  VVVADPGKHSIFLFE---ADGTLVKQVGGQG--------QGEEQFDEPCFVTVDKEDNIF 124

Query: 119 VADKSNHVIR 128
           VAD++NH I+
Sbjct: 125 VADQNNHRIQ 134


>gi|385204242|ref|ZP_10031112.1| NHL repeat protein [Burkholderia sp. Ch1-1]
 gi|385184133|gb|EIF33407.1| NHL repeat protein [Burkholderia sp. Ch1-1]
          Length = 321

 Score = 37.4 bits (85), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 31/49 (63%), Gaps = 2/49 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSAR--FDKPKSFAVDMKGNIYVADKSNHVIR 128
           + + + DG +  S GEPGS++  F+ P + + D  G +YVAD+ NH I+
Sbjct: 146 IHKYSPDGKRILSWGEPGSSKGQFNLPHNISCDEDGWVYVADRENHRIQ 194


>gi|91777978|ref|YP_553186.1| hypothetical protein Bxe_B2150 [Burkholderia xenovorans LB400]
 gi|91690638|gb|ABE33836.1| Conserved hypothetical protein [Burkholderia xenovorans LB400]
          Length = 321

 Score = 37.0 bits (84), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 31/49 (63%), Gaps = 2/49 (4%)

Query: 82  VKRLAGDGVQGYSDGEPGSAR--FDKPKSFAVDMKGNIYVADKSNHVIR 128
           + + + DG +  S GEPGS++  F+ P + + D  G +YVAD+ NH I+
Sbjct: 146 IHKYSPDGKRILSWGEPGSSKGQFNLPHNISCDEDGWVYVADRENHRIQ 194


>gi|389844275|ref|YP_006346355.1| NHL repeat protein [Mesotoga prima MesG1.Ag.4.2]
 gi|387859021|gb|AFK07112.1| NHL repeat protein [Mesotoga prima MesG1.Ag.4.2]
          Length = 496

 Score = 37.0 bits (84), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 34/66 (51%), Gaps = 3/66 (4%)

Query: 63  DSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADK 122
           ++ R     LS P   +++ +   G G+ G  DG   +AR +KP+   V   G +YVAD 
Sbjct: 191 NNDRILLTELSTPFVAKTIDQ--IGSGLSGLEDGPFENARLNKPQGI-VYSNGRLYVADT 247

Query: 123 SNHVIR 128
            NH +R
Sbjct: 248 ENHALR 253



 Score = 35.8 bits (81), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 61/130 (46%), Gaps = 20/130 (15%)

Query: 31  GYTVTTVIDGHQLEIN---PHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLA 86
           G  V  + DGH  + N   P  + +DR   + L V DS  SA   + F   E   V+ +A
Sbjct: 324 GSGVENIGDGHLKDANFAQPSGLFLDR---NSLYVADSEVSA---IRFVDLEAGTVQTVA 377

Query: 87  GDGVQ--GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLDHGNQLIRQI 143
           G G+   GY DG    + F  P    +  +G  +YVAD  NH IRKI   D G + +  +
Sbjct: 378 GSGLFSFGYVDGILKRSLFQHP--IGIHGEGRFLYVADTYNHAIRKI---DLGIRRVETV 432

Query: 144 --NLKPEDCS 151
             NL+   C+
Sbjct: 433 IKNLREGTCT 442


>gi|402881549|ref|XP_003904331.1| PREDICTED: NHL repeat-containing protein 2 [Papio anubis]
          Length = 688

 Score = 37.0 bits (84), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 30/50 (60%), Gaps = 2/50 (4%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
           G  DG    + F+ P+  A+ M   IYVAD  NH+IRK+T  D G+++ R
Sbjct: 267 GRKDGIFSESTFNSPQGVAI-MNNIIYVADTENHLIRKLTKGD-GSEVQR 314


>gi|158521169|ref|YP_001529039.1| YD repeat-containing protein [Desulfococcus oleovorans Hxd3]
 gi|158509995|gb|ABW66962.1| YD repeat protein [Desulfococcus oleovorans Hxd3]
          Length = 2961

 Score = 37.0 bits (84), Expect = 3.5,   Method: Composition-based stats.
 Identities = 18/54 (33%), Positives = 35/54 (64%), Gaps = 2/54 (3%)

Query: 79   ESVVKRLAGDGVQGYS-DGEPGSAR-FDKPKSFAVDMKGNIYVADKSNHVIRKI 130
            + ++  +AG+G++G   DG    A   + P+S  VD+KGN+Y +++ ++ +RKI
Sbjct: 1769 DGIISTVAGNGLEGNGPDGISALATPLNDPESIVVDVKGNLYFSERDHYKVRKI 1822


>gi|408794504|ref|ZP_11206109.1| NHL repeat protein [Leptospira meyeri serovar Hardjo str. Went 5]
 gi|408461739|gb|EKJ85469.1| NHL repeat protein [Leptospira meyeri serovar Hardjo str. Went 5]
          Length = 677

 Score = 37.0 bits (84), Expect = 3.5,   Method: Composition-based stats.
 Identities = 23/82 (28%), Positives = 36/82 (43%), Gaps = 6/82 (7%)

Query: 51  IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFA 110
           +DRP     + L       Y   F   +E ++  L+G  +  +     G  +F  P S  
Sbjct: 182 LDRP-----VSLAYHNQKIYVADFA-RDEVLLFDLSGSFISSFGGSGKGQGQFRGPSSIY 235

Query: 111 VDMKGNIYVADKSNHVIRKITN 132
           +D  GNI+VAD  N  I+K  +
Sbjct: 236 IDSNGNIFVADSGNARIQKFNS 257


>gi|290975266|ref|XP_002670364.1| predicted protein [Naegleria gruberi]
 gi|284083922|gb|EFC37620.1| predicted protein [Naegleria gruberi]
          Length = 1584

 Score = 37.0 bits (84), Expect = 3.6,   Method: Composition-based stats.
 Identities = 19/56 (33%), Positives = 27/56 (48%), Gaps = 6/56 (10%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           +E S     +GDG      G    A+ + P   ++   G IY+AD  NH IRKI +
Sbjct: 158 NEHSDCDGFSGDG------GLASRAKLNSPNGLSISQNGEIYIADTLNHRIRKINS 207


>gi|72392883|ref|XP_847242.1| flagellum-adhesion glycoprotein [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62358460|gb|AAX78922.1| flagellum-adhesion glycoprotein, putative [Trypanosoma brucei]
 gi|70803272|gb|AAZ13176.1| flagellum-adhesion glycoprotein, putative [Trypanosoma brucei
           brucei strain 927/4 GUTat10.1]
          Length = 590

 Score = 37.0 bits (84), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 32/59 (54%), Gaps = 1/59 (1%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPED 149
           +G  DG  G A F+ P S  V +  +IYVAD+ N+ IR+I    +  +   Q   KP+D
Sbjct: 108 RGNKDGPKGDALFNNPTS-VVSVNDDIYVADRDNNCIRRIDAEGNVTRYGPQDLNKPKD 165


>gi|383852866|ref|XP_003701946.1| PREDICTED: LOW QUALITY PROTEIN: tripartite motif-containing protein
           71-like [Megachile rotundata]
          Length = 722

 Score = 37.0 bits (84), Expect = 3.6,   Method: Composition-based stats.
 Identities = 16/33 (48%), Positives = 21/33 (63%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           G  +FD+P   AVD +  I VADK NH I+ +T
Sbjct: 500 GPGQFDRPAGVAVDARRRIVVADKDNHRIQVLT 532


>gi|327277470|ref|XP_003223487.1| PREDICTED: NHL repeat-containing protein 2-like [Anolis
           carolinensis]
          Length = 724

 Score = 37.0 bits (84), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 37/82 (45%), Gaps = 17/82 (20%)

Query: 50  VIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSF 109
           VI   G   ++V+  +    YT+  P S              G  DG    A F+ P+  
Sbjct: 235 VIADTGHHRILVVKKTGQILYTVGGPDS--------------GRKDGTYSEATFNSPQGV 280

Query: 110 AVDMKGNI-YVADKSNHVIRKI 130
           A+  K NI YVAD  NH+IRK+
Sbjct: 281 AI--KNNIIYVADTENHLIRKV 300


>gi|261330448|emb|CBH13432.1| flagellum-adhesion glycoprotein, putative [Trypanosoma brucei
           gambiense DAL972]
          Length = 588

 Score = 37.0 bits (84), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 32/59 (54%), Gaps = 1/59 (1%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPED 149
           +G  DG  G A F+ P S  V +  +IYVAD+ N+ IR+I    +  +   Q   KP+D
Sbjct: 108 RGNKDGPKGDALFNNPTS-VVSVNDDIYVADRDNNCIRRIDAEGNVTRYGPQDLNKPKD 165


>gi|148654423|ref|YP_001274628.1| NHL repeat-containing protein [Roseiflexus sp. RS-1]
 gi|148566533|gb|ABQ88678.1| NHL repeat containing protein [Roseiflexus sp. RS-1]
          Length = 1146

 Score = 37.0 bits (84), Expect = 3.7,   Method: Composition-based stats.
 Identities = 32/122 (26%), Positives = 51/122 (41%), Gaps = 23/122 (18%)

Query: 11  AITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHS--VIDRPGSSDLIVLDSSRSA 68
           A+     +  LV++ +  E G     ++    + I+P    ++    +  LIV D S + 
Sbjct: 818 AVAATGSIIRLVATDIFGEFGNAPGQLVQPRGIAIDPQGNIIVSDSANHRLIVFDPSGTP 877

Query: 69  FYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
             T+                G  G  DG+     F +P+  AVD +GNIYVAD  N  I 
Sbjct: 878 IRTI----------------GSFGSGDGQ-----FYEPRGVAVDAQGNIYVADTWNARIV 916

Query: 129 KI 130
           K+
Sbjct: 917 KL 918


>gi|374312921|ref|YP_005059351.1| NHL repeat containing protein [Granulicella mallensis MP5ACTX8]
 gi|358754931|gb|AEU38321.1| NHL repeat containing protein [Granulicella mallensis MP5ACTX8]
          Length = 654

 Score = 37.0 bits (84), Expect = 3.7,   Method: Composition-based stats.
 Identities = 26/80 (32%), Positives = 45/80 (56%), Gaps = 7/80 (8%)

Query: 54  PGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDG-VQGYSDGEPGSARFDKPKSFAVD 112
           P ++  I +D+S +A+ T     S    V +L+G G V    +G  G A  D+P+S A+D
Sbjct: 374 PTAAGNIAIDTSGNAWVT-----SGNFSVTKLSGTGTVLSVVNGSSGGA-LDEPQSLAID 427

Query: 113 MKGNIYVADKSNHVIRKITN 132
             GN++V D  N+ + +I++
Sbjct: 428 GSGNVWVNDFENNSVTEISS 447


>gi|72392873|ref|XP_847237.1| flagellum-adhesion glycoprotein [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62358465|gb|AAX78927.1| flagellum-adhesion glycoprotein, putative [Trypanosoma brucei]
 gi|70803267|gb|AAZ13171.1| flagellum-adhesion glycoprotein, putative [Trypanosoma brucei
           brucei strain 927/4 GUTat10.1]
          Length = 590

 Score = 37.0 bits (84), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 32/59 (54%), Gaps = 1/59 (1%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPED 149
           +G  DG  G A F+ P S  V +  +IYVAD+ N+ IR+I    +  +   Q   KP+D
Sbjct: 108 RGNKDGPKGDALFNNPTS-VVSVNDDIYVADRDNNCIRRIDAEGNVTRYGPQDLNKPKD 165


>gi|290993799|ref|XP_002679520.1| predicted protein [Naegleria gruberi]
 gi|284093137|gb|EFC46776.1| predicted protein [Naegleria gruberi]
          Length = 1414

 Score = 37.0 bits (84), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 17/57 (29%), Positives = 33/57 (57%), Gaps = 1/57 (1%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGS-ARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S+  ++  +AG G  GY D    + ++ + PK   V   G +++AD +N+ +RKI+
Sbjct: 850 VSKSGIISTIAGLGAAGYIDNVLATESQLNAPKGVVVAPSGEVFIADSNNNKVRKIS 906


>gi|290983130|ref|XP_002674282.1| predicted protein [Naegleria gruberi]
 gi|284087871|gb|EFC41538.1| predicted protein [Naegleria gruberi]
          Length = 1572

 Score = 37.0 bits (84), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 37/74 (50%), Gaps = 6/74 (8%)

Query: 60  IVLDSSRSAFYTLSFPLSEESVVKRLAGD-GVQGYSDGEPGSARFDKPKSFAVDMKGN-I 117
           ++L  SRS+   L         +  +AG  GV GY DG+     F +  S AVD   N I
Sbjct: 590 LILSHSRSSIRMLK---KSSGFITNIAGTFGVSGYKDGKTSLFSF-RLNSLAVDRNDNSI 645

Query: 118 YVADKSNHVIRKIT 131
           Y++D  N  IRK+T
Sbjct: 646 YISDNGNLAIRKLT 659


>gi|390562751|ref|ZP_10244924.1| putative Peptidylamidoglycolate lyase [Nitrolancetus hollandicus
           Lb]
 gi|390172671|emb|CCF84237.1| putative Peptidylamidoglycolate lyase [Nitrolancetus hollandicus
           Lb]
          Length = 527

 Score = 37.0 bits (84), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 45/105 (42%), Gaps = 26/105 (24%)

Query: 27  LLEDGYTVTTVIDG--HQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLS---FPLSEESV 81
           L  DG +VT    G  H  ++     +D  G  ++ V DS  +    LS   F L+E S 
Sbjct: 437 LTSDGTSVTNEWTGFDHPTDV----AVDAEG--NIFVADSGNNRVVKLSSAGFQLAEWSN 490

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHV 126
           V                G+ R D+P+  AVD KG +YV D   H+
Sbjct: 491 VS---------------GNGRLDRPRGIAVDAKGMVYVTDLGGHI 520



 Score = 36.6 bits (83), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 15/29 (51%), Positives = 19/29 (65%)

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           F  P+  AVD  GNIYVAD  NH + K++
Sbjct: 280 FVSPQGLAVDSIGNIYVADSGNHTVAKLS 308


>gi|326435919|gb|EGD81489.1| hypothetical protein PTSG_02206 [Salpingoeca sp. ATCC 50818]
          Length = 6977

 Score = 37.0 bits (84), Expect = 3.9,   Method: Composition-based stats.
 Identities = 22/87 (25%), Positives = 38/87 (43%), Gaps = 5/87 (5%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           +P  V+  P + D++V D   +    ++     E  V    G G +G+ DG     R   
Sbjct: 912 SPQDVLSLP-NGDIVVCDRDNARVRVIT----PEGKVSTALGVGKRGHQDGVGAGVRLAG 966

Query: 106 PKSFAVDMKGNIYVADKSNHVIRKITN 132
           P+       G+  V D  NH IR++++
Sbjct: 967 PRGMTQLASGSFVVTDAENHCIREVSS 993


>gi|295837701|ref|ZP_06824634.1| NHL repeat protein [Streptomyces sp. SPB74]
 gi|197695928|gb|EDY42861.1| NHL repeat protein [Streptomyces sp. SPB74]
          Length = 610

 Score = 37.0 bits (84), Expect = 3.9,   Method: Composition-based stats.
 Identities = 26/84 (30%), Positives = 45/84 (53%), Gaps = 4/84 (4%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P   +  PG +  +V D++R A   L    ++   V+R  G G +G  DG+  +ARF++P
Sbjct: 183 PGKALRLPGGT-FLVSDTTRHALVELE---ADGETVRRRIGGGERGLLDGDAATARFNEP 238

Query: 107 KSFAVDMKGNIYVADKSNHVIRKI 130
           +  A+     + VAD  NH +R++
Sbjct: 239 QGLALLPDRAVVVADTVNHALRRV 262


>gi|373253064|ref|ZP_09541182.1| thiol-disulfide isomerase-like thioredoxin [Nesterenkonia sp. F]
          Length = 659

 Score = 37.0 bits (84), Expect = 3.9,   Method: Composition-based stats.
 Identities = 23/78 (29%), Positives = 38/78 (48%), Gaps = 3/78 (3%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESV-VKRLAGDGVQ--GYSDGEPGSARFDKPKSFAV 111
            + D+ + DS  S+   L      ++  V+ + G+G+   G+ DG+P  AR   P   A 
Sbjct: 391 AAGDVWIADSETSSLRVLRTAEDPKAPRVETIVGEGMFDFGFRDGDPAQARLQHPLGVAA 450

Query: 112 DMKGNIYVADKSNHVIRK 129
              G++ VAD  N  IR+
Sbjct: 451 LPDGSVLVADTYNGAIRR 468


>gi|340380228|ref|XP_003388625.1| PREDICTED: hypothetical protein LOC100634990 [Amphimedon
           queenslandica]
          Length = 616

 Score = 37.0 bits (84), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 19/59 (32%), Positives = 35/59 (59%), Gaps = 3/59 (5%)

Query: 76  LSEESVVKRLAGDGVQGYSDGEPGSARFDK---PKSFAVDMKGNIYVADKSNHVIRKIT 131
           +S++  +++++ DG    S G+   A  D    P+  A+D +G +YVAD+ NH I+K +
Sbjct: 286 VSDDHRIQKISIDGYLKASVGQVYIADRDNHRYPRDIAIDSQGLVYVADRGNHCIQKFS 344


>gi|405967973|gb|EKC33082.1| hypothetical protein CGI_10023982 [Crassostrea gigas]
          Length = 617

 Score = 37.0 bits (84), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 20/62 (32%), Positives = 34/62 (54%), Gaps = 5/62 (8%)

Query: 90  VQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPED 149
           +QG+      SA F+ P    VD +GN+YV  K++  + ++T  +     +R+I L P +
Sbjct: 530 LQGHVTFAFRSAEFENPAGLCVDYQGNVYVCGKTSRNVFQVTKENE----VREI-LSPSE 584

Query: 150 CS 151
           CS
Sbjct: 585 CS 586


>gi|290995663|ref|XP_002680402.1| predicted protein [Naegleria gruberi]
 gi|284094023|gb|EFC47658.1| predicted protein [Naegleria gruberi]
          Length = 974

 Score = 37.0 bits (84), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 39/76 (51%), Gaps = 8/76 (10%)

Query: 58  DLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DGEPGS-ARFDKPKSFAVDMKG 115
           +LI+ D+ R     L+       ++  LAG  V G+S DG P + A F       +D   
Sbjct: 646 ELIISDAKRIRKIDLN------GIIVTLAGSNVAGFSGDGGPAADATFGTLGGIYLDSND 699

Query: 116 NIYVADKSNHVIRKIT 131
           +IYV+D  NH IRKI+
Sbjct: 700 DIYVSDPDNHRIRKIS 715



 Score = 35.8 bits (81), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 32/115 (27%), Positives = 57/115 (49%), Gaps = 18/115 (15%)

Query: 30  DGYTVTTVIDGHQLEINP------HSVIDRPGSSDLIVLDS------SRSAFYTLSFPLS 77
           +G   T V  G + + +P      +++I  PG+   I+ DS      S  +F+ +   L+
Sbjct: 210 EGRIATVVGSGQKGQESPDGVLAVNAIIGAPGN---IIFDSIGNMYLSDRSFHKVRKVLT 266

Query: 78  EESVVKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
             ++V  +AG+G+  Y+ DG    SA   +P   A+   G +Y+A+   H IRK+
Sbjct: 267 NGTIVT-IAGNGMSAYNGDGILAVSASLFRPSGLALSSTGELYIAESYGHRIRKV 320


>gi|340053072|emb|CCC47357.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 434

 Score = 37.0 bits (84), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 37/122 (30%), Positives = 54/122 (44%), Gaps = 11/122 (9%)

Query: 10  LAITLLSLLFSLVSSG-LLLED----GYTVTTVIDGHQLEINPHSVID-RPGSSDLIVLD 63
           LA++ +  +  LVS+G   L+D      T  +V  G  + + P  V    PG+   IV  
Sbjct: 51  LAVSRVQAIIELVSNGDAWLQDCGSTNGTFLSVNAGLGIRLKPQRVYQLTPGNH--IVFG 108

Query: 64  SSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVA 120
             R+ F   SFPL  E V K     G       E G    A  D P    VD++G+++ +
Sbjct: 109 DVRTLFVDESFPLCREYVRKLEVTGGTASLQSTECGLSSGANDDAPTDTHVDLQGSLFSS 168

Query: 121 DK 122
           DK
Sbjct: 169 DK 170


>gi|290976432|ref|XP_002670944.1| predicted protein [Naegleria gruberi]
 gi|284084508|gb|EFC38200.1| predicted protein [Naegleria gruberi]
          Length = 1514

 Score = 37.0 bits (84), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 24/87 (27%), Positives = 47/87 (54%), Gaps = 6/87 (6%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS-DG-EPGSARF 103
           NP  ++    + +L + D++ +    +   L+  +++  +AG G  GY+ DG +  SA+ 
Sbjct: 156 NPSGIVVDSKTGELFIADTANNRIRKI---LTNGTIIT-IAGTGETGYNGDGIDATSAQI 211

Query: 104 DKPKSFAVDMKGNIYVADKSNHVIRKI 130
                 A+D+   +Y+AD +N+ IRKI
Sbjct: 212 RGVYGIALDVNSELYIADANNYRIRKI 238


>gi|87311714|ref|ZP_01093830.1| NHL repeat protein [Blastopirellula marina DSM 3645]
 gi|87285608|gb|EAQ77526.1| NHL repeat protein [Blastopirellula marina DSM 3645]
          Length = 652

 Score = 37.0 bits (84), Expect = 4.1,   Method: Composition-based stats.
 Identities = 17/44 (38%), Positives = 27/44 (61%), Gaps = 1/44 (2%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G+G  G +DG+  +A F   +  A+D    +YVAD  NH++RK+
Sbjct: 199 GNGAIGTTDGDYATASFHHLQGMALD-GDTLYVADTENHMLRKV 241


>gi|156401400|ref|XP_001639279.1| predicted protein [Nematostella vectensis]
 gi|156226406|gb|EDO47216.1| predicted protein [Nematostella vectensis]
          Length = 820

 Score = 37.0 bits (84), Expect = 4.2,   Method: Composition-based stats.
 Identities = 20/54 (37%), Positives = 29/54 (53%), Gaps = 5/54 (9%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           ++ VV R A  G +G  +GE     F  P+  A D  GN+ V D++NH + K T
Sbjct: 729 QDDVVTRRAVIGREGNKEGE-----FSYPRGIAFDSVGNLIVCDRNNHRLLKFT 777


>gi|290995474|ref|XP_002680320.1| predicted protein [Naegleria gruberi]
 gi|284093940|gb|EFC47576.1| predicted protein [Naegleria gruberi]
          Length = 992

 Score = 37.0 bits (84), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 21/90 (23%), Positives = 42/90 (46%), Gaps = 5/90 (5%)

Query: 45  INPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG--YSDGEPGSAR 102
           ++P+ V+  P + D+ + D   +    +     +  +V  +AG G  G    +G    A+
Sbjct: 444 VDPYGVVVDPSNGDVFISDGYLNCVRKID---GKSGIVTTVAGTGEAGDVGDNGPSNKAQ 500

Query: 103 FDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
              P   ++   G++ +AD  N  IRK++N
Sbjct: 501 LFSPSGLSLTSSGDLLIADNGNQAIRKVSN 530


>gi|431798072|ref|YP_007224976.1| NHL repeat protein,IPT/TIG domain-containing protein [Echinicola
           vietnamensis DSM 17526]
 gi|430788837|gb|AGA78966.1| NHL repeat protein,IPT/TIG domain-containing protein [Echinicola
           vietnamensis DSM 17526]
          Length = 492

 Score = 37.0 bits (84), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 35/87 (40%), Gaps = 13/87 (14%)

Query: 53  RPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQ-------GYSDGE-PGSARFD 104
           +P   D    D    A   LS     E  V   AG G         GY DG+    ARFD
Sbjct: 392 KPDEYDFYFTDQYNHAIRKLS----PEGSVTTFAGRGSSSINPDPYGYVDGDLREEARFD 447

Query: 105 KPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +P   A    G  Y+ D+ NH IRKI 
Sbjct: 448 RPSGIAYS-DGAFYIGDQMNHRIRKIA 473


>gi|195456364|ref|XP_002075106.1| GK23381 [Drosophila willistoni]
 gi|194171191|gb|EDW86092.1| GK23381 [Drosophila willistoni]
          Length = 721

 Score = 37.0 bits (84), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 17/53 (32%), Positives = 30/53 (56%)

Query: 79  ESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           + VV+   G    G+ DG+  ++RF+ P+  A   +  + VAD  NH +R+I+
Sbjct: 259 DGVVEHKIGGFESGFIDGDLTTSRFNNPQGIAFLDEDTLIVADTKNHALRQIS 311


>gi|430743901|ref|YP_007203030.1| thiol-disulfide isomerase-like thioredoxin [Singulisphaera
           acidiphila DSM 18658]
 gi|430015621|gb|AGA27335.1| thiol-disulfide isomerase-like thioredoxin [Singulisphaera
           acidiphila DSM 18658]
          Length = 696

 Score = 37.0 bits (84), Expect = 4.4,   Method: Composition-based stats.
 Identities = 24/84 (28%), Positives = 37/84 (44%), Gaps = 5/84 (5%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V+    S  L + D++ +          +  VV    G G  G  DG+   A F++P
Sbjct: 229 PGKVLADEKSKQLFIADTAHNRIVLTDLDGRKSVVV----GSGGIGMVDGDYAKAEFNRP 284

Query: 107 KSFAVDMKGNIYVADKSNHVIRKI 130
           +   + +   +YVAD  NH IR I
Sbjct: 285 QGLCL-VDDTLYVADTENHAIRAI 307


>gi|380693132|ref|ZP_09857991.1| hypothetical protein BfaeM_04015 [Bacteroides faecis MAJ27]
          Length = 472

 Score = 36.6 bits (83), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 26/67 (38%), Positives = 32/67 (47%), Gaps = 9/67 (13%)

Query: 74  FPLSEESVVKRLAGDG-------VQGYSDGE-PGSARFDKPKSFAVDMKGN-IYVADKSN 124
           F ++ +  V   AG G       V G  DGE    ARFD P   A D +    Y+ADK N
Sbjct: 404 FKITPDGNVSLFAGRGSVSSDNKVHGNIDGELKTEARFDNPCGIAYDEETQTFYIADKEN 463

Query: 125 HVIRKIT 131
           H IR I+
Sbjct: 464 HSIRTIS 470


>gi|290973162|ref|XP_002669318.1| predicted protein [Naegleria gruberi]
 gi|284082864|gb|EFC36574.1| predicted protein [Naegleria gruberi]
          Length = 832

 Score = 36.6 bits (83), Expect = 4.6,   Method: Composition-based stats.
 Identities = 33/112 (29%), Positives = 57/112 (50%), Gaps = 14/112 (12%)

Query: 23  SSGLLLEDGYTVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVV 82
           S G   ++G     +IDG      P+S+   P S DL  +D +    Y +   +S + ++
Sbjct: 268 SKGYSGDEGNAADAMIDG------PYSLAFHPVSGDLTFVDINN---YRIR-KISNKGII 317

Query: 83  KRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
             +AG+G +G S G+ GSA   +     S +    G +Y+A++ N+ IRKI+
Sbjct: 318 STIAGNGEKG-SIGDGGSALNAQIYYSVSISFSPNGELYIANEWNNRIRKIS 368


>gi|326431538|gb|EGD77108.1| NHL repeat-containing protein [Salpingoeca sp. ATCC 50818]
          Length = 1455

 Score = 36.6 bits (83), Expect = 4.6,   Method: Composition-based stats.
 Identities = 17/50 (34%), Positives = 29/50 (58%), Gaps = 2/50 (4%)

Query: 80   SVVKRLAGDGVQGYSD--GEPGSARFDKPKSFAVDMKGNIYVADKSNHVI 127
            S +  +AG   +G +   G   +A+ D+P   AVD  GN+Y+++K NH +
Sbjct: 1164 STITTIAGTKTRGSAGVGGLATAAQLDRPTCVAVDTIGNVYISEKGNHRV 1213


>gi|290976072|ref|XP_002670765.1| predicted protein [Naegleria gruberi]
 gi|284084327|gb|EFC38021.1| predicted protein [Naegleria gruberi]
          Length = 854

 Score = 36.6 bits (83), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 18/52 (34%), Positives = 30/52 (57%), Gaps = 4/52 (7%)

Query: 82  VKRLAGDGVQGYSDGEPGSA---RFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +  +AG G  GY +G+ G A       P+  AV   G +Y++D +N++IRK+
Sbjct: 73  ISTIAGTGSAGY-NGDGGLAIETMLSSPQGVAVSESGEVYLSDSTNNIIRKV 123


>gi|156740315|ref|YP_001430444.1| NHL repeat-containing protein [Roseiflexus castenholzii DSM 13941]
 gi|156231643|gb|ABU56426.1| NHL repeat containing protein [Roseiflexus castenholzii DSM 13941]
          Length = 1163

 Score = 36.6 bits (83), Expect = 4.7,   Method: Composition-based stats.
 Identities = 33/141 (23%), Positives = 58/141 (41%), Gaps = 26/141 (18%)

Query: 11  AITLLSLLFSLVSSGLLLEDGYTVTTVIDGHQLEINPHS--VIDRPGSSDLIVLDSSRSA 68
           A+     +  LV++ +  E G     +I    + ++P    ++   G+  LIV D S + 
Sbjct: 835 AVAATGSVIKLVATDIFGEFGSEPGQLIQPRGVAVDPQGNVIVSDSGNHRLIVFDPSGTP 894

Query: 69  FYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
             T+                   G+ +G+    +F +P+  AVD  GN+YVAD  N    
Sbjct: 895 IRTIG------------------GFGNGD---GQFYEPRGVAVDAAGNMYVADTWN---A 930

Query: 129 KITNLDHGNQLIRQINLKPED 149
           +I  LD   + +    +  ED
Sbjct: 931 RIVKLDPQGRFLSSWGVGRED 951


>gi|153806006|ref|ZP_01958674.1| hypothetical protein BACCAC_00251 [Bacteroides caccae ATCC 43185]
 gi|149130683|gb|EDM21889.1| IPT/TIG domain protein [Bacteroides caccae ATCC 43185]
          Length = 530

 Score = 36.6 bits (83), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 24/86 (27%), Positives = 38/86 (44%), Gaps = 14/86 (16%)

Query: 64  SSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDG--------------EPGSARFDKPKSF 109
           S+ SA  ++ +   +  +V   AG G  G +D               E   A F  P S 
Sbjct: 445 STVSADQSIRYIDQKTGMVTTFAGSGTSGNTDSSFEVLSYAGVNSALESLPAAFGAPSSV 504

Query: 110 AVDMKGNIYVADKSNHVIRKITNLDH 135
            VD  G +YV D+ N+ ++KIT ++ 
Sbjct: 505 CVDKDGTVYVLDRRNNCVKKITTVEK 530


>gi|163788357|ref|ZP_02182803.1| hypothetical protein FBALC1_08248 [Flavobacteriales bacterium
           ALC-1]
 gi|159876677|gb|EDP70735.1| hypothetical protein FBALC1_08248 [Flavobacteriales bacterium
           ALC-1]
          Length = 500

 Score = 36.6 bits (83), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 13/66 (19%)

Query: 70  YTLSFPLSEESVVKRLAGD--------GVQG----YSDGEPGSARFDKPKSFAVDMKGNI 117
           Y+L FPL++ ++ K+L G          +Q     YS    G+ R D+P++FA D+ GN 
Sbjct: 342 YSLHFPLTQNALQKKLNGGMDIFISKLNIQNSSIVYSTYLGGNKR-DRPRNFATDLNGNA 400

Query: 118 YVADKS 123
           Y+  K+
Sbjct: 401 YIIGKT 406


>gi|374855491|dbj|BAL58347.1| NHL repeat containing protein [uncultured candidate division OP1
           bacterium]
          Length = 325

 Score = 36.6 bits (83), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 14/32 (43%), Positives = 23/32 (71%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G+ +F+  ++ AVD +GN+YVAD  NH ++K 
Sbjct: 39  GNGQFNALEAVAVDAQGNVYVADTENHRVQKF 70


>gi|290990712|ref|XP_002677980.1| predicted protein [Naegleria gruberi]
 gi|284091590|gb|EFC45236.1| predicted protein [Naegleria gruberi]
          Length = 1555

 Score = 36.6 bits (83), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 18/51 (35%), Positives = 30/51 (58%), Gaps = 1/51 (1%)

Query: 81  VVKRLAGDGVQGYS-DGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +++ +AG GV+G++ DG   S   D P  F+++    IY AD  N+ IR +
Sbjct: 605 IIQTIAGSGVKGFNGDGPALSTHLDTPMGFSLNSDWIIYFADSGNNRIRYV 655


>gi|149177401|ref|ZP_01856005.1| hypothetical protein PM8797T_18906 [Planctomyces maris DSM 8797]
 gi|148843734|gb|EDL58093.1| hypothetical protein PM8797T_18906 [Planctomyces maris DSM 8797]
          Length = 683

 Score = 36.6 bits (83), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 24/84 (28%), Positives = 39/84 (46%), Gaps = 5/84 (5%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  ++  P    L + DS+ +     S     +  +  + G G  G  DG+  +A FD P
Sbjct: 223 PGKLLADPAQQRLFISDSNHNRIVIASL----DGKLIDVIGSGQIGNKDGDYKTASFDHP 278

Query: 107 KSFAVDMKGNIYVADKSNHVIRKI 130
           +  A+ +   +YVAD  NH IR +
Sbjct: 279 QGMAL-VGNTLYVADTENHSIRTV 301


>gi|148657095|ref|YP_001277300.1| PA14 domain-containing protein [Roseiflexus sp. RS-1]
 gi|148569205|gb|ABQ91350.1| PA14 domain protein [Roseiflexus sp. RS-1]
          Length = 1293

 Score = 36.6 bits (83), Expect = 4.9,   Method: Composition-based stats.
 Identities = 19/57 (33%), Positives = 31/57 (54%), Gaps = 2/57 (3%)

Query: 77  SEESVVKRLAGDGVQGYSDGEPGSA--RFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           S+   ++R +  G    + G PG+   +F  P+S AV   G +YVAD  NH I++ +
Sbjct: 76  SDNHRIQRFSAAGELLGAWGSPGTGDGQFSSPRSVAVAPDGTVYVADTGNHRIQRFS 132


>gi|37519956|ref|NP_923333.1| hypothetical protein gll0387 [Gloeobacter violaceus PCC 7421]
 gi|35210948|dbj|BAC88328.1| gll0387 [Gloeobacter violaceus PCC 7421]
          Length = 544

 Score = 36.6 bits (83), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 17/45 (37%), Positives = 25/45 (55%), Gaps = 1/45 (2%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           G G+ G+ DG    A F  P+  A+   G  ++V D  NH++RKI
Sbjct: 251 GSGIPGWRDGNFEEAEFWAPQGLALSADGRTLFVCDTENHLLRKI 295


>gi|290996598|ref|XP_002680869.1| predicted protein [Naegleria gruberi]
 gi|284094491|gb|EFC48125.1| predicted protein [Naegleria gruberi]
          Length = 1407

 Score = 36.6 bits (83), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 35/131 (26%), Positives = 56/131 (42%), Gaps = 18/131 (13%)

Query: 14  LLSLLFSLVSSGLLLED---GYTVTTV---------IDGHQLEINPHSVIDRPGSSDLIV 61
           L  L+ +   SGL++      Y +TT+         ID     I+ +S      S DL +
Sbjct: 11  LCLLILTYHGSGLVIGAISVKYNITTIAGYNSLSSGIDAKMANIDVNSCFKDLSSGDLFI 70

Query: 62  LDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYS--DGEPGSARFDKPKSFAVDMKGNIYV 119
            D+S      +    +    V  +AG GV GY+  + +  +A+ + P    V     IY+
Sbjct: 71  SDNSNHRIVRV---FAANGTVLTIAGTGVSGYNGDNIQATTAQLNTPTGVFV-FNSEIYI 126

Query: 120 ADKSNHVIRKI 130
           AD  N  +RKI
Sbjct: 127 ADSQNSRVRKI 137


>gi|290988662|ref|XP_002677016.1| predicted protein [Naegleria gruberi]
 gi|284090621|gb|EFC44272.1| predicted protein [Naegleria gruberi]
          Length = 450

 Score = 36.6 bits (83), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 15/37 (40%), Positives = 24/37 (64%)

Query: 94  SDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +DG P S     P   A+D +GNI+++D  N++IRK+
Sbjct: 188 TDGPPRSIPLYYPTGIAMDEEGNIFISDTRNNLIRKV 224


>gi|206603181|gb|EDZ39661.1| NHL repeat domain protein [Leptospirillum sp. Group II '5-way CG']
          Length = 494

 Score = 36.6 bits (83), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 17/55 (30%), Positives = 30/55 (54%), Gaps = 2/55 (3%)

Query: 78  EESVVKRLAGDGVQGYSDGEPGSARFDKPKS--FAVDMKGNIYVADKSNHVIRKI 130
           E++ ++ + G G+ G +DG    A+F  P+   F  D      VAD  NH++R++
Sbjct: 210 EKAALREVIGQGIPGSADGSFDQAQFRDPQGVRFCPDDPDTAIVADTGNHLLRRV 264


>gi|241584846|ref|XP_002403871.1| NHL repeat-containing protein [Ixodes scapularis]
 gi|215500274|gb|EEC09768.1| NHL repeat-containing protein [Ixodes scapularis]
          Length = 621

 Score = 36.6 bits (83), Expect = 5.4,   Method: Composition-based stats.
 Identities = 33/122 (27%), Positives = 56/122 (45%), Gaps = 5/122 (4%)

Query: 39  DGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGE- 97
           DG  L++     +    SSD  +L  + S  + +     ++ +   LAG GV G + G  
Sbjct: 469 DGSALDVRLQHPLGVCWSSDKQLLYVADSYNHKIRKVDVQKRLCTTLAGTGVAGDATGSF 528

Query: 98  PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIRQINLKPEDCSKSSQSG 157
               +FD+P    V +   +YVAD +NH + K+ +LD G   + ++ L   D + S    
Sbjct: 529 SDEVQFDEPGGLCV-VGSRLYVADTNNHCV-KLVHLDLG--FVEKVTLTLPDATDSRDED 584

Query: 158 SG 159
           S 
Sbjct: 585 SA 586


>gi|147918860|ref|YP_687414.1| hypothetical protein RRC373 [Methanocella arvoryzae MRE50]
 gi|110622810|emb|CAJ38088.1| hypothetical protein RRC373 [Methanocella arvoryzae MRE50]
          Length = 539

 Score = 36.6 bits (83), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 39/87 (44%), Gaps = 8/87 (9%)

Query: 52  DRPGSSDL---IVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKS 108
           D PG  D    I +D + + + T +F    +   K        G    EPG  +FD+   
Sbjct: 82  DEPGQFDQTSGIAVDDTGNIYVTDTFNHRIQVWNKATNTWTTMGSHGDEPG--QFDQTSG 139

Query: 109 FAVDMKGNIYVADKSNHVIR---KITN 132
            AVD  GNIYV D  NH I+   K TN
Sbjct: 140 IAVDDTGNIYVTDTINHRIQVWNKATN 166


>gi|170573447|ref|XP_001892474.1| Hypothetical RING finger protein F54G8.4 in chromosome III,
           putative [Brugia malayi]
 gi|158601961|gb|EDP38697.1| Hypothetical RING finger protein F54G8.4 in chromosome III,
           putative [Brugia malayi]
          Length = 1024

 Score = 36.6 bits (83), Expect = 5.5,   Method: Composition-based stats.
 Identities = 17/42 (40%), Positives = 23/42 (54%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           G  V+ +    P + RF+ P   AVD  G IYV DK NH ++
Sbjct: 840 GHCVRKFGGHGPSNGRFNNPWGVAVDDMGMIYVVDKDNHRVQ 881


>gi|290976289|ref|XP_002670873.1| predicted protein [Naegleria gruberi]
 gi|284084436|gb|EFC38129.1| predicted protein [Naegleria gruberi]
          Length = 1395

 Score = 36.6 bits (83), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 31/100 (31%), Positives = 49/100 (49%), Gaps = 9/100 (9%)

Query: 33  TVTTVIDGHQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQG 92
           T+TT+   + L   P S+   P + DL + +S  +    LS     +  +  +AG G +G
Sbjct: 558 TITTLSTSNLLNY-PTSLFYTP-NGDLYIANSGGNQILKLS-----KGTISVIAGTGTRG 610

Query: 93  -YSDGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
              DG+   SA+   P +  V   G I++AD  N+ IRKI
Sbjct: 611 NQGDGKQATSAQLSYPLAVTVTSNGVIFIADSGNNAIRKI 650


>gi|198421727|ref|XP_002130106.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 454

 Score = 36.6 bits (83), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 20/86 (23%), Positives = 38/86 (44%), Gaps = 1/86 (1%)

Query: 46  NPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDK 105
           NP  +     +  L + D+   A   ++   +   V     G  ++G+ DG    A +  
Sbjct: 176 NPQQLEYDDTNDRLFIADTDNHAIRIINVDSATPKVTTLTGGPRMEGFKDGSFAEAMWRH 235

Query: 106 PKSFAVDMKGNI-YVADKSNHVIRKI 130
           P   A D++ ++ YV+D  NH +R +
Sbjct: 236 PTGMAYDVRKDVLYVSDHYNHAVRML 261


>gi|326923951|ref|XP_003208196.1| PREDICTED: NHL repeat-containing protein 2-like [Meleagris
           gallopavo]
          Length = 727

 Score = 36.6 bits (83), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 20/40 (50%), Positives = 24/40 (60%), Gaps = 3/40 (7%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           G  DG    A F+ P+  A+  K N IYVAD  NH+IRKI
Sbjct: 262 GRKDGRFSEAAFNSPQGVAI--KNNVIYVADTENHLIRKI 299


>gi|255513298|gb|EET89564.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Candidatus Micrarchaeum acidiphilum ARMAN-2]
          Length = 579

 Score = 36.6 bits (83), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 33/50 (66%), Gaps = 4/50 (8%)

Query: 86  AGDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKITNLD 134
           AG+G +G SDG   ++ F +P+   ++  GN IY+AD  N+ +R+I NLD
Sbjct: 286 AGNGAKGLSDGSFETSEFYRPQ--GLEWVGNRIYIADTENNAVREI-NLD 332


>gi|57529823|ref|NP_001006504.1| NHL repeat-containing protein 2 [Gallus gallus]
 gi|82081010|sp|Q5ZI67.1|NHLC2_CHICK RecName: Full=NHL repeat-containing protein 2
 gi|53136494|emb|CAG32576.1| hypothetical protein RCJMB04_29n5 [Gallus gallus]
          Length = 727

 Score = 36.2 bits (82), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 20/40 (50%), Positives = 24/40 (60%), Gaps = 3/40 (7%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           G  DG    A F+ P+  A+  K N IYVAD  NH+IRKI
Sbjct: 262 GRKDGRFSEAAFNSPQGVAI--KNNVIYVADTENHLIRKI 299


>gi|428162946|gb|EKX32046.1| hypothetical protein GUITHDRAFT_61646, partial [Guillardia theta
           CCMP2712]
          Length = 254

 Score = 36.2 bits (82), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 28/47 (59%), Gaps = 1/47 (2%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVD-MKGNIYVADKSNHVIRKI 130
           LAG G  GY+D     A+F+KP    +D  +   YV+D +N+ IR+I
Sbjct: 154 LAGSGTAGYNDAVGVLAQFNKPVDVTLDWTESYAYVSDYNNNCIRRI 200


>gi|124003658|ref|ZP_01688507.1| NHL repeat [Microscilla marina ATCC 23134]
 gi|123991227|gb|EAY30679.1| NHL repeat [Microscilla marina ATCC 23134]
          Length = 508

 Score = 36.2 bits (82), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 3/45 (6%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           G G QG  DG+   A F +P   A  +KG+ +YVAD  N+VIR++
Sbjct: 245 GSGTQGLKDGDFDKATFYEPHGLA--LKGDFLYVADTKNNVIRRV 287


>gi|229029660|ref|ZP_04185736.1| Cell surface protein [Bacillus cereus AH1271]
 gi|228731664|gb|EEL82570.1| Cell surface protein [Bacillus cereus AH1271]
          Length = 617

 Score = 36.2 bits (82), Expect = 6.0,   Method: Composition-based stats.
 Identities = 19/43 (44%), Positives = 25/43 (58%), Gaps = 2/43 (4%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
           G A F +P   AVD K NIYV D+ N+ ++K  N   GN L +
Sbjct: 476 GEAEFFQPMQLAVDSKDNIYVVDRINNRVQKFDN--EGNFLTK 516



 Score = 36.2 bits (82), Expect = 6.9,   Method: Composition-based stats.
 Identities = 16/34 (47%), Positives = 22/34 (64%), Gaps = 2/34 (5%)

Query: 98  PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           PG  +F  P   A+D KGNI++AD  NH I+K +
Sbjct: 430 PGFFQF--PSGIAIDSKGNIFIADSENHRIQKFS 461


>gi|423219561|ref|ZP_17206057.1| hypothetical protein HMPREF1061_02830 [Bacteroides caccae
           CL03T12C61]
 gi|392624766|gb|EIY18844.1| hypothetical protein HMPREF1061_02830 [Bacteroides caccae
           CL03T12C61]
          Length = 543

 Score = 36.2 bits (82), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 24/86 (27%), Positives = 38/86 (44%), Gaps = 14/86 (16%)

Query: 64  SSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDG--------------EPGSARFDKPKSF 109
           S+ SA  ++ +   +  +V   AG G  G +D               E   A F  P S 
Sbjct: 458 STVSADQSIRYIDQKTGMVTTFAGSGTSGNTDSSFEVLSYAGVNSALESLLAAFGAPSSV 517

Query: 110 AVDMKGNIYVADKSNHVIRKITNLDH 135
            VD  G +YV D+ N+ ++KIT ++ 
Sbjct: 518 CVDKDGTVYVLDRRNNCVKKITTVEK 543


>gi|384419507|ref|YP_005628867.1| hypothetical protein XOC_2585 [Xanthomonas oryzae pv. oryzicola
           BLS256]
 gi|353462420|gb|AEQ96699.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzicola
           BLS256]
          Length = 472

 Score = 36.2 bits (82), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 29/56 (51%), Gaps = 2/56 (3%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGN 137
           V R  G G     DG  G A F +P+  A++    +YVAD  NH +R+I NL  G 
Sbjct: 198 VLRQFGHGNADLIDGGVGEAAFRRPQGLALESD-QLYVADTGNHALRRI-NLRSGQ 251


>gi|290977893|ref|XP_002671671.1| predicted protein [Naegleria gruberi]
 gi|284085242|gb|EFC38927.1| predicted protein [Naegleria gruberi]
          Length = 3330

 Score = 36.2 bits (82), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 24/61 (39%), Positives = 37/61 (60%), Gaps = 5/61 (8%)

Query: 74   FPLSEES-VVKRLAGD-GVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
            +  SEES  V+R+AG     GY+ DG+P   +   +P+S  V   G +Y+ D +NH+IR+
Sbjct: 2025 YSASEESGTVQRIAGSLSSSGYNGDGKPAIDSLLSQPQSVFVK-DGEVYICDTNNHLIRR 2083

Query: 130  I 130
            I
Sbjct: 2084 I 2084


>gi|374994067|ref|YP_004969566.1| gluconolactonase [Desulfosporosinus orientis DSM 765]
 gi|357212433|gb|AET67051.1| gluconolactonase [Desulfosporosinus orientis DSM 765]
          Length = 1526

 Score = 36.2 bits (82), Expect = 6.2,   Method: Composition-based stats.
 Identities = 32/102 (31%), Positives = 46/102 (45%), Gaps = 23/102 (22%)

Query: 30  DGYTVTTVIDGHQLEINPHSV-IDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD 88
           DG  + T+  G     NP+ V +D  G   + V DSS +A             +KR+  D
Sbjct: 198 DGTNIVTLGSGFS---NPNGVAVDSSGK--IYVADSSNNA-------------IKRMDAD 239

Query: 89  GVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           G    + G      F  P   AVD  G IYVAD +N+ I+++
Sbjct: 240 GTNIVTLG----TGFSTPNGVAVDSSGKIYVADTNNNAIKRM 277



 Score = 36.2 bits (82), Expect = 6.7,   Method: Composition-based stats.
 Identities = 19/51 (37%), Positives = 28/51 (54%), Gaps = 4/51 (7%)

Query: 80  SVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           + +KR+  DG    + G    + F  P   AVD  G IYVAD SN+ I+++
Sbjct: 190 NAIKRMDADGTNIVTLG----SGFSNPNGVAVDSSGKIYVADSSNNAIKRM 236


>gi|86738767|ref|YP_479167.1| serine/threonine protein kinase [Frankia sp. CcI3]
 gi|86565629|gb|ABD09438.1| serine/threonine protein kinase [Frankia sp. CcI3]
          Length = 863

 Score = 36.2 bits (82), Expect = 6.2,   Method: Composition-based stats.
 Identities = 16/32 (50%), Positives = 21/32 (65%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
           +A+ D P S A D  GNIY+ D  N+ IRKI+
Sbjct: 596 AAQLDGPGSTARDKAGNIYIGDAKNNRIRKIS 627


>gi|340725457|ref|XP_003401086.1| PREDICTED: NHL repeat-containing protein 2-like [Bombus terrestris]
          Length = 692

 Score = 36.2 bits (82), Expect = 6.3,   Method: Composition-based stats.
 Identities = 22/55 (40%), Positives = 31/55 (56%), Gaps = 3/55 (5%)

Query: 93  YSDGEP-GSARFDKPKSFAVDM-KGNIYVADKSNHVIRKITNLDHGNQLIRQINL 145
           Y  G P  S  FD+P   A++  KG IY+AD +NH + K+ N + GN     IN+
Sbjct: 509 YGVGIPHASFLFDEPSGIAINAEKGLIYIADTNNHAV-KMINTERGNITTLPINI 562


>gi|228985056|ref|ZP_04145224.1| Cell surface protein [Bacillus thuringiensis serovar tochigiensis
           BGSC 4Y1]
 gi|228774744|gb|EEM23142.1| Cell surface protein [Bacillus thuringiensis serovar tochigiensis
           BGSC 4Y1]
          Length = 598

 Score = 36.2 bits (82), Expect = 6.4,   Method: Composition-based stats.
 Identities = 16/33 (48%), Positives = 22/33 (66%), Gaps = 2/33 (6%)

Query: 98  PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           PG  +F  P   A+D KGNI++AD  NH I+K+
Sbjct: 411 PGFFQF--PSGIAIDSKGNIFIADSENHRIQKL 441


>gi|229155542|ref|ZP_04283650.1| Cell surface protein [Bacillus cereus ATCC 4342]
 gi|228627860|gb|EEK84579.1| Cell surface protein [Bacillus cereus ATCC 4342]
          Length = 598

 Score = 36.2 bits (82), Expect = 6.4,   Method: Composition-based stats.
 Identities = 16/33 (48%), Positives = 22/33 (66%), Gaps = 2/33 (6%)

Query: 98  PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           PG  +F  P   A+D KGNI++AD  NH I+K+
Sbjct: 411 PGFFQF--PSGIAIDSKGNIFIADSENHRIQKL 441


>gi|47568330|ref|ZP_00239032.1| cell surface protein [Bacillus cereus G9241]
 gi|47555023|gb|EAL13372.1| cell surface protein [Bacillus cereus G9241]
          Length = 617

 Score = 36.2 bits (82), Expect = 6.4,   Method: Composition-based stats.
 Identities = 16/33 (48%), Positives = 22/33 (66%), Gaps = 2/33 (6%)

Query: 98  PGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           PG  +F  P   A+D KGNI++AD  NH I+K+
Sbjct: 430 PGFFQF--PSGIAIDSKGNIFIADSENHRIQKL 460


>gi|402587633|gb|EJW81568.1| RING finger protein nhl-1 [Wuchereria bancrofti]
          Length = 794

 Score = 36.2 bits (82), Expect = 6.5,   Method: Composition-based stats.
 Identities = 17/42 (40%), Positives = 23/42 (54%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           G  V+ +    P + RF+ P   AVD  G IYV DK NH ++
Sbjct: 610 GHCVRKFGGHGPSNGRFNNPWGVAVDDMGMIYVVDKDNHRVQ 651


>gi|435849842|ref|YP_007301783.1| PDK repeat-containing protein [Methanomethylovorans hollandica DSM
           15978]
 gi|433663330|gb|AGB50755.1| PDK repeat-containing protein [Methanomethylovorans hollandica DSM
           15978]
          Length = 1881

 Score = 36.2 bits (82), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 14/33 (42%), Positives = 22/33 (66%)

Query: 100 SARFDKPKSFAVDMKGNIYVADKSNHVIRKITN 132
           + +F+ P +  VD  GN+YVAD +N+ I+K  N
Sbjct: 47  TGQFNNPSAVTVDSYGNVYVADANNYRIQKFDN 79


>gi|290972811|ref|XP_002669144.1| predicted protein [Naegleria gruberi]
 gi|284082687|gb|EFC36400.1| predicted protein [Naegleria gruberi]
          Length = 1362

 Score = 36.2 bits (82), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 17/47 (36%), Positives = 26/47 (55%), Gaps = 2/47 (4%)

Query: 86  AGDGVQGYSDGEP--GSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           AG G+ GYS       S  F+ P   A + +  +++AD +NH IRK+
Sbjct: 429 AGTGLVGYSGNYTLRNSTLFNSPNKLAFNSRNELFIADSNNHRIRKV 475


>gi|288801999|ref|ZP_06407440.1| conserved hypothetical protein [Prevotella melaninogenica D18]
 gi|288335434|gb|EFC73868.1| conserved hypothetical protein [Prevotella melaninogenica D18]
          Length = 486

 Score = 36.2 bits (82), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 15/40 (37%), Positives = 22/40 (55%)

Query: 91  QGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +G+ DG   +A+F  P+       G +Y+AD  NH IR I
Sbjct: 384 KGWEDGALKNAKFCYPRQMTFTKDGKLYIADSGNHCIRMI 423


>gi|260798148|ref|XP_002594062.1| hypothetical protein BRAFLDRAFT_68498 [Branchiostoma floridae]
 gi|229279295|gb|EEN50073.1| hypothetical protein BRAFLDRAFT_68498 [Branchiostoma floridae]
          Length = 515

 Score = 36.2 bits (82), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 21/77 (27%), Positives = 36/77 (46%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
           G + LI++ +  + F  +  P+     V   +      + +G   + +FDKP   AV  +
Sbjct: 246 GKAPLIIVQTEPAVFLPIDTPVPVLGHVMLPSLPNTISFGEGGSRNGQFDKPSGVAVSDE 305

Query: 115 GNIYVADKSNHVIRKIT 131
           G I+V D  NH I+  T
Sbjct: 306 GEIFVTDLRNHRIQVFT 322


>gi|323320709|gb|ADX36386.1| putative RTX-toxin [Vibrio vulnificus]
          Length = 1838

 Score = 36.2 bits (82), Expect = 7.1,   Method: Composition-based stats.
 Identities = 28/103 (27%), Positives = 48/103 (46%), Gaps = 13/103 (12%)

Query: 70  YTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR- 128
           Y+   P +E+S+V   +G   + Y D    +    K   + +D+KGN +  D + H++R 
Sbjct: 161 YSFEAPHAEKSIVSFWSGKNFKQYRDILDNAQTDGKKVVYDIDVKGNAFAIDLNKHLMRW 220

Query: 129 --KITNLDHG--NQLIRQINLKPEDCSKSSQSGSGMFWVTVFS 167
                N D+   NQL   I     D +  S +G   FW +V++
Sbjct: 221 GGLFPNPDNAEQNQLKSSI-----DAATFSNTG---FWSSVYA 255


>gi|300743712|ref|ZP_07072732.1| putative NHL repeat protein [Rothia dentocariosa M567]
 gi|300380073|gb|EFJ76636.1| putative NHL repeat protein [Rothia dentocariosa M567]
          Length = 667

 Score = 36.2 bits (82), Expect = 7.2,   Method: Composition-based stats.
 Identities = 22/69 (31%), Positives = 34/69 (49%), Gaps = 4/69 (5%)

Query: 75  PLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI--TN 132
           P+SE   +   AG GV+G  DG    A F +P        GN++VA      +R +  T 
Sbjct: 364 PVSE--TISIFAGAGVEGLQDGTAEDAWFAQPSGIIEARDGNLWVACSETSGLRHVTFTR 421

Query: 133 LDHGNQLIR 141
            +HG+Q ++
Sbjct: 422 DEHGHQSVQ 430


>gi|290974456|ref|XP_002669961.1| predicted protein [Naegleria gruberi]
 gi|284083515|gb|EFC37217.1| predicted protein [Naegleria gruberi]
          Length = 1106

 Score = 36.2 bits (82), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 45/88 (51%), Gaps = 4/88 (4%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAG-DGVQGYS-DGEPGSARFD 104
           P S+   PG   L+  D S    +  +    E +V++R+AG + V GY+ DGE  +    
Sbjct: 893 PQSLAVSPGGDALVFSDLSTIGIFHYNLN-DENAVIRRIAGFENVIGYNGDGEATTTLIS 951

Query: 105 KPKSFAVDMKG-NIYVADKSNHVIRKIT 131
              S A    G ++Y  D SN++IRK++
Sbjct: 952 HVPSLAFSNDGLSVYFVDSSNNLIRKLS 979


>gi|390960114|ref|YP_006423871.1| gluconolactonase [Terriglobus roseus DSM 18391]
 gi|390415032|gb|AFL90536.1| gluconolactonase [Terriglobus roseus DSM 18391]
          Length = 1836

 Score = 36.2 bits (82), Expect = 7.5,   Method: Composition-based stats.
 Identities = 21/76 (27%), Positives = 37/76 (48%), Gaps = 1/76 (1%)

Query: 55  GSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMK 114
           G+ +L V DS  +    ++   +       +    +  Y+ G+  ++    P   A+D  
Sbjct: 190 GAGELFVSDSGNNRVRKVTPAPATAPAKSDITKGTIVTYAGGDVATS-LSSPAGLAIDGA 248

Query: 115 GNIYVADKSNHVIRKI 130
           GN+YVAD  N+VIR+I
Sbjct: 249 GNLYVADSGNNVIRRI 264


>gi|269796827|ref|YP_003316282.1| thiol-disulfide isomerase-like thioredoxin [Sanguibacter keddieii
           DSM 10542]
 gi|269099012|gb|ACZ23448.1| thiol-disulfide isomerase-like thioredoxin [Sanguibacter keddieii
           DSM 10542]
          Length = 634

 Score = 35.8 bits (81), Expect = 7.7,   Method: Composition-based stats.
 Identities = 17/63 (26%), Positives = 32/63 (50%)

Query: 81  VVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLI 140
           VV+++ G   +G  DG    A F +P   +V   G++++AD     +R++   D G+  I
Sbjct: 345 VVEQVGGTQNEGLLDGPLAQAWFAQPSGLSVGPDGSVWLADAETSALRRVDVADDGSATI 404

Query: 141 RQI 143
             +
Sbjct: 405 TSL 407


>gi|428164545|gb|EKX33567.1| hypothetical protein GUITHDRAFT_120259 [Guillardia theta CCMP2712]
          Length = 396

 Score = 35.8 bits (81), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 16/38 (42%), Positives = 25/38 (65%), Gaps = 1/38 (2%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIR 128
           GY DG+  SARF++P+ F +   G  I +AD +N+ +R
Sbjct: 100 GYVDGDGLSARFNQPRDFVMSTDGTKILIADTNNYCLR 137


>gi|432922359|ref|XP_004080313.1| PREDICTED: NHL repeat-containing protein 2-like [Oryzias latipes]
          Length = 720

 Score = 35.8 bits (81), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 25/45 (55%), Gaps = 3/45 (6%)

Query: 87  GDGVQGYSDGEPGSARFDKPKSFAVDMKGN-IYVADKSNHVIRKI 130
           G    G  DG    A F+ P+  A   KG+ +YVAD  NH+IRKI
Sbjct: 257 GGPASGRKDGSLSEASFNSPQGVA--FKGDAVYVADTENHLIRKI 299


>gi|219852010|ref|YP_002466442.1| periplasmic copper-binding protein [Methanosphaerula palustris
           E1-9c]
 gi|219546269|gb|ACL16719.1| periplasmic copper-binding [Methanosphaerula palustris E1-9c]
          Length = 709

 Score = 35.8 bits (81), Expect = 8.1,   Method: Composition-based stats.
 Identities = 20/75 (26%), Positives = 40/75 (53%), Gaps = 6/75 (8%)

Query: 60  IVLDSSRSAFYTLSFPLSEESVVKRLAGDG--VQGYSDGEPGSARFDKPKSFAVDMKGNI 117
           + +D + + F T ++     S +++ +  G  +  +  G  G  +F  P + A+D   N+
Sbjct: 232 VTVDDAGNVFVTDTY----NSRIQKFSSVGTFITKWGTGGTGEGQFGLPSATAIDTSENV 287

Query: 118 YVADKSNHVIRKITN 132
           YVAD +N+ I+K T+
Sbjct: 288 YVADTANNRIQKFTS 302


>gi|290996648|ref|XP_002680894.1| predicted protein [Naegleria gruberi]
 gi|284094516|gb|EFC48150.1| predicted protein [Naegleria gruberi]
          Length = 966

 Score = 35.8 bits (81), Expect = 8.2,   Method: Composition-based stats.
 Identities = 18/50 (36%), Positives = 30/50 (60%), Gaps = 4/50 (8%)

Query: 85  LAGDGVQGYSDGEPG---SARFDKPKSFAVDMKGNIYVADKSNHVIRKIT 131
            AG+G  G+S G+ G   +A+ + P+  AV   G +Y+AD  N  +RK++
Sbjct: 436 FAGNGTSGFS-GDGGLAINAQLNSPRCVAVSGSGEVYIADSMNSRVRKVS 484


>gi|427739164|ref|YP_007058708.1| thiol-disulfide isomerase-like thioredoxin [Rivularia sp. PCC 7116]
 gi|427374205|gb|AFY58161.1| thiol-disulfide isomerase-like thioredoxin [Rivularia sp. PCC 7116]
          Length = 487

 Score = 35.8 bits (81), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 24/50 (48%), Gaps = 1/50 (2%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNI-YVADKSNHVIRKI 130
           V  L G G  G  DG      F  P+  A D   NI YVAD  N+ IRK+
Sbjct: 217 VLHLIGSGKAGLKDGSFEETEFFAPQGMAFDEVDNILYVADTENNAIRKV 266


>gi|428172619|gb|EKX41527.1| hypothetical protein GUITHDRAFT_153825 [Guillardia theta CCMP2712]
          Length = 285

 Score = 35.8 bits (81), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 23/74 (31%), Positives = 35/74 (47%), Gaps = 2/74 (2%)

Query: 57  SDLIVLDSSRSAFYTLSFPLSEESVVKRLAGD--GVQGYSDGEPGSARFDKPKSFAVDMK 114
            ++ + D+   A   ++        V  +AG   G  G+ DGE  SARF+ P   AV  +
Sbjct: 133 ENIFIADTYNHAIRHITRDEDRNWTVTLIAGSRSGQSGFEDGEGESARFNCPTGLAVVNE 192

Query: 115 GNIYVADKSNHVIR 128
             I ++D SN  IR
Sbjct: 193 QEILISDFSNSAIR 206


>gi|224003203|ref|XP_002291273.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220973049|gb|EED91380.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 264

 Score = 35.8 bits (81), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 38/129 (29%), Positives = 61/129 (47%), Gaps = 21/129 (16%)

Query: 41  HQLEINPHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEE----SVVKRLAGDGVQGYSDG 96
           H+     H  ++RP +SD I          T +F +SE+    S   R+A D V+   + 
Sbjct: 38  HRQPAQVHHALERPDASDAI----------TDAFQMSEKYGTTSKEARIAWDVVEEIYET 87

Query: 97  EPGSARFDKPKSFAVDMKGNIYVADKS---NHVIRK-ITNLDHGNQLIRQI-NLKPEDCS 151
            P  + FD+  S AVD   + Y  D+    NH++ +  +NL    +L+ QI  L+ ED S
Sbjct: 88  SPPLSSFDEQTSTAVDAPQDYY--DRIHFLNHLLMESQSNLGQVKELVAQIKELELEDPS 145

Query: 152 KSSQSGSGM 160
            +  +  GM
Sbjct: 146 LARLADDGM 154


>gi|386811933|ref|ZP_10099158.1| conserved hypothetical protein [planctomycete KSU-1]
 gi|386404203|dbj|GAB62039.1| conserved hypothetical protein [planctomycete KSU-1]
          Length = 669

 Score = 35.8 bits (81), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 25/84 (29%), Positives = 40/84 (47%), Gaps = 5/84 (5%)

Query: 47  PHSVIDRPGSSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKP 106
           P  V+    S+ L + DS+ +     +     +  V  + G+G  G  DG    A F+ P
Sbjct: 213 PGKVLADEISNRLFIADSNHNRIVIATL----DGKVLDVVGNGKIGKDDGTFTEASFNHP 268

Query: 107 KSFAVDMKGNIYVADKSNHVIRKI 130
           +  A   +  +YVAD  NH+IRK+
Sbjct: 269 QGMAFH-RDYLYVADTENHLIRKL 291


>gi|391334887|ref|XP_003741830.1| PREDICTED: tripartite motif-containing protein 71-like [Metaseiulus
           occidentalis]
          Length = 800

 Score = 35.8 bits (81), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 23/37 (62%), Gaps = 2/37 (5%)

Query: 92  GYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIR 128
           G +  +PG  +FD+P   A+D  G I VADK NH I+
Sbjct: 576 GSAGSQPG--QFDRPAGVAMDSTGRIIVADKDNHRIQ 610


>gi|290996232|ref|XP_002680686.1| predicted protein [Naegleria gruberi]
 gi|284094308|gb|EFC47942.1| predicted protein [Naegleria gruberi]
          Length = 655

 Score = 35.8 bits (81), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 39/75 (52%), Gaps = 4/75 (5%)

Query: 56  SSDLIVLDSSRSAFYTLSFPLSEESVVKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKG 115
           S D+   DSS +    ++   +  ++V  +AG G  G+S+G   S++ +KP    V   G
Sbjct: 113 SGDIYFADSSNNLIRVMN---AATNIVSTIAGTGSFGFSEGNATSSKLNKPTE-PVYWNG 168

Query: 116 NIYVADKSNHVIRKI 130
            +  AD  N+ IRK+
Sbjct: 169 QLLFADSLNYRIRKV 183


>gi|194365180|ref|YP_002027790.1| NHL repeat containing protein [Stenotrophomonas maltophilia R551-3]
 gi|194347984|gb|ACF51107.1| NHL repeat containing protein [Stenotrophomonas maltophilia R551-3]
          Length = 470

 Score = 35.8 bits (81), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 17/52 (32%), Positives = 29/52 (55%), Gaps = 1/52 (1%)

Query: 82  VKRLAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNL 133
           + R  G G   + DG    A F +P++  ++ + ++YVAD  NH +R+I  L
Sbjct: 197 ILRQFGLGTADFMDGNLAEAAFHRPQALVLE-RDSLYVADTGNHAVRRINLL 247


>gi|290979581|ref|XP_002672512.1| predicted protein [Naegleria gruberi]
 gi|284086089|gb|EFC39768.1| predicted protein [Naegleria gruberi]
          Length = 1118

 Score = 35.8 bits (81), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 20/50 (40%), Positives = 29/50 (58%), Gaps = 2/50 (4%)

Query: 82  VKRLAGDGVQGYS-DGEPG-SARFDKPKSFAVDMKGNIYVADKSNHVIRK 129
           +K + G G  GYS D  P  S++  + KSF V   G+I + D  N++IRK
Sbjct: 584 LKSIVGTGTDGYSGDNLPAVSSKLSQIKSFYVKENGDIIILDSGNNIIRK 633


>gi|156399563|ref|XP_001638571.1| predicted protein [Nematostella vectensis]
 gi|156225692|gb|EDO46508.1| predicted protein [Nematostella vectensis]
          Length = 1122

 Score = 35.8 bits (81), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 15/46 (32%), Positives = 29/46 (63%), Gaps = 1/46 (2%)

Query: 85  LAGDGVQGYSDGEPGSARFDKPKSFAVDMKGNIYVADKSNHVIRKI 130
           +AG G QG+++G    A+F+KP  F V+ +  +++ D+ N  +R +
Sbjct: 427 IAGSGEQGHANGGGRKAKFNKPSGF-VEYESKLFICDQGNSRLRVV 471


>gi|428162955|gb|EKX32055.1| hypothetical protein GUITHDRAFT_58597, partial [Guillardia theta
           CCMP2712]
          Length = 186

 Score = 35.8 bits (81), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 22/48 (45%), Positives = 27/48 (56%), Gaps = 2/48 (4%)

Query: 85  LAGDGVQGYSDGEPG-SARFDKPKSFAVDMKG-NIYVADKSNHVIRKI 130
           LAGDG  G  D   G +A+F  P   AVD  G ++ VAD  +H IR I
Sbjct: 83  LAGDGTNGLVDNVNGAAAKFYNPTGVAVDFAGQSLLVADSRDHTIRII 130


>gi|73671050|ref|YP_307065.1| surface layer protein [Methanosarcina barkeri str. Fusaro]
 gi|72398212|gb|AAZ72485.1| putative surface layer protein [Methanosarcina barkeri str. Fusaro]
          Length = 752

 Score = 35.8 bits (81), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 17/43 (39%), Positives = 27/43 (62%), Gaps = 2/43 (4%)

Query: 99  GSARFDKPKSFAVDMKGNIYVADKSNHVIRKITNLDHGNQLIR 141
           G+ +F   +   +D  GNIYVAD  N+ I+K+ N  +GN L++
Sbjct: 162 GNVQFKNARDICLDSSGNIYVADTGNNRIQKLDN--NGNYLVQ 202


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.136    0.400 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,968,844,411
Number of Sequences: 23463169
Number of extensions: 115418245
Number of successful extensions: 312288
Number of sequences better than 100.0: 936
Number of HSP's better than 100.0 without gapping: 530
Number of HSP's successfully gapped in prelim test: 406
Number of HSP's that attempted gapping in prelim test: 308829
Number of HSP's gapped (non-prelim): 3547
length of query: 191
length of database: 8,064,228,071
effective HSP length: 134
effective length of query: 57
effective length of database: 9,215,130,721
effective search space: 525262451097
effective search space used: 525262451097
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 72 (32.3 bits)