BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 011435
(486 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255567088|ref|XP_002524526.1| conserved hypothetical protein [Ricinus communis]
gi|223536200|gb|EEF37853.1| conserved hypothetical protein [Ricinus communis]
Length = 484
Score = 764 bits (1973), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/491 (76%), Positives = 410/491 (83%), Gaps = 12/491 (2%)
Query: 1 MKGTHLL----LTLTLLIAFTLQFQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVL 56
M+ +HL L LTL +AF LQFQAHAAPAGPLIKH +S LKWT RSSSK PQ+DGNVL
Sbjct: 1 MRRSHLFYFAPLLLTLFLAFNLQFQAHAAPAGPLIKHFSSFLKWTARSSSKTPQSDGNVL 60
Query: 57 QFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAG 116
QFE+GYLVETV+EGNEIGVVPYKIRVSEDGEL+AVDEVNSNIVKITPPLSQYSR RLVAG
Sbjct: 61 QFEDGYLVETVVEGNEIGVVPYKIRVSEDGELYAVDEVNSNIVKITPPLSQYSRARLVAG 120
Query: 117 SFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNV 176
SFQGY GHVDGK NEARFNHPKG+TMDDKGN+YVADTLNLAIRKIGDAGVTTIAGGKSN
Sbjct: 121 SFQGYKGHVDGKSNEARFNHPKGITMDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNT 180
Query: 177 AGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDI 236
AG+RDGPSEDAKFS DFDVVYV TCSLLV+DRGN ALRQISLNQ+DC+YQ +SI+ TD+
Sbjct: 181 AGYRDGPSEDAKFSTDFDVVYVHSTCSLLVVDRGNVALRQISLNQEDCDYQSSSITVTDL 240
Query: 237 LMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKE 296
LMVVGAV GYVTCMLQQGFGP FFS+TQ SES EI E + EKP PI SMKE
Sbjct: 241 LMVVGAVFTGYVTCMLQQGFGPSFFSKTQHFSES------EILEHQSMEKPTPITGSMKE 294
Query: 297 EPGWPSFGQLIIDLSKLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPP 356
EP WPSFGQL+IDLSKL LEA+ MFL +P RS+G++ LTPL+DTLRMPEDE PP
Sbjct: 295 EPRWPSFGQLMIDLSKLTLEALADMFLYLIPSWLRSNGSRKGLTPLKDTLRMPEDEVEPP 354
Query: 357 VVQRQKSAVPLSETHQVRTPGTGDKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFY 416
V RQ +VPLSET QV P DKY +MKPPKIKSASFKD SLS+KHRSSKRQEYAEFY
Sbjct: 355 SVHRQSISVPLSETRQVHNPNASDKYSEMKPPKIKSASFKDPSLSSKHRSSKRQEYAEFY 414
Query: 417 GSGEVPPPYSKSKSQKERTRHRQRDKSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMR 476
GSGE+P +SKS KE+TRHRQRDKSGE+ +G EPKPV MK VDYDN KFDHYNMR
Sbjct: 415 GSGEMPSS-GRSKSHKEKTRHRQRDKSGEVAPAATGAEPKPVNMKHVDYDNPKFDHYNMR 473
Query: 477 SKYGD-DSYRF 486
SKYG +SY+F
Sbjct: 474 SKYGSGNSYQF 484
>gi|356549639|ref|XP_003543199.1| PREDICTED: uncharacterized protein LOC100797841 [Glycine max]
Length = 487
Score = 728 bits (1880), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/485 (72%), Positives = 412/485 (84%), Gaps = 12/485 (2%)
Query: 6 LLLTLTLLIAFTLQFQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVE 65
L + + L F+L Q+HAAPAGPLIKH++SL+KWT RS+SK P +DGNVLQFENGY+VE
Sbjct: 8 LFVLIALAPTFSLHLQSHAAPAGPLIKHISSLIKWT-RSTSKTPHSDGNVLQFENGYVVE 66
Query: 66 TVIEGNEIGVVPYKIRVSE-DGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGH 124
TV+EGNEIGVVPY+IRVSE DGELFAVD +NSNIV+ITPPLSQYSRGRLVAGSFQGYTGH
Sbjct: 67 TVVEGNEIGVVPYRIRVSEEDGELFAVDAINSNIVRITPPLSQYSRGRLVAGSFQGYTGH 126
Query: 125 VDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPS 184
VDGKP++ARFNHPKG+T+DDKGN+YVADT NLAIRKIGDAGVTTIAGGKSNVAG+RDGPS
Sbjct: 127 VDGKPSDARFNHPKGITVDDKGNVYVADTQNLAIRKIGDAGVTTIAGGKSNVAGYRDGPS 186
Query: 185 EDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVL 244
EDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL+Q+DC+YQ NSIS TDIL VVGAV+
Sbjct: 187 EDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLDQEDCDYQSNSISSTDILTVVGAVI 246
Query: 245 VGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEPGWPSFG 304
VGY TCMLQQGFG FFS+TQQPS+ +F K L++ EK +PI+ES KEEPGWPSFG
Sbjct: 247 VGYATCMLQQGFGSSFFSKTQQPSQKQF------KGLASNEKHMPILESSKEEPGWPSFG 300
Query: 305 QLIIDLSKLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSA 364
QLI+DLSKL+LEA+ S F+ F+P FR S +K LTPL+D L MPED+ PP+V RQ +
Sbjct: 301 QLIVDLSKLSLEALASTFIQFIPSHFRPSNSKRGLTPLKDRLVMPEDDVPPPLVNRQNAQ 360
Query: 365 --VPLSETHQVRTPGTGDKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVP 422
PL+E V TP +KY +MKPPKIKS+SFKD S+S+KHRSS+R EYAEFYGS E+
Sbjct: 361 GHTPLTENRMVHTPTIAEKYSEMKPPKIKSSSFKDPSMSSKHRSSRRPEYAEFYGSSEI- 419
Query: 423 PPYSKSKSQKERTRHRQRDKSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMRSKYG-D 481
PPY+KSKSQKER RHRQR+KSGE+V G G E KPVE + VD++N KFDHY+MR+KY +
Sbjct: 420 PPYTKSKSQKERPRHRQREKSGEVVMGAVGAEAKPVETRAVDHNNPKFDHYSMRTKYASE 479
Query: 482 DSYRF 486
+++RF
Sbjct: 480 ETFRF 484
>gi|225461726|ref|XP_002285515.1| PREDICTED: uncharacterized protein LOC100245328 [Vitis vinifera]
gi|302142876|emb|CBI20171.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 720 bits (1858), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/491 (72%), Positives = 414/491 (84%), Gaps = 15/491 (3%)
Query: 2 KGTHLLLTLTLLIAFTLQFQAHAAPA-GPLIKHLTSLLKWTTRSSSKAPQADGNVLQFEN 60
K L+ LTL+ F+LQF+A AAP GP ++H +SLLKWT RSSSKAP +DG+VLQFE+
Sbjct: 3 KSLLFLIILTLVFTFSLQFRAQAAPPLGPFVRHFSSLLKWT-RSSSKAPHSDGHVLQFED 61
Query: 61 GYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQG 120
GYLVETV+EGNE+GVVP+ IRVSEDGELFAVD V +NIV+ITPPLSQYSR RLVAGSFQG
Sbjct: 62 GYLVETVVEGNELGVVPHSIRVSEDGELFAVDAVKNNIVRITPPLSQYSRARLVAGSFQG 121
Query: 121 YTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR 180
+TGHVDGKP++ARFN PKGVTMDDKGN+YVADT NLAIRKIGD+GVTTIAGGKSNVAG+R
Sbjct: 122 HTGHVDGKPSDARFNGPKGVTMDDKGNVYVADTSNLAIRKIGDSGVTTIAGGKSNVAGYR 181
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVV 240
DGPSEDAKFS+DFDVVYVRPTCSLLV+DRGNAALRQISLNQ+DC+YQ +SIS TDI MV+
Sbjct: 182 DGPSEDAKFSSDFDVVYVRPTCSLLVVDRGNAALRQISLNQEDCDYQNSSISATDIFMVI 241
Query: 241 GAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEP-- 298
GAV+VGY +C+LQ+GFGP FS+TQ SESEF ++ KEKP PIVES+KEEP
Sbjct: 242 GAVMVGYASCLLQKGFGPSAFSKTQH-SESEFEDQL------IKEKPTPIVESIKEEPDA 294
Query: 299 GWPSFGQLIIDLSKLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVV 358
GWPSFGQLIIDLSK LEA+ +FL F+P RF + + LTPL+D L MPEDEA PP+
Sbjct: 295 GWPSFGQLIIDLSKFTLEALTGIFLYFIPSRFMPTRARKGLTPLKDHLIMPEDEADPPLA 354
Query: 359 QRQKSAVPLSETHQVRTPGTGDKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGS 418
Q+Q++ PLSET Q TP T +KY +MKPPKIKS SFKD SLS+KHRSSKRQEYAEFY S
Sbjct: 355 QKQRAPPPLSETRQAHTPNTSEKYSEMKPPKIKSYSFKDPSLSSKHRSSKRQEYAEFYHS 414
Query: 419 GEVPPPYS--KSKSQKERTRHRQRDKSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMR 476
GEVPPPY+ +SKSQKER+RHRQRDKSGE+ +G G E KPVE+K VDYD+ KFDHYN+R
Sbjct: 415 GEVPPPYTQVRSKSQKERSRHRQRDKSGEM-FGAVGAESKPVEIKAVDYDDPKFDHYNIR 473
Query: 477 SKYG-DDSYRF 486
SKYG DDS+RF
Sbjct: 474 SKYGSDDSFRF 484
>gi|357452081|ref|XP_003596317.1| hypothetical protein MTR_2g075860 [Medicago truncatula]
gi|355485365|gb|AES66568.1| hypothetical protein MTR_2g075860 [Medicago truncatula]
Length = 493
Score = 671 bits (1730), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/494 (68%), Positives = 397/494 (80%), Gaps = 21/494 (4%)
Query: 4 THLLLTLTLLI---AFTLQFQ-AHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFE 59
TH LL TL++ F+L FQ HAAPAGPLIKHL+SL+KWT +++K P +DGNVLQFE
Sbjct: 7 THSLLFYTLIVLVSIFSLHFQPTHAAPAGPLIKHLSSLIKWTRSATTKTPHSDGNVLQFE 66
Query: 60 NGYLVETVIEGNEIGVVPYKIRVSE-DGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF 118
NGY+VETV+EGNEIGV+PY+IRVSE DGELFAVDE+NSNIV+ITPPLSQYSRGRLVAGSF
Sbjct: 67 NGYVVETVVEGNEIGVIPYRIRVSEEDGELFAVDEINSNIVRITPPLSQYSRGRLVAGSF 126
Query: 119 QGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAG 178
QGYT HVDGKP++ARFNHPKG+TMDDKGN+YVADT NLAIRKIGDAGVTTIAGGKSNVAG
Sbjct: 127 QGYTDHVDGKPSDARFNHPKGITMDDKGNVYVADTQNLAIRKIGDAGVTTIAGGKSNVAG 186
Query: 179 FRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILM 238
+RDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALR+I L+Q+DC+YQ +SIS TDIL+
Sbjct: 187 YRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRKIILDQEDCDYQSSSISSTDILI 246
Query: 239 VVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEP 298
VVGAVLVGY TCMLQQGFG FFS+T+ + E K + +K +PI ES KE+P
Sbjct: 247 VVGAVLVGYATCMLQQGFGSSFFSKTRSSGQ-------EFKGRESNDKRMPIPESSKEDP 299
Query: 299 GWPSFGQLIIDLSKLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVV 358
GWPSFGQLI DLSKL+LEA+ S F F+P F+ + K LTPL+D L MPEDE PP+V
Sbjct: 300 GWPSFGQLIADLSKLSLEALASAFTQFMPSHFKFNSRKTGLTPLKDRLVMPEDEVQPPLV 359
Query: 359 QRQKSAVPLSETH---QVRTPGTGDKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEF 415
+R+ + V ++E QV T +KY + KPPK+KS+SFKD S KHRSSKR EYAEF
Sbjct: 360 KRKTTPVTVTENRQMPQVHTATITEKYSEAKPPKVKSSSFKDPS---KHRSSKRSEYAEF 416
Query: 416 YGSGEVPPPYSKSKSQKERTRHRQRDKSGELVYGVSGMEPKPVEMK-PVDYDNLKFDHYN 474
YGSGEVP Y+KSKSQKER RHR R+KSGE+V+ +G E KPVE + VD+ N KFD Y+
Sbjct: 417 YGSGEVPSSYAKSKSQKERPRHRHREKSGEVVFPTNGAEAKPVEPRAAVDHSNSKFDRYS 476
Query: 475 MRSK--YGDDSYRF 486
MR+ +S+RF
Sbjct: 477 MRTGGYVPGESFRF 490
>gi|224117004|ref|XP_002317450.1| predicted protein [Populus trichocarpa]
gi|222860515|gb|EEE98062.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 657 bits (1696), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/491 (68%), Positives = 388/491 (79%), Gaps = 36/491 (7%)
Query: 1 MKGTHLLLTLTLLI---AFTLQFQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQ 57
M+ +H TL L + A T QFQAHAAP D NVLQ
Sbjct: 1 MRRSHFSYTLVLAVLSLACTFQFQAHAAPP------------------------DENVLQ 36
Query: 58 FENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS 117
FE+GYLVETV++GN +GVVPYKIR+SEDGEL+AVDEVNSN+VKITPPLSQYSR RLVAGS
Sbjct: 37 FEDGYLVETVVKGNAMGVVPYKIRLSEDGELYAVDEVNSNVVKITPPLSQYSRARLVAGS 96
Query: 118 FQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVA 177
FQGYTGH+DGKPNE RFNHP+G+TMDDKGN+YVAD+LN AIRKIGDAGVTTIAGGKSNVA
Sbjct: 97 FQGYTGHIDGKPNEVRFNHPRGLTMDDKGNIYVADSLNHAIRKIGDAGVTTIAGGKSNVA 156
Query: 178 GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPT-DI 236
GFRDGPSEDAKFSNDFDVVYV TCSLLV+DRGNAALRQISLNQ+DC+YQ +S + T D+
Sbjct: 157 GFRDGPSEDAKFSNDFDVVYVHSTCSLLVVDRGNAALRQISLNQEDCDYQSSSFTMTEDV 216
Query: 237 LMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKE 296
LMVVGAVL+GY TCMLQ GFG SR QQ SESE+ KE S+KEKPIPIV++MKE
Sbjct: 217 LMVVGAVLIGYATCMLQLGFGSSSSSRMQQSSESEY------KEKSSKEKPIPIVDNMKE 270
Query: 297 EPGWPSFGQLIIDLSKLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPP 356
EP WPSFGQL IDLSKLALEA+ + L+FVP ++ G + LTPL+D+L MP+DE PP
Sbjct: 271 EPKWPSFGQLFIDLSKLALEALVGILLSFVPSWYKPGGARKGLTPLKDSLIMPDDEVEPP 330
Query: 357 VVQRQKSAVPLSETHQVRTPGTGDKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFY 416
VQRQ + P+SE QV+TP T DKY +MK PKIKSASFKD S +KHRSSK+QEYAEFY
Sbjct: 331 SVQRQSTPAPVSENRQVQTPTTSDKYSEMKLPKIKSASFKDPSPLSKHRSSKQQEYAEFY 390
Query: 417 GSGEVPPPYSKSKSQKERTRHRQRDKSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMR 476
GSGEV P + +SKS KE++RHRQRDKSGE+V+G G EPKP EM PVDY++ KF+HYN R
Sbjct: 391 GSGEV-PSHGRSKSHKEKSRHRQRDKSGEVVFGAVGAEPKPAEMNPVDYNSPKFNHYNNR 449
Query: 477 SKYGDDS-YRF 486
SKYG DS YRF
Sbjct: 450 SKYGSDSLYRF 460
>gi|449438911|ref|XP_004137231.1| PREDICTED: uncharacterized protein LOC101210502 [Cucumis sativus]
Length = 480
Score = 650 bits (1676), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 320/480 (66%), Positives = 390/480 (81%), Gaps = 18/480 (3%)
Query: 6 LLLTLTLLIAFTLQFQAHAAPAGPLIKHLTSLLKWTTRSSSK---APQADGNVLQFENGY 62
++++ ++ TLQ Q +A PAGPLIKHL+S +KWT RSS K AP +DGNVLQFENGY
Sbjct: 7 FVISIIFVLFLTLQIQVNATPAGPLIKHLSSFVKWT-RSSYKSLPAPPSDGNVLQFENGY 65
Query: 63 LVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT 122
LV TV+EGNEIGV+P+KI VS+DGELF VD VNSNIVKITPPLS+Y+R RLVAGSFQ +T
Sbjct: 66 LVGTVVEGNEIGVLPHKIHVSKDGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHT 125
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
GH+DGKPN+ARFNHP+GVT+DDKGN+YVADTLNLAIRKIGDAGVTTIAGGKSNV G+RDG
Sbjct: 126 GHIDGKPNDARFNHPRGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVVGYRDG 185
Query: 183 PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGA 242
P EDAKFSNDFDV+YVR TCSLLVIDRGNAA+RQISLNQ+DCEYQ +SIS +D+LM++GA
Sbjct: 186 PGEDAKFSNDFDVMYVRSTCSLLVIDRGNAAIRQISLNQEDCEYQDSSISNSDVLMIIGA 245
Query: 243 VLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEPGWPS 302
VL GY T M+Q+GFG S+T P E+E+ E+ E S+ +++S+KE+PGWPS
Sbjct: 246 VLAGYATYMIQRGFGTSNVSQTNPPLETEYREKPYKPESSS------VMDSVKEDPGWPS 299
Query: 303 FGQLIIDLSKLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQR-Q 361
FG+LIIDLSKLALEA+ S+FL+ VP RFR+ T+ LTPL+D+LRMPEDE P VQ Q
Sbjct: 300 FGRLIIDLSKLALEAVASIFLSVVPARFRARNTRKGLTPLKDSLRMPEDEPEQPTVQMLQ 359
Query: 362 KSAVPLSETHQVRTPGTGDKYPD-MKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGE 420
++ VPL+ET Q D +P+ MKP K+ S+SFKD SL +KHRSSKRQE+A+FY SGE
Sbjct: 360 RTPVPLTETRQAHV-NARDPFPELMKPSKLNSSSFKDPSLQSKHRSSKRQEHADFYRSGE 418
Query: 421 VPPPYSKSKSQKERTRHRQRDKSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMRSKYG 480
+PPPYS+SKSQKER RHRQR+KS E+ YG G E+KP DYDN K++HYN+R+KYG
Sbjct: 419 IPPPYSRSKSQKERPRHRQREKSAEISYGAVGS-----ELKPADYDNPKYEHYNIRNKYG 473
>gi|224117390|ref|XP_002331700.1| predicted protein [Populus trichocarpa]
gi|222874306|gb|EEF11437.1| predicted protein [Populus trichocarpa]
Length = 495
Score = 644 bits (1660), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/520 (65%), Positives = 389/520 (74%), Gaps = 59/520 (11%)
Query: 1 MKGTHL----LLTLTLLIAFTLQFQAHAAPAGPLIKHLTSLLKWTTR-SSSKAPQADGNV 55
M+ +H +L L L +A T QFQAHAAP PLIKHL+SLLKWTT +SSK P +DGNV
Sbjct: 1 MRRSHFYSTFILALLLSVACTFQFQAHAAPPAPLIKHLSSLLKWTTTVASSKTPHSDGNV 60
Query: 56 LQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVA 115
LQFE+GYLVETV+EGN +GVVPYKIRVSEDGEL+AVDEVNSN+VKITPPLSQYSR RL A
Sbjct: 61 LQFEDGYLVETVVEGNAMGVVPYKIRVSEDGELYAVDEVNSNVVKITPPLSQYSRARLAA 120
Query: 116 GSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSN 175
GSFQGYTGH+DGKPNEARFNHP+G+TMDDKGN+YVADTLNLAIRKIGDAGVTTIAGGKSN
Sbjct: 121 GSFQGYTGHIDGKPNEARFNHPRGLTMDDKGNIYVADTLNLAIRKIGDAGVTTIAGGKSN 180
Query: 176 VAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTD 235
VAGFRDGPSEDAKFSNDFDVVY+ TCSLLV+DRGNAALRQISLNQ+DC+YQ T
Sbjct: 181 VAGFRDGPSEDAKFSNDFDVVYLHSTCSLLVVDRGNAALRQISLNQEDCDYQSKCTRKTT 240
Query: 236 I-------------------LMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEET 276
I +MVVGAVL+GY TCMLQQGFG FFSR +Q + ++
Sbjct: 241 IESTRLVSKTRCWGVLFDYVIMVVGAVLIGYATCMLQQGFGSSFFSRMKQ------SSDS 294
Query: 277 EIKELSNKEKPIPIVESMKEEPGWPSFGQLIIDLSKLALEAMGSMFLNFVPFRFRSSGTK 336
E K+ S+KEKPIPI+E+MKEEP WPSFGQL+IDLSKLALEA+ + L FVP R +
Sbjct: 295 EFKKKSSKEKPIPIMENMKEEPKWPSFGQLLIDLSKLALEALVGILLCFVPSWNRPGEAR 354
Query: 337 GNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPGTGDKYPDMKPPKIKSASFK 396
LTPL+D+L +PED+ PP VQRQ + P+SE+ QV TP T DKY + KPPKIKSASFK
Sbjct: 355 TGLTPLKDSLTLPEDKVEPPSVQRQSTPAPVSESRQVHTPTTSDKYLEGKPPKIKSASFK 414
Query: 397 DSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKERTRHRQRDKSGELVYGVSGM--- 453
D SL +KH SSKRQEYA FYGSGEVP GE G++ +
Sbjct: 415 DPSLLSKHWSSKRQEYAGFYGSGEVP-------------------SHGEARTGLTPLKDS 455
Query: 454 ------EPKPVEMKPVDYDNLKFDHYNMRSKYG-DDSYRF 486
+PKP EMK VDY++ KF+HYN+RSKYG D SYRF
Sbjct: 456 LTLPEDKPKPAEMKHVDYESPKFEHYNIRSKYGPDSSYRF 495
>gi|42572439|ref|NP_974315.1| NHL domain-containing protein [Arabidopsis thaliana]
gi|332642057|gb|AEE75578.1| NHL domain-containing protein [Arabidopsis thaliana]
Length = 493
Score = 614 bits (1584), Expect = e-173, Method: Compositional matrix adjust.
Identities = 326/491 (66%), Positives = 379/491 (77%), Gaps = 33/491 (6%)
Query: 10 LTLLIAFTLQ---FQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVET 66
TL I FTL FQA AAP+G LIKH++S+LKWTT SSSK Q+D NVLQFENGYLVET
Sbjct: 22 FTLWIFFTLHSFPFQAQAAPSGSLIKHMSSVLKWTTGSSSKLSQSDTNVLQFENGYLVET 81
Query: 67 VIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVD 126
V+EGN+IGVVPYKIRVS+DGEL+AVDE+NSNI+KITPPLSQYSRGRLVAGSFQG TGH D
Sbjct: 82 VVEGNDIGVVPYKIRVSDDGELYAVDELNSNIMKITPPLSQYSRGRLVAGSFQGKTGHAD 141
Query: 127 GKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSED 186
GKP+EARFNHP+GVTMDDKGN+YVADTLNLAIRKIGD+GVTTIAGGKSN+AG+RDGPSED
Sbjct: 142 GKPSEARFNHPRGVTMDDKGNVYVADTLNLAIRKIGDSGVTTIAGGKSNIAGYRDGPSED 201
Query: 187 AKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYN-SISPTDILMVVGAVLV 245
AKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL+++DC+YQ + SIS TDIL+V+GAVL+
Sbjct: 202 AKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLSEEDCDYQDDSSISLTDILLVIGAVLI 261
Query: 246 GYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEPGWPSFGQ 305
GY TCMLQQGFG FFS+TQ SE+ + EE KE +P+ KEEPGWPSFGQ
Sbjct: 262 GYATCMLQQGFGNSFFSKTQVGSETSYEEEHPGKE--KLSRPVHEKTITKEEPGWPSFGQ 319
Query: 306 LIIDLSKLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAV 365
L+ DL KLALE + S + VP RF+ T NL PL+D L MPEDE PP VQR +
Sbjct: 320 LLTDLCKLALEFITS---HLVPARFQ---TNPNLRPLKDRLIMPEDEQEPPRVQRHTAPA 373
Query: 366 PLSETHQVRTPGTGDKYPDMKPPKIKSAS-FKDSSLSTK--HR-SSKRQEYAEFYGSGEV 421
P+SE+ P D YP+ K PK++S+S KD +LS+ HR SSKRQ+YA+FY SGEV
Sbjct: 374 PISESRHAHLPKADDSYPEHKTPKLRSSSVMKDPTLSSSKHHRTSSKRQDYAQFYASGEV 433
Query: 422 PPPYSKSKSQKERTRHRQRDKSGELVYGVSGMEPKPV---EMKPVDYDN-LKFDHYNMR- 476
P K KER+R R RDK+ E EPKP +KPV+Y N KFDHYNMR
Sbjct: 434 AQP----KIHKERSRRRHRDKTTE-------TEPKPTPSDTVKPVEYSNSSKFDHYNMRS 482
Query: 477 SKYGDDS-YRF 486
SKYG ++ +RF
Sbjct: 483 SKYGPETPFRF 493
>gi|297834352|ref|XP_002885058.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297330898|gb|EFH61317.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 614 bits (1584), Expect = e-173, Method: Compositional matrix adjust.
Identities = 324/488 (66%), Positives = 375/488 (76%), Gaps = 30/488 (6%)
Query: 10 LTLLIAFTLQFQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVIE 69
TL I FT FQA AAP+G LIKH++ +LKWTT SSSK Q+D NVLQFENGYLVETV+E
Sbjct: 21 FTLWIVFTFPFQAQAAPSGSLIKHMSWVLKWTTGSSSKISQSDTNVLQFENGYLVETVVE 80
Query: 70 GNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP 129
GNEIGVVPYKIRVS DGEL+AVDE+NSNI+KITPPLSQYSRGRLVAGSFQG TGH DGKP
Sbjct: 81 GNEIGVVPYKIRVSHDGELYAVDELNSNIMKITPPLSQYSRGRLVAGSFQGKTGHADGKP 140
Query: 130 NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKF 189
+EARFNHP+GVTMDDKGN+YV DTLNLAIRKIGD+GVTTIAGGKSN+AG+RDGPSEDAKF
Sbjct: 141 SEARFNHPRGVTMDDKGNVYVGDTLNLAIRKIGDSGVTTIAGGKSNIAGYRDGPSEDAKF 200
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYN-SISPTDILMVVGAVLVGYV 248
SNDFDVVYVR TCSLLVIDRGNAALRQISL+ +DC+YQ + SIS TDIL+V+GAVL+GY
Sbjct: 201 SNDFDVVYVRSTCSLLVIDRGNAALRQISLSDEDCDYQDDSSISLTDILLVIGAVLIGYA 260
Query: 249 TCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEPGWPSFGQLII 308
TC+LQQGFG FFS+TQ SE+ F EE KE P+ + KEEPGWPSFGQLII
Sbjct: 261 TCLLQQGFGNSFFSKTQLESETSFEEEHPGKE--KLSLPVHETKVTKEEPGWPSFGQLII 318
Query: 309 DLSKLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLS 368
DL KLAL+ + S + VP RF +S NL PL+D L+MPEDE PP VQR + P+S
Sbjct: 319 DLCKLALDFITS---HLVPTRFTTS---HNLRPLKDRLKMPEDEQEPPRVQRHTAPAPIS 372
Query: 369 ETHQVRTPGTGDKYPDMKPPKIKSAS-FKDSSLS-TKHR--SSKRQEYAEFYGSGEVPPP 424
E+ P D YP+ K PK++S+S KD +LS +KH SSKRQ+YA+FY SGEV PP
Sbjct: 373 ESRHAHLPKADDSYPEHKTPKLRSSSVMKDPALSASKHHRSSSKRQDYAQFYASGEVAPP 432
Query: 425 YSKSKSQKERTRHRQRDKSGELVYGVSGMEPKPV---EMKPVDYDN-LKFDHYNMR-SKY 479
K KER+R R RDK+ E EPK +KPV+Y N KFDHYNMR SKY
Sbjct: 433 ----KIHKERSRRRHRDKTTE-------TEPKLTPSDTVKPVEYSNSSKFDHYNMRSSKY 481
Query: 480 GDDS-YRF 486
G ++ +RF
Sbjct: 482 GPETPFRF 489
>gi|22331093|ref|NP_188104.2| NHL domain-containing protein [Arabidopsis thaliana]
gi|20260314|gb|AAM13055.1| unknown protein [Arabidopsis thaliana]
gi|25083516|gb|AAN72090.1| unknown protein [Arabidopsis thaliana]
gi|332642056|gb|AEE75577.1| NHL domain-containing protein [Arabidopsis thaliana]
Length = 492
Score = 607 bits (1564), Expect = e-171, Method: Compositional matrix adjust.
Identities = 325/491 (66%), Positives = 378/491 (76%), Gaps = 34/491 (6%)
Query: 10 LTLLIAFTLQ---FQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVET 66
TL I FTL FQA AAP+G LIKH++S+LKWTT SSSK Q+D NVLQFENGYLVET
Sbjct: 22 FTLWIFFTLHSFPFQAQAAPSGSLIKHMSSVLKWTTGSSSKLSQSDTNVLQFENGYLVET 81
Query: 67 VIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVD 126
V+EGN+IGVVPYKIRVS+DGEL+AVDE+NSNI+KITPPLSQYSRGRLVAGSFQG TGH D
Sbjct: 82 VVEGNDIGVVPYKIRVSDDGELYAVDELNSNIMKITPPLSQYSRGRLVAGSFQGKTGHAD 141
Query: 127 GKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSED 186
GKP+EARFNHP+GVTMDDKGN+YVADTLNLAIRKIGD+GVTTIAGGKSN+AG+RDGPSED
Sbjct: 142 GKPSEARFNHPRGVTMDDKGNVYVADTLNLAIRKIGDSGVTTIAGGKSNIAGYRDGPSED 201
Query: 187 AKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYN-SISPTDILMVVGAVLV 245
AKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL+++DC+YQ + SIS TDIL+V+GAVL+
Sbjct: 202 AKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLSEEDCDYQDDSSISLTDILLVIGAVLI 261
Query: 246 GYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEPGWPSFGQ 305
GY TCMLQQGFG FFS+T SE+ + EE KE +P+ KEEPGWPSFGQ
Sbjct: 262 GYATCMLQQGFGNSFFSKTVG-SETSYEEEHPGKE--KLSRPVHEKTITKEEPGWPSFGQ 318
Query: 306 LIIDLSKLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAV 365
L+ DL KLALE + S + VP RF+ T NL PL+D L MPEDE PP VQR +
Sbjct: 319 LLTDLCKLALEFITS---HLVPARFQ---TNPNLRPLKDRLIMPEDEQEPPRVQRHTAPA 372
Query: 366 PLSETHQVRTPGTGDKYPDMKPPKIKSAS-FKDSSLSTK--HR-SSKRQEYAEFYGSGEV 421
P+SE+ P D YP+ K PK++S+S KD +LS+ HR SSKRQ+YA+FY SGEV
Sbjct: 373 PISESRHAHLPKADDSYPEHKTPKLRSSSVMKDPTLSSSKHHRTSSKRQDYAQFYASGEV 432
Query: 422 PPPYSKSKSQKERTRHRQRDKSGELVYGVSGMEPKPV---EMKPVDYDN-LKFDHYNMR- 476
P K KER+R R RDK+ E EPKP +KPV+Y N KFDHYNMR
Sbjct: 433 AQP----KIHKERSRRRHRDKTTE-------TEPKPTPSDTVKPVEYSNSSKFDHYNMRS 481
Query: 477 SKYGDDS-YRF 486
SKYG ++ +RF
Sbjct: 482 SKYGPETPFRF 492
>gi|449530424|ref|XP_004172195.1| PREDICTED: uncharacterized LOC101210502, partial [Cucumis sativus]
Length = 437
Score = 607 bits (1564), Expect = e-171, Method: Compositional matrix adjust.
Identities = 297/432 (68%), Positives = 357/432 (82%), Gaps = 14/432 (3%)
Query: 51 ADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSR 110
ADGNVLQFENGYLV TV+EGNEIGV+P+KI VS+DGELF VD VNSNIVKITPPLS+Y+R
Sbjct: 11 ADGNVLQFENGYLVGTVVEGNEIGVLPHKIHVSKDGELFVVDSVNSNIVKITPPLSKYTR 70
Query: 111 GRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIA 170
RLVAGSFQ +TGH+DGKPN+ARFNHP+GVT+DDKGN+YVADTLNLAIRKIGDAGVTTIA
Sbjct: 71 ARLVAGSFQSHTGHIDGKPNDARFNHPRGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIA 130
Query: 171 GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNS 230
GGKSNV G+RDGP EDAKFSNDFDV+YVR TCSLLVIDRGNAA+RQISLNQ+DCEYQ +S
Sbjct: 131 GGKSNVVGYRDGPGEDAKFSNDFDVMYVRSTCSLLVIDRGNAAIRQISLNQEDCEYQDSS 190
Query: 231 ISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIPI 290
IS +D+LM++GAVL GY T M+Q+GFG S+T P E+E+ E+ E S+ +
Sbjct: 191 ISNSDVLMIIGAVLAGYATYMIQRGFGTSNVSQTNPPLETEYREKPYKPESSS------V 244
Query: 291 VESMKEEPGWPSFGQLIIDLSKLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPE 350
++S+KE+PGWPSFG+LIIDLSKLALEA+ S+FL+ VP RFR+ T+ LTPL+D+LRMPE
Sbjct: 245 MDSVKEDPGWPSFGRLIIDLSKLALEAVASIFLSVVPARFRARNTRKGLTPLKDSLRMPE 304
Query: 351 DEAAPPVVQR-QKSAVPLSETHQVRTPGTGDKYPD-MKPPKIKSASFKDSSLSTKHRSSK 408
DE P VQ Q++ VPL+ET Q D +P+ MKP K+ S+SFKD SL +KHRSSK
Sbjct: 305 DEPEQPTVQMLQRTPVPLTETRQAHV-NARDPFPELMKPSKLNSSSFKDPSLQSKHRSSK 363
Query: 409 RQEYAEFYGSGEVPPPYSKSKSQKERTRHRQRDKSGELVYGVSGMEPKPVEMKPVDYDNL 468
RQE+A+FY SGE+PPPYS+SKSQKER RHRQR+KS E+ YG G E+KP DYDN
Sbjct: 364 RQEHADFYRSGEIPPPYSRSKSQKERPRHRQREKSAEISYGAVGS-----ELKPADYDNP 418
Query: 469 KFDHYNMRSKYG 480
K++HYN+R+KYG
Sbjct: 419 KYEHYNIRNKYG 430
>gi|11994597|dbj|BAB02652.1| unnamed protein product [Arabidopsis thaliana]
Length = 511
Score = 602 bits (1553), Expect = e-169, Method: Compositional matrix adjust.
Identities = 326/509 (64%), Positives = 379/509 (74%), Gaps = 51/509 (10%)
Query: 10 LTLLIAFTLQ---FQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVET 66
TL I FTL FQA AAP+G LIKH++S+LKWTT SSSK Q+D NVLQFENGYLVET
Sbjct: 22 FTLWIFFTLHSFPFQAQAAPSGSLIKHMSSVLKWTTGSSSKLSQSDTNVLQFENGYLVET 81
Query: 67 VIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVD 126
V+EGN+IGVVPYKIRVS+DGEL+AVDE+NSNI+KITPPLSQYSRGRLVAGSFQG TGH D
Sbjct: 82 VVEGNDIGVVPYKIRVSDDGELYAVDELNSNIMKITPPLSQYSRGRLVAGSFQGKTGHAD 141
Query: 127 GKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSED 186
GKP+EARFNHP+GVTMDDKGN+YVADTLNLAIRKIGD+GVTTIAGGKSN+AG+RDGPSED
Sbjct: 142 GKPSEARFNHPRGVTMDDKGNVYVADTLNLAIRKIGDSGVTTIAGGKSNIAGYRDGPSED 201
Query: 187 AKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYN-SISPT----------- 234
AKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL+++DC+YQ + SIS T
Sbjct: 202 AKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLSEEDCDYQDDSSISLTGVDLRVLTEDH 261
Query: 235 -------DILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKP 287
DIL+V+GAVL+GY TCMLQQGFG FFS+TQ SE+ + EE KE +P
Sbjct: 262 CVLTFGADILLVIGAVLIGYATCMLQQGFGNSFFSKTQVGSETSYEEEHPGKE--KLSRP 319
Query: 288 IPIVESMKEEPGWPSFGQLIIDLSKLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLR 347
+ KEEPGWPSFGQL+ DL KLALE + S + VP RF+ T NL PL+D L
Sbjct: 320 VHEKTITKEEPGWPSFGQLLTDLCKLALEFITS---HLVPARFQ---TNPNLRPLKDRLI 373
Query: 348 MPEDEAAPPVVQRQKSAVPLSETHQVRTPGTGDKYPDMKPPKIKSAS-FKDSSLSTK--H 404
MPEDE PP VQR + P+SE+ P D YP+ K PK++S+S KD +LS+ H
Sbjct: 374 MPEDEQEPPRVQRHTAPAPISESRHAHLPKADDSYPEHKTPKLRSSSVMKDPTLSSSKHH 433
Query: 405 R-SSKRQEYAEFYGSGEVPPPYSKSKSQKERTRHRQRDKSGELVYGVSGMEPKPV---EM 460
R SSKRQ+YA+FY SGEV P K KER+R R RDK+ E EPKP +
Sbjct: 434 RTSSKRQDYAQFYASGEVAQP----KIHKERSRRRHRDKTTE-------TEPKPTPSDTV 482
Query: 461 KPVDYDN-LKFDHYNMR-SKYGDDS-YRF 486
KPV+Y N KFDHYNMR SKYG ++ +RF
Sbjct: 483 KPVEYSNSSKFDHYNMRSSKYGPETPFRF 511
>gi|449456567|ref|XP_004146020.1| PREDICTED: uncharacterized protein LOC101206392 [Cucumis sativus]
Length = 477
Score = 553 bits (1425), Expect = e-155, Method: Compositional matrix adjust.
Identities = 286/479 (59%), Positives = 354/479 (73%), Gaps = 19/479 (3%)
Query: 12 LLIAFTL---QFQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVI 68
L +AF + FQ +A +GPL +HL+SLLKWT SSK PQ DGN +QFE+GYLVET++
Sbjct: 12 LFLAFIVPIVSFQVDSAASGPLARHLSSLLKWT--GSSKTPQPDGNAIQFESGYLVETIV 69
Query: 69 EGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGK 128
EGNEIG+VPYKIRVSEDGELFAVD VNSN+VK++PPLS+YSR RLVAGSFQGY GHVDGK
Sbjct: 70 EGNEIGMVPYKIRVSEDGELFAVDSVNSNVVKVSPPLSRYSRARLVAGSFQGYKGHVDGK 129
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAK 188
P++ARFN PKG+T+DDKGN+YVADTLNLAIRKI DAGVTTIAGGK+NV G+ DGP E+AK
Sbjct: 130 PSDARFNQPKGITIDDKGNVYVADTLNLAIRKIVDAGVTTIAGGKTNVPGYSDGPGEEAK 189
Query: 189 FSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYV 248
FSNDFDV+YVR TCSLLV+DRGNAALRQISLN++DC+YQY S+S +D+ M +GA+L+GY
Sbjct: 190 FSNDFDVIYVRRTCSLLVVDRGNAALRQISLNKEDCDYQYGSVSTSDVAMFIGALLIGYF 249
Query: 249 TCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEPGWPSFGQLII 308
T MLQ GF FF+ Q + ETE KELS K K +V ++KEE W SFGQ++
Sbjct: 250 TYMLQHGFRLSFFTFMVQSE----HLETETKELS-KGKQTKLVSTIKEETWWESFGQVVA 304
Query: 309 DLSKLALEAMGSMFLNFVPFRFRSSGTK-GNLTPLRDTLRMPEDEAAPPVVQRQKSAVPL 367
+L K A+E + +F+ FRS K LTPL+D L+MPEDE V +QK+ PL
Sbjct: 305 ELYKQAIELLPGNLKSFLRPYFRSEDNKEKGLTPLKDALKMPEDEIKTNVSLKQKTVTPL 364
Query: 368 SETHQVRTPGTGDKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSK 427
SET K+ ++KPPK+KS S K+ SL KH S QEYAEFYG+G V S+
Sbjct: 365 SETKHASI-----KHDELKPPKMKS-SIKNPSLLNKHSHSG-QEYAEFYGTGMVSSSLSR 417
Query: 428 SKSQKERTRHRQRDKSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMRSKYGDDSYRF 486
SK QK+R+RHRQ++K +++ G G EPK EM+ DY+ KFD YN+R+KY DS F
Sbjct: 418 SKGQKDRSRHRQKEKGLDILTGTLGAEPKLAEMR-TDYNEPKFDQYNIRNKYRYDSSSF 475
>gi|449518065|ref|XP_004166064.1| PREDICTED: uncharacterized protein LOC101228723 [Cucumis sativus]
Length = 449
Score = 543 bits (1399), Expect = e-152, Method: Compositional matrix adjust.
Identities = 279/461 (60%), Positives = 343/461 (74%), Gaps = 16/461 (3%)
Query: 27 AGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDG 86
GPL +H +SLLKWT SSK PQ DGN +QFE+GYLVET++EGNEIG+VPYKIRVSEDG
Sbjct: 2 GGPLARHFSSLLKWT--GSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPYKIRVSEDG 59
Query: 87 ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKG 146
ELFAVD VNSN+VK++PPLS+YSR RLVAGSFQGY GHVDGKP++ARFN PKG+T+DDKG
Sbjct: 60 ELFAVDSVNSNVVKVSPPLSRYSRARLVAGSFQGYKGHVDGKPSDARFNQPKGITIDDKG 119
Query: 147 NLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLV 206
N+YVADTLNLAIRKI DAGVTTIAGGK+NV G+ DGP E+AKFSNDFDV+YVR TCSLLV
Sbjct: 120 NVYVADTLNLAIRKIVDAGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDVIYVRRTCSLLV 179
Query: 207 IDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQ 266
+DRGNAALRQISLN++DC+YQY S+S +D+ M +GA+L+GY T MLQ GF FF+ Q
Sbjct: 180 VDRGNAALRQISLNKEDCDYQYGSVSTSDVAMFIGALLIGYFTYMLQHGFRLSFFTFMVQ 239
Query: 267 PSESEFNEETEIKELSNKEKPIPIVESMKEEPGWPSFGQLIIDLSKLALEAMGSMFLNFV 326
+ ETE KELS K K +V ++KEE W SFGQ++ +L K A+E + +F+
Sbjct: 240 SE----HLETETKELS-KGKQTKLVSTIKEETWWESFGQVVAELYKQAIELLPGNLKSFL 294
Query: 327 PFRFRSSGTK-GNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPGTGDKYPDM 385
FRS K LTPL+D L+MPEDE V +QK+ PLSET K+ ++
Sbjct: 295 RPYFRSEDNKEKGLTPLKDALKMPEDEIKTNVSLKQKTVTPLSETKHASI-----KHDEL 349
Query: 386 KPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKERTRHRQRDKSGE 445
KPPK+KS S K+ SL KH S QEYAEFYG+G V S+SK QK+R+RHRQ++K +
Sbjct: 350 KPPKMKS-SIKNPSLLNKHSHSG-QEYAEFYGTGMVSSSLSRSKGQKDRSRHRQKEKGLD 407
Query: 446 LVYGVSGMEPKPVEMKPVDYDNLKFDHYNMRSKYGDDSYRF 486
++ G G EPK EM+ DY+ KFD YN+R+KY DS F
Sbjct: 408 ILTGTLGAEPKLAEMR-TDYNEPKFDQYNIRNKYRYDSSSF 447
>gi|357140648|ref|XP_003571876.1| PREDICTED: uncharacterized protein LOC100824092 [Brachypodium
distachyon]
Length = 491
Score = 516 bits (1330), Expect = e-144, Method: Compositional matrix adjust.
Identities = 266/481 (55%), Positives = 343/481 (71%), Gaps = 24/481 (4%)
Query: 19 QFQAHAAPAGPLIKHLTSLLKWTTRSSSKAP------QADGNV-LQFENGYLVETVIEGN 71
+ A A+P +++ L+S++KW SS +P Q DGNV LQFE+GY VET++EG+
Sbjct: 22 EANAFASPTDNIVRQLSSVVKWPRGSSPHSPKQSSHSQYDGNVALQFESGYFVETLVEGD 81
Query: 72 EIGVVPYKIRVS--EDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP 129
++GV P+ IRVS E GEL AVD +SNIV+ITPPLS+YSR RLVAGSFQG+ GH+DGKP
Sbjct: 82 KLGVTPHTIRVSPLEGGELLAVDSAHSNIVRITPPLSEYSRARLVAGSFQGHAGHIDGKP 141
Query: 130 NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKF 189
+ARF P GV +DD GN+Y+ADT NLAIRKIGD+GVTTIAGGKSN+ G+RDGPSEDAKF
Sbjct: 142 IDARFKRPTGVAVDDTGNVYIADTANLAIRKIGDSGVTTIAGGKSNIPGYRDGPSEDAKF 201
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVT 249
S DFDVVYV+ CSLLVIDRGNAALR+I+L Q+DC YQ ++ +DI++V+GAV+ GY+
Sbjct: 202 STDFDVVYVKKMCSLLVIDRGNAALRKIALPQEDCTYQDAALLSSDIILVIGAVVAGYLF 261
Query: 250 CMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEP--GWPSFGQLI 307
++Q GFGP + + P E E +E S KP +VES+KEEP GWPSFG L+
Sbjct: 262 SVVQHGFGPSSSEKIEAP-------EDEKQESSTVGKPPLVVESLKEEPSAGWPSFGTLV 314
Query: 308 IDLSKLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPL 367
DL KLA+E +GS+ N VP R + K LTPL+D L MPED P+ Q+ S+ P+
Sbjct: 315 ADLLKLAIEGVGSLLFNIVPSRLQRVKRKTGLTPLKDRLVMPEDREETPIAQKL-SSTPM 373
Query: 368 SETHQVRTPGTGDKYPDMKPPKIKSAS-FKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYS 426
T + P T ++ IKS+S F+DS+LS+KHRS+KR EYAEFYGS E P +
Sbjct: 374 -RTETLHAPNTVNETAPKAQKSIKSSSKFRDSTLSSKHRSTKRPEYAEFYGSSET--PQA 430
Query: 427 KSKSQKERTRHRQRDKSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMRSKYGDDS-YR 485
+K K+R RHR R+KSGE+ YG EPKP EMKP DY++ K+DHYNMR+KYG D+ +R
Sbjct: 431 SAKVPKDRLRHRHREKSGEVAYGSGHPEPKPAEMKPADYNDPKYDHYNMRNKYGTDTGFR 490
Query: 486 F 486
+
Sbjct: 491 Y 491
>gi|115445197|ref|NP_001046378.1| Os02g0234200 [Oryza sativa Japonica Group]
gi|50251216|dbj|BAD27660.1| putative NHL repeat-containing protein [Oryza sativa Japonica
Group]
gi|113535909|dbj|BAF08292.1| Os02g0234200 [Oryza sativa Japonica Group]
gi|125538730|gb|EAY85125.1| hypothetical protein OsI_06475 [Oryza sativa Indica Group]
gi|222622487|gb|EEE56619.1| hypothetical protein OsJ_06002 [Oryza sativa Japonica Group]
Length = 493
Score = 502 bits (1293), Expect = e-139, Method: Compositional matrix adjust.
Identities = 264/482 (54%), Positives = 333/482 (69%), Gaps = 28/482 (5%)
Query: 22 AHAAPAGPLIKHLTSLLKWTT-----------RSSSKAPQADGNV-LQFENGYLVETVIE 69
A +PA +++ L+S++KW PQ DG V LQFE+GY VET++E
Sbjct: 23 AFPSPADSIVRQLSSVVKWPRVPSSSSSSSHGHKQPSHPQYDGGVALQFESGYFVETLVE 82
Query: 70 GNEIGVVPYKIRVS--EDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDG 127
G+++GV P+ IRVS E GEL AVD +SNIV+ITPPLS+YSRGRLVAGSFQG++GH+DG
Sbjct: 83 GDKLGVTPHTIRVSPVEGGELLAVDSAHSNIVRITPPLSEYSRGRLVAGSFQGHSGHIDG 142
Query: 128 KPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDA 187
KP++ARF P GV +DD GN+YVADT NLAIRKIG++GVTTIAGGKSN+ G+RDGPSEDA
Sbjct: 143 KPSDARFKRPTGVAVDDTGNVYVADTANLAIRKIGESGVTTIAGGKSNIPGYRDGPSEDA 202
Query: 188 KFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGY 247
KFS DFDVVYV+ CSLLVIDRGNAALR+I+L Q+DC YQ ++ +DI++V+GAV+ GY
Sbjct: 203 KFSTDFDVVYVKKMCSLLVIDRGNAALRKIALPQEDCTYQDATLLSSDIILVIGAVVAGY 262
Query: 248 VTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEP--GWPSFGQ 305
+ ++Q GFG + + P E E +E S KP +VES+KEEP GWPS G
Sbjct: 263 IFSVVQHGFGSSTAEKIEAP-------EDEHQESSTVGKPPLVVESLKEEPSAGWPSLGT 315
Query: 306 LIIDLSKLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAV 365
LI DL KL +E +G+ L VP R + K +LTPL+D L MPED PV Q+ S
Sbjct: 316 LIGDLLKLVIEGVGNQLLRLVPSRLQHGKRKTDLTPLKDRLVMPEDTEETPVAQKLSSTP 375
Query: 366 PLSETHQVRTPGTGDKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPY 425
ET + P ++ +KS+ F+DS+LS+KHRSSKRQEYAEFYGS E P
Sbjct: 376 MRPET--LHGPNPVNETAPKAQKSVKSSKFRDSTLSSKHRSSKRQEYAEFYGSSET--PQ 431
Query: 426 SKSKSQKERTRHRQRDKSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMRSKYGDDS-Y 484
SK K+R RHR R+KSGE VYG S EPKP E+KP DY + K+D YN+RSKYG DS Y
Sbjct: 432 VSSKVPKDRLRHRHREKSGEAVYGTSHPEPKPAEVKPADYSDPKYDPYNIRSKYGADSGY 491
Query: 485 RF 486
R+
Sbjct: 492 RY 493
>gi|242061092|ref|XP_002451835.1| hypothetical protein SORBIDRAFT_04g008420 [Sorghum bicolor]
gi|241931666|gb|EES04811.1| hypothetical protein SORBIDRAFT_04g008420 [Sorghum bicolor]
Length = 495
Score = 474 bits (1220), Expect = e-131, Method: Compositional matrix adjust.
Identities = 261/480 (54%), Positives = 334/480 (69%), Gaps = 31/480 (6%)
Query: 26 PAGPLIKHLTSLLKWTTRS--SSKAPQ-------ADGNV---LQFENGYLVETVIEGNEI 73
P G ++K L+S+++W S+ P+ ADG+V LQFE+GY VET++EG+++
Sbjct: 28 PTGSIVKQLSSVVRWPRGGGPSTHGPKLPGHSQYADGHVGVSLQFESGYFVETLVEGDKL 87
Query: 74 GVVPYKIRVS--EDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNE 131
GV P+ IRVS E GEL AVD +SNIV+ITPPLS+YSRGRLVAGSFQG++GH+DGKP++
Sbjct: 88 GVTPHTIRVSPVEGGELLAVDSAHSNIVRITPPLSEYSRGRLVAGSFQGHSGHIDGKPSD 147
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSN 191
ARF P GV +DD GN+YVADT NLAIRKIG++GVTTIAGGKSN+ G+RDGPSEDAKFS
Sbjct: 148 ARFKRPTGVAVDDMGNVYVADTANLAIRKIGESGVTTIAGGKSNIPGYRDGPSEDAKFST 207
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCM 251
DFDVVYV+ CSLLVIDRGNAALR+ISL Q+DC YQ +++ +D+++V+GAV+ GY+
Sbjct: 208 DFDVVYVKKMCSLLVIDRGNAALRKISLPQEDCTYQDSALLSSDLILVIGAVVAGYIFSG 267
Query: 252 LQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEP--GWPSFGQLIID 309
Q GFG F S + E+E +E + I KP +VES+KEEP GWPS G LI D
Sbjct: 268 FQHGFG-FSGSEKVEAPENEQHESSTIG------KPPLVVESLKEEPGAGWPSLGTLIAD 320
Query: 310 LSKLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSE 369
L KLA+E +G + L+ VP R + K +LTPLRD L MPED P Q+ S E
Sbjct: 321 LLKLAIEGVGKLLLSVVPQRMQHGKKKTDLTPLRDRLVMPEDREETPAAQKLSSTPMRPE 380
Query: 370 THQVRTPGTGDKYPDMKPPKIKSAS-FKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKS 428
T V P + IKS+S F+DS+LS+KHRSSKRQEYA+FYG+ E P P +
Sbjct: 381 T--VHAPNAVPETAVKAQKSIKSSSKFRDSTLSSKHRSSKRQEYADFYGTSE-PAPVG-A 436
Query: 429 KSQKERTRHR-QRDKSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMRSKYGDDS-YRF 486
K K+R RHR R+KSGE+ YG + + KP E KP DY + K+D Y MRSKY +S YR+
Sbjct: 437 KVPKDRLRHRHHREKSGEVAYGAAHPDLKPAEAKPADYSDPKYDPY-MRSKYAAESGYRY 495
>gi|226504928|ref|NP_001143593.1| uncharacterized protein LOC100276297 precursor [Zea mays]
gi|195623016|gb|ACG33338.1| hypothetical protein [Zea mays]
Length = 501
Score = 461 bits (1187), Expect = e-127, Method: Compositional matrix adjust.
Identities = 258/491 (52%), Positives = 335/491 (68%), Gaps = 29/491 (5%)
Query: 16 FTLQFQAHAAPAGPLIKHLTSLLKWTTRS-SSKAPQ-------ADGNV---LQFENGYLV 64
F A A P G ++K L+S+++W + S+ P+ ADG+V LQFE+GY V
Sbjct: 20 FRPSAAALAPPTGSIVKQLSSVVRWPRAAPSTHGPKQPGHPQYADGHVGVALQFESGYFV 79
Query: 65 ETVIEGNEIGVVPYKIRVS--EDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT 122
ET++EG+++GV P+ IRVS E GEL AVD +SNIV+ITPPLS+YSRGRLVAGSFQG++
Sbjct: 80 ETLVEGDKLGVTPHTIRVSPVEGGELLAVDSAHSNIVRITPPLSEYSRGRLVAGSFQGHS 139
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
GH+DGKP++ARF P GV +DD GN+YVADT NLAIRKIG++GVTTIAGGKSN+ G+RDG
Sbjct: 140 GHIDGKPSDARFKRPTGVAVDDMGNVYVADTANLAIRKIGESGVTTIAGGKSNIPGYRDG 199
Query: 183 PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGA 242
PSEDAKFS DFDVVYV+ CSLLVIDRGNAALR+ISL Q+DC YQ +++ +D+++V+GA
Sbjct: 200 PSEDAKFSTDFDVVYVKKMCSLLVIDRGNAALRKISLPQEDCTYQDSALLSSDLILVIGA 259
Query: 243 VLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEP--GW 300
V+ GY+ Q GFG F S + E+E +E + I KP +VES+KEEP GW
Sbjct: 260 VVAGYIFSGFQHGFG-FSGSEKVEAPENEQHESSTIG------KPPLVVESLKEEPGAGW 312
Query: 301 PSFGQLIIDLSKLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQR 360
PS G LI DL KLA+E +G + L+ VP R + K +LTPLRD L MPED +
Sbjct: 313 PSLGTLIADLLKLAIEGVGKLLLSVVPQRMQHGKRKTDLTPLRDRLVMPEDREETAAAAQ 372
Query: 361 QKSAVPLS-ETHQVRTPGTGDKYPDMKPPKIKSASFKDSS-LSTKHRSSKRQEYAEFYGS 418
+ S+ P+ ET P T P P K + +DSS LS+KHRSSKRQEYA+FYG+
Sbjct: 373 KLSSTPMRPETAHAPNPVTETAAPKA-PKSTKPSKLRDSSTLSSKHRSSKRQEYADFYGT 431
Query: 419 GEVPPPYSKSKSQKE--RTRHRQRDKSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMR 476
E P + +K K+ R RH R+KSGE+ YG + + KP E KP DY + +D Y +R
Sbjct: 432 SEPAPVGAAAKVPKDRLRHRHHHREKSGEVAYGAAHHDLKPAEAKPADYSDPSYDPY-VR 490
Query: 477 SKYGDDS-YRF 486
SKY +S YR+
Sbjct: 491 SKYAAESGYRY 501
>gi|413936635|gb|AFW71186.1| hypothetical protein ZEAMMB73_395034 [Zea mays]
Length = 501
Score = 461 bits (1185), Expect = e-127, Method: Compositional matrix adjust.
Identities = 257/491 (52%), Positives = 335/491 (68%), Gaps = 29/491 (5%)
Query: 16 FTLQFQAHAAPAGPLIKHLTSLLKWTTRS-SSKAPQ-------ADGNV---LQFENGYLV 64
F A A P G ++K L+S+++W + S+ P+ ADG+V LQFE+GY V
Sbjct: 20 FRPSAAALAPPTGSIVKQLSSVVRWPRAAPSTHGPKQPGHPQYADGHVGVALQFESGYFV 79
Query: 65 ETVIEGNEIGVVPYKIRVS--EDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT 122
ET++EG+++GV P+ +RVS E GEL AVD +SNIV+ITPPLS+YSRGRLVAGSFQG++
Sbjct: 80 ETLVEGDKLGVTPHTVRVSPVEGGELLAVDSAHSNIVRITPPLSEYSRGRLVAGSFQGHS 139
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
GH+DGKP++ARF P GV +DD GN+YVADT NLAIRKIG++GVTTIAGGKSN+ G+RDG
Sbjct: 140 GHIDGKPSDARFKRPTGVAVDDMGNVYVADTANLAIRKIGESGVTTIAGGKSNIPGYRDG 199
Query: 183 PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGA 242
PSEDAKFS DFDVVYV+ CSLLVIDRGNAALR+ISL Q+DC YQ +++ +D+++V+GA
Sbjct: 200 PSEDAKFSTDFDVVYVKKMCSLLVIDRGNAALRKISLPQEDCTYQDSALLSSDLILVIGA 259
Query: 243 VLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEP--GW 300
V+ GY+ Q GFG F S + E+E +E + I KP +VES+KEEP GW
Sbjct: 260 VVAGYIFSGFQHGFG-FSGSEKVEAPENEQHESSTIG------KPPLVVESLKEEPGAGW 312
Query: 301 PSFGQLIIDLSKLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQR 360
PS G LI DL KLA+E +G + L+ VP R + K +LTPLRD L MPED +
Sbjct: 313 PSLGTLIADLLKLAIEGVGKLLLSVVPQRMQHGKRKTDLTPLRDRLVMPEDREETAAAAQ 372
Query: 361 QKSAVPLS-ETHQVRTPGTGDKYPDMKPPKIKSASFKDSS-LSTKHRSSKRQEYAEFYGS 418
+ S+ P+ ET P T P P K + +DSS LS+KHRSSKRQEYA+FYG+
Sbjct: 373 KLSSTPMRPETAHAPNPVTETAAPKA-PKSTKPSKLRDSSTLSSKHRSSKRQEYADFYGT 431
Query: 419 GEVPPPYSKSKSQKE--RTRHRQRDKSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMR 476
E P + +K K+ R RH R+KSGE+ YG + + KP E KP DY + +D Y +R
Sbjct: 432 SEPAPVGAAAKVPKDRLRHRHHHREKSGEVAYGAAHHDLKPAEAKPADYSDPSYDPY-VR 490
Query: 477 SKYGDDS-YRF 486
SKY +S YR+
Sbjct: 491 SKYAAESGYRY 501
>gi|238009310|gb|ACR35690.1| unknown [Zea mays]
Length = 340
Score = 315 bits (806), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 181/348 (52%), Positives = 232/348 (66%), Gaps = 16/348 (4%)
Query: 146 GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLL 205
GN+YVADT NLAIRKIG++GVTTIAGGKSN+ G+RDGPSEDAKFS DFDVVYV+ CSLL
Sbjct: 2 GNVYVADTANLAIRKIGESGVTTIAGGKSNIPGYRDGPSEDAKFSTDFDVVYVKKMCSLL 61
Query: 206 VIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQ 265
VIDRGNAALR+ISL Q+DC YQ +++ +D+++V+GAV+ GY+ Q GFG F S
Sbjct: 62 VIDRGNAALRKISLPQEDCTYQDSALLSSDLILVIGAVVAGYIFSGFQHGFG-FSGSEKV 120
Query: 266 QPSESEFNEETEIKELSNKEKPIPIVESMKEEP--GWPSFGQLIIDLSKLALEAMGSMFL 323
+ E+E +E + I KP +VES+KEEP GWPS G LI DL KLA+E +G + L
Sbjct: 121 EAPENEQHESSTIG------KPPLVVESLKEEPGAGWPSLGTLIADLLKLAIEGVGKLLL 174
Query: 324 NFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLS-ETHQVRTPGTGDKY 382
+ VP R + K +LTPLRD L MPED ++ S+ P+ ET P T
Sbjct: 175 SVVPQRMQHGKRKTDLTPLRDRLVMPEDREETAAAAQKLSSTPMRPETAHAPNPVTETAA 234
Query: 383 PDMKPPKIKSASFKDSS-LSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKE--RTRHRQ 439
P P K + +DSS LS+KHRSSKRQEYA+FYG+ E P + +K K+ R RH
Sbjct: 235 PKA-PKSTKPSKLRDSSTLSSKHRSSKRQEYADFYGTSEPAPVGAAAKVPKDRLRHRHHH 293
Query: 440 RDKSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMRSKYGDDS-YRF 486
R+KSGE+ YG + + KP E KP DY + +D Y +RSKY +S YR+
Sbjct: 294 REKSGEVAYGAAHHDLKPAEAKPADYSDPSYDPY-VRSKYAAESGYRY 340
>gi|317015034|gb|ADU85604.1| NHL repeat-containing protein, partial [Wolffia arrhiza]
Length = 239
Score = 276 bits (707), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 134/204 (65%), Positives = 170/204 (83%)
Query: 34 LTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDE 93
L+SL KW ++S K PQ + ++++FE+GYLVET+ +GN +G+ P+ IRVS DGEL AVD
Sbjct: 36 LSSLFKWVSKSPPKNPQTEASLVRFESGYLVETIADGNRLGLTPHAIRVSPDGELIAVDS 95
Query: 94 VNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADT 153
VNSNIV+ITPPLS +SRGRLVAGSFQG +G +DGKP+EARFN P+GV +D +GN+YVAD
Sbjct: 96 VNSNIVRITPPLSAFSRGRLVAGSFQGRSGLIDGKPSEARFNQPRGVAVDRRGNIYVADV 155
Query: 154 LNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAA 213
NLAIRKIGD+GV+TIAGGK+ AGFRDGPSE+A+FS DFDVVYV+ CSLLV+DRGNAA
Sbjct: 156 SNLAIRKIGDSGVSTIAGGKAGAAGFRDGPSEEARFSADFDVVYVKSLCSLLVVDRGNAA 215
Query: 214 LRQISLNQDDCEYQYNSISPTDIL 237
LR+I L+ DDC ++S+ +DIL
Sbjct: 216 LRKIFLHDDDCTQDFSSVGFSDIL 239
>gi|147798773|emb|CAN69794.1| hypothetical protein VITISV_022544 [Vitis vinifera]
Length = 491
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 179/466 (38%), Positives = 254/466 (54%), Gaps = 47/466 (10%)
Query: 40 WTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIV 99
W+ ++++K + ++++FE+GY VETV +G+++G+ PY I V GEL +D NSN+
Sbjct: 48 WSLKATTKTAISSRSMMKFESGYTVETVFDGSKLGIEPYSIEVLGSGELLILDSANSNLY 107
Query: 100 KITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIR 159
KI+ LSQY+R +LV GS +GY+GHVDG+ EAR NHPKG+T+DD+GN+YVADTLN+AIR
Sbjct: 108 KISSSLSQYTRPKLVTGSAEGYSGHVDGRLREARMNHPKGLTVDDRGNIYVADTLNMAIR 167
Query: 160 KIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
KI DAGVTTIAGGK G DGPSEDAKFSNDFDVVY+ +CSLLVIDRGN A+R+I
Sbjct: 168 KISDAGVTTIAGGKLGRGGGHVDGPSEDAKFSNDFDVVYIGSSCSLLVIDRGNQAIREIQ 227
Query: 219 LNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEI 278
L+ DDC YQY S P I +++ A GY+ +LQ+ G S Q + S T
Sbjct: 228 LHFDDCAYQYGSGFPLGIAVLIAAGFFGYMLALLQRRVGTIVSSENDQANPS-IAHSTYQ 286
Query: 279 KELSNKEKP-IPIVESM--KEEPGWPSFGQLIIDLSKLALEAMGSMF--LNFVPFRFRSS 333
K L + P IP + M +EE + S G+L + E G M L P ++
Sbjct: 287 KPLKSVRPPLIPTEDEMEKQEEGFFGSLGKLFVYAGACIAEIFGGMIPGLKKKPHSYQYQ 346
Query: 334 GT--------KGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPGTGDKYPDM 385
N PL+++ +P DE PP + RTP YP M
Sbjct: 347 NQQNYQQPQKHSNAWPLQESFVIP-DEDEPPSI-------------DTRTPTPRKTYPFM 392
Query: 386 KPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKERTRH----RQRD 441
KD+ H+ + + + + T H + +
Sbjct: 393 S---------KDA--EKMHQIRQSRAFVSGWDGDFQQQQKQHHHRHYSSTPHTYYEQNCE 441
Query: 442 KSGELVYGV---SGMEPKPVEMKPVDYDNLKFDHYNMRSKYGDDSY 484
K+ E+V+G + +PV++KPV+Y + +DH+N+RS+ Y
Sbjct: 442 KTNEIVFGAVQEQQVRREPVDIKPVNYGDPIYDHHNIRSRIHSKGY 487
>gi|225424717|ref|XP_002265467.1| PREDICTED: uncharacterized protein LOC100263777 [Vitis vinifera]
gi|296086531|emb|CBI32120.3| unnamed protein product [Vitis vinifera]
Length = 491
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 179/466 (38%), Positives = 254/466 (54%), Gaps = 47/466 (10%)
Query: 40 WTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIV 99
W+ ++++K + ++++FE+GY VETV +G+++G+ PY I V GEL +D NSN+
Sbjct: 48 WSLKATTKTAISSRSMMKFESGYTVETVFDGSKLGIEPYSIEVLGSGELLILDSANSNLY 107
Query: 100 KITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIR 159
KI+ LSQY+R +LV GS +GY+GHVDG+ EAR NHPKG+T+DD+GN+YVADTLN+AIR
Sbjct: 108 KISSSLSQYTRPKLVTGSAEGYSGHVDGRLREARMNHPKGLTVDDRGNIYVADTLNMAIR 167
Query: 160 KIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
KI DAGVTTIAGGK G DGPSEDAKFSNDFDVVY+ +CSLLVIDRGN A+R+I
Sbjct: 168 KISDAGVTTIAGGKLGRGGGHVDGPSEDAKFSNDFDVVYIGSSCSLLVIDRGNQAIREIQ 227
Query: 219 LNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEI 278
L+ DDC YQY S P I +++ A GY+ +LQ+ G S Q + S T
Sbjct: 228 LHFDDCAYQYGSGFPLGIAVLIAAGFFGYMLALLQRRVGTIVSSENDQANPS-IAHSTYQ 286
Query: 279 KELSNKEKP-IPIVESM--KEEPGWPSFGQLIIDLSKLALEAMGSMF--LNFVPFRFRSS 333
K L + P IP + M +EE + S G+L + E G M L P ++
Sbjct: 287 KPLKSVRPPLIPTEDEMERQEEGFFGSLGKLFVYAGACIAEIFGGMIPGLKKKPHSYQYQ 346
Query: 334 GT--------KGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPGTGDKYPDM 385
N PL+++ +P DE PP + RTP YP M
Sbjct: 347 NQQNYQQPQKHSNAWPLQESFVIP-DEDEPPSI-------------DTRTPTPRKTYPFM 392
Query: 386 KPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKERTRH----RQRD 441
KD+ H+ + + + + T H + +
Sbjct: 393 S---------KDA--EKMHQIRQSRAFVSGWDGDFQQQQKQHHHRHYSSTPHTYYEQNCE 441
Query: 442 KSGELVYGV---SGMEPKPVEMKPVDYDNLKFDHYNMRSKYGDDSY 484
K+ E+V+G + +PV++KPV+Y + +DH+N+RS+ Y
Sbjct: 442 KTNEIVFGAVQEQQVRREPVDIKPVNYGDPIYDHHNIRSRIHSKGY 487
>gi|356565931|ref|XP_003551189.1| PREDICTED: uncharacterized protein LOC100781836 [Glycine max]
Length = 493
Score = 273 bits (699), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 180/483 (37%), Positives = 266/483 (55%), Gaps = 79/483 (16%)
Query: 40 WTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIV 99
W+ ++++K + ++++FE+GY VETV +G+++G+ PY + V +GEL +D NSNI
Sbjct: 52 WSLKATTKTAVSSRSMMKFESGYNVETVFDGSKLGIEPYAVEVLPNGELLILDSANSNIY 111
Query: 100 KITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIR 159
+I+ LS YSR +LVAGS +GY+GHVDGK EAR NHPKG+T+DD+GN+YVADT N+AIR
Sbjct: 112 RISSSLSLYSRPKLVAGSAEGYSGHVDGKLREARMNHPKGITVDDRGNIYVADTTNMAIR 171
Query: 160 KIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
KI D+GVTTIAGGK N G DGPSE+AKFS+D DVVYV +CSLLVIDRGN A+R+I
Sbjct: 172 KISDSGVTTIAGGKWNRGGGHVDGPSEEAKFSDDLDVVYVGSSCSLLVIDRGNRAIREIQ 231
Query: 219 LNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEI 278
L+ DDC YQY S P I M+VGA GY+ +LQ+ G S+ E+
Sbjct: 232 LHFDDCAYQYGSGFPLGIAMLVGAGFFGYMLALLQRRLGTIVASQ-------------EV 278
Query: 279 KELSNKEKPI-----PIVES-----MKEEPGWPSFGQLIIDLSKLALEAMGSMFLNF--- 325
S +KP+ P++ S +EE + S G+L+ + +E +G +F +F
Sbjct: 279 IPPSPYQKPLKSVRPPLIPSEYEPDKQEEGFFGSLGKLLANTGASMVEIIGGLFPSFRRK 338
Query: 326 -VPFRFRSS------GTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPGT 378
+ ++F+ + N P++++ +P DE PP + R P
Sbjct: 339 SLRYQFQRQPLIQQPQKQVNAWPVQESFVIP-DEDQPPSI-------------DTRAPTP 384
Query: 379 GDKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKERTRHR 438
YP M KD+ + R S+ FY SG + + Q++ RH+
Sbjct: 385 RKTYPFMS---------KDAEKMQQLRQSR-----AFY-SGWDGDLQQQQQQQQKHHRHQ 429
Query: 439 QR------------DKSGELVYGVSGMEPKPVE---MKPVDYDNLKFDHYNMRSKYGDDS 483
R + + E+V+G + + E +KPVDY ++H N+RS+
Sbjct: 430 YRSSIPHTYYEQSHETTNEIVFGAVEEQDRKQESVVIKPVDYGESMYEH-NIRSRMSSMG 488
Query: 484 YRF 486
Y +
Sbjct: 489 YGY 491
>gi|115448467|ref|NP_001048013.1| Os02g0730400 [Oryza sativa Japonica Group]
gi|46390465|dbj|BAD15926.1| putative NHL repeat-containing protein [Oryza sativa Japonica
Group]
gi|46390861|dbj|BAD16365.1| putative NHL repeat-containing protein [Oryza sativa Japonica
Group]
gi|113537544|dbj|BAF09927.1| Os02g0730400 [Oryza sativa Japonica Group]
gi|125541002|gb|EAY87397.1| hypothetical protein OsI_08804 [Oryza sativa Indica Group]
gi|125583566|gb|EAZ24497.1| hypothetical protein OsJ_08258 [Oryza sativa Japonica Group]
gi|215715228|dbj|BAG94979.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 508
Score = 272 bits (695), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 145/349 (41%), Positives = 215/349 (61%), Gaps = 17/349 (4%)
Query: 27 AGPLIKHLTSLLK--WTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSE 84
+G L +S++K W+ +S++K +++++E GY VETV +G+++G+ PY + V++
Sbjct: 39 SGFLSNAASSVMKRLWSLKSTTKTGSGGKSMVKYEGGYTVETVFDGSKLGIEPYAVEVTQ 98
Query: 85 DGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDD 144
GEL +D +NSNI +I PLS+YSR +LVAGS +G+ GHVDG+P EAR NHPKG T+D
Sbjct: 99 GGELLVMDSMNSNIYRIALPLSRYSRPKLVAGSPEGFPGHVDGRPREARMNHPKGFTVDG 158
Query: 145 KGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 204
+GN+YVAD +N+AIRKI D GVTTIAGGKS+ G DGPS+DAKFS DF+V Y+ +CSL
Sbjct: 159 RGNIYVADAMNMAIRKISDTGVTTIAGGKSSRGGHVDGPSDDAKFSTDFEVRYIGSSCSL 218
Query: 205 LVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRT 264
LVIDRGN A+R+I LN DDC YQY + P + +++ A GY+ +LQ+ T
Sbjct: 219 LVIDRGNQAIREIQLNFDDCVYQYEAGFPLGVAVLLAAAFFGYMLALLQRRV--LGMVST 276
Query: 265 QQPSESEFNEETEIKELSNKEKPI------PIV----ESMK---EEPGWPSFGQLIIDLS 311
+ E++ + I + +KP+ P++ ES K EE + S G+LI+
Sbjct: 277 EDEPETQTPLKASIASIPPYQKPLKQSVRPPLIPNEDESEKQEVEEGFFTSIGKLIVGAK 336
Query: 312 KLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQR 360
E +G+ F + P++++ +P DE PP+ R
Sbjct: 337 SSVAEIVGAAFSRKKRLNIHQQQARVRSWPVQESYAIPRDETPPPLDTR 385
>gi|224102017|ref|XP_002312513.1| predicted protein [Populus trichocarpa]
gi|222852333|gb|EEE89880.1| predicted protein [Populus trichocarpa]
Length = 494
Score = 271 bits (693), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 181/500 (36%), Positives = 267/500 (53%), Gaps = 51/500 (10%)
Query: 6 LLLTLTLLIAF--TLQFQAHAAPA---GPLIKHLTSLLK---WTTRSSSKAPQADGNVLQ 57
L+L +L+ F +PA G L ++ S L W+ ++++K + +++
Sbjct: 9 LVLCFIMLLFFGGVTSVPTTTSPAKIVGGLFSNVVSALMKWLWSLKATTKTVISGRPMMK 68
Query: 58 FENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS 117
FE+GY VETV +G+++G+ PY ++V GEL +D NSNI +I+ LS YSR +LVAGS
Sbjct: 69 FESGYTVETVFDGSKLGIEPYSVQVLPSGELLILDSANSNIYRISASLSLYSRPKLVAGS 128
Query: 118 FQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVA 177
+GY+GHVDGK EA+ NHPKG+T+DD+GN+YVADT+N+AIRKI DAGVTTIAGGK
Sbjct: 129 HEGYSGHVDGKLREAKMNHPKGLTVDDRGNIYVADTMNMAIRKISDAGVTTIAGGKWGRG 188
Query: 178 GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDIL 237
DG SEDA FSNDFDVVY+ +CSLLVIDRGN A+R+I L+ DDC YQY S P I
Sbjct: 189 SHVDGASEDANFSNDFDVVYIGSSCSLLVIDRGNRAIREIQLHFDDCAYQYGSGFPLGIA 248
Query: 238 MVVGAVLVGYVTCMLQQGFGPFFFSR--TQQPSESEFNEETEIKELSNKEKPIPIVESMK 295
++V A GY+ +LQ+ G + + + S + + IK + P +
Sbjct: 249 VLVAAGFFGYMLALLQRRVGMIVSPQNVSMKMSTTGIPYQKPIKSIRPPLIPTEDEQEKH 308
Query: 296 EEPGWPSFGQLIIDLSKLALEAMGSMFLNF-----------VPFRFRSSGTKGNLTPLRD 344
EE + S G+L I+ +E G + +F + + + P++D
Sbjct: 309 EEGLFGSLGKLFINTGASVMEIFGGIVPSFRKKPVSYQYQNYQQQQYQHQKQLSSWPVQD 368
Query: 345 TLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPGTGDKYPDMKPPKIKSASFKDSSLSTKH 404
+ +P DE PP ++ RTP YP M KD+ +
Sbjct: 369 SFVIP-DEDEPPSIES-------------RTPTPRKTYPFMS---------KDTEKMHQW 405
Query: 405 RSSKRQEYAEFYG--SGEVPPPYSKSKSQKERTRHRQR-DKSGELVYGVSGMEPKPVE-- 459
R R Y+ + G + + + +S T + Q +K+ E+V+G + E
Sbjct: 406 RQG-RSIYSGWDGDLQQQQHQHHHRYQSSTPHTYYEQSYEKTNEIVFGAVQEQDGKYETM 464
Query: 460 -MKPVDYDNLKFDHYNMRSK 478
KPVDY + K H+N+RS+
Sbjct: 465 VTKPVDYGDPKHYHHNIRSR 484
>gi|449492692|ref|XP_004159073.1| PREDICTED: uncharacterized protein LOC101226879 [Cucumis sativus]
Length = 516
Score = 270 bits (691), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 178/531 (33%), Positives = 277/531 (52%), Gaps = 76/531 (14%)
Query: 1 MKGTHLLLTLTLLIAFTLQFQAHAAPAGPLIKHL-----TSLLK--WTTRSSSKAPQADG 53
M + L L+ F A P +++ + +SL+K W+ +SS+K +
Sbjct: 1 MARIWVAFALVFLLLFGGVSSVSATPPAKIVRGVLSNVVSSLVKKLWSMKSSAKTAVSSR 60
Query: 54 NVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL 113
++++FE+GY VETV +G+++G+ PY + +S GEL +D NSNI KI+ PLS++SR +L
Sbjct: 61 SMMKFESGYSVETVFDGSKLGIDPYSVEMSPSGELLILDAENSNIHKISMPLSRFSRPKL 120
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGK 173
V+GS +GY+GHVDG EAR NHPKG+T+D++GN+Y+ADT+N+AIRKI D GVTTIAGG+
Sbjct: 121 VSGSAEGYSGHVDGHSREARMNHPKGLTLDERGNIYIADTMNMAIRKISDTGVTTIAGGR 180
Query: 174 SNV-AGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSIS 232
N +G DGPSEDAKFSNDFDVVYV +CSLLVIDRGN A+R+I LN DDC QY
Sbjct: 181 WNQGSGHIDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAIREIELNYDDCNTQYADSL 240
Query: 233 PTDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIPIVE 292
++++V A L GY+ +LQ+ F S+ Q E + +K P ++
Sbjct: 241 NLGVVLLVAAGLFGYLLALLQRRVQAMFSSQKDQ----EIRSQQMMKATPVAPYQRPPLK 296
Query: 293 SMK-------------EEPGWPSFGQLIIDLSKLALEAMGSMFLNF--------VPFRFR 331
S++ EE + S G+L ++ + G + F + +F+
Sbjct: 297 SVRPSLIPSEDEPEKLEEGFFGSLGRLFVNSGSSMADIFGGLLSGFRRKPLNHQIHQQFQ 356
Query: 332 SSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPGTGDKYPDMKPPKIK 391
N PL+++ +P DE PP + + +TP YP M +
Sbjct: 357 PVNRHPNAWPLQESFVIP-DEDEPPSI-------------ETKTPTIKKTYPFMTQDLDR 402
Query: 392 SASFKDSSLSTKHRSSKRQEYAEF--------------------YGSGEVPPPYSKSKSQ 431
S FK +RS EF + + + + S
Sbjct: 403 SHQFK------PNRSYFSGWDGEFHQQQQQQQIQHHHQQQHIQHHHHQQQQQYHHRQYSA 456
Query: 432 KERTRHRQRDKSGELVYGVSGMEP---KPVEMKPVDYDNLKFDHYNMRSKY 479
T + + ++ E+V+G + + + +K VDY + +++H+N+R++Y
Sbjct: 457 GPTTYYEKSCETNEIVFGAVQEQDGRREAMVIKAVDYGDPRYNHHNIRARY 507
>gi|255552640|ref|XP_002517363.1| conserved hypothetical protein [Ricinus communis]
gi|223543374|gb|EEF44905.1| conserved hypothetical protein [Ricinus communis]
Length = 494
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 172/460 (37%), Positives = 253/460 (55%), Gaps = 41/460 (8%)
Query: 40 WTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIV 99
W+ +S++K + ++++FE GY VETV +G+++G+ P+ + VS GEL +D NSNI
Sbjct: 47 WSLKSNTKTAASSRSMMKFEGGYTVETVFDGSKLGIEPHTVDVSPSGELLVLDSENSNIY 106
Query: 100 KITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIR 159
KI+ PLS+YSR +L+AGS +GY+GHVDG+ EAR NHPKG+T+D++GN+Y+ADT+N+AIR
Sbjct: 107 KISTPLSRYSRPKLIAGSPEGYSGHVDGRLREARMNHPKGLTVDNRGNIYIADTMNMAIR 166
Query: 160 KIGDAGVTTIAGGK-SNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
KI D GVTTIAGGK + G DGPSEDAKFSNDFD+ Y+ +CSLLVIDRGN A+R+I
Sbjct: 167 KISDGGVTTIAGGKWTRAGGHVDGPSEDAKFSNDFDIAYIGSSCSLLVIDRGNQAIREIQ 226
Query: 219 LNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEI 278
LN DDC QY+ I ++V A GY+ +LQ F R + T +
Sbjct: 227 LNDDDCNNQYDGTFHLGIAVLVAAAFFGYMLALLQWRVQTLFSPRND--PRTYMKNGTPM 284
Query: 279 KELSNKEKPI--PIVESMKEEPGWPSFGQLIIDLSKLALEAMGSMF----LNFVPFRFRS 332
KP+ P+V S +EP P G L KL L + S F L++ +
Sbjct: 285 APYQRPPKPVRPPLVPS-DDEPDKPDEG-FFGSLGKLVLGGLFSGFRRKPLHYQFQQQYQ 342
Query: 333 SGTK-GNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPGTGDKYPDMKPPKIK 391
K N P++++ +P DE PP + + RTP + YP M K
Sbjct: 343 QQLKHSNPWPMQESFVIP-DEDEPPSL-------------ETRTPTSKKAYPSMTNGVEK 388
Query: 392 SASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKERTRHRQRD---------K 442
FK SS + + +Y + K Q+ RH + +
Sbjct: 389 HYQFKQSS---GYYNGWDGDYHQQQQMQMQVQMQQHQKQQQHHHRHYTSNPKTYYEKSCE 445
Query: 443 SGELVYGVSGMEP---KPVEMKPVDYDNLKFDHYNMRSKY 479
+ E+V+G + + V +K VDY + +++H+N+R ++
Sbjct: 446 TNEIVFGAVQEQDGRREAVVIKAVDYADPRYNHHNIRPRF 485
>gi|356497860|ref|XP_003517774.1| PREDICTED: uncharacterized protein LOC100799483 [Glycine max]
Length = 508
Score = 267 bits (683), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 146/336 (43%), Positives = 211/336 (62%), Gaps = 18/336 (5%)
Query: 40 WTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIV 99
W+ ++ +K ++++FE+GY VETV +G+++G+ PY + V +GEL +D NSNI
Sbjct: 48 WSLKAPTKTAIGAKSMMKFESGYTVETVFDGSKLGIEPYAVEVLPNGELLILDSANSNIY 107
Query: 100 KITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIR 159
+I+ LS SR +LVAGS +GY+GHVDG+ EAR NHPKG+T+D++GN+YVAD +N+AIR
Sbjct: 108 RISSSLSLNSRPKLVAGSAEGYSGHVDGRFREARMNHPKGITVDNRGNIYVADIMNMAIR 167
Query: 160 KIGDAGVTTIAGGK-SNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
KI D+GVTTIAGGK S G DGPSE+AKFSNDFDVVYV +CSLLVIDRGN A+R+I
Sbjct: 168 KISDSGVTTIAGGKWSRGGGHVDGPSEEAKFSNDFDVVYVGSSCSLLVIDRGNQAIREIQ 227
Query: 219 LNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSR--TQQPSESEFNEET 276
L+ DDC YQY + P I +++GA GY+ +LQ+ G S+ + S S +
Sbjct: 228 LHFDDCAYQYENGFPLGIAVLIGAGFFGYMLALLQRRLGTIVASQDVSAHSSMSGISPSP 287
Query: 277 EIKELSNKEKPIPIVE---SMKEEPGWPSFGQLIIDLSKLALEAMGSMFLNF-------- 325
K L++ P+ E +EE + S G+L+ + +E MG++F F
Sbjct: 288 YQKPLNSARPPLIPSEDESDKQEESFFGSIGKLLTNAGASVVEIMGALFPGFRKNPPQSY 347
Query: 326 ---VPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVV 358
P F+ + N P++++ +P DE PP +
Sbjct: 348 EFQSPPLFQQPQKQVNAWPVQESFVIP-DEDEPPSI 382
>gi|224108107|ref|XP_002314724.1| predicted protein [Populus trichocarpa]
gi|222863764|gb|EEF00895.1| predicted protein [Populus trichocarpa]
Length = 487
Score = 265 bits (678), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 171/458 (37%), Positives = 253/458 (55%), Gaps = 48/458 (10%)
Query: 40 WTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIV 99
W+ +S++K + +++FE+GY VETV +G+++G+ PY ++V GEL +D VNSNI
Sbjct: 54 WSMKSTTKTVISGRPMMKFESGYTVETVFDGSKLGIEPYSVQVLPSGELLILDSVNSNIY 113
Query: 100 KITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIR 159
+++ LS YSR +LVAGS +GY+GHVDGK EA+ NHPKG+T+DD+GN+Y+ADT+N+AIR
Sbjct: 114 RMSSSLSLYSRPKLVAGSHEGYSGHVDGKLREAKMNHPKGLTVDDRGNIYIADTMNMAIR 173
Query: 160 KIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
KI DAGVTTIAGGK DG SEDAKFSNDFDV+Y+ +CSLLVIDRGN A+R+I L
Sbjct: 174 KISDAGVTTIAGGKWGRGSHVDGASEDAKFSNDFDVLYIGSSCSLLVIDRGNQAIREIQL 233
Query: 220 NQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIK 279
+ DDC YQY S P I +++ A GY+ +LQ+ G S N E
Sbjct: 234 HFDDCAYQYGSGFPLGIAVLLAAGFFGYMLALLQRRVGMIV---------SPQNVAMETS 284
Query: 280 ELSNKEKPI-----PIV----ESMKEEPG-WPSFGQLIIDLSKLALEAMGSMFLNFV--- 326
N +PI P++ E K E G + S G+L I+ E +G + +F
Sbjct: 285 TTGNPYQPIKSFRPPLIPTEDEQEKHEEGLFGSLGKLFINTWASIAEILGGIVPSFRKKP 344
Query: 327 PFRFRSSGTKGNLT-PLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPGTGDKYPDM 385
P + + + + P++D+ +P DE PP + + RTP YP M
Sbjct: 345 PNYQYQNYQQQSTSWPVQDSFVIP-DEDEPP-------------STETRTPTPRKTYPFM 390
Query: 386 KPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPY-SKSKSQKERTRHRQR-DKS 443
K ++ S RS +F + Y + +S T + Q +K+
Sbjct: 391 SKDTEKMHQWRQS------RSIYSGWDGDFQQQQQQKQQYHHRYQSSTPHTYYEQSYEKT 444
Query: 444 GELVYGV---SGMEPKPVEMKPVDYDNLKFDHYNMRSK 478
E+V+G ++ + PV+Y + + +N+RS+
Sbjct: 445 NEIVFGAVQEQDVKHGTTVILPVNYGDPIHNRHNIRSR 482
>gi|449449204|ref|XP_004142355.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101209861 [Cucumis sativus]
Length = 454
Score = 265 bits (676), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 164/458 (35%), Positives = 258/458 (56%), Gaps = 35/458 (7%)
Query: 43 RSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKIT 102
+SS+K + ++++FE+GY VETV +G+++G+ PY + +S GEL +D NSNI KI+
Sbjct: 2 KSSAKTAVSSRSMMKFESGYSVETVFDGSKLGIDPYSVEMSPSGELLILDAENSNIHKIS 61
Query: 103 PPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG 162
PLS++SR +LV+GS +GY+GHVDG EAR NHPKG+T+D++GN+Y+ADT+N+AIRKI
Sbjct: 62 MPLSRFSRPKLVSGSAEGYSGHVDGHSREARMNHPKGLTLDERGNIYIADTMNMAIRKIS 121
Query: 163 DAGVTTIAGGKSNV-AGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQ 221
D GVTTIAGG+ N +G DGPSEDAKFSNDFDVVYV +CSLLVIDRGN A+R+I LN
Sbjct: 122 DTGVTTIAGGRWNQGSGHIDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAIREIELNY 181
Query: 222 DDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFN-------E 274
DDC QY ++++V A L GY+ +LQ+ F S+ + S+ +
Sbjct: 182 DDCNTQYADSLNLGVVLLVAAGLFGYLLALLQRRVQAMFSSQKEIRSQQMMKATPVAPYQ 241
Query: 275 ETEIKELSNKEKPIPIVESMKEEPGWPSFGQLIIDLSKLALEAMGSMFLNF--------V 326
+K + P EE + S G+L ++ + G + F +
Sbjct: 242 RPPLKSVRPSLIPSEDEPEKLEEGFFGSLGRLFVNSGSSMADIFGGLLSGFRRKPLNHQI 301
Query: 327 PFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPGTGDKYPDMK 386
+F+ N PL+++ +P DE PP + + +TP YP M
Sbjct: 302 HQQFQPVNRHPNAWPLQESFVIP-DEDEPPSI-------------ETKTPTIKKTYPFMT 347
Query: 387 PPKIKSASFKDSSLSTKH--RSSKRQEYAEFYGSGEVPPPYSKSKSQKERTRHRQRDKSG 444
+S FK ++ + + +Q++ + + + + + S T + + ++
Sbjct: 348 QDLDRSHQFKPNNSNNNSXIQHHHQQQHIQHHHHQQQQQYHHRQYSAGPTTYYEKSCETN 407
Query: 445 ELVYGVSGMEPKPVE---MKPVDYDNLKFDHYNMRSKY 479
E+V+G + E +K VDY + +++H+N+R++Y
Sbjct: 408 EIVFGAVQEQDGRREAMVIKAVDYGDPRYNHHNIRARY 445
>gi|255560011|ref|XP_002521024.1| conserved hypothetical protein [Ricinus communis]
gi|223539861|gb|EEF41441.1| conserved hypothetical protein [Ricinus communis]
Length = 500
Score = 264 bits (674), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 151/356 (42%), Positives = 215/356 (60%), Gaps = 24/356 (6%)
Query: 40 WTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIV 99
W+ +++SK + +++FE GY VETV +G+++G+ PY + V +GEL +D NSNI
Sbjct: 56 WSLKATSKTAISGRPMMKFEGGYNVETVFDGSKLGIEPYSVEVLPNGELLILDSANSNIY 115
Query: 100 KITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIR 159
+I+ LS YSR +LVAGS GY+GHVDGK EAR NHPKG+T+DD+GN+Y+AD +N+AIR
Sbjct: 116 RISSSLSLYSRPKLVAGSPDGYSGHVDGKHREARMNHPKGLTVDDRGNIYIADAMNMAIR 175
Query: 160 KIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
KI DAGVTTIAGGK G DG SEDAKFSNDFDVVY+ +CSLLVIDRGN A+R+I
Sbjct: 176 KISDAGVTTIAGGKWGRGGSHVDGASEDAKFSNDFDVVYIGSSCSLLVIDRGNRAIREIQ 235
Query: 219 LNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPS-ESEFNEETE 277
L+ DDC YQY S P + ++V A GY+ +LQ+ G S+ + + ++ +
Sbjct: 236 LHFDDCAYQYESGFPLGVAVLVAAGFFGYMLALLQRRVGKIVSSQNDRDAMKTSISGSPY 295
Query: 278 IKELSNKEKPIPIVESMK---EEPGWPSFGQLIIDLSKLALEAMGSMFLNF----VPFRF 330
K L + P+ E + EE + S G+L + +E +G + F + +++
Sbjct: 296 QKPLRSVRPPLIPTEDEQEKHEEGFFGSLGKLFANAGACVVEILGGIVPGFRKKPLNYQY 355
Query: 331 RSSGTKGNLT-PLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPGTGDKYPDM 385
S K + T P++D+ +P DE PP + + RTP YP M
Sbjct: 356 LSQQQKHSSTWPVQDSFVIP-DEDEPPSI-------------ETRTPTPKKTYPFM 397
>gi|296087695|emb|CBI34951.3| unnamed protein product [Vitis vinifera]
Length = 811
Score = 263 bits (672), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 163/419 (38%), Positives = 234/419 (55%), Gaps = 35/419 (8%)
Query: 1 MKGTHLLLTLTLLIAFTLQFQAHAAPAGPLIKHLTS-----LLKW--TTRSSSKAPQADG 53
M+ + L L L++ + + P ++ + S L+KW + +S++ +
Sbjct: 1 MRNNWVGLFLILVVFGGFSSVSASPPVAKIVSGIVSNVVSALVKWLWSLKSTTNTAISSR 60
Query: 54 NVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL 113
+ + FE+GY VETV +G+++G+ PY + VS GEL +D NSNI KI+ PLS+YSR +L
Sbjct: 61 SKMNFEDGYTVETVFDGSKLGIEPYSLEVSTSGELLVLDSENSNIYKISSPLSRYSRPKL 120
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGK 173
VAGS GY+GHVDGK EAR NHPKG+TMD++GN+Y+ADT+N+AIRKI DAGVTTIAGGK
Sbjct: 121 VAGSPDGYSGHVDGKLREARMNHPKGLTMDERGNIYIADTMNMAIRKISDAGVTTIAGGK 180
Query: 174 SNVAGFR-DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSIS 232
G DGPSEDAKFSNDFDVVY+ +CSLLVIDRGN A+R+I L+ +DC YQYN
Sbjct: 181 WGRGGGHVDGPSEDAKFSNDFDVVYIGSSCSLLVIDRGNQAIREIQLHYEDCAYQYNGSF 240
Query: 233 PTDILMVVGAVLVGYVTCMLQQGFGPFFFSR--TQQPSESEFNEETEIKELSNKEKPIPI 290
I ++V A GY+ +LQ+ F S+ + P + E+ + L + P+
Sbjct: 241 HLGIAVLVAAGFFGYMLALLQRRVAAMFSSQYDSSTPMKKGMPPESYQRPLKSVRAPLIP 300
Query: 291 VESMKE---EPGWPSFGQLIIDLSKLALEAMGSMFLN--------FVPFRFRSSGTKGNL 339
E E E + S G+L ++ E G +F + ++ N
Sbjct: 301 TEDEYEKADEGFFGSLGRLFLNTGSTLAEIFGGLFSGSRKKPPHQQIQQQYGQPNVHSNG 360
Query: 340 TPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPGTGDKYPDMKPPKIKSASFKDS 398
P++++ +P DE PP ++ R P YP M P K+ F+ S
Sbjct: 361 WPMQESYVIP-DEDEPPSIES-------------RAPTPKKTYPFMTPEMEKTHHFRQS 405
>gi|356501873|ref|XP_003519748.1| PREDICTED: uncharacterized protein LOC100797570 [Glycine max]
Length = 507
Score = 261 bits (666), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 169/471 (35%), Positives = 251/471 (53%), Gaps = 53/471 (11%)
Query: 40 WTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIV 99
W+ ++ +K ++++FE+GY VETV +G+++G+ PY + V +GEL +D NSNI
Sbjct: 48 WSLKAPTKTAIGAKSMMKFESGYTVETVFDGSKLGIEPYAVEVLANGELLILDSANSNIY 107
Query: 100 KITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIR 159
+I+ LS +R +LVAGS +GY+GHVDG+ EAR NHPKG+ +D++GN+Y+AD +N+AIR
Sbjct: 108 RISSSLSLNTRPKLVAGSAEGYSGHVDGRLREARMNHPKGIAVDNRGNIYIADIMNMAIR 167
Query: 160 KIGDAGVTTIAGGK-SNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
KI D+GVTTIAGGK S G DGPSE+AKFSNDFDVVY+ +CSLLVIDRGN A+R+I
Sbjct: 168 KISDSGVTTIAGGKWSRGGGHIDGPSEEAKFSNDFDVVYIGSSCSLLVIDRGNQAIREIQ 227
Query: 219 LNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEI 278
L+ DDC YQY + I M+VGA GY+ +LQ G S+ + S +
Sbjct: 228 LHFDDCAYQYENGLTLGIAMLVGAGFFGYMLALLQHRLGTIVASQ-DGAAMSGISPSPYQ 286
Query: 279 KELSNKEKPIPIVES---MKEEPGWPSFGQLIIDLSKLALEAMGSMFLNF------VPFR 329
K L + P+ E +EE S G+L+ + +E MG++F F F+
Sbjct: 287 KPLKSVRPPLIPSEDESDKQEESFIRSIGKLLTNAGASVVEIMGALFPGFRKKPQSYEFQ 346
Query: 330 ----FRSSGTKGNLTPLRDTLRMPEDEAAPPVV---------------QRQKSAVPLSET 370
F+ + N P++++ +P DE PP + Q + L ++
Sbjct: 347 SQPLFQQPQKQVNAWPVQESFVIP-DEDEPPSIDPRSPTPRKTYAFMSQDAEKMPQLWQS 405
Query: 371 HQVRTPGTGDKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKSKS 430
H + GD + + + HR Y S VP Y +
Sbjct: 406 HAFYSGWGGDLQQQQQLKLQQQQQQQQQQQLKHHR--------HLYHS-SVPHTYYE--- 453
Query: 431 QKERTRHRQRDKSGELVYG-VSGMEPK--PVEMKPVDYDNLKFDHYNMRSK 478
+ ++ E+V+G V E K V +KPVDY +DH+ +R +
Sbjct: 454 -------QSHEEINEIVFGAVQEQEGKKETVVIKPVDYGQSLYDHHYIRPR 497
>gi|225452518|ref|XP_002274991.1| PREDICTED: uncharacterized protein LOC100266244 [Vitis vinifera]
Length = 677
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 162/417 (38%), Positives = 233/417 (55%), Gaps = 34/417 (8%)
Query: 1 MKGTHLLLTLTLLIAFTLQFQAHAAPAGPLIKHLTS-----LLKW--TTRSSSKAPQADG 53
M+ + L L L++ + + P ++ + S L+KW + +S++ +
Sbjct: 1 MRNNWVGLFLILVVFGGFSSVSASPPVAKIVSGIVSNVVSALVKWLWSLKSTTNTAISSR 60
Query: 54 NVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL 113
+ + FE+GY VETV +G+++G+ PY + VS GEL +D NSNI KI+ PLS+YSR +L
Sbjct: 61 SKMNFEDGYTVETVFDGSKLGIEPYSLEVSTSGELLVLDSENSNIYKISSPLSRYSRPKL 120
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGK 173
VAGS GY+GHVDGK EAR NHPKG+TMD++GN+Y+ADT+N+AIRKI DAGVTTIAGGK
Sbjct: 121 VAGSPDGYSGHVDGKLREARMNHPKGLTMDERGNIYIADTMNMAIRKISDAGVTTIAGGK 180
Query: 174 SNVAGFR-DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSIS 232
G DGPSEDAKFSNDFDVVY+ +CSLLVIDRGN A+R+I L+ +DC YQYN
Sbjct: 181 WGRGGGHVDGPSEDAKFSNDFDVVYIGSSCSLLVIDRGNQAIREIQLHYEDCAYQYNGSF 240
Query: 233 PTDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIPIVE 292
I ++V A GY+ +LQ+ F S+ + F ++ + L + P+ E
Sbjct: 241 HLGIAVLVAAGFFGYMLALLQRRVAAMFSSQYVS-DPAFFTLQSYQRPLKSVRAPLIPTE 299
Query: 293 SMKE---EPGWPSFGQLIIDLSKLALEAMGSMFLN--------FVPFRFRSSGTKGNLTP 341
E E + S G+L ++ E G +F + ++ N P
Sbjct: 300 DEYEKADEGFFGSLGRLFLNTGSTLAEIFGGLFSGSRKKPPHQQIQQQYGQPNVHSNGWP 359
Query: 342 LRDTLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPGTGDKYPDMKPPKIKSASFKDS 398
++++ +P DE PP ++ R P YP M P K+ F+ S
Sbjct: 360 MQESYVIP-DEDEPPSIES-------------RAPTPKKTYPFMTPEMEKTHHFRQS 402
>gi|22329761|ref|NP_173800.2| NHL domain-containing protein [Arabidopsis thaliana]
gi|18700091|gb|AAL77657.1| At1g23880/T23E23_8 [Arabidopsis thaliana]
gi|20856013|gb|AAM26643.1| At1g23880/T23E23_8 [Arabidopsis thaliana]
gi|332192324|gb|AEE30445.1| NHL domain-containing protein [Arabidopsis thaliana]
Length = 545
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 159/381 (41%), Positives = 222/381 (58%), Gaps = 27/381 (7%)
Query: 6 LLLTLTLLIAFTLQFQAHAAPA----GPLIKHLTSLLKW----TTRSSSKAPQADGNVLQ 57
L + + L AF + +PA + H TSLLKW + ++++K ++++
Sbjct: 68 LGIIILLFSAFVASAPSSTSPAKIVNSFISNHGTSLLKWLWSLSFKTTTKTAVPTKSMVK 127
Query: 58 FENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS 117
FENGY VETV++G+++G+ PY I+V +GEL +D NSNI +I+ LS YSR RLV GS
Sbjct: 128 FENGYSVETVLDGSKLGIEPYSIQVLSNGELLILDSQNSNIYQISSSLSLYSRPRLVTGS 187
Query: 118 FQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKS-NV 176
+GY GHVDG+ +AR N+PKG+T+DD+GN+YVADT+N AIRKI +AGVTTIAGGK
Sbjct: 188 PEGYPGHVDGRLRDARLNNPKGLTVDDRGNIYVADTVNNAIRKISEAGVTTIAGGKMVRG 247
Query: 177 AGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDI 236
G DGPSEDAKFSNDFDVVY+ +CSLLVIDRGN A+R+I L+ DDC QY S P I
Sbjct: 248 GGHVDGPSEDAKFSNDFDVVYLGSSCSLLVIDRGNQAIREIQLHFDDCADQYGSGFPLGI 307
Query: 237 LMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKE 296
++V AV GY+ +LQ+ T Q ++ IK + +P I+ ++
Sbjct: 308 AVLVAAVFFGYMLALLQRRLSSIVSYHTDQEVFEAVPDQDPIKPV----RPPLILTGDEQ 363
Query: 297 EPGWPSF-GQLIIDLSK---LALEAMGSMF----------LNFVPFRFRSSGTKGNLTPL 342
E SF G L I +S ++E MF LNF + S P+
Sbjct: 364 EKQEESFLGTLQIFISNAWVFSVELFSGMFPGLRKKQTVGLNFNHQETKHSAFSTTSWPI 423
Query: 343 RDTLRMPEDEAAPPVVQRQKS 363
+++ + + PPV R +
Sbjct: 424 QESFVIHNKDEPPPVESRNAT 444
>gi|218197942|gb|EEC80369.1| hypothetical protein OsI_22479 [Oryza sativa Indica Group]
Length = 517
Score = 259 bits (661), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 160/469 (34%), Positives = 246/469 (52%), Gaps = 35/469 (7%)
Query: 27 AGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDG 86
A ++K L SL TR+ S A A +++++E GY VETV +G+++G+ P+ + V+ G
Sbjct: 48 ASAVVKQLWSLKSTATRTGSAA--AGRSMVKYEGGYAVETVFDGSKLGIEPHDVEVTPSG 105
Query: 87 ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKG 146
EL +D +NSN+ ++ PLS+YSR +LVAGS +G +GHVDG+ EA+ NHPKG T+DD+G
Sbjct: 106 ELLVLDSMNSNVYRVQLPLSRYSRPKLVAGSPEGLSGHVDGRLREAKMNHPKGFTVDDRG 165
Query: 147 NLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLV 206
N+YVAD +N+AIRKI D GVTTIAGGKS G DGPS+DAKFS DF++ Y+ +CSLLV
Sbjct: 166 NIYVADAMNMAIRKISDTGVTTIAGGKSMRGGHLDGPSDDAKFSTDFEIRYISSSCSLLV 225
Query: 207 IDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQ 266
IDRGN A+R+I+L+ DDCEYQY + P I ++ A GY+ +LQ+
Sbjct: 226 IDRGNQAIREIALHNDDCEYQYEAGFPLGIALLFAAGFFGYMLALLQRQ----VLGMVST 281
Query: 267 PSESEFNEETEIKELSNKEKPI------PIV-------ESMKEEPGWPSFGQLIIDLSKL 313
E + I + +KP+ P++ + EE + S G+LI
Sbjct: 282 ADEPQTPPRPSIASIPPYQKPLKPSLRPPLIPTEDQAGKHEAEEGFFTSIGKLIGGAKSS 341
Query: 314 ALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQV 373
A+E + + N P++++ +P DE PP+ R
Sbjct: 342 AVEIFSRKKRPTHQYHHHLQQQRANPWPVQESYAIPHDETPPPLDMR------------A 389
Query: 374 RTPGTGDKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKE 433
TP + +P K+ + Q+ E + + S
Sbjct: 390 ATPRKNYAFMTKEPEKVHHVCHGRPYFNGWDMQHAPQQQPEQQMYHQQHLQQHRQYSAGP 449
Query: 434 RTRHRQR-DKSGELVYGV---SGMEPKPVEMKPVDYDNLKFDHYNMRSK 478
+T + Q + + E+V+G + + VE+K V+Y + ++ Y MR +
Sbjct: 450 QTFYEQSCETTNEIVFGAVQEVDTKRRMVEIKAVNYGDTFYEQYGMRYR 498
>gi|115467514|ref|NP_001057356.1| Os06g0269300 [Oryza sativa Japonica Group]
gi|53793193|dbj|BAD54399.1| NHL repeat-containing protein-like [Oryza sativa Japonica Group]
gi|113595396|dbj|BAF19270.1| Os06g0269300 [Oryza sativa Japonica Group]
gi|215768552|dbj|BAH00781.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222635365|gb|EEE65497.1| hypothetical protein OsJ_20925 [Oryza sativa Japonica Group]
Length = 517
Score = 258 bits (659), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 160/469 (34%), Positives = 246/469 (52%), Gaps = 35/469 (7%)
Query: 27 AGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDG 86
A ++K L SL TR+ S A A +++++E GY VETV +G+++G+ P+ + V+ G
Sbjct: 48 ASAVVKQLWSLKSTATRTGSAA--AGRSMVKYEGGYAVETVFDGSKLGIEPHDVEVTPSG 105
Query: 87 ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKG 146
EL +D +NSN+ ++ PLS+YSR +LVAGS +G +GHVDG+ EA+ NHPKG T+DD+G
Sbjct: 106 ELLVLDSMNSNVYRVQLPLSRYSRPKLVAGSPEGLSGHVDGRLREAKMNHPKGFTVDDRG 165
Query: 147 NLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLV 206
N+YVAD +N+AIRKI D GVTTIAGGKS G DGPS+DAKFS DF++ Y+ +CSLLV
Sbjct: 166 NIYVADAMNMAIRKISDTGVTTIAGGKSMRGGHLDGPSDDAKFSTDFEIRYISSSCSLLV 225
Query: 207 IDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQ 266
IDRGN A+R+I+L+ DDCEYQY + P I ++ A GY+ +LQ+
Sbjct: 226 IDRGNQAIREIALHNDDCEYQYEAGFPLGIALLFAAGFFGYMLALLQRQ----VLGMVST 281
Query: 267 PSESEFNEETEIKELSNKEKPI------PIV-------ESMKEEPGWPSFGQLIIDLSKL 313
E + I + +KP+ P++ + EE + S G+LI
Sbjct: 282 ADEPQTPPRPSIASIPPYQKPLKPSLRPPLIPTEDQAGKHEAEEGFFTSIGKLIGGAKSS 341
Query: 314 ALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQV 373
A+E + + N P++++ +P DE PP+ R
Sbjct: 342 AVEIFSRKKRPTHQYHHHLQQQRANPWPVQESYAIPHDETPPPLDMR------------A 389
Query: 374 RTPGTGDKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKE 433
TP + +P K+ + Q+ E + + S
Sbjct: 390 ATPRKNYAFMTKEPEKVHHVRHGRPYFNGWDMQHAPQQQPEQQMYHQQHLQQHRQYSAGP 449
Query: 434 RTRHRQR-DKSGELVYGV---SGMEPKPVEMKPVDYDNLKFDHYNMRSK 478
+T + Q + + E+V+G + + VE+K V+Y + ++ Y MR +
Sbjct: 450 QTFYEQSCETTNEIVFGAVQEVDTKRRMVEIKAVNYGDTFYEQYGMRYR 498
>gi|297845436|ref|XP_002890599.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297336441|gb|EFH66858.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 165/436 (37%), Positives = 237/436 (54%), Gaps = 38/436 (8%)
Query: 33 HLTSLLKW----TTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGEL 88
H TSL+KW + ++++ + ++++FENGY VETV++G+++G+ PY ++V +GEL
Sbjct: 39 HGTSLMKWLWSLSFKTTTITAVSTKSMVKFENGYSVETVLDGSKLGIEPYSLQVLPNGEL 98
Query: 89 FAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNL 148
+D NSNI KI+ LS YSR RL+ GS +GY GHVDG+ +AR N+PKG+T+DD+GN+
Sbjct: 99 LILDSQNSNIYKISSSLSLYSRPRLITGSPEGYPGHVDGRLRDARLNNPKGLTVDDRGNI 158
Query: 149 YVADTLNLAIRKIGDAGVTTIAGGKS-NVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVI 207
YVADT+N AIRKI +AGVTTIAGGK + G DGPSEDAKFSNDFDVVY+ +CSLLVI
Sbjct: 159 YVADTVNNAIRKISEAGVTTIAGGKMVHGGGHVDGPSEDAKFSNDFDVVYLGSSCSLLVI 218
Query: 208 DRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQP 267
DRGN A+R+I L+ DDC QY S P I ++V A+ GY+ +LQ+ T Q
Sbjct: 219 DRGNQAIREIQLHFDDCADQYGSGFPLGIAVLVAAIFFGYMLALLQRRLSSIISYHTDQE 278
Query: 268 SESEFNEETEIKELSNKEKPIPIVESMKEEPGWPSFGQLIIDLSKLALEAMGSMFLNF-- 325
++ IK + P + +EE + I ++ ++E MF
Sbjct: 279 VFEAVPDQDPIKPVRPPLIPTGDEQEKQEESFLGTLQIFISNVWVFSVELFSGMFPGLRK 338
Query: 326 ---VPFRFRSSGTKGNL-----TPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPG 377
V F F TK + P++++ + + PP+ + R P
Sbjct: 339 KQTVGFNFNHQETKHSAFSTTSWPIQESFVIHNKDEPPPI--------------ESRNPT 384
Query: 378 TGDKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKERTRH 437
G YP M K A+ K L + R+ R AEF + + H
Sbjct: 385 PGKIYPFMS----KDAAEKMQQLR-QSRALYRSLDAEFLQEQQQQKHQQHHRRHHATIPH 439
Query: 438 ----RQRDKSGELVYG 449
R +KS E+V+G
Sbjct: 440 TVYERSSEKSNEIVFG 455
>gi|297841763|ref|XP_002888763.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297334604|gb|EFH65022.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 503
Score = 255 bits (652), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 132/242 (54%), Positives = 172/242 (71%), Gaps = 3/242 (1%)
Query: 28 GPLIKHLTSLLKW--TTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSED 85
G + H +SL+KW + ++++K A ++++FENGY VETV +G+++G+ PY I V +
Sbjct: 34 GFISNHGSSLMKWLWSLKTTTKTTIATRSMVKFENGYSVETVFDGSKLGIEPYSIEVLPN 93
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDK 145
GEL +D NSNI KI+ LS YSR RLV GS +GY GHVDG+ +A+ NHPKG+T+DD+
Sbjct: 94 GELLILDSENSNIYKISSSLSLYSRPRLVTGSPEGYPGHVDGRLRDAKLNHPKGLTVDDR 153
Query: 146 GNLYVADTLNLAIRKIGDAGVTTIAGGKS-NVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 204
GN+YVADT+N AIRKI + GVTTIAGGK+ G DGPSEDAKFSNDFDVVYV +CSL
Sbjct: 154 GNIYVADTVNNAIRKISEGGVTTIAGGKTVRNGGHVDGPSEDAKFSNDFDVVYVGSSCSL 213
Query: 205 LVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRT 264
LVIDRGN A+R+I L+ DDC YQY S P I ++V A GY+ +LQ+ G S
Sbjct: 214 LVIDRGNKAIREIQLHFDDCAYQYGSGFPLGIAVLVAAGFFGYMLALLQRRVGSIVSSHN 273
Query: 265 QQ 266
Q
Sbjct: 274 DQ 275
>gi|30698020|ref|NP_177185.3| NHL domain-containing protein [Arabidopsis thaliana]
gi|332196920|gb|AEE35041.1| NHL domain-containing protein [Arabidopsis thaliana]
Length = 509
Score = 255 bits (652), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 132/242 (54%), Positives = 172/242 (71%), Gaps = 3/242 (1%)
Query: 28 GPLIKHLTSLLKW--TTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSED 85
G + H +SL+KW + ++++K A ++++FENGY VETV +G+++G+ PY I V +
Sbjct: 34 GFISNHGSSLMKWLWSLKTTTKTTIATRSMVKFENGYSVETVFDGSKLGIEPYSIEVLPN 93
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDK 145
GEL +D NSNI KI+ LS YSR RLV GS +GY GHVDG+ +A+ NHPKG+T+DD+
Sbjct: 94 GELLILDSENSNIYKISSSLSLYSRPRLVTGSPEGYPGHVDGRLRDAKLNHPKGLTVDDR 153
Query: 146 GNLYVADTLNLAIRKIGDAGVTTIAGGKS-NVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 204
GN+YVADT+N AIRKI + GVTTIAGGK+ G DGPSEDAKFSNDFDVVYV +CSL
Sbjct: 154 GNIYVADTVNNAIRKISEGGVTTIAGGKTVRNGGHVDGPSEDAKFSNDFDVVYVGSSCSL 213
Query: 205 LVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRT 264
LVIDRGN A+R+I L+ DDC YQY S P I ++V A GY+ +LQ+ G S
Sbjct: 214 LVIDRGNKAIREIQLHFDDCAYQYGSGFPLGIAVLVAAGFFGYMLALLQRRVGSIVSSHN 273
Query: 265 QQ 266
Q
Sbjct: 274 DQ 275
>gi|413923863|gb|AFW63795.1| hypothetical protein ZEAMMB73_706161 [Zea mays]
Length = 515
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 145/355 (40%), Positives = 216/355 (60%), Gaps = 23/355 (6%)
Query: 20 FQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYK 79
F ++AA A ++K L W+ +S++K +++++E GY VETV +G+++G+ PY
Sbjct: 42 FLSNAASA--VVKRL-----WSLKSTTKTASGGKSMVKYEGGYTVETVFDGSKLGIEPYS 94
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
+ V++ GEL +D +NSNI ++ PLS+YSR +LVAGS +G+ GHVDGK EAR NHPKG
Sbjct: 95 VEVTQSGELLVMDSMNSNIYRMALPLSRYSRPKLVAGSPEGFPGHVDGKLREARMNHPKG 154
Query: 140 VTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 199
T+DDKGN+YVAD +N+AIRKI D GVTTIAGGKS G DGPS++AKFS DF+V Y+
Sbjct: 155 FTVDDKGNIYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYIG 214
Query: 200 PTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPF 259
+CSLLVIDRGN A+R+I L+ DDC YQY + P + +++ A GY+ +LQ+
Sbjct: 215 SSCSLLVIDRGNQAIREIQLHFDDCVYQYEAGFPLGVALLLAAAFFGYMLALLQRRVLGM 274
Query: 260 FFSRTQQPSESEFNEETEI------KELSNKEKPIPIVES------MKEEPGWPSFGQLI 307
+ + +Q ES + K L + +P P++ + +EE + S G+L+
Sbjct: 275 YSNGDEQEDESPVKAKLSSIPPPYQKPLKSSLRP-PLIPNDDEPVKQEEEGLFTSIGKLV 333
Query: 308 IDLSKLALEAMGSMF--LNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQR 360
E +G+ F V G + P++++ +P DE PP+ R
Sbjct: 334 GVARSSVAEIVGATFSRKKRVNVHHHQLGRPTSW-PVQESYAIPRDETPPPLDTR 387
>gi|293331401|ref|NP_001168100.1| uncharacterized protein LOC100381837 precursor [Zea mays]
gi|223946007|gb|ACN27087.1| unknown [Zea mays]
gi|413938717|gb|AFW73268.1| hypothetical protein ZEAMMB73_298159 [Zea mays]
Length = 510
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 147/355 (41%), Positives = 215/355 (60%), Gaps = 22/355 (6%)
Query: 20 FQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYK 79
F ++AA A ++K L W+ +S++K +++++E GY VETV +G+++G+ PY
Sbjct: 42 FLSNAASA--VVKRL-----WSLKSTTKIASGGKSMVKYEGGYTVETVFDGSKLGIEPYS 94
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
+ V++ GEL +D +NSNI ++ PLS+YSR +LVAGS +G+ GHVDGK EAR NHPKG
Sbjct: 95 VEVTQGGELLVMDSMNSNIYRMALPLSRYSRPKLVAGSPEGFPGHVDGKLREARMNHPKG 154
Query: 140 VTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 199
T+DD+GN+YVAD +N+AIRKI D GVTTIAGGKS G DGPS++AKFS DF+V Y+
Sbjct: 155 FTVDDRGNIYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYIG 214
Query: 200 PTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPF 259
+CSLLVIDRGN A+R+I L+ DDC YQY + P + +++ A GY+ +LQ+
Sbjct: 215 SSCSLLVIDRGNQAIREIQLHFDDCVYQYEAGFPLGVALLLAAAFFGYMLALLQRRVLGM 274
Query: 260 FFSRTQQPSESEFNEE-TEI-----KELSNKEKPIPI------VESMKEEPGWPSFGQLI 307
+ + +Q ES + T I K L P+ I V+ +EE + S G+L+
Sbjct: 275 YSNGDEQDHESPVKAKLTSIPPPYQKPLKPYVHPLLIPNDDEPVKQEEEEGLFTSIGKLV 334
Query: 308 IDLSKLALEAMGSMF--LNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQR 360
E +G+ F V G + P +++ +P DE PP+ R
Sbjct: 335 GGAKSSIAEIIGAAFSRKKRVNVHHHQLGRPTSW-PAQESYAIPRDETPPPLDTR 388
>gi|242066098|ref|XP_002454338.1| hypothetical protein SORBIDRAFT_04g029020 [Sorghum bicolor]
gi|241934169|gb|EES07314.1| hypothetical protein SORBIDRAFT_04g029020 [Sorghum bicolor]
Length = 518
Score = 252 bits (644), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 142/336 (42%), Positives = 207/336 (61%), Gaps = 16/336 (4%)
Query: 40 WTTRSSSKAPQADG-NVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNI 98
W+ +S++K + G +++++E GY VETV +G+++G+ PY + V++ GEL +D +NSNI
Sbjct: 55 WSLKSTTKTAASGGKSMVKYEGGYTVETVFDGSKLGIEPYSVEVTQGGELLVMDSMNSNI 114
Query: 99 VKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAI 158
++ PLS+YSR +LVAGS +G+ GHVDGK EAR NHPKG T+DD+GN+YVAD +N+AI
Sbjct: 115 YRMALPLSRYSRPKLVAGSPEGFPGHVDGKLREARMNHPKGFTVDDRGNIYVADAMNMAI 174
Query: 159 RKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
RKI D GVTTIAGGKS G DGPS++AKFS DF+V YV +CSLLVIDRGN A+R+I
Sbjct: 175 RKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYVGSSCSLLVIDRGNQAIREIQ 234
Query: 219 LNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEI 278
L+ DDC YQY + P + +++ A GY+ +LQ+ + + +Q ES +
Sbjct: 235 LHFDDCVYQYEAGFPLGVALLLAAAFFGYMLALLQRRVLGMYSNGDEQEHESPVKAKLSS 294
Query: 279 ------KELSNKEKP--IPI----VESMKEEPGWPSFGQLIIDLSKLALEAMGSMF--LN 324
K L +P IP V+ +EE + S G+L+ E +G+ F
Sbjct: 295 IPPPYQKPLKPSLRPPLIPTDDEPVKQEEEEGLFTSIGKLVGGAKSSIAEIIGAAFSRKK 354
Query: 325 FVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQR 360
V G + P++++ +P DE PP+ R
Sbjct: 355 RVNIHHHQLGRPTSW-PVQESYAIPRDETPPPLDTR 389
>gi|413923861|gb|AFW63793.1| hypothetical protein ZEAMMB73_706161 [Zea mays]
Length = 516
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 146/356 (41%), Positives = 218/356 (61%), Gaps = 24/356 (6%)
Query: 20 FQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADG-NVLQFENGYLVETVIEGNEIGVVPY 78
F ++AA A ++K L W+ +S++K + G +++++E GY VETV +G+++G+ PY
Sbjct: 42 FLSNAASA--VVKRL-----WSLKSTTKTAASGGKSMVKYEGGYTVETVFDGSKLGIEPY 94
Query: 79 KIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPK 138
+ V++ GEL +D +NSNI ++ PLS+YSR +LVAGS +G+ GHVDGK EAR NHPK
Sbjct: 95 SVEVTQSGELLVMDSMNSNIYRMALPLSRYSRPKLVAGSPEGFPGHVDGKLREARMNHPK 154
Query: 139 GVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYV 198
G T+DDKGN+YVAD +N+AIRKI D GVTTIAGGKS G DGPS++AKFS DF+V Y+
Sbjct: 155 GFTVDDKGNIYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYI 214
Query: 199 RPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGP 258
+CSLLVIDRGN A+R+I L+ DDC YQY + P + +++ A GY+ +LQ+
Sbjct: 215 GSSCSLLVIDRGNQAIREIQLHFDDCVYQYEAGFPLGVALLLAAAFFGYMLALLQRRVLG 274
Query: 259 FFFSRTQQPSESEFNEETEI------KELSNKEKPIPIVES------MKEEPGWPSFGQL 306
+ + +Q ES + K L + +P P++ + +EE + S G+L
Sbjct: 275 MYSNGDEQEDESPVKAKLSSIPPPYQKPLKSSLRP-PLIPNDDEPVKQEEEGLFTSIGKL 333
Query: 307 IIDLSKLALEAMGSMF--LNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQR 360
+ E +G+ F V G + P++++ +P DE PP+ R
Sbjct: 334 VGVARSSVAEIVGATFSRKKRVNVHHHQLGRPTSW-PVQESYAIPRDETPPPLDTR 388
>gi|413938718|gb|AFW73269.1| hypothetical protein ZEAMMB73_298159 [Zea mays]
Length = 511
Score = 251 bits (640), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 148/356 (41%), Positives = 217/356 (60%), Gaps = 23/356 (6%)
Query: 20 FQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADG-NVLQFENGYLVETVIEGNEIGVVPY 78
F ++AA A ++K L W+ +S++K + G +++++E GY VETV +G+++G+ PY
Sbjct: 42 FLSNAASA--VVKRL-----WSLKSTTKIAASGGKSMVKYEGGYTVETVFDGSKLGIEPY 94
Query: 79 KIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPK 138
+ V++ GEL +D +NSNI ++ PLS+YSR +LVAGS +G+ GHVDGK EAR NHPK
Sbjct: 95 SVEVTQGGELLVMDSMNSNIYRMALPLSRYSRPKLVAGSPEGFPGHVDGKLREARMNHPK 154
Query: 139 GVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYV 198
G T+DD+GN+YVAD +N+AIRKI D GVTTIAGGKS G DGPS++AKFS DF+V Y+
Sbjct: 155 GFTVDDRGNIYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYI 214
Query: 199 RPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGP 258
+CSLLVIDRGN A+R+I L+ DDC YQY + P + +++ A GY+ +LQ+
Sbjct: 215 GSSCSLLVIDRGNQAIREIQLHFDDCVYQYEAGFPLGVALLLAAAFFGYMLALLQRRVLG 274
Query: 259 FFFSRTQQPSESEFNEE-TEI-----KELSNKEKPIPI------VESMKEEPGWPSFGQL 306
+ + +Q ES + T I K L P+ I V+ +EE + S G+L
Sbjct: 275 MYSNGDEQDHESPVKAKLTSIPPPYQKPLKPYVHPLLIPNDDEPVKQEEEEGLFTSIGKL 334
Query: 307 IIDLSKLALEAMGSMF--LNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQR 360
+ E +G+ F V G + P +++ +P DE PP+ R
Sbjct: 335 VGGAKSSIAEIIGAAFSRKKRVNVHHHQLGRPTSW-PAQESYAIPRDETPPPLDTR 389
>gi|326512916|dbj|BAK03365.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326529495|dbj|BAK04694.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326529987|dbj|BAK08273.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 503
Score = 251 bits (640), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 116/230 (50%), Positives = 167/230 (72%), Gaps = 2/230 (0%)
Query: 27 AGPLIKHLTSLLK--WTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSE 84
+G L +++LK W+ +S++K +++++E GY VETV +G+++G+ PY + V++
Sbjct: 36 SGFLSNAASAVLKRLWSLKSTAKTASGAKSMVKYEGGYTVETVFDGSKLGIEPYAVEVTQ 95
Query: 85 DGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDD 144
GEL +D +NSNI +I PLS+YSR +LVAGS +G GH+DG+ EA+ NHPKG T+D
Sbjct: 96 GGELLVMDSMNSNIYRIALPLSRYSRPKLVAGSPEGIPGHIDGRLREAKMNHPKGFTVDG 155
Query: 145 KGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 204
+GN+YVAD +N+A+RKI D GVTTIAGGKS+ G DGPS+DAKFS DF+V Y+ +CSL
Sbjct: 156 RGNIYVADAMNMAVRKISDTGVTTIAGGKSSRGGHVDGPSDDAKFSTDFEVRYIASSCSL 215
Query: 205 LVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQ 254
LVIDRGN A+R+I LN DDC YQY + P + +++ A GY+ +LQ+
Sbjct: 216 LVIDRGNQAIREIQLNFDDCVYQYEAGFPLGVALLLAAGFFGYMLALLQR 265
>gi|357137830|ref|XP_003570502.1| PREDICTED: uncharacterized protein LOC100839308 [Brachypodium
distachyon]
Length = 495
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 139/356 (39%), Positives = 210/356 (58%), Gaps = 30/356 (8%)
Query: 27 AGPLIKHLTSLLK--WTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSE 84
+G L +++LK W+ +S++K +++++E GY VETV +G+++G+ PY + V++
Sbjct: 35 SGFLSNAASAVLKRLWSLKSTTKTATGAKSMVKYEGGYTVETVFDGSKLGIEPYAVEVTQ 94
Query: 85 DGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDD 144
GEL +D +NSNI +I PLS+YSR +LVAGS +GY GHVDG+ EA+ NHPKG T+D
Sbjct: 95 GGELLVMDSMNSNIYRIALPLSRYSRPKLVAGSPEGYPGHVDGRLREAKMNHPKGFTVDG 154
Query: 145 KGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 204
+GN+YVAD +N+A+RKI D GVTTIAGGKS+ G DGPS+DAKFS DF+V Y+ +CSL
Sbjct: 155 RGNIYVADAMNMAVRKISDTGVTTIAGGKSSRGGHVDGPSDDAKFSTDFEVRYIASSCSL 214
Query: 205 LVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRT 264
LVIDRGN A+R+I L+ DDC YQY + P + +++ A GY+ +LQ+
Sbjct: 215 LVIDRGNQAIREIQLHFDDCVYQYEAGFPLGVALLLAAAFFGYMLALLQR----RVLGMV 270
Query: 265 QQPSESEFNEETEIKELSNKEKPIPIVESMK---------------EEPGWPSFGQLIID 309
E + + I + + P+ S++ EE + S G+LI
Sbjct: 271 STEDELQTPPKASIASIPPYQIQKPLKSSLRPPLIPNEDDSEKQEVEEGFFSSVGKLIGG 330
Query: 310 LSKLALEAMGSMF-----LNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQR 360
E +G+ F LN ++ + N P++++ +P E PP+ R
Sbjct: 331 AKSSVAEIIGAAFSRKKRLNTHHYQ----QARANSWPVQESYAIPHGETPPPLDTR 382
>gi|357485963|ref|XP_003613269.1| hypothetical protein MTR_5g034750 [Medicago truncatula]
gi|355514604|gb|AES96227.1| hypothetical protein MTR_5g034750 [Medicago truncatula]
Length = 589
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 160/372 (43%), Positives = 210/372 (56%), Gaps = 57/372 (15%)
Query: 51 ADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLS---- 106
A ++++FE+GY VETV +G+++G+ PY + V +GEL +D NSNI KI+ LS
Sbjct: 73 AGKSMMKFESGYNVETVFDGSKLGIEPYAVEVLSNGELLILDSENSNIYKISSSLSLCKC 132
Query: 107 --------------QYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVAD 152
SR +LVAGS +GY+GHVDGK EAR NHPKG+T+DD+GN+YVAD
Sbjct: 133 ANVITSPMMSLVAPTNSRPKLVAGSAEGYSGHVDGKLREARMNHPKGITVDDRGNIYVAD 192
Query: 153 TLNLAIRKIGDAGVTTIAGGK-SNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGN 211
+N+AIRKI D+GVTTIAGGK S G DGPSE+AKFSNDFDVVYV +CSLLVIDRGN
Sbjct: 193 IMNMAIRKISDSGVTTIAGGKLSRGGGHVDGPSEEAKFSNDFDVVYVGSSCSLLVIDRGN 252
Query: 212 AALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSE-- 269
A+R+I L DDC YQY S P I M++GA GY+ +LQ+ S+ +E
Sbjct: 253 QAIREIQLRFDDCAYQYESGFPLGIAMLLGAGFFGYMLALLQRRLSTIVASQDMTLAESS 312
Query: 270 --SEFNEETEIKELSNKEKP-IPIV-ESMKEEPG-WPSFGQLIIDLSKLALEAMGSMFLN 324
S+F+ K L + P IP ES K+E G + S G+L+ + +E MG
Sbjct: 313 AMSDFSPSPYQKPLKSVRPPLIPSEDESYKQEEGLFASIGKLLTNAGASVVEIMG----- 367
Query: 325 FVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQV----------- 373
FR + + PL PE RQ +A P+ E+ +
Sbjct: 368 ---FRKKPQSYEFQSQPL---FHQPE---------RQINAWPVQESFVITNEDEPPSIDP 412
Query: 374 RTPGTGDKYPDM 385
RTP YP M
Sbjct: 413 RTPTPKKTYPFM 424
>gi|30698018|ref|NP_850974.1| NHL domain-containing protein [Arabidopsis thaliana]
gi|17065224|gb|AAL32766.1| Unknown protein [Arabidopsis thaliana]
gi|21387163|gb|AAM47985.1| unknown protein [Arabidopsis thaliana]
gi|332196919|gb|AEE35040.1| NHL domain-containing protein [Arabidopsis thaliana]
Length = 447
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 124/213 (58%), Positives = 155/213 (72%), Gaps = 1/213 (0%)
Query: 55 VLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV 114
+++FENGY VETV +G+++G+ PY I V +GEL +D NSNI KI+ LS YSR RLV
Sbjct: 1 MVKFENGYSVETVFDGSKLGIEPYSIEVLPNGELLILDSENSNIYKISSSLSLYSRPRLV 60
Query: 115 AGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKS 174
GS +GY GHVDG+ +A+ NHPKG+T+DD+GN+YVADT+N AIRKI + GVTTIAGGK+
Sbjct: 61 TGSPEGYPGHVDGRLRDAKLNHPKGLTVDDRGNIYVADTVNNAIRKISEGGVTTIAGGKT 120
Query: 175 -NVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISP 233
G DGPSEDAKFSNDFDVVYV +CSLLVIDRGN A+R+I L+ DDC YQY S P
Sbjct: 121 VRNGGHVDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAIREIQLHFDDCAYQYGSGFP 180
Query: 234 TDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQ 266
I ++V A GY+ +LQ+ G S Q
Sbjct: 181 LGIAVLVAAGFFGYMLALLQRRVGSIVSSHNDQ 213
>gi|357118152|ref|XP_003560822.1| PREDICTED: uncharacterized protein LOC100834977 [Brachypodium
distachyon]
Length = 544
Score = 249 bits (635), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 122/243 (50%), Positives = 169/243 (69%), Gaps = 2/243 (0%)
Query: 27 AGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDG 86
A ++K L SL K TTR+ + A +++++E Y VETV +G+++G+ PY + V+ G
Sbjct: 48 ASAVVKQLWSL-KSTTRTVASTAAAGRSMVKYEGWYEVETVFDGSKLGIEPYAVEVTPAG 106
Query: 87 ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKG 146
EL +D +NSNI ++ PLS+YSR +LV+GS +G +GHVDG+ EAR NHPKG T+DD+G
Sbjct: 107 ELLVLDSMNSNIYRVQLPLSRYSRPKLVSGSPEGLSGHVDGRLREARMNHPKGFTVDDRG 166
Query: 147 NLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLV 206
N+YVAD +N+AIRKI D GVTTIAGGKS G DGPS+DAKFS DF++ Y+ +CSLLV
Sbjct: 167 NIYVADAMNMAIRKISDTGVTTIAGGKSVRGGHTDGPSDDAKFSTDFEIRYISSSCSLLV 226
Query: 207 IDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQG-FGPFFFSRTQ 265
IDRGN A+R+I L DDCEYQ+ + P + ++ A GY+ +LQ+ FG +
Sbjct: 227 IDRGNQAIREIPLQLDDCEYQHEAGFPLGVALLFAAGFFGYMLALLQRRVFGMVSTTEEH 286
Query: 266 QPS 268
PS
Sbjct: 287 HPS 289
>gi|226530993|ref|NP_001143690.1| uncharacterized protein LOC100276420 precursor [Zea mays]
gi|195624742|gb|ACG34201.1| hypothetical protein [Zea mays]
Length = 515
Score = 248 bits (633), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 125/252 (49%), Positives = 178/252 (70%), Gaps = 8/252 (3%)
Query: 20 FQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADG-NVLQFENGYLVETVIEGNEIGVVPY 78
F ++AA A ++K L W+ +S++K + G +++++E GY VETV +G+++G+ PY
Sbjct: 42 FLSNAASA--VVKRL-----WSLKSTTKTAASGGKSMVKYEGGYTVETVFDGSKLGIEPY 94
Query: 79 KIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPK 138
+ V++ GEL +D +NSNI ++ PLS+YSR +LVAGS +G+ GHVDGK EAR NHPK
Sbjct: 95 SVEVTQSGELLVMDSMNSNIYRMALPLSRYSRPKLVAGSPEGFPGHVDGKLREARMNHPK 154
Query: 139 GVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYV 198
G T+DDKGN+YVAD +N+AIRKI D GVTTIAGGKS G DGPS++AKFS DF+V Y+
Sbjct: 155 GFTVDDKGNIYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYI 214
Query: 199 RPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGP 258
+CSLLVIDRGN A+R+I L+ DDC YQY + P + +++ A GY+ +LQ+
Sbjct: 215 GSSCSLLVIDRGNQAIREIQLHFDDCVYQYEAGFPLGVALLLAAAFFGYMLALLQRRVLG 274
Query: 259 FFFSRTQQPSES 270
+ + +Q ES
Sbjct: 275 MYSNGDEQEDES 286
>gi|356573394|ref|XP_003554846.1| PREDICTED: uncharacterized protein LOC100793916 [Glycine max]
Length = 528
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 147/386 (38%), Positives = 224/386 (58%), Gaps = 36/386 (9%)
Query: 35 TSLLKWTTRSSSKAPQADGNV------LQFENGYLVETVIEGNEIGVVPYKIRVSEDGEL 88
++L+KW S K+ G V ++FE+GY VET+ +G+++G+ P+ +++S +GE
Sbjct: 41 SALIKWLWSLSVKSTTKPGRVQHSRSMVKFESGYSVETIFDGSQLGIEPHSVKISPNGEF 100
Query: 89 FAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNL 148
+D NSNI K++ +S+YSR +L+AGS +G GH+DG+P EAR NHPKG+T+DD+GN+
Sbjct: 101 LVLDSENSNIYKVSGSMSRYSRPKLLAGSAEGNIGHIDGRPREARMNHPKGLTVDDRGNI 160
Query: 149 YVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVRPTCSLLVI 207
Y+ADTLN+AIRKI D GVTTIAGGK AG DGPSEDAKFSNDFDVVYV +CSLLV+
Sbjct: 161 YIADTLNMAIRKISDEGVTTIAGGKRGYAGGHVDGPSEDAKFSNDFDVVYVGSSCSLLVV 220
Query: 208 DRGNAALRQISLNQDDC---EYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFS-R 263
DRGN A+R+I L+QDDC + NS + I+++V A GY+ +LQ F S
Sbjct: 221 DRGNHAIREIQLHQDDCTSYDEDDNSFN-LGIVVLVAAAFFGYMLALLQWRVRAMFSSPD 279
Query: 264 TQQPSESEFNEETEIKELSNKEKPIPIVESM--KEEPGWPSFGQLIIDLSKLALEAMGSM 321
+ P ++ + S + IP + ++E + S G+L ++ E +G +
Sbjct: 280 VRAPFVAQQMQRPPPTTKSVRPPLIPNEDEFEKQDEGFFVSLGRLFLNSGTCMSEILGGL 339
Query: 322 F---------LNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQ 372
F + +++ + N P++++ +P+++ PP + +
Sbjct: 340 FSGSKRKSLQYHQYQQQYQYANRYPNAWPMQESFVIPDEDEPPPSL-------------E 386
Query: 373 VRTPGTGDKYPDMKPPKIKSASFKDS 398
+TP + YP M K FK S
Sbjct: 387 TKTPTPRETYPIMTKELEKPQHFKPS 412
>gi|3176691|gb|AAC18814.1| Contains homology to serine/threonine protein kinase gb|X99618 from
Mycobacterium tuberculosis. ESTs gb|F14403, gb|F14404,
and gb|N96730 come from this gene [Arabidopsis thaliana]
Length = 493
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 127/235 (54%), Positives = 161/235 (68%), Gaps = 12/235 (5%)
Query: 44 SSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITP 103
++S A A ++++FENGY VETV +G+++G+ PY I V +GEL +D NSNI KI+
Sbjct: 25 ANSPATIATRSMVKFENGYSVETVFDGSKLGIEPYSIEVLPNGELLILDSENSNIYKISS 84
Query: 104 PLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGD 163
LS YSR RLV GS +GY GHVDG+ +A+ NHPKG+T+DD+GN+YVADT+N AIRKI +
Sbjct: 85 SLSLYSRPRLVTGSPEGYPGHVDGRLRDAKLNHPKGLTVDDRGNIYVADTVNNAIRKISE 144
Query: 164 AGVTTIAGGKS-NVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQD 222
GVTTIAGGK+ G DGPSEDAKFSNDFDVVYV +CSLLVIDRGN A+R+I L+ D
Sbjct: 145 GGVTTIAGGKTVRNGGHVDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAIREIQLHFD 204
Query: 223 DCEYQYNSISPTD-----------ILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQ 266
DC YQY S P I ++V A GY+ +LQ+ G S Q
Sbjct: 205 DCAYQYGSGFPLGKLTKVPNFFLWIAVLVAAGFFGYMLALLQRRVGSIVSSHNDQ 259
>gi|224140207|ref|XP_002323476.1| predicted protein [Populus trichocarpa]
gi|222868106|gb|EEF05237.1| predicted protein [Populus trichocarpa]
Length = 453
Score = 242 bits (617), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 158/468 (33%), Positives = 246/468 (52%), Gaps = 72/468 (15%)
Query: 54 NVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL 113
++++FE GY VETV +G+++G+ PY + VS GEL +D NSNI KI+ S+Y R +L
Sbjct: 7 SMMKFEGGYTVETVFDGSKLGIEPYSVEVSPSGELLLLDSENSNIYKISTQFSKYGRPKL 66
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGG 172
+ GSF+GY GHVDGK +AR NHPKG+T+D+ GN+YVADT+N+AIRKI D GVTTIAGG
Sbjct: 67 ITGSFEGYPGHVDGKLRDARMNHPKGLTVDEGGNIYVADTMNMAIRKISTDGGVTTIAGG 126
Query: 173 K-SNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNS- 230
K + G DGPSEDAKFSNDFDVVY +CSLL+IDRG+ A+R+I L+ DDC Y ++
Sbjct: 127 KWARGGGHVDGPSEDAKFSNDFDVVYSASSCSLLIIDRGSQAIREIQLHDDDCNYPHDDC 186
Query: 231 ---------ISPTDILMVVGAVLVGYVTCMLQQGFGPFFFS-RTQQPSESEFNE---ETE 277
I+ + ++V A GY+ +LQ+ F S R + P ++ +
Sbjct: 187 FHLDLDNILINIAGLAVLVAAGFFGYMLALLQRRVQILFSSTRGKGPPKAPYQSPPMSVR 246
Query: 278 IKELSNKEKPIPIVESMKEEPGWPSFGQLIIDLSKLALEAMGSMFLNF--VPFRFRSSGT 335
+ ++++P+ E + + S G+LI++ S E G +F F P ++
Sbjct: 247 PPFIPDEDEPVKSDEGL-----FGSLGRLILNTSSTVGEIFGGIFSGFRRKPIHYQFQQH 301
Query: 336 ------KGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPGTGDKYPDMKPPK 389
N P++D+ +P DE PP + + R+P + YP
Sbjct: 302 YQQPLKHSNTWPVQDSFVIP-DEDEPPSI-------------ETRSPTSQKTYP------ 341
Query: 390 IKSASFKDSSLSTKHRSSKRQEYAEFYGSG---------------EVPPPYSKSKSQKER 434
F + H + Q Y +G G + + + +
Sbjct: 342 -----FMTKDVEQNHHLEQNQGYYSNWGGGYHQQQQQQMHLQRYKQQQQQHHRHYMPNPK 396
Query: 435 TRHRQRDKSGELVYGV---SGMEPKPVEMKPVDYDNLKFDHYNMRSKY 479
T + + ++ E+V+G + V +K VDY + +++H+N+R ++
Sbjct: 397 TYYEKSCETNEIVFGAVQEQNGRREAVVIKAVDYGDPRYNHHNIRPRF 444
>gi|147860395|emb|CAN80443.1| hypothetical protein VITISV_043282 [Vitis vinifera]
Length = 527
Score = 242 bits (617), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 119/199 (59%), Positives = 155/199 (77%), Gaps = 3/199 (1%)
Query: 34 LTSLLKW--TTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAV 91
+++L+KW + +S++ + + + FE+GY VETV +G+++G+ PY + VS GEL +
Sbjct: 22 VSALVKWLWSLKSTTNTAISSXSKMNFEDGYTVETVFDGSKLGIEPYSLEVSTSGELLVL 81
Query: 92 DEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVA 151
D NSNI KI+ PLS+YSR +LVAGS GY+GHVDGK EAR NHPKG+TMD++GN+Y+A
Sbjct: 82 DSENSNIYKISSPLSRYSRPKLVAGSPDGYSGHVDGKLREARMNHPKGLTMDERGNIYIA 141
Query: 152 DTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVRPTCSLLVIDRG 210
DT+N+AIRKI DAGVTTIAGGK G DGPSEDAKFSNDFDVVY+ +CSLLVIDRG
Sbjct: 142 DTMNMAIRKISDAGVTTIAGGKWGRGGGHVDGPSEDAKFSNDFDVVYIGSSCSLLVIDRG 201
Query: 211 NAALRQISLNQDDCEYQYN 229
N A+R+I L+ +DC YQYN
Sbjct: 202 NQAIREIQLHYEDCAYQYN 220
>gi|293332279|ref|NP_001170567.1| uncharacterized protein LOC100384592 precursor [Zea mays]
gi|238006082|gb|ACR34076.1| unknown [Zea mays]
Length = 505
Score = 241 bits (615), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 139/350 (39%), Positives = 204/350 (58%), Gaps = 24/350 (6%)
Query: 27 AGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDG 86
A + K L SL T++++ A A +++++E GY V+TV +G+++G+ P+ +++ G
Sbjct: 45 ASAVAKQLWSLKSAATKTAT-ASVAGRSMVRYEGGYAVDTVFDGSKLGIEPHAAQINRAG 103
Query: 87 ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKG 146
+L +D +NSNI ++ PLS YSR +L+AGS +G +GHVDG+ EAR NHPKG T+DD+G
Sbjct: 104 DLLLLDSINSNIYRVQLPLSPYSRPKLLAGSPEGLSGHVDGRLREARMNHPKGFTVDDRG 163
Query: 147 NLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGP--SEDAKFSNDFDVVYVRPTCSL 204
N+YVAD +N+AIRKI D GVTTIAGGKS G+ D P S+DAKFS DF+V Y+ TCSL
Sbjct: 164 NIYVADAMNMAIRKISDTGVTTIAGGKSIRGGYIDEPSVSDDAKFSTDFEVQYISSTCSL 223
Query: 205 LVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGF--GPFFFS 262
LVIDRGN A+R+I LN DDC YQY + P ++ A GY+ +LQ P +
Sbjct: 224 LVIDRGNQAIREIPLNDDDCAYQYEAGFPLGFALLCAAGFFGYMLALLQHRLLGMPSTIN 283
Query: 263 RTQQPSESEFNEETEIKELSNKEKP---------IPIVESMKEEPG---WPSFGQLIIDL 310
+ Q P I +S +KP IP E+ ++EP + S G+L+
Sbjct: 284 KPQAP------PRPSIASISPYQKPLNPYVRQPFIPREETARQEPEEGFFTSAGKLVGGA 337
Query: 311 SKLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQR 360
E G + + + N P++++ +P DE P + R
Sbjct: 338 KSSVAEVFG-FKKKSLSNPYHQQQRRANPWPVQESYAIPHDEPPPALDTR 386
>gi|242095458|ref|XP_002438219.1| hypothetical protein SORBIDRAFT_10g009760 [Sorghum bicolor]
gi|241916442|gb|EER89586.1| hypothetical protein SORBIDRAFT_10g009760 [Sorghum bicolor]
Length = 507
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 130/317 (41%), Positives = 186/317 (58%), Gaps = 17/317 (5%)
Query: 56 LQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVA 115
+++E GY V+TV +G+++G+ P+ + ++ G+L +D +NSNI ++ PLS YSR +L+A
Sbjct: 73 VRYEGGYAVDTVFDGSKLGIEPHAVEITPAGDLLVLDSINSNIYRVQLPLSPYSRPKLLA 132
Query: 116 GSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSN 175
GS +G +GHVDG+ EAR NHP G T+DDKGN+YVAD +N+AIRKI D GVTTIAGGKS
Sbjct: 133 GSPEGLSGHVDGRLREARMNHPNGFTVDDKGNIYVADAMNMAIRKISDTGVTTIAGGKSI 192
Query: 176 VAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTD 235
G DGPS+DAKFS DF++ Y+ +CSLLVIDRGN A+R+I LN DDC YQY + P
Sbjct: 193 RGGHIDGPSDDAKFSTDFEIQYISSSCSLLVIDRGNQAIREIPLNDDDCAYQYETGFPLG 252
Query: 236 ILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKP-------- 287
++ A GY+ +LQ+ F E + I + +KP
Sbjct: 253 FALLCAAGFFGYMLALLQRR----LFGMASTTDEPQAPPRPSIASIPPYQKPLNPYVRQP 308
Query: 288 -IPIVESMK---EEPGWPSFGQLIIDLSKLALEAMGSMFLNFVPFRFRSSGTKGNLTPLR 343
IP E+ K EE + S G+LI E G + + + + N P++
Sbjct: 309 FIPREETAKQETEEGFFTSAGKLIGGAKSSVAEIFG-LKKKRLSNPYHQQQRRANPWPVQ 367
Query: 344 DTLRMPEDEAAPPVVQR 360
++ +P DE P + R
Sbjct: 368 ESYAIPHDEHPPALDTR 384
>gi|413953662|gb|AFW86311.1| hypothetical protein ZEAMMB73_666053 [Zea mays]
Length = 506
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 136/344 (39%), Positives = 203/344 (59%), Gaps = 12/344 (3%)
Query: 27 AGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDG 86
A + K L SL T++++ A A +++++E GY V+TV +G+++G+ P+ +++ G
Sbjct: 45 ASAVAKQLWSLKSAATKTAT-ASVAGRSMVRYEGGYAVDTVFDGSKLGIEPHAAQINRAG 103
Query: 87 ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKG 146
+L +D +NSNI ++ PLS YSR +L+AGS +G +GHVDG+ EAR NHPKG T+DD+G
Sbjct: 104 DLLLLDSINSNIYRVQLPLSPYSRPKLLAGSPEGLSGHVDGRLREARMNHPKGFTVDDRG 163
Query: 147 NLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGP--SEDAKFSNDFDVVYVRPTCSL 204
N+YVAD +N+AIRKI D GVTTIAGGKS G+ D P S+DAKFS DF+V Y+ TCSL
Sbjct: 164 NIYVADAMNMAIRKISDTGVTTIAGGKSIRGGYIDEPSVSDDAKFSTDFEVQYISSTCSL 223
Query: 205 LVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGF--GPFFFS 262
LVIDRGN A+R+I LN DDC YQY + P ++ A GY+ +LQ P +
Sbjct: 224 LVIDRGNQAIREIPLNDDDCAYQYEAGFPLGFALLCAAGFFGYMLALLQHRLLGMPSTIN 283
Query: 263 RTQQPSESEFNEETEIKELSN---KEKPIPIVESMKEEPG---WPSFGQLIIDLSKLALE 316
+ Q P ++ N ++ IP E+ ++EP + S G+L+ E
Sbjct: 284 KPQAPPRPSIASIPPYQKPLNPYVRQPFIPREETARQEPEEGFFTSAGKLVGGAKSSVAE 343
Query: 317 AMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQR 360
G + + + N P++++ +P DE P + R
Sbjct: 344 VFG-FKKKSLSNPYHQQQRRANPWPVQESYAIPHDEPPPALDTR 386
>gi|195621162|gb|ACG32411.1| hypothetical protein [Zea mays]
gi|219884599|gb|ACL52674.1| unknown [Zea mays]
gi|413944257|gb|AFW76906.1| hypothetical protein ZEAMMB73_849037 [Zea mays]
Length = 508
Score = 237 bits (605), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 107/201 (53%), Positives = 148/201 (73%)
Query: 54 NVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL 113
+++++E GY V+TV +G+++G+ P+ + ++ G L +D +NSNI ++ PLS YSR +L
Sbjct: 68 SMVRYEGGYAVDTVFDGSKLGIEPHAVEITPAGNLLVLDSINSNIYRVQLPLSPYSRPKL 127
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGK 173
+AGS +G +GHVDG+ EAR NHPKG T+DD+GN+YVAD +N+AIRKI D GVTTIAGGK
Sbjct: 128 LAGSPKGLSGHVDGRLREARMNHPKGFTVDDRGNIYVADGMNMAIRKISDTGVTTIAGGK 187
Query: 174 SNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISP 233
S G DGPS+DAKFS DF++ Y+ +CSLLVIDRGN A+R+I LN DDC YQY + P
Sbjct: 188 SIRGGHIDGPSDDAKFSTDFEIKYISSSCSLLVIDRGNQAIREIPLNDDDCAYQYEAGFP 247
Query: 234 TDILMVVGAVLVGYVTCMLQQ 254
++ A GY+ MLQ+
Sbjct: 248 LGFALLCAAGFFGYMLAMLQR 268
>gi|9369390|gb|AAF87138.1|AC002423_3 T23E23.5 [Arabidopsis thaliana]
Length = 493
Score = 232 bits (592), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 151/393 (38%), Positives = 209/393 (53%), Gaps = 56/393 (14%)
Query: 23 HAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRV 82
H G +I ++ + S+S A ++++FENGY VETV++G+++G+ PY I+V
Sbjct: 4 HFLFLGIIILLFSAFVASAPSSTSPATVPTKSMVKFENGYSVETVLDGSKLGIEPYSIQV 63
Query: 83 SEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTM 142
+GEL +D NSNI +I+ LS YSR RLV GS +GY GHVDG+ +AR N+PKG+T+
Sbjct: 64 LSNGELLILDSQNSNIYQISSSLSLYSRPRLVTGSPEGYPGHVDGRLRDARLNNPKGLTV 123
Query: 143 DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKS-NVAGFRDGPSEDAKFSNDFDVVYVRPT 201
DD+GN+YVADT+N AIRKI +AGVTTIAGGK G DGPSEDAKFSNDFDVVY+ +
Sbjct: 124 DDRGNIYVADTVNNAIRKISEAGVTTIAGGKMVRGGGHVDGPSEDAKFSNDFDVVYLGSS 183
Query: 202 CSLLVIDRGNAALRQISLNQDDCEYQYNSISP---------------------------- 233
CSLLVIDRGN A+R+I L+ DDC QY S P
Sbjct: 184 CSLLVIDRGNQAIREIQLHFDDCADQYGSGFPLGNIYIPKFSVWVPSCILQSSLSQLLFN 243
Query: 234 ---------TDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNK 284
I ++V AV GY+ +LQ+ T Q ++ IK +
Sbjct: 244 VSLIQCFNIAGIAVLVAAVFFGYMLALLQRRLSSIVSYHTDQEVFEAVPDQDPIKPV--- 300
Query: 285 EKPIPIVESMKEEPGWPSF-GQLIIDLSK---LALEAMGSMF----------LNFVPFRF 330
+P I+ ++E SF G L I +S ++E MF LNF
Sbjct: 301 -RPPLILTGDEQEKQEESFLGTLQIFISNAWVFSVELFSGMFPGLRKKQTVGLNFNHQET 359
Query: 331 RSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKS 363
+ S P++++ + + PPV R +
Sbjct: 360 KHSAFSTTSWPIQESFVIHNKDEPPPVESRNAT 392
>gi|449446849|ref|XP_004141183.1| PREDICTED: uncharacterized protein LOC101216407 [Cucumis sativus]
gi|449489545|ref|XP_004158343.1| PREDICTED: uncharacterized protein LOC101227198 [Cucumis sativus]
Length = 417
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 123/277 (44%), Positives = 177/277 (63%), Gaps = 11/277 (3%)
Query: 55 VLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV 114
+++FE GY+VET+++G+++G+ PY + VS GEL +D NSN+ KI+ P+SQ+ R +L
Sbjct: 1 MIKFERGYVVETLLDGSKMGIEPYSVGVSPSGELLILDAENSNVHKISMPVSQFCRPKLF 60
Query: 115 AGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKS 174
AGS +GY+GHVDGK +AR +HP+G+T+D +GN+Y+ADT N AIRKI DAGVTTIAGGK
Sbjct: 61 AGSSEGYSGHVDGKLRDARMSHPRGLTVDQRGNIYIADTKNKAIRKISDAGVTTIAGGKW 120
Query: 175 NVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPT 234
+G DGPSE++KFSNDFD+VYV +CSLLV+DRGN A+R+I L +DC +Y+
Sbjct: 121 RKSGHIDGPSEESKFSNDFDLVYVGSSCSLLVVDRGNQAIREIQLRAEDCT-EYDGSFLL 179
Query: 235 DILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPI--PIVE 292
I ++ A+L+GY+ Q F S+ S + I EK I P++
Sbjct: 180 GIALLTAAMLLGYMLARFQFRVLATFSSKND--SRVDLRNIPSIPPYGRVEKSIRRPLIP 237
Query: 293 SMKEEPGWP------SFGQLIIDLSKLALEAMGSMFL 323
S + E P S G+L ++ A E ++ L
Sbjct: 238 SEEAEDNQPEENIICSLGKLFLNTGSSAAEIFVALLL 274
>gi|357516379|ref|XP_003628478.1| hypothetical protein MTR_8g058630 [Medicago truncatula]
gi|355522500|gb|AET02954.1| hypothetical protein MTR_8g058630 [Medicago truncatula]
Length = 150
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 102/128 (79%), Positives = 115/128 (89%)
Query: 109 SRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTT 168
SR RLVAGSF G TGHVDGK ++ARF++PKG+ +DDKGN+YVADT N+AIRKIGDAGVTT
Sbjct: 13 SRERLVAGSFLGRTGHVDGKLSDARFHYPKGIALDDKGNVYVADTQNMAIRKIGDAGVTT 72
Query: 169 IAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQY 228
IAGGKSNVAG+RDGP EDAK SNDFDVVY+RPTCSLLVIDRGNAALRQI LNQ+DC YQ
Sbjct: 73 IAGGKSNVAGYRDGPGEDAKLSNDFDVVYIRPTCSLLVIDRGNAALRQIFLNQEDCNYQS 132
Query: 229 NSISPTDI 236
+SIS T +
Sbjct: 133 SSISLTGL 140
>gi|357516309|ref|XP_003628443.1| hypothetical protein MTR_8g058300 [Medicago truncatula]
gi|355522465|gb|AET02919.1| hypothetical protein MTR_8g058300 [Medicago truncatula]
Length = 160
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 102/126 (80%), Positives = 114/126 (90%)
Query: 109 SRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTT 168
SR RLVAGSF G TGHVDGK ++ARF++PKG+ +DDKGN+YVADT N+AIRKIGDAGVTT
Sbjct: 13 SRERLVAGSFLGRTGHVDGKLSDARFHYPKGIALDDKGNVYVADTQNMAIRKIGDAGVTT 72
Query: 169 IAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQY 228
IAGGKSNVAG+RDGP EDAK SNDFDVVY+RPTCSLLVIDRGNAALRQI LNQ+DC YQ
Sbjct: 73 IAGGKSNVAGYRDGPGEDAKLSNDFDVVYIRPTCSLLVIDRGNAALRQIFLNQEDCNYQS 132
Query: 229 NSISPT 234
+SIS T
Sbjct: 133 SSISLT 138
>gi|30685100|ref|NP_196993.2| NHL domain-containing protein [Arabidopsis thaliana]
gi|332004703|gb|AED92086.1| NHL domain-containing protein [Arabidopsis thaliana]
Length = 754
Score = 208 bits (529), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 124/321 (38%), Positives = 186/321 (57%), Gaps = 19/321 (5%)
Query: 54 NVLQFENGYLVETVIEGNEIGVVPYKIRVSEDG-ELFAVDEVNSNIVKITPPLSQYSRGR 112
+++++E+GY +ETV +G+++G+ PY I VS +G EL +D NSNI KI+ PLS+Y + +
Sbjct: 70 SMVKYESGYNMETVFDGSKLGIEPYAIEVSPNGGELIVLDSENSNIHKISMPLSRYGKPK 129
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGG 172
L++GS +GYTGHVDGK EAR N P+G+ MDD+GN+YVADT+N+AIRKI D GV+TIA G
Sbjct: 130 LLSGSQEGYTGHVDGKLKEARMNRPRGLAMDDRGNIYVADTINMAIRKISDDGVSTIAAG 189
Query: 173 KSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDC---EYQYN 229
G ++ E +FS+DFD++YV +CSLLVIDRGN +++I L+ DC E +
Sbjct: 190 GRWSGGSKE---ESMRFSDDFDLIYVSSSCSLLVIDRGNQLIKEIQLHDHDCSQPEPDTD 246
Query: 230 SISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIP 289
S+ L+ V AV GY+ +L + F S + + + P P
Sbjct: 247 SLHLGTALL-VAAVFFGYMLALLVRRVRSLFSSSSHDTKSKRHVATPSMTMAPYQRYPRP 305
Query: 290 IV--------ESMKEEPGWPSFGQLIIDLSKLALEAMGSMFLNFVPFRFRS--SGTKGNL 339
+ ES KEE S G+L++ E M S N +P F + N
Sbjct: 306 VRQPLIPPQHESEKEEGFLGSLGKLVVKTGSSVSEMM-SGSRNVIPPNFHQYHHQQEPNQ 364
Query: 340 TPLRDTLRMPEDEAAPPVVQR 360
P++++ +PE++ P + R
Sbjct: 365 WPVQESFAIPEEDGPPALEPR 385
>gi|9755656|emb|CAC01808.1| putative protein [Arabidopsis thaliana]
Length = 733
Score = 207 bits (527), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 124/321 (38%), Positives = 186/321 (57%), Gaps = 19/321 (5%)
Query: 54 NVLQFENGYLVETVIEGNEIGVVPYKIRVSEDG-ELFAVDEVNSNIVKITPPLSQYSRGR 112
+++++E+GY +ETV +G+++G+ PY I VS +G EL +D NSNI KI+ PLS+Y + +
Sbjct: 49 SMVKYESGYNMETVFDGSKLGIEPYAIEVSPNGGELIVLDSENSNIHKISMPLSRYGKPK 108
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGG 172
L++GS +GYTGHVDGK EAR N P+G+ MDD+GN+YVADT+N+AIRKI D GV+TIA G
Sbjct: 109 LLSGSQEGYTGHVDGKLKEARMNRPRGLAMDDRGNIYVADTINMAIRKISDDGVSTIAAG 168
Query: 173 KSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDC---EYQYN 229
G ++ E +FS+DFD++YV +CSLLVIDRGN +++I L+ DC E +
Sbjct: 169 GRWSGGSKE---ESMRFSDDFDLIYVSSSCSLLVIDRGNQLIKEIQLHDHDCSQPEPDTD 225
Query: 230 SISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIP 289
S+ L+ V AV GY+ +L + F S + + + P P
Sbjct: 226 SLHLGTALL-VAAVFFGYMLALLVRRVRSLFSSSSHDTKSKRHVATPSMTMAPYQRYPRP 284
Query: 290 IV--------ESMKEEPGWPSFGQLIIDLSKLALEAMGSMFLNFVPFRFRS--SGTKGNL 339
+ ES KEE S G+L++ E M S N +P F + N
Sbjct: 285 VRQPLIPPQHESEKEEGFLGSLGKLVVKTGSSVSEMM-SGSRNVIPPNFHQYHHQQEPNQ 343
Query: 340 TPLRDTLRMPEDEAAPPVVQR 360
P++++ +PE++ P + R
Sbjct: 344 WPVQESFAIPEEDGPPALEPR 364
>gi|297807503|ref|XP_002871635.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297317472|gb|EFH47894.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 704
Score = 202 bits (514), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 131/380 (34%), Positives = 206/380 (54%), Gaps = 29/380 (7%)
Query: 6 LLLTLTLLIAFTLQFQAHAAP-----AGPLIKHLTSLL-KW--------TTRSSSKAPQA 51
L+L++ + F A P L+ ++ S+L KW TT +++K+ +
Sbjct: 8 LILSVIFMFFFCGLSSVSAKPPPVKIVSGLVTNVASILWKWLWSLQTSTTTTTTTKSGVS 67
Query: 52 DGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDG-ELFAVDEVNSNIVKITPPLSQYSR 110
+++++E+GY +ETV +G+++G+ PY I VS +G EL +D NSNI KI+ PLS+Y++
Sbjct: 68 SRSMVKYESGYNIETVFDGSKLGIEPYAIEVSPNGGELIVLDSENSNIHKISMPLSRYAK 127
Query: 111 GRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIA 170
+LV+GS +GYTGHVDGK EA+ N P+G+ +DD GN+YVADT N+AIRKI D GV+TI
Sbjct: 128 PKLVSGSQEGYTGHVDGKLKEAKMNRPRGLAIDDSGNIYVADTNNMAIRKISDDGVSTIT 187
Query: 171 GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNS 230
G G ++ E +FS+DFD++YV +CSLLVIDRGN +++I L+ DC +Q
Sbjct: 188 AGGRWSGGSKE---ESMRFSDDFDLIYVSSSCSLLVIDRGNQMIKEIQLHDHDCSHQEPD 244
Query: 231 ISPTDI--LMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPI 288
+ ++V A GY+ +L + F S + +P+
Sbjct: 245 TDNLHLGTALLVAAAFFGYMFALLVRRVRSLFSSSGHDNKRHVAKPSMTMAPYQRYPRPV 304
Query: 289 ------PIVESMKEEPGWPSFGQLIIDLSKLALEAMGSMFLNFVPFRFRS--SGTKGNLT 340
P E KEE S G+L++ E M S N +P F + N
Sbjct: 305 RQPLIPPQHEPEKEEGFLGSLGKLVVKTGSSVSEMM-SGSRNVIPPNFHQYHHQQEPNQW 363
Query: 341 PLRDTLRMPEDEAAPPVVQR 360
P++++ +PE++ P + R
Sbjct: 364 PVQESYAIPEEDGPPALEPR 383
>gi|357516307|ref|XP_003628442.1| hypothetical protein MTR_8g058280 [Medicago truncatula]
gi|355522464|gb|AET02918.1| hypothetical protein MTR_8g058280 [Medicago truncatula]
Length = 283
Score = 165 bits (418), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 109/271 (40%), Positives = 152/271 (56%), Gaps = 37/271 (13%)
Query: 225 EYQYNS-ISPTDILMVVGAVLVGYVTCMLQQGF---GPFFFSRTQQPSESEFNEETEIKE 280
+Y YN + I + + +G + + + + G + QQPSE +F E
Sbjct: 38 KYPYNCWCCDSRICCIHASAGIGILFLLKNELYFILGVYIAYMLQQPSERDFKGE----- 92
Query: 281 LSNKEKPIPIVESMKEEPGWPSFGQLIIDLSKLALEAMGSMFLNFVPFRFRSSGTKGNLT 340
++ +K +E KEE GWPSF Q ++DL + T+ LT
Sbjct: 93 -ASSDKSKSTLERTKEETGWPSFRQ-VVDL-------------------LKPDSTRRGLT 131
Query: 341 PLRDTLRMPEDEAAPPVVQRQKS-AVPLSETHQVRTPGTGDKYPDMKPPKIKSASFKDSS 399
PL+D L MPEDE P +V+RQ + A PL+ET ++ +KY + K K+KS++ KD S
Sbjct: 132 PLKDRLLMPEDELEPLLVKRQSALAPPLTETRKIHLKSAAEKYSETKIAKVKSSTPKDPS 191
Query: 400 LSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKERTRHRQRDKSGELVYGVSGMEPKPVE 459
L +KH SSKRQEYAEFYGS E+P P +KSK QK+R+RHR+R+KS E V G G E KP+E
Sbjct: 192 LPSKHHSSKRQEYAEFYGSSEIPAP-TKSKIQKQRSRHRRREKSEE-VSGAVGTEQKPLE 249
Query: 460 MKPVDYDNLKFDHYN---MRSKY-GDDSYRF 486
M+ Y N K+D YN MR KY +D+ RF
Sbjct: 250 MRAAGYFNTKYDQYNYNMMRPKYVPEDTSRF 280
>gi|148909539|gb|ABR17864.1| unknown [Picea sitchensis]
Length = 264
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 82/166 (49%), Positives = 110/166 (66%), Gaps = 1/166 (0%)
Query: 59 ENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF 118
E GY VETV+ GN++ V P+ I + +G++ +D VNS + +I PLSQ S ++ AGS
Sbjct: 26 EEGYRVETVLYGNKMDVYPHSI-LPFNGDILLLDSVNSTLFRIGLPLSQDSTIKVFAGSR 84
Query: 119 QGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAG 178
G DG +A+FNHPK T+D KGN+YVAD N AIRKI +GV+TIAGG + G
Sbjct: 85 NTVPGFADGGFLDAQFNHPKSFTLDSKGNIYVADRANHAIRKISKSGVSTIAGGTAGKTG 144
Query: 179 FRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDC 224
DGPS++A FSND+D+ ++ C+LLV DRGN +RQI L DC
Sbjct: 145 HADGPSQEAMFSNDYDLTFIPSICALLVSDRGNRMIRQIKLPPGDC 190
>gi|42571623|ref|NP_973902.1| NHL domain-containing protein [Arabidopsis thaliana]
gi|332192325|gb|AEE30446.1| NHL domain-containing protein [Arabidopsis thaliana]
Length = 400
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 87/204 (42%), Positives = 118/204 (57%), Gaps = 8/204 (3%)
Query: 56 LQFENGYLVETVIEGNEIGVVPYKIR-VSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV 114
+ E GY V TV++G++ G+ PY I + L +D S + PLS S
Sbjct: 23 IVLEEGYEVTTVVDGHKSGLNPYTIHALPGSSNLIVLDSSGSTFYTTSFPLSVDSVINRF 82
Query: 115 AGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGK 173
AG G +GHVDGK +RF+ P+G +D KGN+YVAD N AIRKI +G VTTIAGG
Sbjct: 83 AG--DGSSGHVDGKAGNSRFSKPRGFAVDAKGNVYVADKSNKAIRKISSSGSVTTIAGGI 140
Query: 174 SNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISP 233
S G RDGP+++A FS+DF++ +V C LLV D GN +RQI+L ++DC +S
Sbjct: 141 SKAFGHRDGPAQNATFSSDFEITFVPQRCCLLVSDHGNEMIRQINLKEEDCLENSHSNLG 200
Query: 234 TDILMVVGAVLVGYVTCMLQQGFG 257
T L +G VL +C+L G
Sbjct: 201 TYSLWSIGIVL----SCILGVAIG 220
>gi|18395326|ref|NP_564209.1| NHL domain-containing protein [Arabidopsis thaliana]
gi|15724248|gb|AAL06517.1|AF412064_1 At1g23890/T23E23_13 [Arabidopsis thaliana]
gi|16974539|gb|AAL31179.1| At1g23890/T23E23_13 [Arabidopsis thaliana]
gi|110742589|dbj|BAE99208.1| hypothetical protein [Arabidopsis thaliana]
gi|332192326|gb|AEE30447.1| NHL domain-containing protein [Arabidopsis thaliana]
Length = 261
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 87/204 (42%), Positives = 118/204 (57%), Gaps = 8/204 (3%)
Query: 56 LQFENGYLVETVIEGNEIGVVPYKIR-VSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV 114
+ E GY V TV++G++ G+ PY I + L +D S + PLS S
Sbjct: 23 IVLEEGYEVTTVVDGHKSGLNPYTIHALPGSSNLIVLDSSGSTFYTTSFPLSVDSVINRF 82
Query: 115 AGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGK 173
AG G +GHVDGK +RF+ P+G +D KGN+YVAD N AIRKI +G VTTIAGG
Sbjct: 83 AG--DGSSGHVDGKAGNSRFSKPRGFAVDAKGNVYVADKSNKAIRKISSSGSVTTIAGGI 140
Query: 174 SNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISP 233
S G RDGP+++A FS+DF++ +V C LLV D GN +RQI+L ++DC +S
Sbjct: 141 SKAFGHRDGPAQNATFSSDFEITFVPQRCCLLVSDHGNEMIRQINLKEEDCLENSHSNLG 200
Query: 234 TDILMVVGAVLVGYVTCMLQQGFG 257
T L +G VL +C+L G
Sbjct: 201 TYSLWSIGIVL----SCILGVAIG 220
>gi|255560013|ref|XP_002521025.1| conserved hypothetical protein [Ricinus communis]
gi|223539862|gb|EEF41442.1| conserved hypothetical protein [Ricinus communis]
Length = 408
Score = 145 bits (365), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 80/203 (39%), Positives = 123/203 (60%), Gaps = 5/203 (2%)
Query: 56 LQFENGYLVETVIEGNEIGVVPYKIRVS-EDGELFAVDEVNSNIVKITPPLSQYSRGRLV 114
L E+GY V T+I+G+++ + P+ + + +L +D +S I I+ P+SQ S + +
Sbjct: 30 LILEDGYTVTTIIDGHKLEINPHAVLSRPQSSDLILLDSSHSTIYTISFPISQESVVKRL 89
Query: 115 AGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKS 174
+G G G DG+P ARFN P+ +D+KGN+YVAD LN IRKI ++GV+TIAGG S
Sbjct: 90 SG--DGVAGLSDGEPGSARFNKPRSFAVDNKGNIYVADRLNGTIRKITNSGVSTIAGGYS 147
Query: 175 NVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPT 234
G DGP+++A FS+DF+V +V C+LL+ D GN +R++ L DDC +S
Sbjct: 148 KGFGREDGPAQNATFSSDFEVAFVAEECALLISDHGNQLVRRLPLKPDDCATASHSALGA 207
Query: 235 DILMVVGAVLVGYVTCMLQQGFG 257
V+G LV ++C++ G
Sbjct: 208 VSFWVLGLGLV--MSCLIGIAIG 228
>gi|414870491|tpg|DAA49048.1| TPA: hypothetical protein ZEAMMB73_516870 [Zea mays]
Length = 271
Score = 144 bits (364), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 80/211 (37%), Positives = 123/211 (58%), Gaps = 33/211 (15%)
Query: 44 SSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITP 103
+++ A +++++E GY VE +++ +++G+ PY + V++ E+ +D SNI +
Sbjct: 90 NNAIAASGGKSMVKYECGYTVEKILDYSKLGIEPYSMEVTQGSEILIMD---SNIYSMAL 146
Query: 104 PLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGD 163
PLS+ SR +LVAGS +G+ GHVDG E R HPKGVT+DDKG
Sbjct: 147 PLSRDSRPKLVAGSLEGFPGHVDGNLREGRIYHPKGVTVDDKGR---------------- 190
Query: 164 AGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV-YVRPTCSLLVIDRGNAALRQISLNQD 222
GG+ DGPS++AK S DF+V Y+ +CSLLVIDRGN +R+I L+ D
Sbjct: 191 -------GGQV------DGPSDEAKSSTDFEVCYYIGSSCSLLVIDRGNQTIREIQLHFD 237
Query: 223 DCEYQYNSISPTDILMVVGAVLVGYVTCMLQ 253
DC YQ+ + P + ++ A +GY+ +LQ
Sbjct: 238 DCVYQHEADFPLGVALLAVAAFLGYMLALLQ 268
>gi|9369410|gb|AAF87158.1|AC002423_23 T23E23.6 [Arabidopsis thaliana]
Length = 370
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 81/179 (45%), Positives = 110/179 (61%), Gaps = 5/179 (2%)
Query: 56 LQFENGYLVETVIEGNEIGVVPYKIR-VSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV 114
+ E GY V TV++G++ G+ PY I + L +D S + PLS S
Sbjct: 23 IVLEEGYEVTTVVDGHKSGLNPYTIHALPGSSNLIVLDSSGSTFYTTSFPLSVDSVINRF 82
Query: 115 AGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGK 173
AG G +GHVDGK +RF+ P+G +D KGN+YVAD N AIRKI +G VTTIAGG
Sbjct: 83 AG--DGSSGHVDGKAGNSRFSKPRGFAVDAKGNVYVADKSNKAIRKISSSGSVTTIAGGI 140
Query: 174 SNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDC-EYQYNSI 231
S G RDGP+++A FS+DF++ +V C LLV D GN +RQI+L ++DC E ++SI
Sbjct: 141 SKAFGHRDGPAQNATFSSDFEITFVPQRCCLLVSDHGNEMIRQINLKEEDCLENSHSSI 199
>gi|224065156|ref|XP_002301692.1| predicted protein [Populus trichocarpa]
gi|222843418|gb|EEE80965.1| predicted protein [Populus trichocarpa]
Length = 264
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 86/230 (37%), Positives = 128/230 (55%), Gaps = 12/230 (5%)
Query: 53 GNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGR 112
G + E+GY+V TV++G+++ V P+ +++ +L +D S + P+SQ G
Sbjct: 20 GEQIMLEDGYMVTTVMDGHKLNVNPHAVQL-RSSDLVVLDSSKSVFYTLPFPISQ--DGV 76
Query: 113 LVAG-SFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG 171
+V S G G++DG+P ARFN PK T+D +GN+YVAD LN A+RKI +G+TT
Sbjct: 77 MVKRLSGSGDKGYIDGEPGLARFNKPKSFTVDLRGNVYVADQLNHAVRKISSSGMTTTIA 136
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSI 231
G + G +DGP E A FS DF+V++V C+LL+ D GN LRQ+ L Q+DC S
Sbjct: 137 GNYSQIGRQDGPGETATFSTDFEVLFVPQICALLISDHGNQLLRQVDLKQEDCIIGSQSA 196
Query: 232 SPTDILMVVGAVLVGYVTCM--LQQGFG--PFFFSRTQQPSESEFNEETE 277
V+G VL +C+ L GF P+ T + F ++ E
Sbjct: 197 LGAVKFWVLGLVL----SCLFGLATGFAIRPYVIPHTGRRQTPSFQQDME 242
>gi|297850862|ref|XP_002893312.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297339154|gb|EFH69571.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 398
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 83/204 (40%), Positives = 117/204 (57%), Gaps = 8/204 (3%)
Query: 56 LQFENGYLVETVIEGNEIGVVPYKIR-VSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV 114
+ E+GY V TV++G++ G+ PY I + L +D S + PLS S
Sbjct: 23 IVLEDGYEVTTVVDGHKSGLNPYTIHALPGSSNLIVLDSSGSTFYTTSFPLSFDSVIHRF 82
Query: 115 AGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGK 173
AG G +G+VDGK +RF P+G +D KGN+YVAD N AIRKI +G VTTIAGG
Sbjct: 83 AG--DGTSGYVDGKAGNSRFKKPRGFAIDAKGNVYVADRSNKAIRKISSSGYVTTIAGGI 140
Query: 174 SNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISP 233
S G RDGP+++A FS+DF++ +V C LLV D GN +RQI+L ++DC +S
Sbjct: 141 SKEFGHRDGPAQNATFSSDFEITFVPQRCCLLVSDHGNEMVRQINLKEEDCLESSHSNLG 200
Query: 234 TDILMVVGAVLVGYVTCMLQQGFG 257
L +G +++C+L G
Sbjct: 201 AYSLWSIGI----FLSCILGVAIG 220
>gi|224079468|ref|XP_002305875.1| predicted protein [Populus trichocarpa]
gi|222848839|gb|EEE86386.1| predicted protein [Populus trichocarpa]
Length = 241
Score = 135 bits (339), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 78/222 (35%), Positives = 123/222 (55%), Gaps = 6/222 (2%)
Query: 58 FENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSR--GRLVA 115
E+GY+V TV++G+++ + P+ +++ E+ +D S + P+SQ S RL
Sbjct: 2 LEDGYMVTTVLDGHKLNINPHAVQL-RSSEIVVLDSSRSVFYTLPFPISQASVMVKRL-- 58
Query: 116 GSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSN 175
S +G TG++DG+P ARFN PK +D +GN+YVAD N A+RKI ++GVT+ G +
Sbjct: 59 -SGEGKTGYIDGEPGLARFNKPKSFAVDLRGNVYVADQQNHAVRKISNSGVTSTIVGNYS 117
Query: 176 VAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTD 235
G +DGP + A FS+DF+V++V C+LL+ D GN LRQI L +DC S
Sbjct: 118 QTGRQDGPGKTATFSSDFEVLFVPQICALLISDHGNQLLRQIDLKPEDCVIGSQSALGAV 177
Query: 236 ILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETE 277
V+G L + ++ P+ T + F ++ E
Sbjct: 178 KFWVLGLALSCLLGIVIGIATRPYVIPHTGRLQTPSFQQDME 219
>gi|296086532|emb|CBI32121.3| unnamed protein product [Vitis vinifera]
Length = 369
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 84/224 (37%), Positives = 124/224 (55%), Gaps = 16/224 (7%)
Query: 56 LQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAV-DEVNSNIVKITPPLSQYSRGRLV 114
L E+GY V TV +GN++ + P+ I F + D S ++ PLSQ S + +
Sbjct: 28 LVLEDGYTVRTVFDGNKLEINPHSILPRYGSSDFIILDSSKSVFYTVSSPLSQESEIKRL 87
Query: 115 AGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTT-IAGGK 173
+GS G++ DG A F+ P+ +D KGN+YVAD N IRKI + GVTT IAGG
Sbjct: 88 SGSSAGFS---DGDSASATFSKPRSFAVDLKGNVYVADQSNGVIRKITNRGVTTTIAGGY 144
Query: 174 SNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISP 233
+ G DGP+++A FS DF++V+V C++LV DRG+ +RQI L +DC S+
Sbjct: 145 AQKTGKVDGPAQNASFSKDFELVFVPEKCAVLVSDRGSQLVRQIDLKVEDCRRSPQSVLG 204
Query: 234 TDILMVV----GAVLVGYVTCML-------QQGFGPFFFSRTQQ 266
L V+ + LVG++ ++ Q+ F P FFS T +
Sbjct: 205 GAFLWVLLGLGVSCLVGFIVGIISRPYVIPQEVFCPLFFSETWK 248
>gi|225424715|ref|XP_002265272.1| PREDICTED: uncharacterized protein LOC100243227 [Vitis vinifera]
Length = 438
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 84/222 (37%), Positives = 123/222 (55%), Gaps = 16/222 (7%)
Query: 56 LQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAV-DEVNSNIVKITPPLSQYSRGRLV 114
L E+GY V TV +GN++ + P+ I F + D S ++ PLSQ S + +
Sbjct: 28 LVLEDGYTVRTVFDGNKLEINPHSILPRYGSSDFIILDSSKSVFYTVSSPLSQESEIKRL 87
Query: 115 AGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTT-IAGGK 173
+GS G++ DG A F+ P+ +D KGN+YVAD N IRKI + GVTT IAGG
Sbjct: 88 SGSSAGFS---DGDSASATFSKPRSFAVDLKGNVYVADQSNGVIRKITNRGVTTTIAGGY 144
Query: 174 SNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISP 233
+ G DGP+++A FS DF++V+V C++LV DRG+ +RQI L +DC S+
Sbjct: 145 AQKTGKVDGPAQNASFSKDFELVFVPEKCAVLVSDRGSQLVRQIDLKVEDCRRSPQSVLG 204
Query: 234 TDILMVV----GAVLVGYVTCML-------QQGFGPFFFSRT 264
L V+ + LVG++ ++ Q+ F P FFS T
Sbjct: 205 GAFLWVLLGLGVSCLVGFIVGIISRPYVIPQEVFCPLFFSET 246
>gi|413944258|gb|AFW76907.1| hypothetical protein ZEAMMB73_849037 [Zea mays]
Length = 187
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 57/112 (50%), Positives = 86/112 (76%)
Query: 54 NVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL 113
+++++E GY V+TV +G+++G+ P+ + ++ G L +D +NSNI ++ PLS YSR +L
Sbjct: 68 SMVRYEGGYAVDTVFDGSKLGIEPHAVEITPAGNLLVLDSINSNIYRVQLPLSPYSRPKL 127
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
+AGS +G +GHVDG+ EAR NHPKG T+DD+GN+YVAD +N+AIRKI D G
Sbjct: 128 LAGSPKGLSGHVDGRLREARMNHPKGFTVDDRGNIYVADGMNMAIRKISDTG 179
>gi|168018878|ref|XP_001761972.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686689|gb|EDQ73076.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 386
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 82/203 (40%), Positives = 110/203 (54%), Gaps = 13/203 (6%)
Query: 29 PLIKHLTSLLKWTTRSSSKAPQADG----NVLQFEN-GYLVETVIEGNEIGVVPYKIRVS 83
P I L L + S + + Q D N L+ ++ GY V TV+ GN G+ Y I +
Sbjct: 7 PTILRLLLLATSSVVSLALSVQPDAAAMINALEVDSIGYDVTTVLNGNLRGLSFYCIDEA 66
Query: 84 ED---GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGV 140
D +D S ++++ PLSQ S +AGS +G G+ DG+ +A FNHPK +
Sbjct: 67 TDRAPAWAIVLDSTKSKVLRVQLPLSQDSVVEHIAGSLEGKAGYQDGRGGDALFNHPKML 126
Query: 141 TMDDKGNLYVADTLNLAIRKIGDAG----VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
T+D GN+YVAD N AIR I G VTTIAGG N G DG FSNDF V
Sbjct: 127 TLDSDGNIYVADVRNTAIRMITTQGKHSFVTTIAGGM-NRTGHNDGEGRVVTFSNDFGVT 185
Query: 197 YVRPTCSLLVIDRGNAALRQISL 219
Y+R C+LL++DRGN +R + L
Sbjct: 186 YLRKNCTLLIVDRGNRMVRAMKL 208
>gi|388505270|gb|AFK40701.1| unknown [Medicago truncatula]
Length = 384
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 91/287 (31%), Positives = 145/287 (50%), Gaps = 30/287 (10%)
Query: 59 ENGYLVETVIEGNEIGVVPYKIR---VSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVA 115
E GY + T+++G+++ + P+ I +S D L +D NS + P+SQ S + +
Sbjct: 31 EEGYTITTILDGHKLHINPFSILQRPISSD--LIVLDSTNSTFYTVQLPISQESVFKRFS 88
Query: 116 GSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSN 175
G+ G G+ DG ARF+ P+ +D +GN+YVAD +N IRKI GVTTIAGG S
Sbjct: 89 GN--GSPGYEDGDVGLARFDKPRSFAVDFRGNVYVADRVNKVIRKISTNGVTTIAGGSSE 146
Query: 176 VAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTD 235
+ +DGP ++A FSNDF++ ++ C+LLV D + + QI+L +DC S
Sbjct: 147 KSSIKDGPVQNASFSNDFELTFIPALCALLVSDHMHQLVHQINLKGEDCTLGSKS----- 201
Query: 236 ILMVVGAVLVGY----VTCMLQQGFG----PFFFSRTQQPSESEFNEETEIKELSNKEKP 287
+GAV+ ++C+L G P+ + S F T +N K
Sbjct: 202 ---ALGAVMTWTLGLGLSCILGLVIGIVIRPYIIPH-EHTSRCHFT-ATRKHCQTNLVKL 256
Query: 288 IPI----VESMKEEPGWPSFGQLIIDLSKLALEAMGSMF-LNFVPFR 329
+P ++S G S + + L +L+L + MF ++FV R
Sbjct: 257 VPTLYSGIKSAVASCGCSSVFTVAVRLWELSLSLLVLMFNIDFVSPR 303
>gi|357485931|ref|XP_003613253.1| hypothetical protein MTR_5g034550 [Medicago truncatula]
gi|355514588|gb|AES96211.1| hypothetical protein MTR_5g034550 [Medicago truncatula]
Length = 154
Score = 127 bits (319), Expect = 1e-26, Method: Composition-based stats.
Identities = 72/152 (47%), Positives = 88/152 (57%), Gaps = 18/152 (11%)
Query: 108 YSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVT 167
Y R +LVAGS +GY+GHVD K EAR NHPKG+T+DD+GN+YVAD +N+AIRKI
Sbjct: 2 YGRPKLVAGSAEGYSGHVDEKLREARMNHPKGITVDDRGNIYVADIINMAIRKIS----- 56
Query: 168 TIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQ 227
G F S + FDV+YV + SLLVIDRG A+R+I L DDC YQ
Sbjct: 57 --LGNNMTYLSFLYEESLILFYLLLFDVIYVGSSYSLLVIDRGKQAIREIQLRFDDCAYQ 114
Query: 228 YNSISPTD-----------ILMVVGAVLVGYV 248
Y S P I M+VGA GY+
Sbjct: 115 YESRFPLGKLNKFKVCLYRIAMLVGAGFFGYM 146
>gi|226495023|ref|NP_001144050.1| uncharacterized protein LOC100276874 precursor [Zea mays]
gi|195636116|gb|ACG37526.1| hypothetical protein [Zea mays]
gi|414870559|tpg|DAA49116.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
Length = 272
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 85/227 (37%), Positives = 125/227 (55%), Gaps = 12/227 (5%)
Query: 58 FENGYLVETVIEGNEI---GVVPYKIR-VSEDGELFAVDEVNSNIVKITPPLSQYSRGRL 113
E+GY V T + N + G PY + S G+L +D S + I+ S R
Sbjct: 29 LEDGYTVTTFADLNPLPASGPHPYAVLPRSRAGDLLLLDSAGSALYTISLS-SSPGEPRR 87
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGG 172
+AG +G +G DG P +A F+ P+ V +D N+YVAD L+ A+RK+ +G TTIAGG
Sbjct: 88 LAGGKRG-SGFNDGGPGDAAFDRPRSVAVDAADNVYVADRLHGAVRKVAPSGYTTTIAGG 146
Query: 173 KSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSIS 232
S+ G RDGP+++A FS DF +VYV C+LLV DRGN +RQI+L +DC ++ S
Sbjct: 147 FSSGPGHRDGPAQNATFSPDFALVYVPKICALLVADRGNRMVRQINLKPEDCAHEKQSGL 206
Query: 233 PTDIL--MVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETE 277
++ + V L+G + L + F P +QP F ++TE
Sbjct: 207 GNTLVSFIAVLCTLLGSLIGFLARHFYPVNAGSLRQP---LFQQDTE 250
>gi|224028415|gb|ACN33283.1| unknown [Zea mays]
gi|414870560|tpg|DAA49117.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
Length = 418
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 84/223 (37%), Positives = 123/223 (55%), Gaps = 16/223 (7%)
Query: 58 FENGYLVETVIEGNEI---GVVPYKIR-VSEDGELFAVDEVNSNIVKITPPLSQYSRGRL 113
E+GY V T + N + G PY + S G+L +D S + I+ S R
Sbjct: 29 LEDGYTVTTFADLNPLPASGPHPYAVLPRSRAGDLLLLDSAGSALYTISLS-SSPGEPRR 87
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGG 172
+AG +G +G DG P +A F+ P+ V +D N+YVAD L+ A+RK+ +G TTIAGG
Sbjct: 88 LAGGKRG-SGFNDGGPGDAAFDRPRSVAVDAADNVYVADRLHGAVRKVAPSGYTTTIAGG 146
Query: 173 KSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSIS 232
S+ G RDGP+++A FS DF +VYV C+LLV DRGN +RQI+L +DC ++ S
Sbjct: 147 FSSGPGHRDGPAQNATFSPDFALVYVPKICALLVADRGNRMVRQINLKPEDCAHEKQSGL 206
Query: 233 PTDIL--MVVGAVLVGYVTCMLQQGFGPF-------FFSRTQQ 266
++ + V L+G + L + F P FFSR ++
Sbjct: 207 GNTLVSFIAVLCTLLGSLIGFLARHFYPVNEVSVNHFFSRIRK 249
>gi|147798774|emb|CAN69795.1| hypothetical protein VITISV_022545 [Vitis vinifera]
Length = 459
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 77/205 (37%), Positives = 114/205 (55%), Gaps = 9/205 (4%)
Query: 56 LQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAV-DEVNSNIVKITPPLSQYSRGRLV 114
L E+GY V TV +GN++ + P+ I F + D S + PLSQ S +
Sbjct: 28 LVLEDGYTVRTVFDGNKLEINPHSILPRYGSSDFIILDSSKSVFYTVXSPLSQESEINRL 87
Query: 115 AGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTT-IAGGK 173
+GS G++ DG A F+ P+ +D KGN+YVAD N IRKI + GVTT IAGG
Sbjct: 88 SGSSAGFS---DGDSASAMFSKPRSFAVDLKGNVYVADQSNGVIRKITNRGVTTTIAGGY 144
Query: 174 SNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISP 233
+ G DGP+++A FS DF++V+V C++LV DRG+ +RQI L +DC S+
Sbjct: 145 AQKTGKVDGPAQNASFSKDFELVFVPEKCAVLVSDRGSQLVRQIDLKVEDCRRSPQSVLG 204
Query: 234 TDILMVV----GAVLVGYVTCMLQQ 254
L V+ + LVG++ ++ +
Sbjct: 205 GAFLWVLLGLGVSCLVGFIVGIISR 229
>gi|115476416|ref|NP_001061804.1| Os08g0414600 [Oryza sativa Japonica Group]
gi|37806208|dbj|BAC99711.1| unknown protein [Oryza sativa Japonica Group]
gi|113623773|dbj|BAF23718.1| Os08g0414600 [Oryza sativa Japonica Group]
gi|215715379|dbj|BAG95130.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215715380|dbj|BAG95131.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 275
Score = 122 bits (305), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 83/220 (37%), Positives = 121/220 (55%), Gaps = 8/220 (3%)
Query: 56 LQFENGYLVETVIEGNEIGVV----PYKIRVS-EDGELFAVDEVNSNIVKITPPLSQYSR 110
L ++GY V T + N G PY + G+L +D S + + PLS +
Sbjct: 27 LVLDDGYTVTTAADLNHPGAPAAPHPYALLPRPRAGDLVLLDSAASALYTLALPLSGGAA 86
Query: 111 GRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTI 169
R +AG G G DG+P +A F+ P+ + +D N+YVAD +N A+RKI +G TTI
Sbjct: 87 ARTLAGGGGGPAGFADGEPRDAAFDRPRSLAVDHADNVYVADRMNGAVRKIAPSGFTTTI 146
Query: 170 AGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDC--EYQ 227
AGG+S G +DGP+++A FS+DF++VYV C+LLV DRGN +RQI+L Q+DC E Q
Sbjct: 147 AGGRSKGPGRKDGPAQNATFSSDFELVYVPKMCALLVTDRGNRLIRQINLKQEDCARETQ 206
Query: 228 YNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQP 267
+ + ++ V L+G V + F P QP
Sbjct: 207 PGLGTTSVSIIAVLCALLGSVIGFSIRHFYPAHAGSLHQP 246
>gi|414870561|tpg|DAA49118.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
Length = 280
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 78/203 (38%), Positives = 115/203 (56%), Gaps = 13/203 (6%)
Query: 58 FENGYLVETVIEGNEI---GVVPYKIR-VSEDGELFAVDEVNSNIVKITPPLSQYSRGRL 113
E+GY V T + N + G PY + S G+L +D S + I+ S R
Sbjct: 29 LEDGYTVTTFADLNPLPASGPHPYAVLPRSRAGDLLLLDSAGSALYTISLS-SSPGEPRR 87
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGG 172
+AG +G +G DG P +A F+ P+ V +D N+YVAD L+ A+RK+ +G TTIAGG
Sbjct: 88 LAGGKRG-SGFNDGGPGDAAFDRPRSVAVDAADNVYVADRLHGAVRKVAPSGYTTTIAGG 146
Query: 173 KSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNS-- 230
S+ G RDGP+++A FS DF +VYV C+LLV DRGN +RQI+L +DC ++ S
Sbjct: 147 FSSGPGHRDGPAQNATFSPDFALVYVPKICALLVADRGNRMVRQINLKPEDCAHEKQSGL 206
Query: 231 ----ISPTDILMVVGAVLVGYVT 249
+S +L + L+G++
Sbjct: 207 GNTLVSFIAVLCTLLGSLIGFLA 229
>gi|359493626|ref|XP_003634638.1| PREDICTED: LOW QUALITY PROTEIN: protein argonaute 1B-like [Vitis
vinifera]
Length = 867
Score = 121 bits (304), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 68/126 (53%), Positives = 79/126 (62%), Gaps = 35/126 (27%)
Query: 155 NLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAAL 214
NLAIRKIGD+GVTTI GGKSNVAG+ +GP EDAKFS DFDVVYVR
Sbjct: 11 NLAIRKIGDSGVTTIVGGKSNVAGYGNGPCEDAKFSRDFDVVYVR--------------- 55
Query: 215 RQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNE 274
PT I MV+GAV+VGY +C+LQ+GFGP FS+TQ SESE +E
Sbjct: 56 ------------------PTHIFMVIGAVMVGYASCLLQKGFGPSVFSKTQH-SESE-SE 95
Query: 275 ETEIKE 280
+ IKE
Sbjct: 96 DQLIKE 101
>gi|414870562|tpg|DAA49119.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
Length = 207
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 74/178 (41%), Positives = 105/178 (58%), Gaps = 7/178 (3%)
Query: 58 FENGYLVETVIEGNEI---GVVPYKIRV-SEDGELFAVDEVNSNIVKITPPLSQYSRGRL 113
E+GY V T + N + G PY + S G+L +D S + I+ S R
Sbjct: 29 LEDGYTVTTFADLNPLPASGPHPYAVLPRSRAGDLLLLDSAGSALYTISLS-SSPGEPRR 87
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGG 172
+AG +G +G DG P +A F+ P+ V +D N+YVAD L+ A+RK+ +G TTIAGG
Sbjct: 88 LAGGKRG-SGFNDGGPGDAAFDRPRSVAVDAADNVYVADRLHGAVRKVAPSGYTTTIAGG 146
Query: 173 KSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNS 230
S+ G RDGP+++A FS DF +VYV C+LLV DRGN +RQI+L +DC ++ S
Sbjct: 147 FSSGPGHRDGPAQNATFSPDFALVYVPKICALLVADRGNRMVRQINLKPEDCAHEKQS 204
>gi|222640551|gb|EEE68683.1| hypothetical protein OsJ_27311 [Oryza sativa Japonica Group]
Length = 420
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 85/226 (37%), Positives = 123/226 (54%), Gaps = 15/226 (6%)
Query: 56 LQFENGYLVETVIEGNEIGVV----PYKIRVS-EDGELFAVDEVNSNIVKITPPLSQYSR 110
L ++GY V T + N G PY + G+L +D S + + PLS +
Sbjct: 27 LVLDDGYTVTTAADLNHPGAPAAPHPYALLPRPRAGDLVLLDSAASALYTLALPLSGGAA 86
Query: 111 GRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTI 169
R +AG G G DG+P +A F+ P+ + +D N+YVAD +N A+RKI +G TTI
Sbjct: 87 ARTLAGGGGGPAGFADGEPRDAAFDRPRSLAVDHADNVYVADRMNGAVRKIAPSGFTTTI 146
Query: 170 AGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDC--EYQ 227
AGG+S G +DGP+++A FS+DF++VYV C+LLV DRGN +RQI+L Q+DC E Q
Sbjct: 147 AGGRSKGPGRKDGPAQNATFSSDFELVYVPKMCALLVTDRGNRLIRQINLKQEDCARETQ 206
Query: 228 YNSISPTDILMVVGAVLVGYVTCMLQQGFGPF-------FFSRTQQ 266
+ + ++ V L+G V + F P FF R Q
Sbjct: 207 PGLGTTSVSIIAVLCALLGSVIGFSIRHFYPAHEVSINRFFRRMQM 252
>gi|356526828|ref|XP_003532018.1| PREDICTED: uncharacterized protein LOC100816542 [Glycine max]
Length = 400
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 67/168 (39%), Positives = 99/168 (58%), Gaps = 4/168 (2%)
Query: 59 ENGYLVETVIEGNEIGVVPYKI-RVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS 117
E GY V TV +G++ + P+ + + +L +D VNS P+++ S ++G
Sbjct: 27 EEGYTVTTVFDGHKPHIFPFTVLQRPFSSDLILLDSVNSTFYTAQFPITEESVFTRLSG- 85
Query: 118 FQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGK-SNV 176
G G+ DG ARF P+ D +GN+YVAD N AIRKI GVTTIAGG+ S
Sbjct: 86 -DGSVGYSDGDVGSARFAKPRSFAFDMRGNVYVADKSNRAIRKISAKGVTTIAGGEFSEK 144
Query: 177 AGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDC 224
+ +DGP+ +A FSNDFD+ ++ C+LLV D + +RQI+L ++DC
Sbjct: 145 SRTKDGPALNASFSNDFDLTFIPGLCALLVSDHMHRLVRQINLMEEDC 192
>gi|302768697|ref|XP_002967768.1| hypothetical protein SELMODRAFT_67906 [Selaginella moellendorffii]
gi|300164506|gb|EFJ31115.1| hypothetical protein SELMODRAFT_67906 [Selaginella moellendorffii]
Length = 186
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/135 (45%), Positives = 86/135 (63%), Gaps = 5/135 (3%)
Query: 88 LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN 147
+ A+D + I K+ PLSQ S AGSF G +G+VDG ++ FN P+ + + D G
Sbjct: 45 VLALDSSGNRIWKLRLPLSQNSSLEAFAGSFVGESGYVDGPAAKSLFNRPQSLAICDNGA 104
Query: 148 LYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC-SLL 205
++VADT NLAIRKI D VTTIAGG S GF DGP + A+FS++F + +C SLL
Sbjct: 105 VFVADTRNLAIRKISKDGEVTTIAGGSSRKPGFADGPGDTARFSSEFSLAC---SCGSLL 161
Query: 206 VIDRGNAALRQISLN 220
+ DRGN +R+I ++
Sbjct: 162 IADRGNRLIREIQID 176
>gi|242079119|ref|XP_002444328.1| hypothetical protein SORBIDRAFT_07g020240 [Sorghum bicolor]
gi|241940678|gb|EES13823.1| hypothetical protein SORBIDRAFT_07g020240 [Sorghum bicolor]
Length = 416
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 79/223 (35%), Positives = 116/223 (52%), Gaps = 19/223 (8%)
Query: 58 FENGYLVETVIEGNEI---GVVPYKIRVS-EDGELFAVDEVNSNIVKITPPLSQYSRGRL 113
E+GY V T N + G PY I G+L +D S + ++ S RL
Sbjct: 30 LEDGYTVTTFANFNPLPASGPHPYAILPRPRAGDLLLLDSAGSALYTLSLSSSPGEPRRL 89
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGG 172
G + +G DG +A F+ P+ V +D N+YVAD + A+RK+ +G TT+AGG
Sbjct: 90 AGGKRR--SGFDDG---DAAFDRPRSVAVDAADNVYVADQRHGAVRKVAPSGYTTTVAGG 144
Query: 173 KSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSIS 232
S+ G RDG +++A FS DF++VYV C+LLV DRGN +RQI+L +DC ++ S
Sbjct: 145 LSSGPGHRDGLAQNATFSADFELVYVPKICALLVADRGNRMVRQINLKPEDCAHEKQSGL 204
Query: 233 PTDILMVVG--AVLVGYVTCMLQQGFGPF-------FFSRTQQ 266
T + V+ L+G + L + F P FF R Q+
Sbjct: 205 GTTSVSVIAILCALLGLIIGFLVRHFYPVNEVSINHFFIRIQK 247
>gi|326503362|dbj|BAJ99306.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 452
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 76/223 (34%), Positives = 111/223 (49%), Gaps = 24/223 (10%)
Query: 56 LQFENGYLVETVIEGNEIGVVPYKIRVS-EDGELFAVDEVNSNIVKITPPLSQYSRGRLV 114
L E+GY V TV + N G PY + G+L +D S + + P+S + R
Sbjct: 56 LVLEDGYTVSTVSDLNPSGTHPYALLPRPRAGDLVLLDSAGSTLYTLPLPVSADAGPR-- 113
Query: 115 AGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGK 173
+ G HP+ + +D N+YVAD N +IRK+ +G TTIAG
Sbjct: 114 ---------RLAGGAGALGDGHPRSIAVDGADNVYVADRANGSIRKVAPSGYTTTIAGAY 164
Query: 174 SNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYN---S 230
S G RD P+++A FS DF+++YV C+LLV DRGN +RQI L +DC ++
Sbjct: 165 SAGTGHRDEPAQNATFSADFELIYVPQICALLVADRGNRLIRQIKLKPEDCAHENQKGLG 224
Query: 231 ISPTDILMVVGAVLVGYVTCMLQQGFGPF-------FFSRTQQ 266
+ T I+ ++ A L G + L + F PF FF R Q+
Sbjct: 225 TTSTSIIAILAA-LFGSIIGFLVRHFYPFHEVSINRFFIRIQR 266
>gi|302821461|ref|XP_002992393.1| hypothetical protein SELMODRAFT_47902 [Selaginella moellendorffii]
gi|300139809|gb|EFJ06543.1| hypothetical protein SELMODRAFT_47902 [Selaginella moellendorffii]
Length = 183
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 58/135 (42%), Positives = 83/135 (61%), Gaps = 8/135 (5%)
Query: 88 LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN 147
+ A+D + I K+ PLSQ S AGSF G +G+VDG ++ FN P+ +++ D G
Sbjct: 45 VLALDSSGNRIWKLRLPLSQNSSLEAFAGSFVGESGYVDGPAAKSLFNRPQSLSICDNGA 104
Query: 148 LYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC-SLL 205
++VADT NLAIRKI D VTTIAGG S GF + A+FS++F + +C SLL
Sbjct: 105 VFVADTRNLAIRKISKDGEVTTIAGGSSRKPGF---AGDTARFSSEFSLAC---SCGSLL 158
Query: 206 VIDRGNAALRQISLN 220
+ D GN +R+I ++
Sbjct: 159 IADCGNRLIREIQID 173
>gi|302768701|ref|XP_002967770.1| hypothetical protein SELMODRAFT_17931 [Selaginella moellendorffii]
gi|300164508|gb|EFJ31117.1| hypothetical protein SELMODRAFT_17931 [Selaginella moellendorffii]
Length = 128
Score = 92.8 bits (229), Expect = 3e-16, Method: Composition-based stats.
Identities = 54/125 (43%), Positives = 78/125 (62%), Gaps = 11/125 (8%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGD--AGVTTIAG 171
+AGS G +G+VDG ++ FN P+ + + D G ++VADT NLAIRKI G+TTIAG
Sbjct: 9 LAGSLVGESGYVDGPAAKSLFNRPQSLAICDNGAVFVADTRNLAIRKISKDGEGMTTIAG 68
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC---SLLVIDRGNAALRQISLNQ-DDCEYQ 227
G S GF DGP + A+FS++F R C SLL+ DRGN +R+I ++ C+
Sbjct: 69 GSSRKPGFADGPGDTARFSSEF-----RLACSCGSLLIADRGNRLIREIQIDDPKSCDSS 123
Query: 228 YNSIS 232
+++S
Sbjct: 124 DSAVS 128
>gi|421861496|ref|ZP_16293498.1| hypothetical protein PPOP_3335 [Paenibacillus popilliae ATCC 14706]
gi|410828922|dbj|GAC43935.1| hypothetical protein PPOP_3335 [Paenibacillus popilliae ATCC 14706]
Length = 575
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 58/145 (40%), Positives = 75/145 (51%), Gaps = 4/145 (2%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + ++GEL D N I + + VA Q Y DGK EARFN
Sbjct: 92 PSGLLAGKNGELLIADTYNHLIRRADAAGQVSTLAGQVAKMRQQYGSWTDGKGTEARFNQ 151
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ D +GNLY+AD N IRK+ +G VTT+AG S +AG+RDG +A+F+ DV
Sbjct: 152 PMGMAEDRQGNLYIADAGNHVIRKLDKSGRVTTVAG--SGLAGWRDGTGSEARFNEPRDV 209
Query: 196 VYVRPTCSLLVIDRGNAALRQISLN 220
V SL V D N LR+I N
Sbjct: 210 A-VAEDGSLYVADALNHVLRRIDAN 233
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 73/157 (46%), Gaps = 19/157 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + G L+ D N I K L + R VAGS G G DG +EARFN
Sbjct: 152 PMGMAEDRQGNLYIADAGNHVIRK----LDKSGRVTTVAGS--GLAGWRDGTGSEARFNE 205
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSN----------VAG--FRDGPS 184
P+ V + + G+LYVAD LN +R+I G T +S AG + DG
Sbjct: 206 PRDVAVAEDGSLYVADALNHVLRRIDANGNVTTLNARSKRIVEYAPGAVAAGGDYADGKL 265
Query: 185 EDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQ 221
++KF+ + + + L+V D GN LR ++L Q
Sbjct: 266 GESKFNEPSSLAFT-SSGDLVVSDTGNQRLRLVNLKQ 301
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 55/108 (50%), Gaps = 8/108 (7%)
Query: 115 AGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGG- 172
AGS G +G VDG+ A F +P G+ G L +ADT N IR+ AG V+T+AG
Sbjct: 72 AGS--GVSGSVDGRAERAEFRYPSGLLAGKNGELLIADTYNHLIRRADAAGQVSTLAGQV 129
Query: 173 ---KSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
+ + DG +A+F+ + R +L + D GN +R++
Sbjct: 130 AKMRQQYGSWTDGKGTEARFNQPMGMAEDR-QGNLYIADAGNHVIRKL 176
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 48/95 (50%), Gaps = 3/95 (3%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
G+ D + +EA FN P G+ + +G + VAD N AIR + + V T+ G ++G
Sbjct: 330 GYRDSEASEALFNGPAGIAITAEGGIIVADRWNHAIRYLFNGKVYTLG--GGGGTGHQNG 387
Query: 183 PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
+E A F +V V ++ V D N ++R I
Sbjct: 388 WAEQATFREPVNVA-VLSNGTIAVADGFNNSIRLI 421
>gi|302821973|ref|XP_002992647.1| hypothetical protein SELMODRAFT_7079 [Selaginella moellendorffii]
gi|300139611|gb|EFJ06349.1| hypothetical protein SELMODRAFT_7079 [Selaginella moellendorffii]
Length = 124
Score = 85.1 bits (209), Expect = 7e-14, Method: Composition-based stats.
Identities = 52/121 (42%), Positives = 76/121 (62%), Gaps = 9/121 (7%)
Query: 115 AGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGK 173
AGSF G +G+VDG ++ FN P+ + + D G ++VADT NLAIRKI D VTTIAGG
Sbjct: 10 AGSFVGESGYVDGPAAKSLFNRPQSLAICDNGAVFVADTRNLAIRKISKDGEVTTIAGGS 69
Query: 174 SNVAGFRDGPSEDAKFSNDFDVVYVRPTC-SLLVIDRGNAALRQISLNQ-DDCEYQYNSI 231
S GF + A+FS +F + +C SLL+ DRGN +R+I ++ + C+ +++
Sbjct: 70 SRKPGF---AGDTARFSGEFSLAC---SCGSLLIADRGNRLIREIQIDDPNSCDSSDSAV 123
Query: 232 S 232
S
Sbjct: 124 S 124
>gi|182414082|ref|YP_001819148.1| NHL repeat-containing protein [Opitutus terrae PB90-1]
gi|177841296|gb|ACB75548.1| NHL repeat containing protein [Opitutus terrae PB90-1]
Length = 963
Score = 81.6 bits (200), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 56/147 (38%), Positives = 79/147 (53%), Gaps = 11/147 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITP-----PLSQYSRGRLVAGSFQGYTGHVDGKPNE 131
P + V DG +F D +N+ I ++TP PL + VAG G TG DG ++
Sbjct: 284 PAGLAVDRDGNIFVADSLNNTIRRVTPLNGPAPLGVVTT---VAGQ-AGVTGSADGVGSQ 339
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSN 191
ARFN P G+ +D GN++VAD N IRKI +G T G+++V G DGP A+F N
Sbjct: 340 ARFNLPYGIAVDAAGNIFVADLGNTTIRKIAPSGAVTTLAGEASV-GTADGPGPMARF-N 397
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQIS 218
+ V V + V D NA +R+I+
Sbjct: 398 YPNGVAVDLAGNTYVADTFNATIRKIT 424
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 45/114 (39%), Positives = 56/114 (49%), Gaps = 6/114 (5%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
+PY I V G +F D N+ I KI P G + + + G DG ARFN
Sbjct: 344 LPYGIAVDAAGNIFVADLGNTTIRKIAP------SGAVTTLAGEASVGTADGPGPMARFN 397
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKF 189
+P GV +D GN YVADT N IRKI AGV + G + G DG A+F
Sbjct: 398 YPNGVAVDLAGNTYVADTFNATIRKITPAGVVSTLAGAAGQIGSADGTGSAARF 451
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 53/142 (37%), Positives = 73/142 (51%), Gaps = 7/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V+ DG L+ DE NS I +ITP + +AGS G +DG ARF
Sbjct: 508 PNGLAVATDGTLYVADEENSTIRQITPDGMVST----LAGS-PAQRGGIDGTGTAARFVQ 562
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+T+D GNLYV+D + +RKI AG T G+ +AG DG A+F+ +
Sbjct: 563 PAGLTIDAAGNLYVSDRGDFTVRKITPAGEVTTVAGQHGIAGGADGTGSAAQFAYAGGIA 622
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
R +L V D N +RQI+
Sbjct: 623 IDR-RGTLYVADSNN-RIRQIT 642
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 40/94 (42%), Positives = 48/94 (51%), Gaps = 9/94 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPP--LSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
P V + G LF D N I KITP +S ++ G G VDG N ARF
Sbjct: 65 PSSAAVDQAGNLFVADTTNHTIRKITPSGTVSTFA-------GMGGQPGSVDGTGNAARF 117
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTT 168
P GV +D+ GNLYVAD+ N IRKI GV +
Sbjct: 118 LSPHGVALDEAGNLYVADSGNNTIRKITPTGVVS 151
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 55/170 (32%), Positives = 82/170 (48%), Gaps = 26/170 (15%)
Query: 65 ETVIEGNEIGVV-------PYKIRVSEDGELFAVDEVNSNIVKITPP---LSQYSRGRLV 114
+T I GN G V P I V G L+ ++ ++ I ITP L+ LV
Sbjct: 211 KTGIRGNTNGTVDTALFALPRNIAVFR-GNLYVTEQESAAIRWITPTGVVLTLAGDPDLV 269
Query: 115 AGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI----GDA--GVTT 168
G DG +ARF+ P G+ +D GN++VAD+LN IR++ G A GV T
Sbjct: 270 --------GSADGTGGDARFSSPAGLAVDRDGNIFVADSLNNTIRRVTPLNGPAPLGVVT 321
Query: 169 IAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G++ V G DG A+F+ + + V ++ V D GN +R+I+
Sbjct: 322 TVAGQAGVTGSADGVGSQARFNLPYGIA-VDAAGNIFVADLGNTTIRKIA 370
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 52/144 (36%), Positives = 71/144 (49%), Gaps = 11/144 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G + D N+ I KITP + +AG+ G G DG + ARF
Sbjct: 399 PNGVAVDLAGNTYVADTFNATIRKITPAGVVST----LAGA-AGQIGSADGTGSAARFEF 453
Query: 137 PKGVTMDDKGNLYVADTLNLA-IRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF-D 194
P G+ +D GN+Y T N A +RKI AGV T G S G DGP A+F+ F +
Sbjct: 454 PLGIAVDRAGNVYT--TANSATVRKITPAGVVTTIAGVSGNFGSADGPGLAARFA--FPN 509
Query: 195 VVYVRPTCSLLVIDRGNAALRQIS 218
+ V +L V D N+ +RQI+
Sbjct: 510 GLAVATDGTLYVADEENSTIRQIT 533
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 68/142 (47%), Gaps = 7/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ + + E G L+ D N+ I KITP + + + ARFNH
Sbjct: 120 PHGVALDEAGNLYVADSGNNTIRKITPTGVVSTLAGQAGAAGSADG-----DGSAARFNH 174
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P GVT G L+VADT N IR I AG + GK+ + G +G + A F+ ++
Sbjct: 175 PTGVTAYPDGTLFVADTQNHVIRTITPAGRVSTFAGKTGIRGNTNGTVDTALFALPRNIA 234
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
R +L V ++ +AA+R I+
Sbjct: 235 VFR--GNLYVTEQESAAIRWIT 254
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 48/98 (48%), Gaps = 1/98 (1%)
Query: 121 YTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR 180
+ G +DG+ A F P +D GNL+VADT N IRKI +G + G G
Sbjct: 49 FFGGLDGQGRAAGFTTPSSAAVDQAGNLFVADTTNHTIRKITPSGTVSTFAGMGGQPGSV 108
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
DG A+F + V + +L V D GN +R+I+
Sbjct: 109 DGTGNAARFLSPHGVA-LDEAGNLYVADSGNNTIRKIT 145
Score = 51.6 bits (122), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 66/143 (46%), Gaps = 9/143 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V G ++ NS V+ P + V+G+F G DG ARF
Sbjct: 454 PLGIAVDRAGNVYT--TANSATVRKITPAGVVTTIAGVSGNF----GSADGPGLAARFAF 507
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ + G LYVAD N IR+I D V+T+AG + G DG A+F +
Sbjct: 508 PNGLAVATDGTLYVADEENSTIRQITPDGMVSTLAGSPAQRGGI-DGTGTAARFVQPAGL 566
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
+ +L V DRG+ +R+I+
Sbjct: 567 T-IDAAGNLYVSDRGDFTVRKIT 588
>gi|302821467|ref|XP_002992396.1| hypothetical protein SELMODRAFT_135182 [Selaginella moellendorffii]
gi|300139812|gb|EFJ06546.1| hypothetical protein SELMODRAFT_135182 [Selaginella moellendorffii]
Length = 161
Score = 80.5 bits (197), Expect = 2e-12, Method: Composition-based stats.
Identities = 50/122 (40%), Positives = 75/122 (61%), Gaps = 10/122 (8%)
Query: 115 AGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGD--AGVTTIAGG 172
AGSF G +G+VDG ++ FN P+ +++ D G ++VADT NLAIRKI VTTIAGG
Sbjct: 24 AGSFVGESGYVDGPAAKSLFNRPQSLSICDNGAVFVADTRNLAIRKISKFLPTVTTIAGG 83
Query: 173 KSNVAGFRDGPSEDAKFSNDFDVVYVRPTC-SLLVIDRGNAALRQISLNQ-DDCEYQYNS 230
S GF + A+FS++F + +C SLL+ D GN +R+I ++ C+ ++
Sbjct: 84 SSRKPGF---AGDTARFSSEFSLAC---SCGSLLIADCGNRLIREIQIDDPKSCDSSDSA 137
Query: 231 IS 232
+S
Sbjct: 138 VS 139
>gi|302791603|ref|XP_002977568.1| hypothetical protein SELMODRAFT_417419 [Selaginella moellendorffii]
gi|300154938|gb|EFJ21572.1| hypothetical protein SELMODRAFT_417419 [Selaginella moellendorffii]
Length = 288
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 63/192 (32%), Positives = 102/192 (53%), Gaps = 15/192 (7%)
Query: 58 FENGYLVETVIEGNEIGVVPYKIRVS-EDGELFAVDEVNSNIVKITPPLSQYSRGRLVAG 116
F +G+ V+T+ + + V + + + + A D + I K+ PLSQ S AG
Sbjct: 35 FCSGFTVKTLFDLGKHSVNAFGLYPDPRNKFMLAPDSSGNRIWKLRLPLSQNSSLEAFAG 94
Query: 117 SFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNV 176
GY +DG ++ FN P+ + M G ++VADT +AIRKI + VTTIAGG S
Sbjct: 95 WESGY---IDGPAAKSLFNRPQSLAMCGNGAIFVADT-RMAIRKI--SKVTTIAGGSSRK 148
Query: 177 AGFRDGPSEDAKFSNDFDVVYVRPTC-SLLVIDRGNAALRQISLNQ-DDCEYQYNSISPT 234
G DG A+FS++F + +C SL + DRGN +R+I ++ C+ +++S +
Sbjct: 149 PGIADGT---ARFSSEFSLAC---SCGSLPIADRGNWLIREIQIDDPKSCDSSDSTVSGS 202
Query: 235 DILMVVGAVLVG 246
+V+ V +G
Sbjct: 203 QKCVVLLGVCLG 214
>gi|428179226|gb|EKX48098.1| hypothetical protein GUITHDRAFT_69001 [Guillardia theta CCMP2712]
Length = 334
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 77/142 (54%), Gaps = 11/142 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I S++G+L+ D+ N I K+T +G + + G G+ DGK +AR N
Sbjct: 50 PVGIVESKEGDLYLCDQENHCIRKVT------RKGEVTTLAGNGEEGYHDGKGKDARLNI 103
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ MD +GN+ AD+ N IRK+ D VTTIAG K GF+DGP+ A F N
Sbjct: 104 PTGLCMDAQGNIIFADSGNQRIRKVSPDGTVTTIAGSKK---GFKDGPAGKALF-NYPAY 159
Query: 196 VYVRPTCSLLVIDRGNAALRQI 217
V V S+ V D GN +R+I
Sbjct: 160 VAVDSKGSIFVSDFGNHCIRKI 181
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 76/144 (52%), Gaps = 11/144 (7%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
+P + + G + D N I K++P + + +AGS +G+ DG +A FN
Sbjct: 103 IPTGLCMDAQGNIIFADSGNQRIRKVSPDGTVTT----IAGSKKGFK---DGPAGKALFN 155
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
+P V +D KG+++V+D N IRKI G+ VTT+AG + G+ DG A+F N
Sbjct: 156 YPAYVAVDSKGSIFVSDFGNHCIRKIDGEGMVTTVAG--NGKMGWADGKGAKARF-NSPQ 212
Query: 195 VVYVRPTCSLLVIDRGNAALRQIS 218
+ + ++ + D GN +R++S
Sbjct: 213 GMCIDKEDTVYIADYGNQRIRKMS 236
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 67/143 (46%), Gaps = 10/143 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G +F D N I KI G + + G G DGK +ARFN
Sbjct: 157 PAYVAVDSKGSIFVSDFGNHCIRKID------GEGMVTTVAGNGKMGWADGKGAKARFNS 210
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P+G+ +D + +Y+AD N IRK+ G V T+AG S GF G + A+F
Sbjct: 211 PQGMCIDKEDTVYIADYGNQRIRKMSKEGEVVTVAG--SGEPGFAHGHGQLARFRGPRS- 267
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V V + V DR N +R+IS
Sbjct: 268 VSVSQDGIVYVGDRENFRVRKIS 290
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 53/94 (56%), Gaps = 4/94 (4%)
Query: 126 DGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPS 184
DG ++ARFN P G+ +G+LY+ D N IRK+ G VTT+AG + G+ DG
Sbjct: 39 DGVGDQARFNSPVGIVESKEGDLYLCDQENHCIRKVTRKGEVTTLAG--NGEEGYHDGKG 96
Query: 185 EDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+DA+ N + + +++ D GN +R++S
Sbjct: 97 KDARL-NIPTGLCMDAQGNIIFADSGNQRIRKVS 129
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 29/110 (26%), Positives = 46/110 (41%), Gaps = 6/110 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + ++ ++ D N I K++ G +V + G G G ARF
Sbjct: 211 PQGMCIDKEDTVYIADYGNQRIRKMSK------EGEVVTVAGSGEPGFAHGHGQLARFRG 264
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSED 186
P+ V++ G +YV D N +RKI + G G + G D ED
Sbjct: 265 PRSVSVSQDGIVYVGDRENFRVRKISEDGYVWTFAGMGKLGGCGDDACED 314
>gi|296136309|ref|YP_003643551.1| NHL repeat containing protein [Thiomonas intermedia K12]
gi|295796431|gb|ADG31221.1| NHL repeat containing protein [Thiomonas intermedia K12]
Length = 366
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 53/144 (36%), Positives = 80/144 (55%), Gaps = 10/144 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKPNEARF 134
P + V+ G ++ D NS + +I+ ++G + +AGS G TG DG+ +ARF
Sbjct: 172 PVGLGVNAKGVVYVADAYNSTVRRIS------AKGVVSTLAGS-PGDTGWRDGRGAQARF 224
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
N P G+T+D +G +YV++ N IRKI G T GK GF DG E+A+F +
Sbjct: 225 NTPVGLTLDAQGQIYVSEYFNNVIRKITPDGTVTTFAGKPGKGGFADGKVEEAQFLHPQT 284
Query: 195 VVYVRPTCSLLVIDRGNAALRQIS 218
+ + P SL+V D GN +R+IS
Sbjct: 285 LSFA-PDGSLIVADTGNNRVRRIS 307
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/115 (36%), Positives = 61/115 (53%), Gaps = 5/115 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V+ DG LF V + SN V+ +S+ R +AG + G DG +ARFNH
Sbjct: 117 PECVAVATDGTLF-VSDSGSNTVRC---ISREGRVSTLAGKLE-VEGFADGTGQQARFNH 171
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSN 191
P G+ ++ KG +YVAD N +R+I GV + G G+RDG A+F+
Sbjct: 172 PVGLGVNAKGVVYVADAYNSTVRRISAKGVVSTLAGSPGDTGWRDGRGAQARFNT 226
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 39/125 (31%), Positives = 69/125 (55%), Gaps = 8/125 (6%)
Query: 101 ITPPLSQYSRGRL-VAGSFQGYTGHVD------GKPNEARFNHPKGVTMDDKGNLYVADT 153
++P +SQ + + AG+ + + G + G +ARF+ P+G+ +D +GN++VAD
Sbjct: 19 LSPLISQTAFAAMQAAGTLEVFAGAIGPGMYTGGDFAQARFHDPRGMALDAQGNIFVADY 78
Query: 154 LNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAA 213
+N +RK+G G +I G+ R+GP+ A+F + + V V +L V D G+
Sbjct: 79 VNSVVRKLGTDGQVSIVAGQVEQRDARNGPALQARFYSP-ECVAVATDGTLFVSDSGSNT 137
Query: 214 LRQIS 218
+R IS
Sbjct: 138 VRCIS 142
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/142 (28%), Positives = 67/142 (47%), Gaps = 6/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + G +F D VNS + K L + +VAG + +G +ARF
Sbjct: 62 PRGMALDAQGNIFVADYVNSVVRK----LGTDGQVSIVAGQVEQRDAR-NGPALQARFYS 116
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P+ V + G L+V+D+ + +R I G + GK V GF DG + A+F++ +
Sbjct: 117 PECVAVATDGTLFVSDSGSNTVRCISREGRVSTLAGKLEVEGFADGTGQQARFNHPVG-L 175
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
V + V D N+ +R+IS
Sbjct: 176 GVNAKGVVYVADAYNSTVRRIS 197
Score = 46.6 bits (109), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 52/96 (54%), Gaps = 6/96 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + G+++ + N+ I KITP + + AG G G DGK EA+F H
Sbjct: 227 PVGLTLDAQGQIYVSEYFNNVIRKITPDGTVTT----FAGK-PGKGGFADGKVEEAQFLH 281
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
P+ ++ G+L VADT N +R+I G V+T+AG
Sbjct: 282 PQTLSFAPDGSLIVADTGNNRVRRISPQGEVSTLAG 317
>gi|194364186|ref|YP_002026796.1| SMP-30/gluconolaconase/LRE domain-containing protein
[Stenotrophomonas maltophilia R551-3]
gi|194346990|gb|ACF50113.1| SMP-30/Gluconolaconase/LRE domain protein [Stenotrophomonas
maltophilia R551-3]
Length = 693
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 54/145 (37%), Positives = 77/145 (53%), Gaps = 11/145 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + S DG ++ D ++N ++ P GR+ + QG G +DG +A FN
Sbjct: 72 PYALLRSADGSIYFTDAGDNNRIRRRQP-----DGRIETVAGQG-EGRIDGPALQASFNT 125
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ D +GNLYVADT N AIR+IG D VTT+AGG+ G+ DGP+ A+F +
Sbjct: 126 PSGIAADAQGNLYVADTGNHAIRRIGTDGQVTTLAGGEQ---GYADGPAAQARFDAPMGI 182
Query: 196 VYVRPTCSLLVIDRGNAALRQISLN 220
V + V D N +R I +
Sbjct: 183 A-VDAQGQVYVADTFNDRIRVIGTD 206
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 51/147 (34%), Positives = 73/147 (49%), Gaps = 15/147 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKPNEARF 134
P I G L+ D N I +I + G++ +AG QGY DG +ARF
Sbjct: 126 PSGIAADAQGNLYVADTGNHAIRRIG------TDGQVTTLAGGEQGYA---DGPAAQARF 176
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
+ P G+ +D +G +YVADT N IR IG D V T+AGG+ G DG A+F
Sbjct: 177 DAPMGIAVDAQGQVYVADTFNDRIRVIGTDGNVRTLAGGER--PGLADGLGVAARFDTPV 234
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISLN 220
+ + +LLV D N A+R++ +
Sbjct: 235 ALAF-DAHGALLVADLFNNAVRRVGAD 260
Score = 41.6 bits (96), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 44/97 (45%), Gaps = 7/97 (7%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P I V G+++ D N I I R +AG + G DG ARF+
Sbjct: 178 APMGIAVDAQGQVYVADTFNDRIRVI----GTDGNVRTLAGGER--PGLADGLGVAARFD 231
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAG 171
P + D G L VAD N A+R++G D V+T+ G
Sbjct: 232 TPVALAFDAHGALLVADLFNNAVRRVGADGMVSTLLG 268
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 57/120 (47%), Gaps = 10/120 (8%)
Query: 104 PLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTL--NLAIRKI 161
PL+ ++ ++AG G+ G DG +ARF P + G++Y D N R+
Sbjct: 41 PLAWTAQIEMLAGD--GHPGDRDGASAQARFADPYALLRSADGSIYFTDAGDNNRIRRRQ 98
Query: 162 GDAGVTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
D + T+AG G+ + DGP+ A F N + +L V D GN A+R+I +
Sbjct: 99 PDGRIETVAGQGEGRI----DGPALQASF-NTPSGIAADAQGNLYVADTGNHAIRRIGTD 153
>gi|329908044|ref|ZP_08274804.1| hypothetical protein IMCC9480_3484 [Oxalobacteraceae bacterium
IMCC9480]
gi|327546777|gb|EGF31711.1| hypothetical protein IMCC9480_3484 [Oxalobacteraceae bacterium
IMCC9480]
Length = 642
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 56/145 (38%), Positives = 74/145 (51%), Gaps = 11/145 (7%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPP--LSQYSRGRLVAGSFQGYTGHVDGKPNEAR 133
+P+ + G + D N I KITP +SQ +AGS G G VDG A
Sbjct: 344 LPWAVTSDATGNWYVADAGNYMIRKITPAGVVSQ------LAGS--GKWGSVDGTGAGAS 395
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
F+ PKG+ D GN++VADT N IRKI AGV T G + G DGP A+FS
Sbjct: 396 FSAPKGIVADPLGNVFVADTYNSTIRKITPAGVVTTIAGAPSSTGSTDGPGNLARFSGP- 454
Query: 194 DVVYVRPTCSLLVIDRGNAALRQIS 218
+ + + +L V D GN +R+IS
Sbjct: 455 EAIAIDAQRNLYVGDTGNHTIRKIS 479
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 54/145 (37%), Positives = 73/145 (50%), Gaps = 12/145 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITP-PLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P I + G L+ D VN I K++P + GR G G DG+ + ARF
Sbjct: 79 PESIAIDRAGMLYVADSVNHTIRKVSPQGVVTTLAGR------AGEPGSADGRGSAARFF 132
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
PKGV +D GN+ V+D N IRKI D VTT+AG N + DG A+F+ F
Sbjct: 133 DPKGVAVDVAGNVVVSDNANHTIRKISPDGNVTTLAGSARNASAL-DGIGSAARFA--FP 189
Query: 195 VVYVRPTC-SLLVIDRGNAALRQIS 218
V ++ V+DRGN LR+I+
Sbjct: 190 QALVTDAARNVYVVDRGNGLLRKIT 214
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 49/144 (34%), Positives = 72/144 (50%), Gaps = 9/144 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G + D+ N+ I K+ P S + +AG+ G VDG+ ARF
Sbjct: 23 PGSMVVDSAGNRYIADQNNNQIRKVRPDGSVST----LAGASASTPGAVDGRGAAARFYS 78
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD-- 194
P+ + +D G LYVAD++N IRK+ GV T G++ G DG A+F FD
Sbjct: 79 PESIAIDRAGMLYVADSVNHTIRKVSPQGVVTTLAGRAGEPGSADGRGSAARF---FDPK 135
Query: 195 VVYVRPTCSLLVIDRGNAALRQIS 218
V V +++V D N +R+IS
Sbjct: 136 GVAVDVAGNVVVSDNANHTIRKIS 159
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/115 (36%), Positives = 56/115 (48%), Gaps = 5/115 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I G +F D NS I KITP + +AG+ TG DG N ARF+
Sbjct: 399 PKGIVADPLGNVFVADTYNSTIRKITPAGVVTT----IAGAPS-STGSTDGPGNLARFSG 453
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSN 191
P+ + +D + NLYV DT N IRKI +GV + G G DG A+ ++
Sbjct: 454 PEAIAIDAQRNLYVGDTGNHTIRKISTSGVVSTLAGSPGRYGSEDGTGAAARLAS 508
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 39/94 (41%), Positives = 49/94 (52%), Gaps = 2/94 (2%)
Query: 125 VDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPS 184
V+G +ARFN P VT D GN YVAD N IRKI AGV + G S G DG
Sbjct: 333 VNGNGAQARFNLPWAVTSDATGNWYVADAGNYMIRKITPAGVVSQLAG-SGKWGSVDGTG 391
Query: 185 EDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
A FS +V P ++ V D N+ +R+I+
Sbjct: 392 AGASFSAPKGIV-ADPLGNVFVADTYNSTIRKIT 424
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 54/100 (54%), Gaps = 2/100 (2%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAG 178
G +G+++G + ARF +P + +D GN Y+AD N IRK+ D V+T+AG ++ G
Sbjct: 6 GGSGNLNGTGSLARFAYPGSMVVDSAGNRYIADQNNNQIRKVRPDGSVSTLAGASASTPG 65
Query: 179 FRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
DG A+F + + R L V D N +R++S
Sbjct: 66 AVDGRGAAARFYSPESIAIDRAGM-LYVADSVNHTIRKVS 104
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 39/113 (34%), Positives = 51/113 (45%), Gaps = 7/113 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + L+ D N I KI S +AGS G G DG AR
Sbjct: 454 PEAIAIDAQRNLYVGDTGNHTIRKI----STSGVVSTLAGS-PGRYGSEDGTGAAARLAS 508
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKF 189
P+ +++D GN+YV A+RKI AGV T G++ G D EDA+F
Sbjct: 509 PRSMSVDQAGNVYVISYR--AVRKITPAGVVTTWAGQALAYGNVDAVGEDARF 559
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 54/106 (50%), Gaps = 7/106 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G + D N I KI+P + + +AGS + + +DG + ARF
Sbjct: 134 PKGVAVDVAGNVVVSDNANHTIRKISPDGNVTT----LAGSARNAS-ALDGIGSAARFAF 188
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRD 181
P+ + D N+YV D N +RKI AG VTT+A G + V F+D
Sbjct: 189 PQALVTDAARNVYVVDRGNGLLRKITPAGIVTTLASGVNGV-NFKD 233
Score = 42.0 bits (97), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 48/143 (33%), Positives = 70/143 (48%), Gaps = 18/143 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPP--LSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
P + V + G ++ + + KITP ++ + AG Y G+VD +ARF
Sbjct: 509 PRSMSVDQAGNVYVISY--RAVRKITPAGVVTTW------AGQALAY-GNVDAVGEDARF 559
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSND 192
+ +T D GN+YV+DT IRKI D VTT+AG ++ G R G + A F+
Sbjct: 560 GYLLALTADAAGNVYVSDTAATTIRKI-DPWRRVTTVAGSTGSI-GIRTG-ALPASFAFP 616
Query: 193 FDVVYVRPTCSLLVIDRGNAALR 215
+ P +L VID NA LR
Sbjct: 617 SALAVTGP-GTLAVIDE-NALLR 637
>gi|408823744|ref|ZP_11208634.1| NHL repeat-containing protein [Pseudomonas geniculata N1]
Length = 681
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 55/145 (37%), Positives = 77/145 (53%), Gaps = 11/145 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + S DG ++ D ++N ++ P GR+ + QG G VDG +A FN
Sbjct: 60 PYALLRSADGSVYFTDAGDNNRIRRRLP-----DGRVETVAGQG-EGRVDGPALQASFNT 113
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ D +GNLYVADT N AIR+IG D VTT+AGG+ G+ DGP+ A+F +
Sbjct: 114 PSGIAADAQGNLYVADTGNHAIRRIGTDGQVTTLAGGEQ---GYADGPAAQARFDAPMGI 170
Query: 196 VYVRPTCSLLVIDRGNAALRQISLN 220
V + V D N +R I +
Sbjct: 171 A-VDAQGQVYVADTYNDRIRVIGTD 194
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 52/147 (35%), Positives = 73/147 (49%), Gaps = 15/147 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKPNEARF 134
P I G L+ D N I +I + G++ +AG QGY DG +ARF
Sbjct: 114 PSGIAADAQGNLYVADTGNHAIRRIG------TDGQVTTLAGGEQGYA---DGPAAQARF 164
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
+ P G+ +D +G +YVADT N IR IG D V T+AGG+ G DG A+F
Sbjct: 165 DAPMGIAVDAQGQVYVADTYNDRIRVIGTDGNVRTLAGGER--PGMADGVGAAARFDTPV 222
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISLN 220
+ + +LLV D N A+R+I +
Sbjct: 223 ALAF-DAQGALLVADLFNNAVRRIGAD 248
Score = 45.1 bits (105), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 48/102 (47%), Gaps = 9/102 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V G+++ D N I I R +AG + G DG ARF+
Sbjct: 167 PMGIAVDAQGQVYVADTYNDRIRVI----GTDGNVRTLAGGER--PGMADGVGAAARFDT 220
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTI--AGGKSN 175
P + D +G L VAD N A+R+IG D V+T+ AGG N
Sbjct: 221 PVALAFDAQGALLVADLFNNAVRRIGADGTVSTVVAAGGVIN 262
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 58/120 (48%), Gaps = 10/120 (8%)
Query: 104 PLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTL--NLAIRKI 161
PL+ ++ +AG G+ G+ DG +ARF P + G++Y D N R++
Sbjct: 29 PLAWTAQIEPLAGD--GHPGNRDGAAAQARFADPYALLRSADGSVYFTDAGDNNRIRRRL 86
Query: 162 GDAGVTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
D V T+AG G+ V DGP+ A F N + +L V D GN A+R+I +
Sbjct: 87 PDGRVETVAGQGEGRV----DGPALQASF-NTPSGIAADAQGNLYVADTGNHAIRRIGTD 141
>gi|182412380|ref|YP_001817446.1| immunoglobulin I-set domain-containing protein [Opitutus terrae
PB90-1]
gi|177839594|gb|ACB73846.1| Immunoglobulin I-set domain protein [Opitutus terrae PB90-1]
Length = 1130
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/139 (37%), Positives = 66/139 (47%), Gaps = 7/139 (5%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
+ V G +F VD S I KI+ + G G TG DG ARF P G
Sbjct: 494 LAVDSSGNIFVVDRGASTIRKISQGIVTTFAG------MPGETGQDDGAGAAARFRDPMG 547
Query: 140 VTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 199
+ +D NLYVADT N IRK+ AGV T G ++ G DGP A F N + + +
Sbjct: 548 IVIDGADNLYVADTNNWKIRKVTPAGVVTTFAGHTSTQGANDGPIGIASFFNPYGLA-IG 606
Query: 200 PTCSLLVIDRGNAALRQIS 218
P +L V+D LR IS
Sbjct: 607 PNGALYVVDLAGDTLRMIS 625
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 50/142 (35%), Positives = 71/142 (50%), Gaps = 6/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V+ G +F D I ++ Q AG G G DG ++ARF
Sbjct: 182 PRGIAVNATGVIFVSDRSAHTIRRV----DQLGNVSTWAG-HGGSAGSADGPGDQARFRD 236
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P+G+ +D GN+YVAD N IRKI AG T G + +GF DGP+ +A+F +
Sbjct: 237 PEGLAIDAAGNVYVADINNHTIRKINPAGEVTTLAGAAGESGFADGPAANARFFCPTSLA 296
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
+ P ++ V D N A+R+IS
Sbjct: 297 -IDPAGAIWVNDAINRAIRKIS 317
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/139 (33%), Positives = 73/139 (52%), Gaps = 8/139 (5%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
+ ++ +G ++ D NS I ITP + +AG G +DG + A F++P
Sbjct: 75 VAIAPNGLVYVADLANSTIRAITPAGVVTT----LAG-VANVHGCIDGVGSNALFHNPSA 129
Query: 140 VTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYV 198
+ + G+LYVAD+ AIRK+ AGV TT+AGG G+ DGP +A+FS + V
Sbjct: 130 LVVGPSGDLYVADSNGHAIRKVTPAGVVTTLAGGPLRY-GYMDGPGTEAQFSYPRGIA-V 187
Query: 199 RPTCSLLVIDRGNAALRQI 217
T + V DR +R++
Sbjct: 188 NATGVIFVSDRSAHTIRRV 206
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/144 (32%), Positives = 69/144 (47%), Gaps = 8/144 (5%)
Query: 75 VVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
V P ++ DG L+ D I ++T + +AG GY G DG A+F
Sbjct: 381 VQPIGSALAVDGNLYVTDSGGYAIRRVT----RSGEVSTLAG-LLGYPGFRDGSGYAAQF 435
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
+G+T D +GNL V D IRK+ AG T G G DGP+ A+F D
Sbjct: 436 RDLRGITPDKEGNLLVGD--GRTIRKVTLAGAVTTIAGADGEDGDTDGPAASARF-RAVD 492
Query: 195 VVYVRPTCSLLVIDRGNAALRQIS 218
+ V + ++ V+DRG + +R+IS
Sbjct: 493 GLAVDSSGNIFVVDRGASTIRKIS 516
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 49/96 (51%), Gaps = 1/96 (1%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
G DG ARFN GV + G +YVAD N IR I AGV T G +NV G DG
Sbjct: 58 GSRDGAAAVARFNGANGVAIAPNGLVYVADLANSTIRAITPAGVVTTLAGVANVHGCIDG 117
Query: 183 PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+A F N +V V P+ L V D A+R+++
Sbjct: 118 VGSNALFHNPSALV-VGPSGDLYVADSNGHAIRKVT 152
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/142 (27%), Positives = 70/142 (49%), Gaps = 6/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G+L+ D I K+TP + L G + G++DG EA+F++
Sbjct: 127 PSALVVGPSGDLYVADSNGHAIRKVTPAGVVTT---LAGGPLR--YGYMDGPGTEAQFSY 181
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P+G+ ++ G ++V+D IR++ G + G AG DGP + A+F D + +
Sbjct: 182 PRGIAVNATGVIFVSDRSAHTIRRVDQLGNVSTWAGHGGSAGSADGPGDQARF-RDPEGL 240
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
+ ++ V D N +R+I+
Sbjct: 241 AIDAAGNVYVADINNHTIRKIN 262
Score = 45.8 bits (107), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 33/95 (34%), Positives = 47/95 (49%), Gaps = 6/95 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + G ++ D N I KI P + +AG+ G +G DG ARF
Sbjct: 237 PEGLAIDAAGNVYVADINNHTIRKINPAGEVTT----LAGA-AGESGFADGPAANARFFC 291
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIA 170
P + +D G ++V D +N AIRKI G VTT+A
Sbjct: 292 PTSLAIDPAGAIWVNDAINRAIRKISPEGTVTTVA 326
Score = 45.4 bits (106), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 50/105 (47%), Gaps = 3/105 (2%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGK 173
+AG+ G G DG ARF G+ +D GN++V D IRKI VTT AG
Sbjct: 469 IAGA-DGEDGDTDGPAASARFRAVDGLAVDSSGNIFVVDRGASTIRKISQGIVTTFAGMP 527
Query: 174 SNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G DG A+F + +V + +L V D N +R+++
Sbjct: 528 GET-GQDDGAGAAARFRDPMGIV-IDGADNLYVADTNNWKIRKVT 570
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 38/77 (49%), Gaps = 1/77 (1%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGG 172
+VAG YT +G ARF P G + GNLYV D+ AIR++ +G + G
Sbjct: 360 VVAGPTDSYTSR-NGVGANARFVQPIGSALAVDGNLYVTDSGGYAIRRVTRSGEVSTLAG 418
Query: 173 KSNVAGFRDGPSEDAKF 189
GFRDG A+F
Sbjct: 419 LLGYPGFRDGSGYAAQF 435
Score = 39.7 bits (91), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 49/117 (41%), Gaps = 12/117 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT---GHVDGKPNEAR 133
P I + L+ D N I K+TP V +F G+T G DG A
Sbjct: 545 PMGIVIDGADNLYVADTNNWKIRKVTPAG--------VVTTFAGHTSTQGANDGPIGIAS 596
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKF 189
F +P G+ + G LYV D +R I D VTT+ G ++ DG A+F
Sbjct: 597 FFNPYGLAIGPNGALYVVDLAGDTLRMISPDGFVTTLGGSSAHRGETADGIGTAARF 653
>gi|374606289|ref|ZP_09679172.1| copper amine oxidase domain-containing protein [Paenibacillus
dendritiformis C454]
gi|374388103|gb|EHQ59542.1| copper amine oxidase domain-containing protein [Paenibacillus
dendritiformis C454]
Length = 564
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 54/149 (36%), Positives = 78/149 (52%), Gaps = 9/149 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYS----RGRLVAGSFQGYTGHVDGKPNEA 132
P + ++G L D N ++++ Q S + R A + + DGK EA
Sbjct: 78 PAGLLAGKNGALLIADTYN-HLIRRADAAGQVSTLAGQVRFAAETREPNGSWADGKGTEA 136
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSN 191
RFN P G+ D +GNLY+AD N IRK+ +G VTT+AG S +AG++DG +A+F+
Sbjct: 137 RFNEPMGMAEDRQGNLYIADAANHVIRKLDKSGRVTTVAG--SGLAGWKDGKGAEARFNE 194
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQISLN 220
DV V SL V D N +R+I N
Sbjct: 195 PRDVA-VAEDGSLYVADALNHVIRRIDAN 222
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 57/157 (36%), Positives = 75/157 (47%), Gaps = 19/157 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + G L+ D N I K L + R VAGS G G DGK EARFN
Sbjct: 141 PMGMAEDRQGNLYIADAANHVIRK----LDKSGRVTTVAGS--GLAGWKDGKGAEARFNE 194
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTI----------AGGKSNVAG-FRDGPS 184
P+ V + + G+LYVAD LN IR+I + VTT+ A G AG + DG
Sbjct: 195 PRDVAVAEDGSLYVADALNHVIRRIDANGNVTTLNARSKRIVEYAPGAVTAAGDYADGKL 254
Query: 185 EDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQ 221
+KF+ + + P+ L+V D GN LR + L Q
Sbjct: 255 MVSKFNEPSSLAFT-PSGDLVVSDTGNQRLRLVDLKQ 290
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 52/95 (54%), Gaps = 3/95 (3%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
G+ DGK +EA FN P G+ + +G + VAD N AIR + D V T++GG G ++G
Sbjct: 319 GYRDGKASEALFNGPAGIALTGEGGIVVADRWNHAIRYLYDGKVYTLSGGGRT--GHQNG 376
Query: 183 PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
+E A F DV V ++ V D N ++R I
Sbjct: 377 WAEQATFREPMDVA-VLSNGTIAVADGFNNSIRLI 410
Score = 47.4 bits (111), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 55/111 (49%), Gaps = 11/111 (9%)
Query: 115 AGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGK 173
AGS G +G DG+ +A+F +P G+ G L +ADT N IR+ AG V+T+AG
Sbjct: 58 AGS--GVSGSADGRAEQAQFRYPAGLLAGKNGALLIADTYNHLIRRADAAGQVSTLAGQV 115
Query: 174 SNVAGFR-------DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
A R DG +A+F+ + R +L + D N +R++
Sbjct: 116 RFAAETREPNGSWADGKGTEARFNEPMGMAEDR-QGNLYIADAANHVIRKL 165
>gi|426402696|ref|YP_007021667.1| hypothetical protein Bdt_0693 [Bdellovibrio bacteriovorus str.
Tiberius]
gi|425859364|gb|AFY00400.1| hypothetical protein Bdt_0693 [Bdellovibrio bacteriovorus str.
Tiberius]
Length = 692
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 53/116 (45%), Positives = 63/116 (54%), Gaps = 7/116 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT-GHVDGKPNEARFN 135
P I + G+LF D N++I KITP LVAGS G + G DG A F
Sbjct: 334 PAGIGIDASGDLFVTDSDNASIRKITPARVV----TLVAGSLAGDSDGSADGTGTAASFF 389
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFS 190
P+GV D GNLYVADT+N IRKI +G VTTIAG + G DG A+FS
Sbjct: 390 SPEGVAADPAGNLYVADTMNRTIRKITPSGNVTTIAGSPGQI-GSADGTGAAARFS 444
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 48/142 (33%), Positives = 71/142 (50%), Gaps = 7/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I ++ DG + D+ N+ I K+TP G + + G G DG A FN+
Sbjct: 170 PEDITLAADGNFYITDKNNNMIRKMTPA------GVVTTFAGDGTYGCTDGTGAAAHFNY 223
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+ D GNL+V + IRKI AGV T G +N G DG A+FS +
Sbjct: 224 PTGIVGDSAGNLFVVCSSCSTIRKITPAGVVTTFAGLANATGALDGTGTAARFSWPIGIT 283
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
+ + +L V D GN+A+R+++
Sbjct: 284 -IDSSDNLYVADYGNSAIRKVT 304
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 73/143 (51%), Gaps = 8/143 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P K+ V+EDG ++ DE I K+TP S +AG + +G DG ARF
Sbjct: 446 PTKLTVAEDGNIYIADEYR--IRKLTPGGVVTS----LAGDYD-NSGSADGTGTSARFGG 498
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
G+ D G+LYV+D+ N +RK+ AGV T G+ + G DG A FS +
Sbjct: 499 VAGIASDGAGSLYVSDSGNYTVRKVTLAGVVTTLAGQVGIQGSDDGTGTGATFSRVAGIT 558
Query: 197 YVRPTCSLLVIDRGNAALRQISL 219
V P+ ++ V D N +R+I++
Sbjct: 559 -VTPSGNIFVADTDNNVIRKITV 580
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 47/142 (33%), Positives = 71/142 (50%), Gaps = 7/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P V+ G++F VD S I KI+ + G+ G F H DG + ARF++
Sbjct: 61 PRDAVVNAAGDIFIVDSNASVIRKISNGVVSTFAGKF--GVFD----HADGTGDSARFDY 114
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+T+D GNL+V + N IRKI A V T G AG DG A+F+N D+
Sbjct: 115 PTGITIDGSGNLFVTEGNNHTIRKITPAAVVTTVAGSPGNAGTADGTGSAARFNNPEDIT 174
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
+ + + D+ N +R+++
Sbjct: 175 -LAADGNFYITDKNNNMIRKMT 195
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 68/145 (46%), Gaps = 12/145 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGY---TGHVDGKPNEAR 133
P I G LF V S I KITP V +F G TG +DG AR
Sbjct: 224 PTGIVGDSAGNLFVVCSSCSTIRKITPAG--------VVTTFAGLANATGALDGTGTAAR 275
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
F+ P G+T+D NLYVAD N AIRK+ + V + G G DG A+F+
Sbjct: 276 FSWPIGITIDSSDNLYVADYGNSAIRKVTSSAVVSNFAGSYGDYGAVDGTGTAARFAGPA 335
Query: 194 DVVYVRPTCSLLVIDRGNAALRQIS 218
+ + + L V D NA++R+I+
Sbjct: 336 G-IGIDASGDLFVTDSDNASIRKIT 359
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 50/147 (34%), Positives = 73/147 (49%), Gaps = 14/147 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPP--LSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
P I + L+ D NS I K+T +S + AGS+ Y G VDG ARF
Sbjct: 279 PIGITIDSSDNLYVADYGNSAIRKVTSSAVVSNF------AGSYGDY-GAVDGTGTAARF 331
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF- 193
P G+ +D G+L+V D+ N +IRKI A V T+ G ++AG DG ++ + F
Sbjct: 332 AGPAGIGIDASGDLFVTDSDNASIRKITPARVVTLVAG--SLAGDSDGSADGTGTAASFF 389
Query: 194 --DVVYVRPTCSLLVIDRGNAALRQIS 218
+ V P +L V D N +R+I+
Sbjct: 390 SPEGVAADPAGNLYVADTMNRTIRKIT 416
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/143 (32%), Positives = 69/143 (48%), Gaps = 8/143 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + G L+ D +N I KITP + + +AGS G G DG ARF++
Sbjct: 391 PEGVAADPAGNLYVADTMNRTIRKITPSGNVTT----IAGS-PGQIGSADGTGAAARFSY 445
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P +T+ + GN+Y+AD IRK+ GV T G + +G DG A+F +
Sbjct: 446 PTKLTVAEDGNIYIADEYR--IRKLTPGGVVTSLAGDYDNSGSADGTGTSARFGGVAGIA 503
Query: 197 YVRPTCSLLVIDRGNAALRQISL 219
SL V D GN +R+++L
Sbjct: 504 S-DGAGSLYVSDSGNYTVRKVTL 525
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 48/142 (33%), Positives = 66/142 (46%), Gaps = 7/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + G LF + N I KITP + VAGS G G DG + ARFN+
Sbjct: 115 PTGITIDGSGNLFVTEGNNHTIRKITPAAVVTT----VAGS-PGNAGTADGTGSAARFNN 169
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P+ +T+ GN Y+ D N IRK+ AGV T G G DG A F+ +V
Sbjct: 170 PEDITLAADGNFYITDKNNNMIRKMTPAGVVTTFAGDGTY-GCTDGTGAAAHFNYPTGIV 228
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
+L V+ + +R+I+
Sbjct: 229 G-DSAGNLFVVCSSCSTIRKIT 249
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/134 (32%), Positives = 64/134 (47%), Gaps = 8/134 (5%)
Query: 86 GELFAVDEVNSNIVKIT-PPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDD 144
G L+ D N + K+T + G++ G G DG A F+ G+T+
Sbjct: 508 GSLYVSDSGNYTVRKVTLAGVVTTLAGQV------GIQGSDDGTGTGATFSRVAGITVTP 561
Query: 145 KGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 204
GN++VADT N IRKI AGV T G + G DG +A+FS V + +L
Sbjct: 562 SGNIFVADTDNNVIRKITVAGVVTTFAGAAGQGGNDDGMGSNARFSQPH-FVATDSSGNL 620
Query: 205 LVIDRGNAALRQIS 218
V + G A +R+I+
Sbjct: 621 YVAEWGEATIRKIT 634
Score = 47.4 bits (111), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 48/99 (48%), Gaps = 18/99 (18%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAG---SFQGYTGHV---DGKPNEAR 133
I V+ G +F D N+ I KIT VAG +F G G DG + AR
Sbjct: 557 ITVTPSGNIFVADTDNNVIRKIT-----------VAGVVTTFAGAAGQGGNDDGMGSNAR 605
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAG 171
F+ P V D GNLYVA+ IRKI A VTTIAG
Sbjct: 606 FSQPHFVATDSSGNLYVAEWGEATIRKITPSAVVTTIAG 644
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 58/108 (53%), Gaps = 5/108 (4%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG 171
+L+AG F + ++G + ARF P+ ++ G++++ D+ IRKI + V+T A
Sbjct: 37 QLLAGQFD-ASAMINGPLSIARFREPRDAVVNAAGDIFIVDSNASVIRKISNGVVSTFA- 94
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDV-VYVRPTCSLLVIDRGNAALRQIS 218
GK V DG + A+F D+ + + + +L V + N +R+I+
Sbjct: 95 GKFGVFDHADGTGDSARF--DYPTGITIDGSGNLFVTEGNNHTIRKIT 140
>gi|329908045|ref|ZP_08274805.1| hypothetical protein IMCC9480_3485 [Oxalobacteraceae bacterium
IMCC9480]
gi|327546778|gb|EGF31712.1| hypothetical protein IMCC9480_3485 [Oxalobacteraceae bacterium
IMCC9480]
Length = 685
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 51/136 (37%), Positives = 73/136 (53%), Gaps = 11/136 (8%)
Query: 86 GELFAVDEVNSNIVKITPP--LSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMD 143
G + VD + I KITP +SQ + G G VDG + A FN+P+G+ D
Sbjct: 395 GNWYVVDTPHHMIRKITPAGVVSQLAG--------NGKPGGVDGTGSGASFNYPRGIVAD 446
Query: 144 DKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 203
GNL+VADT N IRKI AGV T G + +G DGP A+F + + + + + +
Sbjct: 447 ALGNLFVADTFNSRIRKITPAGVVTTIAGAGSSSGSTDGPGNIARFF-EPEAIAIDASRN 505
Query: 204 LLVIDRGNAALRQISL 219
L V D GN +R+I++
Sbjct: 506 LYVADTGNHTVRKITV 521
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 72/144 (50%), Gaps = 10/144 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITP-PLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P I + G L+ D N I KITP + GR+ G G DG+ N ARF+
Sbjct: 120 PQSIAIDRAGTLYVADTNNQTIRKITPQGVVTTIAGRV------GVDGSTDGRGNAARFS 173
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVT-TIAGGKSNVAGFRDGPSEDAKFSNDFD 194
+P+G+ +D G + V+DT N +R I GV T+AG N G DG A+FSN
Sbjct: 174 YPQGIAVDVAGTVLVSDTYNHTVRTISPGGVVGTLAGSAGNF-GVLDGVRSAARFSNPQG 232
Query: 195 VVYVRPTCSLLVIDRGNAALRQIS 218
+V ++ V D GN LR+++
Sbjct: 233 LV-TDAARNIYVADAGNGVLRKVT 255
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 51/142 (35%), Positives = 67/142 (47%), Gaps = 6/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V G + D N I K++ S + +AG+ G DG ARFN+
Sbjct: 65 PIGIAVDGVGNRYVADWQNHVIRKVSADGSVST----LAGAMS-TPGAADGTGTAARFNY 119
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P+ + +D G LYVADT N IRKI GV T G+ V G DG A+FS +
Sbjct: 120 PQSIAIDRAGTLYVADTNNQTIRKITPQGVVTTIAGRVGVDGSTDGRGNAARFSYPQGIA 179
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
V ++LV D N +R IS
Sbjct: 180 -VDVAGTVLVSDTYNHTVRTIS 200
Score = 62.0 bits (149), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 44/113 (38%), Positives = 54/113 (47%), Gaps = 5/113 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I G LF D NS I KITP + +AG+ +G DG N ARF
Sbjct: 440 PRGIVADALGNLFVADTFNSRIRKITPAGVVTT----IAGAGS-SSGSTDGPGNIARFFE 494
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKF 189
P+ + +D NLYVADT N +RKI AGV + G G DG A+F
Sbjct: 495 PEAIAIDASRNLYVADTGNHTVRKITVAGVVSTLAGSPGKYGSDDGTGAAARF 547
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 58/117 (49%), Gaps = 13/117 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGH---VDGKPNEAR 133
P I V G + D N + I+P V G+ G G+ +DG + AR
Sbjct: 175 PQGIAVDVAGTVLVSDTYNHTVRTISP--------GGVVGTLAGSAGNFGVLDGVRSAAR 226
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKF 189
F++P+G+ D N+YVAD N +RK+ AG VTT+AG +N G +DG A F
Sbjct: 227 FSNPQGLVTDAARNIYVADAGNGVLRKVTPAGIVTTLAGSLANY-GLKDGTGAAAGF 282
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/90 (40%), Positives = 49/90 (54%), Gaps = 3/90 (3%)
Query: 130 NEARFNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAK 188
N+ARFN P G+ +D GN YVAD N IRK+ D V+T+AG S G DG A+
Sbjct: 58 NQARFNFPIGIAVDGVGNRYVADWQNHVIRKVSADGSVSTLAGAMST-PGAADGTGTAAR 116
Query: 189 FSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
F+ + R +L V D N +R+I+
Sbjct: 117 FNYPQSIAIDR-AGTLYVADTNNQTIRKIT 145
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 52/107 (48%), Gaps = 7/107 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPP-LSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P I V G + I KITP + GR++ TG +DG + ARF
Sbjct: 550 PPAIAVDPSGNVVLSQPAYGTIRKITPGGVVTTLAGRVL------ETGALDGAGDAARFF 603
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
+P+G+ D+ GN+YVADT N IR+I +G + G + G R G
Sbjct: 604 NPQGLAADNAGNVYVADTGNNTIRRITPSGQVSTVAGVPGLVGIRTG 650
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/96 (38%), Positives = 48/96 (50%), Gaps = 2/96 (2%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
G VDG +ARF +GVT D GN YV DT + IRKI AGV + G G DG
Sbjct: 372 GMVDGNGAKARFGSLQGVTSDASGNWYVVDTPHHMIRKITPAGVVSQLAGNGKPGGV-DG 430
Query: 183 PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
A F+ +V +L V D N+ +R+I+
Sbjct: 431 TGSGASFNYPRGIV-ADALGNLFVADTFNSRIRKIT 465
Score = 43.9 bits (102), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 42/142 (29%), Positives = 61/142 (42%), Gaps = 6/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + L+ D N + KIT + +AGS G G DG ARF
Sbjct: 495 PEAIAIDASRNLYVADTGNHTVRKITVAGVVST----LAGS-PGKYGSDDGTGAAARFLK 549
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P + +D GN+ ++ IRKI GV T G+ G DG + A+F N +
Sbjct: 550 PPAIAVDPSGNVVLSQPAYGTIRKITPGGVVTTLAGRVLETGALDGAGDAARFFNPQGLA 609
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
++ V D GN +R+I+
Sbjct: 610 -ADNAGNVYVADTGNNTIRRIT 630
>gi|373953102|ref|ZP_09613062.1| Ig family protein [Mucilaginibacter paludis DSM 18603]
gi|373889702|gb|EHQ25599.1| Ig family protein [Mucilaginibacter paludis DSM 18603]
Length = 3731
Score = 76.3 bits (186), Expect = 3e-11, Method: Composition-based stats.
Identities = 50/142 (35%), Positives = 72/142 (50%), Gaps = 8/142 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + V G ++ DE+N I KITP G + + G G ++G A F++
Sbjct: 1225 PYDVAVDVTGNVYVADELNQVIRKITPA------GLVTTFAGSGGIGALNGTGTAASFHN 1278
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+T D +GN+YVAD N AIRKI GV T G ++ G DG A F N + V
Sbjct: 1279 PTGITTDAQGNVYVADLYNNAIRKITPGGVVTTLAGTGSI-GSADGVGTSASFYNP-NAV 1336
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
++ V+D N +R+I+
Sbjct: 1337 ATDAVGNIYVVDTYNQLIRKIT 1358
Score = 71.6 bits (174), Expect = 7e-10, Method: Composition-based stats.
Identities = 52/142 (36%), Positives = 73/142 (51%), Gaps = 7/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + G ++ D N I KITP + +AG+ G +G V+G A+FN+
Sbjct: 193 PYSLTTDMQGNVYVADYGNQLIRKITPAGVVTT----LAGTV-GSSGFVNGTGTAAKFNY 247
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P+ V D GN+YVAD +N AIRKI AGV T G S V G +G A F N V
Sbjct: 248 PRSVATDAAGNVYVADQVNQAIRKITPAGVVTTFAG-SGVPGALNGTGTAATFYNPTGVT 306
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
+ ++ V D N ++R+I+
Sbjct: 307 -MDAQGNVYVADSQNHSIRKIT 327
Score = 71.2 bits (173), Expect = 1e-09, Method: Composition-based stats.
Identities = 54/143 (37%), Positives = 73/143 (51%), Gaps = 9/143 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIV-KITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P+ I G L+ V +VNSN++ KITP G + + G G V+G A FN
Sbjct: 139 PFGIATDVQGNLY-VSDVNSNLIRKITPG------GVVTTLAGSGSAGSVNGTGTAASFN 191
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P +T D +GN+YVAD N IRKI AGV T G +GF +G AKF N
Sbjct: 192 TPYSLTTDMQGNVYVADYGNQLIRKITPAGVVTTLAGTVGSSGFVNGTGTAAKF-NYPRS 250
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V ++ V D+ N A+R+I+
Sbjct: 251 VATDAAGNVYVADQVNQAIRKIT 273
Score = 70.1 bits (170), Expect = 2e-09, Method: Composition-based stats.
Identities = 48/141 (34%), Positives = 70/141 (49%), Gaps = 8/141 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + G ++ D+VN I KITP G + + G G ++G A F +
Sbjct: 248 PRSVATDAAGNVYVADQVNQAIRKITPA------GVVTTFAGSGVPGALNGTGTAATFYN 301
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P GVTMD +GN+YVAD+ N +IRKI AGV T G ++ G +G +A F + V
Sbjct: 302 PTGVTMDAQGNVYVADSQNHSIRKITPAGVVTTLAGTGSM-GSANGAGTNASFYYP-NAV 359
Query: 197 YVRPTCSLLVIDRGNAALRQI 217
+L + D N +R+I
Sbjct: 360 VADALGNLYIADTNNHLIRKI 380
Score = 68.9 bits (167), Expect = 5e-09, Method: Composition-based stats.
Identities = 46/141 (32%), Positives = 68/141 (48%), Gaps = 9/141 (6%)
Query: 78 YKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHP 137
Y++ ++ +D + I KITP G + + G +G DG A F+ P
Sbjct: 636 YRLTTDAQSNVYVID--GNMIRKITPA------GVVTTLAGSGDSGSADGTGTAASFHTP 687
Query: 138 KGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVY 197
+T D +GN+YVAD N IRKI GV G S +GF +G + AKF N +
Sbjct: 688 YDLTTDAQGNVYVADNFNQTIRKITREGVVNTFAGTSGSSGFVNGTAAAAKFKNPIGIA- 746
Query: 198 VRPTCSLLVIDRGNAALRQIS 218
++ V D GN A+R+I+
Sbjct: 747 TDTQGNVYVADNGNLAIRKIT 767
Score = 67.4 bits (163), Expect = 2e-08, Method: Composition-based stats.
Identities = 51/142 (35%), Positives = 73/142 (51%), Gaps = 7/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ + G ++ D N+ I KITP G + + G +G DG A FN
Sbjct: 1116 PFGLTTDAQGNIYVSDANNNTIRKITP------SGVVTTFAGSGSSGAADGIGMAASFNS 1169
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+ D +GN+YVAD N IRKI GV T G + VAG +G + AKF++ +DV
Sbjct: 1170 PYGLATDAQGNIYVADFGNQVIRKITPDGVVTTFAGTTGVAGNVNGAAAAAKFNSPYDVA 1229
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
V T ++ V D N +R+I+
Sbjct: 1230 -VDVTGNVYVADELNQVIRKIT 1250
Score = 65.9 bits (159), Expect = 4e-08, Method: Composition-based stats.
Identities = 55/143 (38%), Positives = 72/143 (50%), Gaps = 11/143 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + G LF V++ N I KITP + +AG+ G G +G N A FNH
Sbjct: 1667 PRGMAIDALGNLFVVED-NYLIRKITPDAVVTT----LAGN--GAAGSANGTGNAASFNH 1719
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ D GNLYVADT N IRK+ AG VTT AG S A DG A F N
Sbjct: 1720 PWGIVADAAGNLYVADTYNNLIRKVTSAGSVTTFAG--SGAASSVDGTGTAASF-NYPSA 1776
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
+ + + +L V + +R+IS
Sbjct: 1777 ISIDASGNLYVAELNGNVIRKIS 1799
Score = 65.9 bits (159), Expect = 5e-08, Method: Composition-based stats.
Identities = 41/105 (39%), Positives = 55/105 (52%), Gaps = 9/105 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + G ++ D N I KIT + AG+ G +G V+G A+F +
Sbjct: 687 PYDLTTDAQGNVYVADNFNQTIRKIT----REGVVNTFAGT-SGSSGFVNGTAAAAKFKN 741
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD 181
P G+ D +GN+YVAD NLAIRKI AGV T G +GF+D
Sbjct: 742 PIGIATDTQGNVYVADNGNLAIRKITPAGVVTTLAG----SGFKD 782
Score = 65.5 bits (158), Expect = 6e-08, Method: Composition-based stats.
Identities = 51/144 (35%), Positives = 72/144 (50%), Gaps = 9/144 (6%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
VPY + + G LF D+ + I KITP AGS G G DG A F
Sbjct: 1610 VPYGMAIDAAGNLFVADQFYNQIRKITPD----GLVTTFAGSLTGAPGATDGTGAAATFR 1665
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P+G+ +D GNL+V + N IRKI DA VTT+AG + AG +G A F++ +
Sbjct: 1666 SPRGMAIDALGNLFVVED-NYLIRKITPDAVVTTLAG--NGAAGSANGTGNAASFNHPWG 1722
Query: 195 VVYVRPTCSLLVIDRGNAALRQIS 218
+V +L V D N +R+++
Sbjct: 1723 IV-ADAAGNLYVADTYNNLIRKVT 1745
Score = 65.5 bits (158), Expect = 6e-08, Method: Composition-based stats.
Identities = 54/144 (37%), Positives = 73/144 (50%), Gaps = 12/144 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPP-LSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P + V G ++ D+ N+ I KIT + G+L AGS DG A FN
Sbjct: 2106 PTGLAVDAAGNIYVADQGNNMIRKITAAGVVTTLAGKLTAGS-------ADGVGAAASFN 2158
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P GV +D GN+YVAD LN +RKI D VTT+AG S AG DG A F +
Sbjct: 2159 LPAGVAVDASGNVYVADLLNSMVRKITPDGTVTTLAGSTS--AGSADGTGAAAGFHYPTN 2216
Query: 195 VVYVRPTCSLLVIDRGNAALRQIS 218
+ V +++V D+ N +R+IS
Sbjct: 2217 -LQVDDQGNIIVADQLNNKIRKIS 2239
Score = 64.3 bits (155), Expect = 1e-07, Method: Composition-based stats.
Identities = 44/120 (36%), Positives = 60/120 (50%), Gaps = 9/120 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ I G L+ D N+ I K+T S G + + G VDG A FN+
Sbjct: 1720 PWGIVADAAGNLYVADTYNNLIRKVT------SAGSVTTFAGSGAASSVDGTGTAASFNY 1773
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P +++D GNLYVA+ IRKI AG VTTIAG S +G +G + A F N + +
Sbjct: 1774 PSAISIDASGNLYVAELNGNVIRKISPAGVVTTIAG--SGASGIANGIGKAATFGNLYSI 1831
Score = 63.9 bits (154), Expect = 2e-07, Method: Composition-based stats.
Identities = 47/142 (33%), Positives = 68/142 (47%), Gaps = 8/142 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + G L+ D N I KITP G + + G G +G A FN+
Sbjct: 1062 PSGVTTDASGNLYIADFNNRLIRKITP------SGLVTTFAGSGAAGSENGNGAAASFNN 1115
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+T D +GN+YV+D N IRKI +GV T G S +G DG A F++ + +
Sbjct: 1116 PFGLTTDAQGNIYVSDANNNTIRKITPSGVVTTFAG-SGSSGAADGIGMAASFNSPYGLA 1174
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
++ V D GN +R+I+
Sbjct: 1175 -TDAQGNIYVADFGNQVIRKIT 1195
Score = 62.0 bits (149), Expect = 6e-07, Method: Composition-based stats.
Identities = 45/142 (31%), Positives = 68/142 (47%), Gaps = 8/142 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ + V G ++ D N+ I KI+P G + + G G +G A FN+
Sbjct: 85 PFGVAVDASGNVYVADAGNNLIRKISPV------GVVSTFAGSGVAGSANGTGTAASFNN 138
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+ D +GNLYV+D + IRKI GV T G S AG +G A F+ + +
Sbjct: 139 PFGIATDVQGNLYVSDVNSNLIRKITPGGVVTTLAG-SGSAGSVNGTGTAASFNTPYSLT 197
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
++ V D GN +R+I+
Sbjct: 198 -TDMQGNVYVADYGNQLIRKIT 218
Score = 61.6 bits (148), Expect = 8e-07, Method: Composition-based stats.
Identities = 46/143 (32%), Positives = 70/143 (48%), Gaps = 20/143 (13%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
+P + V G ++ D +NS + KITP + + +AGS G DG A F+
Sbjct: 2159 LPAGVAVDASGNVYVADLLNSMVRKITPDGTVTT----LAGSTSA--GSADGTGAAAGFH 2212
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
+P + +DD+GN+ VAD LN IRKI AGV T G + F+N +DV
Sbjct: 2213 YPTNLQVDDQGNIIVADQLNNKIRKISPAGVVTTIAGPTG-------------FNNPYDV 2259
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
+ T + V D + +++ IS
Sbjct: 2260 A-ISKTGIIYVADYNSNSIKAIS 2281
Score = 61.2 bits (147), Expect = 1e-06, Method: Composition-based stats.
Identities = 50/142 (35%), Positives = 73/142 (51%), Gaps = 8/142 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V +G + +D++N+ + KITP G + + G +G +G A FNH
Sbjct: 2052 PLDVAVDAEGNTYVLDQLNNLVRKITPA------GVVSTLAGSGSSGSANGAATAATFNH 2105
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+ +D GN+YVAD N IRKI AGV T GK AG DG A F+ V
Sbjct: 2106 PTGLAVDAAGNIYVADQGNNMIRKITAAGVVTTLAGKL-TAGSADGVGAAASFNLPAGVA 2164
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
V + ++ V D N+ +R+I+
Sbjct: 2165 -VDASGNVYVADLLNSMVRKIT 2185
Score = 59.3 bits (142), Expect = 4e-06, Method: Composition-based stats.
Identities = 44/106 (41%), Positives = 56/106 (52%), Gaps = 5/106 (4%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGG 172
VAGS G G +G A F+ P GVT D GNLY+AD N IRKI +G VTT AG
Sbjct: 1040 VAGSV-GIAGKANGIGTAATFSGPSGVTTDASGNLYIADFNNRLIRKITPSGLVTTFAG- 1097
Query: 173 KSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
S AG +G A F+N F + ++ V D N +R+I+
Sbjct: 1098 -SGAAGSENGNGAAASFNNPFGLT-TDAQGNIYVSDANNNTIRKIT 1141
Score = 54.3 bits (129), Expect = 1e-04, Method: Composition-based stats.
Identities = 48/145 (33%), Positives = 64/145 (44%), Gaps = 13/145 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEA---R 133
PY + G ++ D N I KITP V +F G TG A +
Sbjct: 1170 PYGLATDAQGNIYVADFGNQVIRKITPDG--------VVTTFAGTTGVAGNVNGAAAAAK 1221
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
FN P V +D GN+YVAD LN IRKI AG+ T G + G +G A F N
Sbjct: 1222 FNSPYDVAVDVTGNVYVADELNQVIRKITPAGLVTTFAGSGGI-GALNGTGTAASFHNPT 1280
Query: 194 DVVYVRPTCSLLVIDRGNAALRQIS 218
+ ++ V D N A+R+I+
Sbjct: 1281 GIT-TDAQGNVYVADLYNNAIRKIT 1304
Score = 53.1 bits (126), Expect = 3e-04, Method: Composition-based stats.
Identities = 41/107 (38%), Positives = 48/107 (44%), Gaps = 11/107 (10%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF--QGYTGHVDGKPNEARFNHPKGVTMD 143
G +F D N I KITP V +F G G +GK A F P G+ +D
Sbjct: 1566 GNMFVADFGNHMIRKITPAT--------VVSTFVGTGSPGSTNGKGTAASFYVPYGMAID 1617
Query: 144 DKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKF 189
GNL+VAD IRKI G VTT AG + G DG A F
Sbjct: 1618 AAGNLFVADQFYNQIRKITPDGLVTTFAGSLTGAPGATDGTGAAATF 1664
Score = 52.8 bits (125), Expect = 4e-04, Method: Composition-based stats.
Identities = 36/99 (36%), Positives = 50/99 (50%), Gaps = 2/99 (2%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGF 179
G G+ + A F+ P GV +D GN+YVAD N IRKI GV + G S VAG
Sbjct: 68 GIAGNTNATGTAATFHSPFGVAVDASGNVYVADAGNNLIRKISPVGVVSTFAG-SGVAGS 126
Query: 180 RDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+G A F+N F + +L V D + +R+I+
Sbjct: 127 ANGTGTAASFNNPFGIA-TDVQGNLYVSDVNSNLIRKIT 164
Score = 48.1 bits (113), Expect = 0.009, Method: Composition-based stats.
Identities = 32/91 (35%), Positives = 39/91 (42%), Gaps = 6/91 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I G ++ D N+ I KITP G + + G G DG A F +
Sbjct: 1279 PTGITTDAQGNVYVADLYNNAIRKITPG------GVVTTLAGTGSIGSADGVGTSASFYN 1332
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVT 167
P V D GN+YV DT N IRKI T
Sbjct: 1333 PNAVATDAVGNIYVVDTYNQLIRKITTGNYT 1363
Score = 45.4 bits (106), Expect = 0.070, Method: Composition-based stats.
Identities = 24/58 (41%), Positives = 30/58 (51%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVA 177
G G+ +G A F+ P GV D GN+YV+D N IRKI AGV + G A
Sbjct: 564 GSAGNANGTGTAATFSSPTGVATDPSGNIYVSDYNNNLIRKINLAGVVSTFAGSGTAA 621
Score = 45.1 bits (105), Expect = 0.093, Method: Composition-based stats.
Identities = 38/134 (28%), Positives = 63/134 (47%), Gaps = 4/134 (2%)
Query: 87 ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF--QGYTGHVDGKPNEARFNHPKGVTMDD 144
+++ V+ + +V + + S G V +F G G V+ A FN P V +D
Sbjct: 2000 QIYMVNMAITPLVPVNSGTAVTSAGTAVVTTFAGSGAAGSVNSTGTSATFNGPLDVAVDA 2059
Query: 145 KGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 204
+GN YV D LN +RKI AGV + G + + A F++ + V ++
Sbjct: 2060 EGNTYVLDQLNNLVRKITPAGVVSTLAGSGSSGSANGAATA-ATFNHPTGLA-VDAAGNI 2117
Query: 205 LVIDRGNAALRQIS 218
V D+GN +R+I+
Sbjct: 2118 YVADQGNNMIRKIT 2131
Score = 43.5 bits (101), Expect = 0.26, Method: Composition-based stats.
Identities = 39/145 (26%), Positives = 63/145 (43%), Gaps = 36/145 (24%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P ++V + G + D++N+ I KI+P V + G TG FN+
Sbjct: 2214 PTNLQVDDQGNIIVADQLNNKIRKISPAG--------VVTTIAGPTG----------FNN 2255
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P V + G +YVAD + +I+ I GVTT+A G +N G
Sbjct: 2256 PYDVAISKTGIIYVADYNSNSIKAISPSGGVTTLATGFANPGG----------------- 2298
Query: 196 VYVRPTCSLLVIDRGNAALRQISLN 220
V + + V D G+ +R+I++N
Sbjct: 2299 VAIDSRGVIYVADYGHNTIRKITIN 2323
Score = 40.8 bits (94), Expect = 1.4, Method: Composition-based stats.
Identities = 41/142 (28%), Positives = 61/142 (42%), Gaps = 10/142 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + G ++ D N+ I KI G + + G V+G A F H
Sbjct: 581 PTGVATDPSGNIYVSDYNNNLIRKINLA------GVVSTFAGSGTAASVNGTGVAASFLH 634
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
+T D + N+YV D N+ IRKI AGV T G + +G DG A F +D+
Sbjct: 635 AYRLTTDAQSNVYVIDG-NM-IRKITPAGVVTTLAGSGD-SGSADGTGTAASFHTPYDLT 691
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
++ V D N +R+I+
Sbjct: 692 -TDAQGNVYVADNFNQTIRKIT 712
Score = 39.3 bits (90), Expect = 4.8, Method: Composition-based stats.
Identities = 39/135 (28%), Positives = 59/135 (43%), Gaps = 11/135 (8%)
Query: 91 VDEVNSNIVKITP-------PLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMD 143
VD V I ++P P YS + G+ G +G V+G A N G+ D
Sbjct: 1506 VDTVGVPITALSPVNKGGAVPAKTYSLVSTIVGN--GSSGAVNGTGTAASLNLCDGLVFD 1563
Query: 144 DKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 203
GN++VAD N IRKI A V + G + G +G A F + + + +
Sbjct: 1564 LLGNMFVADFGNHMIRKITPATVVSTFVGTGS-PGSTNGKGTAASFYVPYGMA-IDAAGN 1621
Query: 204 LLVIDRGNAALRQIS 218
L V D+ +R+I+
Sbjct: 1622 LFVADQFYNQIRKIT 1636
Score = 38.5 bits (88), Expect = 7.0, Method: Composition-based stats.
Identities = 28/91 (30%), Positives = 43/91 (47%), Gaps = 6/91 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + G L+ V E+N N+++ P + +AGS G +G +G A F +
Sbjct: 1774 PSAISIDASGNLY-VAELNGNVIRKISPAGVVTT---IAGS--GASGIANGIGKAATFGN 1827
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVT 167
+ D G++YVAD IRKI G +
Sbjct: 1828 LYSIATDASGDVYVADQYKYIIRKIVGTGYS 1858
>gi|456737758|gb|EMF62435.1| Hypothetical protein EPM1_0109 [Stenotrophomonas maltophilia EPM1]
Length = 693
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 55/145 (37%), Positives = 76/145 (52%), Gaps = 11/145 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + S DG ++ D ++N ++ P GR+ + QG G VDG +A FN
Sbjct: 72 PYALLRSADGSVYFTDAGDNNRIRRRLP-----DGRVETVAGQG-EGRVDGPALQASFNT 125
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ D +GNLYVADT N AIR+IG D VTT+AGG+ G DGP+ A+F +
Sbjct: 126 PSGIAADAQGNLYVADTGNHAIRRIGTDGQVTTLAGGEQ---GHADGPAAQARFDAPMGI 182
Query: 196 VYVRPTCSLLVIDRGNAALRQISLN 220
V + V D N +R I +
Sbjct: 183 A-VDAQGQVYVADTFNDRIRVIGTD 206
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 51/147 (34%), Positives = 72/147 (48%), Gaps = 15/147 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKPNEARF 134
P I G L+ D N I +I + G++ +AG QG H DG +ARF
Sbjct: 126 PSGIAADAQGNLYVADTGNHAIRRIG------TDGQVTTLAGGEQG---HADGPAAQARF 176
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
+ P G+ +D +G +YVADT N IR IG D V T+AGG G DG A+F
Sbjct: 177 DAPMGIAVDAQGQVYVADTFNDRIRVIGTDGNVRTLAGGDR--PGLADGVGATARFDTPV 234
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISLN 220
+ + +LLV D N A+R++ +
Sbjct: 235 ALAF-DAQGALLVADLFNNAVRRVGAD 260
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 48/103 (46%), Gaps = 9/103 (8%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P I V G+++ D N I I R +AG + G DG ARF+
Sbjct: 178 APMGIAVDAQGQVYVADTFNDRIRVI----GTDGNVRTLAGGDR--PGLADGVGATARFD 231
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTI--AGGKSN 175
P + D +G L VAD N A+R++G D V+T+ AGG N
Sbjct: 232 TPVALAFDAQGALLVADLFNNAVRRVGADGTVSTVVAAGGVIN 274
Score = 40.0 bits (92), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 57/120 (47%), Gaps = 10/120 (8%)
Query: 104 PLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTL--NLAIRKI 161
PL+ ++ +AG G+ G DG +ARF P + G++Y D N R++
Sbjct: 41 PLAWTAQIEPLAGD--GHPGDRDGASAQARFADPYALLRSADGSVYFTDAGDNNRIRRRL 98
Query: 162 GDAGVTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
D V T+AG G+ V DGP+ A F N + +L V D GN A+R+I +
Sbjct: 99 PDGRVETVAGQGEGRV----DGPALQASF-NTPSGIAADAQGNLYVADTGNHAIRRIGTD 153
>gi|348676369|gb|EGZ16187.1| hypothetical protein PHYSODRAFT_506757 [Phytophthora sojae]
Length = 1981
Score = 76.3 bits (186), Expect = 4e-11, Method: Composition-based stats.
Identities = 48/119 (40%), Positives = 66/119 (55%), Gaps = 9/119 (7%)
Query: 106 SQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
S ++GR+ + +G G +DG ARFNHP+GV +D G +YVADT N IRKI
Sbjct: 197 SGRAQGRVTTVAGEGTAGFLDGPAATARFNHPRGVAVDSNGVVYVADTANHRIRKINPTT 256
Query: 166 --VTTIAGGKSNVAGFRDGPS-EDAKFSNDFDVVYVR----PTCSLLVIDRGNAALRQI 217
V+T+AG + GF DG + A+FS DV + T ++ V D GN +RQI
Sbjct: 257 KMVSTLAG--DGIEGFADGAALSAARFSYPSDVAVLETNGGATVTVFVADTGNHRIRQI 313
Score = 47.8 bits (112), Expect = 0.013, Method: Composition-based stats.
Identities = 28/93 (30%), Positives = 46/93 (49%)
Query: 109 SRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTT 168
S R+V + GH+DG+ NEA FN P GV G +YVA + + +R+I A + +
Sbjct: 467 STNRVVTLAGSNVPGHIDGEGNEATFNAPAGVAFAADGRVYVASSTDCTVRQISPASLVS 526
Query: 169 IAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 201
+ S A PS A + D ++++ +
Sbjct: 527 RSVTCSTRATQVLRPSGCASYEQPVDELFLQTS 559
Score = 47.8 bits (112), Expect = 0.014, Method: Composition-based stats.
Identities = 30/68 (44%), Positives = 37/68 (54%), Gaps = 9/68 (13%)
Query: 121 YTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVT-TIAGGKSNVAGF 179
+ G DG P ARF+ P GV +D G ++VADT N IR+I G T T+AG
Sbjct: 342 HAGLADGDPLGARFDSPMGVAVDADGVVFVADTGNHLIRRIEPNGTTHTLAGSVV----- 396
Query: 180 RDGPSEDA 187
PSEDA
Sbjct: 397 ---PSEDA 401
>gi|424666862|ref|ZP_18103887.1| hypothetical protein A1OC_00420 [Stenotrophomonas maltophilia
Ab55555]
gi|401069531|gb|EJP78052.1| hypothetical protein A1OC_00420 [Stenotrophomonas maltophilia
Ab55555]
Length = 693
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 55/145 (37%), Positives = 76/145 (52%), Gaps = 11/145 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + S DG ++ D ++N ++ P GR+ + QG G VDG +A FN
Sbjct: 72 PYALLRSADGSVYFTDAGDNNRIRRRLP-----DGRVETVAGQG-EGRVDGPALQASFNT 125
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ D +GNLYVADT N AIR+IG D VTT+AGG+ G DGP+ A+F +
Sbjct: 126 PSGIAADAQGNLYVADTGNHAIRRIGIDGQVTTLAGGEQ---GHADGPAAQARFDAPMGI 182
Query: 196 VYVRPTCSLLVIDRGNAALRQISLN 220
V + V D N +R I +
Sbjct: 183 A-VDAQGQVYVADTFNDRIRVIGTD 206
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 50/145 (34%), Positives = 69/145 (47%), Gaps = 11/145 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I G L+ D N I +I + +AG QG H DG +ARF+
Sbjct: 126 PSGIAADAQGNLYVADTGNHAIRRIGIDGQVTT----LAGGEQG---HADGPAAQARFDA 178
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ +D +G +YVADT N IR IG D V T+AGG G DG A+F +
Sbjct: 179 PMGIAVDAQGQVYVADTFNDRIRVIGTDGNVRTLAGGDR--PGLADGVGAAARFDTPVAL 236
Query: 196 VYVRPTCSLLVIDRGNAALRQISLN 220
+ +LLV D N A+R++ +
Sbjct: 237 AF-DAQGALLVADLFNNAVRRVGAD 260
Score = 43.9 bits (102), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 48/102 (47%), Gaps = 9/102 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V G+++ D N I I R +AG + G DG ARF+
Sbjct: 179 PMGIAVDAQGQVYVADTFNDRIRVI----GTDGNVRTLAGGDR--PGLADGVGAAARFDT 232
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTI--AGGKSN 175
P + D +G L VAD N A+R++G D V+T+ AGG N
Sbjct: 233 PVALAFDAQGALLVADLFNNAVRRVGADGTVSTVVAAGGVIN 274
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 58/120 (48%), Gaps = 10/120 (8%)
Query: 104 PLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTL--NLAIRKI 161
PL+ ++ +AG G+ G DG +ARF P + G++Y D N R++
Sbjct: 41 PLAWTAQIEPLAGD--GHPGDRDGASAQARFADPYALLRSADGSVYFTDAGDNNRIRRRL 98
Query: 162 GDAGVTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
D V T+AG G+ V DGP+ A F N + +L V D GN A+R+I ++
Sbjct: 99 PDGRVETVAGQGEGRV----DGPALQASF-NTPSGIAADAQGNLYVADTGNHAIRRIGID 153
>gi|125561542|gb|EAZ06990.1| hypothetical protein OsI_29235 [Oryza sativa Indica Group]
Length = 498
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 37/87 (42%), Positives = 56/87 (64%), Gaps = 6/87 (6%)
Query: 167 TTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEY 226
TTIAGG+S G +DGP+++A FS DF++VYV C+LL+ DRGN +RQI+L ++DC
Sbjct: 223 TTIAGGRSKGPGRKDGPAQNATFSPDFELVYVPKMCALLITDRGNRLIRQINLKREDCAR 282
Query: 227 Q------YNSISPTDILMVVGAVLVGY 247
+ S+S +L + +VG+
Sbjct: 283 ETQPGLGTTSVSIIAVLCALLGSVVGF 309
>gi|42522312|ref|NP_967692.1| hypothetical protein Bd0727 [Bdellovibrio bacteriovorus HD100]
gi|39574843|emb|CAE78685.1| hypothetical protein with NHL repeat [Bdellovibrio bacteriovorus
HD100]
Length = 709
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 63/116 (54%), Gaps = 7/116 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT-GHVDGKPNEARFN 135
P + + G LF D N++I K+TP LVAGS G + G DG A F+
Sbjct: 351 PAGVGIDASGNLFVTDSDNASIRKVTPARVV----TLVAGSLAGDSDGSADGTGTAASFH 406
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFS 190
P+GV D GNLYVADT+N IRKI +G VTTIAG + G DG A+FS
Sbjct: 407 SPEGVAADPAGNLYVADTMNRTIRKITPSGNVTTIAGSPGQI-GSADGTGAAARFS 461
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 73/143 (51%), Gaps = 8/143 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P K+ V+EDG ++ DE I K+TP S +AG + +G DG ARF
Sbjct: 463 PTKLTVAEDGNIYVADEYR--IRKLTPGGVVTS----LAGDYD-NSGSADGTGTSARFGG 515
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
G+ D G+LYV+D+ N +RK+ AGV T G+ + G DG A FS +
Sbjct: 516 VAGIASDGAGSLYVSDSGNYTVRKVTLAGVVTTLAGQVGIQGSDDGTGTGATFSRVAGIT 575
Query: 197 YVRPTCSLLVIDRGNAALRQISL 219
V P+ ++ V D N +R+I++
Sbjct: 576 -VTPSGNIFVADTDNNVIRKITV 597
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 47/142 (33%), Positives = 71/142 (50%), Gaps = 7/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P V+ G++F VD S I KI+ + G+ G F H DG + ARF++
Sbjct: 78 PRDAVVNAAGDIFIVDSNASVIRKISNGVVSTFAGKF--GVFD----HADGTGDSARFDY 131
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+T+D GNL+V + N IRKI A V T G AG DG A+F+N D+
Sbjct: 132 PTGITIDGSGNLFVTEGNNHTIRKITPAAVVTTVAGSPGNAGTADGTGSAARFNNPEDIT 191
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
+ + + D+ N +R+++
Sbjct: 192 -LAADGNFYITDKNNNMIRKMT 212
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 52/143 (36%), Positives = 70/143 (48%), Gaps = 8/143 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT-GHVDGKPNEARFN 135
P I G LF V S I KITP G + + Q YT G VDG ARF+
Sbjct: 241 PTGIVGDSAGNLFVVCSSCSTIRKITPA------GVVTTFAGQAYTTGAVDGTGTAARFS 294
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+T+D NLYVAD N AIRK+ + V + G G DG A+F+
Sbjct: 295 WPVGITIDSSDNLYVADYSNSAIRKVTSSAVVSNFAGSYGDYGAVDGTGTAARFAGPAG- 353
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V + + +L V D NA++R+++
Sbjct: 354 VGIDASGNLFVTDSDNASIRKVT 376
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/142 (32%), Positives = 70/142 (49%), Gaps = 7/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I ++ DG + D+ N+ I K+TP G + + G G DG A FN+
Sbjct: 187 PEDITLAADGNFYITDKNNNMIRKMTPA------GVVTTFAGDGTYGCTDGTGAAAHFNY 240
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+ D GNL+V + IRKI AGV T G++ G DG A+FS +
Sbjct: 241 PTGIVGDSAGNLFVVCSSCSTIRKITPAGVVTTFAGQAYTTGAVDGTGTAARFSWPVGIT 300
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
+ + +L V D N+A+R+++
Sbjct: 301 -IDSSDNLYVADYSNSAIRKVT 321
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 51/147 (34%), Positives = 73/147 (49%), Gaps = 14/147 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPP--LSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
P I + L+ D NS I K+T +S + AGS+ Y G VDG ARF
Sbjct: 296 PVGITIDSSDNLYVADYSNSAIRKVTSSAVVSNF------AGSYGDY-GAVDGTGTAARF 348
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF- 193
P GV +D GNL+V D+ N +IRK+ A V T+ G ++AG DG ++ + F
Sbjct: 349 AGPAGVGIDASGNLFVTDSDNASIRKVTPARVVTLVAG--SLAGDSDGSADGTGTAASFH 406
Query: 194 --DVVYVRPTCSLLVIDRGNAALRQIS 218
+ V P +L V D N +R+I+
Sbjct: 407 SPEGVAADPAGNLYVADTMNRTIRKIT 433
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 48/143 (33%), Positives = 69/143 (48%), Gaps = 8/143 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + G L+ D +N I KITP + + +AGS G G DG ARF++
Sbjct: 408 PEGVAADPAGNLYVADTMNRTIRKITPSGNVTT----IAGS-PGQIGSADGTGAAARFSY 462
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P +T+ + GN+YVAD IRK+ GV T G + +G DG A+F +
Sbjct: 463 PTKLTVAEDGNIYVADEYR--IRKLTPGGVVTSLAGDYDNSGSADGTGTSARFGGVAGIA 520
Query: 197 YVRPTCSLLVIDRGNAALRQISL 219
SL V D GN +R+++L
Sbjct: 521 S-DGAGSLYVSDSGNYTVRKVTL 542
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 48/142 (33%), Positives = 66/142 (46%), Gaps = 7/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + G LF + N I KITP + VAGS G G DG + ARFN+
Sbjct: 132 PTGITIDGSGNLFVTEGNNHTIRKITPAAVVTT----VAGS-PGNAGTADGTGSAARFNN 186
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P+ +T+ GN Y+ D N IRK+ AGV T G G DG A F+ +V
Sbjct: 187 PEDITLAADGNFYITDKNNNMIRKMTPAGVVTTFAGDGTY-GCTDGTGAAAHFNYPTGIV 245
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
+L V+ + +R+I+
Sbjct: 246 G-DSAGNLFVVCSSCSTIRKIT 266
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 44/134 (32%), Positives = 64/134 (47%), Gaps = 8/134 (5%)
Query: 86 GELFAVDEVNSNIVKIT-PPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDD 144
G L+ D N + K+T + G++ G G DG A F+ G+T+
Sbjct: 525 GSLYVSDSGNYTVRKVTLAGVVTTLAGQV------GIQGSDDGTGTGATFSRVAGITVTP 578
Query: 145 KGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 204
GN++VADT N IRKI AGV T G + G DG +A+FS V + +L
Sbjct: 579 SGNIFVADTDNNVIRKITVAGVVTTFAGAAGQGGNDDGMGSNARFSQPH-FVATDSSGNL 637
Query: 205 LVIDRGNAALRQIS 218
V + G A +R+I+
Sbjct: 638 YVAEWGEATIRKIT 651
Score = 47.8 bits (112), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 48/99 (48%), Gaps = 18/99 (18%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAG---SFQGYTGHV---DGKPNEAR 133
I V+ G +F D N+ I KIT VAG +F G G DG + AR
Sbjct: 574 ITVTPSGNIFVADTDNNVIRKIT-----------VAGVVTTFAGAAGQGGNDDGMGSNAR 622
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAG 171
F+ P V D GNLYVA+ IRKI A VTTIAG
Sbjct: 623 FSQPHFVATDSSGNLYVAEWGEATIRKITSGAVVTTIAG 661
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 58/108 (53%), Gaps = 5/108 (4%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG 171
+L+AG F + ++G + ARF P+ ++ G++++ D+ IRKI + V+T A
Sbjct: 54 QLLAGQFD-ASAMINGPLSIARFREPRDAVVNAAGDIFIVDSNASVIRKISNGVVSTFA- 111
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDV-VYVRPTCSLLVIDRGNAALRQIS 218
GK V DG + A+F D+ + + + +L V + N +R+I+
Sbjct: 112 GKFGVFDHADGTGDSARF--DYPTGITIDGSGNLFVTEGNNHTIRKIT 157
>gi|373488690|ref|ZP_09579354.1| NHL repeat containing protein [Holophaga foetida DSM 6591]
gi|372005635|gb|EHP06271.1| NHL repeat containing protein [Holophaga foetida DSM 6591]
Length = 438
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 48/143 (33%), Positives = 68/143 (47%), Gaps = 1/143 (0%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ I V L+ D I K+ P L+AGS G +G D + A FN
Sbjct: 121 PFGIAVDSAKTLYVSDATRHTISKLVPNSDGTVTQSLLAGS-DGISGSEDKSGSSASFNS 179
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P+G+ +D GNLYVAD N IRKI G T G++ +G DG A+F++ +
Sbjct: 180 PEGLALDASGNLYVADYGNHTIRKIDTLGAVTTFAGQAEASGTVDGDRLSARFNHPIGLA 239
Query: 197 YVRPTCSLLVIDRGNAALRQISL 219
+ L V D GN +R I++
Sbjct: 240 FNASYSVLYVADSGNHTIRAINI 262
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 63/131 (48%), Gaps = 6/131 (4%)
Query: 88 LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN 147
L+ D +S I KIT S G + G TG +G N ARFN P G+T+ G
Sbjct: 301 LYVTDYGSSTIRKITS-----SGGISTLAGYAGDTGTANGTGNGARFNQPVGITLHTSGY 355
Query: 148 LYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVI 207
L VAD N AIRK+ +G + G+S V+G D +A FS ++ V + V
Sbjct: 356 LLVADAYNHAIRKVSTSGSVSTLAGESGVSGNEDESGSEAHFSRPSNIC-VDSSGIAYVT 414
Query: 208 DRGNAALRQIS 218
D N +R I+
Sbjct: 415 DYKNGLIRTIT 425
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 41/90 (45%), Gaps = 5/90 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + G L D N I K++ S + +AG G +G+ D +EA F+
Sbjct: 345 PVGITLHTSGYLLVADAYNHAIRKVSTSGSVST----LAGE-SGVSGNEDESGSEAHFSR 399
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGV 166
P + +D G YV D N IR I AGV
Sbjct: 400 PSNICVDSSGIAYVTDYKNGLIRTITTAGV 429
>gi|86605024|ref|YP_473787.1| NHL repeat-containing protein [Synechococcus sp. JA-3-3Ab]
gi|86553566|gb|ABC98524.1| NHL repeat domain protein [Synechococcus sp. JA-3-3Ab]
Length = 637
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 74/140 (52%), Gaps = 10/140 (7%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
I V +G + D I +++P G L + G GH DG ++A+F+ P+G
Sbjct: 333 IAVDPEGFFYLADPAQHRIFRLSP------EGELEVWAGSGKAGHRDGAADQAQFDSPQG 386
Query: 140 VTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYV 198
+ D KG L+VAD+ N +R I VTT AG + VAG+RDG ++A+F F + +
Sbjct: 387 LLWDPKGGLWVADSGNHCLRHISRQRQVTTFAG--TCVAGYRDGERDEAQFREPFGLA-L 443
Query: 199 RPTCSLLVIDRGNAALRQIS 218
SL V DR N +R+I+
Sbjct: 444 GLDGSLYVADRANRRIRRIT 463
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 55/188 (29%), Positives = 89/188 (47%), Gaps = 34/188 (18%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ + + DG L+ D N I +ITP G++ + G G DG ++A+
Sbjct: 438 PFGLALGLDGSLYVADRANRRIRRITP------TGKVTTAAGTGQPGSADGPADQAQLLQ 491
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSN----D 192
P + +D +GNL++AD L R D +TT++ + AG+RDGP +A+F
Sbjct: 492 PTALAVDREGNLWIADRHRLR-RLSADGQLTTLSRAE---AGYRDGPLAEARFQTLAGLA 547
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQISLN--------QDDCEYQYNSIS------PTDILM 238
FD + L + DR N LR++ N QD+ +Q S P D+L+
Sbjct: 548 FDSAGI-----LWLADRDNHRLRRLQPNGQVSTLAGQDEPGWQDGPASVARFEQPGDLLV 602
Query: 239 VV-GAVLV 245
+ G+V+V
Sbjct: 603 LPDGSVVV 610
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 44/143 (30%), Positives = 70/143 (48%), Gaps = 13/143 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V +G L+ D + L+ SR G+ DG EARF
Sbjct: 492 PTALAVDREGNLWIADRHRLRRLSADGQLTTLSRAE---------AGYRDGPLAEARFQT 542
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
G+ D G L++AD N +R++ G V+T+AG + G++DGP+ A+F D+
Sbjct: 543 LAGLAFDSAGILWLADRDNHRLRRLQPNGQVSTLAG--QDEPGWQDGPASVARFEQPGDL 600
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
+ V P S++V+D G LR++S
Sbjct: 601 L-VLPDGSVVVVDAGLPGLRRLS 622
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 44/135 (32%), Positives = 62/135 (45%), Gaps = 21/135 (15%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQG--YTGHVDGKPNEARF 134
P + G L+ D N + I SR R V +F G G+ DG+ +EA+F
Sbjct: 384 PQGLLWDPKGGLWVADSGNHCLRHI-------SRQRQVT-TFAGTCVAGYRDGERDEAQF 435
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P G+ + G+LYVAD N IR+I G T A G G DGP++ A+
Sbjct: 436 REPFGLALGLDGSLYVADRANRRIRRITPTGKVTTAAGTGQ-PGSADGPADQAQL----- 489
Query: 195 VVYVRPTCSLLVIDR 209
++PT L +DR
Sbjct: 490 ---LQPTA--LAVDR 499
Score = 42.0 bits (97), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 41/80 (51%), Gaps = 2/80 (2%)
Query: 139 GVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYV 198
G+ +D +G Y+AD I ++ G + G S AG RDG ++ A+F + +++
Sbjct: 332 GIAVDPEGFFYLADPAQHRIFRLSPEGELEVWAG-SGKAGHRDGAADQAQFDSPQGLLW- 389
Query: 199 RPTCSLLVIDRGNAALRQIS 218
P L V D GN LR IS
Sbjct: 390 DPKGGLWVADSGNHCLRHIS 409
>gi|340788705|ref|YP_004754170.1| hypothetical protein CFU_3523 [Collimonas fungivorans Ter331]
gi|340553972|gb|AEK63347.1| hypothetical protein CFU_3523 [Collimonas fungivorans Ter331]
Length = 394
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 52/142 (36%), Positives = 67/142 (47%), Gaps = 6/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V G LF D N+ I KIT + S + ARFN
Sbjct: 58 PGAIAVDAGGNLFVADTANNTIRKITAAGEASTLAGSAGNSGSSDG-----SGSRARFNQ 112
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+ +D GNLY+ADT N IR I AGV T G + AG DG + DA+F+ + V
Sbjct: 113 PGGIAVDAGGNLYLADTQNHTIRMITSAGVVTTIAGSAGQAGQNDGTAGDARFNQPWGVA 172
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
+L V D GNA +R+I+
Sbjct: 173 R-DGAGNLYVTDTGNATVRKIT 193
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 40/92 (43%), Positives = 45/92 (48%), Gaps = 5/92 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V G L+ D N I IT + +AGS G G DG +ARFN
Sbjct: 113 PGGIAVDAGGNLYLADTQNHTIRMITSAGVVTT----IAGS-AGQAGQNDGTAGDARFNQ 167
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTT 168
P GV D GNLYV DT N +RKI AGV T
Sbjct: 168 PWGVARDGAGNLYVTDTGNATVRKITAAGVVT 199
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 45/88 (51%), Gaps = 1/88 (1%)
Query: 131 EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFS 190
ARFN P G+ +D GN+ V+D+ N IR + GV + G + AG DG A+F
Sbjct: 283 RARFNQPYGIALDSAGNIRVSDSGNQLIRTVSLTGVVSTLAGAAGTAGSTDGSGNKARF- 341
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQIS 218
N + + ++ V D N +R+++
Sbjct: 342 NQPEGIAADAANNIYVADTSNNLIRKVT 369
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 26/43 (60%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
G DG + RF+ P + +D GNL+VADT N IRKI AG
Sbjct: 44 GSSDGSGSNVRFSQPGAIAVDAGGNLFVADTANNTIRKITAAG 86
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 52/100 (52%), Gaps = 12/100 (12%)
Query: 131 EARFNHPKGVTMDDKGN-----------LYVADTLNLAIRKIGDAGVTTIAGGKSNVAGF 179
A+FN P+G+T+D+ GN +Y+AD+ N IRK+ G + G + +G
Sbjct: 217 RAQFNLPRGITLDNAGNSTGTAVNLAVNIYIADSNNNTIRKLDQNGNVSTLAGTAGSSGS 276
Query: 180 RDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
DG + A+F+ + + + ++ V D GN +R +SL
Sbjct: 277 ADGSGQRARFNQPYGIA-LDSAGNIRVSDSGNQLIRTVSL 315
>gi|190572588|ref|YP_001970433.1| NHL repeat-containing protein [Stenotrophomonas maltophilia K279a]
gi|190010510|emb|CAQ44119.1| putative NHL repeat protein [Stenotrophomonas maltophilia K279a]
Length = 1267
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 48/114 (42%), Positives = 65/114 (57%), Gaps = 10/114 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + DG ++ D ++N ++ P GR+ + QG G VDG +A FN
Sbjct: 646 PYALLRGADGSVYFTDAGDNNRIRRRLP-----DGRVETVAGQG-EGRVDGPALQASFNT 699
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKF 189
P G+ D +GNLYVADT N AIR+IG D VTT+AGG+ G DGP+ A+F
Sbjct: 700 PSGIAADAQGNLYVADTGNHAIRRIGTDGQVTTLAGGEQ---GHADGPAVQARF 750
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 50/145 (34%), Positives = 70/145 (48%), Gaps = 11/145 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I G L+ D N I +I + +AG G GH DG +ARF+
Sbjct: 700 PSGIAADAQGNLYVADTGNHAIRRIGTDGQVTT----LAG---GEQGHADGPAVQARFDA 752
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ +D +G +YVADT N IR IG D V T+AGG + G DG A+F +
Sbjct: 753 PMGIAVDAQGQVYVADTFNDRIRVIGTDGMVRTLAGG--DRPGLADGVGAAARFDTPVAL 810
Query: 196 VYVRPTCSLLVIDRGNAALRQISLN 220
+ +LLV D N A+R++ +
Sbjct: 811 AF-DAQGALLVADLFNNAVRRVGAD 834
Score = 43.9 bits (102), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 48/102 (47%), Gaps = 9/102 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V G+++ D N I I R +AG + G DG ARF+
Sbjct: 753 PMGIAVDAQGQVYVADTFNDRIRVI----GTDGMVRTLAGGDR--PGLADGVGAAARFDT 806
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTI--AGGKSN 175
P + D +G L VAD N A+R++G D V+T+ AGG N
Sbjct: 807 PVALAFDAQGALLVADLFNNAVRRVGADGTVSTVVAAGGVIN 848
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 49/104 (47%), Gaps = 8/104 (7%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTL--NLAIRKIGDAGVTTIAG-GKSNV 176
G+ G DG +ARF P + G++Y D N R++ D V T+AG G+ V
Sbjct: 629 GHPGDRDGASAQARFADPYALLRGADGSVYFTDAGDNNRIRRRLPDGRVETVAGQGEGRV 688
Query: 177 AGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
DGP+ A F N + +L V D GN A+R+I +
Sbjct: 689 ----DGPALQASF-NTPSGIAADAQGNLYVADTGNHAIRRIGTD 727
>gi|410694120|ref|YP_003624742.1| putative Soluble quinoprotein glucose dehydrogenase [Thiomonas sp.
3As]
gi|294340545|emb|CAZ88930.1| putative Soluble quinoprotein glucose dehydrogenase [Thiomonas sp.
3As]
Length = 366
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 52/144 (36%), Positives = 78/144 (54%), Gaps = 10/144 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKPNEARF 134
P + V+ G ++ D NS + +I+ ++G + +AGS G TG DG+ +ARF
Sbjct: 172 PVGLDVNAKGVVYVADAYNSTVRRIS------AKGVVSTLAGS-PGDTGWRDGRGAQARF 224
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
N P G+T+D +G +YV++ N IRKI G T GK GF DG +A F +
Sbjct: 225 NTPVGLTLDAQGQIYVSEYFNNVIRKITPDGTVTTFAGKPGKGGFADGNVGEALFLHPQT 284
Query: 195 VVYVRPTCSLLVIDRGNAALRQIS 218
+ + P SL+V D GN +R+IS
Sbjct: 285 LSFA-PDGSLIVADTGNNRVRRIS 307
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 40/114 (35%), Positives = 60/114 (52%), Gaps = 5/114 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V+ DG LF D ++ + +I S+ R +AG + G DG +ARFNH
Sbjct: 117 PECVAVATDGTLFVSDSGSNTVRRI----SREGRVSTLAGKLE-VEGFADGTGQQARFNH 171
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFS 190
P G+ ++ KG +YVAD N +R+I GV + G G+RDG A+F+
Sbjct: 172 PVGLDVNAKGVVYVADAYNSTVRRISAKGVVSTLAGSPGDTGWRDGRGAQARFN 225
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 39/125 (31%), Positives = 70/125 (56%), Gaps = 8/125 (6%)
Query: 101 ITPPLSQYSRGRL-VAGSFQGYTGHVD------GKPNEARFNHPKGVTMDDKGNLYVADT 153
++P +SQ + + AG+ + + G + G +ARF+ P+G+ +D +GN++VAD
Sbjct: 19 LSPLISQTAFAAMQAAGTLEVFAGAIGPGMYNGGDFAQARFHDPRGMALDAQGNIFVADY 78
Query: 154 LNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAA 213
+N +RK+G G +I G+ R+GP+ A+F + + V V +L V D G+
Sbjct: 79 VNSVVRKLGTDGQVSIVAGQVEQRDARNGPALQARFYSP-ECVAVATDGTLFVSDSGSNT 137
Query: 214 LRQIS 218
+R+IS
Sbjct: 138 VRRIS 142
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/142 (28%), Positives = 68/142 (47%), Gaps = 6/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + G +F D VNS + K L + +VAG + +G +ARF
Sbjct: 62 PRGMALDAQGNIFVADYVNSVVRK----LGTDGQVSIVAGQVEQRDAR-NGPALQARFYS 116
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P+ V + G L+V+D+ + +R+I G + GK V GF DG + A+F++ +
Sbjct: 117 PECVAVATDGTLFVSDSGSNTVRRISREGRVSTLAGKLEVEGFADGTGQQARFNHPVG-L 175
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
V + V D N+ +R+IS
Sbjct: 176 DVNAKGVVYVADAYNSTVRRIS 197
Score = 43.5 bits (101), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 50/96 (52%), Gaps = 6/96 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + G+++ + N+ I KITP + + AG G G DG EA F H
Sbjct: 227 PVGLTLDAQGQIYVSEYFNNVIRKITPDGTVTT----FAGK-PGKGGFADGNVGEALFLH 281
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
P+ ++ G+L VADT N +R+I G V+T+AG
Sbjct: 282 PQTLSFAPDGSLIVADTGNNRVRRISPQGEVSTLAG 317
>gi|373955434|ref|ZP_09615394.1| NHL repeat containing protein [Mucilaginibacter paludis DSM 18603]
gi|373892034|gb|EHQ27931.1| NHL repeat containing protein [Mucilaginibacter paludis DSM 18603]
Length = 459
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 50/142 (35%), Positives = 72/142 (50%), Gaps = 8/142 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I G ++ D N+ I KIT + G + G G++DG A+F
Sbjct: 165 PQGICTDAQGNMYVADSYNNVIRKIT------AAGVTTTYAGTGTLGYLDGPAATAQFYA 218
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
PKGV D +GN+YVAD N IRKI AGV T GK + AG+ DG DA F + +
Sbjct: 219 PKGVAADAQGNIYVADMGNNMIRKISAAGVVTTLAGKGS-AGYADGTGADAVFKSPAGLA 277
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
V + ++ V D+G +R+++
Sbjct: 278 -VDASGNIYVADQGTNTIRKVT 298
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 47/126 (37%), Positives = 68/126 (53%), Gaps = 9/126 (7%)
Query: 93 EVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVAD 152
+ NS IT +S + AG+ G G V+G A+F++P+G+ D +GN+YVAD
Sbjct: 128 KTNSATFAITATVSTF------AGNANG-GGFVNGTGVNAQFHNPQGICTDAQGNMYVAD 180
Query: 153 TLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNA 212
+ N IRKI AGVTT G + G+ DGP+ A+F V ++ V D GN
Sbjct: 181 SYNNVIRKITAAGVTTTYAGTGTL-GYLDGPAATAQFYAPKGVA-ADAQGNIYVADMGNN 238
Query: 213 ALRQIS 218
+R+IS
Sbjct: 239 MIRKIS 244
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 37/93 (39%), Positives = 51/93 (54%), Gaps = 7/93 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPN-EARFN 135
P + V G ++ D+ + I K+T S G + + +G VD N +ARF+
Sbjct: 273 PAGLAVDASGNIYVADQGTNTIRKVT------SAGVVTTLAGAAASGQVDATTNTDARFS 326
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTT 168
P GVT+D GN+YVAD N AIRK+ AGVTT
Sbjct: 327 SPSGVTVDASGNVYVADLANHAIRKVTSAGVTT 359
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 46/144 (31%), Positives = 74/144 (51%), Gaps = 11/144 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + G ++ D N+ I KI+ + G + + +G G+ DG +A F
Sbjct: 219 PKGVAADAQGNIYVADMGNNMIRKIS------AAGVVTTLAGKGSAGYADGTGADAVFKS 272
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSE-DAKFSNDFD 194
P G+ +D GN+YVAD IRK+ AG VTT+AG + +G D + DA+FS+
Sbjct: 273 PAGLAVDASGNIYVADQGTNTIRKVTSAGVVTTLAGAAA--SGQVDATTNTDARFSSPSG 330
Query: 195 VVYVRPTCSLLVIDRGNAALRQIS 218
V V + ++ V D N A+R+++
Sbjct: 331 VT-VDASGNVYVADLANHAIRKVT 353
Score = 45.8 bits (107), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 51/110 (46%), Gaps = 25/110 (22%)
Query: 64 VETVIEGNEI--GVVPY--KIRVSEDGELFAVD--------EVNSNIVKITPPLSQYSRG 111
V T I GN I VVP I V G LF D V +NI+ YS
Sbjct: 357 VTTTIIGNPILSKVVPSPSGIYVDASGNLFITDASGQVMEINVTTNII--------YS-- 406
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
+AG G +G +G A FN P+ +T+D +GN+YV D N IRKI
Sbjct: 407 --LAG-VAGTSGFANGTNINALFNGPQALTLDSQGNIYVVDYYNNMIRKI 453
>gi|386716891|ref|YP_006183217.1| NHL repeat protein [Stenotrophomonas maltophilia D457]
gi|384076453|emb|CCH11034.1| NHL repeat protein [Stenotrophomonas maltophilia D457]
Length = 652
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 54/145 (37%), Positives = 75/145 (51%), Gaps = 11/145 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + S DG ++ D ++N ++ P GR+ + QG G VDG +A FN
Sbjct: 31 PYALLRSADGSVYFTDAGDNNRIRRRLP-----DGRVETVAGQG-EGRVDGPALQASFNT 84
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ D +GNLYVADT N AIR+I D VTT+AGG+ G DGP+ A+F +
Sbjct: 85 PSGIAADAQGNLYVADTGNHAIRRISTDGQVTTLAGGEQ---GHADGPAAQARFDAPMGI 141
Query: 196 VYVRPTCSLLVIDRGNAALRQISLN 220
V + V D N +R I +
Sbjct: 142 A-VDAQGQVYVADTFNDRIRVIGTD 165
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 70/145 (48%), Gaps = 11/145 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I G L+ D N I +I S + +AG QG H DG +ARF+
Sbjct: 85 PSGIAADAQGNLYVADTGNHAIRRI----STDGQVTTLAGGEQG---HADGPAAQARFDA 137
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ +D +G +YVADT N IR IG D V T+AGG G DG A+F +
Sbjct: 138 PMGIAVDAQGQVYVADTFNDRIRVIGTDGSVRTLAGGDR--PGLADGVGAAARFDTPVAL 195
Query: 196 VYVRPTCSLLVIDRGNAALRQISLN 220
+ +LLV D N A+R++ +
Sbjct: 196 AF-DAQGALLVADLFNNAVRRVGAD 219
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 49/102 (48%), Gaps = 9/102 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V G+++ D N I I S R +AG + G DG ARF+
Sbjct: 138 PMGIAVDAQGQVYVADTFNDRIRVIGTDGSV----RTLAGGDR--PGLADGVGAAARFDT 191
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTI--AGGKSN 175
P + D +G L VAD N A+R++G D V+T+ AGG N
Sbjct: 192 PVALAFDAQGALLVADLFNNAVRRVGADGTVSTVVAAGGVIN 233
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 50/104 (48%), Gaps = 8/104 (7%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTL--NLAIRKIGDAGVTTIAG-GKSNV 176
G+ G DG +ARF P + G++Y D N R++ D V T+AG G+ V
Sbjct: 14 GHPGDRDGASAQARFADPYALLRSADGSVYFTDAGDNNRIRRRLPDGRVETVAGQGEGRV 73
Query: 177 AGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
DGP+ A F N + +L V D GN A+R+IS +
Sbjct: 74 ----DGPALQASF-NTPSGIAADAQGNLYVADTGNHAIRRISTD 112
>gi|393778566|ref|ZP_10366837.1| Ig family protein [Ralstonia sp. PBA]
gi|392714468|gb|EIZ02071.1| Ig family protein [Ralstonia sp. PBA]
Length = 342
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 53/141 (37%), Positives = 74/141 (52%), Gaps = 13/141 (9%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV---AGSFQGYTGHVDGKPNEARFNH 136
I + G ++A + + I KITP G LV AGS G TG DG N A FN
Sbjct: 206 IALDAAGNIYATELSGNRIRKITP-------GGLVTTFAGS--GTTGSADGPGNTASFNS 256
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P GV +D GN+YVADT N +R+I AGV + G++ V G ++G A+F + VV
Sbjct: 257 PGGVALDGDGNVYVADTGNNLLRRITRAGVVSTLAGQAGVTGAQNGIGSAARFKQPYGVV 316
Query: 197 YVRPTCSLLVIDRGNAALRQI 217
V ++ V D N +R++
Sbjct: 317 -VDADGTVYVADTFNNLIRKV 336
Score = 41.6 bits (96), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 41/83 (49%), Gaps = 8/83 (9%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTT-IAG 171
L AGS G G G + A F+ P G+ D GNL+V+D +RKI AG TT IAG
Sbjct: 126 LFAGS--GAAGSATGIGSAASFSLPYGIIFDKNGNLFVSDVQTGILRKITPAGETTNIAG 183
Query: 172 GKSNVAGFRDGPSEDAKFSNDFD 194
G G GP + + FD
Sbjct: 184 G-----GISSGPVDGNGAAAYFD 201
>gi|392540748|ref|ZP_10287885.1| Ig family protein [Pseudoalteromonas piscicida JCM 20779]
Length = 2384
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 46/142 (32%), Positives = 67/142 (47%), Gaps = 7/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + G ++ D N+NI K+TP G + + G G DG A F
Sbjct: 622 PKAVTLDSSGNVYVADSSNNNIRKVTP------AGVVTTFAGSGTYGSDDGTGTAATFAA 675
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+T+D GNLYV +T +RKI AGV T G N +GF D A F+ ++
Sbjct: 676 PTGITIDSNGNLYVVETNPHIVRKITPAGVVTTFAGSKNSSGFTDATGTSATFNFPYN-G 734
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
L + DR N A+R+++
Sbjct: 735 GSNSNNDLFIADRNNHAIRKVT 756
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/114 (38%), Positives = 56/114 (49%), Gaps = 7/114 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + +G ++ D N I KITP G + + G G DG + A FN
Sbjct: 568 PYDLVTDSNGNVYVADYGNHVIRKITP------EGVVTTLAGSGSAGSDDGTGSAASFNF 621
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFS 190
PK VT+D GN+YVAD+ N IRK+ AGV T G S G DG A F+
Sbjct: 622 PKAVTLDSSGNVYVADSSNNNIRKVTPAGVVTTFAG-SGTYGSDDGTGTAATFA 674
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/149 (32%), Positives = 63/149 (42%), Gaps = 21/149 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + + +LF D N I K+T S + + G G +G +A FN
Sbjct: 731 PYNGGSNSNNDLFIADRNNHAIRKVT------SASVVTTFAGTGSAGSTNGTGTQASFNK 784
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF--- 193
P V D NLYV + IRKI GV T G + +G DG A+FS +
Sbjct: 785 PYDVAADSADNLYVTEQAAHTIRKITSTGVVTTYAGSAGASGNTDGLVSVARFSQPYGIA 844
Query: 194 ----DVVYVRPTCSLLVIDRGNAALRQIS 218
DVVYV D GN +R+IS
Sbjct: 845 VDSNDVVYVA--------DTGNHRIRKIS 865
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 50/98 (51%), Gaps = 4/98 (4%)
Query: 122 TGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR 180
+G+ DG A F P + D GN+YVAD N IRKI G VTT+AG S AG
Sbjct: 553 SGYADGTGTAASFKSPYDLVTDSNGNVYVADYGNHVIRKITPEGVVTTLAGSGS--AGSD 610
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
DG A F N V + + ++ V D N +R+++
Sbjct: 611 DGTGSAASF-NFPKAVTLDSSGNVYVADSSNNNIRKVT 647
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 68/147 (46%), Gaps = 17/147 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIV-KITPPLSQYSRGRLVAGSFQGY---TGHVDGKPNEA 132
P I + +G L+ V E N +IV KITP V +F G +G D A
Sbjct: 676 PTGITIDSNGNLYVV-ETNPHIVRKITP--------AGVVTTFAGSKNSSGFTDATGTSA 726
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSN 191
FN P + +L++AD N AIRK+ A VTT AG S AG +G A F+
Sbjct: 727 TFNFPYNGGSNSNNDLFIADRNNHAIRKVTSASVVTTFAGTGS--AGSTNGTGTQASFNK 784
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQIS 218
+DV +L V ++ +R+I+
Sbjct: 785 PYDVA-ADSADNLYVTEQAAHTIRKIT 810
Score = 47.4 bits (111), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 51/101 (50%), Gaps = 10/101 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV--AGSFQGYTGHVDGKPNEARF 134
PY + L+ ++ I KIT S G + AGS G +G+ DG + ARF
Sbjct: 785 PYDVAADSADNLYVTEQAAHTIRKIT------STGVVTTYAGS-AGASGNTDGLVSVARF 837
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSN 175
+ P G+ +D +YVADT N IRKI A TT+ G +N
Sbjct: 838 SQPYGIAVDSNDVVYVADTGNHRIRKISPA-ETTLTGTPTN 877
>gi|255589882|ref|XP_002535117.1| nhl repeat-containing protein, putative [Ricinus communis]
gi|223523999|gb|EEF27264.1| nhl repeat-containing protein, putative [Ricinus communis]
Length = 435
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 52/143 (36%), Positives = 71/143 (49%), Gaps = 10/143 (6%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P + + G L+ D N+ I KI P G + + G G DGK A+FN
Sbjct: 121 TPSGLAIDAAGNLYVADTGNNAIRKIGP------DGTVSTLAGDGLAGDKDGKGAGAQFN 174
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P GV +D G +YV DT N IR+I D VTT+AGGK AG DG A F
Sbjct: 175 GPIGVAVDAAGVVYVTDTYNDRIRRIAPDGTVTTVAGGKR--AGMADGAGAQALFDTPTG 232
Query: 195 VVYVRPTCSLLVIDRGNAALRQI 217
+ + + +L + D GN+A+R+I
Sbjct: 233 IA-LGASGALYIADTGNSAIRKI 254
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/143 (34%), Positives = 74/143 (51%), Gaps = 10/143 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ + + G L+ D ++N ++ + +AG +GY +G P A FN
Sbjct: 68 PFGVALDRQGNLYVADGGDNNSIR---KIDLDGVTTTLAGGTEGY---AEGAPTAAAFNT 121
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ +D GNLYVADT N AIRKIG D V+T+AG +AG +DG A+F+ V
Sbjct: 122 PSGLAIDAAGNLYVADTGNNAIRKIGPDGTVSTLAG--DGLAGDKDGKGAGAQFNGPIGV 179
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V + V D N +R+I+
Sbjct: 180 A-VDAAGVVYVTDTYNDRIRRIA 201
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 51/100 (51%), Gaps = 7/100 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G ++ D N I +I P + + VAG + G DG +A F+
Sbjct: 176 PIGVAVDAAGVVYVTDTYNDRIRRIAPDGTVTT----VAGGKR--AGMADGAGAQALFDT 229
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSN 175
P G+ + G LY+ADT N AIRKIG D V+T+A K +
Sbjct: 230 PTGIALGASGALYIADTGNSAIRKIGKDGTVSTVAAAKED 269
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 60/116 (51%), Gaps = 8/116 (6%)
Query: 104 PLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTL-NLAIRKIG 162
P ++R +AG G G +G RF P GV +D +GNLYVAD N +IRKI
Sbjct: 37 PAGWHARITTMAG--DGLPGSSNGSGKRTRFADPFGVALDRQGNLYVADGGDNNSIRKID 94
Query: 163 DAGV-TTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
GV TT+AGG G+ +G A F N + + +L V D GN A+R+I
Sbjct: 95 LDGVTTTLAGGTE---GYAEGAPTAAAF-NTPSGLAIDAAGNLYVADTGNNAIRKI 146
>gi|255531263|ref|YP_003091635.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
gi|255344247|gb|ACU03573.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
Length = 646
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 51/143 (35%), Positives = 76/143 (53%), Gaps = 11/143 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + G ++ D+ N+ I KIT S G + + G G DG + A+FN
Sbjct: 352 PEGLAIDASGNMYIADKNNNVIRKIT------STGTVSTFAGTGVAGFADGAGSIAKFNG 405
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P V +D GN+YVAD N IRKI AG V+T+AG + AG+ DG AKF DV
Sbjct: 406 PWKVAVDATGNVYVADRDNFKIRKITPAGIVSTLAG---STAGYADGTGSAAKFMQPLDV 462
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
+ P+ +++V D + +R+I+
Sbjct: 463 A-IDPSGNVIVADNTSHRIRKIT 484
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 43/100 (43%), Positives = 59/100 (59%), Gaps = 4/100 (4%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAG 178
G G V+G ARF+ P+G+ +D GN+Y+AD N IRKI G V+T AG + VAG
Sbjct: 335 GTAGLVNGAKASARFDSPEGLAIDASGNMYIADKNNNVIRKITSTGTVSTFAG--TGVAG 392
Query: 179 FRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
F DG AKF+ + V V T ++ V DR N +R+I+
Sbjct: 393 FADGAGSIAKFNGPWKVA-VDATGNVYVADRDNFKIRKIT 431
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 74/143 (51%), Gaps = 11/143 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+K+ V G ++ D N I KITP +AGS GY DG + A+F
Sbjct: 406 PWKVAVDATGNVYVADRDNFKIRKITPA----GIVSTLAGSTAGY---ADGTGSAAKFMQ 458
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P V +D GN+ VAD + IRKI AG VTTIAG + AG+ +G A+F N
Sbjct: 459 PLDVAIDPSGNVIVADNTSHRIRKITAAGVVTTIAGNGT--AGYTNGTGTAAQFKNPSG- 515
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V V + ++ V DR N +R+I+
Sbjct: 516 VDVDASGNIYVADRLNHRIRKIT 538
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/143 (32%), Positives = 76/143 (53%), Gaps = 11/143 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G ++ D +N I KIT + G + + + G +G DG A+F+
Sbjct: 513 PSGVDVDASGNIYVADRLNHRIRKIT------TSGVVSSLAGTGTSGTTDGAAGSAKFSD 566
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P GVT+D GN+YVAD ++ IRKI V+T+AG + G+++G S AKF+ D+V
Sbjct: 567 PYGVTVDVSGNVYVADLISSRIRKISSGQVSTLAG---TIPGYQNGTSTIAKFNQPTDLV 623
Query: 197 YVRPTCSLLVIDRGNAALRQISL 219
++ + D N ++R + L
Sbjct: 624 I--QGSNIYIADHSNNSIRLVKL 644
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 46/145 (31%), Positives = 74/145 (51%), Gaps = 8/145 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + G + D + I KIT + G + + G G+ +G A+F +
Sbjct: 459 PLDVAIDPSGNVIVADNTSHRIRKIT------AAGVVTTIAGNGTAGYTNGTGTAAQFKN 512
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P GV +D GN+YVAD LN IRKI +GV + + + +G DG + AKFS+ + V
Sbjct: 513 PSGVDVDASGNIYVADRLNHRIRKITTSGVVS-SLAGTGTSGTTDGAAGSAKFSDPYGVT 571
Query: 197 YVRPTCSLLVIDRGNAALRQISLNQ 221
V + ++ V D ++ +R+IS Q
Sbjct: 572 -VDVSGNVYVADLISSRIRKISSGQ 595
>gi|124008176|ref|ZP_01692873.1| hypothetical protein M23134_05970 [Microscilla marina ATCC 23134]
gi|123986275|gb|EAY26097.1| hypothetical protein M23134_05970 [Microscilla marina ATCC 23134]
Length = 2548
Score = 72.8 bits (177), Expect = 4e-10, Method: Composition-based stats.
Identities = 54/141 (38%), Positives = 73/141 (51%), Gaps = 6/141 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF-QGYTGHVDGKPNEARFN 135
P I + G LF D N+ I ++ S AG Q +VDG ARFN
Sbjct: 801 PTSIALDRSGNLFVADRHNNLIRQVATSSGATST---YAGDISQTNALYVDGAAASARFN 857
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
HP G+T+D G++YVADT N IRKI + VTTIA G +N G +G ++ AKF N
Sbjct: 858 HPTGITVDMVGDVYVADTRNQVIRKISEGQVTTIA-GIANDRGLTNGTAQAAKF-NYPGS 915
Query: 196 VYVRPTCSLLVIDRGNAALRQ 216
VY ++ V D+ N +R+
Sbjct: 916 VYADLEQNIYVGDKVNQLVRR 936
Score = 67.4 bits (163), Expect = 2e-08, Method: Composition-based stats.
Identities = 64/160 (40%), Positives = 81/160 (50%), Gaps = 14/160 (8%)
Query: 70 GNEIGV-VPYKIRVSEDGE-LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV-D 126
GNE P + S DG L+ DE N I KI S S +VAGS G +G V D
Sbjct: 618 GNETTSDAPAGMVKSPDGVFLYVSDEKNHVIKKIRVSDSTVS---IVAGSV-GASGLVND 673
Query: 127 GKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI----GDAGVTTIAGGKSNVAGFRDG 182
ARFNHP G+ +DD G LYVAD N IR I G A VTT+AG +G G
Sbjct: 674 PVGTNARFNHPSGLAIDDAGILYVADKDNHVIRAIANPDGAATVTTVAG--DGTSGDAIG 731
Query: 183 PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQD 222
S A+F DV V + +L V D+ N ++++ LN +
Sbjct: 732 ASTSARFREPSDVA-VDFSGNLYVADKNNHKIKKVDLNTN 770
Score = 52.0 bits (123), Expect = 7e-04, Method: Composition-based stats.
Identities = 46/150 (30%), Positives = 68/150 (45%), Gaps = 8/150 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYS--RGRLVAGSFQGYTGHVDGKPNEARF 134
P + V G L+ D+ N I K+ + S G V F G DG + ARF
Sbjct: 741 PSDVAVDFSGNLYVADKNNHKIKKVDLNTNTVSILSGPAVGTVFPA--GATDGTASIARF 798
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDA--GVTTIAGGKSNV-AGFRDGPSEDAKFSN 191
P + +D GNL+VAD N IR++ + +T AG S A + DG + A+F++
Sbjct: 799 FFPTSIALDRSGNLFVADRHNNLIRQVATSSGATSTYAGDISQTNALYVDGAAASARFNH 858
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQISLNQ 221
+ V + V D N +R+IS Q
Sbjct: 859 PTGIT-VDMVGDVYVADTRNQVIRKISEGQ 887
>gi|344924046|ref|ZP_08777507.1| NHL repeat containing protein [Candidatus Odyssella
thessalonicensis L13]
Length = 368
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 52/143 (36%), Positives = 73/143 (51%), Gaps = 11/143 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + V G L+ + N I KIT L+AGS QGY +G + ARF+
Sbjct: 112 PYGVAVDSSGTLYVSEYTNHRIRKITSAGVT----SLLAGSAQGY---AEGTGSGARFDR 164
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVT-TIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P V +D G +YVAD N IR+I AGVT T+AG S+ G+ +G A+F DV
Sbjct: 165 PYSVAVDSSGTVYVADFFNSRIRRITSAGVTSTLAG--SSTGGYLEGTGGAAQFGTPIDV 222
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V + ++ V D +R+I+
Sbjct: 223 A-VDSSGTVYVTDTYTQRVRKIT 244
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 46/142 (32%), Positives = 71/142 (50%), Gaps = 8/142 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + V G ++ D NS I +IT +AGS G G+++G A+F
Sbjct: 165 PYSVAVDSSGTVYVADFFNSRIRRITSAGVT----STLAGSSTG--GYLEGTGGAAQFGT 218
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P V +D G +YV DT +RKI GVT++ G SN G+ +G A+FS+ + +
Sbjct: 219 PIDVAVDSSGTVYVTDTYTQRVRKITSGGVTSLLAG-SNTIGYAEGTGASARFSSPYGIA 277
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
V + + V D N +R+I+
Sbjct: 278 -VDSSGTAYVADSDNHRIRKIT 298
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 46/142 (32%), Positives = 71/142 (50%), Gaps = 9/142 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY I V G ++ D N I I+ + + AGS G +G A+FN+
Sbjct: 58 PYGIAVHSSGTIYVADSANHRIRSISSAGTT----SVFAGSGT--AGTTEGTGASAQFNN 111
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P GV +D G LYV++ N IRKI AGVT++ G + G+ +G A+F + V
Sbjct: 112 PYGVAVDSSGTLYVSEYTNHRIRKITSAGVTSLLAGSAQ--GYAEGTGSGARFDRPYSVA 169
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
V + ++ V D N+ +R+I+
Sbjct: 170 -VDSSGTVYVADFFNSRIRRIT 190
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 58/105 (55%), Gaps = 5/105 (4%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGK 173
+AGS GY +G ARFN+P G+ + G +YVAD+ N IR I AG T++ G
Sbjct: 38 LAGSTGGY---AEGTGASARFNYPYGIAVHSSGTIYVADSANHRIRSISSAGTTSVFAG- 93
Query: 174 SNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
S AG +G A+F+N + V V + +L V + N +R+I+
Sbjct: 94 SGTAGTTEGTGASAQFNNPYGVA-VDSSGTLYVSEYTNHRIRKIT 137
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 44/93 (47%), Gaps = 6/93 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G ++ D + KIT L+AGS G+ +G ARF+
Sbjct: 219 PIDVAVDSSGTVYVTDTYTQRVRKITSGGVT----SLLAGSNT--IGYAEGTGASARFSS 272
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTI 169
P G+ +D G YVAD+ N IRKI G T++
Sbjct: 273 PYGIAVDSSGTAYVADSDNHRIRKITSGGTTSL 305
>gi|254522386|ref|ZP_05134441.1| NHL repeat containing protein [Stenotrophomonas sp. SKA14]
gi|219719977|gb|EED38502.1| NHL repeat containing protein [Stenotrophomonas sp. SKA14]
Length = 643
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 54/145 (37%), Positives = 73/145 (50%), Gaps = 11/145 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + DG ++ D ++N ++ P R VAG +G +DG +A FN
Sbjct: 22 PYALLRGADGSVYFTDAGDNNRIRRRLP---DGRVETVAGQGEGL---IDGPALQASFNT 75
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ D +GNLYVADT N AIR+IG D VTT+AGG V G DGP A+F +
Sbjct: 76 PSGIAADAQGNLYVADTGNHAIRRIGIDGQVTTLAGG---VQGHGDGPVAQARFDAPMGI 132
Query: 196 VYVRPTCSLLVIDRGNAALRQISLN 220
V + V D N +R I +
Sbjct: 133 A-VDAQGQVYVADTFNDRIRVIGTD 156
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 50/143 (34%), Positives = 70/143 (48%), Gaps = 11/143 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I G L+ D N I +I + +AG QG H DG +ARF+
Sbjct: 76 PSGIAADAQGNLYVADTGNHAIRRIGIDGQVTT----LAGGVQG---HGDGPVAQARFDA 128
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ +D +G +YVADT N IR IG D V T+AGG+ G DG + A+F +
Sbjct: 129 PMGIAVDAQGQVYVADTFNDRIRVIGTDGSVRTLAGGER--PGLADGAGDAARFDTPVAL 186
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
+ +LLV D N A+R++
Sbjct: 187 AF-DAQGALLVADLFNNAVRRVG 208
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 50/102 (49%), Gaps = 9/102 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V G+++ D N I I S R +AG + G DG + ARF+
Sbjct: 129 PMGIAVDAQGQVYVADTFNDRIRVIGTDGSV----RTLAGGER--PGLADGAGDAARFDT 182
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTI--AGGKSN 175
P + D +G L VAD N A+R++G D V+T+ AGG N
Sbjct: 183 PVALAFDAQGALLVADLFNNAVRRVGPDGTVSTVVAAGGVIN 224
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 48/103 (46%), Gaps = 6/103 (5%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTL--NLAIRKIGDAGVTTIAGGKSNVA 177
G+ G DG +ARF P + G++Y D N R++ D V T+AG
Sbjct: 5 GHPGDRDGASAQARFADPYALLRGADGSVYFTDAGDNNRIRRRLPDGRVETVAGQGE--- 61
Query: 178 GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
G DGP+ A F N + +L V D GN A+R+I ++
Sbjct: 62 GLIDGPALQASF-NTPSGIAADAQGNLYVADTGNHAIRRIGID 103
>gi|223934991|ref|ZP_03626910.1| NHL repeat containing protein [bacterium Ellin514]
gi|223896444|gb|EEF62886.1| NHL repeat containing protein [bacterium Ellin514]
Length = 1064
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 51/147 (34%), Positives = 79/147 (53%), Gaps = 13/147 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G +F D N+ I KITP + + +AG+F+ + G DG + ARF
Sbjct: 512 PQGLAVDGTGNVFVADTFNNLIRKITPGGAVTT----LAGNFENF-GSSDGTNSNARFYW 566
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR---DGPSEDAKFSNDF 193
P GV +D+ GN++VAD +N IR++ +G I + +AGF DG + A+F F
Sbjct: 567 PSGVAVDNAGNVFVADYMNHTIRELIPSGTNWIVNTVAGLAGFWGSIDGTNTSARF---F 623
Query: 194 D--VVYVRPTCSLLVIDRGNAALRQIS 218
+ V + +L V D GN A+R+I+
Sbjct: 624 QPRSLSVDASGALYVADSGNHAIRKIT 650
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 49/142 (34%), Positives = 68/142 (47%), Gaps = 6/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + +F D N+ I KITP +AGS G G++D A F+
Sbjct: 183 PTGIAIDNFNNIFVADTGNNTIRKITPS----GNVNTMAGS-AGVYGNLDNSGANALFSG 237
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P+G+T+D GNLYV DT N IRKI +GV T G + G +G +A F +
Sbjct: 238 PQGLTVDSSGNLYVVDTGNGTIRKITSSGVVTTFAGSAGNYGATNGIGANALFYAPQGIT 297
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
C + V D GN +R+I+
Sbjct: 298 IDLFGC-VYVADTGNHTIRKIT 318
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/142 (33%), Positives = 73/142 (51%), Gaps = 6/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V ++ D N I KI+P + +AGS G+ G V+ A F+
Sbjct: 402 PQAVAVDAATNVYVADTANQTIRKISPSGLVCT----LAGSI-GHPGSVNNIGTNALFSG 456
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P+G+T+D GN+YVADTLN IR+I G T G + V+G +G + DA+F +
Sbjct: 457 PQGITVDGVGNIYVADTLNHIIRRITPDGAATTFAGSAGVSGTANGTNTDAQFYAPQGLA 516
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
V T ++ V D N +R+I+
Sbjct: 517 -VDGTGNVFVADTFNNLIRKIT 537
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/118 (38%), Positives = 64/118 (54%), Gaps = 7/118 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G +F D +N I ++ P + + VAG G+ G +DG ARF
Sbjct: 567 PSGVAVDNAGNVFVADYMNHTIRELIPSGTNWIV-NTVAG-LAGFWGSIDGTNTSARFFQ 624
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG----VTTIAGGKSNVAGFRDGPSEDAKFS 190
P+ +++D G LYVAD+ N AIRKI +G VTT+A G + AG DG +A+FS
Sbjct: 625 PRSLSVDASGALYVADSGNHAIRKITPSGTNWVVTTVA-GLAGAAGSVDGTGINAEFS 681
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/142 (35%), Positives = 72/142 (50%), Gaps = 8/142 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V G ++ D +N I +ITP AGS G +G +G +A+F
Sbjct: 457 PQGITVDGVGNIYVADTLNHIIRRITPD----GAATTFAGS-AGVSGTANGTNTDAQFYA 511
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P+G+ +D GN++VADT N IRKI G VTT+AG N G DG + +A+F V
Sbjct: 512 PQGLAVDGTGNVFVADTFNNLIRKITPGGAVTTLAGNFENF-GSSDGTNSNARFYWPSGV 570
Query: 196 VYVRPTCSLLVIDRGNAALRQI 217
V ++ V D N +R++
Sbjct: 571 A-VDNAGNVFVADYMNHTIREL 591
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 56/153 (36%), Positives = 74/153 (48%), Gaps = 11/153 (7%)
Query: 68 IEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDG 127
I N + P I + G ++ D N I KIT + + L AG++ G D
Sbjct: 284 IGANALFYAPQGITIDLFGCVYVADTGNHTIRKITSDGTVTTLAGL-AGNY----GSADS 338
Query: 128 KPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSED 186
+ A F +P+G+T D GNLY+ADT N IR I G VTT AG S G DG S D
Sbjct: 339 VNSSASFWNPQGITSDATGNLYIADTGNNTIRTITPGGSVTTFAGLPS--IGSADGLSSD 396
Query: 187 AKFSNDF-DVVYVRPTCSLLVIDRGNAALRQIS 218
A+F F V V ++ V D N +R+IS
Sbjct: 397 ARFR--FPQAVAVDAATNVYVADTANQTIRKIS 427
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 55/145 (37%), Positives = 70/145 (48%), Gaps = 12/145 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV--AGSFQGYTGHVDGKPNEARF 134
P + V G L+ VD N I KIT S G + AGS Y G +G A F
Sbjct: 238 PQGLTVDSSGNLYVVDTGNGTIRKIT------SSGVVTTFAGSAGNY-GATNGIGANALF 290
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
P+G+T+D G +YVADT N IRKI D VTT+AG N G D + A F N
Sbjct: 291 YAPQGITIDLFGCVYVADTGNHTIRKITSDGTVTTLAGLAGNY-GSADSVNSSASFWNPQ 349
Query: 194 DVVYVRPTCSLLVIDRGNAALRQIS 218
+ T +L + D GN +R I+
Sbjct: 350 GIT-SDATGNLYIADTGNNTIRTIT 373
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/144 (32%), Positives = 67/144 (46%), Gaps = 10/144 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV--AGSFQGYTGHVDGKPNEARF 134
P I V G + D N+ + KIT + G + AGS Y G D A F
Sbjct: 128 PAGIAVDASGNVLVADTGNNTVRKIT------ATGDVTTFAGSAGNY-GSTDNLGTNALF 180
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P G+ +D+ N++VADT N IRKI +G G + V G D +A FS
Sbjct: 181 YRPTGIAIDNFNNIFVADTGNNTIRKITPSGNVNTMAGSAGVYGNLDNSGANALFSGPQG 240
Query: 195 VVYVRPTCSLLVIDRGNAALRQIS 218
+ V + +L V+D GN +R+I+
Sbjct: 241 LT-VDSSGNLYVVDTGNGTIRKIT 263
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 53/108 (49%), Gaps = 3/108 (2%)
Query: 117 SFQGYTGH--VDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKS 174
+ GY G +G + +F P+ + +D N++VADT N IRKI G+ T G
Sbjct: 51 TLAGYAGRGSANGISSSVQFKAPQAIAIDISNNVFVADTENHVIRKISCTGIITTLAGSL 110
Query: 175 NVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQD 222
G RDG +A F + V + ++LV D GN +R+I+ D
Sbjct: 111 GTHGSRDGSGTNALFFRPAGIA-VDASGNVLVADTGNNTVRKITATGD 157
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/96 (38%), Positives = 48/96 (50%), Gaps = 7/96 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I G L+ D N+ I ITP S + L + G DG ++ARF
Sbjct: 348 PQGITSDATGNLYIADTGNNTIRTITPGGSVTTFAGLPS------IGSADGLSSDARFRF 401
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
P+ V +D N+YVADT N IRKI +G V T+AG
Sbjct: 402 PQAVAVDAATNVYVADTANQTIRKISPSGLVCTLAG 437
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 42/83 (50%), Gaps = 2/83 (2%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G L+ D N I KITP + + VAG G G VDG A F+H
Sbjct: 625 PRSLSVDASGALYVADSGNHAIRKITPSGTNWVV-TTVAG-LAGAAGSVDGTGINAEFSH 682
Query: 137 PKGVTMDDKGNLYVADTLNLAIR 159
P G+++ G +YVAD+ N IR
Sbjct: 683 PAGISLTSAGIVYVADSDNNTIR 705
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 51/150 (34%), Positives = 64/150 (42%), Gaps = 22/150 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + +F D N I KI S +AGS G G DG A F
Sbjct: 73 PQAIAIDISNNVFVADTENHVIRKI----SCTGIITTLAGSL-GTHGSRDGSGTNALFFR 127
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ +D GN+ VADT N +RKI G VTT AG N G D +A F
Sbjct: 128 PAGIAVDASGNVLVADTGNNTVRKITATGDVTTFAGSAGNY-GSTDNLGTNALF------ 180
Query: 196 VYVRPTC-------SLLVIDRGNAALRQIS 218
RPT ++ V D GN +R+I+
Sbjct: 181 --YRPTGIAIDNFNNIFVADTGNNTIRKIT 208
>gi|223935789|ref|ZP_03627704.1| NHL repeat containing protein [bacterium Ellin514]
gi|223895390|gb|EEF61836.1| NHL repeat containing protein [bacterium Ellin514]
Length = 755
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 49/142 (34%), Positives = 68/142 (47%), Gaps = 7/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V DG ++ D N I ++ PP + +AGS Y G +G + A+F
Sbjct: 210 PQGIAVGSDGNIYVADTGNGTI-RVIPPGGSVTT---LAGSPGNY-GSTNGTGSAAQFYQ 264
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P GV + G +YVAD LN IR + GV T G + G +DG +A+F V
Sbjct: 265 PMGVAVAANGTVYVADNLNHTIRAVTSGGVVTTLAGLAGNYGSKDGTGSNARFYAPQGVA 324
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
T + V+D GN +RQIS
Sbjct: 325 VSGST--VFVVDTGNGTIRQIS 344
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 56/179 (31%), Positives = 80/179 (44%), Gaps = 17/179 (9%)
Query: 51 ADGNVLQFENGYLVETVIEGNEIGVV-----------PYKIRVSEDGELFAVDEVNSNIV 99
+G + Q +G V T+ IG P V G +F D N I
Sbjct: 336 GNGTIRQISSGGAVTTLAGSASIGNADGTGGSAKFYWPKGTAVDASGNVFVSDTFNHTIR 395
Query: 100 KITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIR 159
KIT + + +AG+ G +G +G A+F P+G+ +D GN YVADT N IR
Sbjct: 396 KITAAGTVST----LAGT-AGSSGTNNGVGGGAQFYAPQGIAVDTGGNAYVADTANNVIR 450
Query: 160 KIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
K+ G T G + V G DG +A+FS V + ++ V D GN +R+IS
Sbjct: 451 KVTSGGTVTTLAGTAGVEGQGDGTGSNAQFSGP-QAVALDGAANVYVSDTGNHTIRKIS 508
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 74/145 (51%), Gaps = 10/145 (6%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
+P + V + G L+ D N I KI+ + G G +G +GK + ARFN
Sbjct: 50 LPGGVAVDKTGNLYVADTANHTIRKISGGVVSTFAG------LAGVSGSANGKGSAARFN 103
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P+GV +D G +YVADT N IRKI D V+T+AG N G + +A+F + +
Sbjct: 104 QPQGVAVDTNGIVYVADTGNHIIRKIALDGTVSTLAGAAGN-PGTLNATGTNAQFY-EPE 161
Query: 195 VVYVRPTCSLL-VIDRGNAALRQIS 218
V V SL+ V D N +RQ++
Sbjct: 162 AVAVNGNGSLIYVADTWNHEIRQVT 186
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 43/98 (43%), Positives = 55/98 (56%), Gaps = 2/98 (2%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
G DG + ARFN P GV +D GNLYVADT N IRKI V+T A G + V+G +G
Sbjct: 37 GSADGNNSSARFNLPGGVAVDKTGNLYVADTANHTIRKISGGVVSTFA-GLAGVSGSANG 95
Query: 183 PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
A+F N V V + V D GN +R+I+L+
Sbjct: 96 KGSAARF-NQPQGVAVDTNGIVYVADTGNHIIRKIALD 132
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 51/152 (33%), Positives = 73/152 (48%), Gaps = 23/152 (15%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGH--------VDGK 128
P + + ++ D N I KI+P G V +F G+ GH +
Sbjct: 483 PQAVALDGAANVYVSDTGNHTIRKISP-------GGAVT-TFAGFPGHPGNLDSNMDNNG 534
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDA 187
N ARF P G+ +D GN+YVADT N IRKI D V+T+A G V G DG + DA
Sbjct: 535 TNTARFYSPSGLAVDSSGNVYVADTGNHTIRKITADGSVSTLA-GLPGVWGNADGTNRDA 593
Query: 188 KFSNDF--DVVYVRPTCSLLVIDRGNAALRQI 217
+F F + + + +L V+D GN +R +
Sbjct: 594 RF---FQPEGISIDSQGNLFVMDSGNHTMRML 622
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 60/117 (51%), Gaps = 10/117 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G ++ D N I KIT S + L G G+ DG +ARF
Sbjct: 543 PSGLAVDSSGNVYVADTGNHTIRKITADGSVSTLAGL-----PGVWGNADGTNRDARFFQ 597
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG----VTTIAGGKSNVAGFRDGPSEDAKF 189
P+G+++D +GNL+V D+ N +R + +G VTTIA G+ ++ G DG A+F
Sbjct: 598 PEGISIDSQGNLFVMDSGNHTMRMLIASGTNWIVTTIA-GQPDLGGAADGTGNGAQF 653
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 68/152 (44%), Gaps = 21/152 (13%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V G + D N+ I K+T + + +AG+ G G DG + A+F+
Sbjct: 428 PQGIAVDTGGNAYVADTANNVIRKVTSGGTVTT----LAGT-AGVEGQGDGTGSNAQFSG 482
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSE-DAKFSNDF-- 193
P+ V +D N+YV+DT N IRKI G T AGF P D+ N+
Sbjct: 483 PQAVALDGAANVYVSDTGNHTIRKISPGGAVTT------FAGFPGHPGNLDSNMDNNGTN 536
Query: 194 -------DVVYVRPTCSLLVIDRGNAALRQIS 218
+ V + ++ V D GN +R+I+
Sbjct: 537 TARFYSPSGLAVDSSGNVYVADTGNHTIRKIT 568
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 43/86 (50%), Gaps = 3/86 (3%)
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSND 192
F P+G+ + GN+YVADT N IR I G VTT+AG N G +G A+F
Sbjct: 207 FYQPQGIAVGSDGNIYVADTGNGTIRVIPPGGSVTTLAGSPGNY-GSTNGTGSAAQFYQP 265
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQIS 218
V V ++ V D N +R ++
Sbjct: 266 MGVA-VAANGTVYVADNLNHTIRAVT 290
>gi|344205849|ref|YP_004790990.1| NHL repeat containing protein [Stenotrophomonas maltophilia JV3]
gi|343777211|gb|AEM49764.1| NHL repeat containing protein [Stenotrophomonas maltophilia JV3]
Length = 693
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 53/145 (36%), Positives = 74/145 (51%), Gaps = 11/145 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + DG ++ D ++N ++ P GR+ + QG G VDG +A FN
Sbjct: 72 PYALLRGADGSIYFTDAGDNNRIRRRLP-----DGRVETVAGQG-EGRVDGPALQASFNT 125
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ D +GNLYVADT N AIR+I D VTT+AGG+ G DGP+ A+F +
Sbjct: 126 PSGIAADVQGNLYVADTGNHAIRRISTDGQVTTLAGGEQ---GHADGPAAQARFDAPMGI 182
Query: 196 VYVRPTCSLLVIDRGNAALRQISLN 220
V + V D N +R I +
Sbjct: 183 A-VDAQGQVYVADTFNDRIRVIGTD 206
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 52/145 (35%), Positives = 71/145 (48%), Gaps = 11/145 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I G L+ D N I +I S + +AG QG H DG +ARF+
Sbjct: 126 PSGIAADVQGNLYVADTGNHAIRRI----STDGQVTTLAGGEQG---HADGPAAQARFDA 178
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ +D +G +YVADT N IR IG D V T+AGG GF DG A+F +
Sbjct: 179 PMGIAVDAQGQVYVADTFNDRIRVIGTDGSVRTLAGGDR--PGFADGVGTAARFDTPVAL 236
Query: 196 VYVRPTCSLLVIDRGNAALRQISLN 220
+ +LLV D N A+R++ +
Sbjct: 237 AF-DAHGALLVADLFNNAVRRVGAD 260
Score = 45.1 bits (105), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 48/103 (46%), Gaps = 9/103 (8%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P I V G+++ D N I I S R +AG + G DG ARF+
Sbjct: 178 APMGIAVDAQGQVYVADTFNDRIRVIGTDGSV----RTLAGGDR--PGFADGVGTAARFD 231
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTI--AGGKSN 175
P + D G L VAD N A+R++G D V+T+ AGG N
Sbjct: 232 TPVALAFDAHGALLVADLFNNAVRRVGADGTVSTVVAAGGVIN 274
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 39/120 (32%), Positives = 58/120 (48%), Gaps = 10/120 (8%)
Query: 104 PLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTL--NLAIRKI 161
PL+ ++ +AG G+ G DG +ARF P + G++Y D N R++
Sbjct: 41 PLAWTAQIEPLAGD--GHPGDRDGGSAQARFADPYALLRGADGSIYFTDAGDNNRIRRRL 98
Query: 162 GDAGVTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
D V T+AG G+ V DGP+ A F N + +L V D GN A+R+IS +
Sbjct: 99 PDGRVETVAGQGEGRV----DGPALQASF-NTPSGIAADVQGNLYVADTGNHAIRRISTD 153
>gi|110639161|ref|YP_679370.1| hypothetical protein CHU_2779 [Cytophaga hutchinsonii ATCC 33406]
gi|110281842|gb|ABG60028.1| conserved hypothetical protein [Cytophaga hutchinsonii ATCC 33406]
Length = 1051
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 73/143 (51%), Gaps = 10/143 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + + G L+ D N+ I KIT S G + + G+VDG ARF
Sbjct: 165 PYGMAIDASGNLYVADSGNNLIRKIT------SSGVVTTIAGNTTPGYVDGTGTAARFYL 218
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P +T+D GN ++ D N IRK+ AG V+T+AG S AG+ +G A+F+ + +
Sbjct: 219 PVNITLDVSGNFFITDNRNHRIRKMTSAGVVSTVAGSGS--AGYMNGTGVTAQFNRPYGI 276
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V V +L V D N +R+I+
Sbjct: 277 V-VDAFSNLYVTDTNNGVIRKIT 298
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 61/116 (52%), Gaps = 11/116 (9%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKPNEAR 133
+P I + G F D N I K+T S G + VAGS G G+++G A+
Sbjct: 218 LPVNITLDVSGNFFITDNRNHRIRKMT------SAGVVSTVAGS--GSAGYMNGTGVTAQ 269
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKF 189
FN P G+ +D NLYV DT N IRKI +GV + G + GF DGP+ +A+F
Sbjct: 270 FNRPYGIVVDAFSNLYVTDTNNGVIRKITSSGVVSTYAG-TGTPGFADGPAANAQF 324
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 43/115 (37%), Positives = 60/115 (52%), Gaps = 9/115 (7%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
+P + + ++ D N I KITP G ++ + G G DG A+FN
Sbjct: 110 LPAAVVLDAAQNIYIADNGNHCIRKITPA------GVVITFAGSGTAGSNDGTGTAAQFN 163
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKF 189
+P G+ +D GNLYVAD+ N IRKI +G VTTIAG + G+ DG A+F
Sbjct: 164 NPYGMAIDASGNLYVADSGNNLIRKITSSGVVTTIAGNTT--PGYVDGTGTAARF 216
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 56/101 (55%), Gaps = 4/101 (3%)
Query: 119 QGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVT-TIAGGKSNVA 177
G +G +D + ARFN P V +D N+Y+AD N IRKI AGV T AG S A
Sbjct: 93 NGTSGLIDATGSAARFNLPAAVVLDAAQNIYIADNGNHCIRKITPAGVVITFAG--SGTA 150
Query: 178 GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G DG A+F+N + + + + +L V D GN +R+I+
Sbjct: 151 GSNDGTGTAAQFNNPYGMA-IDASGNLYVADSGNNLIRKIT 190
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/120 (34%), Positives = 55/120 (45%), Gaps = 9/120 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY I V L+ D N I KIT S G + + G G DG A+F
Sbjct: 273 PYGIVVDAFSNLYVTDTNNGVIRKIT------SSGVVSTYAGTGTPGFADGPAANAQFQW 326
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGP--SEDAKFSNDFD 194
P G+T++ G+LY AD +RK+ AG+ + G N AGF + SE A + FD
Sbjct: 327 PTGLTINASGDLYEADNETHRVRKVTPAGIVSTFAGNGN-AGFANTAALSEYAVSNGAFD 385
Score = 41.6 bits (96), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 48/90 (53%), Gaps = 4/90 (4%)
Query: 130 NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAK 188
N A F G+ D+ GNL VA + +IRK+ +G V+TIAG + +G D A+
Sbjct: 50 NTAEFYGLSGLAFDNSGNLIVAGYQDHSIRKVNPSGIVSTIAG--NGTSGLIDATGSAAR 107
Query: 189 FSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
F+ VV + ++ + D GN +R+I+
Sbjct: 108 FNLPAAVV-LDAAQNIYIADNGNHCIRKIT 136
>gi|223939820|ref|ZP_03631690.1| NHL repeat containing protein [bacterium Ellin514]
gi|223891508|gb|EEF57999.1| NHL repeat containing protein [bacterium Ellin514]
Length = 805
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 51/143 (35%), Positives = 73/143 (51%), Gaps = 8/143 (5%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
I + + G L D NS I KI P + Y L F G DG ARFN P+G
Sbjct: 356 ISIDKAGNLLVADTQNSEIRKIAPVGTNYVVTTLAG--FAQNVGGADGTNAVARFNSPRG 413
Query: 140 VTMDDKGNLYVADTLNLAIRKIGDAG----VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
+ +D GN++VAD N IRK+ G VTTIA G++ +A + DG +A F+ +
Sbjct: 414 IAVDAAGNVFVADQNNNTIRKLTPVGTNWAVTTIA-GQAGMAYYGDGNGTNAYFNYPAGI 472
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V + +L V D GN +R+++
Sbjct: 473 A-VDASGNLFVTDAGNHVIRKLT 494
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 50/146 (34%), Positives = 73/146 (50%), Gaps = 8/146 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ + V +G +F D N I K+TP + Y + + + DG N ARF
Sbjct: 295 PHAVAVDTNGNVFVADSENYTIRKLTPSGTNYVVSTPIGQTRA--SNSTDGTNNAARFWF 352
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG----VTTIAGGKSNVAGFRDGPSEDAKFSND 192
G+++D GNL VADT N IRKI G VTT+AG NV G DG + A+F N
Sbjct: 353 LLGISIDKAGNLLVADTQNSEIRKIAPVGTNYVVTTLAGFAQNVGG-ADGTNAVARF-NS 410
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQIS 218
+ V ++ V D+ N +R+++
Sbjct: 411 PRGIAVDAAGNVFVADQNNNTIRKLT 436
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/146 (34%), Positives = 73/146 (50%), Gaps = 7/146 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V G +F D+ N+ I K+TP + ++ + + Y G DG A FN+
Sbjct: 411 PRGIAVDAAGNVFVADQNNNTIRKLTPVGTNWAVTTIAGQAGMAYYG--DGNGTNAYFNY 468
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG----VTTIAGGKSNVAGFRDGPSEDAKFSND 192
P G+ +D GNL+V D N IRK+ VTTIAG + AG DG + +A+F
Sbjct: 469 PAGIAVDASGNLFVTDAGNHVIRKLTPTATNYLVTTIAGSAAAQAGSTDGTNANARFF-I 527
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQIS 218
D + V +L V D N +R+I+
Sbjct: 528 VDGITVDAAGNLFVADNNNCLIRKIA 553
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 51/146 (34%), Positives = 74/146 (50%), Gaps = 8/146 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G +F D N I KI P + + +AG GY G DG +A F +
Sbjct: 121 PAAVAVDNSGNVFVADSANYTIRKIAPLGTNWVV-TTIAG-VPGYHGSSDGTNTDALFFY 178
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG----VTTIAGGKSNVAGFRDGPSEDAKFSND 192
P+G+ D G+L+V D+ N +IRK+ G VTTIA G S V+G DG + A F+
Sbjct: 179 PEGLAFDSSGHLFVGDSSNNSIRKMTPVGTNWVVTTIA-GSSPVSGSNDGTNGFAHFNQP 237
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQIS 218
+ V S+ V D N+ +R+I+
Sbjct: 238 CGLA-VDAAGSIFVADYFNSTIRKIT 262
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 52/146 (35%), Positives = 73/146 (50%), Gaps = 8/146 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + G LF D N++I K+TP + + +AGS +G DG A FN
Sbjct: 179 PEGLAFDSSGHLFVGDSSNNSIRKMTPVGTNWVV-TTIAGS-SPVSGSNDGTNGFAHFNQ 236
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG----VTTIAGGKSNVAGFRDGPSEDAKFSND 192
P G+ +D G+++VAD N IRKI AG VTTIA GK VA +G +A F N
Sbjct: 237 PCGLAVDAAGSIFVADYFNSTIRKITSAGTNWLVTTIA-GKVGVADSAEGTGTNAVF-NY 294
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQIS 218
V V ++ V D N +R+++
Sbjct: 295 PHAVAVDTNGNVFVADSENYTIRKLT 320
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 55/117 (47%), Gaps = 4/117 (3%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V G LF D N I K+TP + Y +AGS G DG ARF
Sbjct: 469 PAGIAVDASGNLFVTDAGNHVIRKLTPTATNYLV-TTIAGSAAAQAGSTDGTNANARFFI 527
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG---VTTIAGGKSNVAGFRDGPSEDAKFS 190
G+T+D GNL+VAD N IRKI G +TT GK N F DG + F+
Sbjct: 528 VDGITVDAAGNLFVADNNNCLIRKIAPVGTNWITTTIAGKLNSYDFADGVGTNILFN 584
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 69/145 (47%), Gaps = 6/145 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G +F D NS I KIT + + +AG G +G A FN+
Sbjct: 237 PCGLAVDAAGSIFVADYFNSTIRKITSAGTNWLV-TTIAGKV-GVADSAEGTGTNAVFNY 294
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG---GKSNVAGFRDGPSEDAKFSNDF 193
P V +D GN++VAD+ N IRK+ +G + G++ + DG + A+F
Sbjct: 295 PHAVAVDTNGNVFVADSENYTIRKLTPSGTNYVVSTPIGQTRASNSTDGTNNAARFWFLL 354
Query: 194 DVVYVRPTCSLLVIDRGNAALRQIS 218
+ + +LLV D N+ +R+I+
Sbjct: 355 G-ISIDKAGNLLVADTQNSEIRKIA 378
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 66/129 (51%), Gaps = 8/129 (6%)
Query: 92 DEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVA 151
D N+ + K+TP + Y + A F G +DG ++ARF +P G+ +D LYV
Sbjct: 601 DMGNNMVRKLTPNGTNYVSSTVAA--FPQAYGFMDGTNSDARFAYPTGIAIDTNDTLYVT 658
Query: 152 DTLNLAIRKIGDAG----VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVI 207
D N IRK+ G VTT+AG ++ G DG A F+ F + + T +L V
Sbjct: 659 DQGNNTIRKVTPLGTNWMVTTLAGIHAST-GSADGAGSAALFNGPFGIA-IDKTGNLFVA 716
Query: 208 DRGNAALRQ 216
D N+++R+
Sbjct: 717 DLQNSSIRK 725
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 43/84 (51%), Gaps = 2/84 (2%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + + L+ D+ N+ I K+TP + + L TG DG + A FN
Sbjct: 644 PTGIAIDTNDTLYVTDQGNNTIRKVTPLGTNWMVTTLAG--IHASTGSADGAGSAALFNG 701
Query: 137 PKGVTMDDKGNLYVADTLNLAIRK 160
P G+ +D GNL+VAD N +IRK
Sbjct: 702 PFGIAIDKTGNLFVADLQNSSIRK 725
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 39/100 (39%), Positives = 52/100 (52%), Gaps = 6/100 (6%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG----VTTIAGGKSNVAG 178
G DG ARFN+P+ D GN+YVAD+ N IRK+ G VTTIAG N G
Sbjct: 49 GSNDGTNRVARFNNPEATASDSAGNIYVADSGNHVIRKMAAVGTNWVVTTIAGLPGN-PG 107
Query: 179 FRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
DG + A+F V V + ++ V D N +R+I+
Sbjct: 108 SADGTNSAAQFYYP-AAVAVDNSGNVFVADSANYTIRKIA 146
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 56/111 (50%), Gaps = 13/111 (11%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDK 145
G ++ D N I K+ + + +AG G G DG + A+F +P V +D+
Sbjct: 72 GNIYVADSGNHVIRKMAAVGTNWVV-TTIAG-LPGNPGSADGTNSAAQFYYPAAVAVDNS 129
Query: 146 GNLYVADTLNLAIRKIGDAG----VTTIAGGKSNVAGFR---DGPSEDAKF 189
GN++VAD+ N IRKI G VTTIAG V G+ DG + DA F
Sbjct: 130 GNVFVADSANYTIRKIAPLGTNWVVTTIAG----VPGYHGSSDGTNTDALF 176
Score = 45.8 bits (107), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 12/145 (8%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
I V G LF D N I KI P + + +AG Y DG FN P G
Sbjct: 531 ITVDAAGNLFVADNNNCLIRKIAPVGTNWIT-TTIAGKLNSYD-FADGVGTNILFNQPTG 588
Query: 140 VTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVA------GFRDGPSEDAKFSNDF 193
+ + G +YV D N +RK+ G ++ S VA GF DG + DA+F+
Sbjct: 589 IAVGKGGVVYVVDMGNNMVRKLTPNGTNYVS---STVAAFPQAYGFMDGTNSDARFAYPT 645
Query: 194 DVVYVRPTCSLLVIDRGNAALRQIS 218
+ + +L V D+GN +R+++
Sbjct: 646 GIA-IDTNDTLYVTDQGNNTIRKVT 669
>gi|380692993|ref|ZP_09857852.1| cell surface protein [Bacteroides faecis MAJ27]
Length = 457
Score = 71.6 bits (174), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 65/108 (60%), Gaps = 2/108 (1%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG 171
R+ AG +G GH DG+ EA+FN P+ + +D++ NLY+AD+ N IRK+ GV +
Sbjct: 351 RIYAGD-KGRAGHADGQETEAQFNSPRQLVLDEEENLYIADSGNHCIRKVTPEGVVSTVI 409
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G N +G++DG E A F+ + + + ++ + D+ N +R++S+
Sbjct: 410 GNPNSSGYKDGTPEIALFTEPWGLA-IDSEGTIYIGDKDNRCVRKLSI 456
>gi|451980854|ref|ZP_21929239.1| conserved hypothetical protein, contains NHL repeats [Nitrospina
gracilis 3/211]
gi|451761976|emb|CCQ90480.1| conserved hypothetical protein, contains NHL repeats [Nitrospina
gracilis 3/211]
Length = 729
Score = 71.6 bits (174), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 51/138 (36%), Positives = 74/138 (53%), Gaps = 14/138 (10%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQG---YTGHVDGKP-NEARFNHPKGVT 141
G L+ D +N I K+TP G+++ + G YTG DG P +A F P+ +T
Sbjct: 513 GNLYVSDRINHQIRKVTP------EGKIITIAGNGSSDYTG--DGGPAKDASFRDPQSLT 564
Query: 142 MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD-GPSEDAKFSNDFDVVYVRP 200
MD +GNLY+ DT N IRKI G+ T G N D GP+ A F + D+V+ P
Sbjct: 565 MDKEGNLYIGDTANNVIRKIDKNGIVTTYAGNGNHEHSGDGGPALKAGFMSIGDIVF-SP 623
Query: 201 TCSLLVIDRGNAALRQIS 218
+ L V++ GN +R+I+
Sbjct: 624 SGELHVVEPGNHTVRKIT 641
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 42/108 (38%), Positives = 60/108 (55%), Gaps = 7/108 (6%)
Query: 113 LVAGSFQGYTGHVDGKP-NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIA 170
+V GY+G DG P EA P G+T D +GNLYVAD N +RK+ G +TT A
Sbjct: 77 VVGTGVSGYSG--DGGPATEATLKVPAGLTFDKEGNLYVADRENHVVRKVDTNGIITTFA 134
Query: 171 G-GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
G GK+ +G + GP+ AK + D+ V +L + DR N +R++
Sbjct: 135 GTGKAGYSGDK-GPATQAKLNLPSDMT-VDHKGNLFISDRSNNVIRKV 180
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 69/146 (47%), Gaps = 12/146 (8%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV--AGSFQ-GYTGHVDGKPNEA 132
VP + ++G L+ D N + K+ + G + AG+ + GY+G G +A
Sbjct: 99 VPAGLTFDKEGNLYVADRENHVVRKVD------TNGIITTFAGTGKAGYSGD-KGPATQA 151
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG-PSEDAKFSN 191
+ N P +T+D KGNL+++D N IRK+ +G T G N D P+
Sbjct: 152 KLNLPSDMTVDHKGNLFISDRSNNVIRKVDPSGTITTYAGTGNEGYNGDNMPALRTNLDK 211
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQI 217
F + V +L + DRGN +R++
Sbjct: 212 PFGLA-VDKHGNLYIADRGNNRIRKV 236
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 51/114 (44%), Gaps = 7/114 (6%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAG--SFQGYTGHVDGKPNEARFNHP 137
I S GEL V+ N + KIT + + LVAG QG G GK EA P
Sbjct: 619 IVFSPSGELHVVEPGNHTVRKIT----RDGKVELVAGRPGVQGLFGD-GGKATEAMLKQP 673
Query: 138 KGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSN 191
+ D KGN+Y+ D N IRK+ G+ T G+ + +G + F N
Sbjct: 674 ACIAFDSKGNMYITDMGNNRIRKVDTNGIITTLAGRGSFGWGYEGEEVNIYFQN 727
Score = 45.1 bits (105), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 47/93 (50%), Gaps = 2/93 (2%)
Query: 139 GVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVY 197
G+ D KGNLYV+D +N IRK+ G + TIAG S+ GP++DA F D +
Sbjct: 506 GLAFDKKGNLYVSDRINHQIRKVTPEGKIITIAGNGSSDYTGDGGPAKDASFR-DPQSLT 564
Query: 198 VRPTCSLLVIDRGNAALRQISLNQDDCEYQYNS 230
+ +L + D N +R+I N Y N
Sbjct: 565 MDKEGNLYIGDTANNVIRKIDKNGIVTTYAGNG 597
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 70/145 (48%), Gaps = 11/145 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKPNEARFN 135
P + V ++G ++ D N+ I KI + R V G+ Q Y G + E +
Sbjct: 269 PTDVAVDDEGNVYVADRNNNRIRKI----NTLGMIRTVMGTGQQDYNGDAE-VARETNLH 323
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG-GKSNVAGFRDGPSEDAKFSND 192
P +T+D+ G+L V D + IRK+ G V TIAG G N AG +GP+ A
Sbjct: 324 LPFAITLDNNGDLLVVDRSHYRIRKMHQKGSRVETIAGNGVKNFAG-DNGPATGANLEFP 382
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQI 217
+V V +++ D+G+ +RQI
Sbjct: 383 HGIV-VDKQDNVIFADKGHYRIRQI 406
Score = 38.9 bits (89), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 67/148 (45%), Gaps = 17/148 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ + V + G L+ D N+ I K+ S + G + +G A
Sbjct: 212 PFGLAVDKHGNLYIADRGNNRIRKVDAGSGLMS---TIGGDGGFFFIGDNGPAYRASIAG 268
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGV--TTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P V +DD+GN+YVAD N IRKI G+ T + G+ + G DA+ + + +
Sbjct: 269 PTDVAVDDEGNVYVADRNNNRIRKINTLGMIRTVMGTGQQDYNG-------DAEVARETN 321
Query: 195 V-----VYVRPTCSLLVIDRGNAALRQI 217
+ + + LLV+DR + +R++
Sbjct: 322 LHLPFAITLDNNGDLLVVDRSHYRIRKM 349
>gi|223934698|ref|ZP_03626618.1| NHL repeat containing protein [bacterium Ellin514]
gi|223896653|gb|EEF63094.1| NHL repeat containing protein [bacterium Ellin514]
Length = 847
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/142 (33%), Positives = 70/142 (49%), Gaps = 6/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V + G ++ D N+ I I P S + +AG+ G+ G D A F+
Sbjct: 224 PEAVAVDQAGNVYVADTGNAAIRMIMPGGSVTT----LAGA-AGFVGSADASGTNALFHQ 278
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+ ++ GNLYVAD N IR+I AGV T G S AG DG + A+F V
Sbjct: 279 PAGIGINSAGNLYVADYFNNTIRQISPAGVVTTLAGLSGTAGSADGTNSSARFLGPQGVA 338
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
V T ++ + D N+ +R ++
Sbjct: 339 -VDSTGTVFIADTANSTIRVMT 359
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 67/143 (46%), Gaps = 6/143 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V ++ D N + +TP + L AG+F G DG ARFN
Sbjct: 498 PQGVAVDSANNVYVADTGNHTVRMVTPGGISSTLAGL-AGTF----GTFDGTNAGARFNG 552
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+ +D GNLYV D N IRK+ AG T G + + G DG A F +
Sbjct: 553 PTGIAVDGAGNLYVTDYNNDTIRKVTSAGAVTTLAGWTGMWGSIDGAGNSALFFGPSG-I 611
Query: 197 YVRPTCSLLVIDRGNAALRQISL 219
V +L VID GN+ LR+++L
Sbjct: 612 SVDALGNLYVIDSGNSTLRKLTL 634
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 48/142 (33%), Positives = 71/142 (50%), Gaps = 6/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V G ++ D NS I KITP S + +AGS G G+ DG A+F
Sbjct: 443 PQGIAVDGGGNIYVADTGNSTIRKITPSGSTST----LAGS-AGNPGNADGAGITAQFYQ 497
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P+GV +D N+YVADT N +R + G+++ G + G DG + A+F N +
Sbjct: 498 PQGVAVDSANNVYVADTGNHTVRMVTPGGISSTLAGLAGTFGTFDGTNAGARF-NGPTGI 556
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
V +L V D N +R+++
Sbjct: 557 AVDGAGNLYVTDYNNDTIRKVT 578
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/142 (32%), Positives = 70/142 (49%), Gaps = 7/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G +F D NS I +T + G + + G ++G + ARF
Sbjct: 334 PQGVAVDSTGTVFIADTANSTIRVMT------AAGVVTTLAGSPSEGSINGVTSSARFYS 387
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P+ V +D + N+YVADT N IRKI GV ++ G + V G DG +A FS +
Sbjct: 388 PQNVAVDGQNNIYVADTQNSVIRKITPFGVVSVLAGTTGVFGSADGSGANALFSGPQGIA 447
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
V ++ V D GN+ +R+I+
Sbjct: 448 -VDGGGNIYVADTGNSTIRKIT 468
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 48/142 (33%), Positives = 69/142 (48%), Gaps = 6/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V ++ D NS I KITP + ++AG+ G G DG A F+
Sbjct: 388 PQNVAVDGQNNIYVADTQNSVIRKITP----FGVVSVLAGT-TGVFGSADGSGANALFSG 442
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P+G+ +D GN+YVADT N IRKI +G T+ G + G DG A+F V
Sbjct: 443 PQGIAVDGGGNIYVADTGNSTIRKITPSGSTSTLAGSAGNPGNADGAGITAQFYQPQGVA 502
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
V ++ V D GN +R ++
Sbjct: 503 -VDSANNVYVADTGNHTVRMVT 523
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 51/141 (36%), Positives = 71/141 (50%), Gaps = 8/141 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + DG ++ D +S I K+T Q + +AG G TG V+ A F H
Sbjct: 116 PSGIALDSDGNIYVSDYGSSTIRKVT----QSGQVTTLAG-MTGVTGSVNNTGTNALFFH 170
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+ +D+ NLYVAD N IRKI + V + G + V G DG +F N+ + V
Sbjct: 171 PMGLAVDNATNLYVADYGNHLIRKITPSNVVSTLAGVTGVPGSADG--LGGQF-NEPEAV 227
Query: 197 YVRPTCSLLVIDRGNAALRQI 217
V ++ V D GNAA+R I
Sbjct: 228 AVDQAGNVYVADTGNAAIRMI 248
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 47/143 (32%), Positives = 73/143 (51%), Gaps = 9/143 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I ++ G L+ D N+ I +I+P + L G G DG + ARF
Sbjct: 279 PAGIGINSAGNLYVADYFNNTIRQISPAGVVTTLAGL-----SGTAGSADGTNSSARFLG 333
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P+GV +D G +++ADT N IR + AG VTT+AG S G +G + A+F + +V
Sbjct: 334 PQGVAVDSTGTVFIADTANSTIRVMTAAGVVTTLAGSPSE--GSINGVTSSARFYSPQNV 391
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V ++ V D N+ +R+I+
Sbjct: 392 A-VDGQNNIYVADTQNSVIRKIT 413
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 43/142 (30%), Positives = 70/142 (49%), Gaps = 6/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G ++ D N NI+++ ++ R +AG+ G G DG +A FN
Sbjct: 61 PQGVAVDGAGNVYVADTGN-NIIRV---VTASGLCRTLAGT-AGVQGSADGMGAQASFNQ 115
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+ +D GN+YV+D + IRK+ +G T G + V G + +A F + +
Sbjct: 116 PSGIALDSDGNIYVSDYGSSTIRKVTQSGQVTTLAGMTGVTGSVNNTGTNALFFHPMGLA 175
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
V +L V D GN +R+I+
Sbjct: 176 -VDNATNLYVADYGNHLIRKIT 196
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/145 (33%), Positives = 66/145 (45%), Gaps = 13/145 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVA-GSFQGYTGHVDGKPNEARFN 135
P I V G L+ D N I K+T S G + + G G +DG N A F
Sbjct: 553 PTGIAVDGAGNLYVTDYNNDTIRKVT------SAGAVTTLAGWTGMWGSIDGAGNSALFF 606
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG----VTTIAGGKSNVAGFRDGPSEDAKFSN 191
P G+++D GNLYV D+ N +RK+ +G V+T+A G V G DG A+F
Sbjct: 607 GPSGISVDALGNLYVIDSGNSTLRKLTLSGGTWTVSTVA-GMPGVNGGIDGSGAGAEFYY 665
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQ 216
V V + V D GN +R
Sbjct: 666 PAGVT-VSAAGYVYVADAGNNTIRS 689
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 48/144 (33%), Positives = 62/144 (43%), Gaps = 12/144 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPP--LSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
P + V L+ D N I KITP +S + V GS G G +F
Sbjct: 171 PMGLAVDNATNLYVADYGNHLIRKITPSNVVSTLAGVTGVPGSADGLGG---------QF 221
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
N P+ V +D GN+YVADT N AIR I G T G + G D +A F
Sbjct: 222 NEPEAVAVDQAGNVYVADTGNAAIRMIMPGGSVTTLAGAAGFVGSADASGTNALFHQPAG 281
Query: 195 VVYVRPTCSLLVIDRGNAALRQIS 218
+ + +L V D N +RQIS
Sbjct: 282 -IGINSAGNLYVADYFNNTIRQIS 304
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 55/108 (50%), Gaps = 3/108 (2%)
Query: 113 LVAGSFQGYTG--HVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIA 170
L+ G+ GY G DG A+ P+GV +D GN+YVADT N IR + +G+
Sbjct: 35 LLIGTVAGYAGKGSADGVGGSAQLFGPQGVAVDGAGNVYVADTGNNIIRVVTASGLCRTL 94
Query: 171 GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G + V G DG A F N + + ++ V D G++ +R+++
Sbjct: 95 AGTAGVQGSADGMGAQASF-NQPSGIALDSDGNIYVSDYGSSTIRKVT 141
>gi|427400920|ref|ZP_18892158.1| hypothetical protein HMPREF9710_01754 [Massilia timonae CCUG 45783]
gi|425720099|gb|EKU83025.1| hypothetical protein HMPREF9710_01754 [Massilia timonae CCUG 45783]
Length = 717
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 54/142 (38%), Positives = 74/142 (52%), Gaps = 10/142 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + +G L+ D N I K+ P + + VAGS G G++DG A+FN
Sbjct: 131 PSALALDHEGNLYVADTGNHAIRKVAPDGTVTT----VAGS--GSPGYLDGIGRAAQFNG 184
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ +DD G +YVADT N IR+I D VTT+AG + G DG DA F +
Sbjct: 185 PVGIAVDDAGIVYVADTYNDRIRRIAPDGMVTTLAG--NGKPGLLDGALLDAGFDTPSAL 242
Query: 196 VYVRPTCSLLVIDRGNAALRQI 217
R +L V D GN A+R+I
Sbjct: 243 AAGR-DGTLYVADTGNHAVRRI 263
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 68/133 (51%), Gaps = 16/133 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + + G ++ D +N +++ P S +AG +G+ DG A F+
Sbjct: 77 PYGVAIGARGAVYVADGGEANRIRLIQPDGAVS---TLAGGKEGFA---DGIGAAAAFHT 130
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFS----- 190
P + +D +GNLYVADT N AIRK+ D VTT+AG S G+ DG A+F+
Sbjct: 131 PSALALDHEGNLYVADTGNHAIRKVAPDGTVTTVAG--SGSPGYLDGIGRAAQFNGPVGI 188
Query: 191 --NDFDVVYVRPT 201
+D +VYV T
Sbjct: 189 AVDDAGIVYVADT 201
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 66/125 (52%), Gaps = 13/125 (10%)
Query: 101 ITPPLSQY-----SRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLN 155
+TPP+++ +R L+AG G G +DG + +RF+ P GV + +G +YVAD
Sbjct: 38 LTPPVAKTTPFWPARVSLLAG--DGRDGVIDGPGSASRFSDPYGVAIGARGAVYVADGGE 95
Query: 156 L-AIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAA 213
IR I D V+T+AGGK GF DG A F + + +L V D GN A
Sbjct: 96 ANRIRLIQPDGAVSTLAGGKE---GFADGIGAAAAFHTP-SALALDHEGNLYVADTGNHA 151
Query: 214 LRQIS 218
+R+++
Sbjct: 152 IRKVA 156
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 45/95 (47%), Gaps = 7/95 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V + G ++ D N I +I P G + + G G +DG +A F+
Sbjct: 185 PVGIAVDDAGIVYVADTYNDRIRRIAP------DGMVTTLAGNGKPGLLDGALLDAGFDT 238
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIA 170
P + G LYVADT N A+R+I D V T+A
Sbjct: 239 PSALAAGRDGTLYVADTGNHAVRRIKPDGTVDTLA 273
>gi|302848538|ref|XP_002955801.1| hypothetical protein VOLCADRAFT_119185 [Volvox carteri f.
nagariensis]
gi|300258994|gb|EFJ43226.1| hypothetical protein VOLCADRAFT_119185 [Volvox carteri f.
nagariensis]
Length = 1149
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 39/97 (40%), Positives = 56/97 (57%), Gaps = 3/97 (3%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFR 180
GH DG ARF + +GV +D GN++VAD+ N +R++ D+ V+T+ G N AGFR
Sbjct: 269 GHADGPSGSARFYNLRGVAVDGDGNIFVADSSNHCVRQLTAADSIVSTLVGSPGN-AGFR 327
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
DG DA+F N +L+V D GN LR++
Sbjct: 328 DGTGTDARFRNPCGAAVNLQDGTLVVADAGNNRLRKV 364
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 46/144 (31%), Positives = 68/144 (47%), Gaps = 10/144 (6%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P + V G + D + I++I + G V + G GH DG +A+F+
Sbjct: 174 APVGVVVDFGGNAYIADTGHCRILRI-----RLDSGEAVVLAGGGGYGHKDGPGRKAKFS 228
Query: 136 HPKGVTMDDK-GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P + +D + G+L V+D L R D VTTIAGG + G DGPS A+F N
Sbjct: 229 CPMYLALDHRDGSLVVSDQHCLR-RVASDGFVTTIAGGTT--PGHADGPSGSARFYN-LR 284
Query: 195 VVYVRPTCSLLVIDRGNAALRQIS 218
V V ++ V D N +RQ++
Sbjct: 285 GVAVDGDGNIFVADSSNHCVRQLT 308
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 52/106 (49%), Gaps = 7/106 (6%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
+ V DG +F D N + ++T S S + GS G G DG +ARF +P G
Sbjct: 286 VAVDGDGNIFVADSSNHCVRQLTAADSIVS---TLVGS-PGNAGFRDGTGTDARFRNPCG 341
Query: 140 VTMD-DKGNLYVADTLNLAIRKIG-DAGVTTIAG-GKSNVAGFRDG 182
++ G L VAD N +RK+ D TTIAG G + VA DG
Sbjct: 342 AAVNLQDGTLVVADAGNNRLRKVDRDRLTTTIAGNGTAGVAIDGDG 387
>gi|431798813|ref|YP_007225717.1| NHL repeat protein [Echinicola vietnamensis DSM 17526]
gi|430789578|gb|AGA79707.1| NHL repeat protein [Echinicola vietnamensis DSM 17526]
Length = 655
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 43/108 (39%), Positives = 71/108 (65%), Gaps = 6/108 (5%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGG 172
+AG+ G G +G + A+F++P+G+ +D +GN++VAD N IRKI +G V+T AG
Sbjct: 338 IAGT--GIAGFQNGSSSTAKFDNPEGIAIDQQGNIFVADRDNNVIRKISSSGDVSTFAG- 394
Query: 173 KSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
+ +AGF DG + AKF++ + V V +++V DRGN ++R+I+ N
Sbjct: 395 -TGIAGFTDGVAGVAKFNSPWKVA-VDNQGNVIVADRGNHSIRKITPN 440
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 51/146 (34%), Positives = 73/146 (50%), Gaps = 19/146 (13%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+K+ V G + D N +I KITP + + +AG+ GY DG N+A+F+
Sbjct: 413 PWKVAVDNQGNVIVADRGNHSIRKITPNGTVST----LAGTTNGYQ---DGSGNQAKFDQ 465
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDA-GVTTIAGGKSNVAGFRDGPSEDAKF----SN 191
P V + GN+ +AD N IR I + V+TIAG + G+ DG A+F
Sbjct: 466 PTDVAVLPNGNIVIADNRNHCIRMIDSSVQVSTIAG--TGNGGYVDGAGSQAQFYYPSGI 523
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQI 217
D D P +L V DR N A+R+I
Sbjct: 524 DTD-----PNGNLFVADRKNHAIRKI 544
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 50/143 (34%), Positives = 74/143 (51%), Gaps = 10/143 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V +G + D N I I + + +AG+ G G+VDG ++A+F +
Sbjct: 466 PTDVAVLPNGNIVIADNRNHCIRMIDSSVQVST----IAGTGNG--GYVDGAGSQAQFYY 519
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGD-AGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ D GNL+VAD N AIRKI V+T+AGG N G ++G AKF + + V
Sbjct: 520 PSGIDTDPNGNLFVADRKNHAIRKIDSYHNVSTVAGG--NGEGIQNGGIAVAKFDDPYGV 577
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V +LV D N +R+I+
Sbjct: 578 A-VGQNGKVLVADLDNNVIREIN 599
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 45/141 (31%), Positives = 68/141 (48%), Gaps = 10/141 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I +G LF D N I KI S ++ + G+ +G +G A+F+
Sbjct: 520 PSGIDTDPNGNLFVADRKNHAIRKID---SYHNVSTVAGGNGEGIQ---NGGIAVAKFDD 573
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P GV + G + VAD N IR+I V+TI G SN G+ DGPS +K ++ DV+
Sbjct: 574 PYGVAVGQNGKVLVADLDNNVIREINGDYVSTIIG--SNGEGYIDGPSTASKMNSPTDVL 631
Query: 197 YVRPTCSLLVIDRGNAALRQI 217
++ D GN +R++
Sbjct: 632 V--NGDEIIFADYGNHLVRKV 650
>gi|431798812|ref|YP_007225716.1| gluconolactonase [Echinicola vietnamensis DSM 17526]
gi|430789577|gb|AGA79706.1| gluconolactonase [Echinicola vietnamensis DSM 17526]
Length = 428
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 57/143 (39%), Positives = 77/143 (53%), Gaps = 12/143 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V+ +G L+ D N I KI + + VAGS QG + DG EA F +
Sbjct: 242 PLDIAVTAEGVLYVTDNRNHRIRKIEVDGTVST----VAGSEQG---NQDGALEEATFRY 294
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P G+ +DD GN+YVAD +N IRKI +AG V+T+AG S G RDG A+F+N +
Sbjct: 295 PSGLDVDDMGNIYVADRINHLIRKIDLNAGQVSTVAGDGSQ--GTRDGQVMTAQFNNPYG 352
Query: 195 VVYVRPTCSLLVIDRGNAALRQI 217
+ V L+V D N +R I
Sbjct: 353 -ISVADNGQLVVADLSNHKIRLI 374
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 54/141 (38%), Positives = 71/141 (50%), Gaps = 10/141 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V + G ++ D +N I KI Q S VAG G G DG+ A+FN+
Sbjct: 295 PSGLDVDDMGNIYVADRINHLIRKIDLNAGQVST---VAG--DGSQGTRDGQVMTAQFNN 349
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+++ D G L VAD N IR I V TIAG +VAGF DG ++F N DV
Sbjct: 350 PYGISVADNGQLVVADLSNHKIRLIQGENVITIAG---SVAGFLDGVGVTSQFYNPTDVT 406
Query: 197 YVRPTCSLLVIDRGNAALRQI 217
Y + V D GN +R+I
Sbjct: 407 Y--HDGVIYVADLGNHRVRKI 425
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 93/201 (46%), Gaps = 25/201 (12%)
Query: 26 PAGPLI---KHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRV 82
P G LI + S+ K TT + GN F+NG + +++ P+K V
Sbjct: 141 PDGYLIVTDRANNSIRKVTTDGAVSTVLGTGNS-GFQNGPVASALLD------YPWKSCV 193
Query: 83 SEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTM 142
+G ++ D N I KI P +G + + G G DG EA+F+ P + +
Sbjct: 194 DMEGNIYVADRDNHMIRKIDP------QGMVSTVAGTGEAGFADGPAEEAQFDQPLDIAV 247
Query: 143 DDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFS--NDFDVVYVR 199
+G LYV D N IRKI D V+T+AG + G +DG E+A F + D V
Sbjct: 248 TAEGVLYVTDNRNHRIRKIEVDGTVSTVAGSEQ---GNQDGALEEATFRYPSGLD---VD 301
Query: 200 PTCSLLVIDRGNAALRQISLN 220
++ V DR N +R+I LN
Sbjct: 302 DMGNIYVADRINHLIRKIDLN 322
Score = 47.8 bits (112), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 49/100 (49%), Gaps = 12/100 (12%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
G +DG +A F +P+GVTM G L V D N +IRK+ G + G N +GF++G
Sbjct: 120 GLIDGSGIQAAFRNPEGVTMHPDGYLIVTDRANNSIRKVTTDGAVSTVLGTGN-SGFQNG 178
Query: 183 PSEDAKFSNDFDVVYVRPTC-----SLLVIDRGNAALRQI 217
P A Y +C ++ V DR N +R+I
Sbjct: 179 PVASALLD------YPWKSCVDMEGNIYVADRDNHMIRKI 212
>gi|409204160|ref|ZP_11232357.1| putative outer membrane adhesin-like protein [Pseudoalteromonas
flavipulchra JG1]
Length = 2380
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/142 (32%), Positives = 66/142 (46%), Gaps = 7/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G ++ D N+ I KITP G + + G G DG A F
Sbjct: 618 PKAVTVDASGNVYVADSSNNKIRKITP------AGVVTTFAGSGSPGSTDGTGTAATFAA 671
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+T+D GNL+V +T +RKI AGV T G +GF D AKF+ ++
Sbjct: 672 PTGITIDSNGNLFVVETNPHIVRKITPAGVVTTFAGSKGSSGFTDATGTSAKFNFPYN-G 730
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
L + DR N A+R+++
Sbjct: 731 GSSSNNDLFIADRNNHAIRKVT 752
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 48/142 (33%), Positives = 67/142 (47%), Gaps = 7/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY S + +LF D N I K+T S + + G G +G +A FN
Sbjct: 727 PYNGGSSSNNDLFIADRNNHAIRKVT------SASVVTTFAGTGSAGSANGTGTQASFNK 780
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P + +D NLYV++ IRKI AGV T G + V+G DG + A+FS F +
Sbjct: 781 PYDIALDSADNLYVSEQTGHTIRKITSAGVVTTFAGSAGVSGSTDGLASVARFSQPFGIA 840
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
V + V D GN +R+IS
Sbjct: 841 -VDSNGIVYVADTGNHRIRKIS 861
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/115 (39%), Positives = 57/115 (49%), Gaps = 9/115 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + + ++ D N I KITP G + + G G +G + A FN
Sbjct: 564 PYDVATDSNNNVYVADYSNHVIRKITP------EGVVTTLAGSGTAGSDEGTGSAASFNF 617
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFS 190
PK VT+D GN+YVAD+ N IRKI AG VTT AG S G DG A F+
Sbjct: 618 PKAVTVDASGNVYVADSSNNKIRKITPAGVVTTFAGSGS--PGSTDGTGTAATFA 670
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 49/144 (34%), Positives = 70/144 (48%), Gaps = 11/144 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIV-KITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P I + +G LF V E N +IV KITP AGS +G +G D A+FN
Sbjct: 672 PTGITIDSNGNLFVV-ETNPHIVRKITPA----GVVTTFAGS-KGSSGFTDATGTSAKFN 725
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P +L++AD N AIRK+ A VTT AG S AG +G A F+ +D
Sbjct: 726 FPYNGGSSSNNDLFIADRNNHAIRKVTSASVVTTFAGTGS--AGSANGTGTQASFNKPYD 783
Query: 195 VVYVRPTCSLLVIDRGNAALRQIS 218
+ + +L V ++ +R+I+
Sbjct: 784 IA-LDSADNLYVSEQTGHTIRKIT 806
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 52/101 (51%), Gaps = 10/101 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV--AGSFQGYTGHVDGKPNEARF 134
PY I + L+ ++ I KIT S G + AGS G +G DG + ARF
Sbjct: 781 PYDIALDSADNLYVSEQTGHTIRKIT------SAGVVTTFAGS-AGVSGSTDGLASVARF 833
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSN 175
+ P G+ +D G +YVADT N IRKI A TT+ G SN
Sbjct: 834 SQPFGIAVDSNGIVYVADTGNHRIRKISPA-ETTLTGTPSN 873
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/98 (38%), Positives = 49/98 (50%), Gaps = 4/98 (4%)
Query: 122 TGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR 180
+G+ DG A F P V D N+YVAD N IRKI G VTT+AG S AG
Sbjct: 549 SGYADGTGTAASFKSPYDVATDSNNNVYVADYSNHVIRKITPEGVVTTLAG--SGTAGSD 606
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+G A F N V V + ++ V D N +R+I+
Sbjct: 607 EGTGSAASF-NFPKAVTVDASGNVYVADSSNNKIRKIT 643
>gi|255531264|ref|YP_003091636.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
gi|255344248|gb|ACU03574.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
Length = 439
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 48/143 (33%), Positives = 76/143 (53%), Gaps = 10/143 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + G ++ D I K++P G + + G G++D +A+F +
Sbjct: 250 PLDVVADAQGNIYVADNTAHRIRKVSP------SGTVTTLAGDGTAGYLDATGVQAKFRN 303
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+T+D +GN+ VAD LN IRKI AG V+TIAG + G DG + AKF++ + V
Sbjct: 304 PSGLTVDQQGNIIVADRLNHRIRKITPAGTVSTIAG--AGTTGLLDGDALTAKFADPYGV 361
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V ++LV + NA +R+I+
Sbjct: 362 A-VDAGGNILVAELTNARIRKIT 383
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/122 (39%), Positives = 66/122 (54%), Gaps = 12/122 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V + G + D +N I KITP + + +AG+ G TG +DG A+F
Sbjct: 304 PSGLTVDQQGNIIVADRLNHRIRKITPAGTVST----IAGA--GTTGLLDGDALTAKFAD 357
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFS--NDF 193
P GV +D GN+ VA+ N IRKI G V+T+AG AGF DG S +AKF+ D
Sbjct: 358 PYGVAVDAGGNILVAELTNARIRKITPVGQVSTLAGSS---AGFADGLSVNAKFNQPTDL 414
Query: 194 DV 195
D+
Sbjct: 415 DI 416
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 41/100 (41%), Positives = 61/100 (61%), Gaps = 4/100 (4%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAG 178
G G +G +A+F +P+G+ +D +GN+ VAD N IRKI AG VTTIAG AG
Sbjct: 126 GTAGLFNGAAADAQFRNPEGLAVDAQGNVIVADRQNHTIRKISPAGQVTTIAG--DGTAG 183
Query: 179 FRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+ DG AKFS+ + + + P +++V DR N +R+I+
Sbjct: 184 YADGTGTAAKFSSPWKLA-IDPLGNIIVADRDNFKIRKIA 222
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 45/121 (37%), Positives = 59/121 (48%), Gaps = 10/121 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G + D N I KI+P G++ + G G+ DG A+F+
Sbjct: 143 PEGLAVDAQGNVIVADRQNHTIRKISPA------GQVTTIAGDGTAGYADGTGTAAKFSS 196
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P + +D GN+ VAD N IRKI D VTT+AG + AGF DG AKF DV
Sbjct: 197 PWKLAIDPLGNIIVADRDNFKIRKIAPDGAVTTLAG---STAGFADGTGSAAKFMQPLDV 253
Query: 196 V 196
V
Sbjct: 254 V 254
Score = 62.0 bits (149), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 72/143 (50%), Gaps = 11/143 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+K+ + G + D N I KI P + + +AGS G+ DG + A+F
Sbjct: 197 PWKLAIDPLGNIIVADRDNFKIRKIAPDGAVTT----LAGSTAGF---ADGTGSAAKFMQ 249
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P V D +GN+YVAD IRK+ +G VTT+AG AG+ D AKF N +
Sbjct: 250 PLDVVADAQGNIYVADNTAHRIRKVSPSGTVTTLAG--DGTAGYLDATGVQAKFRNPSGL 307
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V +++V DR N +R+I+
Sbjct: 308 T-VDQQGNIIVADRLNHRIRKIT 329
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/85 (38%), Positives = 45/85 (52%), Gaps = 7/85 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + V G + + N+ I KITP + Q S +AGS G+ DG A+FN
Sbjct: 358 PYGVAVDAGGNILVAELTNARIRKITP-VGQVST---LAGSSAGF---ADGLSVNAKFNQ 410
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P + +D KGN+YVA+ N IR I
Sbjct: 411 PTDLDIDAKGNIYVAEVTNHRIRMI 435
>gi|223937279|ref|ZP_03629185.1| NHL repeat containing protein [bacterium Ellin514]
gi|223894064|gb|EEF60519.1| NHL repeat containing protein [bacterium Ellin514]
Length = 429
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 49/142 (34%), Positives = 65/142 (45%), Gaps = 6/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G ++ D N I KITP + L G G DG ++ARFN
Sbjct: 164 PRSLAVDNGGNVYVADTWNHTIRKITPAGLVSTLAGLA-----GNPGSADGTNSKARFNR 218
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+ +D+ NL+V D N IRKI G T G V G DG + A+F +V
Sbjct: 219 PSGIAVDNATNLFVTDFHNHTIRKITPGGTVTTIAGLPGVWGNADGTNNVARFFQPQGIV 278
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
+L V D GN +R+IS
Sbjct: 279 -ADNAGNLFVADSGNQTIRKIS 299
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 55/143 (38%), Positives = 73/143 (51%), Gaps = 11/143 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V LF D N I KITP + + +AG G G+ DG N ARF
Sbjct: 219 PSGIAVDNATNLFVTDFHNHTIRKITPGGTVTT----IAG-LPGVWGNADGTNNVARFFQ 273
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG----VTTIAGGKSNVAGFRDGPSEDAKFSND 192
P+G+ D+ GNL+VAD+ N IRKI +G V+T+A G S +AG +G + A+F
Sbjct: 274 PQGIVADNAGNLFVADSGNQTIRKISPSGTNWIVSTVA-GLSGIAGNANGTNNTARFYFP 332
Query: 193 FDVVYVRPTCSLLVIDRGNAALR 215
DV + V D GN A+R
Sbjct: 333 ADVAQ-DIAGYIYVADLGNNAIR 354
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 49/151 (32%), Positives = 71/151 (47%), Gaps = 12/151 (7%)
Query: 71 NEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV---DG 127
N + P I V G ++ D N+ I KITP V + G G++ DG
Sbjct: 103 NALFYAPQGIAVDSAGFIYVADTANATIRKITPAG--------VVSTLAGSAGNINSFDG 154
Query: 128 KPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDA 187
A F P+ + +D+ GN+YVADT N IRKI AG+ + G + G DG + A
Sbjct: 155 TGINANFYQPRSLAVDNGGNVYVADTWNHTIRKITPAGLVSTLAGLAGNPGSADGTNSKA 214
Query: 188 KFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+F N + V +L V D N +R+I+
Sbjct: 215 RF-NRPSGIAVDNATNLFVTDFHNHTIRKIT 244
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 53/143 (37%), Positives = 70/143 (48%), Gaps = 8/143 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I G ++ D NS I KITP S + AG F G G DG A F
Sbjct: 54 PNSIAADSAGNIYVADTENSTIRKITPNGSVST----FAG-FAGTFGSADGVGTNALFYA 108
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P+G+ +D G +YVADT N IRKI AG V+T+AG N+ F DG +A F +
Sbjct: 109 PQGIAVDSAGFIYVADTANATIRKITPAGVVSTLAGSAGNINSF-DGTGINANFYQPRSL 167
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V ++ V D N +R+I+
Sbjct: 168 A-VDNGGNVYVADTWNHTIRKIT 189
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 51/108 (47%), Gaps = 3/108 (2%)
Query: 113 LVAGSFQGYT--GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIA 170
L +F G+ G DG + ARF HP + D GN+YVADT N IRKI G +
Sbjct: 28 LTINTFAGHDAPGSKDGLSSSARFRHPNSIAADSAGNIYVADTENSTIRKITPNGSVSTF 87
Query: 171 GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G + G DG +A F + V + V D NA +R+I+
Sbjct: 88 AGFAGTFGSADGVGTNALFYAPQGIA-VDSAGFIYVADTANATIRKIT 134
>gi|330824371|ref|YP_004387674.1| NHL repeat containing protein [Alicycliphilus denitrificans K601]
gi|329309743|gb|AEB84158.1| NHL repeat containing protein [Alicycliphilus denitrificans K601]
Length = 362
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 53/158 (33%), Positives = 75/158 (47%), Gaps = 9/158 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ + DG L+ D N + + S G + + G G DG + ARFN
Sbjct: 59 PHGLSYGNDGSLYFADRGNHQVRVLR------SGGTVATVAGTGKAGFADGPAHAARFNE 112
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P V ++ G +YVAD N IRKI D V T+AGG S AGF DG + A+F+ + V
Sbjct: 113 PIAVAVERSGAVYVADRNNHRIRKIRPDGTVITLAGGDS--AGFVDGDLKSARFNQPYGV 170
Query: 196 VYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISP 233
+L V D N A+R+I L D + + +P
Sbjct: 171 ALDAAQTTLYVADYLNHAIRRIDLVLDKVDTLAGNGAP 208
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 78/144 (54%), Gaps = 10/144 (6%)
Query: 77 PYKIRV-SEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
PY + + + L+ D +N I +I L + +AG+ G G DG+ ARFN
Sbjct: 167 PYGVALDAAQTTLYVADYLNHAIRRIDLVLDKVD---TLAGN--GAPGFADGQRATARFN 221
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P V +D +G L+V D LN A+R++ AG VTT+AG + AG+ DGP+ +F N
Sbjct: 222 QPYNVRIDGQGRLWVPDQLNHAVRRVTPAGEVTTVAG--AGKAGYADGPAATVRFDNPTG 279
Query: 195 VVYVRPTCSLLVIDRGNAALRQIS 218
V + P +++V DR N LR ++
Sbjct: 280 VAPL-PNGAVVVADRNNNRLRLVT 302
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 64/131 (48%), Gaps = 9/131 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY +R+ G L+ D++N + ++TP G + + G G+ DG RF++
Sbjct: 223 PYNVRIDGQGRLWVPDQLNHAVRRVTPA------GEVTTVAGAGKAGYADGPAATVRFDN 276
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P GV G + VAD N +R + D V T+AG + AGF DG + A+F+ DV
Sbjct: 277 PTGVAPLPNGAVVVADRNNNRLRLVTPDGAVATLAG--AGEAGFADGTAASARFNQPLDV 334
Query: 196 VYVRPTCSLLV 206
+ +LV
Sbjct: 335 EFDDSMSRVLV 345
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 41/107 (38%), Positives = 58/107 (54%), Gaps = 8/107 (7%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIA 170
R +AG G G DG P R N P G++ + G+LY AD N +R + G V T+A
Sbjct: 38 RTLAG--DGQPGAADGSPG--RVNRPHGLSYGNDGSLYFADRGNHQVRVLRSGGTVATVA 93
Query: 171 GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
G + AGF DGP+ A+F N+ V V + ++ V DR N +R+I
Sbjct: 94 G--TGKAGFADGPAHAARF-NEPIAVAVERSGAVYVADRNNHRIRKI 137
>gi|298386946|ref|ZP_06996500.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
gi|298260096|gb|EFI02966.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
Length = 457
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 65/108 (60%), Gaps = 2/108 (1%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG 171
R+ AG +G GH DG+ EA+F+ P+ + +D++ NLY+AD+ N IRK+ GV +
Sbjct: 351 RIYAGD-KGRAGHADGQETEAQFSSPRQLVLDEEENLYIADSGNHCIRKVTPEGVVSTVI 409
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G N +G++DG E A F+ + + + ++ + D+ N +R++S+
Sbjct: 410 GNPNTSGYKDGTPEIALFTEPWGLA-IDSEGTIYIGDKDNRCVRKLSI 456
>gi|357008076|ref|ZP_09073075.1| copper amine oxidase domain-containing protein [Paenibacillus elgii
B69]
Length = 533
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 53/146 (36%), Positives = 76/146 (52%), Gaps = 6/146 (4%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGY--TGHVDGKPNEAR 133
+P + V +DG + D N I K++ G +G+ +DGK + +
Sbjct: 67 MPGGLTVLKDGTVLVSDSRNQLIRKLSQGTVSTFAGAAYKQDSKGFPVGALLDGKSDASL 126
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSND 192
FN P+G+ D GN+YVAD+ N AIRKI AG V+T+AG + + G +DG +DA F
Sbjct: 127 FNEPQGLAADANGNVYVADSGNHAIRKIDTAGQVSTVAG--NGLLGRKDGEGKDALFYRP 184
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQIS 218
DV V +L V D N A+R IS
Sbjct: 185 TDVA-VAADGTLYVADSLNHAIRSIS 209
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/151 (32%), Positives = 68/151 (45%), Gaps = 12/151 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT--------GHVDGK 128
P + + G L D N I I Q + +AG Q T G DG
Sbjct: 251 PTALVLDAKGNLIVSDSGNQRIRYIDL---QQGKVTTLAGGGQAATNKELHVQGGFADGS 307
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAK 188
++ARF+ P G+ + ++G L +AD+ N AIR + D V+TIAG + G DG A
Sbjct: 308 ASDARFSFPMGLALTEEGGLVIADSQNHAIRYLLDGQVSTIAGAADRITGNADGIEGSAA 367
Query: 189 FSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
DV V S+L D N LR++SL
Sbjct: 368 LHRPMDVA-VLADGSILAADTYNNKLREVSL 397
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 69/156 (44%), Gaps = 18/156 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV-------DGKP 129
P + V+ DG L+ D +N I I+P + L + + G V DG
Sbjct: 184 PTDVAVAADGTLYVADSLNHAIRSISPSGEVKTLNALSPRVVELFPGQVSPAGDFADGDL 243
Query: 130 NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNV--------AGF 179
A+FN P + +D KGNL V+D+ N IR I VTT+AGG GF
Sbjct: 244 KSAKFNEPTALVLDAKGNLIVSDSGNQRIRYIDLQQGKVTTLAGGGQAATNKELHVQGGF 303
Query: 180 RDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALR 215
DG + DA+FS + L++ D N A+R
Sbjct: 304 ADGSASDARFSFPMGLALTEE-GGLVIADSQNHAIR 338
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 39/125 (31%), Positives = 58/125 (46%), Gaps = 13/125 (10%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGG--KSNVA 177
G G DG A F P G+T+ G + V+D+ N IRK+ V+T AG K +
Sbjct: 51 GGLGSADGAGVSASFRMPGGLTVLKDGTVLVSDSRNQLIRKLSQGTVSTFAGAAYKQDSK 110
Query: 178 GF-----RDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSIS 232
GF DG S DA N+ + ++ V D GN A+R+I D Q ++++
Sbjct: 111 GFPVGALLDGKS-DASLFNEPQGLAADANGNVYVADSGNHAIRKI-----DTAGQVSTVA 164
Query: 233 PTDIL 237
+L
Sbjct: 165 GNGLL 169
>gi|255532869|ref|YP_003093241.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
gi|255345853|gb|ACU05179.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
Length = 2296
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 65/135 (48%), Gaps = 7/135 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + G ++ D N I KIT + G + + G DG ARFN
Sbjct: 667 PTGLAADAAGNVYVADRDNHRIRKIT------AAGVVTTLAGNGTADFADGTQANARFNM 720
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P GV +D GN++VAD N IRK+ AGV + GK +GF +G S DA+F F +
Sbjct: 721 PVGVAVDANGNVFVADRNNFRIRKVTPAGVVSTYAGKG-TSGFAEGLSGDAQFKLVFGLA 779
Query: 197 YVRPTCSLLVIDRGN 211
+ +L + D+GN
Sbjct: 780 MSSISGNLYISDQGN 794
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 74/143 (51%), Gaps = 16/143 (11%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV---AGSFQGYTGHVDGKPNEARFNH 136
I + G L+ D+ ++I KITP G +V AGS G G DG A+F
Sbjct: 616 IAIDNSGNLYVTDQDVNSIRKITP-------GGVVSTFAGS--GVKGFKDGTGTAAQFAF 666
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ D GN+YVAD N IRKI AG VTT+AG + A F DG +A+F+ V
Sbjct: 667 PTGLAADAAGNVYVADRDNHRIRKITAAGVVTTLAG--NGTADFADGTQANARFNMPVGV 724
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V ++ V DR N +R+++
Sbjct: 725 A-VDANGNVFVADRNNFRIRKVT 746
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 70/143 (48%), Gaps = 10/143 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G ++ D N + KITP G++ + G G+ +G A F +
Sbjct: 559 PSGVAVDAAGNVYVSDTRNFRVRKITPA------GQVSTFAGTGTYGYANGTTTTATFTY 612
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
G+ +D+ GNLYV D +IRKI G V+T AG S V GF+DG A+F+ +
Sbjct: 613 LLGIAIDNSGNLYVTDQDVNSIRKITPGGVVSTFAG--SGVKGFKDGTGTAAQFAFPTGL 670
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
++ V DR N +R+I+
Sbjct: 671 A-ADAAGNVYVADRDNHRIRKIT 692
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 37/96 (38%), Positives = 48/96 (50%), Gaps = 7/96 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + G + D N I KI+P G + + G G DG A+FN
Sbjct: 505 PAGIAMDAAGNTYLADRGNHRIRKISPA------GVVTTLAGSGTAGFADGAGAAAQFND 558
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
P GV +D GN+YV+DT N +RKI AG V+T AG
Sbjct: 559 PSGVAVDAAGNVYVSDTRNFRVRKITPAGQVSTFAG 594
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 56/132 (42%), Gaps = 18/132 (13%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPP--LSQYSRGRLVAGSFQGYTGHVDGKPNEAR 133
+P + V +G +F D N I K+TP +S Y+ +G +G +G +A+
Sbjct: 720 MPVGVAVDANGNVFVADRNNFRIRKVTPAGVVSTYAG--------KGTSGFAEGLSGDAQ 771
Query: 134 FNHPKGVTMDD-KGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR----DGPSEDAK 188
F G+ M GNLY++D N + I D G I G VAG + +
Sbjct: 772 FKLVFGLAMSSISGNLYISDQGNFNVSFITDGG---ILSGTPTVAGVNAIKLKATNTNGN 828
Query: 189 FSNDFDVVYVRP 200
DFD+ P
Sbjct: 829 QDQDFDITVTGP 840
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 33/62 (53%), Gaps = 3/62 (4%)
Query: 120 GYTGHVDGKPNEA---RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNV 176
G G+ +G N+A +F++ G+T D NLYV D+ N IRKI G ++ G
Sbjct: 45 GVDGYYEGDNNDAITAQFSYIGGITTDANNNLYVTDSRNYCIRKIDQNGKVSVIAGIPTT 104
Query: 177 AG 178
AG
Sbjct: 105 AG 106
>gi|182414669|ref|YP_001819735.1| SMP-30/gluconolaconase/LRE domain-containing protein [Opitutus
terrae PB90-1]
gi|177841883|gb|ACB76135.1| SMP-30/Gluconolaconase/LRE domain protein [Opitutus terrae PB90-1]
Length = 1292
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 54/132 (40%), Positives = 66/132 (50%), Gaps = 7/132 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V G LF D + I KI + Y VAG+ G G +DG+ ARFN
Sbjct: 427 PTGIAVDAAGNLFVAD-FKATIRKIDA--TGYVS--TVAGA-HGLDGSLDGEKTAARFNA 480
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+ +D GNLYVADT N +IRKI AG T G S V G DG A+F + +
Sbjct: 481 PHGLAVDQHGNLYVADTFNHSIRKIDAAGQVTTPYGVSGVEGTVDGIGNAARFGSPTALA 540
Query: 197 YVRPTCSLLVID 208
+ R SL V D
Sbjct: 541 FDR-DGSLFVAD 551
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/94 (39%), Positives = 58/94 (61%), Gaps = 2/94 (2%)
Query: 125 VDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPS 184
VDG ++ARFN PKG+ +D G +YVADT N IRK+ +GV T G ++G DG
Sbjct: 208 VDGAGSDARFNGPKGIAVDANGTVYVADTSNHIIRKVTPSGVVTTLAGSPGISGNSDGAG 267
Query: 185 EDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+ A+F+ D+ V ++ V+D+ + +LR+I+
Sbjct: 268 DAARFNAPTDIA-VDDAGTIYVVDQ-SGSLRKIT 299
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 52/141 (36%), Positives = 67/141 (47%), Gaps = 20/141 (14%)
Query: 62 YLVETVIEGNEIGVV-----------PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSR 110
Y T + N IG V P + DG L+ VD + I KITP S +
Sbjct: 31 YTFTTYVGRNNIGYVDGEGTQARFHEPAGLACDADGNLYVVDPGTNLIRKITPAASVST- 89
Query: 111 GRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTI 169
AG+ G+ G VDG ARF P+GV + G +Y+ADT N AIR I D V +
Sbjct: 90 ---FAGTPTGW-GLVDGPAASARFGLPQGVAVGADGTVYIADTGNAAIRIITPDGSVRIL 145
Query: 170 AGGKSNVAGFRDGPSEDAKFS 190
AGG+S G +DG A F+
Sbjct: 146 AGGRS---GSQDGYGTGATFN 163
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 37/96 (38%), Positives = 50/96 (52%), Gaps = 1/96 (1%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
G+VDG+ +ARF+ P G+ D GNLYV D IRKI A + G G DG
Sbjct: 43 GYVDGEGTQARFHEPAGLACDADGNLYVVDPGTNLIRKITPAASVSTFAGTPTGWGLVDG 102
Query: 183 PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
P+ A+F V V ++ + D GNAA+R I+
Sbjct: 103 PAASARFGLPQGVA-VGADGTVYIADTGNAAIRIIT 137
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 48/142 (33%), Positives = 68/142 (47%), Gaps = 7/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYS-RGRLVAGSFQGYTGHVDGKPNEARFN 135
P + V+ DG +F D I K+TP L + G LV +Q VDG+ + A F
Sbjct: 370 PRGLTVAADGTVFVADSNAYVIRKVTPGLLVTTWAGSLVRPIYQ----TVDGQGSNAGFG 425
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
+P G+ +D GNL+VAD IRKI G + G + G DG A+F N
Sbjct: 426 NPTGIAVDAAGNLFVAD-FKATIRKIDATGYVSTVAGAHGLDGSLDGEKTAARF-NAPHG 483
Query: 196 VYVRPTCSLLVIDRGNAALRQI 217
+ V +L V D N ++R+I
Sbjct: 484 LAVDQHGNLYVADTFNHSIRKI 505
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 55/209 (26%), Positives = 86/209 (41%), Gaps = 12/209 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + DG LF D + +I+P + VAG+ TG +DG A F
Sbjct: 536 PTALAFDRDGSLFVAD--GHRVRRISPEGVVTT----VAGTANA-TGSIDGVGAAATFGE 588
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
KG+ +D GN++VA+ IR+I G GG + G DG A+F+ + +
Sbjct: 589 IKGLAVDRAGNVFVAENTTHVIRRITPDGTVVTIGGLAGSIGTADGVGSAARFNEPWGLA 648
Query: 197 YVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGF 256
R L ++D GN +R+ + +I+ + GA T
Sbjct: 649 LDR-FGHLYIVDSGNNTIRKGVI----VGATAPAITKDPVAKAAGAGRQASFTLTATGLP 703
Query: 257 GPFFFSRTQQPSESEFNEETEIKELSNKE 285
P F + ++ ES+F +EI E S E
Sbjct: 704 APQFQWQRKRAGESDFVALSEIAEFSGVE 732
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 66/144 (45%), Gaps = 21/144 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V +G ++ D N I K+TP + +AGS G +G+ DG + ARFN
Sbjct: 220 PKGIAVDANGTVYVADTSNHIIRKVTPSGVVTT----LAGS-PGISGNSDGAGDAARFNA 274
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P + +DD G +YV D ++RKI GV T S + FS V
Sbjct: 275 PTDIAVDDAGTIYVVDQSG-SLRKITPEGVVT---------------SLASGFSYPRGVA 318
Query: 197 YVRPTCSLLVIDRGNAALRQISLN 220
+ R + V D G+ +++++ N
Sbjct: 319 FDRSSGVFFVADTGHHVIKRVTPN 342
Score = 45.4 bits (106), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 74/143 (51%), Gaps = 8/143 (5%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
+P + V DG ++ D N+ I ITP S R++AG G +G DG A FN
Sbjct: 111 LPQGVAVGADGTVYIADTGNAAIRIITPDGSV----RILAG---GRSGSQDGYGTGATFN 163
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P+ V ++ G +YVAD+ N +R+I + VTT+AG AG DG DA+F N
Sbjct: 164 LPEAVAVNAAGVVYVADSGNNTVRRIEEGNVTTLAGLAGASAGAVDGAGSDARF-NGPKG 222
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
+ V ++ V D N +R+++
Sbjct: 223 IAVDANGTVYVADTSNHIIRKVT 245
>gi|325916326|ref|ZP_08178604.1| gluconolactonase [Xanthomonas vesicatoria ATCC 35937]
gi|325537497|gb|EGD09215.1| gluconolactonase [Xanthomonas vesicatoria ATCC 35937]
Length = 699
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/143 (35%), Positives = 73/143 (51%), Gaps = 18/143 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + + G L+ D ++N +++ P + +AG +G+ DG A FN
Sbjct: 74 PYGLAIDTHGALYIADAGDNNRIRVMLPNGSV---QTLAGGREGFA---DGIGAAAAFNT 127
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ +D GNLY+ADT N AIRK G VTT+AG AGF DG + A+F+ V
Sbjct: 128 PSGIALDTAGNLYIADTGNHAIRKRTPQGVVTTLAG--DGTAGFGDGAASQARFNGPMGV 185
Query: 196 -------VYVRPTCS--LLVIDR 209
VYV T + + VI+R
Sbjct: 186 AVDAQGRVYVADTYNDRIRVIER 208
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 52/143 (36%), Positives = 70/143 (48%), Gaps = 10/143 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + G L+ D N I K TP +G + + G G DG ++ARFN
Sbjct: 128 PSGIALDTAGNLYIADTGNHAIRKRTP------QGVVTTLAGDGTAGFGDGAASQARFNG 181
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P GV +D +G +YVADT N IR I D V T+AG S + G DG E A+ D
Sbjct: 182 PMGVAVDAQGRVYVADTYNDRIRVIERDGQVRTLAG--SALPGMADGVGEQARLDTPTD- 238
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
+ V + V D N A+R+I+
Sbjct: 239 LKVDAHGVVWVADMRNDAIRRIT 261
Score = 43.5 bits (101), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 47/98 (47%), Gaps = 7/98 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G ++ D N I I + + R +AGS G DG +AR +
Sbjct: 182 PMGVAVDAQGRVYVADTYNDRIRVI----ERDGQVRTLAGS--ALPGMADGVGEQARLDT 235
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGK 173
P + +D G ++VAD N AIR+I D V T+ GG+
Sbjct: 236 PTDLKVDAHGVVWVADMRNDAIRRITPDGSVATLVGGE 273
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 60/118 (50%), Gaps = 8/118 (6%)
Query: 101 ITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTL-NLAIR 159
+ PL ++ + VAG+ G G D + +A+F P G+ +D G LY+AD N IR
Sbjct: 40 VATPLGWAAQLQWVAGN--GVRGAQDARAADAQFAEPYGLAIDTHGALYIADAGDNNRIR 97
Query: 160 -KIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
+ + V T+AGG+ GF DG A F N + + +L + D GN A+R+
Sbjct: 98 VMLPNGSVQTLAGGRE---GFADGIGAAAAF-NTPSGIALDTAGNLYIADTGNHAIRK 151
>gi|409099442|ref|ZP_11219466.1| NHL repeat containing protein [Pedobacter agri PB92]
Length = 754
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 50/134 (37%), Positives = 72/134 (53%), Gaps = 10/134 (7%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDK 145
G+L+ D+ N+ I KI+ S G + + G TG +DG A FN+P GV +D
Sbjct: 178 GDLYLADQKNNKIKKIS------STGFVTTVAGSGATGSLDGNAASATFNNPAGVAVDRA 231
Query: 146 GNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 204
GN+YVAD N IRKI AG V+T+AG + AG DG +A+ + + P +L
Sbjct: 232 GNIYVADAGNNKIRKITPAGVVSTLAGNGT--AGSTDGTGTNARLYYPTGIT-IDPLDNL 288
Query: 205 LVIDRGNAALRQIS 218
+ D N +R+IS
Sbjct: 289 YIADYRNNRIRKIS 302
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 52/148 (35%), Positives = 72/148 (48%), Gaps = 18/148 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G ++ D N+ I KITP G + + G G DG AR +
Sbjct: 223 PAGVAVDRAGNIYVADAGNNKIRKITPA------GVVSTLAGNGTAGSTDGTGTNARLYY 276
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSN---- 191
P G+T+D NLY+AD N IRKI +G VTT+AG S AG +G ++ F+
Sbjct: 277 PTGITIDPLDNLYIADYRNNRIRKISPSGVVTTLAGNGS--AGNLNGVGTNSSFNGPIML 334
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQISL 219
FDV +L + D N +R+ISL
Sbjct: 335 AFDV-----QGNLFITDFQNNLIRKISL 357
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 34/89 (38%), Positives = 47/89 (52%), Gaps = 4/89 (4%)
Query: 131 EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKF 189
F+ P G+ D G+LY+AD N I+KI G VTT+AG S G DG + A F
Sbjct: 163 SGNFSSPGGLNFDLAGDLYLADQKNNKIKKISSTGFVTTVAG--SGATGSLDGNAASATF 220
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+N V R ++ V D GN +R+I+
Sbjct: 221 NNPAGVAVDRAG-NIYVADAGNNKIRKIT 248
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 42/91 (46%), Gaps = 6/91 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + L+ D N+ I KI+P G + + G G+++G + FN
Sbjct: 277 PTGITIDPLDNLYIADYRNNRIRKISP------SGVVTTLAGNGSAGNLNGVGTNSSFNG 330
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVT 167
P + D +GNL++ D N IRKI G T
Sbjct: 331 PIMLAFDVQGNLFITDFQNNLIRKISLTGYT 361
>gi|124002516|ref|ZP_01687369.1| hypothetical protein M23134_05219 [Microscilla marina ATCC 23134]
gi|123992345|gb|EAY31713.1| hypothetical protein M23134_05219 [Microscilla marina ATCC 23134]
Length = 2385
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 50/148 (33%), Positives = 73/148 (49%), Gaps = 6/148 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ---GYTGHVDGKPNEAR 133
P + + G L+ D+ N I IT P R +AG+ +G+V G A+
Sbjct: 655 PSGLALDMAGNLYVADKNNHAIRMITNPSGGSPVVRTIAGNSSYPTAVSGNVTGALAVAK 714
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVA---GFRDGPSEDAKFS 190
FN P GV +D GN+YVAD N I+KI + VTT+AG ++ A G DG ++ A+F
Sbjct: 715 FNEPSGVAVDAAGNIYVADKNNHRIKKIANGMVTTLAGPMNDAASIPGRTDGAADAARFF 774
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQIS 218
V L V D+ N +RQ++
Sbjct: 775 FPTSVALDITGAQLYVADKLNNIIRQVN 802
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 54/168 (32%), Positives = 76/168 (45%), Gaps = 16/168 (9%)
Query: 66 TVIEGNEIGVV-------PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV-AGS 117
T + GN G + P + V G ++ D+ N I KI + G + A S
Sbjct: 700 TAVSGNVTGALAVAKFNEPSGVAVDAAGNIYVADKNNHRIKKIANGMVTTLAGPMNDAAS 759
Query: 118 FQGYTGHVDGKPNEARFNHPKGVTMDDKG-NLYVADTLNLAIRKI--GDAGVTTIAGGKS 174
G T DG + ARF P V +D G LYVAD LN IR++ D T AG +
Sbjct: 760 IPGRT---DGAADAARFFFPTSVALDITGAQLYVADKLNNIIRQVNTADGHTLTYAGDVA 816
Query: 175 N-VAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQ 221
N +AG +DG + AKF + + V + + D N +R+IS Q
Sbjct: 817 NGIAGHQDGNAASAKFRSPAGIT-VNAVGDVYIADTHNQVIRKISQGQ 863
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 19/180 (10%)
Query: 48 APQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGE-LFAVDEVNSNIVKITPPLS 106
AP A + G+ +T IE + G + +S DG+ L+ D+ N I K+
Sbjct: 574 APTA--KISTLAKGFKFDTTIEPEKDGSA--GMVISLDGKYLYVADQRNQVIKKVDLVTK 629
Query: 107 QYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI----- 161
S +VAGS G G D + A+FN+P G+ +D GNLYVAD N AIR I
Sbjct: 630 TVS---IVAGS--GVAGFKDDNGSLAQFNYPSGLALDMAGNLYVADKNNHAIRMITNPSG 684
Query: 162 GDAGVTTIAGGKS---NVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G V TIAG S V+G G AKF N+ V V ++ V D+ N +++I+
Sbjct: 685 GSPVVRTIAGNSSYPTAVSGNVTGALAVAKF-NEPSGVAVDAAGNIYVADKNNHRIKKIA 743
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/71 (42%), Positives = 38/71 (53%)
Query: 119 QGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAG 178
G GH DG A+F P G+T++ G++Y+ADT N IRKI V TIAG
Sbjct: 817 NGIAGHQDGNAASAKFRSPAGITVNAVGDVYIADTHNQVIRKISQGQVITIAGEVEVADD 876
Query: 179 FRDGPSEDAKF 189
D + DAKF
Sbjct: 877 DSDVLARDAKF 887
Score = 42.0 bits (97), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 57/119 (47%), Gaps = 18/119 (15%)
Query: 51 ADGNVLQFENGYLVETVIEGNEIGVV-------PYKIRVSEDGELFAVDEVNSNIVKITP 103
ADG+ L + V I G++ G P I V+ G+++ D N I KI
Sbjct: 804 ADGHTLTYAGD--VANGIAGHQDGNAASAKFRSPAGITVNAVGDVYIADTHNQVIRKI-- 859
Query: 104 PLSQYSRGRLV--AGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK 160
S+G+++ AG + D +A+F +P G+ D + NLY++D LN ++RK
Sbjct: 860 -----SQGQVITIAGEVEVADDDSDVLARDAKFRYPSGIFADLEQNLYISDKLNFSVRK 913
>gi|302821969|ref|XP_002992645.1| hypothetical protein SELMODRAFT_6806 [Selaginella moellendorffii]
gi|300139609|gb|EFJ06347.1| hypothetical protein SELMODRAFT_6806 [Selaginella moellendorffii]
Length = 79
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 37/70 (52%), Positives = 46/70 (65%), Gaps = 1/70 (1%)
Query: 115 AGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGK 173
AGS G +G+VDG ++RFN P+ + + D G ++VADT NLAIRKI D VTTIAGG
Sbjct: 10 AGSHVGESGYVDGPAAKSRFNRPQSLAICDNGAVFVADTTNLAIRKISKDGEVTTIAGGS 69
Query: 174 SNVAGFRDGP 183
S G D P
Sbjct: 70 SRRPGIADSP 79
>gi|403382146|ref|ZP_10924203.1| copper amine oxidase domain-containing protein [Paenibacillus sp.
JC66]
Length = 531
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 68/228 (29%), Positives = 107/228 (46%), Gaps = 24/228 (10%)
Query: 24 AAPAGPL-IKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVIEG-NEIGVV--PYK 79
AAP G L + S L S + A + LQ E+G +V+ + +G E+ P
Sbjct: 71 AAPDGTLYVSDTKSHLLRRLDHSGVSLLAGSSFLQ-EDGQVVDALGDGKGELSSFSEPAG 129
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
+ + +G LF D+ N + K+ + G + + QG GH DG E+ F P+
Sbjct: 130 LALDHNGNLFVADKGNHAVRKVD------AEGNVTTYAGQGVLGHKDGTAEESLFYAPED 183
Query: 140 VTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG----------GKSNVAG-FRDGPSEDA 187
V + G +YVADTLN IRKI G V+T+ G++ +AG ++DGP ++A
Sbjct: 184 VVVASDGTVYVADTLNHVIRKIDPEGKVSTLNALPQRYIEVFPGEAVLAGDYKDGPLQEA 243
Query: 188 KFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTD 235
KF N+ + + +L + D GN +R + L D S+ D
Sbjct: 244 KF-NEPTGLAIDHLGNLYISDTGNRVIRYMDLANDRVSTVAGSVQLYD 290
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 60/124 (48%), Gaps = 13/124 (10%)
Query: 110 RGRLVAGSFQGYT-----------GHVDGKPNE-ARFNHPKGVTMDDKGNLYVADTLNLA 157
R VAGS Q Y G DG E A F P+G+ + ++G L +AD+LN A
Sbjct: 278 RVSTVAGSVQLYDEANSSSLYASGGFSDGHATEEALFMAPRGIAITEEGGLVIADSLNHA 337
Query: 158 IRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
IR + + V T+AGG G +DG + + ++ D V V S+++ D N LR
Sbjct: 338 IRYLFEGRVITLAGGHEAEHGQQDGINGYNRLNHPQD-VQVAADGSIIIADAYNNQLRAF 396
Query: 218 SLNQ 221
L Q
Sbjct: 397 QLYQ 400
Score = 51.6 bits (122), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 58/118 (49%), Gaps = 9/118 (7%)
Query: 106 SQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
S S + +AG+ G G DG EA F HP G+ G LYV+DT + +R++ +G
Sbjct: 37 SWMSSVQTIAGT--GSFGARDGDKAEASFRHPSGLAAAPDGTLYVSDTKSHLLRRLDHSG 94
Query: 166 VTTIAGGK------SNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
V+ +AG V DG E + FS + + +L V D+GN A+R++
Sbjct: 95 VSLLAGSSFLQEDGQVVDALGDGKGELSSFSEPAGLA-LDHNGNLFVADKGNHAVRKV 151
>gi|29348037|ref|NP_811540.1| cell surface protein [Bacteroides thetaiotaomicron VPI-5482]
gi|29339939|gb|AAO77734.1| putative cell surface protein, have conserved domain [Bacteroides
thetaiotaomicron VPI-5482]
Length = 434
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 40/108 (37%), Positives = 61/108 (56%), Gaps = 2/108 (1%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG 171
+L AG+F G +G DG +A FN P+ +++D +GN+Y+AD+ N IR I G+ T
Sbjct: 328 KLFAGAF-GQSGWNDGIATDAEFNSPRQMSLDMEGNIYIADSGNHCIRMIDKNGIVTTPI 386
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G+ AG+ DG S D ND V V + + D GN +R+++L
Sbjct: 387 GQPGEAGYADG-SPDMALLNDPRGVAVNSEGDVYIADLGNRCIRKLTL 433
>gi|445496654|ref|ZP_21463509.1| NHL repeat containing protein [Janthinobacterium sp. HH01]
gi|444786649|gb|ELX08197.1| NHL repeat containing protein [Janthinobacterium sp. HH01]
Length = 741
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 50/143 (34%), Positives = 70/143 (48%), Gaps = 10/143 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + G L+ D N+ I K+TP G + + G G DG+ A+FN
Sbjct: 160 PSGLAIDAAGNLYVADTGNNAIRKVTP------EGVVSTLAGDGLPGDKDGRGAAAQFNG 213
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ +D G +YVADT N IR+I G VTTIAGG AG DG + A F +
Sbjct: 214 PVGIAVDAAGVVYVADTYNDRIRRIAPNGDVTTIAGGSR--AGKADGAAAQALFDTPTGL 271
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
+ L + D GN A+R++
Sbjct: 272 A-LSAAGDLYIADTGNHAIRKLG 293
Score = 62.0 bits (149), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 47/147 (31%), Positives = 74/147 (50%), Gaps = 10/147 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ + + G L+ D ++N ++ ++ +AG +GY +G A FN
Sbjct: 106 PFGVVIDGAGNLYVADGGDNNSIR---KIALDGATTTLAGGVEGYA---EGAGKAAAFNT 159
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ +D GNLYVADT N AIRK+ G V+T+AG + G +DG A+F+ +
Sbjct: 160 PSGLAIDAAGNLYVADTGNNAIRKVTPEGVVSTLAG--DGLPGDKDGRGAAAQFNGPVGI 217
Query: 196 VYVRPTCSLLVIDRGNAALRQISLNQD 222
V + V D N +R+I+ N D
Sbjct: 218 A-VDAAGVVYVADTYNDRIRRIAPNGD 243
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 58/109 (53%), Gaps = 6/109 (5%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTL-NLAIRKIG-DAGVTTI 169
R+ + + G G +G RF P GV +D GNLYVAD N +IRKI D TT+
Sbjct: 81 RVTSIAGDGLPGANNGHGRSTRFADPFGVVIDGAGNLYVADGGDNNSIRKIALDGATTTL 140
Query: 170 AGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
AGG V G+ +G + A F N + + +L V D GN A+R+++
Sbjct: 141 AGG---VEGYAEGAGKAAAF-NTPSGLAIDAAGNLYVADTGNNAIRKVT 185
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 47/95 (49%), Gaps = 7/95 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V G ++ D N I +I P G + + G DG +A F+
Sbjct: 214 PVGIAVDAAGVVYVADTYNDRIRRIAP------NGDVTTIAGGSRAGKADGAAAQALFDT 267
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIA 170
P G+ + G+LY+ADT N AIRK+G D V+TIA
Sbjct: 268 PTGLALSAAGDLYIADTGNHAIRKLGKDGKVSTIA 302
>gi|301122083|ref|XP_002908768.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262099530|gb|EEY57582.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 1981
Score = 67.8 bits (164), Expect = 1e-08, Method: Composition-based stats.
Identities = 47/116 (40%), Positives = 64/116 (55%), Gaps = 10/116 (8%)
Query: 109 SRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--V 166
++G ++ + G G DG ++FNHP+GV +D G +YVADT N IR I V
Sbjct: 201 AQGHVITVAGDGSAGFQDGLAATSQFNHPRGVAVDSNGVVYVADTANHRIRIIDPTTKRV 260
Query: 167 TTIAGGKSNVAGFRDGPS-EDAKFSNDFDVVYVRP----TCSLLVIDRGNAALRQI 217
+T+AG V GF DGP+ A+FS DV VR T ++ V D GN +RQI
Sbjct: 261 STLAG--DGVEGFVDGPALTAARFSYPSDVA-VRESNSGTITVFVADTGNHRIRQI 313
Score = 50.4 bits (119), Expect = 0.002, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 41/77 (53%), Gaps = 9/77 (11%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVT-TIA 170
RL + + G DG P ARF+ P GV +D G ++VADT N IR+I G T T+A
Sbjct: 333 RLSQQPEKPHAGMADGDPLGARFDSPMGVAVDADGVVFVADTGNHLIRRIDLDGTTHTVA 392
Query: 171 GGKSNVAGFRDGPSEDA 187
GG PSEDA
Sbjct: 393 GGVV--------PSEDA 401
Score = 45.8 bits (107), Expect = 0.045, Method: Composition-based stats.
Identities = 26/90 (28%), Positives = 45/90 (50%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG 171
R+V + GH+DG+ NEA FN P GVT G +Y A + + ++R++ A + +
Sbjct: 470 RVVTLAGSNVPGHIDGEGNEATFNAPAGVTFAADGRVYAASSTDCSVRQVTPASLVSRTV 529
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 201
S A PS A + D ++++ +
Sbjct: 530 TCSTRATEVLRPSGCASYEQPVDELFLQTS 559
>gi|86610239|ref|YP_479001.1| NHL repeat-containing protein [Synechococcus sp. JA-2-3B'a(2-13)]
gi|86558781|gb|ABD03738.1| NHL repeat protein [Synechococcus sp. JA-2-3B'a(2-13)]
Length = 649
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 71/140 (50%), Gaps = 10/140 (7%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
I V +G + D I +++P G L + G G DG N+A+F+ P+G
Sbjct: 345 IAVDPEGAFYLADRAQHRIFRLSP------EGELEVWAGTGQAGRRDGAANQAQFDSPQG 398
Query: 140 VTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYV 198
+ D KG L+VAD+ N +R I V+T AG + +AG+RDG + A+F + +
Sbjct: 399 LLWDPKGGLWVADSGNHCLRYINLQRQVSTFAG--TCIAGYRDGGLDRAQFREPSGLA-L 455
Query: 199 RPTCSLLVIDRGNAALRQIS 218
SL V DR N +R+I+
Sbjct: 456 GSDGSLYVADRANRRIRRIT 475
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 71/142 (50%), Gaps = 13/142 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + DG L+ D N I +ITP G + + G G VDG +A+
Sbjct: 450 PSGLALGSDGSLYVADRANRRIRRITP------AGEVTTVAGTGQPGSVDGPAEQAQLLQ 503
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P + +D +GNL++AD L R++ G VTT++G + G+RDGP +A+F +
Sbjct: 504 PTALAVDKEGNLWIADGHRL--RRLSAGGRVTTLSGAEP---GYRDGPLAEARFQTLGGL 558
Query: 196 VYVRPTCSLLVIDRGNAALRQI 217
+ +L V DR N +R++
Sbjct: 559 AF-DAAGNLWVADRDNHRIRRL 579
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 50/143 (34%), Positives = 74/143 (51%), Gaps = 13/143 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V ++G L+ D + + LS R ++G+ GY DG EARF
Sbjct: 504 PTALAVDKEGNLWIAD--GHRLRR----LSAGGRVTTLSGAEPGYR---DGPLAEARFQT 554
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
G+ D GNL+VAD N IR++ G V+T+AG N G++DGP+ A+F +
Sbjct: 555 LGGLAFDAAGNLWVADRDNHRIRRLQPGGQVSTLAG--QNQPGWQDGPTSVARFDQPAGL 612
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V P S++V+D G LR+IS
Sbjct: 613 A-VLPDGSVVVVDAGLPGLRRIS 634
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 30/68 (44%), Positives = 41/68 (60%), Gaps = 3/68 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRD 181
G+ DG + A+F P G+ + G+LYVAD N IR+I AG VTT+AG + G D
Sbjct: 436 GYRDGGLDRAQFREPSGLALGSDGSLYVADRANRRIRRITPAGEVTTVAG--TGQPGSVD 493
Query: 182 GPSEDAKF 189
GP+E A+
Sbjct: 494 GPAEQAQL 501
>gi|290992324|ref|XP_002678784.1| predicted protein [Naegleria gruberi]
gi|284092398|gb|EFC46040.1| predicted protein [Naegleria gruberi]
Length = 2331
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/146 (30%), Positives = 73/146 (50%), Gaps = 8/146 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDG-KPNEARFN 135
PY I + DG ++ D NS I K L + VAG GY G+ D +P A FN
Sbjct: 479 PYGIHLHTDGSIYIADTYNSKIRK----LDSSKKISSVAG--NGYLGYSDNTQPLRASFN 532
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG-PSEDAKFSNDFD 194
V +++ G++Y++D N IRK+ +G T G++ ++GF D ++ A + F
Sbjct: 533 LAVSVLINNVGDMYISDYANNVIRKMSSSGAITTIAGQAKISGFADSLNAKTALLNGPFG 592
Query: 195 VVYVRPTCSLLVIDRGNAALRQISLN 220
+ Y T +L D N +R+++ N
Sbjct: 593 LYYQESTQTLFFADSLNGRIRKLTKN 618
Score = 43.1 bits (100), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 80/169 (47%), Gaps = 15/169 (8%)
Query: 57 QFENGYLVETVIEGNEIGVV-----PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRG 111
Q+E GY VI N + P + V ++GEL+ D NS + KIT LS
Sbjct: 399 QYEQGY-DPIVISDNVLATTTSVNHPRSLYV-KNGELYFADTYNSLVRKIT--LSTGVLS 454
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDA-GVTTIA 170
+ + + G G A N+P G+ + G++Y+ADT N IRK+ + ++++A
Sbjct: 455 TIAGNGGERFYGD-GGLGTSASLNYPYGIHLHTDGSIYIADTYNSKIRKLDSSKKISSVA 513
Query: 171 GGKSNVAGFRDGPSEDAKFSNDFDV-VYVRPTCSLLVIDRGNAALRQIS 218
G + G+ D ++ + S + V V + + + D N +R++S
Sbjct: 514 G--NGYLGYSDN-TQPLRASFNLAVSVLINNVGDMYISDYANNVIRKMS 559
>gi|413923862|gb|AFW63794.1| hypothetical protein ZEAMMB73_706161, partial [Zea mays]
Length = 139
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 60/89 (67%), Gaps = 8/89 (8%)
Query: 20 FQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADG-NVLQFENGYLVETVIEGNEIGVVPY 78
F ++AA A ++K L W+ +S++K + G +++++E GY VETV +G+++G+ PY
Sbjct: 42 FLSNAASA--VVKRL-----WSLKSTTKTAASGGKSMVKYEGGYTVETVFDGSKLGIEPY 94
Query: 79 KIRVSEDGELFAVDEVNSNIVKITPPLSQ 107
+ V++ GEL +D +NSNI ++ PLS+
Sbjct: 95 SVEVTQSGELLVMDSMNSNIYRMALPLSR 123
>gi|380693545|ref|ZP_09858404.1| hypothetical protein BfaeM_06134 [Bacteroides faecis MAJ27]
Length = 454
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 57/98 (58%), Gaps = 1/98 (1%)
Query: 122 TGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD 181
+G+V+G R N P+ + D+ GN++V + N IRKI AG T+ G+ +GF D
Sbjct: 357 SGYVNGTGATVRLNKPEQMAFDEDGNMFVPERNNHIIRKITPAGSATLYAGRPEQSGFGD 416
Query: 182 GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G E+AKF N + V V P S+ V DR N +R++++
Sbjct: 417 GLPEEAKF-NQPECVTVYPDNSIYVADRDNHVIRRVTV 453
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 33/85 (38%), Positives = 46/85 (54%), Gaps = 5/85 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P ++ EDG +F + N I KITP S L AG + +G DG P EA+FN
Sbjct: 372 PEQMAFDEDGNMFVPERNNHIIRKITPAGS----ATLYAGRPE-QSGFGDGLPEEAKFNQ 426
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P+ VT+ ++YVAD N IR++
Sbjct: 427 PECVTVYPDNSIYVADRDNHVIRRV 451
>gi|427400231|ref|ZP_18891469.1| hypothetical protein HMPREF9710_01065 [Massilia timonae CCUG 45783]
gi|425720971|gb|EKU83886.1| hypothetical protein HMPREF9710_01065 [Massilia timonae CCUG 45783]
Length = 367
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 35/77 (45%), Positives = 44/77 (57%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
G +DG ARF P+G+ +D NLYVADT N IRKI AGV T G + +GF DG
Sbjct: 63 GTMDGAGTAARFKWPQGIAIDADDNLYVADTGNFVIRKITPAGVVTTVAGAAGTSGFVDG 122
Query: 183 PSEDAKFSNDFDVVYVR 199
+ +A+F N V R
Sbjct: 123 DAGNARFGNPVAVAVNR 139
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/95 (41%), Positives = 50/95 (52%), Gaps = 8/95 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + D L+ D N I KITP + VAG+ G +G VDG ARF +
Sbjct: 77 PQGIAIDADDNLYVADTGNFVIRKITPAGVVTT----VAGA-AGTSGFVDGDAGNARFGN 131
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIA 170
P V ++ +G +YVAD NL IR I AG V+T A
Sbjct: 132 PVAVAVNRRGTIYVAD--NLRIRSITSAGRVSTTA 164
Score = 39.3 bits (90), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 21/60 (35%), Positives = 30/60 (50%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
G +G ARF +T+D +GN+Y AD +N +R+I GV T G + R G
Sbjct: 274 GSNNGVGTAARFEQVVALTVDPRGNVYAADAINNLVRRITPEGVVTTVAGSTRSTTLRTG 333
>gi|298386893|ref|ZP_06996448.1| IPT/TIG domain-containing protein [Bacteroides sp. 1_1_14]
gi|298260567|gb|EFI03436.1| IPT/TIG domain-containing protein [Bacteroides sp. 1_1_14]
Length = 454
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 57/98 (58%), Gaps = 1/98 (1%)
Query: 122 TGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD 181
+G+V+G R N P+ + D+ GN++V + N IRKI AG T+ G+ +GF D
Sbjct: 357 SGYVNGTGATVRLNKPEQMAFDEDGNMFVPERNNHIIRKITPAGSATLYAGRPEQSGFGD 416
Query: 182 GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G E+AKF N + V V P S+ V DR N +R++++
Sbjct: 417 GLPEEAKF-NQPECVTVYPDNSIYVADRDNHVIRRVTV 453
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 33/85 (38%), Positives = 46/85 (54%), Gaps = 5/85 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P ++ EDG +F + N I KITP S L AG + +G DG P EA+FN
Sbjct: 372 PEQMAFDEDGNMFVPERNNHIIRKITPAGS----ATLYAGRPE-QSGFGDGLPEEAKFNQ 426
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P+ VT+ ++YVAD N IR++
Sbjct: 427 PECVTVYPDNSIYVADRDNHVIRRV 451
>gi|116749588|ref|YP_846275.1| NHL repeat-containing protein [Syntrophobacter fumaroxidans MPOB]
gi|116698652|gb|ABK17840.1| NHL repeat containing protein [Syntrophobacter fumaroxidans MPOB]
Length = 1750
Score = 67.0 bits (162), Expect = 2e-08, Method: Composition-based stats.
Identities = 47/142 (33%), Positives = 71/142 (50%), Gaps = 5/142 (3%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G ++ D N I K+ Q L G GY+G G +A N
Sbjct: 40 PSGVAVDTSGNVYLSDTNNHRIRKLDVAAGQIQT--LAGGQSPGYSGD-GGTAAKAGLNR 96
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P+G+ +D GN+Y AD+ N IRKI +G +TT+AG S + GP+ A+ + F +
Sbjct: 97 PRGIAVDAAGNVYFADSNNHCIRKIDTSGIITTVAGTGSAGSNGDGGPAASARLAYPFGI 156
Query: 196 VYVRPTCSLLVIDRGNAALRQI 217
V P+ ++ V D GN +R+I
Sbjct: 157 A-VDPSGNIYVADLGNHKVRRI 177
Score = 50.8 bits (120), Expect = 0.002, Method: Composition-based stats.
Identities = 32/98 (32%), Positives = 46/98 (46%), Gaps = 1/98 (1%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGF 179
GY+G DG EAR N+P + +D ++Y+ADT N IRK+ G T G
Sbjct: 472 GYSGD-DGPAAEARLNNPSAIAVDGSESIYIADTNNHRIRKVDGGGTITTVAGNGTPGYS 530
Query: 180 RDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
DG S A N + V V ++ + D N +R +
Sbjct: 531 GDGASATAASLNFPNGVAVDADGNVFIADTSNHRVRMV 568
Score = 49.7 bits (117), Expect = 0.004, Method: Composition-based stats.
Identities = 37/142 (26%), Positives = 61/142 (42%), Gaps = 7/142 (4%)
Query: 78 YKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP-NEARFNH 136
Y + + + +L+ D N I K+T + G GS + DG P AR
Sbjct: 266 YGVALGPNNDLYIADTYNQRIRKVTDGVINTVAGTGYGGSLE------DGIPATGARLKS 319
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P + +D + N+Y+ADT + IR++ AG GK DG A +
Sbjct: 320 PVALAVDSQNNIYIADTYSHRIRRVDAAGNIVTVAGKGVPGNAGDGGQAVAAILKSPHGL 379
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
+ P SL + DR + +R+++
Sbjct: 380 ALGPDNSLYIADRTDHRVRKVT 401
Score = 48.1 bits (113), Expect = 0.010, Method: Composition-based stats.
Identities = 44/146 (30%), Positives = 68/146 (46%), Gaps = 10/146 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKI-TPPLSQYSRGRLVAGSFQGYTGHVDGKPN-EARF 134
P I V G ++ D N I KI T + G AGS + DG P AR
Sbjct: 97 PRGIAVDAAGNVYFADSNNHCIRKIDTSGIITTVAGTGSAGS------NGDGGPAASARL 150
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD-GPSEDAKFSNDF 193
+P G+ +D GN+YVAD N +R+I AG + G ++ D GP+ +A ++
Sbjct: 151 AYPFGIAVDPSGNIYVADLGNHKVRRIDAAGNISTVAGTGLLSRLGDGGPATEAGLTSPT 210
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISL 219
V V + +L + D G +R++ +
Sbjct: 211 GVA-VDGSGNLFISDSGRHVIRRVDV 235
Score = 45.1 bits (105), Expect = 0.086, Method: Composition-based stats.
Identities = 38/141 (26%), Positives = 60/141 (42%), Gaps = 6/141 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V DG +F D N + + + G GY+G G A
Sbjct: 544 PNGVAVDADGNVFIADTSNHRVRMVDSGVITTVAG----NGTPGYSGD-GGAAVSASLKA 598
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P GV +D G LY+AD N +RK+ + T+AG + G + A F + +V
Sbjct: 599 PHGVWVDATGALYIADAHNYRVRKVAGGNIVTVAGTGTPGYSGDGGLAAAADFRSVHGLV 658
Query: 197 YVRPTCSLLVIDRGNAALRQI 217
V + +L V D N+ +R++
Sbjct: 659 -VDGSGNLFVADMENSRVRKV 678
Score = 44.7 bits (104), Expect = 0.12, Method: Composition-based stats.
Identities = 33/101 (32%), Positives = 52/101 (51%), Gaps = 6/101 (5%)
Query: 120 GYTGHVDG-KPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG-GKSNVA 177
GY+G DG A N P GV +D GN+++ADT N +R + +TT+AG G +
Sbjct: 528 GYSG--DGASATAASLNFPNGVAVDADGNVFIADTSNHRVRMVDSGVITTVAGNGTPGYS 585
Query: 178 GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G DG + + V+V T +L + D N +R+++
Sbjct: 586 G--DGGAAVSASLKAPHGVWVDATGALYIADAHNYRVRKVA 624
Score = 43.1 bits (100), Expect = 0.35, Method: Composition-based stats.
Identities = 37/145 (25%), Positives = 63/145 (43%), Gaps = 10/145 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV-DGKP-NEARF 134
P+ I V G ++ D N + +I + G + + G + DG P EA
Sbjct: 153 PFGIAVDPSGNIYVADLGNHKVRRID------AAGNISTVAGTGLLSRLGDGGPATEAGL 206
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
P GV +D GNL+++D+ IR++ G + +AG GP+ A N +
Sbjct: 207 TSPTGVAVDGSGNLFISDSGRHVIRRVDVGGTIDRVAGDYEQRYHGDGGPALSAGLMNAY 266
Query: 194 DVVYVRPTCSLLVIDRGNAALRQIS 218
V + P L + D N +R+++
Sbjct: 267 GVA-LGPNNDLYIADTYNQRIRKVT 290
Score = 40.8 bits (94), Expect = 1.7, Method: Composition-based stats.
Identities = 35/144 (24%), Positives = 69/144 (47%), Gaps = 10/144 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ + + D L+ D + + K+T + +AG+ + A +
Sbjct: 376 PHGLALGPDNSLYIADRTDHRVRKVTAAGVIST----LAGTGEEGLSADGAAAAFANLDG 431
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFR--DGPSEDAKFSNDF 193
P V + G++Y +D+ + +RKIG D ++T+AG VAG+ DGP+ +A+ +N
Sbjct: 432 PCAVAVGPSGSVYFSDSGSNRVRKIGLDGNLSTVAG--KGVAGYSGDDGPAAEARLNNP- 488
Query: 194 DVVYVRPTCSLLVIDRGNAALRQI 217
+ V + S+ + D N +R++
Sbjct: 489 SAIAVDGSESIYIADTNNHRIRKV 512
>gi|118161380|gb|ABK64062.1| conserved hypothetical protein [Janthinobacterium lividum]
Length = 779
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/122 (36%), Positives = 62/122 (50%), Gaps = 20/122 (16%)
Query: 51 ADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSR 110
ADG++ Q GYL I G L+ D +N N+ KITP +
Sbjct: 634 ADGSLAQARFGYL--------------RAIAFDASGNLYLTDSLNHNVRKITPAGVVTT- 678
Query: 111 GRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTI 169
+AG+ G G DG+ + A FN P G+ +D GN+YVADT N +R+I +G VTT+
Sbjct: 679 ---LAGT-TGVAGDADGRGSAASFNGPHGIALDKAGNVYVADTENNLVRRISTSGEVTTV 734
Query: 170 AG 171
AG
Sbjct: 735 AG 736
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 74/145 (51%), Gaps = 12/145 (8%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
+ + + G LF DE N I KITP + L + Q DG ARFN P G
Sbjct: 364 LTIDKQGNLFVADEGNHTIRKITPAGVVST---LAGTASQLRPAVADGIGAAARFNLPYG 420
Query: 140 VTMDDKGNLYVADT-----LNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
+ +D GN+YV+D+ LN +RKI AG VTTI G + + G DG + +A+F+
Sbjct: 421 LAVDGAGNVYVSDSNPGLQLN-GVRKITPAGEVTTITGSNAGI-GVTDGLASEARFAGP- 477
Query: 194 DVVYVRPTCSLLVIDRGNAALRQIS 218
+ + +L V D GN +R++S
Sbjct: 478 QAIALHSDGTLFVADTGNQLIRRVS 502
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 34/93 (36%), Positives = 49/93 (52%), Gaps = 1/93 (1%)
Query: 126 DGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSE 185
DG +ARF + + + D GNLY+ D+LN +RKI AGV T G + VAG DG
Sbjct: 635 DGSLAQARFGYLRAIAFDASGNLYLTDSLNHNVRKITPAGVVTTLAGTTGVAGDADGRGS 694
Query: 186 DAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
A F+ + + ++ V D N +R+IS
Sbjct: 695 AASFNGPHGIALDKA-GNVYVADTENNLVRRIS 726
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 42/129 (32%), Positives = 62/129 (48%), Gaps = 14/129 (10%)
Query: 76 VPYKIRVSEDGELFAVD-----EVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPN 130
+PY + V G ++ D ++N + KITP + + GS G G DG +
Sbjct: 417 LPYGLAVDGAGNVYVSDSNPGLQLN-GVRKITPAGEVTT----ITGSNAGI-GVTDGLAS 470
Query: 131 EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVT-TIAG--GKSNVAGFRDGPSEDA 187
EARF P+ + + G L+VADT N IR++ GVT T+AG G + DG A
Sbjct: 471 EARFAGPQAIALHSDGTLFVADTGNQLIRRVSAVGVTSTLAGNPGHGSFLSSGDGTGAQA 530
Query: 188 KFSNDFDVV 196
F+ +V
Sbjct: 531 TFAQPAGIV 539
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 44/153 (28%), Positives = 67/153 (43%), Gaps = 18/153 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV------DGKPN 130
P I + DG LF D N I +++ V + G GH DG
Sbjct: 477 PQAIALHSDGTLFVADTGNQLIRRVSAVG--------VTSTLAGNPGHGSFLSSGDGTGA 528
Query: 131 EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKF 189
+A F P G+ + G Y+AD IR++G+ G VTT+AG N + DG A F
Sbjct: 529 QATFAQPAGIVVGSDGVAYIADAFRNTIRRVGNDGVVTTLAGSYENRSQPTDGQGAKAGF 588
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQISLNQD 222
S+ + + + +L V D A+R++ N +
Sbjct: 589 SST-NGIAIDGNATLYVADY--QAVRKVDANGN 618
Score = 47.4 bits (111), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 66/131 (50%), Gaps = 16/131 (12%)
Query: 96 SNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADT-- 153
++I KITP + + +AG + Y G DG ARF G+ +D G +YVAD
Sbjct: 264 ASIRKITPDGTVTT----LAGISRTY-GTSDGDAKSARFGGISGIAVDTAGAIYVADAGS 318
Query: 154 -----LNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVI 207
+ IRK+ AG V+T+AGG + G DG A FS+ + + + +L V
Sbjct: 319 NINGIVGSRIRKLTPAGIVSTLAGGSAY--GSADGAGAVATFSS-LNGLTIDKQGNLFVA 375
Query: 208 DRGNAALRQIS 218
D GN +R+I+
Sbjct: 376 DEGNHTIRKIT 386
Score = 46.2 bits (108), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 43/119 (36%), Positives = 56/119 (47%), Gaps = 15/119 (12%)
Query: 80 IRVSEDGELFAVDE-------VNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEA 132
I V G ++ D V S I K+TP + L GS G DG A
Sbjct: 303 IAVDTAGAIYVADAGSNINGIVGSRIRKLTPAGIVST---LAGGSAYG---SADGAGAVA 356
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVA-GFRDGPSEDAKF 189
F+ G+T+D +GNL+VAD N IRKI AG V+T+AG S + DG A+F
Sbjct: 357 TFSSLNGLTIDKQGNLFVADEGNHTIRKITPAGVVSTLAGTASQLRPAVADGIGAAARF 415
Score = 42.7 bits (99), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 68/152 (44%), Gaps = 14/152 (9%)
Query: 63 LVETVIEGNEIGVVPYKIRVSED--GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQG 120
L T G+ + + ++ D G L+A D+ I KI+P G L +
Sbjct: 178 LTLTAAAGDATAQLSQVVSLTRDAMGNLYATDQ-GCVIHKISPA------GLLTTLAGSS 230
Query: 121 YTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGF 179
G DGK A F+ +T+D G++YV +IRKI D VTT+A G S G
Sbjct: 231 SCGAADGKGTAASFDSMLAITVDASGSVYVGG--GASIRKITPDGTVTTLA-GISRTYGT 287
Query: 180 RDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGN 211
DG ++ A+F + V ++ V D G+
Sbjct: 288 SDGDAKSARFGG-ISGIAVDTAGAIYVADAGS 318
>gi|153868931|ref|ZP_01998653.1| receptor protein kinase [Beggiatoa sp. PS]
gi|152074497|gb|EDN71345.1| receptor protein kinase [Beggiatoa sp. PS]
Length = 3115
Score = 66.6 bits (161), Expect = 3e-08, Method: Composition-based stats.
Identities = 66/197 (33%), Positives = 96/197 (48%), Gaps = 24/197 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGS-FQGYTGHVDGKPNEAR 133
P + G + D +N I+K + ++G L VAG+ +G TG DG EA+
Sbjct: 2130 PKSMSFDASGNAYIADSLNHRILK------RDTQGNLTVVAGTGAKGSTGD-DGPAIEAK 2182
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSN 191
+P+G +D +GNLY+ADTLN IRK+ G +TT+AG GK+ G +G + AK N
Sbjct: 2183 LKNPQGTAIDHEGNLYIADTLNHRIRKVDSNGIITTVAGIGKAGNTG-DNGLATAAKLRN 2241
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCM 251
+V+ L + D GN +R++S Q + NSI I V G GY
Sbjct: 2242 PTAIVF-DNNGHLYIADSGNHRIRKVS-GQRTRKPSANSI----ITTVAGNGRSGY---- 2291
Query: 252 LQQGFGPFFFSRTQQPS 268
Q GP +R P+
Sbjct: 2292 -QGDNGPATGARLSNPT 2307
Score = 58.9 bits (141), Expect = 6e-06, Method: Composition-based stats.
Identities = 47/146 (32%), Positives = 67/146 (45%), Gaps = 6/146 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKI----TPPLSQYSRGRLVAGSFQ-GYTGHVDGKPNE 131
P I +G L+ D N I K+ T S S VAG+ + GY G +G
Sbjct: 2242 PTAIVFDNNGHLYIADSGNHRIRKVSGQRTRKPSANSIITTVAGNGRSGYQGD-NGPATG 2300
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSN 191
AR ++P G+ +D + NLY+ADT N IRK+ G T G N DG A N
Sbjct: 2301 ARLSNPTGLAVDSQNNLYIADTDNHRIRKVDLTGTITTVAGNGNKGYSGDGDPATAAQIN 2360
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQI 217
+ V T +L + D+ N +R++
Sbjct: 2361 TPTGLEVDSTGNLYIADKNNHRIRKV 2386
Score = 52.8 bits (125), Expect = 4e-04, Method: Composition-based stats.
Identities = 46/144 (31%), Positives = 68/144 (47%), Gaps = 10/144 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKP-NEARF 134
P + V L+ D N I K+ + + VAG+ +GY+G DG P A+
Sbjct: 2306 PTGLAVDSQNNLYIADTDNHRIRKVDLTGTITT----VAGNGNKGYSG--DGDPATAAQI 2359
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG-PSEDAKFSNDF 193
N P G+ +D GNLY+AD N IRK+ G+ T G DG + A+ S
Sbjct: 2360 NTPTGLEVDSTGNLYIADKNNHRIRKVDTEGIITTFTGTGKPGTATDGIIASVAQISQPT 2419
Query: 194 DVVYVRPTCSLLVIDRGNAALRQI 217
DV + +L + D+GN +R+I
Sbjct: 2420 DVA-LDQYGNLYIADKGNDTIRKI 2442
Score = 43.5 bits (101), Expect = 0.24, Method: Composition-based stats.
Identities = 24/61 (39%), Positives = 34/61 (55%), Gaps = 1/61 (1%)
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFS 190
A N PK ++ D GN Y+AD+LN I K G +T +AG + + DGP+ +AK
Sbjct: 2125 ANLNSPKSMSFDASGNAYIADSLNHRILKRDTQGNLTVVAGTGAKGSTGDDGPAIEAKLK 2184
Query: 191 N 191
N
Sbjct: 2185 N 2185
Score = 40.4 bits (93), Expect = 2.2, Method: Composition-based stats.
Identities = 30/96 (31%), Positives = 40/96 (41%), Gaps = 22/96 (22%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEA---- 132
P + V G L+ D+ N I K+ G +TG GKP A
Sbjct: 2362 PTGLEVDSTGNLYIADKNNHRIRKVD-----------TEGIITTFTG--TGKPGTATDGI 2408
Query: 133 -----RFNHPKGVTMDDKGNLYVADTLNLAIRKIGD 163
+ + P V +D GNLY+AD N IRKIG+
Sbjct: 2409 IASVAQISQPTDVALDQYGNLYIADKGNDTIRKIGE 2444
>gi|344924044|ref|ZP_08777505.1| NHL repeat containing protein [Candidatus Odyssella
thessalonicensis L13]
Length = 383
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 67/135 (49%), Gaps = 16/135 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY I + G L+ D N I KIT + L+AGS G TG+ +G A+FN
Sbjct: 60 PYAITIDSSGTLYVADNGNYRIRKITSSGTT----SLLAGS--GTTGYAEGTGASAQFNT 113
Query: 137 PK-GVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFS----- 190
+ G+ D+ GN+YV+DT N +RKI G T++ G + +G+++G A+ S
Sbjct: 114 FQWGIAADNSGNVYVSDTTNNRVRKITSGGTTSLLAGST--SGYQEGTGAGARLSSPRGL 171
Query: 191 --NDFDVVYVRPTCS 203
N VYV S
Sbjct: 172 AVNSAGTVYVATATS 186
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 42/141 (29%), Positives = 68/141 (48%), Gaps = 9/141 (6%)
Query: 78 YKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHP 137
+ I G ++ D N+ + KIT + L+AGS GY +G AR + P
Sbjct: 116 WGIAADNSGNVYVSDTTNNRVRKITSGGTT----SLLAGSTSGYQ---EGTGAGARLSSP 168
Query: 138 KGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVY 197
+G+ ++ G +YVA + IR I G T++ G S G+ +G A+FS V
Sbjct: 169 RGLAVNSAGTVYVATATSERIRAITSGGTTSLLAG-SGATGYVEGTGSAAQFSTPTSVA- 226
Query: 198 VRPTCSLLVIDRGNAALRQIS 218
V + ++ VID N +R+I+
Sbjct: 227 VDSSGTVYVIDANNYRIRKIT 247
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/172 (28%), Positives = 82/172 (47%), Gaps = 19/172 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V+ G ++ + I IT + L+AGS G TG+V+G + A+F+
Sbjct: 168 PRGLAVNSAGTVYVATATSERIRAITSGGTT----SLLAGS--GATGYVEGTGSAAQFST 221
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P V +D G +YV D N IRKI G T++ G + G+ +G A+F N F+++
Sbjct: 222 PTSVAVDSSGTVYVIDANNYRIRKITSGGTTSLFAGSTQ--GYAEGTGSAARF-NFFNLI 278
Query: 197 YVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYV 248
P S + +D NA ++ D Y+ +I+P + + G+V
Sbjct: 279 ---P--SGITVD--NAGTVYVA---DTFNYRIRTITPGGVTSTLAGTTQGFV 320
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 53/97 (54%), Gaps = 3/97 (3%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
G+ +G + ARF +P +T+D G LYVAD N IRKI +G T++ G S G+ +G
Sbjct: 46 GYQEGTGSAARFENPYAITIDSSGTLYVADNGNYRIRKITSSGTTSLLAG-SGTTGYAEG 104
Query: 183 PSEDAKFSNDFDV-VYVRPTCSLLVIDRGNAALRQIS 218
A+F N F + + ++ V D N +R+I+
Sbjct: 105 TGASAQF-NTFQWGIAADNSGNVYVSDTTNNRVRKIT 140
>gi|88601128|gb|ABD46562.1| NHL repeat-containing protein [Vermamoeba vermiformis]
Length = 309
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 74/145 (51%), Gaps = 8/145 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V DG ++ D N+ + K+TP + +AG G G DG +ARFN
Sbjct: 60 PSGLDVDTDGNVYVADYYNNTMRKVTPEGIVTT----IAGHV-GQWGSTDGSGEKARFNG 114
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P GV +D +GN+ ++D N +RKI D V+TIAG AG DG + A+F
Sbjct: 115 PSGVRIDTEGNIIISDNNNNTVRKISNIDGNVSTIAGSAGKSAGSEDGNGQQARFFGPSG 174
Query: 195 VVYVRPTCSLLVIDRGNAALRQISL 219
+ V P ++ V DR N +R IS+
Sbjct: 175 IA-VAPDGTIFVCDRYNHTIRSISI 198
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 52/145 (35%), Positives = 68/145 (46%), Gaps = 14/145 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V+ DG +F D N I I S +AG G DGK ARFN
Sbjct: 172 PSGIAVAPDGTIFVCDRYNHTIRSI----SIVGEVTTIAGRVM-QPGSADGKLTAARFNQ 226
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIA---GGKSNVAGFRDGPSEDAKFSND 192
P G+++D GNL+V+D N IRKI G VTTIA G + V GF D + F N
Sbjct: 227 PSGISVDKIGNLFVSDYYNHTIRKISPLGEVTTIAGMFGHQGAVEGFGDHIRLNHPFRNT 286
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQI 217
D +L + D N+ +R++
Sbjct: 287 ID-----SAGNLYICDEYNSIVRKL 306
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 59/117 (50%), Gaps = 11/117 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV---DGKPNEAR 133
P I + + ++ + N+ I KITP G +V + G+ GH DG ++AR
Sbjct: 5 PRGIAIDKHDNIYICEWNNNTIRKITPD------GVVV--TLAGHPGHWGSRDGVGSKAR 56
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFS 190
FN P G+ +D GN+YVAD N +RK+ G+ T G G DG E A+F+
Sbjct: 57 FNGPSGLDVDTDGNVYVADYYNNTMRKVTPEGIVTTIAGHVGQWGSTDGSGEKARFN 113
Score = 47.0 bits (110), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 43/142 (30%), Positives = 62/142 (43%), Gaps = 4/142 (2%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P +R+ +G + D N+ + KI+ S +AGS G DG +ARF
Sbjct: 115 PSGVRIDTEGNIIISDNNNNTVRKISNIDGNVS---TIAGSAGKSAGSEDGNGQQARFFG 171
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+ + G ++V D N IR I G T G+ G DG A+F N +
Sbjct: 172 PSGIAVAPDGTIFVCDRYNHTIRSISIVGEVTTIAGRVMQPGSADGKLTAARF-NQPSGI 230
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
V +L V D N +R+IS
Sbjct: 231 SVDKIGNLFVSDYYNHTIRKIS 252
>gi|223993873|ref|XP_002286620.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220977935|gb|EED96261.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1937
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 48/129 (37%), Positives = 68/129 (52%), Gaps = 26/129 (20%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGG 172
+AGS G G VDG ++ARF HP+GV +D G +YVADT N AIR I +G V+T+AG
Sbjct: 230 MAGSTSGEEGFVDGVESDARFRHPEGVAVDHDGYVYVADTGNHAIRMISPSGRVSTLAG- 288
Query: 173 KSNVAGFRDG-PSEDAKFSNDFDVVYVRP----------------------TCSLLVIDR 209
+ V G++DG + A+ S+ D+ R T +LLV D
Sbjct: 289 -NGVPGYKDGLANSGAQLSSPTDIAVWRDWAWWPYENPVDPDSFIYKNGNGTLALLVADT 347
Query: 210 GNAALRQIS 218
N +R+I+
Sbjct: 348 DNHRVRKIT 356
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 33/55 (60%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVA 177
G+ DG+ +EARF+ P+G+T+ G ++VADT N +R+I G G VA
Sbjct: 392 GYADGEKDEARFDSPQGITVASDGRVFVADTNNHLVREISRLGTVVTVAGSLTVA 446
>gi|290975813|ref|XP_002670636.1| predicted protein [Naegleria gruberi]
gi|284084197|gb|EFC37892.1| predicted protein [Naegleria gruberi]
Length = 2442
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 51/146 (34%), Positives = 77/146 (52%), Gaps = 14/146 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRG--RLVAGSFQGYTGHVDGKPNEARF 134
P I V+ +G ++ D +N I IT S G LVAGS+ G G +G A
Sbjct: 833 PTDIHVTNNGTIYLTDSLNHCIRMIT------SDGVISLVAGSYYGGFGGDNGPAANAVL 886
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--DGPSEDAKFSN 191
N P+ + ++ KG +YV+DT+N +RKI G +TT+AG S +AGF G + +AK S
Sbjct: 887 NSPESLFVNSKGEIYVSDTVNHRVRKIDLNGTITTVAG--SGLAGFYGDGGLATNAKLSY 944
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQI 217
+YV + + D GN +R++
Sbjct: 945 PMG-IYVSKNGDIYIADYGNHRIRKV 969
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 77/160 (48%), Gaps = 22/160 (13%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGS-FQGYTGHVDGKPNEAR 133
P + V+ GE++ D VN + KI G + VAGS G+ G G A+
Sbjct: 889 PESLFVNSKGEIYVSDTVNHRVRKIDL------NGTITTVAGSGLAGFYGD-GGLATNAK 941
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFS- 190
++P G+ + G++Y+AD N IRK+ G +TTIAG G N +G ++A +
Sbjct: 942 LSYPMGIYVSKNGDIYIADYGNHRIRKVFSNGTITTIAGTGSIN----YNGDIQEATLTT 997
Query: 191 -NDFDVVYVRPT-CSLLVIDRGNAALRQISLNQDDCEYQY 228
N VY+ P + + D GN +R+I L DC Y
Sbjct: 998 LNSPRGVYMSPNEDEIYIADSGNNLIRKIKL---DCPSDY 1034
Score = 42.4 bits (98), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 67/137 (48%), Gaps = 11/137 (8%)
Query: 85 DGELFAVDEVNSNIVKITPPLSQYSRGRLVAG--SFQGYTGHVDGKPNEARFNHPKGVTM 142
+GE + D N ++K L R +AG G++G G A G+++
Sbjct: 513 NGEFYWNDSGNCRVLK----LGTDGIVRRIAGITGICGFSGD-GGLATNALMGVSWGLSV 567
Query: 143 DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVRPT 201
G++YVA+++N IRKI + +TT+AG ++VAG+ D + + N V+ T
Sbjct: 568 SPSGDIYVAESINSRIRKISNGIITTVAG--TSVAGYNGDDIAATSASLNGPPAVFATST 625
Query: 202 CSLLVIDRGNAALRQIS 218
L D GN +R++S
Sbjct: 626 AYYLA-DAGNRRVRKVS 641
Score = 42.0 bits (97), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 53/175 (30%), Positives = 77/175 (44%), Gaps = 28/175 (16%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + VS D ++ D N +T + G V G Y+G G A F
Sbjct: 675 PRALWVSNDDSVYYADVSNGKTRVVTNGIVSTIAGTGVLG----YSGD-GGLATNANFRW 729
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG-GKSNVAGFRDG-PSEDAKFSNDFD 194
G+ M +LY+AD+ N IRK+ + VTT+AG G +N G DG P+ A N +
Sbjct: 730 IYGLWMVGS-DLYIADSTNNVIRKLSNNIVTTVAGTGNANFNG--DGIPALSANI-NTAN 785
Query: 195 VVYVRPTCSLLVI---DRGNAALRQISLN----------QDDCEYQYNSISPTDI 236
V + P S + I DR +R+++ N D + Y I+PTDI
Sbjct: 786 QVIIHPQTSEMFILSSDR----IRKVNTNGTISTVVGGIGDGAQAIYALINPTDI 836
>gi|325104746|ref|YP_004274400.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
gi|324973594|gb|ADY52578.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
Length = 1769
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 48/143 (33%), Positives = 74/143 (51%), Gaps = 11/143 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + G ++ D++N +I KITP G + + G G+ DG ARFN
Sbjct: 1002 PTGIAIDPWGNIYIADDLNHSIRKITP------NGTVSTFAGNGTAGYADGVGVLARFNR 1055
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ D GN+YV+++ N IRKI G V+T AG + AG+ DGP A F N
Sbjct: 1056 PTGIITDALGNVYVSESSNY-IRKITPNGTVSTFAG--NGTAGYADGPGTSAMF-NSPQA 1111
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
+ + + ++ V + N +R+I+
Sbjct: 1112 MVMDASDNIYVSESSNHRIRKIT 1134
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/115 (34%), Positives = 60/115 (52%), Gaps = 9/115 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + G ++ + S I KITP +G + + G G+ +G A FN
Sbjct: 948 PQGMAIDASGNIYIAERGASRIRKITP------QGLVSTIAGSGTNGNSNGVGTAASFNK 1001
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFS 190
P G+ +D GN+Y+AD LN +IRKI G V+T AG + AG+ DG A+F+
Sbjct: 1002 PTGIAIDPWGNIYIADDLNHSIRKITPNGTVSTFAG--NGTAGYADGVGVLARFN 1054
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 68/129 (52%), Gaps = 10/129 (7%)
Query: 91 VDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYV 150
V E ++ I KITP G + + G G+ DG A FN P+ + MD N+YV
Sbjct: 1069 VSESSNYIRKITP------NGTVSTFAGNGTAGYADGPGTSAMFNSPQAMVMDASDNIYV 1122
Query: 151 ADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDR 209
+++ N IRKI AG V+T+AG V G RDG E+A+F + + + ++ V +
Sbjct: 1123 SESSNHRIRKITPAGEVSTVAG--KGVQGNRDGTKEEAQFWGPVGIA-LDMSGNIYVAEW 1179
Query: 210 GNAALRQIS 218
N +R+I+
Sbjct: 1180 SNHRIRKIT 1188
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 76/146 (52%), Gaps = 13/146 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT-GHVDGKPNEARFN 135
P + +G ++ + NS I KIT + G++ +F G T G+ DG A+FN
Sbjct: 895 PAGVVADAEGNIYVAEYGNSLIRKIT------ADGQV--STFAGSTFGYADGIGTAAKFN 946
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P+G+ +D GN+Y+A+ IRKI G V+TIAG +N G +G A F N
Sbjct: 947 GPQGMAIDASGNIYIAERGASRIRKITPQGLVSTIAGSGTN--GNSNGVGTAASF-NKPT 1003
Query: 195 VVYVRPTCSLLVIDRGNAALRQISLN 220
+ + P ++ + D N ++R+I+ N
Sbjct: 1004 GIAIDPWGNIYIADDLNHSIRKITPN 1029
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 62/106 (58%), Gaps = 8/106 (7%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGG 172
+AGS +G V+GK EA+F+ P GV D +GN+YVA+ N IRKI D V+T AG
Sbjct: 875 LAGSTKGV---VNGKGIEAQFDTPAGVVADAEGNIYVAEYGNSLIRKITADGQVSTFAG- 930
Query: 173 KSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+ G+ DG AKF N + + + ++ + +RG + +R+I+
Sbjct: 931 --STFGYADGIGTAAKF-NGPQGMAIDASGNIYIAERGASRIRKIT 973
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 62/239 (25%), Positives = 94/239 (39%), Gaps = 42/239 (17%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + ++ + N I KITP G + + +G G+ DG EA+F
Sbjct: 1109 PQAMVMDASDNIYVSESSNHRIRKITPA------GEVSTVAGKGVQGNRDGTKEEAQFWG 1162
Query: 137 PKGVTMDDKGNLYVADTLNLAIRK------------IGDAGVTTIAGGKSNVAGFRDGPS 184
P G+ +D GN+YVA+ N IRK I D G+ +A SN G
Sbjct: 1163 PVGIALDMSGNIYVAEWSNHRIRKITQGITLTGTPGITDVGINPVALKASNTVG------ 1216
Query: 185 EDAKFSNDFDVVYVRPTC--SLLVIDR--GNAA--LRQISLNQDDCEYQYNSISPTDILM 238
S FD+ P + I++ G+A L Q N D + Y S +P
Sbjct: 1217 ---STSQQFDITVSGPAILKNFANINKTYGDAVFTLSQPGSNNTDGTFTYVSNNPAV--- 1270
Query: 239 VVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEE 297
A + G +L+ G +Q + S F E T L+ + P+ I KE+
Sbjct: 1271 ---ASVNGNKVTILKSGIVKI---TAKQAAHSYFTETTIESTLTVAKAPLTITVDNKEK 1323
>gi|261404592|ref|YP_003240833.1| copper amine oxidase domain-containing protein [Paenibacillus sp.
Y412MC10]
gi|261281055|gb|ACX63026.1| copper amine oxidase domain protein [Paenibacillus sp. Y412MC10]
Length = 1280
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 55/191 (28%), Positives = 88/191 (46%), Gaps = 31/191 (16%)
Query: 45 SSKAPQADGNVL----QFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVK 100
S++A Q V+ ++NG L E NE P + + G L+ D N I
Sbjct: 234 STRAVQVRAGVIALAGDYKNGSLAEAQF--NE----PAGLALDGKGNLYVSDSGNHAIRY 287
Query: 101 ITPPLSQYSRGRL--VAGSF---QGYT--------GHVDGKPNEARFNHPKGVTMDDKGN 147
I + +G + AGS GY G++DG + ARFN P+G+ +
Sbjct: 288 I-----DFGKGTVSTAAGSVPSSDGYAKDALYADPGYLDGAADAARFNSPRGLAWSVEDG 342
Query: 148 LYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVI 207
L +AD+ N A+R++ D V+T+AGG G+ DG + +F+ D+ + L +
Sbjct: 343 LLIADSHNHAVRQLKDERVSTMAGGTR---GYADGIESEVRFNAPADIAVASDSGELFIA 399
Query: 208 DRGNAALRQIS 218
D+ N A+R S
Sbjct: 400 DQRNGAVRAWS 410
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 56/171 (32%), Positives = 81/171 (47%), Gaps = 19/171 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V G+++ D N I KI Q R +AG+ G G DGK +A F+
Sbjct: 143 PSGIAVDGKGQVYIADSGNHAIRKI----DQAGRVTTIAGN--GRIGLKDGKAQDALFHE 196
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIA----------GGKSNVAG-FRDGPS 184
P+ V + + G LYVADTLN IR+I D VTTI G +AG +++G
Sbjct: 197 PQDVAVTEDGILYVADTLNHVIRRISPDGEVTTIGSPSTRAVQVRAGVIALAGDYKNGSL 256
Query: 185 EDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTD 235
+A+F N+ + + +L V D GN A+R I + S+ +D
Sbjct: 257 AEAQF-NEPAGLALDGKGNLYVSDSGNHAIRYIDFGKGTVSTAAGSVPSSD 306
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/88 (45%), Positives = 51/88 (57%), Gaps = 4/88 (4%)
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFS 190
A N P G+ +D KG +Y+AD+ N AIRKI AG VTTIAG + G +DG ++DA F
Sbjct: 138 AFLNRPSGIAVDGKGQVYIADSGNHAIRKIDQAGRVTTIAG--NGRIGLKDGKAQDALFH 195
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQIS 218
DV V L V D N +R+IS
Sbjct: 196 EPQDVA-VTEDGILYVADTLNHVIRRIS 222
>gi|196230675|ref|ZP_03129536.1| Peptidase S53 propeptide [Chthoniobacter flavus Ellin428]
gi|196225016|gb|EDY19525.1| Peptidase S53 propeptide [Chthoniobacter flavus Ellin428]
Length = 1120
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 37/96 (38%), Positives = 52/96 (54%), Gaps = 1/96 (1%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
G DG A+F +P GV +D GN+YVAD N IRKI G T G+ V G +G
Sbjct: 676 GTADGTGTAAQFGYPSGVAVDSSGNIYVADFNNDTIRKITPGGTVTTPYGQPGVIGATNG 735
Query: 183 PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+A F N + V + ++ V D GN+A+R+I+
Sbjct: 736 TGTNATF-NTPNGVAIDSANNIYVADSGNSAIRKIT 770
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 56/103 (54%), Gaps = 11/103 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVA--GSFQGYTGHVDGKPNEARF 134
P + + ++ D NS I KITP GR+V+ G +G DG +A F
Sbjct: 745 PNGVAIDSANNIYVADSGNSAIRKITP-------GRVVSTLAGQPGLSGSADGT-TKALF 796
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNV 176
N+P+GV +D GN+YVADT N IRKI AG V+T+AG V
Sbjct: 797 NNPQGVAVDSAGNVYVADTTNETIRKITPAGVVSTLAGTAGTV 839
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 48/144 (33%), Positives = 68/144 (47%), Gaps = 11/144 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ--GYTGHVDGKPNEARF 134
P + V G ++ D N I KITP G V + G G +G A F
Sbjct: 690 PSGVAVDSSGNIYVADFNNDTIRKITP-------GGTVTTPYGQPGVIGATNGTGTNATF 742
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
N P GV +D N+YVAD+ N AIRKI V + G+ ++G DG ++ A F+N
Sbjct: 743 NTPNGVAIDSANNIYVADSGNSAIRKITPGRVVSTLAGQPGLSGSADGTTK-ALFNNPQG 801
Query: 195 VVYVRPTCSLLVIDRGNAALRQIS 218
V V ++ V D N +R+I+
Sbjct: 802 VA-VDSAGNVYVADTTNETIRKIT 824
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 50/122 (40%), Gaps = 11/122 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEAR--- 133
P + V G ++ D N I KITP V + G G V
Sbjct: 799 PQGVAVDSAGNVYVADTTNETIRKITP--------AGVVSTLAGTAGTVGYADGAGAAAI 850
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
FN P V +D GN+YVAD N +RKI GV T G++ + G DG A F+
Sbjct: 851 FNGPSSVAVDSAGNVYVADLYNFVVRKITPGGVVTTPYGQAGMPGRLDGIGTAALFNAPI 910
Query: 194 DV 195
V
Sbjct: 911 GV 912
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 62/146 (42%), Gaps = 30/146 (20%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF--QGYTGHVDGKPNEARF 134
P + V G ++ D N + KITP G +V + G G +DG A F
Sbjct: 854 PSSVAVDSAGNVYVADLYNFVVRKITP-------GGVVTTPYGQAGMPGRLDGIGTAALF 906
Query: 135 NHPKGVTMDDKGNLYVAD-----------TLNLAIRKIGDAG-VTTIAGGKSNVAGFRDG 182
N P GV +D NLY+ D T N +R++ AG V+TIAG S G DG
Sbjct: 907 NAPIGVAVDANNNLYITDSQIPPDLTSTSTGNNLVRRVNAAGVVSTIAGAGST--GSADG 964
Query: 183 PSEDAKFS-------NDFDVVYVRPT 201
A+F N VVY+ T
Sbjct: 965 TGNVAQFYSLQAAAINSAKVVYLADT 990
>gi|428185691|gb|EKX54543.1| hypothetical protein GUITHDRAFT_63359, partial [Guillardia theta
CCMP2712]
Length = 305
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 47/116 (40%), Positives = 65/116 (56%), Gaps = 6/116 (5%)
Query: 107 QYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTM--DDKGNLYVADTLNLAIRKIGDA 164
Y R+VA + G GH DG A+F+ P + + DK ++ VAD+ N A+R+I +
Sbjct: 83 SYDGERVVAFAGSGKRGHRDGPVQVAQFDTPCSICVCPSDK-SIIVADSGNNAVRRIANG 141
Query: 165 GVTTIAGGKS--NVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
VTT+AGG GF DG SE AKF V++ + +LLVID GN LR +S
Sbjct: 142 MVTTLAGGSGPDRAGGFVDGESEGAKFRRPTFVMFDKEE-TLLVIDSGNHCLRVMS 196
Score = 42.7 bits (99), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 43/87 (49%), Gaps = 1/87 (1%)
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSN 191
A + HPKG+ G L V D+ + IR + G +A S G RDGP + A+F
Sbjct: 54 ASWLHPKGICPLPSG-LLVCDSGHHRIRSVSYDGERVVAFAGSGKRGHRDGPVQVAQFDT 112
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQIS 218
+ S++V D GN A+R+I+
Sbjct: 113 PCSICVCPSDKSIIVADSGNNAVRRIA 139
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 31/131 (23%), Positives = 61/131 (46%), Gaps = 8/131 (6%)
Query: 88 LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN 147
L +D N + ++P + R +AG + G DG + NHP+ + + G+
Sbjct: 182 LLVIDSGNHCLRVMSPDWKEV---RTLAGGPK--AGGTDGAVDTCELNHPEASCLLEDGS 236
Query: 148 LYVADTLNLAIRKIGDAGVTTIAGGKSN-VAGFRDGPSEDAKFSNDFDVVYVRPTCSLLV 206
+ +AD N IR++ D + +++ N G DG E++ F+ V + ++++
Sbjct: 237 ILIADRENNKIRRL-DGDLRSLSSWAGNGCWGATDGLIEESTFNKPCGVCCLEDG-TIVI 294
Query: 207 IDRGNAALRQI 217
D GN +R +
Sbjct: 295 SDSGNNCIRLV 305
>gi|373955343|ref|ZP_09615303.1| Ig family protein [Mucilaginibacter paludis DSM 18603]
gi|373891943|gb|EHQ27840.1| Ig family protein [Mucilaginibacter paludis DSM 18603]
Length = 929
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 55/141 (39%), Positives = 74/141 (52%), Gaps = 15/141 (10%)
Query: 52 DGNVL-QFENGYLVETVIEG-NEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYS 109
D N++ + +G LV T G N+ P + V +G ++ D ++I KITP S
Sbjct: 179 DNNLIRKISSGGLVTTFASGFNQ----PNGVTVDLNGNVYVADAATNSIKKITPTGSV-- 232
Query: 110 RGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTT 168
+VAG G G +G + A F +P VT+D GNLYV+D N IRKI AG VTT
Sbjct: 233 --TVVAG--NGSMGSNNGIGSAASFYYPFSVTVDGAGNLYVSDNGNNLIRKIDLAGAVTT 288
Query: 169 IAGGKSNVAGFRDGPSEDAKF 189
+AG S +A F DG A F
Sbjct: 289 LAG--SGMAAFADGTGTAASF 307
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 62/132 (46%), Gaps = 8/132 (6%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDK 145
G L+ D VN+ + K+TP G + + G ++G A N P G T+D
Sbjct: 319 GNLYIADGVNNRVRKVTP------LGVVTTVAGNGTRATINGNGTSASLNTPTGATIDIA 372
Query: 146 GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLL 205
G +YVA+ IRK+ +G TI G SNVAG +G A F D V + +
Sbjct: 373 GIVYVAELDGNCIRKVDPSGNVTILAG-SNVAGSANGIGTAASFRRPND-VQADQSGFIY 430
Query: 206 VIDRGNAALRQI 217
V D GN +R+I
Sbjct: 431 VTDYGNNVIRKI 442
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 46/96 (47%), Gaps = 7/96 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ + V G L+ D N+ I KI G + + G DG A F
Sbjct: 256 PFSVTVDGAGNLYVSDNGNNLIRKIDL------AGAVTTLAGSGMAAFADGTGTAASFYG 309
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
P G T+D GNLY+AD +N +RK+ G VTT+AG
Sbjct: 310 PCGGTLDAMGNLYIADGVNNRVRKVTPLGVVTTVAG 345
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 29/70 (41%), Positives = 35/70 (50%), Gaps = 3/70 (4%)
Query: 104 PLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGD 163
P + Y + AG+ G +G +G + F P V D GNLYVAD N IRKI
Sbjct: 131 PATIYGQTSTYAGT--GNSGSTNGSALTSTFYSPTRVAADLSGNLYVADRDNNLIRKISS 188
Query: 164 AG-VTTIAGG 172
G VTT A G
Sbjct: 189 GGLVTTFASG 198
>gi|383124185|ref|ZP_09944852.1| hypothetical protein BSIG_3780 [Bacteroides sp. 1_1_6]
gi|251839311|gb|EES67395.1| hypothetical protein BSIG_3780 [Bacteroides sp. 1_1_6]
Length = 454
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 35/98 (35%), Positives = 57/98 (58%), Gaps = 1/98 (1%)
Query: 122 TGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD 181
+G+++G R N P+ + D+ GN++V + N IRKI AG ++ G+ +GF D
Sbjct: 357 SGYINGTGATVRLNKPEQMAFDEDGNMFVPERNNHIIRKITPAGSASLYAGQPEQSGFGD 416
Query: 182 GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G E+AKF N + V V P S+ V DR N +R++++
Sbjct: 417 GLPEEAKF-NQPECVTVYPDNSVYVADRDNHVIRRVTV 453
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 33/85 (38%), Positives = 46/85 (54%), Gaps = 5/85 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P ++ EDG +F + N I KITP S L AG + +G DG P EA+FN
Sbjct: 372 PEQMAFDEDGNMFVPERNNHIIRKITPAGS----ASLYAGQPE-QSGFGDGLPEEAKFNQ 426
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P+ VT+ ++YVAD N IR++
Sbjct: 427 PECVTVYPDNSVYVADRDNHVIRRV 451
>gi|428172619|gb|EKX41527.1| hypothetical protein GUITHDRAFT_153825 [Guillardia theta CCMP2712]
Length = 285
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/107 (40%), Positives = 58/107 (54%), Gaps = 7/107 (6%)
Query: 115 AGSFQGYTGHVDGKPNEARFNHPKGVTMDDKG-NLYVADTLNLAIRKIG-----DAGVTT 168
AGS G DG EA FNHP G++ D G N+++ADT N AIR I + VT
Sbjct: 101 AGSNLSKPGLRDGSIKEALFNHPGGISADQSGENIFIADTYNHAIRHITRDEDRNWTVTL 160
Query: 169 IAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALR 215
IAG +S +GF DG E A+F+ + V +L+ D N+A+R
Sbjct: 161 IAGSRSGQSGFEDGEGESARFNCPTGLAVVNEQ-EILISDFSNSAIR 206
Score = 61.6 bits (148), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 47/154 (30%), Positives = 74/154 (48%), Gaps = 13/154 (8%)
Query: 77 PYKIRVSEDGE-LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P I + GE +F D N I IT + L+AGS G +G DG+ ARFN
Sbjct: 123 PGGISADQSGENIFIADTYNHAIRHITRDEDRNWTVTLIAGSRSGQSGFEDGEGESARFN 182
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKI----GDAGVTTIAGGKSN--------VAGFRDGP 183
P G+ + ++ + ++D N AIR + GD VTT+ G S +G+ DGP
Sbjct: 183 CPTGLAVVNEQEILISDFSNSAIRLLRRVRGDWLVTTVVGKTSKDSEGRMLPESGYVDGP 242
Query: 184 SEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
+ A+ + V + + S+L+ D N +R++
Sbjct: 243 VDQARLNRPHGVAWDEASRSVLIADCMNHRIRRV 276
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 69/153 (45%), Gaps = 18/153 (11%)
Query: 87 ELFAVDEVNSNIVKITP--------PLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPK 138
ELF D N + +++ + +V + G GHVDG E +F +P
Sbjct: 2 ELFVADAYNQAVRRVSSIRINVEAKSMEDLVHADVVTIAGHGAKGHVDGPVGECKFCYPH 61
Query: 139 GVT---MDDKGNLYVADTLNLAIRKI------GDAGVTTIAGGKSNVAGFRDGPSEDAKF 189
V MD +L VAD N IRK+ G+ V T AG + G RDG ++A F
Sbjct: 62 DVLFQDMDGAEHLLVADFDNNCIRKVFITNGVGEH-VVTWAGSNLSKPGLRDGSIKEALF 120
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQISLNQD 222
++ + + ++ + D N A+R I+ ++D
Sbjct: 121 NHPGGISADQSGENIFIADTYNHAIRHITRDED 153
>gi|291612976|ref|YP_003523133.1| NHL repeat-containing protein [Sideroxydans lithotrophicus ES-1]
gi|291583088|gb|ADE10746.1| NHL repeat-containing protein [Sideroxydans lithotrophicus ES-1]
Length = 385
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 53/147 (36%), Positives = 74/147 (50%), Gaps = 17/147 (11%)
Query: 82 VSEDGE--LFAVDEVNSNIVKITPPLSQYSRG--RLVAGS----FQGYTGHVDGKPNEAR 133
++ DG L+ D +N+++ +I S G LVAG G DG AR
Sbjct: 247 IATDGASFLYLTDTLNNDVRRI-----DVSTGATTLVAGGNSTLASSGVGSSDGIGANAR 301
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDA-GVTTIAGGKSNVAGFRDGPSEDAKFSND 192
FN P G+T D NLYVADT N IRKI A G T G + V G DG A+F++
Sbjct: 302 FNQPMGITTDGS-NLYVADTNNHTIRKIVIATGNVTTPAGAALVPGTADGAGPTARFNHP 360
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQISL 219
F ++Y+ +L V D N ++R++ L
Sbjct: 361 FGIIYIN--GALYVADYTNGSIRKVQL 385
Score = 45.8 bits (107), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 49/111 (44%), Gaps = 9/111 (8%)
Query: 117 SFQGYTGH---VDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAG 171
+F G +G DG ARF P + D NLYV D IRKI VTT AG
Sbjct: 46 TFAGTSGSHAITDGIGATARFQIPAYIASDGT-NLYVTDMYANNIRKIVIATGQVTTFAG 104
Query: 172 GKSNVAGFRD---GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
+ +G D GP A F+N + +V L V D N +RQI +
Sbjct: 105 SITGASGVTDAPSGPGTSALFNNPYGIVIDATNTKLYVADGYNHTIRQIVI 155
Score = 42.0 bits (97), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 47/163 (28%), Positives = 67/163 (41%), Gaps = 38/163 (23%)
Query: 78 YKIRVSEDG-ELFAVDEVNSNIVKI--------------TPPLSQYSRGRLVAGSFQGYT 122
Y + +S G L+ D NS I +I TPP + Y F G T
Sbjct: 185 YPMGLSRIGTNLYVADSSNSTIRQIDLASASAAVTTLAGTPPSTGY---------FNGLT 235
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTT-IAGGKSNVA--- 177
G + F P + D LY+ DTLN +R+I G TT +AGG S +A
Sbjct: 236 GA------SSVFYSPTSIATDGASFLYLTDTLNNDVRRIDVSTGATTLVAGGNSTLASSG 289
Query: 178 -GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G DG +A+F+ + +L V D N +R+I +
Sbjct: 290 VGSSDGIGANARFNQPMGI--TTDGSNLYVADTNNHTIRKIVI 330
>gi|329927682|ref|ZP_08281814.1| copper amine oxidase N-terminal domain protein [Paenibacillus sp.
HGF5]
gi|328938318|gb|EGG34710.1| copper amine oxidase N-terminal domain protein [Paenibacillus sp.
HGF5]
Length = 1251
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 56/171 (32%), Positives = 81/171 (47%), Gaps = 19/171 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V G+++ D N I KI Q R +AG+ G G DGK +A F+
Sbjct: 114 PSGIAVDGKGQVYIADSGNHAIRKI----DQAGRVNTIAGN--GRIGLKDGKAQDALFHE 167
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIA----------GGKSNVAG-FRDGPS 184
P+ V + + G LYVADTLN IR+I D VTTI G +AG +++G
Sbjct: 168 PQDVAVTEDGILYVADTLNHVIRRISPDGEVTTIGSPSTRAVQVRAGVIALAGDYKNGSL 227
Query: 185 EDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTD 235
+A+F N+ + + +L V D GN A+R I + S+ +D
Sbjct: 228 AEAQF-NEPAGLALDGKGNLYVSDSGNQAIRYIDFGKGTVSTAAGSVPSSD 277
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 55/191 (28%), Positives = 87/191 (45%), Gaps = 31/191 (16%)
Query: 45 SSKAPQADGNVL----QFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVK 100
S++A Q V+ ++NG L E NE P + + G L+ D N I
Sbjct: 205 STRAVQVRAGVIALAGDYKNGSLAEAQF--NE----PAGLALDGKGNLYVSDSGNQAIRY 258
Query: 101 ITPPLSQYSRGRL--VAGSF---QGYT--------GHVDGKPNEARFNHPKGVTMDDKGN 147
I + +G + AGS GY G++DG + ARFN P+G+ +
Sbjct: 259 I-----DFGKGTVSTAAGSVPSSDGYAKDALYADPGYLDGAADAARFNSPRGLAWSVEDG 313
Query: 148 LYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVI 207
L +AD+ N A+R++ D VTT+ GG G+ DG + +F+ D+ + L +
Sbjct: 314 LLIADSHNHAVRQLKDERVTTMVGGTR---GYADGIESEVRFNAPADIGVASDSGELFIA 370
Query: 208 DRGNAALRQIS 218
D+ N A+R S
Sbjct: 371 DQRNGAVRAWS 381
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 50/88 (56%), Gaps = 4/88 (4%)
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFS 190
A N P G+ +D KG +Y+AD+ N AIRKI AG V TIAG + G +DG ++DA F
Sbjct: 109 AFLNRPSGIAVDGKGQVYIADSGNHAIRKIDQAGRVNTIAG--NGRIGLKDGKAQDALFH 166
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQIS 218
DV V L V D N +R+IS
Sbjct: 167 EPQDVA-VTEDGILYVADTLNHVIRRIS 193
>gi|423301816|ref|ZP_17279839.1| hypothetical protein HMPREF1057_02980 [Bacteroides finegoldii
CL09T03C10]
gi|408470907|gb|EKJ89439.1| hypothetical protein HMPREF1057_02980 [Bacteroides finegoldii
CL09T03C10]
Length = 469
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 54/109 (49%), Gaps = 2/109 (1%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIA 170
RL + +DG + NHP + MD + N+YVAD N IRKI DAG T
Sbjct: 361 RLAGSDYGNLPDLIDGDLGGSTLNHPCQICMDSQNNIYVADRYNCCIRKISLDAGTITTF 420
Query: 171 GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
GK+ G++DG +A F + + V + V D N A+R+I+L
Sbjct: 421 AGKAQTEGYQDGLISEALFDDPMGIA-VDKNGVIYVADSDNYAIRRIAL 468
Score = 45.8 bits (107), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 43/85 (50%), Gaps = 4/85 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P +I + ++ D N I KI+ + AG Q G+ DG +EA F+
Sbjct: 386 PCQICMDSQNNIYVADRYNCCIRKISLDAGTITT---FAGKAQ-TEGYQDGLISEALFDD 441
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P G+ +D G +YVAD+ N AIR+I
Sbjct: 442 PMGIAVDKNGVIYVADSDNYAIRRI 466
>gi|374311754|ref|YP_005058184.1| NHL repeat containing protein [Granulicella mallensis MP5ACTX8]
gi|358753764|gb|AEU37154.1| NHL repeat containing protein [Granulicella mallensis MP5ACTX8]
Length = 1456
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 76/148 (51%), Gaps = 12/148 (8%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGS-FQGYTGHVDGKPNEA 132
+P+ I V G++F VD N+N+ KI S G + VAG+ +G++G G A
Sbjct: 87 IPFGILVDRAGDIFIVDPGNNNVRKI-----AASTGIITTVAGNGTRGFSGD-GGAATSA 140
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAG-VTTIAGGKSNVAGFRDGPSEDAKFS 190
N P+ V +D GN+Y+ DT+N +RK+ G +TT+AG S + GP+ +A
Sbjct: 141 ELNQPQAVALDAAGNMYITDTINSRVRKVEATTGIITTVAGNGSVTSSGDGGPATNAGLV 200
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+ + ++ V D G +R+IS
Sbjct: 201 QPAGIA-LDSANNIFVTDSGEPGVREIS 227
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 60/115 (52%), Gaps = 18/115 (15%)
Query: 119 QGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG---VTTIA----- 170
+G++G G A N+P+G+++D NLY+AD+ N +RK+ DA +TT+A
Sbjct: 570 EGFSGD-GGSATSAELNYPEGISLDSAANLYIADSGNYRVRKV-DAATGIITTVAGDGAK 627
Query: 171 -----GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
GG + AGFR D KF + D +Y+ + S+ +D + + I+ N
Sbjct: 628 TDSGDGGLAVNAGFR--AISDVKF-DSHDNMYIADSLSIRRVDVVSGTITTIAGN 679
Score = 43.1 bits (100), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 64/121 (52%), Gaps = 11/121 (9%)
Query: 104 PLSQYSRGRL--VAGS-FQGYTGHVDGKP-NEARFNHPKGVTMDDKGNLYVADTLNLAIR 159
PL+ + G + +AG+ GYTG DG P A P+GV +D+ G++Y AD + +R
Sbjct: 495 PLAALTPGIITTIAGNGTSGYTG--DGGPATSAGMYEPQGVALDNAGDVYFADFYHHVVR 552
Query: 160 KIGDAG--VTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
+I A +TT+AG + GF DG S + N + + + +L + D GN +R+
Sbjct: 553 RIDAASGIITTVAGNGTE--GFSGDGGSATSAELNYPEGISLDSAANLYIADSGNYRVRK 610
Query: 217 I 217
+
Sbjct: 611 V 611
Score = 42.0 bits (97), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 40/87 (45%), Gaps = 5/87 (5%)
Query: 77 PYKIRVSEDGELFAVDE--VNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
P + + G L+ D ++NI K+T S VAG G +AR
Sbjct: 308 PQAVALDTAGNLYISDGDYRSNNIRKVTAATGIIST---VAGDGTAEFSGDGGAAVDARL 364
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKI 161
N P GV +D GNLY+AD N +R++
Sbjct: 365 NEPGGVALDTAGNLYIADVNNSVVREV 391
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 73/145 (50%), Gaps = 12/145 (8%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKPNEARFNHPK 138
I + G +F VD + K+T S +VAG+ +G++G G A +P+
Sbjct: 254 IAIDGSGNVFIVDTDRYAVFKLTTGTGTVS---VVAGNGTRGFSGD-GGAATSAELAYPQ 309
Query: 139 GVTMDDKGNLYVADT--LNLAIRKIGDAG--VTTIAG-GKSNVAGFRDGPSEDAKFSNDF 193
V +D GNLY++D + IRK+ A ++T+AG G + +G G + DA+ N+
Sbjct: 310 AVALDTAGNLYISDGDYRSNNIRKVTAATGIISTVAGDGTAEFSG-DGGAAVDARL-NEP 367
Query: 194 DVVYVRPTCSLLVIDRGNAALRQIS 218
V + +L + D N+ +R++S
Sbjct: 368 GGVALDTAGNLYIADVNNSVVREVS 392
>gi|325104251|ref|YP_004273905.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
gi|324973099|gb|ADY52083.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
Length = 532
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 72/143 (50%), Gaps = 12/143 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V +G++ D N +I K+T G + + G +G+ +GKP +FN
Sbjct: 49 PDGILVDMNGDIIITDRTNHSIRKMTTA------GVVSTLAGTGVSGYANGKP--GQFNT 100
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P T+D GN+ V + IRKI D V+TIAG S AG+ DG A+F N D
Sbjct: 101 PWQSTVDAAGNIIVVEKDGARIRKIALDGTVSTIAGTGS--AGYSDGAVSVARFDNALDA 158
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V V ++ + DR N +R+I+
Sbjct: 159 V-VDSDGNIFIADRNNRRVRKIT 180
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 45/85 (52%), Gaps = 1/85 (1%)
Query: 78 YKIRVSEDGELFAVDEVNSNIVKITPPLS-QYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
+ + DG + D + I KITP + ++ + + G TG +DG N A FN
Sbjct: 265 FGLNFDNDGNIIFADASFNRIRKITPGENGDWTTATVTTIAGTGSTGRIDGLGNVATFNQ 324
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P V D+ GN+YVAD +N +IRK+
Sbjct: 325 PYDVVTDENGNIYVADNVNHSIRKL 349
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 61/99 (61%), Gaps = 6/99 (6%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRD 181
G VDG +ARF +P G+ +D G++ + D N +IRK+ AG V+T+AG + V+G+ +
Sbjct: 35 GFVDGNGLDARFRNPDGILVDMNGDIIITDRTNHSIRKMTTAGVVSTLAG--TGVSGYAN 92
Query: 182 GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
G + +F+ + V +++V+++ A +R+I+L+
Sbjct: 93 G--KPGQFNTPWQST-VDAAGNIIVVEKDGARIRKIALD 128
Score = 42.0 bits (97), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 70/162 (43%), Gaps = 23/162 (14%)
Query: 69 EGNEIGVVPYKIRVSEDG--ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVD 126
+G G V + I ++ D LF D ++ T +S + G Q T D
Sbjct: 198 DGTTSGTVVWPISLAIDAADNLFVSDSRTIRMITKTGTIST------IVG-LQASTNFTD 250
Query: 127 G---KPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI-----GD---AGVTTIAGGKSN 175
G KP AR G+ D+ GN+ AD IRKI GD A VTTIAG S
Sbjct: 251 GESGKPLTARLGDVFGLNFDNDGNIIFADASFNRIRKITPGENGDWTTATVTTIAGTGS- 309
Query: 176 VAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
G DG A F+ +DVV ++ V D N ++R++
Sbjct: 310 -TGRIDGLGNVATFNQPYDVV-TDENGNIYVADNVNHSIRKL 349
Score = 39.3 bits (90), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 52/103 (50%), Gaps = 14/103 (13%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P++ V G + V++ + I KI + + +AG+ G G+ DG + ARF++
Sbjct: 101 PWQSTVDAAGNIIVVEKDGARIRKIALDGTVST----IAGT--GSAGYSDGAVSVARFDN 154
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI--GDAG------VTTIAG 171
+D GN+++AD N +RKI G G V+T+AG
Sbjct: 155 ALDAVVDSDGNIFIADRNNRRVRKITPGAGGNWTTATVSTVAG 197
>gi|29347802|ref|NP_811305.1| NHL repeat-containing protein [Bacteroides thetaiotaomicron
VPI-5482]
gi|383120108|ref|ZP_09940841.1| hypothetical protein BSIG_4606 [Bacteroides sp. 1_1_6]
gi|29339703|gb|AAO77499.1| NHL repeat-containing protein [Bacteroides thetaiotaomicron
VPI-5482]
gi|251838367|gb|EES66454.1| hypothetical protein BSIG_4606 [Bacteroides sp. 1_1_6]
Length = 440
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 59/100 (59%), Gaps = 5/100 (5%)
Query: 122 TGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDA--GVTTIAGGKSNVAGF 179
G DG EA+F P+G+ +D +GNLY+AD N IRK+ VTTIAG S AG+
Sbjct: 343 AGFRDGDVQEAQFKEPRGIAIDKEGNLYIADVGNNRIRKVDTKLNVVTTIAG--SGAAGY 400
Query: 180 RDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
+DG +A+F+ + VY+ L + D+ N +R++++
Sbjct: 401 KDGDPLEAQFNQPWG-VYLDKNEFLYIADQNNHCIRKLAI 439
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 62/128 (48%), Gaps = 19/128 (14%)
Query: 42 TRSSSKAPQADGNVLQ--------FENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDE 93
TR + P DG++ + F +G + E + P I + ++G L+ D
Sbjct: 321 TRDADGWPALDGDIDEYIPGAGAGFRDGDVQEAQFKE------PRGIAIDKEGNLYIADV 374
Query: 94 VNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADT 153
N+ I K+ L+ + +AGS G G+ DG P EA+FN P GV +D LY+AD
Sbjct: 375 GNNRIRKVDTKLNVVT---TIAGS--GAAGYKDGDPLEAQFNQPWGVYLDKNEFLYIADQ 429
Query: 154 LNLAIRKI 161
N IRK+
Sbjct: 430 NNHCIRKL 437
>gi|29347940|ref|NP_811443.1| hypothetical protein BT_2530 [Bacteroides thetaiotaomicron
VPI-5482]
gi|383121619|ref|ZP_09942326.1| hypothetical protein BSIG_4948 [Bacteroides sp. 1_1_6]
gi|29339842|gb|AAO77637.1| conserved hypothetical protein, with a conserved domain
[Bacteroides thetaiotaomicron VPI-5482]
gi|251837946|gb|EES66035.1| hypothetical protein BSIG_4948 [Bacteroides sp. 1_1_6]
Length = 441
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 52/94 (55%), Gaps = 1/94 (1%)
Query: 126 DGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSE 185
DG +A FN+PK + D+ GN++VAD N IR I + T G+ VAG++DG
Sbjct: 348 DGPIEDALFNYPKDIKFDNDGNMFVADYGNHCIRMISADNIVTTVAGQPGVAGYKDGGPV 407
Query: 186 DAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
++ F N + V V + + D GNA +R++ +
Sbjct: 408 ESLFKNPWGVA-VNEQGDIYIADWGNARIRKLVI 440
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 5/85 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I+ DG +F D N I I S + VAG G G+ DG P E+ F +
Sbjct: 359 PKDIKFDNDGNMFVADYGNHCIRMI----SADNIVTTVAGQ-PGVAGYKDGGPVESLFKN 413
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P GV ++++G++Y+AD N IRK+
Sbjct: 414 PWGVAVNEQGDIYIADWGNARIRKL 438
>gi|380692925|ref|ZP_09857784.1| hypothetical protein BfaeM_02978 [Bacteroides faecis MAJ27]
Length = 454
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 43/134 (32%), Positives = 66/134 (49%), Gaps = 5/134 (3%)
Query: 88 LFAVDEVNSNIVKITPPLSQYSRG--RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDK 145
L+ + I K+ + ++ G L AG + G +G+ GK ARFN P +D +
Sbjct: 323 LYIIARKKHCIYKVAYNAATHTFGIPELFAGDY-GESGYASGKGTGARFNQPSTPCLDPE 381
Query: 146 GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLL 205
GNL + D +N IRKI G T+ G+ +G DG + AKF V + +L+
Sbjct: 382 GNLLIPDKMNHCIRKITPEGEVTLYAGQPQTSGHTDGLPDKAKFYEPEAVTF--SGNALI 439
Query: 206 VIDRGNAALRQISL 219
V DRGN +R + +
Sbjct: 440 VADRGNHCVRNVVI 453
>gi|332878080|ref|ZP_08445810.1| NHL repeat protein [Capnocytophaga sp. oral taxon 329 str. F0087]
gi|357047673|ref|ZP_09109271.1| NHL repeat protein [Paraprevotella clara YIT 11840]
gi|332684042|gb|EGJ56909.1| NHL repeat protein [Capnocytophaga sp. oral taxon 329 str. F0087]
gi|355529361|gb|EHG98795.1| NHL repeat protein [Paraprevotella clara YIT 11840]
Length = 461
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 63/108 (58%), Gaps = 2/108 (1%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG 171
++ AG QG GH DG ++A+F+ P+ + +D++ NLY+AD+ N IRKI GV +
Sbjct: 355 KIYAG-MQGRPGHADGLASDAQFHSPRQLVLDEEENLYIADSENNCIRKITPEGVVSTVI 413
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G +G++DG + A F+ + + + + V D+ N +RQ+S+
Sbjct: 414 GIPGKSGYKDGTPDVALFTQPWGLA-IDSEGIIYVGDKDNLCVRQLSI 460
>gi|430809992|ref|ZP_19437107.1| NHL repeat containing protein [Cupriavidus sp. HMR-1]
gi|429497580|gb|EKZ96110.1| NHL repeat containing protein [Cupriavidus sp. HMR-1]
Length = 536
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 39/90 (43%), Positives = 46/90 (51%), Gaps = 6/90 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G ++ D N I KI+P Y +AGS G G DG A FN
Sbjct: 239 PIDVAVDGTGNVYVADYGNHRIRKISP----YGAVTTLAGS--GVAGSTDGTGTAASFNL 292
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGV 166
P GV +D GN+YVADT N IRKI AGV
Sbjct: 293 PSGVAVDGAGNVYVADTYNHLIRKITSAGV 322
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 67/143 (46%), Gaps = 10/143 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V G +F D N + +TP G + + G G DG A FN
Sbjct: 347 PSDIVVDVAGNVFVADTYNHLVRMVTP------TGVVTTLAGTGTAGFADGPAQAAAFNR 400
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ +D GN+YV+D N IRKI A VTT+AG S G DG A F++ +
Sbjct: 401 PAGIAVDAAGNIYVSDLGNARIRKISPANVVTTLAG--STTRGSADGTGAAASFTSLLRI 458
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
++ +D G+ A+R+++
Sbjct: 459 A-TDAAGNVYAVDAGSNAVRKVT 480
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 50/144 (34%), Positives = 75/144 (52%), Gaps = 10/144 (6%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
+P + V G ++ D N I KIT S G + + G +G DG + A F
Sbjct: 292 LPSGVAVDGAGNVYVADTYNHLIRKIT------SAGVVSTLAGSGSSGSADGTGSAASFY 345
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
+P + +D GN++VADT N +R + G VTT+AG + AGF DGP++ A F+
Sbjct: 346 YPSDIVVDVAGNVFVADTYNHLVRMVTPTGVVTTLAG--TGTAGFADGPAQAAAFNRPAG 403
Query: 195 VVYVRPTCSLLVIDRGNAALRQIS 218
+ V ++ V D GNA +R+IS
Sbjct: 404 IA-VDAAGNIYVSDLGNARIRKIS 426
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 37/84 (44%), Positives = 45/84 (53%), Gaps = 4/84 (4%)
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
HP V +D GN+YVAD N IRKI G VTT+AG S VAG DG A F N
Sbjct: 238 HPIDVAVDGTGNVYVADYGNHRIRKISPYGAVTTLAG--SGVAGSTDGTGTAASF-NLPS 294
Query: 195 VVYVRPTCSLLVIDRGNAALRQIS 218
V V ++ V D N +R+I+
Sbjct: 295 GVAVDGAGNVYVADTYNHLIRKIT 318
Score = 39.3 bits (90), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 40/94 (42%), Gaps = 10/94 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT--GHVDGKPNEARF 134
P I V G ++ D N+ I KI+P V + G T G DG A F
Sbjct: 401 PAGIAVDAAGNIYVSDLGNARIRKISP--------ANVVTTLAGSTTRGSADGTGAAASF 452
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTT 168
+ D GN+Y D + A+RK+ AGV T
Sbjct: 453 TSLLRIATDAAGNVYAVDAGSNAVRKVTPAGVVT 486
>gi|390960114|ref|YP_006423871.1| gluconolactonase [Terriglobus roseus DSM 18391]
gi|390415032|gb|AFL90536.1| gluconolactonase [Terriglobus roseus DSM 18391]
Length = 1836
Score = 63.2 bits (152), Expect = 3e-07, Method: Composition-based stats.
Identities = 47/145 (32%), Positives = 74/145 (51%), Gaps = 7/145 (4%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITP-PLSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
+P + ++ GELF D N+ + K+TP P + ++ + G+ Y G
Sbjct: 182 LPSGVAMNGAGELFVSDSGNNRVRKVTPAPATAPAKSDITKGTIVTYAG----GDVATSL 237
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAGGKSNVAGFR-DGPSEDAKFSND 192
+ P G+ +D GNLYVAD+ N IR+I GV TT+AG +N + + DG A N
Sbjct: 238 SSPAGLAIDGAGNLYVADSGNNVIRRIDTNGVMTTVAGQVANSSPYAGDGLLATAATLNA 297
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQI 217
V + P +L + D GN A+R++
Sbjct: 298 PWSVALDPDGNLFIADTGNNAVRRV 322
Score = 43.9 bits (102), Expect = 0.19, Method: Composition-based stats.
Identities = 42/153 (27%), Positives = 66/153 (43%), Gaps = 18/153 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITP--PLSQY-----SRGRLVAGSFQGYTGHVDGKP 129
P + G L+ D ++ ++++ S Y RG GSF G G
Sbjct: 339 PRAVVFDPSGVLYVADSADNTVLRVNADGSFSTYVGISTPRG----GSFDGD----GGAA 390
Query: 130 NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDA 187
A N P + D GNLY+AD+ N +R++ +TT+AG V S+ A
Sbjct: 391 TAAHLNGPAALAFDPAGNLYIADSANNRVRRVDALSKLITTVAGNGGTVMDDTVMNSDIA 450
Query: 188 KFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
FS + +++ +L V D + +RQIS N
Sbjct: 451 SFSGPY-ALFLDNAANLYVGDLFHNRVRQISSN 482
>gi|347754243|ref|YP_004861807.1| gluconolactonase [Candidatus Chloracidobacterium thermophilum B]
gi|347586761|gb|AEP11291.1| Gluconolactonase [Candidatus Chloracidobacterium thermophilum B]
Length = 1222
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 58/169 (34%), Positives = 75/169 (44%), Gaps = 21/169 (12%)
Query: 51 ADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSR 110
A G F++GY E G P + V DGEL+ D +N I K+TP
Sbjct: 677 AGGGQRGFQDGYGPEAQFNG------PLGLAVGRDGELYVADHLNMRIRKVTP------D 724
Query: 111 GRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTI 169
G + + G + DG A F PKGV +D G +YV D + +R I D V T+
Sbjct: 725 GYVSTLAGTGISKIEDGSVATASFEGPKGVAVDMHGVVYVTD--GVTVRTITPDGEVRTL 782
Query: 170 AGGKSNVAGFRDGPSEDAKFSNDFDV-VYVRPTCSLLVIDRGNAALRQI 217
AG V GFRDG A F + + V V C V D N A+R I
Sbjct: 783 AG---QVRGFRDGIGTRAMFGWAYAIAVDVSGLC--FVTDAANHAVRCI 826
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/97 (38%), Positives = 54/97 (55%), Gaps = 4/97 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK-IGDAGVTTIAGGKSNVAGFRD 181
G +DG AR NHP G+ +D +G LY+AD N AIRK + D + T+AGG GF+D
Sbjct: 629 GFLDGVGTSARLNHPVGLALDAQGVLYIADHFNHAIRKLLPDGRLVTLAGGGQR--GFQD 686
Query: 182 GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G +A+F+ + R L V D N +R+++
Sbjct: 687 GYGPEAQFNGPLGLAVGR-DGELYVADHLNMRIRKVT 722
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 57/114 (50%), Gaps = 9/114 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + G L+ D N I K+ P GRLV + G G DG EA+FN
Sbjct: 643 PVGLALDAQGVLYIADHFNHAIRKLLP------DGRLVTLAGGGQRGFQDGYGPEAQFNG 696
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKF 189
P G+ + G LYVAD LN+ IRK+ D V+T+AG + ++ DG A F
Sbjct: 697 PLGLAVGRDGELYVADHLNMRIRKVTPDGYVSTLAG--TGISKIEDGSVATASF 748
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/134 (33%), Positives = 68/134 (50%), Gaps = 13/134 (9%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDK 145
G L+ D N+++ ++T G + + G G DG +ARF P+ V +D
Sbjct: 318 GYLYVADLNNASVRRLT------LDGAVTTLAGDGVEGTRDGVGKQARFKAPRAVAVDAS 371
Query: 146 GNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 204
G +YVAD + +R+I G V TIAGG+ G DGP+E A+F + R +L
Sbjct: 372 GTVYVAD--DARVRRISPGGMVVTIAGGEP---GCVDGPAEAARFDTLSGLALDR-VGNL 425
Query: 205 LVIDRGNAALRQIS 218
+ D GN LR++S
Sbjct: 426 YLADAGNRRLRKLS 439
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/99 (37%), Positives = 52/99 (52%), Gaps = 5/99 (5%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRD 181
G+ DG A+F P G+ D +GNLYVAD IR+I D V T+AG S G RD
Sbjct: 242 GYADGPLTVAKFLRPNGIACDPQGNLYVADFGGHRIRQISVDGLVRTLAG--SGQPGNRD 299
Query: 182 GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
A+F+ + Y L V D NA++R+++L+
Sbjct: 300 DLGLLAEFNGPRGIAYA--AGYLYVADLNNASVRRLTLD 336
Score = 45.4 bits (106), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 52/110 (47%), Gaps = 17/110 (15%)
Query: 68 IEGNEIGV-------VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQG 120
+EG GV P + V G ++ D+ + + +I+P G +V G
Sbjct: 347 VEGTRDGVGKQARFKAPRAVAVDASGTVYVADD--ARVRRISP-------GGMVVTIAGG 397
Query: 121 YTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTI 169
G VDG ARF+ G+ +D GNLY+AD N +RK+ D V+T+
Sbjct: 398 EPGCVDGPAEAARFDTLSGLALDRVGNLYLADAGNRRLRKLSRDGQVSTL 447
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 49/103 (47%), Gaps = 6/103 (5%)
Query: 64 VETVIEGNEIGVV--PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGY 121
V+TV G E + P + V G L+ D N I+++TP + Y+ LV G +G
Sbjct: 832 VKTVFGGGEARQLNFPNGLAVDVFGHLYVADTNNHRILRLTPNGNGYT-ASLVCGVRRGR 890
Query: 122 TGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDA 164
G EA + P+G+ + NLYVAD+ I +G A
Sbjct: 891 Q---TGSALEAELDSPRGIVVGFHNNLYVADSNANRILWVGPA 930
>gi|114566902|ref|YP_754056.1| hypothetical protein Swol_1378 [Syntrophomonas wolfei subsp. wolfei
str. Goettingen]
gi|114337837|gb|ABI68685.1| hypothetical protein Swol_1378 [Syntrophomonas wolfei subsp. wolfei
str. Goettingen]
Length = 1030
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 61/120 (50%), Gaps = 5/120 (4%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPP--LSQYSRGRLVAGSFQ-GYTGHVDGKPNEA 132
PY + +G ++ D N I K+ P +S + ++ GS++ GY+G G A
Sbjct: 564 CPYGVAFDSNGNMYIADIFNHRIRKVDPAGIISTVAGNGVLTGSYKSGYSGD-GGSATSA 622
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD-GPSEDAKFSN 191
+ N+P GV D GN+Y+AD+ N IRK+ G+ + A G D GP+ A+ +N
Sbjct: 623 QLNYPYGVAFDASGNMYIADSNNHCIRKVDTLGIISTAAGNGTYGYSGDGGPATSAQLNN 682
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 63/120 (52%), Gaps = 7/120 (5%)
Query: 116 GSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSN 175
G+ GY+G G A+ N+P G+T D GN+Y+AD+ N IRK+ +G+ + G
Sbjct: 727 GNDGGYSGD-GGLATSAQLNNPNGITFDSSGNMYIADSNNNCIRKVDHSGMISTFAGNGT 785
Query: 176 VAGFRD-GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPT 234
F D GP+ A+ N V + + +L + D + ++R++ L Q ++ISPT
Sbjct: 786 SGHFGDGGPATSAQLRNPVGVA-LDNSGNLFIADYFDHSIRKVVL----AAQQNSTISPT 840
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 61/118 (51%), Gaps = 11/118 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKI-TPPLSQYSRGRLVAGSFQGYTGHVDGKP-NEARF 134
PY + G ++ D N I K+ T + + G G++ GY+G DG P A+
Sbjct: 627 PYGVAFDASGNMYIADSNNHCIRKVDTLGIISTAAGN---GTY-GYSG--DGGPATSAQL 680
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSND 192
N+P G++ D++GN+Y+ADT N IR + GV + G N +G R G D +S D
Sbjct: 681 NNPNGLSFDNRGNMYIADTYNHRIRMVDPNGVISTVAGNGN-SGDRYG--NDGGYSGD 735
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 42/150 (28%), Positives = 68/150 (45%), Gaps = 16/150 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGS-FQGYTGHVDGKPNEAR 133
P+ + +G L+ D N + KI S G + VAG+ GY+G G A+
Sbjct: 509 PHGLAFDGNGNLYIADASNRRVRKID------SAGIITTVAGNGTSGYSGD-GGSAIAAK 561
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSND 192
P GV D GN+Y+AD N IRK+ AG ++T+AG ++ G S D +
Sbjct: 562 ITCPYGVAFDSNGNMYIADIFNHRIRKVDPAGIISTVAGNGVLTGSYKSGYSGDGGSATS 621
Query: 193 FDVVY-----VRPTCSLLVIDRGNAALRQI 217
+ Y + ++ + D N +R++
Sbjct: 622 AQLNYPYGVAFDASGNMYIADSNNHCIRKV 651
Score = 46.6 bits (109), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 32/70 (45%), Positives = 40/70 (57%), Gaps = 5/70 (7%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVA 177
GY+G G A N+P G+ D GNLY+AD N +RKI AG +TT+AG G S +
Sbjct: 493 GYSGD-GGLAASALLNYPHGLAFDGNGNLYIADASNRRVRKIDSAGIITTVAGNGTSGYS 551
Query: 178 GFRDGPSEDA 187
G DG S A
Sbjct: 552 G--DGGSAIA 559
>gi|298387816|ref|ZP_06997366.1| hypothetical protein HMPREF9007_04616 [Bacteroides sp. 1_1_14]
gi|298259421|gb|EFI02295.1| hypothetical protein HMPREF9007_04616 [Bacteroides sp. 1_1_14]
Length = 440
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 52/94 (55%), Gaps = 1/94 (1%)
Query: 126 DGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSE 185
DG +A FN+PK + D+ GN++VAD N IR + + T G+ VAG++DG
Sbjct: 347 DGPIEDALFNYPKDIKFDNDGNMFVADYGNHCIRMVSADNIVTTVAGQPGVAGYKDGGPV 406
Query: 186 DAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
++ F N + V V + + D GNA +R++ +
Sbjct: 407 ESLFKNPWGVA-VNEQGDIYIADWGNARIRKLVI 439
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 45/85 (52%), Gaps = 5/85 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I+ DG +F D N I + S + VAG G G+ DG P E+ F +
Sbjct: 358 PKDIKFDNDGNMFVADYGNHCIRMV----SADNIVTTVAGQ-PGVAGYKDGGPVESLFKN 412
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P GV ++++G++Y+AD N IRK+
Sbjct: 413 PWGVAVNEQGDIYIADWGNARIRKL 437
>gi|380692527|ref|ZP_09857386.1| hypothetical protein BfaeM_00899 [Bacteroides faecis MAJ27]
Length = 441
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 52/94 (55%), Gaps = 1/94 (1%)
Query: 126 DGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSE 185
DG +A FN+PK + D+ GN++VAD N IR + + T G+ VAG++DG
Sbjct: 348 DGPIEDALFNYPKDIKFDNDGNMFVADYGNHCIRMVSADNIVTTVAGQPGVAGYKDGGPL 407
Query: 186 DAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
++ F N + V V + + D GNA +R++ +
Sbjct: 408 ESLFKNPWGVA-VNEQGDIYIADWGNARIRKLVI 440
Score = 47.4 bits (111), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 45/85 (52%), Gaps = 5/85 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I+ DG +F D N I ++ + VAG G G+ DG P E+ F +
Sbjct: 359 PKDIKFDNDGNMFVADYGNHCIRMVSAD----NIVTTVAGQ-PGVAGYKDGGPLESLFKN 413
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P GV ++++G++Y+AD N IRK+
Sbjct: 414 PWGVAVNEQGDIYIADWGNARIRKL 438
>gi|398335921|ref|ZP_10520626.1| hypothetical protein LkmesMB_09964 [Leptospira kmetyi serovar
Malaysia str. Bejo-Iso9]
Length = 947
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 50/118 (42%), Positives = 61/118 (51%), Gaps = 9/118 (7%)
Query: 82 VSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVT 141
++ DG + N +IVK + YS VAG+ +G+ DG ARFN PKG+
Sbjct: 688 ITTDGIFLYITSQNHHIVKYNLMNNTYSTIAGVAGT----SGNADGVGTAARFNAPKGIA 743
Query: 142 MDDKGNLYVADTLNLAIR--KIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVY 197
D LYVADT N IR KI D VTTIAG S AG DG AKF+ +VY
Sbjct: 744 TDGT-YLYVADTGNHKIRKIKISDNTVTTIAG--SGTAGTLDGLGTAAKFNQPSHLVY 798
>gi|325965074|ref|YP_004242980.1| thiol-disulfide isomerase-like thioredoxin [Arthrobacter
phenanthrenivorans Sphe3]
gi|323471161|gb|ADX74846.1| thiol-disulfide isomerase-like thioredoxin [Arthrobacter
phenanthrenivorans Sphe3]
Length = 674
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 40/109 (36%), Positives = 59/109 (54%), Gaps = 8/109 (7%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI-----GDAGVT 167
+VAG+ G G +DG P+EA F P G+ D GN++VAD+ A+RK+ G V
Sbjct: 380 IVAGN--GLEGLLDGAPHEAWFAQPSGLAEDADGNIWVADSETSALRKLVIDDDGGVAVE 437
Query: 168 TIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
+ G GFRDGP+ +A+ + V V P S+ + D N A+R+
Sbjct: 438 SAIGKGLFDFGFRDGPAAEARLQHPLGVT-VLPDGSVAIADTYNGAVRR 485
Score = 38.9 bits (89), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGG 172
G DG EAR HP GVT+ G++ +ADT N A+R+ A V+T+A G
Sbjct: 448 GFRDGPAAEARLQHPLGVTVLPDGSVAIADTYNGAVRRYDPASGTVSTLARG 499
>gi|444912221|ref|ZP_21232386.1| putative hemolysin [Cystobacter fuscus DSM 2262]
gi|444717129|gb|ELW57964.1| putative hemolysin [Cystobacter fuscus DSM 2262]
Length = 844
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/90 (38%), Positives = 49/90 (54%), Gaps = 1/90 (1%)
Query: 130 NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKF 189
+ R N P G+ D +GN+YV+D+ N IR+I G TT+ G G DG + A+F
Sbjct: 604 SHVRLNGPLGIAADAQGNVYVSDSDNYCIRRITPDGTTTVFAGAEMQPGGMDGSALQARF 663
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
N ++V P LLV D GN +R+I L
Sbjct: 664 -NQPAGLFVTPAQELLVADLGNGVIRRIDL 692
Score = 42.7 bits (99), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 44/157 (28%), Positives = 64/157 (40%), Gaps = 28/157 (17%)
Query: 25 APAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYL--------VETVIEGNEIG-- 74
AP P+ +L W R S+ A DG V E G + +V+ G G
Sbjct: 694 APGNPVSTLPANL--WMYRPSAVAQGPDGTVYVVETGMMRVLKLSNGTVSVLAGAPPGGF 751
Query: 75 ---------VVPY-KIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGH 124
++PY I V DG + D N + +I+P G + + G G
Sbjct: 752 ADASGEDARMLPYVGIAVMPDGSVAFSDPGNYRVRRISP------AGEVTTLAGSGRFGA 805
Query: 125 VDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
DG+ +A F P G+ + G LYVAD+ N +R I
Sbjct: 806 RDGRGADADFVVPGGLAVGTDGTLYVADSGNALLRAI 842
Score = 42.0 bits (97), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 48/175 (27%), Positives = 78/175 (44%), Gaps = 21/175 (12%)
Query: 55 VLQFENGYL--VETVIEGNEIGVVPYKIRV--------SEDGELFAVDEVNSNIVKITPP 104
V NG + ++ + GN + +P + + DG ++ V+ ++K++
Sbjct: 679 VADLGNGVIRRIDLLAPGNPVSTLPANLWMYRPSAVAQGPDGTVYVVETGMMRVLKLS-- 736
Query: 105 LSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDA 164
++AG+ G G D +AR G+ + G++ +D N +R+I A
Sbjct: 737 ---NGTVSVLAGAPPG--GFADASGEDARMLPYVGIAVMPDGSVAFSDPGNYRVRRISPA 791
Query: 165 G-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G VTT+AG S G RDG DA F + V +L V D GNA LR I+
Sbjct: 792 GEVTTLAG--SGRFGARDGRGADADFVVPGGLA-VGTDGTLYVADSGNALLRAIT 843
>gi|302821385|ref|XP_002992355.1| hypothetical protein SELMODRAFT_6804 [Selaginella moellendorffii]
gi|300139771|gb|EFJ06505.1| hypothetical protein SELMODRAFT_6804 [Selaginella moellendorffii]
Length = 78
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 36/70 (51%), Positives = 45/70 (64%), Gaps = 2/70 (2%)
Query: 115 AGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGK 173
AGS G +G+VDG ++RFN P+ + + D G ++V DT NLAIRKI G VTTIAGG
Sbjct: 10 AGSHVGESGYVDGPAAKSRFNRPQSLAICDNGAVFV-DTTNLAIRKISKNGEVTTIAGGS 68
Query: 174 SNVAGFRDGP 183
S G D P
Sbjct: 69 SRRPGIADSP 78
>gi|298387997|ref|ZP_06997545.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
gi|298259263|gb|EFI02139.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
Length = 439
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 39/101 (38%), Positives = 59/101 (58%), Gaps = 8/101 (7%)
Query: 122 TGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDA--GVTTIAG-GKSNVAG 178
G DG EA+FN P+G+ +D +GNLY+AD N IRK+ VTTIAG GK G
Sbjct: 343 AGFRDGDVQEAQFNEPRGIAIDKEGNLYIADVNNHRIRKVDTKLNIVTTIAGSGK----G 398
Query: 179 FRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
++DG +A+F + VY+ L + D+ N +R++++
Sbjct: 399 YKDGDPLEAQFDQPWG-VYLDKNEFLYIADQNNHCIRKLAI 438
>gi|429737840|ref|ZP_19271682.1| IPT/TIG domain protein [Prevotella saccharolytica F0055]
gi|429161715|gb|EKY04093.1| IPT/TIG domain protein [Prevotella saccharolytica F0055]
Length = 438
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 64/122 (52%), Gaps = 5/122 (4%)
Query: 99 VKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAI 158
+ +T P + Y R L + + G GH DG +EA+F +P + D+ GN+YVAD N I
Sbjct: 322 IDVTNPATTYKR--LSSSNISG--GHRDGPLSEAQFRNPSQIFCDNDGNIYVADRGNHCI 377
Query: 159 RKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
R+I + G G++DG EDA F N+ + + S+ V D N +R+++
Sbjct: 378 RRITPENMVETVLGMPGTKGWKDGKKEDALF-NEPTGIGIAQDGSVYVADFKNGRVRKLT 436
Query: 219 LN 220
+N
Sbjct: 437 IN 438
Score = 47.0 bits (110), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 42/85 (49%), Gaps = 5/85 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P +I DG ++ D N I +ITP + V G G G DGK +A FN
Sbjct: 356 PSQIFCDNDGNIYVADRGNHCIRRITPE----NMVETVLG-MPGTKGWKDGKKEDALFNE 410
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P G+ + G++YVAD N +RK+
Sbjct: 411 PTGIGIAQDGSVYVADFKNGRVRKL 435
>gi|320105416|ref|YP_004181006.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
gi|319923937|gb|ADV81012.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
Length = 822
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 51/143 (35%), Positives = 68/143 (47%), Gaps = 8/143 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G L+ D N I +++ G VAG F G G A +H
Sbjct: 120 PVGVAVDRVGNLYISDSHNQRIRRVSGGTIATIAGTGVAG-FSGD----GGAAVLATLSH 174
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG-GKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ +D GNLYVADT N IRKI +TT+AG G+ AG DG A + + D
Sbjct: 175 PTGLAVDTGGNLYVADTDNHRIRKISGTTITTVAGSGEQGFAG--DGGPATAAWLDSPDG 232
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V V T +L + D N +R +S
Sbjct: 233 VAVDATGNLYIADTHNQRIRVVS 255
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 52/101 (51%), Gaps = 10/101 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF-QGYTGHVDGKPNEARF- 134
P + V G L+ D N I KI+ + VAGS QG+ G DG P A +
Sbjct: 175 PTGLAVDTGGNLYVADTDNHRIRKISG-----TTITTVAGSGEQGFAG--DGGPATAAWL 227
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKS 174
+ P GV +D GNLY+ADT N IR + G ++TIAG S
Sbjct: 228 DSPDGVAVDATGNLYIADTHNQRIRVVSAEGTISTIAGNGS 268
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 46/145 (31%), Positives = 71/145 (48%), Gaps = 11/145 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF-QGYTGHVDGKPNEARFN 135
P I G ++ D +N+++V+ +S VAG+ QG+ G G A +
Sbjct: 64 PAGIAYDASGNMYFAD-LNNHVVR---KVSAAGIITTVAGTGEQGFAGD-GGSATSAWLD 118
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR--DGPSEDAKFSNDF 193
P GV +D GNLY++D+ N IR++ + TIAG + VAGF G + A S+
Sbjct: 119 SPVGVAVDRVGNLYISDSHNQRIRRVSGGTIATIAG--TGVAGFSGDGGAAVLATLSHPT 176
Query: 194 DVVYVRPTCSLLVIDRGNAALRQIS 218
+ V +L V D N +R+IS
Sbjct: 177 GLA-VDTGGNLYVADTDNHRIRKIS 200
>gi|333383538|ref|ZP_08475196.1| hypothetical protein HMPREF9455_03362 [Dysgonomonas gadei ATCC
BAA-286]
gi|332827477|gb|EGK00223.1| hypothetical protein HMPREF9455_03362 [Dysgonomonas gadei ATCC
BAA-286]
Length = 434
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 57/107 (53%), Gaps = 2/107 (1%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGG 172
L AG+ G G DG+ ++A FN P+ +T+D +GNLY+AD N IR I G+ T G
Sbjct: 329 LFAGA-PGQPGWSDGRVSDAEFNLPRQMTLDLEGNLYIADEGNHCIRMIDKDGIVTTPIG 387
Query: 173 KSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
+ G+ DG E A F V V + + D GN LR+++L
Sbjct: 388 QPGQEGYMDGSPEIALFKKPRGVA-VDKNGDVYIADYGNRCLRKLTL 433
>gi|380692433|ref|ZP_09857292.1| NHL repeat-containing protein [Bacteroides faecis MAJ27]
Length = 439
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 39/101 (38%), Positives = 59/101 (58%), Gaps = 8/101 (7%)
Query: 122 TGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDA--GVTTIAG-GKSNVAG 178
G DG EA+FN P+G+ +D +GNLY+AD N IRK+ VTTIAG GK G
Sbjct: 343 AGFRDGDVQEAQFNEPRGIAIDKEGNLYIADVNNHRIRKVDTKLNIVTTIAGSGK----G 398
Query: 179 FRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
++DG +A+F + VY+ L + D+ N +R++++
Sbjct: 399 YKDGDPLEAQFDQPWG-VYLDKNEFLYIADQNNHCIRKLAI 438
>gi|289667480|ref|ZP_06488555.1| putative NHL repeat protein [Xanthomonas campestris pv. musacearum
NCPPB 4381]
Length = 310
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 57/176 (32%), Positives = 77/176 (43%), Gaps = 18/176 (10%)
Query: 51 ADGNVLQFENGYLVETVIEGNEIGVV-----PYKIRVSEDGELFAVDEVNSNIVKITPPL 105
ADG+V G E +G IGV P I + G L+ D N I K+TP
Sbjct: 44 ADGHVHTLAGGR--EGFADG--IGVAAAFNTPSGIMLDTAGNLYIADTGNHAIRKLTP-- 97
Query: 106 SQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI-GDA 164
+G++ + G G +G + RFN P GV +D +G +YVADT N I I D
Sbjct: 98 ----QGKVTTLAGDGVAGDRNGAAAQVRFNGPVGVAVDAQGRVYVADTYNDRIGVIETDG 153
Query: 165 GVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
V T+AGG + G DG A F + R T L +RQ++ N
Sbjct: 154 QVRTLAGG--GLPGMADGIGTQAWFDAPSAALNARSTVVKLSCASATFTVRQLNAN 207
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/121 (35%), Positives = 59/121 (48%), Gaps = 9/121 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + G L+ D + N + L +AG +G+ DG A FN
Sbjct: 17 PYGLATDAHGTLYIADGGDDNRIH---GLGADGHVHTLAGGREGF---ADGIGVAAAFNT 70
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ +D GNLY+ADT N AIRK+ G VTT+AG VAG R+G + +F+ V
Sbjct: 71 PSGIMLDTAGNLYIADTGNHAIRKLTPQGKVTTLAG--DGVAGDRNGAAAQVRFNGPVGV 128
Query: 196 V 196
Sbjct: 129 A 129
Score = 46.6 bits (109), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 52/98 (53%), Gaps = 6/98 (6%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTL-NLAIRKIG-DAGVTTIAGGKSNVAGFR 180
G DG+ +A+F+ P G+ D G LY+AD + I +G D V T+AGG+ GF
Sbjct: 3 GVADGRAADAQFDDPYGLATDAHGTLYIADGGDDNRIHGLGADGHVHTLAGGRE---GFA 59
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
DG A F N + + +L + D GN A+R+++
Sbjct: 60 DGIGVAAAF-NTPSGIMLDTAGNLYIADTGNHAIRKLT 96
>gi|290975266|ref|XP_002670364.1| predicted protein [Naegleria gruberi]
gi|284083922|gb|EFC37620.1| predicted protein [Naegleria gruberi]
Length = 1584
Score = 62.0 bits (149), Expect = 7e-07, Method: Composition-based stats.
Identities = 43/145 (29%), Positives = 74/145 (51%), Gaps = 6/145 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + +S++GE++ D +N I KI + Y +AG+ + G G ++ N
Sbjct: 181 PNGLSISQNGEIYIADTLNHRIRKI----NSYGVISTIAGTGRASFGGDGGFAVLSQLNS 236
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P GV + G +Y+ADTLN IRKI GV +T+AG G G + +++ ++ +
Sbjct: 237 PNGVHISQNGEIYIADTLNHRIRKINSYGVISTVAGTGRASFGGDGGLAINSQLNSPYG- 295
Query: 196 VYVRPTCSLLVIDRGNAALRQISLN 220
V+V + + D N +R+I +N
Sbjct: 296 VHVSQNGEIYIADTLNHRIRKIFVN 320
Score = 62.0 bits (149), Expect = 7e-07, Method: Composition-based stats.
Identities = 43/145 (29%), Positives = 74/145 (51%), Gaps = 6/145 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + +S++GE++ D +N I KI + Y +AG+ + G G ++ N
Sbjct: 576 PNGLSISQNGEIYIADTLNHRIRKI----NSYGVISTIAGTGRASFGGDGGFAVLSQLNS 631
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P GV + G +Y+ADTLN IRKI GV +T+AG G G + +++ ++ +
Sbjct: 632 PNGVHISQNGEIYIADTLNHRIRKINSYGVISTVAGTGRASFGGDGGLAINSQLNSPYG- 690
Query: 196 VYVRPTCSLLVIDRGNAALRQISLN 220
V+V + + D N +R+I +N
Sbjct: 691 VHVSQNGEIYIADTLNHRIRKIFVN 715
Score = 58.9 bits (141), Expect = 6e-06, Method: Composition-based stats.
Identities = 49/143 (34%), Positives = 73/143 (51%), Gaps = 6/143 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + +S++GE++ D +N I KI + Y VAG+ + G G ++ N
Sbjct: 237 PNGVHISQNGEIYIADTLNHRIRKI----NSYGVISTVAGTGRASFGGDGGLAINSQLNS 292
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P GV + G +Y+ADTLN IRKI G +TTIAG S+ + DG A N
Sbjct: 293 PYGVHVSQNGEIYIADTLNHRIRKIFVNGTITTIAGSSSDGSFGGDGGLSIASRLNSPKG 352
Query: 196 VYVRPTCS-LLVIDRGNAALRQI 217
V+V P + +L+ D N +R+I
Sbjct: 353 VFVSPNNNEILIADTSNNRIRKI 375
Score = 58.9 bits (141), Expect = 6e-06, Method: Composition-based stats.
Identities = 49/143 (34%), Positives = 73/143 (51%), Gaps = 6/143 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + +S++GE++ D +N I KI + Y VAG+ + G G ++ N
Sbjct: 632 PNGVHISQNGEIYIADTLNHRIRKI----NSYGVISTVAGTGRASFGGDGGLAINSQLNS 687
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P GV + G +Y+ADTLN IRKI G +TTIAG S+ + DG A N
Sbjct: 688 PYGVHVSQNGEIYIADTLNHRIRKIFVNGTITTIAGSSSDGSFGGDGGLSIASRLNSPKG 747
Query: 196 VYVRPTCS-LLVIDRGNAALRQI 217
V+V P + +L+ D N +R+I
Sbjct: 748 VFVSPNNNEILIADTSNNRIRKI 770
Score = 55.8 bits (133), Expect = 4e-05, Method: Composition-based stats.
Identities = 46/142 (32%), Positives = 72/142 (50%), Gaps = 16/142 (11%)
Query: 87 ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVD--------GKPNEARFNHPK 138
EL+ D +N I+ ++ LS Y +V+G Q GH D G + A+ N P
Sbjct: 522 ELYMTDVLNHRILSVS--LSSYLV-TIVSGK-QNCNGHSDCDGFSGDGGLASRAKLNSPN 577
Query: 139 GVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAG-GKSNVAGFRDGPSEDAKFSNDFDVV 196
G+++ G +Y+ADTLN IRKI GV +TIAG G+++ G DG N + V
Sbjct: 578 GLSISQNGEIYIADTLNHRIRKINSYGVISTIAGTGRASFGG--DGGFAVLSQLNSPNGV 635
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
++ + + D N +R+I+
Sbjct: 636 HISQNGEIYIADTLNHRIRKIN 657
Score = 52.4 bits (124), Expect = 5e-04, Method: Composition-based stats.
Identities = 45/142 (31%), Positives = 71/142 (50%), Gaps = 16/142 (11%)
Query: 87 ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVD--------GKPNEARFNHPK 138
EL+ D +N I+ + LS+Y +V+G Q H D G + A+ N P
Sbjct: 127 ELYMTDVLNHRILSVC--LSRYLV-TIVSGK-QNCNEHSDCDGFSGDGGLASRAKLNSPN 182
Query: 139 GVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAG-GKSNVAGFRDGPSEDAKFSNDFDVV 196
G+++ G +Y+ADTLN IRKI GV +TIAG G+++ G DG N + V
Sbjct: 183 GLSISQNGEIYIADTLNHRIRKINSYGVISTIAGTGRASFGG--DGGFAVLSQLNSPNGV 240
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
++ + + D N +R+I+
Sbjct: 241 HISQNGEIYIADTLNHRIRKIN 262
Score = 40.0 bits (92), Expect = 2.8, Method: Composition-based stats.
Identities = 31/95 (32%), Positives = 45/95 (47%), Gaps = 10/95 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPN---EAR 133
PY + VS++GE++ D +N I KI + G + + G G +R
Sbjct: 293 PYGVHVSQNGEIYIADTLNHRIRKI------FVNGTITTIAGSSSDGSFGGDGGLSIASR 346
Query: 134 FNHPKGVTMDDKGN-LYVADTLNLAIRKIGDAGVT 167
N PKGV + N + +ADT N IRKI + G T
Sbjct: 347 LNSPKGVFVSPNNNEILIADTSNNRIRKIIECGTT 381
>gi|337745906|ref|YP_004640068.1| copper amine oxidase domain-containing protein [Paenibacillus
mucilaginosus KNP414]
gi|336297095|gb|AEI40198.1| copper amine oxidase domain protein [Paenibacillus mucilaginosus
KNP414]
Length = 537
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 50/159 (31%), Positives = 74/159 (46%), Gaps = 20/159 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITP-----PLSQYSRGRLVA---GSFQGYTGHVDGK 128
P + + DG L+ D + I +I+P L+ SR R+V G DG
Sbjct: 184 PGDVAAAPDGTLYVADTLGHTIRRISPQGEVTTLTAPSR-RVVEATPGQVAAAGDFADGP 242
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNV--------AG 178
+A+FN P G+ +D KGNLYV+D+ N IR I VTT+AGG + G
Sbjct: 243 LAQAKFNEPTGIALDAKGNLYVSDSGNQRIRYIDLAKGTVTTVAGGGTAAELKDMYVPGG 302
Query: 179 FRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
F +G + A+ + + V LL+ D N A+R +
Sbjct: 303 FSNGAALQARLNYPMGIA-VTEEGGLLIADSQNHAVRYL 340
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 72/151 (47%), Gaps = 16/151 (10%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF-----QGY--TGHVDGK 128
VP + V DG D N I K+T R ++AG F +GY G +DG
Sbjct: 67 VPAGLAVLPDGTAAVSDSRNGVIRKLTG-----GRVDVLAGVFYRKDDKGYPVGGLLDGA 121
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDA 187
N + F P G++ G+LYVAD N AIR+I G VTT+AG S G +DG A
Sbjct: 122 ANASLFQEPLGLSAGPDGSLYVADAGNHAIRRIDAKGNVTTVAG--SGRLGAKDGKGAAA 179
Query: 188 KFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+F DV P +L V D +R+IS
Sbjct: 180 EFYRPGDVAAA-PDGTLYVADTLGHTIRRIS 209
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 68/157 (43%), Gaps = 19/157 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + DG L+ D N I +I ++G + + G G DGK A F
Sbjct: 130 PLGLSAGPDGSLYVADAGNHAIRRID------AKGNVTTVAGSGRLGAKDGKGAAAEFYR 183
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG----------GKSNVAG-FRDGPS 184
P V G LYVADTL IR+I G VTT+ G+ AG F DGP
Sbjct: 184 PGDVAAAPDGTLYVADTLGHTIRRISPQGEVTTLTAPSRRVVEATPGQVAAAGDFADGPL 243
Query: 185 EDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQ 221
AKF N+ + + +L V D GN +R I L +
Sbjct: 244 AQAKF-NEPTGIALDAKGNLYVSDSGNQRIRYIDLAK 279
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 50/96 (52%), Gaps = 1/96 (1%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
G +G +AR N+P G+ + ++G L +AD+ N A+R + ++T+AG G DG
Sbjct: 302 GFSNGAALQARLNYPMGIAVTEEGGLLIADSQNHAVRYLFGGQLSTLAGAGEQKMGLLDG 361
Query: 183 PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
A + DV V S+LV D N LR+++
Sbjct: 362 MEGKAGLNRPADVA-VLGDGSVLVADSFNNRLRRLT 396
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 49/112 (43%), Gaps = 8/112 (7%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG- 171
L + G G +G EA F P G+ + G V+D+ N IRK+ V +AG
Sbjct: 44 LTTAAGTGRLGSANGTGLEASFRVPAGLAVLPDGTAAVSDSRNGVIRKLTGGRVDVLAGV 103
Query: 172 --GKSN----VAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
K + V G DG + + F + P SL V D GN A+R+I
Sbjct: 104 FYRKDDKGYPVGGLLDGAANASLFQEPLG-LSAGPDGSLYVADAGNHAIRRI 154
>gi|159471860|ref|XP_001694074.1| predicted protein [Chlamydomonas reinhardtii]
gi|158277241|gb|EDP03010.1| predicted protein [Chlamydomonas reinhardtii]
Length = 1591
Score = 61.6 bits (148), Expect = 8e-07, Method: Composition-based stats.
Identities = 41/106 (38%), Positives = 56/106 (52%), Gaps = 5/106 (4%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAG 171
+AGS GH+DG ARF + +GV +D +GN Y +D+ N +R + D V+T AG
Sbjct: 1242 IAGS--SMPGHLDGPAATARFYNLRGVAVDGEGNCYCSDSSNHCVRLLHAADGMVSTFAG 1299
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
AGFRDG +A+F N + SL V D N LR+I
Sbjct: 1300 SPGQ-AGFRDGAGTEARFRNPCGIAINLQDGSLAVADAENNRLRRI 1344
Score = 38.5 bits (88), Expect = 7.8, Method: Composition-based stats.
Identities = 33/119 (27%), Positives = 53/119 (44%), Gaps = 7/119 (5%)
Query: 73 IGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEA 132
+ P + V G + D + I+++ + G V + G GH DG +A
Sbjct: 1151 LAAAPVGVAVDFQGFAYVADTGHCRILRV-----RLDTGEAVVLAGGGGYGHRDGPGRKA 1205
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSN 191
+F P + +D + + V + R D VTTIAG S++ G DGP+ A+F N
Sbjct: 1206 KFACPMYLAVDARDSSLVVSDQHCLRRVASDGFVTTIAG--SSMPGHLDGPAATARFYN 1262
>gi|410098158|ref|ZP_11293137.1| hypothetical protein HMPREF1076_02315 [Parabacteroides goldsteinii
CL02T12C30]
gi|409222431|gb|EKN15373.1| hypothetical protein HMPREF1076_02315 [Parabacteroides goldsteinii
CL02T12C30]
Length = 446
Score = 61.6 bits (148), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 41/108 (37%), Positives = 55/108 (50%), Gaps = 2/108 (1%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG 171
L AG Q TG DG+ EA FN PK + D G +YVAD N IRKI GV +
Sbjct: 340 ELYAG-VQNETGWQDGEREEALFNEPKQICFDQDGVMYVADAGNHVIRKITRDGVVSTVI 398
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G + V G+ DG EDA F V + ++ V D N +R++++
Sbjct: 399 GIAGVKGYVDGSPEDALFQYPTGVA-IDKEGTIYVGDARNNCVRKLAI 445
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 45/88 (51%), Gaps = 9/88 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAG--SFQGYTGHVDGKPNEARF 134
P +I +DG ++ D N I KIT R +V+ G G+VDG P +A F
Sbjct: 364 PKQICFDQDGVMYVADAGNHVIRKIT-------RDGVVSTVIGIAGVKGYVDGSPEDALF 416
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIG 162
+P GV +D +G +YV D N +RK+
Sbjct: 417 QYPTGVAIDKEGTIYVGDARNNCVRKLA 444
>gi|402814801|ref|ZP_10864394.1| copper amine oxidase domain protein [Paenibacillus alvei DSM 29]
gi|402507172|gb|EJW17694.1| copper amine oxidase domain protein [Paenibacillus alvei DSM 29]
Length = 571
Score = 61.6 bits (148), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 38/96 (39%), Positives = 54/96 (56%), Gaps = 4/96 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRD 181
G DG+ + +F+ P G+ D KGNLYVAD N IRKI AG VTT+AG + G++D
Sbjct: 130 GWEDGEQAQVQFDQPTGLAADKKGNLYVADAGNHVIRKIDGAGKVTTVAG--DGIPGWKD 187
Query: 182 GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
G + A+F + V ++ V D N +R+I
Sbjct: 188 GAAGTARFHTP-RAIAVADDGAIYVTDSLNHVIRRI 222
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 49/157 (31%), Positives = 70/157 (44%), Gaps = 19/157 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + G L+ D N I KI G++ + G G DG ARF+
Sbjct: 144 PTGLAADKKGNLYVADAGNHVIRKID------GAGKVTTVAGDGIPGWKDGAAGTARFHT 197
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNV----------AG-FRDGPS 184
P+ + + D G +YV D+LN IR+I G VTT+ S + AG F+DG
Sbjct: 198 PRAIAVADDGAIYVTDSLNHVIRRIDAMGMVTTLTARSSRIVEYSPGSVTGAGDFKDGNL 257
Query: 185 EDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQ 221
+A F N+ + + L V D GN +R I L Q
Sbjct: 258 TEAMF-NEPSGIALMNDGKLAVADTGNQRIRLIDLKQ 293
Score = 47.0 bits (110), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 55/197 (27%), Positives = 88/197 (44%), Gaps = 23/197 (11%)
Query: 34 LTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDE 93
+T+L ++R +P + F++G L E + NE P I + DG+L D
Sbjct: 228 VTTLTARSSRIVEYSPGSVTGAGDFKDGNLTEAMF--NE----PSGIALMNDGKLAVADT 281
Query: 94 VNSNIVKITPPLSQYSRGRLV-AGSFQGYT------------GHVDGKPNEARFNHPKGV 140
N I I L Q S + + S GYT G+ DG + + FN P G+
Sbjct: 282 GNQRIRLI--DLKQGSVSTIAGSSSTAGYTLPGVQRVLYAPGGYRDGAASNSIFNSPTGI 339
Query: 141 TMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 200
+ ++ + VAD N +R I V T++ G++ +G DG + AK ++ V V
Sbjct: 340 AITNENGIVVADRWNHVVRYIYKGEVVTLS-GQAGKSGDLDGITSYAKL-HEPTAVAVLA 397
Query: 201 TCSLLVIDRGNAALRQI 217
S+ V + N A+R I
Sbjct: 398 NGSIAVAEGFNNAIRLI 414
>gi|289664052|ref|ZP_06485633.1| putative NHL repeat protein [Xanthomonas campestris pv. vasculorum
NCPPB 702]
Length = 263
Score = 61.6 bits (148), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 47/145 (32%), Positives = 65/145 (44%), Gaps = 9/145 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + G L+ D N I K+TP +G++ + G G +G + RFN
Sbjct: 24 PSGIMLDTAGNLYIADTGNHAIRKLTP------QGKVTTLAGDGVAGDRNGAAAQVRFNG 77
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P GV +D +G +YVADT N I I D V T+AGG + G DG A F
Sbjct: 78 PVGVAVDAQGRVYVADTYNDRIGVIETDGQVRTLAGG--GLPGMADGIGTQAWFDAPSAA 135
Query: 196 VYVRPTCSLLVIDRGNAALRQISLN 220
+ R T L +RQ++ N
Sbjct: 136 LNARSTVVKLSCASATFTVRQLNAN 160
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 35/78 (44%), Positives = 44/78 (56%), Gaps = 3/78 (3%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAG 178
G G DG A FN P G+ +D GNLY+ADT N AIRK+ G VTT+AG VAG
Sbjct: 7 GREGFADGIGVAAAFNTPSGIMLDTAGNLYIADTGNHAIRKLTPQGKVTTLAG--DGVAG 64
Query: 179 FRDGPSEDAKFSNDFDVV 196
R+G + +F+ V
Sbjct: 65 DRNGAAAQVRFNGPVGVA 82
>gi|351729452|ref|ZP_08947143.1| NHL repeat-containing protein [Acidovorax radicis N35]
Length = 659
Score = 61.6 bits (148), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 31/66 (46%), Positives = 37/66 (56%)
Query: 124 HVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGP 183
H DG ARF PK V +D GN+YVADT N +R+I G T GK V G +DG
Sbjct: 47 HADGDGPAARFYQPKAVAVDTAGNMYVADTENHLVRRISPTGAVTTLAGKPGVCGNQDGT 106
Query: 184 SEDAKF 189
S+ A F
Sbjct: 107 SDTATF 112
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/107 (36%), Positives = 54/107 (50%), Gaps = 9/107 (8%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTL-------NLAIRKIGDAG 165
L+AGS G GH DG+ A+F+ + D GN+YV D L IRKI +G
Sbjct: 484 LLAGSL-GTVGHADGQGAAAQFSALGNMATDAAGNVYVVDGLFHEVNKIGPTIRKITPSG 542
Query: 166 -VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGN 211
V+T+AG + G+ DG A F+ D Y + T +L V +GN
Sbjct: 543 MVSTLAGNPAAAPGYADGTGAAAVFTVDLGFRYPQQTAALAVDAQGN 589
>gi|325186506|emb|CCA21046.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1881
Score = 61.6 bits (148), Expect = 1e-06, Method: Composition-based stats.
Identities = 40/108 (37%), Positives = 58/108 (53%), Gaps = 5/108 (4%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGD-AGVTTIAGGKSNVAG 178
G GH DG+ + F++P + +D G YV DT N IR+I + VTT AG ++ G
Sbjct: 236 GSRGHNDGEAVASLFDNPNDLAIDSTGVTYVVDTGNHCIRRIDQRSRVTTFAGNRTR--G 293
Query: 179 FRDGPSEDAK--FSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDC 224
FRDGP + A+ F + VV + ++ V D GN +R+I Q C
Sbjct: 294 FRDGPLDIAQYNFPSGIAVVNEQNRVTVYVADTGNHRIRRIRDGQVAC 341
Score = 56.6 bits (135), Expect = 3e-05, Method: Composition-based stats.
Identities = 53/186 (28%), Positives = 84/186 (45%), Gaps = 17/186 (9%)
Query: 40 WTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVV---PYKIRVSEDGELFAVDEVNS 96
W+ S+ + G V F G +G + + P + + G + VD N
Sbjct: 215 WSFPKSNSKTKPQGAVETFAGGSRGHN--DGEAVASLFDNPNDLAIDSTGVTYVVDTGNH 272
Query: 97 NIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN---LYVADT 153
I +I Q SR AG+ G DG + A++N P G+ + ++ N +YVADT
Sbjct: 273 CIRRI----DQRSRVTTFAGNRT--RGFRDGPLDIAQYNFPSGIAVVNEQNRVTVYVADT 326
Query: 154 LNLAIRKIGDAGVTTIAGGKSNV--AGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGN 211
N IR+I D V IAG ++ GF DG + +++F + + +L+V D GN
Sbjct: 327 GNHRIRRIRDGQVACIAGRCDSIPHPGFSDGNASESRFDTPLGLA-IDVDGNLIVADSGN 385
Query: 212 AALRQI 217
+R I
Sbjct: 386 NLIRLI 391
Score = 44.3 bits (103), Expect = 0.13, Method: Composition-based stats.
Identities = 31/81 (38%), Positives = 44/81 (54%), Gaps = 6/81 (7%)
Query: 121 YTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVT-TIAGG-----KS 174
+ G DG +E+RF+ P G+ +D GNL VAD+ N IR I G+T T+AG ++
Sbjct: 351 HPGFSDGNASESRFDTPLGLAIDVDGNLIVADSGNNLIRLIDAVGLTRTLAGTIVTSRET 410
Query: 175 NVAGFRDGPSEDAKFSNDFDV 195
+AG E K S D D+
Sbjct: 411 AIAGCVPPCLEGVKGSEDGDL 431
>gi|153864226|ref|ZP_01997193.1| NHL repeat protein [Beggiatoa sp. SS]
gi|152146280|gb|EDN72808.1| NHL repeat protein [Beggiatoa sp. SS]
Length = 313
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 52/145 (35%), Positives = 76/145 (52%), Gaps = 10/145 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKI-TPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P +I V++DG ++ D+ N I K+ T + G VA SF G G G A+ N
Sbjct: 70 PKRIVVADDGVVYITDKGNHRIRKVDTSGIITTIVGNGVA-SFGGDGGEAPG----AQLN 124
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
P + +DD G+LY+ DT N IRK+ A +TT+AGG S +G DG + A+F N
Sbjct: 125 SPSDIAIDDSGSLYIIDTNNHRIRKVEKATNIITTLAGGVSGFSG-DDGQALSAQF-NTP 182
Query: 194 DVVYVRPTCSLLVIDRGNAALRQIS 218
D + + LL+ D N +R I+
Sbjct: 183 DGIAISDEGHLLIADSENHRVRMIT 207
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 73/146 (50%), Gaps = 8/146 (5%)
Query: 75 VVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF--QGYTGHVDGKPNEA 132
++P + + + G L+ + N I K+ R +AG+ G++G G+ EA
Sbjct: 10 ILPNGVAIDKAGNLYISEAGNHVIRKVDANTQSIFR---IAGTLGVAGFSGD-GGRAVEA 65
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSN 191
+ N+PK + + D G +Y+ D N IRK+ +G +TTI G G G + A+ ++
Sbjct: 66 KLNNPKRIVVADDGVVYITDKGNHRIRKVDTSGIITTIVGNGVASFGGDGGEAPGAQLNS 125
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQI 217
D+ + + SL +ID N +R++
Sbjct: 126 PSDIA-IDDSGSLYIIDTNNHRIRKV 150
>gi|158315873|ref|YP_001508381.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
gi|158111278|gb|ABW13475.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
Length = 850
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 72/147 (48%), Gaps = 10/147 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAG-SFQGYTGHVDGKP-NEARF 134
P + + G + D N I ++ P + + VAG +G++ DG P EA
Sbjct: 599 PAAVALDSAGNILIADTFNQRIRRVDPSGTITT----VAGKDDRGFS--EDGVPATEATL 652
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVT-TIAGGKSNVAGFRDGPSEDAKFSNDF 193
+P GV D GN+Y+AD+ N IR++G G+ T+AGG A GP+ DA +
Sbjct: 653 WYPGGVVADPTGNIYIADSGNNRIRRVGTDGIIQTVAGGDGEGAFGDGGPAADALLAFPI 712
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISLN 220
V RP L + D GN +R+I L+
Sbjct: 713 SVAMDRP-GRLYIADSGNNRIRRIGLD 738
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 73/147 (49%), Gaps = 10/147 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKP-NEARF 134
P+ + G ++ D N+ + +IT + + +AG+ + G++G DG P +AR
Sbjct: 543 PFGMVADWAGNIYVADFDNNRVRRITADGTITT----IAGTGEAGFSG--DGGPATQARL 596
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG-PSEDAKFSNDF 193
P V +D GN+ +ADT N IR++ +G T GK + DG P+ +A
Sbjct: 597 RQPAAVALDSAGNILIADTFNQRIRRVDPSGTITTVAGKDDRGFSEDGVPATEATLWYPG 656
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISLN 220
VV PT ++ + D GN +R++ +
Sbjct: 657 GVV-ADPTGNIYIADSGNNRIRRVGTD 682
Score = 43.9 bits (102), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 63/135 (46%), Gaps = 11/135 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKP-NEARF 134
P + + G L+ D N+ I +I L VAG+ GY+G DG P A
Sbjct: 711 PISVAMDRPGRLYIADSGNNRIRRIG--LDGVIE--TVAGTGLPGYSG--DGGPATRATL 764
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P+GV +D +G +++ D N IR++ +G+ T G ++ RD + + S D
Sbjct: 765 RSPRGVAVDARGAIFITDRTNRRIRRVDPSGIITTVAGTAHPG--RDESVDPDEMSPDGP 822
Query: 195 VVYVRPTCSLLVIDR 209
V + PT + V DR
Sbjct: 823 VA-LDPTGDVFVADR 836
Score = 41.6 bits (96), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 49/96 (51%), Gaps = 7/96 (7%)
Query: 126 DGKP-NEARFNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFR--D 181
DG P A N P G+ D GN+YVAD N +R+I D +TTIAG + AGF
Sbjct: 531 DGGPAGSAALNGPFGMVADWAGNIYVADFDNNRVRRITADGTITTIAG--TGEAGFSGDG 588
Query: 182 GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
GP+ A+ V + ++L+ D N +R++
Sbjct: 589 GPATQARLRQPA-AVALDSAGNILIADTFNQRIRRV 623
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 67/146 (45%), Gaps = 14/146 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKI-TPPLSQYSRGRLVAGSFQGYTGHVDGKPN-EARF 134
P + G ++ D N+ I ++ T + Q G G+F DG P +A
Sbjct: 655 PGGVVADPTGNIYIADSGNNRIRRVGTDGIIQTVAGGDGEGAFG------DGGPAADALL 708
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFR--DGPSEDAKFSN 191
P V MD G LY+AD+ N IR+IG D + T+AG + + G+ GP+ A +
Sbjct: 709 AFPISVAMDRPGRLYIADSGNNRIRRIGLDGVIETVAG--TGLPGYSGDGGPATRATLRS 766
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQI 217
V V ++ + DR N +R++
Sbjct: 767 PRGVA-VDARGAIFITDRTNRRIRRV 791
>gi|444306537|ref|ZP_21142300.1| NHL repeat containing protein [Arthrobacter sp. SJCon]
gi|443481173|gb|ELT44105.1| NHL repeat containing protein [Arthrobacter sp. SJCon]
Length = 673
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/109 (39%), Positives = 62/109 (56%), Gaps = 8/109 (7%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK--IGDAGVTTI- 169
++AG+ G G +DG +EA F P G+ D GN++VAD+ A+RK I DAG T+
Sbjct: 379 IIAGN--GLEGLLDGPAHEAWFAQPSGLAEDGDGNIWVADSETSALRKLVIDDAGSITVE 436
Query: 170 -AGGKSNVA-GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
A GK GFRDGP+ +A+ + V V P S+ + D N A+R+
Sbjct: 437 SALGKGLFDFGFRDGPAVEARLQHPLGVT-VLPDGSVAIADTYNGAVRR 484
>gi|296128041|ref|YP_003635291.1| alkyl hydroperoxide reductase [Cellulomonas flavigena DSM 20109]
gi|296019856|gb|ADG73092.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Cellulomonas flavigena DSM 20109]
Length = 639
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 63/124 (50%), Gaps = 4/124 (3%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFR 180
G VDG P EA F P G+++D G L++AD A+R + DA V ++ G G R
Sbjct: 363 GLVDGTPQEAWFAQPSGLSVDAAGRLWLADAETSALRWVDPADASVHSVVGTGLFDFGHR 422
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVV 240
DG ++ A F + V + P S+LV D N ALR+ + D +++ TD+
Sbjct: 423 DGAADQALFQHPLGVAAL-PDGSVLVADTYNGALRRWQPGEGDAPGTVTTLA-TDLAEPS 480
Query: 241 GAVL 244
G VL
Sbjct: 481 GLVL 484
Score = 42.7 bits (99), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 47/100 (47%), Gaps = 9/100 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G L+ D S + + P + V G+ GH DG ++A F H
Sbjct: 377 PSGLSVDAAGRLWLADAETSALRWVDPADASVHS---VVGTGLFDFGHRDGAADQALFQH 433
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI----GDA--GVTTIA 170
P GV G++ VADT N A+R+ GDA VTT+A
Sbjct: 434 PLGVAALPDGSVLVADTYNGALRRWQPGEGDAPGTVTTLA 473
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 58/125 (46%), Gaps = 30/125 (24%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTM---DDKG----NLYVADTLNLAIR--KIGDAGVTTIA 170
G G VDG P++ARF+ P G+ + + +G ++ VADT+N A+R ++ D VTT+A
Sbjct: 219 GERGLVDGGPDDARFSEPNGLGLVPDELRGRVGYDVLVADTVNHALRGVRLSDGHVTTLA 278
Query: 171 G-GKSNVAGFRDG--------------------PSEDAKFSNDFDVVYVRPTCSLLVIDR 209
G G+ + G D + K S+ +DV + + +V
Sbjct: 279 GTGEQYMVGAVDNVRPADGDTANPAGSGVGDRWAARQVKLSSPWDVAWSTELAAFVVAMA 338
Query: 210 GNAAL 214
GN L
Sbjct: 339 GNHTL 343
>gi|197118445|ref|YP_002138872.1| lipoprotein [Geobacter bemidjiensis Bem]
gi|197087805|gb|ACH39076.1| repeat-containing lipoprotein, putative [Geobacter bemidjiensis
Bem]
Length = 652
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 52/144 (36%), Positives = 72/144 (50%), Gaps = 12/144 (8%)
Query: 77 PYKIRVSEDGE-LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P I +S D L+ D N+ I I + +AG+ G G +G ARF+
Sbjct: 130 PSGIALSSDNSTLYVSDTGNNTIRSINVATGAVTT---LAGT-AGVVGSTNGTGGAARFS 185
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
P GV D NLYVAD+LN +RKI A VTT+AG S + F DG A F++
Sbjct: 186 SPSGVATDGA-NLYVADSLNHRVRKIVLASAAVTTLAG--SGLQDFADGTGIAASFNSPR 242
Query: 194 DVVYVRPTCSLLVIDRGNAALRQI 217
+ SL + D+GN+A+R+I
Sbjct: 243 GI--ATDGVSLYLADQGNSAVRRI 264
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 52/104 (50%), Gaps = 2/104 (1%)
Query: 117 SFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDA-GVTTIAGGKSN 175
SF G G DG ++ARF+ P V D +LYV DT N IRK+ A G T+ G +
Sbjct: 55 SFAGSAGSTDGAASQARFSIPGDVASDGT-SLYVVDTGNSVIRKVVLATGATSTVAGLAG 113
Query: 176 VAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
+ G DG A+F+ + +L V D GN +R I++
Sbjct: 114 ITGSGDGTGAVARFNYPSGIALSSDNSTLYVSDTGNNTIRSINV 157
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 35/81 (43%), Positives = 45/81 (55%), Gaps = 6/81 (7%)
Query: 82 VSEDG-ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGV 140
++ DG LF D N+ + KI S +AG+ G TG DG + ARF+ P G+
Sbjct: 450 LTTDGTNLFVSDTGNNTVRKIVIATGAVST---LAGT-AGATGSADGAGSAARFHSPNGL 505
Query: 141 TMDDKGNLYVADTLNLAIRKI 161
T D NLYVAD+ N AIRKI
Sbjct: 506 TTDGT-NLYVADSGNNAIRKI 525
Score = 43.5 bits (101), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 44/134 (32%), Positives = 58/134 (43%), Gaps = 13/134 (9%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDK 145
G L+ D N + K+ S + G G DG A F P +T D +
Sbjct: 348 GVLYVADTGNDLLRKVAVATGASS-------TLAGTVGSADGVGVAAAFTSPYDLTTDGR 400
Query: 146 GNLYVADTLNLAIRKIGDA--GVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 203
N+YVADT N +R+I A VTT+A G + AG DG A F F +
Sbjct: 401 -NVYVADTNNHTVRQISIATGAVTTLA-GTPDRAGSGDGTGAAASFR--FPSGLTTDGTN 456
Query: 204 LLVIDRGNAALRQI 217
L V D GN +R+I
Sbjct: 457 LFVSDTGNNTVRKI 470
Score = 43.5 bits (101), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 67/141 (47%), Gaps = 13/141 (9%)
Query: 82 VSEDG-ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGV 140
++ DG +F D +N+ K+ S +AG G G D ARF+ P G+
Sbjct: 287 IATDGTSVFVTDLERNNLRKVDIATQAVST---LAGDGGGSPGSTDAVGGAARFSAPAGL 343
Query: 141 TMDDKGNLYVADTLNLAIRKIGDA--GVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYV 198
+ + G LYVADT N +RK+ A +T+AG G DG A F++ +D+
Sbjct: 344 VL-NAGVLYVADTGNDLLRKVAVATGASSTLAG----TVGSADGVGVAAAFTSPYDL--T 396
Query: 199 RPTCSLLVIDRGNAALRQISL 219
++ V D N +RQIS+
Sbjct: 397 TDGRNVYVADTNNHTVRQISI 417
>gi|320106842|ref|YP_004182432.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
gi|319925363|gb|ADV82438.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
Length = 382
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/119 (35%), Positives = 57/119 (47%), Gaps = 10/119 (8%)
Query: 112 RLVAGSFQ-------GYTGHVDGKPNEARFNHPKGVTMDD-KGNLYVADTLNLAIRKIGD 163
RL+ G+ Q G G+ DG+ N ARFN P V + G Y+ADT N IR++
Sbjct: 259 RLLNGANQVTTLAGNGTAGYADGQGNSARFNKPTQVVYNSGDGAYYIADTFNNCIRRMDS 318
Query: 164 AGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQD 222
AG T G +G DG S AKF + L V D GN A+R++ +N +
Sbjct: 319 AGNVTTYAGIGGQSGLVDGASTSAKFDKPTGIAIAN--GYLYVADSGNNAIRRVDMNAN 375
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 51/102 (50%), Gaps = 12/102 (11%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI---GDAGVTTIAGGKSNV 176
G G DG + +RFN P G D L + DT N +IR + +AG T IA S V
Sbjct: 170 GSLGDADGNGSASRFNSPTGFATDG---LLIGDTGNNSIRTMDANNNAG-TLIA---SQV 222
Query: 177 AGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
GF DG A+FS + V + L++D GN A+R ++
Sbjct: 223 PGFIDGNLSVARFS--YPTVALATPIGTLIVDSGNHAIRLLN 262
Score = 42.4 bits (98), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 54/108 (50%), Gaps = 7/108 (6%)
Query: 111 GRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTI 169
G L+A G+ +DG + ARF++P G L V D+ N AIR + A VTT+
Sbjct: 215 GTLIASQVPGF---IDGNLSVARFSYPTVALATPIGTLIV-DSGNHAIRLLNGANQVTTL 270
Query: 170 AGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
AG + AG+ DG A+F+ VVY + + D N +R++
Sbjct: 271 AG--NGTAGYADGQGNSARFNKPTQVVYNSGDGAYYIADTFNNCIRRM 316
>gi|325104254|ref|YP_004273908.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
gi|324973102|gb|ADY52086.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
Length = 439
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 55/176 (31%), Positives = 84/176 (47%), Gaps = 23/176 (13%)
Query: 55 VLQFENGYLVETVIEGNEIGVV-----------PYKIRVSEDGELFAVDEVNSNIVKITP 103
V++ E+ Y+V T+ G + P I V +G++ D N +I K+T
Sbjct: 113 VIKKESNYIVSTIAGSTTFGFIDGNGLDARFRNPDGILVDMNGDIIITDRTNHSIRKMT- 171
Query: 104 PLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG- 162
+ G + + G +G+ +GKP + FN P TMD GN+ V + IRKI
Sbjct: 172 -----TAGVVSTLAGTGVSGYANGKPGQ--FNTPWQSTMDAAGNIIVIEKDGGRIRKIAP 224
Query: 163 DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
D V+ IAG S GF DG A+F++ D V V ++ V DR N +R+I+
Sbjct: 225 DGAVSLIAGTGS--LGFTDGNVSVARFNHALDGV-VDSEGNIFVADRNNYRIRKIT 277
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 47/85 (55%), Gaps = 1/85 (1%)
Query: 78 YKIRVSEDGELFAVDEVNSNIVKITPPL-SQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
+ + +G + D N+ I KITP + + +++ + + G G DG + A FN
Sbjct: 352 FGLNFDNEGNILLADASNNRIRKITPGVGNDWTKATVSTIAGNGTAGRGDGLGHAATFNQ 411
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P V MD KG++YVAD +N +IRKI
Sbjct: 412 PYDVVMDAKGDIYVADNVNHSIRKI 436
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 50/96 (52%), Gaps = 7/96 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P++ + G + +++ I KI P + L+AG+ G G DG + ARFNH
Sbjct: 198 PWQSTMDAAGNIIVIEKDGGRIRKIAPDGAV----SLIAGT--GSLGFTDGNVSVARFNH 251
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
+D +GN++VAD N IRKI AG V+T G
Sbjct: 252 ALDGVVDSEGNIFVADRNNYRIRKITPAGMVSTFMG 287
Score = 45.1 bits (105), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 54/112 (48%), Gaps = 11/112 (9%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--------GDAG 165
+ GS +GY G P + G+ D++GN+ +AD N IRKI A
Sbjct: 328 IVGSTKGYDDGTPGNPLTVKLGDIFGLNFDNEGNILLADASNNRIRKITPGVGNDWTKAT 387
Query: 166 VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
V+TIAG + AG DG A F+ +DVV + + V D N ++R+I
Sbjct: 388 VSTIAG--NGTAGRGDGLGHAATFNQPYDVV-MDAKGDIYVADNVNHSIRKI 436
>gi|423215200|ref|ZP_17201728.1| hypothetical protein HMPREF1074_03260 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692463|gb|EIY85701.1| hypothetical protein HMPREF1074_03260 [Bacteroides xylanisolvens
CL03T12C04]
Length = 451
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 54/98 (55%), Gaps = 1/98 (1%)
Query: 122 TGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD 181
+G+V+G R N P+ D+ GN++V + IRKI AGV ++ G +GF D
Sbjct: 354 SGYVNGSGATVRLNQPRQPAFDEDGNMFVPEKSAHIIRKITPAGVASLYAGIPGQSGFGD 413
Query: 182 GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G E AKF N + V V P S+ V DR N +R++++
Sbjct: 414 GLPELAKF-NSPECVTVYPDNSVYVADRENHVIRRVTV 450
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 41/78 (52%), Gaps = 5/78 (6%)
Query: 84 EDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMD 143
EDG +F ++ I KITP L AG G +G DG P A+FN P+ VT+
Sbjct: 376 EDGNMFVPEKSAHIIRKITPA----GVASLYAG-IPGQSGFGDGLPELAKFNSPECVTVY 430
Query: 144 DKGNLYVADTLNLAIRKI 161
++YVAD N IR++
Sbjct: 431 PDNSVYVADRENHVIRRV 448
>gi|197120448|ref|YP_002132399.1| fibronectin type III domain-containing protein [Anaeromyxobacter
sp. K]
gi|196170297|gb|ACG71270.1| Fibronectin type III domain protein [Anaeromyxobacter sp. K]
Length = 460
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 70/138 (50%), Gaps = 8/138 (5%)
Query: 82 VSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVT 141
++ DG V + +N+V+ L+ + L + Q G DG A F P+GV
Sbjct: 219 ITSDGTYLYVADTANNLVRRID-LATAAVTTLAGSTTQ--AGAADGPGASATFRWPRGVA 275
Query: 142 MDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 199
D LYVADT+N AIR + A V+T+AG ++ AG+ DG A+FS + +
Sbjct: 276 TDGT-TLYVADTMNNAIRALDLATGVVSTLAGDPASYAGYADGRGTAARFSAPYGL--AM 332
Query: 200 PTCSLLVIDRGNAALRQI 217
LLV D GN+A+R +
Sbjct: 333 HGGELLVADSGNSAIRAV 350
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 49/135 (36%), Positives = 64/135 (47%), Gaps = 9/135 (6%)
Query: 87 ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKG 146
L+ D N I ++ Q + +AG + Y GH DG A FN P+G+T D
Sbjct: 169 SLYVCDTTNMLIRRLDLSTRQVTT---LAGDPRTYAGHQDGIGTAASFNSPRGITSDGT- 224
Query: 147 NLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 204
LYVADT N +R+I A VTT+AG + AG DGP A F V +L
Sbjct: 225 YLYVADTANNLVRRIDLATAAVTTLAGSTTQ-AGAADGPGASATFRWPRGV--ATDGTTL 281
Query: 205 LVIDRGNAALRQISL 219
V D N A+R + L
Sbjct: 282 YVADTMNNAIRALDL 296
Score = 47.4 bits (111), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 41/115 (35%), Positives = 57/115 (49%), Gaps = 14/115 (12%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK--IGDAGVTTIAG 171
+AGS Y DG ARFN P G+ + +LYV DT N+ IR+ + VTT+AG
Sbjct: 140 IAGSSMDY---ADGDAIHARFNTPTGI-VGVGTSLYVCDTTNMLIRRLDLSTRQVTTLAG 195
Query: 172 GKSNVAGFRDGPSEDAKF------SNDFDVVYVRPTCSLLV--IDRGNAALRQIS 218
AG +DG A F ++D +YV T + LV ID AA+ ++
Sbjct: 196 DPRTYAGHQDGIGTAASFNSPRGITSDGTYLYVADTANNLVRRIDLATAAVTTLA 250
>gi|116620673|ref|YP_822829.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116223835|gb|ABJ82544.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 912
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 50/147 (34%), Positives = 75/147 (51%), Gaps = 13/147 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITP--PLSQYSRGRLVAGS-FQGYTGHVDGKPNEAR 133
P+ + V G +F D N + ++TP +S Y AGS +G+ G G A
Sbjct: 156 PFAVAVDGQGNVFIADTNNVVVRRVTPDGTISTY------AGSGARGFAGD-GGAARNAW 208
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAG-GKSNVAGFRDGPSEDAKFSN 191
F+ P+GV +D G LY+ADT N IR++ D +TT AG G + V G +GP A S
Sbjct: 209 FDGPEGVAVDANGVLYIADTFNGRIRRVAADGTITTAAGVGSTGVFGGDNGPPASAALSL 268
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQIS 218
DV R + + + D GN+ +R ++
Sbjct: 269 PTDVAVDR-SGNPYIADFGNSRVRMVA 294
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 57/203 (28%), Positives = 90/203 (44%), Gaps = 21/203 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV--DGKPNEARF 134
P + V G ++ D N+ I KI PL S VAG G T + DG +
Sbjct: 556 PRGVAVDAQGNVYVADTGNNRIRKI-DPLGNIS---TVAG--DGSTEFIPGDGIATQQGL 609
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSND 192
P+GV +D GN+YVA+T + +R++ G +TTIAG G+ G DG A N
Sbjct: 610 TDPRGVAVDRAGNIYVAETGHNRVRRVSTGGTITTIAGNGQCCYTG--DGGLGTAAQLNQ 667
Query: 193 FDVVYVRPTCSLLVIDRGNAALR---------QISLNQDDCEYQYNSISPTDILMVVGAV 243
+ V ++ V D GN A+R Q+ + ++P +++++ GA
Sbjct: 668 PWGIAVDSAGNIYVADSGNNAIRLLAPVSANIQVGAVVNAASNLPGPVAPGELVVLYGAG 727
Query: 244 LVGYVTCMLQQGFGPFFFSRTQQ 266
L G + + GP ++ Q
Sbjct: 728 LAGVQSVLFNGVAGPLVYTSAGQ 750
Score = 45.1 bits (105), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 43/147 (29%), Positives = 72/147 (48%), Gaps = 14/147 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVK--ITPPLSQYSRGRLVAGSFQGYTGHVDGKP-NEAR 133
P + V G +F D ++N+V+ T + Q G G +QG DG P A+
Sbjct: 100 PMGLAVDAAGNIFVADR-DANVVRRIATTGIIQTVAGNGTPG-YQG-----DGGPATTAQ 152
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFR-DGPSEDAKFSN 191
N P V +D +GN+++ADT N+ +R++ D ++T AG S GF DG + + +
Sbjct: 153 LNAPFAVAVDGQGNVFIADTNNVVVRRVTPDGTISTYAG--SGARGFAGDGGAARNAWFD 210
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQIS 218
+ V V L + D N +R+++
Sbjct: 211 GPEGVAVDANGVLYIADTFNGRIRRVA 237
Score = 45.1 bits (105), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 35/53 (66%), Gaps = 2/53 (3%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
G++G G A+ N P+GV +D +GN+YVADT N IRKI G ++T+AG
Sbjct: 540 GFSGD-GGAATAAKLNRPRGVAVDAQGNVYVADTGNNRIRKIDPLGNISTVAG 591
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 37/95 (38%), Positives = 49/95 (51%), Gaps = 5/95 (5%)
Query: 126 DGKP-NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVT-TIAG-GKSNVAGFRDG 182
DG P +A N P+GV D GN Y+ADT + +RK+ GV TIAG G +G DG
Sbjct: 488 DGLPATQASVNQPEGVAGDAAGNTYIADTFDNVVRKVTTDGVIHTIAGFGTPGFSG--DG 545
Query: 183 PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
+ A N V V ++ V D GN +R+I
Sbjct: 546 GAATAAKLNRPRGVAVDAQGNVYVADTGNNRIRKI 580
Score = 39.3 bits (90), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 45/98 (45%), Gaps = 9/98 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV---DGKPNEAR 133
P + V +G L+ D N I ++ + G + + G TG +G P A
Sbjct: 212 PEGVAVDANGVLYIADTFNGRIRRVA------ADGTITTAAGVGSTGVFGGDNGPPASAA 265
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG 171
+ P V +D GN Y+AD N +R + + +TT+AG
Sbjct: 266 LSLPTDVAVDRSGNPYIADFGNSRVRMVANGVITTVAG 303
Score = 38.9 bits (89), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGF 179
A+F+ P GV +D GN+ + D+ N +RKI +TTIAG + AGF
Sbjct: 384 AQFDTPTGVAIDAGGNVLIVDSQNQRLRKISRGVITTIAG--TGTAGF 429
>gi|344923600|ref|ZP_08777061.1| NHL repeat containing protein [Candidatus Odyssella
thessalonicensis L13]
Length = 1226
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 63/135 (46%), Gaps = 13/135 (9%)
Query: 88 LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN 147
++ D N I KI S LVAGS HVDG ARF +P V +D N
Sbjct: 94 VYVADSQNGVIRKIDTSTRVTS---LVAGSPGALNAHVDGTYTTARFAYPTSVAVDSSSN 150
Query: 148 LYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFD----VVYVRPTC 202
LY+ DTLN IRKI VTT+AGG ++ + + S DF+ V
Sbjct: 151 LYIGDTLNHCIRKIAPGNVVTTLAGGGGSLP-----NTTSSNLSADFNGPNGVAVDAGGT 205
Query: 203 SLLVIDRGNAALRQI 217
++ V D N+ LR+I
Sbjct: 206 TVYVADTSNSMLRKI 220
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 72/142 (50%), Gaps = 10/142 (7%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
+ V G ++ D N+ I KITP + ++AGS G + + +G ARF++P+G
Sbjct: 31 VAVDGSGTVYVADSNNNVIKKITPAGTV----SVLAGS--GVSDYAEGTGTAARFSYPEG 84
Query: 140 VTMDDKG-NLYVADTLNLAIRKIGDAGVTT--IAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
V +D G +YVAD+ N IRKI + T +AG + DG A+F+ V
Sbjct: 85 VGVDVAGTTVYVADSQNGVIRKIDTSTRVTSLVAGSPGALNAHVDGTYTTARFAYPTSVA 144
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
V + +L + D N +R+I+
Sbjct: 145 -VDSSSNLYIGDTLNHCIRKIA 165
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 43/84 (51%), Gaps = 1/84 (1%)
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
FN GV +D G +YVAD+ N I+KI AG ++ G S V+ + +G A+FS
Sbjct: 25 FNFASGVAVDGSGTVYVADSNNNVIKKITPAGTVSVLAG-SGVSDYAEGTGTAARFSYPE 83
Query: 194 DVVYVRPTCSLLVIDRGNAALRQI 217
V ++ V D N +R+I
Sbjct: 84 GVGVDVAGTTVYVADSQNGVIRKI 107
Score = 38.9 bits (89), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 2/88 (2%)
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKS--NVAGFRDGPSEDAKFSNDF 193
+P GV +D G Y+ DT + I K G T + N GF +G A+F+ +
Sbjct: 241 NPYGVAVDSLGVAYIGDTGHNCIHKTYPDGTTVLLASIDVPNSPGFVNGVGAAARFNAPW 300
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISLNQ 221
+ T SL + DR N A+R+I Q
Sbjct: 301 QIATDPTTGSLYIADRDNGAIRKIDTYQ 328
>gi|284038675|ref|YP_003388605.1| NHL repeat containing protein [Spirosoma linguale DSM 74]
gi|283817968|gb|ADB39806.1| NHL repeat containing protein [Spirosoma linguale DSM 74]
Length = 359
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 44/120 (36%), Positives = 62/120 (51%), Gaps = 12/120 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + L+ D + + I K+TP +G L + G TG+ DG ++A+F
Sbjct: 52 PVSVAFDKANNLYFADGI-ARIFKVTP------QGNLSLYAGSGGTGYQDGSLDKAKFLW 104
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFR----DGPSEDAKFSN 191
P G+ D GNLYVAD+ N AIRKI D VTT AG +V DG ++A+F N
Sbjct: 105 PYGLAFDRAGNLYVADSGNQAIRKISPDGQVTTFAGQPYDVTSITNVSVDGIGKEARFYN 164
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 70/146 (47%), Gaps = 10/146 (6%)
Query: 75 VVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
+ P + V+ GELF + +S + + +G + Y +G N AR+
Sbjct: 222 ISPRGLAVNSKGELFI------GCPGVIYKVSTSGQTTVYSGVREQYGSSPNGPINSARY 275
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
+ D NLY+ +T +RKI D V+ + G S + G++DGP + A+F +
Sbjct: 276 GLITSLRFDSNDNLYIGETGAGIVRKIATDGQVSDVTG--SRLGGYKDGPLQAAEFGSVE 333
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISL 219
D+ + P+ SL V D N A+R+I+
Sbjct: 334 DLAF-SPSGSLYVADNRNGAIRKITF 358
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 44/90 (48%), Gaps = 3/90 (3%)
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAK 188
P++ + P V D NLY AD + + ++ AG S G++DG + AK
Sbjct: 44 PSQLNNSRPVSVAFDKANNLYFADGIARIFKVTPQGNLSLYAG--SGGTGYQDGSLDKAK 101
Query: 189 FSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
F + + + R +L V D GN A+R+IS
Sbjct: 102 FLWPYGLAFDR-AGNLYVADSGNQAIRKIS 130
Score = 38.9 bits (89), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 49/107 (45%), Gaps = 18/107 (16%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGY------TGHVDGKPN 130
PY + G L+ D N I KI+P G++ + Q Y VDG
Sbjct: 105 PYGLAFDRAGNLYVADSGNQAIRKISP------DGQVTTFAGQPYDVTSITNVSVDGIGK 158
Query: 131 EARFNHPKGVTMDDKGNLYVADTLNLA-----IRKIG-DAGVTTIAG 171
EARF +P +T+D NL+V + + + IRKI A VT+ AG
Sbjct: 159 EARFYNPLVLTIDRSDNLFVGEYADASAYVAVIRKITPQANVTSYAG 205
>gi|290995070|ref|XP_002680154.1| predicted protein [Naegleria gruberi]
gi|284093774|gb|EFC47410.1| predicted protein [Naegleria gruberi]
Length = 636
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 57/165 (34%), Positives = 83/165 (50%), Gaps = 26/165 (15%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPP--LSQYSRGRLVAGS-FQGYTGHVDGK-PNEA 132
PY I +S GE++ D N+ I KIT +S +AG+ QGY+G DG A
Sbjct: 433 PYGIAISSSGEIYIADTNNNRIRKITTSGIIST------IAGTGTQGYSG--DGSSATSA 484
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSE-DAKF 189
+ +P GV + +G +YVAD N IRKI +G ++TIAG G S +G DG S A+
Sbjct: 485 QLYNPYGVAISSRGEIYVADYNNNRIRKITTSGIISTIAGTGTSGYSG--DGSSAISAQL 542
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQIS--------LNQDDCEY 226
N + V + + + + D N +R++S L CEY
Sbjct: 543 YNPYGVA-ISSSDEIYITDTNNNRIRKLSPWCTGNAILLDGSCEY 586
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/147 (29%), Positives = 71/147 (48%), Gaps = 16/147 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPP--LSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
P + +S E++ VD N+ I KIT +S + GY+G V A+
Sbjct: 105 PSGVAISSSDEIYIVDRSNNRIRKITTSGIIST-----IAGNGTAGYSGDV---ATSAKL 156
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
+P G+ + Y+ADT N IRKI +G ++TIAG + AG+ G AK + +
Sbjct: 157 YYPSGIAISSSDETYIADTNNHRIRKITTSGIISTIAG--NGTAGY-SGDGSSAKSAQLY 213
Query: 194 --DVVYVRPTCSLLVIDRGNAALRQIS 218
V + + + ++DR N +R+I+
Sbjct: 214 YPSGVAISSSDEIYIVDRSNNRIRKIT 240
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 71/143 (49%), Gaps = 18/143 (12%)
Query: 83 SEDGELFAVDEVNSNIVKITPP--LSQYSRGRLVAGS-FQGYTGHVDGK-PNEARFNHPK 138
S GE++ D N I KIT +S +AG+ GY+G DG A+ N P
Sbjct: 383 SSSGEIYIADTNNHRIRKITTSGIIST------IAGTGTSGYSG--DGSSATSAQLNSPY 434
Query: 139 GVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSE-DAKFSNDFDV 195
G+ + G +Y+ADT N IRKI +G ++TIAG G +G DG S A+ N + V
Sbjct: 435 GIAISSSGEIYIADTNNNRIRKITTSGIISTIAGTGTQGYSG--DGSSATSAQLYNPYGV 492
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
+ + V D N +R+I+
Sbjct: 493 A-ISSRGEIYVADYNNNRIRKIT 514
Score = 47.4 bits (111), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 39/113 (34%), Positives = 56/113 (49%), Gaps = 14/113 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPP--LSQYSRGRLVAGSFQGYTGHVDGK-PNEAR 133
P + +S E++ VD N+ I KIT +S + GY+G DG A+
Sbjct: 215 PSGVAISSSDEIYIVDRSNNRIRKITTSGIIST-----IAGNGTAGYSG--DGSSATSAQ 267
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPS 184
N P G+ + +Y+AD N IRKI +G ++TIAG G S +G DG S
Sbjct: 268 LNSPSGIAISSSDEIYIADMFNNRIRKITTSGIISTIAGTGTSGYSG--DGSS 318
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 43/147 (29%), Positives = 66/147 (44%), Gaps = 20/147 (13%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPP--LSQYSRGRLVAGS-FQGYTGHVDGK-PNEA 132
P I +S E++ D N+ I KIT +S +AG+ GY+G DG
Sbjct: 271 PSGIAISSSDEIYIADMFNNRIRKITTSGIIST------IAGTGTSGYSG--DGSSATSI 322
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSN 191
+ P GV + +Y+AD N IRKI +G ++TIAG G DG S + N
Sbjct: 323 QLYFPYGVAVSLSDEIYIADMFNNRIRKITTSGIISTIAG------GIGDGLSATTAYIN 376
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQIS 218
+ + + + D N +R+I+
Sbjct: 377 AITFEF-SSSGEIYIADTNNHRIRKIT 402
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 66/147 (44%), Gaps = 14/147 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPP--LSQYSRGRLVAGSFQGYTGHVDGK-PNEAR 133
P I +S E + D N I KIT +S + GY+G DG A+
Sbjct: 159 PSGIAISSSDETYIADTNNHRIRKITTSGIIST-----IAGNGTAGYSG--DGSSAKSAQ 211
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR-DGPSEDAKFSN 191
+P GV + +Y+ D N IRKI +G ++TIAG + AG+ DG S + N
Sbjct: 212 LYYPSGVAISSSDEIYIVDRSNNRIRKITTSGIISTIAG--NGTAGYSGDGSSATSAQLN 269
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQIS 218
+ + + + + D N +R+I+
Sbjct: 270 SPSGIAISSSDEIYIADMFNNRIRKIT 296
Score = 39.3 bits (90), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 48/91 (52%), Gaps = 6/91 (6%)
Query: 131 EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKF 189
A+ +P G+ + Y+ADT N IRKI +G ++TIAG + AG+ G AK
Sbjct: 43 SAKLYYPSGIAISSSDETYIADTNNHRIRKITTSGIISTIAG--NGTAGY-SGDGSSAKS 99
Query: 190 SNDF--DVVYVRPTCSLLVIDRGNAALRQIS 218
+ + V + + + ++DR N +R+I+
Sbjct: 100 AQLYYPSGVAISSSDEIYIVDRSNNRIRKIT 130
Score = 38.5 bits (88), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 52/117 (44%), Gaps = 13/117 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPP--LSQYSRGRLVAGSFQGYTGHVDGK-PNEAR 133
P I +S E + D N I KIT +S + GY+G DG A+
Sbjct: 49 PSGIAISSSDETYIADTNNHRIRKITTSGIIST-----IAGNGTAGYSG--DGSSAKSAQ 101
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKF 189
+P GV + +Y+ D N IRKI +G ++TIAG + AG+ + AK
Sbjct: 102 LYYPSGVAISSSDEIYIVDRSNNRIRKITTSGIISTIAG--NGTAGYSGDVATSAKL 156
>gi|386722458|ref|YP_006188784.1| copper amine oxidase domain-containing protein [Paenibacillus
mucilaginosus K02]
gi|384089583|gb|AFH61019.1| copper amine oxidase domain-containing protein [Paenibacillus
mucilaginosus K02]
Length = 537
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 72/151 (47%), Gaps = 16/151 (10%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF-----QGY--TGHVDGK 128
VP + V DG D N I K+T R ++AG F +GY G +DG
Sbjct: 67 VPAGLAVLPDGTAAVSDSRNGVIRKLTG-----GRVDVLAGVFYRKDDKGYPVGGLLDGA 121
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDA 187
N + F P G++ G+LYVAD N AIR+I G VTT+AG S G +DG A
Sbjct: 122 ANASLFQEPLGLSAGPDGSLYVADAGNHAIRRIDAKGNVTTVAG--SGRLGVKDGKGAAA 179
Query: 188 KFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+F DV P +L V D +R+IS
Sbjct: 180 EFYRPGDVAAA-PDGTLYVADTLGHTIRRIS 209
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 74/159 (46%), Gaps = 20/159 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITP-----PLSQYSRGRLVA---GSFQGYTGHVDGK 128
P + + DG L+ D + I +I+P L+ S+ R+V G DG
Sbjct: 184 PGDVAAAPDGTLYVADTLGHTIRRISPQGEVTTLTAPSK-RVVEATPGQVAAAGDFADGP 242
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNV--------AG 178
+A+FN P G+ +D KGNLYV+D+ N +R I VTT+AGG + G
Sbjct: 243 LAQAKFNEPTGIALDAKGNLYVSDSGNQRVRYIDLAKGTVTTVAGGGTAAELKDMYVPGG 302
Query: 179 FRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
F +G + A+ + + V LL+ D N A+R +
Sbjct: 303 FSNGAALQARLNYPMGIA-VTEEGGLLIADSQNHAVRYL 340
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 68/157 (43%), Gaps = 19/157 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + DG L+ D N I +I ++G + + G G DGK A F
Sbjct: 130 PLGLSAGPDGSLYVADAGNHAIRRID------AKGNVTTVAGSGRLGVKDGKGAAAEFYR 183
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG----------GKSNVAG-FRDGPS 184
P V G LYVADTL IR+I G VTT+ G+ AG F DGP
Sbjct: 184 PGDVAAAPDGTLYVADTLGHTIRRISPQGEVTTLTAPSKRVVEATPGQVAAAGDFADGPL 243
Query: 185 EDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQ 221
AKF N+ + + +L V D GN +R I L +
Sbjct: 244 AQAKF-NEPTGIALDAKGNLYVSDSGNQRVRYIDLAK 279
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 50/96 (52%), Gaps = 1/96 (1%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
G +G +AR N+P G+ + ++G L +AD+ N A+R + +TT+AG G DG
Sbjct: 302 GFSNGAALQARLNYPMGIAVTEEGGLLIADSQNHAVRYLFGGQLTTLAGAGEQKTGLLDG 361
Query: 183 PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
A + DV V S+LV D N LR+++
Sbjct: 362 MEGKAGLNRPADVA-VLGDGSVLVADSFNNRLRRLT 396
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 49/112 (43%), Gaps = 8/112 (7%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG- 171
L + G G +G EA F P G+ + G V+D+ N IRK+ V +AG
Sbjct: 44 LTTAAGTGRLGSANGTGLEASFRVPAGLAVLPDGTAAVSDSRNGVIRKLTGGRVDVLAGV 103
Query: 172 --GKSN----VAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
K + V G DG + + F + P SL V D GN A+R+I
Sbjct: 104 FYRKDDKGYPVGGLLDGAANASLFQEPLG-LSAGPDGSLYVADAGNHAIRRI 154
>gi|114568006|ref|YP_755160.1| leucine-rich repeat-containing protein [Syntrophomonas wolfei
subsp. wolfei str. Goettingen]
gi|114338941|gb|ABI69789.1| Leucine-rich repeat (LRR) protein-like protein [Syntrophomonas
wolfei subsp. wolfei str. Goettingen]
Length = 1351
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 73/146 (50%), Gaps = 10/146 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQG-YTGHVDGKP-NEARF 134
P + ++G L+ VD+ N+ I K+ + VAG+ G Y G DG P A+
Sbjct: 533 PVGVACDKNGNLYIVDKDNNRIRKV----DNTGKISTVAGNGTGGYAG--DGSPATSAQI 586
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
P GVT D+ GN+Y+AD N IRK+ +G +TT+AG S GP+E A N
Sbjct: 587 WVPYGVTFDNNGNMYIADMNNKRIRKVDPSGIITTVAGNGSWKYSGDGGPAEAAGLCNAV 646
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISL 219
V + +L + D + +R++ L
Sbjct: 647 GVA-CDSSGNLYIADSHSNCIRKVVL 671
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 70/145 (48%), Gaps = 9/145 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSR-GRLVAGSFQGYTGHVDGKPNEARFN 135
P + G L+ + E NSN ++ + S +V + Y G G AR N
Sbjct: 418 PIGVACDSSGNLY-IAESNSNCIRKVDSSGKISVVAGIVTQGWATYEGD-GGLATSARLN 475
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--DGPSEDAKFSND 192
+P GV D GNLY+AD N IRK+ +G ++T+AG + AG+ GP+ A+ +
Sbjct: 476 YPFGVACDGSGNLYIADRGNHRIRKVDTSGIISTVAG--NGTAGYSGDGGPATSAQLKDP 533
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQI 217
V +L ++D+ N +R++
Sbjct: 534 VGVA-CDKNGNLYIVDKDNNRIRKV 557
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 45/146 (30%), Positives = 68/146 (46%), Gaps = 14/146 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKI-TPPLSQYSRGRLVAGSFQGYTGHVDGKP-NEARF 134
P+ + G L+ D N I K+ T + G AG Y+G DG P A+
Sbjct: 477 PFGVACDGSGNLYIADRGNHRIRKVDTSGIISTVAGNGTAG----YSG--DGGPATSAQL 530
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDG-PSEDAKFSN 191
P GV D GNLY+ D N IRK+ + G ++T+AG G AG DG P+ A+
Sbjct: 531 KDPVGVACDKNGNLYIVDKDNNRIRKVDNTGKISTVAGNGTGGYAG--DGSPATSAQIWV 588
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQI 217
+ V + ++ + D N +R++
Sbjct: 589 PYGVTF-DNNGNMYIADMNNKRIRKV 613
Score = 42.4 bits (98), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 40/139 (28%), Positives = 64/139 (46%), Gaps = 19/139 (13%)
Query: 114 VAGS-FQGYTGHVDGKP-NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIA 170
VAG+ GY+G DG P A+ N P+GV D NLY+ D N IRK+ +G ++T+A
Sbjct: 339 VAGNGTAGYSG--DGGPATSAQLNTPRGVVCDGADNLYIVDGDNQRIRKVDTSGKISTVA 396
Query: 171 GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNS 230
G ++ GP+ A + V + +L + + + +R++
Sbjct: 397 GNGTHWFEGDGGPATAAGLYDPIGVA-CDSSGNLYIAESNSNCIRKVD------------ 443
Query: 231 ISPTDILMVVGAVLVGYVT 249
S I +V G V G+ T
Sbjct: 444 -SSGKISVVAGIVTQGWAT 461
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 43/143 (30%), Positives = 69/143 (48%), Gaps = 25/143 (17%)
Query: 87 ELFAVDEVNSNIVKITPPLSQYSRGRL--VAGS----FQGYTGHVDGKP-NEARFNHPKG 139
L+ VD N I K+ + G++ VAG+ F+G DG P A P G
Sbjct: 372 NLYIVDGDNQRIRKVD------TSGKISTVAGNGTHWFEG-----DGGPATAAGLYDPIG 420
Query: 140 VTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG----GKSNVAGFRDGPSEDAKFSNDFD 194
V D GNLY+A++ + IRK+ +G ++ +AG G + G G + A+ + F
Sbjct: 421 VACDSSGNLYIAESNSNCIRKVDSSGKISVVAGIVTQGWATYEG-DGGLATSARLNYPFG 479
Query: 195 VVYVRPTCSLLVIDRGNAALRQI 217
V + +L + DRGN +R++
Sbjct: 480 VA-CDGSGNLYIADRGNHRIRKV 501
>gi|392945712|ref|ZP_10311354.1| protein kinase family protein,NHL repeat protein [Frankia sp. QA3]
gi|392289006|gb|EIV95030.1| protein kinase family protein,NHL repeat protein [Frankia sp. QA3]
Length = 770
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/145 (31%), Positives = 73/145 (50%), Gaps = 8/145 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP-NEARFN 135
P + + +DG L+ D N + +I P R LVAG Y DGK A +
Sbjct: 617 PSGLALGDDGSLYVADTGNETVRRIDP----SGRITLVAGRPGTYGHRGDGKAATAALLD 672
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGP-SEDAKFSNDF 193
P+G+ + G+LY+AD+ N IR++ G +TT+AG S A RDG + + SN
Sbjct: 673 DPRGLALGPDGSLYIADSGNDVIRRVDRHGIITTVAGTGSYSADDRDGALATQTRLSNPA 732
Query: 194 DVVYVRPTCSLLVIDRGNAALRQIS 218
V+ V PT ++ + G+ +R++
Sbjct: 733 GVL-VDPTGAIYIACSGDGTVRRVG 756
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 68/145 (46%), Gaps = 13/145 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKPN-EAR 133
P + V +DG L+ + + + ++GR+ VAG DG P A
Sbjct: 563 PSAVAVDDDGVLYVAEGYR---------VRRIAQGRITTVAGKSTESGSAGDGGPAINAT 613
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSND 192
P G+ + D G+LYVADT N +R+I +G T+ G+ G R DG + A +D
Sbjct: 614 LYQPSGLALGDDGSLYVADTGNETVRRIDPSGRITLVAGRPGTYGHRGDGKAATAALLDD 673
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQI 217
+ + P SL + D GN +R++
Sbjct: 674 PRGLALGPDGSLYIADSGNDVIRRV 698
Score = 38.9 bits (89), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 48/100 (48%), Gaps = 5/100 (5%)
Query: 119 QGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAG 178
G+T V G +A + P V +DD G LYVA+ +R+I +TT+AG +
Sbjct: 546 DGFTADV-GDATKAHLDDPSAVAVDDDGVLYVAEGYR--VRRIAQGRITTVAGKSTESGS 602
Query: 179 FRD-GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
D GP+ +A + + SL V D GN +R+I
Sbjct: 603 AGDGGPAINATLYQPSGLA-LGDDGSLYVADTGNETVRRI 641
>gi|290983166|ref|XP_002674300.1| predicted protein [Naegleria gruberi]
gi|284087889|gb|EFC41556.1| predicted protein [Naegleria gruberi]
Length = 1435
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/139 (29%), Positives = 65/139 (46%), Gaps = 8/139 (5%)
Query: 83 SEDGELFAVDEVNSNIVKITPPLSQYSR--GRLVAGSFQGYTGHVDGKPNEARFNHPKGV 140
S GEL+ DE NS I K+ S G +VAG Y+G G+ A +P V
Sbjct: 172 SSQGELYFADESNSLIRKLVIANGTVSNVAGNVVAG----YSGD-GGEATSASLRYPSSV 226
Query: 141 TMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV-VYVR 199
+ + G +Y+ADT N IRK+ + G+ G G+ + +F + V
Sbjct: 227 YVSNSGEIYIADTFNCVIRKVSNLGIIGTVAGVGGQCGYNQDSINATESKLNFPTGITVS 286
Query: 200 PTCSLLVIDRGNAALRQIS 218
+ L ++D+GN +R++S
Sbjct: 287 DSGDLYIVDKGNHRIRKVS 305
Score = 41.6 bits (96), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 54/101 (53%), Gaps = 8/101 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGH-VDGKPNEARFN 135
P I VS+ G+L+ VD+ N I K++ +V G F TG+ + G A+ N
Sbjct: 280 PTGITVSDSGDLYIVDKGNHRIRKVSATTGLIET--IVGGGFN--TGYGISGT--SAQLN 333
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNV 176
+P + ++++ L+++DT + + K + G+ T+ G SNV
Sbjct: 334 NPTQILLNNQNELFISDTNHHRLLKYSN-GIVTLVAGYSNV 373
>gi|171913873|ref|ZP_02929343.1| NHL repeat containing protein [Verrucomicrobium spinosum DSM 4136]
Length = 370
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/148 (32%), Positives = 72/148 (48%), Gaps = 13/148 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP-NEARFN 135
P+ + DG ++ + I ++TP + + +GYTG DG P A F+
Sbjct: 60 PFGVVRGPDGAIWYTEYTGQRIRRVTPDGKIET---MAGNGKKGYTG--DGGPATAASFD 114
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG---VTTIAG-GKSNVAGFRDGPSEDAKFSN 191
P + D GN Y+AD N AIRK+ DA +TT+AG G+ G DGPSE A+
Sbjct: 115 LPHEIRFDKAGNYYIADMSNHAIRKV-DAKTGIITTLAGTGRGGYTG-DDGPSEKAELKQ 172
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQISL 219
+ + P L + D GN +R+I +
Sbjct: 173 PHSIQF-GPDGDLYICDIGNHVIRKIDM 199
>gi|224536973|ref|ZP_03677512.1| hypothetical protein BACCELL_01849 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521412|gb|EEF90517.1| hypothetical protein BACCELL_01849 [Bacteroides cellulosilyticus
DSM 14838]
Length = 446
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 59/117 (50%), Gaps = 7/117 (5%)
Query: 108 YSRGRL-----VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG 162
Y+ GRL + GS + G+ DG ++A+FN P+ D+ N YV D N IRK+
Sbjct: 329 YATGRLMQPYILCGS-KNNKGYTDGPGSKAQFNEPQQGCFDNDDNFYVCDQNNNLIRKVE 387
Query: 163 DAGVTTIAGGKSNVAGFRDGP-SEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+G + G+ G+ DG ++A+F F + Y R T + D+ N +R I+
Sbjct: 388 PSGQVSTFAGRREEWGWADGDLRKEARFDRPFGIAYNRNTSEFYIADKNNKRIRIIT 444
>gi|405375498|ref|ZP_11029528.1| putative hemolysin [Chondromyces apiculatus DSM 436]
gi|397086225|gb|EJJ17355.1| putative hemolysin [Myxococcus sp. (contaminant ex DSM 436)]
Length = 855
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 58/180 (32%), Positives = 87/180 (48%), Gaps = 27/180 (15%)
Query: 55 VLQFENGYL--VETVIEGNEIGVV-------PYKIRVSEDG-ELFAVDEVNSNIVKITPP 104
V NG + ++ V EG+ + + P + VSEDG L+ V+ S +V+I
Sbjct: 686 VADLNNGVIRRIDLVAEGHPVTTLQGDWLYRPSGVTVSEDGGTLYVVESGMSRVVRIRDG 745
Query: 105 LSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDA 164
++ +VAG+ G+ DG P ++F G+ + G+L V+D N +R+I
Sbjct: 746 VTS-----VVAGTTPGFR---DGAPESSQFLPYLGIAVLKDGSLAVSDPGNYRVRRIHLD 797
Query: 165 G------VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G VTT+AG S G DGP EDA+ +V V P L V D GNA +R I+
Sbjct: 798 GNGQARKVTTLAG--SGTYGHADGPGEDAQLVLPAGLV-VGPDGRLYVADSGNALVRTIT 854
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 44/86 (51%), Gaps = 2/86 (2%)
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
N P G+ D GN+YV+DT + IR+I G + G + G +DGP++ A F+
Sbjct: 616 LNGPMGIATDAAGNVYVSDTDHYVIRRIDPEGKVELFAGST--PGLQDGPAKQAAFNQPA 673
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISL 219
+ +LLV D N +R+I L
Sbjct: 674 GLTVTPDGSALLVADLNNGVIRRIDL 699
>gi|323343289|ref|ZP_08083516.1| hypothetical protein HMPREF0663_10051 [Prevotella oralis ATCC
33269]
gi|323095108|gb|EFZ37682.1| hypothetical protein HMPREF0663_10051 [Prevotella oralis ATCC
33269]
Length = 436
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 64/122 (52%), Gaps = 5/122 (4%)
Query: 99 VKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAI 158
+ +T P + Y R L + + G GH DGK +A+F++P + D GN+YVAD N I
Sbjct: 320 IDVTDPENTYRR--LSSSNISG--GHRDGKLEKAQFHNPTQIYCDADGNIYVADRNNHCI 375
Query: 159 RKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
R+I + G G++DG +A F N+ + + ++ V D GN +R+++
Sbjct: 376 RRISPDDMVETVLGMPETKGWKDGAKSEALF-NEPTGIGIGKDGAVYVADWGNGRVRKLT 434
Query: 219 LN 220
+N
Sbjct: 435 IN 436
Score = 41.6 bits (96), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 40/85 (47%), Gaps = 5/85 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P +I DG ++ D N I +I+P L +G+ DG +EA FN
Sbjct: 354 PTQIYCDADGNIYVADRNNHCIRRISP--DDMVETVLGMPETKGWK---DGAKSEALFNE 408
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P G+ + G +YVAD N +RK+
Sbjct: 409 PTGIGIGKDGAVYVADWGNGRVRKL 433
>gi|223937730|ref|ZP_03629631.1| NHL repeat containing protein [bacterium Ellin514]
gi|223893523|gb|EEF59983.1| NHL repeat containing protein [bacterium Ellin514]
Length = 1474
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 37/99 (37%), Positives = 55/99 (55%), Gaps = 2/99 (2%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGF 179
G G +G +A+FN P+G+ +D G L+V+DT N IRKI GV T G S VAG
Sbjct: 1051 GVPGFTNGPNLKAQFNAPQGICVDGLGTLFVSDTGNNVIRKISTNGVVTTFAG-SGVAGT 1109
Query: 180 RDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
DG +A F + + + +L V D GN+ +R+++
Sbjct: 1110 HDGVGTNASFLAPTGIA-LDSSNNLYVADSGNSLIRKVT 1147
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 43/92 (46%), Gaps = 6/92 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V G LF D N+ I KI+ + G + + G G DG A F
Sbjct: 1068 PQGICVDGLGTLFVSDTGNNVIRKIS------TNGVVTTFAGSGVAGTHDGVGTNASFLA 1121
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTT 168
P G+ +D NLYVAD+ N IRK+ G+ T
Sbjct: 1122 PTGIALDSSNNLYVADSGNSLIRKVTPDGLVT 1153
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 66/145 (45%), Gaps = 10/145 (6%)
Query: 80 IRVSEDGELF-AVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPK 138
+ + DG LF +++ S ++ S L AG GY +DG ARF PK
Sbjct: 1216 LTLGLDGTLFGSMNTTASAVLGRVLRFSSTGGYELYAGDLSGY---MDGPRLLARFQRPK 1272
Query: 139 GVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVY 197
+ G++ V+D IRKI D VTT+AG G R+G A F N V
Sbjct: 1273 ALATASDGSIIVSDWTR--IRKIHSDGRVTTLAGAGD--LGLRNGSGLFAAF-NQLGAVT 1327
Query: 198 VRPTCSLLVIDRGNAALRQISLNQD 222
V ++ D N ++R+IS++ D
Sbjct: 1328 VDSAGNIYAADAANHSIRKISVDMD 1352
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 40/85 (47%), Gaps = 8/85 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + DG + D + I KI +S GR+ + G G +G A FN
Sbjct: 1271 PKALATASDGSIIVSDW--TRIRKI------HSDGRVTTLAGAGDLGLRNGSGLFAAFNQ 1322
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
VT+D GN+Y AD N +IRKI
Sbjct: 1323 LGAVTVDSAGNIYAADAANHSIRKI 1347
>gi|379719861|ref|YP_005311992.1| copper amine oxidase domain-containing protein [Paenibacillus
mucilaginosus 3016]
gi|378568533|gb|AFC28843.1| copper amine oxidase domain-containing protein [Paenibacillus
mucilaginosus 3016]
Length = 535
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 72/151 (47%), Gaps = 16/151 (10%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF-----QGY--TGHVDGK 128
VP + V DG D N I K+T R ++AG F +GY G +DG
Sbjct: 67 VPAGLAVLPDGTAAVSDSRNGVIRKLTG-----GRVDVLAGVFYRKDDKGYPVGGLLDGA 121
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDA 187
N + F P G++ G+LYVAD N AIR+I G VTT+AG S G +DG A
Sbjct: 122 ANASLFQEPLGLSAGPDGSLYVADAGNHAIRRIDAKGNVTTVAG--SGRLGAKDGKGAAA 179
Query: 188 KFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+F DV P +L V D +R+IS
Sbjct: 180 EFYRPGDVAAA-PDGTLYVADTLGHTIRRIS 209
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 49/159 (30%), Positives = 73/159 (45%), Gaps = 20/159 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITP-----PLSQYSRGRLVA---GSFQGYTGHVDGK 128
P + + DG L+ D + I +I+P L+ SR R+V G DG
Sbjct: 184 PGDVAAAPDGTLYVADTLGHTIRRISPQGEVTTLTAPSR-RVVEATPGQVAAAGDFADGP 242
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNV--------AG 178
+A+FN P G+ +D KGNLYV+D+ N IR I VTT+AGG + G
Sbjct: 243 LAQAKFNEPTGIALDAKGNLYVSDSGNQRIRYIDLAKGTVTTVAGGGTAAELKDMYVPGG 302
Query: 179 FRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
F +G + A+ + + V LL+ D N +R +
Sbjct: 303 FSNGAALQARLNYPMGIA-VTEEGGLLIADSQNHDVRYL 340
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 68/157 (43%), Gaps = 19/157 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + DG L+ D N I +I ++G + + G G DGK A F
Sbjct: 130 PLGLSAGPDGSLYVADAGNHAIRRID------AKGNVTTVAGSGRLGAKDGKGAAAEFYR 183
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG----------GKSNVAG-FRDGPS 184
P V G LYVADTL IR+I G VTT+ G+ AG F DGP
Sbjct: 184 PGDVAAAPDGTLYVADTLGHTIRRISPQGEVTTLTAPSRRVVEATPGQVAAAGDFADGPL 243
Query: 185 EDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQ 221
AKF N+ + + +L V D GN +R I L +
Sbjct: 244 AQAKF-NEPTGIALDAKGNLYVSDSGNQRIRYIDLAK 279
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 49/96 (51%), Gaps = 1/96 (1%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
G +G +AR N+P G+ + ++G L +AD+ N +R + ++T+AG G DG
Sbjct: 302 GFSNGAALQARLNYPMGIAVTEEGGLLIADSQNHDVRYLFGGQLSTLAGAGEQKMGLLDG 361
Query: 183 PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
A + DV V S+LV D N LR+++
Sbjct: 362 MEGKAGLNRPADVA-VLGDGSVLVADSFNNRLRRLT 396
Score = 39.7 bits (91), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 49/112 (43%), Gaps = 8/112 (7%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG- 171
L + G G +G EA F P G+ + G V+D+ N IRK+ V +AG
Sbjct: 44 LTTAAGTGRLGSANGTGLEASFRVPAGLAVLPDGTAAVSDSRNGVIRKLTGGRVDVLAGV 103
Query: 172 --GKSN----VAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
K + V G DG + + F + P SL V D GN A+R+I
Sbjct: 104 FYRKDDKGYPVGGLLDGAANASLFQEPLG-LSAGPDGSLYVADAGNHAIRRI 154
>gi|290975356|ref|XP_002670409.1| predicted protein [Naegleria gruberi]
gi|284083967|gb|EFC37665.1| predicted protein [Naegleria gruberi]
Length = 2200
Score = 59.7 bits (143), Expect = 3e-06, Method: Composition-based stats.
Identities = 43/145 (29%), Positives = 74/145 (51%), Gaps = 6/145 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + +S++GE++ D +N I KI + Y +AG+ + G G + N
Sbjct: 181 PNGLSISQNGEIYIADTLNHRIRKI----NSYGVISTIAGTGRASFGGDGGFAVLSPLNS 236
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P GV + G +Y+ADTLN IRKI G +TT+AG G G + +++ ++ +
Sbjct: 237 PNGVHISQNGEIYIADTLNHRIRKIFVNGLITTVAGTGRASFGGDGGLAINSQLNSPYG- 295
Query: 196 VYVRPTCSLLVIDRGNAALRQISLN 220
V+V + + + D N +R+I +N
Sbjct: 296 VHVSQSGEIYIADTLNHRIRKIFVN 320
Score = 55.8 bits (133), Expect = 5e-05, Method: Composition-based stats.
Identities = 49/145 (33%), Positives = 74/145 (51%), Gaps = 10/145 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKPNEARF 134
P + +S++GE++ D +N I KI + G + VAG+ + G G ++
Sbjct: 237 PNGVHISQNGEIYIADTLNHRIRKI------FVNGLITTVAGTGRASFGGDGGLAINSQL 290
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
N P GV + G +Y+ADTLN IRKI G +TTIAG S+ + DG A N
Sbjct: 291 NSPYGVHVSQSGEIYIADTLNHRIRKIFVNGTITTIAGSSSDGSFGGDGGLSIASRLNSP 350
Query: 194 DVVYVRPTCS-LLVIDRGNAALRQI 217
V+V P + +L+ D N +R+I
Sbjct: 351 KGVFVSPNNNEILIADTSNNRIRKI 375
Score = 53.9 bits (128), Expect = 2e-04, Method: Composition-based stats.
Identities = 46/144 (31%), Positives = 72/144 (50%), Gaps = 16/144 (11%)
Query: 87 ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVD--------GKPNEARFNHPK 138
EL+ D +N I+ ++ LS Y +V+G Q H D G + A+ N P
Sbjct: 127 ELYMTDVLNHRILSVS--LSSYLV-TIVSGK-QNCNEHSDCDGFSGDGGLASRAKLNSPN 182
Query: 139 GVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAG-GKSNVAGFRDGPSEDAKFSNDFDVV 196
G+++ G +Y+ADTLN IRKI GV +TIAG G+++ G DG N + V
Sbjct: 183 GLSISQNGEIYIADTLNHRIRKINSYGVISTIAGTGRASFGG--DGGFAVLSPLNSPNGV 240
Query: 197 YVRPTCSLLVIDRGNAALRQISLN 220
++ + + D N +R+I +N
Sbjct: 241 HISQNGEIYIADTLNHRIRKIFVN 264
Score = 39.3 bits (90), Expect = 4.9, Method: Composition-based stats.
Identities = 31/95 (32%), Positives = 44/95 (46%), Gaps = 10/95 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPN---EAR 133
PY + VS+ GE++ D +N I KI + G + + G G +R
Sbjct: 293 PYGVHVSQSGEIYIADTLNHRIRKI------FVNGTITTIAGSSSDGSFGGDGGLSIASR 346
Query: 134 FNHPKGVTMDDKGN-LYVADTLNLAIRKIGDAGVT 167
N PKGV + N + +ADT N IRKI + G T
Sbjct: 347 LNSPKGVFVSPNNNEILIADTSNNRIRKIIECGTT 381
>gi|223939666|ref|ZP_03631540.1| NHL repeat containing protein [bacterium Ellin514]
gi|223891717|gb|EEF58204.1| NHL repeat containing protein [bacterium Ellin514]
Length = 452
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 73/162 (45%), Gaps = 15/162 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + G ++ D N+ I K+T + + +AGS G +DG N+A F
Sbjct: 235 PIGLALDSAGNMYVADTGNNAIRKLTLEGTNWVV-TTIAGSTNQQNGSLDGTNNQALFTW 293
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG---VTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
P+ +D GN+YVAD+ N IRK+ G V + GG++ + DG S A F N
Sbjct: 294 PESPAVDSAGNVYVADSYNYTIRKVTVVGTDYVVSTVGGRAGFSYPLDGVSTKAAFVNPV 353
Query: 194 DVVYVRPTCSLLVIDR-GNAALRQISLNQDDCEYQYNSISPT 234
V +D+ GN + S N C + + SPT
Sbjct: 354 SV----------ALDKNGNLYVADHSYNTVRCGWVESMASPT 385
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 44/147 (29%), Positives = 74/147 (50%), Gaps = 8/147 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G +F + + I K++P + + +AG G +G DG ++ARF
Sbjct: 119 PTGLAVDASGNVFVAEGYANTIRKLSPIGTNWIV-TTIAG-LAGSSGSADGTNSDARFYL 176
Query: 137 PKG-VTMDDKGNLYVADTLNLAIRKI---GDAGVTTIAGGKSNVAGFRDGPSEDAKFSND 192
P G + D GNL+V D + IR++ G V T G + + GF+DG + DA F +
Sbjct: 177 PYGNMACDTNGNLFVTDGYD-TIRQLRPDGTNWVVTTIAGAAGIHGFKDGTNNDALFYSP 235
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQISL 219
+ + ++ V D GN A+R+++L
Sbjct: 236 IGLA-LDSAGNMYVADTGNNAIRKLTL 261
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/120 (36%), Positives = 63/120 (52%), Gaps = 8/120 (6%)
Query: 76 VPY-KIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
+PY + +G LF D ++ I ++ P + + +AG+ G G DG N+A F
Sbjct: 176 LPYGNMACDTNGNLFVTDGYDT-IRQLRPDGTNWVV-TTIAGA-AGIHGFKDGTNNDALF 232
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG----VTTIAGGKSNVAGFRDGPSEDAKFS 190
P G+ +D GN+YVADT N AIRK+ G VTTIAG + G DG + A F+
Sbjct: 233 YSPIGLALDSAGNMYVADTGNNAIRKLTLEGTNWVVTTIAGSTNQQNGSLDGTNNQALFT 292
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 69/145 (47%), Gaps = 8/145 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + G ++ D N +++ + + +AG+ Q Y G DG EARF
Sbjct: 61 PQSVACDASGNVYVADY-NGRVIRKVEVIGKDWVVTTIAGTNQAY-GTKDGTNAEARFTG 118
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG----VTTIAGGKSNVAGFRDGPSEDAKFSND 192
P G+ +D GN++VA+ IRK+ G VTTIA G + +G DG + DA+F
Sbjct: 119 PTGLAVDASGNVFVAEGYANTIRKLSPIGTNWIVTTIA-GLAGSSGSADGTNSDARFYLP 177
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQI 217
+ + +L V D G +RQ+
Sbjct: 178 YGNMACDTNGNLFVTD-GYDTIRQL 201
Score = 43.9 bits (102), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 34/68 (50%), Gaps = 3/68 (4%)
Query: 126 DGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK---IGDAGVTTIAGGKSNVAGFRDG 182
DG A F P+ V D GN+YVAD IRK IG V T G + G +DG
Sbjct: 50 DGTNTAAHFYSPQSVACDASGNVYVADYNGRVIRKVEVIGKDWVVTTIAGTNQAYGTKDG 109
Query: 183 PSEDAKFS 190
+ +A+F+
Sbjct: 110 TNAEARFT 117
>gi|365121900|ref|ZP_09338810.1| hypothetical protein HMPREF1033_02156 [Tannerella sp.
6_1_58FAA_CT1]
gi|363643613|gb|EHL82920.1| hypothetical protein HMPREF1033_02156 [Tannerella sp.
6_1_58FAA_CT1]
Length = 443
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 58/110 (52%), Gaps = 2/110 (1%)
Query: 110 RGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTI 169
+ L AG G++DG + A FN P+ + +D+ NLY+ADT N IRKI G +
Sbjct: 335 KHELFAGIVNN-AGYLDGIGSYAMFNQPRQLILDEDDNLYIADTENHVIRKITPQGQVST 393
Query: 170 AGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G++ VAG++DG E A F V + + + D N +R++++
Sbjct: 394 VIGQAGVAGYQDGDPEVALFDRPHGVC-INKEGIIYIGDYENQCIRRLAI 442
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 46/88 (52%), Gaps = 9/88 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKPNEARF 134
P ++ + ED L+ D N I KITP +G++ V G G G+ DG P A F
Sbjct: 361 PRQLILDEDDNLYIADTENHVIRKITP------QGQVSTVIGQ-AGVAGYQDGDPEVALF 413
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIG 162
+ P GV ++ +G +Y+ D N IR++
Sbjct: 414 DRPHGVCINKEGIIYIGDYENQCIRRLA 441
>gi|375146499|ref|YP_005008940.1| cell surface receptor IPT/TIG domain-containing protein [Niastella
koreensis GR20-10]
gi|361060545|gb|AEV99536.1| cell surface receptor IPT/TIG domain protein [Niastella koreensis
GR20-10]
Length = 500
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 52/144 (36%), Positives = 74/144 (51%), Gaps = 20/144 (13%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + G L + NS I +ITP L S +AGS G G++DG A+F+
Sbjct: 224 PDDIALDAAGNLIVSEIGNSKIRRITP-LGVVST---IAGS--GTYGYLDGPGLTAQFHF 277
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P+G+T+D+ GN+YVA+ N +IRKI AG VTT F +GP + ND
Sbjct: 278 PQGLTLDNAGNIYVAEYQNQSIRKIDPAGNVTTF---------FINGPQGYVAYPNDV-- 326
Query: 196 VYVRPTCSLLVIDRGNAALRQISL 219
V ++ V D+ N + +ISL
Sbjct: 327 --VVDASNVYVTDQSNNRICKISL 348
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 47/76 (61%), Gaps = 6/76 (7%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDK 145
G ++ D+ + I ++ P + L+AG + +G++DGKP +ARF+ P+G+ +D
Sbjct: 428 GNIYVTDDSTNGIYRVDP----NGKLSLIAGGVR--SGYIDGKPQDARFSGPRGIVIDAS 481
Query: 146 GNLYVADTLNLAIRKI 161
GNL+VAD N IRKI
Sbjct: 482 GNLFVADIGNNCIRKI 497
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 52/190 (27%), Positives = 78/190 (41%), Gaps = 52/190 (27%)
Query: 74 GVVPYKIRVSEDG-ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEA 132
G V Y V D ++ D+ N+ I KI+ G L S G G VDG P +A
Sbjct: 318 GYVAYPNDVVVDASNVYVTDQSNNRICKIS-----LQTGLLSVLSGNGNWGMVDGDPQQA 372
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRK--------------------------IGDAGV 166
+F P + +D+ NL +AD +N +RK +G+ V
Sbjct: 373 QFYQPAKMALDNNNNLIIADKINGRVRKVIKASGYTSSVTLNVFSTPAGLVMDGVGNIYV 432
Query: 167 T-----------------TIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDR 209
T IAGG +G+ DG +DA+FS +V + + +L V D
Sbjct: 433 TDDSTNGIYRVDPNGKLSLIAGGVR--SGYIDGKPQDARFSGPRGIV-IDASGNLFVADI 489
Query: 210 GNAALRQISL 219
GN +R+I +
Sbjct: 490 GNNCIRKIIM 499
Score = 42.7 bits (99), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 37/68 (54%), Gaps = 3/68 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRD 181
G VD A+F P + +D GNL V++ N IR+I G V+TIAG S G+ D
Sbjct: 210 GAVDASGINAQFWSPDDIALDAAGNLIVSEIGNSKIRRITPLGVVSTIAG--SGTYGYLD 267
Query: 182 GPSEDAKF 189
GP A+F
Sbjct: 268 GPGLTAQF 275
>gi|116672456|ref|YP_833389.1| NHL repeat-containing protein [Arthrobacter sp. FB24]
gi|116612565|gb|ABK05289.1| NHL repeat containing protein [Arthrobacter sp. FB24]
Length = 672
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 40/103 (38%), Positives = 56/103 (54%), Gaps = 6/103 (5%)
Query: 119 QGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK--IGDAGVTTI--AGGKS 174
G G +DG +EA F G+ D GN++VAD+ A+RK I DAG T+ A GK
Sbjct: 382 NGLEGLLDGAAHEAWFAQSSGLAEDADGNIWVADSETSALRKLVIDDAGTVTVESAVGKG 441
Query: 175 NVA-GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
GFRDGP+ +A+ + V V P S+ + D N A+R+
Sbjct: 442 LFDFGFRDGPAAEARLQHPLGVT-VLPDGSVAIADTYNGAVRR 483
Score = 38.9 bits (89), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDA--GVTTIAGGKS 174
G DG EAR HP GVT+ G++ +ADT N A+R+ A V+T+A G S
Sbjct: 446 GFRDGPAAEARLQHPLGVTVLPDGSVAIADTYNGAVRRYDPATGTVSTLARGLS 499
>gi|37522270|ref|NP_925647.1| hypothetical protein gll2701 [Gloeobacter violaceus PCC 7421]
gi|35213270|dbj|BAC90642.1| gll2701 [Gloeobacter violaceus PCC 7421]
Length = 351
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 46/137 (33%), Positives = 70/137 (51%), Gaps = 12/137 (8%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKPNEARFNHPKGVTMD 143
G L+ D + + ++ + GR+ +AG+ + G G +ARFN P GV +D
Sbjct: 109 GNLYIADHNHHRVRRVG------TDGRIETIAGTGEADYGGDGGPAKQARFNDPAGVAVD 162
Query: 144 DKGNLYVADTLNLAIRKIG-DAGVTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 201
GN+ VADT N IR IG D + T+AG G++ +G GP+ A+ + V V P
Sbjct: 163 ALGNVLVADTYNHRIRTIGPDGTIRTVAGTGQAGYSG-DGGPATAARLDFPWGVA-VAPD 220
Query: 202 CSLLVIDRGNAALRQIS 218
+L+ D GN +R I
Sbjct: 221 GRILIADTGNNRIRSIG 237
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 54/167 (32%), Positives = 80/167 (47%), Gaps = 20/167 (11%)
Query: 62 YLVETVIEGNEIGVV-----PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--- 113
+ + +V+ GN I V P + + GELF V+ + I+K+ P GRL
Sbjct: 25 FALRSVVAGNHIKVAEEPLYPVFVLPAPAGELFVVEHSRNQILKLVP-------GRLPTV 77
Query: 114 VAGSFQG-YTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAG 171
+AG+ Y+G DG P G+ D GNLY+AD + +R++G D + TIAG
Sbjct: 78 IAGNGTSDYSG--DGGPATRAGLFMMGIARDRAGNLYIADHNHHRVRRVGTDGRIETIAG 135
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G GP++ A+F ND V V ++LV D N +R I
Sbjct: 136 TGEADYGGDGGPAKQARF-NDPAGVAVDALGNVLVADTYNHRIRTIG 181
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 65/136 (47%), Gaps = 15/136 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ + V+ DG + D N+ I I P + R VAG+ Q G G +AR
Sbjct: 212 PWGVAVAPDGRILIADTGNNRIRSIGPDGTI----RTVAGTGQAGFGGDGGPAVKARLER 267
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P+ D +GNL+VADT N +R+I D ++T+AGG+ P A+ ++ F
Sbjct: 268 PQLAVADHRGNLFVADTNNNRVRRIAPDGTISTVAGGE---------PPAAARLNDPF-A 317
Query: 196 VYVRPTCSLLVIDRGN 211
V V L + D GN
Sbjct: 318 VGVDERGHLYIADTGN 333
Score = 45.8 bits (107), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 47/145 (32%), Positives = 65/145 (44%), Gaps = 10/145 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKP-NEARF 134
P + V G + D N I I P + R VAG+ Q GY+G DG P AR
Sbjct: 156 PAGVAVDALGNVLVADTYNHRIRTIGPDGTI----RTVAGTGQAGYSG--DGGPATAARL 209
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
+ P GV + G + +ADT N IR IG D + T+AG G GP+ A+
Sbjct: 210 DFPWGVAVAPDGRILIADTGNNRIRSIGPDGTIRTVAGTGQAGFGGDGGPAVKARLERP- 268
Query: 194 DVVYVRPTCSLLVIDRGNAALRQIS 218
+ +L V D N +R+I+
Sbjct: 269 QLAVADHRGNLFVADTNNNRVRRIA 293
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 40/82 (48%), Gaps = 13/82 (15%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDK 145
G LF D N+ + +I P + + VAG P AR N P V +D++
Sbjct: 277 GNLFVADTNNNRVRRIAPDGTIST----VAGGE---------PPAAARLNDPFAVGVDER 323
Query: 146 GNLYVADTLNLAIRKIGDAGVT 167
G+LY+ADT N + KI +G T
Sbjct: 324 GHLYIADTGNFRVLKIDGSGRT 345
>gi|158319004|ref|YP_001511512.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
gi|158114409|gb|ABW16606.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
Length = 892
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 47/147 (31%), Positives = 66/147 (44%), Gaps = 8/147 (5%)
Query: 74 GVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEAR 133
G+ PY + + G LF + I K+T L GY+G +G A+
Sbjct: 575 GLSPYSVAIDPQGTLFITSLSSDRIQKVT---RTGEVSDLAGTGADGYSGD-NGPATAAK 630
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFR-DGPSEDAKFSN 191
N P D GN+Y+ D N IRKI D +TTIAG + AGF DG A N
Sbjct: 631 LNGPGSAVPDKNGNIYIPDAQNYRIRKITPDGIITTIAG--TGTAGFSGDGGPATAAQIN 688
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQIS 218
+ V + P S+ + D N +R+I+
Sbjct: 689 SAEKVAIGPDGSIYIADYDNHRIRKIT 715
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 74/146 (50%), Gaps = 13/146 (8%)
Query: 79 KIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKP-NEARFNH 136
K+ + DG ++ D N I KITP + +AG+ QGY+G DG P A+ +
Sbjct: 692 KVAIGPDGSIYIADYDNHRIRKITPDGIINT----IAGTGLQGYSG--DGGPATAAKLDG 745
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSNDFD 194
P V + D G LY+A+ + I+KI G VTT+AG G+ +G GP+ A+ S
Sbjct: 746 PNDVELGDDGTLYIANLGSNTIQKITKDGIVTTVAGNGQKGFSG-DGGPATAAQLS--VP 802
Query: 195 VVYVRPTCSLLVIDRGNAALRQISLN 220
V + + + D GN +R++ N
Sbjct: 803 SVSLGNGGEIYIADYGNNRVRKVDPN 828
Score = 43.1 bits (100), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 73/144 (50%), Gaps = 9/144 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKPNEARFN 135
P + + +DG L+ + ++ I KIT + VAG+ Q G++G DG P A
Sbjct: 746 PNDVELGDDGTLYIANLGSNTIQKIT----KDGIVTTVAGNGQKGFSG--DGGPATAAQL 799
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
V++ + G +Y+AD N +RK+ G +TTIAG + +G G + A+F N+
Sbjct: 800 SVPSVSLGNGGEIYIADYGNNRVRKVDPNGTITTIAGTGAEGSGGDGGQATAAQF-NEPS 858
Query: 195 VVYVRPTCSLLVIDRGNAALRQIS 218
V +L + D GN LR+I+
Sbjct: 859 SVAEDADGALYIADSGNNRLRRIA 882
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 48/92 (52%), Gaps = 5/92 (5%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
+ + GE++ D N+ + K+ P + + +AG+ +G G+ A+FN P
Sbjct: 804 VSLGNGGEIYIADYGNNRVRKVDPNGTITT----IAGTGAEGSGGDGGQATAAQFNEPSS 859
Query: 140 VTMDDKGNLYVADTLNLAIRKIG-DAGVTTIA 170
V D G LY+AD+ N +R+I D +TT+A
Sbjct: 860 VAEDADGALYIADSGNNRLRRIAPDGTITTVA 891
Score = 38.1 bits (87), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 59/117 (50%), Gaps = 13/117 (11%)
Query: 84 EDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKP-NEARFNHPKGVT 141
++G ++ D N I KITP + +AG+ G++G DG P A+ N + V
Sbjct: 641 KNGNIYIPDAQNYRIRKITPDGIITT----IAGTGTAGFSG--DGGPATAAQINSAEKVA 694
Query: 142 MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFR--DGPSEDAKFSNDFDV 195
+ G++Y+AD N IRKI D + TIAG + + G+ GP+ AK DV
Sbjct: 695 IGPDGSIYIADYDNHRIRKITPDGIINTIAG--TGLQGYSGDGGPATAAKLDGPNDV 749
>gi|290977429|ref|XP_002671440.1| predicted protein [Naegleria gruberi]
gi|284085009|gb|EFC38696.1| predicted protein [Naegleria gruberi]
Length = 1494
Score = 59.3 bits (142), Expect = 5e-06, Method: Composition-based stats.
Identities = 42/138 (30%), Positives = 70/138 (50%), Gaps = 12/138 (8%)
Query: 114 VAGSFQGYTGHVDGKPN-EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
+AG+ YTG DG +A+ N P GV + G +Y++D+ N IRKI G ++T+ G
Sbjct: 477 IAGTISKYTG--DGSSAIQAQLNGPTGVAISPNGEVYISDSSNDVIRKIDTNGAISTVVG 534
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSI 231
S GF D + A+ +N + ++ P L++ D N +R+ S++ + +I
Sbjct: 535 SSS---GFLDASARRAQLANPMGIAFL-PNGDLIISDAKNNRIRKFSISSGNV----TTI 586
Query: 232 SPTDILMVVGAVLVGYVT 249
+ T + G LVG T
Sbjct: 587 AGTGLTTYNGEGLVGVAT 604
>gi|302821389|ref|XP_002992357.1| hypothetical protein SELMODRAFT_430570 [Selaginella moellendorffii]
gi|300139773|gb|EFJ06507.1| hypothetical protein SELMODRAFT_430570 [Selaginella moellendorffii]
Length = 220
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 56/103 (54%), Gaps = 8/103 (7%)
Query: 163 DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC-SLLVIDRGNAALRQISLNQ 221
DA VTTI GG S GF DGP + A+FS++ + +C SLL+ DRGN +R+I ++
Sbjct: 20 DASVTTIVGGSSRKPGFADGPGDTARFSSESSLAC---SCGSLLIADRGNRLIREIQIDD 76
Query: 222 -DDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSR 263
C+ +++S + ++ VL+G L GPF +
Sbjct: 77 PKSCDSSDSAVSGSQKWALIPVVLLGVC---LGMPLGPFIIWK 116
>gi|224125014|ref|XP_002319481.1| predicted protein [Populus trichocarpa]
gi|222857857|gb|EEE95404.1| predicted protein [Populus trichocarpa]
Length = 1065
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 51/158 (32%), Positives = 79/158 (50%), Gaps = 16/158 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRG-RLVAGSFQGY------TGHVDGKP 129
P + +S D EL+ D +S+I LS ++G RL+AG + G DG
Sbjct: 752 PSGVSLSPDFELYVADSESSSI----RVLSLRTKGTRLLAGGDPIFPDNLFKFGDHDGIG 807
Query: 130 NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDA 187
+E HP GV G +Y+AD+ N I+K+ A VTTIAG + AGF+DG + A
Sbjct: 808 SEVLLQHPLGVLHAKDGLIYIADSYNHKIKKLDLATKRVTTIAG--TGKAGFKDGKALTA 865
Query: 188 KFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCE 225
+ S ++ L++ D N+ +R + LN+ + E
Sbjct: 866 QLSEPAGLIEAE-NGRLIIADTNNSVIRYLDLNKGEAE 902
Score = 38.9 bits (89), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 49/96 (51%), Gaps = 18/96 (18%)
Query: 89 FAVDEVNSNIVKITPPLSQYSRGRLVA----GSF------QGYTGHVDGKPNEARFNHPK 138
A+D +N+ + +S + R+V G+F G G DG ++A FN P+
Sbjct: 581 LAIDVLNNRLF-----ISDSNHNRIVVTDLDGNFIAQIGSSGEEGLRDGSFDDATFNRPQ 635
Query: 139 GVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG 171
G+ + K N LYVADT N A+R++ V T+AG
Sbjct: 636 GLAYNSKKNILYVADTENHALREVDFVSEKVRTLAG 671
>gi|375146761|ref|YP_005009202.1| NHL repeat containing protein [Niastella koreensis GR20-10]
gi|361060807|gb|AEV99798.1| NHL repeat containing protein [Niastella koreensis GR20-10]
Length = 439
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 54/155 (34%), Positives = 76/155 (49%), Gaps = 24/155 (15%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTG---HVDGKPNEAR 133
P IR + G LF V +N + V + +Y+ G + G TG +V+G A+
Sbjct: 292 PAGIRFDKSGNLFVV--LNGDHV-----IKKYTAGAWAGSTIAGQTGVPGYVNGAAAVAK 344
Query: 134 FNHPKGVTMDDKGNLYVA-------DTLN--LAIRKIGDA--GVTTIAGGKSNVAGFRDG 182
F+HP G+ +D GNLYVA +T N AIR I A V+T AG S AG+ D
Sbjct: 345 FDHPWGLAIDTAGNLYVAGNGTYDGNTGNSDQAIRYIEAASFNVSTFAG--SGSAGYADA 402
Query: 183 PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
E A FS + V +L V+D+ N +R+I
Sbjct: 403 VGEAAAFSAPTGIA-VDKNGTLYVLDKNNNRIRKI 436
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 61/126 (48%), Gaps = 10/126 (7%)
Query: 119 QGYTGHVDGKPNEARFN-------HPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIA 170
G G +G +A F+ G+ +DD NLYVAD N IRKI +A VTT
Sbjct: 129 SGKAGFANGMGTDAMFDFGGQSWYRSMGIIVDDNLNLYVADPGNHCIRKIDSNANVTTFC 188
Query: 171 GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNS 230
G S AG+ DG AKFS +DV + ++ +D N +R+I+ + + + S
Sbjct: 189 GSPSG-AGYADGKGTAAKFSLPYDVAF-DAQGNIWCVDPANWDIRKIAPDGTATTWAWGS 246
Query: 231 ISPTDI 236
SP +
Sbjct: 247 QSPWSV 252
>gi|301064161|ref|ZP_07204608.1| RHS repeat-associated core domain protein [delta proteobacterium
NaphS2]
gi|300441781|gb|EFK06099.1| RHS repeat-associated core domain protein [delta proteobacterium
NaphS2]
Length = 2050
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 71/148 (47%), Gaps = 13/148 (8%)
Query: 78 YKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQ-GYTGHVDGKPN-EAR 133
Y I G L D N+ I K+ + G + VAGS G++G DG P EA
Sbjct: 776 YGIATDSAGNLHIADWGNNRIRKVD------TNGIITTVAGSGDYGFSG--DGGPAIEAS 827
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD-GPSEDAKFSND 192
+ P G+ +D GNLY+ D+ N +RK+ G+ T G N + D GP+ +A S+
Sbjct: 828 LSFPMGIAIDSAGNLYILDSDNNRVRKVDTNGIITTVAGNGNWSYNGDGGPAVEASLSSA 887
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQISLN 220
+ + +L + D GN +R++ N
Sbjct: 888 ASGIAIDSAGNLYISDTGNYCIRKVDTN 915
Score = 51.6 bits (122), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 43/150 (28%), Positives = 74/150 (49%), Gaps = 15/150 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQG-YTGHVDGKPN-EARF 134
P I + G L+ +D N+ + K+ + G + + G ++ + DG P EA
Sbjct: 831 PMGIAIDSAGNLYILDSDNNRVRKVD------TNGIITTVAGNGNWSYNGDGGPAVEASL 884
Query: 135 NHP-KGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--DGPSEDAKFS 190
+ G+ +D GNLY++DT N IRK+ G +TT+AG + VAGF GP+ +A
Sbjct: 885 SSAASGIAIDSAGNLYISDTGNYCIRKVDTNGIITTVAG--NGVAGFSGDGGPAVEASLG 942
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
+ + +L ++D N +R++ N
Sbjct: 943 WAMGIA-IDSAGNLYILDGSNHRVRKVDTN 971
Score = 43.9 bits (102), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 44/143 (30%), Positives = 65/143 (45%), Gaps = 14/143 (9%)
Query: 80 IRVSEDGELFAVDEVNSNIVKI-TPPLSQYSRGRLVAGSFQGYTGHVDGKPN-EARFNHP 137
I + G L+ D N I K+ T + G VAG F G DG P EA
Sbjct: 891 IAIDSAGNLYISDTGNYCIRKVDTNGIITTVAGNGVAG-FSG-----DGGPAVEASLGWA 944
Query: 138 KGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--DGPSEDAKFSNDFD 194
G+ +D GNLY+ D N +RK+ G +TT+AG S+ GF GP+ +A
Sbjct: 945 MGIAIDSAGNLYILDGSNHRVRKVDTNGIITTVAG--SDDYGFSGDGGPAIEASLGYAVG 1002
Query: 195 VVYVRPTCSLLVIDRGNAALRQI 217
+ + +L + D N +R++
Sbjct: 1003 IA-IDSAENLYISDSSNHCIRRV 1024
Score = 42.0 bits (97), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 45/97 (46%), Gaps = 3/97 (3%)
Query: 126 DGKPN-EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD-GP 183
DG P EA G+ D GNL++AD N IRK+ G+ T G + D GP
Sbjct: 763 DGGPAIEASLRSSYGIATDSAGNLHIADWGNNRIRKVDTNGIITTVAGSGDYGFSGDGGP 822
Query: 184 SEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
+ +A S + + +L ++D N +R++ N
Sbjct: 823 AIEASLSFPMGIA-IDSAGNLYILDSDNNRVRKVDTN 858
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 49/107 (45%), Gaps = 12/107 (11%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQ-GYTGHVDGKPN-EARFN 135
I + G L+ +D N + K+ + G + VAGS G++G DG P EA
Sbjct: 947 IAIDSAGNLYILDGSNHRVRKVD------TNGIITTVAGSDDYGFSG--DGGPAIEASLG 998
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
+ G+ +D NLY++D+ N IR++ G+ G DG
Sbjct: 999 YAVGIAIDSAENLYISDSSNHCIRRVDTGGIIATVAGNGIYGSIEDG 1045
>gi|421094122|ref|ZP_15554843.1| putative lipoprotein [Leptospira borgpetersenii str. 200801926]
gi|410363263|gb|EKP14295.1| putative lipoprotein [Leptospira borgpetersenii str. 200801926]
gi|456886734|gb|EMF97860.1| putative lipoprotein [Leptospira borgpetersenii str. 200701203]
Length = 358
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 47/136 (34%), Positives = 72/136 (52%), Gaps = 9/136 (6%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDK 145
G+ F + + I+KI P+ Q+S L AG+ G G +G + F P + +D +
Sbjct: 133 GDKFISCQDTAQILKI-DPMDQFS---LYAGNSSGIGGFQNGDRLNSLFKGPLFMDLDRE 188
Query: 146 GNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 203
NLYV + N IRKI V+T++GG V+G+ DG A+F + ++Y + T S
Sbjct: 189 RNLYVGELGNHTIRKINLNSETVSTLSGG---VSGYLDGDLTSAQFKSPSGIIYDQKTDS 245
Query: 204 LLVIDRGNAALRQISL 219
LLV D N +R+I L
Sbjct: 246 LLVADLQNHRIRKIDL 261
>gi|372267905|ref|ZP_09503953.1| hypothetical protein AlS89_08394 [Alteromonas sp. S89]
Length = 2227
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 53/198 (26%), Positives = 92/198 (46%), Gaps = 25/198 (12%)
Query: 44 SSSKAPQADGNVLQFEN-GYLVETVIEGNEIGVVPY----------------KIRVSEDG 86
S + + +G+V+ + G +++TV G E+G ++ V+ DG
Sbjct: 861 SGQRIQRGNGSVVDISSLGGVIDTVAGGGEVGGTSILGDGGLAVDASLRYLGRVAVAPDG 920
Query: 87 ELFAVDEVNSNIVKITPPLSQYSRGRLVAG-SFQGYTGHVDGK-PNEARFNHPKGVTMDD 144
+ D+ V + +S + G L + S +G++G DG A +HP G+ D
Sbjct: 921 SFYFTDDHLIRKVDVDGYISTIA-GTLASNVSVRGFSG--DGAIATAATMDHPYGLDFCD 977
Query: 145 KGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR--DGPSEDAKFSNDFDVVYVRPTC 202
G++YVADT N +R+I GV T G + F GP+ DA + +DV+ P
Sbjct: 978 DGSIYVADTGNDRVRRIDRRGVITTIAGSEVIDTFAGDGGPATDASLNAPYDVI-CGPHG 1036
Query: 203 SLLVIDRGNAALRQISLN 220
S+ + D N +R++ +N
Sbjct: 1037 SIYIADSRNHRIRRVDVN 1054
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/146 (30%), Positives = 71/146 (48%), Gaps = 14/146 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNI--VKITPPLSQYSRGRLVAGS-FQGYTGHVDGKP-NEA 132
PY + G ++ D N I V + +S VAGS +G++G DG P +A
Sbjct: 1027 PYDVICGPHGSIYIADSRNHRIRRVDVNGIIST------VAGSGARGFSG--DGGPATDA 1078
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSN 191
+ P G+T+D +GNLY+ D+ N IR++G D +TTIAG G + +A F +
Sbjct: 1079 SLSAPSGITLDPEGNLYIVDSGNRRIRRVGVDGRITTIAGNGGYKNTGDGGSALEAGFDD 1138
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQI 217
+ Y + + D G +R+I
Sbjct: 1139 PLGIAYA-ADGGIYISDSGEGGVRRI 1163
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 71/146 (48%), Gaps = 10/146 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPN-EARFN 135
P I + +G L+ VD N I ++ R +AG+ GY DG EA F+
Sbjct: 1083 PSGITLDPEGNLYIVDSGNRRIRRVGVD----GRITTIAGN-GGYKNTGDGGSALEAGFD 1137
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGF--RDGPSEDAKFSND 192
P G+ G +Y++D+ +R+IG D + T+AG KS GF GP+ A +
Sbjct: 1138 DPLGIAYAADGGIYISDSGEGGVRRIGTDGTIVTVAGNKSPYWGFGGDGGPAIQADMTGV 1197
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQIS 218
DV + P SL ++D N +R++S
Sbjct: 1198 TDVA-IGPEGSLYLVDAYNFRIRKVS 1222
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/150 (30%), Positives = 74/150 (49%), Gaps = 15/150 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKP-NEAR 133
PY + +DG ++ D N + +I RG + +AGS T DG P +A
Sbjct: 970 PYGLDFCDDGSIYVADTGNDRVRRID------RRGVITTIAGSEVIDTFAGDGGPATDAS 1023
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--DGPSEDAKFS 190
N P V G++Y+AD+ N IR++ G ++T+AG S GF GP+ DA S
Sbjct: 1024 LNAPYDVICGPHGSIYIADSRNHRIRRVDVNGIISTVAG--SGARGFSGDGGPATDASLS 1081
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
+ + P +L ++D GN +R++ ++
Sbjct: 1082 APSGIT-LDPEGNLYIVDSGNRRIRRVGVD 1110
>gi|116619658|ref|YP_821814.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116222820|gb|ABJ81529.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 585
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 6/135 (4%)
Query: 84 EDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMD 143
+ G L+ D N I I+ G AG F G A N P GV +D
Sbjct: 61 KTGNLYIADSANHRIRMISGSTISTVAGNGTAG-FAGD----KAAATSANLNTPSGVALD 115
Query: 144 DKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVRPTC 202
GN Y+AD+LN IRK+ +TT+AG + G + DG + N+ V V P
Sbjct: 116 SSGNFYIADSLNSVIRKVTGGTITTVAGDYTQFPGDQGDGGQANVAVLNNPTSVMVDPAG 175
Query: 203 SLLVIDRGNAALRQI 217
+ + D GN +R++
Sbjct: 176 NYYIADSGNNRIRKV 190
Score = 43.5 bits (101), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 14/129 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVD--GKPNEA 132
P + + G + D +NS I K+T G + VAG + + G G+ N A
Sbjct: 109 PSGVALDSSGNFYIADSLNSVIRKVT-------GGTITTVAGDYTQFPGDQGDGGQANVA 161
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSND 192
N+P V +D GN Y+AD+ N IRK+ G G G P A F N
Sbjct: 162 VLNNPTSVMVDPAGNYYIADSGNNRIRKVDTTGTINAYLGTLATGGRLRNPYALALFGN- 220
Query: 193 FDVVYVRPT 201
V+Y+ T
Sbjct: 221 --VLYIADT 227
Score = 43.1 bits (100), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 49/99 (49%), Gaps = 8/99 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQG-YTGHVDGKPNEA--- 132
PY + + + L+ D N+ I K P + L +F G T G N A
Sbjct: 212 PYALALFGN-VLYIADTSNNRIAKYAPYTANNVAADLT--NFAGNLTAGFAGDGNTATLS 268
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIA 170
+ N P G+ +D GN+Y+AD+ N IRK+G G +TTIA
Sbjct: 269 QLNKPVGIAVDSAGNVYIADSNNGRIRKVGTDGIITTIA 307
Score = 42.4 bits (98), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 51/92 (55%), Gaps = 4/92 (4%)
Query: 129 PNEARFNHPKGVTMDDK-GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGPSED 186
P A+F+ P G+ +D K GNLY+AD+ N IR I + ++T+AG + AGF D +
Sbjct: 45 PKVAQFSSPTGLALDPKTGNLYIADSANHRIRMISGSTISTVAGNGT--AGFAGDKAAAT 102
Query: 187 AKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+ N V + + + + D N+ +R+++
Sbjct: 103 SANLNTPSGVALDSSGNFYIADSLNSVIRKVT 134
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 39/83 (46%), Gaps = 3/83 (3%)
Query: 131 EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGF--RDGPSEDAK 188
A + P+G+ + G +Y+ADT N IR + T +GG N A F R P E A
Sbjct: 323 SAVLSFPRGIAVAANGTVYIADTNNHVIRALVPTVPTINSGGVVNAASFTARISPGELAS 382
Query: 189 -FSNDFDVVYVRPTCSLLVIDRG 210
F N F + V+P L I G
Sbjct: 383 VFGNGFGISTVQPDVPLPTIAAG 405
>gi|290974673|ref|XP_002670069.1| predicted protein [Naegleria gruberi]
gi|284083624|gb|EFC37325.1| predicted protein [Naegleria gruberi]
Length = 821
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 75/143 (52%), Gaps = 6/143 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V +G+++ D N I K++ ++ AG+ QG+ G G A+ ++
Sbjct: 165 PSGVAVDSNGDVYISDSGNGLIRKVSISSGIITK---FAGTSQGFAGD-GGLAKNAKLSN 220
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P+G+ G+LY+AD+ N +R+I +G +TTIAG ++ G ++ AK N +V
Sbjct: 221 PRGLNFGPNGDLYIADSDNNVVRRINSSGIITTIAGDTTSGYSGDGGDAKLAKMKNPINV 280
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
+ V T + + D N +R++S
Sbjct: 281 I-VSKTNEIFISDADNNVIRKVS 302
Score = 43.1 bits (100), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 51/101 (50%), Gaps = 3/101 (2%)
Query: 119 QGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVA 177
G TG G A+ N P GV +D G++Y++D+ N IRK+ +G+ T G S
Sbjct: 148 SGNTGE-GGLAINAQLNQPSGVAVDSNGDVYISDSGNGLIRKVSISSGIITKFAGTSQGF 206
Query: 178 GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G +++AK SN + + P L + D N +R+I+
Sbjct: 207 AGDGGLAKNAKLSNPRGLNF-GPNGDLYIADSDNNVVRRIN 246
Score = 38.1 bits (87), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 39/145 (26%), Positives = 67/145 (46%), Gaps = 9/145 (6%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
+P I+ +G+++ D SN++++ + VAG+ G G A+ N
Sbjct: 444 MPSLIQCQPNGDVYVADYF-SNLIRLVSTNGSITN---VAGTGVGGYSGDGGNAKLAKLN 499
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG--GKSNVAGFRDGPSEDAKFSND 192
P V + + +ADT N IRK+ G + TIAG G +G +G + + S+
Sbjct: 500 APNSVKVSSSDEIVIADTSNNVIRKVFKNGTIITIAGTIGSQGYSG-DNGLAISSVLSSP 558
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQI 217
DV + P + + D GN +R+I
Sbjct: 559 SDVA-IAPNGEVFIADWGNHVIRKI 582
>gi|116619867|ref|YP_822023.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116223029|gb|ABJ81738.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 592
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 66/143 (46%), Gaps = 3/143 (2%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
+P + V + G L VD+ I K+ + S QG++G G+ +A FN
Sbjct: 157 IPIRCAVDKSGNLLIVDQGAHKIRKVEAASNIIST--FAGNGSQGFSGD-GGQAAQASFN 213
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
+P + +D G +YV D N IR+I GV T G N A DG S A N
Sbjct: 214 NPTALAVDAAGTVYVTDQSNQRIRRIDTGGVITTVAGNGNAAFSGDGGSATAASLNYPGG 273
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
+ + + +L ++D N +R++S
Sbjct: 274 IVLDSSGTLYIVDSVNQRVRKVS 296
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 69/146 (47%), Gaps = 13/146 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKPNEARF 134
P + V G ++ D+ N I +I + G + VAG+ G A
Sbjct: 215 PTALAVDAAGTVYVTDQSNQRIRRID------TGGVITTVAGNGNAAFSGDGGSATAASL 268
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR--DGPSEDAKFSND 192
N+P G+ +D G LY+ D++N +RK+ ++T+AG + AGF G + A+ +N
Sbjct: 269 NYPGGIVLDSSGTLYIVDSVNQRVRKVSGTTISTVAG--TGTAGFSGDGGAALQAQLNNP 326
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQIS 218
F + + +L V D N +R+I+
Sbjct: 327 FPIT-ADGSGNLYVGDVSNNRVRKIT 351
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 51/110 (46%), Gaps = 9/110 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + G L+ VD VN + K++ G AG F G G +A+ N+
Sbjct: 271 PGGIVLDSSGTLYIVDSVNQRVRKVSGTTISTVAGTGTAG-FSGD----GGAALQAQLNN 325
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDA----GVTTIAGGKSNVAGFRDG 182
P +T D GNLYV D N +RKI A G T + G +N A F+ G
Sbjct: 326 PFPITADGSGNLYVGDVSNNRVRKITGASTRPGPTITSAGVTNAASFQTG 375
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 57/101 (56%), Gaps = 5/101 (4%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAG-GKSNVA 177
G+ G G+ +A+ N G+ D GNLY+A+ LN +RK+ GV TT+AG G + A
Sbjct: 30 GFAGDT-GQATQAQINRAVGLVTDANGNLYLAEELNNRVRKVDTGGVITTLAGIGTAGFA 88
Query: 178 GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G GP+ A+ + + + + ++ V D+GN +R+I+
Sbjct: 89 G-DGGPAAQAQLNGPLGLC-IDGSGNIYVSDQGNKRVRKIA 127
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 43/86 (50%), Gaps = 6/86 (6%)
Query: 84 EDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPN-EARFNHPKGVTM 142
+G L+ +E+N+ + K+ + + F G DG P +A+ N P G+ +
Sbjct: 53 ANGNLYLAEELNNRVRKVDTGGVITTLAGIGTAGFAG-----DGGPAAQAQLNGPLGLCI 107
Query: 143 DDKGNLYVADTLNLAIRKIGDAGVTT 168
D GN+YV+D N +RKI +G T
Sbjct: 108 DGSGNIYVSDQGNKRVRKIAPSGTIT 133
>gi|222056335|ref|YP_002538697.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
gi|221565624|gb|ACM21596.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
Length = 372
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 41/101 (40%), Positives = 52/101 (51%), Gaps = 9/101 (8%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK--IGDAGVTTIAG----GKSNV 176
G DG + ARFN P G+T D NLYVADT N IRK I VTT+AG G +
Sbjct: 61 GTADGTGSAARFNAPSGITTDGT-NLYVADTGNNLIRKVVITTGAVTTLAGTVGTGTAQT 119
Query: 177 AGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
+G DG AKF+ F + +L V D N +R++
Sbjct: 120 SGSTDGTGSAAKFNAPFAI--TTDGTNLYVADTNNNTIRKV 158
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 45/141 (31%), Positives = 67/141 (47%), Gaps = 10/141 (7%)
Query: 82 VSEDGELFAVDEVNSNIVK---ITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPK 138
++ DG V + +N+++ IT G + G+ Q +G DG + A+FN P
Sbjct: 78 ITTDGTNLYVADTGNNLIRKVVITTGAVTTLAGTVGTGTAQ-TSGSTDGTGSAAKFNAPF 136
Query: 139 GVTMDDKGNLYVADTLNLAIRK--IGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
+T D NLYVADT N IRK I VTT+A G + G DG F++ +
Sbjct: 137 AITTDGT-NLYVADTNNNTIRKVVIATGTVTTLA-GSVGIPGSADGIGPAGLFNSPGGI- 193
Query: 197 YVRPTCSLLVIDRGNAALRQI 217
+L V D GN +R++
Sbjct: 194 -TTDGTNLYVSDTGNRTIRKV 213
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 49/157 (31%), Positives = 70/157 (44%), Gaps = 9/157 (5%)
Query: 82 VSEDG-ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGV 140
++ DG L+ D N I K+ + +AGS G G DG A F G+
Sbjct: 193 ITTDGTNLYVSDTGNRTIRKVVIATGAVTT---LAGS-AGTPGSTDGVGPSALFGTVFGI 248
Query: 141 TMDDKGNLYVADTLNLAIRKIGDA-GVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 199
T D +L+VADT N IRKI A G+ T G + V+G DG AKF+ F +
Sbjct: 249 TTDGT-SLFVADTDNSTIRKIVIATGMVTTLAGSAGVSGIADGTGSTAKFNAPFGI--TT 305
Query: 200 PTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDI 236
+L V D ++R++++ N S T I
Sbjct: 306 DGTNLYVTDSRQGSIRKVAIATGVVTTLVNGSSLTTI 342
>gi|156399714|ref|XP_001638646.1| predicted protein [Nematostella vectensis]
gi|156225768|gb|EDO46583.1| predicted protein [Nematostella vectensis]
Length = 286
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 45/137 (32%), Positives = 71/137 (51%), Gaps = 11/137 (8%)
Query: 88 LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN 147
++ D NS I K+ ++ + AG+ G G VDG+ A+F P+ +++DD+G+
Sbjct: 111 MYLADVGNSRIRKVDMATAEVT---TFAGN--GSEGLVDGERTRAQFKGPQSLSLDDEGD 165
Query: 148 -LYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 204
L+V DT N AIR I D V T+ GG GF+DG +KF + + Y R L
Sbjct: 166 RLFVGDTDNHAIRVISLKDGSVQTLVGGS---LGFKDGVGLKSKFYHPTGIAYDRENDIL 222
Query: 205 LVIDRGNAALRQISLNQ 221
V D N +R + +++
Sbjct: 223 YVSDHYNHVIRAVKVSE 239
>gi|357516311|ref|XP_003628444.1| hypothetical protein MTR_8g058310 [Medicago truncatula]
gi|355522466|gb|AET02920.1| hypothetical protein MTR_8g058310 [Medicago truncatula]
Length = 66
Score = 58.2 bits (139), Expect = 9e-06, Method: Composition-based stats.
Identities = 34/67 (50%), Positives = 43/67 (64%), Gaps = 5/67 (7%)
Query: 1 MKGTHLLLTLTLLIAFTLQFQAHAAP-AGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFE 59
M+ L L LL +L F AHA P G LI HL+SLL S++K ++DGNV+QFE
Sbjct: 1 MRNFVLFYALALL---SLHFSAHATPLGGTLINHLSSLL-IRKLSNTKTSKSDGNVVQFE 56
Query: 60 NGYLVET 66
NGY+VET
Sbjct: 57 NGYVVET 63
>gi|444916089|ref|ZP_21236213.1| hypothetical protein D187_08495 [Cystobacter fuscus DSM 2262]
gi|444712768|gb|ELW53683.1| hypothetical protein D187_08495 [Cystobacter fuscus DSM 2262]
Length = 2336
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/151 (27%), Positives = 72/151 (47%), Gaps = 17/151 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVA-------GSFQGYTGHVDGKP 129
P+ + V + G+++ D N + +I S GR+ GS +G + G
Sbjct: 1144 PWDVAVGKAGDVYVSDSANHRVRRIG------SNGRITTVAGTGDDGSLEGISIGDGGPA 1197
Query: 130 NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG--PSEDA 187
+A + PKG+ +D +GNLY+AD + +R++ G+ T G+ +GF P+
Sbjct: 1198 QQALLSAPKGLALDSEGNLYIADHFS-RVRRVDANGIITTYAGQLEASGFSGNGTPALQG 1256
Query: 188 KFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
KF + + V P S V D N ++R++S
Sbjct: 1257 KFDSPTGLA-VGPDGSCYVSDEWNHSVRRVS 1286
Score = 47.0 bits (110), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 46/91 (50%), Gaps = 3/91 (3%)
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGF--RDGPSEDAKF 189
AR ++P+G+ + +GNLY+AD N +R + G+ GK N GF +G + A
Sbjct: 1082 ARLSYPRGIALGKEGNLYIADFDNDRVRYVTPEGIIHTLAGKPNARGFCGDNGLASAACL 1141
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
+ +DV V + V D N +R+I N
Sbjct: 1142 NGPWDVA-VGKAGDVYVSDSANHRVRRIGSN 1171
Score = 42.7 bits (99), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 53/98 (54%), Gaps = 8/98 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAG--SFQGYTGHVDGKPNEARF 134
P I + ++G L+ D N + +TP ++ +AG + +G+ G +G + A
Sbjct: 1087 PRGIALGKEGNLYIADFDNDRVRYVTPEGIIHT----LAGKPNARGFCGD-NGLASAACL 1141
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
N P V + G++YV+D+ N +R+IG G +TT+AG
Sbjct: 1142 NGPWDVAVGKAGDVYVSDSANHRVRRIGSNGRITTVAG 1179
>gi|423220897|ref|ZP_17207391.1| hypothetical protein HMPREF1061_04164 [Bacteroides caccae
CL03T12C61]
gi|392622375|gb|EIY16503.1| hypothetical protein HMPREF1061_04164 [Bacteroides caccae
CL03T12C61]
Length = 426
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 56/116 (48%), Gaps = 7/116 (6%)
Query: 110 RGRLVAGSFQGYTG------HVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGD 163
R L G+++ Y G + DGK ARFN P + D G +Y+AD+ N IR I
Sbjct: 311 RYELSTGNYELYAGAREDPGYEDGKRLNARFNFPSQICFDLDGIMYIADSSNHCIRSIDR 370
Query: 164 AGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G + G AG+ DG +DA F + V V ++ + D N +R++++
Sbjct: 371 EGAVSTVIGVPGRAGYVDGTPDDALFDEPWGVA-VDEEGTIYIADTKNKCIRKLAI 425
>gi|255070387|ref|XP_002507275.1| predicted protein [Micromonas sp. RCC299]
gi|226522550|gb|ACO68533.1| predicted protein [Micromonas sp. RCC299]
Length = 131
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 63/112 (56%), Gaps = 4/112 (3%)
Query: 111 GRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTI 169
G++ + S G G DG+ ++ F+ P GV + G++YVAD+ N IR++ G VT I
Sbjct: 4 GKVTSISGSGIPGFRDGQGEKSHFSSPAGVAVSSDGSVYVADSGNNRIRRVSSKGHVTWI 63
Query: 170 AGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQ 221
AG S +AGF D A+F+ +V + PT + V D N +R +SL++
Sbjct: 64 AG--SGLAGFSDDKLLRAEFNRPQGIVTI-PTGLIFVADTLNHRIRLVSLHE 112
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 43/85 (50%), Gaps = 6/85 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + VS DG ++ D N+ I +++ S+G + + G G D K A FN
Sbjct: 30 PAGVAVSSDGSVYVADSGNNRIRRVS------SKGHVTWIAGSGLAGFSDDKLLRAEFNR 83
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P+G+ G ++VADTLN IR +
Sbjct: 84 PQGIVTIPTGLIFVADTLNHRIRLV 108
>gi|251771416|gb|EES51995.1| conserved protein of unknown function [Leptospirillum
ferrodiazotrophum]
Length = 373
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 61/112 (54%), Gaps = 6/112 (5%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI----GDAGVTTI 169
+AG F G VDG ++ARF + +G+ G LYVADT N IRK+ G A VTTI
Sbjct: 37 IAGEFH-ERGAVDGTGSQARFEYVQGIVAAPDGTLYVADTGNDLIRKVTVSGGTATVTTI 95
Query: 170 AGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQ 221
A G ++ A FRDG A+F+N + +L + D N +R++ L+
Sbjct: 96 A-GVNHHARFRDGNGTAARFNNPEGLAISSDGKTLYIADSRNNRIRKMDLSS 146
Score = 43.5 bits (101), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 65/147 (44%), Gaps = 9/147 (6%)
Query: 77 PYKIRVSEDGE-LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P + +S DG+ L+ D N+ I K+ S G + + + + G DG A F
Sbjct: 117 PEGLAISSDGKTLYIADSRNNRIRKM-----DLSSGAVTTLAGRAFPGSNDGTGGAAGFY 171
Query: 136 HPKGVTMD-DKGNLYVADTLNLAIRK--IGDAGVTTIAGGKSNVAGFRDGPSEDAKFSND 192
P+ + + D LY++D+ N IRK + VTT+AG + G DG A F +
Sbjct: 172 GPRALALTPDGKTLYISDSGNNMIRKLDVATGTVTTVAGKGALAPGMDDGIGGAASFRDP 231
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQISL 219
+ L V D N +R+I L
Sbjct: 232 RGIALSSDGSVLYVADTRNNLIRKIVL 258
>gi|414881061|tpg|DAA58192.1| TPA: hypothetical protein ZEAMMB73_458241 [Zea mays]
Length = 528
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 49/88 (55%), Gaps = 20/88 (22%)
Query: 61 GYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQY------------ 108
G +ETV + + +G+ PY V+++GEL +D +NSNI ++ PLS+
Sbjct: 200 GKSMETVFDSSMLGIEPYSREVTQNGELLVMDSMNSNIYRMALPLSRSDNLSHIRFISSS 259
Query: 109 --------SRGRLVAGSFQGYTGHVDGK 128
SR +LVAGS +G+ GH+DGK
Sbjct: 260 EIGLNNVDSRPKLVAGSPEGFPGHIDGK 287
>gi|390956314|ref|YP_006420071.1| hypothetical protein Terro_0387 [Terriglobus roseus DSM 18391]
gi|390411232|gb|AFL86736.1| hypothetical protein Terro_0387 [Terriglobus roseus DSM 18391]
Length = 1224
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 53/195 (27%), Positives = 90/195 (46%), Gaps = 16/195 (8%)
Query: 60 NGYLVETVIEG---NEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAG 116
G+ IEG ++G+ P ++ ++ DG L+ + V+ + P + R R AG
Sbjct: 189 TGFFNPVSIEGLSATQVGIQPKEVALAPDGTLYIANGVSDGPILAVDPTT--GRIRTYAG 246
Query: 117 SFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKS 174
S G TG + +A P G+++D G+LY D N +RK+ + +TTIAG S
Sbjct: 247 SASGCTGCDNVPAIQANIGWPSGMSVDANGDLYFVDISNHVVRKVTKSTGLITTIAGAGS 306
Query: 175 NVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAA--LRQISLNQD------DCEY 226
N ++D +F + D++ + P+ LLV D G A + +I ++ C
Sbjct: 307 NFPYNTVQLAKDTEFISPEDLL-ILPSGDLLVDDSGVGAGQIFRIEMSTGRLYPIAGCSG 365
Query: 227 QYNSISPTDILMVVG 241
N+ PT + G
Sbjct: 366 LVNTCDPTSTVATRG 380
>gi|292491957|ref|YP_003527396.1| hypothetical protein Nhal_1896 [Nitrosococcus halophilus Nc4]
gi|291580552|gb|ADE15009.1| YD repeat protein [Nitrosococcus halophilus Nc4]
Length = 2539
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 48/144 (33%), Positives = 72/144 (50%), Gaps = 8/144 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKPNEARFN 135
P+ I + DG L+ + N+ I ++ P LVAG+ +G++G G +AR N
Sbjct: 1325 PFAIALDSDGNLYIAESQNNVIRRVGPD----GLITLVAGNGTRGFSGD-GGLATQARLN 1379
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
HP + + G+LY+AD N IR + G +TTIAG S G GP+ A N
Sbjct: 1380 HPNDLAIGPDGSLYIADYDNHRIRLVSPNGIITTIAGNGSRGYGGDGGPATQAMLRNPQG 1439
Query: 195 VVYVRPTCSLLVIDRGNAALRQIS 218
+ V SL + DR N +R++S
Sbjct: 1440 IT-VASDSSLYIADRRNHRIRKVS 1462
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 71/144 (49%), Gaps = 11/144 (7%)
Query: 80 IRVSEDGELFAVDEVNSNIVKI-TPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPK 138
+ V +G L+ VD N +++I T + + G +AG F G TG + + P
Sbjct: 1273 VAVGPNGSLYVVDVGNYRVLRIGTDGVVSIAAGNGIAG-FSG-TGDLATQA----MMRPF 1326
Query: 139 GVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR--DGPSEDAKFSNDFDVV 196
+ +D GNLY+A++ N IR++G G+ T+ G GF G + A+ ++ D+
Sbjct: 1327 AIALDSDGNLYIAESQNNVIRRVGPDGLITLVAGNG-TRGFSGDGGLATQARLNHPNDLA 1385
Query: 197 YVRPTCSLLVIDRGNAALRQISLN 220
+ P SL + D N +R +S N
Sbjct: 1386 -IGPDGSLYIADYDNHRIRLVSPN 1408
Score = 42.4 bits (98), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 61/143 (42%), Gaps = 8/143 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKPNEARF 134
P + + DG L+ D N I ++P G + +AG+ G G +A
Sbjct: 1381 PNDLAIGPDGSLYIADYDNHRIRLVSP------NGIITTIAGNGSRGYGGDGGPATQAML 1434
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
+P+G+T+ +LY+AD N IRK+ G+ T G + DG N
Sbjct: 1435 RNPQGITVASDSSLYIADRRNHRIRKVSPEGIITTVAGNGILGYDGDGGISTGAKLNLPI 1494
Query: 195 VVYVRPTCSLLVIDRGNAALRQI 217
V + P +L + D N +R++
Sbjct: 1495 AVALSPNENLYIADYYNHRIRRV 1517
>gi|290990564|ref|XP_002677906.1| NHL domain-containing protein kinase [Naegleria gruberi]
gi|284091516|gb|EFC45162.1| NHL domain-containing protein kinase [Naegleria gruberi]
Length = 2733
Score = 57.8 bits (138), Expect = 1e-05, Method: Composition-based stats.
Identities = 45/146 (30%), Positives = 69/146 (47%), Gaps = 19/146 (13%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQ-GYTGHVDGKPNEAR 133
P + +S GELF D+ N I K+ S G + VAG+ G++G D N A
Sbjct: 686 PQGVTLSPTGELFIADQNNHRIRKVA------SNGYISTVAGNGNFGFSGDGDLATN-AE 738
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKF-SND 192
N P G+ G +Y+AD LN IR++ ++TIAGG + GP+ +
Sbjct: 739 LNSPSGIAFSSIGTMYIADRLNRVIRRVISGNISTIAGGVGD-----GGPATGGYIQAQS 793
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQIS 218
FD + T + + D N +R++S
Sbjct: 794 FD---ISSTGEIYIADTENHRIRKVS 816
Score = 55.5 bits (132), Expect = 5e-05, Method: Composition-based stats.
Identities = 46/146 (31%), Positives = 71/146 (48%), Gaps = 12/146 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGS-FQGYTGHVDGKPNEAR 133
P + VS +G+++ D N I KI + G + VAGS GY+G G A+
Sbjct: 518 PSGVAVSSNGDIYIADTENHRIRKIE------TNGYIATVAGSGASGYSGD-GGLLTSAK 570
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAGGKSNVAGFRDGPSEDAKFSND 192
F P+GV + G +Y+ADT N +RKI + V +TIAG S G + AK +
Sbjct: 571 FQQPQGVAVSSNGEIYIADTENHVVRKISTSNVLSTIAGTGSYGYNGDGGLAITAKLFSP 630
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQIS 218
V + + + + D N +R+I+
Sbjct: 631 IGVA-ISSSGEVFIADNNNHRIRKIA 655
Score = 55.1 bits (131), Expect = 7e-05, Method: Composition-based stats.
Identities = 39/141 (27%), Positives = 58/141 (41%), Gaps = 4/141 (2%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + +S GE+F D N I KI +V GY G D N A+ +
Sbjct: 630 PIGVAISSSGEVFIADNNNHRIRKIA---KDGYISTIVGTGLNGYDGDGDLATN-AKLGN 685
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P+GVT+ G L++AD N IRK+ G + G N DG N +
Sbjct: 686 PQGVTLSPTGELFIADQNNHRIRKVASNGYISTVAGNGNFGFSGDGDLATNAELNSPSGI 745
Query: 197 YVRPTCSLLVIDRGNAALRQI 217
++ + DR N +R++
Sbjct: 746 AFSSIGTMYIADRLNRVIRRV 766
Score = 52.4 bits (124), Expect = 6e-04, Method: Composition-based stats.
Identities = 43/143 (30%), Positives = 68/143 (47%), Gaps = 11/143 (7%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPN---EARFNH 136
I V+ +G+++ D N+ I K++ +S G + + G G +G N A+ N
Sbjct: 464 IAVNSNGDVYIADTYNNRIRKVS-----FSTGIISTVAGTGIAGQ-NGDGNLATSAQLNF 517
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P GV + G++Y+ADT N IRKI G + T+AG ++ G AKF V
Sbjct: 518 PSGVAVSSNGDIYIADTENHRIRKIETNGYIATVAGSGASGYSGDGGLLTSAKFQQPQGV 577
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V + + D N +R+IS
Sbjct: 578 A-VSSNGEIYIADTENHVVRKIS 599
Score = 51.2 bits (121), Expect = 0.001, Method: Composition-based stats.
Identities = 44/145 (30%), Positives = 67/145 (46%), Gaps = 6/145 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + VS +GE++ D N + KI+ S GS+ GY G G A+
Sbjct: 574 PQGVAVSSNGEIYIADTENHVVRKIST--SNVLSTIAGTGSY-GYNGD-GGLAITAKLFS 629
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P GV + G +++AD N IRKI G ++TI G N + +AK N V
Sbjct: 630 PIGVAISSSGEVFIADNNNHRIRKIAKDGYISTIVGTGLNGYDGDGDLATNAKLGNPQGV 689
Query: 196 VYVRPTCSLLVIDRGNAALRQISLN 220
+ PT L + D+ N +R+++ N
Sbjct: 690 T-LSPTGELFIADQNNHRIRKVASN 713
Score = 51.2 bits (121), Expect = 0.001, Method: Composition-based stats.
Identities = 44/150 (29%), Positives = 74/150 (49%), Gaps = 17/150 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGS-FQGYTGHVDGK-PNEA 132
P I + +GE + D +N I KI + G + +AG+ GY G DG A
Sbjct: 206 PACIAIDSNGEFYISDSMNHRIRKIA------TNGIITTIAGTGTHGYDG--DGALAINA 257
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR--DGPSEDAKFS 190
+ P GV ++ G++Y+AD+ N IRKI ++T+AG + +AG+ G S A+ +
Sbjct: 258 QLYSPTGVVVNSNGDVYIADSGNNRIRKISKGYISTVAG--NGIAGYSGDGGLSTSAQLA 315
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
V + +++ D N +R+I+ N
Sbjct: 316 TPQSVA-INLNGEIIIADSNNNRIRKIATN 344
Score = 48.9 bits (115), Expect = 0.006, Method: Composition-based stats.
Identities = 48/172 (27%), Positives = 76/172 (44%), Gaps = 21/172 (12%)
Query: 60 NGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNS---------NIVKITPPLSQYSR 110
NGY+ GN S DG+L E+NS + I L++ R
Sbjct: 713 NGYISTVAGNGN--------FGFSGDGDLATNAELNSPSGIAFSSIGTMYIADRLNRVIR 764
Query: 111 GRLVAGSFQGYTGHV-DGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTT 168
R+++G+ G V DG P + + + G +Y+ADT N IRK+ G ++T
Sbjct: 765 -RVISGNISTIAGGVGDGGPATGGYIQAQSFDISSTGEIYIADTENHRIRKVSTLGKIST 823
Query: 169 IAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
IAG + G + AK S+ V V T + + DR N +R+++L+
Sbjct: 824 IAGTGAMGYSGDGGLAITAKLSSPVGVA-VSSTGEVFIADRDNHRIRKVTLS 874
Score = 46.6 bits (109), Expect = 0.029, Method: Composition-based stats.
Identities = 45/141 (31%), Positives = 69/141 (48%), Gaps = 13/141 (9%)
Query: 82 VSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKPNEARFNHPKGV 140
+S GE++ D N I K+ S + +AG+ GY+G G A+ + P GV
Sbjct: 796 ISSTGEIYIADTENHRIRKV----STLGKISTIAGTGAMGYSGD-GGLAITAKLSSPVGV 850
Query: 141 TMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFS--NDFDVVY 197
+ G +++AD N IRK+ +G +TTIAG + +GF E AKF+ N V
Sbjct: 851 AVSSTGEVFIADRDNHRIRKVTLSGIITTIAG--NGTSGFNSDGIE-AKFAQLNSPSSVT 907
Query: 198 VRPTCSLLVIDRGNAALRQIS 218
+ + + D N +RQIS
Sbjct: 908 ISGG-EIYIADTNNHRIRQIS 927
Score = 44.3 bits (103), Expect = 0.14, Method: Composition-based stats.
Identities = 37/98 (37%), Positives = 52/98 (53%), Gaps = 10/98 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTG-HVDG-KPNEARF 134
P + VS GE+F D N I K+T LS G + + G +G + DG + A+
Sbjct: 847 PVGVAVSSTGEVFIADRDNHRIRKVT--LS----GIITTIAGNGTSGFNSDGIEAKFAQL 900
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
N P VT+ G +Y+ADT N IR+I + G + TIAG
Sbjct: 901 NSPSSVTISG-GEIYIADTNNHRIRQISNTGIIKTIAG 937
Score = 43.9 bits (102), Expect = 0.18, Method: Composition-based stats.
Identities = 44/176 (25%), Positives = 71/176 (40%), Gaps = 35/176 (19%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKI------TPPLSQYSRG------------------- 111
PY + +S GE++ D N+ I KI T +RG
Sbjct: 113 PYGVALSLGGEIYIADSNNNRIRKIATNGIITTIAGTGTRGYDGDGALATSAKLNFPSSI 172
Query: 112 --------RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGD 163
+AG+ G G + A+ P + +D G Y++D++N IRKI
Sbjct: 173 AITTNGNISTIAGTGTGSYGGDNAIATGAQLYAPACIAIDSNGEFYISDSMNHRIRKIAT 232
Query: 164 AG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G +TTIAG ++ + +A+ + VV V + + D GN +R+IS
Sbjct: 233 NGIITTIAGTGTHGYDGDGALAINAQLYSPTGVV-VNSNGDVYIADSGNNRIRKIS 287
Score = 43.1 bits (100), Expect = 0.30, Method: Composition-based stats.
Identities = 34/102 (33%), Positives = 54/102 (52%), Gaps = 12/102 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGS-FQGYTGHVDGKPNEAR 133
P + V+ +G+++ D N+ I KI S+G + VAG+ GY+G G A+
Sbjct: 262 PTGVVVNSNGDVYIADSGNNRIRKI-------SKGYISTVAGNGIAGYSGD-GGLSTSAQ 313
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKS 174
P+ V ++ G + +AD+ N IRKI G ++TIAG S
Sbjct: 314 LATPQSVAINLNGEIIIADSNNNRIRKIATNGKISTIAGTSS 355
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 45/135 (33%), Positives = 64/135 (47%), Gaps = 12/135 (8%)
Query: 46 SKAPQADGNVLQFE------NGYLVETVIEGNE-IGVVPYKIRVSE-DGELFAVDEVNSN 97
S A Q D LQ+ G +V + NE I P ++ +S G+LF + V++
Sbjct: 17 SLASQIDSQSLQYNISTVAGGGSIVGDGLLANEAITSYPCQLAISSISGDLFFGEVVSNR 76
Query: 98 IVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLA 157
I KIT S +AG+ G G A+ +P GV + G +Y+AD+ N
Sbjct: 77 IRKITSSTGVIST---IAGTGTGAYGGDGSMATAAQLFYPYGVALSLGGEIYIADSNNNR 133
Query: 158 IRKIGDAG-VTTIAG 171
IRKI G +TTIAG
Sbjct: 134 IRKIATNGIITTIAG 148
Score = 38.9 bits (89), Expect = 6.0, Method: Composition-based stats.
Identities = 33/119 (27%), Positives = 52/119 (43%), Gaps = 6/119 (5%)
Query: 103 PPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG 162
P ++ SR V GS +GY G D A N+ + + ++ G++Y+ADT N IRK+
Sbjct: 432 PKINIVSR---VVGS-EGYNGD-DIAATSALLNYARSIAVNSNGDVYIADTYNNRIRKVS 486
Query: 163 -DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
G+ + G DG + N V V + + D N +R+I N
Sbjct: 487 FSTGIISTVAGTGIAGQNGDGNLATSAQLNFPSGVAVSSNGDIYIADTENHRIRKIETN 545
>gi|421097867|ref|ZP_15558546.1| hypothetical protein LEP1GSC125_1130 [Leptospira borgpetersenii
str. 200901122]
gi|410799150|gb|EKS01231.1| hypothetical protein LEP1GSC125_1130 [Leptospira borgpetersenii
str. 200901122]
Length = 357
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 47/124 (37%), Positives = 65/124 (52%), Gaps = 9/124 (7%)
Query: 98 IVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLA 157
I+KI L Q+S L AGS G G +G + F P + +D + NLYV + N A
Sbjct: 144 ILKI-DHLDQFS---LYAGSSSGVDGFQNGDRLNSLFKSPFFMDLDRERNLYVGELSNHA 199
Query: 158 IRKI--GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALR 215
IRKI V+T++GG V G+ DG A+F + + Y + T SLLV D N +R
Sbjct: 200 IRKINLNSGTVSTLSGG---VLGYLDGDLASAQFKSPLGITYDQKTDSLLVADIQNHNIR 256
Query: 216 QISL 219
+I L
Sbjct: 257 KIDL 260
>gi|158520859|ref|YP_001528729.1| YD repeat-containing protein [Desulfococcus oleovorans Hxd3]
gi|158509685|gb|ABW66652.1| YD repeat protein [Desulfococcus oleovorans Hxd3]
Length = 2831
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 51/178 (28%), Positives = 83/178 (46%), Gaps = 29/178 (16%)
Query: 52 DGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVD---------EVNSNIVKIT 102
DGN+ F G ++ ++ GN I V+ DG L+ + ++N + +
Sbjct: 1526 DGNITTFFEGPIIRSI--GN------LDIEVAPDGMLYILSRSENQLRRVDLNGIVSIVA 1577
Query: 103 PPLSQYSRGRLVAGSFQGYTGHVDGKPN-EARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
+ Y G V F G DG P EAR HP+G+ +D GN+Y+ADT N +R+I
Sbjct: 1578 GIATSYPPGMKV---FAG-----DGGPAIEARLYHPQGMEIDASGNIYIADTDNHCVRRI 1629
Query: 162 GDAGVTTIAGGKSNVAGFR--DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
G+ G AG+ G + DA+ + + V T +L + D GN ++R++
Sbjct: 1630 SPDGIIEAFAGMGVDAGYSGDGGLAVDARLQSPTGLA-VDKTGNLFIADSGNFSIRKV 1686
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 53/192 (27%), Positives = 83/192 (43%), Gaps = 25/192 (13%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPN-EARFN 135
P + V + G LF D N +I K+ P + + G+ GY+G DG P +A+
Sbjct: 1662 PTGLAVDKTGNLFIADSGNFSIRKVDP---KGVITTIAGGNGPGYSG--DGWPAVDAQLQ 1716
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
+T+D GNLY+ T IRKI G+ TTIAGG + GP+ A+ +
Sbjct: 1717 SISEITLDSSGNLYL--TGYDHIRKINQDGIITTIAGGNGSGHSGDGGPAIYAQLGLGLN 1774
Query: 195 VVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQ 254
+ P +L ++D +R+ + P + V G+V VG V + +
Sbjct: 1775 DIIADPRGNLYILDTSYCGVRK--------------VGPAPV--VAGSVGVGEVIFVEEN 1818
Query: 255 GFGPFFFSRTQQ 266
G G S +
Sbjct: 1819 GLGHIMLSNGRH 1830
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/120 (35%), Positives = 62/120 (51%), Gaps = 16/120 (13%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVA----GSFQGYTGHVDGKPNEA 132
P + + G ++ D N + +I+P G + A G GY+G G +A
Sbjct: 1605 PQGMEIDASGNIYIADTDNHCVRRISP------DGIIEAFAGMGVDAGYSGD-GGLAVDA 1657
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAGGKSNVAGFR-DG-PSEDAKF 189
R P G+ +D GNL++AD+ N +IRK+ GV TTIAGG N G+ DG P+ DA+
Sbjct: 1658 RLQSPTGLAVDKTGNLFIADSGNFSIRKVDPKGVITTIAGG--NGPGYSGDGWPAVDAQL 1715
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 54/230 (23%), Positives = 90/230 (39%), Gaps = 35/230 (15%)
Query: 131 EARFNHPKGVTMDDKGNLYVAD-TLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKF 189
+A+ +P VTMD +GNLY+AD T + I + N+ F +GP +
Sbjct: 1483 DAQLAYPTSVTMDMRGNLYIADSTYEIVGGAISIKDIIRKVDKDGNITTFFEGPIIRSIG 1542
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVT 249
+ D +V P L ++ R LR++ LN I+ +V + Y
Sbjct: 1543 NLDIEVA---PDGMLYILSRSENQLRRVDLNG--------------IVSIVAGIATSYPP 1585
Query: 250 CM--LQQGFGPFFFSRTQQPSESEFNEETEI------KELSNKEKPIPIVES---MKEEP 298
M GP +R P E + I + P I+E+ M +
Sbjct: 1586 GMKVFAGDGGPAIEARLYHPQGMEIDASGNIYIADTDNHCVRRISPDGIIEAFAGMGVDA 1645
Query: 299 GWPSFGQLIIDL-----SKLALEAMGSMFL-NFVPFRFRSSGTKGNLTPL 342
G+ G L +D + LA++ G++F+ + F R KG +T +
Sbjct: 1646 GYSGDGGLAVDARLQSPTGLAVDKTGNLFIADSGNFSIRKVDPKGVITTI 1695
>gi|182416423|ref|YP_001821489.1| NHL repeat-containing protein [Opitutus terrae PB90-1]
gi|177843637|gb|ACB77889.1| NHL repeat containing protein [Opitutus terrae PB90-1]
Length = 1026
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 51/140 (36%), Positives = 71/140 (50%), Gaps = 9/140 (6%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
+ + G L+AVD + + KITP + +AGS G G G EARF+ P
Sbjct: 111 LAIDARGNLYAVDFTDHTVRKITPEGVVTT----LAGS-AGDHGTQVGHGGEARFDSPMA 165
Query: 140 VTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYV 198
V +D NLYV + AIRK+ D VT +AG + AG DG S A+FS D + V
Sbjct: 166 VAVDRWDNLYVGQMGDGAIRKVSPDGNVTILAG--AGKAGSADGDSASARFSGS-DGLAV 222
Query: 199 RPTCSLLVIDRGNAALRQIS 218
T ++ V D N +R+I+
Sbjct: 223 DGTGNVYVADLFNHTIRKIT 242
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 59/113 (52%), Gaps = 6/113 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P ++ + G + DE N I K++P + VAG G +G DG + ARF+
Sbjct: 272 PRELSIDAYGNILVADEGNCAIRKVSPSGVVST----VAGK-TGLSGSDDGV-DAARFSL 325
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKF 189
P+GV + G++YVAD+ N +R+I G T G+ G+ +G SE A+F
Sbjct: 326 PRGVAVSRTGDIYVADSGNSTVRRIAVGGAVTTFAGRPGGPGYANGSSETAQF 378
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 50/96 (52%), Gaps = 9/96 (9%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
+P+ I DG L+ D NS I +I P S + G G Q G DG+ EARF
Sbjct: 594 LPFGIAAGHDGSLYIADTGNSTIRQIRPDGSMVTIG---GGMRQ--EGKQDGRGGEARFL 648
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG 171
+P GV +D G+LYVAD+ N +RK GV AG
Sbjct: 649 NPYGVAVDAAGHLYVADSGNNLVRK----GVKVAAG 680
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 65/133 (48%), Gaps = 15/133 (11%)
Query: 65 ETVIEGNEIGV------VPYKIRVSEDGELFAVDEVNSNI--VKITPPLSQYSRGRLVAG 116
+T + G++ GV +P + VS G+++ D NS + + + ++ ++ GR
Sbjct: 308 KTGLSGSDDGVDAARFSLPRGVAVSRTGDIYVADSGNSTVRRIAVGGAVTTFA-GR---- 362
Query: 117 SFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNV 176
G G+ +G A+F P G+ +D N++VAD+ N IRKI GV T G V
Sbjct: 363 --PGGPGYANGSSETAQFYFPTGIAIDQNRNVFVADSYNNVIRKITPGGVVTTVAGLGGV 420
Query: 177 AGFRDGPSEDAKF 189
G +G A+F
Sbjct: 421 FGSAEGSGAAARF 433
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 53/88 (60%), Gaps = 2/88 (2%)
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSND 192
RF +P+ +++D GN+ VAD N AIRK+ +GV + GK+ ++G DG + A+FS
Sbjct: 268 RFYYPRELSIDAYGNILVADEGNCAIRKVSPSGVVSTVAGKTGLSGSDDG-VDAARFSLP 326
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQISLN 220
V R T + V D GN+ +R+I++
Sbjct: 327 RGVAVSR-TGDIYVADSGNSTVRRIAVG 353
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/145 (31%), Positives = 66/145 (45%), Gaps = 12/145 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGH---VDGKPNEAR 133
P I V G L+ D +S I KIT + G + +F G G DG AR
Sbjct: 53 PRSIAVDASGTLYVADAASSVIRKIT------AEGMVT--TFVGTAGQRGSADGIGAAAR 104
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
F G+ +D +GNLY D + +RKI GV T G + G + G +A+F +
Sbjct: 105 FQGIDGLAIDARGNLYAVDFTDHTVRKITPEGVVTTLAGSAGDHGTQVGHGGEARFDSPM 164
Query: 194 DVVYVRPTCSLLVIDRGNAALRQIS 218
V R +L V G+ A+R++S
Sbjct: 165 AVAVDRWD-NLYVGQMGDGAIRKVS 188
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/100 (35%), Positives = 50/100 (50%), Gaps = 2/100 (2%)
Query: 120 GYTGHVDGKP-NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAG 178
G +G DG EARFN P G+ + G +YVAD N IR+I AG+ + G + G
Sbjct: 474 GLSGSTDGNARTEARFNGPTGIAVGPSGTIYVADFDNHTIRQISPAGMVSTLAGAAGQPG 533
Query: 179 FRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
DG A+F V R + V D ++A+R+I+
Sbjct: 534 TADGTGSAARFYAPAAVTVDRAGM-IYVADSWSSAVRKIT 572
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 48/143 (33%), Positives = 67/143 (46%), Gaps = 7/143 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + ++ +F D N+ I KITP + VAG G G +G ARF
Sbjct: 381 PTGIAIDQNRNVFVADSYNNVIRKITPGGVVTT----VAG-LGGVFGSAEGSGAAARFGV 435
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSE-DAKFSNDFDV 195
P V +D NLYVA+ I KI G T G ++G DG + +A+F N
Sbjct: 436 PAAVAIDAAANLYVANRQTHVIAKIAPDGAVTFFAGSPGLSGSTDGNARTEARF-NGPTG 494
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
+ V P+ ++ V D N +RQIS
Sbjct: 495 IAVGPSGTIYVADFDNHTIRQIS 517
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 58/106 (54%), Gaps = 3/106 (2%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGG 172
++AG+ G G DG ARF+ G+ +D GN+YVAD N IRKI GV T G
Sbjct: 195 ILAGA--GKAGSADGDSASARFSGSDGLAVDGTGNVYVADLFNHTIRKITPDGVVTTLAG 252
Query: 173 KSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+ +GF DG A+F + + + ++LV D GN A+R++S
Sbjct: 253 VARESGFADGAGAAARFYYPRE-LSIDAYGNILVADEGNCAIRKVS 297
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 47/141 (33%), Positives = 65/141 (46%), Gaps = 15/141 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G ++ D +S + KITP G + Q Y DG+P R
Sbjct: 547 PAAVTVDRAGMIYVADSWSSAVRKITPD------GVVTTVVRQPY----DGEPE--RLYL 594
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ G+LY+ADT N IR+I D + TI GG G +DG +A+F N + V
Sbjct: 595 PFGIAAGHDGSLYIADTGNSTIRQIRPDGSMVTIGGGMRQ-EGKQDGRGGEARFLNPYGV 653
Query: 196 VYVRPTCSLLVIDRGNAALRQ 216
V L V D GN +R+
Sbjct: 654 A-VDAAGHLYVADSGNNLVRK 673
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 47/141 (33%), Positives = 65/141 (46%), Gaps = 13/141 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V G ++ D N I +I+P + +AG+ G G DG + ARF
Sbjct: 492 PTGIAVGPSGTIYVADFDNHTIRQISPAGMVST----LAGA-AGQPGTADGTGSAARFYA 546
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P VT+D G +YVAD+ + A+RKI GV T V DG E + F +
Sbjct: 547 PAAVTVDRAGMIYVADSWSSAVRKITPDGVVTTV-----VRQPYDGEPE--RLYLPFGIA 599
Query: 197 YVRPTCSLLVIDRGNAALRQI 217
SL + D GN+ +RQI
Sbjct: 600 -AGHDGSLYIADTGNSTIRQI 619
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 47/96 (48%), Gaps = 1/96 (1%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
G DG + ARF P+ + +D G LYVAD + IRKI G+ T G + G DG
Sbjct: 39 GSTDGFGDSARFAAPRSIAVDASGTLYVADAASSVIRKITAEGMVTTFVGTAGQRGSADG 98
Query: 183 PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
A+F D + + +L +D + +R+I+
Sbjct: 99 IGAAARFQG-IDGLAIDARGNLYAVDFTDHTVRKIT 133
>gi|418737091|ref|ZP_13293489.1| NHL repeat protein [Leptospira borgpetersenii serovar Castellonis
str. 200801910]
gi|410747250|gb|EKR00156.1| NHL repeat protein [Leptospira borgpetersenii serovar Castellonis
str. 200801910]
Length = 358
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/136 (34%), Positives = 71/136 (52%), Gaps = 9/136 (6%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDK 145
G+ F + + I+KI P+ Q+S L AG+ G G +G + F P + +D +
Sbjct: 133 GDKFISCQDTAQILKI-DPMDQFS---LYAGNSSGIGGFQNGDRLNSLFKGPLFMDLDRE 188
Query: 146 GNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 203
NLYV + N IRKI V+T++GG V+G+ DG A+F ++Y + T S
Sbjct: 189 RNLYVGELGNHTIRKINLNSETVSTLSGG---VSGYLDGDLTSAQFKFPSGIIYDQKTDS 245
Query: 204 LLVIDRGNAALRQISL 219
LLV D N +R+I L
Sbjct: 246 LLVADLQNHRIRKIDL 261
>gi|312198347|ref|YP_004018408.1| serine/threonine protein kinase [Frankia sp. EuI1c]
gi|311229683|gb|ADP82538.1| serine/threonine protein kinase [Frankia sp. EuI1c]
Length = 814
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/144 (34%), Positives = 70/144 (48%), Gaps = 10/144 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP-NEARFN 135
P I + G+LF D N + ++ VAG F GY DGKP +A +
Sbjct: 620 PSGIAIDSAGDLFIADNGNDIVRRV----GVDGIITTVAGRF-GYGSWGDGKPATQAMIS 674
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDF 193
P V +D +G +Y+AD+ N IR+IG D + TIAG + VAG+ DG A D
Sbjct: 675 KPFNVALDRQGRIYIADSYNHKIRRIGLDGVIETIAG--TGVAGYSGDGGKATAATLRDP 732
Query: 194 DVVYVRPTCSLLVIDRGNAALRQI 217
V V ++ + D GN +R+I
Sbjct: 733 RGVTVDAAGNVYITDSGNNRVRRI 756
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/109 (38%), Positives = 60/109 (55%), Gaps = 8/109 (7%)
Query: 114 VAGSFQ-GYTGHVDGKPN-EARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIA 170
+AG+ Q G+ G DG P +A+ +HP G MD GNLY AD N +R+I D +TT+A
Sbjct: 483 IAGTGQAGFAG--DGGPAAQAQLDHPYGPAMDGFGNLYFADFDNNRVRRISPDGTITTVA 540
Query: 171 G-GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G G+ +G DG A N V + P +L ++D N +RQ+S
Sbjct: 541 GNGQPGFSG--DGGPATAAMLNKPVAVAIGPGGTLYIVDTFNMRVRQVS 587
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 70/147 (47%), Gaps = 8/147 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKP-NEARF 134
P + + G L+ VD N + +++P + +AGS + + DG P A
Sbjct: 562 PVAVAIGPGGTLYIVDTFNMRVRQVSP----DGIIQTIAGSGERPWNPADDGGPATNAAL 617
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG-PSEDAKFSNDF 193
+P G+ +D G+L++AD N +R++G G+ T G+ + DG P+ A S F
Sbjct: 618 WYPSGIAIDSAGDLFIADNGNDIVRRVGVDGIITTVAGRFGYGSWGDGKPATQAMISKPF 677
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISLN 220
+V R + + D N +R+I L+
Sbjct: 678 NVALDR-QGRIYIADSYNHKIRRIGLD 703
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 50/97 (51%), Gaps = 7/97 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKPNEARFN 135
P+ + + G ++ D N I +I +AG+ GY+G GK A
Sbjct: 676 PFNVALDRQGRIYIADSYNHKIRRI----GLDGVIETIAGTGVAGYSGD-GGKATAATLR 730
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
P+GVT+D GN+Y+ D+ N +R+I AG +TT+AG
Sbjct: 731 DPRGVTVDAAGNVYITDSGNNRVRRIDTAGIITTVAG 767
Score = 45.8 bits (107), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 74/153 (48%), Gaps = 20/153 (13%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKP-NEARF 134
PY + G L+ D N+ + +I+P + + VAG+ Q G++G DG P A
Sbjct: 506 PYGPAMDGFGNLYFADFDNNRVRRISPDGTITT----VAGNGQPGFSG--DGGPATAAML 559
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVT-TIAGGKSNVAGFRD-GPSEDAKFSND 192
N P V + G LY+ DT N+ +R++ G+ TIAG +G R P++D + +
Sbjct: 560 NKPVAVAIGPGGTLYIVDTFNMRVRQVSPDGIIQTIAG-----SGERPWNPADDGGPATN 614
Query: 193 FDVVY-----VRPTCSLLVIDRGNAALRQISLN 220
+ Y + L + D GN +R++ ++
Sbjct: 615 AALWYPSGIAIDSAGDLFIADNGNDIVRRVGVD 647
>gi|322434821|ref|YP_004217033.1| hypothetical protein AciX9_1190 [Granulicella tundricola MP5ACTX9]
gi|321162548|gb|ADW68253.1| NHL repeat containing protein [Granulicella tundricola MP5ACTX9]
Length = 1631
Score = 57.4 bits (137), Expect = 2e-05, Method: Composition-based stats.
Identities = 47/147 (31%), Positives = 69/147 (46%), Gaps = 12/147 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF--QGYTGHVDGKP-NEAR 133
P + + G +F D N+ I K+ S +AG GY G DG P + A+
Sbjct: 225 PSSVALDGAGNVFISDTGNNVIRKVNVFDGTIST---IAGQMGKNGYVG--DGGPASSAK 279
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG---VTTIAGGKSNVAGFRDGPSEDAKFS 190
N P G+ D +GNLY DT N +R+I DAG +TT AG G+ DG A
Sbjct: 280 LNGPNGLVFDAQGNLYFCDTNNNVVRRI-DAGTGVITTFAGNGVTTGGYGDGGPAAAAML 338
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQI 217
N + V + + D+GN+ +R++
Sbjct: 339 NAPWGIAVSSKGEIYIADQGNSLIRKV 365
Score = 47.8 bits (112), Expect = 0.014, Method: Composition-based stats.
Identities = 41/149 (27%), Positives = 69/149 (46%), Gaps = 13/149 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQG----YTGHVDGKPNEA 132
P + G L+ D N+N+V+ + G V +F G G+ DG P A
Sbjct: 283 PNGLVFDAQGNLYFCD-TNNNVVR------RIDAGTGVITTFAGNGVTTGGYGDGGPAAA 335
Query: 133 -RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSN 191
N P G+ + KG +Y+AD N IRK+ + ++T+AG + P+ + ++
Sbjct: 336 AMLNAPWGIAVSSKGEIYIADQGNSLIRKVVNGTISTVAGTHDAADTGSNPPAIHTQLNS 395
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQISLN 220
VV V +L + D GN +R+++ N
Sbjct: 396 PAGVV-VDVAGNLYISDSGNNLIRKVNTN 423
Score = 42.4 bits (98), Expect = 0.49, Method: Composition-based stats.
Identities = 34/97 (35%), Positives = 53/97 (54%), Gaps = 5/97 (5%)
Query: 126 DGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDA--GVT-TIAGGKSNVAGFRDG 182
+G ++ F P GVT+D GNLY+ DT N IRK+ DA G+T TIAG + G
Sbjct: 157 EGVATQSPFFLPFGVTLDAAGNLYITDTSNTRIRKV-DAVTGMTSTIAGNGTIGGTGDGG 215
Query: 183 PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
P+ A S+ V + ++ + D GN +R++++
Sbjct: 216 PATSATLSSPSSVA-LDGAGNVFISDTGNNVIRKVNV 251
>gi|299473428|emb|CBN77825.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 2120
Score = 57.4 bits (137), Expect = 2e-05, Method: Composition-based stats.
Identities = 41/109 (37%), Positives = 52/109 (47%), Gaps = 6/109 (5%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGG 172
L GS G G+ DG A FN P+ V +D N+YVADT N IR+I GV T G
Sbjct: 203 LAGGS--GEAGYADGSGTAALFNDPQDVAVDAHANVYVADTGNHRIRRISPEGVVTTVAG 260
Query: 173 KSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVI---DRGNAALRQIS 218
G DG + +A FS + + LV+ D N LR+IS
Sbjct: 261 DGE-EGSDDGDAMEASFSFPGGIALYYDSSEGLVLYVADTNNHRLRKIS 308
Score = 53.1 bits (126), Expect = 3e-04, Method: Composition-based stats.
Identities = 46/141 (32%), Positives = 60/141 (42%), Gaps = 31/141 (21%)
Query: 77 PYKIRVSEDGELFAVDE---------VNSNIVKITPPLSQY---SRGRLVAGS--FQGYT 122
P + +EDG LF D NS + +T L + G V S +G
Sbjct: 358 PSGLAAAEDGTLFVADTNNHLIRMVLANSTVFTLTGGLEGAEVEAGGEEVCPSPCLRGVA 417
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-------------DAGVTTI 169
GH DG ARFN+P V++ G L+VAD +L R+I D VTT
Sbjct: 418 GHTDGNLTAARFNYPADVSLGTNGTLFVADLHSL--RRISMPENPTIVLGVGFDGRVTTA 475
Query: 170 AGGKSNVAGFRDGPSEDAKFS 190
AGG G DG +A+FS
Sbjct: 476 AGGAE--PGEADGTGPEARFS 494
Score = 43.9 bits (102), Expect = 0.21, Method: Composition-based stats.
Identities = 46/168 (27%), Positives = 65/168 (38%), Gaps = 31/168 (18%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V ++ D N I +I+P G + + G G DG EA F+
Sbjct: 225 PQDVAVDAHANVYVADTGNHRIRRISP------EGVVTTVAGDGEEGSDDGDAMEASFSF 278
Query: 137 PKGVTMDDKGN----LYVADTLNLAIRKI-GD----AGVTTIAGGK-------------- 173
P G+ + + LYVADT N +RKI GD AG T G+
Sbjct: 279 PGGIALYYDSSEGLVLYVADTNNHRLRKISGDIANGAGTVTCHAGRCGNGTESATRMAAE 338
Query: 174 -SNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
+ AGF DG A+F + +L V D N +R + N
Sbjct: 339 ATPEAGFADGDGSYARFDGPSGLAAAEDG-TLFVADTNNHLIRMVLAN 385
Score = 42.0 bits (97), Expect = 0.62, Method: Composition-based stats.
Identities = 38/119 (31%), Positives = 54/119 (45%), Gaps = 22/119 (18%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI-----------GDAGVTTIAG 171
G DG + ARF+ P G+ + G L+VADT N IR + G G AG
Sbjct: 344 GFADGDGSYARFDGPSGLAAAEDGTLFVADTNNHLIRMVLANSTVFTLTGGLEGAEVEAG 403
Query: 172 GKS--------NVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQD 222
G+ VAG DG A+F+ DV + +L V D +LR+IS+ ++
Sbjct: 404 GEEVCPSPCLRGVAGHTDGNLTAARFNYPADVS-LGTNGTLFVADL--HSLRRISMPEN 459
>gi|108761192|ref|YP_633033.1| lipoprotein [Myxococcus xanthus DK 1622]
gi|108465072|gb|ABF90257.1| putative lipoprotein [Myxococcus xanthus DK 1622]
Length = 868
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/153 (30%), Positives = 66/153 (43%), Gaps = 25/153 (16%)
Query: 68 IEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDG 127
+ GN +G P + + DG L D + + + ++TP G V G
Sbjct: 584 VAGNSLGA-PTAVALLPDGGLVVADGMGNAVKRVTP-------GGEVTTVASG------- 628
Query: 128 KPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSED 186
N P G+ D GN+YVADT + IR+I G V AGG G DGP++
Sbjct: 629 ------LNGPMGIAADAAGNVYVADTDHYVIRRIDPEGKVEVFAGGTP---GLMDGPAKQ 679
Query: 187 AKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
A F+ + +LLV D N +R+I L
Sbjct: 680 AAFNQPTGLAVTPDGTALLVADMNNGVIRRIDL 712
Score = 42.4 bits (98), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 60/144 (41%), Gaps = 25/144 (17%)
Query: 40 WTTRSSSKAPQADGN-----------VLQFENGYLVETVIEGNEIGV----------VPY 78
W R S A ADGN V++ +G + +V+ G G +PY
Sbjct: 726 WLYRPSGVAVSADGNTLFVVESGMSRVVRIRDG--LTSVVAGTTPGFRDGAPESSQFLPY 783
Query: 79 -KIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHP 137
I V +DG L D N + ++ +R ++ + G GH DG + A P
Sbjct: 784 LGIAVLKDGSLAVSDPGNYRVRRVVLNADGNAR-KVTTLAGSGRYGHSDGPGDAADLVLP 842
Query: 138 KGVTMDDKGNLYVADTLNLAIRKI 161
G+T+ G LYVAD N +R I
Sbjct: 843 AGLTVGPDGRLYVADAGNSLVRAI 866
>gi|336322170|ref|YP_004602138.1| alkyl hydroperoxide reductase [[Cellvibrio] gilvus ATCC 13127]
gi|336105751|gb|AEI13570.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [[Cellvibrio] gilvus ATCC 13127]
Length = 638
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 70/143 (48%), Gaps = 15/143 (10%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI-------GDA 164
R VAG+ G +DG +A F P G+ +DD+G +++AD+ A+R + GD
Sbjct: 347 RHVAGTMN--EGLLDGPLADAWFAQPSGLAVDDEGGIWLADSETSALRVVEPVHGSAGDG 404
Query: 165 GVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDC 224
V T+ G G RDG +++A+ + V V P S+ V D N A+R+ D
Sbjct: 405 SVRTVVGAGLFEFGHRDGAADEARLQHPLGVA-VLPDGSVAVADTYNGAVRRFVAGTDGA 463
Query: 225 EYQYNSIS-----PTDILMVVGA 242
+I+ P+ ++++ GA
Sbjct: 464 GGAVTTIATGLAEPSGLVVLDGA 486
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 41/120 (34%), Positives = 57/120 (47%), Gaps = 13/120 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRG--RLVAGSFQGYTGHVDGKPNEARF 134
P + V ++G ++ D S + + P G R V G+ GH DG +EAR
Sbjct: 370 PSGLAVDDEGGIWLADSETSALRVVEPVHGSAGDGSVRTVVGAGLFEFGHRDGAADEARL 429
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKI-----GDAG-VTTIAGGKSNVAGF-----RDGP 183
HP GV + G++ VADT N A+R+ G G VTTIA G + +G DGP
Sbjct: 430 QHPLGVAVLPDGSVAVADTYNGAVRRFVAGTDGAGGAVTTIATGLAEPSGLVVLDGADGP 489
Score = 45.4 bits (106), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 57/252 (22%), Positives = 102/252 (40%), Gaps = 55/252 (21%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P K DG L D + ++ ++ P R R+ +G G VDG P++ARF+
Sbjct: 177 PAKAVALPDGNLLVADAGHHSLAELAPDGQTLVR-RIGSGE----RGLVDGGPDDARFSE 231
Query: 137 PKGVTM--DD-----KGNLYVADTLNLAIR--KIGDAGVTTIAG-GKSNVAGFRD----- 181
P G+ + D+ ++ VADT+N A+R ++ D V+T+AG G+ + G D
Sbjct: 232 PNGLCLVPDELRPWVGYDVLVADTVNHALRGVRLADGHVSTVAGTGEQFMVGAADNVLPG 291
Query: 182 ------------GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYN 229
G + S+ +DV + + +V GN L + ++
Sbjct: 292 WGGDDRSGRAYAGAPLAVRLSSPWDVTWSAAWGAFVVAMAGNHTL-----------WAFD 340
Query: 230 SISPTDILMVVGAVLVGYVTCMLQQGF--GPFFFSRTQQPSESEFNEETEIKELSNKEKP 287
P+ + +V + +G GP + QPS ++E I ++
Sbjct: 341 PKEPS----------LRHVAGTMNEGLLDGPLADAWFAQPSGLAVDDEGGIWLADSETSA 390
Query: 288 IPIVESMKEEPG 299
+ +VE + G
Sbjct: 391 LRVVEPVHGSAG 402
>gi|195609352|gb|ACG26506.1| hypothetical protein [Zea mays]
gi|414870563|tpg|DAA49120.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
Length = 159
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 61/117 (52%), Gaps = 6/117 (5%)
Query: 58 FENGYLVETVIEGNEI---GVVPYKIRV-SEDGELFAVDEVNSNIVKITPPLSQYSRGRL 113
E+GY V T + N + G PY + S G+L +D S + I+ S R
Sbjct: 29 LEDGYTVTTFADLNPLPASGPHPYAVLPRSRAGDLLLLDSAGSALYTISL-SSSPGEPRR 87
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIA 170
+AG +G +G DG P +A F+ P+ V +D N+YVAD L+ A+RK+ +G T A
Sbjct: 88 LAGGKRG-SGFNDGGPGDAAFDRPRSVAVDAADNVYVADRLHGAVRKVAPSGERTRA 143
>gi|29349195|ref|NP_812698.1| hypothetical protein BT_3787 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29341103|gb|AAO78892.1| conserved hypothetical protein with NHL repeat [Bacteroides
thetaiotaomicron VPI-5482]
Length = 452
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 57/107 (53%), Gaps = 2/107 (1%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGG 172
L AG + +G+V+G ARF++P+ D GN++V + IRKI G ++ G
Sbjct: 347 LFAGGWD-ESGYVNGSGVTARFDNPRQPAFDQDGNMFVPEYGRHTIRKITPTGEVSLYAG 405
Query: 173 KSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
AGF DG E A+F N + V V SL V DR N +R++++
Sbjct: 406 LPGQAGFTDGLPEKARF-NKPECVTVYLDNSLYVADRDNHLIRRVTV 451
>gi|383784367|ref|YP_005468936.1| hypothetical protein LFE_1113 [Leptospirillum ferrooxidans C2-3]
gi|383083279|dbj|BAM06806.1| hypothetical protein LFE_1113 [Leptospirillum ferrooxidans C2-3]
Length = 382
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/112 (39%), Positives = 58/112 (51%), Gaps = 6/112 (5%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI----GDAGVT 167
+ +AGSF G DG ARF P+G+ G++YVADT N IRKI G V
Sbjct: 44 KTIAGSFH-ERGDNDGNGASARFEFPQGIVAAPDGSIYVADTGNDMIRKITMSSGTGTVE 102
Query: 168 TIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
TIA G ++ A FRDG A+F+N + +L V D N +R+I L
Sbjct: 103 TIA-GVNHHARFRDGAGAAARFNNPEGLAISPDGKTLYVADSRNNRIRKIDL 153
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 52/151 (34%), Positives = 72/151 (47%), Gaps = 12/151 (7%)
Query: 77 PYKIRVSEDGE-LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P + +S DG+ L+ D N+ I K+ Q + +AG GH DG +A F
Sbjct: 182 PRGLAISPDGKTLYIGDSGNNMIRKMDLTTLQVTT---LAGQGALVPGHADGVGTQATFL 238
Query: 136 HPKGVTMDDKGN-LYVADTLNLAIRK--IGDAGVTTIAGGKSNVAGFRDGPSEDAKFSND 192
P+G+ + G LY+ADT N IRK + V+T+A G G +GP DA F
Sbjct: 239 EPRGLAISPDGQILYIADTRNHLIRKLVLATNSVSTLA-GHPGFPGTLNGPGPDAYF--- 294
Query: 193 FD-VVYVRPTCSLLVIDRGNAALRQISLNQD 222
+D ++ L V D NA LR I LN D
Sbjct: 295 YDPMMLAIDGNKLYVCDAANADLRLIDLNSD 325
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 68/149 (45%), Gaps = 13/149 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV---DGKPNEAR 133
P I + DG ++ D N I KIT S G + G H DG AR
Sbjct: 68 PQGIVAAPDGSIYVADTGNDMIRKIT-----MSSGTGTVETIAGVNHHARFRDGAGAAAR 122
Query: 134 FNHPKGVTMDDKG-NLYVADTLNLAIRKIGDA--GVTTIAGGKSNVAGFRDGPSEDAKFS 190
FN+P+G+ + G LYVAD+ N IRKI A V+T+AG A DG + A F+
Sbjct: 123 FNNPEGLAISPDGKTLYVADSRNNRIRKIDLATKTVSTLAG--HAFASGNDGIGDHAGFA 180
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQISL 219
+ +L + D GN +R++ L
Sbjct: 181 QPRGLAISPDGKTLYIGDSGNNMIRKMDL 209
>gi|115376708|ref|ZP_01463935.1| hypothetical protein STIAU_5350 [Stigmatella aurantiaca DW4/3-1]
gi|310822897|ref|YP_003955255.1| hypothetical protein STAUR_5665 [Stigmatella aurantiaca DW4/3-1]
gi|115366257|gb|EAU65265.1| hypothetical protein STIAU_5350 [Stigmatella aurantiaca DW4/3-1]
gi|309395969|gb|ADO73428.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1]
Length = 856
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 46/87 (52%), Gaps = 1/87 (1%)
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
N P G+ D GN+YVAD+ N IR+I G T+ G G DG ++ A+F+
Sbjct: 616 LNGPMGIAADASGNIYVADSDNHCIRRIAPDGTATVFAGAVMEPGQGDGTAKAARFNQPA 675
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISLN 220
+ + P LLV D GN +R++ L+
Sbjct: 676 GLAF-GPGGELLVADLGNGVIRRVDLS 701
Score = 43.1 bits (100), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 62/145 (42%), Gaps = 26/145 (17%)
Query: 39 KWTTRSSSKAPQADGN----------VLQFENGYLVETVIEGNEIG----------VVPY 78
KW R S+ A ADG VL+ NG V ++I G+ G +PY
Sbjct: 714 KWMYRPSAVAVAADGTLYVVETGMARVLEVRNG--VVSIIAGSSPGYADGTPTSSQFLPY 771
Query: 79 -KIRVSEDGELFAVDEVNSNIVKIT-PPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
+ V +DG L D N I +I + + +AGS G G DG+ A
Sbjct: 772 LGLAVLKDGSLAVADPGNYRIRRILFSAQGKATEVTTLAGS--GRFGSRDGEGQSADLVL 829
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P G+ + + G LYVAD N +R +
Sbjct: 830 PAGLAVGEDGTLYVADAGNALLRAV 854
Score = 42.0 bits (97), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 35/85 (41%), Gaps = 5/85 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I G ++ D N I +I P + AG+ G DG ARFN
Sbjct: 619 PMGIAADASGNIYVADSDNHCIRRIAPD----GTATVFAGAVM-EPGQGDGTAKAARFNQ 673
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P G+ G L VAD N IR++
Sbjct: 674 PAGLAFGPGGELLVADLGNGVIRRV 698
>gi|452822384|gb|EME29404.1| haloacid dehalogenase-like hydrolase family protein [Galdieria
sulphuraria]
Length = 679
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 68/153 (44%), Gaps = 7/153 (4%)
Query: 75 VVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS--FQGYT---GHVDGKP 129
P I VS +G L+ VD +S+I++ +++ G LV G F G G +G
Sbjct: 512 CAPSGITVSPNGTLYVVDS-DSSIIRWLS-MTKNQVGTLVGGDAIFTGNLSAFGDRNGVS 569
Query: 130 NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKF 189
+ R P G+ DK L VADT N ++ I + G+ DGP AKF
Sbjct: 570 SSVRLQRPMGICYMDKDQLIVADTFNHKLKCIHTIQRDCRWLCGDSQLGYVDGPKTYAKF 629
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQISLNQD 222
DV + + L ++DR N +R + N +
Sbjct: 630 HCPCDVAWDPISQRLYIVDRENHVIRWMDWNSN 662
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 50/183 (27%), Positives = 81/183 (44%), Gaps = 28/183 (15%)
Query: 66 TVIEGNEIGVV------------PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL 113
TV+ IGV+ P + + E+ +FAV ++ I + P +L
Sbjct: 434 TVLHSKSIGVLSFSGTSECTFRYPMDVTIYEN-TIFAVAAGSNEIWSLNPS----GEVKL 488
Query: 114 VAGSFQGYTGHVD--GKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTI 169
V G+ + +GH+D G + AR P G+T+ G LYV D+ + IR + V T+
Sbjct: 489 VCGNGKYPSGHIDTEGDLSRARLCAPSGITVSPNGTLYVVDSDSSIIRWLSMTKNQVGTL 548
Query: 170 AGGKS----NVAGF--RDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDD 223
GG + N++ F R+G S + + Y+ L+V D N L+ I Q D
Sbjct: 549 VGGDAIFTGNLSAFGDRNGVSSSVRLQRPMGICYMDKD-QLIVADTFNHKLKCIHTIQRD 607
Query: 224 CEY 226
C +
Sbjct: 608 CRW 610
>gi|290978674|ref|XP_002672060.1| predicted protein [Naegleria gruberi]
gi|284085634|gb|EFC39316.1| predicted protein [Naegleria gruberi]
Length = 1219
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/139 (33%), Positives = 71/139 (51%), Gaps = 11/139 (7%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDG-KPNEARFNHPKGVTMDD 144
GE++ D N+ I KI L+ + + GY G DG +A N P GV +D
Sbjct: 430 GEIYIADTENNRIRKI---LTNGTITTIAGTGDYGYNG--DGIMAVDAWLNEPSGVEIDS 484
Query: 145 -KGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 201
G ++ +DT N IR++ ++G +TTIAG GKS G DG + D + ++ YV T
Sbjct: 485 TSGEVFFSDTENFRIRRVSNSGIITTIAGTGKSKFNG--DGMATDTNLATPTEIQYVPST 542
Query: 202 CSLLVIDRGNAALRQISLN 220
+ D GN +R+ ++N
Sbjct: 543 TEIYFADSGNNRIRKFTIN 561
Score = 47.0 bits (110), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 56/199 (28%), Positives = 85/199 (42%), Gaps = 20/199 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + ++E GE+F D N + KIT + +V F+ Y D N F +
Sbjct: 309 PNSVTLNELGEVFISDSQNYRVRKITTESGIITT--IVGNGFEKYC--TDLASNTPLF-Y 363
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR-DG-PSEDAKFSNDF 193
P+G+ ++ G + AD+ N IRKI G + TIAG + G+ DG + A+ +
Sbjct: 364 PRGIISNENGEFFYADSSNHCIRKILTNGTILTIAG--TGTKGYNGDGIEATSAQLNTPH 421
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISLN------QDDCEYQYNSISPTDILMVVGAVLVGY 247
DV T + + D N +R+I N +Y YN D +M V A L
Sbjct: 422 DVALNLATGEIYIADTENNRIRKILTNGTITTIAGTGDYGYNG----DGIMAVDAWLNEP 477
Query: 248 VTCMLQQGFGPFFFSRTQQ 266
+ G FFS T+
Sbjct: 478 SGVEIDSTSGEVFFSDTEN 496
Score = 44.7 bits (104), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 71/141 (50%), Gaps = 18/141 (12%)
Query: 85 DGELFAVDEVNSNIVKITPPLSQYSRGRLV--AGS-FQGYTGHVDG-KPNEARFNHPKGV 140
D E+ D N I K+ ++ G +V AG+ F GY G DG A+ N+P G+
Sbjct: 35 DEEMLIADATNHRIRKV------FTNGTIVTIAGNGFAGYNG--DGLDATSAQLNNPVGI 86
Query: 141 TMD-DKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--DGPSEDAKFSNDFDVV 196
+D + +Y+AD+ N IRKI G +TTIAG + +AG+ D ++ A+ + +V
Sbjct: 87 YVDTNSREVYIADSNNHRIRKILQNGKITTIAG--TGIAGYNGDDKSADSAQLNTPSGIV 144
Query: 197 YVRPTCSLLVIDRGNAALRQI 217
+ + D N +R+I
Sbjct: 145 IDPNNGEIFISDSKNHRIRKI 165
Score = 38.1 bits (87), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 46/91 (50%), Gaps = 14/91 (15%)
Query: 87 ELFAVDEVNSNIVKITPPLSQYSRGRL--VAG-SFQGYTGHVDGKP-NEARFNHPKGVTM 142
E++ D N+ I K T G + +AG S GY G D P A N P GVT
Sbjct: 544 EIYFADSGNNRIRKFTI------NGMMTTIAGTSTSGYNG--DNMPATRAWLNFPVGVTY 595
Query: 143 DDKGN-LYVADTLNLAIRKIGDAG-VTTIAG 171
D K N +Y+AD N IRKI G +TTIAG
Sbjct: 596 DPKTNQVYIADLSNHRIRKILTNGTITTIAG 626
>gi|392945711|ref|ZP_10311353.1| serine/threonine protein kinase [Frankia sp. QA3]
gi|392289005|gb|EIV95029.1| serine/threonine protein kinase [Frankia sp. QA3]
Length = 859
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 54/147 (36%), Positives = 77/147 (52%), Gaps = 10/147 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKP-NEARF 134
P++I +++DG L+ D + I K+TP S VAG+ + GY G DG P A+
Sbjct: 710 PHEITMTDDGTLYFADLRSETIQKVTPDGIISS----VAGTGEAGYAG--DGGPARSAKL 763
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAGGKSNVAGFRDGPSEDAKFSNDF 193
N P D LY+AD N IRKI +G+ TT+AG + +G GP+ A+ N
Sbjct: 764 NGPSLSIGPDGRTLYLADYHNNRIRKIDPSGIITTVAGIGTAGSGGDGGPATAAQLKNPT 823
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISLN 220
VV V +L + D GNA +R+I N
Sbjct: 824 SVV-VDGAGALYIADNGNARVRRIDPN 849
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 65/238 (27%), Positives = 91/238 (38%), Gaps = 50/238 (21%)
Query: 74 GVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPN--- 130
G+ PY + + DG L I KITP AG+ + G G
Sbjct: 539 GLSPYSLELDLDGSLLVSSLATDRIQKITP-----------AGAANDFAGTGAGGSGGDG 587
Query: 131 ----EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAGGKSNVAGFR--DGP 183
A+ N P T D GN+Y+ D N IRKI AGV +TIAG + AG+ GP
Sbjct: 588 GPAIAAQLNGPGSTTRDKAGNIYIGDAQNNRIRKISPAGVISTIAG--TGTAGYSGDGGP 645
Query: 184 SEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMV-VGA 242
+ A+ N+ + V P S+ D N +R+ I P I+ VG
Sbjct: 646 ATAAQL-NNAEQVTTGPDGSVYFSDYENHRIRK--------------IDPAGIITTYVGT 690
Query: 243 VLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEI------KELSNKEKPIPIVESM 294
+ GY GP +R P E ++ + E K P I+ S+
Sbjct: 691 GVAGYTGA-----GGPATQARIDGPHEITMTDDGTLYFADLRSETIQKVTPDGIISSV 743
Score = 46.6 bits (109), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 58/117 (49%), Gaps = 11/117 (9%)
Query: 79 KIRVSEDGELFAVDEVNSNIVKITPP--LSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
++ DG ++ D N I KI P ++ Y V GYTG G +AR +
Sbjct: 656 QVTTGPDGSVYFSDYENHRIRKIDPAGIITTY-----VGTGVAGYTG-AGGPATQARIDG 709
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAG-GKSNVAGFRDGPSEDAKFSN 191
P +TM D G LY AD + I+K+ D ++++AG G++ AG GP+ AK +
Sbjct: 710 PHEITMTDDGTLYFADLRSETIQKVTPDGIISSVAGTGEAGYAG-DGGPARSAKLNG 765
>gi|358458536|ref|ZP_09168745.1| NHL repeat containing protein [Frankia sp. CN3]
gi|357078265|gb|EHI87715.1| NHL repeat containing protein [Frankia sp. CN3]
Length = 674
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 69/146 (47%), Gaps = 6/146 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V++DG ++ D N I KI P + + + G G + +A N
Sbjct: 406 PNGVAVTDDGTVYIDDNHNDRIRKIDPSGTIRTIAGIGTGDGHGTFSGDNSAATKAGLNS 465
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFRD---GPSEDAKFSN 191
P+G+ + G +Y+ADT N +RKI +TT+AG ++ D G + A S
Sbjct: 466 PQGIAVTSDGTVYIADTANNRVRKIDPSSGTITTVAGTGASTGSVSDDDGGLATQADLSA 525
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQI 217
DV V P +L ++D G+ +R++
Sbjct: 526 PADVA-VGPGGALYIVDTGHDRIRKV 550
Score = 42.0 bits (97), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 53/108 (49%), Gaps = 10/108 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVD----GKPNEA 132
P I V+ DG ++ D N+ + KI P + VAG+ TG V G +A
Sbjct: 466 PQGIAVTSDGTVYIADTANNRVRKIDPSSGTIT---TVAGTGA-STGSVSDDDGGLATQA 521
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAG 178
+ P V + G LY+ DT + IRK+ G +TT+AG G+ +AG
Sbjct: 522 DLSAPADVAVGPGGALYIVDTGHDRIRKVDAQGRITTVAGTGEPGLAG 569
>gi|153808835|ref|ZP_01961503.1| hypothetical protein BACCAC_03135 [Bacteroides caccae ATCC 43185]
gi|149128661|gb|EDM19879.1| NHL repeat protein [Bacteroides caccae ATCC 43185]
Length = 426
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 49/97 (50%), Gaps = 1/97 (1%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
G+ DGK ARFN P + D G +Y+AD+ N IR I G + G AG+ DG
Sbjct: 330 GYEDGKRLNARFNFPSQICFDLDGIMYIADSSNHCIRSIDREGAVSTVIGVPGRAGYVDG 389
Query: 183 PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
+DA F + V V ++ + D N +R++++
Sbjct: 390 TPDDALFDEPWGVA-VDEEGTIYIADTKNKCIRKLAI 425
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P +I DG ++ D N I I + V G G G+VDG P++A F+
Sbjct: 344 PSQICFDLDGIMYIADSSNHCIRSI----DREGAVSTVIG-VPGRAGYVDGTPDDALFDE 398
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG 162
P GV +D++G +Y+ADT N IRK+
Sbjct: 399 PWGVAVDEEGTIYIADTKNKCIRKLA 424
>gi|325102830|ref|YP_004272484.1| cell surface receptor IPT/TIG domain protein [Pedobacter saltans
DSM 12145]
gi|324971678|gb|ADY50662.1| cell surface receptor IPT/TIG domain protein [Pedobacter saltans
DSM 12145]
Length = 491
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 56/107 (52%), Gaps = 5/107 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PYK V + G ++ D N I KI LS + + G+ +G+ DG N
Sbjct: 225 PYKTAVDKYGNIYVADNGNHRIRKI--DLSTNTVSTIAGGAGAATSGYTDGIGANGLLNT 282
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDG 182
P G+ +DD N+YVAD N A+RKI D V+T+AG + +AG DG
Sbjct: 283 PTGLAVDDDLNVYVADAANHAVRKITPDGRVSTLAG--NGIAGIADG 327
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/147 (29%), Positives = 71/147 (48%), Gaps = 7/147 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + +DG L++ D I KI + G+ G GHVDG+ +ARFN+
Sbjct: 337 PTAVCMGKDGFLYSADRYGQRIRKIDVRTGKTVTIAGSGGNAAGTGGHVDGEVLKARFNN 396
Query: 137 PKGVTMDDKGNLYVADTLNLA-----IRKIGDAGVTTIAGGKS-NVAGFRDGPSEDAKFS 190
G+ +D G +YV++ A IR I + V+TI G + + G+ +G ++F
Sbjct: 397 IWGMDIDKDGTIYVSELEGTAGKSHTIRMIKNGQVSTIGGPDAFDNNGYVNGLPGISRFY 456
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQI 217
N DV V ++ + D N +R+I
Sbjct: 457 NPTDVA-VDEEGNVFIADMNNYVIRKI 482
Score = 46.2 bits (108), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 49/120 (40%), Gaps = 11/120 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V +D ++ D N + KITP GR+ + G G DG N
Sbjct: 283 PTGLAVDDDLNVYVADAANHAVRKITPD------GRVSTLAGNGIAGIADGIWPNVTVNR 336
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVA---GFRDGPSEDAKFSN 191
P V M G LY AD IRKI TIAG N A G DG A+F+N
Sbjct: 337 PTAVCMGKDGFLYSADRYGQRIRKIDVRTGKTVTIAGSGGNAAGTGGHVDGEVLKARFNN 396
Score = 43.1 bits (100), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 55/109 (50%), Gaps = 8/109 (7%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIA--- 170
+AG+ G G VDG + A FN+P +D GN+YVAD N IRKI D T++
Sbjct: 204 IAGT--GTAGRVDGDISIASFNNPYKTAVDKYGNIYVADNGNHRIRKI-DLSTNTVSTIA 260
Query: 171 -GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G + +G+ DG + N + V ++ V D N A+R+I+
Sbjct: 261 GGAGAATSGYTDGIGANGLL-NTPTGLAVDDDLNVYVADAANHAVRKIT 308
>gi|359777541|ref|ZP_09280822.1| hypothetical protein ARGLB_073_01730 [Arthrobacter globiformis NBRC
12137]
gi|359305319|dbj|GAB14651.1| hypothetical protein ARGLB_073_01730 [Arthrobacter globiformis NBRC
12137]
Length = 666
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/103 (37%), Positives = 55/103 (53%), Gaps = 6/103 (5%)
Query: 119 QGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK--IGDAGVTTI--AGGKS 174
G G +DG +EA F P G+ D GN++VAD+ A+RK I D G T+ A GK
Sbjct: 376 NGLEGLLDGPAHEAWFAQPSGLAEDADGNIWVADSETSALRKLVISDDGAVTVESAVGKG 435
Query: 175 NVA-GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
GFRDG + +A+ + V V P S+ + D N A+R+
Sbjct: 436 LFDFGFRDGEASEARLQHPLGVT-VLPDGSVAIADTYNGAVRR 477
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 33/52 (63%), Gaps = 2/52 (3%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGG 172
G DG+ +EAR HP GVT+ G++ +ADT N A+R+ A V+T+A G
Sbjct: 440 GFRDGEASEARLQHPLGVTVLPDGSVAIADTYNGAVRRYDPAAGTVSTLARG 491
>gi|399034109|ref|ZP_10732477.1| gluconolactonase [Flavobacterium sp. CF136]
gi|398067543|gb|EJL59038.1| gluconolactonase [Flavobacterium sp. CF136]
Length = 418
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 55/105 (52%), Gaps = 9/105 (8%)
Query: 87 ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKG 146
L+ ++ S I+K+TP G L + TG +G EA+F+ P G+ D
Sbjct: 251 NLYISEDATSKIMKLTPT------GILTIFAGANGTGFQNGNIKEAKFSKPSGIVFDKMD 304
Query: 147 NLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFS 190
NLYV D N IRKI D V+T+AG S + G +DG + +A F+
Sbjct: 305 NLYVVDRYNNRIRKIAVDGTVSTVAG--SGIRGNKDGITSEATFN 347
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 51/88 (57%), Gaps = 2/88 (2%)
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSND 192
R G+ +D K NLY+++ I K+ G+ TI G +N GF++G ++AKFS
Sbjct: 237 RVGAASGMCLDSKNNLYISEDATSKIMKLTPTGILTIFAG-ANGTGFQNGNIKEAKFSKP 295
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQISLN 220
+V+ + +L V+DR N +R+I+++
Sbjct: 296 SGIVFDKMD-NLYVVDRYNNRIRKIAVD 322
>gi|421111981|ref|ZP_15572448.1| hypothetical protein LEP1GSC071_4063 [Leptospira santarosai str.
JET]
gi|410802632|gb|EKS08783.1| hypothetical protein LEP1GSC071_4063 [Leptospira santarosai str.
JET]
Length = 356
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 62/118 (52%), Gaps = 8/118 (6%)
Query: 104 PLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI-- 161
PL Q+S L AG+ G G +G + F P + +D + NLYV + N AIRKI
Sbjct: 148 PLDQFS---LYAGNSSGVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHAIRKINL 204
Query: 162 GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
V+T++GG + G+ DG A+F + + Y + T SLLV D + +R+I L
Sbjct: 205 NSGSVSTLSGG---ILGYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDL 259
>gi|392403177|ref|YP_006439789.1| NHL repeat containing protein [Turneriella parva DSM 21527]
gi|390611131|gb|AFM12283.1| NHL repeat containing protein [Turneriella parva DSM 21527]
Length = 392
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/115 (38%), Positives = 62/115 (53%), Gaps = 11/115 (9%)
Query: 82 VSEDG-ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT--GHVDGKPNEARFNHPK 138
++ DG L+ D N+ I KI + ++AG G T G DG N+ARFN+P+
Sbjct: 139 ITTDGTNLYVADNSNNKIRKIVIASRAVT---VLAGPAAGDTTSGDTDGTANDARFNNPQ 195
Query: 139 GVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGF--RDGPSEDAKF 189
G+ D NL+VAD+LN +RKI A V+T+AG V G DG + A+F
Sbjct: 196 GIVTDGT-NLFVADSLNRKVRKIVIASGIVSTLAGPAQGVTGSGDTDGSANTARF 249
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 30/66 (45%), Positives = 38/66 (57%), Gaps = 5/66 (7%)
Query: 130 NEARFNHPKGVTMDDKGNLYVADTLNLAIRK--IGDAGVTTIAG--GKSNVAGFRDGPSE 185
N ARFN+P G+T D +LYVADT N IRK I VTT+AG + +G +G
Sbjct: 303 NSARFNNPLGITTDGT-SLYVADTSNQKIRKIVIATGAVTTVAGPAQGATTSGDTEGVGT 361
Query: 186 DAKFSN 191
A+F N
Sbjct: 362 SARFFN 367
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 42/113 (37%), Positives = 58/113 (51%), Gaps = 11/113 (9%)
Query: 114 VAGSFQGY--TGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK--IGDAGVTTI 169
+AG QG +G DG + ARFN GVT D + N+Y D IRK I V+T+
Sbjct: 53 LAGPSQGTAASGDTDGTGSAARFNAVTGVTTDGQ-NIYSTDYFGHKIRKTVISSGVVSTL 111
Query: 170 AG---GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
AG G S ++G DG E A+FS+ + T +L V D N +R+I +
Sbjct: 112 AGPGPGTS-LSGDVDGIGEAARFSS-MRAITTDGT-NLYVADNSNNKIRKIVI 161
>gi|405345633|ref|ZP_11022426.1| hypothetical protein A176_0011 [Chondromyces apiculatus DSM 436]
gi|397093682|gb|EJJ24379.1| hypothetical protein A176_0011 [Myxococcus sp. (contaminant ex DSM
436)]
Length = 870
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 67/146 (45%), Gaps = 11/146 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDG-KPNEARFN 135
P + V G ++ D+ N I + + +V + G G D + +A F
Sbjct: 670 PMALAVDAAGNVYVADQDNHRIRMVRAGTRE-----VVTLAGTGTLGTADAVRGRDASFA 724
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKI---GDAGVTTIAG-GKSNVAGFRDGPSEDAKFSN 191
P + + G LYV DT++ +R++ G V T+AG G GF+DGP DA+F
Sbjct: 725 APSALALGGVGTLYVLDTVSQRLRRVSLQGSRAVVTLAGTGAGTPFGFQDGPGSDARFRA 784
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQI 217
+V + P LL+ D N LR+I
Sbjct: 785 QLGMV-MGPQGELLLADTANLRLRKI 809
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 58/128 (45%), Gaps = 7/128 (5%)
Query: 104 PLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGD 163
P+ Q++ G +G DG + A F P G+ + G + VADT N IR I
Sbjct: 522 PVGQWAASVEAYAGRVGASGWQDGPADTAMFQSPTGLAVTHAGEVVVADTRNNRIRLIQQ 581
Query: 164 AG----VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G V+TIAG S G RDG A + VV PT L V D GN +R++
Sbjct: 582 EGAGRTVSTIAG--SGELGHRDGAGSQALLRSPTAVV-AGPTGELYVADSGNHVIRRLDR 638
Query: 220 NQDDCEYQ 227
++ + +
Sbjct: 639 GEEGWQVR 646
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 63/222 (28%), Positives = 94/222 (42%), Gaps = 31/222 (13%)
Query: 15 AFTLQFQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFEN-GYLVETVIEGNEI 73
A T FQ+ P G + H ++ TR++ ++Q E G V T+ E+
Sbjct: 547 ADTAMFQS---PTGLAVTHAGEVVVADTRNNRI------RLIQQEGAGRTVSTIAGSGEL 597
Query: 74 G-----------VVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGY- 121
G P + GEL+ D N I ++ + R AG QG+
Sbjct: 598 GHRDGAGSQALLRSPTAVVAGPTGELYVADSGNHVIRRLDRGEEGWQV-RTWAG--QGFV 654
Query: 122 TGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGF 179
G DG P ARF+ P + +D GN+YVAD N IR + G V T+AG + G
Sbjct: 655 AGFADGGPARARFSRPMALAVDAAGNVYVADQDNHRIRMVRAGTREVVTLAG--TGTLGT 712
Query: 180 RDG-PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
D DA F+ + + +L V+D + LR++SL
Sbjct: 713 ADAVRGRDASFAAP-SALALGGVGTLYVLDTVSQRLRRVSLQ 753
>gi|196233467|ref|ZP_03132310.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
gi|196222463|gb|EDY16990.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
Length = 509
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 53/97 (54%), Gaps = 6/97 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + E G + D N + +I+P +AG+ G G DG ARF +
Sbjct: 225 PSGLAIDEHGNVDVADRTNQTLRRISPS----GETETIAGT-AGEHGTADGAGKAARFWY 279
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGG 172
P+G+ +D+ GN+YVADT ++RK+ G V+T+AGG
Sbjct: 280 PQGLAIDEHGNVYVADTSAKSVRKVTPNGQVSTLAGG 316
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 67/143 (46%), Gaps = 10/143 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V+ +G++F D N I KI + + ++AG G +G +G EARF+
Sbjct: 320 PTAVAVNHEGDVFVTDSSNHVIQKI----AHDGKVSVLAGK-PGESGAKNGHGEEARFHF 374
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ + G LYV + + +IRKI G VTT AG G DG A+F
Sbjct: 375 PNGIAIGADGALYVVE--HSSIRKIDGKGNVTTFAGSLEE-EGRNDGTGAAARFWG-LTA 430
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
+ +L V D LR+IS
Sbjct: 431 IAADARGNLYVADHELKNLRRIS 453
Score = 47.4 bits (111), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 39/73 (53%), Gaps = 1/73 (1%)
Query: 118 FQGYT-GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNV 176
F G T G DG A+F +P G+ +D+ GN+ VAD N +R+I +G T G +
Sbjct: 205 FAGETEGAADGAGVTAQFKNPSGLAIDEHGNVDVADRTNQTLRRISPSGETETIAGTAGE 264
Query: 177 AGFRDGPSEDAKF 189
G DG + A+F
Sbjct: 265 HGTADGAGKAARF 277
>gi|302760605|ref|XP_002963725.1| hypothetical protein SELMODRAFT_79954 [Selaginella moellendorffii]
gi|300168993|gb|EFJ35596.1| hypothetical protein SELMODRAFT_79954 [Selaginella moellendorffii]
Length = 1052
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 55/111 (49%), Gaps = 5/111 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIR--KIGDAGVTTIAGGKSNVAGFR 180
G DG ++A F HP G+ G +YVAD+ N I+ + V T+AG + VAG+
Sbjct: 794 GDKDGPASQALFQHPLGILSSANGAIYVADSYNHKIKLMDLASKTVRTVAG--TGVAGYE 851
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSI 231
DG S A+FS + + P SL V D N +R + DD Q ++
Sbjct: 852 DGKSVKAQFSEPAGLA-LGPNGSLFVADTNNNLIRLLKPGADDNAIQVTTL 901
Score = 45.4 bits (106), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 45/83 (54%), Gaps = 5/83 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I S +G ++ D N I K+ S+ R VAG+ G G+ DGK +A+F+
Sbjct: 808 PLGILSSANGAIYVADSYNHKI-KLMDLASKTVR--TVAGT--GVAGYEDGKSVKAQFSE 862
Query: 137 PKGVTMDDKGNLYVADTLNLAIR 159
P G+ + G+L+VADT N IR
Sbjct: 863 PAGLALGPNGSLFVADTNNNLIR 885
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 49/101 (48%), Gaps = 9/101 (8%)
Query: 85 DGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDD 144
+G LF D N N + +T Y L GS G G DG + A FN P+GV D
Sbjct: 583 NGRLFISDS-NHNRIVVTDLNGNYI---LQVGS--GSEGLTDGSFSLASFNRPQGVAYDT 636
Query: 145 KGN-LYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDG 182
+ N LYVADT N A+R++ V T+AG +R G
Sbjct: 637 EKNILYVADTENHALREVDFVKETVRTLAGTGERGFDYRGG 677
>gi|326435919|gb|EGD81489.1| hypothetical protein PTSG_02206 [Salpingoeca sp. ATCC 50818]
Length = 6977
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 61/136 (44%), Gaps = 10/136 (7%)
Query: 85 DGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDD 144
+G++ D N+ + ITP G++ G GH DG R P+G+T
Sbjct: 921 NGDIVVCDRDNARVRVITP------EGKVSTALGVGKRGHQDGVGAGVRLAGPRGMTQLA 974
Query: 145 KGNLYVADTLNLAIRKIGD--AGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 202
G+ V D N IR++ A V TIAG AG RDG + A+F + +
Sbjct: 975 SGSFVVTDAENHCIREVSSALAKVETIAG--CTRAGVRDGDAAAAEFRYPTHALEIPRQK 1032
Query: 203 SLLVIDRGNAALRQIS 218
+L+ D GN +R IS
Sbjct: 1033 LILITDTGNHTIRAIS 1048
Score = 45.8 bits (107), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 45/105 (42%), Gaps = 10/105 (9%)
Query: 70 GNEIGV-------VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT 122
G +GV P I VS G++F D N +V +P + +VAG F
Sbjct: 1125 GKRVGVGVHALLRAPSGIVVSPGGDIFVSDTGNDRVVMYSPVTQEVI---VVAGGFAERE 1181
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVT 167
VDG A P+G+T D+ G L VA+ +R+I T
Sbjct: 1182 DMVDGHGRAATLQGPRGLTFDNGGCLIVAEGAGHRVRRIATPAAT 1226
>gi|74317309|ref|YP_315049.1| hypothetical protein Tbd_1291 [Thiobacillus denitrificans ATCC
25259]
gi|74056804|gb|AAZ97244.1| conserved hypothetical protein containing twin-arginine
translocation pathway signal [Thiobacillus denitrificans
ATCC 25259]
Length = 510
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 71/149 (47%), Gaps = 15/149 (10%)
Query: 80 IRVSEDGELFAVDEVNSNIVK----ITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
I +S D + E ++NIV+ + +S Y+ G L+ G QG VDG NEARFN
Sbjct: 295 IVLSPDQRYLYIAERDNNIVRCIDLMNNVVSTYA-GVLMVGQKQG--AFVDGPANEARFN 351
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-------VTTIAGGKSNVAGFRDGPSEDAK 188
+ D+ GNLYVAD N IRKI + V+T AG +G GP+ AK
Sbjct: 352 GCSQIDYDNAGNLYVADRFNHVIRKITPSADPMVGKMVSTYAGVPMQ-SGRISGPASKAK 410
Query: 189 FSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
F + + R + + D N+ + I
Sbjct: 411 FYEPWGLSVDRKNNLIFIGDTANSRVAVI 439
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 57/119 (47%), Gaps = 13/119 (10%)
Query: 121 YTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR 180
+ G +G A F G+ ++D G++ VAD N IR I + G TI G F+
Sbjct: 85 FPGMRNGPALSAFFYPATGMYIEDNGSIKVADARNSLIRNISNLGDVTIFSGSMQQYPFK 144
Query: 181 DGPSEDAKFS--NDFDVVYVRPTCSLLVIDRGNAALRQISLN--------QDDCEYQYN 229
DG + D F+ ND D + + +V DR N A+R++ + Q +C+ +YN
Sbjct: 145 DGSAVDTNFNSPNDIDKL---SDGTYVVGDRENNAIRRVFADGSVKTIAGQGNCKNKYN 200
Score = 42.4 bits (98), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 58/115 (50%), Gaps = 9/115 (7%)
Query: 115 AGSFQGYTGHVDGKPNEARFNHPKGVTMD-DKGNLYVADTLNLAIRKIG--DAGVTTIAG 171
G+ Y G VDG A+F G+ + D+ LY+A+ N +R I + V+T AG
Sbjct: 270 CGALTYYPGRVDGPTATAKFRGACGIVLSPDQRYLYIAERDNNIVRCIDLMNNVVSTYAG 329
Query: 172 ----GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQD 222
G+ A F DGP+ +A+F+ + Y +L V DR N +R+I+ + D
Sbjct: 330 VLMVGQKQGA-FVDGPANEARFNGCSQIDYDN-AGNLYVADRFNHVIRKITPSAD 382
>gi|227537797|ref|ZP_03967846.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33300]
gi|227242411|gb|EEI92426.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33300]
Length = 441
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 53/100 (53%), Gaps = 1/100 (1%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGF 179
G +G DG +A FN PK + D GN+++AD N IR + G+ + G+ AG+
Sbjct: 342 GGSGFRDGPVADAVFNFPKDIKFDRAGNMFIADYGNHCIRMLSADGIVSTVAGQPTKAGY 401
Query: 180 RDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
+DG +++F+ + V V + + D NA +R++ +
Sbjct: 402 KDGGPVESQFNQPWGVA-VNDQGDIYIADWNNARIRKLVI 440
>gi|300772471|ref|ZP_07082341.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33861]
gi|300760774|gb|EFK57600.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33861]
Length = 441
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 53/100 (53%), Gaps = 1/100 (1%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGF 179
G +G DG +A FN PK + D GN+++AD N IR + G+ + G+ AG+
Sbjct: 342 GGSGFRDGPVADAVFNFPKDIKFDRAGNMFIADYGNHCIRMLSADGIVSTVAGQPTKAGY 401
Query: 180 RDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
+DG +++F+ + V V + + D NA +R++ +
Sbjct: 402 KDGGPVESQFNQPWGVA-VNDQGDIYIADWNNARIRKLVI 440
>gi|254390149|ref|ZP_05005369.1| NHL repeat-containing protein [Streptomyces clavuligerus ATCC
27064]
gi|197703856|gb|EDY49668.1| NHL repeat-containing protein [Streptomyces clavuligerus ATCC
27064]
Length = 527
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 50/147 (34%), Positives = 75/147 (51%), Gaps = 10/147 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV--DGKPNEARF 134
P+ + V G L+ V E N+N ++ S VAG+ G TG+ DG A+
Sbjct: 175 PFGVAVDSTGVLY-VTEYNNNRIRKVAADGTIS---TVAGT--GATGNRGDDGPAASAQL 228
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
N P GV +D G +Y+ADT N IRKI D ++T+AG + GP+ A+ +
Sbjct: 229 NRPYGVAVDGAGAVYIADTDNHRIRKITADGTISTVAGTGTGGFDGDGGPATAARLNRPM 288
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISLN 220
VV + T +L V D GN +R+I+ +
Sbjct: 289 GVV-IDSTGTLYVADYGNHRIRKITTD 314
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 71/146 (48%), Gaps = 8/146 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP-NEARFN 135
P ++ V G ++ D N + KIT + + +V G++G DG P AR N
Sbjct: 119 PREVAVDGAGAVYVTDAENHRVRKITADGTIST---VVGTGVAGFSG--DGGPATAARLN 173
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P GV +D G LYV + N IRK+ D ++T+AG + DGP+ A+ + +
Sbjct: 174 RPFGVAVDSTGVLYVTEYNNNRIRKVAADGTISTVAGTGATGNRGDDGPAASAQLNRPYG 233
Query: 195 VVYVRPTCSLLVIDRGNAALRQISLN 220
V V ++ + D N +R+I+ +
Sbjct: 234 VA-VDGAGAVYIADTDNHRIRKITAD 258
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 49/148 (33%), Positives = 71/148 (47%), Gaps = 12/148 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAG-SFQGYTGHVDGKPNEARFN 135
PY I V G L+ D N + KIT + VAG GY G +G A+ N
Sbjct: 63 PYGIAVDSAGTLYFSDYNNHRVRKITTD----GKVSTVAGIGSAGYRGD-NGPAVSAQLN 117
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFR--DGPSEDAKFSND 192
P+ V +D G +YV D N +RKI D ++T+ G + VAGF GP+ A+ +
Sbjct: 118 CPREVAVDGAGAVYVTDAENHRVRKITADGTISTVVG--TGVAGFSGDGGPATAARLNRP 175
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQISLN 220
F V V T L V + N +R+++ +
Sbjct: 176 FGVA-VDSTGVLYVTEYNNNRIRKVAAD 202
Score = 47.4 bits (111), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 66/123 (53%), Gaps = 14/123 (11%)
Query: 102 TPPLSQYSRGRLVAGS-FQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK 160
TPP+S VAG+ G+TG +G A+ N P G+ +D G LY +D N +RK
Sbjct: 34 TPPIS------TVAGTGVAGFTGD-NGPAVAAQLNRPYGIAVDSAGTLYFSDYNNHRVRK 86
Query: 161 I-GDAGVTTIAGGKSNVAGFR--DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
I D V+T+AG S AG+R +GP+ A+ + +V V ++ V D N +R+I
Sbjct: 87 ITTDGKVSTVAGIGS--AGYRGDNGPAVSAQLNCPREVA-VDGAGAVYVTDAENHRVRKI 143
Query: 218 SLN 220
+ +
Sbjct: 144 TAD 146
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 67/143 (46%), Gaps = 6/143 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + V G ++ D N I KIT + + VAG+ G G AR N
Sbjct: 231 PYGVAVDGAGAVYIADTDNHRIRKITADGTIST----VAGTGTGGFDGDGGPATAARLNR 286
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P GV +D G LYVAD N IRKI D + T+AG + G GP A+ +N V
Sbjct: 287 PMGVVIDSTGTLYVADYGNHRIRKITTDGKIGTVAGTGTGGFGGDGGPPASAQVNNSLGV 346
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V +L + D GN +R+++
Sbjct: 347 A-VDCVDTLYIADYGNHRVRKVA 368
>gi|365864816|ref|ZP_09404493.1| NHL repeat-containing protein [Streptomyces sp. W007]
gi|364005745|gb|EHM26808.1| NHL repeat-containing protein [Streptomyces sp. W007]
Length = 292
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 49/139 (35%), Positives = 72/139 (51%), Gaps = 8/139 (5%)
Query: 82 VSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKPNEARFNHPKGV 140
++++G LF D N + K+T + + VAG+ GY+G G AR N P GV
Sbjct: 1 MNDEGVLFITDGNNHRVRKVTADGTIST----VAGTGTAGYSGD-SGAATSARLNLPLGV 55
Query: 141 TMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 199
+D NLY++D N +RK+ D +TTIAG S G GPS A+ SN F + V
Sbjct: 56 VVDSADNLYISDYNNHRVRKVTPDGEITTIAGTGSAGFGGDGGPSPVAQLSNPFGLA-VD 114
Query: 200 PTCSLLVIDRGNAALRQIS 218
+L + D N +R+I+
Sbjct: 115 CVDTLYIADHLNNRIRKIA 133
>gi|294817139|ref|ZP_06775781.1| Receptor protein kinase [Streptomyces clavuligerus ATCC 27064]
gi|294321954|gb|EFG04089.1| Receptor protein kinase [Streptomyces clavuligerus ATCC 27064]
Length = 510
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 50/147 (34%), Positives = 75/147 (51%), Gaps = 10/147 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV--DGKPNEARF 134
P+ + V G L+ V E N+N ++ S VAG+ G TG+ DG A+
Sbjct: 158 PFGVAVDSTGVLY-VTEYNNNRIRKVAADGTIS---TVAGT--GATGNRGDDGPAASAQL 211
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
N P GV +D G +Y+ADT N IRKI D ++T+AG + GP+ A+ +
Sbjct: 212 NRPYGVAVDGAGAVYIADTDNHRIRKITADGTISTVAGTGTGGFDGDGGPATAARLNRPM 271
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISLN 220
VV + T +L V D GN +R+I+ +
Sbjct: 272 GVV-IDSTGTLYVADYGNHRIRKITTD 297
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 71/146 (48%), Gaps = 8/146 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP-NEARFN 135
P ++ V G ++ D N + KIT + + +V G++G DG P AR N
Sbjct: 102 PREVAVDGAGAVYVTDAENHRVRKITADGTIST---VVGTGVAGFSG--DGGPATAARLN 156
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P GV +D G LYV + N IRK+ D ++T+AG + DGP+ A+ + +
Sbjct: 157 RPFGVAVDSTGVLYVTEYNNNRIRKVAADGTISTVAGTGATGNRGDDGPAASAQLNRPYG 216
Query: 195 VVYVRPTCSLLVIDRGNAALRQISLN 220
V V ++ + D N +R+I+ +
Sbjct: 217 VA-VDGAGAVYIADTDNHRIRKITAD 241
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 74/150 (49%), Gaps = 16/150 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAG-SFQGYTGHVDGKPNEAR 133
PY I V G L+ D N + KIT + G++ VAG GY G +G A+
Sbjct: 46 PYGIAVDSAGTLYFSDYNNHRVRKIT------TDGKVSTVAGIGSAGYRGD-NGPAVSAQ 98
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFR--DGPSEDAKFS 190
N P+ V +D G +YV D N +RKI D ++T+ G + VAGF GP+ A+ +
Sbjct: 99 LNCPREVAVDGAGAVYVTDAENHRVRKITADGTISTVVG--TGVAGFSGDGGPATAARLN 156
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
F V V T L V + N +R+++ +
Sbjct: 157 RPFGVA-VDSTGVLYVTEYNNNRIRKVAAD 185
Score = 47.0 bits (110), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 66/123 (53%), Gaps = 14/123 (11%)
Query: 102 TPPLSQYSRGRLVAGS-FQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK 160
TPP+S VAG+ G+TG +G A+ N P G+ +D G LY +D N +RK
Sbjct: 17 TPPIST------VAGTGVAGFTGD-NGPAVAAQLNRPYGIAVDSAGTLYFSDYNNHRVRK 69
Query: 161 I-GDAGVTTIAGGKSNVAGFR--DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
I D V+T+AG S AG+R +GP+ A+ + +V V ++ V D N +R+I
Sbjct: 70 ITTDGKVSTVAGIGS--AGYRGDNGPAVSAQLNCPREVA-VDGAGAVYVTDAENHRVRKI 126
Query: 218 SLN 220
+ +
Sbjct: 127 TAD 129
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 67/143 (46%), Gaps = 6/143 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + V G ++ D N I KIT + + VAG+ G G AR N
Sbjct: 214 PYGVAVDGAGAVYIADTDNHRIRKITADGTIST----VAGTGTGGFDGDGGPATAARLNR 269
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P GV +D G LYVAD N IRKI D + T+AG + G GP A+ +N V
Sbjct: 270 PMGVVIDSTGTLYVADYGNHRIRKITTDGKIGTVAGTGTGGFGGDGGPPASAQVNNSLGV 329
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V +L + D GN +R+++
Sbjct: 330 A-VDCVDTLYIADYGNHRVRKVA 351
>gi|326445969|ref|ZP_08220703.1| hypothetical protein SclaA2_33112 [Streptomyces clavuligerus ATCC
27064]
Length = 508
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 50/147 (34%), Positives = 75/147 (51%), Gaps = 10/147 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV--DGKPNEARF 134
P+ + V G L+ V E N+N ++ S VAG+ G TG+ DG A+
Sbjct: 156 PFGVAVDSTGVLY-VTEYNNNRIRKVAADGTIS---TVAGT--GATGNRGDDGPAASAQL 209
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
N P GV +D G +Y+ADT N IRKI D ++T+AG + GP+ A+ +
Sbjct: 210 NRPYGVAVDGAGAVYIADTDNHRIRKITADGTISTVAGTGTGGFDGDGGPATAARLNRPM 269
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISLN 220
VV + T +L V D GN +R+I+ +
Sbjct: 270 GVV-IDSTGTLYVADYGNHRIRKITTD 295
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 71/146 (48%), Gaps = 8/146 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP-NEARFN 135
P ++ V G ++ D N + KIT + + +V G++G DG P AR N
Sbjct: 100 PREVAVDGAGAVYVTDAENHRVRKITADGTIST---VVGTGVAGFSG--DGGPATAARLN 154
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P GV +D G LYV + N IRK+ D ++T+AG + DGP+ A+ + +
Sbjct: 155 RPFGVAVDSTGVLYVTEYNNNRIRKVAADGTISTVAGTGATGNRGDDGPAASAQLNRPYG 214
Query: 195 VVYVRPTCSLLVIDRGNAALRQISLN 220
V V ++ + D N +R+I+ +
Sbjct: 215 VA-VDGAGAVYIADTDNHRIRKITAD 239
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 49/148 (33%), Positives = 71/148 (47%), Gaps = 12/148 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAG-SFQGYTGHVDGKPNEARFN 135
PY I V G L+ D N + KIT + VAG GY G +G A+ N
Sbjct: 44 PYGIAVDSAGTLYFSDYNNHRVRKITTD----GKVSTVAGIGSAGYRGD-NGPAVSAQLN 98
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFR--DGPSEDAKFSND 192
P+ V +D G +YV D N +RKI D ++T+ G + VAGF GP+ A+ +
Sbjct: 99 CPREVAVDGAGAVYVTDAENHRVRKITADGTISTVVG--TGVAGFSGDGGPATAARLNRP 156
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQISLN 220
F V V T L V + N +R+++ +
Sbjct: 157 FGVA-VDSTGVLYVTEYNNNRIRKVAAD 183
Score = 47.0 bits (110), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 66/123 (53%), Gaps = 14/123 (11%)
Query: 102 TPPLSQYSRGRLVAGS-FQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK 160
TPP+S VAG+ G+TG +G A+ N P G+ +D G LY +D N +RK
Sbjct: 15 TPPIST------VAGTGVAGFTGD-NGPAVAAQLNRPYGIAVDSAGTLYFSDYNNHRVRK 67
Query: 161 I-GDAGVTTIAGGKSNVAGFR--DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
I D V+T+AG S AG+R +GP+ A+ + +V V ++ V D N +R+I
Sbjct: 68 ITTDGKVSTVAGIGS--AGYRGDNGPAVSAQLNCPREVA-VDGAGAVYVTDAENHRVRKI 124
Query: 218 SLN 220
+ +
Sbjct: 125 TAD 127
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 67/143 (46%), Gaps = 6/143 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + V G ++ D N I KIT + + VAG+ G G AR N
Sbjct: 212 PYGVAVDGAGAVYIADTDNHRIRKITADGTIST----VAGTGTGGFDGDGGPATAARLNR 267
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P GV +D G LYVAD N IRKI D + T+AG + G GP A+ +N V
Sbjct: 268 PMGVVIDSTGTLYVADYGNHRIRKITTDGKIGTVAGTGTGGFGGDGGPPASAQVNNSLGV 327
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V +L + D GN +R+++
Sbjct: 328 A-VDCVDTLYIADYGNHRVRKVA 349
>gi|290975887|ref|XP_002670673.1| predicted protein [Naegleria gruberi]
gi|284084234|gb|EFC37929.1| predicted protein [Naegleria gruberi]
Length = 1074
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 40/150 (26%), Positives = 70/150 (46%), Gaps = 18/150 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQ-GYTGHVDG-KPNEA 132
P+ I + GE++ D+ N + K+ + G + +AGS G+ G DG
Sbjct: 692 PFSISIGNSGEIYISDQYNQRVRKVA------ANGYISTIAGSGAIGFNG--DGLAATST 743
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSND 192
FN+P GV+ + G++++ D+ N IRK+ ++T+AG G DG S + N
Sbjct: 744 CFNYPSGVSSNSNGDVFIIDSFNSRIRKLSSGKISTVAG------GLGDGSSAVNSYLNS 797
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQISLNQD 222
+ + + D N +R+I+ N D
Sbjct: 798 QSFAISSKSGEIFIADSNNHRIRKIATNGD 827
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 49/150 (32%), Positives = 73/150 (48%), Gaps = 19/150 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAG--SFQGYTGHVDGKPNEA 132
PY + V+ GE F D N I K++ + G++ +AG S GY+G G A
Sbjct: 95 PYGVIVNSLGETFIADTSNHVIRKVS------TNGKISTIAGTASSYGYSGD-GGLATNA 147
Query: 133 RFNHPKGVTMDDK-GNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--DGPSEDAK 188
N P G+ ++ G + + DT N IRK+ G +TTIAG + AG+ +G + +AK
Sbjct: 148 LLNSPYGLALNSSSGEVIIVDTSNNVIRKVSSIGNITTIAG--TGAAGYSGDNGQATNAK 205
Query: 189 FSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
F+ Y L V D N +R+IS
Sbjct: 206 FNAPRAAFY--SNGELFVADSRNHRIRKIS 233
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 41/155 (26%), Positives = 76/155 (49%), Gaps = 11/155 (7%)
Query: 68 IEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPP-LSQYSRGRLVAGSFQGYTGHVD 126
+ GN + PY I + + E+ D N I K++ +S ++ + SF G D
Sbjct: 31 MTGNALLRYPYGIALGLNSEILIADTFNQRIRKVSSSDVSTFAG--VGTSSFSG-----D 83
Query: 127 GK-PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGP- 183
G ++ N P GV ++ G ++ADT N IRK+ G ++TIAG S+ DG
Sbjct: 84 GALATQSEINFPYGVIVNSLGETFIADTSNHVIRKVSTNGKISTIAGTASSYGYSGDGGL 143
Query: 184 SEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+ +A ++ + + + ++++D N +R++S
Sbjct: 144 ATNALLNSPYGLALNSSSGEVIIVDTSNNVIRKVS 178
Score = 47.4 bits (111), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 47/152 (30%), Positives = 74/152 (48%), Gaps = 19/152 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL---VAGSFQ-GYTGHVDGKPNEA 132
P + VS GE++ D N I KI S G + VAGS + G++G G A
Sbjct: 518 PKGVVVSSSGEIYFSDSENHRIRKI-------STGGIISTVAGSGESGFSGD-GGLAISA 569
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGG-----KSNVAGFRDGPSED 186
+ ++P G+ M+ G L DT N IRK+ +G ++TIAG K+ +G +G + +
Sbjct: 570 KIHYPNGIAMNSNGELIFTDTRNNRIRKVSTSGYISTIAGNGTVAYKATFSG-DNGLAIN 628
Query: 187 AKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
A+ F V + + D GN +R++S
Sbjct: 629 AQLFVPFAVAVNLTNNDIYIADSGNHRIRKVS 660
Score = 45.1 bits (105), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 47/147 (31%), Positives = 72/147 (48%), Gaps = 22/147 (14%)
Query: 83 SEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGS-FQGYTGHVDGKPNEARFNHPKG 139
S+ GE+F D N I KI + G + VAGS G++G G A N+P
Sbjct: 804 SKSGEIFIADSNNHRIRKIA------TNGDISTVAGSGVAGFSGD-GGLATSATLNNPSY 856
Query: 140 VTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVY- 197
V ++ G L ++DT N IRK+ G +TTIAG + AG+ +G +A + + Y
Sbjct: 857 VAVNSNGELLISDTNNHRIRKVSLNGIITTIAG--NGTAGY-NGEGNNASL---YQLNYP 910
Query: 198 ----VRPTCSLLVIDRGNAALRQISLN 220
V T L + D N +R++++N
Sbjct: 911 SGLVVSSTGDLFIADSYNHRIRKLNVN 937
Score = 45.1 bits (105), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 58/108 (53%), Gaps = 15/108 (13%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGS-FQGYTGHVDGK-PNEA 132
P I +S +G+ + D N I K+ S G + +AG+ GY+G DG A
Sbjct: 463 PNSIFISSNGDSYFSDSSNHKIRKL-------SNGYITTIAGTGTSGYSG--DGSSATSA 513
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAG 178
+ N+PKGV + G +Y +D+ N IRKI G ++T+AG G+S +G
Sbjct: 514 KLNNPKGVVVSSSGEIYFSDSENHRIRKISTGGIISTVAGSGESGFSG 561
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 42/159 (26%), Positives = 71/159 (44%), Gaps = 12/159 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEA---R 133
P + V+ +GEL D N I K++ G + + G G+ +G+ N A +
Sbjct: 854 PSYVAVNSNGELLISDTNNHRIRKVSL------NGIITTIAGNGTAGY-NGEGNNASLYQ 906
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSND 192
N+P G+ + G+L++AD+ N IRK+ G + + G +AG DG + N
Sbjct: 907 LNYPSGLVVSSTGDLFIADSYNHRIRKLNVNGTISTSAGNG-IAGLSGDGSLPTSAQLNF 965
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSI 231
+ V L + D N +R++SL E N +
Sbjct: 966 PTGLAVSSVGELYISDDSNNRIRKVSLTCSQGESLVNGL 1004
Score = 43.1 bits (100), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 73/147 (49%), Gaps = 10/147 (6%)
Query: 76 VPYKIRVS-EDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKPNEAR 133
VP+ + V+ + +++ D N I K++ + VAG+ G++G +G A+
Sbjct: 633 VPFAVAVNLTNNDIYIADSGNHRIRKVSSSSGIITT---VAGTGTSGFSGD-NGLATNAK 688
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR-DGPSEDAKFSN 191
N P +++ + G +Y++D N +RK+ G ++TIAG S GF DG + + N
Sbjct: 689 LNFPFSISIGNSGEIYISDQYNQRVRKVAANGYISTIAG--SGAIGFNGDGLAATSTCFN 746
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQIS 218
V + +ID N+ +R++S
Sbjct: 747 YPSGVSSNSNGDVFIIDSFNSRIRKLS 773
Score = 42.4 bits (98), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 49/100 (49%), Gaps = 8/100 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVD--GKPNEARF 134
P I V+ +GE+F D VN+ I KI + G ++ + G G G A+
Sbjct: 264 PSGISVNSNGEIFISDSVNNRIRKI------LTNGTIITIAGTGTVGLSGDGGLAVNAQL 317
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKS 174
P G+ ++ G ++++D+ N IRKI +GV + G S
Sbjct: 318 WLPSGIVVNSVGEIFISDSYNHRIRKISASGVISTFAGTS 357
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 32/90 (35%), Positives = 48/90 (53%), Gaps = 7/90 (7%)
Query: 84 EDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKPNEARFNHPKGVTM 142
+GELF D N I KI S VAG+ G+ G N A+ N+P G+++
Sbjct: 215 SNGELFVADSRNHRIRKI----SNSGIVTTVAGTGTAGFNGDSILAKN-AQLNYPSGISV 269
Query: 143 DDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
+ G ++++D++N IRKI G + TIAG
Sbjct: 270 NSNGEIFISDSVNNRIRKILTNGTIITIAG 299
>gi|302552781|ref|ZP_07305123.1| NHL repeat containing protein [Streptomyces viridochromogenes DSM
40736]
gi|302470399|gb|EFL33492.1| NHL repeat containing protein [Streptomyces viridochromogenes DSM
40736]
Length = 605
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 33/82 (40%), Positives = 50/82 (60%), Gaps = 4/82 (4%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG-GKSNV 176
G G DG +EA FN P+G+T+ D G++ VADT+N A+R++ A VTT+AG G+
Sbjct: 219 GARGFADGSADEAAFNEPQGLTLLDDGSVVVADTVNHALRRLDLATGEVTTLAGTGRQWW 278
Query: 177 AGF-RDGPSEDAKFSNDFDVVY 197
G GP+ + S+ +DV +
Sbjct: 279 QGSPTSGPAREVDLSSPWDVAW 300
Score = 38.5 bits (88), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 29/50 (58%), Gaps = 2/50 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIA 170
GH DG +A HP GVT G++ V+DT N A+R+ A VTT+A
Sbjct: 390 GHRDGAAGQALLQHPLGVTALPDGSVAVSDTYNHALRRYDPATGEVTTLA 439
>gi|428179990|gb|EKX48859.1| hypothetical protein GUITHDRAFT_105483 [Guillardia theta CCMP2712]
Length = 388
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 43/118 (36%), Positives = 63/118 (53%), Gaps = 12/118 (10%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKI-----GDAGVT 167
VAGS G DG+ A FNHP+G+ M G+ + VAD+ N IRK+ V+
Sbjct: 125 VAGSDDSGAGFADGEGPNALFNHPEGLAMSPDGSFILVADSGNHRIRKVLLSKEDKNLVS 184
Query: 168 TIAGGKSN-----VAGFRDGPSE-DAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
TIAGG N GF D + ++ F+ DV ++ +LV+D GN A+R+++L
Sbjct: 185 TIAGGNQNRNVTYSQGFNDADNGVNSSFNRPTDVAFLPNGEEVLVVDAGNHAIRKLTL 242
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 42/119 (35%), Positives = 64/119 (53%), Gaps = 11/119 (9%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDD-KGNLYVADTLNLAIRKIGD------AG 165
L+AG+ G G+ DG + A+F P G+ + + L V+D N AIR I A
Sbjct: 64 LLAGT--GEAGYKDGIGSSAKFKSPSGLAVQSLRTGLCVSDRGNHAIRIITRESPDKPAV 121
Query: 166 VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS-LLVIDRGNAALRQISLNQDD 223
V T+AG + AGF DG +A F N + + + P S +LV D GN +R++ L+++D
Sbjct: 122 VDTVAGSDDSGAGFADGEGPNALF-NHPEGLAMSPDGSFILVADSGNHRIRKVLLSKED 179
>gi|421106984|ref|ZP_15567544.1| putative lipoprotein [Leptospira kirschneri str. H2]
gi|410007898|gb|EKO61577.1| putative lipoprotein [Leptospira kirschneri str. H2]
Length = 358
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 44/127 (34%), Positives = 68/127 (53%), Gaps = 8/127 (6%)
Query: 95 NSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTL 154
+SN + P Q+S L AGS +G +G ++ F+ P + +D + NLYV +
Sbjct: 141 DSNQIYKIDPTEQFS---LFAGSSSDLSGLQNGDRLDSLFDSPFFMDIDPERNLYVGELS 197
Query: 155 NLAIRKIG-DAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNA 212
N IRKI D+G V+T +GG ++G+ DG A+F + + Y R T SLL D +
Sbjct: 198 NHTIRKINLDSGTVSTFSGG---ISGYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDH 254
Query: 213 ALRQISL 219
+R+I L
Sbjct: 255 RIRKIDL 261
>gi|418693423|ref|ZP_13254475.1| putative lipoprotein [Leptospira kirschneri str. H1]
gi|409958780|gb|EKO17669.1| putative lipoprotein [Leptospira kirschneri str. H1]
Length = 358
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 44/127 (34%), Positives = 68/127 (53%), Gaps = 8/127 (6%)
Query: 95 NSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTL 154
+SN + P Q+S L AGS +G +G ++ F+ P + +D + NLYV +
Sbjct: 141 DSNQIYKIDPTEQFS---LFAGSSSDLSGLQNGDRLDSLFDSPFFMDIDPERNLYVGELS 197
Query: 155 NLAIRKIG-DAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNA 212
N IRKI D+G V+T +GG ++G+ DG A+F + + Y R T SLL D +
Sbjct: 198 NHTIRKINLDSGTVSTFSGG---ISGYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDH 254
Query: 213 ALRQISL 219
+R+I L
Sbjct: 255 RIRKIDL 261
>gi|290982352|ref|XP_002673894.1| predicted protein [Naegleria gruberi]
gi|284087481|gb|EFC41150.1| predicted protein [Naegleria gruberi]
Length = 2807
Score = 56.2 bits (134), Expect = 4e-05, Method: Composition-based stats.
Identities = 36/104 (34%), Positives = 54/104 (51%), Gaps = 7/104 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKPNEARFN 135
P+ + +S GE++ D N I K+T Y VAGS GY+G G A+
Sbjct: 221 PFSVSISSTGEIYIADYYNQRIRKVT----TYGFISTVAGSGTAGYSGD-GGLATSAQLY 275
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAG 178
+P GV++ G +Y+AD N IRK+ +G ++TIAG +G
Sbjct: 276 YPLGVSISSAGEIYIADYYNHRIRKVTTSGYISTIAGTTYGFSG 319
Score = 55.8 bits (133), Expect = 4e-05, Method: Composition-based stats.
Identities = 35/99 (35%), Positives = 53/99 (53%), Gaps = 6/99 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + +S GE++ D N I K+T S Y +AG+ G++G G A+ +
Sbjct: 277 PLGVSISSAGEIYIADYYNHRIRKVTT--SGYIS--TIAGTTYGFSGD-GGLATSAQLYY 331
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKS 174
P GV+ G +++ADT N IRKI +G ++TIAG S
Sbjct: 332 PNGVSTGSNGEIFIADTYNYRIRKINTSGYISTIAGTSS 370
Score = 52.4 bits (124), Expect = 5e-04, Method: Composition-based stats.
Identities = 43/143 (30%), Positives = 66/143 (46%), Gaps = 12/143 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ + VS GE++ VD N I KI S Y +AG+ G A+ N+
Sbjct: 710 PFSVAVSSVGEIYIVDTNNYRIRKINT--SGYIS--TIAGTGTGGYNGDSILATSAQLNY 765
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+T+ + VAD N IRKI +G ++TIAG GF DG F + +
Sbjct: 766 PYGLTISSTSEIIVADYYNHRIRKINTSGYISTIAG------GFGDGDMATTSFISAYSF 819
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
+ +++ D N +R+I+
Sbjct: 820 EFTL-NGEIIIADSNNHRIRKIT 841
Score = 51.2 bits (121), Expect = 0.001, Method: Composition-based stats.
Identities = 43/143 (30%), Positives = 66/143 (46%), Gaps = 6/143 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + +S GE++ D N I KI +S Y GS GY+G G A+ +
Sbjct: 542 PRSVAISSSGEIYIADFNNHRIRKIN--ISGYISTIAGTGSV-GYSGD-GGLATNAQLYY 597
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P+ V + G +Y+AD N IRKI +G ++TIAG S G + A+ F V
Sbjct: 598 PQTVAVSSSGEIYIADAYNHRIRKINTSGYISTIAGTGSVGYSGDGGLATSAQLYYPFSV 657
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
+ + + D N +R+I+
Sbjct: 658 A-ISSVGEIYIADTYNHRIRKIN 679
Score = 50.4 bits (119), Expect = 0.002, Method: Composition-based stats.
Identities = 52/179 (29%), Positives = 80/179 (44%), Gaps = 21/179 (11%)
Query: 61 GYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQG 120
GY + ++ N P I +S + E++ D N I K+ S Y +AG+ G
Sbjct: 856 GYNGDEILATNSQLNNPNGIALSSNSEIYIADTNNHRIRKVNA--SGYIS--TIAGTGTG 911
Query: 121 -YTGHVDGK-PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVA 177
Y G DG A+ N+P G+ + + G + +AD N IRKI G ++TIAG S +
Sbjct: 912 GYNG--DGVLATSAQLNYPNGIAIQENGEILIADNNNHRIRKIRTNGYISTIAG--SGIG 967
Query: 178 GFRDGP-SEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI--------SLNQDDCEYQ 227
GF D + A+ N V + + + D N +R++ L CEYQ
Sbjct: 968 GFTDNTVATSAQLENPLGVA-IGSNKEIFLTDSTNKRIRKLIPWCSENGILINGTCEYQ 1025
Score = 50.4 bits (119), Expect = 0.002, Method: Composition-based stats.
Identities = 45/143 (31%), Positives = 67/143 (46%), Gaps = 6/143 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + VS GE++ D N I KI S Y GS GY+G G A+ +
Sbjct: 598 PQTVAVSSSGEIYIADAYNHRIRKINT--SGYISTIAGTGSV-GYSGD-GGLATSAQLYY 653
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P V + G +Y+ADT N IRKI +G ++TI+G S G + A+ + F V
Sbjct: 654 PFSVAISSVGEIYIADTYNHRIRKINTSGYISTISGTGSGGYSGDGGLATSAQLNYPFSV 713
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V + ++D N +R+I+
Sbjct: 714 A-VSSVGEIYIVDTNNYRIRKIN 735
Score = 49.3 bits (116), Expect = 0.005, Method: Composition-based stats.
Identities = 39/150 (26%), Positives = 69/150 (46%), Gaps = 12/150 (8%)
Query: 75 VVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGK--PNEA 132
+ Y + +GE+ D N I KIT + G + S G G+ + +
Sbjct: 814 ISAYSFEFTLNGEIIIADSNNHRIRKIT------TLGYISTISGTGTAGYNGDEILATNS 867
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR-DGPSEDAKFS 190
+ N+P G+ + +Y+ADT N IRK+ +G ++TIAG + G+ DG +
Sbjct: 868 QLNNPNGIALSSNSEIYIADTNNHRIRKVNASGYISTIAG--TGTGGYNGDGVLATSAQL 925
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
N + + ++ +L+ D N +R+I N
Sbjct: 926 NYPNGIAIQENGEILIADNNNHRIRKIRTN 955
Score = 48.1 bits (113), Expect = 0.011, Method: Composition-based stats.
Identities = 40/148 (27%), Positives = 70/148 (47%), Gaps = 10/148 (6%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKPNEAR 133
+P + +S GE+ D N+ I KI+ + G + +AG+ G G A+
Sbjct: 52 IPAGVAISSIGEIIIADSNNNRIRKIS------TNGYISTIAGTGSASFGGDGGLAINAQ 105
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSND 192
N P V++ G++Y++DT N IRKI G ++TIAG + G + A+
Sbjct: 106 LNSPSAVSISSNGDMYISDTGNQRIRKISTNGYISTIAGSGTGGYSGDGGLATSAQLYYP 165
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQISLN 220
V + P+ + + D N +R+++ N
Sbjct: 166 KGVA-ISPSNEIYIADTYNHRIRKVNTN 192
Score = 45.8 bits (107), Expect = 0.048, Method: Composition-based stats.
Identities = 45/148 (30%), Positives = 69/148 (46%), Gaps = 16/148 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY S +G+++ D N+ I K+T S Y VAGS G G A+ N+
Sbjct: 486 PYGTAFSSNGDMYIADTNNNRIRKVTT--SGYIS--TVAGSGTGGYSGDGGLATSAQLNY 541
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P+ V + G +Y+AD N IRKI +G ++TIAG S G S D + + +
Sbjct: 542 PRSVAISSSGEIYIADFNNHRIRKINISGYISTIAGTGSV------GYSGDGGLATNAQL 595
Query: 196 VY-----VRPTCSLLVIDRGNAALRQIS 218
Y V + + + D N +R+I+
Sbjct: 596 YYPQTVAVSSSGEIYIADAYNHRIRKIN 623
Score = 43.1 bits (100), Expect = 0.32, Method: Composition-based stats.
Identities = 23/57 (40%), Positives = 33/57 (57%), Gaps = 4/57 (7%)
Query: 116 GSFQGYTGHVDGK-PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIA 170
G+ GY G DG AR N+P G G++Y+ADT N IRK+ +G ++T+A
Sbjct: 466 GTLNGYGG--DGNLATSARLNYPYGTAFSSNGDMYIADTNNNRIRKVTTSGYISTVA 520
Score = 43.1 bits (100), Expect = 0.34, Method: Composition-based stats.
Identities = 41/148 (27%), Positives = 68/148 (45%), Gaps = 16/148 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ + +S GE++ D N I KI S Y ++G+ G G A+ N+
Sbjct: 654 PFSVAISSVGEIYIADTYNHRIRKINT--SGYIS--TISGTGSGGYSGDGGLATSAQLNY 709
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P V + G +Y+ DT N IRKI +G ++TIAG + G + D+ + +
Sbjct: 710 PFSVAVSSVGEIYIVDTNNYRIRKINTSGYISTIAGTGTG------GYNGDSILATSAQL 763
Query: 196 VY-----VRPTCSLLVIDRGNAALRQIS 218
Y + T ++V D N +R+I+
Sbjct: 764 NYPYGLTISSTSEIIVADYYNHRIRKIN 791
Score = 41.2 bits (95), Expect = 1.1, Method: Composition-based stats.
Identities = 40/145 (27%), Positives = 70/145 (48%), Gaps = 10/145 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKPNEARF 134
P + +S +G+++ D N I KI+ + G + +AGS G G A+
Sbjct: 109 PSAVSISSNGDMYISDTGNQRIRKIS------TNGYISTIAGSGTGGYSGDGGLATSAQL 162
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
+PKGV + +Y+ADT N IRK+ G ++T+AG + G G + A+ + F
Sbjct: 163 YYPKGVAISPSNEIYIADTYNHRIRKVNTNGYISTVAGSGTGGYGGDGGLATSAQVNYPF 222
Query: 194 DVVYVRPTCSLLVIDRGNAALRQIS 218
V + T + + D N +R+++
Sbjct: 223 SVS-ISSTGEIYIADYYNQRIRKVT 246
Score = 38.5 bits (88), Expect = 7.9, Method: Composition-based stats.
Identities = 34/106 (32%), Positives = 52/106 (49%), Gaps = 11/106 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKPNEARF 134
P + +S E++ D N I K+ + G + VAGS G G G A+
Sbjct: 165 PKGVAISPSNEIYIADTYNHRIRKVN------TNGYISTVAGSGTGGYGGDGGLATSAQV 218
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGF 179
N+P V++ G +Y+AD N IRK+ G ++T+AG S AG+
Sbjct: 219 NYPFSVSISSTGEIYIADYYNQRIRKVTTYGFISTVAG--SGTAGY 262
>gi|302786076|ref|XP_002974809.1| hypothetical protein SELMODRAFT_101985 [Selaginella moellendorffii]
gi|300157704|gb|EFJ24329.1| hypothetical protein SELMODRAFT_101985 [Selaginella moellendorffii]
Length = 1049
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 54/111 (48%), Gaps = 5/111 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIR--KIGDAGVTTIAGGKSNVAGFR 180
G DG ++A F HP G+ G +YVADT N I+ + V T+AG + VAG+
Sbjct: 791 GDKDGPASQALFQHPLGILSSANGAVYVADTYNHKIKLMDLASKTVRTVAG--TGVAGYE 848
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSI 231
DG A+FS + + P SL V D N +R + DD Q ++
Sbjct: 849 DGKGVKAQFSEPAGLA-LGPNGSLFVADTNNNVIRLLKPRPDDNAIQVTTL 898
Score = 44.7 bits (104), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 45/83 (54%), Gaps = 5/83 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I S +G ++ D N I K+ S+ R VAG+ G G+ DGK +A+F+
Sbjct: 805 PLGILSSANGAVYVADTYNHKI-KLMDLASKTVR--TVAGT--GVAGYEDGKGVKAQFSE 859
Query: 137 PKGVTMDDKGNLYVADTLNLAIR 159
P G+ + G+L+VADT N IR
Sbjct: 860 PAGLALGPNGSLFVADTNNNVIR 882
Score = 38.9 bits (89), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 35/66 (53%), Gaps = 3/66 (4%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAGGKSNV 176
G G DG + A FN P+GV D + N LYVADT N A+R++ V T+AG
Sbjct: 609 GAEGLTDGSFSLASFNRPQGVAYDTEKNILYVADTENHALREVDFVKETVRTLAGTGERG 668
Query: 177 AGFRDG 182
+R G
Sbjct: 669 FDYRGG 674
>gi|255542920|ref|XP_002512523.1| 2-deoxyglucose-6-phosphate phosphatase, putative [Ricinus communis]
gi|223548484|gb|EEF49975.1| 2-deoxyglucose-6-phosphate phosphatase, putative [Ricinus communis]
Length = 1016
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 57/107 (53%), Gaps = 5/107 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFR 180
G DG +E HP GV G +YVAD+ N I+K+ A V+TIAG + AGF+
Sbjct: 753 GDHDGIGSEVLLQHPLGVLCAKNGQIYVADSYNHKIKKLDPATKRVSTIAG--TGKAGFK 810
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQ 227
DG + A+ S ++ L++ D N+ +R + LN+++ E +
Sbjct: 811 DGKALAAQLSEPSGIIEAE-NGRLIIADTNNSIIRYLDLNKEEAELR 856
Score = 45.1 bits (105), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 42/83 (50%), Gaps = 5/83 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + +++G+++ D N I K+ P + S +AG+ G G DGK A+ +
Sbjct: 767 PLGVLCAKNGQIYVADSYNHKIKKLDPATKRVST---IAGT--GKAGFKDGKALAAQLSE 821
Query: 137 PKGVTMDDKGNLYVADTLNLAIR 159
P G+ + G L +ADT N IR
Sbjct: 822 PSGIIEAENGRLIIADTNNSIIR 844
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 34/55 (61%), Gaps = 3/55 (5%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG 171
G G DG +EA FN P+G+ + K N LYVADT N A+R+I + V T+AG
Sbjct: 570 GEEGLRDGPFDEATFNRPQGLAYNAKKNLLYVADTENHALREIDFVNEMVRTLAG 624
>gi|456874551|gb|EMF89837.1| hypothetical protein LEP1GSC005_3835 [Leptospira santarosai str.
ST188]
Length = 356
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 45/126 (35%), Positives = 65/126 (51%), Gaps = 9/126 (7%)
Query: 96 SNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLN 155
+ I KI PL Q+S L AG+ G G +G + F P + +D + NLYV + N
Sbjct: 141 AQIFKI-DPLDQFS---LYAGNSSGVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGN 196
Query: 156 LAIRKI--GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAA 213
AIRKI V+T++GG G+ DG A+F + + Y + T SLLV D +
Sbjct: 197 HAIRKINLNSGSVSTLSGG---TLGYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHR 253
Query: 214 LRQISL 219
+R+I L
Sbjct: 254 IRKIDL 259
>gi|410448598|ref|ZP_11302672.1| hypothetical protein LEP1GSC068_3151 [Leptospira sp. Fiocruz
LV3954]
gi|410017668|gb|EKO79726.1| hypothetical protein LEP1GSC068_3151 [Leptospira sp. Fiocruz
LV3954]
Length = 356
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 64/124 (51%), Gaps = 9/124 (7%)
Query: 98 IVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLA 157
I KI PL Q+S L AG+ G G +G + F P + +D + NLYV + N A
Sbjct: 143 IFKI-DPLDQFS---LYAGNSSGVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGNHA 198
Query: 158 IRKI--GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALR 215
IRKI V+T++GG G+ DG A+F + + Y + T SLLV D + +R
Sbjct: 199 IRKINLNSGSVSTLSGG---TLGYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHRIR 255
Query: 216 QISL 219
+I L
Sbjct: 256 KIDL 259
>gi|290979906|ref|XP_002672674.1| predicted protein [Naegleria gruberi]
gi|284086252|gb|EFC39930.1| predicted protein [Naegleria gruberi]
Length = 747
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 49/151 (32%), Positives = 77/151 (50%), Gaps = 13/151 (8%)
Query: 70 GNEIGV-VPYKIRVSE-DGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDG 127
GNE + PY + SE G++F KI + R VAGS G +
Sbjct: 439 GNETALNYPYSVTQSEISGDVFI-----GTTFKILKVSKKDKRVSTVAGSVSTLAGD-NF 492
Query: 128 KPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAG-VTTIAGGKSNVAGFRDGPSE 185
++ N P G+ G+LY+ D++N AIRKI + G ++TIAG + +AGF+DG +
Sbjct: 493 NSIASQLNEPAGLAFAINGDLYICDSINHAIRKIDHETGIISTIAG--NGIAGFKDGNAS 550
Query: 186 DAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
A+F+++ + V P LL+ D N +R+
Sbjct: 551 IAQFNSNLGLS-VLPNGDLLIADYNNNRIRK 580
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 71/145 (48%), Gaps = 10/145 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + +G+L+ D +N I KI + G + + G G DG + A+FN
Sbjct: 502 PAGLAFAINGDLYICDSINHAIRKI-----DHETGIISTIAGNGIAGFKDGNASIAQFNS 556
Query: 137 PKGVTMDDKGNLYVADTLNLAIRK--IGDAGVTTIAGGKSNVAGFRDGP-SEDAKFSNDF 193
G+++ G+L +AD N IRK V TIAGG + +G DG + A+ ++
Sbjct: 557 NLGLSVLPNGDLLIADYNNNRIRKYLASSKQVLTIAGGLAGYSG--DGQLATSAQLNHPT 614
Query: 194 DVVYVRPTCSLLVIDRGNAALRQIS 218
DV Y T + + D GN +R+IS
Sbjct: 615 DVAYNASTGDVFIADFGNKVIRKIS 639
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 48/168 (28%), Positives = 82/168 (48%), Gaps = 15/168 (8%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGK-PNEARFNHPK 138
+ V +G+L D N+ I K L+ + +AG GY+G DG+ A+ NHP
Sbjct: 560 LSVLPNGDLLIADYNNNRIRKY---LASSKQVLTIAGGLAGYSG--DGQLATSAQLNHPT 614
Query: 139 GVTMD-DKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSND--F 193
V + G++++AD N IRKI ++ ++TIAG S ++ + +G + A SN +
Sbjct: 615 DVAYNASTGDVFIADFGNKVIRKISNSTGIISTIAG--SGLSEY-NGDAMPASISNISPY 671
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVG 241
+ T L + D N +R+I++N + Y ++ T I G
Sbjct: 672 GLSIHPITGELFISDSVNYLIRKINVN-NQIIYNVAGVNGTSIWSTTG 718
>gi|418754100|ref|ZP_13310334.1| hypothetical protein LEP1GSC179_2637 [Leptospira santarosai str.
MOR084]
gi|409965522|gb|EKO33385.1| hypothetical protein LEP1GSC179_2637 [Leptospira santarosai str.
MOR084]
Length = 356
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 45/126 (35%), Positives = 65/126 (51%), Gaps = 9/126 (7%)
Query: 96 SNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLN 155
+ I KI PL Q+S L AG+ G G +G + F P + +D + NLYV + N
Sbjct: 141 AQIFKI-DPLDQFS---LYAGNSSGVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGN 196
Query: 156 LAIRKI--GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAA 213
AIRKI V+T++GG G+ DG A+F + + Y + T SLLV D +
Sbjct: 197 HAIRKINLNSGSVSTLSGG---TLGYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHR 253
Query: 214 LRQISL 219
+R+I L
Sbjct: 254 IRKIDL 259
>gi|418747486|ref|ZP_13303786.1| hypothetical protein LEP1GSC163_2583 [Leptospira santarosai str.
CBC379]
gi|410791609|gb|EKR89564.1| hypothetical protein LEP1GSC163_2583 [Leptospira santarosai str.
CBC379]
Length = 356
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 45/126 (35%), Positives = 65/126 (51%), Gaps = 9/126 (7%)
Query: 96 SNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLN 155
+ I KI PL Q+S L AG+ G G +G + F P + +D + NLYV + N
Sbjct: 141 AQIFKI-DPLDQFS---LYAGNSSGVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGN 196
Query: 156 LAIRKI--GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAA 213
AIRKI V+T++GG G+ DG A+F + + Y + T SLLV D +
Sbjct: 197 HAIRKINLNSGSVSTLSGG---TLGYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHR 253
Query: 214 LRQISL 219
+R+I L
Sbjct: 254 IRKIDL 259
>gi|393782357|ref|ZP_10370541.1| hypothetical protein HMPREF1071_01409 [Bacteroides salyersiae
CL02T12C01]
gi|392673185|gb|EIY66648.1| hypothetical protein HMPREF1071_01409 [Bacteroides salyersiae
CL02T12C01]
Length = 446
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 55/108 (50%), Gaps = 2/108 (1%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG 171
+L AG +G+ DG A FN P+ + + LY+ADT N IRKI G+ +
Sbjct: 340 KLYAG-MTNRSGYADGPCEYAMFNEPRQMIVSADNELYLADTNNHVIRKISQDGIVSTVI 398
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G + +GF DG E+A F F V + ++ + D N +R++++
Sbjct: 399 GLAGQSGFMDGTPEEALFDKPFGVA-LDTDGTIYIGDSENQCVRRLAI 445
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P ++ VS D EL+ D N I KI SQ V G G +G +DG P EA F+
Sbjct: 364 PRQMIVSADNELYLADTNNHVIRKI----SQDGIVSTVIG-LAGQSGFMDGTPEEALFDK 418
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG 162
P GV +D G +Y+ D+ N +R++
Sbjct: 419 PFGVALDTDGTIYIGDSENQCVRRLA 444
>gi|441163236|ref|ZP_20968263.1| hypothetical protein SRIM_29923 [Streptomyces rimosus subsp.
rimosus ATCC 10970]
gi|440616375|gb|ELQ79517.1| hypothetical protein SRIM_29923 [Streptomyces rimosus subsp.
rimosus ATCC 10970]
Length = 238
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 65/143 (45%), Gaps = 8/143 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV-DGKPNEA-RF 134
P+ + + E G L+ V N+ + K+TP +G + + G G+V DG P A +
Sbjct: 47 PHDVALDEHGNLYIVCRSNNRVRKVTP------QGIITTVAGNGIAGYVSDGGPATATQL 100
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
+ P GV D GNLY+AD N +RK+ G+ T G DG A N
Sbjct: 101 SSPCGVAADGAGNLYIADLGNNRVRKVDTKGIITTVAGNGTGGYVSDGGPATATQLNGPH 160
Query: 195 VVYVRPTCSLLVIDRGNAALRQI 217
V V ++ + D N +R++
Sbjct: 161 SVAVDRDGNVYIADYHNHRVRKV 183
Score = 39.7 bits (91), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 60/120 (50%), Gaps = 8/120 (6%)
Query: 103 PPLSQYSRGRLVAGSFQGYTGHV-DGKPNE-ARFNHPKGVTMDDKGNLYVADTLNLAIRK 160
P + ++G + + G G++ DG P + N P V +D+ GNLY+ N +RK
Sbjct: 11 PQPAGIAQGCIATVAGNGTAGYLSDGGPATLTQLNWPHDVALDEHGNLYIVCRSNNRVRK 70
Query: 161 IGDAG-VTTIAGGKSNVAGF--RDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
+ G +TT+AG + +AG+ GP+ + S+ V +L + D GN +R++
Sbjct: 71 VTPQGIITTVAG--NGIAGYVSDGGPATATQLSSPCGVA-ADGAGNLYIADLGNNRVRKV 127
Score = 39.3 bits (90), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 44/92 (47%), Gaps = 8/92 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV-DGKPNEA-RF 134
P + G L+ D N+ + K+ ++G + + G G+V DG P A +
Sbjct: 103 PCGVAADGAGNLYIADLGNNRVRKVD------TKGIITTVAGNGTGGYVSDGGPATATQL 156
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV 166
N P V +D GN+Y+AD N +RK+ G+
Sbjct: 157 NGPHSVAVDRDGNVYIADYHNHRVRKVDSKGL 188
>gi|428304139|ref|YP_007140964.1| NHL repeat containing protein [Crinalium epipsammum PCC 9333]
gi|428245674|gb|AFZ11454.1| NHL repeat containing protein [Crinalium epipsammum PCC 9333]
Length = 537
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 49/157 (31%), Positives = 74/157 (47%), Gaps = 13/157 (8%)
Query: 87 ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT-GHVDGKPNEARFNHPKGVTMDDK 145
ELF D +S+I +T L + R + GS Q + G VDG R H G+T
Sbjct: 384 ELFIADSESSSIRAVT--LGDFPIVRTICGSGQLFGFGDVDGIGENVRLQHCLGITYG-A 440
Query: 146 GNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 203
G L+V DT N ++++ TI G S AG +DG D FS + + C+
Sbjct: 441 GYLWVTDTYNHKLKRVNPTTGECQTICG--SGKAGLQDGFGTDVYFSEPSGLAF---ACN 495
Query: 204 LLVI-DRGNAALRQISLN-QDDCEYQYNSISPTDILM 238
L I D N A+R+I+LN Q+ +Q+ + + +
Sbjct: 496 YLYIADSNNHAIRRINLNSQEVTTFQFTMLCSPSVCI 532
>gi|422003150|ref|ZP_16350382.1| hypothetical protein LSS_06614 [Leptospira santarosai serovar
Shermani str. LT 821]
gi|417258114|gb|EKT87507.1| hypothetical protein LSS_06614 [Leptospira santarosai serovar
Shermani str. LT 821]
Length = 356
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 45/126 (35%), Positives = 65/126 (51%), Gaps = 9/126 (7%)
Query: 96 SNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLN 155
+ I KI PL Q+S L AG+ G G +G + F P + +D + NLYV + N
Sbjct: 141 AQIFKI-DPLDQFS---LYAGNSSGVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGN 196
Query: 156 LAIRKI--GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAA 213
AIRKI V+T++GG G+ DG A+F + + Y + T SLLV D +
Sbjct: 197 HAIRKINLNSGSVSTLSGG---TLGYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHR 253
Query: 214 LRQISL 219
+R+I L
Sbjct: 254 IRKIDL 259
>gi|111219921|ref|YP_710715.1| ascorbate-dependent monooxygenase [Frankia alni ACN14a]
gi|111147453|emb|CAJ59103.1| Putative ascorbate-dependent monooxygenase [Frankia alni ACN14a]
Length = 865
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 45/146 (30%), Positives = 70/146 (47%), Gaps = 14/146 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV---DGKP-NEA 132
P + + G+LF D+ + I K+ R + +F G +G DG P +A
Sbjct: 657 PTAVATDDAGDLFIADQGENRIRKVD--------ARGIISTFAGSSGRGSFGDGGPATDA 708
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAGGKSNVAGFRDGPSEDAKFSN 191
+ P GV D G++Y+ADT N IRK+ GV TT+AG + GP+ A
Sbjct: 709 LLDVPFGVAADAAGDVYIADTDNSRIRKVDTHGVITTVAGNRLRGFAGDGGPAVKASL-Q 767
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQI 217
D + V +L + DRGN+ +R++
Sbjct: 768 DPRGIAVDAVGNLYITDRGNSRIRKV 793
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 71/146 (48%), Gaps = 11/146 (7%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKPN-EAR 133
VP+ + G+++ D NS I K+ + VAG+ +G+ G DG P +A
Sbjct: 712 VPFGVAADAAGDVYIADTDNSRIRKV----DTHGVITTVAGNRLRGFAG--DGGPAVKAS 765
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSN 191
P+G+ +D GNLY+ D N IRK+ G +TT+AG G+ AG G + +A+
Sbjct: 766 LQDPRGIAVDAVGNLYITDRGNSRIRKVDTHGIITTLAGSGRPGSAG-DGGLAGNAELGR 824
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQI 217
V V ++ DR + +R +
Sbjct: 825 PDGAVGVDHEGNVFFSDRASGRIRVV 850
Score = 45.1 bits (105), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 55/98 (56%), Gaps = 9/98 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKPN-EARF 134
P+ + G L+ D ++ +++I + +AG+ G++G DG P A+
Sbjct: 534 PFGLTADAFGNLYFADFGSNRVMRINAAGVITT----IAGTGVAGFSG--DGGPAVAAQL 587
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
+ P G+ +D++G+LY+AD LN IR++ G +TT+AG
Sbjct: 588 DQPAGIALDNRGDLYIADRLNHRIRRVDPRGIITTVAG 625
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 39/157 (24%), Positives = 62/157 (39%), Gaps = 25/157 (15%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGH-VDGKP------ 129
P I + G+L+ D +N I ++ P R + + G H + G P
Sbjct: 590 PAGIALDNRGDLYIADRLNHRIRRVDP--------RGIITTVAGINDHFIVGDPVGYSGA 641
Query: 130 --------NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD 181
A + P V DD G+L++AD IRK+ G+ + G S F D
Sbjct: 642 GAGDGGPALAAPLSFPTAVATDDAGDLFIADQGENRIRKVDARGIISTFAGSSGRGSFGD 701
Query: 182 -GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
GP+ DA F V + + D N+ +R++
Sbjct: 702 GGPATDALLDVPFGVA-ADAAGDVYIADTDNSRIRKV 737
>gi|242041143|ref|XP_002467966.1| hypothetical protein SORBIDRAFT_01g037250 [Sorghum bicolor]
gi|241921820|gb|EER94964.1| hypothetical protein SORBIDRAFT_01g037250 [Sorghum bicolor]
Length = 360
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 72/153 (47%), Gaps = 14/153 (9%)
Query: 83 SEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT------GHVDGKPNEARFNH 136
SE ELF VD +S+I + + RL+AG + G DG ++ H
Sbjct: 49 SEMQELFVVDSESSSIRVVNL---KTGGSRLLAGGDPVFPENLFRFGDYDGTGSDVLLQH 105
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P GV +Y+AD+ N I+K+ VTTIAG + AG++DGP A+ S
Sbjct: 106 PLGVAYASDNQVYIADSYNHKIKKLDPVTRKVTTIAG--TGRAGYKDGPGLAAQLSEPAG 163
Query: 195 VVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQ 227
+V V LLV D N +R I+L+ E +
Sbjct: 164 LVEV-GDGRLLVADTNNNTIRYITLSDKGAEVK 195
>gi|359685014|ref|ZP_09255015.1| hypothetical protein Lsan2_10264 [Leptospira santarosai str.
2000030832]
Length = 356
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 45/126 (35%), Positives = 65/126 (51%), Gaps = 9/126 (7%)
Query: 96 SNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLN 155
+ I KI PL Q+S L AG+ G G +G + F P + +D + NLYV + N
Sbjct: 141 AQIFKI-DPLDQFS---LYAGNSSGVDGFQNGDRLNSSFKGPFFMDLDRERNLYVGELGN 196
Query: 156 LAIRKI--GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAA 213
AIRKI V+T++GG G+ DG A+F + + Y + T SLLV D +
Sbjct: 197 HAIRKINLNSGSVSTLSGG---TLGYLDGDLTSAQFKSPLGITYDQKTDSLLVADLQDHR 253
Query: 214 LRQISL 219
+R+I L
Sbjct: 254 IRKIDL 259
>gi|119961614|ref|YP_949427.1| NHL repeat-containing protein [Arthrobacter aurescens TC1]
gi|403528903|ref|YP_006663790.1| NHL repeat-containing protein [Arthrobacter sp. Rue61a]
gi|119948473|gb|ABM07384.1| putative NHL repeat protein [Arthrobacter aurescens TC1]
gi|403231330|gb|AFR30752.1| NHL repeat-containing protein [Arthrobacter sp. Rue61a]
Length = 657
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 61/109 (55%), Gaps = 9/109 (8%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK--IGDAGV---T 167
++AGS G G +DGK EA F G+ +D + N++VAD+ ++R+ I D+GV T
Sbjct: 366 ILAGS--GLEGLLDGKAEEAWFAQSSGLAIDGEDNIWVADSETSSLRRLVISDSGVSVET 423
Query: 168 TIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
I G + GFRDG + +A+ + V V P S+ + D N A+R+
Sbjct: 424 AIGKGLFDF-GFRDGEASEARLQHPLGVT-VLPDNSVAIADTYNGAVRR 470
Score = 38.9 bits (89), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 49/101 (48%), Gaps = 17/101 (16%)
Query: 85 DGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDD 144
+G VD + +V++ P L R V GS G G++DG+ A+FN P+GVT+
Sbjct: 188 NGNYLVVDSGHHRLVELRPDLETVER---VIGS--GTKGYLDGQSEIAQFNEPQGVTLLP 242
Query: 145 KGNLY-------VADTLNLAIRKIGDAGVTTIAGGKSNVAG 178
+ VADT+N +R GVT +G +AG
Sbjct: 243 SELAWKLGYDAVVADTVNHRLR-----GVTLSSGYVQTLAG 278
Score = 38.5 bits (88), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 32/52 (61%), Gaps = 2/52 (3%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDA--GVTTIAGG 172
G DG+ +EAR HP GVT+ ++ +ADT N A+R+ A V+T+A G
Sbjct: 433 GFRDGEASEARLQHPLGVTVLPDNSVAIADTYNGAVRRYDPATKSVSTLARG 484
>gi|256393018|ref|YP_003114582.1| NHL repeat containing protein [Catenulispora acidiphila DSM 44928]
gi|256359244|gb|ACU72741.1| NHL repeat containing protein [Catenulispora acidiphila DSM 44928]
Length = 1046
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 51/89 (57%), Gaps = 12/89 (13%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRG---RLVAGSFQGYTGHVDGKPNE-A 132
P + V+ G++F D +V++TP RG RL+ GY+G DG+P+ A
Sbjct: 825 PEAVAVNAAGDVFIADTYADRVVELTP------RGLLLRLIGTGRAGYSG--DGRPSPLA 876
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
+ N P G+ +D +G+LY+AD+ N IR++
Sbjct: 877 QLNQPIGLALDAQGDLYIADSANNVIRRV 905
Score = 47.0 bits (110), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 47/89 (52%), Gaps = 2/89 (2%)
Query: 131 EARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKF 189
A+ N P+GV +D G+L+VADT + AIR++ D ++T+ + G G + A
Sbjct: 940 SAQLNDPQGVAVDGAGDLFVADTFDNAIREVTPDGTISTVVNSSAAPGGESSGAAPTASH 999
Query: 190 SNDFDVVYVRPTCSLLVI-DRGNAALRQI 217
N V V P+ LL I D N+ + Q+
Sbjct: 1000 LNTPYAVTVDPSTDLLYIADTRNSVIAQV 1028
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 36/79 (45%), Gaps = 6/79 (7%)
Query: 107 QYSRGRLVAGSFQGYTGHVDG-KPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
Q ++AGS + Y H DG K A P G D G+L+VAD + +R+I G
Sbjct: 738 QNGSTSVIAGSLEAYGEHGDGGKATSASLYQPSGSATDAAGDLFVADAGDNVVREIAANG 797
Query: 166 VT-----TIAGGKSNVAGF 179
T+ GG + AG
Sbjct: 798 TISRFAGTVPGGSWSGAGL 816
>gi|290982388|ref|XP_002673912.1| predicted protein [Naegleria gruberi]
gi|284087499|gb|EFC41168.1| predicted protein [Naegleria gruberi]
Length = 2313
Score = 55.5 bits (132), Expect = 7e-05, Method: Composition-based stats.
Identities = 46/153 (30%), Positives = 75/153 (49%), Gaps = 12/153 (7%)
Query: 70 GNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP 129
GN+ + +S DG ++ D N I K + S L GS +G+ DG
Sbjct: 459 GNQDYFLSKSFDISSDGIIYIADYYNHRIAKFV--IGGTSLTTLAGGSLKGFA---DGVG 513
Query: 130 NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFR--DGPSE 185
+ A N+P +++ LY +D N AIR + +A VTTI+G S +AG+ +GP+
Sbjct: 514 SNANLNYPDSISIGLNNMLYFSDRDNHAIRSVSTINALVTTISG--SGIAGYTGDEGPAI 571
Query: 186 DAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
AK N + V ++ +D+GN +R+I+
Sbjct: 572 YAKL-NLPGSIEVALNGDIIFMDKGNQRIRKIT 603
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 32/110 (29%), Positives = 53/110 (48%), Gaps = 8/110 (7%)
Query: 113 LVAGS-FQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG 171
+VAG+ G++G G A N P +T D GN+ ++D+ N IRK+ + ++T+AG
Sbjct: 280 IVAGTGTSGFSGD-GGVATSALLNGPSALTFDSSGNMLISDSSNNRIRKVTNGIISTLAG 338
Query: 172 GKSNVAGFRDGP-SEDAKFSNDFDVVYV---RPTCSLLVIDRGNAALRQI 217
+ F +G S+ V Y T +L+ D N LR++
Sbjct: 339 TSNR--NFGNGAVGTLVSLSSPNSVYYAGNDDSTGGILIADMNNHVLRRL 386
>gi|451982091|ref|ZP_21930422.1| hypothetical protein NITGR_780006 [Nitrospina gracilis 3/211]
gi|451760645|emb|CCQ91702.1| hypothetical protein NITGR_780006 [Nitrospina gracilis 3/211]
Length = 724
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 69/147 (46%), Gaps = 8/147 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ I V E G L+ D N+ + K+TP + VAG + G +
Sbjct: 225 PFGIHVDESGNLYIADRGNNRVRKVTPDGIITT----VAGEGTHFFSGDFGPATRCSLAY 280
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSNDFD 194
P V DD+GNLY+AD N +RK+ G +TT+ G GK+ G + SE F
Sbjct: 281 PTDVVTDDRGNLYIADRNNNRVRKVDTNGIITTVMGTGKNEYNGDNEIASE-TSLHLPFA 339
Query: 195 VVYVRPTCSLLVIDRGNAALRQISLNQ 221
+ + P LL++DR + +R + L Q
Sbjct: 340 LAFT-PDQHLLIVDRNHHRVRSMHLVQ 365
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 73/148 (49%), Gaps = 12/148 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV--AG-SFQGYTGHVDGKPNEAR 133
P + G+++ D N+ ++K+ P +G++V AG GY G G +A
Sbjct: 169 PSDVACDSQGQVYISDRSNNRVLKMNP------QGQIVTVAGLGMAGYGGDF-GPAIDAL 221
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSND 192
+P G+ +D+ GNLY+AD N +RK+ D +TT+AG ++ GP+ +
Sbjct: 222 LKYPFGIHVDESGNLYIADRGNNRVRKVTPDGIITTVAGEGTHFFSGDFGPATRCSLAYP 281
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQISLN 220
DVV +L + DR N +R++ N
Sbjct: 282 TDVV-TDDRGNLYIADRNNNRVRKVDTN 308
Score = 46.2 bits (108), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 45/75 (60%), Gaps = 3/75 (4%)
Query: 114 VAGSFQ-GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
VAG+ G++G +G EA N P G+ +D KGNLY+AD N +R++ G +TT+AG
Sbjct: 90 VAGNGNAGFSGD-EGPALEAALNFPAGLCLDLKGNLYIADRNNHRVRRVDTKGIITTVAG 148
Query: 172 GKSNVAGFRDGPSED 186
G +GP+E+
Sbjct: 149 TGEPDFGMEEGPAEE 163
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 37/145 (25%), Positives = 68/145 (46%), Gaps = 10/145 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKPNEARF 134
P + + G L+ D N + ++ ++G + VAG+ + G +G E
Sbjct: 113 PAGLCLDLKGNLYIADRNNHRVRRVD------TKGIITTVAGTGEPDFGMEEGPAEEIPL 166
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
+ P V D +G +Y++D N + K+ G + T+AG G GP+ DA F
Sbjct: 167 HFPSDVACDSQGQVYISDRSNNRVLKMNPQGQIVTVAGLGMAGYGGDFGPAIDALLKYPF 226
Query: 194 DVVYVRPTCSLLVIDRGNAALRQIS 218
++V + +L + DRGN +R+++
Sbjct: 227 G-IHVDESGNLYIADRGNNRVRKVT 250
>gi|225872818|ref|YP_002754275.1| hypothetical protein ACP_1177 [Acidobacterium capsulatum ATCC
51196]
gi|225794459|gb|ACO34549.1| hypothetical protein ACP_1177 [Acidobacterium capsulatum ATCC
51196]
Length = 855
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 74/145 (51%), Gaps = 11/145 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKP-NEARF 134
P + G L+ D N+N+++ +S +AG+ + GY+G DG A
Sbjct: 81 PSAVAYDGSGNLYIADT-NNNVIR---EVSTTGVVTTIAGNGEEGYSG--DGAAATSAML 134
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDF 193
+ P G+ +D GN+Y+AD+ N IR++ + + T+AG + AG+ DG + + +D
Sbjct: 135 DTPTGIAVDSNGNIYIADSHNNRIREVSNGIINTVAG--NGTAGYSGDGAAATSAMLDDP 192
Query: 194 DVVYVRPTCSLLVIDRGNAALRQIS 218
V V + ++ + D GN +R ++
Sbjct: 193 TAVAVDASGNIYIADTGNQRIRNVA 217
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 64/124 (51%), Gaps = 10/124 (8%)
Query: 98 IVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNL 156
I+++TP S + VAG+ GY+G+ G A F P V D GNLY+ADT N
Sbjct: 46 ILRVTPGTSTAT----VAGTGSAGYSGN-GGAATSAAFASPSAVAYDGSGNLYIADTNNN 100
Query: 157 AIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAAL 214
IR++ G VTTIAG G+ +G DG + + + + V ++ + D N +
Sbjct: 101 VIREVSTTGVVTTIAGNGEEGYSG--DGAAATSAMLDTPTGIAVDSNGNIYIADSHNNRI 158
Query: 215 RQIS 218
R++S
Sbjct: 159 REVS 162
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 69/144 (47%), Gaps = 6/144 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQG-YTGHVDGKPNEARFN 135
P I V G ++ D N+ I +++ + G A +F G Y+G G A
Sbjct: 247 PTGIAVDSSGNIYIADSHNNRIREVSGGVINTVAGS-GAVTFPGSYSGD-GGSATAATLA 304
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG-GKSNVAGFRDGPSEDAKFSNDFD 194
P GV +D G++Y+ADT N +R+I + + T+AG G+ +G DG + + ND
Sbjct: 305 KPTGVALDAAGHVYIADTNNERLREIANGVIATVAGNGQQGYSG--DGAAATSAALNDPR 362
Query: 195 VVYVRPTCSLLVIDRGNAALRQIS 218
V + S+ V D N +R ++
Sbjct: 363 NASVNASGSVAVADTLNERVRGLT 386
Score = 46.2 bits (108), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 46/96 (47%), Gaps = 7/96 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP-NEARFN 135
P I V +G ++ D N+ I +++ + G AG Y+G DG A +
Sbjct: 137 PTGIAVDSNGNIYIADSHNNRIREVSNGIINTVAGNGTAG----YSG--DGAAATSAMLD 190
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG 171
P V +D GN+Y+ADT N IR + + T+AG
Sbjct: 191 DPTAVAVDASGNIYIADTGNQRIRNVAAGTIHTVAG 226
>gi|116751477|ref|YP_848164.1| NHL repeat-containing protein [Syntrophobacter fumaroxidans MPOB]
gi|116700541|gb|ABK19729.1| NHL repeat containing protein [Syntrophobacter fumaroxidans MPOB]
Length = 1763
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 47/161 (29%), Positives = 77/161 (47%), Gaps = 25/161 (15%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKPNEARFN 135
P + V G+L+ D N + K+ Y VAG+ GY+G G EA+
Sbjct: 475 PTGVAVDAKGDLYIADSGNCRVRKV----DSYGIITTVAGNGSCGYSGD-GGAAVEAQLG 529
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGG-------KSNVAGFRD------ 181
+P V +D + NLY++DT + IRK+G D +TT+AG + G+++
Sbjct: 530 YPAFVAVDSRENLYISDTESHRIRKVGPDGVITTVAGSGLCHLKVEDGYGGYQEYDAPCF 589
Query: 182 ----GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
GP++ AK +N V + +L + D GN LR+++
Sbjct: 590 DGDQGPADQAKLNNPSGVA-TDASGNLYIADTGNRRLRKVA 629
Score = 46.2 bits (108), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 31/56 (55%)
Query: 127 GKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
G ++A+ N+P GV D GNLY+ADT N +RK+ +G + G + DG
Sbjct: 594 GPADQAKLNNPSGVATDASGNLYIADTGNRRLRKVAPSGTISTVAGDGELGSGADG 649
Score = 43.9 bits (102), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 68/153 (44%), Gaps = 14/153 (9%)
Query: 72 EIGVVPYKIRVSEDGELFAVD----EVNSNIVKITPPLSQYSRGRLVAG-SFQGYTGHVD 126
++ + P + G LF D E N I KI P + VAG ++G G D
Sbjct: 654 QLHLSPVGVVADSRGNLFVSDLSHGEGNGRIRKIDPTGFVTT----VAGHDWEGTLG--D 707
Query: 127 GKP-NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPS 184
G P +A P+G+ +D GNLY AD +R + G +TT+AG G G +
Sbjct: 708 GGPATQAYIGSPRGLAVDRGGNLYFADPWINRVRAVNTMGIITTVAGSDDGTFGEDGGLA 767
Query: 185 EDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
A+ + DV V P L ++ N +R++
Sbjct: 768 LRARLAFPMDVA-VDPRGDLYATEQWNQRVRKV 799
Score = 43.5 bits (101), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 28/53 (52%)
Query: 130 NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
N N P GV +D KG+LY+AD+ N +RK+ G+ T G + DG
Sbjct: 468 NHTEVNWPTGVAVDAKGDLYIADSGNCRVRKVDSYGIITTVAGNGSCGYSGDG 520
Score = 38.9 bits (89), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 47/100 (47%), Gaps = 10/100 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + G L+ D N + K+ P + + VAG + +G DG P+
Sbjct: 604 PSGVATDASGNLYIADTGNRRLRKVAPSGTIST----VAGDGELGSG-ADGWPSTQLHLS 658
Query: 137 PKGVTMDDKGNLYVADTL----NLAIRKIGDAG-VTTIAG 171
P GV D +GNL+V+D N IRKI G VTT+AG
Sbjct: 659 PVGVVADSRGNLFVSDLSHGEGNGRIRKIDPTGFVTTVAG 698
>gi|290985160|ref|XP_002675294.1| predicted protein [Naegleria gruberi]
gi|284088889|gb|EFC42550.1| predicted protein [Naegleria gruberi]
Length = 1229
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 37/94 (39%), Positives = 51/94 (54%), Gaps = 10/94 (10%)
Query: 85 DGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT--GHVDGKPNEARFNHPKGVTM 142
DG+L+ VD N I ++ SQ + +F G T G+ DG ARF P G+ +
Sbjct: 586 DGQLYFVDAYNHRI-RVYNFTSQ------IVSTFSGGTISGYQDGPIQTARFYFPNGLDI 638
Query: 143 DD-KGNLYVADTLNLAIRKIGDAGVTTIAGGKSN 175
D G+LYV+DT N IRKI + V+T+AG N
Sbjct: 639 DTTNGDLYVSDTFNCVIRKISNGVVSTVAGQYQN 672
>gi|116624994|ref|YP_827150.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116228156|gb|ABJ86865.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 912
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 58/117 (49%), Gaps = 7/117 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP-NEARFN 135
P+ + + G L+ + N+ I K+ + + L GY+G DG P A+ N
Sbjct: 481 PFSVALDAAGNLYIAEFSNNRIRKVA---TNGNISTLAGTGVSGYSG--DGGPATSAQLN 535
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSN 191
P+ V +D GN+YVADT N +RKIG G +TT+AG GP+ A+ N
Sbjct: 536 GPQAVAVDGSGNVYVADTANNRVRKIGPTGLITTVAGNGIGGFSGDGGPATSAQVGN 592
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 54/98 (55%), Gaps = 8/98 (8%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPP-LSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
+P ++V G L+ D +N+ I K++ ++ Y+ +VA S G G A+
Sbjct: 314 LPTSVQVDSSGNLYFADSLNNRIRKLSGGNVNTYAGNGIVARSGDG------GAATNAQL 367
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAG 171
N P GV +D GNLYV+DTLN +R++ GV TT AG
Sbjct: 368 NTPLGVAVDAAGNLYVSDTLNNLVRRVDTKGVITTFAG 405
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 70/145 (48%), Gaps = 13/145 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKPNEARFN 135
P + V G L+ D N + K++ + VAGS G+ G G A+ N
Sbjct: 426 PQGLAVDSAGNLYIADTQNHRVRKVSGGVMS-----TVAGSGTSGFAGD-GGAATSAQLN 479
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--DGPSEDAKFSND 192
P V +D GNLY+A+ N IRK+ G ++T+AG + V+G+ GP+ A+ N
Sbjct: 480 APFSVALDAAGNLYIAEFSNNRIRKVATNGNISTLAG--TGVSGYSGDGGPATSAQL-NG 536
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQI 217
V V + ++ V D N +R+I
Sbjct: 537 PQAVAVDGSGNVYVADTANNRVRKI 561
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 34/83 (40%), Positives = 47/83 (56%), Gaps = 8/83 (9%)
Query: 113 LVAG-SFQGYTGHVDGKPN-EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIA 170
LVAG S G++G DG P A+ N P+GV +D GNLY+AD+ N +RK+ G+ +
Sbjct: 65 LVAGNSRAGFSG--DGGPAVNAQLNSPQGVALDSAGNLYIADSQNNRVRKVNPQGIISTF 122
Query: 171 GGKSNVA--GF--RDGPSEDAKF 189
G NV+ GF G + DA
Sbjct: 123 AGNGNVSVPGFWGDSGAATDASI 145
Score = 46.6 bits (109), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 52/91 (57%), Gaps = 3/91 (3%)
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG-GKSNVAGFRDGPSEDAKFSN 191
+ N+P+G+ +D GNLY+ADT N +RK+ ++T+AG G S AG G + A+ +
Sbjct: 422 QLNNPQGLAVDSAGNLYIADTQNHRVRKVSGGVMSTVAGSGTSGFAG-DGGAATSAQLNA 480
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQISLNQD 222
F V + +L + + N +R+++ N +
Sbjct: 481 PFSVA-LDAAGNLYIAEFSNNRIRKVATNGN 510
Score = 46.6 bits (109), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 64/143 (44%), Gaps = 5/143 (3%)
Query: 75 VVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
V P+ + V G ++ V E+ SN ++ + + QG+ G G PN+
Sbjct: 257 VSPFGVAVDSAGVIY-VAELGSNRIRKIDTAGNITT--AIGDGTQGFAGD-GGAPNKVEM 312
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
+ P V +D GNLY AD+LN IRK+ V T AG G + +A+ +
Sbjct: 313 SLPTSVQVDSSGNLYFADSLNNRIRKLSGGNVNTYAGNGIVARSGDGGAATNAQLNTPLG 372
Query: 195 VVYVRPTCSLLVIDRGNAALRQI 217
V V +L V D N +R++
Sbjct: 373 VA-VDAAGNLYVSDTLNNLVRRV 394
Score = 42.7 bits (99), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 51/187 (27%), Positives = 87/187 (46%), Gaps = 22/187 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKPNEARF 134
P + V G ++ D N+ + KI P G + VAG+ G G A+
Sbjct: 537 PQAVAVDGSGNVYVADTANNRVRKIGP------TGLITTVAGNGIGGFSGDGGPATSAQV 590
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR-DGPSEDAKFSND 192
+P G+ +D GN+++ D + +RK+ +G ++TIAGG + G+ DG + A N
Sbjct: 591 GNPNGLALDSVGNVFITDG-SARVRKLFISGIISTIAGGGNR--GYSGDGGNAFAAQLNG 647
Query: 193 FDVVYVRPTCSLLVIDRGNAALR--QISLNQDDCEYQYNS-------ISPTDILMVVGAV 243
+ + T +L V D N A+R QIS + N ++P +I+++ G+
Sbjct: 648 PSGLAINSTGALFVADALNNAVRMLQISASGISVNAVTNGATNLSGPVAPGEIVVIYGSG 707
Query: 244 LVGYVTC 250
L G +T
Sbjct: 708 LGGALTT 714
Score = 38.9 bits (89), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 38/157 (24%), Positives = 66/157 (42%), Gaps = 33/157 (21%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPP--LSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
P + + G L+ D N+ + K+ P +S ++ V S G+ G G +A
Sbjct: 89 PQGVALDSAGNLYIADSQNNRVRKVNPQGIISTFAGNGNV--SVPGFWGD-SGAATDASI 145
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD------------- 181
+ P + +D N+Y+A + + +R++ G+ I G AG++
Sbjct: 146 HLPVAIAVDSSNNVYIAASADNTVRRVTTDGIINIFAG----AGYKGYYGDAGAAGLAGL 201
Query: 182 -GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
GP +D F P LL+ D GNA +RQ+
Sbjct: 202 TGP-QDITFG---------PKGVLLIADTGNAVIRQV 228
>gi|312198348|ref|YP_004018409.1| serine/threonine protein kinase [Frankia sp. EuI1c]
gi|311229684|gb|ADP82539.1| serine/threonine protein kinase [Frankia sp. EuI1c]
Length = 782
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 35/95 (36%), Positives = 54/95 (56%), Gaps = 3/95 (3%)
Query: 126 DGKP-NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGP 183
DG P +A P + +D +GNLY+AD N +R++ G +TT+AG G DGP
Sbjct: 512 DGVPATKAMLRSPNDLAVDGRGNLYIADGENERVRRVDTHGIITTVAGSGVEAGGDGDGP 571
Query: 184 SEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+ DA F + + + P SL V D+GN+ +R+I+
Sbjct: 572 ATDAFFKYT-EGLTIGPDGSLYVGDQGNSRIRRIT 605
Score = 39.7 bits (91), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 54/123 (43%), Gaps = 5/123 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G L+ D N + ++ + VAGS G DG +A F +
Sbjct: 524 PNDLAVDGRGNLYIADGENERVRRV----DTHGIITTVAGSGVEAGGDGDGPATDAFFKY 579
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
+G+T+ G+LYV D N IR+I G + TIAG + GP+ A+ +
Sbjct: 580 TEGLTIGPDGSLYVGDQGNSRIRRITPQGRIITIAGTGTRGYSGDGGPATKAQIDSTGTA 639
Query: 196 VYV 198
+ V
Sbjct: 640 IAV 642
>gi|29349488|ref|NP_812991.1| hypothetical protein BT_4080 [Bacteroides thetaiotaomicron
VPI-5482]
gi|298384984|ref|ZP_06994543.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
gi|383120427|ref|ZP_09941155.1| hypothetical protein BSIG_2566 [Bacteroides sp. 1_1_6]
gi|29341397|gb|AAO79185.1| putative cell surface protein [Bacteroides thetaiotaomicron
VPI-5482]
gi|251840523|gb|EES68605.1| hypothetical protein BSIG_2566 [Bacteroides sp. 1_1_6]
gi|298262128|gb|EFI04993.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
Length = 456
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 59/110 (53%), Gaps = 18/110 (16%)
Query: 119 QGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVA 177
+G +G++DG +A FN P + +D +GN+ + DT N IRKI G + GK +
Sbjct: 355 EGKSGYLDGPIGQAMFNKPGQMCVDSEGNIILTDTENHCIRKITMSTGYVSTLAGKPQNS 414
Query: 178 GFRDGPSEDAKF--------SNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G+ +G +EDA+F ND DV+Y+ D N A+R++++
Sbjct: 415 GYVNGSAEDAQFKKPLGICIDND-DVMYIG--------DSENRAIRRLAV 455
>gi|260821256|ref|XP_002605949.1| hypothetical protein BRAFLDRAFT_132236 [Branchiostoma floridae]
gi|229291286|gb|EEN61959.1| hypothetical protein BRAFLDRAFT_132236 [Branchiostoma floridae]
Length = 446
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 41/97 (42%), Positives = 54/97 (55%), Gaps = 4/97 (4%)
Query: 126 DGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDG 182
DGK EARF HP+G+ +D K N ++VADT N IR++ DA V T+AG G +G
Sbjct: 179 DGKGLEARFYHPQGLYLDAKQNRMFVADTDNHVIREMSMPDAVVKTVAGTPKE-KGLVNG 237
Query: 183 PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
AKF + + Y T L V D N A+R I+L
Sbjct: 238 QGRAAKFYHPTQMAYDPYTDILYVSDHFNHAIRTINL 274
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 57/122 (46%), Gaps = 13/122 (10%)
Query: 125 VDGKPNEARFNHPKGVTMDDKGN-LYVADT-------LNLAIRKIG--DAGVTTIAGGKS 174
+DG +AR N+P G+ D N +YVAD N IR+I D+ +TT+AG
Sbjct: 57 MDGFFLDARLNYPWGLVFDKSDNSIYVADCGCPNSPHTNERIRRISLSDSWITTLAGSSQ 116
Query: 175 NVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPT 234
G +DG +DA F + +V L V D N +RQ+S D + + P
Sbjct: 117 ---GNQDGLGKDAHFHHTAGMVVDEDNAMLYVCDSSNNVIRQVSTRTGDVKAFGGNPDPR 173
Query: 235 DI 236
D+
Sbjct: 174 DV 175
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 63/133 (47%), Gaps = 7/133 (5%)
Query: 88 LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN 147
L+ D N I I L Y ++AGS +G G DGK AR N+P+G+ D
Sbjct: 259 LYVSDHFNHAIRTIN--LQGYEV-DILAGSPKGEPGFKDGKGTAARLNYPEGIAFDTTHR 315
Query: 148 -LYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLL 205
LYV + N IR + AG V T+AGG G++DG A+F + + +
Sbjct: 316 VLYVVEFGNNCIRMVTPAGIVKTLAGGPE--PGYKDGSGSAARFFHPTGLTLDPTNKVIY 373
Query: 206 VIDRGNAALRQIS 218
V D+ N +R ++
Sbjct: 374 VTDQYNHLVRSVT 386
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 66/139 (47%), Gaps = 7/139 (5%)
Query: 83 SEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTM 142
++ +F D N I +++ P + + VAG+ + G V+G+ A+F HP +
Sbjct: 197 AKQNRMFVADTDNHVIREMSMPDAVV---KTVAGTPK-EKGLVNGQGRAAKFYHPTQMAY 252
Query: 143 DDKGN-LYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 199
D + LYV+D N AIR I G V +AG GF+DG A+ + + +
Sbjct: 253 DPYTDILYVSDHFNHAIRTINLQGYEVDILAGSPKGEPGFKDGKGTAARLNYPEGIAFDT 312
Query: 200 PTCSLLVIDRGNAALRQIS 218
L V++ GN +R ++
Sbjct: 313 THRVLYVVEFGNNCIRMVT 331
>gi|153868997|ref|ZP_01998705.1| protein kinase-like protein [Beggiatoa sp. PS]
gi|152074439|gb|EDN71293.1| protein kinase-like protein [Beggiatoa sp. PS]
Length = 1432
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 77/153 (50%), Gaps = 19/153 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKI-----TPPLSQYSRGRLVAGSFQ-GYTGHVDGKPN 130
P I +G + D N++I KI T PL+ S +AG + GY+G +G
Sbjct: 526 PTAIVFDHNGHFYIADSGNNSIRKINYQPGTSPLNANSLITTIAGDGRSGYSGD-NGPAI 584
Query: 131 EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKF 189
+AR +P + +D++ NLY+ADT N IRKI G +TT AG +G++ G S D
Sbjct: 585 QARLGNPSSLVVDNENNLYIADTDNHRIRKIDIRGNITTFAG-----SGYK-GYSGDGNL 638
Query: 190 S-----NDFDVVYVRPTCSLLVIDRGNAALRQI 217
+ N + V T ++ + D+ N +R+I
Sbjct: 639 AITARLNMPTGLAVDGTGNIFIADQNNHRIRKI 671
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 49/157 (31%), Positives = 77/157 (49%), Gaps = 32/157 (20%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV--AGS-FQGYTGHVDGKPNEA 132
VP + G L+ D +N I K+ P G L AG+ +G+ G +G EA
Sbjct: 413 VPKDLIFDATGNLYIADILNHRIFKLDP------EGNLTVFAGTGTKGFEGD-NGPAIEA 465
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFS 190
++P+G+ +D +GNLY+ADT N IRKI G +TT+ G G++ AG
Sbjct: 466 NLSNPEGLAIDAQGNLYIADTNNHRIRKIDSDGIITTVVGTGEAGYAG-----------D 514
Query: 191 NDFDVV--YVRPTCSLL-------VIDRGNAALRQIS 218
N+F + +PT + + D GN ++R+I+
Sbjct: 515 NEFAIAAQLKKPTAIVFDHNGHFYIADSGNNSIRKIN 551
Score = 47.0 bits (110), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 64/145 (44%), Gaps = 12/145 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV--AGS-FQGYTGHVDGK-PNEA 132
P + V + L+ D N I KI RG + AGS ++GY+G DG A
Sbjct: 591 PSSLVVDNENNLYIADTDNHRIRKID------IRGNITTFAGSGYKGYSG--DGNLAITA 642
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSND 192
R N P G+ +D GN+++AD N IRKI G+ G DG N
Sbjct: 643 RLNMPTGLAVDGTGNIFIADQNNHRIRKIDGEGIIRTFTGTGVRGTATDGILASVAEINQ 702
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQI 217
+ + +L + ++GN +R+I
Sbjct: 703 PTDIALDQYGNLYLAEKGNHFIRKI 727
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 44/88 (50%), Gaps = 4/88 (4%)
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR--DGPSEDAKF 189
A PK + D GNLY+AD LN I K+ G T+ G + GF +GP+ +A
Sbjct: 409 AELKVPKDLIFDATGNLYIADILNHRIFKLDPEGNLTVFAG-TGTKGFEGDNGPAIEANL 467
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQI 217
SN + + + +L + D N +R+I
Sbjct: 468 SNP-EGLAIDAQGNLYIADTNNHRIRKI 494
>gi|338536093|ref|YP_004669427.1| putative lipoprotein [Myxococcus fulvus HW-1]
gi|337262189|gb|AEI68349.1| putative lipoprotein [Myxococcus fulvus HW-1]
Length = 855
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 46/87 (52%), Gaps = 4/87 (4%)
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSND 192
N P G+ +D GN+YVADT + IR+I G V AGG G +DGP++ A F+
Sbjct: 616 LNGPMGIAVDAAGNVYVADTDHYVIRRIDVEGKVEVFAGGTP---GLQDGPAKQAAFNQP 672
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQISL 219
+ +LLV D N +R+I L
Sbjct: 673 TGMTVTPDGTALLVADMNNGVIRRIDL 699
Score = 43.9 bits (102), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 60/147 (40%), Gaps = 31/147 (21%)
Query: 40 WTTRSSSKAPQADGNVLQFENGYLVET--------------VIEGNEIG----------V 75
W R S A ADG+ L Y+VE+ V+ G G
Sbjct: 713 WLYRPSGVAVSADGSTL-----YVVESGMSRVVRIRDGLTSVVAGTTPGYRDGAPTSAQF 767
Query: 76 VPY-KIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
+PY I V +DG L D N + ++ +R ++ + G GH DG ++A
Sbjct: 768 LPYLGIAVLKDGSLAVADPGNYRVRRVVLEADGTAR-KVTTLAGNGRYGHADGPGDKAEL 826
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKI 161
P G+T+ G LYVAD N +R I
Sbjct: 827 VLPAGLTVGPDGRLYVADAGNGLVRAI 853
>gi|429202583|ref|ZP_19193964.1| antioxidant, AhpC/TSA family [Streptomyces ipomoeae 91-03]
gi|428661888|gb|EKX61363.1| antioxidant, AhpC/TSA family [Streptomyces ipomoeae 91-03]
Length = 619
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 63/124 (50%), Gaps = 11/124 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF-QGYTGHVDGKPNEARFN 135
P K + DG+ D +V++ P G V + G G VDG + ARF+
Sbjct: 186 PGKALLLPDGDFLVSDTTRHRLVRLAP------DGETVVRRYGTGERGFVDGPADHARFS 239
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG-DAG-VTTIAG-GKSNVAGF-RDGPSEDAKFSN 191
P+G+ + D G++ VADT+N A+R++ D G V T+AG G+ G GP+ S+
Sbjct: 240 EPQGLALLDSGDVVVADTVNHALRRLDPDTGHVATLAGTGRQWWQGSPTSGPAWQIDLSS 299
Query: 192 DFDV 195
+DV
Sbjct: 300 PWDV 303
>gi|441163235|ref|ZP_20968262.1| leucine-rich repeat-containing protein [Streptomyces rimosus subsp.
rimosus ATCC 10970]
gi|440616374|gb|ELQ79516.1| leucine-rich repeat-containing protein [Streptomyces rimosus subsp.
rimosus ATCC 10970]
Length = 553
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 42/142 (29%), Positives = 70/142 (49%), Gaps = 14/142 (9%)
Query: 82 VSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV-DGKPN-EARFNHPKG 139
V +G L+ D N + K+TP G + + G G+V DG P R ++P G
Sbjct: 5 VDAEGSLYIADRYNHRVRKVTP------NGLITTVAGNGTAGYVSDGGPALGTRLHYPWG 58
Query: 140 VTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRD--GPSEDAKFSNDFDVV 196
+ +D+ G+LY+ D N IRK+ D +TT+AG + AG+ D GP+ + + +
Sbjct: 59 LALDEAGSLYIGDGHNHRIRKVTSDGIITTVAG--NGTAGYVDDGGPAAGTRLYYPYGIA 116
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
R +L + D N +R ++
Sbjct: 117 LDR-GGNLYIADCNNHRVRGVT 137
>gi|290994196|ref|XP_002679718.1| predicted protein [Naegleria gruberi]
gi|284093336|gb|EFC46974.1| predicted protein [Naegleria gruberi]
Length = 762
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 73/145 (50%), Gaps = 12/145 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV--AGSFQGYTGHVDGKPNEARF 134
P+ + VS E++ D+ N + KI GR+V AG+ +G DG A+
Sbjct: 41 PFGVFVSPTNEIYIADQYNHRVRKILES------GRIVTIAGNGKGGFSGDDGLATNAQL 94
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSND 192
N P V + +K +++AD N IRKI ++G + TIAG G+ +G +GP+ A+ +
Sbjct: 95 NCPSSVFVSNKNEVFIADQYNHRIRKILESGRIVTIAGNGEEGFSG-DNGPATSARLNCP 153
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQI 217
++V + +D N +R+I
Sbjct: 154 MS-IFVSNMNEVYFVDSNNNRVRKI 177
Score = 43.1 bits (100), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 51/102 (50%), Gaps = 11/102 (10%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV--AGSFQ-GYTGHVDGKPNEA 132
P + VS E+F D+ N I KI GR+V AG+ + G++G +G A
Sbjct: 96 CPSSVFVSNKNEVFIADQYNHRIRKILES------GRIVTIAGNGEEGFSGD-NGPATSA 148
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGK 173
R N P + + + +Y D+ N +RKI + G + TIAG +
Sbjct: 149 RLNCPMSIFVSNMNEVYFVDSNNNRVRKILENGIIVTIAGNE 190
Score = 39.3 bits (90), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 49/89 (55%), Gaps = 4/89 (4%)
Query: 131 EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAK 188
+A+ +P GV + +Y+AD N +RKI ++G + TIAG GK +G DG + +A+
Sbjct: 35 DAKLCYPFGVFVSPTNEIYIADQYNHRVRKILESGRIVTIAGNGKGGFSG-DDGLATNAQ 93
Query: 189 FSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
N V+V + + D+ N +R+I
Sbjct: 94 L-NCPSSVFVSNKNEVFIADQYNHRIRKI 121
>gi|162448821|ref|YP_001611188.1| NHL repeat-containing protein [Sorangium cellulosum So ce56]
gi|161159403|emb|CAN90708.1| hypothetical protein with NHL repeat [Sorangium cellulosum So ce56]
Length = 1889
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/72 (40%), Positives = 43/72 (59%), Gaps = 4/72 (5%)
Query: 122 TGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK--IGDAGVTTIAGGKSNVAGF 179
+G+VDG ARF P G+ D +GNLY+AD N ++R+ I + VTT+AG + AG
Sbjct: 948 SGNVDGVGTTARFAEPSGIVADGRGNLYIADAFNHSLRRFEIATSMVTTVAG--NGTAGS 1005
Query: 180 RDGPSEDAKFSN 191
DG A+ ++
Sbjct: 1006 ADGVGTAARLTS 1017
Score = 39.3 bits (90), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 52/106 (49%), Gaps = 9/106 (8%)
Query: 118 FQGYTGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAGGKS 174
G +G +ARF P + D LYVAD A+R+I VTTIAG S
Sbjct: 793 LAGARALTNGSYAQARFIQPGDMAYDASTRMLYVAD--RTAVRRIDLTSGAVTTIAG--S 848
Query: 175 NVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
V+G DG A+F N + + +L + DRG+ +R+++L+
Sbjct: 849 TVSGTTDGVGTAARF-NTISGITLHDG-ALYIADRGSRTIRKLTLS 892
>gi|375148602|ref|YP_005011043.1| NHL repeat containing protein [Niastella koreensis GR20-10]
gi|361062648|gb|AEW01640.1| NHL repeat containing protein [Niastella koreensis GR20-10]
Length = 716
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/149 (34%), Positives = 73/149 (48%), Gaps = 15/149 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAG--SFQGYTGHVDGKP-NEAR 133
P + V +DG L+ D N I K+ ++ +AG GY G DG P +A
Sbjct: 164 PTAVAVDKDGNLYISDASNKVIRKV----NKQGVISTIAGVPGRAGYAG--DGGPATKAL 217
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAGGKSNVAGFR--DGPSEDAKFS 190
P G+ +D GN+Y+AD N +RKI AGV TT AG + AG+ GP+ A+F
Sbjct: 218 LTQPAGIAVDYSGNIYIADPSNSVVRKINPAGVITTFAG--NGTAGYSGDGGPAIKAQFQ 275
Query: 191 -NDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+ V P ++ D N A+R+IS
Sbjct: 276 MGSPQGLAVDPAGNVYASDYQNHAIRKIS 304
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 77/146 (52%), Gaps = 11/146 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKPNEARFN 135
P + + DG ++ D NS+++K ++ + AG+ +G++G G+ +A+
Sbjct: 108 PTGMTIDNDGNIYIAD-FNSSVIK---KVTTSGIMSIFAGNGTEGFSGD-GGQAAQAKLY 162
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAG--GKSNVAGFRDGPSEDAKFSND 192
P V +D GNLY++D N IRK+ GV +TIAG G++ AG GP+ A +
Sbjct: 163 RPTAVAVDKDGNLYISDASNKVIRKVNKQGVISTIAGVPGRAGYAG-DGGPATKALLTQP 221
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQIS 218
+ V + ++ + D N+ +R+I+
Sbjct: 222 AGIA-VDYSGNIYIADPSNSVVRKIN 246
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 32/53 (60%), Gaps = 2/53 (3%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
GY G G N ARFN P+ V D+ NLY+AD N IRK+ G ++T+AG
Sbjct: 36 GYGGDA-GPANNARFNQPRAVATDNDNNLYIADMRNHVIRKVNSNGIISTVAG 87
Score = 47.4 bits (111), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 44/120 (36%), Positives = 62/120 (51%), Gaps = 15/120 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPP--LSQYSRGRLVAGSFQGYTGHVDGKPN-EAR 133
P I V G ++ D NS + KI P ++ ++ GY+G DG P +A+
Sbjct: 221 PAGIAVDYSGNIYIADPSNSVVRKINPAGVITTFA-----GNGTAGYSG--DGGPAIKAQ 273
Query: 134 F--NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKF 189
F P+G+ +D GN+Y +D N AIRKI G +TTIAG G + AG GP+ AK
Sbjct: 274 FQMGSPQGLAVDPAGNVYASDYQNHAIRKISSKGIITTIAGTGAPDYAG-DGGPAILAKI 332
>gi|380693812|ref|ZP_09858671.1| hypothetical protein BfaeM_07498 [Bacteroides faecis MAJ27]
Length = 456
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 59/110 (53%), Gaps = 18/110 (16%)
Query: 119 QGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVA 177
+G +G++DG +A FN P + +D +GN+ + DT N IRKI G + GK +
Sbjct: 355 EGKSGYLDGPIGQAMFNKPGQMCVDSEGNIILTDTENHCIRKITMSTGYVSTLAGKPQNS 414
Query: 178 GFRDGPSEDAKF--------SNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G+ +G +EDA+F ND DV+Y+ D N A+R++++
Sbjct: 415 GYVNGSAEDAQFKKPLGICIDND-DVMYIG--------DSENRAIRRLAV 455
>gi|390957686|ref|YP_006421443.1| gluconolactonase [Terriglobus roseus DSM 18391]
gi|390412604|gb|AFL88108.1| gluconolactonase [Terriglobus roseus DSM 18391]
Length = 738
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 59/131 (45%), Gaps = 4/131 (3%)
Query: 88 LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN 147
LF+V V + V + P S+ G L G T V G + P+G+ D +GN
Sbjct: 27 LFSVLSVKAQAVLVLDPGSRV--GTLAGSGRDGRT--VAGTAQDVALGWPRGLAYDHEGN 82
Query: 148 LYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVI 207
LYVAD+ N I ++ GV + G + DG + A N V V P S+
Sbjct: 83 LYVADSRNHQIDRVSSGGVLAVVAGTGHQGYVGDGGAATAAELNAPTAVAVAPDGSVYFA 142
Query: 208 DRGNAALRQIS 218
D GN +R+I+
Sbjct: 143 DSGNHCIRRIA 153
Score = 46.2 bits (108), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 61/135 (45%), Gaps = 9/135 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V+ DG ++ D N I +I + VAG+ G G ARF
Sbjct: 128 PTAVAVAPDGSVYFADSGNHCIRRIASGVITT-----VAGNGAPGFGGDGGPAMVARFRS 182
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSNDFD 194
P G+ G+LYVADT N +RKI G V+TIAG G + AG DG A
Sbjct: 183 PGGLAFAADGSLYVADTGNRRVRKIPPGGSVSTIAGTGTEDDAG--DGGVATAASFRSPG 240
Query: 195 VVYVRPTCSLLVIDR 209
+ V P LL+ DR
Sbjct: 241 ALQVLPDGRLLIADR 255
>gi|29348305|ref|NP_811808.1| hypothetical protein BT_2896 [Bacteroides thetaiotaomicron
VPI-5482]
gi|383121762|ref|ZP_09942466.1| hypothetical protein BSIG_1754 [Bacteroides sp. 1_1_6]
gi|29340208|gb|AAO78002.1| NHL repeat protein [Bacteroides thetaiotaomicron VPI-5482]
gi|251841348|gb|EES69429.1| hypothetical protein BSIG_1754 [Bacteroides sp. 1_1_6]
Length = 452
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 44/86 (51%), Gaps = 7/86 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGY-TGHVDGKPNEARFN 135
P +I V +G L + KITP G + + GY TG+VDG EA+FN
Sbjct: 370 PNQIAVDAEGNLLVTTVYGRTVRKITP------EGYVSTYAGIGYQTGYVDGLAAEAKFN 423
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKI 161
P G+ +D +GN+YV D N IR I
Sbjct: 424 KPYGIAIDAQGNVYVGDCENWRIRVI 449
Score = 43.1 bits (100), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 27/99 (27%), Positives = 46/99 (46%), Gaps = 1/99 (1%)
Query: 119 QGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAG 178
G G +DG+ +R + P + +D +GNL V +RKI G + G G
Sbjct: 352 NGVWGIIDGQGVSSRMDQPNQIAVDAEGNLLVTTVYGRTVRKITPEGYVSTYAGIGYQTG 411
Query: 179 FRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
+ DG + +AKF+ + + + ++ V D N +R I
Sbjct: 412 YVDGLAAEAKFNKPYGIA-IDAQGNVYVGDCENWRIRVI 449
>gi|297568314|ref|YP_003689658.1| hypothetical protein [Desulfurivibrio alkaliphilus AHT2]
gi|296924229|gb|ADH85039.1| YD repeat protein [Desulfurivibrio alkaliphilus AHT2]
Length = 2439
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 71/135 (52%), Gaps = 14/135 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGS-FQGYTGHVDGKP-NEA 132
P+ + + DG ++ D N I ++ + G + +AG+ ++G++G DG P +EA
Sbjct: 1217 PHAVAIGPDGSIYIADTYNHRIRRVG------TDGIITTIAGTGYRGFSG--DGGPADEA 1268
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSN 191
R P V + G++Y+ADT N IR++G G +TT+AG S GP+ +A FS
Sbjct: 1269 RLFGPIAVAIGPDGSIYIADTYNHRIRRVGTDGIITTVAGTGSLGYSGDGGPATEASFST 1328
Query: 192 DFDVVYVRPTCSLLV 206
+ + P SL +
Sbjct: 1329 PSGITF-GPDGSLYI 1342
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 70/145 (48%), Gaps = 7/145 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKI-TPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P I + DG ++ D +N I ++ T + G G + G G EAR +
Sbjct: 1156 PTDIALGPDGSIYIADSINHRIRRVGTDGIITTVAGTGPTGWWSGGFSGDGGSAVEARLD 1215
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--DGPSEDAKFSND 192
P V + G++Y+ADT N IR++G G +TTIAG + GF GP+++A+
Sbjct: 1216 RPHAVAIGPDGSIYIADTYNHRIRRVGTDGIITTIAG--TGYRGFSGDGGPADEARLFGP 1273
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQI 217
V + P S+ + D N +R++
Sbjct: 1274 I-AVAIGPDGSIYIADTYNHRIRRV 1297
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 49/148 (33%), Positives = 74/148 (50%), Gaps = 21/148 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQ-GYTGHVDGKP-NEA 132
P + + DG ++ D N I ++ + G + VAG+ GY+G DG P EA
Sbjct: 1273 PIAVAIGPDGSIYIADTYNHRIRRVG------TDGIITTVAGTGSLGYSG--DGGPATEA 1324
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--DGPSEDAKF 189
F+ P G+T G+LY+A N IR++G G +TTIAG + GF GP+++A+
Sbjct: 1325 SFSTPSGITFGPDGSLYIA--ANHRIRRVGTDGIITTIAG--TGYRGFSGDGGPADEARL 1380
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQI 217
V + P S+ V D N +R I
Sbjct: 1381 GPRG--VSLGPDGSIYVADSNNHRIRHI 1406
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 57/116 (49%), Gaps = 19/116 (16%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGK 173
+AG+ G GH G AR P V++D G +Y+ADT N IR +G G+ T G
Sbjct: 1072 IAGT--GSWGHSGGLAVNARLYRPSCVSIDPDGTIYIADTGNHRIRSVGTDGIITTHAG- 1128
Query: 174 SNVAGFR------------DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
+GFR DGP+ +A+ ++ D+ + P S+ + D N +R++
Sbjct: 1129 ---SGFRAGGLGDSGYSGDDGPAVNARLNSPTDIA-LGPDGSIYIADSINHRIRRV 1180
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 51/100 (51%), Gaps = 6/100 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNI--VKITPPLSQYSRGRLVAGSF--QGYTGHVDGKPNEA 132
P + + DG ++ D N I V ++ ++ AG GY+G DG A
Sbjct: 1093 PSCVSIDPDGTIYIADTGNHRIRSVGTDGIITTHAGSGFRAGGLGDSGYSGD-DGPAVNA 1151
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
R N P + + G++Y+AD++N IR++G G +TT+AG
Sbjct: 1152 RLNSPTDIALGPDGSIYIADSINHRIRRVGTDGIITTVAG 1191
>gi|206602867|gb|EDZ39348.1| Conserved protein of unknown function [Leptospirillum sp. Group II
'5-way CG']
Length = 374
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/146 (34%), Positives = 70/146 (47%), Gaps = 9/146 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + DG ++ D N I K+ P VAGS + DG ARFN+
Sbjct: 62 PQGILAAPDGTIYIADTGNDMIRKMNPSTKSVEN---VAGSDH-RARYRDGVGANARFNN 117
Query: 137 PKGVTMD-DKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
P+G+ + D LY+AD+ N IRKI A V+TIAG +G DG ++A F
Sbjct: 118 PEGMAISPDGKTLYIADSRNNMIRKIDLATKTVSTIAGHSFPSSG--DGVGKEAGFETPR 175
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISL 219
+ +L V D GN A+R+I L
Sbjct: 176 GLAISPDGKTLYVADSGNNAIRKIDL 201
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 56/108 (51%), Gaps = 4/108 (3%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDA--GVTTIAG 171
+AGSF G +DG ARF P+G+ G +Y+ADT N IRK+ + V +A
Sbjct: 40 IAGSFH-ERGAIDGAGTNARFEFPQGILAAPDGTIYIADTGNDMIRKMNPSTKSVENVA- 97
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G + A +RDG +A+F+N + +L + D N +R+I L
Sbjct: 98 GSDHRARYRDGVGANARFNNPEGMAISPDGKTLYIADSRNNMIRKIDL 145
Score = 47.8 bits (112), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 74/156 (47%), Gaps = 11/156 (7%)
Query: 70 GNEIGV-VPYKIRVSEDGE-LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDG 127
G E G P + +S DG+ L+ D N+ I KI + + +AG+ + +G DG
Sbjct: 166 GKEAGFETPRGLAISPDGKTLYVADSGNNAIRKIDLATNTVTT---LAGAGKLMSGSADG 222
Query: 128 KPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRK--IGDAGVTTIAGGKSNVAGFRDGPS 184
+A F+ P+ + + G LY+ADT N IRK + V+T+A G G +GP
Sbjct: 223 VGVQATFHEPRSLAISGDGQVLYIADTRNNLIRKMVLATNSVSTLA-GHPGFPGTLNGPG 281
Query: 185 EDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
DA F + V L V D NA +R + L+
Sbjct: 282 PDAYFYHPVSVTI--DGNKLYVADGANADIRMVDLS 315
Score = 42.4 bits (98), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 59/120 (49%), Gaps = 12/120 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + +S DG++ + + +N+++ L+ S L G+ G ++G +A F H
Sbjct: 232 PRSLAISGDGQVLYIADTRNNLIR-KMVLATNSVSTLAG--HPGFPGTLNGPGPDAYFYH 288
Query: 137 PKGVTMDDKGN-LYVADTLNLAIRKI--GDAGVTTIAGGKSN----VAGFRDGPSEDAKF 189
P VT+D GN LYVAD N IR + V+T+AG N +AG DG + F
Sbjct: 289 PVSVTID--GNKLYVADGANADIRMVDLSTGVVSTVAGATLNGGVPIAGREDGSGVEGHF 346
>gi|108707786|gb|ABF95581.1| HAD-superfamily hydrolase, subfamily IA, variant 3 containing
protein, expressed [Oryza sativa Japonica Group]
Length = 1064
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 55/110 (50%), Gaps = 5/110 (4%)
Query: 118 FQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSN 175
Q G DG ++ HP GV +YVAD+ N I+++ VTTIAG +
Sbjct: 792 LQYLFGDHDGTGSDVLLQHPLGVVYASDNQIYVADSYNHKIKRLDPVTRKVTTIAG--TG 849
Query: 176 VAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCE 225
AG++DGP+ A+ S +V V LLV D N+ +R I LN+ E
Sbjct: 850 RAGYKDGPALSAQLSEPAGLVEVGDG-RLLVADTNNSTIRYIVLNEKGAE 898
Score = 38.1 bits (87), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 25/85 (29%), Positives = 40/85 (47%), Gaps = 5/85 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + D +++ D N I ++ P + + +AG+ G G+ DG A+ +
Sbjct: 811 PLGVVYASDNQIYVADSYNHKIKRLDPVTRKVTT---IAGT--GRAGYKDGPALSAQLSE 865
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P G+ G L VADT N IR I
Sbjct: 866 PAGLVEVGDGRLLVADTNNSTIRYI 890
>gi|455647406|gb|EMF26379.1| hypothetical protein H114_24527 [Streptomyces gancidicus BKS 13-15]
Length = 627
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/93 (38%), Positives = 52/93 (55%), Gaps = 3/93 (3%)
Query: 125 VDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGP 183
VDG EA F P G+ + +G L+VAD+ + A+R++G D V T G G++DGP
Sbjct: 343 VDGSAKEALFAQPSGLAVTAEG-LWVADSESSALRRLGPDGTVRTAVGAGLFAFGYQDGP 401
Query: 184 SEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
++ A F + V + P SL V D N ALR+
Sbjct: 402 ADGALFQHPLGVTAL-PDGSLAVSDTYNHALRR 433
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 57/121 (47%), Gaps = 11/121 (9%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF-QGYTGHVDGKPNEARFNHPK 138
+R+ G D +V++ G V F G G DG P ARF P+
Sbjct: 190 LRLPSSGTFLVTDTTRHQVVELA------EDGETVLRRFGSGGRGLEDGPPGRARFQEPQ 243
Query: 139 GVTMDDKGNLYVADTLNLAIRKIG-DAG-VTTIAGGKS--NVAGFRDGPSEDAKFSNDFD 194
G+ + G + VADT+N A+R++ ++G VTT+AG S DGP+ S+ +D
Sbjct: 244 GLALLPDGAVVVADTVNHALRRLDPESGEVTTLAGTGSPWRPGEATDGPARAVNLSSPWD 303
Query: 195 V 195
V
Sbjct: 304 V 304
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 50/96 (52%), Gaps = 7/96 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V+ +G L+ D +S + ++ P R + AG F G+ DG + A F H
Sbjct: 355 PSGLAVTAEG-LWVADSESSALRRLGP--DGTVRTAVGAGLFA--FGYQDGPADGALFQH 409
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIA 170
P GVT G+L V+DT N A+R+ A VTT+A
Sbjct: 410 PLGVTALPDGSLAVSDTYNHALRRYDPATGEVTTLA 445
>gi|442323754|ref|YP_007363775.1| putative lipoprotein [Myxococcus stipitatus DSM 14675]
gi|441491396|gb|AGC48091.1| putative lipoprotein [Myxococcus stipitatus DSM 14675]
Length = 906
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/99 (39%), Positives = 51/99 (51%), Gaps = 6/99 (6%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAG--VTTIAGGKSNVAGF 179
G VDG AR P GV +D GN++VADT N A+R+I DA VTTIAG G
Sbjct: 490 GFVDGPVARARLRRPVGVAVDGLGNIFVADTGNHAVRRIAPDAARTVTTIAG--LGTPGV 547
Query: 180 RDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+GP + + V P +L V D GN + +I+
Sbjct: 548 GEGPGATTALRSPQSIA-VAPDGTLYVADTGNHRIVRIA 585
Score = 45.4 bits (106), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 48/190 (25%), Positives = 74/190 (38%), Gaps = 61/190 (32%)
Query: 87 ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKG 146
+L+ D N + +ITP +R V GS +G +G ++AR + P V D G
Sbjct: 627 DLYVTDTFNHRLARITP----QARVSTVIGSRG--SGSTNGPASQARLHRPTAVAFGD-G 679
Query: 147 NLYVADTLNLAIRKIG-DAGVTT------------------------------------- 168
L+V DT N IR++ DA TT
Sbjct: 680 ALWVVDTGNRLIRRVAMDASFTTTTVAGSAPGGFADGAGGAAQFLPMSGAVHVDGRLLLT 739
Query: 169 ---------IAGGK------SNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAA 213
+ GG+ S G RDG +E A FS + + P ++LV+D+G +
Sbjct: 740 DTGNERIRMLVGGRVRTYAGSGAHGARDGTAEQATFSLPTGIAAL-PNGNMLVVDQGAST 798
Query: 214 LRQISLNQDD 223
LR++ + D
Sbjct: 799 LRELRMPAAD 808
Score = 43.5 bits (101), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 60/145 (41%), Gaps = 9/145 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G +F D N + +I P ++ +AG G G +G
Sbjct: 504 PVGVAVDGLGNIFVADTGNHAVRRIAPDAARTVT--TIAG--LGTPGVGEGPGATTALRS 559
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG---VTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
P+ + + G LYVADT N I +I G V+T AG + G DG A+F
Sbjct: 560 PQSIAVAPDGTLYVADTGNHRIVRIARDGRWTVSTFAGSREGRQGRADGTGPAARFQTPT 619
Query: 194 DVVYVRPTCSLLVIDRGNAALRQIS 218
+V+ L V D N L +I+
Sbjct: 620 SLVFA--GTDLYVTDTFNHRLARIT 642
>gi|196229731|ref|ZP_03128595.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
gi|196226057|gb|EDY20563.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
Length = 360
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/149 (32%), Positives = 74/149 (49%), Gaps = 15/149 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKPN-EARF 134
P+ + DG ++ + + ++TP ++ AG Q GY+G DG P EA F
Sbjct: 50 PFGLVRGPDGAIWFCEYGGQRLRRVTPDGKIHT----AAGIGQKGYSG--DGGPALEATF 103
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG---VTTIAG-GKSNVAGFRDGPSEDAKFS 190
N P + D GN Y+AD N AIR++ DA +TT AG GK +G GP+ A+
Sbjct: 104 NLPHEIRFDRAGNYYIADMANHAIRRV-DAKTKIITTFAGTGKPGYSG-DGGPAAQAQLK 161
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQISL 219
+ + P SL + D GN +R++ +
Sbjct: 162 QPHSIQF-GPDGSLYICDVGNNCIRKVDM 189
Score = 39.7 bits (91), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 48/86 (55%), Gaps = 5/86 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKPNEARFN 135
P + ++G L+ V + ++K S + AG+ + G+TG+ G EA +
Sbjct: 220 PRSMDFDKEGNLWLVTREGNQVLKFDAKTGIIS---IAAGTGKKGFTGN-GGPALEATLS 275
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKI 161
PKG+ +D +GN+++ADT + +IR+I
Sbjct: 276 GPKGIAVDAQGNVWLADTESHSIREI 301
>gi|428180908|gb|EKX49774.1| hypothetical protein GUITHDRAFT_67798, partial [Guillardia theta
CCMP2712]
Length = 247
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 47/147 (31%), Positives = 76/147 (51%), Gaps = 14/147 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRG----RLVAGSFQGYTGHVDGKPNEA 132
P + ++ GE+F V S++V LS+ S G +VAG G G DG
Sbjct: 37 PRGLTLNASGEVF-VSYCKSHVV---AALSRGSSGLWDISVVAGC--GRRGCQDGSHESG 90
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFRDGPSEDAKFS 190
R N P+G+T D G+LY+AD+ N IR++ D + T+AG S RDG +A FS
Sbjct: 91 RLNGPRGLTFDPHGDLYIADSSNHRIRRLRSQDMFLETVAG--SGEGESRDGTLLNASFS 148
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQI 217
+ + + T +L+V ++ + +R++
Sbjct: 149 FPYGLAWDALTNTLIVSEQESHKIRRV 175
Score = 46.2 bits (108), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 37/59 (62%), Gaps = 3/59 (5%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
+AG+ G GH DG+ ++A FN P + D +GN++VAD+ N +IR I G V T+ G
Sbjct: 185 IAGT--GECGHRDGEASQATFNEPCFIACDQQGNIFVADSRNDSIRCITRHGFVHTLVG 241
>gi|307110397|gb|EFN58633.1| hypothetical protein CHLNCDRAFT_140870 [Chlorella variabilis]
Length = 415
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/143 (30%), Positives = 65/143 (45%), Gaps = 12/143 (8%)
Query: 88 LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEAR----FNHPKGVTMD 143
++AVD + +I + VAG G G+ DG A FN P +
Sbjct: 61 VYAVDTSRHTVGRIALDTGTW---ETVAG-LDGTPGYRDGPEGPAGQPALFNRPSAICQM 116
Query: 144 DKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 201
G+L VADT N IR+I A V+T+A G+ G DGP+ +A+F + +
Sbjct: 117 PHGHLAVADTGNACIRQIDAATKQVSTLA-GRCGEPGAADGPAAEAQFGSSIKSIACA-N 174
Query: 202 CSLLVIDRGNAALRQISLNQDDC 224
CS+ V D LR + ++ +C
Sbjct: 175 CSVFVGDVSTGRLRLVRVDDAEC 197
>gi|452910311|ref|ZP_21958992.1| hypothetical protein C884_01866 [Kocuria palustris PEL]
gi|452834558|gb|EME37358.1| hypothetical protein C884_01866 [Kocuria palustris PEL]
Length = 632
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 65/138 (47%), Gaps = 15/138 (10%)
Query: 111 GRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDA-----G 165
G L + G G VDG P+++ F G+ G ++ AD+ + A+R I A G
Sbjct: 338 GELAVAAGTGDEGLVDGDPSQSWFAQTSGLVEAPDGTIWAADSESSALRSIAVADGALSG 397
Query: 166 VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCE 225
V T AG GFRDG S A+ + V V P S+ V D N A+R+ D E
Sbjct: 398 VATAAGLGLYDFGFRDGDSTQARLQHPLGVA-VLPDGSIAVADTYNGAIRR----WDPVE 452
Query: 226 YQYNSI-----SPTDILM 238
Q +++ P+D+L+
Sbjct: 453 GQLSTLERDFDEPSDLLV 470
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 51/104 (49%), Gaps = 14/104 (13%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P K EDG + D + +V++ L R V G G G DG P EA FN
Sbjct: 176 PEKAVALEDGSVLVADTGHHRLVRMAADLQTV---RSVIG--DGTRGPADGGPEEAHFNE 230
Query: 137 PKGVTMDDKG-------NLYVADTLNLAIR--KIGDAGVTTIAG 171
P+G+ + +G ++ VADT+N +R ++ D VTT+AG
Sbjct: 231 PRGLALLPQGVREQVGYDVIVADTVNHRLRGVRLSDGEVTTLAG 274
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 36/76 (47%)
Query: 85 DGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDD 144
DG ++A D +S + I S AG G DG +AR HP GV +
Sbjct: 372 DGTIWAADSESSALRSIAVADGALSGVATAAGLGLYDFGFRDGDSTQARLQHPLGVAVLP 431
Query: 145 KGNLYVADTLNLAIRK 160
G++ VADT N AIR+
Sbjct: 432 DGSIAVADTYNGAIRR 447
>gi|418677567|ref|ZP_13238841.1| putative lipoprotein [Leptospira kirschneri serovar Grippotyphosa
str. RM52]
gi|418687786|ref|ZP_13248945.1| putative lipoprotein [Leptospira kirschneri serovar Grippotyphosa
str. Moskva]
gi|418742122|ref|ZP_13298495.1| putative lipoprotein [Leptospira kirschneri serovar Valbuzzi str.
200702274]
gi|400320757|gb|EJO68617.1| putative lipoprotein [Leptospira kirschneri serovar Grippotyphosa
str. RM52]
gi|410738110|gb|EKQ82849.1| putative lipoprotein [Leptospira kirschneri serovar Grippotyphosa
str. Moskva]
gi|410750480|gb|EKR07460.1| putative lipoprotein [Leptospira kirschneri serovar Valbuzzi str.
200702274]
Length = 358
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 64/118 (54%), Gaps = 8/118 (6%)
Query: 104 PLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG- 162
P Q+S L AGS +G +G + F+ P + +D + NLYV + N IRKI
Sbjct: 150 PTEQFS---LFAGSSSNLSGLQNGDRLNSLFDGPFFMDIDLERNLYVGELSNHTIRKINL 206
Query: 163 DAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
D+G V+T +GG ++G+ DG A+F + + Y R T SLL D + +R+I+L
Sbjct: 207 DSGMVSTFSGG---ISGYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKINL 261
>gi|312200909|ref|YP_004020970.1| serine/threonine protein kinase [Frankia sp. EuI1c]
gi|311232245|gb|ADP85100.1| serine/threonine protein kinase [Frankia sp. EuI1c]
Length = 847
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/149 (32%), Positives = 76/149 (51%), Gaps = 14/149 (9%)
Query: 75 VVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKP-NEA 132
+ P + + DG ++ V +++NIV +S+ +AG+ G++G DG P A
Sbjct: 528 LAPLHLFMGTDGSVY-VSSLDTNIVHR---ISKDGTVTPIAGNGTAGFSG--DGGPATSA 581
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFR--DGPSEDAKF 189
N P +D GN+YV DT N IRKI D +TT+ G + AGF GP+ A+
Sbjct: 582 ELNGPGTAVVDKNGNIYVPDTANNRIRKITPDGKITTVVG--NGTAGFSGDGGPATQAEI 639
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
N + + V P SL + D N +R+++
Sbjct: 640 -NSVEGIAVGPDGSLYLADYSNERIRKVT 667
Score = 43.9 bits (102), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 60/122 (49%), Gaps = 7/122 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP-NEARFN 135
P V ++G ++ D N+ I KITP + +V G++G DG P +A N
Sbjct: 586 PGTAVVDKNGNIYVPDTANNRIRKITPDGKITT---VVGNGTAGFSG--DGGPATQAEIN 640
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
+G+ + G+LY+AD N IRK+ D ++TIAG + P+ A+ S+
Sbjct: 641 SVEGIAVGPDGSLYLADYSNERIRKVTPDGIISTIAGTGTKGYTSTPTPALSAQISDPNS 700
Query: 195 VV 196
VV
Sbjct: 701 VV 702
>gi|423298428|ref|ZP_17276486.1| hypothetical protein HMPREF1070_05151 [Bacteroides ovatus
CL03T12C18]
gi|392663340|gb|EIY56891.1| hypothetical protein HMPREF1070_05151 [Bacteroides ovatus
CL03T12C18]
Length = 507
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/69 (42%), Positives = 41/69 (59%), Gaps = 2/69 (2%)
Query: 122 TGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD 181
TG+ DG EARF + + DD GN+Y+ D N AIR + + GV+T+ GG N +G +
Sbjct: 412 TGYWDGIFAEARFLRGEQMCSDDAGNIYIVDQDNHAIRMVNNTGVSTVIGG--NGSGAVN 469
Query: 182 GPSEDAKFS 190
G +DAK
Sbjct: 470 GVGKDAKLC 478
>gi|441163237|ref|ZP_20968264.1| NHL repeat-containing protein [Streptomyces rimosus subsp. rimosus
ATCC 10970]
gi|440616376|gb|ELQ79518.1| NHL repeat-containing protein [Streptomyces rimosus subsp. rimosus
ATCC 10970]
Length = 551
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 71/144 (49%), Gaps = 14/144 (9%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV-DGKP-NEARFNHP 137
+ V G L+ D N + K+TP +G + + G G+ DG P ++ N P
Sbjct: 1 MAVDAAGNLYIADSANQRVRKVTP------QGTITTVAGTGTAGYTSDGGPATSSQLNTP 54
Query: 138 KGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRD--GPSEDAKFSNDFD 194
VT+DD GNLY+A++ + IRK+ G +TT+AG + AG+ D GP+ +
Sbjct: 55 AYVTLDDAGNLYIAESGSQRIRKVTTDGIITTVAG--NGTAGYVDDGGPATATRLYGPRG 112
Query: 195 VVYVRPTCSLLVIDRGNAALRQIS 218
V R +L + D N +R ++
Sbjct: 113 VALDR-AGNLYIADGDNNRVRGVT 135
>gi|357412450|ref|YP_004924186.1| alkyl hydroperoxide reductase [Streptomyces flavogriseus ATCC
33331]
gi|320009819|gb|ADW04669.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Streptomyces flavogriseus ATCC 33331]
Length = 603
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 57/126 (45%), Gaps = 11/126 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF-QGYTGHVDGKPNEARFN 135
P K V DG D +V++ + G V G F G G DG EARF+
Sbjct: 177 PGKALVLPDGGFLVSDSTRHRLVELE------ADGETVRGHFGTGERGFADGGREEARFS 230
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSN---VAGFRDGPSEDAKFSN 191
P+G+ + G + VADT+N AIR + GVTT G DGP+ + S+
Sbjct: 231 EPQGLAVLPDGRIAVADTVNHAIRALDLTTGVTTTLAGTGRQWWQGSATDGPATEVDLSS 290
Query: 192 DFDVVY 197
+D+ +
Sbjct: 291 PWDIAW 296
>gi|421129432|ref|ZP_15589632.1| putative lipoprotein [Leptospira kirschneri str. 2008720114]
gi|410358807|gb|EKP05916.1| putative lipoprotein [Leptospira kirschneri str. 2008720114]
Length = 358
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 66/124 (53%), Gaps = 9/124 (7%)
Query: 98 IVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLA 157
I KI P Q+S L AGS +G +G + F+ P + +D + NLYV + N
Sbjct: 145 IYKIDPT-EQFS---LFAGSSSNLSGLQNGDRLNSLFDGPFFMDIDLERNLYVGELSNHT 200
Query: 158 IRKIG-DAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALR 215
IRKI D+G V+T +GG ++G+ DG A+F + + Y R T SLL D + +R
Sbjct: 201 IRKINLDSGMVSTFSGG---ISGYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIR 257
Query: 216 QISL 219
+I L
Sbjct: 258 KIDL 261
>gi|386841352|ref|YP_006246410.1| hypothetical protein SHJG_5269 [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|374101653|gb|AEY90537.1| hypothetical protein SHJG_5269 [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|451794647|gb|AGF64696.1| hypothetical protein SHJGH_5033 [Streptomyces hygroscopicus subsp.
jinggangensis TL01]
Length = 611
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 48/80 (60%), Gaps = 4/80 (5%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG-GKSNV 176
G G DG P EA F+ P+G+ + D G++ VADT+N A+R++ A VTT+AG GK
Sbjct: 219 GTRGFADGGPREASFSEPQGLALLDDGSVVVADTVNHALRRLDPATGEVTTLAGTGKQWW 278
Query: 177 AG-FRDGPSEDAKFSNDFDV 195
G GP+ + S+ +DV
Sbjct: 279 QGSATSGPAREVDLSSPWDV 298
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 23/50 (46%), Positives = 30/50 (60%), Gaps = 2/50 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIA 170
GH DG ++A HP GVT G++ VADT N A+R+ A VTT+A
Sbjct: 390 GHRDGAADQALLQHPLGVTALPDGSVAVADTYNHALRRYDPATGEVTTLA 439
>gi|398339828|ref|ZP_10524531.1| hypothetical protein LkirsB1_10395 [Leptospira kirschneri serovar
Bim str. 1051]
Length = 358
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 63/118 (53%), Gaps = 8/118 (6%)
Query: 104 PLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG- 162
P Q+S L AGS +G +G + F+ P + +D + NLYV + N IRKI
Sbjct: 150 PTEQFS---LFAGSSSNLSGLQNGDRLNSLFDGPFFMDIDLERNLYVGELSNHTIRKINL 206
Query: 163 DAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
D+G V+T +GG ++G+ DG A+F + + Y R T SLL D + +R+I L
Sbjct: 207 DSGMVSTFSGG---ISGYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKIDL 261
>gi|218190788|gb|EEC73215.1| hypothetical protein OsI_07299 [Oryza sativa Indica Group]
Length = 1103
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 55/107 (51%), Gaps = 5/107 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFR 180
G DG ++ HP GV +YVAD+ N I+++ VTTIAG + AG++
Sbjct: 812 GDHDGTGSDVLLQHPLGVVYASDNQIYVADSYNHKIKRLDPVTRKVTTIAG--TGRAGYK 869
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQ 227
DGP+ A+ S +V V LLV D N+ +R I LN+ E +
Sbjct: 870 DGPALSAQLSEPAGLVEVGDG-RLLVADTNNSTIRYIVLNEKGAEVR 915
Score = 42.7 bits (99), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 53/106 (50%), Gaps = 17/106 (16%)
Query: 89 FAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGY---------TGHVDGKPNEARFNHPKG 139
A+D +N+ + +S + R+V + +G G +DG + A FN P+G
Sbjct: 589 LAIDVLNNRLF-----ISDSNHNRIVVTNLEGEFICQIGSSEEGLLDGTFDTASFNRPQG 643
Query: 140 VTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDG 182
+ + K N LYVADT N A+R+I V T+AG + + +R G
Sbjct: 644 LAYNSKKNILYVADTENHALREINFVSETVKTLAGNGTKGSDYRGG 689
Score = 38.5 bits (88), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 25/85 (29%), Positives = 40/85 (47%), Gaps = 5/85 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + D +++ D N I ++ P + + +AG+ G G+ DG A+ +
Sbjct: 826 PLGVVYASDNQIYVADSYNHKIKRLDPVTRKVTT---IAGT--GRAGYKDGPALSAQLSE 880
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P G+ G L VADT N IR I
Sbjct: 881 PAGLVEVGDGRLLVADTNNSTIRYI 905
>gi|29349674|ref|NP_813177.1| hypothetical protein BT_4266 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29341584|gb|AAO79371.1| conserved hypothetical protein with NHL repeat [Bacteroides
thetaiotaomicron VPI-5482]
Length = 490
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 67/146 (45%), Gaps = 7/146 (4%)
Query: 80 IRVSEDGELFAVDEVN-SNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPK 138
I ++E +++ VD N K T P Y+ +V G G G DG A+FN+P
Sbjct: 345 IAMAEGNQIWRVDVGNLEGKDKKTYPGEGYAGKAIVEGQIAG-KGWEDGLLRNAKFNNPH 403
Query: 139 GVTMDDKGNLYVADTLNLAIRKIG-----DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
+ + G LY+AD N IR I D + G + G++DG + A F++ F
Sbjct: 404 QICFTEDGKLYIADCGNNCIRVIDTRLSIDKATVSTPIGLPGMKGYKDGGPDIAMFNHPF 463
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISL 219
V + V D GN +R++S+
Sbjct: 464 GVAVSADGQIVYVADTGNKVIRKLSI 489
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 45/86 (52%), Gaps = 1/86 (1%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P++I +EDG+L+ D N+ I I LS G G+ DG P+ A FNH
Sbjct: 402 PHQICFTEDGKLYIADCGNNCIRVIDTRLSIDKATVSTPIGLPGMKGYKDGGPDIAMFNH 461
Query: 137 PKGVTMDDKGNL-YVADTLNLAIRKI 161
P GV + G + YVADT N IRK+
Sbjct: 462 PFGVAVSADGQIVYVADTGNKVIRKL 487
>gi|421091013|ref|ZP_15551797.1| putative lipoprotein [Leptospira kirschneri str. 200802841]
gi|410000210|gb|EKO50873.1| putative lipoprotein [Leptospira kirschneri str. 200802841]
Length = 358
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 63/118 (53%), Gaps = 8/118 (6%)
Query: 104 PLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG- 162
P Q+S L AGS +G +G + F+ P + +D + NLYV + N IRKI
Sbjct: 150 PTEQFS---LFAGSSSNLSGLQNGDRLNSLFDGPFFMDIDLERNLYVGELSNHTIRKINL 206
Query: 163 DAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
D+G V+T +GG ++G+ DG A+F + + Y R T SLL D + +R+I L
Sbjct: 207 DSGMVSTFSGG---ISGYLDGDLTSARFKSPLGIAYDRKTDSLLAADIQDHRIRKIDL 261
>gi|320103301|ref|YP_004178892.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
gi|319750583|gb|ADV62343.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
Length = 676
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 57/116 (49%), Gaps = 4/116 (3%)
Query: 111 GRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVT 167
G L A G G DG A FN P+G+ +D N LYVADT N AIR I VT
Sbjct: 234 GSLKAVVGNGKIGMRDGAYERASFNRPQGIRLDALRNRLYVADTENHAIRAIDLTTRSVT 293
Query: 168 TIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQD 222
T+AG G+ G GP+ ++ +D+V + T LV G + +I +Q+
Sbjct: 294 TVAGTGEMVYPGLPGGPARRFGLNSPWDLVQIPETNQFLVAMAGTHQIYKIDFDQN 349
>gi|222624800|gb|EEE58932.1| hypothetical protein OsJ_10596 [Oryza sativa Japonica Group]
Length = 1017
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 55/107 (51%), Gaps = 5/107 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFR 180
G DG ++ HP GV +YVAD+ N I+++ VTTIAG + AG++
Sbjct: 726 GDHDGTGSDVLLQHPLGVVYASDNQIYVADSYNHKIKRLDPVTRKVTTIAG--TGRAGYK 783
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQ 227
DGP+ A+ S +V V LLV D N+ +R I LN+ E +
Sbjct: 784 DGPALSAQLSEPAGLVEVGDG-RLLVADTNNSTIRYIVLNEKGAEVR 829
Score = 42.7 bits (99), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 53/106 (50%), Gaps = 17/106 (16%)
Query: 89 FAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGY---------TGHVDGKPNEARFNHPKG 139
A+D +N+ + +S + R+V + +G G +DG + A FN P+G
Sbjct: 503 LAIDVLNNRLF-----ISDSNHNRIVVTNLEGEFICQIGSSEEGLLDGTFDTASFNRPQG 557
Query: 140 VTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDG 182
+ + K N LYVADT N A+R+I V T+AG + + +R G
Sbjct: 558 LAYNSKKNILYVADTENHALREINFVSETVKTLAGNGTKGSDYRGG 603
Score = 38.5 bits (88), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 25/85 (29%), Positives = 40/85 (47%), Gaps = 5/85 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + D +++ D N I ++ P + + +AG+ G G+ DG A+ +
Sbjct: 740 PLGVVYASDNQIYVADSYNHKIKRLDPVTRKVTT---IAGT--GRAGYKDGPALSAQLSE 794
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P G+ G L VADT N IR I
Sbjct: 795 PAGLVEVGDGRLLVADTNNSTIRYI 819
>gi|425734502|ref|ZP_18852820.1| NHL repeat-containing protein [Brevibacterium casei S18]
gi|425481116|gb|EKU48277.1| NHL repeat-containing protein [Brevibacterium casei S18]
Length = 666
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 64/131 (48%), Gaps = 6/131 (4%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTI 169
RL++G+ G V+G A F G+ + G + VAD+ AIR++ A TT+
Sbjct: 366 RLLSGTMN--EGLVNGDAESAWFAQSSGLDLHPDGGVIVADSETSAIRRLDPASGEATTL 423
Query: 170 AGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYN 229
G GFRDGP+ +A+ + V + P SL + D N A+R+ ++
Sbjct: 424 VGTGLFDFGFRDGPAAEARLQHPLGVRTL-PDGSLAIADTYNGAIRRYDFTTNEVSTLAR 482
Query: 230 SI-SPTDILMV 239
+ P+DIL+V
Sbjct: 483 GLREPSDILVV 493
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 50/158 (31%), Positives = 73/158 (46%), Gaps = 28/158 (17%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P K+ V G L D + ++V+ TP R R+ G G DG A+F+
Sbjct: 200 PGKVTVLPSGNLLVADSGHHSLVEYTPDGQTVLR-RIGTGE----RGLTDGDFASAQFSE 254
Query: 137 PKGVTM--DDKG-----NLYVADTLNLAIRKIGDAG--VTTIAG-GKSNVAGFRD----- 181
P G+T+ +D L VADT+N +R I G V TIAG G+ ++ G D
Sbjct: 255 PGGITVLPEDIAARAGYQLVVADTVNHVLRGIDLDGETVRTIAGTGEQHMVGAIDNVRGT 314
Query: 182 --------GPSEDAKFSNDFDVVYVRPTCSLLVIDRGN 211
GP+ D K S+ +DV++V T ++V GN
Sbjct: 315 HGALGRYSGPALDVKLSSPWDVLFVPATGEVVVAMAGN 352
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 36/76 (47%), Gaps = 3/76 (3%)
Query: 85 DGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDD 144
DG + D S I ++ P + + LV + G DG EAR HP GV
Sbjct: 397 DGGVIVADSETSAIRRLDPASGEATT--LVGTGLFDF-GFRDGPAAEARLQHPLGVRTLP 453
Query: 145 KGNLYVADTLNLAIRK 160
G+L +ADT N AIR+
Sbjct: 454 DGSLAIADTYNGAIRR 469
>gi|194767578|ref|XP_001965892.1| GF16360 [Drosophila ananassae]
gi|190619368|gb|EDV34892.1| GF16360 [Drosophila ananassae]
Length = 300
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/97 (41%), Positives = 51/97 (52%), Gaps = 8/97 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P KI S G FA+ + +N V + SRG + G +DGK EARF H
Sbjct: 48 PAKIARSPAGR-FAISDAGNNRVLVVS-----SRGLVEHIIGDHKAGLIDGKFTEARFKH 101
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAG 171
P+G+T D+ L VADT N A+RKI D V T+AG
Sbjct: 102 PQGLTFLDEHTLIVADTENHALRKISLADGIVKTLAG 138
>gi|428175994|gb|EKX44881.1| hypothetical protein GUITHDRAFT_60229, partial [Guillardia theta
CCMP2712]
Length = 181
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 49/170 (28%), Positives = 76/170 (44%), Gaps = 22/170 (12%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIGDAG---VTTIAGG--KSNV 176
G+ DG+ RFN P + G+ L VADT N AIR+I DA VTTIAG S+
Sbjct: 8 GYQDGQAKIVRFNSPLDIAFSKDGSWLAVADTNNHAIRRI-DAMTGVVTTIAGCPRSSSC 66
Query: 177 AGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDI 236
G +DGP+ A F++ + + V D N +RQI L + +
Sbjct: 67 LGSQDGPASAASFNSPTSIALDPQDKFIFVADTSNNMIRQIDL------------TAMTV 114
Query: 237 LMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEK 286
+ GA GY +L F++ T + +F E ++ +++ +
Sbjct: 115 TTIAGATRSGYADGLLA---ASLFYNPTGITAHPDFTNENKLVMVADTQN 161
>gi|290973007|ref|XP_002669241.1| predicted protein [Naegleria gruberi]
gi|284082786|gb|EFC36497.1| predicted protein [Naegleria gruberi]
Length = 747
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 71/149 (47%), Gaps = 12/149 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNI--VKITPPLSQYSRGRLVAGSFQGYTGHVD--GKPNEA 132
P I +S GE++ D N I + + +S + AGS GY+G+V G +A
Sbjct: 326 PRGIAISSTGEIYIADTYNQRIRRIALNTNISTF------AGSGFGYSGYVGDGGLSTDA 379
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSN 191
N P V G +Y+ADT N IRKI + +TTIAG + G + +A+ S+
Sbjct: 380 LLNTPLSVACSSNGEIYIADTYNHRIRKISLNNTITTIAGTGDSGFSGDGGLAINARLSS 439
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQISLN 220
D+V V + D N +R+I+ N
Sbjct: 440 PADIV-VNSNGVIYFSDYDNNRIRKIASN 467
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 65/146 (44%), Gaps = 8/146 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I +S GE++ D N I +I + VAG+ G +A+ N+
Sbjct: 158 PRGIAISSTGEIYIADTYNHRIRRI----ALNGTINTVAGTGDSRFGGDGDLATKAQLNY 213
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR-DGPSEDAKFSNDFD 194
P G+ + G +Y+ADT N IR+I G + TIAG + V G DG N
Sbjct: 214 PMGIAISSTGEIYIADTFNERIRRIALNGTINTIAG--TGVLGLSGDGDLATKAQLNTPR 271
Query: 195 VVYVRPTCSLLVIDRGNAALRQISLN 220
+ + T + D N +R+I+LN
Sbjct: 272 GIAISSTGEIYFADTSNQRIRRIALN 297
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 71/163 (43%), Gaps = 30/163 (18%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKPNEARFN 135
P I +S GE++ D N I +I + +AG+ G +G D +A+ N
Sbjct: 214 PMGIAISSTGEIYIADTFNERIRRI----ALNGTINTIAGTGVLGLSGDGD-LATKAQLN 268
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVT-TIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P+G+ + G +Y ADT N IR+I G+ TIAG + D +F D D
Sbjct: 269 TPRGIAISSTGEIYFADTSNQRIRRIALNGIIDTIAG------------TGDPRFGGDGD 316
Query: 195 V-----------VYVRPTCSLLVIDRGNAALRQISLNQDDCEY 226
+ + + T + + D N +R+I+LN + +
Sbjct: 317 LATKAQLNSPRGIAISSTGEIYIADTYNQRIRRIALNTNISTF 359
Score = 46.2 bits (108), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 46/177 (25%), Positives = 75/177 (42%), Gaps = 36/177 (20%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V GE+ VD I KI + +AG+ G +A+ N
Sbjct: 105 PTGIVVGTRGEILFVDS--DRIRKIENGIIT-----TIAGTGDSRFGGDGDLATKAQLNS 157
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P+G+ + G +Y+ADT N IR+I G + T+AG + D++F D D+
Sbjct: 158 PRGIAISSTGEIYIADTYNHRIRRIALNGTINTVAG------------TGDSRFGGDGDL 205
Query: 196 -----------VYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVG 241
+ + T + + D N +R+I+LN N+I+ T +L + G
Sbjct: 206 ATKAQLNYPMGIAISSTGEIYIADTFNERIRRIALNG-----TINTIAGTGVLGLSG 257
>gi|111219554|ref|YP_710348.1| protein serine/threonine kinase [Frankia alni ACN14a]
gi|111147086|emb|CAJ58733.1| putative Protein serine/threonine kinase [Frankia alni ACN14a]
Length = 765
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 66/145 (45%), Gaps = 13/145 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKPN-EAR 133
P + V +DG L+ + V+ +GR+ VAG Y DG P A
Sbjct: 560 PSAVAVDDDGVLYVAEGYRVRRVE---------KGRITTVAGKATEYGSAGDGGPAVNAT 610
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSND 192
P G+ + D G+LYVAD +R+I +G T+ G+ G R DG + A +D
Sbjct: 611 LYQPSGLALGDDGSLYVADRGEDTVRRIDRSGRITLVAGRPGTYGNRGDGRAATAALLDD 670
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQI 217
+ + P SL + D GN +R++
Sbjct: 671 PTGLALGPDGSLYIADAGNDVIRRV 695
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/130 (30%), Positives = 60/130 (46%), Gaps = 6/130 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP-NEARFN 135
P + + +DG L+ D + +I + R LVAG Y DG+ A +
Sbjct: 614 PSGLALGDDGSLYVADRGEDTVRRI----DRSGRITLVAGRPGTYGNRGDGRAATAALLD 669
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P G+ + G+LY+AD N IR++ G +TT+AG S AG DG +D
Sbjct: 670 DPTGLALGPDGSLYIADAGNDVIRRVDGHGIITTVAGTASYSAGDHDGALATQTRLSDPA 729
Query: 195 VVYVRPTCSL 204
V V PT ++
Sbjct: 730 GVAVDPTGTV 739
>gi|380694022|ref|ZP_09858881.1| hypothetical protein BfaeM_08588 [Bacteroides faecis MAJ27]
Length = 490
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 67/146 (45%), Gaps = 7/146 (4%)
Query: 80 IRVSEDGELFAVDEVN-SNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPK 138
I ++E +++ VD N K T P Y+ +V G G G DG A+FN+P
Sbjct: 345 IAMAEGNQIWRVDVGNLEGKDKNTYPGEGYAGKAIVEGQVAG-KGWEDGLLRNAKFNNPH 403
Query: 139 GVTMDDKGNLYVADTLNLAIRKIG-----DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
+ + G LY+AD N IR I D + G + G++DG + A F++ F
Sbjct: 404 QICFTEDGKLYIADCGNNCIRVIDTRLSIDKATVSTPIGLPGMKGYKDGGPDIAMFNHPF 463
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISL 219
V + V D GN +R++S+
Sbjct: 464 GVAVSADGQIVYVADTGNKVIRKLSI 489
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 45/86 (52%), Gaps = 1/86 (1%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P++I +EDG+L+ D N+ I I LS G G+ DG P+ A FNH
Sbjct: 402 PHQICFTEDGKLYIADCGNNCIRVIDTRLSIDKATVSTPIGLPGMKGYKDGGPDIAMFNH 461
Query: 137 PKGVTMDDKGNL-YVADTLNLAIRKI 161
P GV + G + YVADT N IRK+
Sbjct: 462 PFGVAVSADGQIVYVADTGNKVIRKL 487
>gi|297582981|ref|YP_003698761.1| NHL repeat containing protein [Bacillus selenitireducens MLS10]
gi|297141438|gb|ADH98195.1| NHL repeat containing protein [Bacillus selenitireducens MLS10]
Length = 414
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 66/131 (50%), Gaps = 9/131 (6%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSN------- 175
G+ +G P++ +FNHP G+ D G L + D+ N +R + +G + G+ +
Sbjct: 92 GYQNGNPDDTKFNHPAGIVPFDDGFL-ITDSGNHTLRYVSMSGQSETFAGRYDGYDEYGE 150
Query: 176 -VAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPT 234
F G E+A F +VY + L V D GN A+R+++ + + + PT
Sbjct: 151 PEGRFDHGTGEEAGFDTPLGLVYDEDSGLLYVADAGNGAIRRVTEDGEVSTVAEDLDYPT 210
Query: 235 DILMVVGAVLV 245
D++++ G+++V
Sbjct: 211 DLILLDGSLIV 221
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 49/97 (50%), Gaps = 4/97 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
G DG +EA F+ P+GV + G + VADT N +R I + V +AG V G +G
Sbjct: 320 GSDDGPVSEAGFHFPRGVAVLSSGAILVADTYNHRLRLITEDEVLPVAG--HGVHGMVNG 377
Query: 183 PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
P EDA F + V +LV D N +R++ L
Sbjct: 378 PVEDALFDGPYHVAVFGE--RILVTDHWNHMIREVEL 412
>gi|290985668|ref|XP_002675547.1| predicted protein [Naegleria gruberi]
gi|284089144|gb|EFC42803.1| predicted protein [Naegleria gruberi]
Length = 1037
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 53/96 (55%), Gaps = 6/96 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + S G++F D N I K++ + VAG G+ +V +A+ N
Sbjct: 445 PYGVVTSSSGDVFIADTSNCRIRKVSSSTGIITT---VAGGTCGFGDNV--LAVDAQLNT 499
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
P G++++ KG L++ADT N IRK+ +G ++TIAG
Sbjct: 500 PYGISVNSKGELFIADTNNHRIRKVSSSGFISTIAG 535
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 64/149 (42%), Gaps = 13/149 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P KI V G + D N+ I I S +AGS G+ G A N
Sbjct: 54 PTKIVVDSAGRVVFCDNSNNRIRMI----SNNGTISTIAGSGTGFVLGDGGLATNAILNM 109
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DG---PSEDAKFSND 192
P G+T++ G + +ADT N IRKI G+ T G N GF DG S F D
Sbjct: 110 PTGLTINSIGEILIADTSNFKIRKIDLNGIITTIAGSGN-QGFEGDGGLATSAPLNFPAD 168
Query: 193 FDVVYVRPTCS-LLVIDRGNAALRQISLN 220
V V PT + + + D N +R+I N
Sbjct: 169 ---VSVHPTTNEVFIADSNNHCVRKILTN 194
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 66/154 (42%), Gaps = 28/154 (18%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV--------DGK 128
PY + + GE F +S Y G++ S G V DGK
Sbjct: 281 PYNVAFNSAGEAF---------------ISDYGSGKIRKVSLNGIITTVVGEYLQSQDGK 325
Query: 129 -PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSED 186
+ N PKGV + G +Y AD+ AIRKI +G + TIAG S + GF
Sbjct: 326 LATDVVLNTPKGVAISPSGEVYFADSDKYAIRKITLSGTIITIAG--SGLYGFAGDNGYS 383
Query: 187 AKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
++ SN +D +++ + D N +R+I +N
Sbjct: 384 SQLSNSYD-IFINSLGEAFIADTYNHRIRKIDVN 416
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 64/144 (44%), Gaps = 11/144 (7%)
Query: 78 YKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHP 137
Y I ++ GE F D N I KI + +V F G G +A P
Sbjct: 390 YDIFINSLGEAFIADTYNHRIRKIDVNGIIMTIAGIVTSGFSGD----GGLATKAELFEP 445
Query: 138 KGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGP-SEDAKFSNDFD 194
GV G++++ADT N IRK+ + +TT+AGG GF D + DA+ + +
Sbjct: 446 YGVVTSSSGDVFIADTSNCRIRKVSSSTGIITTVAGG---TCGFGDNVLAVDAQLNTPYG 502
Query: 195 VVYVRPTCSLLVIDRGNAALRQIS 218
+ V L + D N +R++S
Sbjct: 503 -ISVNSKGELFIADTNNHRIRKVS 525
Score = 45.8 bits (107), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 66/263 (25%), Positives = 112/263 (42%), Gaps = 52/263 (19%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKPNEARF 134
PY I V+ GELF D N I K++ S G + +AG+ G G A
Sbjct: 500 PYGISVNSKGELFIADTNNHRIRKVS------SSGFISTIAGNGVGGFSGDGGLATNANL 553
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSND 192
P V ++ G +++AD+ IRKI G + TIAG G S G + + +++ +
Sbjct: 554 FKPSKVVVNSIGEIFIADSSTNRIRKILTNGTIITIAGNGNSGFNG-DEADATNSQLGSP 612
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQI--------SLNQDDCEY-QYNSISPTD-------- 235
+ + + T + + D+GN +R++ +L C Y YN+ S D
Sbjct: 613 YGIA-LSSTGEIYISDQGNNRIRKLVPYCVNQGTLVNGSCFYTDYNNNSTKDNSNVSNNN 671
Query: 236 --------------ILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEET-EIKE 280
+V G V+V + C++ F RT++ +E E E
Sbjct: 672 QILAVVLPSVLVPVFCIVAGIVVVLLICCVI-------FSKRTRKEKSTEKVESVPRDLE 724
Query: 281 LSNKEKPIPIVESMKEEP-GWPS 302
++ + + + +ES+K EP +PS
Sbjct: 725 MAIESQTLQTMESVKSEPTDFPS 747
Score = 41.6 bits (96), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 66/138 (47%), Gaps = 10/138 (7%)
Query: 87 ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKG 146
E+F D N + KI + ++ + Q G G +R N+P GV + G
Sbjct: 177 EVFIADSNNHCVRKILTNGTIFTVAGICGAQGQSSDG---GLAINSRLNYPIGVAISSTG 233
Query: 147 NLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFR-DGPSE-DAKFSNDFDVVYVRPTC 202
++++A+ N +RK+ + ++T AG + V GF DG + +A N ++V +
Sbjct: 234 DIFIAERDNSKVRKVSSSTGIISTFAG--NGVFGFMGDGKAAVNASIYNPYNVAF-NSAG 290
Query: 203 SLLVIDRGNAALRQISLN 220
+ D G+ +R++SLN
Sbjct: 291 EAFISDYGSGKIRKVSLN 308
>gi|168701257|ref|ZP_02733534.1| NHL repeat containing protein [Gemmata obscuriglobus UQM 2246]
Length = 358
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 52/147 (35%), Positives = 76/147 (51%), Gaps = 12/147 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDG-KPNEARFN 135
P + V +G L+ V E N + V++ L++ R +GYTG DG K +A F+
Sbjct: 217 PRAVAVGPNGRLYVV-ERNGHCVRVID-LAKGRIERFAGTGKKGYTG--DGTKALDATFD 272
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG---VTTIAG-GKSNVAGFRD-GPSEDAKFS 190
PK + +D GN++V DT N IRKI DA VTTIAG G++ G D GP+ A
Sbjct: 273 GPKEIDIDKDGNVFVVDTENEVIRKI-DAKSGVVTTIAGKGRTKTPGLGDNGPATGATLG 331
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQI 217
V V P +L + D + +R++
Sbjct: 332 RPHGVA-VGPDGALYIGDTNSHRIRKV 357
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 47/152 (30%), Positives = 68/152 (44%), Gaps = 7/152 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP-NEARFN 135
PY I + DG L+ VD +N + K+ + S VAG+ DG P N+A
Sbjct: 102 PYGIELDADGNLYIVDRLNFCVRKVDGKTAVIS---TVAGTGGKVGYGGDGGPANKALLV 158
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
P G+ +D KG LY+AD +R + ++T+ G V GP +DA N
Sbjct: 159 EPNGLCLDGKGGLYIADVAGHRVRAVDLATGTISTLLGNGKGVTAGDGGPIKDATL-NGP 217
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISLNQDDCE 225
V V P L V++R +R I L + E
Sbjct: 218 RAVAVGPNGRLYVVERNGHCVRVIDLAKGRIE 249
Score = 44.7 bits (104), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 41/85 (48%), Gaps = 3/85 (3%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ + + G L+ D N + K+ + VAG+ + G GK EA N
Sbjct: 45 PFDVAFDKAGNLYFSDTFNHLVRKVDAKTGTIT---TVAGNGRKGFGGDGGKATEASLNE 101
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P G+ +D GNLY+ D LN +RK+
Sbjct: 102 PYGIELDADGNLYIVDRLNFCVRKV 126
>gi|111219555|ref|YP_710349.1| serine/threonine-protein kinase [Frankia alni ACN14a]
gi|111147087|emb|CAJ58734.1| putative serine/threonine-protein kinase (partial match) [Frankia
alni ACN14a]
Length = 899
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 52/144 (36%), Positives = 75/144 (52%), Gaps = 10/144 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKP-NEARF 134
P I +++DG L+ VD + I K+TP + +AG+ + GYTG DG P A+
Sbjct: 750 PNDITMTDDGTLYFVDLTSETIQKVTPDGIIST----IAGTGEAGYTG--DGGPARSAKL 803
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
N P D Y+AD N +RKI G +TTIAG + +G GP+ A+F N
Sbjct: 804 NKPSLAIGPDGETFYIADYNNNRVRKIDPNGIITTIAGTGTEGSGGDGGPATAAQFKNPS 863
Query: 194 DVVYVRPTCSLLVIDRGNAALRQI 217
VV V + ++ V D GN +R+I
Sbjct: 864 SVV-VDGSGAVYVADNGNDRVRRI 886
Score = 42.4 bits (98), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 56/144 (38%), Gaps = 4/144 (2%)
Query: 74 GVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEAR 133
G+ PY + + DG L I KITP + AG+ G G A+
Sbjct: 579 GLSPYSLALEPDGSLLVSSLATDRIQKITPTGAADD----FAGTGAGGIAGDGGPATAAQ 634
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
+ P D GN+Y+ D N IRK+ AGV + G DG A N
Sbjct: 635 LDGPGSTARDKAGNIYIGDAKNNRIRKVTPAGVISTVAGTGTAGYSGDGGPATAAQLNSA 694
Query: 194 DVVYVRPTCSLLVIDRGNAALRQI 217
+ V P S+ D N +R+I
Sbjct: 695 EKVTTGPDGSVYFSDYDNHRIRKI 718
>gi|290989149|ref|XP_002677205.1| predicted protein [Naegleria gruberi]
gi|284090811|gb|EFC44461.1| predicted protein [Naegleria gruberi]
Length = 691
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 72/146 (49%), Gaps = 11/146 (7%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTG-HVDGKPNEAR 133
P I V++ G+++ D N+ I KI S +AG+ + G+ G +D +
Sbjct: 252 CPTGIAVNQRGDVYFSDSGNNVIRKI----STCGIITTIAGTGEKGFNGDQMDAL--DVM 305
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSND 192
N P G+ + +G + D N I KI D + TIAG + G++DGP AKF+N
Sbjct: 306 LNGPSGIAISQEGEILFTDIYNNRICKINRDRTLVTIAG--TTEEGYQDGPVRMAKFNNP 363
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQIS 218
D+ T + V+D GN +R+IS
Sbjct: 364 SDIAIDNETGDVYVVDGGNNYIRKIS 389
>gi|72094692|ref|XP_795849.1| PREDICTED: NHL repeat-containing protein 2-like [Strongylocentrotus
purpuratus]
Length = 706
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 66/135 (48%), Gaps = 11/135 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQG-YTGHVDGKPNEARFN 135
P K+ + +G L AV + N V I + +V G TG DG EARF+
Sbjct: 212 PGKVTTNPEGTLLAVSDTGHNRVIIV------ALDGVVQHCIGGPETGFNDGLYREARFH 265
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG-GKSNVAGFRDGPSEDAKFSND 192
P+G+ + +YVADT N AIRKI + VTTIAG G+ V + G + S+
Sbjct: 266 SPQGLCWA-QDVIYVADTENHAIRKIDLKEQCVTTIAGTGEQGVDWYGAGRGTEQVISSP 324
Query: 193 FDVVYVRPTCSLLVI 207
+DVV P +L I
Sbjct: 325 WDVVLGPPNEDVLFI 339
Score = 42.0 bits (97), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 42/99 (42%), Gaps = 2/99 (2%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFR 180
G DGK EA+ HP GV D L+VAD+ N I+ I + T AG G
Sbjct: 444 GDSDGKGLEAKLQHPLGVAWDHSKLLFVADSYNHKIKMIDPEERYCATYAGTGEPGKGGD 503
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
D A+F+ + C + V D N +R I +
Sbjct: 504 DEHILKAQFNEPGGLAISPCGCKIYVADTNNHTIRCIDI 542
>gi|412985326|emb|CCO20351.1| predicted protein [Bathycoccus prasinos]
Length = 897
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 72/150 (48%), Gaps = 14/150 (9%)
Query: 77 PYKIRVSEDG-ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDG--KPNEAR 133
P+ + S DG ++F D N I +I + +AG+ G +G DG K +A
Sbjct: 131 PFGVVASPDGRDIFIADTFNHRIRRIDVEFGTVT---TLAGT--GASGSADGLAKRGDAT 185
Query: 134 FNHPKGVTMD-DKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFRDG-PSEDAKF 189
F P GV + D G +YVADT N IR I G V T+AG S + G+ D A+F
Sbjct: 186 FKFPSGVAVSPDGGFVYVADTKNHKIRVIILGSGVVRTVAG--SGLTGYHDDLVGTKARF 243
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
+ DVV +L V D N +R++ +
Sbjct: 244 NQPMDVVMHPDGHTLFVSDAMNNCIRKVDV 273
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 55/108 (50%), Gaps = 17/108 (15%)
Query: 126 DGKPNEARFNHPKGVTMDDKG-NLYVADTLNLAIRKIGD--AGVTTIAGGKSNVAGFRDG 182
DG EARFN P GV G ++++ADT N IR+I VTT+AG + +G DG
Sbjct: 120 DGPSLEARFNKPFGVVASPDGRDIFIADTFNHRIRRIDVEFGTVTTLAG--TGASGSADG 177
Query: 183 PSE--DAKF--------SNDFDVVYVRPTCS--LLVIDRGNAALRQIS 218
++ DA F S D VYV T + + VI G+ +R ++
Sbjct: 178 LAKRGDATFKFPSGVAVSPDGGFVYVADTKNHKIRVIILGSGVVRTVA 225
>gi|406901359|gb|EKD44040.1| NHL repeat containing protein, partial [uncultured bacterium]
Length = 667
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/114 (39%), Positives = 59/114 (51%), Gaps = 5/114 (4%)
Query: 89 FAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN- 147
+ D N I ++T S+ + +L+AGS +G DG A FN P +T+D N
Sbjct: 530 YVADTGNEVIRRVTLTGSKQGQTKLIAGS-PTESGFKDGTKTAAEFNVPIALTIDSADNY 588
Query: 148 LYVADTLNLAIRK--IGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 199
LYVAD N AIRK I D VTT+ G S G+ DG EDA + +V Y R
Sbjct: 589 LYVADRDNHAIRKVRISDGKVTTVTGNPS-TPGYLDGRLEDAYLNYPVEVYYNR 641
Score = 47.8 bits (112), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 42/93 (45%), Gaps = 6/93 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I G + D N I KI + G + QG G V+G ++F H
Sbjct: 48 PEDIIADGAGNFYVTDTFNGVIRKID------ANGIVSTVVGQGGYGDVNGSATTSKFAH 101
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTI 169
P V +DD GN+Y+AD N I+K VTT+
Sbjct: 102 PSAVAVDDSGNVYIADAGNGKIKKFSGGRVTTL 134
Score = 46.6 bits (109), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 48/98 (48%), Gaps = 5/98 (5%)
Query: 127 GKPNEARFNHPKGVTMDDKGNL-YVADTLNLAIRKI----GDAGVTTIAGGKSNVAGFRD 181
G+ N A F+ P G+ +D G YVADT N IR++ G T + G +GF+D
Sbjct: 507 GRKNCAYFSRPGGIVVDGSGKYAYVADTGNEVIRRVTLTGSKQGQTKLIAGSPTESGFKD 566
Query: 182 GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G A+F+ + L V DR N A+R++ +
Sbjct: 567 GTKTAAEFNVPIALTIDSADNYLYVADRDNHAIRKVRI 604
Score = 43.1 bits (100), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 47/90 (52%), Gaps = 6/90 (6%)
Query: 131 EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD--GPSEDAK 188
+A F+ P+ + D GN YV DT N IRKI G+ + G+ G+ D G + +K
Sbjct: 42 DAYFDFPEDIIADGAGNFYVTDTFNGVIRKIDANGIVSTVVGQ---GGYGDVNGSATTSK 98
Query: 189 FSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
F++ V V + ++ + D GN +++ S
Sbjct: 99 FAHP-SAVAVDDSGNVYIADAGNGKIKKFS 127
>gi|290999745|ref|XP_002682440.1| predicted protein [Naegleria gruberi]
gi|284096067|gb|EFC49696.1| predicted protein [Naegleria gruberi]
Length = 731
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 74/146 (50%), Gaps = 12/146 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKPNEARFN 135
P+ I ++ + ELF D N + K+ S +AG+ GY G V EA+
Sbjct: 484 PFGIALNGNDELFISDRSNHRVRKV----SNNGIISTIAGTGSAGYNGDVI-MATEAKLY 538
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFRDG--PSEDAKFSN 191
P GV++D+KGN+Y+AD N IRKI ++TIAG + AGF D + +++ ++
Sbjct: 539 LPHGVSVDNKGNVYIADKQNHRIRKILASTGMISTIAG--TGQAGFNDDNMSALESRVNS 596
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQI 217
+DV + + D N +R+I
Sbjct: 597 PYDVTVDESGQVIYIADTNNHKIRRI 622
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 50/97 (51%), Gaps = 4/97 (4%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
+P+ + V G ++ D+ N I KI S +AG+ Q + E+R N
Sbjct: 539 LPHGVSVDNKGNVYIADKQNHRIRKILASTGMIST---IAGTGQAGFNDDNMSALESRVN 595
Query: 136 HPKGVTMDDKGN-LYVADTLNLAIRKIGDAGVTTIAG 171
P VT+D+ G +Y+ADT N IR+I + +TTIAG
Sbjct: 596 SPYDVTVDESGQVIYIADTNNHKIRRIQNGNLTTIAG 632
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 68/140 (48%), Gaps = 10/140 (7%)
Query: 84 EDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGK-PNEARFNHPKGVT 141
+DGEL D N + +I+ S + +AG+ GY G DG +A+ N+P GV
Sbjct: 379 KDGELIFCDRSNHRVRRISKDGSV----KTIAGNGIGGYNG--DGMLAIDAQLNYPHGVA 432
Query: 142 MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG-PSEDAKFSNDFDVVYVRP 200
D GN+Y++D+ N +R + G + G N +DG + ++ + F + +
Sbjct: 433 SDSIGNIYISDSYNHRVRIVFTNGTISTIAGNGNSGFNKDGIQATSSQLNYPFGIA-LNG 491
Query: 201 TCSLLVIDRGNAALRQISLN 220
L + DR N +R++S N
Sbjct: 492 NDELFISDRSNHRVRKVSNN 511
Score = 46.6 bits (109), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 47/151 (31%), Positives = 72/151 (47%), Gaps = 15/151 (9%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKP-NEAR 133
+P + ++ E+F D N I K+ S VAG+ GY+G DG P N +
Sbjct: 203 LPTTVALNSLNEVFIADSQNHRIRKV----SNSGIISTVAGTGVSGYSG--DGIPANTTK 256
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGP--SEDAKF 189
N P G+T+D N+ +AD N IR I ++ ++T+AG + G RD + AK
Sbjct: 257 LNTPNGITIDSNDNIIIADRNNHRIRLISNSSGIISTLAG--NGTTGSRDEEVLATSAKL 314
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
S DV + L++ D N +R + LN
Sbjct: 315 SRPADVT-IGYDGELIITDTDNFVIRIVKLN 344
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 45/88 (51%), Gaps = 7/88 (7%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDG-KPNEARFNHPKGVTMDD 144
GE F D N + KI L+ + + F GY G DG ++A +P V ++D
Sbjct: 55 GEYFISDTYNHRVRKI---LANGTMTTIAGTGFAGYNG--DGILSSQAHLYYPYDVAVND 109
Query: 145 KGNLYVADTLNLAIRKIGDAG-VTTIAG 171
G +Y+ADT N IRKI G + T+AG
Sbjct: 110 LGEVYIADTYNHRIRKILLNGTIITVAG 137
>gi|288921012|ref|ZP_06415304.1| serine/threonine protein kinase [Frankia sp. EUN1f]
gi|288347587|gb|EFC81872.1| serine/threonine protein kinase [Frankia sp. EUN1f]
Length = 882
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 79/145 (54%), Gaps = 11/145 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF-QGYTGHVDGKP-NEARF 134
P + V+ DG L+ + + I KI + ++ VAG+ QGY+G DG P +A+
Sbjct: 736 PNDVEVASDGTLYIANLGSDTIQKINTDGTIHT----VAGTGEQGYSG--DGGPATKAQL 789
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
+ P V++ G LY+AD N IRK+ G +TTIAG S + GP+ +AKF++
Sbjct: 790 SIPS-VSLGRNGTLYIADYGNNRIRKVDANGTITTIAGTGSEGSSGDGGPATEAKFTDPS 848
Query: 194 DVVYVRPTCSLLVIDRGNAALRQIS 218
VV T +L + D GN +R+I+
Sbjct: 849 SVVE-DSTGALYIADSGNNRIRRIA 872
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 70/150 (46%), Gaps = 10/150 (6%)
Query: 74 GVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEAR 133
G+ PY + + + G L + KI+ S +AG+ G G A+
Sbjct: 565 GLSPYSLSIDDQGALLITSLATDRLQKISATGSVSD----LAGTGSGGFSGDGGPATAAK 620
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFR--DGPSEDAKFS 190
+ P D KGN+Y+ D N IRKI D ++TIAG + VAG+ GP+ A+
Sbjct: 621 LDGPGSAVTDGKGNIYIPDARNYRIRKIAADGTISTIAG--TGVAGYSGDGGPATAAQLK 678
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
+ + + V P S+ + D N +R+IS +
Sbjct: 679 S-AEKLAVAPDGSIYIADYENHRIRKISTD 707
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 64/144 (44%), Gaps = 9/144 (6%)
Query: 79 KIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKPNEARFNHP 137
K+ V+ DG ++ D N I KI S +AG+ GY+G G A N P
Sbjct: 682 KLAVAPDGSIYIADYENHRIRKI----STDGIITTIAGTGVDGYSGE-GGPATAATLNGP 736
Query: 138 KGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
V + G LY+A+ + I+KI D + T+AG GP+ A+ S V
Sbjct: 737 NDVEVASDGTLYIANLGSDTIQKINTDGTIHTVAGTGEQGYSGDGGPATKAQLS--IPSV 794
Query: 197 YVRPTCSLLVIDRGNAALRQISLN 220
+ +L + D GN +R++ N
Sbjct: 795 SLGRNGTLYIADYGNNRIRKVDAN 818
>gi|116622401|ref|YP_824557.1| Ig domain-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116225563|gb|ABJ84272.1| Ig domain protein, group 1 domain protein [Candidatus Solibacter
usitatus Ellin6076]
Length = 2770
Score = 53.1 bits (126), Expect = 3e-04, Method: Composition-based stats.
Identities = 42/152 (27%), Positives = 69/152 (45%), Gaps = 15/152 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNE----- 131
P + + +G+L+ D N I K+ G + + G +V G N+
Sbjct: 51 PTDVAIHPNGDLYIADTYNHRIRKVD------KNGVITTVAGTGQATNVGGDANDNILAV 104
Query: 132 -ARFNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDG-PSEDAK 188
A NHP G+ D GNLY+ADT + IR++ G G+ T G DG P+ AK
Sbjct: 105 SAELNHPSGIAFDTAGNLYIADTGHDRIRRVDGVTGIITTVAGTGERGYSGDGQPATLAK 164
Query: 189 FSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
++ + + + +L + D GN +R++ N
Sbjct: 165 INSPYHIA-LDGHGNLFIADDGNHRVRRVDGN 195
Score = 43.5 bits (101), Expect = 0.25, Method: Composition-based stats.
Identities = 36/105 (34%), Positives = 51/105 (48%), Gaps = 8/105 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKPNEARFN 135
PY I + G LF D+ N + ++ + VAG+ GY G D + A
Sbjct: 168 PYHIALDGHGNLFIADDGNHRVRRVDGNSGVITT---VAGTGNAGYNGD-DQQATHADLQ 223
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVT-TIAGGKSNVAGF 179
+P+GV +D GNLY+AD N +R + GV T AG + V GF
Sbjct: 224 NPRGVLIDASGNLYIADYGNHRVRVVDATGVIHTFAG--TGVYGF 266
Score = 42.0 bits (97), Expect = 0.69, Method: Composition-based stats.
Identities = 30/69 (43%), Positives = 39/69 (56%), Gaps = 5/69 (7%)
Query: 114 VAGS-FQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAG 171
VAG+ QG++G G AR P V + G+LY+ADT N IRK+ GV TT+AG
Sbjct: 28 VAGTGIQGFSGD-SGPATAARLFDPTDVAIHPNGDLYIADTYNHRIRKVDKNGVITTVAG 86
Query: 172 G--KSNVAG 178
+NV G
Sbjct: 87 TGQATNVGG 95
>gi|294630411|ref|ZP_06708971.1| NHL repeat protein [Streptomyces sp. e14]
gi|292833744|gb|EFF92093.1| NHL repeat protein [Streptomyces sp. e14]
Length = 609
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%), Gaps = 4/82 (4%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG-GKSNV 176
G G DG +A FN P+G+ + D G + VADT+N A+R+ A +TT+AG G+ +
Sbjct: 223 GARGFADGAAEDAAFNEPQGLALLDDGAVVVADTVNHALRRFDPATGRITTLAGTGRQWM 282
Query: 177 AG-FRDGPSEDAKFSNDFDVVY 197
G +GP+ + S+ +DV +
Sbjct: 283 QGEATEGPAREVNLSSPWDVAW 304
Score = 38.1 bits (87), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 29/50 (58%), Gaps = 2/50 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIA 170
GH DG +A HP GVT G++ V+DT N A+R+ A VTT+A
Sbjct: 394 GHRDGAAGQALLQHPLGVTALPDGSVAVSDTYNHALRRYDPATGEVTTLA 443
>gi|283781091|ref|YP_003371846.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
gi|283439544|gb|ADB17986.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
Length = 782
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 37/97 (38%), Positives = 48/97 (49%), Gaps = 8/97 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P KI G+ + + N N + IT G+L+ G G DG EA F+H
Sbjct: 296 PGKILADAAGQRLFISDSNHNRIVIT-----SLDGKLIETIGSGVIGKADGSFAEASFDH 350
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG 171
P+G +D + LYVADT N +RKI VTTIAG
Sbjct: 351 PQGCALDGE-TLYVADTENHLLRKIDLTKKTVTTIAG 386
Score = 39.3 bits (90), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 53/175 (30%), Positives = 72/175 (41%), Gaps = 32/175 (18%)
Query: 65 ETVIEGNEIGVVPYK---------IRVSEDGE-LFAVDEVNSNIVKITPPLSQYSRGRLV 114
E +++G I VPY +S DGE LF D S+I + P R +
Sbjct: 462 EDIVDGPLIPKVPYTEGFSSFAQPSGLSSDGEWLFVADSEGSSIRAV--PFDPTKEVRTI 519
Query: 115 AGSFQGYTGH------VDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIR----KIGDA 164
G+ + G VDG A+ H V + G +YVADT N I+ K G+
Sbjct: 520 VGTSELPGGRLFDFGDVDGPRERAKLQHALEVVYSE-GKIYVADTYNNKIKLVDAKTGE- 577
Query: 165 GVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
V TIAG S G D P A F + V T L V D N +R++ +
Sbjct: 578 -VKTIAG--SGSPGTSDDP---ATFDEPAGLALVGET--LYVADTNNHLIRKVDV 624
>gi|383454552|ref|YP_005368541.1| putative lipoprotein [Corallococcus coralloides DSM 2259]
gi|380728713|gb|AFE04715.1| putative lipoprotein [Corallococcus coralloides DSM 2259]
Length = 861
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 51/204 (25%), Positives = 81/204 (39%), Gaps = 42/204 (20%)
Query: 18 LQFQ--AHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGV 75
++FQ A A P GP +T++ K QA V E G
Sbjct: 543 VKFQPPAAATPMGPFAASVTTVAG----QPGKRCQAGAKVAPDELG-------------- 584
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P + V DG D + + + +++P + R V G
Sbjct: 585 APLAVAVLPDGGWAVADSLGNTVKRVSPD----GKIRTVLTGLYG--------------- 625
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ D GN+YV+DT N IR+I G + G + G++DGP+ A F+ +
Sbjct: 626 -PMGIAADALGNVYVSDTENAVIRRISPEGKAEVFAGTT--WGYQDGPALSAAFNQPAGL 682
Query: 196 VYVRPTCSLLVIDRGNAALRQISL 219
+ +LLV D N+ +R+I +
Sbjct: 683 SFTPDGTALLVADLNNSVIRRIDM 706
Score = 38.5 bits (88), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 56/141 (39%), Gaps = 20/141 (14%)
Query: 40 WTTRSSSKAPQADGNVLQFENGYL--------VETVIEGNEIG----------VVPY-KI 80
W R S+ DG E+G V TV+ G+ G ++PY +
Sbjct: 720 WLYRPSAVVQAVDGTTYVVESGMARVVRVRDGVTTVVAGSTPGFRDGDPKEGQMLPYLGL 779
Query: 81 RVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGV 140
+ DG L D N + ++T S +L + G G DG EA+ P G+
Sbjct: 780 ALLPDGSLAISDPGNYRVRRLTFNASGEPE-KLTTLAGSGRYGAEDGTGREAQLVLPAGL 838
Query: 141 TMDDKGNLYVADTLNLAIRKI 161
+ G LYVAD N +R I
Sbjct: 839 ALGPDGTLYVADAGNSLVRAI 859
>gi|255530479|ref|YP_003090851.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
gi|255343463|gb|ACU02789.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
Length = 453
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 49/94 (52%), Gaps = 1/94 (1%)
Query: 126 DGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSE 185
DG +A FN PK + D+ GN++VAD N IR + + T G+ +G++DG
Sbjct: 360 DGPLADALFNDPKEIKFDNSGNMFVADYGNHCIRMVSADNIVTTVAGQPGKSGYKDGGPV 419
Query: 186 DAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
++ F+ + V V + + D NA +R++ +
Sbjct: 420 ESLFNQPWGVA-VNEQGDIYIADWSNARIRKLVI 452
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 46/85 (54%), Gaps = 5/85 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P +I+ G +F D N I + S + VAG G +G+ DG P E+ FN
Sbjct: 371 PKEIKFDNSGNMFVADYGNHCIRMV----SADNIVTTVAGQ-PGKSGYKDGGPVESLFNQ 425
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P GV ++++G++Y+AD N IRK+
Sbjct: 426 PWGVAVNEQGDIYIADWSNARIRKL 450
>gi|225450563|ref|XP_002277564.1| PREDICTED: NHL repeat-containing protein 2-like [Vitis vinifera]
Length = 1078
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 62/122 (50%), Gaps = 11/122 (9%)
Query: 112 RLVAGSFQGYT------GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
RL+AG ++ G DG +E HP GV+ G +YVAD+ N I+K+ A
Sbjct: 798 RLLAGGDTVFSDNLFRFGDHDGVGSEVLLQHPLGVSCGKDGQIYVADSYNHKIKKLDPAT 857
Query: 166 --VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDD 223
V+T+AG + AGF+DG + A+ S +V V L + D N+ +R + L + +
Sbjct: 858 GRVSTLAG--TGKAGFKDGRALAAQLSEPSGIVEVE-NGVLFIADTNNSVIRYLDLKKKE 914
Query: 224 CE 225
+
Sbjct: 915 AD 916
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 60/125 (48%), Gaps = 8/125 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P K+ + + + N N + +T Y L GS G G DG ++A FN
Sbjct: 593 PGKLAIDVINNRLFISDSNHNRIVVTDLNGNYI---LQIGS-TGEEGLRDGSFDDATFNR 648
Query: 137 PKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSND- 192
P+G+ + K N LYVADT N A+R+I + V T+AG + + ++ G + N
Sbjct: 649 PQGLAYNAKKNLLYVADTENHALREIDFVNETVQTLAGNGTKGSDYQGGGKGATQLLNSP 708
Query: 193 FDVVY 197
+DV +
Sbjct: 709 WDVCF 713
>gi|392404857|ref|YP_006441469.1| NHL repeat containing protein [Turneriella parva DSM 21527]
gi|390612811|gb|AFM13963.1| NHL repeat containing protein [Turneriella parva DSM 21527]
Length = 697
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 46/120 (38%), Positives = 55/120 (45%), Gaps = 20/120 (16%)
Query: 82 VSEDG-ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQG--YTGHVDGKPNEARFNHPK 138
++ DG L+ D NS I KI + +AG QG TG D N ARFN P
Sbjct: 496 ITTDGTNLYVADSTNSKIRKIVISTRVVTT---IAGPAQGSAATGDTDATGNAARFNKPT 552
Query: 139 GVTMDDKGNLYVADTLNLAIRK--IGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
G+T D NL++AD N IRK I VTTIA GPS+ S D D V
Sbjct: 553 GITYDGT-NLFIADGNNNKIRKLVISTGVVTTIA-----------GPSQGTITSGDTDAV 600
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 32/69 (46%), Positives = 35/69 (50%), Gaps = 3/69 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK--IGDAGVTTIAGGKSNVAGFR 180
G +D ARF KGVT D NLYV DT N IRK I VT +AGG S G
Sbjct: 361 GDLDATGTSARFRQIKGVTTDGT-NLYVVDTGNNNIRKIVISTGAVTKLAGGTSTEFGDA 419
Query: 181 DGPSEDAKF 189
DG A+F
Sbjct: 420 DGTGSTARF 428
Score = 45.8 bits (107), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 52/149 (34%), Positives = 73/149 (48%), Gaps = 19/149 (12%)
Query: 82 VSEDG-ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGV 140
V+ DG L+ VD N+NI KI +S + +L G+ + G DG + ARF P G+
Sbjct: 378 VTTDGTNLYVVDTGNNNIRKIV--ISTGAVTKLAGGTSTEF-GDADGTGSTARFRQPSGI 434
Query: 141 TMDDKGNLYVADTLNLA-IRK--IGDAGVTTIAGGKSNVA-------GFRDGPSEDAKFS 190
T D NLYV D N A IRK I VTT+ G + + G DG A+F
Sbjct: 435 TTDGT-NLYVID--NQAKIRKIVISTGAVTTLVGPAAGCSATPPCPRGDTDGTGTAARF- 490
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQISL 219
N + + T +L V D N+ +R+I +
Sbjct: 491 NVPEGITTDGT-NLYVADSTNSKIRKIVI 518
Score = 39.7 bits (91), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 38/98 (38%), Positives = 48/98 (48%), Gaps = 12/98 (12%)
Query: 82 VSEDG-ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGY--TGHVDGKPNEARFNHPK 138
++ DG LF D N+ I K+ + +AG QG +G D N ARF P
Sbjct: 554 ITYDGTNLFIADGNNNKIRKLVISTGVVTT---IAGPSQGTITSGDTDAVGNAARFYSPV 610
Query: 139 GVTMDDKGNLYVADTL---NLAIRK--IGDAGVTTIAG 171
G+T D+ NL+VAD N IRK I VTTIAG
Sbjct: 611 GITT-DRTNLFVADGTGNRNNKIRKILISTGAVTTIAG 647
>gi|410478931|ref|YP_006766568.1| hypothetical protein LFML04_1390 [Leptospirillum ferriphilum ML-04]
gi|406774183|gb|AFS53608.1| hypothetical protein LFML04_1390 [Leptospirillum ferriphilum ML-04]
Length = 427
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 56/122 (45%), Gaps = 17/122 (13%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIA--- 170
+AGSF G +DG EARF P+G+ G +Y+ADT N IRKI G++
Sbjct: 78 IAGSFH-ERGAIDGPGREARFEFPQGILAAPDGTIYIADTGNDMIRKISSNGLSGATPTP 136
Query: 171 -------------GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
G + A +RDG +A+F+N + +L V D N +R+I
Sbjct: 137 AELHHLSVDVEDIAGSDHRARYRDGVGANARFNNPEGMAISPDGKTLYVADSRNNMIRKI 196
Query: 218 SL 219
L
Sbjct: 197 DL 198
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 53/158 (33%), Positives = 74/158 (46%), Gaps = 18/158 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKI--------TPPLSQYSRGRL----VAGSFQGYTGH 124
P I + DG ++ D N I KI TP ++ + +AGS +
Sbjct: 100 PQGILAAPDGTIYIADTGNDMIRKISSNGLSGATPTPAELHHLSVDVEDIAGSDH-RARY 158
Query: 125 VDGKPNEARFNHPKGVTMDDKG-NLYVADTLNLAIRKIGDA--GVTTIAGGKSNVAGFRD 181
DG ARFN+P+G+ + G LYVAD+ N IRKI A V+TIAG +G D
Sbjct: 159 RDGVGANARFNNPEGMAISPDGKTLYVADSRNNMIRKIDLATKTVSTIAGHSFPSSG--D 216
Query: 182 GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G ++A F + +L V D GN A+R+I L
Sbjct: 217 GVGKEAGFETPRGLAISPDGKTLYVADSGNNAIRKIDL 254
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 74/156 (47%), Gaps = 11/156 (7%)
Query: 70 GNEIGV-VPYKIRVSEDGE-LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDG 127
G E G P + +S DG+ L+ D N+ I KI + + +AG+ + +G DG
Sbjct: 219 GKEAGFETPRGLAISPDGKTLYVADSGNNAIRKIDLATNTVTT---LAGAGKLMSGSADG 275
Query: 128 KPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRK--IGDAGVTTIAGGKSNVAGFRDGPS 184
+A F+ P+ + + G LY+ADT N IRK + V+T+A G G +GP
Sbjct: 276 VGVQATFHEPRSLAISGDGQVLYIADTRNNLIRKMVLATNSVSTLA-GHPGFPGTLNGPG 334
Query: 185 EDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
DA F + V L V D NA +R + L+
Sbjct: 335 PDAYFYHPVSVTI--DGNKLYVADGANADIRMVDLS 368
Score = 42.7 bits (99), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 59/120 (49%), Gaps = 12/120 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + +S DG++ + + +N+++ L+ S L G+ G ++G +A F H
Sbjct: 285 PRSLAISGDGQVLYIADTRNNLIR-KMVLATNSVSTLAG--HPGFPGTLNGPGPDAYFYH 341
Query: 137 PKGVTMDDKGN-LYVADTLNLAIRKI--GDAGVTTIAGGKSN----VAGFRDGPSEDAKF 189
P VT+D GN LYVAD N IR + V+T+AG N +AG DG + F
Sbjct: 342 PVSVTID--GNKLYVADGANADIRMVDLSTGVVSTVAGATLNGGVPIAGREDGSGVEGHF 399
>gi|358461043|ref|ZP_09171215.1| serine/threonine protein kinase [Frankia sp. CN3]
gi|357074242|gb|EHI83734.1| serine/threonine protein kinase [Frankia sp. CN3]
Length = 866
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 47/155 (30%), Positives = 79/155 (50%), Gaps = 26/155 (16%)
Query: 75 VVPYKIRVSEDGELFAVDEVNSNIVK-------ITPPLSQYSRGRLVAGSFQ-GYTGHVD 126
+ PY + + +DG L+ V ++++IV +TP +AG+ Q G++G D
Sbjct: 548 LAPYDLSIGDDGSLY-VSNLDTHIVHKIAKDGVVTP----------IAGNAQDGFSG--D 594
Query: 127 GKP-NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGP 183
G P A+ P V D GNLY+ DT N +RKI +G +TT+ G G + +G GP
Sbjct: 595 GGPATAAQLYGPGRVAWDKAGNLYIPDTQNYRVRKIDPSGKITTVVGIGTAGYSG-DGGP 653
Query: 184 SEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+ A+ N + + V +L + D N +R+++
Sbjct: 654 ATQAQI-NGVEGIAVTADGTLYLADYDNQRIRKVT 687
Score = 42.7 bits (99), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 40/142 (28%), Positives = 72/142 (50%), Gaps = 11/142 (7%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKPNEARFNHPK 138
I V+ DG L+ D N I K+TP + +AG+ + GY+G +A+ + P
Sbjct: 665 IAVTADGTLYLADYDNQRIRKVTPDGIITT----IAGTGEKGYSG-TPTTATQAKLDGPN 719
Query: 139 GVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVV 196
+++ D G +Y A+ + ++KI AG +TT AG GK+ G GP+ A S V
Sbjct: 720 SISLADDGTIYFANLGSDTVQKIDKAGMLTTFAGNGKTGRTG-DGGPATSATLS--IPDV 776
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
++ ++ + G+ +R+++
Sbjct: 777 FLGHDGTVYICAYGSETIRKVT 798
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 35/94 (37%), Positives = 50/94 (53%), Gaps = 13/94 (13%)
Query: 82 VSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGS-FQGYTGHVDGKP-NEARFNHP 137
+ DG ++ + I K+T S G + +AG+ +GYTG DG P N A+ + P
Sbjct: 778 LGHDGTVYICAYGSETIRKVT------SDGIITTIAGTGAEGYTG--DGGPANAAQLSDP 829
Query: 138 KGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIA 170
V +D G +YVAD N IR+I G +TTIA
Sbjct: 830 TSVVVDAGGAIYVADNGNKVIRRIDPNGTITTIA 863
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 49/97 (50%), Gaps = 7/97 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP-NEARFN 135
P ++ + G L+ D N + KI P + +V GY+G DG P +A+ N
Sbjct: 606 PGRVAWDKAGNLYIPDTQNYRVRKIDPSGKITT---VVGIGTAGYSG--DGGPATQAQIN 660
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAG 171
+G+ + G LY+AD N IRK+ D +TTIAG
Sbjct: 661 GVEGIAVTADGTLYLADYDNQRIRKVTPDGIITTIAG 697
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 64/146 (43%), Gaps = 9/146 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV-DGKPNEARFN 135
P I +++DG ++ + + + KI G L + G TG DG P +
Sbjct: 718 PNSISLADDGTIYFANLGSDTVQKID------KAGMLTTFAGNGKTGRTGDGGPATSATL 771
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
V + G +Y+ + IRK+ D +TTIAG + GP+ A+ S+
Sbjct: 772 SIPDVFLGHDGTVYICAYGSETIRKVTSDGIITTIAGTGAEGYTGDGGPANAAQLSDPTS 831
Query: 195 VVYVRPTCSLLVIDRGNAALRQISLN 220
VV V ++ V D GN +R+I N
Sbjct: 832 VV-VDAGGAIYVADNGNKVIRRIDPN 856
>gi|345851882|ref|ZP_08804843.1| hypothetical protein SZN_18986 [Streptomyces zinciresistens K42]
gi|345636654|gb|EGX58200.1| hypothetical protein SZN_18986 [Streptomyces zinciresistens K42]
Length = 468
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 73/149 (48%), Gaps = 14/149 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKPNEARF 134
P ++ V G ++ D N I K+TP G++ VAG+ G G A+
Sbjct: 60 PREVAVDGAGAVYIADSNNHRIRKVTPD------GKISTVAGTGAGGFRGDGGPATAAQL 113
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFR--DGPSEDAKFSN 191
N P GVT+D G L+VAD N IR+I D ++T+AG + AGFR GP+ A+ +
Sbjct: 114 NLPLGVTVDGAGVLHVADYYNHRIRRITADGVISTVAG--TGAAGFRGDGGPAATAQLNG 171
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQISLN 220
V + L + D N +R+++ +
Sbjct: 172 PHGVA-LNAAGDLCIADLQNHRVRKVTAD 199
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 72/149 (48%), Gaps = 14/149 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGS-FQGYTGHVDGKPN-EA 132
PY I V G ++ D + +IT + GR+ VAG+ G+ G DG P A
Sbjct: 4 PYGIAVDSTGTVYVADFSGHRVRRIT------TDGRITTVAGTGAAGFRG--DGGPAVAA 55
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSN 191
+ N P+ V +D G +Y+AD+ N IRK+ D ++T+AG + GP+ A+ +
Sbjct: 56 QLNGPREVAVDGAGAVYIADSNNHRIRKVTPDGKISTVAGTGAGGFRGDGGPATAAQLNL 115
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQISLN 220
V V L V D N +R+I+ +
Sbjct: 116 PLGVT-VDGAGVLHVADYYNHRIRRITAD 143
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 47/88 (53%), Gaps = 6/88 (6%)
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFR--DGPSEDAKFS 190
N P G+ +D G +YVAD +R+I D +TT+AG + AGFR GP+ A+ +
Sbjct: 1 MNRPYGIAVDSTGTVYVADFSGHRVRRITTDGRITTVAG--TGAAGFRGDGGPAVAAQLN 58
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+V V ++ + D N +R+++
Sbjct: 59 GPREVA-VDGAGAVYIADSNNHRIRKVT 85
Score = 38.5 bits (88), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 42/94 (44%), Gaps = 6/94 (6%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNE-ARF 134
+P + V G L D N I +IT + A F+G DG P A+
Sbjct: 115 LPLGVTVDGAGVLHVADYYNHRIRRITADGVISTVAGTGAAGFRG-----DGGPAATAQL 169
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTT 168
N P GV ++ G+L +AD N +RK+ GV +
Sbjct: 170 NGPHGVALNAAGDLCIADLQNHRVRKVTADGVIS 203
>gi|393789630|ref|ZP_10377750.1| hypothetical protein HMPREF1068_04030 [Bacteroides nordii
CL02T12C05]
gi|392650346|gb|EIY44015.1| hypothetical protein HMPREF1068_04030 [Bacteroides nordii
CL02T12C05]
Length = 435
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 90/190 (47%), Gaps = 7/190 (3%)
Query: 32 KHLTSLLKWTTRSSSKAPQADGNVLQFE--NGYLVETVIEGNEIGVVPYKIRVSEDGELF 89
KH S+ K+ ++A +G +L+F+ GY + V G G Y + + GE
Sbjct: 250 KHSFSMCKYDGFMYTRA--KNGTLLKFDPKTGY-AQVVATGLMNGSDSYILFSPQQGEEH 306
Query: 90 AVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLY 149
+ +N I + +L AG G+ DG A F+ P+ + + + +L+
Sbjct: 307 ILYLAYTNAHCIYTYNLKTGAHKLFAGMVNT-PGYADGPCEYALFDTPRQIILTAENDLF 365
Query: 150 VADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDR 209
+AD+ N IRKI GV + G++ + G +DG E A F+ F V V ++ + D
Sbjct: 366 LADSNNHVIRKITQDGVVSTVIGQAGMTGNQDGTPEVALFNEPFGVA-VDTDGTIYIGDS 424
Query: 210 GNAALRQISL 219
N ++R++++
Sbjct: 425 KNQSIRRLAI 434
>gi|291452573|ref|ZP_06591963.1| NHL repeat containing protein [Streptomyces albus J1074]
gi|291355522|gb|EFE82424.1| NHL repeat containing protein [Streptomyces albus J1074]
Length = 613
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 48/149 (32%), Positives = 70/149 (46%), Gaps = 9/149 (6%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRD 181
G VDG EA F P G+ D L+VAD+ A+R + G V T+ G G RD
Sbjct: 333 GLVDGPGAEAWFAQPSGLAADGDARLWVADSETSALRWVDPEGTVHTVVGTGLFDFGHRD 392
Query: 182 GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVG 241
G + A F + V V P S+ V D N ALR+ D + ++++ TD+ G
Sbjct: 393 GEAAQALFQHPL-AVTVLPDGSVAVSDTYNHALRRY----DPASGEVSTLA-TDLREPSG 446
Query: 242 AVLVGYVTCMLQQGFGPFFFSRTQQPSES 270
AVLVG +++ +R + P E+
Sbjct: 447 AVLVGEEIVVVESAR--HRLTRLRLPEEA 473
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 50/84 (59%), Gaps = 5/84 (5%)
Query: 119 QGYTGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIGDAG--VTTIAG-GKS 174
+G G DG P EARF+ P+G+ + +G+ + VADT+N A+R++ A VTT AG G+
Sbjct: 215 EGERGLADGGPAEARFSEPQGLALTPEGDAVIVADTVNHALRRVELASGVVTTPAGTGRQ 274
Query: 175 NVAGF-RDGPSEDAKFSNDFDVVY 197
G GP+ + S+ +DV +
Sbjct: 275 WWQGSPTSGPAREVDLSSPWDVAW 298
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 41/84 (48%), Gaps = 4/84 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + D L+ D S + + P + ++ V G+ GH DG+ +A F H
Sbjct: 347 PSGLAADGDARLWVADSETSALRWVDPEGTVHT----VVGTGLFDFGHRDGEAAQALFQH 402
Query: 137 PKGVTMDDKGNLYVADTLNLAIRK 160
P VT+ G++ V+DT N A+R+
Sbjct: 403 PLAVTVLPDGSVAVSDTYNHALRR 426
>gi|153806007|ref|ZP_01958675.1| hypothetical protein BACCAC_00252 [Bacteroides caccae ATCC 43185]
gi|149130684|gb|EDM21890.1| IPT/TIG domain protein [Bacteroides caccae ATCC 43185]
Length = 440
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 46/92 (50%), Gaps = 5/92 (5%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
+ V +DG ++ D VN + KITP + + L G G +DG +ARFN G
Sbjct: 167 VCVDDDGNVYVGDCVNYCVRKITPDGTVTTLAGLA-----GNKGCIDGTGVQARFNGLYG 221
Query: 140 VTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG 171
+ D +GN+ + D IRKI GVTT G
Sbjct: 222 MDCDAEGNIILTDVFEWKIRKITPEGVTTTLG 253
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 50/136 (36%), Positives = 64/136 (47%), Gaps = 22/136 (16%)
Query: 71 NEIGVVPYKIRVSEDGEL---FAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDG 127
N + +VP K+ DGEL A E I K T S +AGS G G+ DG
Sbjct: 88 NILAIVPPKLG---DGELTITIANREPVRTIEKFTYSFSAVVT--TLAGSANGEPGYQDG 142
Query: 128 KPNEARF-----------NHPKG-VTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKS 174
+EA F + KG V +DD GN+YV D +N +RKI D VTT+AG
Sbjct: 143 VGSEALFFFDAAKAEPAEDWKKGSVCVDDDGNVYVGDCVNYCVRKITPDGTVTTLAGLAG 202
Query: 175 NVAGFRDGPSEDAKFS 190
N G DG A+F+
Sbjct: 203 N-KGCIDGTGVQARFN 217
Score = 45.8 bits (107), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 72/148 (48%), Gaps = 18/148 (12%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
I V ++G L+A D++ + IVK + L+ +G +G+++G +A F P
Sbjct: 298 IVVDKEGNLYAADQILNGIVKFKA--GTWEAENLIG---KGTSGYLNGSFEDALFTFPSD 352
Query: 140 VTMDDKGNLYVA--------DTLNLAIR--KIGDAGVTTIAGGKSNVAGFRDGPSEDAKF 189
+ +D G++YVA + L+ +IR + + V +AGG AG+ D + A F
Sbjct: 353 LAIDSNGDIYVAGNGAWDGGENLDQSIRLLDMTNRVVRLVAGGTQ--AGYVDANAGSAAF 410
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQI 217
S D+ V + V D+ N +R+I
Sbjct: 411 SGPQDLA-VDKNGVIYVYDKKNNVIRKI 437
>gi|424869138|ref|ZP_18292858.1| uncharacterized protein [Leptospirillum sp. Group II 'C75']
gi|124514239|gb|EAY55753.1| conserved protein of unknown function [Leptospirillum rubarum]
gi|387220840|gb|EIJ75456.1| uncharacterized protein [Leptospirillum sp. Group II 'C75']
Length = 389
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 56/122 (45%), Gaps = 17/122 (13%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIA--- 170
+AGSF G +DG EARF P+G+ G +Y+ADT N IRKI G++
Sbjct: 40 IAGSFH-ERGAIDGPGREARFEFPQGILAAPDGTIYIADTGNDMIRKISSNGLSGATPTP 98
Query: 171 -------------GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
G + A +RDG +A+F+N + +L V D N +R+I
Sbjct: 99 AELHHLSVDVEDIAGSDHRARYRDGVGANARFNNPEGMAISPDGKTLYVADSRNNMIRKI 158
Query: 218 SL 219
L
Sbjct: 159 DL 160
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 53/158 (33%), Positives = 74/158 (46%), Gaps = 18/158 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKI--------TPPLSQYSRGRL----VAGSFQGYTGH 124
P I + DG ++ D N I KI TP ++ + +AGS +
Sbjct: 62 PQGILAAPDGTIYIADTGNDMIRKISSNGLSGATPTPAELHHLSVDVEDIAGSDH-RARY 120
Query: 125 VDGKPNEARFNHPKGVTMDDKG-NLYVADTLNLAIRKIGDA--GVTTIAGGKSNVAGFRD 181
DG ARFN+P+G+ + G LYVAD+ N IRKI A V+TIAG +G D
Sbjct: 121 RDGVGANARFNNPEGMAISPDGKTLYVADSRNNMIRKIDLATKTVSTIAGHSFPSSG--D 178
Query: 182 GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G ++A F + +L V D GN A+R+I L
Sbjct: 179 GVGKEAGFETPRGLAISPDGKTLYVADSGNNAIRKIDL 216
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 74/156 (47%), Gaps = 11/156 (7%)
Query: 70 GNEIGV-VPYKIRVSEDGE-LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDG 127
G E G P + +S DG+ L+ D N+ I KI + + +AG+ + +G DG
Sbjct: 181 GKEAGFETPRGLAISPDGKTLYVADSGNNAIRKIDLATNTVTT---LAGAGKLMSGSADG 237
Query: 128 KPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRK--IGDAGVTTIAGGKSNVAGFRDGPS 184
+A F+ P+ + + G LY+ADT N IRK + V+T+A G G +GP
Sbjct: 238 VGVQATFHEPRSLAISGDGQVLYIADTRNNLIRKMVLATNSVSTLA-GHPGFPGTLNGPG 296
Query: 185 EDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
DA F + V L V D NA +R + L+
Sbjct: 297 PDAYFYHPVSVTI--DGNKLYVADGANADIRMVDLS 330
Score = 42.7 bits (99), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 59/120 (49%), Gaps = 12/120 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + +S DG++ + + +N+++ L+ S L G+ G ++G +A F H
Sbjct: 247 PRSLAISGDGQVLYIADTRNNLIR-KMVLATNSVSTLAG--HPGFPGTLNGPGPDAYFYH 303
Query: 137 PKGVTMDDKGN-LYVADTLNLAIRKI--GDAGVTTIAGGKSN----VAGFRDGPSEDAKF 189
P VT+D GN LYVAD N IR + V+T+AG N +AG DG + F
Sbjct: 304 PVSVTID--GNKLYVADGANADIRMVDLSTGVVSTVAGATLNGGVPIAGREDGSGVEGHF 361
>gi|195119117|ref|XP_002004078.1| GI18254 [Drosophila mojavensis]
gi|193914653|gb|EDW13520.1| GI18254 [Drosophila mojavensis]
Length = 734
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 41/101 (40%), Positives = 54/101 (53%), Gaps = 11/101 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPP-LSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P KI + ED A D N+ I+ +T + +Y G L AG VDG N +RFN
Sbjct: 227 PAKIAMLEDQYAIA-DAGNNRIIVVTNCGVVKYKIGGLEAG-------FVDGSLNMSRFN 278
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKS 174
+P+GV DK L VADT N A+R+I + V T+AG S
Sbjct: 279 NPQGVAFLDKNTLIVADTDNHALRQISLNNGVVETLAGTGS 319
>gi|423219560|ref|ZP_17206056.1| hypothetical protein HMPREF1061_02829 [Bacteroides caccae
CL03T12C61]
gi|392624765|gb|EIY18843.1| hypothetical protein HMPREF1061_02829 [Bacteroides caccae
CL03T12C61]
Length = 436
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 46/92 (50%), Gaps = 5/92 (5%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
+ V +DG ++ D VN + KITP + + L G G +DG +ARFN G
Sbjct: 163 VCVDDDGNVYVGDCVNYCVRKITPDGTVTTLAGLA-----GNKGCIDGTGVQARFNGLYG 217
Query: 140 VTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG 171
+ D +GN+ + D IRKI GVTT G
Sbjct: 218 MDCDAEGNIILTDVFEWKIRKITPEGVTTTLG 249
Score = 47.8 bits (112), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 49/136 (36%), Positives = 66/136 (48%), Gaps = 22/136 (16%)
Query: 71 NEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV---AGSFQGYTGHVDG 127
N + +VP K+ DGEL + N V+ + YS +V AGS G G+ DG
Sbjct: 84 NILAIVPPKLG---DGEL-TITIANREPVRTIDKFT-YSFSAVVTTLAGSANGEPGYQDG 138
Query: 128 KPNEARF-----------NHPKG-VTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKS 174
+EA F + KG V +DD GN+YV D +N +RKI D VTT+AG
Sbjct: 139 VGSEALFFFDAAKAEPAEDWKKGSVCVDDDGNVYVGDCVNYCVRKITPDGTVTTLAGLAG 198
Query: 175 NVAGFRDGPSEDAKFS 190
N G DG A+F+
Sbjct: 199 N-KGCIDGTGVQARFN 213
Score = 45.8 bits (107), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 72/148 (48%), Gaps = 18/148 (12%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
I V ++G L+A D++ + IVK + L+ +G +G+++G +A F P
Sbjct: 294 IVVDKEGNLYAADQILNGIVKFKA--GTWEAENLIG---KGTSGYLNGSFEDALFTFPSD 348
Query: 140 VTMDDKGNLYVA--------DTLNLAIR--KIGDAGVTTIAGGKSNVAGFRDGPSEDAKF 189
+ +D G++YVA + L+ +IR + + V +AGG AG+ D + A F
Sbjct: 349 LAIDSNGDIYVAGNGAWDGGENLDQSIRLLDMTNRVVRLVAGGTQ--AGYVDANAGSAAF 406
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQI 217
S D+ V + V D+ N +R+I
Sbjct: 407 SGPQDLA-VDKNGVIYVYDKKNNVIRKI 433
>gi|195387367|ref|XP_002052367.1| GJ17513 [Drosophila virilis]
gi|194148824|gb|EDW64522.1| GJ17513 [Drosophila virilis]
Length = 736
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 39/97 (40%), Positives = 48/97 (49%), Gaps = 9/97 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P KI + D A N +V T + +Y G L AG VDG N ARFN+
Sbjct: 227 PAKIAMFNDYYAIADAGNNRVLVVTTSGVVEYKVGGLEAG-------FVDGNLNTARFNN 279
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG 171
P+GV DK L VADT N A+R+I V T+AG
Sbjct: 280 PQGVAFLDKDTLIVADTDNHALRQISLKSGAVETLAG 316
>gi|408828763|ref|ZP_11213653.1| hypothetical protein SsomD4_16351 [Streptomyces somaliensis DSM
40738]
Length = 614
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 60/131 (45%), Gaps = 17/131 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPP----LSQYSRGRLVAGSFQGYTGHVDGKPNEA 132
P K E G L D +V++ P + +Y GR G VDG A
Sbjct: 189 PGKALALESGNLLVSDTTRHRLVEVAPDGETVVRRYGDGR---------RGLVDGPAESA 239
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG-GKSNVAGF-RDGPSEDAK 188
RF+ P+G+ G + VADT+N A+R + V+T+AG G + G GP+ +
Sbjct: 240 RFSEPQGMCALPDGRIVVADTVNHALRALDPETGAVSTLAGTGAQWMQGSPTSGPAREVA 299
Query: 189 FSNDFDVVYVR 199
S+ +DV + R
Sbjct: 300 LSSPWDVAWWR 310
Score = 41.6 bits (96), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 23/50 (46%), Positives = 30/50 (60%), Gaps = 2/50 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIA 170
GH DG ++A F HP GVT G++ V DT N A+R+ A VTT+A
Sbjct: 398 GHRDGAADQALFQHPLGVTALPDGSVAVCDTYNHALRRYDPAAGEVTTLA 447
>gi|296089788|emb|CBI39607.3| unnamed protein product [Vitis vinifera]
Length = 1002
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 62/122 (50%), Gaps = 11/122 (9%)
Query: 112 RLVAGSFQGYT------GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
RL+AG ++ G DG +E HP GV+ G +YVAD+ N I+K+ A
Sbjct: 722 RLLAGGDTVFSDNLFRFGDHDGVGSEVLLQHPLGVSCGKDGQIYVADSYNHKIKKLDPAT 781
Query: 166 --VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDD 223
V+T+AG + AGF+DG + A+ S +V V L + D N+ +R + L + +
Sbjct: 782 GRVSTLAG--TGKAGFKDGRALAAQLSEPSGIVEVENGV-LFIADTNNSVIRYLDLKKKE 838
Query: 224 CE 225
+
Sbjct: 839 AD 840
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 60/125 (48%), Gaps = 8/125 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P K+ + + + N N + +T Y L GS G G DG ++A FN
Sbjct: 517 PGKLAIDVINNRLFISDSNHNRIVVTDLNGNYI---LQIGS-TGEEGLRDGSFDDATFNR 572
Query: 137 PKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSND- 192
P+G+ + K N LYVADT N A+R+I + V T+AG + + ++ G + N
Sbjct: 573 PQGLAYNAKKNLLYVADTENHALREIDFVNETVQTLAGNGTKGSDYQGGGKGATQLLNSP 632
Query: 193 FDVVY 197
+DV +
Sbjct: 633 WDVCF 637
>gi|29349087|ref|NP_812590.1| hypothetical protein BT_3679 [Bacteroides thetaiotaomicron
VPI-5482]
gi|383124299|ref|ZP_09944964.1| hypothetical protein BSIG_3673 [Bacteroides sp. 1_1_6]
gi|29340994|gb|AAO78784.1| conserved hypothetical protein, with a conserved domain
[Bacteroides thetaiotaomicron VPI-5482]
gi|251839204|gb|EES67288.1| hypothetical protein BSIG_3673 [Bacteroides sp. 1_1_6]
Length = 430
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 61/125 (48%), Gaps = 11/125 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPP-LSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
PY I D + D+ S++ KITP ++ G + Q DG EA F
Sbjct: 291 PYLIYYFVDSNFYMSDQNLSSVYKITPDGECEWFCGSATQKTVQ------DGLREEALFA 344
Query: 136 HPKGVTMDDKGNLYVADTL-NLAIRK--IGDAGVTTIAGGKSNVAGFRDGPSEDAKFSND 192
P G+T+D+ GN Y+ D +RK I D V+T+A G+ +VA DG +A F+
Sbjct: 345 QPNGMTVDEDGNFYIVDGFKGYCLRKLDILDGYVSTVA-GQVDVASQIDGTPLEATFNYP 403
Query: 193 FDVVY 197
+D+ Y
Sbjct: 404 YDICY 408
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 38/84 (45%), Gaps = 3/84 (3%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V EDG + VD ++ L Y VAG +DG P EA FN+
Sbjct: 346 PNGMTVDEDGNFYIVDGFKGYCLRKLDILDGYVS--TVAGQVD-VASQIDGTPLEATFNY 402
Query: 137 PKGVTMDDKGNLYVADTLNLAIRK 160
P + D +G ++A+ AIRK
Sbjct: 403 PYDICYDGEGGYWIAEAWGKAIRK 426
>gi|260904571|ref|ZP_05912893.1| NHL repeat-containing protein [Brevibacterium linens BL2]
Length = 647
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 66/134 (49%), Gaps = 6/134 (4%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTI 169
RL++G+ G DG+ A F G+ + G++++AD+ AIR++ V+T+
Sbjct: 352 RLLSGTMN--EGLADGEAEAAWFAQTSGLDLSSDGDVFIADSETSAIRRLDPSTGAVSTL 409
Query: 170 AGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYN 229
G GFRDGP+ +A+ + V + P S+ + D N A+R+ ++
Sbjct: 410 IGEGLFDFGFRDGPAAEARLQHPLGVRSL-PDGSIAIADTYNGAIRRYDFTTNEVSTLAR 468
Query: 230 SI-SPTDILMVVGA 242
+ P+DI ++ A
Sbjct: 469 GLREPSDIFVLEAA 482
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 41/115 (35%), Positives = 55/115 (47%), Gaps = 23/115 (20%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTM--DDKG-----NLYVADTLNLAIRKIG--DAGVTTIA 170
G G DG A F+ P G+T+ DD +L VADT+N +R I VTT+A
Sbjct: 224 GERGANDGDFTSASFSEPGGITVLPDDVAAKAGYHLVVADTVNHTLRGINLDTETVTTVA 283
Query: 171 GGKS--------NVAGFR------DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGN 211
G S NV G DGP+ D K S+ +DV+Y+ T ++V GN
Sbjct: 284 GTGSQHMVGAIDNVVGTHGELGRYDGPALDVKLSSPWDVLYIPATAEVVVAMAGN 338
Score = 43.1 bits (100), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 40/81 (49%), Gaps = 3/81 (3%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
+ +S DG++F D S I ++ P S L+ + G DG EAR HP G
Sbjct: 378 LDLSSDGDVFIADSETSAIRRLDPSTGAVST--LIGEGLFDF-GFRDGPAAEARLQHPLG 434
Query: 140 VTMDDKGNLYVADTLNLAIRK 160
V G++ +ADT N AIR+
Sbjct: 435 VRSLPDGSIAIADTYNGAIRR 455
>gi|147818279|emb|CAN64724.1| hypothetical protein VITISV_026725 [Vitis vinifera]
Length = 423
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 62/122 (50%), Gaps = 11/122 (9%)
Query: 112 RLVAGSFQGYT------GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
RL+AG ++ G DG +E HP GV+ G +YVAD+ N I+K+ A
Sbjct: 143 RLLAGGDTVFSDNLFRFGDHDGVGSEVLLQHPLGVSCGKDGQIYVADSYNHKIKKLDPAT 202
Query: 166 --VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDD 223
V+T+AG + AGF+DG + A+ S +V V L + D N+ +R + L + +
Sbjct: 203 GRVSTLAG--TGKAGFKDGRALAAQLSEPSGIVEVE-NGVLFIADTNNSVIRYLDLKKKE 259
Query: 224 CE 225
+
Sbjct: 260 AD 261
Score = 46.6 bits (109), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 41/83 (49%), Gaps = 5/83 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + +DG+++ D N I K+ P + GR+ + G G DG+ A+ +
Sbjct: 174 PLGVSCGKDGQIYVADSYNHKIKKLDP-----ATGRVSTLAGTGKAGFKDGRALAAQLSE 228
Query: 137 PKGVTMDDKGNLYVADTLNLAIR 159
P G+ + G L++ADT N IR
Sbjct: 229 PSGIVEVENGVLFIADTNNSVIR 251
Score = 38.9 bits (89), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 34/55 (61%), Gaps = 3/55 (5%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG 171
G G DG ++A FN P+G+ + K N LYVADT N A+R+I + V T+AG
Sbjct: 59 GEEGLRDGSFDDATFNRPQGLAYNAKKNLLYVADTENHALREIDFVNETVQTLAG 113
>gi|288923318|ref|ZP_06417452.1| serine/threonine protein kinase [Frankia sp. EUN1f]
gi|288345334|gb|EFC79729.1| serine/threonine protein kinase [Frankia sp. EUN1f]
Length = 849
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 43/147 (29%), Positives = 67/147 (45%), Gaps = 10/147 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKP-NEARF 134
P + + DG+L D N I ++ VAG+ + ++G DG P A
Sbjct: 598 PSAVTLDRDGDLLIADTFNQRIRRV----DAAGIITTVAGNGEHAFSG--DGGPATAAAL 651
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD-GPSEDAKFSNDF 193
+P GV +D G +++ADT N IR++G G+ T G+ F D GP+ A +
Sbjct: 652 WYPGGVAVDGNGTVFIADTANNRIRRVGSDGIITTLAGQDGEGSFGDGGPASKALLAFPL 711
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISLN 220
V R L + D N +R+I L+
Sbjct: 712 AVALDR-FGRLYIADTSNNRIRRIGLD 737
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 48/220 (21%), Positives = 88/220 (40%), Gaps = 19/220 (8%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P+ + G ++ D N+ + +I+P + + VAG+ G G +A+
Sbjct: 541 APFGLVTDRAGNVYVADSGNNRVRRISPDGTIVT----VAGTGVKGFGGDGGPAVDAQLY 596
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P VT+D G+L +ADT N IR++ AG +TT+AG + GP+ A
Sbjct: 597 QPSAVTLDRDGDLLIADTFNQRIRRVDAAGIITTVAGNGEHAFSGDGGPATAAALWYPGG 656
Query: 195 VVYVRPTCSLLVIDRGNAALRQI-------SLNQDDCEYQYNSISPTDILMVVGAVLVGY 247
V V ++ + D N +R++ +L D E + P L+ +
Sbjct: 657 VA-VDGNGTVFIADTANNRIRRVGSDGIITTLAGQDGEGSFGDGGPAS------KALLAF 709
Query: 248 VTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKP 287
+ FG + + T + I+ ++ +P
Sbjct: 710 PLAVALDRFGRLYIADTSNNRIRRIGLDGRIETVAGNGRP 749
Score = 45.8 bits (107), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 64/135 (47%), Gaps = 11/135 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKP-NEARF 134
P + + G L+ D N+ I +I R VAG+ + G +G DG P +A
Sbjct: 710 PLAVALDRFGRLYIADTSNNRIRRI----GLDGRIETVAGNGRPGLSG--DGGPATKATL 763
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P+GVT+D G +Y+ D N +R++ AGV G + G +G + A S D
Sbjct: 764 RSPRGVTVDAAGTIYITDRTNRRVRRVDAAGVIMTVAGTAR-PGRVEG-VDPAALSPDGQ 821
Query: 195 VVYVRPTCSLLVIDR 209
V + P+ LLV DR
Sbjct: 822 VA-LDPSGDLLVSDR 835
Score = 45.8 bits (107), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 71/145 (48%), Gaps = 12/145 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKI-TPPLSQYSRGRLVAGSFQGYTGHVDGKP-NEARF 134
P + V +G +F D N+ I ++ + + G+ GSF DG P ++A
Sbjct: 654 PGGVAVDGNGTVFIADTANNRIRRVGSDGIITTLAGQDGEGSFG------DGGPASKALL 707
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAG-GKSNVAGFRDGPSEDAKFSND 192
P V +D G LY+ADT N IR+IG D + T+AG G+ ++G GP+ A +
Sbjct: 708 AFPLAVALDRFGRLYIADTSNNRIRRIGLDGRIETVAGNGRPGLSG-DGGPATKATLRSP 766
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQI 217
V V ++ + DR N +R++
Sbjct: 767 RGVT-VDAAGTIYITDRTNRRVRRV 790
Score = 38.1 bits (87), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 43/87 (49%), Gaps = 2/87 (2%)
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFS 190
A N P G+ D GN+YVAD+ N +R+I D + T+AG G GP+ DA+
Sbjct: 537 ASVNAPFGLVTDRAGNVYVADSGNNRVRRISPDGTIVTVAGTGVKGFGGDGGPAVDAQLY 596
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQI 217
V R LL+ D N +R++
Sbjct: 597 QPSAVTLDR-DGDLLIADTFNQRIRRV 622
>gi|302768699|ref|XP_002967769.1| hypothetical protein SELMODRAFT_409048 [Selaginella moellendorffii]
gi|300164507|gb|EFJ31116.1| hypothetical protein SELMODRAFT_409048 [Selaginella moellendorffii]
Length = 199
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/56 (48%), Positives = 38/56 (67%), Gaps = 4/56 (7%)
Query: 166 VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC-SLLVIDRGNAALRQISLN 220
VTTIAGG S GF DGP + A+FS++F + +C SLL+ DRGN +R+I ++
Sbjct: 68 VTTIAGGSSRKPGFADGPGDTARFSSEFSLAC---SCGSLLIADRGNRLIREIQID 120
>gi|356571070|ref|XP_003553704.1| PREDICTED: NHL repeat-containing protein 2-like [Glycine max]
Length = 1098
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 55/107 (51%), Gaps = 5/107 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFR 180
G DG +E HP GV + G +Y+AD+ N I+K+ V+TIAG + AGF+
Sbjct: 834 GDQDGIGSEVLLQHPLGVVCGNDGEIYIADSYNHKIKKLDPTSKRVSTIAG--TGKAGFK 891
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQ 227
DG + A+ S +V L + D N+ +R + LN ++ E +
Sbjct: 892 DGTAVKAQLSEPSGIVEGNKG-RLFIADTNNSLIRYLDLNINETELR 937
Score = 43.5 bits (101), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 62/123 (50%), Gaps = 19/123 (15%)
Query: 89 FAVDEVNSNIVKITPPLSQYSRGRLVA----GSF------QGYTGHVDGKPNEARFNHPK 138
A+D +N+ + +S + R+V G+F G G DG ++A FN P+
Sbjct: 615 LAIDVLNNRLF-----ISDSNHNRIVVTDLDGNFIVQIGSSGEEGLQDGSFDDATFNRPQ 669
Query: 139 GVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSND-FD 194
G+ + K N LYVADT N A+R+I + V T+AG + + + G D++ N +D
Sbjct: 670 GLAYNAKKNILYVADTENHALREIDFVNEKVRTLAGNGTKGSDYVGGGKGDSQLLNSPWD 729
Query: 195 VVY 197
V +
Sbjct: 730 VCF 732
>gi|312200910|ref|YP_004020971.1| serine/threonine protein kinase [Frankia sp. EuI1c]
gi|311232246|gb|ADP85101.1| serine/threonine protein kinase [Frankia sp. EuI1c]
Length = 804
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 75/156 (48%), Gaps = 22/156 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF--QGYTGHVDGKPN-EAR 133
P + V+ DG ++ D+ I +ITP + R VAG+ G TG DG P A
Sbjct: 650 PSGLAVTADGSIYVADDYLDTIREITPDGTI----RTVAGTSGKDGETG--DGGPAARAL 703
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAGGKSNVAGFRDGPSE-DAKFSN 191
+ P G+ + G+LY+ DT N IR+I GV TT AG S V+ DG S +A +
Sbjct: 704 LSGPTGLALGPDGSLYITDTHNAKIRRIDPKGVMTTFAGTDSGVSTGLDGTSAGEAALTG 763
Query: 192 DFDVVYVRPTCS----------LLVIDRGNAALRQI 217
+ VV V P+ + LL ID N +R +
Sbjct: 764 LYGVV-VDPSGAVYATLNQVGTLLRIDPANHIVRTL 798
>gi|145340877|ref|XP_001415544.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144575767|gb|ABO93836.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1675
Score = 52.0 bits (123), Expect = 6e-04, Method: Composition-based stats.
Identities = 61/182 (33%), Positives = 89/182 (48%), Gaps = 24/182 (13%)
Query: 54 NVLQFENGYLVETVIEGNEIGVVPYKIRVSE-DGELFAVDEVNSNIVKITPPLSQYSRGR 112
N ++ +G V T+I G + P + E DG ++ + +++ KI Y G
Sbjct: 162 NAIRRVSGDDVSTLITG--LAFAPTAMVFDESDGAIYFLGSSQNHVSKIA-----YGSGT 214
Query: 113 L--VAGSFQGYTGHVDG-KPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKI--GDAGV 166
+ AGS QG +G VD P+ ARFN P+G+ +D LYVADT N A+R I V
Sbjct: 215 VSTYAGS-QGTSGLVDNVTPSSARFNAPEGLALDGANRILYVADTGNHAVRAIDLSTGAV 273
Query: 167 TTIAG-GKSNVAGF---RDGP-SEDAKFSNDFDVVY----VRPTCSLLVIDRGNAALRQI 217
TT+ G G + +G DG S A+F+ + Y + LLV DRG +R+I
Sbjct: 274 TTVLGDGTISASGAVLDSDGILSTPARFNYPAGIAYNLDSSLSSGVLLVTDRGTHQIRKI 333
Query: 218 SL 219
L
Sbjct: 334 IL 335
Score = 44.7 bits (104), Expect = 0.12, Method: Composition-based stats.
Identities = 34/102 (33%), Positives = 49/102 (48%), Gaps = 15/102 (14%)
Query: 132 ARFNHPKGVTMD-----DKGNLYVADTLNLAIRKI--------GDAGVTTIAGGKSNVAG 178
ARFN+P G+ + G L V D IRKI A V T+AG + +G
Sbjct: 299 ARFNYPAGIAYNLDSSLSSGVLLVTDRGTHQIRKIILTDATASNAATVVTVAGSYTGTSG 358
Query: 179 FRDGPSEDAKFSND-FDVVYVRPTCSLLVIDRGNAALRQISL 219
FRD A N + YV + + +V+D+ N A+R++SL
Sbjct: 359 FRDSTVGSAALLNKPVSISYVSGS-TYVVVDQDNHAIRKVSL 399
>gi|290976760|ref|XP_002671107.1| predicted protein [Naegleria gruberi]
gi|284084673|gb|EFC38363.1| predicted protein [Naegleria gruberi]
Length = 1349
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 41/126 (32%), Positives = 64/126 (50%), Gaps = 16/126 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGS-FQGYTGHVDGKP-NEA 132
P I + +G+L+ + N I+K YS +L VAG+ + GY+G DG P A
Sbjct: 527 PNGIMYTPNGDLYIASQGNHKIMK-------YSNSQLSTVAGNGYAGYSG--DGGPARNA 577
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKI---GDAGVTTIAGGKSNVAGFRDGPSEDAKF 189
+ N+P VT+ G +Y+AD N A+RK+ G ++GG S++ G P+ D +
Sbjct: 578 KLNYPADVTVSSTGEVYIADWGNRAVRKVFRNGTIITLIVSGGLSDLNGVTLTPNGDLLY 637
Query: 190 SNDFDV 195
S V
Sbjct: 638 STQLWV 643
Score = 45.1 bits (105), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 48/92 (52%), Gaps = 5/92 (5%)
Query: 128 KPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR--DGPSE 185
PN N P G+ G+LY+A N I K ++ ++T+AG + AG+ GP+
Sbjct: 518 NPNNDGINGPNGIMYTPNGDLYIASQGNHKIMKYSNSQLSTVAG--NGYAGYSGDGGPAR 575
Query: 186 DAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
+AK + DV V T + + D GN A+R++
Sbjct: 576 NAKLNYPADVT-VSSTGEVYIADWGNRAVRKV 606
>gi|194877446|ref|XP_001973882.1| GG16549 [Drosophila erecta]
gi|190657069|gb|EDV54282.1| GG16549 [Drosophila erecta]
Length = 717
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 45/130 (34%), Positives = 63/130 (48%), Gaps = 17/130 (13%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT-GHVDGKPNEARFN 135
P KI S +G D N+ ++ +T G +V GY G VDG ARFN
Sbjct: 228 PAKIVRSPNGRYAIADTGNNRVLVLTG-------GGVVQHKIGGYQPGFVDGNSTAARFN 280
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG----GKSNVAGFRDGPSEDAKF 189
+P+G+ ++ L VADT N AIR+I + V T+AG G + G R GP +
Sbjct: 281 NPQGIAFLNENTLIVADTKNHAIRQISLTNGMVETLAGTGHQGNERIGG-RLGPLQ--PL 337
Query: 190 SNDFDVVYVR 199
S+ +DV R
Sbjct: 338 SSPWDVAIFR 347
>gi|302551068|ref|ZP_07303410.1| NHL repeat-containing protein [Streptomyces viridochromogenes DSM
40736]
gi|302468686|gb|EFL31779.1| NHL repeat-containing protein [Streptomyces viridochromogenes DSM
40736]
Length = 498
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 84/165 (50%), Gaps = 20/165 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKPNE-ARF 134
P K+ V G LF D N + ++ Q + VAG+ G+ G D +P +R
Sbjct: 42 PRKVAVDSSGNLFFSDYSNHRVRRVDAATQQIT---TVAGNGTAGFNG--DNQPAVYSRL 96
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG---VTTIAGGKSNVAGFR--DGPSEDAKF 189
++P+GV +D GNL++AD N +R++ DA +TT+AG + +AGF + P+ +A+
Sbjct: 97 HYPRGVAVDAGGNLFIADCNNHRVRRV-DAATRQITTVAG--TGIAGFNGDNQPAVNAQL 153
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPT 234
++ V V +L + D N +R++ D Q +++ T
Sbjct: 154 NSPIGVA-VDAGGNLFITDFNNQRVRRV----DAATRQITTVAGT 193
Score = 46.2 bits (108), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 75/169 (44%), Gaps = 16/169 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKPN-EARF 134
P + G LF D N + ++ Q + VAG+ G+ G D +P A+
Sbjct: 213 PVGVAADAGGNLFIGDYANHRVRRVDAATRQIT---TVAGTGIAGFNG--DNQPAVNAQL 267
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG---VTTIAGGKSNVAGFR--DGPSEDAKF 189
N P V +D GN+++ADT N +R++ DA +TT+AG AGF GP+ +
Sbjct: 268 NAPHTVAVDSGGNVFIADTSNHRVRRV-DAATRQITTVAG--IGTAGFNGDGGPAIGVQL 324
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILM 238
++ V V L + D N +R++S + P D+ +
Sbjct: 325 NSPVGVA-VDSGGGLFIADASNYRVRKVSDVASTARFSVAPGGPPDVRL 372
Score = 45.4 bits (106), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 76/326 (23%), Positives = 131/326 (40%), Gaps = 48/326 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKPN-EARF 134
P + V G LF D N + ++ Q + VAG+ G+ G D +P A+
Sbjct: 99 PRGVAVDAGGNLFIADCNNHRVRRVDAATRQIT---TVAGTGIAGFNG--DNQPAVNAQL 153
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG---VTTIAGGKSNVAGFRDGPSEDAKFSN 191
N P GV +D GNL++ D N +R++ DA +TT+AG + AGF +G ++ A +
Sbjct: 154 NSPIGVAVDAGGNLFITDFNNQRVRRV-DAATRQITTVAG--TGTAGF-NGDNQSAVNAQ 209
Query: 192 DFDVVYVRPTC--SLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILM-------VVGA 242
+ V V +L + D N +R++ D Q +++ T I V A
Sbjct: 210 LYGPVGVAADAGGNLFIGDYANHRVRRV----DAATRQITTVAGTGIAGFNGDNQPAVNA 265
Query: 243 VLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEE----P 298
L T + G G F + T ++ + + I V +
Sbjct: 266 QLNAPHTVAVDSG-GNVFIADT---------SNHRVRRVDAATRQITTVAGIGTAGFNGD 315
Query: 299 GWPSFGQLIIDLSKLALEAMGSMFL-NFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPV 357
G P+ G + +A+++ G +F+ + +R R ++ + T R PP
Sbjct: 316 GGPAIGVQLNSPVGVAVDSGGGLFIADASNYRVRK------VSDVASTARFSVAPGGPPD 369
Query: 358 VQRQKSAVPLSETHQVRTPGTGDKYP 383
V+ +++ QVR G P
Sbjct: 370 VRLERAGNTGYPGVQVRAEDDGTIAP 395
>gi|219667746|ref|YP_002458181.1| hypothetical protein Dhaf_1699 [Desulfitobacterium hafniense DCB-2]
gi|219538006|gb|ACL19745.1| protein of unknown function DUF1533 [Desulfitobacterium hafniense
DCB-2]
Length = 1916
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 57/124 (45%), Gaps = 15/124 (12%)
Query: 67 VIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVD 126
+I G IG++P + +E+ + +N + VAG+ GY+G D
Sbjct: 234 IINGARIGIIPARDYGAEEALYGIATAMEAN------------KLYTVAGTSTGYSG--D 279
Query: 127 GKP-NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSE 185
G P A+ P + D GN+Y++D+ N IRK+ G+ T GK GP+E
Sbjct: 280 GGPATAAQLKTPNALAFDGAGNMYISDSGNRRIRKVDPQGIITTIAGKETNGDPTPGPAE 339
Query: 186 DAKF 189
D+K
Sbjct: 340 DSKI 343
>gi|158315874|ref|YP_001508382.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
gi|158111279|gb|ABW13476.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
Length = 772
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 52/160 (32%), Positives = 76/160 (47%), Gaps = 29/160 (18%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGR-----LVAGSFQ-GYTGHV----- 125
P + V+ DG L+ D N + +I GR +AG + G+ G V
Sbjct: 468 PTSVAVAPDGTLYVADTRNHRVRRI---------GRDGIITTIAGQDEFGFAGEVSEDGL 518
Query: 126 ----DGKPN-EARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGF 179
DG P A+ N+P V M+ G+L +AD N +R+IG D +TTIAG + G
Sbjct: 519 AYSGDGLPAVNAKLNYPNTVLMETDGSLLIADGENNRVRRIGLDGIITTIAGTGAEGFGG 578
Query: 180 RDGPSEDAKFSNDFDVVYVR-PTCSLLVIDRGNAALRQIS 218
GP+ A+FS + R P SL V D+ N +R+I+
Sbjct: 579 DGGPATSARFS--YPSALARGPDGSLYVADQDNHRVRRIA 616
Score = 47.4 bits (111), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 42/121 (34%), Positives = 59/121 (48%), Gaps = 17/121 (14%)
Query: 114 VAGS-FQGYTGHVDGKP-NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG 171
VAG+ G++G DG P EA+ + P V + G LYVADT N +R+IG G+ T
Sbjct: 445 VAGTGADGFSG--DGGPATEAQLDEPTSVAVAPDGTLYVADTRNHRVRRIGRDGIITTIA 502
Query: 172 GKSNVAGFRDGPSEDA-KFSNDF-----------DVVYVRPTCSLLVIDRGNAALRQISL 219
G+ GF SED +S D + V + SLL+ D N +R+I L
Sbjct: 503 GQDEF-GFAGEVSEDGLAYSGDGLPAVNAKLNYPNTVLMETDGSLLIADGENNRVRRIGL 561
Query: 220 N 220
+
Sbjct: 562 D 562
Score = 42.4 bits (98), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 71/144 (49%), Gaps = 7/144 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + DG L D N+ + +I + +AG+ G G ARF++
Sbjct: 535 PNTVLMETDGSLLIADGENNRVRRIGLDGIITT----IAGTGAEGFGGDGGPATSARFSY 590
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAG-GKSNVAGFRDGPSEDAKFSNDFD 194
P + G+LYVAD N +R+I GD ++T+AG GK+ +G GP++ A+ +
Sbjct: 591 PSALARGPDGSLYVADQDNHRVRRIAGDGTISTLAGTGKTGYSG-DGGPADQAQINAVGA 649
Query: 195 VVYVRPTCSLLVIDRGNAALRQIS 218
+ V ++ + D G+ +R+I+
Sbjct: 650 DLVVDAAGNVYLSDPGSNRVRRIA 673
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 49/89 (55%), Gaps = 3/89 (3%)
Query: 131 EARFNHPKGVTMDD-KGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAK 188
+A F+ +T+D +G +YVADT N IR++ AG +TT+AG ++ GP+ +A+
Sbjct: 405 QAEFDRASDLTIDAPRGYVYVADTDNHRIRRVDRAGTITTVAGTGADGFSGDGGPATEAQ 464
Query: 189 FSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
V V P +L V D N +R+I
Sbjct: 465 LDEPTSVA-VAPDGTLYVADTRNHRVRRI 492
>gi|116625892|ref|YP_828048.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116229054|gb|ABJ87763.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 2384
Score = 51.6 bits (122), Expect = 8e-04, Method: Composition-based stats.
Identities = 43/111 (38%), Positives = 57/111 (51%), Gaps = 9/111 (8%)
Query: 114 VAGS-FQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGV-TTIA 170
VAG+ QG+ G G A+ +HP G+ D GNLY+AD N IR+I G GV TTIA
Sbjct: 145 VAGTGVQGFAGD-GGLAGSAQLSHPAGIAFDSAGNLYIADMGNFRIRRIDGQTGVITTIA 203
Query: 171 GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS--LLVIDRGNAALRQISL 219
G SN + A S ++V P S LLV + G +R++ L
Sbjct: 204 GDGSNTTSPDGVLAAGAGLSQP---IWVAPDRSGGLLVSEMGAMRIRRVDL 251
Score = 42.0 bits (97), Expect = 0.69, Method: Composition-based stats.
Identities = 45/176 (25%), Positives = 78/176 (44%), Gaps = 18/176 (10%)
Query: 52 DGNVLQF----ENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQ 107
DG ++ + G L ++ + V P I V +F D N+ +V + Q
Sbjct: 25 DGKIISIVGAGQGGSLANSIPAASAQLVAPGGIWVDASNNVFIADSGNNRVVVV-----Q 79
Query: 108 YSRGRL--VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--D 163
Y G L +AG+ + G ++ N P G+ D GNLY+++ IR+I
Sbjct: 80 YPSGILYQIAGNGTATSSGDGGSALQSSVNRPMGLAADFNGNLYISEFQGNRIRRIDMQT 139
Query: 164 AGVTTIAGGKSNVAGFR--DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
++T+AG + V GF G + A+ S+ + + +L + D GN +R+I
Sbjct: 140 GMISTVAG--TGVQGFAGDGGLAGSAQLSHPAGIAF-DSAGNLYIADMGNFRIRRI 192
>gi|336178839|ref|YP_004584214.1| serine/threonine protein kinase [Frankia symbiont of Datisca
glomerata]
gi|334859819|gb|AEH10293.1| serine/threonine protein kinase [Frankia symbiont of Datisca
glomerata]
Length = 776
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 41/118 (34%), Positives = 56/118 (47%), Gaps = 9/118 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKP-NEARF 134
P + V DG L+ D N I +I VAGS GY+G DG P A+
Sbjct: 579 PTSVLVDADGTLYIADTGNHRIRRI----GTNDVITTVAGSGTSGYSG--DGGPATAAQL 632
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAGGKSNVAGFRDGPSEDAKFSN 191
P G+ D GNLY+AD N IR++ GV T+AG ++ GP+ A+ +N
Sbjct: 633 ARPGGLAADTAGNLYIADNANNRIRRVSSDGVIITVAGSGTSGYSGDGGPATAAQLAN 690
Score = 45.1 bits (105), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 40/121 (33%), Positives = 62/121 (51%), Gaps = 17/121 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGS-FQGYTGHVDGKP-NEA 132
P + G L+ D N+ I +++ S G + VAGS GY+G DG P A
Sbjct: 635 PGGLAADTAGNLYIADNANNRIRRVS------SDGVIITVAGSGTSGYSG--DGGPATAA 686
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFR--DGPSEDAKF 189
+ +P V + D G +Y+ADT N +R++ D +TT+AG S+ AG+ GP+ A+
Sbjct: 687 QLANPGSVAVTDDGRVYIADTDNNRVRRVDADGVITTVAG--SDEAGYSGDGGPATAARL 744
Query: 190 S 190
Sbjct: 745 C 745
Score = 43.1 bits (100), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 64/146 (43%), Gaps = 8/146 (5%)
Query: 75 VVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP-NEAR 133
P + G L+ D N+ I +I + + GY+G DG P A+
Sbjct: 521 ATPTDVARDRAGNLYIADTDNNRIRRIN---VVGTVTTVAGTGTPGYSG--DGGPATAAQ 575
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAGGKSNVAGFRDGPSEDAKFSND 192
P V +D G LY+ADT N IR+IG V TT+AG ++ GP+ A+ +
Sbjct: 576 LAKPTSVLVDADGTLYIADTGNHRIRRIGTNDVITTVAGSGTSGYSGDGGPATAAQLARP 635
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQIS 218
+ +L + D N +R++S
Sbjct: 636 GGLA-ADTAGNLYIADNANNRIRRVS 660
Score = 42.0 bits (97), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 57/109 (52%), Gaps = 8/109 (7%)
Query: 114 VAGS-FQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAG 171
VAG+ G++G +G A + P +D+ G +Y DT N IR+IG D +TT+AG
Sbjct: 444 VAGTGVAGFSGD-NGPATAADLSKPDDPLVDNTGAVYFTDTGNNRIRRIGADGIITTVAG 502
Query: 172 GKSNVAGFR--DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+ GF +GP+ A F+ DV R +L + D N +R+I+
Sbjct: 503 --TGTYGFSGDNGPAAQAHFATPTDVARDR-AGNLYIADTDNNRIRRIN 548
>gi|326778226|ref|ZP_08237491.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Streptomyces griseus XylebKG-1]
gi|326658559|gb|EGE43405.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Streptomyces griseus XylebKG-1]
Length = 615
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 43/126 (34%), Positives = 63/126 (50%), Gaps = 11/126 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF-QGYTGHVDGKPNEARFN 135
P K + DG L D +V++ + G V F G G DG P+EARF+
Sbjct: 187 PGKALLLPDGGLLVSDTTRHRLVELD------ADGETVRRHFGTGERGLRDGGPDEARFS 240
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVT-TIAG-GKSNVAGF-RDGPSEDAKFSN 191
P+G+ + G + VADT+N AIR + GVT T+AG G+ G GP+ + S+
Sbjct: 241 EPQGLAVLPDGRVAVADTVNHAIRALDLTTGVTSTLAGTGRQWWQGTPTSGPAREVDLSS 300
Query: 192 DFDVVY 197
+DV +
Sbjct: 301 PWDVAW 306
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 48/96 (50%), Gaps = 5/96 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + VS DGE V + ++ ++ ++ + G F GH DG ++A H
Sbjct: 355 PSGLAVSADGERLWVADSETSALRWVD-RDEHVHTAVGTGLFD--FGHRDGAADQALLQH 411
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIA 170
P GVT G++ ++DT N A+R+ A VTT+A
Sbjct: 412 PIGVTALPDGSVAISDTYNHALRRYDPASGEVTTLA 447
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 46/96 (47%), Gaps = 3/96 (3%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFR 180
G VDG EA F P G+ + G L+VAD+ A+R + D V T G G R
Sbjct: 341 GLVDGPAAEAWFAQPSGLAVSADGERLWVADSETSALRWVDRDEHVHTAVGTGLFDFGHR 400
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
DG ++ A + V + P S+ + D N ALR+
Sbjct: 401 DGAADQALLQHPIGVTAL-PDGSVAISDTYNHALRR 435
>gi|290972390|ref|XP_002668936.1| predicted protein [Naegleria gruberi]
gi|284082474|gb|EFC36192.1| predicted protein [Naegleria gruberi]
Length = 727
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 70/149 (46%), Gaps = 10/149 (6%)
Query: 75 VVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
V P + VS E++ D N+ I KI L + + QGY G +G A+
Sbjct: 65 VSPSYVFVSSKNEVYIADYQNNRIRKI---LENGNIITIAGNGQQGYNGD-NGLAVNAQL 120
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
P V + K +Y+AD+ N +IRKI + G + TIAG +GP+ +A F
Sbjct: 121 ASPSSVCVSLKNEVYIADSGNHSIRKILENGNIVTIAGNGQQGYNGDNGPAVNAPL---F 177
Query: 194 D--VVYVRPTCSLLVIDRGNAALRQISLN 220
D V+V P + + D GN +R+I N
Sbjct: 178 DPSSVFVTPNDEVYIADTGNHRIRKILRN 206
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 41/142 (28%), Positives = 66/142 (46%), Gaps = 6/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V+ + E++ D N I KI L + + QGY+G +G A+ +
Sbjct: 179 PSSVFVTPNDEVYIADTGNHRIRKI---LRNGNIITIAGNGQQGYSGD-NGPAVNAKLSS 234
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P V + K +Y+AD N IRKI + G + TIAG G +G + +AK +
Sbjct: 235 PVDVFVSSKNEVYIADYQNTRIRKILENGNIITIAGNGKLGYGGDNGLAVNAKLYSPVG- 293
Query: 196 VYVRPTCSLLVIDRGNAALRQI 217
V+V + + D N +R+I
Sbjct: 294 VFVSQDNEVYIADYQNNRIRKI 315
>gi|182437571|ref|YP_001825290.1| hypothetical protein SGR_3778 [Streptomyces griseus subsp. griseus
NBRC 13350]
gi|178466087|dbj|BAG20607.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus
NBRC 13350]
Length = 605
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 43/126 (34%), Positives = 63/126 (50%), Gaps = 11/126 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF-QGYTGHVDGKPNEARFN 135
P K + DG L D +V++ + G V F G G DG P+EARF+
Sbjct: 177 PGKALLLPDGGLLVSDTTRHRLVELD------ADGETVRRHFGTGERGLRDGGPDEARFS 230
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVT-TIAG-GKSNVAGF-RDGPSEDAKFSN 191
P+G+ + G + VADT+N AIR + GVT T+AG G+ G GP+ + S+
Sbjct: 231 EPQGLAVLPDGRVAVADTVNHAIRALDLTTGVTSTLAGTGRQWWQGTPTSGPAREVDLSS 290
Query: 192 DFDVVY 197
+DV +
Sbjct: 291 PWDVAW 296
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 48/96 (50%), Gaps = 5/96 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + VS DGE V + ++ ++ ++ + G F GH DG ++A H
Sbjct: 345 PSGLAVSADGERLWVADSETSALRWVD-RDEHVHTAVGTGLFD--FGHRDGAADQALLQH 401
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIA 170
P GVT G++ ++DT N A+R+ A VTT+A
Sbjct: 402 PIGVTALPDGSVAISDTYNHALRRYDPASGEVTTLA 437
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 46/96 (47%), Gaps = 3/96 (3%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFR 180
G VDG EA F P G+ + G L+VAD+ A+R + D V T G G R
Sbjct: 331 GLVDGPAAEAWFAQPSGLAVSADGERLWVADSETSALRWVDRDEHVHTAVGTGLFDFGHR 390
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
DG ++ A + V + P S+ + D N ALR+
Sbjct: 391 DGAADQALLQHPIGVTAL-PDGSVAISDTYNHALRR 425
>gi|284044371|ref|YP_003394711.1| NHL repeat containing protein [Conexibacter woesei DSM 14684]
gi|283948592|gb|ADB51336.1| NHL repeat containing protein [Conexibacter woesei DSM 14684]
Length = 623
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 52/123 (42%), Gaps = 5/123 (4%)
Query: 115 AGSFQGYTGHVDGKPNEA-----RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTI 169
AG G TG G+ A R N P+GV+ G +ADTLN IR++ AG+ T
Sbjct: 107 AGPANGATGTFAGEGVPATDATVRLNQPRGVSPTGDGGFLIADTLNNRIRRVDAAGLITT 166
Query: 170 AGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYN 229
G A DG A N V P S L+ D GN +R++ E
Sbjct: 167 VAGTGAAAFTGDGGPATAAALNAPQGVAALPDGSFLIADTGNHRIRRVDAASGRIERVVG 226
Query: 230 SIS 232
+++
Sbjct: 227 NLT 229
Score = 41.6 bits (96), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 62/134 (46%), Gaps = 14/134 (10%)
Query: 44 SSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITP 103
+S + + GN+ GY + N +VP ++ G D N+ I ++
Sbjct: 217 ASGRIERVVGNLTMPNGGYAGDNDQAINAAVLVPARVAPLAGGGFLIADTGNNRIRRVD- 275
Query: 104 PLSQYSRGRL--VAGSFQ--GYTGHVDGKP-NEARFNHPKGVTMDDKGNLYVADTLNLAI 158
+RG + VAGS G+ G DG P A N P+GV G++ +AD+ N I
Sbjct: 276 -----TRGVITTVAGSAAPAGFGG--DGGPATAAALNQPEGVAAGGDGSILIADSTNERI 328
Query: 159 RKIGDAGVT-TIAG 171
R++G GV T+AG
Sbjct: 329 RQVGADGVIRTLAG 342
>gi|375146530|ref|YP_005008971.1| NHL repeat containing protein [Niastella koreensis GR20-10]
gi|361060576|gb|AEV99567.1| NHL repeat containing protein [Niastella koreensis GR20-10]
Length = 434
Score = 51.6 bits (122), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 45/122 (36%), Positives = 65/122 (53%), Gaps = 11/122 (9%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGV----TMDDKGNLYVADTLNLAIRKIG-DAGVTT 168
+AGS G G DGK A FN GV ++D+ GN+YV D N IRKI D VTT
Sbjct: 126 LAGS--GNAGFADGKGANASFNFA-GVRCQLSVDNIGNVYVPDGGNQRIRKIAPDGTVTT 182
Query: 169 IAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQY 228
IAG N G+ +GP+ AKF+N + ++ V +R +R+I+ + + + Y
Sbjct: 183 IAGTGEN--GYNEGPASGAKFNNPCATA-MDANGNMYVAERNGRRIRKITPDGNVSTHAY 239
Query: 229 NS 230
+S
Sbjct: 240 SS 241
Score = 46.6 bits (109), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 47/89 (52%), Gaps = 9/89 (10%)
Query: 77 PYKIRVSEDGELFAV---DEVNSNIVK-ITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEA 132
P+ I + L+ + NSN V+ ITP + + +AG+ G +G+ +G +
Sbjct: 348 PWGIAMDNKSNLYVTGLGEGRNSNCVRMITPDVWNVT---TIAGA--GDSGYAEGTGSSV 402
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
RFN P G+ +D GN+Y+ D N +RKI
Sbjct: 403 RFNGPTGIAVDKNGNMYILDMANNRVRKI 431
Score = 45.1 bits (105), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 40/154 (25%), Positives = 71/154 (46%), Gaps = 16/154 (10%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQ--YSRGRLVAGSFQGYTGHVDGKPNEAR 133
+P I + G ++A + K S + G +AG +G VDG AR
Sbjct: 286 LPCTITIDGAGNIYATHYDAQTVRKYVYNASSDAFDNGADIAG-MDHQSGWVDGIGTVAR 344
Query: 134 FNHPKGVTMDDKGNLYV--------ADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSE 185
FN+P G+ MD+K NLYV ++ + + + + VTTIAG +G+ +G
Sbjct: 345 FNNPWGIAMDNKSNLYVTGLGEGRNSNCVRMITPDVWN--VTTIAGAGD--SGYAEGTGS 400
Query: 186 DAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
+F+ + V ++ ++D N +R+I++
Sbjct: 401 SVRFNGPTGIA-VDKNGNMYILDMANNRVRKITV 433
Score = 42.0 bits (97), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 6/86 (6%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
V ++ V G ++ D N I KI P G + + G G+ +G + A+FN
Sbjct: 149 VRCQLSVDNIGNVYVPDGGNQRIRKIAPD------GTVTTIAGTGENGYNEGPASGAKFN 202
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKI 161
+P MD GN+YVA+ IRKI
Sbjct: 203 NPCATAMDANGNMYVAERNGRRIRKI 228
>gi|290993799|ref|XP_002679520.1| predicted protein [Naegleria gruberi]
gi|284093137|gb|EFC46776.1| predicted protein [Naegleria gruberi]
Length = 1414
Score = 51.6 bits (122), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 78/145 (53%), Gaps = 13/145 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ--GYTGHVDGKPNEARF 134
P I + G+++ VD +++ +++ +AG Q GY G + N+A+
Sbjct: 56 PTAIAFNSRGDMYVVDAQ----LQVIRKIAKDGITTTIAGFPQDAGYNGD-NILANKAKL 110
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGP-SEDAKFSND 192
N+P+GV +++ ++++D+ N IRKI ++G ++T+AG N GF D + + KF N
Sbjct: 111 NYPRGVAVNENDEIFISDSGNYRIRKISNSGIISTVAGTGEN--GFMDHVLAINGKFGNP 168
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQI 217
++Y T L + D+ N +R++
Sbjct: 169 SHLLYTNST--LYINDQSNNKIRKL 191
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 54/100 (54%), Gaps = 8/100 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGK-PNEARFN 135
P I ++ +GE++ D N I K++ G + + G G++D E++ N
Sbjct: 826 PIGISLNSEGEVYIADLSNHRIRKVSKS------GIISTIAGLGAAGYIDNVLATESQLN 879
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKS 174
PKGV + G +++AD+ N +RKI +G ++TIAGG S
Sbjct: 880 APKGVVVAPSGEVFIADSNNNKVRKISTSGIISTIAGGGS 919
Score = 42.0 bits (97), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 53/107 (49%), Gaps = 4/107 (3%)
Query: 114 VAGSFQGYTGHVDGK-PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
VAG + + D K A +P G++++ +G +Y+AD N IRK+ +G ++TIAG
Sbjct: 802 VAGMIYNFGYNGDRKIATTAHLWNPIGISLNSEGEVYIADLSNHRIRKVSKSGIISTIAG 861
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
AG+ D N V V P+ + + D N +R+IS
Sbjct: 862 --LGAAGYIDNVLATESQLNAPKGVVVAPSGEVFIADSNNNKVRKIS 906
Score = 38.5 bits (88), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 51/117 (43%), Gaps = 14/117 (11%)
Query: 108 YSRGRLVAGSFQGYTGHVDGKP-NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV 166
Y+ LV G Q + DG P ++ +P + + +G++YV D IRKI G+
Sbjct: 30 YNISLLVGGGNQ----YGDGLPADKINLYNPTAIAFNSRGDMYVVDAQLQVIRKIAKDGI 85
Query: 167 TTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC-----SLLVIDRGNAALRQIS 218
TT G AG+ D +N + Y R + + D GN +R+IS
Sbjct: 86 TTTIAGFPQDAGYNG----DNILANKAKLNYPRGVAVNENDEIFISDSGNYRIRKIS 138
>gi|423072738|ref|ZP_17061487.1| fibronectin type III domain protein [Desulfitobacterium hafniense
DP7]
gi|361856502|gb|EHL08400.1| fibronectin type III domain protein [Desulfitobacterium hafniense
DP7]
Length = 1926
Score = 51.6 bits (122), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 57/124 (45%), Gaps = 15/124 (12%)
Query: 67 VIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVD 126
+I G IG++P + +E+ + +N + VAG+ GY+G D
Sbjct: 244 IINGARIGIIPARDYGAEEALYGIATAMEAN------------KLYTVAGTSTGYSG--D 289
Query: 127 GKP-NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSE 185
G P A+ P + D GN+Y++D+ N IRK+ G+ T GK GP+E
Sbjct: 290 GGPATAAQLKTPNALAFDGAGNMYISDSGNRRIRKVDPQGIITTIAGKETNGDPTPGPAE 349
Query: 186 DAKF 189
D+K
Sbjct: 350 DSKI 353
>gi|417772389|ref|ZP_12420278.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
Pomona]
gi|418684105|ref|ZP_13245295.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
Kennewicki LC82-25]
gi|400324210|gb|EJO76509.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
Kennewicki LC82-25]
gi|409945760|gb|EKN95775.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
Pomona]
gi|455670390|gb|EMF35375.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
Fox 32256]
Length = 358
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 56/109 (51%), Gaps = 5/109 (4%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIA 170
L AGS + G +G + F+ P + +D + NLYV + N IRKI V+T++
Sbjct: 156 LFAGSSSAFGGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGTVSTLS 215
Query: 171 GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
GG ++G+ DG A+F + + Y R SLL D + +R+I L
Sbjct: 216 GG---ISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDL 261
Score = 42.7 bits (99), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 31/53 (58%), Gaps = 2/53 (3%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
L AG+ G VDG + A F P G+ +D GN+YV+D +N IRKI +G
Sbjct: 59 LFAGT--GVAASVDGTTSTASFKTPFGLEVDTSGNIYVSDQINNLIRKIDPSG 109
>gi|290985345|ref|XP_002675386.1| predicted protein [Naegleria gruberi]
gi|284088982|gb|EFC42642.1| predicted protein [Naegleria gruberi]
Length = 788
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 73/144 (50%), Gaps = 10/144 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP-NEARFN 135
P + VS+D E++ D N+ + KI L + +G++G DG P A+ N
Sbjct: 212 PCSVFVSKDDEVYIADSQNNRVRKI---LRNGIINTIAGTGEEGFSG--DGGPATNAQLN 266
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSNDF 193
P GV + +Y+ D+ N IRKI G + TIAG G+ +G GP+ +A+ SN +
Sbjct: 267 VPCGVFVSATNEVYIVDSGNCRIRKILRNGIINTIAGTGEEGYSG-DGGPAINAQISNVY 325
Query: 194 DVVYVRPTCSLLVIDRGNAALRQI 217
+ ++V + + D N +R+I
Sbjct: 326 N-IFVSQNDEVYIADTNNHRIRKI 348
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 76/148 (51%), Gaps = 14/148 (9%)
Query: 78 YKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV--AGSFQ-GYTGHVDGKPNEARF 134
+ I +S++ EL+ VD N I KI G +V AG+ + G++G G A+
Sbjct: 157 WNIALSQNNELYFVDYSNCRIRKIL------RNGTIVTIAGTGEDGFSGD-GGLAINAKL 209
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSND 192
+P V + +Y+AD+ N +RKI G + TIAG G+ +G GP+ +A+ +
Sbjct: 210 GYPCSVFVSKDDEVYIADSQNNRVRKILRNGIINTIAGTGEEGFSG-DGGPATNAQLNVP 268
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQISLN 220
V+V T + ++D GN +R+I N
Sbjct: 269 CG-VFVSATNEVYIVDSGNCRIRKILRN 295
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 59/127 (46%), Gaps = 13/127 (10%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPN-EARF 134
VP + VS E++ VD N I KI L + +GY+G DG P A+
Sbjct: 267 VPCGVFVSATNEVYIVDSGNCRIRKI---LRNGIINTIAGTGEEGYSG--DGGPAINAQI 321
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAG-GKSNVAGFRDGP---SEDAKF 189
++ + + +Y+ADT N IRKI D + TIAG G+ +G D P S
Sbjct: 322 SNVYNIFVSQNDEVYIADTNNHRIRKILKDGTIETIAGNGEKGFSG--DSPFDFSSHPHI 379
Query: 190 SNDFDVV 196
ND+ ++
Sbjct: 380 GNDYTII 386
>gi|421117009|ref|ZP_15577380.1| putative lipoprotein [Leptospira interrogans serovar Canicola str.
Fiocruz LV133]
gi|410011441|gb|EKO69561.1| putative lipoprotein [Leptospira interrogans serovar Canicola str.
Fiocruz LV133]
Length = 358
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 56/109 (51%), Gaps = 5/109 (4%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIA 170
L AGS + G +G + F+ P + +D + NLYV + N IRKI V+T++
Sbjct: 156 LFAGSSSAFGGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGTVSTLS 215
Query: 171 GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
GG ++G+ DG A+F + + Y R SLL D + +R+I L
Sbjct: 216 GG---ISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDL 261
Score = 42.4 bits (98), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 31/53 (58%), Gaps = 2/53 (3%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
L AG+ G VDG + A F P G+ +D GN+YV+D +N IRKI +G
Sbjct: 59 LFAGT--GVAASVDGTISTASFKTPFGLEVDTSGNIYVSDQINNLIRKIDPSG 109
>gi|413950688|gb|AFW83337.1| hypothetical protein ZEAMMB73_852374 [Zea mays]
Length = 345
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 40/68 (58%), Gaps = 13/68 (19%)
Query: 61 GYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQG 120
GY +ETV + + +G+ PY V++ GEL +D V+ SR +LVAGS +G
Sbjct: 138 GYTMETVFDSSMLGIEPYSREVTQSGELLVMDNVD-------------SRPKLVAGSPEG 184
Query: 121 YTGHVDGK 128
+ GH+DGK
Sbjct: 185 FPGHIDGK 192
>gi|418688943|ref|ZP_13250072.1| putative lipoprotein [Leptospira interrogans str. FPW2026]
gi|421128059|ref|ZP_15588277.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
str. 2006006986]
gi|421134520|ref|ZP_15594655.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
str. Andaman]
gi|400361935|gb|EJP17894.1| putative lipoprotein [Leptospira interrogans str. FPW2026]
gi|410021314|gb|EKO88104.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
str. Andaman]
gi|410434526|gb|EKP83664.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
str. 2006006986]
Length = 358
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 56/109 (51%), Gaps = 5/109 (4%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIA 170
L AGS + G +G + F+ P + +D + NLYV + N IRKI V+T++
Sbjct: 156 LFAGSSSAFGGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGTVSTLS 215
Query: 171 GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
GG ++G+ DG A+F + + Y R SLL D + +R+I L
Sbjct: 216 GG---ISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDL 261
Score = 42.4 bits (98), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 31/53 (58%), Gaps = 2/53 (3%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
L AG+ G VDG + A F P G+ +D GN+YV+D +N IRKI +G
Sbjct: 59 LFAGT--GVAASVDGTISTASFKTPFGLEVDTSGNIYVSDQINNLIRKIDPSG 109
>gi|383120609|ref|ZP_09941337.1| hypothetical protein BSIG_2379 [Bacteroides sp. 1_1_6]
gi|382985045|gb|EES68418.2| hypothetical protein BSIG_2379 [Bacteroides sp. 1_1_6]
Length = 483
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 68/146 (46%), Gaps = 7/146 (4%)
Query: 80 IRVSEDGELFAVDEVN-SNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPK 138
+ ++E +++ VD N K T P Y+ ++ G G G DG A+F++P
Sbjct: 338 VAMAEGNQIWRVDVGNLEGKDKNTYPGEGYAGKAILEGQVAG-AGWEDGLLRNAKFDNPH 396
Query: 139 GVTMDDKGNLYVADTLNLAIRKIG-----DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
+ + G LY+AD N IR I D + T G + G++DG + A F++ F
Sbjct: 397 QICFTEDGKLYIADCGNNCIRVIDTKLPLDRAMVTTPIGLPGMKGYKDGGPDIALFNHPF 456
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISL 219
V + V D GN +R++S+
Sbjct: 457 GVAVSADGQIVYVADTGNKVIRKLSI 482
>gi|336180276|ref|YP_004585651.1| NHL repeat containing protein [Frankia symbiont of Datisca
glomerata]
gi|334861256|gb|AEH11730.1| NHL repeat containing protein [Frankia symbiont of Datisca
glomerata]
Length = 398
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 76/149 (51%), Gaps = 14/149 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKP-NEARF 134
P V DG ++ D N I KI P L + +AG+ G++G DG P A+
Sbjct: 138 PATTAVDSDGNVYISDSHNHRIRKIDP-LGIIT---TIAGTGTAGFSG--DGGPATAAQL 191
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--DGPSEDAKFSN 191
N P G+ + ++Y++D N IRKI G +TTIAG + AGF GP+ A+ N
Sbjct: 192 NEPYGLAVATDDSIYISDYENQRIRKIDPLGIITTIAG--TGTAGFSGDGGPALQAQIKN 249
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQISLN 220
++ V +L + + GNA +R++S N
Sbjct: 250 PNNLA-VAADGTLYISELGNARIRKVSPN 277
Score = 45.1 bits (105), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 43/143 (30%), Positives = 66/143 (46%), Gaps = 8/143 (5%)
Query: 78 YKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHP 137
Y + + G L+ D + I KI P + + GS G G +A N P
Sbjct: 81 YDVTIDRAGALYITDWTSHQIRKIQPDGTVVTVAG--TGSGNAGFGGDGGPATQAELNSP 138
Query: 138 KGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--DGPSEDAKFSNDFD 194
+D GN+Y++D+ N IRKI G +TTIAG + AGF GP+ A+ + +
Sbjct: 139 ATTAVDSDGNVYISDSHNHRIRKIDPLGIITTIAG--TGTAGFSGDGGPATAAQLNEPYG 196
Query: 195 VVYVRPTCSLLVIDRGNAALRQI 217
+ V S+ + D N +R+I
Sbjct: 197 LA-VATDDSIYISDYENQRIRKI 218
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 44/147 (29%), Positives = 73/147 (49%), Gaps = 11/147 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKPN-EARF 134
PY + V+ D ++ D N I KI P L + +AG+ G++G DG P +A+
Sbjct: 194 PYGLAVATDDSIYISDYENQRIRKIDP-LGIIT---TIAGTGTAGFSG--DGGPALQAQI 247
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAGGKSNVAGFRDGPSEDAKFSNDF 193
+P + + G LY+++ N IRK+ GV TT+AG ++ G GP+ A+
Sbjct: 248 KNPNNLAVAADGTLYISELGNARIRKVSPNGVITTVAGNGTSGYGGDGGPATAAQLR--V 305
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISLN 220
V + P L + D N +R+++ N
Sbjct: 306 PSVALSPDGVLYIADYSNERIRRVATN 332
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 47/95 (49%), Gaps = 9/95 (9%)
Query: 79 KIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKPNEARFNH 136
+ +S DG L+ D N I ++ + G + VAG+ + A+ +
Sbjct: 307 SVALSPDGVLYIADYSNERIRRVA------TNGVITTVAGNGTAGSAGDGAAAIRAQLSS 360
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIA 170
P GVT+D GNLY+AD N +R++ G +TT+A
Sbjct: 361 PTGVTVDGAGNLYIADDKNDRVRRVSTTGIITTVA 395
>gi|332668794|ref|YP_004451801.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Cellulomonas fimi ATCC 484]
gi|332337831|gb|AEE44414.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Cellulomonas fimi ATCC 484]
Length = 639
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 58/117 (49%), Gaps = 22/117 (18%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKG-------NLYVADTLNLAIR--KIGDAGVTTIA 170
G G VDG P+EARF+ P G+ + G ++ VADT+N A+R ++ D VTT+A
Sbjct: 219 GERGFVDGGPDEARFSEPNGLALVPDGLRATLGYDVLVADTVNHALRGVRLSDGSVTTVA 278
Query: 171 G-GKSNVAGFRDG------------PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAAL 214
G G+ + G D P+ D + S+ +DV + + +V GN L
Sbjct: 279 GTGEQLMVGAADNVRGGGTGAGYDGPARDVRLSSPWDVAWSPVLGAFVVAMAGNHTL 335
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 70/159 (44%), Gaps = 15/159 (9%)
Query: 52 DGNVLQFENGYLVETVIEGNEIGVV---PYKIRVSEDGELFAVDEVNSNIVKITPPLSQY 108
DG ++ G + E +++G G P + V DG ++ D S + + P +
Sbjct: 339 DGTDVRHVAGTMNEGLLDGPGEGAWFAQPSGLSVGPDGRVWVADSETSALRWLDP--ADG 396
Query: 109 SRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK-------- 160
+ G V + GH DG+ +EAR HP GV G++ VADT N A+R+
Sbjct: 397 TVGTAVGEGLFDF-GHRDGRADEARMQHPLGVAALPDGSVLVADTYNGAVRRYVPPAADE 455
Query: 161 -IGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYV 198
G VTT+A + +G + A S DV+ V
Sbjct: 456 PAGAGEVTTVASDLAEPSGLVVVARDGAGSSGALDVLVV 494
>gi|290982010|ref|XP_002673724.1| NHL repeat-containing protein [Naegleria gruberi]
gi|284087309|gb|EFC40980.1| NHL repeat-containing protein [Naegleria gruberi]
Length = 456
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/147 (31%), Positives = 70/147 (47%), Gaps = 9/147 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKPNEARFN 135
P I V G ++ D N I K T + S +AG+ + GY G + A+ +
Sbjct: 141 PTSIAVDSSGNIYIADTQNHRIRKFTALTGKLST---IAGTEEEGYNGD-NIIATTAKLS 196
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRK--IGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
P GV +D GN+Y+AD+ N IRK I + ++TIAG + DG S A N+
Sbjct: 197 SPTGVALDSIGNVYIADSQNHRIRKITISNGKISTIAGDGDAGSPLYDGLSAIASSLNNP 256
Query: 194 DVVYVRPTCSLLV--IDRGNAALRQIS 218
V V + S V D N+ +R+I+
Sbjct: 257 TSVAVDQSSSNTVYFADTNNSKIRKIT 283
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 53/114 (46%), Gaps = 14/114 (12%)
Query: 59 ENGYLVETVIEG---------NEIGVV--PYKIRVSEDGELFAVDEVNSNIVKITPPLSQ 107
+NGY T I G E + P+ I + +G+++ D+ N+ I KI P
Sbjct: 345 QNGYYTMTTIAGGCRPGFPSTTEFACLDSPFGIAIGPNGDVYVADKSNARIKKINPTTKT 404
Query: 108 YSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
S + GS G+ G +G + N P G+ + G +Y+ADT N IRK+
Sbjct: 405 IST--IAGGSGAGFDGD-NGPAILSSLNDPCGIAVQSNGAVYIADTKNNRIRKV 455
>gi|390369862|ref|XP_798415.2| PREDICTED: NHL repeat-containing protein 2-like [Strongylocentrotus
purpuratus]
Length = 322
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 66/135 (48%), Gaps = 11/135 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQG-YTGHVDGKPNEARFN 135
P K+ + +G L AV + N V I + +V G TG DG EARF+
Sbjct: 173 PGKVTTNPEGTLLAVSDTGHNRVIIV------ALDGVVQHCIGGPETGFNDGLYQEARFH 226
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG-GKSNVAGFRDGPSEDAKFSND 192
P+G+ + +YVADT N AIRKI + VTTIAG G+ V + G + S+
Sbjct: 227 SPQGLCWA-QDVIYVADTENHAIRKIDLKEKRVTTIAGTGEQGVDWYGAGRGTEQVISSP 285
Query: 193 FDVVYVRPTCSLLVI 207
+DVV P +L I
Sbjct: 286 WDVVLGPPNEDVLFI 300
>gi|421740691|ref|ZP_16178929.1| NHL repeat protein [Streptomyces sp. SM8]
gi|406690906|gb|EKC94689.1| NHL repeat protein [Streptomyces sp. SM8]
Length = 444
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 59/125 (47%), Gaps = 7/125 (5%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVA-GFRD 181
G VDG EA F P G+ D L+VAD+ A+R + G A G G RD
Sbjct: 164 GLVDGPGAEAWFAQPSGLAADGDARLWVADSETSALRWVDPEGTVHTAVGTGLFDFGHRD 223
Query: 182 GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVG 241
G + A F + V V P S+ V D N ALR+ D + ++++ TD+ G
Sbjct: 224 GEAAQALFQHPL-AVTVLPDGSVAVSDTYNHALRRY----DPASGEVSTLA-TDLREPSG 277
Query: 242 AVLVG 246
AVLVG
Sbjct: 278 AVLVG 282
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 50/84 (59%), Gaps = 5/84 (5%)
Query: 119 QGYTGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIGDAG--VTTIAG-GKS 174
+G G DG P EARF+ P+G+ + +G+ + VADT+N A+R++ A VTT AG G+
Sbjct: 46 EGERGLADGGPAEARFSEPQGLALTPEGDAVIVADTVNHALRRVELASGVVTTPAGTGRQ 105
Query: 175 NVAGF-RDGPSEDAKFSNDFDVVY 197
G GP+ + S+ +DV +
Sbjct: 106 WWQGSPTSGPAREVDLSSPWDVAW 129
>gi|351729451|ref|ZP_08947142.1| NHL repeat-containing protein [Acidovorax radicis N35]
Length = 663
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 35/68 (51%)
Query: 124 HVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGP 183
++D ARFN P+ V +D GN YVADT N +RKI +G T G+ V G DG
Sbjct: 48 YLDAAGTAARFNAPQAVAVDTAGNTYVADTGNHLVRKIDASGNVTTLAGQEGVCGNADGT 107
Query: 184 SEDAKFSN 191
A +
Sbjct: 108 GTAATLCS 115
Score = 41.6 bits (96), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 36/76 (47%), Gaps = 8/76 (10%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVAD-------TLNLAIRKIGDAGVTTIAGGKSN 175
GH DG A F + +D GNLYV D + IRKI AG+ + G++N
Sbjct: 499 GHADGPAGSALFAGLGSMALDPAGNLYVVDGVYHFITGVGPTIRKITPAGMVSTLAGRAN 558
Query: 176 V-AGFRDGPSEDAKFS 190
V G DGP A+ +
Sbjct: 559 VPPGLVDGPVSQAQLT 574
Score = 39.3 bits (90), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 49/96 (51%), Gaps = 7/96 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF-QGYTGHVDGKPNEARFN 135
P ++ G L+A D N ++V+ + R +++G+ QG GHVDG + A+F
Sbjct: 182 PVQLATDSAGALYAADP-NDHVVR---KFAFGGRATVLSGTVAQGNAGHVDGSASAAKFG 237
Query: 136 HPKGVTMDDKGNLYVADT-LNLAIRKI-GDAGVTTI 169
+ + +D ++V D L +R+I D VTT+
Sbjct: 238 ELQAIAIDRSNRIFVLDRHLGTPLREIAADGSVTTV 273
Score = 38.9 bits (89), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 52/118 (44%), Gaps = 13/118 (11%)
Query: 127 GKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGG--KSNVAGFRDGPS 184
G+ N ARF +P + D G LY AD + +RK G T+ G AG DG +
Sbjct: 173 GRNNPARF-YPVQLATDSAGALYAADPNDHVVRKFAFGGRATVLSGTVAQGNAGHVDGSA 231
Query: 185 EDAKFSNDFDVVYVRPTCSLLVIDRG-NAALRQISLNQD--------DCEYQYNSISP 233
AKF + + + + + V+DR LR+I+ + DC + + + P
Sbjct: 232 SAAKF-GELQAIAIDRSNRIFVLDRHLGTPLREIAADGSVTTVRRAADCAFGADGLPP 288
>gi|293372451|ref|ZP_06618835.1| IPT/TIG domain protein [Bacteroides ovatus SD CMC 3f]
gi|292632634|gb|EFF51228.1| IPT/TIG domain protein [Bacteroides ovatus SD CMC 3f]
Length = 446
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 58/123 (47%), Gaps = 19/123 (15%)
Query: 115 AGSFQGYTGHVDGKPNEARFNHP-KGVTM----------DDKGNLYVADTLNLAIRKIGD 163
AG F Y G+ D N ARF++P +G + D + Y+ D N IRKI
Sbjct: 324 AGEFGSY-GYKDASQNSARFDNPMQGCFVLNEEYVAEQRADIYDFYLTDAANHCIRKITP 382
Query: 164 AGVTTIAGGKSN------VAGFRDG-PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
G+ T G+ + V+G+ DG P E A+F+ + Y T + V D GN +R
Sbjct: 383 DGIVTTFAGRGSYSTDQIVSGYIDGDPRETARFNYPLGLCYEESTGTFYVGDNGNHRVRT 442
Query: 217 ISL 219
I+L
Sbjct: 443 IAL 445
>gi|340370782|ref|XP_003383925.1| PREDICTED: NHL repeat-containing protein 2-like [Amphimedon
queenslandica]
Length = 730
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 50/102 (49%), Gaps = 5/102 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGF 179
G VDGK + + HP GV DD LY+AD+ N I+ + +T+AG S G
Sbjct: 459 GDVDGKGRDVKLQHPMGVAWDDTNQLLYIADSFNHKIKVVNPKTKVCSTLAGTGS--PGL 516
Query: 180 RDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQ 221
DG E A+FS + + +L V D N A+R + L +
Sbjct: 517 VDGSFEVAQFSEPAGLCMSQEGDTLYVADTNNHAIRILDLKE 558
Score = 42.0 bits (97), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 48/95 (50%), Gaps = 13/95 (13%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVA 177
G G DG+ EA FN P+G+ + ++ +YVADT N IRKI D V T+AG
Sbjct: 267 GSPGFKDGRGKEAEFNCPQGLVICEEC-VYVADTENHLIRKISLSDDFVLTVAG-----T 320
Query: 178 GFRDGPSEDAKFSNDFDV-----VYVRPTCSLLVI 207
G++ E K + ++ + + CS++ I
Sbjct: 321 GYQGNDKEGGKVGKEQEISSPWDLAINSDCSIIYI 355
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 38/73 (52%), Gaps = 6/73 (8%)
Query: 88 LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN 147
L+ D N I + P S +AG+ G G VDG A+F+ P G+ M +G+
Sbjct: 485 LYIADSFNHKIKVVNPKTKVCST---LAGT--GSPGLVDGSFEVAQFSEPAGLCMSQEGD 539
Query: 148 -LYVADTLNLAIR 159
LYVADT N AIR
Sbjct: 540 TLYVADTNNHAIR 552
>gi|398331208|ref|ZP_10515913.1| hypothetical protein LalesM3_04029 [Leptospira alexanderi serovar
Manhao 3 str. L 60]
Length = 357
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 68/142 (47%), Gaps = 20/142 (14%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
+ E+ ++F +D T S Y+ + G FQ +G + F P
Sbjct: 137 VSCKENAQIFKIDH--------TDQFSLYAGNSDIEG-FQ------NGDRLNSLFKGPFF 181
Query: 140 VTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVY 197
+ +D + NLYV + N AIRKI V+T++GG V+G+ DG A+F + + Y
Sbjct: 182 MDLDRERNLYVGELGNHAIRKINLNSGTVSTLSGG---VSGYLDGDLASAQFKSPLGITY 238
Query: 198 VRPTCSLLVIDRGNAALRQISL 219
+ T SLLV D N +R+I L
Sbjct: 239 DQKTDSLLVADLQNHRIRKIDL 260
>gi|290983036|ref|XP_002674235.1| predicted protein [Naegleria gruberi]
gi|284087824|gb|EFC41491.1| predicted protein [Naegleria gruberi]
Length = 737
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 71/143 (49%), Gaps = 11/143 (7%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVD--GKPNEARFNHP 137
+ +S + E++ D NS I KI S G + + G +GH G A+ HP
Sbjct: 63 VTISPNREIYIADTGNSVIRKIN------STGYIHTIAGTGTSGHSGDGGLAVNAKLFHP 116
Query: 138 KGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGF--RDGPSEDAKFSNDFDV 195
+ VT+ +G + +ADT N IRKI G GK + +G+ G ++DA S+
Sbjct: 117 RTVTVTSEGEILIADTSNHVIRKITKDGYIFTIAGKPSTSGYGGDGGLAKDALLSSP-QC 175
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V + P+ + +++ G A +R+I+
Sbjct: 176 VRISPSGDIFIVEYGGARIRKIN 198
>gi|153867703|ref|ZP_01997987.1| NHL repeat protein [Beggiatoa sp. SS]
gi|152144978|gb|EDN72014.1| NHL repeat protein [Beggiatoa sp. SS]
Length = 263
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 71/149 (47%), Gaps = 11/149 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAG-SFQGYTGHVDGKP-NEARF 134
P + + G L+ D N+ + K+ S VAG S G G DG P A+
Sbjct: 89 PMWVSLDSAGNLYIADTGNNVVRKLDIASGNIST---VAGDSSVGVAG--DGGPATAAQL 143
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSND 192
++P G+ +D GNLY+AD+ N IRK+ G +TTIAG G AG DG A N
Sbjct: 144 SYPTGIDIDTAGNLYIADSSNHRIRKVDTTGNITTIAGDGTPGFAG--DGKIATAAQLNA 201
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQISLNQ 221
V V T + + D N +R++S +
Sbjct: 202 PTQVMVDSTGQVYIADTSNHRIRKVSTDN 230
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 52/102 (50%), Gaps = 5/102 (4%)
Query: 120 GYTGHVDGKP-NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAG 178
G+ G DG P A+ N P G+ +D GNLY+ADT N IRKI +G + G N
Sbjct: 17 GFAG--DGGPATAAQLNSPLGMGIDSAGNLYIADTENHRIRKIDSSGNISTVAGDGNRGY 74
Query: 179 FRDGPSE-DAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
DG + AK +N V + +L + D GN +R++ +
Sbjct: 75 SGDGAAAVSAKLNNPM-WVSLDSAGNLYIADTGNNVVRKLDI 115
>gi|298383938|ref|ZP_06993499.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
gi|298263542|gb|EFI06405.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
Length = 483
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 68/146 (46%), Gaps = 7/146 (4%)
Query: 80 IRVSEDGELFAVDEVN-SNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPK 138
+ ++E +++ VD N K T P Y+ ++ G G G DG A+F++P
Sbjct: 338 VAMAEGNQIWRVDVGNLEGKDKNTYPGEGYAGKAILEGQVAG-AGWEDGLLRNAKFDNPH 396
Query: 139 GVTMDDKGNLYVADTLNLAIRKIG-----DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
+ + G LY+AD N IR I D + T G + G++DG + A F++ F
Sbjct: 397 QICFTEDGKLYIADCGNNCIRVIDTKLPLDRAMVTTPIGLPGMKGYKDGGPDIALFNHPF 456
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISL 219
V + + D GN +R++S+
Sbjct: 457 GVAVSADGQIVYIADTGNKVIRKLSI 482
>gi|291223251|ref|XP_002731624.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 434
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 64/133 (48%), Gaps = 11/133 (8%)
Query: 88 LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN 147
F D N I +++ P + + +AG +G+ DGK A F HP GVT+D N
Sbjct: 197 FFVADTDNHAIREVSLPDGEVT---TIAGGEKGFK---DGKGTGATFYHPAGVTIDPIRN 250
Query: 148 -LYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 204
L++AD N AIR IG VTT+AG S GF +G A F+ + Y L
Sbjct: 251 ILFIADHYNHAIRMIGVESKIVTTLAG--SGKPGFVNGMGNQAMFNYPEGMAYDTENKVL 308
Query: 205 LVIDRGNAALRQI 217
V++ N +R +
Sbjct: 309 YVVEFDNNCVRIV 321
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 45/114 (39%), Positives = 57/114 (50%), Gaps = 17/114 (14%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKG-NLYVADTLNLAIRKIG--DAGVTTIA 170
+AGS QGY DG +A F+HP G++M K L+VAD+ N IR I VTT A
Sbjct: 108 LAGSLQGYQ---DGVGEKAMFHHPAGMSMHRKTRTLFVADSANSRIRAINVVTGEVTTFA 164
Query: 171 G-GKSNVAGFRDGPSEDAKFSN----DFDVVYVRPTCSLLVIDRGNAALRQISL 219
G GK + +DG A F N D VY V D N A+R++SL
Sbjct: 165 GSGKEEL---KDGLKTIASFFNPQAVAVDHVYKD---RFFVADTDNHAIREVSL 212
Score = 45.4 bits (106), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 56/107 (52%), Gaps = 7/107 (6%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIGDAG-VTTIAG 171
+AGS G G V+G N+A FN+P+G+ D + LYV + N +R + D G V + G
Sbjct: 275 LAGS--GKPGFVNGMGNQAMFNYPEGMAYDTENKVLYVVEFDNNCVRIVDDEGEVRSFVG 332
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G+ G DG E+AKF + + + + + D+ N +R IS
Sbjct: 333 GRE---GKSDGLGEEAKFFHPTGLTFDEKEKIIYITDQYNHQIRGIS 376
Score = 38.5 bits (88), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 44/95 (46%), Gaps = 8/95 (8%)
Query: 83 SEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTM 142
+E+ L+ V E ++N V+I + V G G DG EA+F HP G+T
Sbjct: 303 TENKVLYVV-EFDNNCVRIVDDEGE------VRSFVGGREGKSDGLGEEAKFFHPTGLTF 355
Query: 143 DDKGN-LYVADTLNLAIRKIGDAGVTTIAGGKSNV 176
D+K +Y+ D N IR I G T GK +
Sbjct: 356 DEKEKIIYITDQYNHQIRGISGIGAKTSPPGKITI 390
>gi|299144811|ref|ZP_07037879.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
gi|298515302|gb|EFI39183.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
Length = 459
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 58/123 (47%), Gaps = 19/123 (15%)
Query: 115 AGSFQGYTGHVDGKPNEARFNHP-KGVTM----------DDKGNLYVADTLNLAIRKIGD 163
AG F Y G+ D N ARF++P +G + D + Y+ D N IRKI
Sbjct: 337 AGEFGSY-GYKDASQNSARFDNPMQGCFVLNEEYVAEQRADIYDFYLTDAANHCIRKITP 395
Query: 164 AGVTTIAGGKSN------VAGFRDG-PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
G+ T G+ + V+G+ DG P E A+F+ + Y T + V D GN +R
Sbjct: 396 DGIVTTFAGRGSYSTDQIVSGYIDGDPRETARFNYPLGLCYEESTGTFYVGDNGNHRVRT 455
Query: 217 ISL 219
I+L
Sbjct: 456 IAL 458
>gi|255311905|pdb|3HRP|A Chain A, Crystal Structure Of Structural Genomics Protein Of
Unknown Function (np_812590.1) From Bacteroides
Thetaiotaomicron Vpi-5482 At 1.70 A Resolution
Length = 409
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 60/125 (48%), Gaps = 11/125 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPP-LSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
PY I D + D+ S++ KITP ++ G + Q DG EA F
Sbjct: 270 PYLIYYFVDSNFYXSDQNLSSVYKITPDGECEWFCGSATQKTVQ------DGLREEALFA 323
Query: 136 HPKGVTMDDKGNLYVADTL-NLAIRK--IGDAGVTTIAGGKSNVAGFRDGPSEDAKFSND 192
P G T+D+ GN Y+ D +RK I D V+T+A G+ +VA DG +A F+
Sbjct: 324 QPNGXTVDEDGNFYIVDGFKGYCLRKLDILDGYVSTVA-GQVDVASQIDGTPLEATFNYP 382
Query: 193 FDVVY 197
+D+ Y
Sbjct: 383 YDICY 387
Score = 43.1 bits (100), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 37/84 (44%), Gaps = 3/84 (3%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P V EDG + VD ++ L Y VAG +DG P EA FN+
Sbjct: 325 PNGXTVDEDGNFYIVDGFKGYCLRKLDILDGYVS--TVAGQVD-VASQIDGTPLEATFNY 381
Query: 137 PKGVTMDDKGNLYVADTLNLAIRK 160
P + D +G ++A+ AIRK
Sbjct: 382 PYDICYDGEGGYWIAEAWGKAIRK 405
>gi|451981204|ref|ZP_21929574.1| NHL repeat containing protein [Nitrospina gracilis 3/211]
gi|451761567|emb|CCQ90828.1| NHL repeat containing protein [Nitrospina gracilis 3/211]
Length = 488
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/122 (36%), Positives = 60/122 (49%), Gaps = 9/122 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P KI SE +LFAV + N N V I R R+ +G+ G DG EA F
Sbjct: 178 PGKIAYSESDKLFAVADANHNQVVIADREGTI-RHRIGSGA----VGKADGGFEEASFYR 232
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFRDG-PSEDAKFSNDF 193
P+G+ D ++VADT N +RKI VTT+AG RD P+++ S+ +
Sbjct: 233 PQGLCFQDD-VVWVADTENHLLRKIDLNSKQVTTVAGTGDQGGFLRDTHPAKETALSSPW 291
Query: 194 DV 195
DV
Sbjct: 292 DV 293
>gi|428162807|gb|EKX31917.1| hypothetical protein GUITHDRAFT_57527, partial [Guillardia theta
CCMP2712]
Length = 203
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 45/137 (32%), Positives = 68/137 (49%), Gaps = 9/137 (6%)
Query: 88 LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGV-TMDDKG 146
L V EV +++V+ +S G L AGS GY G DG+ A FNHP + T D
Sbjct: 33 LVLVSEVGNHVVRYINVIS-LQVGTL-AGS--GYVGFADGEARAAMFNHPIALATSPDNS 88
Query: 147 NLYVADTLNLAIR--KIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 204
+ +AD N +R + + ++T+AG AGF+DG DAKF+ + + +
Sbjct: 89 IVLIADGFNHRVRSFNVSNMSISTLAGDGG--AGFQDGIGTDAKFNFPSALSFFGDGTKV 146
Query: 205 LVIDRGNAALRQISLNQ 221
V D N +R I+L+
Sbjct: 147 AVTDMYNNKIRIITLDS 163
>gi|289707042|ref|ZP_06503373.1| antioxidant, AhpC/TSA family [Micrococcus luteus SK58]
gi|289556228|gb|EFD49588.1| antioxidant, AhpC/TSA family [Micrococcus luteus SK58]
Length = 703
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 44/93 (47%), Gaps = 8/93 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT----GHVDGKPNEA 132
P + V DG L+ D S + + GR G+ G GHVDG+ + A
Sbjct: 419 PSGLSVGADGALWVADSETSAVRWV----RTGEDGRREVGTAVGAGLFDFGHVDGEADRA 474
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
R HP GVT G++ +ADT N AIR+ AG
Sbjct: 475 RLQHPLGVTALPDGSVLIADTYNGAIRRYAPAG 507
Score = 43.1 bits (100), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 55/119 (46%), Gaps = 6/119 (5%)
Query: 111 GRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIR--KIGDAG--- 165
G L + G +DG A F P G+++ G L+VAD+ A+R + G+ G
Sbjct: 393 GALAVHAGTALEGLLDGDAGRAWFAQPSGLSVGADGALWVADSETSAVRWVRTGEDGRRE 452
Query: 166 VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDC 224
V T G G DG ++ A+ + V + P S+L+ D N A+R+ + +D
Sbjct: 453 VGTAVGAGLFDFGHVDGEADRARLQHPLGVTAL-PDGSVLIADTYNGAIRRYAPAGEDA 510
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 37/61 (60%), Gaps = 9/61 (14%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTM--DDKG-----NLYVADTLNLAIR--KIGDAGVTTIA 170
G GH DG P+EA FN P+G+ + +D ++ VAD++N +R ++ D V+T+A
Sbjct: 260 GEKGHADGGPDEALFNEPQGLALLPEDVAERVGYDVVVADSVNHRLRGLRLSDGHVSTLA 319
Query: 171 G 171
G
Sbjct: 320 G 320
>gi|290995833|ref|XP_002680487.1| serine/threonine kinase [Naegleria gruberi]
gi|284094108|gb|EFC47743.1| serine/threonine kinase [Naegleria gruberi]
Length = 1078
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 33/56 (58%), Gaps = 2/56 (3%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSN 175
GY G DG N+PK +D KG++Y ADT N IRK+ + +TTIAG K N
Sbjct: 177 GYNG--DGLATATDLNYPKYAQVDSKGDVYFADTYNGLIRKVSNGYITTIAGSKQN 230
>gi|455790313|gb|EMF42184.1| putative lipoprotein [Leptospira interrogans serovar Lora str. TE
1992]
Length = 294
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 56/109 (51%), Gaps = 5/109 (4%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIA 170
L AGS +G +G + F+ P + +D + NLYV + N IRKI V+T++
Sbjct: 156 LFAGSSSDLSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGTVSTLS 215
Query: 171 GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
GG ++G+ DG A+F + + Y R SLL D + +R+I L
Sbjct: 216 GG---ISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDL 261
Score = 43.1 bits (100), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 31/53 (58%), Gaps = 2/53 (3%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
L AG+ G VDG + A F P G+ +D GN+YV+D +N IRKI +G
Sbjct: 59 LFAGT--GVAASVDGTTSTASFKTPFGLEVDTSGNIYVSDQINNLIRKIDPSG 109
>gi|312372811|gb|EFR20689.1| hypothetical protein AND_19679 [Anopheles darlingi]
Length = 745
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 47/98 (47%), Gaps = 7/98 (7%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDK-GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD 181
G VDGK EA+F HP GV + K G++YVADT N I+KI DA + FR+
Sbjct: 474 GDVDGKGYEAKFQHPLGVAYNAKDGHIYVADTYNHKIKKI-DASTNC-----ATTCEFRE 527
Query: 182 GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
E +FS + R L + D N L SL
Sbjct: 528 ANGEIKRFSEPAGLCLDRTGQLLFIADTNNHELLVASL 565
Score = 47.8 bits (112), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 52/101 (51%), Gaps = 13/101 (12%)
Query: 77 PYKIRVSEDGE--LFAVDEVNSNIVKITPP--LSQYSRGRLVAGSFQGYTGHVDGKPNEA 132
P K+ E GE L+AV + ++ + I P +Y+ + G G+ VDG EA
Sbjct: 226 PGKVVCCEGGEEELYAVSDSGNHRILIFEPSGTVRYT----IGGKSSGF---VDGGFREA 278
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG 171
RFN P+GV LYVAD N AIR+I V+T+AG
Sbjct: 279 RFNAPQGVAFRGGDELYVADNENHAIRRIDLRTRTVSTVAG 319
>gi|290975027|ref|XP_002670245.1| predicted protein [Naegleria gruberi]
gi|284083802|gb|EFC37501.1| predicted protein [Naegleria gruberi]
Length = 1363
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 70/144 (48%), Gaps = 13/144 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNI--VKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
PY I ++ + E++ D +N I V + +S VAG+ G++G G A+
Sbjct: 466 PYGIAITANDEIYIADNLNHRIRFVDVNGNIST------VAGTSIGFSGD-SGLATAAKL 518
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG-GKSNVAGFRDGPSEDAKFSNDF 193
N P V++ G+LY+AD N IR++ ++T AG G+S +G G + A S +
Sbjct: 519 NAPMDVSLSASGDLYIADRDNYRIRRVSSGVISTFAGNGQSGYSG-DGGQATSAALSQAY 577
Query: 194 DVVYVRPTCSLLVIDRGNAALRQI 217
V + + + D N +R++
Sbjct: 578 GVKVING--EVYISDSNNYKVRKV 599
Score = 47.0 bits (110), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 71/146 (48%), Gaps = 12/146 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRG---RLVAGSFQGYTGHVDGKPNEAR 133
P KI +G++ D + +I KI+ YS G R+ G++G G ++A+
Sbjct: 409 PQKIAKLSNGDIIVSDTGDHSIKKIS-----YSTGVISRIAGTGVAGFSGD-GGLASQAQ 462
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSND 192
N P G+ + +Y+AD LN IR + G ++T+AG +G G + AK +
Sbjct: 463 LNKPYGIAITANDEIYIADNLNHRIRFVDVNGNISTVAGTSIGFSG-DSGLATAAKLNAP 521
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQIS 218
D V + + L + DR N +R++S
Sbjct: 522 MD-VSLSASGDLYIADRDNYRIRRVS 546
Score = 42.4 bits (98), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 46/98 (46%), Gaps = 9/98 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVD--GKPNEARF 134
P + +S GEL+ D N I K+ + G +V + G +GH G A
Sbjct: 161 PMDVSLSTSGELYIADMYNYRIRKV------LTNGTIVTFAGSGQSGHTGDGGLATNAAM 214
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
GV + G +Y++D+ N +RK+ G +TT+AG
Sbjct: 215 ALAYGVKVFSNGEVYISDSFNFKVRKVDVNGTITTVAG 252
Score = 41.6 bits (96), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 72/147 (48%), Gaps = 12/147 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKPNEARFN 135
P + +S G+L+ D N I +++ + AG+ Q GY+G G+ A +
Sbjct: 521 PMDVSLSASGDLYIADRDNYRIRRVSSGVIS-----TFAGNGQSGYSGD-GGQATSAALS 574
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR-DGPSEDAKFSNDF 193
GV + + G +Y++D+ N +RK+ +G +TTIAG S A F DG + N
Sbjct: 575 QAYGVKVIN-GEVYISDSNNYKVRKVDGSGIITTIAG--SGAAPFNGDGLMATSSNMNHP 631
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISLN 220
V + P+ +++ D N +R + LN
Sbjct: 632 TDVALLPSGEMIIADTDNYRIRMVLLN 658
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 65/249 (26%), Positives = 106/249 (42%), Gaps = 32/249 (12%)
Query: 52 DGNVLQF----ENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQ 107
+G ++ F ++G+ + + N + Y ++V +GE++ D N + K+ +
Sbjct: 188 NGTIVTFAGSGQSGHTGDGGLATNAAMALAYGVKVFSNGEVYISDSFNFKVRKVDVNGTI 247
Query: 108 YSRGRLVAGSFQG-YTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV 166
+ VAGS Y+G + A N+P V D G L +ADT N IR + G
Sbjct: 248 TT----VAGSGVAPYSGD-NILATAANLNYPSDVFRFDTGELVIADTNNNRIRLVMPNG- 301
Query: 167 TTIAGGKSNVAGFRDGP-SEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCE 225
T I + A F DG +E+ S +V ++ LL+ D GN +R + C
Sbjct: 302 TIITTAGNGTASFSDGEFAEENGLSLPTGLVLIQN--GLLIADAGNKRIRLLQRTIFSCY 359
Query: 226 YQYNSISPTDILMVVG-AVLVGYVTCMLQQGF-GPFFFSRTQQPSE-------SEFNEET 276
+ S D + G VG C GF GP T PS+ ++F +
Sbjct: 360 GK----STLDSNVCSGQGRCVGNNICQCNAGFEGP-----TCLPSKYYITTVATDFKDPQ 410
Query: 277 EIKELSNKE 285
+I +LSN +
Sbjct: 411 KIAKLSNGD 419
>gi|418707100|ref|ZP_13267936.1| putative lipoprotein [Leptospira interrogans serovar Hebdomadis
str. R499]
gi|410763306|gb|EKR34037.1| putative lipoprotein [Leptospira interrogans serovar Hebdomadis
str. R499]
Length = 358
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 56/109 (51%), Gaps = 5/109 (4%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIA 170
L AGS +G +G + F+ P + +D + NLYV + N IRKI V+T++
Sbjct: 156 LFAGSSSDLSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGTVSTLS 215
Query: 171 GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
GG ++G+ DG A+F + + Y R SLL D + +R+I L
Sbjct: 216 GG---ISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDL 261
Score = 43.1 bits (100), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 31/53 (58%), Gaps = 2/53 (3%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
L AG+ G VDG + A F P G+ +D GN+YV+D +N IRKI +G
Sbjct: 59 LFAGT--GVAASVDGTTSTASFKTPFGLEVDTSGNIYVSDQINNLIRKIDPSG 109
>gi|290995061|ref|XP_002680150.1| predicted protein [Naegleria gruberi]
gi|284093769|gb|EFC47406.1| predicted protein [Naegleria gruberi]
Length = 880
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 52/90 (57%), Gaps = 5/90 (5%)
Query: 131 EARFNHPKGVTMDD-KGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGP-SEDA 187
A+ P+GV++D KG++Y+AD+ N IRK+ + ++TIAG S AGF DG + A
Sbjct: 426 NAKLYFPRGVSVDPFKGDVYIADSYNNRIRKVSNGFISTIAGTGS--AGFTGDGELAIAA 483
Query: 188 KFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
K + V Y + + ++D NA +R I
Sbjct: 484 KLDTPYSVAYSNTSGLVYILDTNNARIRNI 513
>gi|262405960|ref|ZP_06082510.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|262356835|gb|EEZ05925.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
Length = 459
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 58/123 (47%), Gaps = 19/123 (15%)
Query: 115 AGSFQGYTGHVDGKPNEARFNHP-KGVTMD----------DKGNLYVADTLNLAIRKIGD 163
AG F Y G+ D N ARF++P +G + D + Y+ D N IRKI
Sbjct: 337 AGEFGSY-GYKDASQNSARFDNPMQGCFVLNEEYVAEQRLDVYDFYLTDAANHCIRKITP 395
Query: 164 AGVTTIAGGKSN------VAGFRDG-PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
G+ T G+ + V+G+ DG P E A+F+ + Y T + V D GN +R
Sbjct: 396 DGIVTTFAGRGSYSTDQIVSGYIDGDPRETARFNYPLGLCYEESTGTFYVGDNGNHRVRT 455
Query: 217 ISL 219
I+L
Sbjct: 456 IAL 458
>gi|418714767|ref|ZP_13275259.1| putative lipoprotein [Leptospira interrogans str. UI 08452]
gi|410788941|gb|EKR82646.1| putative lipoprotein [Leptospira interrogans str. UI 08452]
Length = 358
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 56/109 (51%), Gaps = 5/109 (4%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIA 170
L AGS +G +G + F+ P + +D + NLYV + N IRKI V+T++
Sbjct: 156 LFAGSSSDLSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGTVSTLS 215
Query: 171 GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
GG ++G+ DG A+F + + Y R SLL D + +R+I L
Sbjct: 216 GG---ISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDL 261
Score = 43.1 bits (100), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 31/53 (58%), Gaps = 2/53 (3%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
L AG+ G VDG + A F P G+ +D GN+YV+D +N IRKI +G
Sbjct: 59 LFAGT--GVAASVDGTTSTASFKTPFGLEVDTSGNIYVSDQINNLIRKIDPSG 109
>gi|167534306|ref|XP_001748831.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772793|gb|EDQ86441.1| predicted protein [Monosiga brevicollis MX1]
Length = 5844
Score = 50.4 bits (119), Expect = 0.002, Method: Composition-based stats.
Identities = 40/148 (27%), Positives = 72/148 (48%), Gaps = 5/148 (3%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + + +D + + E N+N +++ L+ + G H DG+ ++A F+
Sbjct: 780 PYGVALYKDDSIL-LTERNNNSIRLVNLLTGETECLNHGGHDVPLAAHRDGRLSDACFHR 838
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGD--AGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P G+ + G+ VAD+ N IR I + V T A G+ AG +G A+F+ D
Sbjct: 839 PSGICQLEDGSFLVADSANHCIRSISRSLSTVKTFA-GQPGQAGGAEGAVSKAQFNQPSD 897
Query: 195 VVYVRPTCSLLVIDRGNAALRQISLNQD 222
++ V +L+ D N ++R + L +D
Sbjct: 898 LL-VLDEKRILISDEANNSIRLLQLQKD 924
>gi|290995104|ref|XP_002680171.1| predicted protein [Naegleria gruberi]
gi|284093791|gb|EFC47427.1| predicted protein [Naegleria gruberi]
Length = 928
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 68/144 (47%), Gaps = 10/144 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV--DGKPNEARF 134
P+ + VS + E++ D N I KI G +V + G G +G A+
Sbjct: 13 PFGVFVSSNNEVYIADYGNQRIRKIL------KNGNIVTIAGNGTAGFRGDNGPATNAQL 66
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
+P V + +Y+AD N IRKI + G + TIAG + +GP+ +A+ +N +
Sbjct: 67 YNPYSVFVSSNNEVYIADFSNHRIRKILENGKIVTIAGNGTGGFSGDNGPATNAQLNNPY 126
Query: 194 DVVYVRPTCSLLVIDRGNAALRQI 217
V+V + ++D N +R+I
Sbjct: 127 S-VFVSSNNEVYIVDYNNHRIRKI 149
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 67/144 (46%), Gaps = 10/144 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV--AGSFQGYTGHVDGKPNEARF 134
PY + VS + E++ D N I KI G++V AG+ G +G A+
Sbjct: 69 PYSVFVSSNNEVYIADFSNHRIRKIL------ENGKIVTIAGNGTGGFSGDNGPATNAQL 122
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
N+P V + +Y+ D N IRKI G + TIAG + +GP+ +A+ +N
Sbjct: 123 NNPYSVFVSSNNEVYIVDYNNHRIRKILKNGNIVTIAGNGTGGFSGDNGPATNAQLNNPM 182
Query: 194 DVVYVRPTCSLLVIDRGNAALRQI 217
V+V + + D N +R+I
Sbjct: 183 G-VFVSSNNEVYIADYYNHRIRKI 205
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 53/106 (50%), Gaps = 11/106 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV--AGSFQGYTGHVDGKPNEARF 134
PY + VS + E++ VD N I KI G +V AG+ G +G A+
Sbjct: 452 PYSVFVSSNNEVYIVDYNNHRIRKIL------ENGNIVTIAGNGTGGFSGDNGPATNAQL 505
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGF 179
N+P GV + +Y+AD N IRKI + G + TIAG + AGF
Sbjct: 506 NNPMGVFVSSNNEVYIADYYNHRIRKILENGNIVTIAGNGT--AGF 549
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 53/106 (50%), Gaps = 11/106 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV--AGSFQGYTGHVDGKPNEARF 134
PY + VS + E++ VD N I KI G +V AG+ G +G A+
Sbjct: 125 PYSVFVSSNNEVYIVDYNNHRIRKIL------KNGNIVTIAGNGTGGFSGDNGPATNAQL 178
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGF 179
N+P GV + +Y+AD N IRKI + G + TIAG + AGF
Sbjct: 179 NNPMGVFVSSNNEVYIADYYNHRIRKILENGNIVTIAGNGT--AGF 222
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 43/156 (27%), Positives = 68/156 (43%), Gaps = 17/156 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTG------------- 123
PY + VS + E++ VD N I KI + + G F G G
Sbjct: 379 PYSVFVSSNNEVYIVDYNNHRIRKILKNGNIVTIAGNGTGGFSGDNGPATNAQLNNPMVV 438
Query: 124 -HVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRD 181
H+ G A+ N+P V + +Y+ D N IRKI + G + TIAG + +
Sbjct: 439 IHL-GPATNAQLNNPYSVFVSSNNEVYIVDYNNHRIRKILENGNIVTIAGNGTGGFSGDN 497
Query: 182 GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
GP+ +A+ +N V+V + + D N +R+I
Sbjct: 498 GPATNAQLNNPMG-VFVSSNNEVYIADYYNHRIRKI 532
Score = 45.4 bits (106), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 48/89 (53%), Gaps = 6/89 (6%)
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--DGPSEDAK 188
A+ N P GV + +Y+AD N IRKI G + TIAG + AGFR +GP+ +A+
Sbjct: 8 AQLNSPFGVFVSSNNEVYIADYGNQRIRKILKNGNIVTIAGNGT--AGFRGDNGPATNAQ 65
Query: 189 FSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
N + V+V + + D N +R+I
Sbjct: 66 LYNPYS-VFVSSNNEVYIADFSNHRIRKI 93
>gi|116620348|ref|YP_822504.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116223510|gb|ABJ82219.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 915
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 57/117 (48%), Gaps = 9/117 (7%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
VP G L+ D N +I K+T + +V G +G +G AR
Sbjct: 541 VPGACAFDGKGNLYIADTGNHSIRKVT---ADGVISTVVGNGTVGASGD-EGAAASARLA 596
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVT-TIAGGKSNVAGFR--DGPSEDAKF 189
P+G+T+DD GNLY+ DT N +R++ G+ TIAG + AGF GP+ DA
Sbjct: 597 SPRGLTVDDNGNLYIGDTGNNRVRQVTADGIMHTIAG--TGPAGFAGDGGPAADAAL 651
Score = 46.6 bits (109), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 48/88 (54%), Gaps = 7/88 (7%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKPNEARFNHPKGVTMDD 144
G L+ D N + K++ + + +AG+ G++G G AR N P G+ +DD
Sbjct: 61 GNLYVADTNNHRVRKVSLGGTVTT----IAGTGIAGFSGD-GGLAVNARLNLPYGLALDD 115
Query: 145 KGNLYVADTLNLAIRKIG-DAGVTTIAG 171
GN+YVAD N +R+IG D + TIAG
Sbjct: 116 FGNIYVADLGNERVRRIGSDGAIVTIAG 143
Score = 45.4 bits (106), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 39/104 (37%), Positives = 54/104 (51%), Gaps = 21/104 (20%)
Query: 126 DGKP-NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--D 181
DG P A+F+ G+ D GNLYVADT N +RK+ G VTTIAG + +AGF
Sbjct: 40 DGGPATAAQFSDMSGIAADRLGNLYVADTNNHRVRKVSLGGTVTTIAG--TGIAGFSGDG 97
Query: 182 GPSEDAKFS-------NDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G + +A+ + +DF +YV D GN +R+I
Sbjct: 98 GLAVNARLNLPYGLALDDFGNIYVA--------DLGNERVRRIG 133
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 68/143 (47%), Gaps = 6/143 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + G L+ VD N +++ L+ + VAG+ G G A+
Sbjct: 486 PKGVCLDRGGTLYIVDTSNHRVLR----LAADGTVQTVAGNGSGGYAGDGGSARFAQLRV 541
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P D KGNLY+ADT N +IRK+ D ++T+ G + A +G + A+ ++ +
Sbjct: 542 PGACAFDGKGNLYIADTGNHSIRKVTADGVISTVVGNGTVGASGDEGAAASARLASPRGL 601
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V +L + D GN +RQ++
Sbjct: 602 T-VDDNGNLYIGDTGNNRVRQVT 623
>gi|220914451|ref|YP_002489760.1| NHL repeat containing protein [Arthrobacter chlorophenolicus A6]
gi|219861329|gb|ACL41671.1| NHL repeat containing protein [Arthrobacter chlorophenolicus A6]
Length = 674
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 58/109 (53%), Gaps = 8/109 (7%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK--IGDAGVTTI- 169
+VAG+ G G +DG +E+ F G+ D GN++VAD+ A+R I D G T+
Sbjct: 380 IVAGN--GLEGLLDGPAHESWFAQSSGLAEDADGNIWVADSETSALRTLVIDDTGSLTVK 437
Query: 170 -AGGKSNVA-GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
A GK GFRDG + +A+ + V V P S+ + D N A+R+
Sbjct: 438 TAVGKGLFDFGFRDGTAAEARLQHPLGVT-VLPDGSVAIADTYNGAVRR 485
Score = 39.3 bits (90), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDA--GVTTIAGG 172
G DG EAR HP GVT+ G++ +ADT N A+R+ A V+T+A G
Sbjct: 448 GFRDGTAAEARLQHPLGVTVLPDGSVAIADTYNGAVRRFDPATGTVSTLARG 499
>gi|294647916|ref|ZP_06725468.1| IPT/TIG domain protein [Bacteroides ovatus SD CC 2a]
gi|294806312|ref|ZP_06765159.1| IPT/TIG domain protein [Bacteroides xylanisolvens SD CC 1b]
gi|345510467|ref|ZP_08790034.1| hypothetical protein BSAG_00676 [Bacteroides sp. D1]
gi|292636824|gb|EFF55290.1| IPT/TIG domain protein [Bacteroides ovatus SD CC 2a]
gi|294446568|gb|EFG15188.1| IPT/TIG domain protein [Bacteroides xylanisolvens SD CC 1b]
gi|295085612|emb|CBK67135.1| IPT/TIG domain./NHL repeat. [Bacteroides xylanisolvens XB1A]
gi|345454427|gb|EEO48966.2| hypothetical protein BSAG_00676 [Bacteroides sp. D1]
Length = 476
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 58/123 (47%), Gaps = 19/123 (15%)
Query: 115 AGSFQGYTGHVDGKPNEARFNHP-KGVTMD----------DKGNLYVADTLNLAIRKIGD 163
AG F Y G+ D N ARF++P +G + D + Y+ D N IRKI
Sbjct: 354 AGEFGSY-GYKDASQNSARFDNPMQGCFVLNEEYVAEQRLDVYDFYLTDAANHCIRKITP 412
Query: 164 AGVTTIAGGKSN------VAGFRDG-PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
G+ T G+ + V+G+ DG P E A+F+ + Y T + V D GN +R
Sbjct: 413 DGIVTTFAGRGSYSTDQIVSGYIDGDPRETARFNYPLGLCYEESTGTFYVGDNGNHRVRT 472
Query: 217 ISL 219
I+L
Sbjct: 473 IAL 475
>gi|417783289|ref|ZP_12431009.1| putative lipoprotein [Leptospira interrogans str. C10069]
gi|409953418|gb|EKO07917.1| putative lipoprotein [Leptospira interrogans str. C10069]
Length = 358
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 59/109 (54%), Gaps = 5/109 (4%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAG-VTTIA 170
L AGS +G +G + F+ P + +D + NLYV + N IRKI ++G V+T++
Sbjct: 156 LFAGSSSDLSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGMVSTLS 215
Query: 171 GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
GG ++G+ DG A+F + + Y R SLL D + +R+I L
Sbjct: 216 GG---ISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDL 261
Score = 43.1 bits (100), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 31/53 (58%), Gaps = 2/53 (3%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
L AG+ G VDG + A F P G+ +D GN+YV+D +N IRKI +G
Sbjct: 59 LFAGT--GVAASVDGTTSTASFKTPFGLEVDTSGNIYVSDQINNLIRKIDPSG 109
>gi|24215844|ref|NP_713325.1| hypothetical protein LA_3145 [Leptospira interrogans serovar Lai
str. 56601]
gi|45656856|ref|YP_000942.1| hypothetical protein LIC10968 [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
gi|386074980|ref|YP_005989298.1| hypothetical protein LIF_A2535 [Leptospira interrogans serovar Lai
str. IPAV]
gi|417765006|ref|ZP_12412972.1| putative lipoprotein [Leptospira interrogans serovar Bulgarica str.
Mallika]
gi|418666993|ref|ZP_13228411.1| putative lipoprotein [Leptospira interrogans serovar Pyrogenes str.
2006006960]
gi|418699338|ref|ZP_13260303.1| putative lipoprotein [Leptospira interrogans serovar Bataviae str.
L1111]
gi|418723579|ref|ZP_13282417.1| putative lipoprotein [Leptospira interrogans str. UI 12621]
gi|418727930|ref|ZP_13286513.1| putative lipoprotein [Leptospira interrogans str. UI 12758]
gi|421083673|ref|ZP_15544545.1| putative lipoprotein [Leptospira santarosai str. HAI1594]
gi|421102269|ref|ZP_15562874.1| putative lipoprotein [Leptospira interrogans serovar
Icterohaemorrhagiae str. Verdun LP]
gi|421122244|ref|ZP_15582528.1| putative lipoprotein [Leptospira interrogans str. Brem 329]
gi|24197042|gb|AAN50343.1| hypothetical protein LA_3145 [Leptospira interrogans serovar Lai
str. 56601]
gi|45600092|gb|AAS69579.1| conserved hypothetical protein [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
gi|353458770|gb|AER03315.1| hypothetical protein LIF_A2535 [Leptospira interrogans serovar Lai
str. IPAV]
gi|400352606|gb|EJP04785.1| putative lipoprotein [Leptospira interrogans serovar Bulgarica str.
Mallika]
gi|409962948|gb|EKO26678.1| putative lipoprotein [Leptospira interrogans str. UI 12621]
gi|410344796|gb|EKO95953.1| putative lipoprotein [Leptospira interrogans str. Brem 329]
gi|410367934|gb|EKP23317.1| putative lipoprotein [Leptospira interrogans serovar
Icterohaemorrhagiae str. Verdun LP]
gi|410433778|gb|EKP78117.1| putative lipoprotein [Leptospira santarosai str. HAI1594]
gi|410757315|gb|EKR18927.1| putative lipoprotein [Leptospira interrogans serovar Pyrogenes str.
2006006960]
gi|410761708|gb|EKR27881.1| putative lipoprotein [Leptospira interrogans serovar Bataviae str.
L1111]
gi|410777322|gb|EKR57287.1| putative lipoprotein [Leptospira interrogans str. UI 12758]
gi|456822102|gb|EMF70597.1| putative lipoprotein [Leptospira interrogans serovar Canicola str.
LT1962]
Length = 358
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 56/109 (51%), Gaps = 5/109 (4%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIA 170
L AGS +G +G + F+ P + +D + NLYV + N IRKI V+T++
Sbjct: 156 LFAGSSSDLSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGTVSTLS 215
Query: 171 GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
GG ++G+ DG A+F + + Y R SLL D + +R+I L
Sbjct: 216 GG---ISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDL 261
Score = 42.7 bits (99), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 31/53 (58%), Gaps = 2/53 (3%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
L AG+ G VDG + A F P G+ +D GN+YV+D +N IRKI +G
Sbjct: 59 LFAGT--GVAASVDGTTSTASFKTPFGLEVDTSGNIYVSDQINNLIRKIDPSG 109
>gi|222054715|ref|YP_002537077.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
gi|221564004|gb|ACM19976.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
Length = 2393
Score = 50.1 bits (118), Expect = 0.002, Method: Composition-based stats.
Identities = 42/145 (28%), Positives = 66/145 (45%), Gaps = 10/145 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPN-EARFN 135
P V DG ++ D N+ I ++ + S + ++G DG P A N
Sbjct: 1589 PEDAAVDLDGNIYIADSANNRIRRVDK--ASGSIATIAGNGLAEFSG--DGGPAYMAGLN 1644
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG-GKSNVAGFRDGPSEDAKFSND 192
P V +D GN+Y++D LN +RKI A +TT+AG G S G DG A D
Sbjct: 1645 RPFAVALDTSGNIYISDNLNYRVRKIDLASFIITTVAGNGTSGATG--DGGLATAASLGD 1702
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQI 217
+ V ++ + D G+ +R++
Sbjct: 1703 IRGLAVDTAGNIYIADSGSNGIRKV 1727
Score = 44.3 bits (103), Expect = 0.15, Method: Composition-based stats.
Identities = 46/150 (30%), Positives = 74/150 (49%), Gaps = 10/150 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKI---TPPLSQYSRGRLVAGS-FQGYTGHVDGKPNEA 132
P I S + FA D VNS V+ + L++ VAG+ G+TG G +
Sbjct: 1526 PLYITASTTLKFFARDAVNSEPVQTQVYSITLAEQGIINTVAGNGVAGFTGD-GGIAVGS 1584
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDA--GVTTIAG-GKSNVAGFRDGPSEDAKF 189
N+P+ +D GN+Y+AD+ N IR++ A + TIAG G + +G GP+ A
Sbjct: 1585 SLNYPEDAAVDLDGNIYIADSANNRIRRVDKASGSIATIAGNGLAEFSG-DGGPAYMAGL 1643
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
+ F V + + ++ + D N +R+I L
Sbjct: 1644 NRPF-AVALDTSGNIYISDNLNYRVRKIDL 1672
Score = 40.8 bits (94), Expect = 1.5, Method: Composition-based stats.
Identities = 34/114 (29%), Positives = 55/114 (48%), Gaps = 8/114 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + G L+ + + IV+I S S +VAG+ G G +A
Sbjct: 1743 PEGVTLDRAGNLYYAETWGNVIVRIDKATSTKS---IVAGNGMGGYSGDGGPATQASLYA 1799
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFR-DGPSEDA 187
P + +D GN+Y++DT N IRK+ A ++T+AG + AG+ DG + A
Sbjct: 1800 PHRIVLDGNGNIYISDTFNHRIRKVNSATGLISTLAG--TGTAGYSGDGGAATA 1851
>gi|29349707|ref|NP_813210.1| hypothetical protein BT_4299 [Bacteroides thetaiotaomicron
VPI-5482]
gi|298383965|ref|ZP_06993526.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
gi|383120644|ref|ZP_09941370.1| hypothetical protein BSIG_2353 [Bacteroides sp. 1_1_6]
gi|29341617|gb|AAO79404.1| putative cell surface protein [Bacteroides thetaiotaomicron
VPI-5482]
gi|251840310|gb|EES68392.1| hypothetical protein BSIG_2353 [Bacteroides sp. 1_1_6]
gi|298263569|gb|EFI06432.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
Length = 495
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/134 (31%), Positives = 59/134 (44%), Gaps = 12/134 (8%)
Query: 97 NIVKITPPLSQYSRGRLVAGS--FQGYT---GHVDGKPNEARFNHPKGVTMDDKGNLYVA 151
++ KITP G AG ++G G DG A+F HP+ + D G +Y+A
Sbjct: 362 DVSKITPEDKDTYNGESYAGKAIYEGVMNGKGWEDGLLKNAKFRHPRQICFTDDGKMYIA 421
Query: 152 DTLNLAIRKIGD------AGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLL 205
D+ N IR I A VTT G G++DG E AKF V +
Sbjct: 422 DSGNSCIRVIDTTMPKERAAVTTPI-GLPGAEGYKDGGPEIAKFHFPCGVAVNSDGTIVY 480
Query: 206 VIDRGNAALRQISL 219
V D N +R++S+
Sbjct: 481 VADTQNKVIRKLSI 494
>gi|290972086|ref|XP_002668792.1| predicted protein [Naegleria gruberi]
gi|284082314|gb|EFC36048.1| predicted protein [Naegleria gruberi]
Length = 679
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 72/148 (48%), Gaps = 18/148 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVA----GSFQGYTGHVDGKPNEA 132
P + VS + E++ D N I KI G +V G++ G++G +G A
Sbjct: 13 PSSVFVSSNNEVYIADFCNHRIRKILE------NGNIVTIAGNGNY-GFSGD-NGPATNA 64
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--DGPSEDAKF 189
+FN+P V + K +Y+ D N +IRKI + G + TIAG + GF GP+ +A+
Sbjct: 65 QFNYPCSVFVSSKNEVYITDYSNHSIRKILENGNIITIAGNGT--VGFSGDSGPATNAQL 122
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQI 217
N V+V + D+ N +R+I
Sbjct: 123 YNP-SSVFVSSKNEVYFTDQHNNRIRKI 149
Score = 42.7 bits (99), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 72/146 (49%), Gaps = 16/146 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVA----GSFQGYTGHVDGKPNEA 132
PY + VS + E++ D N I KI G +V G++ G++G +G A
Sbjct: 181 PYSVFVSSNNEVYITDYSNHRIRKILE------NGNIVTIAGNGNY-GFSGD-NGPATNA 232
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSN 191
+ N P V + + +Y++D N IRKI + G + TIAG + +GP+ +A+ N
Sbjct: 233 QLNRPNSVFVSNN-EVYISDQSNQRIRKILENGNIITIAGNGNYGFSGDNGPATNAQL-N 290
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQI 217
+ V+V + + D+ N +R+I
Sbjct: 291 RPNSVFVSNN-EVYISDQSNQRIRKI 315
Score = 38.9 bits (89), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 38/142 (26%), Positives = 67/142 (47%), Gaps = 7/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + VS E++ D+ N+ I KI L + + G++G +G A+ +
Sbjct: 125 PSSVFVSSKNEVYFTDQHNNRIRKI---LENGNIITIAGNGTYGFSGD-NGPATNAQLYN 180
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P V + +Y+ D N IRKI + G + TIAG + +GP+ +A+ N +
Sbjct: 181 PYSVFVSSNNEVYITDYSNHRIRKILENGNIVTIAGNGNYGFSGDNGPATNAQL-NRPNS 239
Query: 196 VYVRPTCSLLVIDRGNAALRQI 217
V+V + + D+ N +R+I
Sbjct: 240 VFVSNN-EVYISDQSNQRIRKI 260
Score = 38.5 bits (88), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 45/87 (51%), Gaps = 2/87 (2%)
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFS 190
A+ +P V + +Y+AD N IRKI + G + TIAG + +GP+ +A+F+
Sbjct: 8 AQLFNPSSVFVSSNNEVYIADFCNHRIRKILENGNIVTIAGNGNYGFSGDNGPATNAQFN 67
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQI 217
V+V + + D N ++R+I
Sbjct: 68 YPCS-VFVSSKNEVYITDYSNHSIRKI 93
>gi|395225280|ref|ZP_10403807.1| NHL repeat protein, partial [Thiovulum sp. ES]
gi|394446527|gb|EJF07348.1| NHL repeat protein, partial [Thiovulum sp. ES]
Length = 4349
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/59 (45%), Positives = 36/59 (61%), Gaps = 2/59 (3%)
Query: 115 AGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGG 172
AG Y G DG + RF +P+G+ D++GNL+VAD+ N I+KI D VTT A G
Sbjct: 788 AGEMDAY-GDEDGNLSTVRFYNPRGLAFDEEGNLFVADSNNSKIKKITPDGDVTTFASG 845
>gi|290992574|ref|XP_002678909.1| predicted protein [Naegleria gruberi]
gi|284092523|gb|EFC46165.1| predicted protein [Naegleria gruberi]
Length = 1626
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 66/144 (45%), Gaps = 9/144 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY +++ DG ++ D N+ I KI G GS + + A+ +
Sbjct: 183 PYDVKIGSDGAIYIADYGNNRIRKIANNTITTVAGNGTIGS-----DSDNAEGTSAKLSG 237
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+ + G++Y++DT N IRK+ G+ T G + V+GF G DAK +
Sbjct: 238 PSGIAISSTGDVYISDTNNNKIRKLSTNGIITTFAG-TGVSGFF-GDKADAKRARLSGPR 295
Query: 197 YVRPTCS--LLVIDRGNAALRQIS 218
+ T S L + D N +RQ+S
Sbjct: 296 GLAITASGVLYIADTNNNRIRQVS 319
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 32/90 (35%), Positives = 49/90 (54%), Gaps = 2/90 (2%)
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFS 190
A N P+G+T+D +Y AD+ N IRKI D V TIAG K+ + P A+ +
Sbjct: 122 ASANSPRGITVDSNDEVYFADSKNHRIRKILKDGTVITIAGTKTGGYNGDNIPQTTAQLN 181
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
N +D V + ++ + D GN +R+I+ N
Sbjct: 182 NPYD-VKIGSDGAIYIADYGNNRIRKIANN 210
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 47/160 (29%), Positives = 73/160 (45%), Gaps = 28/160 (17%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDK 145
G+++ D N + KIT LV+G+ + + A+ N+P GVT+D K
Sbjct: 360 GDIYFADSTNQRLSKITSD----GEIELVSGTGEANFFGENVVATTAKLNNPSGVTVDPK 415
Query: 146 -GNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDG----------PSEDAKFSNDF 193
G+LY ADT N IRKI + ++TIAG ++ + G DG PSE
Sbjct: 416 TGDLYFADTKNNRIRKITPSKIISTIAGTQTTILG--DGELATKASLIAPSE-------- 465
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISP 233
+ + PT + + D G+ +R+I N Y + P
Sbjct: 466 --IILSPTGEIYISDSGHHRIRKILTNGTIITYAGTGLKP 503
Score = 45.4 bits (106), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 39/145 (26%), Positives = 63/145 (43%), Gaps = 7/145 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP-NEARFN 135
P I V + E++ D N I KI + + G + G D P A+ N
Sbjct: 127 PRGITVDSNDEVYFADSKNHRIRKILKDGTVITIAGTKTGGYNG-----DNIPQTTAQLN 181
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
+P V + G +Y+AD N IRKI + +TT+AG + + + AK S +
Sbjct: 182 NPYDVKIGSDGAIYIADYGNNRIRKIANNTITTVAGNGTIGSDSDNAEGTSAKLSGPSGI 241
Query: 196 VYVRPTCSLLVIDRGNAALRQISLN 220
+ T + + D N +R++S N
Sbjct: 242 A-ISSTGDVYISDTNNNKIRKLSTN 265
Score = 42.0 bits (97), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 67/152 (44%), Gaps = 18/152 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQG---YTGHVDGKPNEAR 133
PY I ++ DGELF D+ N + K+ ++ G +V + G Y G + A
Sbjct: 519 PYGIALNSDGELFIADQNNYRVRKV------FTNGTIVTIAGNGERTYNGD-NILATSAA 571
Query: 134 FNHPKGVTMDDK-GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAG---FRDGPSEDAKF 189
P + +D K G LYV + IRKI GV + G V G + DG E
Sbjct: 572 VEAPTNIALDSKSGALYVGSS---RIRKIDSKGVISTVAGTGIVTGIFLYTDGI-ESTSA 627
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQISLNQ 221
+ + +Y+ L+ DR + +R+++ +
Sbjct: 628 NINTGGLYLLSNGDLIFSDRSSQLIRRLTFTR 659
>gi|410096564|ref|ZP_11291551.1| hypothetical protein HMPREF1076_00729 [Parabacteroides goldsteinii
CL02T12C30]
gi|409226528|gb|EKN19437.1| hypothetical protein HMPREF1076_00729 [Parabacteroides goldsteinii
CL02T12C30]
Length = 457
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/134 (28%), Positives = 67/134 (50%), Gaps = 4/134 (2%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDK 145
G L+ ++ ++ TP ++Q +V G+ +G +GK A+ +G+ D +
Sbjct: 327 GRLYRLNPYHTPAGHSTPWITQNEIEHIV-GTGKGAAKEGNGKA--AQLGEIEGMAADQE 383
Query: 146 GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLL 205
GN+Y+AD N I K+ + TI G +G++DG ++A F+ +DV P L
Sbjct: 384 GNVYLADYTNHVIWKVDEEFNATIFAGVPGESGYKDGKPQEALFNKPYDVA-ATPDGILY 442
Query: 206 VIDRGNAALRQISL 219
V D N +R I++
Sbjct: 443 VADTYNYLIRCIAI 456
>gi|290989086|ref|XP_002677176.1| predicted protein [Naegleria gruberi]
gi|284090782|gb|EFC44432.1| predicted protein [Naegleria gruberi]
Length = 2103
Score = 50.1 bits (118), Expect = 0.002, Method: Composition-based stats.
Identities = 40/115 (34%), Positives = 59/115 (51%), Gaps = 13/115 (11%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQ-GYTGHVDGKP-NEARFNHPKGVT 141
GE++ D N I KI YS G + +AG+ + Y G D P A+ +P G+
Sbjct: 125 GEVYFADYANHRIRKI------YSNGTITTIAGNGEPNYNG--DNIPATSAQLGYPIGIA 176
Query: 142 MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
+ G +++ADTLN IRKI + G +TTIAG + GP+ +A + F V
Sbjct: 177 VSSGGEVFIADTLNNRIRKISNNGMITTIAGNGTGGYSGDGGPAVNAMLNTPFGV 231
Score = 47.4 bits (111), Expect = 0.017, Method: Composition-based stats.
Identities = 36/106 (33%), Positives = 46/106 (43%), Gaps = 4/106 (3%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I VS GE+F D +N+ I KI S +AG+ G G A N
Sbjct: 172 PIGIAVSSGGEVFIADTLNNRIRKI----SNNGMITTIAGNGTGGYSGDGGPAVNAMLNT 227
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
P GV +D GN+ AD +N IRK+ +G G A DG
Sbjct: 228 PFGVAIDSDGNIVFADLVNNRIRKVYSSGTIVTIVGTGEAANSPDG 273
Score = 46.2 bits (108), Expect = 0.034, Method: Composition-based stats.
Identities = 37/120 (30%), Positives = 54/120 (45%), Gaps = 21/120 (17%)
Query: 132 ARFNHPKGVTMDDKGN-LYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKF 189
A N+P G+ +D N Y+AD+ N IRK+ +G +TTIAGG S+ F
Sbjct: 335 ATLNNPVGIAIDSSNNEFYIADSNNNRIRKVSSSGKITTIAGGTSS-------------F 381
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVT 249
N+ P + G+ +L L+ D Q+ I+ T L+ A VGY T
Sbjct: 382 GNNV------PAIPAFISPIGSMSLNNNQLSFADSSGQFRQINLTSNLITTLAGTVGYTT 435
Score = 45.4 bits (106), Expect = 0.061, Method: Composition-based stats.
Identities = 39/141 (27%), Positives = 59/141 (41%), Gaps = 10/141 (7%)
Query: 83 SEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVD--GKPNEARFNHPKGV 140
S +GEL D N I K++ + G + + G G G N P G+
Sbjct: 507 STNGELIISDMNNHRIRKVS------NNGIITTIAGNGTAGFCGDGGLAVNTCLNRPNGI 560
Query: 141 TMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG-PSEDAKFSNDFDVVYVR 199
+ G LY+AD N IRK+ + G+ T G N DG + +A + DV +
Sbjct: 561 AISSSGELYIADYGNHRIRKVSNNGIITTIAGNGNTIYNGDGIDAANASLYSPVDVS-IG 619
Query: 200 PTCSLLVIDRGNAALRQISLN 220
+ + D GN +R+I N
Sbjct: 620 ANNEIYIADAGNYRIRKIFTN 640
Score = 44.3 bits (103), Expect = 0.13, Method: Composition-based stats.
Identities = 40/143 (27%), Positives = 67/143 (46%), Gaps = 8/143 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + VS +GEL D N+ I K+ + + + GY G +G A+ +
Sbjct: 890 PKDVAVSSNGELIIADYSNNRIRKV---FTNGTITTIAGTGTNGYGGD-NGLATTAKLSL 945
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P GV++ G +Y+++T IRK+ G + TIAG SN +G + A N
Sbjct: 946 PVGVSISSGGEIYISETNR--IRKVLTNGTIYTIAGTLSNGYKGDNGLASLASL-NAPGT 1002
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V + P+ L D N+ +R+++
Sbjct: 1003 VSIGPSGELYFADTENSVIRKLT 1025
Score = 40.0 bits (92), Expect = 2.8, Method: Composition-based stats.
Identities = 32/111 (28%), Positives = 50/111 (45%), Gaps = 20/111 (18%)
Query: 53 GNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGR 112
G+ L N +LV T+ + +S GE++ D N I KI ++ G
Sbjct: 822 GDGLNASNAFLVSTLFD------------ISNSGEIYIADTGNHRIRKI------FTNGT 863
Query: 113 LV--AGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
++ AG+ G G+ A+ N+PK V + G L +AD N IRK+
Sbjct: 864 IITIAGNGIGGYAGDGGQATSAQLNNPKDVAVSSNGELIIADYSNNRIRKV 914
Score = 39.7 bits (91), Expect = 3.6, Method: Composition-based stats.
Identities = 39/147 (26%), Positives = 65/147 (44%), Gaps = 14/147 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV--AGS-FQGYTGHVDGKPNEAR 133
P + + + E++ D N I KI ++ G +V AG+ G++G +G + A
Sbjct: 613 PVDVSIGANNEIYIADAGNYRIRKI------FTNGTIVTIAGTGTNGFSGD-NGLGSNAT 665
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVA--GFRDGPSEDAKFSN 191
+P V + GN+Y D + IRKI G T GK+ G G + +A+ S
Sbjct: 666 IGYPSSVLFN-SGNVYFTDIVYCVIRKIYSNGTITTISGKAGTCTYGGDGGKASNAQLSY 724
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQIS 218
+ + T + + D N +R IS
Sbjct: 725 PAGIA-ISSTGDIYISDNYNHRIRVIS 750
Score = 38.9 bits (89), Expect = 6.2, Method: Composition-based stats.
Identities = 29/96 (30%), Positives = 47/96 (48%), Gaps = 14/96 (14%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT---GHVDGKPNEARFNHPKGVTM 142
G ++ D V I KI YS G + S + T G GK + A+ ++P G+ +
Sbjct: 677 GNVYFTDIVYCVIRKI------YSNGTITTISGKAGTCTYGGDGGKASNAQLSYPAGIAI 730
Query: 143 DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAG 178
G++Y++D N IR I +++ G SN+AG
Sbjct: 731 SSTGDIYISDNYNHRIRVI-----SSVTGIISNIAG 761
>gi|195456364|ref|XP_002075106.1| GK23381 [Drosophila willistoni]
gi|194171191|gb|EDW86092.1| GK23381 [Drosophila willistoni]
Length = 721
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 63/129 (48%), Gaps = 14/129 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGY-TGHVDGKPNEARFN 135
P KI S+ G +A+ + +N V + +V G+ +G +DG +RFN
Sbjct: 231 PAKIARSQLGHRYAIADAGNNRVLVVL------NDGVVEHKIGGFESGFIDGDLTTSRFN 284
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVA---GFRDGPSEDAKFS 190
+P+G+ D+ L VADT N A+R+I G V T+AG + G R GP + S
Sbjct: 285 NPQGIAFLDEDTLIVADTKNHALRQISLQGGIVETLAGTGHQCSERTGGRTGPVQ--SMS 342
Query: 191 NDFDVVYVR 199
+ +DV R
Sbjct: 343 SPWDVAIFR 351
>gi|418707795|ref|ZP_13268612.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
str. UI 08368]
gi|410771897|gb|EKR47094.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
str. UI 08368]
gi|456968671|gb|EMG09839.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
str. LT2186]
Length = 358
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 55/108 (50%), Gaps = 3/108 (2%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAG 171
L AGS + G +G + F+ P + +D + NLYV + N IRKI ++G +I
Sbjct: 156 LFAGSSSAFGGLQNGDRLNSLFDSPFFMDIDPERNLYVGELSNHTIRKINLNSGTVSILS 215
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G ++G+ DG A+F + + Y R SLL D + +R+I L
Sbjct: 216 G--GISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDL 261
Score = 42.7 bits (99), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 31/53 (58%), Gaps = 2/53 (3%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
L AG+ G VDG + A F P G+ +D GN+YV+D +N IRKI +G
Sbjct: 59 LFAGT--GVAASVDGTTSTASFKTPFGLEVDTSGNIYVSDQINNLIRKIDPSG 109
>gi|359151030|ref|ZP_09183771.1| Redoxin domain-containing protein [Streptomyces sp. S4]
Length = 568
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 67/149 (44%), Gaps = 9/149 (6%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVA-GFRD 181
G VDG EA F P G+ D L+VAD+ A+R + G A G G RD
Sbjct: 288 GLVDGPGAEAWFAQPSGLAADGDARLWVADSETSALRWVDPEGTVHTAVGTGLFDFGHRD 347
Query: 182 GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVG 241
G + A F + V V P S+ V D N ALR+ D + ++++ TD+ G
Sbjct: 348 GEAAQALFQHPL-AVTVLPDGSVAVSDTYNHALRRY----DPASGEVSTLA-TDLREPSG 401
Query: 242 AVLVGYVTCMLQQGFGPFFFSRTQQPSES 270
AVL G +++ +R + P E+
Sbjct: 402 AVLAGEEIVVVESAR--HRLTRLRLPEEA 428
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 50/84 (59%), Gaps = 5/84 (5%)
Query: 119 QGYTGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIGDAG--VTTIAG-GKS 174
+G G DG P EARF+ P+G+ + +G+ + VADT+N A+R++ A VTT AG G+
Sbjct: 170 EGERGLADGGPAEARFSEPQGLVLTPEGDAVIVADTVNHALRRVELASGVVTTPAGTGRQ 229
Query: 175 NVAGF-RDGPSEDAKFSNDFDVVY 197
G GP+ + S+ +DV +
Sbjct: 230 WWQGSPTSGPAREVDLSSPWDVAW 253
>gi|417763451|ref|ZP_12411429.1| putative lipoprotein [Leptospira interrogans str. 2002000624]
gi|417776933|ref|ZP_12424763.1| putative lipoprotein [Leptospira interrogans str. 2002000621]
gi|418671130|ref|ZP_13232484.1| putative lipoprotein [Leptospira interrogans str. 2002000623]
gi|409940657|gb|EKN86296.1| putative lipoprotein [Leptospira interrogans str. 2002000624]
gi|410573292|gb|EKQ36344.1| putative lipoprotein [Leptospira interrogans str. 2002000621]
gi|410581799|gb|EKQ49606.1| putative lipoprotein [Leptospira interrogans str. 2002000623]
Length = 358
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 56/109 (51%), Gaps = 5/109 (4%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIA 170
L AGS +G +G + F+ P + +D + NLYV + N IRKI V+T++
Sbjct: 156 LFAGSSSDLSGLQNGDRLNSLFDSPFFMDIDPERNLYVGELNNHTIRKINLNSGTVSTLS 215
Query: 171 GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
GG ++G+ DG A+F + + Y R SLL D + +R+I L
Sbjct: 216 GG---ISGYLDGDLASARFKSPLGIAYNRKMNSLLAADIQDHRIRKIDL 261
Score = 42.7 bits (99), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 31/53 (58%), Gaps = 2/53 (3%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
L AG+ G VDG + A F P G+ +D GN+YV+D +N IRKI +G
Sbjct: 59 LFAGT--GVAASVDGTTSTASFKTPFGLEVDTSGNIYVSDQINNLIRKIDPSG 109
>gi|290986743|ref|XP_002676083.1| predicted protein [Naegleria gruberi]
gi|284089683|gb|EFC43339.1| predicted protein [Naegleria gruberi]
Length = 733
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 70/144 (48%), Gaps = 10/144 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I VS + E++ D N I KI L + + G++G +G A+ +
Sbjct: 13 PVGIFVSSNNEVYIADYSNHRIRKI---LKNGNIATIAGKGTCGFSGD-NGPATNAQIYY 68
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--DGPSEDAKFSNDF 193
P V + +Y+AD N IRKI + G + TIAG + + GF +GP+ +A+ +
Sbjct: 69 PSSVFVSSNNEVYIADQSNHRIRKILENGNIVTIAG--NGIGGFSGDNGPATNAQIYYPY 126
Query: 194 DVVYVRPTCSLLVIDRGNAALRQI 217
V+V + ++D GN +R+I
Sbjct: 127 S-VFVSSNNVVYIVDYGNNRVRKI 149
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 51/177 (28%), Positives = 80/177 (45%), Gaps = 27/177 (15%)
Query: 58 FENGYLVETVIEGNEIG--------------VVPYKIRVSEDGELFAVDEVNSNIVKITP 103
ENG +V I GN IG PY + VS + ++ VD N+ + KI
Sbjct: 94 LENGNIV--TIAGNGIGGFSGDNGPATNAQIYYPYSVFVSSNNVVYIVDYGNNRVRKIL- 150
Query: 104 PLSQYSRGRLVAGSFQGYTGHV--DGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
G +V + G +G +G A+ N+P GV + +Y+AD N IRKI
Sbjct: 151 -----GNGNIVTIAGNGTSGFSGDNGPATNAQLNNPVGVFVSSNNEVYIADQSNHRIRKI 205
Query: 162 GDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
+ G + TIAG + G +GP+ +A+ + V + ++D GN +R+I
Sbjct: 206 LENGNIVTIAGNGTGGFGGDNGPATNAQLYIPYSVFV--SNNEVYIVDYGNNRIRKI 260
Score = 38.9 bits (89), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 50/99 (50%), Gaps = 11/99 (11%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV--DGKPNEAR 133
+PY + VS + E++ VD N+ I KI G +V + G +G +G A+
Sbjct: 236 IPYSVFVSNN-EVYIVDYGNNRIRKIL------GNGNIVTIAGNGTSGFSGDNGPATNAQ 288
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
N P V + + +Y+AD N IRKI + G + TIAG
Sbjct: 289 LNRPSSVFVSNN-EVYIADLNNHKIRKILENGNIITIAG 326
>gi|373253064|ref|ZP_09541182.1| thiol-disulfide isomerase-like thioredoxin [Nesterenkonia sp. F]
Length = 659
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 43/147 (29%), Positives = 66/147 (44%), Gaps = 13/147 (8%)
Query: 77 PYKIRVSED-GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P+ + SE G L I + P + R++AG+ G G DG +EA F
Sbjct: 328 PWDVLWSEAAGTLIVAMAGTHQIFEFVPTTGEL---RVLAGT--GLEGLTDGSADEAWFA 382
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG------VTTIAGGKSNVAGFRDGPSEDAKF 189
G+ D G++++AD+ ++R + A V TI G GFRDG A+
Sbjct: 383 QTSGLAADAAGDVWIADSETSSLRVLRTAEDPKAPRVETIVGEGMFDFGFRDGDPAQARL 442
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQ 216
+ V + P S+LV D N A+R+
Sbjct: 443 QHPLGVAAL-PDGSVLVADTYNGAIRR 468
>gi|288801999|ref|ZP_06407440.1| conserved hypothetical protein [Prevotella melaninogenica D18]
gi|288335434|gb|EFC73868.1| conserved hypothetical protein [Prevotella melaninogenica D18]
Length = 486
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 48/101 (47%), Gaps = 4/101 (3%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI----GDAGVTTIAGGKSNVAG 178
G DG A+F +P+ +T G LY+AD+ N IR I G T G AG
Sbjct: 385 GWEDGALKNAKFCYPRQMTFTKDGKLYIADSGNHCIRMIDTTLGKNARVTTPIGVPQSAG 444
Query: 179 FRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
F+DG E AKF+ V ++ V D N +R++S+
Sbjct: 445 FQDGGVELAKFNWPTGVAVSADGSTVYVADSKNQVIRELSI 485
>gi|290974765|ref|XP_002670115.1| predicted protein [Naegleria gruberi]
gi|284083670|gb|EFC37371.1| predicted protein [Naegleria gruberi]
Length = 784
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 62/240 (25%), Positives = 100/240 (41%), Gaps = 34/240 (14%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAG-GKSNVA 177
GY+G +G AR N P GV + G +Y+AD N IRKI + ++TIAG G+ +
Sbjct: 141 GYSGD-NGLATRARLNTPTGVAVASNGEVYIADKSNHVIRKISLNGNISTIAGNGEEGFS 199
Query: 178 GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDIL 237
G DG + N + + T L + D N +R+I N + N
Sbjct: 200 G--DGGNAKTAQLNSPIGLSISSTGELYIADSKNHVIRKIDANGIISTFAGN-------- 249
Query: 238 MVVGAVLVGY-------VTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIPI 290
+ GY +L +G FF+ T + ++ + I+++S K I
Sbjct: 250 ----GTINGYGGDGSQAKQALLNTPYGVFFYESTGEVYIAD-TLNSLIRKVS-KSGIIST 303
Query: 291 VESMKEEPGW-------PSFGQLIIDLSKLALEAMGSMFL-NFVPFRFRSSGTKGNLTPL 342
V + G+ P+ L+ + +AL ++G MF+ + R TKGN L
Sbjct: 304 VAGVPNSSGYSREDENVPATSALLSAPTSVALSSLGEMFIADNGNLYIRKVDTKGNSITL 363
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 61/123 (49%), Gaps = 10/123 (8%)
Query: 58 FENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKI-TPPLSQYSRGRLVAG 116
F + V+TV+ G +G P + ++E GEL D + I KI T + G L G
Sbjct: 472 FASNGTVDTVVSG--LGS-PQAVILTESGELLVADRSSHVIRKISTNGVMSIIAGVLEDG 528
Query: 117 SFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSN 175
F G DG + +F+ P+ + + G+LY+AD N IRK+ G +TT+AG +
Sbjct: 529 GFNG-----DGLATKTKFSGPQDIALAPNGDLYIADYDNYLIRKLSKNGNITTVAGNGDS 583
Query: 176 VAG 178
G
Sbjct: 584 TVG 586
Score = 46.2 bits (108), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 64/147 (43%), Gaps = 21/147 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P +I S++G++F D N I +I S + +V+G
Sbjct: 447 PTRIACSKNGDIFIADLFNGAIRRIFA--SNGTVDTVVSG-----------------LGS 487
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDFDV 195
P+ V + + G L VAD + IRKI GV +I G GF DG + KFS D+
Sbjct: 488 PQAVILTESGELLVADRSSHVIRKISTNGVMSIIAGVLEDGGFNGDGLATKTKFSGPQDI 547
Query: 196 VYVRPTCSLLVIDRGNAALRQISLNQD 222
+ P L + D N +R++S N +
Sbjct: 548 A-LAPNGDLYIADYDNYLIRKLSKNGN 573
Score = 38.9 bits (89), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 42/140 (30%), Positives = 63/140 (45%), Gaps = 31/140 (22%)
Query: 87 ELFAVDEVNSNIVKITPP--LSQYSRGRLVAGSF-QGYTGHVDGKPNE-ARFNHPKGVTM 142
E F + I KI+P +SQY AG+ GY+G DG + A+FN P +
Sbjct: 401 ETFIAESAYHKIRKISPNGIISQY------AGTGDHGYSG--DGSDAKLAKFNEPTRIAC 452
Query: 143 DDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 200
G++++AD N AIR+I + V T+ G + P V +
Sbjct: 453 SKNGDIFIADLFNGAIRRIFASNGTVDTVVSGLGS-------P----------QAVILTE 495
Query: 201 TCSLLVIDRGNAALRQISLN 220
+ LLV DR + +R+IS N
Sbjct: 496 SGELLVADRSSHVIRKISTN 515
>gi|395646873|ref|ZP_10434733.1| PKD domain containing protein [Methanofollis liminatans DSM 4140]
gi|395443613|gb|EJG08370.1| PKD domain containing protein [Methanofollis liminatans DSM 4140]
Length = 2489
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 45/83 (54%), Gaps = 13/83 (15%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ I V DG +F D N+NI K T S G + + G G DG+ FN
Sbjct: 1170 PHGIVVDADGNVFVSDAGNNNIQKFT------STGTFI--TKWGTAGSGDGQ-----FNV 1216
Query: 137 PKGVTMDDKGNLYVADTLNLAIR 159
P+G+ +D +GN++VAD+LN I+
Sbjct: 1217 PRGIAVDSRGNVFVADSLNHRIQ 1239
Score = 46.6 bits (109), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 62/143 (43%), Gaps = 22/143 (15%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
+P+ I V D ++ D NS I K S G +A G G DG+ F+
Sbjct: 1075 MPHGIAVDSDSNVYVTDTWNSRIQKFD------STGTFIAK--WGSYGTGDGQ-----FD 1121
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P+G+T+D G++YVAD N I+K G G G + D +F +
Sbjct: 1122 FPQGITIDADGSIYVADNANQRIQKFDSNGTFITKWGS--------GGTGDGEFDRPHGI 1173
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V V ++ V D GN +++ +
Sbjct: 1174 V-VDADGNVFVSDAGNNNIQKFT 1195
Score = 43.5 bits (101), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 22/41 (53%), Positives = 29/41 (70%), Gaps = 1/41 (2%)
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTI 169
P ++ F++PKGV +DD GN+YVADT N I+K DA T I
Sbjct: 1683 PGDSEFSYPKGVAVDDAGNVYVADTNNHRIQKF-DANGTFI 1722
Score = 41.6 bits (96), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 43/95 (45%), Gaps = 13/95 (13%)
Query: 72 EIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNE 131
+ G+ P + V G L+ D +NS I K + G +A ++ Y +
Sbjct: 1545 QFGLGPDSVAVDRAGNLYVTDPINSRIQKFD------NSGTFIA-AWGSYGAGI------ 1591
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV 166
+F P G+ +D N+YVAD+LN I+K G
Sbjct: 1592 GQFRSPTGIAVDADSNVYVADSLNNRIQKFSSTGT 1626
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 30/106 (28%), Positives = 42/106 (39%), Gaps = 16/106 (15%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + DG ++ D N I K S G + G TG + F+
Sbjct: 1123 PQGITIDADGSIYVADNANQRIQKFD------SNGTFITKWGSGGTG-------DGEFDR 1169
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
P G+ +D GN++V+D N I+K G K AG DG
Sbjct: 1170 PHGIVVDADGNVFVSDAGNNNIQKFTSTGTFIT---KWGTAGSGDG 1212
>gi|290970020|ref|XP_002668012.1| predicted protein [Naegleria gruberi]
gi|284081028|gb|EFC35268.1| predicted protein [Naegleria gruberi]
Length = 243
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/142 (28%), Positives = 71/142 (50%), Gaps = 7/142 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + VS + E+F D+ N+ I KI + L+AG+ Q G +G ++ +
Sbjct: 34 PSGVFVSTNNEVFIADKNNNRIRKIVKNGNIV----LIAGNGQTGCGGDNGSATSSQLYY 89
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG-GKSNVAGFRDGPSEDAKFSNDFDV 195
P+ V + + +ADT N IRKI + + TIAG G+ +G +GP+ A+ +
Sbjct: 90 PQSVFVSTNNEICIADTFNHRIRKIENGRIVTIAGNGQPGYSG-DNGPATTAQLHRPYS- 147
Query: 196 VYVRPTCSLLVIDRGNAALRQI 217
V+V + + D N ++R+I
Sbjct: 148 VFVSANNEVYIADTFNHSIRKI 169
Score = 45.8 bits (107), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 70/145 (48%), Gaps = 13/145 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV--AGSFQ-GYTGHVDGKPNEAR 133
P + VS + E+ D N I KI GR+V AG+ Q GY+G +G A+
Sbjct: 90 PQSVFVSTNNEICIADTFNHRIRKI-------ENGRIVTIAGNGQPGYSGD-NGPATTAQ 141
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSND 192
+ P V + +Y+ADT N +IRKI ++G + TIAG + G G + +A+ N
Sbjct: 142 LHRPYSVFVSANNEVYIADTFNHSIRKIDESGNIETIAGNEQPGYGGDGGYATNAQL-NH 200
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQI 217
V++ + + + N R+I
Sbjct: 201 PSGVFISTNYEIYITETNNHTTRKI 225
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 49/97 (50%), Gaps = 7/97 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKPNEARFN 135
PY + VS + E++ D N +I KI + +AG+ Q GY G G A+ N
Sbjct: 145 PYSVFVSANNEVYIADTFNHSIRKI----DESGNIETIAGNEQPGYGGD-GGYATNAQLN 199
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
HP GV + +Y+ +T N RKI + G + TIAG
Sbjct: 200 HPSGVFISTNYEIYITETNNHTTRKILENGNIITIAG 236
>gi|86738767|ref|YP_479167.1| serine/threonine protein kinase [Frankia sp. CcI3]
gi|86565629|gb|ABD09438.1| serine/threonine protein kinase [Frankia sp. CcI3]
Length = 863
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 65/146 (44%), Gaps = 6/146 (4%)
Query: 74 GVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEAR 133
G+ PY + V DG L I KITP + +AG+ G G A+
Sbjct: 543 GLSPYSVDVDPDGSLLVSSLATDRIQKITPAGAVSD----LAGTGAGGISGDGGPATAAQ 598
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSND 192
+ P D GN+Y+ D N IRKI AG +TTIAG G GP+ AK N
Sbjct: 599 LDGPGSTARDKAGNIYIGDAKNNRIRKISPAGIITTIAGTGDAGYGGDGGPATAAKI-NS 657
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQIS 218
+ V P S+ + D N +R+IS
Sbjct: 658 AEKVTTGPDGSVYLSDYENHRIRKIS 683
Score = 47.4 bits (111), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 48/145 (33%), Positives = 70/145 (48%), Gaps = 6/145 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P +++++DG L+ D + I K+TP + VAG+ +G G AR N
Sbjct: 714 PNDLQMTDDGTLYFADLASDTIQKVTPDGIITT----VAGTGEGGFSGDGGPATRARLNV 769
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P D LY+AD N IR++ GV TTIAG +G GP+ A+F N V
Sbjct: 770 PSLTVGPDGRTLYLADYRNHRIRRVDPNGVITTIAGTGGEGSGGDGGPATAAQFKNPSSV 829
Query: 196 VYVRPTCSLLVIDRGNAALRQISLN 220
V + +L + D GN +R+I N
Sbjct: 830 A-VDGSGALYIADNGNDRVRRIDPN 853
Score = 45.8 bits (107), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 48/97 (49%), Gaps = 11/97 (11%)
Query: 79 KIRVSEDGELFAVDEVNSNIVKITPP--LSQYSRGRLVAGSFQGYTGHVDGKP-NEARFN 135
K+ DG ++ D N I KI+P ++ Y V GYTG DG P A+ N
Sbjct: 660 KVTTGPDGSVYLSDYENHRIRKISPQGIITTY-----VGTGVAGYTG--DGGPATAAKIN 712
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAG 171
P + M D G LY AD + I+K+ D +TT+AG
Sbjct: 713 GPNDLQMTDDGTLYFADLASDTIQKVTPDGIITTVAG 749
>gi|365861024|ref|ZP_09400809.1| alkyl hydroperoxide reductase [Streptomyces sp. W007]
gi|364009547|gb|EHM30502.1| alkyl hydroperoxide reductase [Streptomyces sp. W007]
Length = 560
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/126 (32%), Positives = 62/126 (49%), Gaps = 11/126 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF-QGYTGHVDGKPNEARFN 135
P K + DG D +V++ + G V F G G DG P+EARF+
Sbjct: 132 PGKALLLPDGGFLVSDTTRHRLVELD------ADGETVRRHFGTGERGLSDGGPDEARFS 185
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVT-TIAG-GKSNVAGF-RDGPSEDAKFSN 191
P+G+ + G + VADT+N A+R + GVT T+AG G+ G GP+ + S+
Sbjct: 186 EPQGLAVLPDGRVAVADTVNHALRALDLTTGVTSTLAGTGRQWWQGTPTSGPAREVDLSS 245
Query: 192 DFDVVY 197
+DV +
Sbjct: 246 PWDVAW 251
Score = 43.5 bits (101), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 50/97 (51%), Gaps = 7/97 (7%)
Query: 77 PYKIRVSEDGE-LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P + VS DGE L+ D S++ + ++ R + G F GH DG ++A
Sbjct: 300 PSGLAVSADGERLWVADSETSSLRWVD--RDEHVRTAVGTGLFD--FGHRDGAADQALLQ 355
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIA 170
HP GVT G++ ++DT N A+R+ A VTT+A
Sbjct: 356 HPLGVTALPDGSVAISDTYNHALRRYDPASGEVTTLA 392
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 51/107 (47%), Gaps = 5/107 (4%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG-DAGVTTI 169
R+ AG+ G VDG EA F P G+ + G L+VAD+ ++R + D V T
Sbjct: 277 RVAAGTTN--EGLVDGPAAEAWFAQPSGLAVSADGERLWVADSETSSLRWVDRDEHVRTA 334
Query: 170 AGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
G G RDG ++ A + V + P S+ + D N ALR+
Sbjct: 335 VGTGLFDFGHRDGAADQALLQHPLGVTAL-PDGSVAISDTYNHALRR 380
>gi|380694052|ref|ZP_09858911.1| hypothetical protein BfaeM_08738 [Bacteroides faecis MAJ27]
Length = 495
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 59/134 (44%), Gaps = 12/134 (8%)
Query: 97 NIVKITPPLSQYSRGRLVAGS--FQGYT---GHVDGKPNEARFNHPKGVTMDDKGNLYVA 151
++ KITP G AG ++G G DG A+F HP+ + D G +Y+A
Sbjct: 362 DVSKITPEDKDTYNGEPYAGKAIYEGVMNGKGWEDGLLKNAKFRHPRQICFTDDGKMYIA 421
Query: 152 DTLNLAIRKIGD------AGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLL 205
D+ N IR + A VTT G G++DG E AKF V +
Sbjct: 422 DSGNSCIRVVDTTMPKERAAVTTPI-GLPGAEGYKDGGPEIAKFHFPCGVAVNSDGTIVY 480
Query: 206 VIDRGNAALRQISL 219
V D N +R++S+
Sbjct: 481 VADTQNKVIRKLSI 494
>gi|116754580|ref|YP_843698.1| NHL repeat-containing protein [Methanosaeta thermophila PT]
gi|116666031|gb|ABK15058.1| NHL repeat containing protein [Methanosaeta thermophila PT]
Length = 504
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 49/97 (50%), Gaps = 7/97 (7%)
Query: 121 YTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK--IGDAGVTTIAGGKSNVAG 178
Y G +DG AR HP G+ + G++YVADT N I+K + + T AG S G
Sbjct: 395 YFGDIDGDFGRARLQHPLGLFYKE-GSIYVADTYNHRIKKADLSSGSIHTTAGTGS--PG 451
Query: 179 FRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALR 215
F DGP A F+ + ++ SL + D N A+R
Sbjct: 452 FADGPGAQAAFNEPSGLTFLGD--SLFIADTNNHAVR 486
>gi|381205317|ref|ZP_09912388.1| NHL repeat containing protein [SAR324 cluster bacterium JCVI-SC
AAA005]
Length = 458
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 49/164 (29%), Positives = 79/164 (48%), Gaps = 19/164 (11%)
Query: 64 VETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTG 123
V+T++E P + VS L+ +D+ ++K+ PL+ + + G GYT
Sbjct: 232 VQTLLE-------PVDVTVSST-HLYILDQGFRRVLKV--PLAGGAPTSIAGGGSIGYT- 280
Query: 124 HVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGP 183
DG A+FN P T + K +L+VADT N IR+I D + G G
Sbjct: 281 --DGSGISAQFNAPNAFT-NTKDSLFVADTCNNTIRRI-DLDTNIVTSLLPPEPGCGSGS 336
Query: 184 SEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQ 227
S+ K ++ D+ L + D+GN+A+++I LN DC+ Q
Sbjct: 337 SDSDKLNSPTDLT--TNGYQLYIADKGNSAIKRIDLN--DCKEQ 376
>gi|329935738|ref|ZP_08285543.1| hypothetical protein SGM_1035 [Streptomyces griseoaurantiacus M045]
gi|329304829|gb|EGG48702.1| hypothetical protein SGM_1035 [Streptomyces griseoaurantiacus M045]
Length = 617
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 46/80 (57%), Gaps = 4/80 (5%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK--IGDAGVTTIAG-GKSNV 176
G G DG A F+ P+G+ + + G++ VADT+N A+R+ + GVTT+AG G+
Sbjct: 228 GARGFADGTAEAAAFSEPQGLALLEDGSVVVADTVNHALRRLDVATGGVTTLAGTGRQWW 287
Query: 177 AGF-RDGPSEDAKFSNDFDV 195
G GP+ + S+ +DV
Sbjct: 288 QGSPTSGPAREVDLSSPWDV 307
Score = 38.5 bits (88), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 29/50 (58%), Gaps = 2/50 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIA 170
GH DG +A HP GVT G++ V+DT N A+R+ A VTT+A
Sbjct: 399 GHRDGAAEQALLQHPLGVTALPDGSVAVSDTYNHALRRFDPATGEVTTLA 448
>gi|359728310|ref|ZP_09267006.1| hypothetical protein Lwei2_15904 [Leptospira weilii str.
2006001855]
gi|417778551|ref|ZP_12426356.1| putative lipoprotein [Leptospira weilii str. 2006001853]
gi|410781344|gb|EKR65918.1| putative lipoprotein [Leptospira weilii str. 2006001853]
Length = 357
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 57/110 (51%), Gaps = 11/110 (10%)
Query: 118 FQGYTG--HVDGKPNEARFNH----PKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTI 169
F Y G +DG N R N P + +D + NLYV + N AIRKI V+T+
Sbjct: 154 FSLYAGISDIDGFQNGDRLNSLFKGPFFMDLDRERNLYVGELGNHAIRKINLNSETVSTL 213
Query: 170 AGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
+GG V+G+ DG A+F + + Y + T SLLV D + +R+I L
Sbjct: 214 SGG---VSGYLDGDLVSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDL 260
>gi|456865078|gb|EMF83443.1| putative lipoprotein [Leptospira weilii serovar Topaz str. LT2116]
Length = 357
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 57/110 (51%), Gaps = 11/110 (10%)
Query: 118 FQGYTG--HVDGKPNEARFNH----PKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTI 169
F Y G +DG N R N P + +D + NLYV + N AIRKI V+T+
Sbjct: 154 FSLYAGISDIDGFQNGDRLNSLFKGPFFMDLDRERNLYVGELGNHAIRKINLNSETVSTL 213
Query: 170 AGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
+GG V+G+ DG A+F + + Y + T SLLV D + +R+I L
Sbjct: 214 SGG---VSGYLDGDLVSAQFKSPLGITYDQKTDSLLVADLQDHRIRKIDL 260
>gi|395226993|ref|ZP_10405365.1| hypothetical protein ThvES_00021150, partial [Thiovulum sp. ES]
gi|394444658|gb|EJF05823.1| hypothetical protein ThvES_00021150, partial [Thiovulum sp. ES]
Length = 240
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 47/93 (50%), Gaps = 7/93 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY I + E+ ++ IVKI+P G + G G+ DG ++ N
Sbjct: 138 PYDIEIDENNNIYVSLPDEHRIVKISP------SGEVQNFVGSGIAGYQDGVGFNSQLNK 191
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTT 168
P+G+T D+KGNL V D+ N IR+I D ++T
Sbjct: 192 PRGMTFDEKGNLLVVDSYNEMIREISPDGKIST 224
>gi|383636240|ref|ZP_09950646.1| redoxin domain-containing protein [Streptomyces chartreusis NRRL
12338]
Length = 605
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 46/80 (57%), Gaps = 4/80 (5%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG-GKSNV 176
G G DG A FN P+G+ + D+G++ VADT+N A+R++ VTT+AG G+
Sbjct: 219 GTRGFADGPTGTAAFNEPQGLALLDEGSVVVADTVNHALRRLDLTTGEVTTLAGTGRQWW 278
Query: 177 AGF-RDGPSEDAKFSNDFDV 195
G GP+ + S+ +DV
Sbjct: 279 QGSPTSGPAREVDLSSPWDV 298
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 42/85 (49%), Gaps = 6/85 (7%)
Query: 88 LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN 147
L+ D S + +TP S ++ G+ GH DG +A HP GVT G+
Sbjct: 359 LWLADSETSALRWVTPDGSVHT----AVGTGLFDFGHRDGSAGQALLQHPLGVTALPDGS 414
Query: 148 LYVADTLNLAIRKIGDAG--VTTIA 170
+ VADT N A+R+ A VTT+A
Sbjct: 415 VAVADTYNHALRRYDPATGEVTTLA 439
>gi|290971380|ref|XP_002668485.1| predicted protein [Naegleria gruberi]
gi|284081915|gb|EFC35741.1| predicted protein [Naegleria gruberi]
Length = 277
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 33/56 (58%), Gaps = 2/56 (3%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSN 175
GY G DG N+PK +D KG++Y ADT N IRK+ + +TTIAG K N
Sbjct: 177 GYNG--DGLATATDLNYPKYAQVDSKGDVYFADTYNGLIRKVSNGYITTIAGSKQN 230
>gi|290975594|ref|XP_002670527.1| predicted protein [Naegleria gruberi]
gi|284084087|gb|EFC37783.1| predicted protein [Naegleria gruberi]
Length = 443
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 69/146 (47%), Gaps = 14/146 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV--DGKPNEARF 134
P + VS + E++ DE N I K+ G +V + G G +G A+
Sbjct: 13 PRCVFVSSNNEVYIADEGNQRIRKVVE------NGNIVTIAGNGTAGFSGDNGPATSAQL 66
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--DGPSEDAKFSN 191
+P G+ + +Y++D+ N IRK+ + G + TIAG + GF +GP+ A+ N
Sbjct: 67 YNPFGIFVSSNNEVYISDSGNNRIRKVLENGNIITIAGNGT--VGFSGDNGPATSAQLYN 124
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQI 217
F ++V + + D N +R+I
Sbjct: 125 PFG-IFVSANNEVYISDSNNNIIRKI 149
Score = 46.2 bits (108), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 49/89 (55%), Gaps = 6/89 (6%)
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--DGPSEDAK 188
A+ N+P+ V + +Y+AD N IRK+ + G + TIAG + AGF +GP+ A+
Sbjct: 8 AQLNYPRCVFVSSNNEVYIADEGNQRIRKVVENGNIVTIAGNGT--AGFSGDNGPATSAQ 65
Query: 189 FSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
N F ++V + + D GN +R++
Sbjct: 66 LYNPFG-IFVSSNNEVYISDSGNNRIRKV 93
Score = 38.9 bits (89), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 48/98 (48%), Gaps = 9/98 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV--DGKPNEARF 134
P+ I VS + E++ D N+ I K+ G ++ + G G +G A+
Sbjct: 69 PFGIFVSSNNEVYISDSGNNRIRKVLE------NGNIITIAGNGTVGFSGDNGPATSAQL 122
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
+P G+ + +Y++D+ N IRKI + G + TIAG
Sbjct: 123 YNPFGIFVSANNEVYISDSNNNIIRKILENGNIVTIAG 160
>gi|345882347|ref|ZP_08833836.1| hypothetical protein HMPREF0666_00012 [Prevotella sp. C561]
gi|345045612|gb|EGW49528.1| hypothetical protein HMPREF0666_00012 [Prevotella sp. C561]
Length = 496
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 47/101 (46%), Gaps = 4/101 (3%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI----GDAGVTTIAGGKSNVAG 178
G DG A+F P+ +T G LY+AD+ N IR I G T G AG
Sbjct: 395 GWEDGALKNAKFCFPRQMTFTKDGKLYIADSGNQCIRMIDTTQGKNARVTTPIGVPQSAG 454
Query: 179 FRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
F+DG E AKF+ V ++ V D N +R++S+
Sbjct: 455 FQDGGVELAKFNWPTGVAVSADGSTVYVADSKNQVIRELSI 495
>gi|290973472|ref|XP_002669472.1| basal body protein NBP-1 [Naegleria gruberi]
gi|284083020|gb|EFC36728.1| basal body protein NBP-1 [Naegleria gruberi]
Length = 2352
Score = 48.9 bits (115), Expect = 0.006, Method: Composition-based stats.
Identities = 41/150 (27%), Positives = 70/150 (46%), Gaps = 17/150 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVD-----GKPNE 131
P+ I + +G+++ + V + I KI+ + ++ LVAG TG D G
Sbjct: 153 PWGIHIPSNGDIYITEYVGNKIRKIS---ASDNKIYLVAG-----TGSYDFSGDGGNAVS 204
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFS 190
A+ P V ++ G +++ADT N IRKI G +TTIAG S+ + DG
Sbjct: 205 AKLKSPWSVFVNAIGEIFIADTDNDRIRKIATNGIITTIAGSGSSTS---DGVLATTASL 261
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
V++ P L + + +R++ L+
Sbjct: 262 KKPTSVFISPANELFIAEADGGRIRKVDLS 291
Score = 42.0 bits (97), Expect = 0.69, Method: Composition-based stats.
Identities = 41/154 (26%), Positives = 68/154 (44%), Gaps = 17/154 (11%)
Query: 71 NEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV--AGSFQGYTGHVDGK 128
N GV+ Y + EL+ +D N+ + KI S G LV AG+ + +G
Sbjct: 97 NPGGVLEY------NNELYIMDYGNNRVRKIN------SEGVLVTIAGTGTRSSAGDNGA 144
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFRDGPSED 186
A N P G+ + G++Y+ + + IRKI D + +AG S G +
Sbjct: 145 ATSASLNGPWGIHIPSNGDIYITEYVGNKIRKISASDNKIYLVAGTGSYDFSGDGGNAVS 204
Query: 187 AKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
AK + + V+V + + D N +R+I+ N
Sbjct: 205 AKLKSPWS-VFVNAIGEIFIADTDNDRIRKIATN 237
>gi|290971625|ref|XP_002668588.1| predicted protein [Naegleria gruberi]
gi|284082057|gb|EFC35844.1| predicted protein [Naegleria gruberi]
Length = 731
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/127 (33%), Positives = 64/127 (50%), Gaps = 17/127 (13%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGS-FQGYTGHVDGKP-NEA 132
P I + +G+L+ + N+ I+K YS +L VAG+ + GY+G DG P A
Sbjct: 151 PNGIMYTPNGDLYIASQGNNKIMK-------YSNSQLSTVAGNGYAGYSG--DGGPARNA 201
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGK--SNVAGFRDGPSEDAK 188
+ N+P VT+ G +Y+AD N A+RK+ G +T I G S + G P+ D
Sbjct: 202 KLNYPADVTVSSTGEVYIADWGNRAVRKVFRNGTIITLIVSGNGLSELNGVTLTPNGDLL 261
Query: 189 FSNDFDV 195
+S V
Sbjct: 262 YSTQLWV 268
Score = 45.1 bits (105), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 48/92 (52%), Gaps = 5/92 (5%)
Query: 128 KPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR--DGPSE 185
PN N P G+ G+LY+A N I K ++ ++T+AG + AG+ GP+
Sbjct: 142 NPNNDGINGPNGIMYTPNGDLYIASQGNNKIMKYSNSQLSTVAG--NGYAGYSGDGGPAR 199
Query: 186 DAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
+AK + DV V T + + D GN A+R++
Sbjct: 200 NAKLNYPADVT-VSSTGEVYIADWGNRAVRKV 230
>gi|398337816|ref|ZP_10522521.1| hypothetical protein LkmesMB_21054 [Leptospira kmetyi serovar
Malaysia str. Bejo-Iso9]
Length = 359
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 58/110 (52%), Gaps = 5/110 (4%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK--IGDAGVTTIA 170
L AGS G G +G + A FN P + D + NLY+ + N IRK + V+T++
Sbjct: 157 LFAGSPGGTAGLQNGDNSGALFNGPFFMDFDRERNLYLGELGNHDIRKFNLNFQSVSTLS 216
Query: 171 GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
G G++DG A F + +VY R SLLV+D + +R+I+L+
Sbjct: 217 GSS---LGYQDGSVSTALFKSPIGLVYDRKKNSLLVVDIQDHRIRKINLD 263
>gi|308178347|ref|YP_003917753.1| NHL repeat-containing protein [Arthrobacter arilaitensis Re117]
gi|307745810|emb|CBT76782.1| NHL repeat-containing protein [Arthrobacter arilaitensis Re117]
Length = 635
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 66/145 (45%), Gaps = 12/145 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ + G++ I P S S + AG+ G G DG P+EA F
Sbjct: 313 PWDVLYHPSGQVIIAMAGTHQIFAFNPRTSAVS---VFAGT--GLEGLADGAPDEAWFAQ 367
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI----GDA-GVTTIAGGKSNVAGFRDGPSEDAKFSN 191
G+ +D + N+++AD+ A+R I G A V T G GFRDG ++ A+ +
Sbjct: 368 SSGLALDGQ-NVWIADSETSALRWIEVEEGKAQSVNTAVGTGLFDFGFRDGDAQQARLQH 426
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQ 216
V V P S+ + D N A+R+
Sbjct: 427 PLGVA-VLPDHSIAIADSYNGAVRR 450
>gi|290983716|ref|XP_002674574.1| predicted protein [Naegleria gruberi]
gi|284088165|gb|EFC41830.1| predicted protein [Naegleria gruberi]
Length = 2271
Score = 48.9 bits (115), Expect = 0.006, Method: Composition-based stats.
Identities = 37/139 (26%), Positives = 70/139 (50%), Gaps = 8/139 (5%)
Query: 82 VSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVT 141
+S DG ++ D N+ I K ++ + LV S +G+ +DG + A N+P+ +
Sbjct: 481 ISNDGIIYIADYYNNRIAKFE--INNSTLTTLVGRSLKGF---LDGIGSNALLNYPESLI 535
Query: 142 MDDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 199
+ +Y +D N AIR + A VTT+AG + N GP++ ++ + + +
Sbjct: 536 IGPDNMIYFSDRDNNAIRSVSTSSALVTTLAGDRFNGFLGDGGPAKSSRLDSP-GPIQLT 594
Query: 200 PTCSLLVIDRGNAALRQIS 218
++ +DRGN +R+I+
Sbjct: 595 LGGEIIFMDRGNQRIRKIT 613
Score = 43.5 bits (101), Expect = 0.22, Method: Composition-based stats.
Identities = 35/108 (32%), Positives = 52/108 (48%), Gaps = 8/108 (7%)
Query: 115 AGS-FQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGK 173
AGS GY+G G A N P + D GNL ++D+ N IRK+ + ++T+AG
Sbjct: 292 AGSGTSGYSGD-GGLGTSALLNGPSALAFDSSGNLLISDSFNNRIRKVANGTISTLAGNS 350
Query: 174 SNVAGFRDGP-SEDAKFSNDFDVVYVRPTCS---LLVIDRGNAALRQI 217
+ F +G A FS+ V Y S +L+ D N LR++
Sbjct: 351 NR--NFGNGVLGTLASFSSPNSVYYTGNDDSAGGILISDTNNHVLRRL 396
>gi|428166914|gb|EKX35881.1| hypothetical protein GUITHDRAFT_79273, partial [Guillardia theta
CCMP2712]
Length = 318
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 48/155 (30%), Positives = 70/155 (45%), Gaps = 18/155 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY ++VS G V + +N+++ S Y+ +AG Q G +G A+FN
Sbjct: 115 PYGVKVSPCGNYVIVSDTGNNMIRKVDIESGYTN--TLAG--QSLAGTANGVGTLAQFNM 170
Query: 137 PKGVTMD-DKGNLYVADTLNLAIRKI-----------GDAGVTTIAGGKSNVAGFRDGPS 184
P VT+D ++ YV+D N IRKI + +AG S VAG D
Sbjct: 171 PVDVTVDWNETVAYVSDQGNNCIRKIDLLTAALDWTSATPSLVVVAG--SGVAGLTDAVG 228
Query: 185 EDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
A+F N V SLLV D ++ +R+I L
Sbjct: 229 LSAQFYNPTGVAVDWYGASLLVADSMDSTIRRIDL 263
Score = 46.6 bits (109), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 44/135 (32%), Positives = 68/135 (50%), Gaps = 6/135 (4%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG-DAGVTTIAG 171
VA G G+V G +AR N P GV + GN + V+DT N IRK+ ++G T
Sbjct: 92 VAWIGDGTAGNVQGYGTKARINTPYGVKVSPCGNYVIVSDTGNNMIRKVDIESGYTNTLA 151
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSI 231
G+S +AG +G A+F+ DV V D+GN +R+I L + S
Sbjct: 152 GQS-LAGTANGVGTLAQFNMPVDVTVDWNETVAYVSDQGNNCIRKIDLLT--AALDWTSA 208
Query: 232 SPTDILMVVGAVLVG 246
+P+ +++V G+ + G
Sbjct: 209 TPS-LVVVAGSGVAG 222
>gi|290972172|ref|XP_002668832.1| predicted protein [Naegleria gruberi]
gi|284082360|gb|EFC36088.1| predicted protein [Naegleria gruberi]
Length = 2254
Score = 48.9 bits (115), Expect = 0.006, Method: Composition-based stats.
Identities = 40/140 (28%), Positives = 67/140 (47%), Gaps = 12/140 (8%)
Query: 30 LIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELF 89
++ +L L++ +T+ S + G+ Q + V ++ G P + ++DG+LF
Sbjct: 16 VVFYLLHLIEASTKPSYYITKVGGSK-QCDGALAVNALLGG------PSHLTFNQDGDLF 68
Query: 90 AVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDD-KGNL 148
+ V I KI +Y +AG + G ++ N P+GV DD G+L
Sbjct: 69 VAEYVFHRIRKI----DRYGYISTIAGQMTKGSDGDGGLATASQMNSPRGVFFDDVTGSL 124
Query: 149 YVADTLNLAIRKIGDAGVTT 168
+VADT N IRKI + GV +
Sbjct: 125 FVADTNNHKIRKIDNFGVIS 144
>gi|290976488|ref|XP_002670972.1| predicted protein [Naegleria gruberi]
gi|284084536|gb|EFC38228.1| predicted protein [Naegleria gruberi]
Length = 704
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 43/149 (28%), Positives = 71/149 (47%), Gaps = 13/149 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV--AGSFQ-GYTGHVDGK-PNEA 132
P +I +S + E +NI++ ++ G+LV AG+ G++G DGK +A
Sbjct: 121 PRQITMSTTENALYIAESGNNIIRKLNLMT----GQLVIVAGNLTAGFSG--DGKIATQA 174
Query: 133 RFNHPKGVTMDDKGN--LYVADTLNLAIRKIGD-AGVTTIAGGKSNVAGFRDGPSEDAKF 189
N P+GVT D LY++DTLN +RK+ G+ T G + +GF D +
Sbjct: 175 MLNGPRGVTFDTTTQKYLYISDTLNHIVRKLDIFTGIITTIAGTAGSSGFTDNVLSTSAK 234
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
N V + + + D N +R+I+
Sbjct: 235 LNGPQAVAIMSNGDIYIADTQNNRIRKIT 263
Score = 45.4 bits (106), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 55/108 (50%), Gaps = 8/108 (7%)
Query: 115 AGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG- 171
AGS G+T +V A+ N P+ V + G++Y+ADT N IRKI A ++TI G
Sbjct: 219 AGS-SGFTDNV--LSTSAKLNGPQAVAIMSNGDIYIADTQNNRIRKITAATGIISTICGT 275
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G +AG DG + + N +++ L + D N LR+I L
Sbjct: 276 GNGGIAG--DGSAATSAMINSPRDLFLGLQNDLYIADSWNHRLRRIDL 321
>gi|239989050|ref|ZP_04709714.1| hypothetical protein SrosN1_17240 [Streptomyces roseosporus NRRL
11379]
gi|291446053|ref|ZP_06585443.1| alkyl hydroperoxide reductase/Thiol specific antioxidant/Mal
allergen [Streptomyces roseosporus NRRL 15998]
gi|291349000|gb|EFE75904.1| alkyl hydroperoxide reductase/Thiol specific antioxidant/Mal
allergen [Streptomyces roseosporus NRRL 15998]
Length = 605
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 41/126 (32%), Positives = 62/126 (49%), Gaps = 11/126 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF-QGYTGHVDGKPNEARFN 135
P K + DG D +V++ + G V F G G DG P+EARF+
Sbjct: 177 PGKALLLPDGGFLVSDTTRHRLVELD------ADGETVRRHFGTGERGLNDGGPDEARFS 230
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVT-TIAG-GKSNVAGF-RDGPSEDAKFSN 191
P+G+ + G + VADT+N AIR + GVT T+AG G+ G GP+ + S+
Sbjct: 231 EPQGLAVLPDGRVAVADTVNHAIRALDLTTGVTSTLAGTGRQWWQGTPTSGPAREVDLSS 290
Query: 192 DFDVVY 197
+D+ +
Sbjct: 291 PWDLAW 296
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 51/107 (47%), Gaps = 5/107 (4%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG-DAGVTTI 169
R+ AG+ G VDG EA F P G+ + G L+VAD+ A+R + D V T
Sbjct: 322 RVAAGTTN--EGLVDGPAAEAWFAQPSGLAVSADGERLWVADSETSALRWVDRDEHVHTA 379
Query: 170 AGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
G G RDG ++ A + V + P S+ + D N ALR+
Sbjct: 380 VGTGLFDFGHRDGAADQALLQHPIGVTAL-PDGSVAISDMYNHALRR 425
>gi|269926393|ref|YP_003323016.1| alkyl hydroperoxide reductase [Thermobaculum terrenum ATCC BAA-798]
gi|269790053|gb|ACZ42194.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Thermobaculum terrenum ATCC BAA-798]
Length = 493
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 54/98 (55%), Gaps = 10/98 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P K+ ED + + + N N + S + R + G+ G G VDG +EA+FNH
Sbjct: 179 PGKVLALEDEDSLYIADSNHNRIL---ECSLSGKIRRIWGN--GEEGLVDGSASEAKFNH 233
Query: 137 PKGVTMDDKGN-LYVADTLNLAIR--KIGDAGVTTIAG 171
P+G+ + +GN LYVADT N A+R + + V TIAG
Sbjct: 234 PQGMAI--RGNELYVADTENHALRLLHLNEGKVETIAG 269
>gi|290986797|ref|XP_002676110.1| predicted protein [Naegleria gruberi]
gi|284089710|gb|EFC43366.1| predicted protein [Naegleria gruberi]
Length = 1218
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 66/138 (47%), Gaps = 8/138 (5%)
Query: 83 SEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKPNEARFNHPKGVT 141
S +G+++ D +N I KI + S +AG+ GY+G G A N P GV
Sbjct: 215 STNGDIYFCDSLNHRIRKIDGSTAIIST---IAGNGDVGYSGD-GGLATNAALNFPTGVV 270
Query: 142 MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD-GPSEDAKFSNDFDVVYVRP 200
G LY+ D N+ IR + +G + GG + + D GP+ +AKF+ + V
Sbjct: 271 STPDGTLYIIDNNNV-IRMVNSSGYISTIGGSMDSGNYGDGGPAINAKFAYPTG-ISVSK 328
Query: 201 TCSLLVIDRGNAALRQIS 218
+ V D GN +R+IS
Sbjct: 329 NGDVYVTDTGNLKIRKIS 346
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 48/170 (28%), Positives = 78/170 (45%), Gaps = 22/170 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSR--GRLVAGSFQGYTGHVDGKPN-EAR 133
P + + DG L+ +D N+N++++ S G + +G++ DG P A+
Sbjct: 266 PTGVVSTPDGTLYIID--NNNVIRMVNSSGYISTIGGSMDSGNYG------DGGPAINAK 317
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSND 192
F +P G+++ G++YV DT NL IRKI G+ G +NV G P+ A
Sbjct: 318 FAYPTGISVSKNGDVYVTDTGNLKIRKISKQTGIIETVIGNTNVYGGDGLPASKAALGYI 377
Query: 193 FDVV------YVRPTCSLLVIDRGNAALRQISLNQ---DDCEYQYNSISP 233
VV Y+ C + +D+ N + I+ N DD N I+P
Sbjct: 378 SQVVSIDGITYISHDCRIRKVDK-NQIVSTIAGNGNRLDDSTIVPNVINP 426
Score = 45.8 bits (107), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 68/148 (45%), Gaps = 17/148 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + ++ DG V +S + IT LS + +GY G D +A+ N
Sbjct: 641 PLDVALTNDGLYLLVLLSDSTLNNIT--LSTLQNSIICGTGVEGYNGD-DILATDAQLNK 697
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P V++D G++Y++D N +RKI + ++TIAG + E+ +N VV
Sbjct: 698 PTSVSVDQNGDIYISD--NSRLRKISNGVISTIAGNGNTT------DIENDVLANMTSVV 749
Query: 197 ----YVRPTCSLLVIDRGNAALRQISLN 220
V P L +IDR A +R +S N
Sbjct: 750 PSFSKVGPNGDLYIIDR--AYIRTVSTN 775
Score = 41.6 bits (96), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 59/113 (52%), Gaps = 8/113 (7%)
Query: 112 RLVAGSFQGYTGHVDGKPNE----ARFNHPKGVTMD-DKGNLYVADTLNLAIRKIG-DAG 165
+ +AG+ + + DG N+ A+ N P+G+ + + G++Y +D N +RKI +G
Sbjct: 68 KAIAGTGELPGSYSDGGANQLALLAQINEPRGIVVSKETGDIYFSDYYNDLVRKIDMKSG 127
Query: 166 VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
++ G +GF DG ++ +K S F + + ++ DR N +R+IS
Sbjct: 128 KISVIAGIPGSSGFNDGQAKLSKVSRPFGIGLINS--DVIFADRNNHRIRKIS 178
>gi|295394954|ref|ZP_06805167.1| NHL repeat containing protein [Brevibacterium mcbrellneri ATCC
49030]
gi|294972287|gb|EFG48149.1| NHL repeat containing protein [Brevibacterium mcbrellneri ATCC
49030]
Length = 655
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 65/147 (44%), Gaps = 33/147 (22%)
Query: 120 GYTGHVDGKPNEA---------------RFNHPKGVTMDDKGNLYVADTLNLAIRKI--G 162
G H+ G NE F P GV + GN++VAD+ A+RK+
Sbjct: 358 GTVAHISGSLNEGLRDTSAAATPFDLDEWFAQPSGVRLASDGNVWVADSETSALRKLDPS 417
Query: 163 DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQD 222
VT+ G GF+DGP+E A+ + + + P ++ + D N A+R+
Sbjct: 418 TGAVTSYVGVGLFDFGFQDGPAESARLQHPLGALEL-PDGTVAIADTYNGAIRRF----- 471
Query: 223 DCEYQYNSIS--------PTDILMVVG 241
+ Q N++S P+DIL++ G
Sbjct: 472 --DPQTNTVSTLARGLKEPSDILLMQG 496
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 39/87 (44%), Gaps = 9/87 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITP---PLSQYSRGRLVAGSFQGYTGHVDGKPNEAR 133
P +R++ DG ++ D S + K+ P ++ Y L FQ DG AR
Sbjct: 390 PSGVRLASDGNVWVADSETSALRKLDPSTGAVTSYVGVGLFDFGFQ------DGPAESAR 443
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRK 160
HP G G + +ADT N AIR+
Sbjct: 444 LQHPLGALELPDGTVAIADTYNGAIRR 470
Score = 39.7 bits (91), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 64/158 (40%), Gaps = 28/158 (17%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P K+ +DG L D + + L L+ G G+ DG A+F+
Sbjct: 195 PGKVVRLDDGRLIVADSGHHSYA-----LYDSDGANLLGRIGSGERGNADGDFATAQFSE 249
Query: 137 PKGVTM------DDKG-NLYVADTLNLAIRKIG--DAGVTTIAG--------GKSNVA-- 177
P GV G +L ADT+N +R I VTT+AG NV
Sbjct: 250 PGGVAQLPPEIAQQVGYHLVAADTVNHTLRGINLETQTVTTVAGTGEQYMVGASDNVPDQ 309
Query: 178 ----GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGN 211
G DGP+ K S+ +DVVY T ++++ GN
Sbjct: 310 PGFFGRYDGPATAVKLSSPWDVVYAPKTSNVVIAMAGN 347
>gi|302543897|ref|ZP_07296239.1| putative NHL repeat protein [Streptomyces hygroscopicus ATCC 53653]
gi|302461515|gb|EFL24608.1| putative NHL repeat protein [Streptomyces himastatinicus ATCC
53653]
Length = 608
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 47/82 (57%), Gaps = 4/82 (4%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG-GKSNV 176
G G DG+ + ARF+ P+G+ G + VADT+N A+R+ A VTT+AG G+
Sbjct: 215 GERGLADGRADRARFSEPQGLAALPDGTVAVADTVNHALRRFDPATGEVTTLAGTGRQWW 274
Query: 177 AGF-RDGPSEDAKFSNDFDVVY 197
G +GP+ D S+ +D+ +
Sbjct: 275 QGSPAEGPARDIDLSSPWDLAW 296
Score = 39.3 bits (90), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 24/38 (63%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK 160
GH DG ++A HP GVT G++ VADT N A+R+
Sbjct: 389 GHRDGAADQALLQHPLGVTALPDGSVAVADTYNHALRR 426
>gi|307202136|gb|EFN81636.1| NHL repeat-containing protein 2 [Harpegnathos saltator]
Length = 688
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 35/91 (38%), Positives = 48/91 (52%), Gaps = 9/91 (9%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTI 169
++ G+ QG+ DG A+FN P+GV M D +YVAD N AIRKI + V+TI
Sbjct: 257 HIIGGTNQGFE---DGDFKSAKFNSPQGVCMLDDV-IYVADNNNHAIRKINLSEKSVSTI 312
Query: 170 AGGKS---NVAGFRDGPSEDAKFSNDFDVVY 197
AG S + G R G ++D D + Y
Sbjct: 313 AGTGSQSHDRCGGRSGTNQDLSSPWDVAIYY 343
Score = 38.1 bits (87), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 46/100 (46%), Gaps = 11/100 (11%)
Query: 123 GHVDGKPNEARFNHPKGVTMDD-KGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGF 179
G +DGK A+ HP G+ + + +YVADT N I+ + A T+ G + + F
Sbjct: 459 GDMDGKEYSAKLQHPLGIAWNHLEKQIYVADTYNHKIKSVDVATGYCRTLFGDRKPNSMF 518
Query: 180 RDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
D S D +++YV D N AL+ I+L
Sbjct: 519 SFNEPSDLAVSPDGNILYVA--------DTNNHALKVINL 550
>gi|302768705|ref|XP_002967772.1| hypothetical protein SELMODRAFT_409051 [Selaginella moellendorffii]
gi|302826179|ref|XP_002994615.1| hypothetical protein SELMODRAFT_432523 [Selaginella moellendorffii]
gi|300137312|gb|EFJ04320.1| hypothetical protein SELMODRAFT_432523 [Selaginella moellendorffii]
gi|300164510|gb|EFJ31119.1| hypothetical protein SELMODRAFT_409051 [Selaginella moellendorffii]
Length = 172
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 26/58 (44%), Positives = 36/58 (62%), Gaps = 8/58 (13%)
Query: 166 VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC---SLLVIDRGNAALRQISLN 220
VTTIAGG S GF DGP + A+FS++F + C SLL+ DRGN + +I ++
Sbjct: 41 VTTIAGGSSRKPGFADGPGDTARFSSEFSL-----ACSFGSLLIADRGNRLIHEIQID 93
>gi|290991612|ref|XP_002678429.1| predicted protein [Naegleria gruberi]
gi|284092041|gb|EFC45685.1| predicted protein [Naegleria gruberi]
Length = 992
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 61/130 (46%), Gaps = 15/130 (11%)
Query: 53 GNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGR 112
GN+L +GY + ++ N P + VS GE++ D NS I KI S G+
Sbjct: 343 GNMLDLNSGYNGDNILAVNAKLKSPESVVVSNSGEVYISDTGNSRIRKI------LSNGQ 396
Query: 113 LVA----GSFQG---YTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
+V G+F+ Y G N A +P G+ + LY+ADT N IRK+ G
Sbjct: 397 IVTVVGRGNFRNSPSYNGDYILAIN-ANIKNPSGILLSSTNELYIADTENYRIRKVLTNG 455
Query: 166 -VTTIAGGKS 174
+ TIAG S
Sbjct: 456 TIVTIAGTGS 465
>gi|158319005|ref|YP_001511513.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
gi|158114410|gb|ABW16607.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
Length = 807
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 64/145 (44%), Gaps = 10/145 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPN-EARFN 135
P + +EDG L+ V +S +VAG+ DG P A
Sbjct: 599 PSAVVAAEDGTLYLTAGYRVRKVTTDGLIS------VVAGNADESGTAGDGGPAVNATLT 652
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR--DGPSEDAKFSNDF 193
P G+ + D G LYVAD+ +R+I G T+ GK +G+ DGP+ DA ++
Sbjct: 653 SPSGLALADDGTLYVADSSAHTVRRITKDGKITLVAGKPEQSGYEGDDGPAVDALLASPQ 712
Query: 194 DVVYVRPTCSLLVIDRGNAALRQIS 218
D+ + P L + D N +R ++
Sbjct: 713 DLA-LGPAGELYIADTYNNVIRMVT 736
Score = 45.1 bits (105), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 53/109 (48%), Gaps = 8/109 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ--GYTGHVDGKPNEARF 134
P + +++DG L+ D + +IT + + LVAG + GY G DG +A
Sbjct: 654 PSGLALADDGTLYVADSSAHTVRRIT----KDGKITLVAGKPEQSGYEGD-DGPAVDALL 708
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDG 182
P+ + + G LY+ADT N IR + D +TT AG + A DG
Sbjct: 709 ASPQDLALGPAGELYIADTYNNVIRMVTPDGKITTFAGSDESTADDADG 757
>gi|326431538|gb|EGD77108.1| NHL repeat-containing protein [Salpingoeca sp. ATCC 50818]
Length = 1455
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 45/141 (31%), Positives = 64/141 (45%), Gaps = 23/141 (16%)
Query: 113 LVAGS--FQGYTGHVDGKP-NEARFNHPKGVTMDDKGNLYVADTLNLAIRKI---GDAGV 166
+VAG+ +GY H DG+P A N P G+ + G+LY+ADT N IR + D +
Sbjct: 1282 VVAGTPFRKGY--HGDGRPATTAWLNAPTGIAVRGAGDLYIADTGNNRIRYVDLDSDNII 1339
Query: 167 TTIAGGKSNVAGFRDGPSEDAKFSNDFD-----VVYVRPTCSLLVIDRGNAALRQISLNQ 221
T+ G G DG + A + D V T +L++ D LR +SLN
Sbjct: 1340 DTLVG-TGRYGGDVDGSTTTAALETNLDHPEGVAVNAEGT-TLVIADTNKHVLRSVSLNG 1397
Query: 222 DDCEYQYNSISPTDILMVVGA 242
DD SP +L G+
Sbjct: 1398 DD--------SPVTVLAGTGS 1410
>gi|326432343|gb|EGD77913.1| NHL repeat protein [Salpingoeca sp. ATCC 50818]
Length = 1505
Score = 48.5 bits (114), Expect = 0.008, Method: Composition-based stats.
Identities = 34/110 (30%), Positives = 56/110 (50%), Gaps = 7/110 (6%)
Query: 114 VAGS-FQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK----IGDAGVTT 168
VAG+ G++G G P +AR + P+GV + G+L +AD+ N +R+ +G AG+ T
Sbjct: 1395 VAGTGVAGFSGD-GGLPTDARLDTPRGVAVHSSGSLAIADSGNHRVREFNIGVGAAGIIT 1453
Query: 169 IAGGKSNVAGFRDG-PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
G DG + D + + + T ++L +DR N +RQI
Sbjct: 1454 TTAGNGQRGYNGDGMVATDTALNFPTGITFSPLTDNVLFVDRRNRRVRQI 1503
>gi|288921011|ref|ZP_06415303.1| serine/threonine protein kinase [Frankia sp. EUN1f]
gi|288347586|gb|EFC81871.1| serine/threonine protein kinase [Frankia sp. EUN1f]
Length = 801
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 73/163 (44%), Gaps = 18/163 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRG--RLVAGSFQGYTGHVDGKPNEARF 134
P + VS++G+L+ + K+T S G R++AG+ T +G +A F
Sbjct: 594 PRAVAVSDEGDLYIAS--GYRVRKVT------SDGIIRVIAGTSDSGTTGDNGLAEKATF 645
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR--DGPSEDAKFSND 192
P G+ + D G LYVAD+ +R+I G T+ G G+ G + AK N
Sbjct: 646 TSPSGLALADDGTLYVADSSAHTVRRISTDGKITLVAGTPGAYGYSGDGGAATSAKLYNP 705
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTD 235
+ V P L + D N +R ++ + + N+ + TD
Sbjct: 706 HSLA-VGPEGELYIADTYNDVIRVVT-----TDGKINTFAGTD 742
Score = 43.1 bits (100), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 36/151 (23%), Positives = 70/151 (46%), Gaps = 16/151 (10%)
Query: 75 VVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV----AGSFQGYTGHVDGKPN 130
+ P + DG+++ D + I ++ + G++ AG +G++G GK
Sbjct: 536 IAPVAVAAGPDGDVYVADSDDGRIRRV-------ADGKITTVAGAGPGEGFSG-TSGKAV 587
Query: 131 EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD-GPSEDAKF 189
A+ + P+ V + D+G+LY+A +RK+ G+ + G S+ D G +E A F
Sbjct: 588 GAKLSDPRAVAVSDEGDLYIASGYR--VRKVTSDGIIRVIAGTSDSGTTGDNGLAEKATF 645
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
++ + +L V D +R+IS +
Sbjct: 646 TSPSGLALAD-DGTLYVADSSAHTVRRISTD 675
>gi|116007528|ref|NP_001036460.1| CG12547, isoform A [Drosophila melanogaster]
gi|442622271|ref|NP_001260704.1| CG12547, isoform B [Drosophila melanogaster]
gi|17945781|gb|AAL48938.1| RE33981p [Drosophila melanogaster]
gi|30923523|gb|EAA46001.1| CG12547, isoform A [Drosophila melanogaster]
gi|220948326|gb|ACL86706.1| CG12547-PA [synthetic construct]
gi|440214080|gb|AGB93239.1| CG12547, isoform B [Drosophila melanogaster]
Length = 717
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 45/130 (34%), Positives = 66/130 (50%), Gaps = 17/130 (13%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT-GHVDGKPNEARFN 135
P KI +G L+A+ + +N V + + G +V G+ G VDG ARFN
Sbjct: 228 PAKIVQGPNG-LYAIADTGNNRVLV------LTAGGVVQHRIGGHQPGFVDGDLTVARFN 280
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG----GKSNVAGFRDGPSEDAKF 189
+P+G+ ++ L VADT N AIR+I +A V T+AG G + G R GP +
Sbjct: 281 NPQGIAFLNENVLIVADTKNHAIRQISLTNAMVETLAGTGIQGNDRIGG-RLGPLQ--PL 337
Query: 190 SNDFDVVYVR 199
S+ +DV R
Sbjct: 338 SSPWDVAIFR 347
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 25/40 (62%), Gaps = 1/40 (2%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKI 161
G +DG+ A+ HP GVT +D N LYVADT N I+ I
Sbjct: 471 GDIDGRLFSAKLQHPLGVTFNDTNNKLYVADTYNHKIKII 510
>gi|411003633|ref|ZP_11379962.1| hypothetical protein SgloC_12561 [Streptomyces globisporus C-1027]
Length = 605
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 60/126 (47%), Gaps = 11/126 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF-QGYTGHVDGKPNEARFN 135
P K + DG D +V++ + G V F G G DG P EARF+
Sbjct: 177 PGKALLLPDGGFLVSDTTRHRLVELD------ADGETVRRHFGTGERGLSDGGPGEARFS 230
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVT-TIAG-GKSNVAGF-RDGPSEDAKFSN 191
P+G+ + G + VADT+N AIR + GVT T+AG G+ G GP+ S+
Sbjct: 231 EPQGLAVLPDGRVAVADTVNHAIRALDLTTGVTSTLAGTGRQWWQGTPTSGPARGVDLSS 290
Query: 192 DFDVVY 197
+DV +
Sbjct: 291 PWDVAW 296
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 51/107 (47%), Gaps = 5/107 (4%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG-DAGVTTI 169
R+ AG+ G VDG EA F P G+ + G L+VAD+ A+R + D V T
Sbjct: 322 RVAAGTTN--EGLVDGPAAEAWFAQPSGLAVSADGERLWVADSETSALRWVDRDEHVHTA 379
Query: 170 AGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
G G RDG ++ A + V + P S+ + D N ALR+
Sbjct: 380 VGTGLFDFGHRDGAADQALLQHPIGVTAL-PDGSVAISDTYNHALRR 425
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 48/96 (50%), Gaps = 5/96 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + VS DGE V + ++ ++ ++ + G F GH DG ++A H
Sbjct: 345 PSGLAVSADGERLWVADSETSALRWVD-RDEHVHTAVGTGLFD--FGHRDGAADQALLQH 401
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIA 170
P GVT G++ ++DT N A+R+ A VTT+A
Sbjct: 402 PIGVTALPDGSVAISDTYNHALRRYDPASGEVTTLA 437
>gi|326435583|gb|EGD81153.1| hypothetical protein PTSG_13151 [Salpingoeca sp. ATCC 50818]
Length = 1551
Score = 48.5 bits (114), Expect = 0.008, Method: Composition-based stats.
Identities = 34/110 (30%), Positives = 56/110 (50%), Gaps = 7/110 (6%)
Query: 114 VAGS-FQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK----IGDAGVTT 168
VAG+ G++G G P +AR + P+GV + G+L +AD+ N +R+ +G AG+ T
Sbjct: 1441 VAGTGVAGFSGD-GGLPTDARLDTPRGVAVHSSGSLAIADSGNHRVREFNIGVGAAGIIT 1499
Query: 169 IAGGKSNVAGFRDG-PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
G DG + D + + + T ++L +DR N +RQI
Sbjct: 1500 TTAGNGQRGYNGDGMVATDTALNFPTGITFSPLTDNVLFVDRRNRRVRQI 1549
>gi|283779991|ref|YP_003370746.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
gi|283438444|gb|ADB16886.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
Length = 364
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 72/147 (48%), Gaps = 14/147 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKP-NEARF 134
P+ + + DG L+ + N + ++ + VAGS + GY G DG P EA
Sbjct: 52 PFGVEIGPDGALYITEVRNHRVRRLDLKTGAMTT---VAGSGKMGYAG--DGGPATEALL 106
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG---VTTIAG-GKSNVAGFRDGPSEDAKFS 190
N P V D GN+ + N IRK+ DA ++TIAG GK AG GP+++A FS
Sbjct: 107 NEPYEVRFDSHGNMIFVEMKNFVIRKV-DAKTGIISTIAGSGKEGFAG-DGGPAKEAVFS 164
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQI 217
+ + ++ V D GN +R+I
Sbjct: 165 IPHSIA-LDADDNIYVCDLGNHRIRKI 190
Score = 41.6 bits (96), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 51/100 (51%), Gaps = 11/100 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF-QGYTGHVDGKP-NEARF 134
PY++R G + V+ N I K+ S +AGS +G+ G DG P EA F
Sbjct: 109 PYEVRFDSHGNMIFVEMKNFVIRKVDAKTGIIST---IAGSGKEGFAG--DGGPAKEAVF 163
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG---VTTIAG 171
+ P + +D N+YV D N IRKI DA ++TIAG
Sbjct: 164 SIPHSIALDADDNIYVCDLGNHRIRKI-DAKTGLISTIAG 202
Score = 39.7 bits (91), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 27/94 (28%), Positives = 40/94 (42%), Gaps = 19/94 (20%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP------- 129
P + V DG + VD N I KI + AG+ GH K
Sbjct: 279 PKGVAVCPDGGVVVVDTENHVIRKID----------VKAGTISTVPGHTPKKAGGDGDGG 328
Query: 130 --NEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
+A N P G+ +D G++Y+ DTLN +R++
Sbjct: 329 DATKATMNRPHGICVDKDGSIYIGDTLNHRVRRV 362
>gi|197121735|ref|YP_002133686.1| hypothetical protein AnaeK_1325 [Anaeromyxobacter sp. K]
gi|196171584|gb|ACG72557.1| YD repeat protein [Anaeromyxobacter sp. K]
Length = 2350
Score = 48.1 bits (113), Expect = 0.009, Method: Composition-based stats.
Identities = 41/145 (28%), Positives = 70/145 (48%), Gaps = 15/145 (10%)
Query: 79 KIRVSEDGELFAVDEVNSNIVKITPP--LSQYSRGRLVAGSFQGYTGHVDGKPNE-ARFN 135
K+ V DG ++ + I ++ P + ++ AG G+ G DG P A+
Sbjct: 1205 KVAVGPDGSVYLTAAAHDAIRRVRPDGIIERF------AGLSSGFGG--DGGPARFAKLR 1256
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVT-TIAGGKSNVAGF--RDGPSEDAKFSND 192
P GV++ G++Y+ADT N IR++ +G+ +IAG F GP+ A+ +
Sbjct: 1257 GPNGVSVGPDGSVYIADTYNARIRRVDPSGIIESIAGSGFAQPQFWGDGGPALAARLNGT 1316
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQI 217
+ V V P L + D NA +R++
Sbjct: 1317 WQAV-VAPDGDLFIADSFNARIRRV 1340
>gi|290976072|ref|XP_002670765.1| predicted protein [Naegleria gruberi]
gi|284084327|gb|EFC38021.1| predicted protein [Naegleria gruberi]
Length = 854
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 44/162 (27%), Positives = 78/162 (48%), Gaps = 7/162 (4%)
Query: 60 NGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ 119
N YL + + N +PY+I V+ D ++F VD + N+V+ S +
Sbjct: 252 NSYLGDGGLGINAWLGIPYQIAVNSDLDVFFVDSL-YNVVRRIKQSSGIINAVIGT---G 307
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGF 179
G G+ A+ + PKG+ + G +++AD+ N IRK+ G+ G S+V
Sbjct: 308 SGFGGDGGQATSAQLSSPKGLVVSSSGEMFLADSGNNRIRKVFSNGIIVTIAGTSSVGFS 367
Query: 180 RDGP-SEDAKFSN--DFDVVYVRPTCSLLVIDRGNAALRQIS 218
DG S DA+ +N + + +R + + D GN A+R+++
Sbjct: 368 GDGGLSTDAQLNNPVNLAIRQLRSLSEIYISDAGNHAIRKLT 409
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 56/107 (52%), Gaps = 17/107 (15%)
Query: 59 ENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAG 116
+ G +ET++ P + VSE GE++ D N+ I K+ Y G + +AG
Sbjct: 87 DGGLAIETMLSS------PQGVAVSESGEVYLSDSTNNIIRKV------YLNGTIQTIAG 134
Query: 117 S-FQGYTGHVDGKPNEARFNHPKGVTMD-DKGNLYVADTLNLAIRKI 161
S QGY+G +G +F P+G+++ +LY+ADT N +R++
Sbjct: 135 SRTQGYSGD-NGPAVNCQFFSPQGLSLSPSNSDLYIADTFNNVVRRL 180
Score = 38.5 bits (88), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 51/102 (50%), Gaps = 2/102 (1%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAG 178
GY G G E + P+GV + + G +Y++D+ N IRK+ G + TIAG ++
Sbjct: 83 GYNGD-GGLAIETMLSSPQGVAVSESGEVYLSDSTNNIIRKVYLNGTIQTIAGSRTQGYS 141
Query: 179 FRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
+GP+ + +F + + L + D N +R++ LN
Sbjct: 142 GDNGPAVNCQFFSPQGLSLSPSNSDLYIADTFNNVVRRLDLN 183
>gi|428169673|gb|EKX38605.1| hypothetical protein GUITHDRAFT_56349, partial [Guillardia theta
CCMP2712]
Length = 210
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 61/116 (52%), Gaps = 9/116 (7%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMD-DKGNLYVADTLNLAIRKIGD--AGVTTI 169
L+AG+ + TG DG N+ F++P G+++ D + +AD LN IR + VTT+
Sbjct: 5 LLAGNHR--TGSQDGAGNDTSFSNPVGISVSADSQYVVIADYLNSLIRLLFVPLLQVTTV 62
Query: 170 AGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCE 225
AG ++ +G+ DG AKF DVV+ +L+ D N +R SLN C
Sbjct: 63 AG--TSGSGYEDGTCAMAKFDGPIDVVWGIDKSLVLIADFWNNRIR--SLNLSSCS 114
Score = 43.5 bits (101), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 46/150 (30%), Positives = 69/150 (46%), Gaps = 11/150 (7%)
Query: 77 PYKIRVSEDGELFAV-DEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P I VS D + + D +NS I + PL Q + VAG+ +G+ DG A+F+
Sbjct: 27 PVGISVSADSQYVVIADYLNSLIRLLFVPLLQVTT---VAGT--SGSGYEDGTCAMAKFD 81
Query: 136 HPKGVTMD-DKGNLYVADTLNLAIR--KIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSND 192
P V DK + +AD N IR + V+TIAG S G DG + A F+
Sbjct: 82 GPIDVVWGIDKSLVLIADFWNNRIRSLNLSSCSVSTIAG--SGAFGQLDGSTAAATFAGP 139
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQISLNQD 222
+ LLV + ++R I L+++
Sbjct: 140 CSLALTSQGSKLLVAEFSGHSVRMIDLSKN 169
Score = 38.5 bits (88), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 46/85 (54%), Gaps = 7/85 (8%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKI--GDAGVTTIA 170
+AGS G G +DG A F P + + +G+ L VA+ ++R I VTT+A
Sbjct: 118 IAGS--GAFGQLDGSTAAATFAGPCSLALTSQGSKLLVAEFSGHSVRMIDLSKNTVTTLA 175
Query: 171 GGKSNVAGFRDGPSEDAKFSNDFDV 195
G S +AG +DG A+F+N +D+
Sbjct: 176 G--SGIAGLQDGVGLIAQFTNPWDL 198
>gi|241584846|ref|XP_002403871.1| NHL repeat-containing protein [Ixodes scapularis]
gi|215500274|gb|EEC09768.1| NHL repeat-containing protein [Ixodes scapularis]
Length = 621
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 45/81 (55%), Gaps = 7/81 (8%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAGG---KSNVA 177
G DG N+ARFN P+GV D +YVADT N +R++ A V T+AG S++
Sbjct: 280 GWRDGALNDARFNGPQGVVWRDPHFVYVADTGNHTVREVDLEQAQVRTVAGTGQRGSDLE 339
Query: 178 GFRDGPSEDAKFSNDFDVVYV 198
G R GP + S+ +D+ V
Sbjct: 340 GGRQGPQQ--PLSSPWDLCLV 358
Score = 39.7 bits (91), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 48/102 (47%), Gaps = 8/102 (7%)
Query: 123 GHVDGKPNEARFNHPKGVTMD-DKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGF 179
G DG + R HP GV DK LYVAD+ N IRK+ TT+AG + VAG
Sbjct: 466 GDADGSALDVRLQHPLGVCWSSDKQLLYVADSYNHKIRKVDVQKRLCTTLAG--TGVAGD 523
Query: 180 RDGP-SEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
G S++ +F + V L V D N ++ + L+
Sbjct: 524 ATGSFSDEVQFDEPGGLCVV--GSRLYVADTNNHCVKLVHLD 563
>gi|449435944|ref|XP_004135754.1| PREDICTED: NHL repeat-containing protein 2-like [Cucumis sativus]
Length = 1086
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 48/101 (47%), Gaps = 6/101 (5%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFR 180
G DG +E HP GV G +YVAD+ N ++ + VTTIAG + AGF+
Sbjct: 827 GDHDGVGSEVLLQHPLGVFCSKDGQIYVADSYNHKVKMLDPVSKKVTTIAG--TGKAGFK 884
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQ 221
DG + +A+ S + L + D N +R + LN
Sbjct: 885 DGTALEAQLSEPSGI--TEAGGRLFIADTNNNVIRYLYLNN 923
Score = 42.0 bits (97), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 58/117 (49%), Gaps = 9/117 (7%)
Query: 85 DGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDD 144
+ LF D ++ IV ++ S L+ G G DG ++A FN P+G+ +
Sbjct: 614 NNRLFISDSNHNRIV-----VTDLSGNFLLQIGSTGEDGLRDGNFDDATFNRPQGLAYNA 668
Query: 145 KGN-LYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSND-FDVVY 197
K N LYVADT N A+R++ V T+AG S + ++ G ++ N +DV +
Sbjct: 669 KKNLLYVADTENHALREVDFVKERVRTLAGDGSKGSDYQGGKEGTSQLLNSPWDVCF 725
>gi|219853189|ref|YP_002467621.1| PKD domain-containing protein [Methanosphaerula palustris E1-9c]
gi|219547448|gb|ACL17898.1| PKD domain containing protein [Methanosphaerula palustris E1-9c]
Length = 930
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 39/116 (33%), Positives = 54/116 (46%), Gaps = 17/116 (14%)
Query: 69 EGNEIG--VVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVD 126
+G+E G + PY I V G ++ VD N+ + K T S G + + G G D
Sbjct: 367 KGSEDGQFIYPYSIAVDSAGNVYVVDTGNNRVQKFT------STGTFI--TQWGGEGFGD 418
Query: 127 GKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFR 180
G+ FN P G+T D GN+YV DT N ++K G +T G S V F
Sbjct: 419 GQ-----FNFPGGITADSAGNVYVVDTENDRVQKFTSTGEFITKWGGDGSGVGEFN 469
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 36/142 (25%), Positives = 60/142 (42%), Gaps = 22/142 (15%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + G ++ VD N+ + K T S G + + G G NH
Sbjct: 283 PRGIALDSAGNVYVVDTGNNRVQKFT------STGTFITK----WGGEGSGA---GELNH 329
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P GV +DD GN+Y+ DT N ++ G +A + SED +F + +
Sbjct: 330 PHGVALDDAGNVYIIDTWNNRVQIFTSTG--------EFIAKWGSKGSEDGQFIYPYSIA 381
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
V ++ V+D GN +++ +
Sbjct: 382 -VDSAGNVYVVDTGNNRVQKFT 402
Score = 43.5 bits (101), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 61/143 (42%), Gaps = 22/143 (15%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY I V G ++ VD N+ + T S G +A G +G DG+ FN+
Sbjct: 471 PYGIAVDRAGNVYVVDTGNNRVQIFT------STGTFIAQ--WGGSGSRDGQ-----FNY 517
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+ +D GN+YV D N +K G G + D +F+ DV
Sbjct: 518 PGGIAVDSAGNVYVVDESNNRFQKFTSTGEFITKWGSEGLG--------DGEFTYPRDVA 569
Query: 197 YVRPTCSLLVIDRGNAALRQISL 219
V ++ ++D N+ +++ S
Sbjct: 570 -VDSGGNVYIVDESNSRIQKFSW 591
Score = 38.9 bits (89), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 37/142 (26%), Positives = 62/142 (43%), Gaps = 24/142 (16%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G +F D +++ I+K T S G + + G G +G+ F +
Sbjct: 48 PEGVAVDGTGAVFVADRLSNRILKFT------STGEFI--TQWGSEGSGNGQ-----FEN 94
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ +D GN+YV DT N ++K G T GG+ S D +FS + +
Sbjct: 95 PGGIAVDSAGNIYVTDTWNHRVQKFTSTGEFITKWGGEG---------SGDGQFSYPYGI 145
Query: 196 VYVRPTCSLLVIDRGNAALRQI 217
R ++ V D N +++
Sbjct: 146 AADRA-GNVYVTDTWNHRIQKF 166
>gi|290992224|ref|XP_002678734.1| predicted protein [Naegleria gruberi]
gi|284092348|gb|EFC45990.1| predicted protein [Naegleria gruberi]
Length = 1366
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 55/217 (25%), Positives = 89/217 (41%), Gaps = 40/217 (18%)
Query: 77 PYKIRVSEDGELFAVDEVNSNI--VKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
PY I V+ + E++ D +N I + + +S V G+ G++G G A+
Sbjct: 472 PYGIAVTINDEIYIADSLNHRIRFIDVNGNIST------VVGTSIGFSGD-GGLATAAKL 524
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAG-GKSNVAGFRDGPSEDAKFSND 192
N P V++ G+LY+AD N IRK+ + + T AG G+S G G + A S
Sbjct: 525 NAPMDVSLSASGDLYIADRDNYRIRKVLANGTIVTFAGNGQSGHIG-DGGQAISAALSQA 583
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCML 252
+ V V + + D N +R+I ++ G +T +
Sbjct: 584 YGVRVVND--EVYISDSNNFKVRKIDVS-------------------------GVITTIA 616
Query: 253 QQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIP 289
G GPF + + N T++ LSN E I
Sbjct: 617 GTGAGPFNGDNV-LATAANLNHPTDVAFLSNGEMLIA 652
Score = 45.8 bits (107), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 48/163 (29%), Positives = 76/163 (46%), Gaps = 15/163 (9%)
Query: 62 YLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRG---RLVAGSF 118
Y + TV G + P KI +G+L D + +I KI+ YS G R+
Sbjct: 403 YYITTVARGLK---DPQKISKLSNGDLIVSDTGDHSIKKIS-----YSTGVVTRIAGTGV 454
Query: 119 QGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVA 177
G++G G A+FN P G+ + +Y+AD+LN IR I G ++T+ G +
Sbjct: 455 AGFSGD-GGLATLAQFNKPYGIAVTINDEIYIADSLNHRIRFIDVNGNISTVVGTSIGFS 513
Query: 178 GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
G G + AK + D V + + L + DR N +R++ N
Sbjct: 514 G-DGGLATAAKLNAPMD-VSLSASGDLYIADRDNYRIRKVLAN 554
Score = 43.1 bits (100), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 46/171 (26%), Positives = 76/171 (44%), Gaps = 16/171 (9%)
Query: 52 DGNVLQF----ENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQ 107
+G ++ F ++G++ + + N + Y ++V +GE++ D N + K+
Sbjct: 194 NGTIVTFAGNGQSGHIGDGGLATNAAMALAYGVKVFSNGEVYISDSFNFKVRKVDV---- 249
Query: 108 YSRGRL--VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
G + VAGS G + A NHP V D G L +ADT N IR + G
Sbjct: 250 --NGNITTVAGSGAGPFNGDNVLATAANLNHPTDVLRLDTGELIIADTDNYRIRLVMPNG 307
Query: 166 VTTIAGGKSNVAGFRDGP-SEDAKFSNDFDVVYVRPTCSLLVIDRGNAALR 215
T I + A F DG +E+ S ++ ++ LL+ D GN +R
Sbjct: 308 -TIITTAGNGTASFSDGEIAEENGLSLPTGLLMIQN--GLLIADAGNKRIR 355
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 44/96 (45%), Gaps = 5/96 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY I VS + E++ D +N I KI + + A +F G G + N
Sbjct: 111 PYGIAVSSNDEIYFADSMNHRIRKIDITGNITTVVGTGANTFSGD----GGLATDCTMNT 166
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
P V++ G LY+AD N IRK+ G + T AG
Sbjct: 167 PMDVSLSASGELYIADMYNYRIRKVLTNGTIVTFAG 202
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 67/147 (45%), Gaps = 10/147 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKI--TPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
P K+ +G++ D + I KI T ++ + G VAG F G +G A+
Sbjct: 55 PQKLARLSNGDILVTDALGHAIKKINGTGVITTIA-GTGVAG-FAGD----NGPAINAQV 108
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
N P G+ + +Y AD++N IRKI G +TT+ G +N G + D +
Sbjct: 109 NKPYGIAVSSNDEIYFADSMNHRIRKIDITGNITTVVGTGANTFSGDGGLATDCTMNTPM 168
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISLN 220
D V + + L + D N +R++ N
Sbjct: 169 D-VSLSASGELYIADMYNYRIRKVLTN 194
>gi|290975761|ref|XP_002670610.1| predicted protein [Naegleria gruberi]
gi|284084171|gb|EFC37866.1| predicted protein [Naegleria gruberi]
Length = 308
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 69/144 (47%), Gaps = 10/144 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + VS + E++ D N I KI L + + G++G +G A+FN+
Sbjct: 13 PSSVFVSSNNEVYIADFCNHRIRKI---LENGNIVTIAGNGNYGFSGD-NGPATNAQFNY 68
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--DGPSEDAKFSNDF 193
P V + K +Y+ D N IRKI + G + TIAG + GF +GP+ +A+ N
Sbjct: 69 PCSVFVSSKNEVYITDYSNHRIRKILENGNIITIAGNGT--VGFSGDNGPATNAQLYNP- 125
Query: 194 DVVYVRPTCSLLVIDRGNAALRQI 217
V+V + + D N +R+I
Sbjct: 126 SSVFVSSNNEVYIADFCNHRIRKI 149
Score = 47.8 bits (112), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 69/144 (47%), Gaps = 10/144 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + VS + E++ D N I KI L + + G++G +G A+FN+
Sbjct: 125 PSSVFVSSNNEVYIADFCNHRIRKI---LENGNIVTIAGNGNYGFSGD-NGPATNAQFNY 180
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--DGPSEDAKFSNDF 193
P V + K +Y+ D N IRKI + G + TIAG + GF +GP+ +A+ N
Sbjct: 181 PCSVFVSSKNEVYITDYSNHRIRKILENGNIITIAGNGT--VGFSGDNGPATNAQLYNP- 237
Query: 194 DVVYVRPTCSLLVIDRGNAALRQI 217
V+V + D+ N +R+I
Sbjct: 238 SSVFVSSNNEVYFTDQHNNRIRKI 261
Score = 42.0 bits (97), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 66/146 (45%), Gaps = 14/146 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV--DGKPNEARF 134
P + VS E++ D N I KI G ++ + G G +G A+
Sbjct: 69 PCSVFVSSKNEVYITDYSNHRIRKILE------NGNIITIAGNGTVGFSGDNGPATNAQL 122
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--DGPSEDAKFSN 191
+P V + +Y+AD N IRKI + G + TIAG + GF +GP+ +A+F+
Sbjct: 123 YNPSSVFVSSNNEVYIADFCNHRIRKILENGNIVTIAGNGN--YGFSGDNGPATNAQFNY 180
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQI 217
V+V + + D N +R+I
Sbjct: 181 PCS-VFVSSKNEVYITDYSNHRIRKI 205
>gi|219851088|ref|YP_002465520.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
gi|219545347|gb|ACL15797.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
Length = 831
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 41/142 (28%), Positives = 64/142 (45%), Gaps = 22/142 (15%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + + G ++ VD N + K T S G VA G +G +G+ F++
Sbjct: 132 PYGVAIDSAGNVYVVDRGNHRVQKFT------SSGIFVAK--WGSSGSENGQ-----FSY 178
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P+G+ +D GN+YVAD N + K G A G SED +F+ D V
Sbjct: 179 PEGIAIDGTGNVYVADESNHRVEKFTSIGTFLTAWGTKG--------SEDGQFAYP-DGV 229
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
V ++ V D GN +++ +
Sbjct: 230 AVDSLGNVYVADSGNNRVQKFT 251
Score = 47.4 bits (111), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 41/142 (28%), Positives = 64/142 (45%), Gaps = 22/142 (15%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + + G ++ VD N + K T S G VA G +G +G+ F++
Sbjct: 432 PYGVAIDGAGNVYVVDRGNHRVQKFT------SSGIFVAK--WGSSGSENGQ-----FSY 478
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P+G+ +D GN+YVAD N + K G A G SED +F+ D V
Sbjct: 479 PEGIAIDGTGNVYVADESNHRVEKFTSIGTFLTAWGTKG--------SEDGQFAYP-DGV 529
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
V ++ V D GN +++ +
Sbjct: 530 AVDSLGNVYVADSGNNRVQKFT 551
Score = 43.5 bits (101), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 36/142 (25%), Positives = 61/142 (42%), Gaps = 22/142 (15%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY I G ++ D N+ + K T S G + + G +G +G+ F
Sbjct: 338 PYGIAFDSAGNVYVADSGNNRVQKFT------SSGLFI--TMWGTSGSDNGQ-----FRT 384
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+ +D GN+YVAD N ++K G G S +FS+ + V
Sbjct: 385 PTGIAVDSAGNVYVADRDNNRVQKFTSTGTYLAKWGTSGTG--------SGQFSSPYGVA 436
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
+ ++ V+DRGN +++ +
Sbjct: 437 -IDGAGNVYVVDRGNHRVQKFT 457
Score = 43.5 bits (101), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 36/142 (25%), Positives = 61/142 (42%), Gaps = 22/142 (15%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY I G ++ D N+ + K T S G + + G +G +G+ F
Sbjct: 38 PYGIAFDSAGNVYIADSGNNRVQKFT------SSGLFI--TMWGTSGSDNGQ-----FRT 84
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+ +D GN+YVAD N ++K G G S +FS+ + V
Sbjct: 85 PTGIAVDSAGNVYVADRDNNRVQKFTSTGTYLAKWGTSGTG--------SGQFSSPYGVA 136
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
+ ++ V+DRGN +++ +
Sbjct: 137 -IDSAGNVYVVDRGNHRVQKFT 157
Score = 38.9 bits (89), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 41/89 (46%), Gaps = 13/89 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + G ++ DE N + K T S G + G G DG+ F +
Sbjct: 179 PEGIAIDGTGNVYVADESNHRVEKFT------SIGTFLTA--WGTKGSEDGQ-----FAY 225
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
P GV +D GN+YVAD+ N ++K +G
Sbjct: 226 PDGVAVDSLGNVYVADSGNNRVQKFTSSG 254
Score = 38.9 bits (89), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 41/89 (46%), Gaps = 13/89 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + G ++ DE N + K T S G + G G DG+ F +
Sbjct: 479 PEGIAIDGTGNVYVADESNHRVEKFT------SIGTFLTA--WGTKGSEDGQ-----FAY 525
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
P GV +D GN+YVAD+ N ++K +G
Sbjct: 526 PDGVAVDSLGNVYVADSGNNRVQKFTSSG 554
>gi|392965583|ref|ZP_10331002.1| Tripartite motif-containing protein 71 AltName: Full=Lin-41 homolog
[Fibrisoma limi BUZ 3]
gi|387844647|emb|CCH53048.1| Tripartite motif-containing protein 71 AltName: Full=Lin-41 homolog
[Fibrisoma limi BUZ 3]
Length = 929
Score = 47.8 bits (112), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 43/132 (32%), Positives = 60/132 (45%), Gaps = 22/132 (16%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGK-PNEARFN 135
PY + V G ++ D N+ I + P S G VAG G+ +G PN+ N
Sbjct: 109 PYGVFVDMLGNVYVADASNNRIQRWAP---GASVGVTVAG------GNGEGSAPNQ--LN 157
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGV--TTIAGGKSNVAGFRDGPSE----DAKF 189
P G+ +D GN+YVAD N I+K G TT+AGG G P++ + F
Sbjct: 158 FPTGLFVDGSGNVYVADNFNHRIQKWGPGATSGTTVAGGN----GLGSAPNQLLSPNGLF 213
Query: 190 SNDFDVVYVRPT 201
+ VYV T
Sbjct: 214 VDGAGNVYVADT 225
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 50/116 (43%), Gaps = 18/116 (15%)
Query: 66 TVIEGNEIGVVPYKIR------VSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ 119
TV GN G P ++ V G ++ D N I K P + G VAG
Sbjct: 142 TVAGGNGEGSAPNQLNFPTGLFVDGSGNVYVADNFNHRIQKWGP---GATSGTTVAGG-- 196
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV--TTIAGGK 173
+ PN+ P G+ +D GN+YVADT N I+K + TT+AGG
Sbjct: 197 ---NGLGSAPNQ--LLSPNGLFVDGAGNVYVADTGNDRIQKWAPGALSGTTVAGGN 247
>gi|333025645|ref|ZP_08453709.1| hypothetical protein STTU_3149 [Streptomyces sp. Tu6071]
gi|332745497|gb|EGJ75938.1| hypothetical protein STTU_3149 [Streptomyces sp. Tu6071]
Length = 610
Score = 47.8 bits (112), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 47/82 (57%), Gaps = 4/82 (4%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG-GKSNV 176
G G +DG ARFN P+G+ + G + VADT+N A+R++ A V+T+AG G+
Sbjct: 221 GERGLLDGDATGARFNEPQGLALLPDGAVVVADTVNHALRRVDLASGEVSTLAGTGRQWW 280
Query: 177 AGF-RDGPSEDAKFSNDFDVVY 197
G GP+ + S+ +DV +
Sbjct: 281 QGEPVAGPAREVSLSSPWDVAW 302
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 47/96 (48%), Gaps = 5/96 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + VS DGE + + ++ V+ + + R G+ GH DG +A H
Sbjct: 351 PSGLAVSLDGERLWIADAETSAVRW---IDREGTVRTAVGTGLFDFGHRDGAAGQALLQH 407
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIA 170
P GVT G++ VADT N A+R+ A VTT+A
Sbjct: 408 PLGVTALPDGSVAVADTYNHALRRYDPASEEVTTLA 443
>gi|290997788|ref|XP_002681463.1| predicted protein [Naegleria gruberi]
gi|284095087|gb|EFC48719.1| predicted protein [Naegleria gruberi]
Length = 2297
Score = 47.8 bits (112), Expect = 0.012, Method: Composition-based stats.
Identities = 47/164 (28%), Positives = 75/164 (45%), Gaps = 12/164 (7%)
Query: 61 GYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQG 120
G L E + IG V + VS G L+ D I KI+ + + G S G
Sbjct: 169 GALTEGTATSSTIGQV-LGLAVSSSGVLYFSDYTYHCIGKISGTVVSVAAGTC---SLLG 224
Query: 121 YTGHVDGKPNEARFNHPKGVTMD-DKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAG 178
Y G +G A+ N P V + G L++ADT N IRK+G D + T+ G + V+G
Sbjct: 225 YGGD-NGVATSAKLNSPNDVAISPTTGELFIADTGNNVIRKVGLDNKIITVVG--TGVSG 281
Query: 179 F--RDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
+ G ++ A+ S+ + + +L+ D N +R++ N
Sbjct: 282 YLGDGGQAKQAQLSSPTSIAFTSAG-EMLISDSDNYVIRKVYSN 324
Score = 46.6 bits (109), Expect = 0.029, Method: Composition-based stats.
Identities = 36/127 (28%), Positives = 61/127 (48%), Gaps = 4/127 (3%)
Query: 97 NIVKITPPLSQYSRGRLVAGSFQGYTGHVDG-KPNEARFNHPKGVTMDDKGNLYVADTLN 155
N V +T + +++ V +G +G DG P+ A + + + G+LY+AD
Sbjct: 33 NGVVVTDQIERFTITSYVGDGNEGNSG--DGVSPSHASIGYVDAIEVGSNGDLYIADKTY 90
Query: 156 LAIRKIGDAGVTTIAGGKSNVAGFRD-GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAAL 214
IRK+ + + TIAG S+ F D G + AK + + + +L + D N +
Sbjct: 91 HRIRKVSNGIIKTIAGTGSSGGYFGDNGVATSAKLNKPQALAFYSGDGTLYLGDSLNYRI 150
Query: 215 RQISLNQ 221
R+IS NQ
Sbjct: 151 RKISTNQ 157
Score = 44.7 bits (104), Expect = 0.11, Method: Composition-based stats.
Identities = 39/141 (27%), Positives = 59/141 (41%), Gaps = 17/141 (12%)
Query: 57 QFENGYLVETVIEGNE----IGVVPYK--------IRVSEDGELFAVDEVNSNIVKITPP 104
Q E + V +GNE GV P I V +G+L+ D+ I K++
Sbjct: 40 QIERFTITSYVGDGNEGNSGDGVSPSHASIGYVDAIEVGSNGDLYIADKTYHRIRKVSNG 99
Query: 105 LSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTM-DDKGNLYVADTLNLAIRKIGD 163
+ + G GS GY G +G A+ N P+ + G LY+ D+LN IRKI
Sbjct: 100 IIKTIAG---TGSSGGYFGD-NGVATSAKLNKPQALAFYSGDGTLYLGDSLNYRIRKIST 155
Query: 164 AGVTTIAGGKSNVAGFRDGPS 184
+ + GK +G +
Sbjct: 156 NQIISTVIGKGTKGALTEGTA 176
>gi|345000781|ref|YP_004803635.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Streptomyces sp. SirexAA-E]
gi|344316407|gb|AEN11095.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Streptomyces sp. SirexAA-E]
Length = 608
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 41/86 (47%), Gaps = 7/86 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF-QGYTGHVDGKPNEARFN 135
P K DG D +V++ + G V G F G G DG P EARF+
Sbjct: 182 PGKALALADGGFLVSDTTRHRLVELD------ADGETVRGHFGTGERGLTDGGPGEARFS 235
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKI 161
P+G+ + G + VADT+N AIR +
Sbjct: 236 EPQGLALLPDGRIAVADTVNHAIRAL 261
Score = 43.9 bits (102), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 51/106 (48%), Gaps = 5/106 (4%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIA 170
R+ AG+ G VDG EA F P G+ D L+VAD+ A+R + D V T
Sbjct: 327 RVAAGTTN--EGLVDGPGPEAWFAQPSGLAATDD-RLWVADSETSALRYVDLDGAVHTAV 383
Query: 171 GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
G G RDG ++ A F + V + P S+ V D N ALR+
Sbjct: 384 GTGLFDFGHRDGAADQALFQHPLGVTAL-PDGSVAVSDTYNHALRR 428
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 23/50 (46%), Positives = 33/50 (66%), Gaps = 2/50 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAG-VTTIA 170
GH DG ++A F HP GVT G++ V+DT N A+R+ ++G VTT+A
Sbjct: 391 GHRDGAADQALFQHPLGVTALPDGSVAVSDTYNHALRRFDPESGRVTTLA 440
>gi|408405002|ref|YP_006862985.1| NHL repeat-containing protein [Candidatus Nitrososphaera gargensis
Ga9.2]
gi|408365598|gb|AFU59328.1| NHL repeat-containing protein [Candidatus Nitrososphaera gargensis
Ga9.2]
Length = 501
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 57/114 (50%), Gaps = 6/114 (5%)
Query: 111 GRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKI--GDAGVT 167
GR+ + GY VDG E++F P G+ + GN L+VAD+ A+R+I G V
Sbjct: 323 GRIGPFAGSGYENIVDGSLEESQFAQPSGLAV--FGNYLFVADSEVSAVRRIDLGRKVVQ 380
Query: 168 TIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQ 221
T G V G +DGP E+A+ + V + + V D N A+R I L +
Sbjct: 381 TAVGEGLFVFGHKDGPLEEARLQHPLGVA-CESSNKIYVADTYNHAVRLIDLAE 433
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAG 171
GH DG EAR HP GV + +YVADT N A+R I + ++T+ G
Sbjct: 391 GHKDGPLEEARLQHPLGVACESSNKIYVADTYNHAVRLIDLAEQRISTLVG 441
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P K+ +S DG + A+ + N N + + S G+++ G DG +ARF
Sbjct: 181 PGKLSLSPDGRMLAISDSNHNRILVVDADS----GKIIHKVGGGSKDLRDGSFEQARFFR 236
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG 171
P+GV +YVADT N A+R+I V T+AG
Sbjct: 237 PQGVLWVGYDKIYVADTENHALREIDLQSRMVKTLAG 273
>gi|389875076|ref|YP_006374432.1| NHL repeat-containing protein [Tistrella mobilis KA081020-065]
gi|388532256|gb|AFK57450.1| NHL repeat-containing protein [Tistrella mobilis KA081020-065]
Length = 487
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 53/103 (51%), Gaps = 6/103 (5%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG-GKSNV 176
G G VDG ARF P+G+ D G ++VADT N A+R+I A VTTIAG G+
Sbjct: 215 GEPGLVDGPAESARFQRPQGLIAAD-GAIFVADTWNHAVRRIDVASGEVTTIAGTGRRGP 273
Query: 177 AGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
P+ D+ ++ +D+ Y L V + G L +I L
Sbjct: 274 ILKAPAPAIDSALASPWDLEYREGV--LYVANAGTHQLARIDL 314
Score = 43.5 bits (101), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 6/86 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVA-GSFQGYTGHVDGKPNEARFN 135
P + + ED L+ +D S++ ++ L+ + LV G F+ G VDG ARF
Sbjct: 343 PSALALKED-RLWFIDAETSSLRRLD--LAGGTVSTLVGDGLFE--FGMVDGPARTARFQ 397
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKI 161
HP G+ D GN+ VAD N AIR +
Sbjct: 398 HPLGLCFDMNGNILVADAYNDAIRVV 423
>gi|239918387|ref|YP_002957945.1| thiol-disulfide isomerase-like thioredoxin [Micrococcus luteus NCTC
2665]
gi|239839594|gb|ACS31391.1| thiol-disulfide isomerase-like thioredoxin [Micrococcus luteus NCTC
2665]
Length = 699
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 43/93 (46%), Gaps = 8/93 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT----GHVDGKPNEA 132
P + V DG L+ D S + + GR G+ G GHVDG+ + A
Sbjct: 415 PSGLSVGADGALWVADSETSAVRWV----RTGEDGRREVGTAVGAGLFDFGHVDGEADRA 470
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
R H GVT G++ +ADT N AIR+ AG
Sbjct: 471 RLQHALGVTALPDGSVLIADTYNGAIRRYAPAG 503
Score = 43.5 bits (101), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 55/119 (46%), Gaps = 6/119 (5%)
Query: 111 GRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIR--KIGDAG--- 165
G L + G +DG A F P G+++ G L+VAD+ A+R + G+ G
Sbjct: 389 GALAVHAGTALEGLLDGDAGRAWFAQPSGLSVGADGALWVADSETSAVRWVRTGEDGRRE 448
Query: 166 VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDC 224
V T G G DG ++ A+ + V + P S+L+ D N A+R+ + +D
Sbjct: 449 VGTAVGAGLFDFGHVDGEADRARLQHALGVTAL-PDGSVLIADTYNGAIRRYAPAGEDA 506
>gi|336399828|ref|ZP_08580628.1| NHL repeat containing protein [Prevotella multisaccharivorax DSM
17128]
gi|336069564|gb|EGN58198.1| NHL repeat containing protein [Prevotella multisaccharivorax DSM
17128]
Length = 494
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 50/102 (49%), Gaps = 6/102 (5%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI-----GDAGVTTIAGGKSNVA 177
G DG A+FN+P+ + G LY+AD+ N IR I +A VTT G + +
Sbjct: 393 GWEDGSLKNAKFNYPRQICFTKDGRLYIADSQNHCIRSIDTTQGANARVTTPIGVPGS-S 451
Query: 178 GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G+ DG E AKF+ V + + D N +R++S+
Sbjct: 452 GYADGGVELAKFNIPMGVAVSSDGEKVYIADMKNQVIRELSI 493
>gi|317504749|ref|ZP_07962709.1| NHL repeat containing protein [Prevotella salivae DSM 15606]
gi|315664131|gb|EFV03838.1| NHL repeat containing protein [Prevotella salivae DSM 15606]
Length = 448
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 34/58 (58%), Gaps = 4/58 (6%)
Query: 118 FQGY--TGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG 171
F G+ G +DG A F +P G+ D GN+Y+ D++ IRKIG D VTT+AG
Sbjct: 342 FAGFNGVGSMDGPRMSASFRNPTGMAFDVDGNMYITDSMGFTIRKIGHADGMVTTVAG 399
Score = 42.0 bits (97), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 41/155 (26%), Positives = 66/155 (42%), Gaps = 32/155 (20%)
Query: 49 PQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGE------------LFAV----- 91
P +L GYL + VI G+ G P KI V +G L+ +
Sbjct: 48 PVVVSKILPDSGGYLTQFVINGSNFGTDPSKIEVVFNGNRKATVVSSNGTTLYGICPKQE 107
Query: 92 ---DEVNSNIVKITPPL--------SQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGV 140
++V + + P+ ++ R ++AG G G+VDG P +ARFN+ GV
Sbjct: 108 NGFNKVTVRVDSVGEPVMCPNDFKYTKVERVSVLAGK-TGAGGYVDGNPIDARFNYMYGV 166
Query: 141 TMDDKGNLYVADTLNLAIRKIGDAG---VTTIAGG 172
+ N+ V + N +R I + +T +AGG
Sbjct: 167 GVVTGNNVIVMEGRNNRVRMISETDNKVITLLAGG 201
>gi|281415417|ref|ZP_06247159.1| thiol-disulfide isomerase-like thioredoxin [Micrococcus luteus NCTC
2665]
Length = 699
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 43/93 (46%), Gaps = 8/93 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT----GHVDGKPNEA 132
P + V DG L+ D S + + GR G+ G GHVDG+ + A
Sbjct: 415 PSGLSVGADGALWVADSETSAVRWV----RTGEDGRREVGTAVGAGLFDFGHVDGEADRA 470
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
R H GVT G++ +ADT N AIR+ AG
Sbjct: 471 RLQHALGVTALPDGSVLIADTYNGAIRRYAPAG 503
Score = 43.5 bits (101), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 55/119 (46%), Gaps = 6/119 (5%)
Query: 111 GRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIR--KIGDAG--- 165
G L + G +DG A F P G+++ G L+VAD+ A+R + G+ G
Sbjct: 389 GALAVHAGTALEGLLDGDAGRAWFAQPSGLSVGADGALWVADSETSAVRWVRTGEDGRRE 448
Query: 166 VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDC 224
V T G G DG ++ A+ + V + P S+L+ D N A+R+ + +D
Sbjct: 449 VGTAVGAGLFDFGHVDGEADRARLQHALGVTAL-PDGSVLIADTYNGAIRRYAPAGEDA 506
>gi|269796827|ref|YP_003316282.1| thiol-disulfide isomerase-like thioredoxin [Sanguibacter keddieii
DSM 10542]
gi|269099012|gb|ACZ23448.1| thiol-disulfide isomerase-like thioredoxin [Sanguibacter keddieii
DSM 10542]
Length = 634
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 61/124 (49%), Gaps = 7/124 (5%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI-----GDAGVTTIAGGKSNVA 177
G +DG +A F P G+++ G++++AD A+R++ G A +T++ G
Sbjct: 356 GLLDGPLAQAWFAQPSGLSVGPDGSVWLADAETSALRRVDVADDGSATITSLVGQGLFDF 415
Query: 178 GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSIS-PTDI 236
G RDG + A + V + P S++V D N ALR+ D+ ++ P+D
Sbjct: 416 GHRDGAAAQALLQHPLGVAAL-PDGSVVVTDTYNGALRRYDPATDEVTTLVGGLAEPSDA 474
Query: 237 LMVV 240
L+ V
Sbjct: 475 LVQV 478
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 56/126 (44%), Gaps = 14/126 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT--GHVDGKPNEARF 134
P + V DG ++ D S + ++ ++ + + QG GH DG +A
Sbjct: 370 PSGLSVGPDGSVWLADAETSALRRVD--VADDGSATITSLVGQGLFDFGHRDGAAAQALL 427
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSND 192
HP GV G++ V DT N A+R+ A VTT+ GG + PS DA D
Sbjct: 428 QHPLGVAALPDGSVVVTDTYNGALRRYDPATDEVTTLVGGLAE-------PS-DALVQVD 479
Query: 193 FDVVYV 198
D V+V
Sbjct: 480 GDEVHV 485
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 57/131 (43%), Gaps = 11/131 (8%)
Query: 119 QGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAG 178
+G +V +P P V + GNL VAD + ++ ++ G T + S G
Sbjct: 165 RGSGPYVPAEPTPGTLRFPAAVIALESGNLLVADAGHHSLAELAADGETLVRRIGSGERG 224
Query: 179 FRDGPSEDAKFSNDFDVVYVRPTC------SLLVIDRGNAALRQISLNQDDCEYQYNSIS 232
DG +DA FS + V T +LV D N ALR + L+ + +++
Sbjct: 225 LSDGGRQDATFSEPNGLCLVPATLRDQLGYDVLVADTVNHALRGVRLS----DGHVTTVA 280
Query: 233 PT-DILMVVGA 242
T D LMV GA
Sbjct: 281 GTGDQLMVGGA 291
>gi|225420274|ref|ZP_03762577.1| hypothetical protein CLOSTASPAR_06617 [Clostridium asparagiforme
DSM 15981]
gi|225041091|gb|EEG51337.1| hypothetical protein CLOSTASPAR_06617 [Clostridium asparagiforme
DSM 15981]
Length = 336
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 44/85 (51%), Gaps = 8/85 (9%)
Query: 85 DGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDD 144
+G L+ D N ++K+ ++ G G G DG +ARF+ P + + +
Sbjct: 220 EGALYVSDTGNHRVLKVENGEIVWTAG--------GAEGQADGGFGQARFSSPTYLAVSE 271
Query: 145 KGNLYVADTLNLAIRKIGDAGVTTI 169
+G LYV+DT N A+RKI + V T+
Sbjct: 272 QGTLYVSDTGNAAVRKIENGTVATV 296
Score = 47.4 bits (111), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 46/87 (52%), Gaps = 13/87 (14%)
Query: 75 VVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
+VPY GE +AV + +++V+ R +AG +G G+ D + A F
Sbjct: 124 IVPY-------GEGYAVSDRENHMVRYIDAQGS----RTLAG--RGREGYEDNRGGRALF 170
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKI 161
+ P G+ DD GNLY+ADT N IR++
Sbjct: 171 SGPTGLAADDAGNLYIADTGNDVIRRL 197
>gi|308799191|ref|XP_003074376.1| NHL repeat (ISS) [Ostreococcus tauri]
gi|116000547|emb|CAL50227.1| NHL repeat (ISS) [Ostreococcus tauri]
Length = 1783
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 61/198 (30%), Positives = 93/198 (46%), Gaps = 26/198 (13%)
Query: 45 SSKAPQADGNVLQFENGY-LVETVIEGNEIGVV------PYKIRVSED--GELFAVDEVN 95
+S A A G+VL + G V + G+ VV P ++ V E G+++ + +
Sbjct: 257 TSVALLASGDVLVADTGNKAVRNISNGDVTTVVASLPFAPIELVVDESNGGDMYVLGQSQ 316
Query: 96 SNIVKITPPLSQYSRGRLVAGSFQGYTGHVD-GKPNEARFNHPKGVTMDD-KGNLYVADT 153
++KI+ + +AGS Q +G VD ARF P+G+ +D LYVADT
Sbjct: 317 HGVMKISVSTLAVTT---IAGS-QTTSGFVDHNTGTSARFTLPRGLALDSLNSKLYVADT 372
Query: 154 LNLAIRKI--GDAGVTTIAGGKS---NVAGF-RDGP-SEDAKFSNDFDVVY----VRPTC 202
N A+R I VTT+ G S N + +DG S A+F++ + Y +
Sbjct: 373 GNHAVRMIDLSTGVVTTVLGDGSPALNASTLNKDGVLSTPARFNDPVGIAYNYDSALSSG 432
Query: 203 SLLVIDRGNAALRQISLN 220
LLV D G LR++ LN
Sbjct: 433 VLLVSDAGTHQLRKLILN 450
Score = 41.6 bits (96), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 48/99 (48%), Gaps = 15/99 (15%)
Query: 132 ARFNHPKGVTMD-----DKGNLYVADTLNLAIRKI-------GDAG-VTTIAGGKSNVAG 178
ARFN P G+ + G L V+D +RK+ G+A V T+AG + AG
Sbjct: 413 ARFNDPVGIAYNYDSALSSGVLLVSDAGTHQLRKLILNDSTAGNAATVVTVAGSYTGTAG 472
Query: 179 FR-DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
FR D A F N V Y+ + + +V DR N A+R+
Sbjct: 473 FRDDAVGSAAMFYNPEAVSYIG-SATYIVADRSNHAIRK 510
>gi|229822379|ref|YP_002883905.1| alkyl hydroperoxide reductase [Beutenbergia cavernae DSM 12333]
gi|229568292|gb|ACQ82143.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Beutenbergia cavernae DSM 12333]
Length = 641
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 52/112 (46%), Gaps = 6/112 (5%)
Query: 107 QYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG- 165
Q S RL +G DG EA F P G+ + G ++VAD+ A+R + AG
Sbjct: 347 QGSLTRLAGTEHEGLR---DGPAAEAWFAQPSGLAVAQDGRIWVADSETSALRWLDPAGG 403
Query: 166 -VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
V T G G RDG ++ A + V + P S+LV D N ALR+
Sbjct: 404 DVHTAVGQGLFEFGHRDGAADQALLQHPLGVAAL-PDASVLVADTYNGALRR 454
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 47/165 (28%), Positives = 71/165 (43%), Gaps = 32/165 (19%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P K E+G L D + ++ ++ P LV G G VDG PN ARF+
Sbjct: 181 PAKAIALENGNLLVADAGHHSLAELGP-----DGETLVRRVGSGERGLVDGGPNVARFSE 235
Query: 137 PKGVTM---DDKGNL----YVADTLNLAIR--KIGDAGVTTIAG-GKSNVAGFRDGPSE- 185
P G+ + + + L VADT+N A+R ++ D V+T+AG G+ V G D S+
Sbjct: 236 PNGLCLVPVELRARLGYDVLVADTVNHALRGVRLADGHVSTVAGTGQQYVVGAPDNASDP 295
Query: 186 ----------------DAKFSNDFDVVYVRPTCSLLVIDRGNAAL 214
K S+ +DV + + +V GN L
Sbjct: 296 HGGTHPVGFGDQFVPTSVKLSSPWDVAWSPDLAAFVVAMAGNHTL 340
Score = 43.1 bits (100), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 48/100 (48%), Gaps = 9/100 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT--GHVDGKPNEARF 134
P + V++DG ++ D S + + P + G + QG GH DG ++A
Sbjct: 374 PSGLAVAQDGRIWVADSETSALRWLDP-----AGGDVHTAVGQGLFEFGHRDGAADQALL 428
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGG 172
HP GV ++ VADT N A+R+ A VTTIA G
Sbjct: 429 QHPLGVAALPDASVLVADTYNGALRRHDPATGVVTTIATG 468
>gi|452992643|emb|CCQ95895.1| hypothetical protein CULT_320031 [Clostridium ultunense Esp]
Length = 329
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 68/141 (48%), Gaps = 17/141 (12%)
Query: 78 YKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGH-------VDGKP 129
Y +R +G ++ +D N + KI L+AG+ + GYTG G
Sbjct: 178 YDLRRGPEGNIYILDSKNYAVRKINIDKETV---ELIAGTGKPGYTGDGGDAKDATFGGN 234
Query: 130 NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAK 188
E+ F+ P +++D+ GN+YV DT N +R I + G V TIAG + + G ++ +E
Sbjct: 235 KESYFDGPWSLSLDEVGNIYVGDTQNHVVRMITNKGKVYTIAGNINVIKGRKNRTNEKDL 294
Query: 189 FSNDFDVVYVRPTCSLLVIDR 209
F + ++ CS+ DR
Sbjct: 295 FKINLPLI-----CSMDYYDR 310
>gi|395502099|ref|XP_003755423.1| PREDICTED: NHL repeat-containing protein 2 [Sarcophilus harrisii]
Length = 1044
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 53/179 (29%), Positives = 80/179 (44%), Gaps = 32/179 (17%)
Query: 68 IEGNEIGVVPYKIRVSEDGELF----AVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTG 123
I+ N+IG+ YK + LF A+D+V ++V ++ R++ G
Sbjct: 524 IKDNKIGIKLYKDSLPSSPLLFPGKVAIDDVTDSLV-----IADTGHHRILVVRKNGQIQ 578
Query: 124 HVDGKPNEAR---------FNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG- 171
H G PN R FN P+GV + + +YVADT N IRKI V+T+AG
Sbjct: 579 HSIGGPNSGRKDGTFLESTFNSPQGVAIKNN-IIYVADTENHLIRKIDLETKMVSTVAGI 637
Query: 172 ---GKSNVAGFRDGPSEDAKFSNDFDVVY---VRPTCSLLVIDRGNAALRQI-SLNQDD 223
G G + E+ S+ +DVV+ V C ++ A + QI +L DD
Sbjct: 638 GIQGTDKEGGAK---GEEQPISSPWDVVFGISVSEICKDDILWIAMAGIHQIWALLLDD 693
>gi|290971766|ref|XP_002668650.1| predicted protein [Naegleria gruberi]
gi|284082136|gb|EFC35906.1| predicted protein [Naegleria gruberi]
Length = 728
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 66/145 (45%), Gaps = 12/145 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV--DGKPNEARF 134
P + VS + E++ D++N I KI G +V + G G +G A+
Sbjct: 125 PRCVFVSSNNEVYIADQINHRIRKILE------NGNIVTIAGNGPYGFCGDNGLATNAQL 178
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
N P GV + +Y+AD N IRKI + G + TIAG + AGF K +F
Sbjct: 179 NSPAGVFVSSNNEIYIADYDNHRIRKILENGNIVTIAGKGT--AGFSGDNGLATKEKLNF 236
Query: 194 -DVVYVRPTCSLLVIDRGNAALRQI 217
V+V + + D+ N +R+I
Sbjct: 237 PRCVFVSSNNEVYIADQINHRIRKI 261
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 62/144 (43%), Gaps = 10/144 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV--DGKPNEARF 134
P + VS + E++ D N I KI G +V + G G +G +A+
Sbjct: 13 PESVFVSSNNEVYIADYGNHRIRKILE------NGNIVTIAGNGTAGFSGDNGIATKAQL 66
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF- 193
N P GV + +Y+AD N IRKI + G I GK AGF K +F
Sbjct: 67 NGPVGVFVSSNNEVYIADYDNHRIRKILENGNIVIIAGKG-TAGFSGDNGLATKEKLNFP 125
Query: 194 DVVYVRPTCSLLVIDRGNAALRQI 217
V+V + + D+ N +R+I
Sbjct: 126 RCVFVSSNNEVYIADQINHRIRKI 149
Score = 43.1 bits (100), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 68/145 (46%), Gaps = 12/145 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV--DGKPNEARF 134
P + VS + E++ D N I KI G +V + +G G +G + +
Sbjct: 181 PAGVFVSSNNEIYIADYDNHRIRKILE------NGNIVTIAGKGTAGFSGDNGLATKEKL 234
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSND 192
N P+ V + +Y+AD +N IRKI + G + TIAG G G +G + +A+ N
Sbjct: 235 NFPRCVFVSSNNEVYIADQINHRIRKILENGNIVTIAGNGPYGFCG-DNGLATNAQL-NS 292
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQI 217
V+V + + + GN +R+I
Sbjct: 293 PAGVFVSSNNEIYIAEYGNHRIRKI 317
Score = 42.7 bits (99), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 53/111 (47%), Gaps = 12/111 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV--DGKPNEARF 134
P + VS + E++ D++N I KI G +V + G G +G A+
Sbjct: 237 PRCVFVSSNNEVYIADQINHRIRKILE------NGNIVTIAGNGPYGFCGDNGLATNAQL 290
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR-DGP 183
N P GV + +Y+A+ N IRKI + G + T+AG + AGF D P
Sbjct: 291 NSPAGVFVSSNNEIYIAEYGNHRIRKILENGNIVTVAGNGT--AGFSGDSP 339
Score = 41.6 bits (96), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 67/145 (46%), Gaps = 12/145 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV--DGKPNEARF 134
P + VS + E++ D N I KI G +V + +G G +G + +
Sbjct: 69 PVGVFVSSNNEVYIADYDNHRIRKILE------NGNIVIIAGKGTAGFSGDNGLATKEKL 122
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSND 192
N P+ V + +Y+AD +N IRKI + G + TIAG G G +G + +A+ N
Sbjct: 123 NFPRCVFVSSNNEVYIADQINHRIRKILENGNIVTIAGNGPYGFCG-DNGLATNAQL-NS 180
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQI 217
V+V + + D N +R+I
Sbjct: 181 PAGVFVSSNNEIYIADYDNHRIRKI 205
>gi|298245110|ref|ZP_06968916.1| NHL repeat containing protein [Ktedonobacter racemifer DSM 44963]
gi|297552591|gb|EFH86456.1| NHL repeat containing protein [Ktedonobacter racemifer DSM 44963]
Length = 496
Score = 47.4 bits (111), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 37/92 (40%), Positives = 50/92 (54%), Gaps = 8/92 (8%)
Query: 110 RGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVT 167
R R + GS G G DG EA+FNHP+GV + + LYVADT N IR++ V
Sbjct: 208 RVRTIIGS--GVQGQADGSFAEAQFNHPQGVALVND-LLYVADTDNHLIRRVDLRTKQVE 264
Query: 168 TIAG-GKSN--VAGFRDGPSEDAKFSNDFDVV 196
T+AG G+ N V GP+ S+ +D+V
Sbjct: 265 TLAGTGEQNGMVRTRLQGPARSIALSSPWDLV 296
Score = 46.2 bits (108), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 52/99 (52%), Gaps = 7/99 (7%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFR 180
G VDG EA+ H +G+ + G LY+ADT N I+ + V T+AG + AG
Sbjct: 391 GDVDGIGEEAQLQHVQGICAAN-GLLYLADTYNNRIKALDPQTREVRTLAG--TGEAGIH 447
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
+GP+E A+F+ D+ T L + D N A+R + +
Sbjct: 448 NGPAEQAQFNEPGDLAITGNT--LYIADTNNHAIRLLDM 484
>gi|290995422|ref|XP_002680294.1| predicted protein [Naegleria gruberi]
gi|284093914|gb|EFC47550.1| predicted protein [Naegleria gruberi]
Length = 1170
Score = 47.4 bits (111), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 58/107 (54%), Gaps = 7/107 (6%)
Query: 115 AGSFQGYTGHVDGKP-NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAGG 172
AGS Q Y+ DG P AR + P+G+ + G ++++D + IRKI GV +T+AG
Sbjct: 344 AGS-QAYSSG-DGYPATAARLSSPEGIAISTSGEVFISDKGSHTIRKIDSKGVISTVAGT 401
Query: 173 KSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
S AG+ DGP+ AK N + P LL+ + N +R++S
Sbjct: 402 GS--AGYVDGPALKAKL-NGPGFLAFTPNGDLLITESSNNRIRKLSF 445
Score = 41.6 bits (96), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 48/97 (49%), Gaps = 8/97 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I +S GE+F D+ + I KI S+G + + G G+VDG +A+ N
Sbjct: 365 PEGIAISTSGEVFISDKGSHTIRKID------SKGVISTVAGTGSAGYVDGPALKAKLNG 418
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG 171
P + G+L + ++ N IRK+ +TTIAG
Sbjct: 419 PGFLAFTPNGDLLITESSNNRIRKLSFVSGNITTIAG 455
>gi|344924159|ref|ZP_08777620.1| hypothetical protein COdytL_05892 [Candidatus Odyssella
thessalonicensis L13]
Length = 1320
Score = 47.4 bits (111), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 39/81 (48%), Gaps = 17/81 (20%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
I V G ++ D N+N+ K+TP + Y++ L+ G FN PK
Sbjct: 57 IAVDTSGNVYVTDRANNNVYKLTPSGATYTQSTLLTG-----------------FNDPKS 99
Query: 140 VTMDDKGNLYVADTLNLAIRK 160
+ +D+ GNLY+ D+ N + K
Sbjct: 100 IAVDNSGNLYITDSGNGNVVK 120
>gi|282879766|ref|ZP_06288496.1| NHL repeat protein [Prevotella timonensis CRIS 5C-B1]
gi|281306435|gb|EFA98465.1| NHL repeat protein [Prevotella timonensis CRIS 5C-B1]
Length = 486
Score = 47.4 bits (111), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 54/115 (46%), Gaps = 7/115 (6%)
Query: 111 GRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI------GDA 164
GR +G DG A+F +P+ + G +Y+AD++N IR I A
Sbjct: 372 GRACWEGTSAGSGWEDGLLKNAKFKYPRQICFTADGKMYIADSMNSCIRCIDTNQPDNKA 431
Query: 165 GVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
VTT+ G + AGF++G + A+F V + V D GN +R++++
Sbjct: 432 TVTTVIGLPGS-AGFKEGGPDVAQFRYPRGVAVNADGSIVYVADTGNRCIRKLTI 485
Score = 47.4 bits (111), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 50/88 (56%), Gaps = 5/88 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNI--VKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
P +I + DG+++ D +NS I + P ++ + ++ G G +G P+ A+F
Sbjct: 398 PRQICFTADGKMYIADSMNSCIRCIDTNQPDNKATVTTVIG--LPGSAGFKEGGPDVAQF 455
Query: 135 NHPKGVTMDDKGNL-YVADTLNLAIRKI 161
+P+GV ++ G++ YVADT N IRK+
Sbjct: 456 RYPRGVAVNADGSIVYVADTGNRCIRKL 483
>gi|405960880|gb|EKC26754.1| NHL repeat-containing protein 2 [Crassostrea gigas]
Length = 699
Score = 47.4 bits (111), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 57/112 (50%), Gaps = 10/112 (8%)
Query: 129 PNEARFNHPKGVTMDD--KGN-LYVADTLNLAIRKIG--DAGVTTIAGGKSN-----VAG 178
P +A F P G+T+ + +G+ L+VAD+ + +R + D V + GG+ + G
Sbjct: 392 PEKAAFAQPSGLTLGNTPEGSFLFVADSESSTVRSVALKDGAVKHVVGGERDPMNLFAYG 451
Query: 179 FRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNS 230
+DG DAK + V +V + S+LV D N L+ + +++ C + S
Sbjct: 452 DKDGTGIDAKLQHPLGVAWVADSGSILVADSYNHKLKLVDISKKQCTTVWGS 503
Score = 42.4 bits (98), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 31/77 (40%), Positives = 43/77 (55%), Gaps = 4/77 (5%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG-GKSNVAGF 179
G +G + RF+ P+GV M + +Y+ADT N AIRKI VTTIAG GK F
Sbjct: 256 GFKNGTFTQCRFDSPQGVVMKEN-VIYLADTENHAIRKIDLQKQEVTTIAGNGKQGNDKF 314
Query: 180 RDGPSEDAKFSNDFDVV 196
G + + S+ +D+V
Sbjct: 315 GGGIGIEQELSSPWDLV 331
>gi|303274116|ref|XP_003056382.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226462466|gb|EEH59758.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 490
Score = 47.4 bits (111), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSN 191
+P G+ +D G ++VAD+ N IR+I GV T+ G S AGF DG +A+F+N
Sbjct: 3 YPAGIALDGNGAIFVADSGNNRIRQIKSGGVVTVLAG-SGTAGFLDGSLLEAEFNN 57
Score = 38.1 bits (87), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 47/147 (31%), Positives = 70/147 (47%), Gaps = 12/147 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + +G +F D N+ I +I S G + + G G +DG EA FN+
Sbjct: 4 PAGIALDGNGAIFVADSGNNRIRQIK------SGGVVTVLAGSGTAGFLDGSLLEAEFNN 57
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGP-SEDAKFSNDF 193
P+ V + G +YVADT N IR I + VTT AG + ++ +RD + A S
Sbjct: 58 PQSVVVTSSGVVYVADTNNHRIRLINEENDVVTTYAG--AGISDYRDAQVAIQAYISKPL 115
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISLN 220
+ + T L V D G++ +R I N
Sbjct: 116 GLAFRTLTKDLFVSD-GDSRIRIIRPN 141
>gi|384245978|gb|EIE19470.1| hypothetical protein COCSUDRAFT_67905 [Coccomyxa subellipsoidea
C-169]
Length = 838
Score = 47.4 bits (111), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 32/67 (47%)
Query: 142 MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 201
M G L VAD+ N IRK+ G T G+ G DG A FSN + V P
Sbjct: 1 MSPNGFLAVADSTNACIRKVTLEGEVTTLAGECGFPGDEDGEMHSALFSNGIEDVACLPN 60
Query: 202 CSLLVID 208
C++LV D
Sbjct: 61 CTILVTD 67
>gi|262195555|ref|YP_003266764.1| NHL repeat containing protein [Haliangium ochraceum DSM 14365]
gi|262078902|gb|ACY14871.1| NHL repeat containing protein [Haliangium ochraceum DSM 14365]
Length = 404
Score = 47.4 bits (111), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 42/106 (39%), Positives = 56/106 (52%), Gaps = 6/106 (5%)
Query: 85 DGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDD 144
DG L V V +I++I P L+AG+ + G DG EARF+ P GV + D
Sbjct: 205 DGRLVLV-YVFQDIIQIFDPEGLPPAPELLAGAAED-PGFADGNGAEARFDRPLGVAVVD 262
Query: 145 KGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKF 189
+YVAD+ N IRKI D V+T+AG + VA DG +A F
Sbjct: 263 D-EIYVADSANHRIRKITLDGEVSTLAG--TGVAASSDGSLAEATF 305
Score = 43.5 bits (101), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 51/108 (47%), Gaps = 12/108 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V +D E++ D N I KIT G + + G DG EA F+
Sbjct: 255 PLGVAVVDD-EIYVADSANHRIRKIT------LDGEVSTLAGTGVAASSDGSLAEATFDT 307
Query: 137 PKGVTMDDKGNLYVADTLN-LAIRKIGDAG--VTTIAGGKSNVAGFRD 181
P +T D GNLYV + + +RKI V TIAG S +AGF+D
Sbjct: 308 PYAMTRDSDGNLYVTELGDSFRVRKIDLVAETVETIAG--SGMAGFKD 353
>gi|345008031|ref|YP_004810385.1| redoxin domain-containing protein [Streptomyces violaceusniger Tu
4113]
gi|344034380|gb|AEM80105.1| Redoxin domain protein [Streptomyces violaceusniger Tu 4113]
Length = 615
Score = 47.4 bits (111), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 46/82 (56%), Gaps = 4/82 (4%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG-GKSNV 176
G G DG ARF+ P+G+ + G + VADT+N A+R+ A VTT+AG G+
Sbjct: 226 GERGLTDGTGERARFSEPQGLALLPDGTVAVADTVNHALRRFDPATGEVTTLAGTGRQWW 285
Query: 177 AGF-RDGPSEDAKFSNDFDVVY 197
G +GP+ + S+ +DV +
Sbjct: 286 QGSPTEGPAREVDLSSPWDVAW 307
>gi|30696124|ref|NP_564718.2| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
gi|17529226|gb|AAL38840.1| unknown protein [Arabidopsis thaliana]
gi|332195279|gb|AEE33400.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
Length = 1055
Score = 47.4 bits (111), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 49/101 (48%), Gaps = 5/101 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFR 180
G DG E HP GV + G +Y+ D+ N I+K+ V T+AG + AGF+
Sbjct: 791 GDNDGVGAEVLLQHPLGVLCANDGQIYLTDSYNHKIKKLDPVTKRVVTLAG--TGKAGFK 848
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQ 221
DG + A+ S + + L V D N+ +R I LN+
Sbjct: 849 DGKVKGAQLSEPAGLA-ITENGRLFVADTNNSLIRYIDLNK 888
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 60/123 (48%), Gaps = 19/123 (15%)
Query: 89 FAVDEVNSNIVKITPPLSQYSRGRLVA----GSF------QGYTGHVDGKPNEARFNHPK 138
A+D +N+ + +S + R++ G+F G G DG +A FN P+
Sbjct: 572 LAIDTLNNRLF-----ISDSNHNRIIVTDLEGNFIVQIGSSGEEGFQDGSFEDAAFNRPQ 626
Query: 139 GVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSND-FD 194
G+ + K N LYVADT N A+R+I + V T+AG + + ++ G + N +D
Sbjct: 627 GLAYNAKKNLLYVADTENHALREIDFVNERVQTLAGNGTKGSDYQGGRKGTKQLLNSPWD 686
Query: 195 VVY 197
V +
Sbjct: 687 VCF 689
>gi|440733650|ref|ZP_20913345.1| nhl repeat protein [Xanthomonas translucens DAR61454]
gi|440359672|gb|ELP96967.1| nhl repeat protein [Xanthomonas translucens DAR61454]
Length = 473
Score = 47.4 bits (111), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 48/85 (56%), Gaps = 8/85 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ + +++D L+ D + IV+ T S GR++ G +DG EA F+
Sbjct: 170 PFGLALADD-RLYIADSGHHRIVECT------SGGRMLRQFGLGTADFIDGGIGEAAFHR 222
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P+G+ + ++G LYVADT N A+R+I
Sbjct: 223 PRGLAL-ERGVLYVADTGNHALRRI 246
>gi|12323029|gb|AAG51506.1|AC058785_9 hypothetical protein [Arabidopsis thaliana]
Length = 1041
Score = 47.4 bits (111), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 49/101 (48%), Gaps = 5/101 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFR 180
G DG E HP GV + G +Y+ D+ N I+K+ V T+AG + AGF+
Sbjct: 777 GDNDGVGAEVLLQHPLGVLCANDGQIYLTDSYNHKIKKLDPVTKRVVTLAG--TGKAGFK 834
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQ 221
DG + A+ S + + L V D N+ +R I LN+
Sbjct: 835 DGKVKGAQLSEPAGLA-ITENGRLFVADTNNSLIRYIDLNK 874
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 54/107 (50%), Gaps = 18/107 (16%)
Query: 89 FAVDEVNSNIVKITPPLSQYSRGRLVA----GSF------QGYTGHVDGKPNEARFNHPK 138
A+D +N+ + +S + R++ G+F G G DG +A FN P+
Sbjct: 572 LAIDTLNNRLF-----ISDSNHNRIIVTDLEGNFIVQIGSSGEEGFQDGSFEDAAFNRPQ 626
Query: 139 GVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDG 182
G+ + K N LYVADT N A+R+I + V T+AG + + ++ G
Sbjct: 627 GLAYNAKKNLLYVADTENHALREIDFVNERVQTLAGNGTKGSDYQGG 673
>gi|428214156|ref|YP_007087300.1| thiol-disulfide isomerase-like thioredoxin [Oscillatoria acuminata
PCC 6304]
gi|428002537|gb|AFY83380.1| thiol-disulfide isomerase-like thioredoxin [Oscillatoria acuminata
PCC 6304]
Length = 506
Score = 47.4 bits (111), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 63/123 (51%), Gaps = 14/123 (11%)
Query: 113 LVAGSFQGYTG-----HVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG---DA 164
L+ G+ Y+G +DG+ +EA F P G+T DD+ L+VAD+ +IR IG +
Sbjct: 319 LMDGTVGTYSGIGAEAGMDGELDEAAFAQPSGITSDDE-ELFVADSEISSIRGIGLGNEP 377
Query: 165 GVTTIAGGKSNVAGF--RDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQD 222
V T+ G ++ GF RDG +E+A F + + Y L V D N ++ +
Sbjct: 378 RVRTLC-GSGDLFGFGDRDGTAEEALFQHCLGIDYAEGL--LWVADTYNHKIKTVHPKTG 434
Query: 223 DCE 225
+C+
Sbjct: 435 ECK 437
>gi|326436774|gb|EGD82344.1| NHL repeat protein [Salpingoeca sp. ATCC 50818]
Length = 1484
Score = 47.4 bits (111), Expect = 0.019, Method: Composition-based stats.
Identities = 50/155 (32%), Positives = 72/155 (46%), Gaps = 20/155 (12%)
Query: 70 GNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS--FQGYTGHVDG 127
GN P + ++DG L DE N + + S +VAG+ F GY+G DG
Sbjct: 1277 GNAAIHSPVGLAWADDGSLLFSDEENHVVYMVNRRSLIIS---VVAGTPRFAGYSGD-DG 1332
Query: 128 KPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSE 185
AR N P+G+ + KG LY+AD+ N IR + +TT+AG S AGF S
Sbjct: 1333 LSIGARLNRPRGIALS-KGVLYIADSGNHRIRAVDMRTQVITTVAGTGS--AGF----SG 1385
Query: 186 DAKFSNDFDV-----VYVRPTCSLLVIDRGNAALR 215
D + + V V + +L + D GN +R
Sbjct: 1386 DGGLPTNAALRVPHGVAVHSSGTLAIADSGNHRVR 1420
>gi|433676068|ref|ZP_20508221.1| RING finger protein nhl-1 [Xanthomonas translucens pv. translucens
DSM 18974]
gi|430818813|emb|CCP38486.1| RING finger protein nhl-1 [Xanthomonas translucens pv. translucens
DSM 18974]
Length = 473
Score = 47.4 bits (111), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 48/85 (56%), Gaps = 8/85 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ + +++D L+ D + IV+ T S GR++ G +DG EA F+
Sbjct: 170 PFGLALADD-RLYIADSGHHRIVECT------SGGRMLRQFGLGTADFIDGGIGEAAFHR 222
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P+G+ + ++G LYVADT N A+R+I
Sbjct: 223 PRGLAL-ERGVLYVADTGNHALRRI 246
>gi|288923319|ref|ZP_06417453.1| serine/threonine protein kinase [Frankia sp. EUN1f]
gi|288345335|gb|EFC79730.1| serine/threonine protein kinase [Frankia sp. EUN1f]
Length = 752
Score = 47.0 bits (110), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 52/95 (54%), Gaps = 7/95 (7%)
Query: 130 NEARFNHPKGVTMDD-KGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGF--RDGPSE 185
+A F+ + +T+D +G +YVADT N IR+I G +TT+AG + AGF GP+
Sbjct: 413 TQAEFDSARDLTIDTTRGVIYVADTENNRIRRIDRTGRITTVAG--NGTAGFDGDGGPAT 470
Query: 186 DAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
A VV V P +L + D GN +R+I N
Sbjct: 471 AASLQEPSGVV-VAPDGTLFIADNGNHRIRRIGTN 504
Score = 47.0 bits (110), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 52/110 (47%), Gaps = 9/110 (8%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKP-NEARFNHPKGVTMD 143
G ++ D N+ I +I + R VAG+ G+ G DG P A P GV +
Sbjct: 430 GVIYVADTENNRIRRI----DRTGRITTVAGNGTAGFDG--DGGPATAASLQEPSGVVVA 483
Query: 144 DKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAK-FSND 192
G L++AD N IR+IG GV T G+S GF SED F+ D
Sbjct: 484 PDGTLFIADNGNHRIRRIGTNGVITTIAGQSGSGGFAGEVSEDGLIFAGD 533
>gi|399157177|ref|ZP_10757244.1| NHL repeat containing protein, partial [SAR324 cluster bacterium
SCGC AAA001-C10]
Length = 352
Score = 47.0 bits (110), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 37/99 (37%), Positives = 46/99 (46%), Gaps = 6/99 (6%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRD 181
G D A+FN+PKG+T D NLYVAD N IRKI GV T G S G D
Sbjct: 250 GSTDATGTSAQFNYPKGITTDGT-NLYVADYSNHRIRKIVISTGVVTTLAGSSQ--GSTD 306
Query: 182 GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
A F + +L V DR N +R+I ++
Sbjct: 307 ATGTSASFY--YPSGITTDGTNLYVADRYNHRIRKIVIS 343
>gi|290976430|ref|XP_002670943.1| predicted protein [Naegleria gruberi]
gi|284084507|gb|EFC38199.1| predicted protein [Naegleria gruberi]
Length = 4502
Score = 47.0 bits (110), Expect = 0.021, Method: Composition-based stats.
Identities = 39/150 (26%), Positives = 67/150 (44%), Gaps = 10/150 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGY---TGHVDG-KPNEA 132
P + VS DG+ V + +N++K T Y G V + G T VDG +
Sbjct: 269 PQGVDVSFDGDFVYVADTGNNLIKYT----SYKDGVFVISNLAGTGNTTSLVDGVNALQV 324
Query: 133 RFNHPKGVTMDDK-GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFS 190
N P V K G +Y+A+ N IRK+ + ++A G ++G+ D S F
Sbjct: 325 DLNRPVSVRFSKKTGEIYIAEKGNHVIRKVRKDNIASVAVGSIGISGYNGDFKSSLDSFL 384
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
++ + + + + + D GN +R++ N
Sbjct: 385 SEPSDIAISTSGVIYIADSGNNRVRKVYQN 414
Score = 40.8 bits (94), Expect = 1.7, Method: Composition-based stats.
Identities = 26/84 (30%), Positives = 42/84 (50%), Gaps = 3/84 (3%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + VS G+++ D N+ + KI S +S L+AG+ G P A N+
Sbjct: 2423 PKGVTVSSSGDIYFSDTSNNKLKKILQ--SDWSVS-LIAGTGTGSFSGDGSSPTAATINN 2479
Query: 137 PKGVTMDDKGNLYVADTLNLAIRK 160
P G+ + + G +Y AD+ N +RK
Sbjct: 2480 PIGLDITENGEVYFADSNNNRVRK 2503
Score = 38.9 bits (89), Expect = 5.2, Method: Composition-based stats.
Identities = 29/90 (32%), Positives = 45/90 (50%), Gaps = 5/90 (5%)
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAG-GKSNVAGFRDGPSEDAKFS 190
N PKGVT+ G++Y +DT N ++KI D V+ IAG G + +G DG S A
Sbjct: 2420 LNKPKGVTVSSSGDIYFSDTSNNKLKKILQSDWSVSLIAGTGTGSFSG--DGSSPTAATI 2477
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
N+ + + + D N +R+ S +
Sbjct: 2478 NNPIGLDITENGEVYFADSNNNRVRKCSYD 2507
>gi|253748633|gb|EET02664.1| Haloacid dehalogenase-like hydrolase family [Giardia intestinalis
ATCC 50581]
Length = 963
Score = 47.0 bits (110), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 52/102 (50%), Gaps = 7/102 (6%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDD-KGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNV 176
G H+DG ++A FN P V D G +Y+ DT N IR + A V T+A G V
Sbjct: 126 GEAAHLDGGFSQAAFNSPHSVVSDPVNGIIYLTDTKNHCIRTLSLATRKVRTLA-GTPGV 184
Query: 177 AGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G+RDG + F+ +V + +++V D N ALR IS
Sbjct: 185 FGYRDG--LNPLFNEPLGLVLTEDS-NIIVCDSKNGALRHIS 223
>gi|383100947|emb|CCD74491.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis halleri subsp. halleri]
Length = 1028
Score = 47.0 bits (110), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 51/106 (48%), Gaps = 6/106 (5%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFR 180
G DG E HP GV G +Y+ D+ N I+K+ V T+AG + AGF+
Sbjct: 764 GDNDGVGAEVLLQHPLGVLCAKDGQIYLTDSYNHKIKKLDPVTKRVVTVAG--TGKAGFK 821
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQ-DDCE 225
DG + A+ S + + L V D N+ +R I LN+ +D E
Sbjct: 822 DGKVKGAQLSEPAGLA-ITENGRLFVADTNNSLIRYIDLNKGEDAE 866
>gi|383775243|ref|YP_005459809.1| hypothetical protein AMIS_730 [Actinoplanes missouriensis 431]
gi|381368475|dbj|BAL85293.1| hypothetical protein AMIS_730 [Actinoplanes missouriensis 431]
Length = 601
Score = 47.0 bits (110), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 36/99 (36%), Positives = 50/99 (50%), Gaps = 8/99 (8%)
Query: 77 PYKIRVSEDGE-LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P + VS DG+ L+ D S + + + + G+ G F GHVDGK EA F
Sbjct: 345 PSGLSVSRDGKRLWIADSETSALRYVEDGVLHTAVGQ---GLFD--FGHVDGKAEEALFQ 399
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDA--GVTTIAGG 172
HP GV G++ VADT N A+R+ A V+T+ G
Sbjct: 400 HPLGVAALPDGSVLVADTYNGAVRRFDPAADAVSTVDSG 438
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 55/115 (47%), Gaps = 3/115 (2%)
Query: 126 DGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPS 184
DG + P G+++ G L++AD+ A+R + D + T G G DG +
Sbjct: 334 DGPLPDVWMAQPSGLSVSRDGKRLWIADSETSALRYVEDGVLHTAVGQGLFDFGHVDGKA 393
Query: 185 EDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSIS-PTDILM 238
E+A F + V + P S+LV D N A+R+ D + ++ P+D+L+
Sbjct: 394 EEALFQHPLGVAAL-PDGSVLVADTYNGAVRRFDPAADAVSTVDSGLAEPSDVLV 447
>gi|406944252|gb|EKD76066.1| Serine/threonine protein kinase, partial [uncultured bacterium]
Length = 212
Score = 47.0 bits (110), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 67/144 (46%), Gaps = 20/144 (13%)
Query: 83 SEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTM 142
+ +GE++ VD +N+ + KI + G L + G G+ DG + A F P+ + +
Sbjct: 82 AANGEIYLVDTINNRVEKIG------TDGILTNVAGAGDYGYRDGSSDYALFAQPQDIAI 135
Query: 143 --DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 200
D L++ADT N IRKI D V+T+ G S+ G + D D V++
Sbjct: 136 YGDTASELFIADTNNNVIRKIKDGEVSTLLSGLSSPQG----------VAVDGDTVFISD 185
Query: 201 TCS--LLVIDRGNAALRQISLNQD 222
T + +L ID A + N D
Sbjct: 186 TGNNRILGIDHNGGATVEFGKNLD 209
>gi|310657686|ref|YP_003935407.1| exported protein of unknown function [[Clostridium] sticklandii]
gi|308824464|emb|CBH20502.1| exported protein of unknown function [[Clostridium] sticklandii]
Length = 780
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 23/50 (46%), Positives = 34/50 (68%), Gaps = 1/50 (2%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKG-NLYVADTLNLAIRKIGDAGVTTIAG 171
G+ DG+ EA F +G+T DD+ +LY+ADT N IRK+ + V+T+AG
Sbjct: 566 GYKDGESQEALFQSIRGITFDDRTKSLYIADTYNNRIRKVQNGIVSTVAG 615
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 4/84 (4%)
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSN--VAGFRDGPSEDAKFSNDF 193
+P G+T D GN Y+A+ + I KI GV I G+ N G++DG S++A F +
Sbjct: 524 YPTGLTSDVNGNFYIAEQHH--ILKINSKGVVEIFAGRLNRDEYGYKDGESQEALFQSIR 581
Query: 194 DVVYVRPTCSLLVIDRGNAALRQI 217
+ + T SL + D N +R++
Sbjct: 582 GITFDDRTKSLYIADTYNNRIRKV 605
>gi|262199596|ref|YP_003270805.1| NHL repeat containing protein [Haliangium ochraceum DSM 14365]
gi|262082943|gb|ACY18912.1| NHL repeat containing protein [Haliangium ochraceum DSM 14365]
Length = 344
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 38/104 (36%), Positives = 50/104 (48%), Gaps = 18/104 (17%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRG---RLVAGSFQGYTGHVDGKPNEAR 133
P + V E G L+ D N +V RG RLV GS G G DG +A
Sbjct: 34 PGHVAVDEHGTLWTADTGNHRLV-------MADRGGAVRLVVGS--GDPGWRDGDGADAA 84
Query: 134 FNHPKGVTMDDKGN-LYVADTLNLAIRKI-----GDAGVTTIAG 171
+ P+G+ + G LYVAD+ IR++ GDA VTT+AG
Sbjct: 85 LHAPRGIAVCASGRRLYVADSGTCTIRRVDLDAVGDATVTTVAG 128
>gi|347756512|ref|YP_004864075.1| Thiol-disulfide isomerase / thioredoxin [Candidatus
Chloracidobacterium thermophilum B]
gi|347589029|gb|AEP13558.1| Thiol-disulfide isomerase / thioredoxin [Candidatus
Chloracidobacterium thermophilum B]
Length = 676
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 43/119 (36%), Positives = 55/119 (46%), Gaps = 7/119 (5%)
Query: 105 LSQYSRGRLVAGSFQG--YTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG 162
+ QY+ V G + G G DG A F P G++ D K LYVAD+ A+R I
Sbjct: 350 IWQYNLDTGVVGPYAGTGAEGRQDGTLETAVFAQPSGLSTDGK-RLYVADSEISAVRAID 408
Query: 163 DAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
A VTT+AGG G +G E+A+F + V L V D N LR I L
Sbjct: 409 LATGQVTTLAGGDLFDFGDTNGKGENARFQHPLGVAAAER--KLYVADTYNHKLRTIDL 465
>gi|195641946|gb|ACG40441.1| hypothetical protein [Zea mays]
Length = 79
Score = 47.0 bits (110), Expect = 0.025, Method: Composition-based stats.
Identities = 22/34 (64%), Positives = 25/34 (73%)
Query: 165 GVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYV 198
GVTTI GKS G DGPS +AKFSNDF+V Y+
Sbjct: 43 GVTTIDWGKSGRGGHVDGPSGEAKFSNDFEVHYI 76
>gi|334335815|ref|YP_004540967.1| alkyl hydroperoxide reductase/ thiol specific antioxidant/ Mal
allergen [Isoptericola variabilis 225]
gi|334106183|gb|AEG43073.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Isoptericola variabilis 225]
Length = 668
Score = 46.6 bits (109), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 32/90 (35%), Positives = 42/90 (46%), Gaps = 2/90 (2%)
Query: 85 DGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDD 144
DG L+ D S + +TP + R G GH DG ++A HP GV
Sbjct: 401 DGSLWLADAETSALRLVTPAADGQVQVRTAVGQGLFDFGHRDGAADQALLQHPLGVAALS 460
Query: 145 KGNLYVADTLNLAIRKIGDAG--VTTIAGG 172
G+ VADT N A+R+ A VTT+A G
Sbjct: 461 DGSAVVADTYNGALRRYDPASNEVTTLASG 490
>gi|406947628|gb|EKD78528.1| NHL repeat containing protein [uncultured bacterium]
Length = 810
Score = 46.6 bits (109), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 48/97 (49%), Gaps = 5/97 (5%)
Query: 128 KPNEARFNHPKGVTMDDKGNL-YVADTLNLAIRKIG----DAGVTTIAGGKSNVAGFRDG 182
+ N A F+ P G+ +D G YVADT N IR++ + G T + G +G++D
Sbjct: 504 RKNCAYFSRPGGIAVDSNGKYAYVADTGNEVIRRVTLYGKNKGKTKLIAGSPQESGYQDA 563
Query: 183 PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
+DA+F+ V L V DR N +RQ+ +
Sbjct: 564 TRKDAQFNVPIAVTIDNADNYLYVADRDNHTIRQVRI 600
Score = 39.7 bits (91), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 38/85 (44%), Gaps = 6/85 (7%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDK 145
G + D N I KI S G + G G G + +F P V +D
Sbjct: 57 GSFYLADTFNGVIRKIN------SNGVVSTVVGAGGYGDTTGSGSSTKFALPAAVGLDSS 110
Query: 146 GNLYVADTLNLAIRKIGDAGVTTIA 170
GN+Y+ADT N I+K + V+TIA
Sbjct: 111 GNVYIADTGNGKIKKFNGSTVSTIA 135
>gi|298385841|ref|ZP_06995398.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
gi|298261069|gb|EFI03936.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
Length = 432
Score = 46.6 bits (109), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 53/108 (49%), Gaps = 9/108 (8%)
Query: 119 QGYTGHVDG-KPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG------DAGVTTIAG 171
Q TG DG +EA F +G+T+D+ GN++V N IRKI VTTI
Sbjct: 326 QNNTGTTDGFLTDEAEFQFLRGLTIDEDGNIFVCQENNHVIRKIAYDEKMEKRYVTTIL- 384
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G + V G DG + A F+N D+ Y + R N ALR+ S+
Sbjct: 385 GTAGVKGDDDGSPDIALFANPQDISY-DGNGGFWIAQRENPALRKYSV 431
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 36/81 (44%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
+ + EDG +F E N I KI R G G DG P+ A F +P+
Sbjct: 348 LTIDEDGNIFVCQENNHVIRKIAYDEKMEKRYVTTILGTAGVKGDDDGSPDIALFANPQD 407
Query: 140 VTMDDKGNLYVADTLNLAIRK 160
++ D G ++A N A+RK
Sbjct: 408 ISYDGNGGFWIAQRENPALRK 428
>gi|430743901|ref|YP_007203030.1| thiol-disulfide isomerase-like thioredoxin [Singulisphaera
acidiphila DSM 18658]
gi|430015621|gb|AGA27335.1| thiol-disulfide isomerase-like thioredoxin [Singulisphaera
acidiphila DSM 18658]
Length = 696
Score = 46.6 bits (109), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 48/151 (31%), Positives = 71/151 (47%), Gaps = 13/151 (8%)
Query: 77 PYKIRVSEDG-ELFAVDEVNSNIVKITPPLSQY-SRGRLVAGSFQGYTGHVDGKPNEARF 134
P K+ E +LF D ++ IV L+ R +V GS G G VDG +A F
Sbjct: 229 PGKVLADEKSKQLFIADTAHNRIV-----LTDLDGRKSVVVGS--GGIGMVDGDYAKAEF 281
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAG-GKSNVAGFRDGPSEDAKFSN 191
N P+G+ + D LYVADT N AIR I V+T+AG G+ G + S+
Sbjct: 282 NRPQGLCLVDD-TLYVADTENHAIRAIHLKTKQVSTVAGTGQQGHRRSGAGAGKATSLSS 340
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQISLNQD 222
+D+V + T +L + G + + + D
Sbjct: 341 PWDLVLIPGTKTLAIAMAGTHQIWRYDIPSD 371
>gi|152964340|ref|YP_001360124.1| fibronectin type III domain-containing protein [Kineococcus
radiotolerans SRS30216]
gi|151358857|gb|ABS01860.1| Fibronectin type III domain protein [Kineococcus radiotolerans
SRS30216]
Length = 841
Score = 46.6 bits (109), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 45/94 (47%), Gaps = 7/94 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVD-GKPNEARFN 135
P + V+ DG L+ D N+ + K+T + G L + G+ G G N++
Sbjct: 159 PTGVVVAPDGTLYVADSENNQVEKVT------ASGALTIFAGTGFAGSPQAGDANKSPLA 212
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTI 169
P GV +D GNL+VAD N + KI G ++
Sbjct: 213 SPTGVALDAAGNLHVADADNHVVEKITPTGTLSV 246
>gi|386384782|ref|ZP_10070129.1| redoxin domain-containing protein [Streptomyces tsukubaensis
NRRL18488]
gi|385667750|gb|EIF91146.1| redoxin domain-containing protein [Streptomyces tsukubaensis
NRRL18488]
Length = 591
Score = 46.6 bits (109), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 35/96 (36%), Positives = 48/96 (50%), Gaps = 3/96 (3%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFR 180
G VDG +EA F P G+ + G L++AD+ A+R IG D V + G G R
Sbjct: 320 GLVDGPADEAWFAQPSGLAVAADGERLWIADSETSALRWIGRDGSVHSAVGTGLFDFGHR 379
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
DG + A F + V + P S+ V D N ALR+
Sbjct: 380 DGDAAQALFQHPLGVTAL-PDGSVAVCDTYNHALRR 414
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 47/97 (48%), Gaps = 7/97 (7%)
Query: 77 PYKIRVSEDGE-LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P + V+ DGE L+ D S + I S +S G+ GH DG +A F
Sbjct: 334 PSGLAVAADGERLWIADSETSALRWIGRDGSVHS----AVGTGLFDFGHRDGDAAQALFQ 389
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIA 170
HP GVT G++ V DT N A+R+ A VTT+A
Sbjct: 390 HPLGVTALPDGSVAVCDTYNHALRRYDPATGQVTTLA 426
>gi|290991466|ref|XP_002678356.1| predicted protein [Naegleria gruberi]
gi|284091968|gb|EFC45612.1| predicted protein [Naegleria gruberi]
Length = 1017
Score = 46.6 bits (109), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 56/109 (51%), Gaps = 15/109 (13%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGS-FQGYTGHVDG-KPNEA 132
P + VS GE+F D N I KI + G + +AG+ GY+G DG +
Sbjct: 878 PLNVAVSSSGEIFIADTNNHRIRKI------FLNGTITTIAGNGTAGYSG--DGLDSTKC 929
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFR 180
+ N+P V + G +++ DT N IRKI D ++TIAG + +AGF
Sbjct: 930 QLNYPSAVAVSSGGEIFIVDTHNHRIRKIAIDGIISTIAG--NGIAGFN 976
Score = 46.2 bits (108), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 67/145 (46%), Gaps = 16/145 (11%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
+PY I ++ GELF D++N I K++ + Y G + + ++
Sbjct: 717 LPYSISINALGELFISDQLNQRIRKVST--TNYITTIGGNGGIGFNGDGLSATSTQLKY- 773
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKF--SNDF 193
P G++ +Y AD+LN +RKI + +TTIAG G DG + + + SN F
Sbjct: 774 -PFGISASST-EVYFADSLNSRVRKISNGKITTIAG------GIGDGLAATSAYLNSNSF 825
Query: 194 DVVYVRPTCSLLVIDRGNAALRQIS 218
P+ ++ D N +R+IS
Sbjct: 826 TTT---PSGEFIIADSNNNLIRKIS 847
Score = 45.1 bits (105), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 71/144 (49%), Gaps = 11/144 (7%)
Query: 82 VSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAG-SFQGYTGHVDGKPNEARFNHPKGV 140
V D L+ VD +N I K++ + +AG G++G G +A+ N+P +
Sbjct: 492 VLNDQNLYIVDTLNHRIRKVSLTFGNITT---IAGIGTAGFSGD-GGLATKAKLNYPTHM 547
Query: 141 TMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--DGPSEDAKFSNDFDVVY 197
+ G ++++D N IRKI G ++TIAG + + GF +G + A F N + +
Sbjct: 548 AISASGEIFISDNGNQRIRKIATNGKISTIAG--NGIVGFSGDNGLATKATF-NSRNGIA 604
Query: 198 VRPTCSLLVIDRGNAALRQISLNQ 221
V + V D N +R+IS++
Sbjct: 605 VASNGDVYVADTRNHRIRKISVSN 628
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 64/133 (48%), Gaps = 7/133 (5%)
Query: 89 FAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNL 148
F + + N+N+++ +S +AG+ G + A+ N+P V + G +
Sbjct: 833 FIIADSNNNLIR---KISTSGIISTIAGTGAATFGGDNANATIAKLNNPLNVAVSSSGEI 889
Query: 149 YVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDF-DVVYVRPTCSLLV 206
++ADT N IRKI G +TTIAG + AG+ + K ++ V V + +
Sbjct: 890 FIADTNNHRIRKIFLNGTITTIAG--NGTAGYSGDGLDSTKCQLNYPSAVAVSSGGEIFI 947
Query: 207 IDRGNAALRQISL 219
+D N +R+I++
Sbjct: 948 VDTHNHRIRKIAI 960
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 67/145 (46%), Gaps = 14/145 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ I S E++ D +NS + KI S G++ + G G DG + + +
Sbjct: 774 PFGISASST-EVYFADSLNSRVRKI-------SNGKIT--TIAGGIG--DGLAATSAYLN 821
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
T G +AD+ N IRKI +G ++TIAG + G + + AK +N +V
Sbjct: 822 SNSFTTTPSGEFIIADSNNNLIRKISTSGIISTIAGTGAATFGGDNANATIAKLNNPLNV 881
Query: 196 VYVRPTCSLLVIDRGNAALRQISLN 220
V + + + D N +R+I LN
Sbjct: 882 A-VSSSGEIFIADTNNHRIRKIFLN 905
Score = 39.3 bits (90), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 48/94 (51%), Gaps = 4/94 (4%)
Query: 131 EARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG--GKSNVAGFRDGPSED 186
A N P GV + G++Y+AD N IRK+ + +TTIAG G++ + + + + +
Sbjct: 108 NAELNFPSGVAVHSNGDVYIADKSNHVIRKVSALNGKITTIAGIAGETELNKYSNSLATN 167
Query: 187 AKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
++ + T +++ D N +R++ LN
Sbjct: 168 TTLNSPQYLAVNSSTAEVIISDTNNNVIRKVYLN 201
>gi|440789641|gb|ELR10946.1| NHL repeat containing 2 isoform 3, putative [Acanthamoeba
castellanii str. Neff]
Length = 934
Score = 46.6 bits (109), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 59/124 (47%), Gaps = 8/124 (6%)
Query: 77 PYKIRVSEDGE-LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P+ + V +G+ +F D + I+ LS R GS TG VDG ARF+
Sbjct: 298 PHGVAVDREGDRMFVADSGHHRILV----LSLDGAFRTAIGSDDATTGLVDGDYATARFH 353
Query: 136 HPKGVTMDDKGNLYVADTLNLAIR--KIGDAGVTTIAG-GKSNVAGFRDGPSEDAKFSND 192
P G++ + LYVAD+ N IR ++ + V T+AG G+ G + + SN
Sbjct: 354 SPLGLSYAGEDKLYVADSENHCIRCVRLAEERVVTVAGTGQRGYERKGGGRALEWSLSNP 413
Query: 193 FDVV 196
+DV
Sbjct: 414 WDVA 417
>gi|383111216|ref|ZP_09932033.1| hypothetical protein BSGG_2316 [Bacteroides sp. D2]
gi|313694781|gb|EFS31616.1| hypothetical protein BSGG_2316 [Bacteroides sp. D2]
Length = 419
Score = 46.6 bits (109), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 47/88 (53%), Gaps = 6/88 (6%)
Query: 78 YKIRVSEDGELFAVDEVNSNIVKITPPLS-QYSRGRL--VAGSFQGYTGHVDGKPNEARF 134
+ + DG L+ D + I K+TP S YS+G L +AGS G +G DGK +A F
Sbjct: 331 FGVAAGSDGCLYLSDNTYNCIRKLTPDASGDYSKGTLETIAGS--GKSGFSDGKGLKATF 388
Query: 135 NHPKGVTM-DDKGNLYVADTLNLAIRKI 161
N P + + D +YVA +N IR+I
Sbjct: 389 NQPYEIIITKDCKTMYVAGAVNYLIRRI 416
Score = 38.1 bits (87), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 37/131 (28%), Positives = 61/131 (46%), Gaps = 13/131 (9%)
Query: 98 IVKITPPLSQYS-RGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNL 156
+++I+P +Q + G V G Q Y G P A+ GV G LY++D
Sbjct: 292 LIQISPDGTQKTIAGDGVKG--QEYYDGDPGNPLSAKVGATFGVAAGSDGCLYLSDNTYN 349
Query: 157 AIRKI-----GD---AGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVID 208
IRK+ GD + TIAG S +GF DG A F+ ++++ + ++ V
Sbjct: 350 CIRKLTPDASGDYSKGTLETIAG--SGKSGFSDGKGLKATFNQPYEIIITKDCKTMYVAG 407
Query: 209 RGNAALRQISL 219
N +R+I++
Sbjct: 408 AVNYLIRRITV 418
>gi|297848174|ref|XP_002891968.1| hypothetical protein ARALYDRAFT_892829 [Arabidopsis lyrata subsp.
lyrata]
gi|297337810|gb|EFH68227.1| hypothetical protein ARALYDRAFT_892829 [Arabidopsis lyrata subsp.
lyrata]
Length = 1058
Score = 46.6 bits (109), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 51/106 (48%), Gaps = 6/106 (5%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFR 180
G DG E HP GV G +Y+ D+ N I+K+ V T+AG + AGF+
Sbjct: 794 GDNDGVGAEVLLQHPLGVLCAKDGQIYLTDSYNHKIKKLDPITKRVVTVAG--TGKAGFK 851
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQ-DDCE 225
DG + A+ S + + L V D N+ +R I LN+ +D E
Sbjct: 852 DGKVKGAQLSEPAGLA-ITENGRLFVADTNNSLIRYIDLNKGEDAE 896
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 44/82 (53%), Gaps = 4/82 (4%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAGGKSNV 176
G G DG +A FN P+G+ + K N LYVADT N A+R+I + V T+AG +
Sbjct: 611 GEEGFQDGSFEDAAFNRPQGLAYNAKKNLLYVADTENHALREIDFVNERVRTLAGNGTKG 670
Query: 177 AGFRDGPSEDAKFSND-FDVVY 197
+ ++ G + N +DV +
Sbjct: 671 SDYQGGRKGTKQLLNSPWDVCF 692
>gi|421872066|ref|ZP_16303685.1| NHL repeat family protein [Brevibacillus laterosporus GI-9]
gi|372458678|emb|CCF13234.1| NHL repeat family protein [Brevibacillus laterosporus GI-9]
Length = 633
Score = 46.6 bits (109), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 65/143 (45%), Gaps = 22/143 (15%)
Query: 75 VVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
+VP ++ + G ++ D VN I K T + G VA G G++ G F
Sbjct: 403 IVPRQVAIDAYGFVYVADSVNHRIQKFT------NTGVFVAT--YGSMGYLSG-----FF 449
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P GV +D KGN++VADTLN I+K + G+ ++D +F+
Sbjct: 450 QFPAGVAVDSKGNIFVADTLNHRIQKFNPFFIYMTEWGQKG--------TKDGQFNQPMQ 501
Query: 195 VVYVRPTCSLLVIDRGNAALRQI 217
+ + ++ V+DR N +++
Sbjct: 502 LA-IDSKDNIYVVDRNNHRIQKF 523
Score = 38.9 bits (89), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 42/105 (40%), Gaps = 13/105 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G +F D +N I K P + G G DG+ FN
Sbjct: 452 PAGVAVDSKGNIFVADTLNHRIQKFNPFFIYMTE--------WGQKGTKDGQ-----FNQ 498
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD 181
P + +D K N+YV D N I+K ++G G + AG D
Sbjct: 499 PMQLAIDSKDNIYVVDRNNHRIQKFDNSGRFLAKWGTNGGAGATD 543
>gi|290995458|ref|XP_002680312.1| predicted protein [Naegleria gruberi]
gi|284093932|gb|EFC47568.1| predicted protein [Naegleria gruberi]
Length = 744
Score = 46.6 bits (109), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 71/148 (47%), Gaps = 12/148 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV--AGSFQ-GYTGHVDGKPNEAR 133
P + VS +GE++ D N I K+ + G++V AG+ Q G++G +G A
Sbjct: 32 PSSVFVSSNGEVYIADIGNHKIRKVL------TNGKMVTIAGNGQNGFSGD-NGPATNAM 84
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSND 192
N+P GV + +Y+AD N IRKI G + TIAG N G + + +A N
Sbjct: 85 LNYPIGVFVSPYNEVYIADYFNNRIRKILTNGNIITIAGNNVNGKGVYNRLAINAML-NR 143
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQISLN 220
V+V + + D+ +R+I N
Sbjct: 144 PSGVFVTLNDEVYIADQNRNKIRKIESN 171
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 80/186 (43%), Gaps = 15/186 (8%)
Query: 42 TRSSSKAPQADGNVLQF----ENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSN 97
R+ + +++GN++ + GY + V N + P + V+ + E++ D N
Sbjct: 161 NRNKIRKIESNGNIVTIAGNGKEGYGDDNVPAVNAMLNRPSSVFVTSNNEVYIADTGNHR 220
Query: 98 IVKITPPLSQYSRGRLVAGSFQGYTG-HVDG-KPNEARFNHPKGVTMDDKGNLYVADTLN 155
I KI G +V + G G H D N PK V + G +Y+ADT N
Sbjct: 221 IRKIL------RNGNIVTIAGNGLVGFHEDNILATNTSLNFPKSVFVSSNGEVYIADTGN 274
Query: 156 LAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAAL 214
IRKI + + TIAG G + P+ D N V+V + + D+ N +
Sbjct: 275 HRIRKILANGNIATIAGNGKEGYGDDNVPAIDTML-NRPSSVFVSSN-EVYIADQSNQRI 332
Query: 215 RQISLN 220
R+I N
Sbjct: 333 RKILQN 338
Score = 39.7 bits (91), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 43/90 (47%), Gaps = 2/90 (2%)
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFS 190
A F P V + G +Y+AD N IRK+ G + TIAG N +GP+ +A +
Sbjct: 27 ANFKGPSSVFVSSNGEVYIADIGNHKIRKVLTNGKMVTIAGNGQNGFSGDNGPATNAMLN 86
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
V+V P + + D N +R+I N
Sbjct: 87 YPIG-VFVSPYNEVYIADYFNNRIRKILTN 115
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 47/174 (27%), Positives = 72/174 (41%), Gaps = 23/174 (13%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV--AGSFQGYTGHVDGKPNEARF 134
P + VS E++ D N+ I KI + G ++ AG+ G + A
Sbjct: 88 PIGVFVSPYNEVYIADYFNNRIRKIL------TNGNIITIAGNNVNGKGVYNRLAINAML 141
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
N P GV + +Y+AD IRKI G + TIAG G + P+ +A N
Sbjct: 142 NRPSGVFVTLNDEVYIADQNRNKIRKIESNGNIVTIAGNGKEGYGDDNVPAVNAML-NRP 200
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGY 247
V+V + + D GN +R+I N +I+ + G LVG+
Sbjct: 201 SSVFVTSNNEVYIADTGNHRIRKILRN-------------GNIVTIAGNGLVGF 241
>gi|219851969|ref|YP_002466401.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
gi|219546228|gb|ACL16678.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
Length = 676
Score = 46.6 bits (109), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 42/90 (46%), Gaps = 15/90 (16%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGK-PNEARFN 135
P + V DG ++ VD N I K G+T G P + +FN
Sbjct: 47 PSGVAVDSDGNIYVVDTNNFRIQKFNAT--------------GGFTTQWGGSGPGDGQFN 92
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
+P+GV +D+ GN+Y+AD N I+K +G
Sbjct: 93 NPEGVAVDNNGNVYIADRDNNRIQKFNSSG 122
Score = 43.9 bits (102), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 54/112 (48%), Gaps = 16/112 (14%)
Query: 108 YSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGV 166
++ G V + G +G DG+ FN P GV +D GN+YV DT N I+K G
Sbjct: 23 WAEGGYVYTTQWGSSGSGDGQ-----FNQPSGVAVDSDGNIYVVDTNNFRIQKFNATGGF 77
Query: 167 TTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
TT GG GP D +F+N + V V ++ + DR N +++ +
Sbjct: 78 TTQWGGS--------GPG-DGQFNNP-EGVAVDNNGNVYIADRDNNRIQKFN 119
Score = 41.6 bits (96), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 42/142 (29%), Positives = 60/142 (42%), Gaps = 21/142 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V ++ VD N I K S G + G G GH DG+ F
Sbjct: 235 PTGVAVDSVNNVYVVDTGNDRIQKFNS-----SGGFITTG---GSFGHGDGQ-----FWS 281
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P+G+T D N+YV DTLN I+K G G + S D +FS DV
Sbjct: 282 PEGITADSANNVYVVDTLNDRIQKFNATGGFITKWGSAL-------GSFDGQFSGLSDVA 334
Query: 197 YVRPTCSLLVIDRGNAALRQIS 218
V T ++ V + GN +++ +
Sbjct: 335 -VDSTGNVYVAESGNCRIQKFN 355
Score = 39.7 bits (91), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 40/89 (44%), Gaps = 13/89 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V +G ++ D N+ I K S G L+ G G DG+ FN
Sbjct: 94 PEGVAVDNNGNVYIADRDNNRIQKFNS-----SGGFLMK---WGSIGSGDGQ-----FNQ 140
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
P GV +D GN+YV D N I+K +G
Sbjct: 141 PSGVALDSAGNVYVTDKQNNRIQKFNSSG 169
>gi|168024767|ref|XP_001764907.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683943|gb|EDQ70349.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1040
Score = 46.6 bits (109), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 54/105 (51%), Gaps = 11/105 (10%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLN------LAIRKIGDAG--VTTIAGGKS 174
G DG + A+ HP GV +G +YVAD+ N L I+ + + VTT+AG +
Sbjct: 767 GDRDGVGSNAQLQHPLGVLYSSEGLVYVADSYNHKAISCLQIKVLNPSTQKVTTLAG--T 824
Query: 175 NVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
AGF+DG +++ + S + + P L V D N+ +R + L
Sbjct: 825 GTAGFKDGTAQEGQLSEPAGIA-LGPDGKLYVADTNNSLIRVLDL 868
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 9/88 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIV-----KITPPLSQYSRGRLVAGSFQGYTGHVDGKPNE 131
P + S +G ++ D N + K+ P +Q + +AG+ G G DG E
Sbjct: 781 PLGVLYSSEGLVYVADSYNHKAISCLQIKVLNPSTQ--KVTTLAGT--GTAGFKDGTAQE 836
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIR 159
+ + P G+ + G LYVADT N IR
Sbjct: 837 GQLSEPAGIALGPDGKLYVADTNNSLIR 864
>gi|380300595|ref|ZP_09850288.1| NHL repeat protein [Brachybacterium squillarum M-6-3]
Length = 644
Score = 46.6 bits (109), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 33/85 (38%), Positives = 40/85 (47%), Gaps = 3/85 (3%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I DG L D +S I + P + S LV + GHVDG AR H
Sbjct: 373 PSGIDELPDGRLVVADSESSAIRVLDPQTMEVST--LVGQGLFDF-GHVDGPAEVARLQH 429
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P GVT+ G + VADT N AIR +
Sbjct: 430 PLGVTVLPDGRIAVADTYNGAIRIV 454
>gi|322435234|ref|YP_004217446.1| hypothetical protein AciX9_1613 [Granulicella tundricola MP5ACTX9]
gi|321162961|gb|ADW68666.1| NHL repeat containing protein [Granulicella tundricola MP5ACTX9]
Length = 781
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 14/103 (13%)
Query: 75 VVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGS-FQGYTGHVDGK-PN 130
++P + G L+ + I++++P G L VAG+ QGY G DG P
Sbjct: 23 LLPSGLVYDAQGNLYLAETAAHLILRVSP------SGALTIVAGTGTQGYAG--DGTLPT 74
Query: 131 EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG 171
+A + P + + G+LY+ADT N AIR+I A +TT+AG
Sbjct: 75 QALLDSPTALAITPTGDLYLADTHNHAIRRIDAATQIITTVAG 117
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 66/145 (45%), Gaps = 10/145 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPN-EARFN 135
P I + L+ D N I ++ + + L QG++G D P A+ +
Sbjct: 138 PTAIALDSSQNLYIADTRNHIIRRVD--ATTHLITTLAGTGTQGFSG--DAGPALAAQID 193
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG---VTTIAGGKSNVAGFRDGPSEDAKFSND 192
P G+ +D NLY+ADT N IR+I DA +TTIAG + + + A
Sbjct: 194 TPTGLALDASNNLYLADTHNHRIRRI-DAVTHIITTIAGNGTPAFTSDNIAATSATLYLP 252
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQI 217
+ + P+ +LL+ D N +R+I
Sbjct: 253 RGIT-LDPSGNLLIADSANHRIRRI 276
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 37/81 (45%)
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+ D +GNLY+A+T I ++ +G TI G DG + +
Sbjct: 25 PSGLVYDAQGNLYLAETAAHLILRVSPSGALTIVAGTGTQGYAGDGTLPTQALLDSPTAL 84
Query: 197 YVRPTCSLLVIDRGNAALRQI 217
+ PT L + D N A+R+I
Sbjct: 85 AITPTGDLYLADTHNHAIRRI 105
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 42/91 (46%), Gaps = 5/91 (5%)
Query: 130 NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG---VTTIAGGKSNVAGFRDGPSED 186
A P+G+T+D GNL +AD+ N IR+I DA +TT+AG + P+
Sbjct: 245 TSATLYLPRGITLDPSGNLLIADSANHRIRRI-DAVTGLITTLAGDGTQTYAGDSTPATT 303
Query: 187 AKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
A + + P + D N +RQI
Sbjct: 304 ASLDTP-RAIALSPATLPTLTDTANQRIRQI 333
>gi|290995072|ref|XP_002680155.1| predicted protein [Naegleria gruberi]
gi|284093775|gb|EFC47411.1| predicted protein [Naegleria gruberi]
Length = 1951
Score = 46.2 bits (108), Expect = 0.033, Method: Composition-based stats.
Identities = 48/171 (28%), Positives = 77/171 (45%), Gaps = 26/171 (15%)
Query: 77 PYKIRVSEDGELFA--VDEVNSNIVKITPP--LSQYSRGRLVAGSFQGYTGHVDG----- 127
PY + +S E++ +D N+ I KIT +S + G + T +++
Sbjct: 92 PYGVAISSSDEIYIADMDMFNNRIRKITTSGIISTIAGG--IGDGLSATTAYINAITGDG 149
Query: 128 -KPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPS 184
A+ +P GV + +YVAD N IRKI +G ++TIAG G S +G DG S
Sbjct: 150 SSATSAQLYNPYGVAISSSDEIYVADYNNNRIRKITTSGIISTIAGTGTSGYSG--DGSS 207
Query: 185 E-DAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS--------LNQDDCEY 226
A+ N + V + + + + D N +R++S L CEY
Sbjct: 208 AISAQLYNPYGVA-ISSSDEIYITDTNNNRIRKLSPWCTGNAILLDGSCEY 257
>gi|410938688|ref|ZP_11370531.1| putative lipoprotein [Leptospira noguchii str. 2006001870]
gi|410786234|gb|EKR75182.1| putative lipoprotein [Leptospira noguchii str. 2006001870]
Length = 358
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 51/96 (53%), Gaps = 5/96 (5%)
Query: 126 DGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFRDGP 183
+G + FN P + +D + NLYV + N IRKI V+T++GG ++G+ DG
Sbjct: 169 NGDRLNSLFNGPFFMDIDPERNLYVGELGNHTIRKIDLNSGNVSTLSGG---ISGYLDGD 225
Query: 184 SEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
A+F + + Y T SLL D + +R+I+L
Sbjct: 226 LASARFKSPLGIAYNHKTDSLLAADIQDHRIRKINL 261
>gi|298388124|ref|ZP_06997668.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
gi|298259116|gb|EFI01996.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
Length = 435
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 40/78 (51%), Gaps = 6/78 (7%)
Query: 84 EDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMD 143
EDG L+ D N+ I KITP G + + G V+G P ++ F P+G+T
Sbjct: 361 EDGNLYVSDRDNNCIRKITPD------GNVTIYAGNRSEGLVNGLPLKSSFRRPEGLTRS 414
Query: 144 DKGNLYVADTLNLAIRKI 161
G +YVAD N IRKI
Sbjct: 415 KDGVIYVADHDNHVIRKI 432
Score = 46.2 bits (108), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 37/97 (38%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 83 SEDGELFAVDEVNSNIV--KITPPLSQYSRG--RLVAGSFQGYTGHVDGKPNEARFNHPK 138
S+DG FA N +I+ + P+ G L AG ++ +G EARFN P
Sbjct: 297 SKDGRTFARIIRNKHIIYKRTYDPVKHEFVGDETLWAGKYEK-AEFANGIGEEARFNQPC 355
Query: 139 GVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKS 174
D+ GNLYV+D N IRKI D VT AG +S
Sbjct: 356 QGVFDEDGNLYVSDRDNNCIRKITPDGNVTIYAGNRS 392
>gi|374988208|ref|YP_004963703.1| hypothetical protein SBI_05452 [Streptomyces bingchenggensis BCW-1]
gi|297158860|gb|ADI08572.1| hypothetical protein SBI_05452 [Streptomyces bingchenggensis BCW-1]
Length = 609
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 46/82 (56%), Gaps = 4/82 (4%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG-GKSNV 176
G G VDG + ARF+ P+G+ + G + VADT+N A+R + A V T+AG G+
Sbjct: 214 GERGLVDGPADRARFSEPQGLALLPDGTVIVADTVNHALRSLDLATGEVRTVAGTGRQWW 273
Query: 177 AGF-RDGPSEDAKFSNDFDVVY 197
G GP+ + S+ +DV +
Sbjct: 274 QGSPAAGPAREVDLSSPWDVAW 295
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 31/50 (62%), Gaps = 2/50 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIA 170
GH DG ++A F HP GVT G++ V+DT N A+R+ A V+T+A
Sbjct: 392 GHRDGHADQALFQHPLGVTALPDGSVAVSDTYNHALRRFDPASGEVSTLA 441
>gi|290977893|ref|XP_002671671.1| predicted protein [Naegleria gruberi]
gi|284085242|gb|EFC38927.1| predicted protein [Naegleria gruberi]
Length = 3330
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 55/121 (45%), Gaps = 5/121 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + VSE G+++ D N I I P + G G++G +G A N+
Sbjct: 2385 PSGVFVSEFGKIYISDSGNHRIRAILPNGIISTVG---GNGIPGFSGD-NGLATNASLNN 2440
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+ G+L ++D+ N IR I G+ T G GF+DG DA F+N +
Sbjct: 2441 PYGIVETYSGDLIISDSDNNRIRLIDRYGIITTLAGTGE-KGFQDGFFLDALFNNPSQLF 2499
Query: 197 Y 197
Y
Sbjct: 2500 Y 2500
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 47/89 (52%), Gaps = 4/89 (4%)
Query: 131 EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR--DGPSEDAK 188
EA N P GV + + G +Y++D+ N IR I G+ + GG + + GF +G + +A
Sbjct: 2379 EASLNSPSGVFVSEFGKIYISDSGNHRIRAILPNGIISTVGG-NGIPGFSGDNGLATNAS 2437
Query: 189 FSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
+N + +V + L++ D N +R I
Sbjct: 2438 LNNPYGIVETY-SGDLIISDSDNNRIRLI 2465
>gi|357400623|ref|YP_004912548.1| hypothetical protein SCAT_3039 [Streptomyces cattleya NRRL 8057 =
DSM 46488]
gi|386356676|ref|YP_006054922.1| hypothetical protein SCATT_30300 [Streptomyces cattleya NRRL 8057 =
DSM 46488]
gi|337767032|emb|CCB75743.1| conserved protein of unknown function [Streptomyces cattleya NRRL
8057 = DSM 46488]
gi|365807185|gb|AEW95401.1| hypothetical protein SCATT_30300 [Streptomyces cattleya NRRL 8057 =
DSM 46488]
Length = 613
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 41/126 (32%), Positives = 64/126 (50%), Gaps = 10/126 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P K + G L D +V++ P R R+ GS G G DG + ARF+
Sbjct: 184 PGKALLLPTGNLLVSDSTRHALVELAPDGETVVR-RI--GS--GERGLADGPADRARFSE 238
Query: 137 PKGVT-MDDKGNLYVADTLNLAIRKIGDAG--VTTIAG-GKSNVAGF-RDGPSEDAKFSN 191
P+G+ + D ++ VADT+N A+R++ A VTT+AG G+ G GP+ + S+
Sbjct: 239 PQGLALLPDGESVIVADTVNHALRRVDLASGEVTTVAGTGRQWWQGSPVAGPAREVALSS 298
Query: 192 DFDVVY 197
+DV +
Sbjct: 299 PWDVAW 304
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 45/91 (49%), Gaps = 2/91 (2%)
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAK 188
P + RF P + GNL V+D+ A+ ++ G T + S G DGP++ A+
Sbjct: 178 PTDLRF--PGKALLLPTGNLLVSDSTRHALVELAPDGETVVRRIGSGERGLADGPADRAR 235
Query: 189 FSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
FS + + S++V D N ALR++ L
Sbjct: 236 FSEPQGLALLPDGESVIVADTVNHALRRVDL 266
>gi|147865073|emb|CAN83232.1| hypothetical protein VITISV_021876 [Vitis vinifera]
Length = 1694
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 29/46 (63%), Gaps = 1/46 (2%)
Query: 165 GVTTIAGGKSNVA-GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDR 209
G IA GK G DGP +D K SNDFD VY+ +CSL+VID+
Sbjct: 1646 GGYNIARGKWGRGEGHVDGPIQDTKVSNDFDKVYIGSSCSLMVIDK 1691
>gi|290987128|ref|XP_002676275.1| predicted protein [Naegleria gruberi]
gi|284089876|gb|EFC43531.1| predicted protein [Naegleria gruberi]
Length = 1182
Score = 46.2 bits (108), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 57/110 (51%), Gaps = 6/110 (5%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
L+AGS GY G N A F +G+ + G ++++D N +RKI +TT+AG
Sbjct: 315 LIAGSSYGYAGDNQLASN-AMFRVTRGIAISSSGEIFISDGGNYRVRKINTNNIITTVAG 373
Query: 172 GKSNVAGFRDGPSEDAKFSNDF--DVVYVRPTCSLLVIDRGNAALRQISL 219
SN+ +G + +A + F + V V P ++V D N LR+ISL
Sbjct: 374 TGSNLG--YNGDNIEAVKAKLFGPESVAVAPNNEVIVADTRNYRLRKISL 421
>gi|339008739|ref|ZP_08641312.1| cell surface protein [Brevibacillus laterosporus LMG 15441]
gi|338774539|gb|EGP34069.1| cell surface protein [Brevibacillus laterosporus LMG 15441]
Length = 633
Score = 46.2 bits (108), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 44/87 (50%), Gaps = 13/87 (14%)
Query: 75 VVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
+VP ++ + G ++ D VN I K T + G VA G G++ G F
Sbjct: 403 IVPRQVAIDAYGFVYVADSVNHRIQKFT------NTGVFVAT--YGSMGYLSG-----FF 449
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKI 161
P GV +D KGN++VADTLN I+K
Sbjct: 450 QFPAGVAVDSKGNIFVADTLNHRIQKF 476
>gi|325110177|ref|YP_004271245.1| hypothetical protein Plabr_3626 [Planctomyces brasiliensis DSM
5305]
gi|324970445|gb|ADY61223.1| NHL repeat containing protein [Planctomyces brasiliensis DSM 5305]
Length = 370
Score = 46.2 bits (108), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 47/87 (54%), Gaps = 7/87 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKP-NEARF 134
PY+IR + G +F V+ +N+ + ++ S VAG+ + G++G DGK EA F
Sbjct: 115 PYEIRFDKQGNMFFVEMLNNIVRRVDAKTGVIS---TVAGTGEKGFSG--DGKAATEATF 169
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKI 161
N P + D G+LY+ D N +R +
Sbjct: 170 NRPHSICFDSAGHLYICDIGNHRVRVV 196
Score = 43.1 bits (100), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 55/220 (25%), Positives = 93/220 (42%), Gaps = 35/220 (15%)
Query: 4 THLLLTLTLLIAFTLQFQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYL 63
T L LL F LQ A G +I L T + A +A Q E
Sbjct: 6 TGRFLPAILLCLFVLQLPATVQAEGKVITILGDGKAGYTGDNGSADEA-----QCEQ--- 57
Query: 64 VETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYT 122
P+ + + DG L+ + N I +I ++ + VAG+ + GY+
Sbjct: 58 -------------PFGLVIGPDGALYVCETANHVIRRIDLKTNKVT---TVAGTGEKGYS 101
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG---VTTIAG-GKSNVAG 178
G G EA+ + P + D +GN++ + LN +R++ DA ++T+AG G+ +G
Sbjct: 102 GD-GGSALEAKLDEPYEIRFDKQGNMFFVEMLNNIVRRV-DAKTGVISTVAGTGEKGFSG 159
Query: 179 FRDG-PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
DG + +A F+ + + L + D GN +R +
Sbjct: 160 --DGKAATEATFNRPHSICF-DSAGHLYICDIGNHRVRVV 196
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 46/92 (50%), Gaps = 3/92 (3%)
Query: 126 DGKPNE-ARFNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGP 183
DG+P AR PKG+++D +GN+++ADT + IR+I G G+ T G DG
Sbjct: 274 DGRPARLARLAGPKGISVDKQGNIFLADTESHTIRRIDGQTGIITTLVGNGKKGDGPDGN 333
Query: 184 SEDAKFSNDFDVVYVRPTCSLLVIDRGNAALR 215
+ K N V+V + + D N +R
Sbjct: 334 PLECKL-NRPHGVFVAADGKVYIGDSSNHRVR 364
>gi|302520584|ref|ZP_07272926.1| NHL repeat containing protein [Streptomyces sp. SPB78]
gi|302429479|gb|EFL01295.1| NHL repeat containing protein [Streptomyces sp. SPB78]
Length = 634
Score = 46.2 bits (108), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 46/82 (56%), Gaps = 4/82 (4%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG-GKSNV 176
G G DG ARFN P+G+ + G + VADT+N A+R++ A V+T+AG G+
Sbjct: 221 GERGLRDGDAAGARFNEPQGLALLPDGAVVVADTVNHALRRVDPASGEVSTLAGTGRQWW 280
Query: 177 AGF-RDGPSEDAKFSNDFDVVY 197
G GP+ + S+ +DV +
Sbjct: 281 QGEPVAGPAREVSLSSPWDVAW 302
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 47/96 (48%), Gaps = 5/96 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + VS DGE + + ++ V+ + + R G+ GH DG +A H
Sbjct: 351 PSGLAVSLDGERLWIADAETSAVRW---IDREGTVRTAVGTGLFDFGHRDGAAGQALLQH 407
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIA 170
P GVT G++ VADT N A+R+ A VTT+A
Sbjct: 408 PLGVTALPDGSVAVADTYNHALRRYDPASEEVTTLA 443
>gi|160942444|ref|ZP_02089750.1| hypothetical protein CLOBOL_07327 [Clostridium bolteae ATCC
BAA-613]
gi|158434643|gb|EDP12410.1| hypothetical protein CLOBOL_07327 [Clostridium bolteae ATCC
BAA-613]
Length = 329
Score = 46.2 bits (108), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 43/85 (50%), Gaps = 7/85 (8%)
Query: 85 DGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDD 144
+G L+ D N+ IVK +S G G G DG ++A F+ P+ +T +
Sbjct: 212 EGALYVADTGNNRIVKAMDGAVVWSAG-------TGEDGFADGPVSQAMFSGPQRITAAE 264
Query: 145 KGNLYVADTLNLAIRKIGDAGVTTI 169
G LYV+DT N +RKI V+T+
Sbjct: 265 DGALYVSDTGNSVVRKIWGDNVSTL 289
Score = 38.5 bits (88), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 40/84 (47%), Gaps = 4/84 (4%)
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
+ P G+ + G LYVADT N I K D V AG + GF DGP A FS
Sbjct: 202 LSAPTGICFYE-GALYVADTGNNRIVKAMDGAVVWSAGTGED--GFADGPVSQAMFSGPQ 258
Query: 194 DVVYVRPTCSLLVIDRGNAALRQI 217
+ +L V D GN+ +R+I
Sbjct: 259 RITAAEDG-ALYVSDTGNSVVRKI 281
>gi|378548897|ref|ZP_09824113.1| hypothetical protein CCH26_02380 [Citricoccus sp. CH26A]
Length = 708
Score = 46.2 bits (108), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 54/117 (46%), Gaps = 12/117 (10%)
Query: 111 GRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG----- 165
G L + G G DG P A F G+ G ++VAD+ + A+R+I A
Sbjct: 390 GDLAVYAGTGLEGLEDGDPTVAWFAQSSGLAESADGTIWVADSESSALRRISPADPAAAD 449
Query: 166 ------VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
V++ G GFRDG ++ A+F + V + P S+LV D N A+R+
Sbjct: 450 GALSRRVSSAVGTGLFDFGFRDGAADQARFQHPLGVAAL-PDGSVLVADTYNGAIRR 505
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 46/93 (49%), Gaps = 10/93 (10%)
Query: 83 SEDGELFAVDEVNSNIVKITPP--------LSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
S DG ++ D +S + +I+P LS+ + G F G DG ++ARF
Sbjct: 422 SADGTIWVADSESSALRRISPADPAAADGALSRRVSSAVGTGLFD--FGFRDGAADQARF 479
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVT 167
HP GV G++ VADT N AIR+ AGV
Sbjct: 480 QHPLGVAALPDGSVLVADTYNGAIRRYAPAGVA 512
>gi|380692874|ref|ZP_09857733.1| hypothetical protein BfaeM_02721 [Bacteroides faecis MAJ27]
Length = 429
Score = 46.2 bits (108), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 45/83 (54%), Gaps = 6/83 (7%)
Query: 85 DGELFA---VDEVNSNIVKITPP-LSQYSRGRLVAGSFQGYTGHVDGKP-NEARFNHPKG 139
DG+++ VD+V + +ITP + GR + YTG++DG P +EARFN P+
Sbjct: 344 DGDMYDFCFVDQVGHCMWRITPDRICSIIAGRSNYTADGKYTGYIDGDPLHEARFNWPRA 403
Query: 140 VTMD-DKGNLYVADTLNLAIRKI 161
T D D+ Y+ D N IR +
Sbjct: 404 CTYDADEETFYLVDNGNHCIRYL 426
>gi|195164496|ref|XP_002023083.1| GL21149 [Drosophila persimilis]
gi|194105168|gb|EDW27211.1| GL21149 [Drosophila persimilis]
Length = 595
Score = 46.2 bits (108), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 63/130 (48%), Gaps = 17/130 (13%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPP-LSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P KI S G D N+ ++ ++ + ++ G L AG VDGK +RFN
Sbjct: 228 PAKIARSSLGRYAIADAGNNRVLVVSADGVVEHKIGGLQAG-------FVDGKLALSRFN 280
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG----GKSNVAGFRDGPSEDAKF 189
P+GV ++ L VADT N A+RKI + V T+AG G + G R GP +
Sbjct: 281 SPQGVAFLNESTLIVADTENHALRKISLENELVETLAGTGHQGNERIGG-RVGPLQ--AL 337
Query: 190 SNDFDVVYVR 199
S+ +D+ R
Sbjct: 338 SSPWDLAVFR 347
>gi|433645639|ref|YP_007290641.1| NHL repeat protein [Mycobacterium smegmatis JS623]
gi|433295416|gb|AGB21236.1| NHL repeat protein [Mycobacterium smegmatis JS623]
Length = 293
Score = 46.2 bits (108), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 48/180 (26%), Positives = 72/180 (40%), Gaps = 46/180 (25%)
Query: 8 LTLTLLIAFTLQFQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQA---DGNVLQFENGYL- 63
+T+ +++ +Q P+ P I H + ++ S PQ DGN N Y+
Sbjct: 13 VTVAAIVSSAMQHSETTLPSPPSIAHPSGPIELP-FGSLHGPQGVAVDGN----GNVYIA 67
Query: 64 -VETVIEGNEI----------------GVVPYKIRVSEDGELFAVDEVNSNIVKITPPLS 106
V T I N++ G P + V G L+A D VN ++K+ P S
Sbjct: 68 DVATDIADNQVLKMSAAHTGAAKLPFTGAGPVSVAVDSTGTLYATDYVNDRVLKLAPGAS 127
Query: 107 QYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDA 164
+ + A N P G+ +D GN+YVADT N + KI GDA
Sbjct: 128 NTTELSI------------------AGLNRPSGIAVDGGGNVYVADTYNDRVLKISGGDA 169
>gi|406832436|ref|ZP_11092030.1| ABC transporter [Schlesneria paludicola DSM 18645]
Length = 635
Score = 46.2 bits (108), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 50/105 (47%), Gaps = 13/105 (12%)
Query: 59 ENGYLVETVIEGNEIGVV--PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAG 116
E G L+ + + G V PY I V+ GELF V+ + K T +G+L+
Sbjct: 540 EEGQLLRIMGDSENAGAVHYPYDITVNPRGELFVVEYGAGRVTKFT------RQGKLL-- 591
Query: 117 SFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
G GH A+F+ P G+T+D + LYV DT N + +I
Sbjct: 592 ---GRYGHSGPGQTLAQFSTPWGLTIDRRNRLYVCDTGNRRVVEI 633
>gi|406836139|ref|ZP_11095733.1| NHL repeat containing protein [Schlesneria paludicola DSM 18645]
Length = 365
Score = 46.2 bits (108), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 72/149 (48%), Gaps = 13/149 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ--GYTGHVDGKPN-EAR 133
P+ + ++ DG L+ D N I ++ + VAG+ + G+ G DG P A+
Sbjct: 54 PFGLEIAPDGMLYFCDFTNHVIRRMDLKTGFLT---TVAGTPRNPGFAG--DGGPALRAK 108
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG---VTTIAGGKSNVAGFRDGPSEDAKFS 190
F+ P + D GN Y++D + IR+I DA +TT+AG GP+ A+F+
Sbjct: 109 FHEPHEIRFDRNGNYYISDMKSDVIRRI-DAKTQIITTVAGTAKPGFTGDGGPATKAEFN 167
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQISL 219
N V + LL+ D N +RQ+ L
Sbjct: 168 NPI-AVSLDGDARLLICDIKNHRVRQVDL 195
>gi|336180277|ref|YP_004585652.1| serine/threonine protein kinase [Frankia symbiont of Datisca
glomerata]
gi|334861257|gb|AEH11731.1| serine/threonine protein kinase [Frankia symbiont of Datisca
glomerata]
Length = 852
Score = 45.8 bits (107), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 68/147 (46%), Gaps = 10/147 (6%)
Query: 77 PYKIRVSEDGELF-AVDEVNSNIVKITPPLSQYSRGRLVAGS---FQGYTGHVDGKPNEA 132
P + V G ++ A + I +IT + +VAGS G+TG+ +G A
Sbjct: 570 PSSVAVDTTGAIYVATSTSDGQIFRIT----RDGGISVVAGSGPELDGFTGN-NGAATAA 624
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSND 192
+ P G+ +DD G++ ++ + + + +AG ++ +GP+ DA S
Sbjct: 625 ELSQPHGIAVDDNGDILFSEGSRVRKVTVATGRIAAVAGSSTSGTSGDNGPAADALLSVP 684
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQISL 219
DVV R S+ ++D +R+ISL
Sbjct: 685 TDVVIAR-DGSIYILDGEAETVRKISL 710
>gi|149176433|ref|ZP_01855047.1| serine/threonine protein kinase [Planctomyces maris DSM 8797]
gi|148844785|gb|EDL59134.1| serine/threonine protein kinase [Planctomyces maris DSM 8797]
Length = 368
Score = 45.8 bits (107), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 75/149 (50%), Gaps = 14/149 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVK-ITPPLSQYSRGRLVAGSFQ-GYTGHVDGKPNEARF 134
PY + + DG L+ V E+ S++++ I + S VAGS + GY+G G A
Sbjct: 50 PYGLTLGPDGALY-VCEIKSHVIRRIDEKTGKIS---TVAGSGKKGYSGD-SGLALAAEL 104
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDA---GVTTIAG-GKSNVAGFRDGPSEDAKFS 190
N P V D GN+Y + +N +R++ DA ++T+AG G+ +G GP+ A FS
Sbjct: 105 NEPYEVRFDKAGNMYFVEMINNIVRRV-DAKTGKISTVAGTGEKGFSG-DGGPATSATFS 162
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQISL 219
+ +L + D GN +R++ L
Sbjct: 163 RPHSIALDNDN-NLYICDIGNHRIRRVDL 190
>gi|219852041|ref|YP_002466473.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
gi|219546300|gb|ACL16750.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
Length = 930
Score = 45.8 bits (107), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 49/168 (29%), Positives = 70/168 (41%), Gaps = 25/168 (14%)
Query: 55 VLQFENGYLVETVI----EGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSR 110
V+ E GY T G+E P + V G ++ D N+ I K T S
Sbjct: 22 VVSAEGGYAYATQWGSSGSGDEQFSSPSGVAVDSVGNVYVADVGNNRIQKFT------ST 75
Query: 111 GRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIA 170
G + G +G DG +F+ P GV +D GN+YVADT N I+K G+
Sbjct: 76 GTFIKK--WGSSGSGDG-----QFSSPSGVAVDSAGNVYVADTGNNRIQKFTSMGIFIKQ 128
Query: 171 GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G S G FS+ F V V ++ V D GN +++ +
Sbjct: 129 WGSS-------GSGNGQFFSSPFGVA-VDNAGNVYVADTGNNRIQKFT 168
>gi|336403610|ref|ZP_08584322.1| hypothetical protein HMPREF0127_01635 [Bacteroides sp. 1_1_30]
gi|335945536|gb|EGN07347.1| hypothetical protein HMPREF0127_01635 [Bacteroides sp. 1_1_30]
Length = 419
Score = 45.8 bits (107), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 47/88 (53%), Gaps = 6/88 (6%)
Query: 78 YKIRVSEDGELFAVDEVNSNIVKITPPLS-QYSRGRL--VAGSFQGYTGHVDGKPNEARF 134
+ + DG L+ D + I K+TP + YS+G L +AGS G G DGK +A F
Sbjct: 331 FGVAAGSDGCLYLSDNTYNCIRKLTPDANGDYSKGTLETIAGS--GKAGFSDGKGLKATF 388
Query: 135 NHPKGVTM-DDKGNLYVADTLNLAIRKI 161
N P + + +D +YVA +N IR+I
Sbjct: 389 NQPYEIIITEDCKTMYVAGAVNYLIRRI 416
Score = 39.3 bits (90), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 61/133 (45%), Gaps = 17/133 (12%)
Query: 98 IVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNE---ARFNHPKGVTMDDKGNLYVADTL 154
+++I+P +Q + +AG + DG+P A+ GV G LY++D
Sbjct: 292 LIQISPDGTQ----KTIAGDGVKGQEYYDGEPGNPLSAKVGATFGVAAGSDGCLYLSDNT 347
Query: 155 NLAIRKI-----GD---AGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLV 206
IRK+ GD + TIAG S AGF DG A F+ ++++ ++ V
Sbjct: 348 YNCIRKLTPDANGDYSKGTLETIAG--SGKAGFSDGKGLKATFNQPYEIIITEDCKTMYV 405
Query: 207 IDRGNAALRQISL 219
N +R+I++
Sbjct: 406 AGAVNYLIRRITV 418
>gi|21218701|ref|NP_624480.1| hypothetical protein SCO0142 [Streptomyces coelicolor A3(2)]
gi|6066645|emb|CAB58305.1| conserved hypothetical protein SCJ33.06c [Streptomyces coelicolor
A3(2)]
Length = 601
Score = 45.8 bits (107), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 51/107 (47%), Gaps = 3/107 (2%)
Query: 111 GRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTI 169
G + + G G VDG EA F P G++ D L+VAD+ A+R + D V T
Sbjct: 319 GSVTVTAGTGNEGLVDGPGPEAWFAQPSGLSATDD-RLWVADSETSALRWVELDGTVRTA 377
Query: 170 AGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
G G RDGP+ A + V + P S++V D N ALR+
Sbjct: 378 VGTGLFDFGHRDGPAARALLQHPLGVAAL-PDGSVVVADTYNHALRR 423
Score = 39.3 bits (90), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 45/82 (54%), Gaps = 8/82 (9%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK----IGDAGVTTIAG-GKS 174
G G DG + A F+ P+G+ + G++ VADT+N A+R+ G+ VTT+AG G+
Sbjct: 215 GRRGFTDGPADRAEFSEPQGLALLGDGSVVVADTVNHALRRYEPVTGE--VTTLAGTGRQ 272
Query: 175 NVAGF-RDGPSEDAKFSNDFDV 195
+ G G + S+ +DV
Sbjct: 273 HRPGEPSRGAARAVSLSSPWDV 294
>gi|423214921|ref|ZP_17201449.1| hypothetical protein HMPREF1074_02981 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692184|gb|EIY85422.1| hypothetical protein HMPREF1074_02981 [Bacteroides xylanisolvens
CL03T12C04]
Length = 516
Score = 45.8 bits (107), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 44/78 (56%), Gaps = 4/78 (5%)
Query: 89 FAVDEVNSNIVKITPPLSQYSR--GRLVAGSFQGYTGHVDGKPNE-ARFNHPKGVTMDDK 145
F + + +++ ++I PL Q + GR A G+ +G+ E ARFN PKG+ D++
Sbjct: 427 FYLTDRHNHAIRILSPLGQVTTYAGRGSASLNSNPWGYANGRLREDARFNRPKGIAWDER 486
Query: 146 GN-LYVADTLNLAIRKIG 162
N +YV D N IRKIG
Sbjct: 487 DNTIYVGDANNYRIRKIG 504
>gi|289774196|ref|ZP_06533574.1| NHL repeat containing protein [Streptomyces lividans TK24]
gi|289704395|gb|EFD71824.1| NHL repeat containing protein [Streptomyces lividans TK24]
Length = 605
Score = 45.8 bits (107), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 52/109 (47%), Gaps = 3/109 (2%)
Query: 111 GRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTI 169
G + + G G VDG EA F P G++ D L+VAD+ A+R + D V T
Sbjct: 323 GSVTVTAGTGNEGLVDGPGPEAWFAQPSGLSATDD-RLWVADSETSALRWVELDGTVRTA 381
Query: 170 AGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G G RDGP+ A + V + P S++V D N ALR+ +
Sbjct: 382 VGTGLFDFGHRDGPAARALLQHPLGVAAL-PDGSVVVADTYNHALRRYA 429
Score = 39.3 bits (90), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 45/82 (54%), Gaps = 8/82 (9%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK----IGDAGVTTIAG-GKS 174
G G DG + A F+ P+G+ + G++ VADT+N A+R+ G+ VTT+AG G+
Sbjct: 219 GRRGFTDGPADRAEFSEPQGLALLGDGSVVVADTVNHALRRYEPVTGE--VTTLAGTGRQ 276
Query: 175 NVAGF-RDGPSEDAKFSNDFDV 195
+ G G + S+ +DV
Sbjct: 277 HRPGEPSRGAARAVSLSSPWDV 298
>gi|423214677|ref|ZP_17201205.1| hypothetical protein HMPREF1074_02737 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692583|gb|EIY85820.1| hypothetical protein HMPREF1074_02737 [Bacteroides xylanisolvens
CL03T12C04]
Length = 419
Score = 45.8 bits (107), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 47/88 (53%), Gaps = 6/88 (6%)
Query: 78 YKIRVSEDGELFAVDEVNSNIVKITPPLS-QYSRGRL--VAGSFQGYTGHVDGKPNEARF 134
+ + DG L+ D + I K+TP + YS+G L +AGS G G DGK +A F
Sbjct: 331 FGVAAGSDGCLYLSDNTYNCIRKLTPDANGDYSKGTLETIAGS--GKAGFSDGKGLKATF 388
Query: 135 NHPKGVTM-DDKGNLYVADTLNLAIRKI 161
N P + + +D +YVA +N IR+I
Sbjct: 389 NQPYEIIITEDCKTMYVAGAVNYLIRRI 416
Score = 39.3 bits (90), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 61/133 (45%), Gaps = 17/133 (12%)
Query: 98 IVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNE---ARFNHPKGVTMDDKGNLYVADTL 154
+++I+P +Q + +AG + DG+P A+ GV G LY++D
Sbjct: 292 LIQISPDGTQ----KTIAGDGVKGQEYYDGEPGNPLSAKVGATFGVAAGSDGCLYLSDNT 347
Query: 155 NLAIRKI-----GD---AGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLV 206
IRK+ GD + TIAG S AGF DG A F+ ++++ ++ V
Sbjct: 348 YNCIRKLTPDANGDYSKGTLETIAG--SGKAGFSDGKGLKATFNQPYEIIITEDCKTMYV 405
Query: 207 IDRGNAALRQISL 219
N +R+I++
Sbjct: 406 AGAVNYLIRRITV 418
>gi|262405589|ref|ZP_06082139.1| NHL repeat containing protein [Bacteroides sp. 2_1_22]
gi|294644531|ref|ZP_06722285.1| putative lipoprotein [Bacteroides ovatus SD CC 2a]
gi|294805793|ref|ZP_06764670.1| putative lipoprotein [Bacteroides xylanisolvens SD CC 1b]
gi|345510784|ref|ZP_08790345.1| hypothetical protein BSAG_00303 [Bacteroides sp. D1]
gi|262356464|gb|EEZ05554.1| NHL repeat containing protein [Bacteroides sp. 2_1_22]
gi|292640084|gb|EFF58348.1| putative lipoprotein [Bacteroides ovatus SD CC 2a]
gi|294447014|gb|EFG15604.1| putative lipoprotein [Bacteroides xylanisolvens SD CC 1b]
gi|345454355|gb|EEO48593.2| hypothetical protein BSAG_00303 [Bacteroides sp. D1]
Length = 419
Score = 45.8 bits (107), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 47/88 (53%), Gaps = 6/88 (6%)
Query: 78 YKIRVSEDGELFAVDEVNSNIVKITPPLS-QYSRGRL--VAGSFQGYTGHVDGKPNEARF 134
+ + DG L+ D + I K+TP + YS+G L +AGS G G DGK +A F
Sbjct: 331 FGVAAGSDGCLYLSDNTYNCIRKLTPDANGDYSKGTLETIAGS--GKAGFSDGKGLKATF 388
Query: 135 NHPKGVTM-DDKGNLYVADTLNLAIRKI 161
N P + + +D +YVA +N IR+I
Sbjct: 389 NQPYEIIITEDCKTMYVAGAVNYLIRRI 416
Score = 39.3 bits (90), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 61/133 (45%), Gaps = 17/133 (12%)
Query: 98 IVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNE---ARFNHPKGVTMDDKGNLYVADTL 154
+++I+P +Q + +AG + DG+P A+ GV G LY++D
Sbjct: 292 LIQISPDGTQ----KTIAGDGVKGQEYYDGEPGNPLSAKVGATFGVAAGSDGCLYLSDNT 347
Query: 155 NLAIRKI-----GD---AGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLV 206
IRK+ GD + TIAG S AGF DG A F+ ++++ ++ V
Sbjct: 348 YNCIRKLTPDANGDYSKGTLETIAG--SGKAGFSDGKGLKATFNQPYEIIITEDCKTMYV 405
Query: 207 IDRGNAALRQISL 219
N +R+I++
Sbjct: 406 AGAVNYLIRRITV 418
>gi|433458786|ref|ZP_20416675.1| NHL repeat containing protein [Arthrobacter crystallopoietes
BAB-32]
gi|432192788|gb|ELK49609.1| NHL repeat containing protein [Arthrobacter crystallopoietes
BAB-32]
Length = 680
Score = 45.8 bits (107), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 41/159 (25%), Positives = 69/159 (43%), Gaps = 15/159 (9%)
Query: 71 NEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYS----RGRLVAGSFQGYTGHVD 126
N +G P + +S ++ + +V + +S G L + G G D
Sbjct: 337 NHLGNEPLNVSLSSPWDVLWSSALGKVVVAMAGVHQIFSFDPLTGELAVLAGTGLEGLQD 396
Query: 127 GKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV---------TTIAGGKSNVA 177
G+ N A F G+ D G++++AD+ A+R + + V T I G +
Sbjct: 397 GEANAAWFAQSSGLAQDANGSIWIADSETSALRVLTFSDVNGSERVQVQTAIGTGLFDF- 455
Query: 178 GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
GFRDG + +A+ + V V P S+ V D N A+R+
Sbjct: 456 GFRDGGAAEARLQHPLGVT-VLPDGSVAVADTYNGAVRR 493
>gi|302036371|ref|YP_003796693.1| hypothetical protein NIDE1006 [Candidatus Nitrospira defluvii]
gi|300604435|emb|CBK40767.1| conserved protein of unknown function, contains NHL repeats
[Candidatus Nitrospira defluvii]
Length = 404
Score = 45.8 bits (107), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 51/106 (48%), Gaps = 23/106 (21%)
Query: 126 DGKP-NEARFNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGP 183
DG P AR N P V +D GNLY+ADT+N +RK+ G G+ T NVAG
Sbjct: 137 DGGPARRARLNFPSAVAVDRAGNLYIADTMNHRVRKVDGATGIIT------NVAG----- 185
Query: 184 SEDAKFSND----FDVVYVRPTC------SLLVIDRGNAALRQISL 219
+ A++S D PT +L + D+ N +R++ L
Sbjct: 186 TGQARYSGDGGPAVSAAINEPTGLAVSDEALYIADQSNNRVRRVDL 231
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 28/40 (70%), Gaps = 2/40 (5%)
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG 171
N PKG+ +D KGNLY+AD+ N +R++ A +TT+AG
Sbjct: 34 LNEPKGLCVDRKGNLYIADSENHVVRRVDRATGIITTVAG 73
>gi|290985545|ref|XP_002675486.1| predicted protein [Naegleria gruberi]
gi|284089082|gb|EFC42742.1| predicted protein [Naegleria gruberi]
Length = 819
Score = 45.8 bits (107), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 73/148 (49%), Gaps = 12/148 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPN-EARFN 135
P + VS + E++ VD N I KI L + +G++G DG P A+ N
Sbjct: 590 PCGVFVSNNDEVYIVDYNNHRIRKI---LRNGIINTIAGTGEEGFSG--DGGPAINAQVN 644
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFR--DGPSEDAKFSND 192
HP GV + +Y+ ++ N IRKI +A +TTIAG + V G+ G + +A+ S
Sbjct: 645 HPCGVFVSSTNEVYIMNSGNYRIRKILRNANITTIAG--TGVKGYSGDGGLAINAQISY- 701
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQISLN 220
D ++V + + D N +R+I N
Sbjct: 702 VDNIFVSRNDEVYIADTENHRIRKILRN 729
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 44/147 (29%), Positives = 75/147 (51%), Gaps = 10/147 (6%)
Query: 79 KIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP-NEARFNHP 137
I VS++ E++ D N I KI L + + +G++G DG P A+ + P
Sbjct: 536 NIFVSQNDEIYFSDFGNHRIRKI---LRNGTIVTIAGTGEKGFSG--DGGPATSAQLDSP 590
Query: 138 KGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSNDFDV 195
GV + + +Y+ D N IRKI G + TIAG G+ +G GP+ +A+ ++
Sbjct: 591 CGVFVSNNDEVYIVDYNNHRIRKILRNGIINTIAGTGEEGFSG-DGGPAINAQVNHPCG- 648
Query: 196 VYVRPTCSLLVIDRGNAALRQISLNQD 222
V+V T + +++ GN +R+I N +
Sbjct: 649 VFVSSTNEVYIMNSGNYRIRKILRNAN 675
Score = 39.7 bits (91), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 68/144 (47%), Gaps = 8/144 (5%)
Query: 79 KIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPN-EARFNHP 137
I VS++ E++ D N I KI L + + QG++G DG P A+ N P
Sbjct: 153 NIFVSQNDEVYFSDFRNHRIRKI---LRNGTIVTIAGTGEQGFSG--DGGPAINAKLNTP 207
Query: 138 KGVTMDDKGNLYVADTLNLAIRK-IGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
GV + + +Y+ D + IRK + D + TIAG G GP+ A+ S+ V
Sbjct: 208 CGVFVSNNDEVYIVDYKSHRIRKMLQDGTIITIAGTGEQGFGGDGGPATSAQLSHPCG-V 266
Query: 197 YVRPTCSLLVIDRGNAALRQISLN 220
+V T + + D N +R+I N
Sbjct: 267 FVSSTNEVYITDSYNYRIRKILRN 290
Score = 38.1 bits (87), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 47/92 (51%), Gaps = 4/92 (4%)
Query: 131 EARFNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAG-GKSNVAGFRDGPSEDAK 188
+A N P GV + +Y+AD N +RKI D + TIAG G+ +G D P+ A+
Sbjct: 416 KAMLNCPGGVFVAPNDEVYMADCQNHRVRKILKDGTIVTIAGTGEEGFSGDGD-PATSAQ 474
Query: 189 FSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
S+ V+V T + D GN +R+I N
Sbjct: 475 LSHPCS-VFVSSTNEVFFADSGNYRIRKILRN 505
>gi|238062013|ref|ZP_04606722.1| LOW QUALITY PROTEIN: alkyl hydroperoxide reductase/thiol specific
antioxidant/Mal allergen [Micromonospora sp. ATCC 39149]
gi|237883824|gb|EEP72652.1| LOW QUALITY PROTEIN: alkyl hydroperoxide reductase/thiol specific
antioxidant/Mal allergen [Micromonospora sp. ATCC 39149]
Length = 582
Score = 45.8 bits (107), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 47/100 (47%), Gaps = 10/100 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT--GHVDGKPNEARF 134
P + VS DG V + ++ V+ G L QG GHVDG + A
Sbjct: 322 PSGLAVSADGARLWVADSETSAVRYV------ENGVLGTAVGQGLFDFGHVDGPADRALL 375
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGG 172
HP GV G++ +ADT N A+R+ A V+T+AGG
Sbjct: 376 QHPLGVCALPDGSVLIADTYNGAVRRFDPATGLVSTVAGG 415
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 62/139 (44%), Gaps = 6/139 (4%)
Query: 112 RLVAGSFQGYTGHV--DGKPNEARFNHPKGVTMD-DKGNLYVADTLNLAIRKIGDAGVTT 168
R AG + G T DG EA P G+ + D L+VAD+ A+R + + + T
Sbjct: 295 RRTAGMYAGTTVEALRDGPLAEAWLAQPSGLAVSADGARLWVADSETSAVRYVENGVLGT 354
Query: 169 IAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQY 228
G G DGP++ A + V + P S+L+ D N A+R+
Sbjct: 355 AVGQGLFDFGHVDGPADRALLQHPLGVCAL-PDGSVLIADTYNGAVRRFDPATGLVSTVA 413
Query: 229 NSIS-PTDILMVV-GAVLV 245
++ P+D+++ G VLV
Sbjct: 414 GGLAEPSDLVLTADGGVLV 432
>gi|111219919|ref|YP_710713.1| serine/threonine protein kinase [Frankia alni ACN14a]
gi|111147451|emb|CAJ59101.1| Putative serine/threonine protein kinase [Frankia alni ACN14a]
Length = 779
Score = 45.8 bits (107), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 12/87 (13%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRL--VAGS-FQGYTGHVDGKP-NEARFNHPKGVT 141
G+L+ D + I +I SRG + VAG+ +G++G DG P A N+P GV
Sbjct: 635 GDLYIADSEANRIRRID------SRGIVTTVAGTGAKGFSG--DGGPATRAMLNNPTGVA 686
Query: 142 MDDKGNLYVADTLNLAIRKIGDAGVTT 168
+D GNLY+AD+ N IR++ G+ T
Sbjct: 687 VDRAGNLYIADSANHRIRRVTPDGIIT 713
>gi|423072880|ref|ZP_17061629.1| repeat protein [Desulfitobacterium hafniense DP7]
gi|361856375|gb|EHL08282.1| repeat protein [Desulfitobacterium hafniense DP7]
Length = 2017
Score = 45.8 bits (107), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 61/137 (44%), Gaps = 22/137 (16%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
I V +G ++ VD + P L +++ + G+F G G P FN P G
Sbjct: 830 IAVDNNGHIYVVDSND-------PRLQKFTSDGVYVGAFGG--SDSTGGP----FNLPLG 876
Query: 140 VTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 199
VT+D GN+Y+ADTLN I+K G G + V + G +D V
Sbjct: 877 VTVDQDGNIYIADTLNHRIQKYSAEGEFLTKWGSNGVGNVQFGAPQD---------VATD 927
Query: 200 PTCSLLVIDRGNAALRQ 216
+ ++ V+D N +++
Sbjct: 928 SSGNVYVVDTSNKRIQK 944
Score = 42.7 bits (99), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 37/89 (41%), Gaps = 13/89 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY I V DG ++ D N I K AG F G +FN
Sbjct: 968 PYGIAVDSDGNIYVADSNNHRIQKFN-----------AAGVFITTWGSY--GTGLGQFNS 1014
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
PKG+ +D GN+YVAD N ++K G
Sbjct: 1015 PKGIAVDSNGNVYVADIENDRVQKFDSMG 1043
Score = 42.7 bits (99), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 46/99 (46%), Gaps = 13/99 (13%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
VP I + D ++ D N+ I K + + G L+ G TG DG +F
Sbjct: 731 VPGHIAIDSDNNVYVADTGNNRIQKFSS-----TGGYLME---FGSTGSGDG-----QFR 777
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKS 174
+PK V +D+ GN+YVADT N I+K G G S
Sbjct: 778 NPKSVAVDNDGNIYVADTTNKRIQKFDSDGTLITKWGDS 816
Score = 42.7 bits (99), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 39/90 (43%), Gaps = 13/90 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + G ++ VD N I K +GSF G +E FN
Sbjct: 921 PQDVATDSSGNVYVVDTSNKRIQKFDS-----------SGSFLTKWGS--NGSDEGEFNR 967
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGV 166
P G+ +D GN+YVAD+ N I+K AGV
Sbjct: 968 PYGIAVDSDGNIYVADSNNHRIQKFNAAGV 997
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 43/100 (43%), Gaps = 19/100 (19%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPP---LSQYSRGRLVAGSFQGYTGHVDGKPNEAR 133
P I V G ++ D +N+ I K L ++ RG G DG+
Sbjct: 274 PRSIAVDSAGNIYVSDSMNNRIQKFASSGTFLKKWGRG-----------GSADGE----- 317
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGK 173
F P GV +D +G +YV+DT N I K AGV + G
Sbjct: 318 FYQPYGVAVDHEGTIYVSDTGNNRIEKFDAAGVFVTSWGN 357
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 40/89 (44%), Gaps = 14/89 (15%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + V +G ++ D N+ I K AG F G+ +E + N
Sbjct: 321 PYGVAVDHEGTIYVSDTGNNRIEKFD-----------AAGVFVTSWGN---GSDEDQLNM 366
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
P G+ +D GN+YVAD N I+K AG
Sbjct: 367 PMGIVVDSAGNVYVADMGNNRIQKYDSAG 395
Score = 38.9 bits (89), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 44/162 (27%), Positives = 73/162 (45%), Gaps = 22/162 (13%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
VP V +G ++ VD +I K++ S G ++A G G+ +G + N
Sbjct: 684 VPAAAAVDSNGNIYIVDSNRHHIKKLS------STGEILA--TWGSYGNAEG-----QLN 730
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAK---FSND 192
P + +D N+YVADT N I+K G + G + G DG + K ND
Sbjct: 731 VPGHIAIDSDNNVYVADTGNNRIQKFSSTGGYLMEFGST---GSGDGQFRNPKSVAVDND 787
Query: 193 FDVVYVRPTCSLLV--IDRGNAALRQISLNQDDCEYQYNSIS 232
+ +YV T + + D + + +QD+ +YQ++ IS
Sbjct: 788 GN-IYVADTTNKRIQKFDSDGTLITKWGDSQDNGDYQFSLIS 828
Score = 38.5 bits (88), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 25/85 (29%), Positives = 37/85 (43%), Gaps = 13/85 (15%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLS-QYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P + V DG ++ V+ +N + K QY G G+ G +FN
Sbjct: 1062 PSGVAVDGDGNIYVVEALNHRMQKFDSTFQPQYIWGGTSYGNGNG------------QFN 1109
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRK 160
P GV +D GN+YV D N ++K
Sbjct: 1110 SPSGVAVDSSGNIYVLDNNNNRVQK 1134
Score = 38.5 bits (88), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 27/93 (29%), Positives = 45/93 (48%), Gaps = 15/93 (16%)
Query: 75 VVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYS--RGRLVAGSFQGYTGHVDGKPNEA 132
+ P + V G ++ +D NS + K S+++ L GSF G N+
Sbjct: 465 IAPGGVAVDSSGNIYVLDTQNSRVQK----FSEFNIESFDLEWGSF--------GGAND- 511
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
+F +P G+ +D GN+YV+D+ N I+K G
Sbjct: 512 QFRYPNGIAVDSAGNVYVSDSGNYRIQKFSSTG 544
>gi|299147160|ref|ZP_07040227.1| conserved hypothetical protein, with a conserved domain protein
[Bacteroides sp. 3_1_23]
gi|298515045|gb|EFI38927.1| conserved hypothetical protein, with a conserved domain protein
[Bacteroides sp. 3_1_23]
Length = 511
Score = 45.8 bits (107), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 44/78 (56%), Gaps = 4/78 (5%)
Query: 89 FAVDEVNSNIVKITPPLSQYSR--GRLVAGSFQGYTGHVDGKPNE-ARFNHPKGVTMDDK 145
F + + +++ ++I PL Q + GR A G+ +G+ E ARFN PKG+ D++
Sbjct: 427 FYLTDRHNHAIRILSPLGQVTTYAGRGSASLNSNPWGYANGRLREDARFNRPKGIAWDER 486
Query: 146 GN-LYVADTLNLAIRKIG 162
N +YV D N IRKIG
Sbjct: 487 DNTIYVGDANNYRIRKIG 504
>gi|318059218|ref|ZP_07977941.1| hypothetical protein SSA3_14816 [Streptomyces sp. SA3_actG]
gi|318079124|ref|ZP_07986456.1| hypothetical protein SSA3_21010 [Streptomyces sp. SA3_actF]
Length = 610
Score = 45.8 bits (107), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 46/82 (56%), Gaps = 4/82 (4%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG-GKSNV 176
G G DG ARFN P+G+ + G + VADT+N A+R++ A V+T+AG G+
Sbjct: 221 GERGLRDGDAAGARFNEPQGLALLPDGAVVVADTVNHALRRVDLASGEVSTLAGTGRQWW 280
Query: 177 AGF-RDGPSEDAKFSNDFDVVY 197
G GP+ + S+ +DV +
Sbjct: 281 QGEPVAGPAREVSLSSPWDVAW 302
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 47/96 (48%), Gaps = 5/96 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + VS DGE + + ++ V+ + + R G+ GH DG +A H
Sbjct: 351 PSGLAVSLDGERLWIADAETSAVRW---IDREGTVRTAVGTGLFDFGHRDGAAGQALLQH 407
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIA 170
P GVT G++ VADT N A+R+ A VTT+A
Sbjct: 408 PLGVTALPDGSVAVADTYNHALRRYDPASEEVTTLA 443
>gi|196009344|ref|XP_002114537.1| hypothetical protein TRIADDRAFT_11855 [Trichoplax adhaerens]
gi|190582599|gb|EDV22671.1| hypothetical protein TRIADDRAFT_11855 [Trichoplax adhaerens]
Length = 507
Score = 45.8 bits (107), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 36/99 (36%), Positives = 53/99 (53%), Gaps = 7/99 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT-GHVDGKPNEARFN 135
P K+ V DG+ + + + + I + + R G+ + + G VDG +ARFN
Sbjct: 171 PGKVCVVNDGDDIVISDSGHHRILI---VGKDGITRSCVGNKEYFEEGFVDGVFQKARFN 227
Query: 136 HPKGVTMDDKG-NLYVADTLNLAIRKIG--DAGVTTIAG 171
+P+G+T G ++VADT N AIRKI VTTIAG
Sbjct: 228 NPQGITCSRNGKTIFVADTNNHAIRKIDLEYCEVTTIAG 266
>gi|430749689|ref|YP_007212597.1| copper amine oxidase family protein [Thermobacillus composti KWC4]
gi|430733654|gb|AGA57599.1| copper amine oxidase family protein [Thermobacillus composti KWC4]
Length = 187
Score = 45.8 bits (107), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 11/104 (10%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGK 173
+AG+ G G DG ++ F +P V + G +YVADTLN IRKI + VTT+
Sbjct: 4 IAGT--GILGLKDGSASQENFYYPMDVAVSPSGVIYVADTLNYVIRKIENGRVTTLNAQS 61
Query: 174 SNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
VA G E A D + L+ D GN +R +
Sbjct: 62 GRVAEVFPGVVEPAGDYRD---------GAFLIADGGNNRIRIV 96
>gi|290994368|ref|XP_002679804.1| predicted protein [Naegleria gruberi]
gi|284093422|gb|EFC47060.1| predicted protein [Naegleria gruberi]
Length = 1359
Score = 45.8 bits (107), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 50/138 (36%), Positives = 65/138 (47%), Gaps = 10/138 (7%)
Query: 83 SEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGK-PNEARFNHPKGV 140
+E ++ D N I KIT S VAGS + GY G DGK P E +FN+P GV
Sbjct: 551 AEYSAIYVADSNNHVIRKITLSTGIVST---VAGSGEPGYNG--DGKLPLETQFNNPMGV 605
Query: 141 TMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 200
G L VADT N IR I D V T+AG + + +GP+ ++ V Y
Sbjct: 606 ISAQMG-LIVADTNNHRIRVITDVTVKTLAGTGIDGSSGDEGPAIRSQLFYPHSVSYGYS 664
Query: 201 TCSLLVIDRGNAALRQIS 218
+ V D N +R IS
Sbjct: 665 --DIYVADTFNNKIRAIS 680
Score = 45.4 bits (106), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 56/109 (51%), Gaps = 8/109 (7%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIA 170
L+AG G GH+DG + FNHP G+ + +YVAD+ N IRKI V+T+A
Sbjct: 523 LIAGQ-PGIKGHIDGIALNSTFNHPSGIHAEYSA-IYVADSNNHVIRKITLSTGIVSTVA 580
Query: 171 G-GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G G+ G P E +F+N V+ + L+V D N +R I+
Sbjct: 581 GSGEPGYNGDGKLPLE-TQFNNPMGVISAQM--GLIVADTNNHRIRVIT 626
>gi|290971645|ref|XP_002668597.1| predicted protein [Naegleria gruberi]
gi|284082068|gb|EFC35853.1| predicted protein [Naegleria gruberi]
Length = 223
Score = 45.8 bits (107), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 38/135 (28%), Positives = 62/135 (45%), Gaps = 6/135 (4%)
Query: 87 ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF-QGYTGHVDGKPNEARFNHPKGVTMDDK 145
E++ D N I K+ S+ +AG+ +G++G DG +A+ N P ++
Sbjct: 72 EVYITDSANHRIRKV----SKSGIIETIAGNGNEGFSGD-DGLATQAQLNCPMSTFVNSN 126
Query: 146 GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLL 205
G +Y+ D+ N IRK+ G+ G N DG S N V+V +
Sbjct: 127 GEIYITDSNNFRIRKVQRNGIIKTIAGNGNDGFEGDGSSATNAQLNYPMHVFVSSNDEMY 186
Query: 206 VIDRGNAALRQISLN 220
+ D GN A+R++S N
Sbjct: 187 ISDSGNNAIRKVSRN 201
>gi|29830929|ref|NP_825563.1| hypothetical protein SAV_4386 [Streptomyces avermitilis MA-4680]
gi|29608042|dbj|BAC72098.1| hypothetical protein SAV_4386 [Streptomyces avermitilis MA-4680]
Length = 341
Score = 45.8 bits (107), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 47/80 (58%), Gaps = 4/80 (5%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG-GKSNV 176
G G DG + A F+ P+G+T+ D+ + VADT+N A+R++ A VTT+AG G+
Sbjct: 219 GIRGFADGPADAASFSEPQGMTLLDEDVVVVADTVNHALRRLDLATGEVTTLAGTGRQWW 278
Query: 177 AGF-RDGPSEDAKFSNDFDV 195
G GP+ + S+ +DV
Sbjct: 279 QGSPASGPAREVDLSSPWDV 298
>gi|293373845|ref|ZP_06620188.1| NHL repeat protein [Bacteroides ovatus SD CMC 3f]
gi|292631189|gb|EFF49824.1| NHL repeat protein [Bacteroides ovatus SD CMC 3f]
Length = 506
Score = 45.8 bits (107), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 44/78 (56%), Gaps = 4/78 (5%)
Query: 89 FAVDEVNSNIVKITPPLSQYSR--GRLVAGSFQGYTGHVDGKPNE-ARFNHPKGVTMDDK 145
F + + +++ ++I PL Q + GR A G+ +G+ E ARFN PKG+ D++
Sbjct: 422 FYLTDRHNHAIRILSPLGQVTTYAGRGSASLNSNPWGYANGRLREDARFNRPKGIAWDER 481
Query: 146 GN-LYVADTLNLAIRKIG 162
N +YV D N IRKIG
Sbjct: 482 DNTIYVGDANNYRIRKIG 499
>gi|148654423|ref|YP_001274628.1| NHL repeat-containing protein [Roseiflexus sp. RS-1]
gi|148566533|gb|ABQ88678.1| NHL repeat containing protein [Roseiflexus sp. RS-1]
Length = 1146
Score = 45.8 bits (107), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 47/173 (27%), Positives = 73/173 (42%), Gaps = 29/173 (16%)
Query: 70 GNEIG--VVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDG 127
GN G V P I + G + D N ++ P + GSF G DG
Sbjct: 838 GNAPGQLVQPRGIAIDPQGNIIVSDSANHRLIVFDPSGTPIR----TIGSF----GSGDG 889
Query: 128 KPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFR----D 181
+ F P+GV +D +GN+YVADT N I K+ G + + G+ + R
Sbjct: 890 Q-----FYEPRGVAVDAQGNIYVADTWNARIVKLDPQGTFLASWGVGRDDFGDGRRASPT 944
Query: 182 GPSEDAKFSNDFDV-----VYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYN 229
G ++DA + D+ V V ++ + D GN ++I + D Y+Y
Sbjct: 945 GGTQDANLARPLDLFGPRGVAVDADGNVYIADTGN---KRIVVTDTDGNYRYQ 994
Score = 39.3 bits (90), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 38/88 (43%), Gaps = 13/88 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V DG ++ D N IV +T Y GY G G+ FN
Sbjct: 961 PRGVAVDADGNVYIADTGNKRIV-VTDTDGNYRYQ-------WGYDGSASGQ-----FNE 1007
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDA 164
P GV +D+ G ++VADT N ++ A
Sbjct: 1008 PTGVAVDENGTVFVADTWNSRVQAFARA 1035
>gi|159035770|ref|YP_001535023.1| alkyl hydroperoxide reductase [Salinispora arenicola CNS-205]
gi|157914605|gb|ABV96032.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Salinispora arenicola CNS-205]
Length = 612
Score = 45.8 bits (107), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 64/136 (47%), Gaps = 6/136 (4%)
Query: 115 AGSFQGYTGHV--DGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIGDAGVTTIAG 171
AG + G T DG EA P G+++ G+ L+VAD+ AIR + D + T G
Sbjct: 328 AGMYAGSTVEALKDGPLAEAWLAQPSGLSVSADGSRLWVADSETSAIRYVQDGVLNTAVG 387
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSI 231
G DGP+ A + V + P S+L+ D N A+R+ D + +
Sbjct: 388 QGLFEFGHVDGPAAQALLQHPLGVCAL-PDGSVLIADTYNGAVRRYDPESDSVGTVADGL 446
Query: 232 S-PTDILMVV-GAVLV 245
+ P+D+++ G VLV
Sbjct: 447 AEPSDLVLTPDGGVLV 462
Score = 41.6 bits (96), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 42/85 (49%), Gaps = 6/85 (7%)
Query: 77 PYKIRVSEDG-ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P + VS DG L+ D S I + + + G+ G F+ GHVDG +A
Sbjct: 352 PSGLSVSADGSRLWVADSETSAIRYVQDGVLNTAVGQ---GLFE--FGHVDGPAAQALLQ 406
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRK 160
HP GV G++ +ADT N A+R+
Sbjct: 407 HPLGVCALPDGSVLIADTYNGAVRR 431
>gi|57529823|ref|NP_001006504.1| NHL repeat-containing protein 2 [Gallus gallus]
gi|82081010|sp|Q5ZI67.1|NHLC2_CHICK RecName: Full=NHL repeat-containing protein 2
gi|53136494|emb|CAG32576.1| hypothetical protein RCJMB04_29n5 [Gallus gallus]
Length = 727
Score = 45.8 bits (107), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 62/126 (49%), Gaps = 9/126 (7%)
Query: 75 VVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
+ P K+ V + GE + + + + +T G+++ +G DG+ +EA F
Sbjct: 219 LFPGKVTVDKSGERLVIADTGHHRILVT-----LKNGQILHTIGGPNSGRKDGRFSEAAF 273
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG-GKSNVAGFRDGPSEDAKFSN 191
N P+GV + + +YVADT N IRKI VTT+AG G V E+ S+
Sbjct: 274 NSPQGVAIKNN-VIYVADTENHLIRKIDLELEIVTTVAGIGIQGVDKEGGAKGEEQPISS 332
Query: 192 DFDVVY 197
+DVV+
Sbjct: 333 PWDVVF 338
>gi|324997987|ref|ZP_08119099.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Pseudonocardia sp. P1]
Length = 614
Score = 45.4 bits (106), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 60/121 (49%), Gaps = 7/121 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P K+ G D + +V++ P L + R + +GYT DG ARF+
Sbjct: 189 PGKVAALPGGTFLVSDTAHHQLVELEPDL--VTERRRIGDGGRGYT---DGPAGSARFSE 243
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P+G+ + D + VADT+N A+R++ D V+T+AG S + D A+ S+ +D
Sbjct: 244 PQGLLVLDPSTVLVADTVNHAVRRVSLDDGTVSTVAGTGSQLRERVDPGGTAAELSSPWD 303
Query: 195 V 195
+
Sbjct: 304 L 304
Score = 43.1 bits (100), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 42/141 (29%), Positives = 62/141 (43%), Gaps = 8/141 (5%)
Query: 110 RGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAG--- 165
+ R++AG+ G DG EA P G+ G L+VAD+ A+R++ D G
Sbjct: 330 QARVLAGTTN--EGLRDGSFAEAFLAQPSGLATGPDGTLWVADSEISALRRVDVDPGAGP 387
Query: 166 -VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDC 224
V+T G G RDGP+ +A + V V P S+ V D N A+R+
Sbjct: 388 AVSTAVGQGLFEFGHRDGPAAEALLQHPLGVA-VLPDGSVAVADTYNGAVRRFDPAAGSV 446
Query: 225 EYQYNSISPTDILMVVGAVLV 245
++ L+V G LV
Sbjct: 447 STLAEGLAEPSDLLVDGETLV 467
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 46/100 (46%), Gaps = 6/100 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNI--VKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
P + DG L+ D S + V + P + G F+ GH DG EA
Sbjct: 355 PSGLATGPDGTLWVADSEISALRRVDVDPGAGPAVSTAVGQGLFE--FGHRDGPAAEALL 412
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGG 172
HP GV + G++ VADT N A+R+ A V+T+A G
Sbjct: 413 QHPLGVAVLPDGSVAVADTYNGAVRRFDPAAGSVSTLAEG 452
>gi|196228192|ref|ZP_03127059.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
gi|196227595|gb|EDY22098.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
Length = 357
Score = 45.4 bits (106), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 42/85 (49%), Gaps = 3/85 (3%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + DG + D N I + +P +R VAG+ + G +DG P + + +
Sbjct: 274 PKYIALDRDGSVLIADAENHVIRRYSPKTGLITR---VAGTGKPGNGGLDGDPLQCQLHR 330
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P GVT+ G LY+ D+ N I KI
Sbjct: 331 PHGVTVGPDGALYITDSYNDRILKI 355
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 58/113 (51%), Gaps = 10/113 (8%)
Query: 113 LVAGS-FQGYTGHVDGKP-NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG---VT 167
++AG+ +G TG DG P A FN + G+L +AD+ N +R+I DA +T
Sbjct: 79 VIAGTGAKGATG--DGGPATAATFNGIHNFVVLRNGDLLLADSFNNLLRRI-DAKTGVIT 135
Query: 168 TIAGG-KSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
T+AGG K AG GP++DA FS ++ L D GN +R I L
Sbjct: 136 TVAGGVKKGFAG-DGGPAKDALFSTLIEIALNPAGTKLYCADIGNRRVRCIDL 187
>gi|443625938|ref|ZP_21110373.1| putative NHL repeat protein [Streptomyces viridochromogenes Tue57]
gi|443340614|gb|ELS54821.1| putative NHL repeat protein [Streptomyces viridochromogenes Tue57]
Length = 632
Score = 45.4 bits (106), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 42/83 (50%), Gaps = 4/83 (4%)
Query: 119 QGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG-GKSN 175
QG G DG + ARF P+G+ + G + VADT+N +R+ TT+AG G
Sbjct: 222 QGSRGSTDGYADRARFQEPQGLALLPDGTVAVADTVNHLVRRFDPETGRTTTLAGTGIQW 281
Query: 176 VAG-FRDGPSEDAKFSNDFDVVY 197
+ G GP D S+ +DV +
Sbjct: 282 MQGQATSGPGRDVSLSSPWDVAW 304
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 23/50 (46%), Positives = 30/50 (60%), Gaps = 2/50 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIA 170
GH DG+ EA HP GVT G++ V+DT N A+R+ A VTT+A
Sbjct: 394 GHRDGRAEEALLQHPLGVTALPDGSIAVSDTYNHALRRYDPATGEVTTLA 443
>gi|423216520|ref|ZP_17203043.1| hypothetical protein HMPREF1074_04575 [Bacteroides xylanisolvens
CL03T12C04]
gi|392690730|gb|EIY83986.1| hypothetical protein HMPREF1074_04575 [Bacteroides xylanisolvens
CL03T12C04]
Length = 423
Score = 45.4 bits (106), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 53/118 (44%), Gaps = 15/118 (12%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLN------LAIRKIG---- 162
+AGS +G + G P A+F G +D +G +Y D + I++I
Sbjct: 306 CIAGSEKGDSDGTSGNPRSAKFYQLYGFAIDSEGTIYTVDGNDGNVGSGQKIKRITRGKK 365
Query: 163 ---DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
D V T+ G S DG ++A F N +D++ +L V DR N A+R+I
Sbjct: 366 GYEDGTVVTLVG--SAGGAIVDGSVDEAVFGNPYDIILDEANRALYVSDRSNNAIRKI 421
>gi|299142776|ref|ZP_07035905.1| NHL repeat-containing protein [Prevotella oris C735]
gi|298575805|gb|EFI47682.1| NHL repeat-containing protein [Prevotella oris C735]
Length = 448
Score = 45.4 bits (106), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 37/68 (54%), Gaps = 5/68 (7%)
Query: 108 YSRGRLVAGSFQGY--TGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--D 163
Y G+ V F G+ G DG A F +P G+ D GN+YV +++ IRKIG D
Sbjct: 333 YKDGQTVE-QFAGFNGVGSQDGPRMSASFKNPTGMAFDVDGNMYVTESMGFTIRKIGHTD 391
Query: 164 AGVTTIAG 171
VTT+AG
Sbjct: 392 GMVTTVAG 399
Score = 41.6 bits (96), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 42/155 (27%), Positives = 65/155 (41%), Gaps = 32/155 (20%)
Query: 49 PQADGNVLQFENGYLVETVIEGNEIGVVPYKIRV-----------SEDGELF-------- 89
P A +L GYL + VI+G+ G P KI V S DG
Sbjct: 48 PVAVSEILPDSGGYLTQFVIKGSNFGTDPSKIDVIFNGNRKATVVSSDGTTLYGICPKQE 107
Query: 90 -AVDEVNSNIVKITPP--------LSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGV 140
+++V + + P ++ R ++AG G G+VDG P +ARFN+ GV
Sbjct: 108 NGLNQVTVRVDSVGDPTVCPNRFKYTKVERVSVLAGK-TGNGGYVDGNPIDARFNYMYGV 166
Query: 141 TMDDKGNLYVADTLNLAIRKIGDAG---VTTIAGG 172
+ N+ V + N +R I + +T + GG
Sbjct: 167 GVVTGNNVIVMEGRNNRVRMISETDNKVITLLTGG 201
>gi|386773539|ref|ZP_10095917.1| NHL repeat protein [Brachybacterium paraconglomeratum LC44]
Length = 634
Score = 45.4 bits (106), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 38/77 (49%), Gaps = 3/77 (3%)
Query: 85 DGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDD 144
DG + D +S + + P Q S LV + GHVDG + AR HP GVT
Sbjct: 381 DGRIVIADSESSAVRLLDPQTMQVST--LVGKGLFDF-GHVDGPLDRARLQHPLGVTALP 437
Query: 145 KGNLYVADTLNLAIRKI 161
G + +ADT N AIR +
Sbjct: 438 DGRIAIADTYNGAIRLL 454
>gi|290995346|ref|XP_002680256.1| predicted protein [Naegleria gruberi]
gi|284093876|gb|EFC47512.1| predicted protein [Naegleria gruberi]
Length = 1226
Score = 45.1 bits (105), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 41/81 (50%), Gaps = 2/81 (2%)
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P G+ G+LY D+ N AIRKI GV T G AGF+DG + A F++ +
Sbjct: 387 QPAGMAFASNGDLYFCDSNNHAIRKIDTQGVITTLTGNG-TAGFQDGDAASALFNSPASI 445
Query: 196 VYVRPTCSLLVIDRGNAALRQ 216
+ P LV D N+ +R+
Sbjct: 446 ALL-PNGDFLVADFTNSRIRK 465
Score = 43.5 bits (101), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 64/148 (43%), Gaps = 12/148 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + +G+L+ D N I KI ++G + + G G DG A FN
Sbjct: 388 PAGMAFASNGDLYFCDSNNHAIRKID------TQGVITTLTGNGTAGFQDGDAASALFNS 441
Query: 137 PKGVTMDDKGNLYVADTLNLAIRK--IGDAGVTTIAGGKSNVAGFR--DGPSEDAKFSND 192
P + + G+ VAD N IRK V TIAG + GF +G + A +
Sbjct: 442 PASIALLPNGDFLVADFTNSRIRKYVASTKQVVTIAGNST--LGFYGDNGLASKALLNKP 499
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQISLN 220
V Y + +L+ D N +R+IS N
Sbjct: 500 TGVFYNSTSGDILIADYFNFRIRKISNN 527
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 60/147 (40%), Gaps = 11/147 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSN-IVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPN-EARF 134
PY I G+ V E SN IVK T + L SF G DG + A
Sbjct: 156 PYGIAFDSSGDNLYVAEYGSNKIVKFTNGVRSVIVNTLGTKSFSG-----DGGSSLSATI 210
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG-GKSNVAGFRDGPSEDAKFSN 191
N P G+ ++ NLY ADT N IR + + V TIAG G + G G D
Sbjct: 211 NLPAGIALNGN-NLYFADTANNRIRVVNLQTSIVNTIAGDGTARFLGAYSGSRSDVSLFR 269
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQIS 218
Y T LLV D N + +I+
Sbjct: 270 PETTFYNSQTDELLVADTLNHRVVRIN 296
>gi|219852010|ref|YP_002466442.1| periplasmic copper-binding protein [Methanosphaerula palustris
E1-9c]
gi|219546269|gb|ACL16719.1| periplasmic copper-binding [Methanosphaerula palustris E1-9c]
Length = 709
Score = 45.1 bits (105), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 40/89 (44%), Gaps = 12/89 (13%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ I V G ++ D N I+K T G L+ T V P + F
Sbjct: 181 PFGIDVDNTGNVYVADTANHRILKFT------DDGSLID------TWGVTPGPWDGHFVA 228
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
P GVT+DD GN++V DT N I+K G
Sbjct: 229 PYGVTVDDAGNVFVTDTYNSRIQKFSSVG 257
Score = 43.1 bits (100), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 56/123 (45%), Gaps = 18/123 (14%)
Query: 48 APQADGNVLQF-ENGYLVET--VIEGNEIG--VVPYKIRVSEDGELFAVDEVNSNIVKIT 102
A A+ +L+F ++G L++T V G G V PY + V + G +F D NS I K +
Sbjct: 195 ADTANHRILKFTDDGSLIDTWGVTPGPWDGHFVAPYGVTVDDAGNVFVTDTYNSRIQKFS 254
Query: 103 PPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG 162
S G + G TG E +F P +D N+YVADT N I+K
Sbjct: 255 ------SVGTFITKWGTGGTG-------EGQFGLPSATAIDTSENVYVADTANNRIQKFT 301
Query: 163 DAG 165
G
Sbjct: 302 SQG 304
Score = 38.9 bits (89), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 44/88 (50%), Gaps = 9/88 (10%)
Query: 131 EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFS 190
+F+ P G+T+D+ GN++VADT N I+K G A G + E+ +F
Sbjct: 39 SGQFSAPSGITVDELGNIFVADTNNNRIQKFSSTGAFINAWGSKGI--------ENGQFK 90
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+ + V T ++ + D GN +++ S
Sbjct: 91 SPRGIA-VDKTGNIFIADTGNYRIQKFS 117
Score = 38.5 bits (88), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 42/89 (47%), Gaps = 13/89 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V E G +F D N+ I K + S G + + G G +G+ F
Sbjct: 45 PSGITVDELGNIFVADTNNNRIQKFS------STGAFI--NAWGSKGIENGQ-----FKS 91
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
P+G+ +D GN+++ADT N I+K G
Sbjct: 92 PRGIAVDKTGNIFIADTGNYRIQKFSSTG 120
>gi|386849777|ref|YP_006267790.1| Teneurin-4 [Actinoplanes sp. SE50/110]
gi|359837281|gb|AEV85722.1| Teneurin-4 [Actinoplanes sp. SE50/110]
Length = 631
Score = 45.1 bits (105), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 48/96 (50%), Gaps = 8/96 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNE-ARFN 135
PY I + +G ++ D N+ I+ I G AG+ DG A+ N
Sbjct: 146 PYGID-TANGMVYVADTGNNRILMINGAAVSVIAGTGTAGTSP------DGTAASLAKLN 198
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG 171
P+ V +D G++YVADT N +RKI + ++T+AG
Sbjct: 199 APRDVKVDASGDIYVADTGNHLVRKISNGTISTVAG 234
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 46/85 (54%), Gaps = 4/85 (4%)
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR--DGPSEDAKFSNDF 193
HP GV +D G +Y+AD+ N +R++ +TT+AG + VAG DG + +D
Sbjct: 35 HPGGVAVDPAGTMYIADSGNNVVRQVAAGIITTVAG--TGVAGGTPADGGLATSGNLSDP 92
Query: 194 DVVYVRPTCSLLVIDRGNAALRQIS 218
+ V V L + D G+ +R+++
Sbjct: 93 EDVTVDSGGVLYIADTGHHRIRRVA 117
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 66/149 (44%), Gaps = 13/149 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G ++ D N+ + ++ + G VAG T G +
Sbjct: 36 PGGVAVDPAGTMYIADSGNNVVRQVAAGIITTVAGTGVAGG----TPADGGLATSGNLSD 91
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAG-FRDGPSEDAKFSNDFDV 195
P+ VT+D G LY+ADT + IR++ +TT+AG + VAG G + S+ + +
Sbjct: 92 PEDVTVDSGGVLYIADTGHHRIRRVAGGVITTVAG--TGVAGSLMTGVAATTLLSSPYGI 149
Query: 196 ------VYVRPTCSLLVIDRGNAALRQIS 218
VYV T + ++ AA+ I+
Sbjct: 150 DTANGMVYVADTGNNRILMINGAAVSVIA 178
>gi|322788140|gb|EFZ13922.1| hypothetical protein SINV_02735 [Solenopsis invicta]
Length = 717
Score = 45.1 bits (105), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 61/125 (48%), Gaps = 13/125 (10%)
Query: 77 PYKIRV--SEDGELFAVDEVNSNIVKITPPLSQYSR-GRLVAGSFQGYTGHVDGKPNEAR 133
P K+ V SE G + + +N + IT +++ R ++ G QG+ DG AR
Sbjct: 251 PGKLTVLQSEQGTKLVISDSGNNRIVIT---NEHGRVEHVIGGCSQGFK---DGDFKNAR 304
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG-GKSNVAGFRDGPSEDAKFS 190
FN P+GV + +Y+AD N AIRKI + V+TIAG G G D S
Sbjct: 305 FNSPQGVCALNN-TIYIADNNNHAIRKINLSEKTVSTIAGTGLQGCDRNGGGHGTDQALS 363
Query: 191 NDFDV 195
+ +DV
Sbjct: 364 SPWDV 368
Score = 43.9 bits (102), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 38/104 (36%), Positives = 53/104 (50%), Gaps = 9/104 (8%)
Query: 123 GHVDGKPNEARFNHPKGVTMD--DKGNLYVADTLNLAIRKIGDA---GVTTIAGGKSNVA 177
G VDGK N + HP GVT + DK +YVADT N I+ + A T GGK +
Sbjct: 488 GDVDGKGNSVKLQHPLGVTWNHSDK-QIYVADTYNHKIKSVNTATGHCKTLFGGGKPDNT 546
Query: 178 GFRDGPSEDAKFSNDFDVVYVRPTC--SLLVIDRGNAALRQISL 219
+ PS A S + +++YV T SL +ID N + +S+
Sbjct: 547 FIFNEPSGLA-VSPNGNILYVADTNNHSLKIIDLKNEKISTMSI 589
>gi|290995436|ref|XP_002680301.1| predicted protein [Naegleria gruberi]
gi|284093921|gb|EFC47557.1| predicted protein [Naegleria gruberi]
Length = 699
Score = 45.1 bits (105), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 68/146 (46%), Gaps = 14/146 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV--DGKPNEARF 134
P + VS + E++ D N+ I KI G +V + G G +G A+
Sbjct: 13 PNNVFVSSNNEVYIADCFNNRIRKILE------NGTIVTIAGNGTKGSSGDNGLATNAQL 66
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--DGPSEDAKFSN 191
N P V + +Y+AD N IRKI + G + TIAG + + GF +G + +A+
Sbjct: 67 NRPYSVFVSSNNEVYIADQGNNRIRKILENGNIITIAG--NGIHGFSGDNGLATNAQLYT 124
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQI 217
V+V + + D+GN +R+I
Sbjct: 125 PCS-VFVSSNNEVYIADQGNHRIRKI 149
Score = 43.1 bits (100), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 46/87 (52%), Gaps = 2/87 (2%)
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFS 190
A+ N P V + +Y+AD N IRKI + G + TIAG + + +G + +A+ +
Sbjct: 8 AQLNRPNNVFVSSNNEVYIADCFNNRIRKILENGTIVTIAGNGTKGSSGDNGLATNAQLN 67
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQI 217
+ V+V + + D+GN +R+I
Sbjct: 68 RPYS-VFVSSNNEVYIADQGNNRIRKI 93
Score = 42.4 bits (98), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 56/126 (44%), Gaps = 22/126 (17%)
Query: 77 PYKIRVSEDGELFAVDEVNS---------NIVKITPPLSQYSRGR-------LVAGS-FQ 119
Y + VS + E++ D N+ NI+ I + G +AG+
Sbjct: 181 SYSVFVSSNNEVYIADYFNNRIRKILENGNIITIAGNGTHGFNGDNENGNIITIAGNGIH 240
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAG 178
G+ G +G AR NHP V + +Y+AD N IRKI + G + TIAG + AG
Sbjct: 241 GFNGD-NGLATNARLNHPFSVFVSSNNEVYIADYYNNRIRKILENGNIITIAGNGT--AG 297
Query: 179 FR-DGP 183
F D P
Sbjct: 298 FSGDSP 303
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 40/162 (24%), Positives = 70/162 (43%), Gaps = 26/162 (16%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + VS + E++ D+ N I KI L + + G++G +G A+ N
Sbjct: 125 PCSVFVSSNNEVYIADQGNHRIRKI---LENGNIVTIAGNGIHGFSGD-NGLATNAQLNS 180
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSN------------------VA 177
V + +Y+AD N IRKI + G + TIAG ++ +
Sbjct: 181 SYSVFVSSNNEVYIADYFNNRIRKILENGNIITIAGNGTHGFNGDNENGNIITIAGNGIH 240
Query: 178 GFR--DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
GF +G + +A+ ++ F V+V + + D N +R+I
Sbjct: 241 GFNGDNGLATNARLNHPFS-VFVSSNNEVYIADYYNNRIRKI 281
>gi|86742862|ref|YP_483262.1| serine/threonine protein kinase [Frankia sp. CcI3]
gi|86569724|gb|ABD13533.1| serine/threonine protein kinase [Frankia sp. CcI3]
Length = 719
Score = 45.1 bits (105), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 50/98 (51%), Gaps = 9/98 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTG-HVDGKPN-EARF 134
PY + + DG ++ + +++I GR V + G +G H DG P A
Sbjct: 423 PYSVVAAPDGAVYVSQRLRHRVLRIE------RDGRTVHVAGSGKSGPHGDGGPAVNAEL 476
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAG 171
++P G+ + G+L++AD+ N IR++ D + T+AG
Sbjct: 477 DNPCGLALGPDGSLFIADSFNNRIRRVAPDGRIVTVAG 514
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 62/146 (42%), Gaps = 12/146 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP--NEARF 134
P + + DG LF D N+ I ++ P GR+V + G G G + A
Sbjct: 479 PCGLALGPDGSLFIADSFNNRIRRVAP------DGRIVTVAGSGRHGPPAGPAARHAASL 532
Query: 135 N--HPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSN 191
N HP GV +D G +YVA+T + +I D +AG GP++ A+
Sbjct: 533 NLAHPHGVYVDAAGLVYVANTGGHQVIRIDPDLRAAPLAGAGVPGLSGDHGPAQFAQLRR 592
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQI 217
DV P ++ + D N LR +
Sbjct: 593 PHDVT-APPGRNVYLADTDNHLLRAV 617
>gi|344923601|ref|ZP_08777062.1| hypothetical protein COdytL_03000 [Candidatus Odyssella
thessalonicensis L13]
Length = 1165
Score = 45.1 bits (105), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 61/146 (41%), Gaps = 9/146 (6%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
+P I V DG L D N++ +++ P + + GYT DG A F
Sbjct: 123 MPQGITVMNDGRLLVAD-TNNDRIRLVDPGNGFQTSDYAGTGSSGYT---DGAAAGATFT 178
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKS-NVAGFRDGPSEDAKFSND 192
P G+ G +YVAD IR I + VTT AG S ++ DGP +F +
Sbjct: 179 WPMGMATGPDGTIYVADMAR--IRSINPSTNQVTTFAGTVSTTMSDATDGPQNGPRFQYN 236
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQIS 218
V + V DRG+ R ++
Sbjct: 237 MSVALNAAGDKVYVGDRGHCYFRIVT 262
Score = 43.5 bits (101), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 45/79 (56%), Gaps = 5/79 (6%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAG 171
VAG+ G GH +G A+FN P+G+T+ + G L VADT N IR + G+ T+
Sbjct: 103 VAGT--GTPGHTNGPGGSAQFNMPQGITVMNDGRLLVADTNNDRIRLVDPGNGFQTSDYA 160
Query: 172 GKSNVAGFRDGPSEDAKFS 190
G + +G+ DG + A F+
Sbjct: 161 GTGS-SGYTDGAAAGATFT 178
Score = 39.3 bits (90), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 31/60 (51%), Gaps = 7/60 (11%)
Query: 109 SRGRLVAGSFQG-------YTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
S G ++AG +G V+G A F+ P G+ D G+LY+AD+ N IRKI
Sbjct: 311 STGDIIAGGLDADRSARVPNSGLVNGSGATALFDGPSGMVQDADGSLYIADSYNHVIRKI 370
>gi|168069696|ref|XP_001786545.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162661045|gb|EDQ48643.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 602
Score = 45.1 bits (105), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 69/143 (48%), Gaps = 25/143 (17%)
Query: 61 GYLVETVIEGNEIGVVPYK----IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAG 116
Y +++ G ++GV P+K I V+ D E++ +D N+ IV L ++ + V
Sbjct: 399 AYEATSLLNGAQLGVGPFKNPSDIHVTADKEIYVLDSGNNRIVV----LDEHFKPVKVVD 454
Query: 117 SFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNV 176
SF D K + FN+P+G+ + D+ L++ADT N + + D+ + V
Sbjct: 455 SF-------DYKGKKDTFNNPQGIFVTDRKELFIADTGNKRVVHL-DSNL--------RV 498
Query: 177 AGFRDGPSEDAKFSNDFDVVYVR 199
AG + P D +N FD VR
Sbjct: 499 AGVLEAPKSDLLQAN-FDFQPVR 520
>gi|134103740|ref|YP_001109401.1| NHL repeat-containing protein [Saccharopolyspora erythraea NRRL
2338]
gi|291007734|ref|ZP_06565707.1| NHL repeat-containing protein [Saccharopolyspora erythraea NRRL
2338]
gi|133916363|emb|CAM06476.1| NHL repeat containing protein [Saccharopolyspora erythraea NRRL
2338]
Length = 619
Score = 45.1 bits (105), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 42/148 (28%), Positives = 70/148 (47%), Gaps = 15/148 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P K+ + G L D + ++V+ T R R+ GS G G DG +EA F
Sbjct: 181 PAKVLPLDGGTLLVSDSAHHSLVEFTADGETPVR-RI--GS--GERGRADGAADEASFAE 235
Query: 137 PKGVTM--DDKG-----NLYVADTLNLAIR--KIGDAGVTTIAG-GKSNVAGFRDGPSED 186
P G+ + +D ++ VADT+N +R ++ D V T+AG G+ G GP+ +
Sbjct: 236 PAGLALLPEDVAAEVGYDVVVADTVNHLLRGVQLADGEVVTVAGTGEQWRDGSDAGPALE 295
Query: 187 AKFSNDFDVVYVRPTCSLLVIDRGNAAL 214
++ +DV + P +++ GN L
Sbjct: 296 TPLTSPWDVAWWEPAGGVVIAMAGNHTL 323
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 24/38 (63%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK 160
GH DG ++A F HP GV + G++ V DT N AIR+
Sbjct: 401 GHADGAADQALFQHPLGVAVLPDGSVAVCDTYNGAIRR 438
>gi|295837701|ref|ZP_06824634.1| NHL repeat protein [Streptomyces sp. SPB74]
gi|197695928|gb|EDY42861.1| NHL repeat protein [Streptomyces sp. SPB74]
Length = 610
Score = 45.1 bits (105), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 47/96 (48%), Gaps = 5/96 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + VS DGE + + ++ V+ + + R G+ GH DG +A H
Sbjct: 351 PSGLAVSADGERLWIADAETSAVRW---IDRAGTVRTAVGTGLFDFGHRDGAAAQALLQH 407
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIA 170
P GVT G++ VADT N A+R+ A VTT+A
Sbjct: 408 PLGVTALPDGSVAVADTYNHALRRYDPASGEVTTLA 443
Score = 43.5 bits (101), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 44/96 (45%), Gaps = 3/96 (3%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIGDAGVTTIAGGKSNVA-GFR 180
G VDG EA F P G+ + G L++AD A+R I AG A G G R
Sbjct: 337 GLVDGPAAEAWFAQPSGLAVSADGERLWIADAETSAVRWIDRAGTVRTAVGTGLFDFGHR 396
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
DG + A + V + P S+ V D N ALR+
Sbjct: 397 DGAAAQALLQHPLGVTAL-PDGSVAVADTYNHALRR 431
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 43/77 (55%), Gaps = 4/77 (5%)
Query: 125 VDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG-GKSNVAGF-R 180
+DG ARFN P+G+ + + VADT+N A+R++ A VTT+AG G+ G
Sbjct: 226 LDGDAATARFNEPQGLALLPDRAVVVADTVNHALRRVDLASGEVTTLAGTGRQWWQGEPV 285
Query: 181 DGPSEDAKFSNDFDVVY 197
G + + S+ +DV +
Sbjct: 286 AGAAREVSLSSPWDVAW 302
>gi|149279484|ref|ZP_01885614.1| hypothetical protein PBAL39_16831 [Pedobacter sp. BAL39]
gi|149229777|gb|EDM35166.1| hypothetical protein PBAL39_16831 [Pedobacter sp. BAL39]
Length = 742
Score = 45.1 bits (105), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 47/101 (46%), Gaps = 1/101 (0%)
Query: 119 QGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAG 178
G G+ DG A PKG D KGN+Y +D + RK+ T G
Sbjct: 642 NGNFGNADGPIKSAMLVSPKGFVGDGKGNMYFSDGFSQNTRKVDLIQNTISTVSAQTTRG 701
Query: 179 FRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
F+DG +A+F D + + ++ ++D GN A+R++ L
Sbjct: 702 FKDGGLNEAQFMQR-DAMAMDKDGNIYIVDMGNNAIRKLFL 741
Score = 41.6 bits (96), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 45/91 (49%), Gaps = 6/91 (6%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHP-- 137
I V G L+ ++ + I K+TP L AGS G +G+ +G A FN
Sbjct: 470 IAVDSHGNLYIMELTMNRIKKVTPD----GNVTLFAGSPTGASGNQNGTGTAALFNFSFI 525
Query: 138 KGVTMDDKGNLYVADTLNLAIRKIGDAGVTT 168
G+ D + N+Y+ D N ++RKI +GV +
Sbjct: 526 SGIVFDQQDNMYLTDGANSSVRKITPSGVVS 556
>gi|284097436|ref|ZP_06385541.1| NHL repeat containing protein [Candidatus Poribacteria sp. WGA-A3]
gi|283831050|gb|EFC35055.1| NHL repeat containing protein [Candidatus Poribacteria sp. WGA-A3]
Length = 387
Score = 45.1 bits (105), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 58/128 (45%), Gaps = 14/128 (10%)
Query: 34 LTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDE 93
L+ ++++ T +SSK + G + G VE + P + V+EDG ++ D
Sbjct: 98 LSGMVRYVTGTSSKDQRFAG-----DGGPAVEAALN------FPSAVAVAEDGTVYIADT 146
Query: 94 VNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADT 153
N I ++ P S +AG+ Q +G +A N P + +D LY+AD
Sbjct: 147 WNHRIRRVDPGTGAIS---TIAGTGQAKCYGDNGPAVKAALNEPVALALDGPDRLYIADQ 203
Query: 154 LNLAIRKI 161
N +RK+
Sbjct: 204 SNNRVRKL 211
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 64/148 (43%), Gaps = 8/148 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPN-EARFN 135
P + + L+ D+ N+ + K+ L+ +V GY G DG P E
Sbjct: 187 PVALALDGPDRLYIADQSNNRVRKLD--LTSGVMTTVVGTGESGYNG--DGAPGPETALA 242
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
P G+ +D +GNLY+ADT + IRK V T+AGG F G +E + +
Sbjct: 243 GPSGLAVDQEGNLYIADTFSGRIRKWDRHTGTVETVAGGTGAFQ-FTPGENESSPNLSRP 301
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISLNQ 221
+ + P L + D N +R L +
Sbjct: 302 YAIALHPDGRLFITDSDNHLIRVWDLQK 329
Score = 43.9 bits (102), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 50/87 (57%), Gaps = 7/87 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQG-YTGHVDGK-PNEARF 134
PY I + DG LF D N +++++ Q L+AG+ + ++G DGK P +
Sbjct: 301 PYAIALHPDGRLFITDSDN-HLIRVWD--LQKREMSLLAGNGKAEFSG--DGKDPLHSSL 355
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKI 161
N+P GV +D +G++Y+ADT + IR +
Sbjct: 356 NYPFGVALDTRGHVYIADTFSHRIRAV 382
Score = 42.7 bits (99), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 51/98 (52%), Gaps = 6/98 (6%)
Query: 126 DGKPN-EARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAG-GKSNVAGFRD 181
DG P EA N P V + + G +Y+ADT N IR++ G ++TIAG G++ G +
Sbjct: 118 DGGPAVEAALNFPSAVAVAEDGTVYIADTWNHRIRRVDPGTGAISTIAGTGQAKCYG-DN 176
Query: 182 GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
GP+ A + + P L + D+ N +R++ L
Sbjct: 177 GPAVKAALNEPVALALDGPD-RLYIADQSNNRVRKLDL 213
Score = 38.5 bits (88), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 30/48 (62%), Gaps = 4/48 (8%)
Query: 127 GKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDA--GV-TTIAG 171
G A N PK V +D GNLY+AD+ N +RK+ DA G+ TTIAG
Sbjct: 10 GPATAAMLNEPKTVALDAAGNLYIADSENHLVRKV-DARTGIMTTIAG 56
>gi|320107313|ref|YP_004182903.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
gi|319925834|gb|ADV82909.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
Length = 1821
Score = 45.1 bits (105), Expect = 0.083, Method: Composition-based stats.
Identities = 35/99 (35%), Positives = 50/99 (50%), Gaps = 9/99 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAG--SFQGYTGHVDGKPNEA-R 133
P + + G L+ D N I +I P + L AG + + G DG P A R
Sbjct: 295 PGGLAMDAAGFLYIADTGNDTIRRIDPSTGMIT---LFAGIPTVADFAG--DGGPASAGR 349
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
FN+P G+ +D G+LY+AD N IR I G ++T+AG
Sbjct: 350 FNNPLGLAIDSGGSLYIADQANHRIRMITAGGSLSTVAG 388
Score = 39.3 bits (90), Expect = 4.3, Method: Composition-based stats.
Identities = 29/88 (32%), Positives = 44/88 (50%), Gaps = 2/88 (2%)
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDA-GVTTIAGGKSNVAGFR-DGPSEDAKF 189
A N+P + ++ G+LY+ADT N AIRK+ A G+ + G+ V G DG S
Sbjct: 232 APVNYPTALVLNGAGDLYIADTRNNAIRKLTLATGILSTIAGRLGVPGSSGDGGSATLAT 291
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQI 217
N + + L + D GN +R+I
Sbjct: 292 LNAPGGLAMDAAGFLYIADTGNDTIRRI 319
Score = 38.9 bits (89), Expect = 5.7, Method: Composition-based stats.
Identities = 27/91 (29%), Positives = 43/91 (47%), Gaps = 2/91 (2%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V L+ D N + K++ S + G+ Y+G +G N A+ N
Sbjct: 409 PAAVAVDVARNLYIADSNNHLVRKVSSTTGLISSIVGIPGA-AAYSGD-NGPANVAKING 466
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVT 167
P + +D G+LY AD LN IRK+ ++ T
Sbjct: 467 PYALALDSAGDLYFADLLNNRIRKVSNSYAT 497
>gi|395226792|ref|ZP_10405227.1| Cadherin domain-containing protein, partial [Thiovulum sp. ES]
gi|394444937|gb|EJF05961.1| Cadherin domain-containing protein, partial [Thiovulum sp. ES]
Length = 1326
Score = 45.1 bits (105), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 79/189 (41%), Gaps = 41/189 (21%)
Query: 14 IAFTLQFQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNV----------LQFENGYL 63
+ F++ HA ++LT+L+ T S S P DGN+ L ++ YL
Sbjct: 91 LYFSVYNTIHAIFKMSPDRNLTNLI---TTSGSITPSIDGNLNVATVQMPTSLLIKDDYL 147
Query: 64 -------------------VET-VIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITP 103
VET I N + I++ +DG ++ S+I KI
Sbjct: 148 YFSEKLSNDIRRIDFNSSTVETPYINQNYYFQIIENIKLDKDGNFVLSNKNRSSIEKINK 207
Query: 104 PLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGD 163
++V GS+ G DG + A+ P G D N++VADT N +RKI
Sbjct: 208 DGE---ISKIVGGSY----GFADGDSSIAQLKSPSGFDFDKNWNIFVADTNNGRVRKITP 260
Query: 164 AG-VTTIAG 171
G VTTIAG
Sbjct: 261 NGDVTTIAG 269
>gi|334116916|ref|ZP_08491008.1| NHL repeat containing protein [Microcoleus vaginatus FGP-2]
gi|333461736|gb|EGK90341.1| NHL repeat containing protein [Microcoleus vaginatus FGP-2]
Length = 546
Score = 45.1 bits (105), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 60/135 (44%), Gaps = 6/135 (4%)
Query: 87 ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT-GHVDGKPNEARFNHPKGVTMDDK 145
ELF D S+I + + + + R V GS + + G DG+ +E R H GV +
Sbjct: 396 ELFVADSEISSIRAVG--IDEDPKVRTVCGSGELFGFGDKDGRGDEVRLQHCLGVEYA-Q 452
Query: 146 GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLL 205
L+VADT N I++I G T AG D A+F + + P L
Sbjct: 453 NYLWVADTYNHKIKRIDHRGGTCRTMIGEGTAGLVDAKGLKARFFEPSGLSAIGP--HLF 510
Query: 206 VIDRGNAALRQISLN 220
V D N +R++ L+
Sbjct: 511 VADTNNHVIRRVELD 525
Score = 39.7 bits (91), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 43/92 (46%), Gaps = 12/92 (13%)
Query: 92 DEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV---------DGKPNEARFNHPKGVTM 142
+ V N+V ++ R+V + +G HV DG EA F P+G+
Sbjct: 219 NSVIQNLVGSCLFIADSGHNRIVVSTLEGEVLHVIGSGQPGLTDGDFEEAEFFAPQGMAF 278
Query: 143 DDKGN-LYVADTLNLAIRKIG--DAGVTTIAG 171
D + LYVADT N A+RKI V T+AG
Sbjct: 279 DAESQILYVADTENHALRKIDFTTQRVETVAG 310
>gi|290997640|ref|XP_002681389.1| hypothetical protein NAEGRDRAFT_78414 [Naegleria gruberi]
gi|284095013|gb|EFC48645.1| hypothetical protein NAEGRDRAFT_78414 [Naegleria gruberi]
Length = 630
Score = 45.1 bits (105), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 75/158 (47%), Gaps = 29/158 (18%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV--AGSF--QGYTGHVDGKPNEA 132
P+ I ++++GE+F V+ + + KI + G ++ AG QGY+G G A
Sbjct: 67 PFGICLNDEGEIFIVERASHVVRKI------LTNGTIIVFAGIVNQQGYSGD-GGLAVNA 119
Query: 133 RFNHPKGVTMDDK-GNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFS 190
N P V D K G++Y++D N IRK+ G +TTIAG +S G ++D +
Sbjct: 120 ALNWPYDVACDLKTGDVYISDNENQLIRKVFKNGTITTIAGSRS------GGTTQDGSLA 173
Query: 191 NDFDVVYVRPTCSLLVIDRGN-------AALRQISLNQ 221
V RP + + GN +R+IS NQ
Sbjct: 174 T--QTVINRPM-MMSLSSNGNLYFTSNICTIRKISTNQ 208
>gi|426253136|ref|XP_004020256.1| PREDICTED: NHL repeat-containing protein 2 [Ovis aries]
Length = 726
Score = 45.1 bits (105), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 68/148 (45%), Gaps = 22/148 (14%)
Query: 68 IEGNEIGVVPYKIRVSEDGELF----AVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTG 123
I N+IG+ YK + LF AVD V++ +V ++ R++ G T
Sbjct: 204 IRANKIGIKLYKDSLPPSPLLFPGKIAVDHVSNRLV-----IADTGHHRILVVWKNGQTQ 258
Query: 124 HVDGKPN---------EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGG 172
H G PN E+ FN P+GV + + +YVADT N IRKI V+T+AG
Sbjct: 259 HSIGGPNPGRKDGIFSESSFNSPQGVAIMNN-IIYVADTENHLIRKIDLEAEMVSTVAGI 317
Query: 173 KSNVAGFRDGPSEDAK-FSNDFDVVYVR 199
G D + S+ +DVV+ R
Sbjct: 318 GIQGTDKEGGAKGDEQPISSPWDVVFGR 345
Score = 42.4 bits (98), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 28/61 (45%), Positives = 34/61 (55%), Gaps = 5/61 (8%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIA--GGKSNVA 177
G VDG +AR HP GVT D K N LYVAD+ N I+ + TT+A G SN+
Sbjct: 463 GDVDGVGIDARLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGNASNII 522
Query: 178 G 178
G
Sbjct: 523 G 523
>gi|384568064|ref|ZP_10015168.1| thiol-disulfide isomerase-like thioredoxin [Saccharomonospora
glauca K62]
gi|384523918|gb|EIF01114.1| thiol-disulfide isomerase-like thioredoxin [Saccharomonospora
glauca K62]
Length = 634
Score = 45.1 bits (105), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 43/149 (28%), Positives = 65/149 (43%), Gaps = 17/149 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF-QGYTGHVDGKPNEARFN 135
P K V+ +G + D N +I + + R F G G VDG + ARF
Sbjct: 195 PSKAVVTAEGRVLVADTANHSIAEFASDVETLLR------RFGDGRRGSVDGAFDVARFA 248
Query: 136 HPKGVTM------DDKG-NLYVADTLNLAIRKIG--DAGVTTIAG-GKSNVAGFRDGPSE 185
P G+T+ D G +L +ADT N +R + V T+AG G+ G GP+
Sbjct: 249 EPSGLTLLPTHVADRVGYHLLIADTANHLLRGVDLRTGAVRTVAGTGRQWRDGDDSGPAL 308
Query: 186 DAKFSNDFDVVYVRPTCSLLVIDRGNAAL 214
D ++ +DV + ++V GN L
Sbjct: 309 DVDLTSPWDVRWWDVAGGVVVAMAGNHTL 337
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 24/38 (63%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK 160
GH DG + A F HP G+T+ G + VADT N A+R+
Sbjct: 417 GHRDGAADSALFQHPLGITVLPDGAIGVADTYNGAVRR 454
>gi|281423977|ref|ZP_06254890.1| NHL repeat-containing domain protein [Prevotella oris F0302]
gi|281401902|gb|EFB32733.1| NHL repeat-containing domain protein [Prevotella oris F0302]
Length = 469
Score = 45.1 bits (105), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
Query: 118 FQGY--TGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG 171
F G+ G DG A F +P G+ D GN+YV +++ IRKIG D VTT+AG
Sbjct: 363 FAGFNGVGSQDGPRMSASFKNPTGMAFDVDGNMYVTESMGFTIRKIGHTDGMVTTVAG 420
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 42/155 (27%), Positives = 66/155 (42%), Gaps = 32/155 (20%)
Query: 49 PQADGNVLQFENGYLVETVIEGNEIGVVPYKIRV-----------SEDGELF-------- 89
P A +L GYL + VI+G+ G P KI V S DG
Sbjct: 69 PMAVSEILPDSGGYLTQFVIKGSNFGTDPSKIDVIFNGNRKATVVSSDGTTLYGICPKQE 128
Query: 90 -AVDEVNSNIVKITPPL--------SQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGV 140
+++V + + P+ ++ R ++AG G G+VDG P +ARFN+ GV
Sbjct: 129 NGLNQVTVRVDSVGDPIVCPNRFKYTKVERVSVLAGK-TGNGGYVDGNPIDARFNYMYGV 187
Query: 141 TMDDKGNLYVADTLNLAIRKIGDAG---VTTIAGG 172
+ N+ V + N +R I + +T + GG
Sbjct: 188 GVVTGNNVIVMEGRNNRVRMISETNNKVITLLTGG 222
>gi|440683842|ref|YP_007158637.1| NHL repeat containing protein [Anabaena cylindrica PCC 7122]
gi|428680961|gb|AFZ59727.1| NHL repeat containing protein [Anabaena cylindrica PCC 7122]
Length = 506
Score = 45.1 bits (105), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 57/103 (55%), Gaps = 11/103 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P K+ + G LF D + IV ++++ G+++ G +G DG N+ +F+
Sbjct: 175 PGKVLATSSG-LFIADSGHHRIV-----MTRFD-GKILYVIGTGKSGLTDGAFNQVQFSA 227
Query: 137 PKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG-GKSN 175
P+G+T DD+ LY+ADT N A+R+I V TIAG G+ N
Sbjct: 228 PQGMTFDDESQILYIADTENYALRQIDLQRQVVETIAGTGQQN 270
>gi|348688202|gb|EGZ28016.1| hypothetical protein PHYSODRAFT_321722 [Phytophthora sojae]
Length = 490
Score = 45.1 bits (105), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 38/87 (43%), Gaps = 1/87 (1%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + GEL D +N+ I + PP S R+ Q GH DG A F+
Sbjct: 230 PSTVLAGPSGELLVADTMNNCIRGLLPPPDGTSPWRVKTVCGQTRPGHADGNCEVALFDQ 289
Query: 137 PKGVTM-DDKGNLYVADTLNLAIRKIG 162
P + +D +VAD N IR++G
Sbjct: 290 PLSLCWGEDSNTFFVADRGNACIRQVG 316
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 50/174 (28%), Positives = 69/174 (39%), Gaps = 34/174 (19%)
Query: 75 VVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRG---------RLVAGSFQGYTGHV 125
+ P I S DG V + + +K S+ + + A + G G
Sbjct: 84 LAPKGIAASADGRHLFVCDTGHHKIKFAALPSRSALADVDAVTDDVEMFAFAGSGKKGWR 143
Query: 126 DGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI-----GDAGVTTIAGGKSN----- 175
DG EA FN P GV G++ VADT N IR+I G V TIAG ++
Sbjct: 144 DGLALEANFNSPSGVCEYADGSIIVADTGNHCIRQIRRGANGKLVVKTIAGAYASFETKR 203
Query: 176 ------------VAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
AG+RDG E + F + V P+ LLV D N +R +
Sbjct: 204 VGVQGAGIKDQRTAGYRDG--ERSLFRSP-STVLAGPSGELLVADTMNNCIRGL 254
Score = 42.0 bits (97), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 44/168 (26%), Positives = 65/168 (38%), Gaps = 33/168 (19%)
Query: 74 GVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGY------------ 121
GV Y DG + D N I +I + + +AG++ +
Sbjct: 157 GVCEYA-----DGSIIVADTGNHCIRQIRRGANGKLVVKTIAGAYASFETKRVGVQGAGI 211
Query: 122 -----TGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI-----GDA--GVTTI 169
G+ DG+ + F P V G L VADT+N IR + G + V T+
Sbjct: 212 KDQRTAGYRDGE--RSLFRSPSTVLAGPSGELLVADTMNNCIRGLLPPPDGTSPWRVKTV 269
Query: 170 AGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
G G DG E A F + + + + V DRGNA +RQ+
Sbjct: 270 CG--QTRPGHADGNCEVALFDQPLSLCWGEDSNTFFVADRGNACIRQV 315
>gi|440794391|gb|ELR15552.1| haloacid dehalogenase family hydrolase family protein [Acanthamoeba
castellanii str. Neff]
Length = 731
Score = 45.1 bits (105), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 47/151 (31%), Positives = 66/151 (43%), Gaps = 12/151 (7%)
Query: 77 PYKIRV-SEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P KI + +E G LF D N IV ++ L +G G DG EA+FN
Sbjct: 226 PGKIAIDNEHGRLFISDSNNHRIV-----VTDLQGAFLEEIGTRGSLGFKDGSYAEAKFN 280
Query: 136 HPKGVT---MDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPS-EDAKF 189
+GV D+ LYVAD N A+R + VTT+AG + F G S + +
Sbjct: 281 RLQGVAYHKTDEHERLYVADAENHALRVVDLLAKTVTTLAGDGTQGNDFVGGKSGREQQL 340
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
S +DV L V G + ++SL+
Sbjct: 341 STPWDVALSPDGQQLFVAMAGTHQIWEVSLD 371
>gi|428317077|ref|YP_007114959.1| NHL repeat containing protein [Oscillatoria nigro-viridis PCC 7112]
gi|428240757|gb|AFZ06543.1| NHL repeat containing protein [Oscillatoria nigro-viridis PCC 7112]
Length = 546
Score = 45.1 bits (105), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 68/154 (44%), Gaps = 7/154 (4%)
Query: 87 ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT-GHVDGKPNEARFNHPKGVTMDDK 145
ELF D S+I + + + + R V GS + + G DG+ ++ R H GV +
Sbjct: 396 ELFVADSEISSIRAVG--IDEDPKVRTVCGSGELFGFGDKDGRADQVRLQHCLGVEYA-Q 452
Query: 146 GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLL 205
L+VADT N I++I G T AG D A+F + + P L
Sbjct: 453 NYLWVADTYNHKIKRIDHRGGTCRTMIGDGTAGLVDAKGLKARFFEPSGLSAIGP--HLF 510
Query: 206 VIDRGNAALRQISLNQDDC-EYQYNSISPTDILM 238
V D N +R++ L+ + + S+ P ++ +
Sbjct: 511 VADTNNHVIRRVELDTLEVTTLNFPSLCPPEVCL 544
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 43/92 (46%), Gaps = 12/92 (13%)
Query: 92 DEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV---------DGKPNEARFNHPKGVTM 142
+ V N+V ++ R+V + +G HV DG EA F P+G+
Sbjct: 219 NSVIQNLVGSCLFIADSGHNRIVVSTLEGEVLHVIGSGKPGLTDGDFEEAEFFAPQGMAF 278
Query: 143 DDKGN-LYVADTLNLAIRKIG--DAGVTTIAG 171
D + LYVADT N A+RKI V T+AG
Sbjct: 279 DAESQILYVADTENHALRKIDFTTQRVETVAG 310
>gi|348689157|gb|EGZ28971.1| hypothetical protein PHYSODRAFT_474561 [Phytophthora sojae]
Length = 98
Score = 45.1 bits (105), Expect = 0.092, Method: Composition-based stats.
Identities = 32/99 (32%), Positives = 47/99 (47%), Gaps = 3/99 (3%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGG 172
++AG +G+ DG N F +P+ + + G+LYVAD N I K GV T+ G
Sbjct: 3 VLAGYRYWVSGYADGYGNAVLFKNPRSLAVGSDGHLYVADDNNRVI-KFTSGGVMTVLAG 61
Query: 173 KSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGN 211
S +G DG A+F D + + L + DR N
Sbjct: 62 -STSSGLVDGTGTSARFKTP-DSLVIGSDGYLYIADRVN 98
>gi|441185716|ref|ZP_20970504.1| hypothetical protein SRIM_41417, partial [Streptomyces rimosus
subsp. rimosus ATCC 10970]
gi|440613960|gb|ELQ77297.1| hypothetical protein SRIM_41417, partial [Streptomyces rimosus
subsp. rimosus ATCC 10970]
Length = 108
Score = 44.7 bits (104), Expect = 0.096, Method: Composition-based stats.
Identities = 21/50 (42%), Positives = 27/50 (54%)
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
R +P+GVT+D GNL++AD N IRK+ G T G N DG
Sbjct: 43 RVYYPQGVTVDKNGNLFIADRYNHRIRKVTPNGTITTVAGDGNAGYISDG 92
>gi|184201467|ref|YP_001855674.1| hypothetical protein KRH_18210 [Kocuria rhizophila DC2201]
gi|183581697|dbj|BAG30168.1| hypothetical protein [Kocuria rhizophila DC2201]
Length = 246
Score = 44.7 bits (104), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 25/56 (44%), Positives = 34/56 (60%), Gaps = 2/56 (3%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNV 176
G+ DG EAR +HP+ + M G++ VADT N A+R + D VTTI GG +V
Sbjct: 151 GYQDGPATEARLDHPEDLQMLYDGSVVVADTGNHALRHVDVTDGEVTTICGGPDHV 206
Score = 43.9 bits (102), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 63/140 (45%), Gaps = 20/140 (14%)
Query: 96 SNIVKITPPLSQYSRGRLVAGSFQ-----GYTGHVDGKPNEARFNHPKGVTMDDKGNLYV 150
S +V P +Q R Q G G DG+ +A+F +T G L+
Sbjct: 58 SAVVAAVPDRNQLFRWDAYTDVLQEWAGTGEPGRRDGRVAQAKFAATVSITEGGDGRLWF 117
Query: 151 ADTLNLAIRKI--------GDAGVTTIAGGKSNVAGFRDGPSEDAKFSN--DFDVVYVRP 200
D + ++R + GD V T+ G + AG++DGP+ +A+ + D ++Y
Sbjct: 118 VDRDSSSLRCLTIDTDRPDGDPHVVTVVG--RHGAGYQDGPATEARLDHPEDLQMLY--- 172
Query: 201 TCSLLVIDRGNAALRQISLN 220
S++V D GN ALR + +
Sbjct: 173 DGSVVVADTGNHALRHVDVT 192
>gi|145592666|ref|YP_001156963.1| alkyl hydroperoxide reductase [Salinispora tropica CNB-440]
gi|145302003|gb|ABP52585.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Salinispora tropica CNB-440]
Length = 612
Score = 44.7 bits (104), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 60/123 (48%), Gaps = 4/123 (3%)
Query: 126 DGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPS 184
DG EA P G+++ G+ L+VAD+ A+R + + +TT G G DGP+
Sbjct: 341 DGPLAEAWLAQPSGLSVSADGSRLWVADSETSAVRYVENGILTTAVGQGLFEFGHVDGPA 400
Query: 185 EDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSIS-PTDILMVV-GA 242
A + V + P S+L+ D N A+R+ D + ++ P+D+L+ G
Sbjct: 401 AQALLQHPLGVCAL-PDGSVLIADTYNGAVRRYDPESDSVGTVADGLAEPSDLLLTPDGG 459
Query: 243 VLV 245
VLV
Sbjct: 460 VLV 462
Score = 42.0 bits (97), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 39/86 (45%), Gaps = 8/86 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT--GHVDGKPNEARF 134
P + VS DG V + ++ V+ G L QG GHVDG +A
Sbjct: 352 PSGLSVSADGSRLWVADSETSAVRYV------ENGILTTAVGQGLFEFGHVDGPAAQALL 405
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRK 160
HP GV G++ +ADT N A+R+
Sbjct: 406 QHPLGVCALPDGSVLIADTYNGAVRR 431
>gi|290986946|ref|XP_002676184.1| NHL repeat-containing protein [Naegleria gruberi]
gi|284089785|gb|EFC43440.1| NHL repeat-containing protein [Naegleria gruberi]
Length = 1290
Score = 44.7 bits (104), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 57/110 (51%), Gaps = 6/110 (5%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
L+AGS GY G N A F +G+ + G ++++D N +RKI + T+AG
Sbjct: 312 LIAGSGAGYAGDNQLASN-AMFRVTRGIAISSSGEIFISDGGNNRVRKINTNNIIATVAG 370
Query: 172 GKSNVAGFRDGPSEDAKFSNDF--DVVYVRPTCSLLVIDRGNAALRQISL 219
SN+A +G + +A + F + V V P ++V D N LR+ISL
Sbjct: 371 TGSNLA--YNGDNIEAVKAKLFGPESVAVAPNNEVIVADTRNDRLRKISL 418
>gi|158318814|ref|YP_001511322.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
gi|158114219|gb|ABW16416.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
Length = 732
Score = 44.7 bits (104), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 49/98 (50%), Gaps = 9/98 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKPN-EARF 134
P + V G ++ D N I ++ P R+VAGS GY+G DG P A
Sbjct: 548 PRSVTVDPHGVIYLADTGNHRIWRLDP----GDGARVVAGSGTPGYSG--DGGPAVHASL 601
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
P+ V +D +G L +AD + IR++ G +TTIAG
Sbjct: 602 TRPQAVAVDAQGRLLIADQEHRRIRRVDTTGRITTIAG 639
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 60/143 (41%), Gaps = 22/143 (15%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I +S DG L D +N + +++P G ++ P
Sbjct: 506 PAGIAMSSDGSLLIADCLNDRLRRVSP------------------DGRIETMPALPGLRQ 547
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P+ VT+D G +Y+ADT N I ++ GD G +AG + GP+ A +
Sbjct: 548 PRSVTVDPHGVIYLADTGNHRIWRLDPGD-GARVVAGSGTPGYSGDGGPAVHASLTRP-Q 605
Query: 195 VVYVRPTCSLLVIDRGNAALRQI 217
V V LL+ D+ + +R++
Sbjct: 606 AVAVDAQGRLLIADQEHRRIRRV 628
>gi|290988149|ref|XP_002676784.1| predicted protein [Naegleria gruberi]
gi|284090388|gb|EFC44040.1| predicted protein [Naegleria gruberi]
Length = 2167
Score = 44.7 bits (104), Expect = 0.099, Method: Composition-based stats.
Identities = 27/95 (28%), Positives = 44/95 (46%), Gaps = 3/95 (3%)
Query: 124 HVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGP 183
++DG A+FN P GV +Y+AD +N IR I G ++G S ++G
Sbjct: 231 YIDGNTTIAKFNSPTGVAYGPSKEIYIADKMNHVIRMIEFVG---MSGTVSVISGIAKSS 287
Query: 184 SEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
D N V V + ++D GN +R+++
Sbjct: 288 GNDTLRLNSPVDVAVNSQKEVFILDSGNNRIRKLT 322
Score = 43.5 bits (101), Expect = 0.25, Method: Composition-based stats.
Identities = 25/86 (29%), Positives = 44/86 (51%), Gaps = 2/86 (2%)
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSND 192
FN P+G+ + + G +YV+DT N +++I ++G TI G + + AK +
Sbjct: 40 FNQPEGIALGNSGEIYVSDTKNHLVKRIDSNSGAITIVAGTGAFGSDTNTNATSAKLYSP 99
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQIS 218
+ V LL+ D N A+R++S
Sbjct: 100 SGLA-VTIDGKLLIADTSNHAIREVS 124
Score = 40.4 bits (93), Expect = 2.2, Method: Composition-based stats.
Identities = 27/95 (28%), Positives = 44/95 (46%), Gaps = 4/95 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + GE++ D N + +I S +VAG+ + + A+
Sbjct: 43 PEGIALGNSGEIYVSDTKNHLVKRID---SNSGAITIVAGT-GAFGSDTNTNATSAKLYS 98
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG 171
P G+ + G L +ADT N AIR++ + + TIAG
Sbjct: 99 PSGLAVTIDGKLLIADTSNHAIREVSNGIIRTIAG 133
>gi|436838118|ref|YP_007323334.1| NHL repeat-containing protein [Fibrella aestuarina BUZ 2]
gi|384069531|emb|CCH02741.1| NHL repeat-containing protein [Fibrella aestuarina BUZ 2]
Length = 375
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 53/105 (50%), Gaps = 7/105 (6%)
Query: 120 GYT-GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAGG---K 173
GY+ G+ DG A+F++P + MD G LY+ D LN IRK+ V+T AG
Sbjct: 161 GYSFGYRDGPLARAKFSNPWSIIMDKAGMLYIGDGLNHCIRKVDLSTNTVSTFAGKPLEN 220
Query: 174 SNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+ V + DG A+ D+ + +L + D GN ++R ++
Sbjct: 221 TAVCKYADGKGPAAQVCTVEDLT-LDQDGNLFIPDWGNNSVRMVT 264
>gi|284039570|ref|YP_003389500.1| NHL repeat containing protein [Spirosoma linguale DSM 74]
gi|283818863|gb|ADB40701.1| NHL repeat containing protein [Spirosoma linguale DSM 74]
Length = 1079
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 46/97 (47%), Gaps = 12/97 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ I V E G ++ D N + K S + G VAG TG N+ NH
Sbjct: 78 PFGIYVDEAGSMYVADYNNHRVQKW---ASGATSGTTVAG-----TGTAGSAANQ--LNH 127
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAG 171
P GV +D G +YV+DT N ++K G TT+AG
Sbjct: 128 PLGVYVDGAGAIYVSDTDNNRVQKWASGATSGTTVAG 164
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 58/137 (42%), Gaps = 20/137 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G ++ D N+ + K S + G VAG TG N+ N+
Sbjct: 128 PLGVYVDGAGAIYVSDTDNNRVQKW---ASGATSGTTVAG-----TGTAGSAANQ--LNY 177
Query: 137 PKGVTMDDKGNLYVADTLNLAIRK--IGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P G+ +D G YVAD N I+K G TT+AGG G + + +S
Sbjct: 178 PIGIYVDGAGATYVADASNSRIQKWAAGATSGTTVAGGNG------QGSAANQLWSAAG- 230
Query: 195 VVYVRPTCSLLVIDRGN 211
VY+ ++ V D GN
Sbjct: 231 -VYIDGAGAIYVADGGN 246
>gi|302834543|ref|XP_002948834.1| hypothetical protein VOLCADRAFT_80356 [Volvox carteri f.
nagariensis]
gi|300266025|gb|EFJ50214.1| hypothetical protein VOLCADRAFT_80356 [Volvox carteri f.
nagariensis]
Length = 774
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 48/96 (50%), Gaps = 6/96 (6%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGF 179
G DG EA HP V G+ +YVAD+ N I+ + + T+AG S AGF
Sbjct: 485 GDKDGFGAEALLQHPLAVLSSADGSAIYVADSYNHRIKALNPNTNEIVTLAG--SGAAGF 542
Query: 180 RDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALR 215
RDG A+FS + P ++L+ D N+A+R
Sbjct: 543 RDGVGTAAQFSEPAGLCR-GPNGTILIADTNNSAVR 577
Score = 38.9 bits (89), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 28/111 (25%), Positives = 50/111 (45%), Gaps = 14/111 (12%)
Query: 54 NVLQF--ENGYLVETVIEGNEIGVVPYKIRVSEDGE-LFAVDEVNSNIVKITPPLSQYSR 110
N+ +F ++G+ E +++ P + S DG ++ D N I + P +
Sbjct: 480 NLFRFGDKDGFGAEALLQH------PLAVLSSADGSAIYVADSYNHRIKALNP-----NT 528
Query: 111 GRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
+V + G G DG A+F+ P G+ G + +ADT N A+R +
Sbjct: 529 NEIVTLAGSGAAGFRDGVGTAAQFSEPAGLCRGPNGTILIADTNNSAVRIL 579
>gi|157112324|ref|XP_001657496.1| hypothetical protein AaeL_AAEL000965 [Aedes aegypti]
gi|108883769|gb|EAT47994.1| AAEL000965-PA [Aedes aegypti]
Length = 812
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 30/52 (57%), Gaps = 2/52 (3%)
Query: 122 TGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG 171
G VDG +ARFN P+GV + L+VAD N AIRKI VTT+ G
Sbjct: 354 AGFVDGDFRKARFNAPQGVAFQNDTTLFVADNENHAIRKIDLKSQQVTTVVG 405
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 64/139 (46%), Gaps = 12/139 (8%)
Query: 101 ITPPLSQYSRGRLVAGSFQGYTGHVDGK-PNEARFNHPKGVTMD-DKGNLYVADTLNLAI 158
I +YS G +A + G + + PN A F+ P G+ ++ D +Y+AD+ + ++
Sbjct: 474 IWWKFKKYSAGSCLAIAGNGREENRNNSYPNNAAFSQPSGLAINKDAKEIYLADSESSSV 533
Query: 159 RK--IGDAGVTTIAGGKSN-----VAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGN 211
RK + D V +AGG N G DG AKF + V Y + V D N
Sbjct: 534 RKMSLSDGKVLAVAGGDRNPLDLFSFGDIDGKQYSAKFQHPLGVAYNSKDNCIYVADTYN 593
Query: 212 AALRQISLNQD---DCEYQ 227
+++I+ + + C +Q
Sbjct: 594 HKIKKINASTNVATTCSFQ 612
>gi|309792745|ref|ZP_07687190.1| YD repeat protein [Oscillochloris trichoides DG-6]
gi|308225197|gb|EFO78980.1| YD repeat protein [Oscillochloris trichoides DG6]
Length = 667
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 52/97 (53%), Gaps = 7/97 (7%)
Query: 126 DGKP-NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAG-GKSNVAG-FRD 181
DG P + A N P G+ + G+LY+ADT + +R++G D +TTI G G + + G F
Sbjct: 246 DGGPASSAVLNTPVGLVLGSDGSLYIADTGHHRVRQVGPDGIITTILGTGSAGINGNFSY 305
Query: 182 GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
GP+ + D + PT L V D GN L ++S
Sbjct: 306 GPNMMLRSPRDM---AIDPTGGLYVTDTGNNRLLRLS 339
>gi|290976601|ref|XP_002671028.1| predicted protein [Naegleria gruberi]
gi|284084593|gb|EFC38284.1| predicted protein [Naegleria gruberi]
Length = 925
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 56/108 (51%), Gaps = 9/108 (8%)
Query: 68 IEGNEIGVVPYKIRVS-EDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHV 125
+ ++ + P K++V + G ++ D +NS I KI + +AG+ GY+G
Sbjct: 50 VNAGQVSISPTKVKVDRKTGNVYVSDRINSRIRKIDI---NTGKATTIAGNGIIGYSG-- 104
Query: 126 DGK-PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAG 171
DG+ +A N+P V + ++Y ADT N IRKI + + TIAG
Sbjct: 105 DGQLATQANINNPGKVELSSTNDIYFADTENHRIRKIFANGTIITIAG 152
>gi|296122042|ref|YP_003629820.1| NHL repeat containing protein [Planctomyces limnophilus DSM 3776]
gi|296014382|gb|ADG67621.1| NHL repeat containing protein [Planctomyces limnophilus DSM 3776]
Length = 425
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 45/161 (27%), Positives = 76/161 (47%), Gaps = 15/161 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS--FQGYTGHVDGKP-NEAR 133
P+ + + DG L+ E +I++ Q + +AG+ +G+ G DG P +A
Sbjct: 112 PFGLVIGPDGALYFC-EYTGHIIRRLDLEKQTAT--TIAGTPGKKGFAG--DGGPATKAL 166
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAG-VTTIAGGKSNVAGFRD--GPSEDAKF 189
N P + G++ +AD IRKI G G ++T+AG + AGF GP+E A+
Sbjct: 167 MNEPHELRFTPAGDIVIADMRTHTIRKIDGKTGMISTLAG--TGTAGFSGDGGPAEKAQL 224
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNS 230
N + + P LL+ D GN +R++ + Y +
Sbjct: 225 -NMPHSIQIDPAGDLLICDTGNHRVRKVDMKTGLISTAYGT 264
>gi|395768739|ref|ZP_10449254.1| hypothetical protein Saci8_03116 [Streptomyces acidiscabies 84-104]
Length = 596
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 47/82 (57%), Gaps = 9/82 (10%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI----GDAGVTTIAG-GKS 174
G G DG EA F+ P+G+ + D G + VADT+N A+R++ GD VTT+AG G+
Sbjct: 211 GTRGFADGT-AEAAFSEPQGLALLDDGAVVVADTVNHALRRVDLATGD--VTTLAGTGRQ 267
Query: 175 NVAGF-RDGPSEDAKFSNDFDV 195
G GP+ + S+ +DV
Sbjct: 268 WWQGSPTSGPAREIDLSSPWDV 289
Score = 38.1 bits (87), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 30/50 (60%), Gaps = 2/50 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIA 170
GH DG ++A HP GVT G++ ++DT N A+R+ A VTT+A
Sbjct: 381 GHRDGAADQALLQHPLGVTALPDGSVAISDTYNHALRRYDPATGEVTTLA 430
>gi|290984233|ref|XP_002674832.1| predicted protein [Naegleria gruberi]
gi|284088424|gb|EFC42088.1| predicted protein [Naegleria gruberi]
Length = 2158
Score = 44.7 bits (104), Expect = 0.11, Method: Composition-based stats.
Identities = 40/142 (28%), Positives = 63/142 (44%), Gaps = 12/142 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + V D ++ D+ N I ++ + S G G G TG + +H
Sbjct: 101 PYGVFVM-DNIIYIADQYNCKIRMVSSNIIATSVGVPGCGDAIGATG-------VTQLDH 152
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG-GKSNVAGFRDGPSEDAKFSNDFDV 195
P + + G Y+AD N ++K+ VTTIAG G +N G DG N+
Sbjct: 153 PPAIFVPYTGLYYIADFYNHKVKKVESGTVTTIAGTGTANYNG--DGIQASTAQLNEPHG 210
Query: 196 VYVRPTCSLLVIDRGNAALRQI 217
V+VR + ++D+ N +R I
Sbjct: 211 VFVRND-KIYIVDKKNHRIRMI 231
>gi|219852663|ref|YP_002467095.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
gi|219546922|gb|ACL17372.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
Length = 522
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 62/147 (42%), Gaps = 22/147 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ + V G ++ D N I K T V G+F G N+ FN
Sbjct: 94 PHGVAVDSTGNVYVADTGNHQIQKFT-----------VNGNFNTQWGIKGSGTNQ--FNS 140
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P+G+ +D GN+YVADT N I K +G + T G + G + P+ A +
Sbjct: 141 PEGIAVDGAGNVYVADTGNNRIEKFTSSGDIVTSWGSYGSEVGQFNRPTSVAVDNTGIGY 200
Query: 196 VYVRPTCSLLVIDRGNAALRQISLNQD 222
+YV D GN +++ +L D
Sbjct: 201 IYVA--------DTGNNRIQKFTLTGD 219
>gi|254447301|ref|ZP_05060768.1| NHL repeat containing protein [gamma proteobacterium HTCC5015]
gi|198263440|gb|EDY87718.1| NHL repeat containing protein [gamma proteobacterium HTCC5015]
Length = 2515
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 49/106 (46%), Gaps = 10/106 (9%)
Query: 85 DGELFAVDE--VNSNIVKITPP--LSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGV 140
DG L+ N ++KITP + +Y+ +GY+G G EA N P+G+
Sbjct: 1195 DGSLYWSGNRTANQRVLKITPEGEIEKYA-----GKETRGYSGD-GGAATEAELNLPRGM 1248
Query: 141 TMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSED 186
+ G L++AD N IRK+ G+ T G A DG S +
Sbjct: 1249 AVSSGGTLFIADWGNHRIRKVAPNGIITTVAGNGEEAYSGDGLSAE 1294
>gi|227540165|ref|ZP_03970214.1| conserved protein of hypothetical function [Sphingobacterium
spiritivorum ATCC 33300]
gi|227240026|gb|EEI90041.1| conserved protein of hypothetical function [Sphingobacterium
spiritivorum ATCC 33300]
Length = 658
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 46/125 (36%), Positives = 60/125 (48%), Gaps = 26/125 (20%)
Query: 98 IVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVAD---TL 154
IVKI ++ ++ GS G G+VDG +A+ N GV +D+KG+LY D T
Sbjct: 369 IVKI----DRFGTASVLIGS--GGAGYVDGSLADAKTNGAWGVVVDNKGDLYFTDNNATG 422
Query: 155 NLAIRKI-----GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDR 209
IRKI G+ V+TIAG S G DG A F+ RP S L ID+
Sbjct: 423 GNRIRKITFDASGNGTVSTIAGQAS--TGNADGIGAAASFN--------RP--SGLAIDK 470
Query: 210 GNAAL 214
N L
Sbjct: 471 NNNIL 475
>gi|290978007|ref|XP_002671728.1| predicted protein [Naegleria gruberi]
gi|284085299|gb|EFC38984.1| predicted protein [Naegleria gruberi]
Length = 2520
Score = 44.7 bits (104), Expect = 0.12, Method: Composition-based stats.
Identities = 32/103 (31%), Positives = 52/103 (50%), Gaps = 7/103 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGH-VDGKPNEARFN 135
P I V + ELF D N + ++ + + +VAG+ G G+ + A N
Sbjct: 39 PEGIAVGNNNELFISDSKNHLVKRLDLTTLKVT---IVAGT--GIAGNDTNTTATGAMLN 93
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAG 178
+P G+ + +G L +ADTLN IR++ + + TI GK V+G
Sbjct: 94 YPGGICVTSEGKLLIADTLNHVIREVSNGQIKTII-GKCGVSG 135
Score = 43.9 bits (102), Expect = 0.18, Method: Composition-based stats.
Identities = 32/103 (31%), Positives = 47/103 (45%), Gaps = 7/103 (6%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI----GDAGVTTIAGGKSN 175
G G+ DG A +N P GV +LY+AD N +R + G G +A S
Sbjct: 230 GSAGYADGSAFNALYNAPTGVAYGPNKDLYIADRGNHVVRVVKFSYGTTGTQNLA-TISL 288
Query: 176 VAGFRDGPSED-AKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
+G P D K +N +D V V + ++D GN +R+I
Sbjct: 289 FSGVAKTPGSDLVKMNNTYD-VDVSSEGHIFILDTGNLRIRKI 330
Score = 42.7 bits (99), Expect = 0.44, Method: Composition-based stats.
Identities = 43/184 (23%), Positives = 70/184 (38%), Gaps = 44/184 (23%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V+ +G+L D +N I +++ + G+ G ++ P A N
Sbjct: 95 PGGICVTSEGKLLIADTLNHVIREVSNGQIKTIIGKC---GVSGVANDLNTLPTNAYLNQ 151
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG------------------------------- 165
PK VT GN+Y +D+LN +RK A
Sbjct: 152 PKYVTSLPNGNIYFSDSLNYRVRKYTKANNFISTQQLIVQGSLSAGVAAFSSTEMTFAEP 211
Query: 166 -----VTTIAGGKSNV----AGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
+ T +G S + AG+ DG + +A ++ V Y P L + DRGN +R
Sbjct: 212 GTHRLIRTGSGATSYLGVGSAGYADGSAFNALYNAPTGVAY-GPNKDLYIADRGNHVVRV 270
Query: 217 ISLN 220
+ +
Sbjct: 271 VKFS 274
>gi|300771890|ref|ZP_07081761.1| NHL repeat containing protein [Sphingobacterium spiritivorum ATCC
33861]
gi|300761276|gb|EFK58101.1| NHL repeat containing protein [Sphingobacterium spiritivorum ATCC
33861]
Length = 658
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 46/125 (36%), Positives = 60/125 (48%), Gaps = 26/125 (20%)
Query: 98 IVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVAD---TL 154
IVKI ++ ++ GS G G+VDG +A+ N GV +D+KG+LY D T
Sbjct: 369 IVKI----DRFGTASVLIGS--GGAGYVDGSLADAKTNGAWGVVVDNKGDLYFTDNNATG 422
Query: 155 NLAIRKI-----GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDR 209
IRKI G+ V+TIAG S G DG A F+ RP S L ID+
Sbjct: 423 GNRIRKITFDASGNGTVSTIAGQAS--TGNADGIGAAASFN--------RP--SGLAIDK 470
Query: 210 GNAAL 214
N L
Sbjct: 471 NNNIL 475
>gi|290979059|ref|XP_002672252.1| predicted protein [Naegleria gruberi]
gi|284085827|gb|EFC39508.1| predicted protein [Naegleria gruberi]
Length = 1239
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 51/95 (53%), Gaps = 8/95 (8%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDK 145
GEL+ V+E N I K++ + + +AG+ G G+ DG + N+P V + +
Sbjct: 64 GELYFVEEENHIIRKLSFVTNNITT---IAGN--GTAGYSDGVATLSMLNYPHSVHVSNT 118
Query: 146 -GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGF 179
G +Y+ADT N +R++ + + TIAG N GF
Sbjct: 119 TGEIYIADTFNCKLRRVFNGQMETIAG--YNGCGF 151
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 50/98 (51%), Gaps = 10/98 (10%)
Query: 77 PYKIRVSED-GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGK-PNEARF 134
P+ + VS GE++ D N + ++ + +AG + G VDGK E +
Sbjct: 110 PHSVHVSNTTGEIYIADTFNCKLRRVFN-----GQMETIAG-YNGCGFDVDGKRATETKL 163
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAG 171
N+P+ + + + +Y AD+ N IRKI D +TTIAG
Sbjct: 164 NYPQAIRVSNN-EVYFADSANNRIRKITTDGSITTIAG 200
>gi|91076402|ref|XP_969236.1| PREDICTED: similar to CG12547 CG12547-PA [Tribolium castaneum]
gi|270002449|gb|EEZ98896.1| hypothetical protein TcasGA2_TC004511 [Tribolium castaneum]
Length = 712
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 62/135 (45%), Gaps = 9/135 (6%)
Query: 87 ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKG 146
E+ A+ + +N + I S G ++ G G DG + FN P+G+ ++
Sbjct: 230 EILAISDTGNNRILILK-----SDGTILHQVGSGEIGFRDGSFTQCEFNAPQGLVFQNEN 284
Query: 147 NLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGF-RDGPSEDAK--FSNDFDVVYVR-PTC 202
L+VADT N AIRKI A T + S V G R G + + S+ +D+ R P
Sbjct: 285 TLFVADTENHAIRKIDLAKKTVESVVGSGVQGHDRVGGKQWTQQVISSPWDLCIFRTPDM 344
Query: 203 SLLVIDRGNAALRQI 217
+ GN +R+I
Sbjct: 345 DMSFYPEGNPPIREI 359
>gi|290990638|ref|XP_002677943.1| predicted protein [Naegleria gruberi]
gi|284091553|gb|EFC45199.1| predicted protein [Naegleria gruberi]
Length = 1317
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 46/185 (24%), Positives = 82/185 (44%), Gaps = 31/185 (16%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGF 179
GY D ++ FN PK + + G+LYV D+ N IR+I +TTI G K N++
Sbjct: 219 GYNTESDA--SKTLFNQPKDIHYTNDGDLYVFDSKNNLIRRISKGIITTIVGSKYNISTI 276
Query: 180 RDG-----PSE---DAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSI 231
G P E S D+ Y L V+D + ++++SL + +
Sbjct: 277 AGGETLLSPDELPIGRPLSMDYYQNY------LFVVDGMSGLIKRLSLIEGE-------- 322
Query: 232 SPTDILMVVGAV--LVGYVTCML--QQGFGPFFFSRTQQPSESEFNEE--TEIKELSNKE 285
P+ ++ V+ V VG+ + QQ P + +++ + F E EIK++ +
Sbjct: 323 -PSKVVEVIAGVSGFVGWFESQMATQQKISPHALTVSKKSGQVYFVETFGNEIKKIDSSG 381
Query: 286 KPIPI 290
+ + +
Sbjct: 382 RIVLV 386
>gi|108707787|gb|ABF95582.1| HAD-superfamily hydrolase, subfamily IA, variant 3 containing
protein, expressed [Oryza sativa Japonica Group]
Length = 772
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 37/122 (30%), Positives = 59/122 (48%), Gaps = 18/122 (14%)
Query: 89 FAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGY---------TGHVDGKPNEARFNHPKG 139
A+D +N+ + +S + R+V + +G G +DG + A FN P+G
Sbjct: 577 LAIDVLNNRLF-----ISDSNHNRIVVTNLEGEFICQIGSSEEGLLDGTFDTASFNRPQG 631
Query: 140 VTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSND-FDV 195
+ + K N LYVADT N A+R+I V T+AG + + +R G + N +DV
Sbjct: 632 LAYNSKKNILYVADTENHALREINFVSETVKTLAGNGTKGSDYRGGGQGTNQVLNSPWDV 691
Query: 196 VY 197
Y
Sbjct: 692 CY 693
>gi|456988165|gb|EMG23306.1| hypothetical protein LEP1GSC150_2908 [Leptospira interrogans
serovar Copenhageni str. LT2050]
Length = 231
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 31/53 (58%), Gaps = 2/53 (3%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
L AG+ G VDG + A F P G+ +D GN+YV+D +N IRKI +G
Sbjct: 59 LFAGT--GVAASVDGTTSTASFKTPFGLEVDTSGNIYVSDQINNLIRKIDPSG 109
>gi|357391051|ref|YP_004905892.1| hypothetical protein KSE_41500 [Kitasatospora setae KM-6054]
gi|311897528|dbj|BAJ29936.1| hypothetical protein KSE_41500 [Kitasatospora setae KM-6054]
Length = 608
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 48/96 (50%), Gaps = 4/96 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + VS DGE V + ++ ++ + R + G F GH DG +A H
Sbjct: 350 PSGLAVSADGERLWVADSETSALRWVSRGTHEVRTAVGTGLFD--FGHRDGAAGQALLQH 407
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIA 170
P GVT+ G++ V+DT N A+R+ A V+T+A
Sbjct: 408 PLGVTVLPDGSVAVSDTYNQALRRYDPAAGEVSTLA 443
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 51/108 (47%), Gaps = 6/108 (5%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKI--GDAGVTT 168
R+ AG+ G VDG P EA F P G+ + G L+VAD+ A+R + G V T
Sbjct: 327 RVAAGTTN--EGLVDGPPGEAWFAQPSGLAVSADGERLWVADSETSALRWVSRGTHEVRT 384
Query: 169 IAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
G G RDG + A + V V P S+ V D N ALR+
Sbjct: 385 AVGTGLFDFGHRDGAAGQALLQHPLGVT-VLPDGSVAVSDTYNQALRR 431
>gi|428297525|ref|YP_007135831.1| NHL repeat containing protein [Calothrix sp. PCC 6303]
gi|428234069|gb|AFY99858.1| NHL repeat containing protein [Calothrix sp. PCC 6303]
Length = 511
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 53/104 (50%), Gaps = 7/104 (6%)
Query: 126 DGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG---DAGVTTIAGGKS-NVAGFRD 181
DG+ +E+ F P G+T D + NLY+AD+ IR +G A V +I GG+ + G +D
Sbjct: 333 DGELSESVFAQPSGITSDGE-NLYIADSEVSTIRSVGIGESAQVKSICGGEDLYLFGDKD 391
Query: 182 GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCE 225
G + + + + Y+ SL + D N ++ + +C+
Sbjct: 392 GIGTEVRLQHCLGIEYIDN--SLWIADTYNHKIKLVDSQSRNCQ 433
Score = 42.4 bits (98), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 27/56 (48%), Positives = 36/56 (64%), Gaps = 5/56 (8%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGN--LYVADTLNLAIRKI--GDAGVTTIAG 171
G G VDG A F+ P+G+T D+ GN LY+ADT N A+R++ G+ V TIAG
Sbjct: 213 GKAGFVDGDFEAATFSTPQGMTFDE-GNQILYIADTGNHALRQVDLGNQLVRTIAG 267
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 39/77 (50%), Gaps = 6/77 (7%)
Query: 85 DGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDD 144
D L+ D N I + SQ + + G Q +G +D K +ARF P G++ D
Sbjct: 409 DNSLWIADTYNHKIKLVD---SQSRNCQTILGDTQ--SGFIDAKGTDARFFEPSGLSYWD 463
Query: 145 KGNLYVADTLNLAIRKI 161
LY+ADT N AIRK+
Sbjct: 464 S-YLYIADTNNHAIRKV 479
>gi|124006033|ref|ZP_01690870.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
gi|123988440|gb|EAY28086.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
Length = 789
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 70/145 (48%), Gaps = 12/145 (8%)
Query: 78 YKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNE-ARFNH 136
Y I + + ++AV++V+ + KI + + AG DG P A +
Sbjct: 192 YDIALDANNNIYAVNDVDKYVRKIDATTGSIT---IFAGDGNALN---DGGPAHLASLRN 245
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG---VTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
PKG+ +D GN+Y+AD L+ IRK+ DA +TTIAG + G + A+ + F
Sbjct: 246 PKGLAIDAAGNVYIADVLDDRIRKV-DARTGIITTIAGTGAGGYSGDGGVATSARINYPF 304
Query: 194 DVVYVRPTCSLLVIDRGNAALRQIS 218
V V ++ D N ALR+++
Sbjct: 305 RVA-VDALGNVYFTDWDNDALRKLT 328
Score = 43.5 bits (101), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 63/116 (54%), Gaps = 17/116 (14%)
Query: 113 LVAGS-FQGYTGHVDGKP-NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG---VT 167
++AG+ +GY+G DG P A ++P G+T+D GN+Y AD N +RKI DA ++
Sbjct: 70 IIAGTGTEGYSG--DGGPATAAMLDYPTGITVDAAGNVYFADGSNDVVRKI-DATTHVIS 126
Query: 168 TIAG-GKSNVAGFRDGPSEDAK---FSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
TIAG G AG DG A F +D V + ++ ++D N +R++ +
Sbjct: 127 TIAGNGNRGFAG--DGGQATAAQLHFPSD---VALDTAGNIYIVDHRNDRIRKVDI 177
>gi|428183007|gb|EKX51866.1| hypothetical protein GUITHDRAFT_84912 [Guillardia theta CCMP2712]
Length = 351
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 68/152 (44%), Gaps = 12/152 (7%)
Query: 77 PYKIRVSEDGE-LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P + S DG+ L+ D N+ + K+ + G + +G G DG+ +
Sbjct: 85 PCGVSTSPDGQWLYVADSGNNRVRKV-----NTTSGATLTLVGKGDLGSADGEAATVSLS 139
Query: 136 HPKGVTMDDKGN-LYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFRDGPSEDAKFSND 192
P V G LY+AD+L+ IR++ D T+AG S F DGP ++ F+
Sbjct: 140 TPLSVAASPDGRYLYIADSLSQRIRQVRLADGWTRTLAG--SGNQSFSDGPPSESSFNTP 197
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQISLNQDDC 224
+ ++ V D N +R+I++ +D C
Sbjct: 198 AAIAVSPDGLTVYVADLNNERVRKIAV-RDVC 228
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 42/89 (47%), Gaps = 5/89 (5%)
Query: 133 RFNHPKGVTMD-DKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKF 189
RFN P G+ + D LYV D N IRK+ D TT+AG + GF DG + F
Sbjct: 25 RFNCPYGLALSPDDLWLYVGDRDNHRIRKVSMRDGSTTTLAG--TGTPGFADGDWNSSSF 82
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+ V L V D GN +R+++
Sbjct: 83 NQPCGVSTSPDGQWLYVADSGNNRVRKVN 111
>gi|326923951|ref|XP_003208196.1| PREDICTED: NHL repeat-containing protein 2-like [Meleagris
gallopavo]
Length = 727
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 62/126 (49%), Gaps = 11/126 (8%)
Query: 75 VVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
+ P K+ V + GE + + + + +T G+++ +G DG+ +EA F
Sbjct: 219 LFPGKVTVDKSGERLVIADTGHHRILVT-----LKNGQILHTIGGPNSGRKDGRFSEAAF 273
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG---VTTIAG-GKSNVAGFRDGPSEDAKFS 190
N P+GV + + +YVADT N IRKI D VTT+AG G V E+ S
Sbjct: 274 NSPQGVAIKNN-VIYVADTENHLIRKI-DLDLEIVTTVAGIGIQGVDKEGGAKGEEQPIS 331
Query: 191 NDFDVV 196
+ +DVV
Sbjct: 332 SPWDVV 337
>gi|290985802|ref|XP_002675614.1| predicted protein [Naegleria gruberi]
gi|284089211|gb|EFC42870.1| predicted protein [Naegleria gruberi]
Length = 848
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 61/117 (52%), Gaps = 9/117 (7%)
Query: 48 APQADGNVLQFENGYLVETVIEG-NEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLS 106
A + V + N + ++T+I+ N++ +P ++ +L+ + + I K +
Sbjct: 507 ADTWNNRVRRVYNDFTIQTIIQAPNQVQFLPQQLIYGPSCDLYIACTLENQIFKYSS--- 563
Query: 107 QYSRG-RLVAGSFQ-GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
S G L+AG+ + GYTG G AR N+PK + ++ G +Y+AD+ N +RKI
Sbjct: 564 --SEGFVLIAGNGKVGYTGDY-GPAIYARLNNPKAIAVNKFGEVYIADSFNKVVRKI 617
>gi|290976432|ref|XP_002670944.1| predicted protein [Naegleria gruberi]
gi|284084508|gb|EFC38200.1| predicted protein [Naegleria gruberi]
Length = 1514
Score = 44.3 bits (103), Expect = 0.15, Method: Composition-based stats.
Identities = 36/99 (36%), Positives = 52/99 (52%), Gaps = 10/99 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGH-VDGKPNEARF 134
P+ + + +L D N I KI + ++AG+ Q GY G +D K A+
Sbjct: 101 PFAVAFDLETQLIVSDSGNHVIRKIL----RNGTMVIIAGTGQSGYNGDGIDAKI--AKL 154
Query: 135 NHPKGVTMDDK-GNLYVADTLNLAIRKIGDAG-VTTIAG 171
N+P G+ +D K G L++ADT N IRKI G + TIAG
Sbjct: 155 NNPSGIVVDSKTGELFIADTANNRIRKILTNGTIITIAG 193
Score = 43.5 bits (101), Expect = 0.24, Method: Composition-based stats.
Identities = 40/147 (27%), Positives = 62/147 (42%), Gaps = 12/147 (8%)
Query: 78 YKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGK-PNEARFNH 136
Y I + + EL+ D N I KI S G ++ + G G +D A+
Sbjct: 215 YGIALDVNSELYIADANNYRIRKI------LSNGTIITIAGNGGYGFIDNVLATNAKLAF 268
Query: 137 PKGVTMDDKGNLYVADT----LNLAIRKIGDAGVTTIAGGKSNVAGFRDG-PSEDAKFSN 191
G+ +D G +Y+ ++ N IRKI G G + + D + DAKF
Sbjct: 269 VNGIAVDTNGEVYITESENGFSNHRIRKILTNGTIITFSGTGSRGYYGDNVEAADAKFYT 328
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQIS 218
+ V + T +L+ D GN LR +S
Sbjct: 329 PYFVSVRQTTGEVLISDTGNNFLRLVS 355
Score = 42.0 bits (97), Expect = 0.70, Method: Composition-based stats.
Identities = 33/92 (35%), Positives = 45/92 (48%), Gaps = 9/92 (9%)
Query: 83 SEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV-DG-KPNEARFNHPKGV 140
S+ GELF D N+ I KI + G ++ + G TG+ DG A+ G+
Sbjct: 164 SKTGELFIADTANNRIRKI------LTNGTIITIAGTGETGYNGDGIDATSAQIRGVYGI 217
Query: 141 TMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
+D LY+AD N IRKI G + TIAG
Sbjct: 218 ALDVNSELYIADANNYRIRKILSNGTIITIAG 249
>gi|418721636|ref|ZP_13280812.1| hypothetical protein LEP1GSC101_0633 [Leptospira borgpetersenii
str. UI 09149]
gi|410741989|gb|EKQ90740.1| hypothetical protein LEP1GSC101_0633 [Leptospira borgpetersenii
str. UI 09149]
Length = 246
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 55/106 (51%), Gaps = 9/106 (8%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDK 145
G+ F + + I+KI P+ Q+S L AG+ G G +G + F P + +D +
Sbjct: 133 GDKFISCQDTAQILKI-DPMDQFS---LYAGNSSGIGGFQNGDRLNSLFKGPLFMDLDRE 188
Query: 146 GNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFRDGPSEDAKF 189
NLYV + N IRKI V+T++GG V+G+ DG A+F
Sbjct: 189 RNLYVGELGNHTIRKINLNSETVSTLSGG---VSGYLDGDLTSAQF 231
>gi|319786925|ref|YP_004146400.1| NHL repeat containing protein [Pseudoxanthomonas suwonensis 11-1]
gi|317465437|gb|ADV27169.1| NHL repeat containing protein [Pseudoxanthomonas suwonensis 11-1]
Length = 469
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 41/126 (32%), Positives = 60/126 (47%), Gaps = 17/126 (13%)
Query: 48 APQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQ 107
AP DG G LVETV E P + S D LF D + I++ T
Sbjct: 145 APHPDG-------GRLVETVPEPRLSLRFPSAVAASAD-RLFIADSGHHRILECT----- 191
Query: 108 YSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAG 165
GR++ G +DG ++A FN P+G+ + ++ LYVADT N ++R+I +
Sbjct: 192 -HSGRVLRQFGLGTADSMDGPHDQAAFNRPQGLAL-EREVLYVADTGNHSLRRINLLNGT 249
Query: 166 VTTIAG 171
V T+ G
Sbjct: 250 VDTLCG 255
>gi|449137215|ref|ZP_21772546.1| NHL repeat containing protein [Rhodopirellula europaea 6C]
gi|448884292|gb|EMB14794.1| NHL repeat containing protein [Rhodopirellula europaea 6C]
Length = 351
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 36/152 (23%), Positives = 62/152 (40%), Gaps = 11/152 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAG----SFQGYTGHVDGKPNEA 132
P+ + ++ V+ +++ P + +S ++AG + G+VDG A
Sbjct: 58 PFAVEFDSQNRMWIVEFDGGRVMRCEP--NDFSNPHVIAGPESATEPNALGYVDGPARSA 115
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKI-----GDAGVTTIAGGKSNVAGFRDGPSEDA 187
RFN + +D LY++D N ++R++ G V T AG + DA
Sbjct: 116 RFNKLHNLMIDSNDVLYLSDHANHSVRRLMESEDGKWVVDTYAGNGKRGPAVDNVDRRDA 175
Query: 188 KFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
F V LL+ D GN +R + L
Sbjct: 176 TFHEPISVTLDVKDNRLLIADIGNQVIRSLDL 207
>gi|332981053|ref|YP_004462494.1| NHL repeat containing protein [Mahella australiensis 50-1 BON]
gi|332698731|gb|AEE95672.1| NHL repeat containing protein [Mahella australiensis 50-1 BON]
Length = 703
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 62/117 (52%), Gaps = 15/117 (12%)
Query: 53 GNVLQFENGYLVETVIEGNEIGV----VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQY 108
GNVL + YL + VI+G+ +G+ P + VS+DG+++ D N+ I+ +
Sbjct: 38 GNVLYAPHAYLPDRVIDGSSLGIGNFNTPGDLFVSKDGKIYIADSGNNRIIIADSNWNIL 97
Query: 109 SRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
+ + T DGK + F +P+G+ + ++G++Y+ADT N + + G +G
Sbjct: 98 NELK---------TFEHDGKNDT--FKNPQGLYVTEEGHIYIADTDNGRLIEFGPSG 143
>gi|297722063|ref|NP_001173395.1| Os03g0311300 [Oryza sativa Japonica Group]
gi|255674457|dbj|BAH92123.1| Os03g0311300, partial [Oryza sativa Japonica Group]
Length = 100
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 47/91 (51%), Gaps = 8/91 (8%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFR 180
G DG ++ HP GV +YVAD+ N I+++ VTTIAG + AG++
Sbjct: 12 GDHDGTGSDVLLQHPLGVVYASDNQIYVADSYNHKIKRLDPVTRKVTTIAG--TGRAGYK 69
Query: 181 DGPSEDAKFSNDFDVVYVRPT----CSLLVI 207
DGP+ A+ S +V V C++L++
Sbjct: 70 DGPALSAQLSEPAGLVEVGDGNLFPCAVLLV 100
>gi|198421727|ref|XP_002130106.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 454
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 56/109 (51%), Gaps = 7/109 (6%)
Query: 115 AGSFQGYTGHVDGKPNEARFNHPKGVTMDDK-GNLYVADTLNLAIR--KIGDAGVTTIAG 171
AGS QG+ +G +A+F H G+ +D K +Y+AD+ N AIR +G VTT AG
Sbjct: 101 AGSSQGFA---NGVGLKAQFFHIAGMALDHKRQKMYIADSGNNAIRLMDLGTNEVTTFAG 157
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
+ F+DG A +N + Y L + D N A+R I+++
Sbjct: 158 TNGD-KAFKDGSLLTAGLNNPQQLEYDDTNDRLFIADTDNHAIRIINVD 205
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 52/101 (51%), Gaps = 6/101 (5%)
Query: 126 DGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG-DAG---VTTIAGGKSNVAGFR 180
DG A N+P+ + DD + L++ADT N AIR I D+ VTT+ GG + GF+
Sbjct: 166 DGSLLTAGLNNPQQLEYDDTNDRLFIADTDNHAIRIINVDSATPKVTTLTGGP-RMEGFK 224
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQ 221
DG +A + + + Y L V D N A+R + L +
Sbjct: 225 DGSFAEAMWRHPTGMAYDVRKDVLYVSDHYNHAVRMLDLKK 265
>gi|116621305|ref|YP_823461.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116224467|gb|ABJ83176.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 1042
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 69/149 (46%), Gaps = 30/149 (20%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPP--LSQYS-RGRLVAGS--FQGYTGHVDGKP-NEAR 133
I V G L+ + + + I K+TP +S Y+ G G+ F G DG+ +A
Sbjct: 481 IAVDSSGNLYLSENLYNKIKKVTPAGSMSTYAGTGGAPIGTPAFSG-----DGQAATQAG 535
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG---VTTIAGGKSNVAGFRDGPSEDAK-- 188
F +P +T+D GNLY+ D++ IRK+ DA VTTIAG +G + A+
Sbjct: 536 FGYPSALTVDKGGNLYITDSIGTRIRKV-DATTGIVTTIAGTGGCCYSGDEGKATSARID 594
Query: 189 -------------FSNDFDVVYVRPTCSL 204
F++ + Y+RP S+
Sbjct: 595 PYGLTVDAAGGIFFTDPLGIRYIRPDGSI 623
Score = 42.0 bits (97), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 49/96 (51%), Gaps = 8/96 (8%)
Query: 127 GKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFR--DGP 183
G A + P+G G++Y DT N +RKIG D +TTIAG + GF GP
Sbjct: 356 GAATAATLHRPQGTVTGTDGSIYFTDTANHRVRKIGTDGKITTIAG--TGDLGFSGDGGP 413
Query: 184 SEDAKFSNDF-DVVYVRPTCSLLVIDRGNAALRQIS 218
+ A S F D + + T +L VID+ +R+I+
Sbjct: 414 ATQATMS--FPDALAIDSTNNLYVIDQKQLRVRKIT 447
>gi|224052833|ref|XP_002198291.1| PREDICTED: NHL repeat-containing protein 2 [Taeniopygia guttata]
Length = 727
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 65/146 (44%), Gaps = 22/146 (15%)
Query: 68 IEGNEIGVVPYKIRVSEDGELF----AVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTG 123
I+ N IG+ YK + LF AVD+ +V ++ R++ G
Sbjct: 199 IKDNNIGIKLYKDSLPPSPLLFPGKVAVDDSGGRLV-----IADTGHHRILVTRKNGQIL 253
Query: 124 HVDGKPN---------EARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG- 171
H G PN EA FN P+G+ + + +YVADT N IRKI VTT+AG
Sbjct: 254 HTIGGPNSGRRDGRFSEAAFNSPQGIAIKNN-VIYVADTENHLIRKIDLELQMVTTVAGI 312
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVY 197
G V E+ S+ +DVV+
Sbjct: 313 GVQGVDKEGGAKGEEQPISSPWDVVF 338
>gi|398346052|ref|ZP_10530755.1| NHL repeat containing protein [Leptospira broomii str. 5399]
Length = 415
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 40/77 (51%), Gaps = 1/77 (1%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I+V G ++ D NS I+ + P++ LV GS +T ++G +
Sbjct: 271 PSSIKVDSSGGVWISDTGNSRILHYSAPITNAMSADLVIGSTN-FTSTLNGVTSANALWA 329
Query: 137 PKGVTMDDKGNLYVADT 153
P GV++D KG ++V+D+
Sbjct: 330 PSGVSLDSKGGMWVSDS 346
>gi|87311714|ref|ZP_01093830.1| NHL repeat protein [Blastopirellula marina DSM 3645]
gi|87285608|gb|EAQ77526.1| NHL repeat protein [Blastopirellula marina DSM 3645]
Length = 652
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 49/171 (28%), Positives = 70/171 (40%), Gaps = 24/171 (14%)
Query: 65 ETVIEGNEIGVVPYKIRVSE---------DGE-LFAVDEVNSNIVKITPPLSQYSRGRLV 114
E +++G + +PY+ S DG L+ D S+I + P + R V
Sbjct: 330 EDIVDGPLLPPIPYEQGFSSFAQPSGLTSDGTWLYVADSEGSSIRAV--PFDPKEKVRTV 387
Query: 115 AGSFQGYTGHV------DGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTT 168
GS +G + DG EAR H GV D G +Y+ADT N IR + DA
Sbjct: 388 TGSAHLGSGRLFAFGDKDGSATEARLQHALGVCYVD-GQIYIADTYNCKIR-VADAKT-- 443
Query: 169 IAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G VAG + ++DA + D L + D N +R I L
Sbjct: 444 --GEVRTVAGTGEHGADDAAPTFDEPAGLSYAAGKLYIADTNNHLIRTIDL 492
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 45/97 (46%), Gaps = 8/97 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P K+ E + + + N N + I G+L+ G G DG A F+H
Sbjct: 163 PGKVLADEKSDRLFIADSNHNRIVIAD-----LDGKLIETIGNGAIGTTDGDYATASFHH 217
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG 171
+G+ +D LYVADT N +RK+ VTTIAG
Sbjct: 218 LQGMALDGD-TLYVADTENHMLRKVDLKTKQVTTIAG 253
>gi|256833654|ref|YP_003162381.1| alkyl hydroperoxide reductase [Jonesia denitrificans DSM 20603]
gi|256687185|gb|ACV10078.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Jonesia denitrificans DSM 20603]
Length = 628
Score = 43.9 bits (102), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 32/117 (27%), Positives = 55/117 (47%), Gaps = 13/117 (11%)
Query: 120 GYTGHVDGKPNEARFNHPKG-------VTMDDKGNLYVADTLNLAIR--KIGDAGVTTIA 170
G G VDG + A+FN P G V + ++ VADT+N +R + V T+A
Sbjct: 230 GERGLVDGPAHTAQFNEPNGLLVLPPDVAAEVGYDVVVADTVNHCVRSLSLSTGAVGTLA 289
Query: 171 G-GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEY 226
G G+ + G GP + + ++ +DVV+ +++ GN L + C++
Sbjct: 290 GNGRQYMVG---GPDNEGRLTSPWDVVWSAHRKQVIIAMAGNHTLWTLDPRTGQCQW 343
>gi|323454387|gb|EGB10257.1| hypothetical protein AURANDRAFT_62887 [Aureococcus anophagefferens]
Length = 434
Score = 43.9 bits (102), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 50/109 (45%), Gaps = 4/109 (3%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMD-DKGNLYVADTLNLAIRKIGDAGVTTIA 170
R++AG G +G DG+ + A F P GV G++YV++ IR I VTT+A
Sbjct: 166 RVLAGC--GMSGGRDGRADRATFAGPWGVAAHPATGDVYVSEYFGCKIRVIRRGDVTTLA 223
Query: 171 GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
G G DG A F+ V L V D GN +R++ L
Sbjct: 224 GSGPR-GGRADGVGPAASFAGPRGVCLDAARNCLYVADSGNHCVRRLDL 271
Score = 43.1 bits (100), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 51/98 (52%), Gaps = 6/98 (6%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRK--IGDAGVTTIAGGKSNVAGF 179
G DG A F P+GV +D N LYVAD+ N +R+ + VTT+AG + G+
Sbjct: 230 GRADGVGPAASFAGPRGVCLDAARNCLYVADSGNHCVRRLDLASGAVTTVAGDGTR--GY 287
Query: 180 RDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
DG + A+F ++ V + +L V D+ N +R +
Sbjct: 288 ADGDAGGARF-DEPTAVALDADGALYVADQENRRVRCL 324
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 24/82 (29%), Positives = 38/82 (46%), Gaps = 5/82 (6%)
Query: 88 LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN 147
L+ D N + ++ + G + + G G+ DG ARF+ P V +D G
Sbjct: 256 LYVADSGNHCVRRL-----DLASGAVTTVAGDGTRGYADGDAGGARFDEPTAVALDADGA 310
Query: 148 LYVADTLNLAIRKIGDAGVTTI 169
LYVAD N +R + V+T+
Sbjct: 311 LYVADQENRRVRCLRGRTVSTL 332
>gi|261855185|ref|YP_003262468.1| redoxin [Halothiobacillus neapolitanus c2]
gi|261835654|gb|ACX95421.1| Redoxin domain protein [Halothiobacillus neapolitanus c2]
Length = 498
Score = 43.9 bits (102), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 53/109 (48%), Gaps = 12/109 (11%)
Query: 71 NEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPN 130
N V+P KI V+++ A+ + + V+I G+L+A G G DG
Sbjct: 194 NHFMVLPTKIAVNDN--RVAISDTGHDQVRIFD-----HEGKLLATIGDGKAGFADGSFA 246
Query: 131 EARFNHPKGVTMDDKGN-LYVADTLNLAIRKIGDAG--VTTIAGGKSNV 176
A FN P+G+ GN LYVADT N IR+I A V+T+ G V
Sbjct: 247 TAEFNRPQGIAW--LGNDLYVADTDNQRIRRIDFASQTVSTVVGNGQRV 293
>gi|406909052|gb|EKD49392.1| lipoprotein, partial [uncultured bacterium]
Length = 552
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 49/163 (30%), Positives = 72/163 (44%), Gaps = 31/163 (19%)
Query: 80 IRVSEDGE-LFAVDEVNSNIVKIT-------------------PPLSQYSRGRLVAGSFQ 119
I +S DG L+ D+ N I K++ P S Y G G
Sbjct: 285 IDISPDGNWLYIADQYNHRIRKMSTKTKKTEYITGAGEVNFYSPNYSAYQEG----GPCD 340
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIGDAGVTT--IAGGKSNV 176
G TG K A FN P + ++ G YV D+ N +R+I + T +AG +S
Sbjct: 341 GETGA--NKSGCAYFNKPSAIVVNRAGTKAYVTDSGNNMLREINLSTGKTRKLAGTRS-- 396
Query: 177 AGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
AGF++G A F+ D+ +L V D+GN A+R+I+L
Sbjct: 397 AGFKNGFGPLATFNGPADLDLDAAGANLYVADKGNHAIRRINL 439
>gi|380693454|ref|ZP_09858313.1| hypothetical protein BfaeM_05669 [Bacteroides faecis MAJ27]
Length = 440
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 41/85 (48%), Gaps = 4/85 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I + G L+ D + KI+ + VAG +Q +G VDG P E+ FN+
Sbjct: 357 PAGITFDKSGNLYFCDSQGYTLRKISRIDGMLTT---VAGQYQK-SGGVDGLPLESTFNY 412
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P V MDD+ N ++ + +RK
Sbjct: 413 PYVVCMDDEENFFIGERWGCVVRKF 437
Score = 41.6 bits (96), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 48/99 (48%), Gaps = 4/99 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFR 180
G DG EA +P G+T D GNLY D+ +RKI D +TT+AG G
Sbjct: 343 GGKDGPRLEATLRNPAGITFDKSGNLYFCDSQGYTLRKISRIDGMLTTVAGQYQKSGGV- 401
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
DG ++ F+ + VV + + + +R +R+ ++
Sbjct: 402 DGLPLESTFNYPY-VVCMDDEENFFIGERWGCVVRKFAV 439
>gi|254417234|ref|ZP_05030979.1| Redoxin family [Coleofasciculus chthonoplastes PCC 7420]
gi|196176040|gb|EDX71059.1| Redoxin family [Coleofasciculus chthonoplastes PCC 7420]
Length = 496
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 56/123 (45%), Gaps = 7/123 (5%)
Query: 107 QYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG---D 163
Q G + + G +DG NEA F P G+T D + LYVAD+ +IR +G +
Sbjct: 316 QLETGEIGTYAGTGAEACLDGSLNEAAFAQPSGITSDGR-ELYVADSEVSSIRGVGLINN 374
Query: 164 AGVTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQD 222
V TI G G+ G RDG + + + V Y + L V D N ++++
Sbjct: 375 LPVRTICGSGELFEFGDRDGHKSEVRLQHCLGVEYTQN--YLWVADTYNHKIKRVDPTTG 432
Query: 223 DCE 225
C+
Sbjct: 433 SCQ 435
Score = 42.4 bits (98), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 53/112 (47%), Gaps = 10/112 (8%)
Query: 112 RLVAGSFQGYT-GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG---DAGVT 167
R + GS + + G DG +E R H GV + L+VADT N I+++ + T
Sbjct: 378 RTICGSGELFEFGDRDGHKSEVRLQHCLGVEYT-QNYLWVADTYNHKIKRVDPTTGSCQT 436
Query: 168 TIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
+ GK AG RDG S +A+F+ V + L + D N +R + L
Sbjct: 437 LLGNGK---AGHRDGQSHEAEFAEPSGVSAI--AARLYIADTNNHVIRCVDL 483
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 44/94 (46%), Gaps = 13/94 (13%)
Query: 88 LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN 147
L+ D N I ++ P + G G GH DG+ +EA F P GV+
Sbjct: 414 LWVADTYNHKIKRVDP-----TTGSCQTLLGNGKAGHRDGQSHEAEFAEPSGVSA-IAAR 467
Query: 148 LYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGF 179
LY+ADT N IR + G VTT+ N++G+
Sbjct: 468 LYIADTNNHVIRCVDLGTLEVTTL-----NLSGY 496
>gi|428308867|ref|YP_007119844.1| thiol-disulfide isomerase-like thioredoxin [Microcoleus sp. PCC
7113]
gi|428250479|gb|AFZ16438.1| thiol-disulfide isomerase-like thioredoxin [Microcoleus sp. PCC
7113]
Length = 515
Score = 43.5 bits (101), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 46/151 (30%), Positives = 73/151 (48%), Gaps = 18/151 (11%)
Query: 77 PYKIRVSEDGE-LFAVDEVNSNIV--KITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEAR 133
P K+ E+GE LF D + IV ++ + Q V G+ G G DG EA+
Sbjct: 182 PGKVLTDEEGERLFIADSGHHRIVVSSLSGEVQQ------VIGT--GKPGLTDGSFEEAQ 233
Query: 134 FNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFR--DGPSEDAK 188
F P+G+T+D + LYVADT N +R++ + V TIAG R G + + +
Sbjct: 234 FFAPQGMTLDSENQLLYVADTENHCLRQVDLKNQQVKTIAGTGEQSHHIRPHSGKALETR 293
Query: 189 FSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
++ +DV V LL+ G+ + ++ L
Sbjct: 294 LNSPWDVEKVGH--CLLIAMAGSHQIWEMQL 322
Score = 43.5 bits (101), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 37/75 (49%), Gaps = 6/75 (8%)
Query: 88 LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN 147
L+ D N I ++ P G QG GH DG+ +ARF+ P G++ +
Sbjct: 419 LWVADTYNHKIKRVDP-----RTGTCQTMIGQGTAGHQDGQSIKARFSEPSGLSA-IAAH 472
Query: 148 LYVADTLNLAIRKIG 162
LYVADT N AIR I
Sbjct: 473 LYVADTNNHAIRCIA 487
Score = 42.4 bits (98), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 54/123 (43%), Gaps = 7/123 (5%)
Query: 107 QYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DA 164
Q G L + G +DG P++A F P G+T D + LY+AD+ +IR +G D
Sbjct: 321 QLETGWLRTYAGTGAEACLDGTPDQAAFAQPSGLTTDGR-ELYIADSEVSSIRAVGLVDH 379
Query: 165 GVTTIAGGKSNVAGFRDGPSEDA--KFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQD 222
G + GF D EDA + + V Y + L V D N ++++
Sbjct: 380 LPVRTVCGSGELFGFGDVDGEDADVRLQHALGVEYTQN--YLWVADTYNHKIKRVDPRTG 437
Query: 223 DCE 225
C+
Sbjct: 438 TCQ 440
>gi|219851898|ref|YP_002466330.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
gi|219546157|gb|ACL16607.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
Length = 387
Score = 43.5 bits (101), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 58/143 (40%), Gaps = 24/143 (16%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G ++ D N+ I K T S G + S+ G F +
Sbjct: 142 PGGVAVDSVGNVYVADSANNQIQKFT------STGGFIT-SWNG------SASAGGAFEY 188
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
P GV +D GN+YVADT N I+K G T G ++ G D PS
Sbjct: 189 PGGVAVDSAGNVYVADTFNGQIQKFTSTGEFITRWNGSASAGGVFDKPSG---------- 238
Query: 196 VYVRPTCSLLVIDRGNAALRQIS 218
V V ++ V D GN +++ +
Sbjct: 239 VAVDSAGNVYVADWGNNWVQKFT 261
>gi|255513298|gb|EET89564.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Candidatus Micrarchaeum acidiphilum ARMAN-2]
Length = 579
Score = 43.5 bits (101), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 54/193 (27%), Positives = 82/193 (42%), Gaps = 25/193 (12%)
Query: 37 LLKWTTRSSSKAPQADGNVLQFENGYLVETVIEGN--EIGVV-PYKIRVSEDGELFAVDE 93
+ ++ R + AP A G E +I+GN E P I V + E++ D
Sbjct: 387 IWEYIMRDGTAAPYA---------GNCAENIIDGNLKEASFAQPSGIWV-DGNEIYVADS 436
Query: 94 VNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADT 153
S I I + + GS G DG AR HP GV+ D G +YVADT
Sbjct: 437 EASAIRSIN---MKDGYASTITGSGLFTFGDQDGSLASARLQHPIGVSAKD-GLIYVADT 492
Query: 154 LNLAIRKI---GDAGVTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVI-D 208
N AIR+I + T ++G G+ + F D + + D V+P L+I D
Sbjct: 493 YNSAIREIDVKANRITTLVSGPGRKSACRFGDPKCDTLQLYEPND---VKPFGRALIIAD 549
Query: 209 RGNAALRQISLNQ 221
N +R+ +++
Sbjct: 550 TNNNLIRRFDIDE 562
>gi|434403812|ref|YP_007146697.1| thiol-disulfide isomerase-like thioredoxin [Cylindrospermum
stagnale PCC 7417]
gi|428258067|gb|AFZ24017.1| thiol-disulfide isomerase-like thioredoxin [Cylindrospermum
stagnale PCC 7417]
Length = 506
Score = 43.5 bits (101), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 58/122 (47%), Gaps = 16/122 (13%)
Query: 112 RLVAGSFQGYTGHV---------DGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRK- 160
RLV SF G H+ DG +EA+F+ P+G+ D + LYVADT N A+RK
Sbjct: 196 RLVMSSFNGEVFHIIGTGKSGFTDGNFSEAQFSAPQGMAYDAENQILYVADTENHALRKV 255
Query: 161 -IGDAGVTTIAGGKSNVAGFR--DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
I V TIAG R G + + ++ +D+ + SL + GN + Q+
Sbjct: 256 DIKRQVVGTIAGTGEQSHNTRPHSGAALETALNSPWDLEKIGN--SLFIAMAGNHQIWQL 313
Query: 218 SL 219
L
Sbjct: 314 DL 315
>gi|167041535|gb|ABZ06284.1| putative NHL repeat protein [uncultured marine microorganism
HF4000_008B14]
Length = 426
Score = 43.5 bits (101), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 37/71 (52%), Gaps = 7/71 (9%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK---IGDAGVTTIAGGKSNVAGF 179
G VDG + R HP GV D G +Y+ADT N I+K + + T + G+ G
Sbjct: 320 GDVDGADHNIRLQHPIGVAHHD-GTIYLADTYNHKIKKVLPVMQSSFTMLGNGQP---GD 375
Query: 180 RDGPSEDAKFS 190
RDGP A+FS
Sbjct: 376 RDGPGNQAQFS 386
Score = 43.5 bits (101), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 41/76 (53%), Gaps = 6/76 (7%)
Query: 84 EDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMD 143
DG ++ D N I K+ P + Q S L G G DG N+A+F+ P G+++
Sbjct: 340 HDGTIYLADTYNHKIKKVLPVM-QSSFTML----GNGQPGDRDGPGNQAQFSEPSGLSI- 393
Query: 144 DKGNLYVADTLNLAIR 159
D G +Y+ADT N AIR
Sbjct: 394 DGGKIYIADTNNHAIR 409
Score = 39.7 bits (91), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 48/98 (48%), Gaps = 10/98 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P K+ D + + N N + +T + V GS G G VDG A F+H
Sbjct: 111 PGKVVADADSNRLFIADSNHNRIIVTTLDGDALK---VVGS--GEPGMVDGSFAAASFDH 165
Query: 137 PKGVTMDDKGN-LYVADTLNLAIRKIGDAG--VTTIAG 171
P+G+ + G+ LYVAD N AIRK+ A V TIAG
Sbjct: 166 PQGMVL--VGDLLYVADAENHAIRKVDLAAEQVETIAG 201
>gi|407792264|ref|ZP_11139332.1| hypothetical protein B3C1_18191 [Gallaecimonas xiamenensis 3-C-1]
gi|407197721|gb|EKE67773.1| hypothetical protein B3C1_18191 [Gallaecimonas xiamenensis 3-C-1]
Length = 488
Score = 43.5 bits (101), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 47/97 (48%), Gaps = 10/97 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P K+RV G+L A+ + N + P GR+V G DG EA F
Sbjct: 188 PGKVRVK--GDLLAISDSGHNRIIFARP-----DGRVVKVVGSGEACAKDGSAEEACFAS 240
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG 171
P+G D+ LYVADT N IR+I + V+T+AG
Sbjct: 241 PQGTLFQDQA-LYVADTNNHLIRRIDLRNYQVSTVAG 276
>gi|390342982|ref|XP_001201679.2| PREDICTED: NHL repeat-containing protein 2-like [Strongylocentrotus
purpuratus]
Length = 338
Score = 43.5 bits (101), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 42/99 (42%), Gaps = 2/99 (2%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFR 180
G DGK EAR HP GV D L+VAD+ N I+ I + T AG G
Sbjct: 76 GDSDGKGLEARLQHPLGVAWDHSKLLFVADSYNHKIKMIDPEERYCATYAGTGEPGKGGD 135
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
D A+F+ + C + V D N +R I +
Sbjct: 136 DEHILKAQFNEPGGLAISPCGCKIYVADTNNHTIRCIDI 174
>gi|307106270|gb|EFN54516.1| hypothetical protein CHLNCDRAFT_31604 [Chlorella variabilis]
Length = 1072
Score = 43.5 bits (101), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 48/95 (50%), Gaps = 5/95 (5%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDA--GVTTIAGGKSNVAGFR 180
G DG ++A HP V G +YVAD+ N ++++ A + T+AG S AG+R
Sbjct: 800 GDKDGTGSDALLQHPLAVLAAPSGKVYVADSYNHRLKELDPATNTIRTLAG--SGAAGYR 857
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALR 215
DG A+ S + P ++++ D N+ +R
Sbjct: 858 DGVGTTAQLSEPAGLA-AGPDGTVIICDTNNSLIR 891
Score = 42.0 bits (97), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 51/109 (46%), Gaps = 9/109 (8%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDK 145
G LF D N+ IV GR + G G VDG +A F P+G+ K
Sbjct: 583 GRLFISDSNNNRIVVCD------ENGRFLEAIGCGAPGLVDGSYEDAAFFRPQGIAYSAK 636
Query: 146 GN-LYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSN 191
+ LYVADT + A+R++ V T+AG + + +R G S ++ N
Sbjct: 637 RDCLYVADTESHALRQVDLQKQTVFTLAGNGAKGSDYRGGGSGASQLMN 685
>gi|196233139|ref|ZP_03131986.1| ABC transporter-related protein [Chthoniobacter flavus Ellin428]
gi|196222783|gb|EDY17306.1| ABC transporter-related protein [Chthoniobacter flavus Ellin428]
Length = 598
Score = 43.5 bits (101), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 40/83 (48%), Gaps = 13/83 (15%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
+PY I DGEL+ ++ + K++P GRL+ G GH G E F+
Sbjct: 524 LPYDIAAGGDGELYIIEYGGGRLTKVSPD------GRLL-----GRYGHTGG--GEGEFS 570
Query: 136 HPKGVTMDDKGNLYVADTLNLAI 158
P G+T+D L +ADT N I
Sbjct: 571 TPWGLTIDSHMRLRIADTKNRRI 593
>gi|391330742|ref|XP_003739813.1| PREDICTED: NHL repeat-containing protein 2-like [Metaseiulus
occidentalis]
Length = 685
Score = 43.5 bits (101), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 59/118 (50%), Gaps = 12/118 (10%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTI 169
+L+ G+ G G +DG RF+ P+GV D G ++VAD N AIR++ V+T+
Sbjct: 259 QLIVGT--GEAGFLDGAYGICRFSSPQGVCFYD-GGIFVADAGNHAIRRVDFSTKCVSTV 315
Query: 170 AG-GKSNV--AGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDC 224
G GK V G DG ++ S +DVV R S+L+I + +D+C
Sbjct: 316 VGTGKQGVDLVGNLDGNVQE--ISTPWDVVLYRH--SVLIIAMAGSHQIWAYATEDNC 369
>gi|307182571|gb|EFN69764.1| NHL repeat-containing protein 2 [Camponotus floridanus]
Length = 687
Score = 43.5 bits (101), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 47/97 (48%), Gaps = 8/97 (8%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
I SE G + + +N + I + + G QG+ DG ARFN P+G
Sbjct: 227 ILSSEHGTKLIISDSGNNRIVIATKHGEVEH--FIGGCNQGFK---DGSFKNARFNSPQG 281
Query: 140 VTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKS 174
V + + +YVAD N AIRKI + V+TIAG S
Sbjct: 282 VCVLNN-TIYVADNNNHAIRKINLTENNVSTIAGTGS 317
>gi|332027687|gb|EGI67755.1| NHL repeat-containing protein 2 [Acromyrmex echinatior]
Length = 686
Score = 43.5 bits (101), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 42/120 (35%), Positives = 63/120 (52%), Gaps = 12/120 (10%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSR-GRLVAGSFQGYTGHVDGKPNEARFNHPK 138
+++ ++ +L D N+ IV IT +++ R ++ G QG+ DG ARFN P+
Sbjct: 226 LQLEQETKLVISDSGNNRIV-IT---NEHGRVEHVIGGCNQGFK---DGDFKNARFNSPQ 278
Query: 139 GVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGP-SEDAKFSNDFDV 195
GV + + +YVAD N AIRKI A V+TIAG S + G D S+ +DV
Sbjct: 279 GVCVLNNI-IYVADNNNHAIRKIDLAKRIVSTIAGTGSQGCDRKGGKHGTDQALSSPWDV 337
>gi|303248818|ref|ZP_07335068.1| YD repeat protein [Desulfovibrio fructosovorans JJ]
gi|302489759|gb|EFL49690.1| YD repeat protein [Desulfovibrio fructosovorans JJ]
Length = 2115
Score = 43.5 bits (101), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 29/63 (46%), Positives = 37/63 (58%), Gaps = 5/63 (7%)
Query: 112 RLVAGSFQG-YTGHVDGKP-NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTT 168
R+VAG+ Y G DG P A HP G+ +D +GNLY+AD N IRK+ G +TT
Sbjct: 858 RVVAGNGSTHYNG--DGIPATAASLYHPNGLAVDAQGNLYIADQYNNRIRKVDQNGIITT 915
Query: 169 IAG 171
AG
Sbjct: 916 FAG 918
>gi|111219920|ref|YP_710714.1| serine/threonine protein kinase [Frankia alni ACN14a]
gi|111147452|emb|CAJ59102.1| Putative serine/threonine protein kinase [Frankia alni ACN14a]
Length = 725
Score = 43.5 bits (101), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 40/121 (33%), Positives = 62/121 (51%), Gaps = 14/121 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKPN-EARF 134
P + V G L+ VD+ N + ++ + VAG+ +G+TG DG P A
Sbjct: 398 PDAMAVDTAGNLYIVDKSNQRVRRVDRDGVVTT----VAGNGIRGFTG--DGGPAIRAEL 451
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAGGKSNVAGF--RDGPSEDAKFSN 191
P G+ +D G++Y++D N +R++ AGV TT AG + V GF +GP + FS
Sbjct: 452 ADPAGIAVDAAGDIYISDQGNQRVRRVNPAGVITTFAG--TGVFGFSGENGP-KIGGFSG 508
Query: 192 D 192
D
Sbjct: 509 D 509
Score = 38.9 bits (89), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 47/88 (53%), Gaps = 6/88 (6%)
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFR--DGPSEDAKFS 190
N P + +D GNLY+ D N +R++ D VTT+AG + + GF GP+ A+ +
Sbjct: 395 LNKPDAMAVDTAGNLYIVDKSNQRVRRVDRDGVVTTVAG--NGIRGFTGDGGPAIRAELA 452
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQIS 218
+ + V + + D+GN +R+++
Sbjct: 453 DPAGIA-VDAAGDIYISDQGNQRVRRVN 479
Score = 38.9 bits (89), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 51/106 (48%), Gaps = 12/106 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV-DGKPNEARFN 135
P + V G ++ D+ N+ + +I + G + + G G DG P
Sbjct: 577 PESLAVDVAGTMYVTDQGNNRVRRID------THGIITTAAGTGTMGFSGDGGPATRAAI 630
Query: 136 HP--KGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAG 178
H GVT+DD GN+Y+AD IR+I G +TTIAG + VAG
Sbjct: 631 HTVGAGVTVDDAGNIYLADPQVNRIRRIDTHGIITTIAG--TGVAG 674
>gi|407780651|ref|ZP_11127872.1| NHL repeat-containing protein [Oceanibaculum indicum P24]
gi|407208878|gb|EKE78785.1| NHL repeat-containing protein [Oceanibaculum indicum P24]
Length = 498
Score = 43.5 bits (101), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 42/80 (52%), Gaps = 4/80 (5%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG-GKSNV 176
G G G + ARF P+G+ D G +YVADT N AIR+I VTT+AG G+
Sbjct: 219 GEAGFASGPADRARFRDPQGLAAD-TGAIYVADTGNHAIRRIDRQTGQVTTLAGNGRRGY 277
Query: 177 AGFRDGPSEDAKFSNDFDVV 196
P DA+ ++ +D+
Sbjct: 278 VLQSAVPFADAELASPWDLA 297
>gi|425469489|ref|ZP_18848421.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9701]
gi|389880728|emb|CCI38582.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9701]
Length = 340
Score = 43.5 bits (101), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 43/90 (47%), Gaps = 13/90 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY I V G+++ D N+ I S G + S G +G DG+ FN
Sbjct: 64 PYSIAVGSGGKIYVADTANNRIQVFD------SNGGFL--STFGSSGAGDGE-----FNS 110
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGV 166
P GV +D GN+YVADT N ++ +GV
Sbjct: 111 PVGVAVDGVGNIYVADTFNERVQIFNSSGV 140
>gi|291000698|ref|XP_002682916.1| dual specificity protein phosphatase [Naegleria gruberi]
gi|284096544|gb|EFC50172.1| dual specificity protein phosphatase [Naegleria gruberi]
Length = 1502
Score = 43.5 bits (101), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 54/109 (49%), Gaps = 4/109 (3%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG 171
R+ + G G G+ + F+ P G+ + N++V+DT N + +I + G T A
Sbjct: 420 RITNFAGNGNRGCSTGRVTDVMFDTPTGICVIG-DNVFVSDTGNHRVVRINN-GFVTNAA 477
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
G AGF DG +++ ++ D+ T L + D GN+ +RQ+ N
Sbjct: 478 GNPRTAGFMDGKVQNSILNSPTDLFSHENT--LYINDFGNSRIRQVFAN 524
>gi|290977087|ref|XP_002671270.1| predicted protein [Naegleria gruberi]
gi|284084837|gb|EFC38526.1| predicted protein [Naegleria gruberi]
Length = 1095
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 64/131 (48%), Gaps = 25/131 (19%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVA----GSFQGYTGHVDG-KPNEARFNHPKGV 140
G ++ D N + KI+ Y G + GSF GY G DG A+ N P G+
Sbjct: 399 GNIYIADTSNHRVRKIS-----YLDGTITTIAGTGSF-GYNG--DGILATSAQVNKPTGI 450
Query: 141 TMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR-DG-PSEDAKFS------- 190
D GN+Y+AD+ N IRKI G + TIAG + G+ DG P+ +AK +
Sbjct: 451 AFDSIGNIYIADSGNNRIRKILTNGTILTIAG--VGLGGYNGDGIPATNAKLNSPVSVTV 508
Query: 191 NDFDVVYVRPT 201
+ D+VY+ T
Sbjct: 509 DSNDLVYITDT 519
Score = 42.7 bits (99), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 68/146 (46%), Gaps = 13/146 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQ-GYTGHVDG-KPNEA 132
P I ++ D N+ I K+ ++ G + VAG+ GY G DG + A
Sbjct: 558 PQSIAFDSSNNMYIADTYNNRIRKM------FTNGTIITVAGTGTLGYNG--DGIEATNA 609
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSND 192
+ +P+G+ +D L +AD+ N IR++ G T G ++ DG + A N+
Sbjct: 610 QLKYPQGIAIDGD-ELLIADSYNNRIRRVLSNGNITTITGTGDLGYSEDGTTASASKINN 668
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQIS 218
V +R ++VID N+ LR IS
Sbjct: 669 PSGVILRRNGEIIVIDSDNSRLRVIS 694
Score = 42.7 bits (99), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 51/168 (30%), Positives = 74/168 (44%), Gaps = 31/168 (18%)
Query: 87 ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQG---YTG-HVDGKPNEARFNHPKGVTM 142
+L+ D+ N I KI S G + + QG Y G +VD + + + PKG +
Sbjct: 345 DLYIADKGNHRIRKI-------SNGFITTIAGQGSPSYCGENVDSRL--SALSKPKGAAL 395
Query: 143 DDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVR 199
D GN+Y+ADT N +RKI D +TTIAG S G+ DG + N +
Sbjct: 396 DSLGNIYIADTSNHRVRKISYLDGTITTIAGTGS--FGYNGDGILATSAQVNKPTGIAFD 453
Query: 200 PTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGY 247
++ + D GN +R+I N IL + G L GY
Sbjct: 454 SIGNIYIADSGNNRIRKILTNG-------------TILTIAGVGLGGY 488
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 45/96 (46%), Gaps = 6/96 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V + ++ D N I I P + + V G G+ G PN + N+
Sbjct: 503 PVSVTVDSNDLVYITDTYNHRIRLILPSGNIST----VIGGSVGFNGDYL-LPNNTKLNY 557
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
P+ + D N+Y+ADT N IRK+ G + T+AG
Sbjct: 558 PQSIAFDSSNNMYIADTYNNRIRKMFTNGTIITVAG 593
Score = 38.1 bits (87), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 45/88 (51%), Gaps = 10/88 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGS-FQGYTGHVDGKPNEAR 133
PY + ++ EL D N+ I ++ ++ G + VAGS QGY G + K A
Sbjct: 169 PYGLTLNSKEELLFTDVNNNRIRQV------FNNGTIITVAGSNSQGYNGD-NMKATSAT 221
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKI 161
GV +D K N+Y+ADT N IRK+
Sbjct: 222 LFLSFGVAVDTKDNIYIADTNNNRIRKV 249
>gi|257069643|ref|YP_003155898.1| NHL repeat protein [Brachybacterium faecium DSM 4810]
gi|256560461|gb|ACU86308.1| NHL repeat protein [Brachybacterium faecium DSM 4810]
Length = 697
Score = 43.1 bits (100), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 46/98 (46%), Gaps = 6/98 (6%)
Query: 85 DGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDD 144
DG + D ++ V++ P S + G F GHVDG + AR HP VT
Sbjct: 381 DGRIVVADS-ETSAVRVLDPRSMQVTTLVGEGLFD--FGHVDGPASTARLQHPLAVTALP 437
Query: 145 KGNLYVADTLNLAIRKIGDA---GVTTIAGGKSNVAGF 179
G + V+DT N AIR + +A G GG + ++G
Sbjct: 438 DGRIAVSDTYNGAIRLVEEAPAEGERATTGGIAALSGV 475
>gi|430741385|ref|YP_007200514.1| NHL repeat protein [Singulisphaera acidiphila DSM 18658]
gi|430013105|gb|AGA24819.1| NHL repeat protein [Singulisphaera acidiphila DSM 18658]
Length = 361
Score = 43.1 bits (100), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 70/147 (47%), Gaps = 10/147 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ + + DG + V+ + + +I + R + VAG+ + G +A FN
Sbjct: 49 PFGVGFAPDGTIIFVEMLGERVRRI----DKDGRVQTVAGTGKNGDSGDGGPATKAEFNG 104
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG---VTTIAG-GKSNVAGFRDGPSEDAKFSND 192
+ + G++ V+DT N +RKI DA +TTIAG GK +G GP+ A+F
Sbjct: 105 MHSLAVMKNGDILVSDTWNNRVRKI-DARTGVITTIAGTGKKGFSG-DGGPATAAEFGGI 162
Query: 193 FDVVYVRPTCSLLVIDRGNAALRQISL 219
+ + P +L + D N +R + L
Sbjct: 163 YCIALDEPGQALYLADLDNRRIRAVDL 189
>gi|326503030|dbj|BAJ99140.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 460
Score = 43.1 bits (100), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 43/84 (51%), Gaps = 10/84 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY V + G ++A++ N+ + + +P G ++ G G G E + N
Sbjct: 378 PYSAIVDQSGYVYALERSNNRVTRWSPGAKT---GTVIIG------GRGAGSAAE-QLNS 427
Query: 137 PKGVTMDDKGNLYVADTLNLAIRK 160
P +T DD+GN+Y+ADT N ++K
Sbjct: 428 PTDITFDDEGNIYIADTYNHRVQK 451
>gi|154151143|ref|YP_001404761.1| NHL repeat-containing protein [Methanoregula boonei 6A8]
gi|153999695|gb|ABS56118.1| NHL repeat containing protein [Methanoregula boonei 6A8]
Length = 491
Score = 43.1 bits (100), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 40/142 (28%), Positives = 62/142 (43%), Gaps = 28/142 (19%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
+P+ I V+ G + V + N+N +++ P Y AGS G +F+
Sbjct: 182 LPWVITVNTTGYAY-VSDWNNNRIQVFGPSGNYVSQWGSAGSGNG------------QFD 228
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 195
HP GV +D G +YVAD++N I+ +G G +GF D PS A
Sbjct: 229 HPYGVAIDSTGYVYVADSVNNRIQVFDLSGNYVTQWG----SGFND-PSGIA-------- 275
Query: 196 VYVRPTCSLLVIDRGNAALRQI 217
V T + V D GN +++
Sbjct: 276 --VNSTGYIYVADAGNNRIQEF 295
>gi|386845932|ref|YP_006263945.1| Tripartite motif-containing protein 71 [Actinoplanes sp. SE50/110]
gi|359833436|gb|AEV81877.1| Tripartite motif-containing protein 71 [Actinoplanes sp. SE50/110]
Length = 378
Score = 43.1 bits (100), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 39/62 (62%), Gaps = 5/62 (8%)
Query: 113 LVAGSFQ-GYTGHVDGKPN-EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVT-TI 169
LVAG+ G++G DG P EA+ N P G + G++Y+ADTLN +R +G G+ T+
Sbjct: 42 LVAGAGAPGFSG--DGGPAAEAKLNRPLGAAVAADGSVYIADTLNRRVRVVGKDGIIRTV 99
Query: 170 AG 171
AG
Sbjct: 100 AG 101
>gi|37519956|ref|NP_923333.1| hypothetical protein gll0387 [Gloeobacter violaceus PCC 7421]
gi|35210948|dbj|BAC88328.1| gll0387 [Gloeobacter violaceus PCC 7421]
Length = 544
Score = 43.1 bits (100), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 66/147 (44%), Gaps = 11/147 (7%)
Query: 82 VSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVT 141
V DG L+ + I K P GR+ + G+ ++G + + F P G+T
Sbjct: 334 VVADGALYVAMAGSHQIWKCDP-----HSGRISTFAGNGHESTLNGTRDGSAFAQPSGIT 388
Query: 142 MDDKGNLYVADTLNLAIRKIGDA-GVTTIAGGKSNVAGF--RDGPSEDAKFSNDFDVVYV 198
+D L+VAD+ + ++R +G A +T + G ++ GF +DG E + V +
Sbjct: 389 TNDH-RLFVADSESSSVRTVGIAEDITALLCGSGDLFGFGDQDGLGEAVLLQHPLGVHWD 447
Query: 199 RPTCSLLVIDRGNAALRQISLNQDDCE 225
T L + D N +++I CE
Sbjct: 448 GGT--LWLTDTYNHKIKRIDPESGRCE 472
>gi|290986699|ref|XP_002676061.1| predicted protein [Naegleria gruberi]
gi|284089661|gb|EFC43317.1| predicted protein [Naegleria gruberi]
Length = 819
Score = 43.1 bits (100), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 49/106 (46%), Gaps = 6/106 (5%)
Query: 68 IEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDG 127
+ NE + P I V++ GE+ D S+I +I L+ + + QGY G
Sbjct: 165 VLANETNIYPTGIYVTKLGEVLIAD---SSIHRIRKVLNNGTIITIAGNGKQGYNGEGLS 221
Query: 128 KPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAG 171
+ A P V MD GN+Y AD N IRKI + V T+AG
Sbjct: 222 GLDSA-LKEPYSVDMDSDGNIYFADKGNSIIRKILASNGAVITVAG 266
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 67/147 (45%), Gaps = 12/147 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKPNEARF 134
P I + +G ++ E NS+ ++ S G + V GS G T DG + +
Sbjct: 452 PTSISIDSNGTIW-FSEANSHRIR-----KLLSNGTVITVVGSTTGST--ADGPALQTKL 503
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
P G+ D G+L +AD N IRK+ G + TIAG + G D + ++ N
Sbjct: 504 TIPNGIGFDLDGSLLIADRYNHRIRKLLSNGTLITIAGTGVSGKGGDDILATNSSL-NQP 562
Query: 194 DVVYVRPTCSLLVIDRGNAALRQISLN 220
V P +L+ DR N +R++ +N
Sbjct: 563 SGVCTMPNGDILIADRANHQIRRLFVN 589
>gi|449690346|ref|XP_002169020.2| PREDICTED: uncharacterized protein LOC100205242, partial [Hydra
magnipapillata]
Length = 169
Score = 43.1 bits (100), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 34/74 (45%), Positives = 39/74 (52%), Gaps = 13/74 (17%)
Query: 126 DGKPNEARFNHPKGVTMD-DKGNLYVADTLNL-------AIRKIG--DAGVTTIAGGKSN 175
DG EARFN+P GV D +K +YVAD L IRKI VTTIAG +
Sbjct: 95 DGIGTEARFNYPWGVAYDHNKHAVYVADCGCLDSVHRTDKIRKIDIKTGTVTTIAG---S 151
Query: 176 VAGFRDGPSEDAKF 189
GF DG +DA F
Sbjct: 152 TQGFSDGFGKDAHF 165
>gi|290973047|ref|XP_002669261.1| predicted protein [Naegleria gruberi]
gi|284082806|gb|EFC36517.1| predicted protein [Naegleria gruberi]
Length = 618
Score = 43.1 bits (100), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 49/89 (55%), Gaps = 6/89 (6%)
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--DGPSEDAK 188
A+ +P GV + +Y+AD N A+RKI + G + TIAG + AGF +GP+ +A+
Sbjct: 8 AQLFNPFGVFVSSNNEVYIADFCNHAVRKILENGNIVTIAGNGT--AGFSGDNGPATNAQ 65
Query: 189 FSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
F N V+V + + D N +R+I
Sbjct: 66 F-NYPSSVFVSSNNEVCIADLHNHRIRKI 93
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 46/98 (46%), Gaps = 9/98 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV--DGKPNEARF 134
P+ + VS + E++ D N + KI G +V + G G +G A+F
Sbjct: 13 PFGVFVSSNNEVYIADFCNHAVRKILE------NGNIVTIAGNGTAGFSGDNGPATNAQF 66
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
N+P V + + +AD N IRKI + G + TIAG
Sbjct: 67 NYPSSVFVSSNNEVCIADLHNHRIRKILENGSIITIAG 104
>gi|345324001|ref|XP_001513492.2| PREDICTED: NHL repeat-containing protein 2-like [Ornithorhynchus
anatinus]
Length = 725
Score = 43.1 bits (100), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 32/61 (52%), Gaps = 5/61 (8%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIR----KIGDAGVTTIAGGKSNVA 177
G +DG A+ HP GVT D+KGN LYVAD+ N I+ K + G SNV
Sbjct: 463 GDIDGAGINAKLQHPLGVTWDEKGNLLYVADSYNHKIKVVDPKTKNCATLAGTGEASNVI 522
Query: 178 G 178
G
Sbjct: 523 G 523
>gi|242017281|ref|XP_002429120.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212513984|gb|EEB16382.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 791
Score = 43.1 bits (100), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 57/112 (50%), Gaps = 14/112 (12%)
Query: 67 VIEGNEI---GVVPYKIRVSEDGELFAVDEVNSNIVKI--TPPLSQYSRGRLVAGSFQGY 121
++E +EI + Y +++ G LFAV + ++ + I + +Y+ G G QG+
Sbjct: 196 IVESDEIISSSALKYPSKITRWGNLFAVSDSGNHRIIIFNVEGVVKYTIG----GKEQGF 251
Query: 122 TGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIR--KIGDAGVTTIAG 171
DG EA F+ P G+ ++ LY+ADT N AIR + VTT+ G
Sbjct: 252 K---DGNFQEALFDTPHGLAFFNESILYIADTNNHAIRLVDLNKMIVTTVVG 300
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 73/155 (47%), Gaps = 25/155 (16%)
Query: 77 PYKIRVSEDGE-LFAVDEVNSNIVKITPPLSQYSRGRLVAGS------FQGYTGHVDGKP 129
P I SED + LF D +S+I +++ L++ S L GS F G VDGK
Sbjct: 518 PCGITYSEDLKVLFIADTESSSIRRLS--LTKGSVTHLAGGSRFPEDLFS--FGDVDGKG 573
Query: 130 NEARFNHPKGVTMDDK-GNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDA 187
R HP G+T + K +L+V D+ N I++I + G T+ +SN+ G
Sbjct: 574 YNVRLQHPLGLTWNKKRKSLFVCDSYNHKIKQINVETGTTSSLHKESNLFSEPGGCC--- 630
Query: 188 KFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQD 222
S+ D++Y V D N ++ I+L ++
Sbjct: 631 -MSSCDDIIY--------VCDTNNHKIKTINLCKE 656
>gi|354612773|ref|ZP_09030715.1| NHL repeat containing protein [Saccharomonospora paurometabolica
YIM 90007]
gi|353222911|gb|EHB87206.1| NHL repeat containing protein [Saccharomonospora paurometabolica
YIM 90007]
Length = 436
Score = 43.1 bits (100), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 60/118 (50%), Gaps = 11/118 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKPN-EARF 134
P+ + + G + D + ++T + ++ VAG+ Q G +G DG P +A
Sbjct: 125 PWGLALDTQGNFYVADADGCQVRRVTADGASFA---AVAGTGQNGDSG--DGGPAVDAEL 179
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG-GKSNVAGFRDGPSEDAKF 189
HP V +D GN+Y+ D + +RK+ D ++T+AG G+ +G GP+ DA+
Sbjct: 180 RHPTTVALDRHGNVYLTDPESRRVRKVSATDHTISTVAGTGREGDSG-DGGPATDAEL 236
>gi|302864652|ref|YP_003833289.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Micromonospora aurantiaca ATCC 27029]
gi|302567511|gb|ADL43713.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Micromonospora aurantiaca ATCC 27029]
Length = 611
Score = 43.1 bits (100), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 41/86 (47%), Gaps = 8/86 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT--GHVDGKPNEARF 134
P + VS DG V + ++ ++ S G + QG GHVDG ++A
Sbjct: 350 PSGLSVSADGSRLWVADSETSAIR------WVSDGEMHTAVGQGLFDFGHVDGPADQALL 403
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRK 160
HP GV G++ +ADT N A+R+
Sbjct: 404 QHPLGVCALPDGSVLIADTYNGAVRR 429
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 50/105 (47%), Gaps = 4/105 (3%)
Query: 115 AGSFQGYTGHV--DGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIGDAGVTTIAG 171
AG + G T DG + P G+++ G+ L+VAD+ AIR + D + T G
Sbjct: 326 AGMYAGTTVEALRDGPLPDVWMAQPSGLSVSADGSRLWVADSETSAIRWVSDGEMHTAVG 385
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
G DGP++ A + V + P S+L+ D N A+R+
Sbjct: 386 QGLFDFGHVDGPADQALLQHPLGVCAL-PDGSVLIADTYNGAVRR 429
>gi|223939922|ref|ZP_03631790.1| NHL repeat containing protein [bacterium Ellin514]
gi|223891413|gb|EEF57906.1| NHL repeat containing protein [bacterium Ellin514]
Length = 359
Score = 43.1 bits (100), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 46/150 (30%), Positives = 73/150 (48%), Gaps = 17/150 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKP-NEARF 134
P I DG L+ D N I K+T + + VAG+ + G++G DG P A+
Sbjct: 54 PTGIVRGSDGSLYICDTENHRIRKVTADGNIST----VAGTGEHGWSG--DGGPATAAKL 107
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG---VTTIAGGKSNVAGFR--DGPSEDAKF 189
N P V +D NL+ + L+ +RK DA VTTIAG + AGF GP+ A+
Sbjct: 108 NEPYEVRLDQAENLFWVERLSHTVRKC-DAKTGIVTTIAG--NGTAGFSGDGGPATKAQM 164
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
+ + + + +L + D N +R++ +
Sbjct: 165 NEPHSIGFDKAG-NLYICDVRNHRIRKVDM 193
Score = 42.4 bits (98), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 46/88 (52%), Gaps = 9/88 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQ-GYTGHVDGKPNEAR 133
P + DG L+ + + KI +GR+ VAG+ + G+TG+ G A
Sbjct: 224 PRALDFGPDGNLWVALREGNAVYKI-----DMEKGRIFHVAGTGKNGFTGN-GGPAKAAT 277
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKI 161
F PKG+++ GN++VADT N AIR I
Sbjct: 278 FKGPKGLSVASNGNVFVADTENHAIRMI 305
Score = 39.3 bits (90), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 34/65 (52%), Gaps = 1/65 (1%)
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFS 190
A N P G+ G+LY+ DT N IRK+ D ++T+AG + GP+ AK +
Sbjct: 49 AELNGPTGIVRGSDGSLYICDTENHRIRKVTADGNISTVAGTGEHGWSGDGGPATAAKLN 108
Query: 191 NDFDV 195
++V
Sbjct: 109 EPYEV 113
>gi|315500945|ref|YP_004079832.1| alkyl hydroperoxide reductase/ thiol specific antioxidant/ mal
allergen [Micromonospora sp. L5]
gi|315407564|gb|ADU05681.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Micromonospora sp. L5]
Length = 610
Score = 43.1 bits (100), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 41/86 (47%), Gaps = 8/86 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT--GHVDGKPNEARF 134
P + VS DG V + ++ ++ S G + QG GHVDG ++A
Sbjct: 350 PSGLSVSADGSRLWVADSETSAIR------WVSDGEMHTAVGQGLFDFGHVDGPADQALL 403
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRK 160
HP GV G++ +ADT N A+R+
Sbjct: 404 QHPLGVCALPDGSVLIADTYNGAVRR 429
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 50/105 (47%), Gaps = 4/105 (3%)
Query: 115 AGSFQGYTGHV--DGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIGDAGVTTIAG 171
AG + G T DG + P G+++ G+ L+VAD+ AIR + D + T G
Sbjct: 326 AGMYAGTTVEALRDGPLPDVWMAQPSGLSVSADGSRLWVADSETSAIRWVSDGEMHTAVG 385
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
G DGP++ A + V + P S+L+ D N A+R+
Sbjct: 386 QGLFDFGHVDGPADQALLQHPLGVCAL-PDGSVLIADTYNGAVRR 429
>gi|290973961|ref|XP_002669715.1| predicted protein [Naegleria gruberi]
gi|284083266|gb|EFC36971.1| predicted protein [Naegleria gruberi]
Length = 710
Score = 43.1 bits (100), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 70/144 (48%), Gaps = 11/144 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKPNEARFN 135
P + VS + E++ D N I K+ Q +AG+ +G++G +G A+ N
Sbjct: 111 PRNVYVSSNNEVYIADFCNQRIRKVL----QNGNIITIAGNGTKGFSGD-NGPATNAQLN 165
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSNDF 193
P GV + + +Y+AD N IRKI G + TIAG GK +G +G + +A+ N
Sbjct: 166 GPAGVFVSNN-EVYIADYSNHVIRKISQNGTIVTIAGNGKPGFSG-DNGLATNAQLYNP- 222
Query: 194 DVVYVRPTCSLLVIDRGNAALRQI 217
+V + + D N +R+I
Sbjct: 223 SGTFVSSNNEVYISDCFNHVIRKI 246
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 54/108 (50%), Gaps = 3/108 (2%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGG 172
+AG+ + G +G A+ N+P+ V + +Y+AD N IRK+ G + TIAG
Sbjct: 88 IAGNGKPGFGGDNGLATNAQLNYPRNVYVSSNNEVYIADFCNQRIRKVLQNGNIITIAGN 147
Query: 173 KSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
+ +GP+ +A+ N V+V + + D N +R+IS N
Sbjct: 148 GTKGFSGDNGPATNAQL-NGPAGVFVSNN-EVYIADYSNHVIRKISQN 193
>gi|290978635|ref|XP_002672041.1| predicted protein [Naegleria gruberi]
gi|284085614|gb|EFC39297.1| predicted protein [Naegleria gruberi]
Length = 706
Score = 43.1 bits (100), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 66/136 (48%), Gaps = 12/136 (8%)
Query: 87 ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMD-DK 145
+++ +D VN+ I +I+ S G+ G G A+ N P G+ +
Sbjct: 165 DIYIIDFVNNKIRRISNSSGIIST--FAGNGTAGFCGE-GGFATNAQLNGPSGLDFNPTT 221
Query: 146 GNLYVADTLNLAIRKIGDAG--VTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 202
G++Y+AD+ N +RK+ +TT+AG GK AG+ DG DA+ + +DV +
Sbjct: 222 GDVYIADSNNHRVRKVNCKSGIITTLAGTGK---AGYSDGI--DAQLNYPYDVSFCTRGQ 276
Query: 203 SLLVIDRGNAALRQIS 218
+ V DR N + +S
Sbjct: 277 IIYVTDRSNNRICTVS 292
>gi|398342355|ref|ZP_10527058.1| hypothetical protein LinasL1_04604 [Leptospira inadai serovar Lyme
str. 10]
Length = 414
Score = 43.1 bits (100), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 41/94 (43%), Gaps = 9/94 (9%)
Query: 69 EGNEIGVV-------PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGY 121
EGN G V P + V +G L+AVD S ++ P + R+
Sbjct: 254 EGNSCGAVNATSVCYPTDVVVDSNGGLYAVDYARSRVLYF--PSGSTTATRVYGQPDLTS 311
Query: 122 TGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLN 155
+G +G N PKG+ MD +G LYVAD N
Sbjct: 312 SGVNNGGRTANSLNSPKGIAMDTQGGLYVADKFN 345
>gi|375103275|ref|ZP_09749538.1| thiol-disulfide isomerase-like thioredoxin [Saccharomonospora
cyanea NA-134]
gi|374664007|gb|EHR63885.1| thiol-disulfide isomerase-like thioredoxin [Saccharomonospora
cyanea NA-134]
Length = 638
Score = 43.1 bits (100), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 42/152 (27%), Positives = 66/152 (43%), Gaps = 23/152 (15%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPP----LSQYSRGRLVAGSFQGYTGHVDGKPNEA 132
P K V+ +G + D N ++ + L ++ GR G VDG + A
Sbjct: 194 PSKAVVTAEGRVLVADTANHSVAEFASDAETLLRRFGDGR---------RGAVDGAFDVA 244
Query: 133 RFNHPKGVTM------DDKG-NLYVADTLNLAIRKIG--DAGVTTIAG-GKSNVAGFRDG 182
RF P G+T+ D+ G +L VADT N +R + V T+AG G+ G G
Sbjct: 245 RFAEPSGLTLLPTHVADEVGYHLLVADTANHLLRGVDLRTGAVRTVAGTGRQWRDGDDSG 304
Query: 183 PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAAL 214
P+ ++ +DV + ++V GN L
Sbjct: 305 PALGVDLTSPWDVRWWDAANGVVVAMAGNHTL 336
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 24/38 (63%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK 160
GH DG + A F HP GVT+ G + VADT N A+R+
Sbjct: 420 GHRDGAADSALFQHPLGVTVLPDGAIGVADTYNGAVRR 457
>gi|428225769|ref|YP_007109866.1| NHL repeat containing protein [Geitlerinema sp. PCC 7407]
gi|427985670|gb|AFY66814.1| NHL repeat containing protein [Geitlerinema sp. PCC 7407]
Length = 498
Score = 43.1 bits (100), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 50/98 (51%), Gaps = 10/98 (10%)
Query: 82 VSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVT 141
++++ LF D + +V T G++ A G G+ DG A+F P+G+T
Sbjct: 175 IAQENHLFVADTGHHRLVIAT------LEGKVRATVGSGTPGYQDGPWETAQFRAPQGLT 228
Query: 142 MDDKGN-LYVADTLNLAIRKIGDAG--VTTIAG-GKSN 175
D G+ LYVADT N IR + VTT+AG G+ N
Sbjct: 229 YDPAGDRLYVADTGNHLIRCVHGRSRTVTTLAGTGQQN 266
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 36/75 (48%), Gaps = 6/75 (8%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDK 145
G L+ D N I ++ + G V G GH DG EA F+ P G+ + +
Sbjct: 404 GNLWIADTYNHKIKRLN-----LAEGHCVNIVGSGLPGHQDGWGPEASFSEPSGLASEGQ 458
Query: 146 GNLYVADTLNLAIRK 160
LY+ADT N AIR+
Sbjct: 459 -TLYIADTNNHAIRR 472
Score = 39.3 bits (90), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 63/136 (46%), Gaps = 11/136 (8%)
Query: 85 DGE-LFAVDEVNSNIVKIT---PPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGV 140
DG+ L+ D +S++ I+ PP Q G G F G +G+ + AR H G+
Sbjct: 344 DGDTLYVADSESSSVRAISLADPPSVQTLCGS--GGLFD--FGDREGRGDRARLQHCLGL 399
Query: 141 TMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 200
GNL++ADT N I+++ A + S + G +DG +A FS +
Sbjct: 400 A-HGPGNLWIADTYNHKIKRLNLAEGHCVNIVGSGLPGHQDGWGPEASFSEPSGLASEGQ 458
Query: 201 TCSLLVIDRGNAALRQ 216
T L + D N A+R+
Sbjct: 459 T--LYIADTNNHAIRR 472
>gi|408792280|ref|ZP_11203890.1| concanavalin A-like lectin/glucanases family protein [Leptospira
meyeri serovar Hardjo str. Went 5]
gi|408463690|gb|EKJ87415.1| concanavalin A-like lectin/glucanases family protein [Leptospira
meyeri serovar Hardjo str. Went 5]
Length = 1234
Score = 43.1 bits (100), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 28/50 (56%), Positives = 33/50 (66%), Gaps = 5/50 (10%)
Query: 128 KPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG-DAGVTTI-AGGKS 174
P ARFN PK +T D GN +YVADT+N IRKI +G TTI AGG +
Sbjct: 262 SPTAARFNGPKMITTD--GNFIYVADTMNAVIRKIDKSSGTTTILAGGNT 309
>gi|309813024|ref|ZP_07706752.1| putative septum site-determining protein MinC [Dermacoccus sp.
Ellin185]
gi|308433096|gb|EFP57000.1| putative septum site-determining protein MinC [Dermacoccus sp.
Ellin185]
Length = 651
Score = 43.1 bits (100), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 30/52 (57%), Gaps = 2/52 (3%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAGG 172
GHVDG + A HP GVT G++ VADT N AIR+ VTT+A G
Sbjct: 428 GHVDGAADAALLQHPLGVTELPDGSVAVADTYNGAIRRFDPSSGEVTTLATG 479
>gi|381163732|ref|ZP_09872962.1| thiol-disulfide isomerase-like thioredoxin [Saccharomonospora
azurea NA-128]
gi|379255637|gb|EHY89563.1| thiol-disulfide isomerase-like thioredoxin [Saccharomonospora
azurea NA-128]
Length = 633
Score = 43.1 bits (100), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 63/149 (42%), Gaps = 17/149 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF-QGYTGHVDGKPNEARFN 135
P K V+ +G + D N +I + R F G+ G VDG + ARF
Sbjct: 194 PSKAVVTAEGRVLVADTANHSIAEFASDAETLLR------RFGDGHRGAVDGAFDVARFA 247
Query: 136 HPKGVTM------DDKG-NLYVADTLNLAIRKIG--DAGVTTIAG-GKSNVAGFRDGPSE 185
P G+T+ D G +L VADT N +R + V T+AG G G GP+
Sbjct: 248 EPSGLTLLPTHVADQVGYHLIVADTANHLLRGVDLRTGSVRTVAGTGAQWRDGDDSGPAT 307
Query: 186 DAKFSNDFDVVYVRPTCSLLVIDRGNAAL 214
+ ++ +D + ++V GN L
Sbjct: 308 EVDLTSPWDARWWDAAGGVVVAMAGNHTL 336
>gi|195030130|ref|XP_001987921.1| GH10881 [Drosophila grimshawi]
gi|193903921|gb|EDW02788.1| GH10881 [Drosophila grimshawi]
Length = 728
Score = 43.1 bits (100), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 25/51 (49%), Positives = 32/51 (62%), Gaps = 2/51 (3%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG 171
G VDG ARFN+P+GV D+ L VADT N A+R+I + V T+AG
Sbjct: 266 GLVDGDLLTARFNNPQGVAFLDEHTLIVADTDNHALRQISLKNGIVETLAG 316
>gi|424792718|ref|ZP_18218917.1| hypothetical protein XTG29_01651 [Xanthomonas translucens pv.
graminis ART-Xtg29]
gi|422796954|gb|EKU25368.1| hypothetical protein XTG29_01651 [Xanthomonas translucens pv.
graminis ART-Xtg29]
Length = 473
Score = 42.7 bits (99), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 47/85 (55%), Gaps = 8/85 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + +++D L+ D + I++ T + GR++ G +DG EA F+
Sbjct: 170 PVGLALADD-RLYIADSGHHRILECT------TGGRVLRQFGLGTADFIDGGIGEAAFHR 222
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P+G+ ++ +G LYVADT N A+R+I
Sbjct: 223 PRGLALE-RGVLYVADTGNHALRRI 246
>gi|336313754|ref|ZP_08568676.1| thiol-disulfide isomerase-like thioredoxin [Rheinheimera sp. A13L]
gi|335881693|gb|EGM79570.1| thiol-disulfide isomerase-like thioredoxin [Rheinheimera sp. A13L]
Length = 498
Score = 42.7 bits (99), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 57/112 (50%), Gaps = 8/112 (7%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKI--GDAGVTTI 169
L AG+ G +DGK +A FN P G+ + +GN L+VAD A+R+I V T+
Sbjct: 337 LFAGT--GQEALLDGKRRDAAFNQPSGLAL--RGNKLWVADAEASAVRQIDLSSGKVDTL 392
Query: 170 AGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQ 221
G G +DG + A ++ DVV + +L V D N +R + L++
Sbjct: 393 VGQGLFEFGLKDGGFKRALLQHNKDVVALDKN-TLAVADTYNHKIRLLDLDK 443
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 37/64 (57%), Gaps = 3/64 (4%)
Query: 110 RGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VT 167
+G+LV G DG + + F+ P+G+ + DK LYVADT N AIR+I + V
Sbjct: 220 QGKLVKRLGSGIAELKDGHSDSSAFSSPQGLVLTDKA-LYVADTGNHAIRRIDLSTFQVN 278
Query: 168 TIAG 171
TIAG
Sbjct: 279 TIAG 282
>gi|418461883|ref|ZP_13032943.1| Peroxiredoxin [Saccharomonospora azurea SZMC 14600]
gi|359738010|gb|EHK86922.1| Peroxiredoxin [Saccharomonospora azurea SZMC 14600]
Length = 633
Score = 42.7 bits (99), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 63/149 (42%), Gaps = 17/149 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF-QGYTGHVDGKPNEARFN 135
P K V+ +G + D N +I + R F G+ G VDG + ARF
Sbjct: 194 PSKAVVTAEGRVLVADTANHSIAEFASDAETLLR------RFGDGHRGAVDGAFDVARFA 247
Query: 136 HPKGVTM------DDKG-NLYVADTLNLAIRKIG--DAGVTTIAG-GKSNVAGFRDGPSE 185
P G+T+ D G +L VADT N +R + V T+AG G G GP+
Sbjct: 248 EPSGLTLLPTHVADQVGYHLIVADTANHLLRGVDLRTGSVRTVAGTGAQWRDGDDSGPAT 307
Query: 186 DAKFSNDFDVVYVRPTCSLLVIDRGNAAL 214
+ ++ +D + ++V GN L
Sbjct: 308 EVDLTSPWDARWWDAAGGVVVAMAGNHTL 336
>gi|443708985|gb|ELU03866.1| hypothetical protein CAPTEDRAFT_52631, partial [Capitella teleta]
Length = 507
Score = 42.7 bits (99), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 26/39 (66%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
G VDGK EA+ HP VT+ + G L+VAD+ N I+K+
Sbjct: 393 GDVDGKGIEAKLQHPLAVTLAEDGQLFVADSYNHKIKKV 431
>gi|449524308|ref|XP_004169165.1| PREDICTED: NHL repeat-containing protein 2-like, partial [Cucumis
sativus]
Length = 892
Score = 42.7 bits (99), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 37/68 (54%), Gaps = 4/68 (5%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFR 180
G DG +E HP GV G +YVAD+ N ++ + VTTIAG + AGF+
Sbjct: 827 GDHDGVGSEVLLQHPLGVFCSKDGQIYVADSYNHKVKMLDPVSKKVTTIAG--TGKAGFK 884
Query: 181 DGPSEDAK 188
DG + +A+
Sbjct: 885 DGTALEAQ 892
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 45/82 (54%), Gaps = 4/82 (4%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAGGKSNV 176
G G DG ++A FN P+G+ + K N LYVADT N A+R++ V T+AG S
Sbjct: 644 GEDGLRDGNFDDATFNRPQGLAYNAKKNLLYVADTENHALREVDFVKERVRTLAGDGSKG 703
Query: 177 AGFRDGPSEDAKFSND-FDVVY 197
+ ++ G ++ N +DV +
Sbjct: 704 SDYQGGKEGTSQLLNSPWDVCF 725
>gi|348690525|gb|EGZ30339.1| hypothetical protein PHYSODRAFT_295167 [Phytophthora sojae]
Length = 383
Score = 42.7 bits (99), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
++AGS +G VDG ARF HP + + G+LYVAD N IR++
Sbjct: 288 VLAGSTS--SGLVDGTGTSARFYHPDALAIGSDGDLYVADKANYCIRRL 334
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 37/140 (26%), Positives = 60/140 (42%), Gaps = 24/140 (17%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
I V G ++ D N I KIT S+G V+ +F ++P G
Sbjct: 221 IAVDSIGNVYVTDYYNGKITKIT------SKG--VSSTFS------------TSLSNPSG 260
Query: 140 VTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 199
+ D N+YV D N + K G T+ G ++ +G DG A+F + D + +
Sbjct: 261 IVTDSSDNIYVTD--NNRVMKFTSGGAMTVLAGSTS-SGLVDGTGTSARFYHP-DALAIG 316
Query: 200 PTCSLLVIDRGNAALRQISL 219
L V D+ N +R+++L
Sbjct: 317 SDGDLYVADKANYCIRRLNL 336
>gi|384248465|gb|EIE21949.1| hypothetical protein COCSUDRAFT_16964 [Coccomyxa subellipsoidea
C-169]
Length = 1021
Score = 42.7 bits (99), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 38/83 (45%), Gaps = 5/83 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + +G++ D N + + P R VAGS G G DG AR +
Sbjct: 765 PLAVLALSNGDVIVADSYNHRLKLLDPATDSI---RTVAGS--GAAGLTDGTGGNARVSE 819
Query: 137 PKGVTMDDKGNLYVADTLNLAIR 159
P G+ +G +++ADT N AIR
Sbjct: 820 PGGLCAGPRGTVFIADTNNSAIR 842
>gi|29348891|ref|NP_812394.1| hypothetical protein BT_3482 [Bacteroides thetaiotaomicron
VPI-5482]
gi|383125072|ref|ZP_09945731.1| hypothetical protein BSIG_5391 [Bacteroides sp. 1_1_6]
gi|29340797|gb|AAO78588.1| conserved hypothetical protein containing NHL repeat [Bacteroides
thetaiotaomicron VPI-5482]
gi|382983525|gb|EES65521.2| hypothetical protein BSIG_5391 [Bacteroides sp. 1_1_6]
Length = 513
Score = 42.7 bits (99), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 55/115 (47%), Gaps = 12/115 (10%)
Query: 51 ADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSR 110
A GN+ +F++ Y V N G + D F + + +++ ++I PL Q S
Sbjct: 400 AVGNLAKFDSAYQGVFVKNDNYKGEI--------DEYDFYIADKHNHAIRILTPLGQVST 451
Query: 111 --GRLVAGSFQGYTGHVDGK-PNEARFNHPKGVTMDDK-GNLYVADTLNLAIRKI 161
GR A G+V+G EARF PK + D+K G +YV D N IRKI
Sbjct: 452 FAGRGSASLNSNPWGYVNGDLRKEARFERPKAIAYDEKTGIVYVGDAYNHRIRKI 506
>gi|124003658|ref|ZP_01688507.1| NHL repeat [Microscilla marina ATCC 23134]
gi|123991227|gb|EAY30679.1| NHL repeat [Microscilla marina ATCC 23134]
Length = 508
Score = 42.7 bits (99), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 50/98 (51%), Gaps = 11/98 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P KI + GEL+ D N+ I+KI G+++ G G DG ++A F
Sbjct: 210 PSKIIQGDKGELYISDSGNNRILKID------KNGKVLLQIGSGTQGLKDGDFDKATFYE 263
Query: 137 PKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG 171
P G+ + KG+ LYVADT N IR++ V TIAG
Sbjct: 264 PHGLAL--KGDFLYVADTKNNVIRRVDLKRKKVKTIAG 299
>gi|348681654|gb|EGZ21470.1| hypothetical protein PHYSODRAFT_329412 [Phytophthora sojae]
Length = 251
Score = 42.7 bits (99), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 42/79 (53%), Gaps = 6/79 (7%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIA 170
+VAG QG+ ++ ARF+ P +T+ GNLYV D+ N IRK+ VTT A
Sbjct: 144 VVAGRVQGF---LNALGELARFSTPWALTIGSDGNLYVVDSDNNCIRKVDLTTREVTTYA 200
Query: 171 GGKSNVAGFRDGPSEDAKF 189
G +G DG + DA F
Sbjct: 201 -GICLTSGTTDGLATDATF 218
>gi|302821463|ref|XP_002992394.1| hypothetical protein SELMODRAFT_430636 [Selaginella moellendorffii]
gi|300139810|gb|EFJ06544.1| hypothetical protein SELMODRAFT_430636 [Selaginella moellendorffii]
Length = 225
Score = 42.7 bits (99), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 30/65 (46%), Positives = 37/65 (56%), Gaps = 11/65 (16%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGG 172
+AGS G +G+VDG FN P+ + D G ++V NLAIRKI D VTTIAGG
Sbjct: 29 LAGSLVGQSGYVDGP----LFNRPQ---ICDNGAVFVG---NLAIRKISKDGEVTTIAGG 78
Query: 173 KSNVA 177
A
Sbjct: 79 SQKWA 83
>gi|290979154|ref|XP_002672299.1| predicted protein [Naegleria gruberi]
gi|284085875|gb|EFC39555.1| predicted protein [Naegleria gruberi]
Length = 430
Score = 42.7 bits (99), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 46/88 (52%), Gaps = 6/88 (6%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKPNEARF 134
+PY + + +GELF D N+ I K++ + + +AGS GY+G G A+
Sbjct: 54 IPYGVALGMNGELFITDTNNNVIRKVSSTIGIITT---IAGSVTGGYSGD-GGLAIAAKL 109
Query: 135 NHPKGVTMDD-KGNLYVADTLNLAIRKI 161
+ P G+ +D G +YV D N +RK+
Sbjct: 110 SSPYGIVVDQLNGTIYVCDFGNSRVRKL 137
Score = 42.0 bits (97), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 54/107 (50%), Gaps = 4/107 (3%)
Query: 114 VAGSFQ-GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGD--AGVTTIA 170
+AG+ Q Y+G G A F P GV + G L++ DT N IRK+ +TTIA
Sbjct: 32 IAGNNQIAYSGD-GGLAIAASFRIPYGVALGMNGELFITDTNNNVIRKVSSTIGIITTIA 90
Query: 171 GGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
G + G + AK S+ + +V + ++ V D GN+ +R++
Sbjct: 91 GSVTGGYSGDGGLAIAAKLSSPYGIVVDQLNGTIYVCDFGNSRVRKL 137
>gi|290995663|ref|XP_002680402.1| predicted protein [Naegleria gruberi]
gi|284094023|gb|EFC47658.1| predicted protein [Naegleria gruberi]
Length = 974
Score = 42.7 bits (99), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 53/109 (48%), Gaps = 8/109 (7%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGG 172
+AG+ G G D E P + M+ KG L ++D IRKI G + T+AG
Sbjct: 613 IAGTGYGDYGADDILATETYLARPTDIAMNSKGELIISDAKR--IRKIDLNGIIVTLAG- 669
Query: 173 KSNVAGFR--DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
SNVAGF GP+ DA F +Y+ + V D N +R+ISL
Sbjct: 670 -SNVAGFSGDGGPAADATFGT-LGGIYLDSNDDIYVSDPDNHRIRKISL 716
>gi|443708903|gb|ELU03822.1| hypothetical protein CAPTEDRAFT_223323 [Capitella teleta]
Length = 886
Score = 42.7 bits (99), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 36/99 (36%), Positives = 48/99 (48%), Gaps = 15/99 (15%)
Query: 70 GNEIGVV--PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDG 127
GN+ G + PY I V D ++ D N+ I T S+G V SF G G DG
Sbjct: 759 GNQPGKLENPYYITVGHDRNIYVSDTHNNRIQVFT------SQGGFV-NSFGG-PGDQDG 810
Query: 128 KPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV 166
K F HPKG+T+D G + VAD+ N I+ + G
Sbjct: 811 K-----FKHPKGITVDHNGFMVVADSGNHRIQVLRSDGT 844
>gi|170066897|ref|XP_001868267.1| NHL repeat containing 2 [Culex quinquefasciatus]
gi|167863075|gb|EDS26458.1| NHL repeat containing 2 [Culex quinquefasciatus]
Length = 734
Score = 42.7 bits (99), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 41/81 (50%), Gaps = 3/81 (3%)
Query: 122 TGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG-GKSNVAG 178
+G VDG +ARFN P+G+ + ++VAD N AIR+I VTT+AG G+
Sbjct: 273 SGFVDGDFRKARFNAPQGLAFQNDDVIFVADNENHAIRRIDLKSKQVTTVAGSGQQGCDR 332
Query: 179 FRDGPSEDAKFSNDFDVVYVR 199
D S+ +DV R
Sbjct: 333 IGGKIGRDQIISSPWDVAVYR 353
>gi|398342352|ref|ZP_10527055.1| hypothetical protein LinasL1_04589 [Leptospira inadai serovar Lyme
str. 10]
Length = 438
Score = 42.7 bits (99), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 37/84 (44%), Gaps = 12/84 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNE----- 131
P + V +G L+A D NS ++ + G L A G G PN
Sbjct: 293 PTDVVVDSNGGLYATDFANSRVL-------YFPSGSLTATRVYGQPDFTSGGPNNGGITA 345
Query: 132 ARFNHPKGVTMDDKGNLYVADTLN 155
+ N PKG+ +D +G LYV+D N
Sbjct: 346 SSLNKPKGIALDSQGGLYVSDKFN 369
>gi|443288042|ref|ZP_21027136.1| Alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Micromonospora lupini str. Lupac 08]
gi|385881808|emb|CCH22229.1| Alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Micromonospora lupini str. Lupac 08]
Length = 612
Score = 42.7 bits (99), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 66/140 (47%), Gaps = 14/140 (10%)
Query: 115 AGSFQGYTGHV--DGKPNEARFNHPKGVTMDDKG-NLYVADTLNLAIRKIGDAGVTTIAG 171
AG + G T DG EA P G+++ G L++AD+ A+R + + + T G
Sbjct: 328 AGMYAGTTVEALRDGPLAEAWMAQPSGLSVSADGVRLWIADSETSAVRYVENGVLGTAVG 387
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSI 231
G DGP+E A + V + P S+L+ D N A+R+ D Q +++
Sbjct: 388 QGLFDFGHVDGPAESALLQHPLGVCAL-PDGSVLIADTYNGAVRRF----DPATGQVSTV 442
Query: 232 S-----PTDILMV-VGAVLV 245
+ P+D+++ G VLV
Sbjct: 443 ADGLAEPSDLVLTPAGEVLV 462
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 45/100 (45%), Gaps = 10/100 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT--GHVDGKPNEARF 134
P + VS DG + + ++ V+ G L QG GHVDG A
Sbjct: 352 PSGLSVSADGVRLWIADSETSAVRYV------ENGVLGTAVGQGLFDFGHVDGPAESALL 405
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGG 172
HP GV G++ +ADT N A+R+ A V+T+A G
Sbjct: 406 QHPLGVCALPDGSVLIADTYNGAVRRFDPATGQVSTVADG 445
>gi|269957976|ref|YP_003327765.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Xylanimonas cellulosilytica DSM 15894]
gi|269306657|gb|ACZ32207.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Xylanimonas cellulosilytica DSM 15894]
Length = 688
Score = 42.7 bits (99), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 52/104 (50%), Gaps = 14/104 (13%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P K G L D + ++ ++ P R R+ GS G G +DG P+EARF
Sbjct: 178 PAKAITLPGGTLLVADAGHHSLAELAPDGETLVR-RI--GS--GQRGLLDGGPDEARFAE 232
Query: 137 PKGVTM---DDKGNL----YVADTLNLAIR--KIGDAGVTTIAG 171
P G+ + D +G L VADT+N A+R ++ D V T+AG
Sbjct: 233 PNGLCLVPEDLRGRLGYDVVVADTVNHALRGVRLADGLVVTLAG 276
Score = 42.4 bits (98), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 50/208 (24%), Positives = 76/208 (36%), Gaps = 46/208 (22%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLS----QYSRGRLVAGSFQGYTGHVDGKPNEA 132
P+ + S +F V ++ + + P + Y RL+AG+ G VDG +
Sbjct: 330 PWDVAWSGVAGVFLVAMAGTHTLWLFDPTAPRPGAYGTVRLLAGTMN--EGLVDGVGDTC 387
Query: 133 RFNHPKGVTMDDKG---------------------NLYVADTLNLAIRKIG--------- 162
F P G+ +G +VAD A+R +
Sbjct: 388 WFAQPSGLARSTRGAYLMGGDDVFSAADAEQLGQETFWVADAETSALRSVVVSRQPPAVV 447
Query: 163 DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQD 222
A V T G G RDG DA+ + VV + P S++V D N ALR++ D
Sbjct: 448 AASVLTWVGAGLFDFGHRDGDLVDARLQHPLGVVAL-PDGSVVVADTYNGALRRVETWWD 506
Query: 223 DCEYQYNS---------ISPTDILMVVG 241
D Q P+D+L+ G
Sbjct: 507 DVAQQETGRVTTLATGLAEPSDVLVETG 534
>gi|32472743|ref|NP_865737.1| hypothetical protein RB3815 [Rhodopirellula baltica SH 1]
gi|32443980|emb|CAD73422.1| conserved hypothetical protein [Rhodopirellula baltica SH 1]
Length = 351
Score = 42.7 bits (99), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 46/103 (44%), Gaps = 5/103 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI-----GDAGVTTIAGGKSNVA 177
G+VDG ARFN + +D + LY++D N ++R++ G+ V T AG
Sbjct: 106 GYVDGPARSARFNKLHNLVIDAEDVLYLSDHANHSVRRLIQTTDGEWMVDTYAGQGKEGP 165
Query: 178 GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
+ DA F V LL+ D GN +R I L+
Sbjct: 166 ATDNVDRRDATFHEPISVTLDAEGNRLLIADIGNQVVRSIDLS 208
>gi|290976675|ref|XP_002671065.1| predicted protein [Naegleria gruberi]
gi|284084630|gb|EFC38321.1| predicted protein [Naegleria gruberi]
Length = 343
Score = 42.7 bits (99), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 41/122 (33%), Positives = 63/122 (51%), Gaps = 22/122 (18%)
Query: 114 VAGSFQ-GYTGHVDGKPNEARFNHPKGVTMDDK-GNLYVADTLNLAIRKIGDAGV-TTIA 170
+AG+ + G+ G + K +A N P V ++ + GN+Y +D N IRKI + G+ +TI
Sbjct: 48 IAGTGEAGFNGD-NIKATQATINKPVCVIVNHRNGNVYFSDLGNNRIRKIDNNGIISTIV 106
Query: 171 G-------GKSNVA--GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQ 221
G G N+A F + P SND + +Y+ DR N A+R++SLN
Sbjct: 107 GCGDYGLVGDGNLAINSFLNSP-RGICLSNDGNYLYIA--------DRDNHAIRKVSLND 157
Query: 222 DD 223
DD
Sbjct: 158 DD 159
Score = 38.9 bits (89), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 44/97 (45%), Gaps = 25/97 (25%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKPNEAR 133
PY I +S+D EL+ + N I KI G+L VAG +DGK
Sbjct: 189 CPYGIAISKDDELYFSEFSNHTIRKIDK------NGKLWKVAG--------IDGKNGNNE 234
Query: 134 FNH---------PKGVTMDDKGNLYVADTLNLAIRKI 161
++H P + D+ NLY+AD N +IRKI
Sbjct: 235 YDHHAFNNLLNYPAYICFDNNNNLYIADRDNNSIRKI 271
>gi|383763992|ref|YP_005442974.1| hypothetical protein CLDAP_30370 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
gi|381384260|dbj|BAM01077.1| hypothetical protein CLDAP_30370 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
Length = 1106
Score = 42.4 bits (98), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 69/155 (44%), Gaps = 26/155 (16%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV--AGSFQGYTGHVDGKPNEARF 134
P+ + V E G++F D N I S GR + G F TG G P A F
Sbjct: 881 PWGVAVDEAGQIFVADTWNGRIQVFD------SEGRFLRKWGVFAT-TGGELGDP-YALF 932
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDF 193
P+G+ +D GNL VADT N I K G + GG +AG + P+
Sbjct: 933 G-PRGLAIDLDGNLLVADTGNKRILKFTPTGELIKQVGGGGVIAGRFEEPTA-------- 983
Query: 194 DVVYVRPT-CSLLVIDRGNAALRQISLNQDDCEYQ 227
V V PT S+L+ D N ++++S D EY+
Sbjct: 984 --VAVDPTDGSVLIADAWNGRIQRLS---PDLEYR 1013
>gi|320103300|ref|YP_004178891.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
gi|319750582|gb|ADV62342.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
Length = 698
Score = 42.4 bits (98), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 56/117 (47%), Gaps = 7/117 (5%)
Query: 108 YSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVT--MDDKG-NLYVADTLNLAIRKIG-- 162
+S+G + S G VDG N+A + P G+ +DD+G +YVAD+ IR++
Sbjct: 381 FSKGEIGLWSGTGREDIVDGPANQANYAQPSGIDFGVDDQGPAVYVADSEVSGIRRVDLK 440
Query: 163 DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
+T+ G V G DGP A+ + V + R L V D N ++Q+ L
Sbjct: 441 TGTASTLIGRGLFVFGDVDGPLAQARLQHCLGVAFDRGV--LYVADTYNNKVKQLDL 495
Score = 38.9 bits (89), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 61/123 (49%), Gaps = 16/123 (13%)
Query: 83 SEDGELFAVDEVNSNIVKITPPLSQYSRGRL-VAGSFQGYTGHVDGKPNEARFNHPKGVT 141
++ G LF D ++ IV ++ RL V GS G TG DG A FN +G+
Sbjct: 247 AQGGRLFISDTGHNRIV-----VTNLEGKRLDVIGS--GATGFADGSFETASFNRQQGIR 299
Query: 142 MDDKGNLYVADTLNLAIR--KIGDAGVTTIAG-GKSNV---AGFRDGPSEDAKFSNDFDV 195
+ + LYVADT N AIR + V T+AG GK G R GP++ ++ +D+
Sbjct: 300 LVGE-TLYVADTENHAIRAANLTTRTVATVAGTGKQTYHRGWGVR-GPAKTTGLNSPWDL 357
Query: 196 VYV 198
V +
Sbjct: 358 VKI 360
>gi|347757619|ref|YP_004865181.1| NHL repeat family protein [Micavibrio aeruginosavorus ARL-13]
gi|347590137|gb|AEP09179.1| NHL repeat family protein [Micavibrio aeruginosavorus ARL-13]
Length = 495
Score = 42.4 bits (98), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 53/105 (50%), Gaps = 9/105 (8%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
+ + R+ + G+ V + N + + L + ++ GS G GH DG A+FN
Sbjct: 188 LSFPARIEDAGDFIFVADSGHNRILV---LDRTGAVKMTIGS--GELGHQDGSFVMAKFN 242
Query: 136 HPKGVTMDDKGNLYVADTLN--LAIRKIGDAGVTTIAG-GKSNVA 177
HP+G+ + + NLY+ADT N L + + V T++G GK A
Sbjct: 243 HPRGMVLIGR-NLYIADTDNHMLRVANLDSGSVMTLSGNGKRTFA 286
>gi|309790072|ref|ZP_07684646.1| NHL repeat-containing protein [Oscillochloris trichoides DG-6]
gi|308227927|gb|EFO81581.1| NHL repeat-containing protein [Oscillochloris trichoides DG6]
Length = 1124
Score = 42.4 bits (98), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 41/86 (47%), Gaps = 14/86 (16%)
Query: 71 NEIGVV-PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP 129
N +G+ P + + +G L+ D N IV G+++ G+ G+P
Sbjct: 938 NPLGLFGPRGLALDAEGNLYIADTGNKRIVVTD-----------TEGNYRYQWGYAGGEP 986
Query: 130 NEARFNHPKGVTMDDKGNLYVADTLN 155
FN P GV +D +GN+YVADT N
Sbjct: 987 GA--FNEPVGVAVDAQGNVYVADTWN 1010
>gi|402571883|ref|YP_006621226.1| hypothetical protein Desmer_1348 [Desulfosporosinus meridiei DSM
13257]
gi|402253080|gb|AFQ43355.1| conserved repeat protein [Desulfosporosinus meridiei DSM 13257]
Length = 1977
Score = 42.4 bits (98), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 24/94 (25%), Positives = 44/94 (46%), Gaps = 19/94 (20%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPP---LSQYSRGRLVAGSFQGYTGHVDGKPNEA 132
+P I + +G+++ +D NS + K P L Q+ G L ++
Sbjct: 235 IPASIDIDSNGDVYVLDSANSRVQKFQPDGSFLQQWGSGGL----------------SDG 278
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV 166
+F P+G+ + G+++VADT N I+K +G
Sbjct: 279 QFFSPRGIAVSSSGSVFVADTQNKRIQKFTSSGT 312
Score = 39.3 bits (90), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 65/148 (43%), Gaps = 39/148 (26%)
Query: 40 WTTRSSSK------------APQADGNVL----------QFE-NG-YLVETVIEGNEIGV 75
WTT SS A ADGNV +F+ NG YL + G++ G
Sbjct: 126 WTTWGSSGSGNKQFSSPSGVAVDADGNVYVADTFNSRIQKFDSNGNYLTQWGSLGSDEGQ 185
Query: 76 V--PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEAR 133
+ P+ + VS +G ++ D+ NS I + +G+ QG G + E +
Sbjct: 186 LKYPFDLAVSSNGTVYVADQNNSRIQMFS-----------TSGTPQGMWG--EKGSGEGQ 232
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKI 161
FN P + +D G++YV D+ N ++K
Sbjct: 233 FNIPASIDIDSNGDVYVLDSANSRVQKF 260
>gi|299145011|ref|ZP_07038079.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
gi|298515502|gb|EFI39383.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
Length = 459
Score = 42.4 bits (98), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 54/108 (50%), Gaps = 9/108 (8%)
Query: 55 VLQFENGYLVETVIEGNEIGVV-PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL 113
V Q+E E +GNE + P ++ + E+G ++ I KITP G L
Sbjct: 357 VGQYEKAGFTEG--QGNEAKLEEPCQMDLDEEGNIYVAVRKKHRIAKITPD------GML 408
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
+ G +G DG ++A+FNHP+GV G LY++D N IRKI
Sbjct: 409 TNYTGTGTSGTTDGPLDKAQFNHPEGVQFGPDGALYISDYWNHKIRKI 456
>gi|428184600|gb|EKX53455.1| hypothetical protein GUITHDRAFT_101156 [Guillardia theta CCMP2712]
Length = 367
Score = 42.4 bits (98), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 45/87 (51%), Gaps = 6/87 (6%)
Query: 84 EDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMD 143
+ G++ VD N I IT ++ G S +G +D ++A FN P G+
Sbjct: 129 QSGKILVVDRENHCIRAITDNHVIHAYG-----SRSSESGWMDAATSKALFNRPFGIAAS 183
Query: 144 DKGNLYVADTLNLAIRKIGDAG-VTTI 169
G L+VADT N ++RKI +G V+T+
Sbjct: 184 STGELFVADTGNHSVRKIDLSGNVSTL 210
Score = 41.6 bits (96), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 45/102 (44%), Gaps = 6/102 (5%)
Query: 75 VVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
+ P I V E ++ + I ++ P S + G+ Q G +DGK
Sbjct: 224 ISPVGICVDEQNCVYVSEWGGHRIRRVQPDGSSV----IWVGTSQ--KGKLDGKGINGSL 277
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNV 176
HP G+++D KG +YVAD N +R+I G G NV
Sbjct: 278 FHPAGMSVDTKGIVYVADFGNHCVRRINTTGEMITLGVAGNV 319
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 46/180 (25%), Positives = 70/180 (38%), Gaps = 29/180 (16%)
Query: 69 EGNEIGV---VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV-AGSFQGY--- 121
EGN + P I + + ++ D N I ++ Q + G+ +F G
Sbjct: 26 EGNRMTCKMNCPSGIAMGQGDCIYVSDRQNHCIRRLPGIEEQTAPGKEDDVSTFAGVPRK 85
Query: 122 TGHVDGKPNEARFNHPKGV---------------------TMDDKGNLYVADTLNLAIRK 160
+G DG A FN P G+ T G + V D N IR
Sbjct: 86 SGMEDGHNRRALFNEPGGICCLKVGSFPVACFHRLASGLLTRLQSGKILVVDRENHCIRA 145
Query: 161 IGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
I D V G +S+ +G+ D + A F+ F + T L V D GN ++R+I L+
Sbjct: 146 ITDNHVIHAYGSRSSESGWMDAATSKALFNRPFGIA-ASSTGELFVADTGNHSVRKIDLS 204
>gi|423214448|ref|ZP_17200976.1| hypothetical protein HMPREF1074_02508 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692863|gb|EIY86099.1| hypothetical protein HMPREF1074_02508 [Bacteroides xylanisolvens
CL03T12C04]
Length = 464
Score = 42.4 bits (98), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 54/108 (50%), Gaps = 9/108 (8%)
Query: 55 VLQFENGYLVETVIEGNEIGVV-PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL 113
V Q+E E +GNE + P ++ + E+G ++ I KITP G L
Sbjct: 362 VGQYEKAGFTEG--QGNEAKLEEPCQMDLDEEGNIYVAVRRKHRIAKITPD------GML 413
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
+ G +G DG ++A+FNHP+GV G LY++D N IRKI
Sbjct: 414 TNYTGTGTSGTTDGPLDKAQFNHPEGVQFGPDGALYISDYWNHKIRKI 461
>gi|289669010|ref|ZP_06490085.1| hypothetical protein XcampmN_11072, partial [Xanthomonas campestris
pv. musacearum NCPPB 4381]
Length = 284
Score = 42.4 bits (98), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 71/153 (46%), Gaps = 19/153 (12%)
Query: 9 TLTLLIAFTLQFQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVI 68
TL LL A+ + Q G L K LT+L + + P +D ++ +G + E
Sbjct: 112 TLVLLDAYGRERQRLVGVTGDLDKALTALCE------GQQPPSDADL----HG-VAERGP 160
Query: 69 EGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGK 128
E + P + +ED L+ D + +++ T GR++ G +DG
Sbjct: 161 EPHLALRFPTGLAATED-RLYVADTGHHRVLECT------HSGRVLRQFGHGNADLIDGG 213
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
EA F P+G+ + ++ LYVADT N A+R+I
Sbjct: 214 VGEAAFRRPQGLAL-ERDQLYVADTGNHALRRI 245
>gi|290974478|ref|XP_002669972.1| predicted protein [Naegleria gruberi]
gi|284083526|gb|EFC37228.1| predicted protein [Naegleria gruberi]
Length = 2346
Score = 42.4 bits (98), Expect = 0.57, Method: Composition-based stats.
Identities = 29/102 (28%), Positives = 47/102 (46%), Gaps = 4/102 (3%)
Query: 121 YTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGG--KSNVAG 178
YT +G A P G+ + + G LY+AD+ N IRK+ + ++T+ G + AG
Sbjct: 842 YTCGDNGDAKLANLKQPTGLFLTESGVLYIADSGNHRIRKVENGTISTVIGSGYSCSSAG 901
Query: 179 FRDGPSEDAKFSNDFD--VVYVRPTCSLLVIDRGNAALRQIS 218
S D VY+ + + V D GN +R++S
Sbjct: 902 AASCDSTDPLLVKLLTPAAVYITKSGDVYVADTGNNKVRKLS 943
>gi|167565350|ref|ZP_02358266.1| hypothetical protein BoklE_22544 [Burkholderia oklahomensis EO147]
Length = 727
Score = 42.4 bits (98), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 36/78 (46%)
Query: 74 GVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEAR 133
GV+P + +S G+L D S + + + + G+ G V G P + R
Sbjct: 261 GVLPADLAISPSGQLLIADSGPSQQIYVYAKPAAMPVLSSILGTRSGIFHAVKGAPGDWR 320
Query: 134 FNHPKGVTMDDKGNLYVA 151
FN P G+ D GNLYV+
Sbjct: 321 FNGPTGIGFDRGGNLYVS 338
>gi|432922359|ref|XP_004080313.1| PREDICTED: NHL repeat-containing protein 2-like [Oryzias latipes]
Length = 720
Score = 42.4 bits (98), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 28/53 (52%), Positives = 35/53 (66%), Gaps = 5/53 (9%)
Query: 122 TGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIGDAG--VTTIAG 171
+G DG +EA FN P+GV KG+ +YVADT N IRKI +G V+TIAG
Sbjct: 261 SGRKDGSLSEASFNSPQGVAF--KGDAVYVADTENHLIRKIDLSGGRVSTIAG 311
>gi|423223182|ref|ZP_17209651.1| hypothetical protein HMPREF1062_01837 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392639433|gb|EIY33254.1| hypothetical protein HMPREF1062_01837 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 447
Score = 42.4 bits (98), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 40/86 (46%), Gaps = 3/86 (3%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V +G ++ V+ +I KI+ P + + AG DG P E F +
Sbjct: 363 PNGVTVDNEGNIYVVNAGGQSIRKISYPDGYVT---IAAGGTMSGETETDGLPLECTFLY 419
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG 162
PK + MD + N Y+A L +RK+
Sbjct: 420 PKDIAMDSEENFYIAGGAGLNVRKLA 445
>gi|319652355|ref|ZP_08006472.1| cell surface protein [Bacillus sp. 2_A_57_CT2]
gi|317396016|gb|EFV76737.1| cell surface protein [Bacillus sp. 2_A_57_CT2]
Length = 615
Score = 42.4 bits (98), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 51/111 (45%), Gaps = 9/111 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVA--GSFQGYTGHVDGKPNEARF 134
PY ++ G +F D N I+K + G LV+ GS G G +F
Sbjct: 331 PYDVQSDSKGNVFVSDSFNHRILKYN------TNGSLVSKWGSMYGTGGPFGYGSYAGQF 384
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIA-GGKSNVAGFRDGPS 184
P+ + +D N+YVAD++N I+K ++G A G ++GF PS
Sbjct: 385 FVPRQIAVDRYDNVYVADSVNHRIQKFSNSGTFLAAYGSLGTLSGFFQFPS 435
Score = 38.5 bits (88), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 43/86 (50%), Gaps = 13/86 (15%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
P + G L+ D N IVK+ S G+++ G+F G GH G+ FN
Sbjct: 46 TPVALAKDSSGNLYMEDMGNHRIVKMD------SSGKIL-GAF-GSLGHSPGE-----FN 92
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKI 161
P G+ +D +GN+ VADT N I+K
Sbjct: 93 MPFGIAIDREGNILVADTGNYRIQKF 118
>gi|224537023|ref|ZP_03677562.1| hypothetical protein BACCELL_01899 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521331|gb|EEF90436.1| hypothetical protein BACCELL_01899 [Bacteroides cellulosilyticus
DSM 14838]
Length = 447
Score = 42.4 bits (98), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 40/86 (46%), Gaps = 3/86 (3%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V +G ++ V+ +I KI+ P + + AG DG P E F +
Sbjct: 363 PNGVTVDNEGNIYVVNAGGQSIRKISYPDGYVT---IAAGGTMSGETETDGLPLECTFLY 419
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG 162
PK + MD + N Y+A L +RK+
Sbjct: 420 PKDIAMDSEENFYIAGGAGLNVRKLA 445
>gi|167572454|ref|ZP_02365328.1| hypothetical protein BoklC_21631 [Burkholderia oklahomensis C6786]
Length = 727
Score = 42.4 bits (98), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 36/78 (46%)
Query: 74 GVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEAR 133
GV+P + +S G+L D S + + + + G+ G V G P + R
Sbjct: 261 GVLPADLAISPSGQLLIADSGPSQQIYVYAKPAAMPVLSSILGTRSGIFHAVKGAPGDWR 320
Query: 134 FNHPKGVTMDDKGNLYVA 151
FN P G+ D GNLYV+
Sbjct: 321 FNGPTGIGFDRGGNLYVS 338
>gi|328717168|ref|XP_001944227.2| PREDICTED: NHL repeat-containing protein 2-like [Acyrthosiphon
pisum]
Length = 943
Score = 42.4 bits (98), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 50/121 (41%), Gaps = 21/121 (17%)
Query: 100 KITPPLSQYSRGRLVAGSFQGYTGHV------------------DGKPNEARFNHPKGVT 141
KI ++ R+V + QG HV DG EA F+ P+G+
Sbjct: 293 KILVAIADSGHHRIVISTIQGRVKHVIGGGGTGLFSSTTKKGFKDGNFTEALFHSPQGIC 352
Query: 142 MDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSND-FDVVYV 198
+ L+V DT N AIR I + V T+AG G +A+ N +DVVY
Sbjct: 353 FQNSHILFVCDTENHAIRMIDLKEKTVKTVAGNGKKGHDKYGGQMWNAQILNTPWDVVYY 412
Query: 199 R 199
+
Sbjct: 413 K 413
>gi|330464985|ref|YP_004402728.1| alkyl hydroperoxide reductase/ thiol specific antioxidant/ mal
allergen [Verrucosispora maris AB-18-032]
gi|328807956|gb|AEB42128.1| alkyl hydroperoxide reductase/ thiol specific antioxidant/ mal
allergen [Verrucosispora maris AB-18-032]
Length = 611
Score = 42.4 bits (98), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 62/136 (45%), Gaps = 6/136 (4%)
Query: 115 AGSFQGYTGHV--DGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIGDAGVTTIAG 171
AG + G T DG E P G+ + G L+VAD+ + AIR + D + T G
Sbjct: 326 AGMYAGTTVEALRDGPLAETWLAQPSGLAVSADGRRLWVADSESSAIRYVEDDVMGTAVG 385
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSI 231
G DGP+ A + V + P S+L+ D N A+R+ D+ +
Sbjct: 386 QGLFDFGHVDGPAAQALLQHPLGVCAL-PDGSVLIADTYNGAVRRYDPATDEVSTVATDL 444
Query: 232 S-PTDILMVV-GAVLV 245
+ P+D+ + GAVLV
Sbjct: 445 AEPSDLALTPDGAVLV 460
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 49/108 (45%), Gaps = 12/108 (11%)
Query: 54 NVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGE-LFAVDEVNSNIVKITPPLSQYSRGR 112
V +G L ET + P + VS DG L+ D +S I + + + G+
Sbjct: 333 TVEALRDGPLAETWL------AQPSGLAVSADGRRLWVADSESSAIRYVEDDVMGTAVGQ 386
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK 160
G F GHVDG +A HP GV G++ +ADT N A+R+
Sbjct: 387 ---GLFD--FGHVDGPAAQALLQHPLGVCALPDGSVLIADTYNGAVRR 429
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 57/133 (42%), Gaps = 17/133 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P K+ V DG L D +V++ R G G VDG + A F+
Sbjct: 174 PGKLVVGPDGTLLVSDSARHRVVELAADGETLLRA-----IGTGSRGRVDGTADTATFSE 228
Query: 137 PKGVTM------DDKG-NLYVADTLNLAIR--KIGDAGVTTIAGGKSNVAGFRDGPSEDA 187
P+G+ + + G +L VADT+N +R ++G V T+AG D + DA
Sbjct: 229 PQGLCLLPPHVAEVAGYDLVVADTVNHLLRGVRLGSGEVVTVAGTGRQWRSTVDDHAHDA 288
Query: 188 ---KFSNDFDVVY 197
S+ +D+ +
Sbjct: 289 LSVDLSSPWDLAW 301
>gi|386811933|ref|ZP_10099158.1| conserved hypothetical protein [planctomycete KSU-1]
gi|386404203|dbj|GAB62039.1| conserved hypothetical protein [planctomycete KSU-1]
Length = 669
Score = 42.0 bits (97), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 65/141 (46%), Gaps = 11/141 (7%)
Query: 63 LVETVIEGNEIGVVPYKIRVSE-DGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGY 121
L+E +G + P K+ E LF D ++ IV T G+++ G
Sbjct: 199 LLEKGKKGPSLFSFPGKVLADEISNRLFIADSNHNRIVIAT------LDGKVLDVVGNGK 252
Query: 122 TGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG-GKSNVAG 178
G DG EA FNHP+G+ + LYVADT N IRK+ V TIAG GK
Sbjct: 253 IGKDDGTFTEASFNHPQGMAF-HRDYLYVADTENHLIRKLDLKTKTVKTIAGTGKQANFM 311
Query: 179 FRDGPSEDAKFSNDFDVVYVR 199
G + ++ +D+V+++
Sbjct: 312 AAGGMGTSSPLNSPWDLVFLK 332
Score = 41.6 bits (96), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 42/75 (56%), Gaps = 8/75 (10%)
Query: 85 DGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDD 144
+G ++ D N I K+ PL + R L +G F GH DGK +A+F P G+++ +
Sbjct: 442 NGLIYIADTYNHKI-KVLNPLDKTCRTILGSGQF----GHDDGK--DAQFYEPGGLSIAN 494
Query: 145 KGNLYVADTLNLAIR 159
LY+ADT N AIR
Sbjct: 495 N-KLYIADTNNHAIR 508
>gi|163787673|ref|ZP_02182120.1| hypothetical protein FBALC1_04002 [Flavobacteriales bacterium
ALC-1]
gi|159877561|gb|EDP71618.1| hypothetical protein FBALC1_04002 [Flavobacteriales bacterium
ALC-1]
Length = 399
Score = 42.0 bits (97), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 54/122 (44%), Gaps = 12/122 (9%)
Query: 70 GNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP 129
GN + P + + E+G L+ V+ N+NIVKI P + + G V G+ GH+D
Sbjct: 158 GNGLLACPNGMTIDENGNLYVVNFSNTNIVKIQPDGTATNIGNTVGGN-----GHIDYDV 212
Query: 130 NEARFNHPK-GVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAK 188
N G + +Y D NL AG T +AG VAG DG + +
Sbjct: 213 NTNNLYIASFGTNL-----IYYLDLDNLTSGTSIIAG-TVVAGTGMGVAGNDDGSALSST 266
Query: 189 FS 190
FS
Sbjct: 267 FS 268
>gi|290975268|ref|XP_002670365.1| predicted protein [Naegleria gruberi]
gi|284083923|gb|EFC37621.1| predicted protein [Naegleria gruberi]
Length = 1034
Score = 42.0 bits (97), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 31/69 (44%), Positives = 37/69 (53%), Gaps = 4/69 (5%)
Query: 105 LSQYSRGRLVAGSFQGYTGHVDGKPN-EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGD 163
LSQ L G+ G DG P A+ N P GV+M G +Y+AD LN IRKI
Sbjct: 30 LSQGIVSTLAGTKEAGFNG--DGIPAVSAQLNSPLGVSMSKGGEIYIADKLNHRIRKIDI 87
Query: 164 AG-VTTIAG 171
G +TTIAG
Sbjct: 88 FGKITTIAG 96
>gi|158293254|ref|XP_314572.4| AGAP010610-PA [Anopheles gambiae str. PEST]
gi|157016870|gb|EAA09865.4| AGAP010610-PA [Anopheles gambiae str. PEST]
Length = 730
Score = 42.0 bits (97), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 35/98 (35%), Positives = 50/98 (51%), Gaps = 8/98 (8%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
VP SE+ +LFAV + ++ V I S G ++ +G VDG +ARFN
Sbjct: 240 VPTGAGGSEE-QLFAVSDSGNHRVLIVD-----SAGTVLHKVGGKQSGFVDGNFTKARFN 293
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG 171
P+GV ++VAD N A+R+I V+TIAG
Sbjct: 294 APQGVAFQGTDVVFVADNENHAVRRIDLKARLVSTIAG 331
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 45/103 (43%), Gaps = 9/103 (8%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDK-GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD 181
G VDGK A+F HP GV + + G +Y ADT N I+KI DA + FR+
Sbjct: 487 GDVDGKQYGAKFQHPLGVAYNPQDGFIYFADTYNHKIKKI-DAATNC-----ATTCEFRE 540
Query: 182 GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDC 224
+F+ + R L + D N L + N DC
Sbjct: 541 ANGAVRRFNEPAGLCLDRSGQLLYIADTNNHEL--LVANLTDC 581
>gi|430744164|ref|YP_007203293.1| NHL repeat protein [Singulisphaera acidiphila DSM 18658]
gi|430015884|gb|AGA27598.1| NHL repeat protein [Singulisphaera acidiphila DSM 18658]
Length = 358
Score = 42.0 bits (97), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 56/111 (50%), Gaps = 17/111 (15%)
Query: 111 GRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAG-VTT 168
GRL AG G EA N P V D +GNLY++DT+N IR++ G +G +TT
Sbjct: 30 GRLSAG----------GPAVEATLNMPFDVAFDSRGNLYLSDTMNHCIRRVDGKSGIITT 79
Query: 169 IAGGKSNVAGFR--DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
+AG S GF G + AK + +V + +L DR N +R++
Sbjct: 80 VAG--SGTKGFSGDGGVALKAKLDEPYGIV-LDSRGNLYFADRLNRRVRRV 127
>gi|380511998|ref|ZP_09855405.1| nhl repeat protein [Xanthomonas sacchari NCPPB 4393]
Length = 473
Score = 42.0 bits (97), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 43/80 (53%), Gaps = 7/80 (8%)
Query: 82 VSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVT 141
V D L+ D + I++ + + GR++ G +DG EA F+ P+G+
Sbjct: 174 VVADDLLYVADSGHHRILECS------TNGRVLRQFGIGTADFIDGAIGEAAFHRPRGLA 227
Query: 142 MDDKGNLYVADTLNLAIRKI 161
++ +G LYVADT N A+R+I
Sbjct: 228 LE-RGVLYVADTGNHALRRI 246
>gi|290971688|ref|XP_002668616.1| predicted protein [Naegleria gruberi]
gi|284082092|gb|EFC35872.1| predicted protein [Naegleria gruberi]
Length = 348
Score = 42.0 bits (97), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 72/145 (49%), Gaps = 11/145 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKPNEARFN 135
P + VS + E++ D N I KI SQ +AG+ + G++G +G A+
Sbjct: 166 PAGVFVSNN-EVYIADYSNHVIRKI----SQNGTIVTIAGNGKPGFSGD-NGLATNAQLY 219
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSNDF 193
+P G + +Y++D N IRKI G + TIAG GK +G +GP+ +A+ +
Sbjct: 220 NPSGTFVSSNNEVYISDCFNHVIRKISQNGTIVTIAGNGKGGFSG-DNGPATNAQLYSPL 278
Query: 194 DVVYVRPTCSLLVIDRGNAALRQIS 218
V+V + + D N +R+IS
Sbjct: 279 G-VFVSSDNEVYISDCFNHRIRKIS 302
Score = 38.9 bits (89), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 54/108 (50%), Gaps = 3/108 (2%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGG 172
+AG+ + G +G A+ N+P+ V + +Y+AD N IRK+ G + TIAG
Sbjct: 87 IAGNGKPGFGGDNGLATNAQLNYPRNVYVSSNNEVYIADFCNQRIRKVLQNGNIITIAGN 146
Query: 173 KSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
+ +GP+ +A+ N V+V + + D N +R+IS N
Sbjct: 147 GTKGFSGDNGPATNAQL-NGPAGVFVSNN-EVYIADYSNHVIRKISQN 192
>gi|290995474|ref|XP_002680320.1| predicted protein [Naegleria gruberi]
gi|284093940|gb|EFC47576.1| predicted protein [Naegleria gruberi]
Length = 992
Score = 42.0 bits (97), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 63/130 (48%), Gaps = 8/130 (6%)
Query: 46 SKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVS-EDGELFAVDEVNSNIVKITPP 104
S A ++ ++ G L++ + V PY + V +G++F D + + KI
Sbjct: 415 SAADKSVSTIIGTNQGKLIDNIPATTASVVDPYGVVVDPSNGDVFISDGYLNCVRKIDGK 474
Query: 105 LSQYSRGRLVAGSFQGYTGHV--DGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG 162
+ VAG+ G G V +G N+A+ P G+++ G+L +AD N AIRK+
Sbjct: 475 SGIVTT---VAGT--GEAGDVGDNGPSNKAQLFSPSGLSLTSSGDLLIADNGNQAIRKVS 529
Query: 163 DAGVTTIAGG 172
+ +TTI G
Sbjct: 530 NGIITTIVSG 539
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 33/65 (50%), Gaps = 2/65 (3%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAG 178
G +G D K ++ N P GV + KGN+ +ADT+N +R I D GV T G N
Sbjct: 850 GLSGDGD-KAANSKLNIPTGVAVTKKGNIIIADTINGRLRMINNDTGVITTIAGAENDKL 908
Query: 179 FRDGP 183
D P
Sbjct: 909 VLDNP 913
>gi|398342351|ref|ZP_10527054.1| hypothetical protein LinasL1_04584 [Leptospira inadai serovar Lyme
str. 10]
Length = 473
Score = 42.0 bits (97), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 36/84 (42%), Gaps = 12/84 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNE----- 131
P + V +G L+AVD NS ++ Y G A G PN
Sbjct: 328 PTDVVVDSNGGLYAVDFANSRVL-------YYPSGSTTATRVYGQPDFTSSGPNNGGITA 380
Query: 132 ARFNHPKGVTMDDKGNLYVADTLN 155
+ N PKG+ +D +G LYVAD N
Sbjct: 381 SSLNKPKGIALDSQGGLYVADKFN 404
>gi|424868210|ref|ZP_18291970.1| NHL repeat domain protein [Leptospirillum sp. Group II 'C75']
gi|124516337|gb|EAY57845.1| NHL repeat domain protein [Leptospirillum rubarum]
gi|387221511|gb|EIJ76065.1| NHL repeat domain protein [Leptospirillum sp. Group II 'C75']
Length = 494
Score = 42.0 bits (97), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 119 QGYTGHVDGKPNEARFNHPKGVTM--DDKGNLYVADTLNLAIRKIG--DAGVTTIAG-GK 173
QG G DG ++A+F P+G+ DD VADT N +R++ VTTIAG G
Sbjct: 220 QGTPGSADGSFDQAQFRDPQGIRFCPDDPDIAIVADTGNHLLRRVDFRRGSVTTIAGTGV 279
Query: 174 SNVAGFRDGPSEDAKFSNDFDVVYVR 199
A F P+ A ++ +D+++ R
Sbjct: 280 QGWAIFEPVPAMSAVLNSPWDILFHR 305
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 41/80 (51%), Gaps = 2/80 (2%)
Query: 83 SEDGELFAVDEVNSNIVKITPPLS-QYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVT 141
S+ ++ VD S++ + P +S + SR + G G+ DG EAR HP G+
Sbjct: 358 SDGTRIYFVDSETSSVRVLHPGVSPRQSRVETLVGRGLFEFGNRDGSFQEARLQHPLGIL 417
Query: 142 MDDKGNLYVADTLNLAIRKI 161
DD G L VADT N IR +
Sbjct: 418 WDD-GLLLVADTYNHRIRAL 436
>gi|29348885|ref|NP_812388.1| hypothetical protein BT_3476 [Bacteroides thetaiotaomicron
VPI-5482]
gi|383125062|ref|ZP_09945721.1| hypothetical protein BSIG_1192 [Bacteroides sp. 1_1_6]
gi|29340791|gb|AAO78582.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
VPI-5482]
gi|251840786|gb|EES68867.1| hypothetical protein BSIG_1192 [Bacteroides sp. 1_1_6]
Length = 457
Score = 42.0 bits (97), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 56/121 (46%), Gaps = 13/121 (10%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTM---------DDKGNLYVADTLNLAIRKIGD 163
+V G QG VDG +AR + P+ T D+ + Y D N IR +
Sbjct: 338 IVCGQ-QGAKDWVDGVGKKARMHAPRQGTFVKNPAYKGSSDEYDFYFCDRENHCIRILTP 396
Query: 164 AG-VTTIAG-GKSNVAGFRDGP-SEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
G VTT AG G + +G+ DG ++A+F++ +VY + DR N +R+I
Sbjct: 397 QGRVTTFAGRGSNGTSGYNDGDLRQEARFNHPEGIVYDEERECFFIGDRENRRIRKIGYE 456
Query: 221 Q 221
+
Sbjct: 457 E 457
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 13/86 (15%)
Query: 83 SEDGELFAVDEVNSNIVKITPPLSQYSRGRLVA----GSFQGYTGHVDGK-PNEARFNHP 137
S++ + + D N I +TP +GR+ GS G +G+ DG EARFNHP
Sbjct: 376 SDEYDFYFCDRENHCIRILTP------QGRVTTFAGRGS-NGTSGYNDGDLRQEARFNHP 428
Query: 138 KGVTMD-DKGNLYVADTLNLAIRKIG 162
+G+ D ++ ++ D N IRKIG
Sbjct: 429 EGIVYDEERECFFIGDRENRRIRKIG 454
>gi|20090869|ref|NP_616944.1| hypothetical protein MA2021 [Methanosarcina acetivorans C2A]
gi|19915942|gb|AAM05424.1| hypothetical protein MA_2021 [Methanosarcina acetivorans C2A]
Length = 341
Score = 42.0 bits (97), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 51/110 (46%), Gaps = 21/110 (19%)
Query: 61 GYLVE--TVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPP---LSQYSRGRLVA 115
GYL + ++ GN + P+ + V G ++ D N+ I K L+Q+
Sbjct: 129 GYLTQWGSLGSGNGQFIYPHGVAVDSSGNVYVTDAGNNRIQKFNSTGGYLTQW------- 181
Query: 116 GSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
GS+ G N +FN P+GV +D GN+YV D+ N I+K G
Sbjct: 182 GSY--------GSGN-GQFNDPEGVAVDSSGNVYVVDSDNNRIQKFNGTG 222
Score = 38.9 bits (89), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 39/92 (42%), Gaps = 19/92 (20%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPP---LSQYSRGRLVAGSFQGYTGHVDGKPNEAR 133
PY I V G ++ D N I K L+Q+ G L +G+ Q
Sbjct: 100 PYGIAVDSSGNVYVADTGNKRIQKFNGTGGYLTQW--GSLGSGNGQ-------------- 143
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
F +P GV +D GN+YV D N I+K G
Sbjct: 144 FIYPHGVAVDSSGNVYVTDAGNNRIQKFNSTG 175
>gi|257057817|ref|YP_003135649.1| Peroxiredoxin [Saccharomonospora viridis DSM 43017]
gi|256587689|gb|ACU98822.1| Peroxiredoxin [Saccharomonospora viridis DSM 43017]
Length = 644
Score = 42.0 bits (97), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 45/157 (28%), Positives = 64/157 (40%), Gaps = 33/157 (21%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPP----LSQYSRGRLVAGSFQGYTGHVDGKPNEA 132
P K V+ +G + D N ++ + L ++ GR G VDG + A
Sbjct: 196 PAKAVVTAEGRILVADTANHSVAEFASDAETLLRRFGDGR---------RGAVDGAFDVA 246
Query: 133 RFNHPKGVTM------DDKG-NLYVADTLNLAIRKIGDAGVTTIAGGKSNVAG----FRD 181
RF P G+T+ D+ G +L VADT N +R GV G VAG +RD
Sbjct: 247 RFAEPSGLTLLPTHIADEVGYHLLVADTANHLLR-----GVDLRTGAVRTVAGTGRQWRD 301
Query: 182 GPSEDAKFSND----FDVVYVRPTCSLLVIDRGNAAL 214
G A D +DV + ++V GN L
Sbjct: 302 GDDSGAALEVDLTSPWDVRWWDVAGGVVVAMAGNHTL 338
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 38/79 (48%), Gaps = 6/79 (7%)
Query: 88 LFAVDEVNSNI--VKITPPLSQYSRGRLVAGSFQGYT----GHVDGKPNEARFNHPKGVT 141
L+ VD S + V+ P + R V G+ G+ GH DG EA HP G T
Sbjct: 382 LWLVDAETSALRYVERDPGSADEDTPRFVVGTAVGHDLFTFGHRDGSAAEALLQHPLGAT 441
Query: 142 MDDKGNLYVADTLNLAIRK 160
+ G + VADT N A+R+
Sbjct: 442 VLPDGAVAVADTYNGAVRR 460
>gi|397781307|ref|YP_006545780.1| Tripartite motif-containing protein 71 [Methanoculleus bourgensis
MS2]
gi|396939809|emb|CCJ37064.1| Tripartite motif-containing protein 71 AltName: Full=Lin-41 homolog
[Methanoculleus bourgensis MS2]
Length = 1996
Score = 42.0 bits (97), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 40/86 (46%), Gaps = 13/86 (15%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
I V G +F VD +N I K S G + S G G DG+ N+P
Sbjct: 1498 IAVDSAGHVFVVDHLNCRIQKFD------SSGTFI--STWGSKGSSDGQ-----LNNPSD 1544
Query: 140 VTMDDKGNLYVADTLNLAIRKIGDAG 165
+ +D GN+YVADT N ++K +G
Sbjct: 1545 IAIDTAGNIYVADTYNNRVQKFDKSG 1570
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 38/81 (46%), Gaps = 13/81 (16%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
I V G +F VD +N I K S G + S G G DG+ F P G
Sbjct: 1780 IAVDSAGHVFVVDHLNCRIQKFD------SSGTFI--STWGSEGFSDGQ-----FYRPSG 1826
Query: 140 VTMDDKGNLYVADTLNLAIRK 160
+ +D GN+YVADT + I+K
Sbjct: 1827 IALDSAGNVYVADTYSNRIQK 1847
>gi|290973488|ref|XP_002669480.1| predicted protein [Naegleria gruberi]
gi|284083028|gb|EFC36736.1| predicted protein [Naegleria gruberi]
Length = 1071
Score = 42.0 bits (97), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 45/155 (29%), Positives = 70/155 (45%), Gaps = 6/155 (3%)
Query: 68 IEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAG-SFQGYTGHVD 126
I GN + VP + V + G ++ +N+ K+ Q L AG S G +G+
Sbjct: 455 IGGNLMVNVPRGLFVDDSGNIYYAVVFYNNVYKMDVTTGQQ---ILFAGISRLGNSGN-G 510
Query: 127 GKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSE 185
G A N P V D +Y+ ++ + IRK+ +G+ + G S G +G E
Sbjct: 511 GPATSAMLNRPVQVFGDYSNMIYIVESGSSKIRKVDLQSGIISDFCGTSIGYGGDNGLPE 570
Query: 186 DAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
AK +N Y R T + + D GN +R+IS N
Sbjct: 571 YAKLANPAGGFYHRSTGDIYIADTGNGRIRKISAN 605
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 49/103 (47%), Gaps = 8/103 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V++ E++ D +N+ I K+ S G++ + G + +
Sbjct: 173 PDSIYVNQQEEIYICDTLNAKIKKV-------SNGKIYTIAGNGKLVNTGNLATQIFLAM 225
Query: 137 PKGVTMDDKGN-LYVADTLNLAIRKIGDAGVTTIAGGKSNVAG 178
P+GV +D+K N +Y+ADT + IRKI G+ + G S G
Sbjct: 226 PQGVYVDEKKNEVYIADTNSYLIRKIDSNGIISTLAGISGKLG 268
>gi|298386786|ref|ZP_06996341.1| hypothetical protein HMPREF9007_03540 [Bacteroides sp. 1_1_14]
gi|298260460|gb|EFI03329.1| hypothetical protein HMPREF9007_03540 [Bacteroides sp. 1_1_14]
Length = 429
Score = 41.6 bits (96), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 40/75 (53%), Gaps = 4/75 (5%)
Query: 126 DGKPNEARFNHPKGVTMDDKGNLYVADTLN-LAIRKIG--DAGVTTIAGGKSNVAGFRDG 182
DG EA F P G+ +D+ GN ++ + +RKI D V+T+A G +VA DG
Sbjct: 334 DGYREEAYFAQPNGMAIDEDGNFFIVEGFKAYVLRKISVEDGYVSTVA-GMVDVASQIDG 392
Query: 183 PSEDAKFSNDFDVVY 197
+A F+ +D+ Y
Sbjct: 393 TPLEATFNYPYDICY 407
Score = 41.6 bits (96), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 39/84 (46%), Gaps = 3/84 (3%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + EDG F V+ + +++ Y VAG +DG P EA FN+
Sbjct: 345 PNGMAIDEDGNFFIVEGFKAYVLRKISVEDGYVS--TVAGMVD-VASQIDGTPLEATFNY 401
Query: 137 PKGVTMDDKGNLYVADTLNLAIRK 160
P + D +G ++A+ A+RK
Sbjct: 402 PYDICYDGEGGFWIAEGWGQAVRK 425
>gi|219852941|ref|YP_002467373.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
gi|219547200|gb|ACL17650.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
Length = 579
Score = 41.6 bits (96), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 41/89 (46%), Gaps = 13/89 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I ++ G ++ V+ N + + T S G +A G TG + +F
Sbjct: 98 PKGIAINSAGNVYIVNNWNDRVQRFT------STGIFLARWGTGGTG-------DGQFKS 144
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
P GV +D GN+YVAD N ++K AG
Sbjct: 145 PSGVAVDSAGNVYVADMYNYRVQKFSSAG 173
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 36/144 (25%), Positives = 62/144 (43%), Gaps = 21/144 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G ++ D N + K + S G L+A G G DG+ F++
Sbjct: 145 PSGVAVDSAGNVYVADMYNYRVQKFS------SAGTLLAK--WGTEGGGDGQ-----FDY 191
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+ +D + N+YV D+ N ++K G G S D +F++ + +
Sbjct: 192 PTGIAVDSENNVYVVDSYNNRVQKFTSNGTFLAKWGARG--------SGDGEFADFPEEI 243
Query: 197 YVRPTCSLLVIDRGNAALRQISLN 220
V T ++ V D GN + + + N
Sbjct: 244 AVDSTGNVFVTDTGNNRIEKFTSN 267
>gi|440714668|ref|ZP_20895245.1| NHL repeat containing protein [Rhodopirellula baltica SWK14]
gi|436440433|gb|ELP33756.1| NHL repeat containing protein [Rhodopirellula baltica SWK14]
Length = 283
Score = 41.6 bits (96), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 45/102 (44%), Gaps = 5/102 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI-----GDAGVTTIAGGKSNVA 177
G+VDG ARFN + +D + LY++D N ++R++ G+ V T AG
Sbjct: 38 GYVDGPARSARFNKLHNLVIDAEDVLYLSDHANHSVRRLIQTTDGEWMVDTYAGQGKAGP 97
Query: 178 GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
+ DA F V LL+ D GN +R I L
Sbjct: 98 ATNNVDRRDATFHEPISVTLDAEGNRLLIADIGNQVVRSIDL 139
>gi|290970758|ref|XP_002668244.1| predicted protein [Naegleria gruberi]
gi|284081534|gb|EFC35500.1| predicted protein [Naegleria gruberi]
Length = 735
Score = 41.6 bits (96), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 42/155 (27%), Positives = 63/155 (40%), Gaps = 10/155 (6%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQG--YTGHVDGKPNEAR 133
+P I V++DG ++ D I KI + G + G H P
Sbjct: 402 LPTSIFVTDDGTIYVSDTQLHQIRKIQNGIITAIAGTGTQCTTAGSNTCDHGSTSPLRVD 461
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAG-----FRDGPSED 186
P G+ + G +Y ADT N +RKI G +TT+AG N A F ++
Sbjct: 462 LYTPMGIYVTKSGEVYFADTSNHKVRKISSDGLTITTVAGSAGNCASGTCATFSASATDS 521
Query: 187 -AKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
AK VV + S+ + D+G +R +S N
Sbjct: 522 TAKLFYPTGVVVNETSGSIYIADQGTHTIRVVSSN 556
>gi|402571882|ref|YP_006621225.1| hypothetical protein Desmer_1347 [Desulfosporosinus meridiei DSM
13257]
gi|402253079|gb|AFQ43354.1| conserved repeat protein [Desulfosporosinus meridiei DSM 13257]
Length = 2025
Score = 41.6 bits (96), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 40/89 (44%), Gaps = 13/89 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + G ++ VD N I K +GSF G ++ FN
Sbjct: 931 PQDVATDSSGNVYVVDSGNKRIQKFDS-----------SGSFLKKWGS--NGSDQGEFNR 977
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
P+G+ +D +GN+YVAD+ N I+K AG
Sbjct: 978 PQGIAVDSEGNIYVADSNNHRIQKFNAAG 1006
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 52/108 (48%), Gaps = 14/108 (12%)
Query: 75 VVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
+ P+ I V G ++ D + I K TP G L+A G G+ G +F
Sbjct: 1166 LFPHGIAVDSAGNVYVADTSANWIRKYTP------EGTLLAK--WGTRGNSAG-----QF 1212
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDA-GVTTIAGGKSNVAGFRD 181
++P G+ D GN+YVADT N I+K+ A VT + G S VA +
Sbjct: 1213 DNPYGIAADSAGNVYVADTNNNRIQKLAPAYTVTFDSNGGSPVAAISN 1260
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 37/89 (41%), Gaps = 13/89 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V +G ++ D N I K AG+F G +FN
Sbjct: 978 PQGIAVDSEGNIYVADSNNHRIQKFN-----------AAGAFMTTWGSF--GTELGQFNS 1024
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
PKG+ +D GN+YVAD N ++K G
Sbjct: 1025 PKGIAVDSNGNVYVADVENDRVQKFDSMG 1053
Score = 39.3 bits (90), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 22/31 (70%)
Query: 131 EARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
E +F +PK VT+D GN+YVADT N I+K
Sbjct: 783 EGQFRNPKSVTVDSAGNIYVADTTNKRIQKF 813
>gi|219851408|ref|YP_002465840.1| carbohydrate binding family 6 [Methanosphaerula palustris E1-9c]
gi|219545667|gb|ACL16117.1| Carbohydrate binding family 6 [Methanosphaerula palustris E1-9c]
Length = 627
Score = 41.6 bits (96), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 34/60 (56%), Gaps = 8/60 (13%)
Query: 131 EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFS 190
+ +FN+P+G+ +D GN+YVAD N ++K +G A G + GP ED +FS
Sbjct: 45 DGQFNYPQGIAVDGAGNIYVADNGNKQVQKFTSSGTFVTAWGTA-------GP-EDDRFS 96
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 25/85 (29%), Positives = 36/85 (42%), Gaps = 13/85 (15%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V G ++ D N + K T S G V T P + RF+
Sbjct: 51 PQGIAVDGAGNIYVADNGNKQVQKFT------SSGTFV-------TAWGTAGPEDDRFSE 97
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P G+ +D +GN+YV DT ++K
Sbjct: 98 PSGIAVDSEGNVYVTDTGKFRVQKF 122
>gi|417302109|ref|ZP_12089222.1| NHL repeat containing protein [Rhodopirellula baltica WH47]
gi|327541551|gb|EGF28082.1| NHL repeat containing protein [Rhodopirellula baltica WH47]
Length = 351
Score = 41.6 bits (96), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 45/103 (43%), Gaps = 5/103 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI-----GDAGVTTIAGGKSNVA 177
G+VDG ARFN + +D + LY++D N ++R++ G+ V T AG
Sbjct: 106 GYVDGPARSARFNKLHNLVIDAEDVLYLSDHANHSVRRLIQTSDGEWMVDTYAGQGKEGP 165
Query: 178 GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
+ DA F V LL+ D GN +R I +
Sbjct: 166 ATNNVDRRDATFHEPISVTLDAEGNRLLIADIGNQVVRSIDFS 208
>gi|326434843|gb|EGD80413.1| NHL repeat-containing protein [Salpingoeca sp. ATCC 50818]
Length = 1449
Score = 41.6 bits (96), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 73/155 (47%), Gaps = 16/155 (10%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPN----E 131
P I V G+L+ D+ N+ I + L Y + G+ + Y G VDG E
Sbjct: 1300 APTGIAVRGAGDLYIADKGNNRIRYV--DLDSYKIIDTLVGTGR-YGGDVDGSTTTAALE 1356
Query: 132 ARFNHPKGVTMDDKGN-LYVADTLNLAIRKI----GDAGVTTIAGGKSNVAGFR-DGPSE 185
+HP+GV ++ KG L +ADT +R + GD+ VT +AG S GF DG
Sbjct: 1357 TNLDHPEGVAVNAKGTMLVIADTNKHVLRSVSLDGGDSPVTVLAGTGSR--GFNGDGNEP 1414
Query: 186 DA-KFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
+A + V Y +++ +D+ N +R++ L
Sbjct: 1415 NATNLHSPVSVAYDIGERAIVFVDQQNRRIRRVHL 1449
>gi|358455779|ref|ZP_09166005.1| serine/threonine protein kinase [Frankia sp. CN3]
gi|357080952|gb|EHI90385.1| serine/threonine protein kinase [Frankia sp. CN3]
Length = 802
Score = 41.6 bits (96), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 46/104 (44%), Gaps = 7/104 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKIT------PPLSQYSRGRLVAGSFQGYTGHVDGKPN 130
P + + +G L D N ++++T P S R +AG+ Q G
Sbjct: 592 PRAVALDPEGRLLVADTGNHRVLRLTLAQPTDPGRPGASEARRLAGTGQPGDAGDGGAAV 651
Query: 131 EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGK 173
AR P GV G + VADT N +R + +G +T +AGG+
Sbjct: 652 RARLLRPAGVAALRDGRVLVADTGNGRVRAVAPSGQITPLAGGR 695
>gi|78047814|ref|YP_363989.1| hypothetical protein XCV2258 [Xanthomonas campestris pv.
vesicatoria str. 85-10]
gi|78036244|emb|CAJ23935.1| conserved hypothetical protein [Xanthomonas campestris pv.
vesicatoria str. 85-10]
Length = 499
Score = 41.6 bits (96), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 71/153 (46%), Gaps = 19/153 (12%)
Query: 9 TLTLLIAFTLQFQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVI 68
TL LL A+ + Q G L K LT+L + + P +D ++ +G + E
Sbjct: 139 TLVLLDAYGRERQRLVGVTGDLDKALTALCE------GQQPPSDADL----HG-VAEREP 187
Query: 69 EGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGK 128
E + P + +ED L+ D + +++ T GR++ G +DG
Sbjct: 188 EPHLALRFPTGLAATED-RLYVADTGHHRVLECT------HSGRVLRQFGHGNADFIDGG 240
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
EA F P+G+ + ++ LYVADT N A+R+I
Sbjct: 241 VGEAAFRRPQGLAL-ERDQLYVADTGNHALRRI 272
>gi|321469552|gb|EFX80532.1| hypothetical protein DAPPUDRAFT_304106 [Daphnia pulex]
Length = 689
Score = 41.6 bits (96), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 44/88 (50%), Gaps = 7/88 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP-NEARFN 135
P + + +L V + ++ +K PL++ L G G DG +E +FN
Sbjct: 452 PLAVAWNTANQLLYVADSYNHKIKTVDPLTKLCSTYL----GNGKAGLADGSSEDEIQFN 507
Query: 136 HPKGVTMDDKGN--LYVADTLNLAIRKI 161
P G+ +D++G LYVADT N IRK+
Sbjct: 508 EPGGLCIDEEGGGILYVADTNNHCIRKV 535
>gi|290986346|ref|XP_002675885.1| serine/threonine kinase [Naegleria gruberi]
gi|284089484|gb|EFC43141.1| serine/threonine kinase [Naegleria gruberi]
Length = 1623
Score = 41.6 bits (96), Expect = 0.95, Method: Composition-based stats.
Identities = 33/119 (27%), Positives = 56/119 (47%), Gaps = 5/119 (4%)
Query: 51 ADGNVLQF-ENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYS 109
++ N L+ N + V T+ P + + +G+L+ + S I+K L+ +
Sbjct: 515 SNANTLRMVANDFTVTTLANSVSNVRYPTGLYYTANGDLYFANTGTSQIMK----LAHDN 570
Query: 110 RGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTT 168
++AG+ + G A N+P VT+ G LY+ADT N AIRKI G+ +
Sbjct: 571 TIAVIAGTGEKGNKGDGGLAVNAALNYPTQVTVTSNGILYIADTGNNAIRKIDSNGIIS 629
>gi|290978943|ref|XP_002672194.1| predicted protein [Naegleria gruberi]
gi|284085769|gb|EFC39450.1| predicted protein [Naegleria gruberi]
Length = 1483
Score = 41.6 bits (96), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 45/95 (47%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG 182
G D P+E P GVT+ +YV+D LN IR+I + +TT+AG + D
Sbjct: 763 GFGDNGPSENGIGTPFGVTISPNNEIYVSDVLNNLIRRIKNGKITTVAGSGAYGYNGDDI 822
Query: 183 PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
+ A ++ ++V T ++ D N +R I
Sbjct: 823 SALKASLASPSNIVVNPLTNDIIFSDTNNHRIRII 857
>gi|152965577|ref|YP_001361361.1| fibronectin type III domain-containing protein [Kineococcus
radiotolerans SRS30216]
gi|151360094|gb|ABS03097.1| Fibronectin type III domain protein [Kineococcus radiotolerans
SRS30216]
Length = 805
Score = 41.6 bits (96), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 49/203 (24%), Positives = 85/203 (41%), Gaps = 21/203 (10%)
Query: 44 SSSKAPQADGNVLQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITP 103
+SSKAP DG ++ G ++ + P ++ V DG ++AVDE S +V +
Sbjct: 162 TSSKAP--DGTLVGPRPGPARQSPLN-------PIEVAVGPDGTVYAVDEDASAVVAVRD 212
Query: 104 PLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGD 163
R + G TG PKG+ + G L V++ + I D
Sbjct: 213 GTLTVVAARADVNAALGVTGP--------PLFGPKGLVVQADGTLLVSNKFAHDVVAIRD 264
Query: 164 AGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDD 223
+T +AGG++ GP+ + + + + V P +L V D GN ++ + +
Sbjct: 265 GTITVVAGGRNGT--VVPGPATASGLDHP-ERIAVDPQGNLYVADAGNDSIDVVDSDGRI 321
Query: 224 CEYQYNSISPTDILMVV-GAVLV 245
PT + + V G+VL+
Sbjct: 322 SALPVTLNEPTGVAVAVDGSVLI 344
>gi|425467041|ref|ZP_18846325.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9809]
gi|389830285|emb|CCI27873.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9809]
Length = 342
Score = 41.6 bits (96), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 42/91 (46%), Gaps = 15/91 (16%)
Query: 77 PYKIRVSEDGELFAVDEVNSNI-VKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFN 135
PY I V G ++ D N+ I V + + Q++ G +G F+
Sbjct: 197 PYGIAVGSSGNIYVADTFNNRIQVFNSTGVFQFAFGSQGSG--------------NGEFS 242
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGV 166
P+G+ +D GN+YV DTLN ++ +GV
Sbjct: 243 LPQGIAVDSSGNIYVTDTLNDRVQVFNPSGV 273
>gi|21242830|ref|NP_642412.1| hypothetical protein XAC2091 [Xanthomonas axonopodis pv. citri str.
306]
gi|21108317|gb|AAM36948.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 499
Score = 41.6 bits (96), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 71/153 (46%), Gaps = 19/153 (12%)
Query: 9 TLTLLIAFTLQFQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVI 68
TL LL A+ + Q G L K LT+L + + P +D ++ +G + E
Sbjct: 139 TLVLLDAYGRERQRLVGVTGDLDKALTALCE------GQPPPSDADL----HG-VAEREP 187
Query: 69 EGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGK 128
E + P + +ED L+ D + +++ T GR++ G +DG
Sbjct: 188 EPHLALRFPTGLAATED-RLYVADTGHHRVLECT------HSGRVLRQFGHGNADLIDGG 240
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
EA F P+G+ + ++ LYVADT N A+R+I
Sbjct: 241 VGEAAFRRPQGLAL-ERDQLYVADTGNHALRRI 272
>gi|386845144|ref|YP_006263157.1| NHL repeat-containing protein 2 [Actinoplanes sp. SE50/110]
gi|359832648|gb|AEV81089.1| NHL repeat-containing protein 2 [Actinoplanes sp. SE50/110]
Length = 600
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 29/52 (55%), Gaps = 2/52 (3%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGG 172
GHVDG A F HP GV G++ VADT N A+R+ A V+T+ G
Sbjct: 382 GHVDGPAGAALFQHPLGVAALADGSVLVADTYNGAVRRFDPASNEVSTVDAG 433
>gi|427430403|ref|ZP_18920257.1| hypothetical protein C882_1420 [Caenispirillum salinarum AK4]
gi|425878863|gb|EKV27574.1| hypothetical protein C882_1420 [Caenispirillum salinarum AK4]
Length = 498
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 37/76 (48%), Gaps = 2/76 (2%)
Query: 105 LSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDA 164
L Q +R + G GH DG AR HP GV + G + VAD+ N AIR I +A
Sbjct: 377 LGQTARVETLVGRGLFDFGHDDGPMTAARMQHPLGVAVMPDGRVAVADSYNHAIRLIDEA 436
Query: 165 G--VTTIAGGKSNVAG 178
V T+ G+ + G
Sbjct: 437 AGTVETLKTGRLHCTG 452
>gi|390991569|ref|ZP_10261830.1| NHL repeat family protein [Xanthomonas axonopodis pv. punicae str.
LMG 859]
gi|372553665|emb|CCF68805.1| NHL repeat family protein [Xanthomonas axonopodis pv. punicae str.
LMG 859]
Length = 472
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 71/153 (46%), Gaps = 19/153 (12%)
Query: 9 TLTLLIAFTLQFQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVI 68
TL LL A+ + Q G L K LT+L + + P +D ++ +G + E
Sbjct: 112 TLVLLDAYGRERQRLVGVTGDLDKALTALCE------GQPPPSDADL----HG-VAEREP 160
Query: 69 EGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGK 128
E + P + +ED L+ D + +++ T GR++ G +DG
Sbjct: 161 EPHLALRFPTGLAATED-RLYVADTGHHRVLECT------HSGRVLRQFGHGNADLIDGG 213
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
EA F P+G+ + ++ LYVADT N A+R+I
Sbjct: 214 VGEAAFRRPQGLAL-ERDQLYVADTGNHALRRI 245
>gi|348681648|gb|EGZ21464.1| hypothetical protein PHYSODRAFT_329405 [Phytophthora sojae]
Length = 145
Score = 41.6 bits (96), Expect = 1.0, Method: Composition-based stats.
Identities = 30/82 (36%), Positives = 41/82 (50%), Gaps = 5/82 (6%)
Query: 112 RLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIA- 170
++VAGS G G V+G ARF+ P +T G LYV D N IRK+ D T +
Sbjct: 31 KVVAGS--GAKGFVNGFGESARFSTPWALTFGSDGYLYVPDLDNDCIRKV-DITTTEVMT 87
Query: 171 -GGKSNVAGFRDGPSEDAKFSN 191
G +G +G + +A F N
Sbjct: 88 YAGICQTSGAVNGLTTNATFDN 109
>gi|256380948|ref|YP_003104608.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Actinosynnema mirum DSM 43827]
gi|255925251|gb|ACU40762.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Actinosynnema mirum DSM 43827]
Length = 612
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 68/149 (45%), Gaps = 19/149 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF-QGYTGHVDGKPNEARFN 135
P K ++ G L D + ++V++ + G V G G DG E F+
Sbjct: 184 PAKAVLTPAGTLLVSDSAHHSLVELA------ADGETVLRRVGTGERGRADGL--EPTFS 235
Query: 136 HPKGVTM-------DDKGNLYVADTLNLAIRKIG-DAG-VTTIAG-GKSNVAGFRDGPSE 185
P G+ + + VADT+N +R + D G VTT+AG G+ G DGP++
Sbjct: 236 EPAGIALLPPEVAATTGYHAVVADTVNHLLRGLNLDTGEVTTVAGTGEQWRDGATDGPAD 295
Query: 186 DAKFSNDFDVVYVRPTCSLLVIDRGNAAL 214
+ ++ +DVV+ P ++ + GN L
Sbjct: 296 EIPLTSPWDVVWWEPANAVAIALAGNHTL 324
Score = 39.3 bits (90), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 49/92 (53%), Gaps = 7/92 (7%)
Query: 82 VSEDGE-LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGV 140
++ DGE L+ VD S + + P + ++ + G F GH DG ++A HP GV
Sbjct: 361 LASDGERLWLVDSETSALRWLDPDRTVHTA--VGKGLFD--FGHRDGPADQALLQHPLGV 416
Query: 141 TMDDKGNLYVADTLNLAIRKIGDAG--VTTIA 170
T G++ +ADT N A+R+ A V+TIA
Sbjct: 417 TALPDGSVAIADTYNGAVRRYDPATGEVSTIA 448
>gi|325926967|ref|ZP_08188244.1| NHL repeat protein [Xanthomonas perforans 91-118]
gi|325542663|gb|EGD14128.1| NHL repeat protein [Xanthomonas perforans 91-118]
Length = 443
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 71/153 (46%), Gaps = 19/153 (12%)
Query: 9 TLTLLIAFTLQFQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVI 68
TL LL A+ + Q G L K LT+L + + P +D ++ +G + E
Sbjct: 112 TLVLLDAYGRERQRLVGVTGDLDKALTALCE------GQQPPSDADL----HG-VAEREP 160
Query: 69 EGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGK 128
E + P + +ED L+ D + +++ T GR++ G +DG
Sbjct: 161 EPHLALRFPTGLAATED-RLYVADTGHHRVLECT------HSGRVLRQFGHGNADLIDGG 213
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
EA F P+G+ + ++ LYVADT N A+R+I
Sbjct: 214 VGEAAFRRPQGLAL-ERDQLYVADTGNHALRRI 245
>gi|410477767|ref|YP_006765404.1| NHL repeat domain-containing protein [Leptospirillum ferriphilum
ML-04]
gi|406773019|gb|AFS52444.1| NHL repeat domain protein [Leptospirillum ferriphilum ML-04]
Length = 494
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 119 QGYTGHVDGKPNEARFNHPKGVTM--DDKGNLYVADTLNLAIRKIG--DAGVTTIAG-GK 173
QG G DG ++A+F P+G+ DD VADT N +R++ VTTIAG G
Sbjct: 220 QGTPGSADGSFDQAQFRDPQGIRFCPDDPDIAIVADTGNHLLRRVDFRRRSVTTIAGTGV 279
Query: 174 SNVAGFRDGPSEDAKFSNDFDVVYVR 199
A F P+ A ++ +D+++ R
Sbjct: 280 QGWAIFEPVPAMSAVLNSPWDILFHR 305
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 41/80 (51%), Gaps = 2/80 (2%)
Query: 83 SEDGELFAVDEVNSNIVKITPPLS-QYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVT 141
S+ ++ VD S++ + P +S + SR + G G+ DG EAR HP G+
Sbjct: 358 SDGTRIYFVDSETSSVRVLHPGVSPRQSRVETLVGRGLFEFGNRDGSFQEARLQHPLGIL 417
Query: 142 MDDKGNLYVADTLNLAIRKI 161
DD G L VADT N IR +
Sbjct: 418 WDD-GLLLVADTYNHRIRAL 436
>gi|410896258|ref|XP_003961616.1| PREDICTED: NHL repeat-containing protein 2-like [Takifugu rubripes]
Length = 716
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 59/123 (47%), Gaps = 11/123 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P KI + + + A+ + + V L + G+L+ +G DG +EA FN
Sbjct: 221 PGKIAIDDGNKRLAIADTGHHRV-----LMVSTTGQLLQVIGGPESGRKDGDLSEASFNS 275
Query: 137 PKGVTMDDKGN-LYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDG--PSEDAKFSND 192
P+GV + KG+ +YVADT N IRKI AG + G +DG P + S+
Sbjct: 276 PQGVAI--KGDTVYVADTENHLIRKIDLSAGKVSTLAGTGEQGTDKDGGAPGPEQPISSP 333
Query: 193 FDV 195
+DV
Sbjct: 334 WDV 336
>gi|421614357|ref|ZP_16055418.1| NHL repeat containing protein [Rhodopirellula baltica SH28]
gi|408494877|gb|EKJ99474.1| NHL repeat containing protein [Rhodopirellula baltica SH28]
Length = 351
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 44/103 (42%), Gaps = 5/103 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI-----GDAGVTTIAGGKSNVA 177
G+VDG ARFN + +D + LY++D N +R++ G+ V T AG
Sbjct: 106 GYVDGPARSARFNKLHNLVIDAEDVLYLSDHANHCVRRLIQTSDGEWMVDTYAGQGEEGP 165
Query: 178 GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
+ DA F V LL+ D GN +R I +
Sbjct: 166 ATDNVDRRDATFHEPISVTLDAEGNRLLIADIGNQVVRSIDFS 208
>gi|381173685|ref|ZP_09882761.1| NHL repeat family protein [Xanthomonas citri pv. mangiferaeindicae
LMG 941]
gi|380685855|emb|CCG39248.1| NHL repeat family protein [Xanthomonas citri pv. mangiferaeindicae
LMG 941]
Length = 472
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 71/153 (46%), Gaps = 19/153 (12%)
Query: 9 TLTLLIAFTLQFQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVI 68
TL LL A+ + Q G L K LT+L + + P +D ++ +G + E
Sbjct: 112 TLVLLDAYGRERQRLVGVTGDLDKALTALCE------GQPPPSDADL----HG-VAEREP 160
Query: 69 EGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGK 128
E + P + +ED L+ D + +++ T GR++ G +DG
Sbjct: 161 EPHLALRFPTGLAATED-RLYVADTGHHRVLECT------HSGRVLRQFGHGNADLIDGG 213
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
EA F P+G+ + ++ LYVADT N A+R+I
Sbjct: 214 VGEAAFRRPQGLAL-ERDQLYVADTGNHALRRI 245
>gi|156740315|ref|YP_001430444.1| NHL repeat-containing protein [Roseiflexus castenholzii DSM 13941]
gi|156231643|gb|ABU56426.1| NHL repeat containing protein [Roseiflexus castenholzii DSM 13941]
Length = 1163
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 53/201 (26%), Positives = 84/201 (41%), Gaps = 39/201 (19%)
Query: 46 SKAPQADGNVLQFENGYLVETVIEGNEIG------VVPYKIRVSEDGELFAVDEVNSNIV 99
+ A A G+V++ LV T I G E G + P + V G + D N ++
Sbjct: 833 APAVAATGSVIK-----LVATDIFG-EFGSEPGQLIQPRGVAVDPQGNVIVSDSGNHRLI 886
Query: 100 KITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIR 159
P + R + G G+ DG+ F P+GV +D GN+YVADT N I
Sbjct: 887 VFDPSGTPI---RTIGG-----FGNGDGQ-----FYEPRGVAVDAAGNMYVADTWNARIV 933
Query: 160 KIGDAG--VTTIAGGKSNVAGFR----DGPSEDAKFSNDFDV-----VYVRPTCSLLVID 208
K+ G +++ G+ + R G S++ + D V V ++ + D
Sbjct: 934 KLDPQGRFLSSWGVGREDFGDGRRASPTGGSQEQNLARPLDFFGPRGVAVDAEGNVYIAD 993
Query: 209 RGNAALRQISLNQDDCEYQYN 229
GN ++I + D YQY
Sbjct: 994 TGN---KRIVVTDSDGNYQYQ 1011
>gi|289665900|ref|ZP_06487481.1| hypothetical protein XcampvN_23187 [Xanthomonas campestris pv.
vasculorum NCPPB 702]
Length = 472
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 71/153 (46%), Gaps = 19/153 (12%)
Query: 9 TLTLLIAFTLQFQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVI 68
TL LL A+ + Q G L K LT+L + + P +D ++ +G + E
Sbjct: 112 TLVLLDAYGRERQRLVGVTGDLDKALTALCE------GQQPPSDADL----HG-VAERGP 160
Query: 69 EGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGK 128
E + P + +ED L+ D + +++ T GR++ G +DG
Sbjct: 161 EPHLALRFPTGLAATED-RLYVADTGHHRVLECT------HSGRVLRQFGHGNADLIDGG 213
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
EA F P+G+ + ++ LYVADT N A+R+I
Sbjct: 214 VGEAAFRRPQGLAL-ERDQLYVADTGNHALRRI 245
>gi|302036773|ref|YP_003797095.1| hypothetical protein NIDE1420 [Candidatus Nitrospira defluvii]
gi|190343188|gb|ACE75576.1| putative NHL repeat protein [Candidatus Nitrospira defluvii]
gi|300604837|emb|CBK41169.1| protein of unknown function, contains NHL repeats [Candidatus
Nitrospira defluvii]
Length = 989
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 42/152 (27%), Positives = 61/152 (40%), Gaps = 36/152 (23%)
Query: 76 VPYKIRVSEDGELFAVDEVNSNIVKIT---PPLSQYSRGRLVAGSFQGYTGHVDGKPNEA 132
P+ + +DG ++ D N+ I K PL ++ R GSF G
Sbjct: 271 APWGVTTDQDGNIYVTDTNNARIQKFKGDGTPLLKWGR----DGSFDG------------ 314
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSND 192
F P+GV +D GN YVAD N I+K G G+ GP + F
Sbjct: 315 AFFFPRGVAVDFVGNTYVADEGNNRIQKFDTRGSFLTKWGREG-----SGPGQ---FKAP 366
Query: 193 FDVVYVRPTCSLL----VIDRGNAALRQISLN 220
+ V TC L V+D+GN +++ N
Sbjct: 367 WGV-----TCDALGNVYVVDQGNHRIQKFDGN 393
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 27/96 (28%), Positives = 46/96 (47%), Gaps = 13/96 (13%)
Query: 66 TVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV 125
+++ G E Y + + + G +F VD N+ I K + ++ G+F
Sbjct: 673 SLLHGGEFMAPMYGLAIDQQGSIFVVDNGNNRIQK----FDRTGNFIILWGNF------- 721
Query: 126 DGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
G N A F++P G+ D KG++YV DT N ++K
Sbjct: 722 -GAAN-ANFHNPTGIACDAKGDVYVVDTNNHRVQKF 755
>gi|427719807|ref|YP_007067801.1| NHL repeat containing protein [Calothrix sp. PCC 7507]
gi|427352243|gb|AFY34967.1| NHL repeat containing protein [Calothrix sp. PCC 7507]
Length = 505
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 51/110 (46%), Gaps = 7/110 (6%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDA---GVTTIAG-GKSN 175
G VDG E+ F P G+T D K LY+AD+ +IR +G V T+ G G
Sbjct: 327 GAEACVDGSLTESAFAQPSGITTDGK-ELYIADSEVSSIRSVGIVEPRQVKTVCGSGGLF 385
Query: 176 VAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCE 225
G DG E+ + + V Y + L V D N ++ +S + +C+
Sbjct: 386 SFGDVDGQGENVRLQHCLGVEYAQN--YLWVADTYNHKIKLVSPSTRNCQ 433
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 29/72 (40%), Positives = 39/72 (54%), Gaps = 12/72 (16%)
Query: 112 RLVAGSFQGYTGHV---------DGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKI 161
RLV SF G HV DG +EA+F P+G+ D++ LY+ADT N A+R++
Sbjct: 196 RLVRSSFDGEILHVIGTGKPGFIDGDFSEAQFFAPQGMAFDEENQILYIADTENHALRRV 255
Query: 162 G--DAGVTTIAG 171
V TIAG
Sbjct: 256 DLRHQLVKTIAG 267
>gi|375098226|ref|ZP_09744491.1| Peroxiredoxin [Saccharomonospora marina XMU15]
gi|374658959|gb|EHR53792.1| Peroxiredoxin [Saccharomonospora marina XMU15]
Length = 673
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 33/59 (55%), Gaps = 2/59 (3%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGF 179
GH DG+ +A HP GV + G + VADT N A+R+ A V+TI G + +GF
Sbjct: 457 GHRDGRAGQALLQHPLGVAVLPDGAVAVADTYNGAVRRYDPASGEVSTIMTGLAEPSGF 515
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 40/148 (27%), Positives = 62/148 (41%), Gaps = 15/148 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P K V+ +G + D ++ + LV G GH DG + A F
Sbjct: 236 PAKAVVTAEGRVLVADTAQHSVAEFASDAET-----LVRRFGSGVRGHADGAFDVAAFAE 290
Query: 137 PKGVTM------DDKG-NLYVADTLNLAIRKIG--DAGVTTIAG-GKSNVAGFRDGPSED 186
P G+T+ + G +L VADT N +R + VTT+AG G+ G G + D
Sbjct: 291 PSGLTLLPTEIAERVGYHLVVADTGNHLLRGVDLRTGAVTTVAGTGEQWRDGADAGTALD 350
Query: 187 AKFSNDFDVVYVRPTCSLLVIDRGNAAL 214
++ +DV + ++V GN L
Sbjct: 351 VALTSPWDVRWWEAAGGVVVAMAGNHTL 378
>gi|443692107|gb|ELT93780.1| hypothetical protein CAPTEDRAFT_99644, partial [Capitella teleta]
Length = 241
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 62/143 (43%), Gaps = 40/143 (27%)
Query: 59 ENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVD-------------EVNSNIVKITP-- 103
++G + T+ G + PY + ++ DG L +D E + +VK +P
Sbjct: 50 DDGQFIRTI--GAAVLQRPYGLALTNDGHLLVIDHGRKAGDASVVRFEQDGRLVKRSPFE 107
Query: 104 PLSQ--YSRGRLVA-----------GSFQGYTGHVDGKP---------NEARFNHPKGVT 141
PLS YS+ R +A G Y DGK NE F P G++
Sbjct: 108 PLSNAAYSKCRFIAVWRDSLYVVDLGKSVVYVTDFDGKEKRRFGQFGRNEGDFREPSGIS 167
Query: 142 MDDKGNLYVADTLNLAIRKIGDA 164
+D KG +Y+AD+ N I +I DA
Sbjct: 168 VDPKGFVYIADSKNDRI-QIFDA 189
>gi|380693445|ref|ZP_09858304.1| hypothetical protein BfaeM_05612, partial [Bacteroides faecis
MAJ27]
Length = 412
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 28/51 (54%), Gaps = 2/51 (3%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG 171
G DG EA +P G+T D GNLY D+ +RKI D +TT+AG
Sbjct: 343 GGKDGPRLEATLRNPAGITFDKSGNLYFCDSQGYTLRKISRIDGMLTTVAG 393
>gi|346725095|ref|YP_004851764.1| Thiol-disulfide isomerase and thioredoxin [Xanthomonas axonopodis
pv. citrumelo F1]
gi|418518756|ref|ZP_13084891.1| Thiol-disulfide isomerase and thioredoxin [Xanthomonas axonopodis
pv. malvacearum str. GSPB1386]
gi|346649842|gb|AEO42466.1| Thiol-disulfide isomerase and thioredoxin [Xanthomonas axonopodis
pv. citrumelo F1]
gi|410702450|gb|EKQ60955.1| Thiol-disulfide isomerase and thioredoxin [Xanthomonas axonopodis
pv. malvacearum str. GSPB1386]
Length = 472
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 71/153 (46%), Gaps = 19/153 (12%)
Query: 9 TLTLLIAFTLQFQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVI 68
TL LL A+ + Q G L K LT+L + + P +D ++ +G + E
Sbjct: 112 TLVLLDAYGRERQRLVGVTGDLDKALTALCE------GQQPPSDADL----HG-VAEREP 160
Query: 69 EGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGK 128
E + P + +ED L+ D + +++ T GR++ G +DG
Sbjct: 161 EPHLALRFPTGLAATED-RLYVADTGHHRVLECT------HSGRVLRQFGHGNADLIDGG 213
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
EA F P+G+ + ++ LYVADT N A+R+I
Sbjct: 214 VGEAAFRRPQGLAL-ERDQLYVADTGNHALRRI 245
>gi|428186137|gb|EKX54988.1| hypothetical protein GUITHDRAFT_59145, partial [Guillardia theta
CCMP2712]
Length = 231
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 39/71 (54%), Gaps = 6/71 (8%)
Query: 149 YVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLV 206
+V D L+ +RK+ + V+ AG S + GFRDG + DA+FS +++ R L V
Sbjct: 1 FVTDRLDHRVRKVSKSNNFVSYFAG--SGIQGFRDGAAGDARFSEPLGIIHFRN--KLYV 56
Query: 207 IDRGNAALRQI 217
D N A+R I
Sbjct: 57 ADSNNHAIRSI 67
>gi|157279030|gb|AAI34672.1| NHLRC2 protein [Bos taurus]
Length = 376
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 28/61 (45%), Positives = 33/61 (54%), Gaps = 5/61 (8%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIA--GGKSNVA 177
G VDG AR HP GVT D K N LYVAD+ N I+ + TT+A G SN+
Sbjct: 113 GDVDGVGINARLQHPLGVTWDQKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGNASNMI 172
Query: 178 G 178
G
Sbjct: 173 G 173
>gi|57107627|ref|XP_544027.1| PREDICTED: NHL repeat-containing protein 2 isoform 1 [Canis lupus
familiaris]
Length = 726
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 29/52 (55%), Gaps = 3/52 (5%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG 171
G VDG AR HP GVT D K N LYVAD+ N I+ + TT+AG
Sbjct: 463 GDVDGVGVNARLQHPLGVTWDKKRNLLYVADSYNHKIKAVDPKTKNCTTLAG 514
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 66/146 (45%), Gaps = 22/146 (15%)
Query: 68 IEGNEIGVVPYKIRVSEDGELF----AVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTG 123
I N+I + YK + LF VD V++ +V ++ R++ G
Sbjct: 204 IRDNKIEIKLYKDSLPPSPLLFPGKVTVDHVSNRLV-----IADTGHHRILVVWKNGQIQ 258
Query: 124 HVDGKPN---------EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG- 171
++ G PN E+ FN P+GV + + +YVADT N IRKI V+T+AG
Sbjct: 259 YIIGGPNPGRKDGIFSESTFNSPQGVAIRNN-IIYVADTENHLIRKIDLEAEMVSTVAGI 317
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVY 197
G P E+ S+ +DVV+
Sbjct: 318 GIQGTDKEGGAPGEEQPISSPWDVVF 343
>gi|290975165|ref|XP_002670314.1| predicted protein [Naegleria gruberi]
gi|284083871|gb|EFC37570.1| predicted protein [Naegleria gruberi]
Length = 1504
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 47/89 (52%), Gaps = 7/89 (7%)
Query: 84 EDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGH-VDGKPNEARFNHPKGVTM 142
++GE++ D N I KIT G +V + G +G+ + G P + ++P G+ +
Sbjct: 454 QNGEIYFADSQNHRIRKIT------QDGFVVNVAGTGISGYSLIGDPLLSYLSNPNGIVV 507
Query: 143 DDKGNLYVADTLNLAIRKIGDAGVTTIAG 171
G++Y++DT N IR+I + ++ G
Sbjct: 508 LKNGDIYISDTDNHCIRQIKNGTLSNFVG 536
>gi|383123946|ref|ZP_09944616.1| hypothetical protein BSIG_4035 [Bacteroides sp. 1_1_6]
gi|251838822|gb|EES66907.1| hypothetical protein BSIG_4035 [Bacteroides sp. 1_1_6]
Length = 451
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 48/107 (44%), Gaps = 10/107 (9%)
Query: 57 QFENGYLVETVIEGNEIGVVPYKIRVSEDGE--LFAVDEVNSNIVKITPPLSQYSRGRLV 114
Q NGY + + N I V E G+ ++ D I KI + V
Sbjct: 350 QVINGYRTDCLFNWNN----QMTIIVDESGQKNMYIADAGTQTIRKIDMNTGMVTT---V 402
Query: 115 AGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
G Q G+ G P EA FN PKGV + +G+LY++D + +RK+
Sbjct: 403 VGR-QNVKGNQSGTPLEATFNWPKGVGLTAEGDLYISDCGSGCVRKL 448
>gi|407801334|ref|ZP_11148178.1| PKD domain-containing protein [Alcanivorax sp. W11-5]
gi|407024771|gb|EKE36514.1| PKD domain-containing protein [Alcanivorax sp. W11-5]
Length = 1447
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 41/90 (45%), Gaps = 13/90 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY + V+ G+L+ D NS ++++ G Q GH A+F +
Sbjct: 187 PYGLDVTATGDLWLADTGNSRVLRLP-----------AGGGAQTAFGHF--GTGSAQFRY 233
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGV 166
P+ V + D G +YVADT N I + D G
Sbjct: 234 PRDVAVGDDGRVYVADTDNHRIVVLEDTGA 263
>gi|139949007|ref|NP_001077192.1| NHL repeat-containing protein 2 [Bos taurus]
gi|166233893|sp|A4IF69.1|NHLC2_BOVIN RecName: Full=NHL repeat-containing protein 2
gi|134025213|gb|AAI34431.1| NHLRC2 protein [Bos taurus]
gi|296472617|tpg|DAA14732.1| TPA: NHL repeat-containing protein 2 [Bos taurus]
Length = 726
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 28/61 (45%), Positives = 33/61 (54%), Gaps = 5/61 (8%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIA--GGKSNVA 177
G VDG AR HP GVT D K N LYVAD+ N I+ + TT+A G SN+
Sbjct: 463 GDVDGVGINARLQHPLGVTWDQKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGNASNMI 522
Query: 178 G 178
G
Sbjct: 523 G 523
Score = 38.9 bits (89), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 66/148 (44%), Gaps = 22/148 (14%)
Query: 68 IEGNEIGVVPYKIRVSEDGELF----AVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTG 123
I N+IG+ YK + LF VD V++ +V ++ R++ G
Sbjct: 204 IRANKIGIKLYKDSLPPSPLLFPGKITVDHVSNRLV-----IADTGHHRILVVWKNGQIQ 258
Query: 124 HVDGKPN---------EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGG 172
+ G PN E+ FN P+GV + + +YVADT N IRKI V+T+AG
Sbjct: 259 YSIGGPNPGRKDGIFSESSFNSPQGVAIMNN-IIYVADTENHLIRKIDLEAEMVSTVAGI 317
Query: 173 KSNVAGFRDGPSEDAK-FSNDFDVVYVR 199
G D + S+ +DVV+ R
Sbjct: 318 GIQGTDKEGGAKGDEQPISSPWDVVFGR 345
>gi|440911916|gb|ELR61535.1| NHL repeat-containing protein 2, partial [Bos grunniens mutus]
Length = 760
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 28/61 (45%), Positives = 33/61 (54%), Gaps = 5/61 (8%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIA--GGKSNVA 177
G VDG AR HP GVT D K N LYVAD+ N I+ + TT+A G SN+
Sbjct: 497 GDVDGVGINARLQHPLGVTWDQKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGNASNMI 556
Query: 178 G 178
G
Sbjct: 557 G 557
Score = 39.3 bits (90), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 66/148 (44%), Gaps = 22/148 (14%)
Query: 68 IEGNEIGVVPYKIRVSEDGELF----AVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTG 123
I N+IG+ YK + LF VD V++ +V ++ R++ G
Sbjct: 220 IRANKIGIKLYKDSLPPSPLLFPGKITVDHVSNRLV-----IADTGHHRILVVWKNGQIQ 274
Query: 124 HVDGKPN---------EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGG 172
+ G PN E+ FN P+GV + + +YVADT N IRKI V+T+AG
Sbjct: 275 YSIGGPNPGRKDGIFSESSFNSPQGVAIMNN-IIYVADTENHLIRKIDLEAEMVSTVAGI 333
Query: 173 KSNVAGFRDGPSEDAK-FSNDFDVVYVR 199
G D + S+ +DVV+ R
Sbjct: 334 GIQGTDKEGGAKGDEQPISSPWDVVFGR 361
>gi|343780954|ref|NP_001230490.1| NHL repeat-containing protein 2 [Sus scrofa]
Length = 725
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 67/148 (45%), Gaps = 22/148 (14%)
Query: 68 IEGNEIGVVPYKIRVSEDGELF----AVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTG 123
I+ N+IG+ YK ++ LF VD V + +V ++ R++ G
Sbjct: 204 IKDNKIGIKLYKDSLTPSPLLFPGKVTVDHVTNRLV-----IADTGHHRILVVWKNGQIQ 258
Query: 124 HVDGKPN---------EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG- 171
+ G PN E+ FN P+GV + + +YVADT N IRKI V+T+AG
Sbjct: 259 YTIGGPNPGRKDGIFSESTFNSPQGVAIRNN-IIYVADTENHLIRKIDLEAEMVSTVAGI 317
Query: 172 GKSNVAGFRDGPSEDAKFSNDFDVVYVR 199
G E+ S+ +DVV+ R
Sbjct: 318 GIQGTDKEGGANGEEQPISSPWDVVFGR 345
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 30/52 (57%), Gaps = 3/52 (5%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG 171
G VDG A+ HP GVT D+K N LYVAD+ N I+ + TT+AG
Sbjct: 463 GDVDGVGINAKLQHPLGVTWDEKRNLLYVADSYNHKIKVVDPKTKSCTTLAG 514
>gi|126273410|ref|XP_001377730.1| PREDICTED: NHL repeat-containing protein 2 [Monodelphis domestica]
Length = 777
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 29/52 (55%), Gaps = 3/52 (5%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG 171
G VDG A+ HP GVT D K N LYVAD+ N I+ + TTIAG
Sbjct: 513 GDVDGAGINAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTIAG 564
Score = 39.7 bits (91), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 43/149 (28%), Positives = 68/149 (45%), Gaps = 28/149 (18%)
Query: 68 IEGNEIGVVPYKIRVSEDGELF----AVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTG 123
I+ N+IG+ Y+ + LF VD ++ +V ++ R++ G
Sbjct: 254 IKDNKIGIKLYRDSLPPSPLLFPGKVTVDNLSDRLV-----IADTGHHRILVVRKNGQIQ 308
Query: 124 HVDGKPN---------EARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG- 171
H G PN E+ FN P+GV++ + +YVADT N IRKI V+T+AG
Sbjct: 309 HSIGGPNSGSKDGTFLESTFNSPQGVSIKNN-IIYVADTENHLIRKIDLETEMVSTVAGI 367
Query: 172 ---GKSNVAGFRDGPSEDAKFSNDFDVVY 197
G G + E+ S+ +DVV+
Sbjct: 368 GIQGTDKEGGAK---GEEQPISSPWDVVF 393
>gi|188501691|gb|ACD54804.1| NHL repeat containing protein-like protein [Adineta vaga]
Length = 224
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 40/148 (27%), Positives = 65/148 (43%), Gaps = 20/148 (13%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY I + + +L+ D VN I+K P S G +VAG+ TG + N
Sbjct: 81 PYGIFIDIEFKLYLADSVNHRIIKFLPGQST---GIVVAGT--SATGSI-------TLNE 128
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P+GV +D G +Y+AD N I G I ++ R G + + N ++
Sbjct: 129 PRGVILDGNGYVYIADYGNHRILGSEPNGFQCI------ISRSRVGSASNQL--NYPQII 180
Query: 197 YVRPTCSLLVIDRGNAALRQISLNQDDC 224
S+ V D+GN +++ +L + C
Sbjct: 181 GFDTYGSIFVTDKGNNRIQKFTLKSNFC 208
>gi|427731644|ref|YP_007077881.1| thiol-disulfide isomerase-like thioredoxin [Nostoc sp. PCC 7524]
gi|427367563|gb|AFY50284.1| thiol-disulfide isomerase-like thioredoxin [Nostoc sp. PCC 7524]
Length = 505
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 34/55 (61%), Gaps = 3/55 (5%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG 171
G +G DG +EA+F+ P+G+T D LYVADT N A+R+I V TIAG
Sbjct: 213 GKSGFTDGAFDEAQFSAPQGMTFDAANQILYVADTDNHAVRQINLKRQIVETIAG 267
>gi|332980628|ref|YP_004462069.1| NHL repeat containing protein [Mahella australiensis 50-1 BON]
gi|332698306|gb|AEE95247.1| NHL repeat containing protein [Mahella australiensis 50-1 BON]
Length = 484
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 39/84 (46%), Gaps = 15/84 (17%)
Query: 82 VSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVT 141
+ E G L+ VD N+ I+K+ +G V G P + FN PKGV
Sbjct: 69 IDERGLLYVVDTGNARIIKLDK---------------KGAVLGVYGGPKDKPFNTPKGVF 113
Query: 142 MDDKGNLYVADTLNLAIRKIGDAG 165
+D G++YVADT N I + G
Sbjct: 114 VDWSGDMYVADTGNGRIVHLSQDG 137
>gi|427739164|ref|YP_007058708.1| thiol-disulfide isomerase-like thioredoxin [Rivularia sp. PCC 7116]
gi|427374205|gb|AFY58161.1| thiol-disulfide isomerase-like thioredoxin [Rivularia sp. PCC 7116]
Length = 487
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 43/92 (46%), Gaps = 16/92 (17%)
Query: 112 RLVAGSFQGYTGHV---------DGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKI 161
R+V S G H+ DG E F P+G+ D+ N LYVADT N AIRK+
Sbjct: 207 RIVVSSLNGEVLHLIGSGKAGLKDGSFEETEFFAPQGMAFDEVDNILYVADTENNAIRKV 266
Query: 162 G--DAGVTTIAGGKSNVAGFRDGPSEDAKFSN 191
V TIAG K ++ + P + K N
Sbjct: 267 DFKSQKVETIAGIKESL----NSPWDLVKLGN 294
>gi|39998315|ref|NP_954266.1| NHL repeat domain-containing protein [Geobacter sulfurreducens PCA]
gi|409913665|ref|YP_006892130.1| NHL repeat domain-containing protein [Geobacter sulfurreducens
KN400]
gi|39985261|gb|AAR36616.1| NHL repeat domain lipoprotein [Geobacter sulfurreducens PCA]
gi|298507249|gb|ADI85972.1| NHL repeat domain protein [Geobacter sulfurreducens KN400]
Length = 365
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 41/126 (32%), Positives = 59/126 (46%), Gaps = 17/126 (13%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V DG L D +NS I T + G+ AGSF G G G RF
Sbjct: 232 PTDLAVMADGRLLVTDSLNSRIQIFT------ADGK-PAGSF-GEAGDTPG-----RFTR 278
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIA-GGKSNVAGFRDGPSEDAKFSNDFDV 195
PKGV +D +G++YV D+ ++ + G +A G K ++ G PS +N D+
Sbjct: 279 PKGVAVDSEGHIYVCDSQQDMVQIFDETGRLLLAFGDKGSLPGQFWMPS-GIHIAN--DM 335
Query: 196 VYVRPT 201
+YV T
Sbjct: 336 IYVSDT 341
>gi|302531208|ref|ZP_07283550.1| NHL repeat-containing protein [Streptomyces sp. AA4]
gi|302440103|gb|EFL11919.1| NHL repeat-containing protein [Streptomyces sp. AA4]
Length = 609
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 38/77 (49%), Gaps = 2/77 (2%)
Query: 85 DGE-LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMD 143
DG+ L+ VD S + I P +S V + GH DG ++A HP G+ +
Sbjct: 352 DGQKLWLVDAETSALRWIEPAGESFSVHTAVGTDLFTF-GHADGPSDQALLQHPLGLAVL 410
Query: 144 DKGNLYVADTLNLAIRK 160
G + +ADT N AIR+
Sbjct: 411 SDGRIAIADTYNGAIRR 427
>gi|418523208|ref|ZP_13089230.1| hypothetical protein WS7_19596 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410700170|gb|EKQ58738.1| hypothetical protein WS7_19596 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
Length = 472
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 42/153 (27%), Positives = 69/153 (45%), Gaps = 19/153 (12%)
Query: 9 TLTLLIAFTLQFQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVI 68
TL LL A+ + Q G L K LT+L + + P +D ++ + E
Sbjct: 112 TLVLLDAYGRERQRLVGVTGDLDKALTALCE------GQPPPSDADLRG-----VAEREP 160
Query: 69 EGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGK 128
E + P + +ED L+ D + +++ T GR++ G +DG
Sbjct: 161 EPHLALRFPTGLAATED-RLYIADTGHHRVLECT------HSGRVLRQFGHGNADLIDGG 213
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
EA F P+G+ + ++ LYVADT N A+R+I
Sbjct: 214 VGEAAFRRPQGLAL-ERDQLYVADTGNHALRRI 245
>gi|219851493|ref|YP_002465925.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
gi|219545752|gb|ACL16202.1| NHL repeat containing protein [Methanosphaerula palustris E1-9c]
Length = 390
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 41/90 (45%), Gaps = 13/90 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V G ++ VD N + K T S G +A + H G F+
Sbjct: 140 PTSLAVDGVGSIYVVDSWNDRVQKFT------SSGSFLAA----WGSHGSGV---GEFDE 186
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGV 166
P G+ +D GN+YVADT N ++K AG+
Sbjct: 187 PFGIAVDGTGNVYVADTYNNRVQKFTSAGL 216
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 25/43 (58%)
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKS 174
+F+HP GV +D GN+YVADT N I+K +G G S
Sbjct: 41 GQFDHPGGVAVDSAGNVYVADTENDRIQKFTSSGTFVTVWGSS 83
>gi|347447722|pdb|3TC9|A Chain A, Crystal Structure Of A Hypothetical Hydrolase (Bt_3476)
From Bacteroides Thetaiotaomicron Vpi-5482 At 2.23 A
Resolution
gi|347447723|pdb|3TC9|B Chain B, Crystal Structure Of A Hypothetical Hydrolase (Bt_3476)
From Bacteroides Thetaiotaomicron Vpi-5482 At 2.23 A
Resolution
Length = 430
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 56/121 (46%), Gaps = 13/121 (10%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTM---------DDKGNLYVADTLNLAIRKIGD 163
+V G QG VDG +AR + P+ T D+ + Y D N IR +
Sbjct: 311 IVCGQ-QGAKDWVDGVGKKARXHAPRQGTFVKNPAYKGSSDEYDFYFCDRENHCIRILTP 369
Query: 164 AG-VTTIAG-GKSNVAGFRDGP-SEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
G VTT AG G + +G+ DG ++A+F++ +VY + DR N +R+I
Sbjct: 370 QGRVTTFAGRGSNGTSGYNDGDLRQEARFNHPEGIVYDEERECFFIGDRENRRIRKIGYE 429
Query: 221 Q 221
+
Sbjct: 430 E 430
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 13/86 (15%)
Query: 83 SEDGELFAVDEVNSNIVKITPPLSQYSRGRLVA----GSFQGYTGHVDGK-PNEARFNHP 137
S++ + + D N I +TP +GR+ GS G +G+ DG EARFNHP
Sbjct: 349 SDEYDFYFCDRENHCIRILTP------QGRVTTFAGRGS-NGTSGYNDGDLRQEARFNHP 401
Query: 138 KGVTMD-DKGNLYVADTLNLAIRKIG 162
+G+ D ++ ++ D N IRKIG
Sbjct: 402 EGIVYDEERECFFIGDRENRRIRKIG 427
>gi|253701709|ref|YP_003022898.1| NHL repeat containing protein [Geobacter sp. M21]
gi|251776559|gb|ACT19140.1| NHL repeat containing protein [Geobacter sp. M21]
Length = 436
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 40/84 (47%), Gaps = 15/84 (17%)
Query: 75 VVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARF 134
V PY I S +G+L+ D + ++++ + P + +R + G N R
Sbjct: 83 VKPYGI-TSANGKLYICDTLQADVITVDLPNKKMTR--------------LSGNVNAGRL 127
Query: 135 NHPKGVTMDDKGNLYVADTLNLAI 158
P V +D +GN+YVADT L +
Sbjct: 128 KKPVNVAVDARGNIYVADTSRLEV 151
>gi|434391812|ref|YP_007126759.1| NHL repeat containing protein [Gloeocapsa sp. PCC 7428]
gi|428263653|gb|AFZ29599.1| NHL repeat containing protein [Gloeocapsa sp. PCC 7428]
Length = 509
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 54/110 (49%), Gaps = 6/110 (5%)
Query: 119 QGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG-GKSN 175
+G +DG E+ F P G+T D K +LY+AD+ +IR+I + VTT+ G G+
Sbjct: 325 RGAEACIDGALAESAFAQPSGITTDGK-DLYIADSEVSSIRRISLENLQVTTLCGSGELF 383
Query: 176 VAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCE 225
G DG + + + V Y + L + D N ++++ + C+
Sbjct: 384 GFGDVDGVGAEVRLQHCLGVEYFQE--QLWIADTYNHKIKRVDPHTSICQ 431
>gi|290982063|ref|XP_002673750.1| predicted protein [Naegleria gruberi]
gi|284087336|gb|EFC41006.1| predicted protein [Naegleria gruberi]
Length = 1759
Score = 40.8 bits (94), Expect = 1.6, Method: Composition-based stats.
Identities = 32/112 (28%), Positives = 53/112 (47%), Gaps = 15/112 (13%)
Query: 116 GSFQGYTGHVDGKP---------NEARFNHPKGVTMDD-KGNLYVADTLNLAIRKIGDAG 165
G Y G DG P ++A+ N+P VT+D+ NL +ADT N +R +
Sbjct: 67 GIISTYAG--DGSPAYSGDGVLASQAKLNYPNFVTIDNLSNNLLIADTANFVVRYVNRTT 124
Query: 166 --VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALR 215
++TIAG N PS+ + FS+ F + ++ ++D N+ +R
Sbjct: 125 RIISTIAGYNVNGNSGDGLPSKQSSFSSLFGIT-TDMEGNVYIVDAPNSVVR 175
>gi|167839136|ref|ZP_02465913.1| hypothetical protein Bpse38_21319 [Burkholderia thailandensis
MSMB43]
gi|424905350|ref|ZP_18328857.1| hypothetical protein A33K_16751 [Burkholderia thailandensis MSMB43]
gi|390929744|gb|EIP87147.1| hypothetical protein A33K_16751 [Burkholderia thailandensis MSMB43]
Length = 732
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 39/84 (46%), Gaps = 12/84 (14%)
Query: 74 GVVPYKIRVSEDGELFAVDEVNSNIVKI------TPPLSQYSRGRLVAGSFQGYTGHVDG 127
GV+P + +S G+L D S + + P LS G L G+ G V G
Sbjct: 266 GVLPADLAISPSGQLLIADSGPSQQIYVYAKPAGVPVLS----GML--GTRSGIFHAVKG 319
Query: 128 KPNEARFNHPKGVTMDDKGNLYVA 151
P + RFN P G+ D GNLYV+
Sbjct: 320 APGDWRFNGPTGIGFDRGGNLYVS 343
>gi|260793565|ref|XP_002591782.1| hypothetical protein BRAFLDRAFT_123529 [Branchiostoma floridae]
gi|229276992|gb|EEN47793.1| hypothetical protein BRAFLDRAFT_123529 [Branchiostoma floridae]
Length = 535
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 42/142 (29%), Positives = 61/142 (42%), Gaps = 29/142 (20%)
Query: 91 VDEVNSNIVKITP-------------PLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHP 137
VD+ S ++K P L + G LV F G+ P + +F HP
Sbjct: 338 VDQNRSGLIKFRPYGGIGELVKLGVGELDSRTVGELVV-KFGGW------GPQQGQFKHP 390
Query: 138 KGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVY 197
GV + G++ VADT N I+ GV A GF G ++DA FS+ DV
Sbjct: 391 GGVAVSSAGHIIVADTGNHRIQVFDSRGVFLRA------FGFY-GSADDA-FSHPHDVAM 442
Query: 198 VRPTCSLLVIDRGNAALRQISL 219
+LV D+GN ++ +L
Sbjct: 443 TTDD-RILVTDKGNKLVKLFTL 463
>gi|425458112|ref|ZP_18837700.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9807]
gi|389800280|emb|CCI20342.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9807]
Length = 342
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 37/90 (41%), Gaps = 13/90 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY I V G ++ D N+ + +G FQ G +FN
Sbjct: 197 PYGIAVGSGGNIYVADTFNNRVQVFNS-----------SGVFQSTFGSF--GTGNGQFNS 243
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGV 166
P G+ +D GN+YV DT N ++ +GV
Sbjct: 244 PYGIAVDSSGNVYVTDTFNDRVQVFNPSGV 273
Score = 38.5 bits (88), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 33/79 (41%), Gaps = 13/79 (16%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY I V G ++ D N + P +G FQ G E F+
Sbjct: 244 PYGIAVDSSGNVYVTDTFNDRVQVFNP-----------SGVFQSTFGSFGTGNGE--FSS 290
Query: 137 PKGVTMDDKGNLYVADTLN 155
P G+ + +GN+YVADT N
Sbjct: 291 PYGIAVGSRGNIYVADTAN 309
>gi|290974456|ref|XP_002669961.1| predicted protein [Naegleria gruberi]
gi|284083515|gb|EFC37217.1| predicted protein [Naegleria gruberi]
Length = 1106
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 47/96 (48%), Gaps = 13/96 (13%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQ--GYTGHVDGKPNEARFN 135
+ VS D ELF D ++ I K++ S G + +AG Q GY + P E+ N
Sbjct: 785 VVVSPDNELFIADSMDHTIRKVS------SDGIVSTIAGVSQIHGYNLY---DPQESILN 835
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG 171
P ++ G+L+ D N IRKI + VTTI G
Sbjct: 836 GPTSLSSAKNGDLFFIDQKNYVIRKISNGIVTTIIG 871
Score = 38.5 bits (88), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 31/118 (26%), Positives = 50/118 (42%), Gaps = 21/118 (17%)
Query: 112 RLVAGSFQGYTG-HVDG-KPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTI 169
++ GS G H DG EARF V + L++AD+++ IRK+ G+ +
Sbjct: 755 KISNGSLSNVAGSHGDGLNALEARFMDISSVVVSPDNELFIADSMDHTIRKVSSDGIVST 814
Query: 170 AGGKSNVAGFR---------DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G S + G+ +GP+ + N L ID+ N +R+IS
Sbjct: 815 IAGVSQIHGYNLYDPQESILNGPTSLSSAKN----------GDLFFIDQKNYVIRKIS 862
>gi|308159172|gb|EFO61716.1| Haloacid dehalogenase-like hydrolase family [Giardia lamblia P15]
Length = 967
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 48/102 (47%), Gaps = 7/102 (6%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDD-KGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNV 176
G H+DG A FN P + D G +Y+ DT N IR + V T+A G V
Sbjct: 126 GEAAHLDGGFLHAAFNFPHSIVSDPVNGIIYLTDTKNHCIRTLSLMTRKVKTLA-GTPGV 184
Query: 177 AGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G++DG + F+ +V +L+V D N ALR IS
Sbjct: 185 FGYKDG--LNPLFNEPLGLVLTEDG-NLIVCDSKNGALRYIS 223
>gi|206603181|gb|EDZ39661.1| NHL repeat domain protein [Leptospirillum sp. Group II '5-way CG']
Length = 494
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 39/80 (48%), Gaps = 2/80 (2%)
Query: 83 SEDGELFAVDEVNSNIVKITP-PLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVT 141
S+ ++ VD S++ + P P SR + G G+ DG EAR HP G+
Sbjct: 358 SDGNRIYFVDSETSSVRVLHPGPSPWQSRIETLVGRGLFEFGNRDGSFQEARLQHPLGIL 417
Query: 142 MDDKGNLYVADTLNLAIRKI 161
DD G L VADT N IR +
Sbjct: 418 WDD-GLLLVADTYNHRIRAL 436
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 44/86 (51%), Gaps = 5/86 (5%)
Query: 119 QGYTGHVDGKPNEARFNHPKGVTM--DDKGNLYVADTLNLAIRKIG--DAGVTTIAG-GK 173
QG G DG ++A+F P+GV DD VADT N +R++ VTTIAG G
Sbjct: 220 QGIPGSADGSFDQAQFRDPQGVRFCPDDPDTAIVADTGNHLLRRVDFRRRSVTTIAGTGV 279
Query: 174 SNVAGFRDGPSEDAKFSNDFDVVYVR 199
A F + A ++ +D+++ R
Sbjct: 280 QGWAIFEPVSAMSAVLNSPWDILFHR 305
>gi|403363814|gb|EJY81656.1| NHL repeat-containing protein 2 [Oxytricha trifallax]
Length = 478
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 70/148 (47%), Gaps = 15/148 (10%)
Query: 85 DGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTM-- 142
D L + + +N + I + +G V G+ G G VDG E F+HP+G+
Sbjct: 104 DNNLMIISDTGNNRLIIVNEETLECQG--VIGN--GKIGLVDGNYEEGSFHHPQGMCHIY 159
Query: 143 -DDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGF-RDGPSED--AKFSNDFDVV 196
D+ +Y+ DT N AIR+I V T+ G + GF R+G E K S+ +D+V
Sbjct: 160 RDNMHFIYLCDTKNHAIREINLTKKEVLTVIG--TGEKGFDREGNKEPELQKLSSPWDIV 217
Query: 197 YVRPTCSLLVIDRGNAALRQISLNQDDC 224
V +LL+ G + ++L + C
Sbjct: 218 AVNRD-TLLIAMAGVHQIWALNLKTNRC 244
>gi|422325157|ref|ZP_16406193.1| hypothetical protein HMPREF0737_01303 [Rothia mucilaginosa M508]
gi|353343435|gb|EHB87752.1| hypothetical protein HMPREF0737_01303 [Rothia mucilaginosa M508]
Length = 656
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 22/114 (19%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKG-------NLYVADTLNLAIRKIG--DAGVTTIAG-- 171
GHVDG EARFN P G+ + + ++ VADT+N +R + V T+AG
Sbjct: 240 GHVDGTGTEARFNEPAGLALVPENLREQLGYDVLVADTVNHRLRSLNLRTGEVCTLAGNG 299
Query: 172 ------GKSNVAGFRDG-----PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAAL 214
G + V G + P+ D S+ +D VY R ++ G L
Sbjct: 300 VQRVIDGDNAVTGDPNHIPAGVPATDIALSSPWDAVYSREAGQFIIAMAGTHQL 353
Score = 38.9 bits (89), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 52/108 (48%), Gaps = 7/108 (6%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGV-TTI 169
+ AGS G G DG +A F P G+ G+L+VA + +R I D GV T
Sbjct: 365 IFAGS--GVEGLADGPAADAWFAQPSGIIEARDGSLWVACSETSGLRHITFTDNGVQVTS 422
Query: 170 AGGKSNVA-GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
A GK GF DG S+ A+ + + + P S+ V D N A+R+
Sbjct: 423 AVGKGLFDFGFVDGDSDTARMQHPLGLTEL-PDGSIAVADTYNGAIRR 469
>gi|285018581|ref|YP_003376292.1| nhl repeat protein [Xanthomonas albilineans GPE PC73]
gi|283473799|emb|CBA16302.1| hypothetical nhl repeat protein [Xanthomonas albilineans GPE PC73]
Length = 473
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 45/85 (52%), Gaps = 8/85 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + ED L+ D + I++ + GRL+ G +DG+ +A F+
Sbjct: 170 PSGLALYED-RLYIADSGHHRILECN------TGGRLLRQFGTGTADFIDGELGQAAFHR 222
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P+G+ + +G LYVADT N A+R+I
Sbjct: 223 PQGLAL-QRGVLYVADTGNHALRRI 246
>gi|219853190|ref|YP_002467622.1| SMP-30/Gluconolaconase/LRE domain-containing protein
[Methanosphaerula palustris E1-9c]
gi|219547449|gb|ACL17899.1| SMP-30/Gluconolaconase/LRE domain protein [Methanosphaerula
palustris E1-9c]
Length = 343
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 40/84 (47%), Gaps = 17/84 (20%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVA--GSFQGYTGHVDGKPNEARFNHP 137
I V G ++ DE NS + K S G + GS+ P + +F P
Sbjct: 69 IAVDSAGNVYVTDEHNSRVQKFD------STGIFITKWGSY---------GPGDGQFKSP 113
Query: 138 KGVTMDDKGNLYVADTLNLAIRKI 161
+G+ +D+ GN+YV DT+N I+K
Sbjct: 114 EGIAVDNAGNVYVVDTVNNQIQKF 137
>gi|374855656|dbj|BAL58511.1| NHL repeat containing protein [uncultured candidate division OP1
bacterium]
Length = 419
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 57/143 (39%), Gaps = 22/143 (15%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I VS G ++ +D N + K + GS+ G G + + +F
Sbjct: 298 PNDIAVSNQGIVYVLDTGNGRVQKF-----------MADGSYLGQWGSL--GERDGQFQA 344
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+ +D G +YVADT N I+K G I G D +F D+
Sbjct: 345 PLGIAVDTYGFVYVADTGNDRIQKFSSDGAFIIKWGSLGAG--------DGQFDQPVDLA 396
Query: 197 YVRPTCSLLVIDRGNAALRQISL 219
V P + V + GN +++I
Sbjct: 397 -VDPEGVIYVAEEGNHRIQKIKF 418
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 43/154 (27%), Positives = 62/154 (40%), Gaps = 17/154 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V G ++ D N + K T G+F G + E +F+
Sbjct: 204 PLGIAVDFQGFVYVADAGNQRVQKFTE-----------GGAFVAKWGSL--GTGEGQFSF 250
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P GV D GN+YVAD +N I+K GV G A + D SN +V
Sbjct: 251 PTGVATDPAGNVYVADKVNNRIQKFSPTGVLIARWGTFGQAESQFNSPNDIAVSNQ-GIV 309
Query: 197 YVRPTCSLLV---IDRGNAALRQISLNQDDCEYQ 227
YV T + V + G+ + SL + D ++Q
Sbjct: 310 YVLDTGNGRVQKFMADGSYLGQWGSLGERDGQFQ 343
>gi|290974156|ref|XP_002669812.1| predicted protein [Naegleria gruberi]
gi|284083364|gb|EFC37068.1| predicted protein [Naegleria gruberi]
Length = 724
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 49/94 (52%), Gaps = 8/94 (8%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGS--FQGYTGHVDGKPNEARFNHPKGVTMD 143
G+++ + + I KIT + S VAG+ + GY+G +G N+A+ + P G+ +D
Sbjct: 225 GDIYWAEYGTNRIRKITSSTNIIST---VAGTVIYLGYSGD-NGPANQAQLSGPYGMVID 280
Query: 144 DKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSN 175
GNLY + + IR + ++TIAG N
Sbjct: 281 KAGNLYFTEVWGMRIRFVSAQSGNISTIAGNGQN 314
>gi|84623828|ref|YP_451200.1| hypothetical protein XOO_2171 [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|84367768|dbj|BAE68926.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
311018]
Length = 499
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 42/153 (27%), Positives = 69/153 (45%), Gaps = 19/153 (12%)
Query: 9 TLTLLIAFTLQFQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVI 68
TL LL A+ + Q G L K LT+L + + P +D ++ + E
Sbjct: 139 TLVLLDAYGRERQRLVGVTGDLDKALTALCE------GQQPPSDADLYG-----VAERDP 187
Query: 69 EGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGK 128
E + P + +ED L+ D + +++ T GR++ G +DG
Sbjct: 188 EPHLALRFPTGLAATED-LLYVADTGHHRVLECT------HSGRVLRQFGHGNADLIDGG 240
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
EA F P+G+ + ++ LYVADT N A+R+I
Sbjct: 241 VGEAAFRRPQGLAL-ERDQLYVADTGNHALRRI 272
>gi|58581916|ref|YP_200932.1| hypothetical protein XOO2293 [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|58426510|gb|AAW75547.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC
10331]
Length = 472
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 42/153 (27%), Positives = 69/153 (45%), Gaps = 19/153 (12%)
Query: 9 TLTLLIAFTLQFQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVI 68
TL LL A+ + Q G L K LT+L + + P +D ++ + E
Sbjct: 112 TLVLLDAYGRERQRLVGVTGDLDKALTALCE------GQQPPSDADLYG-----VAERDP 160
Query: 69 EGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGK 128
E + P + +ED L+ D + +++ T GR++ G +DG
Sbjct: 161 EPHLALRFPTGLAATED-LLYVADTGHHRVLECT------HSGRVLRQFGHGNADLIDGG 213
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
EA F P+G+ + ++ LYVADT N A+R+I
Sbjct: 214 VGEAAFRRPQGLAL-ERDQLYVADTGNHALRRI 245
>gi|188576512|ref|YP_001913441.1| hypothetical protein PXO_00583 [Xanthomonas oryzae pv. oryzae
PXO99A]
gi|188520964|gb|ACD58909.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae
PXO99A]
Length = 472
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 42/153 (27%), Positives = 69/153 (45%), Gaps = 19/153 (12%)
Query: 9 TLTLLIAFTLQFQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVI 68
TL LL A+ + Q G L K LT+L + + P +D ++ + E
Sbjct: 112 TLVLLDAYGRERQRLVGVTGDLDKALTALCE------GQQPPSDADLYG-----VAERDP 160
Query: 69 EGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGK 128
E + P + +ED L+ D + +++ T GR++ G +DG
Sbjct: 161 EPHLALRFPTGLAATED-LLYVADTGHHRVLECT------HSGRVLRQFGHGNADLIDGG 213
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
EA F P+G+ + ++ LYVADT N A+R+I
Sbjct: 214 VGEAAFRRPQGLAL-ERDQLYVADTGNHALRRI 245
>gi|134293028|ref|YP_001116764.1| NHL repeat-containing protein [Burkholderia vietnamiensis G4]
gi|134136185|gb|ABO57299.1| NHL repeat containing protein [Burkholderia vietnamiensis G4]
Length = 283
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 8/107 (7%)
Query: 131 EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFS 190
EA FNHP V + +G++YVAD G+A V A S + G+ +FS
Sbjct: 120 EAPFNHPTDVAVGAEGDIYVAD-------GYGNAQVHRFASDGSYLNGWGTPGISAGEFS 172
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDIL 237
++V P +LV+DR N ++ N + + P DI
Sbjct: 173 TPH-AIWVLPDERVLVVDRDNDRIQVFDRNGAVLDEWRGLVRPMDIW 218
>gi|449670570|ref|XP_002158620.2| PREDICTED: NHL repeat-containing protein 2-like [Hydra
magnipapillata]
Length = 629
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 51/99 (51%), Gaps = 7/99 (7%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIR--KIGDAGVTTIAGGKSNVAGFR 180
G VDG A+ HP V + G+LYVAD+ N I+ + + TT+AG + V G +
Sbjct: 449 GDVDGIGLNAKLQHPLDVAAVN-GHLYVADSYNHKIKVVNMSNLSCTTLAG--NGVVGIK 505
Query: 181 DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
+G A+F+ ++ + T + + D N +R ++L
Sbjct: 506 NGDFLSAEFNEPGGIICLDDT--IYIADTNNHLIRVLNL 542
>gi|342182683|emb|CCC92162.1| putative flagellum-adhesion glycoprotein [Trypanosoma congolense
IL3000]
Length = 585
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 43/82 (52%), Gaps = 4/82 (4%)
Query: 158 IRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
IR + G+ TIAG K+ V G +DGP+ A F+N V + + V DR N +R+I
Sbjct: 97 IRSVNTTGIDTIAGSKT-VRGNQDGPAAAALFNNPTSVAGIGD--DIFVADRDNKCIRRI 153
Query: 218 SLNQDDCEY-QYNSISPTDILM 238
+ + + Q PTDI++
Sbjct: 154 DASGNVTTFGQVTLNQPTDIIV 175
>gi|373955430|ref|ZP_09615390.1| NHL repeat containing protein [Mucilaginibacter paludis DSM 18603]
gi|373892030|gb|EHQ27927.1| NHL repeat containing protein [Mucilaginibacter paludis DSM 18603]
Length = 434
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 28/51 (54%), Gaps = 2/51 (3%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG 171
G +D N A FN G+ D +GN+YV D+ +RKI V+TIAG
Sbjct: 337 GQLDAAKNTASFNSIAGICTDSEGNIYVVDSNTNLVRKINRKSGYVSTIAG 387
>gi|170739674|ref|YP_001768329.1| NHL repeat-containing protein [Methylobacterium sp. 4-46]
gi|168193948|gb|ACA15895.1| NHL repeat containing protein [Methylobacterium sp. 4-46]
Length = 319
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 44/135 (32%), Positives = 56/135 (41%), Gaps = 21/135 (15%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P +VS DG LF D N + P GRL+A G TG P E F
Sbjct: 192 PEAAKVSPDGRLFVTDLRNDRLQVFDP------EGRLLA--VWGRTGT---GPGE--FRS 238
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+ D GN+YV++ N ++ +G G G V G S FSN V+
Sbjct: 239 PAGLGFDRDGNVYVSEIGNSRVQVLGPDGRFL---GMWGVPG-----SAPGAFSNLHGVL 290
Query: 197 YVRPTCSLLVIDRGN 211
R T + V D GN
Sbjct: 291 VDRDTGLVYVADTGN 305
>gi|326781271|ref|ZP_08240536.1| Ricin B lectin [Streptomyces griseus XylebKG-1]
gi|326661604|gb|EGE46450.1| Ricin B lectin [Streptomyces griseus XylebKG-1]
Length = 508
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 62/123 (50%), Gaps = 11/123 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKPNEARF 134
PY++ VS G L+ D N + +IT + G++ VAG+ + G A+
Sbjct: 44 PYEMAVSGTGILYVSDYSNHRVRRIT------TDGKIGTVAGTGAAGSAGDGGPATRAQL 97
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAG-GKSNVAGFRDGPSEDAKFSND 192
N P+ + +D G LY+AD +R++G GV T+AG G + AG GP+ A+ +
Sbjct: 98 NCPRQIAVDGDGALYIADAGGNRVRRVGADGVIVTVAGTGVAGSAG-DGGPAAKAQLNKP 156
Query: 193 FDV 195
F V
Sbjct: 157 FGV 159
Score = 39.7 bits (91), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 47/94 (50%), Gaps = 8/94 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV-DGKPN-EARF 134
P +I V DG L+ D + + ++ + G +V + G G DG P +A+
Sbjct: 100 PRQIAVDGDGALYIADAGGNRVRRVG------ADGVIVTVAGTGVAGSAGDGGPAAKAQL 153
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTT 168
N P GV +D +G LYVA+ +R++G GV +
Sbjct: 154 NKPFGVAVDGEGVLYVAEFGGHRVRRVGADGVIS 187
>gi|428208104|ref|YP_007092457.1| NHL repeat containing protein [Chroococcidiopsis thermalis PCC
7203]
gi|428010025|gb|AFY88588.1| NHL repeat containing protein [Chroococcidiopsis thermalis PCC
7203]
Length = 386
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 60/135 (44%), Gaps = 21/135 (15%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDK 145
G L+A D VN+N + + P Y + G F +G ++G+P F P G+ D+K
Sbjct: 115 GNLYAGD-VNNNRINVFDPQGNYVN-SIAQGQF---SGLIEGRP----FFGPSGIVFDNK 165
Query: 146 GNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 203
GN YV D + I K + VT G G GPS + VR +
Sbjct: 166 GNGYVGDYSSDRILKFNSSSGEVTGSIGSSGTTPGQFQGPSG----------IAVRSDGN 215
Query: 204 LLVIDRGNAALRQIS 218
L+V D+ N ++ +S
Sbjct: 216 LVVTDQFNNRIQVVS 230
>gi|195356025|ref|XP_002044483.1| GM15465 [Drosophila sechellia]
gi|194131671|gb|EDW53697.1| GM15465 [Drosophila sechellia]
Length = 320
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 25/40 (62%), Gaps = 1/40 (2%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKI 161
G +DG+ A+ HP GVT +D N LYVADT N I+ I
Sbjct: 74 GDIDGRLFSAKLQHPLGVTFNDANNRLYVADTYNHKIKII 113
>gi|398784412|ref|ZP_10547676.1| redoxin domain-containing protein [Streptomyces auratus AGR0001]
gi|396995335|gb|EJJ06353.1| redoxin domain-containing protein [Streptomyces auratus AGR0001]
Length = 602
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 29/50 (58%), Gaps = 2/50 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIA 170
GH DG +A HP GVT G++ +ADT N A+R+ A VTT+A
Sbjct: 386 GHRDGAAGQALLQHPLGVTALPDGSVAIADTYNHALRRFDPASGEVTTLA 435
>gi|290983130|ref|XP_002674282.1| predicted protein [Naegleria gruberi]
gi|284087871|gb|EFC41538.1| predicted protein [Naegleria gruberi]
Length = 1572
Score = 40.4 bits (93), Expect = 2.3, Method: Composition-based stats.
Identities = 29/120 (24%), Positives = 57/120 (47%), Gaps = 3/120 (2%)
Query: 102 TPPLSQYSRGRLVAGSFQGYTGHV-DG-KPNEARFNHPKGVTMDDKGNLYVADTLNLAIR 159
TPP +YS G+ Q + DG + + + ++ K + G++Y+AD AIR
Sbjct: 28 TPPNVKYSVSTFAGGNSQLFQQRFGDGFESSMSCLSYIKDIIPTKDGSMYIADASMAAIR 87
Query: 160 KIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISL 219
KI + G+ ++ G DG + + F +V + +++ I +GN +R++ +
Sbjct: 88 KIDNNGIISLVAGNYRKTIPGDGSALSVSVISPFSMVLTQDE-TIMYIAQGNGCIRKLDI 146
>gi|255327417|ref|ZP_05368491.1| NHL repeat containing protein [Rothia mucilaginosa ATCC 25296]
gi|255295697|gb|EET75040.1| NHL repeat containing protein [Rothia mucilaginosa ATCC 25296]
Length = 656
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 48/114 (42%), Gaps = 22/114 (19%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKG-------NLYVADTLNL----------AIRKIGDAG 165
GHVDG EARFN P G+ + + ++ VADT+N +R + G
Sbjct: 240 GHVDGTGTEARFNEPAGLALVPENLREQLGYDVLVADTVNHRLRSLNLRTGEVRTLAGNG 299
Query: 166 VTTIAGGKSNVAGFRDG-----PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAAL 214
V + G + + G + P+ D S+ +D VY R ++ G L
Sbjct: 300 VQRVIDGDNAITGDPNHIPAGVPATDIALSSPWDAVYSREAGQFIIAMAGTHQL 353
Score = 38.9 bits (89), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 51/108 (47%), Gaps = 7/108 (6%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG---DAGVTTI 169
+ AGS G G DG ++A F P G+ G+L+VA + +R I D T
Sbjct: 365 IFAGS--GVEGLADGPADDAWFAQPSGIIEARDGSLWVACSETSGLRHITFTEDGVQVTS 422
Query: 170 AGGKSNVA-GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ 216
A GK GF DG S+ A+ + + + P S+ V D N A+R+
Sbjct: 423 AVGKGLFDFGFVDGDSDTARMQHPLGLTEL-PDGSIAVADTYNGAIRR 469
>gi|182440597|ref|YP_001828316.1| hypothetical protein SGR_6804 [Streptomyces griseus subsp. griseus
NBRC 13350]
gi|178469113|dbj|BAG23633.1| hypothetical protein [Streptomyces griseus subsp. griseus NBRC
13350]
Length = 508
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 62/124 (50%), Gaps = 13/124 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRL--VAGSFQGYTGHVDGKPNEARF 134
PY++ VS G L+ D N + +IT + G++ VAG+ + G A+
Sbjct: 44 PYEMAVSGTGILYVSDYSNHRVRRIT------TDGKIGTVAGTGAAGSAGDGGPATRAQL 97
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAGGKSNVAGFR--DGPSEDAKFSN 191
N P+ + +D G LY+AD +R++G GV T+AG + VAG GP+ A+ +
Sbjct: 98 NCPRQIAVDGDGALYIADAGGNRVRRVGADGVMVTVAG--TGVAGSAGDGGPAAKAQLNK 155
Query: 192 DFDV 195
F V
Sbjct: 156 PFGV 159
Score = 39.7 bits (91), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 47/94 (50%), Gaps = 8/94 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV-DGKPN-EARF 134
P +I V DG L+ D + + ++ + G +V + G G DG P +A+
Sbjct: 100 PRQIAVDGDGALYIADAGGNRVRRVG------ADGVMVTVAGTGVAGSAGDGGPAAKAQL 153
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTT 168
N P GV +D +G LYVA+ +R++G GV +
Sbjct: 154 NKPFGVAVDGEGVLYVAEFGGHRVRRVGADGVIS 187
>gi|374855491|dbj|BAL58347.1| NHL repeat containing protein [uncultured candidate division OP1
bacterium]
Length = 325
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 41/85 (48%), Gaps = 13/85 (15%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I +DG L+ D N + K P G+ + GSF G G DG+ FN
Sbjct: 158 PIGIAFDKDGNLYVTDAFNHRVQKFDPT------GKFL-GSF-GSFGSGDGQ-----FNV 204
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
G+ +D +GNLYV+D N ++K
Sbjct: 205 TAGIAIDTEGNLYVSDNKNDRVQKF 229
>gi|389844275|ref|YP_006346355.1| NHL repeat protein [Mesotoga prima MesG1.Ag.4.2]
gi|387859021|gb|AFK07112.1| NHL repeat protein [Mesotoga prima MesG1.Ag.4.2]
Length = 496
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 33/56 (58%), Gaps = 2/56 (3%)
Query: 114 VAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTI 169
VAGS G+VDG + F HP G+ + + LYVADT N AIRKI D G+ +
Sbjct: 376 VAGSGLFSFGYVDGILKRSLFQHPIGIHGEGRF-LYVADTYNHAIRKI-DLGIRRV 429
Score = 38.5 bits (88), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 60/120 (50%), Gaps = 17/120 (14%)
Query: 85 DGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDD 144
DGE + + N++ + +T LS + + G +G DG AR N P+G+ +
Sbjct: 181 DGENLFISDTNNDRILLTE-LSTPFVAKTIDQIGSGLSGLEDGPFENARLNKPQGIVYSN 239
Query: 145 KGNLYVADTLNLAIRKIGDAG---VTTIAGGKSNVAGFRD------GPSEDAKFSNDFDV 195
G LYVADT N A+R I D ++T++G GF+D G + A+ ++ +D+
Sbjct: 240 -GRLYVADTENHALR-IADMNQRCLSTLSGD-----GFQDNDWNYNGDASKARLNSPWDL 292
>gi|290969880|ref|XP_002667977.1| predicted protein [Naegleria gruberi]
gi|284080923|gb|EFC35233.1| predicted protein [Naegleria gruberi]
Length = 366
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 47/112 (41%), Gaps = 28/112 (25%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV----------- 125
PY + ++ DG +F D+ N I +I P GS + Y G V
Sbjct: 143 PYGMALAADGTMFVADKANHIIRRINPD-----------GSVELYAGTVGTQCPSATASP 191
Query: 126 ---DGKPN-EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG 171
DG F +P V +D N+YV D N IRKI ++ V+T+ G
Sbjct: 192 ACGDGSDRLSVTFTNPNAVIVDSNNNVYVFDYGNFRIRKIDNSTGLVSTLVG 243
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 63/142 (44%), Gaps = 20/142 (14%)
Query: 94 VNSNIVKITPPLSQYSR-----GRLV---AGSFQGYTGHVDGK-PNEARFNHPKGVTMDD 144
V+S V IT ++ R LV AGS G +G +G P ARFN+P G+ +
Sbjct: 92 VDSTFVYITDSMNHCIRKINKISNLVYTHAGS-CGTSGSTNGNTPTAARFNNPYGMALAA 150
Query: 145 KGNLYVADTLNLAIRKIGDAGVTTIAGGK------SNVAGFRDGPSED---AKFSNDFDV 195
G ++VAD N IR+I G + G S A G D F+N +
Sbjct: 151 DGTMFVADKANHIIRRINPDGSVELYAGTVGTQCPSATASPACGDGSDRLSVTFTNP-NA 209
Query: 196 VYVRPTCSLLVIDRGNAALRQI 217
V V ++ V D GN +R+I
Sbjct: 210 VIVDSNNNVYVFDYGNFRIRKI 231
>gi|423216521|ref|ZP_17203044.1| hypothetical protein HMPREF1074_04576 [Bacteroides xylanisolvens
CL03T12C04]
gi|392690731|gb|EIY83987.1| hypothetical protein HMPREF1074_04576 [Bacteroides xylanisolvens
CL03T12C04]
Length = 589
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 43/89 (48%), Gaps = 5/89 (5%)
Query: 74 GVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEAR 133
G+V K R +++ D N+ I +I P + Y+ + + G G+ DG A+
Sbjct: 499 GIVVDKTR----NDIYFTDGYNNKIRRIRPGKNGYTDATVSTIAGSGTKGNQDGDGATAQ 554
Query: 134 FNHPKGVTMDDKGN-LYVADTLNLAIRKI 161
P G+TM GN +Y +D N IRKI
Sbjct: 555 LAMPHGITMTADGNTIYFSDLDNYIIRKI 583
>gi|428177953|gb|EKX46830.1| hypothetical protein GUITHDRAFT_70199, partial [Guillardia theta
CCMP2712]
Length = 305
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 61/149 (40%), Gaps = 9/149 (6%)
Query: 77 PYKIRVSEDGELFAV-DEVNSNIVKITPPLSQYSRGRLVAGSF--QGYTGHVDGKPNEAR 133
P + +S D V D N I +I +S + + G G DG A+
Sbjct: 41 PSDVALSPDESFLVVADSRNHVIRRINKGAQGWSSNDVTVTTIAGNGQPGLADGSATTAQ 100
Query: 134 FNHPKGVTMDDKGN-LYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFRDGPSEDAKFS 190
F P GV + G + VAD+ N IRK+ V TIAG G++DG +KF
Sbjct: 101 FQKPLGVDISPDGTWVAVADSANNRIRKVTLATGNVETIAGFTQ---GYQDGVGTLSKFW 157
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQISL 219
V + L V D N +R ISL
Sbjct: 158 LPASVSFHPQGTGLAVADTNNNRIRWISL 186
>gi|260792760|ref|XP_002591382.1| hypothetical protein BRAFLDRAFT_86889 [Branchiostoma floridae]
gi|229276587|gb|EEN47393.1| hypothetical protein BRAFLDRAFT_86889 [Branchiostoma floridae]
Length = 888
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 36/141 (25%), Positives = 60/141 (42%), Gaps = 18/141 (12%)
Query: 13 LIAFTLQFQAHAAPAGPLIKHLTSLLKWTTRSSSKAPQADGNVLQFENGYLVETVIEGNE 72
L+ Q Q P++ H+ SLL T+ + KA + +EG+E
Sbjct: 474 LVRTVGQKQGMKPADAPVLGHVKSLLPSTSSNVQKA-------------QTITLGVEGSE 520
Query: 73 IGVV--PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPN 130
G P + VS++GE+F D N I T + + SF+ ++ + +
Sbjct: 521 SGEFNRPLGVAVSDEGEIFVADCYNQRIQVFTLQGTFVCHFLTIVPSFRLFSFPLQFQQT 580
Query: 131 EARFNHPKGVTMDDKGNLYVA 151
+ HP+ V +D KGNL+V
Sbjct: 581 ---YMHPQSVAIDGKGNLWVV 598
>gi|116620267|ref|YP_822423.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116223429|gb|ABJ82138.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 380
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 47/82 (57%), Gaps = 12/82 (14%)
Query: 114 VAGSFQ-GYTGHVDGKPNE-ARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTI 169
VAG+ + GY+G DG P + A+ + PKG+ G+LY+ADT + IR++ +TT+
Sbjct: 276 VAGTGESGYSG--DGGPAKLAKLSGPKGIAWAPDGSLYLADTESHTIRRVDLKSGVITTV 333
Query: 170 AG-GKSNVAGFRDGPSEDAKFS 190
AG GK DGP DA+
Sbjct: 334 AGTGKRG-----DGPDGDARMC 350
>gi|21231560|ref|NP_637477.1| hypothetical protein XCC2116 [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|66768320|ref|YP_243082.1| hypothetical protein XC_1999 [Xanthomonas campestris pv. campestris
str. 8004]
gi|21113245|gb|AAM41401.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. ATCC 33913]
gi|66573652|gb|AAY49062.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. 8004]
Length = 499
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 43/85 (50%), Gaps = 8/85 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + +ED L+ D + I++ T GR++ G +DG EA F
Sbjct: 196 PTGLAATED-RLYIADTGHHRILECT------HSGRVLRQFGHGNADLIDGGVGEAAFRR 248
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P+G+ ++ + LYVADT N A+R+I
Sbjct: 249 PQGLALERE-ELYVADTGNHALRRI 272
>gi|397779898|ref|YP_006544371.1| NHL repeat-containing protein 2 [Methanoculleus bourgensis MS2]
gi|396938400|emb|CCJ35655.1| NHL repeat-containing protein 2 [Methanoculleus bourgensis MS2]
Length = 483
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 65/137 (47%), Gaps = 13/137 (9%)
Query: 82 VSEDGE-LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGV 140
+ DGE L+ D S I I + + L+ S + + G +D AR +HP G+
Sbjct: 341 IVTDGEALYIADSGASAIRHIQRGVVE----TLIGHSLEDF-GDLDTIARMARIHHPMGI 395
Query: 141 TMDDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYV 198
+G LY+ADT N I+++ G V T G S G+++G S DA+ S +V +
Sbjct: 396 A-SHQGLLYIADTYNHKIKELDPGTGWVLTRVG--SGDRGYQNGVSGDARLSEPGGLVNL 452
Query: 199 RPTCSLLVIDRGNAALR 215
+ D GN A+R
Sbjct: 453 GGLW--YIADTGNHAVR 467
Score = 38.1 bits (87), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 33/55 (60%), Gaps = 3/55 (5%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDK-GNLYVADTLNLAIRKIG--DAGVTTIAG 171
G G+ DG +EA F P+G+ D++ G LYVAD N IR++ D V T+AG
Sbjct: 208 GAPGNADGSSSEAAFYLPEGLAFDEEAGVLYVADAGNHTIRQVSWPDLRVETVAG 262
>gi|296124331|ref|YP_003632109.1| NHL repeat containing protein [Planctomyces limnophilus DSM 3776]
gi|296016671|gb|ADG69910.1| NHL repeat containing protein [Planctomyces limnophilus DSM 3776]
Length = 581
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 49/98 (50%), Gaps = 10/98 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P K+ V E + + N N + + Q + V GS G G DG A+F+H
Sbjct: 250 PGKLLVDPAHERVFISDSNHNRIVVASLAGQLLK---VIGS--GKIGAKDGPAESAQFDH 304
Query: 137 PKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG 171
P+G+ +D GN LYVADT N +R + V+T+AG
Sbjct: 305 PQGMALD--GNTLYVADTENHLLRTVNLTTWEVSTLAG 340
>gi|291242634|ref|XP_002741211.1| PREDICTED: tripartite motif protein 2-like [Saccoglossus
kowalevskii]
Length = 721
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 43/180 (23%), Positives = 72/180 (40%), Gaps = 19/180 (10%)
Query: 70 GNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKP 129
GN+ P I +S + V + +++ ++I +Y + G+ Q
Sbjct: 547 GNQELTSPMGIAISPVNDRVYVSDCSAHCIRIYTQDGEYVKSFGSQGNGQ---------- 596
Query: 130 NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKF 189
+F HP G+T+D+KGN+ VA+ N I+ GV + G A + D
Sbjct: 597 --CQFYHPWGMTIDNKGNVIVAEYNNHRIQVCNSDGVFLFSFGSQGSANNQFNGLWDVAC 654
Query: 190 SNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVT 249
ND ++ ++ D + +I +QD +Y PT I V G GYV
Sbjct: 655 DNDGNMYVCDHNNRVMKYDSHGVFVSRIDSDQDALQY------PTGI-SVTGDKPFGYVV 707
>gi|187250772|ref|YP_001875254.1| hypothetical protein Emin_0359 [Elusimicrobium minutum Pei191]
gi|186970932|gb|ACC97917.1| hypothetical protein Emin_0359 [Elusimicrobium minutum Pei191]
Length = 448
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 56/103 (54%), Gaps = 7/103 (6%)
Query: 112 RLVAG-SFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTI 169
R VAG G+ G ++ ++P + +D+ LY+ D N IRK+ D +TT+
Sbjct: 118 RPVAGKGIAGFNGDDQLGALQSSLSNPCAIAVDNLSALYILDKGNKRIRKVFADGMITTL 177
Query: 170 AG-GKSNVAGFRDG-PSEDAKFSNDFDVVYVRPTCSLLVIDRG 210
AG G+S + F++G +ED +FS D + + P ++ +ID G
Sbjct: 178 AGNGRSGM--FQEGLVAEDFRFS-DLQDIALSPEGTIYIIDSG 217
>gi|384428019|ref|YP_005637378.1| hypothetical protein XCR_2380 [Xanthomonas campestris pv. raphani
756C]
gi|341937121|gb|AEL07260.1| conserved hypothetical protein [Xanthomonas campestris pv. raphani
756C]
Length = 472
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 43/85 (50%), Gaps = 8/85 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + +ED L+ D + I++ T GR++ G +DG EA F
Sbjct: 169 PTGLAATED-RLYIADTGHHRILECT------HSGRVLRQFGHGNADLIDGGVGEAAFRR 221
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P+G+ + ++ LYVADT N A+R+I
Sbjct: 222 PQGLAL-EREELYVADTGNHALRRI 245
>gi|348508723|ref|XP_003441903.1| PREDICTED: NHL repeat-containing protein 2-like [Oreochromis
niloticus]
Length = 719
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 26/53 (49%), Positives = 35/53 (66%), Gaps = 5/53 (9%)
Query: 122 TGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG 171
+G DG +EA FN P+GV + KG+ +YVADT N IRKI + V+T+AG
Sbjct: 261 SGRKDGDVSEASFNSPQGVAI--KGDTVYVADTENHLIRKIDLLEGKVSTLAG 311
>gi|423454569|ref|ZP_17431422.1| hypothetical protein IEE_03313 [Bacillus cereus BAG5X1-1]
gi|401135538|gb|EJQ43135.1| hypothetical protein IEE_03313 [Bacillus cereus BAG5X1-1]
Length = 617
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 51/111 (45%), Gaps = 9/111 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVA--GSFQGYTGHVDGKPNEARF 134
PY + +G +F D N I+K G++V GS G G + +F
Sbjct: 333 PYDVERDTNGNVFVSDSFNHRILKYDIS------GKIVGKWGSLFGAGGPLGFGSLPGQF 386
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVA-GFRDGPS 184
P+ + MD N+YV+D++N I+K ++G+ + G V GF PS
Sbjct: 387 FVPRQIAMDRYNNVYVSDSVNHRIQKFTNSGIVLASYGSFGVLPGFFQFPS 437
>gi|290996889|ref|XP_002681014.1| predicted protein [Naegleria gruberi]
gi|284094637|gb|EFC48270.1| predicted protein [Naegleria gruberi]
Length = 1023
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 26/41 (63%), Gaps = 2/41 (4%)
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSN 175
PKG +D GN+Y+ADT N +RKI D + TIAG SN
Sbjct: 357 PKGAVIDSLGNIYIADTNNNRVRKISYLDGTINTIAGTGSN 397
Score = 38.5 bits (88), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 49/105 (46%), Gaps = 7/105 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + + + ++ D N I KI L + +V G+ G N + N+
Sbjct: 449 PVSVTIDSNDNVYIADTYNHRIRKI---LQNGNLTTIVGLGSSGFNGDYL-LSNGTKLNY 504
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR 180
P+ + D GN+Y+AD N IRK+ G + T+AG + V+G+
Sbjct: 505 PQSIAFDSNGNMYIADMNNNRIRKMLTNGTIITVAG--TGVSGYN 547
>gi|188991459|ref|YP_001903469.1| hypothetical protein xccb100_2064 [Xanthomonas campestris pv.
campestris str. B100]
gi|167733219|emb|CAP51417.1| Conserved hypothetical protein [Xanthomonas campestris pv.
campestris]
Length = 481
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 43/85 (50%), Gaps = 8/85 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + +ED L+ D + I++ T GR++ G +DG EA F
Sbjct: 178 PTGLAATED-RLYIADTGHHRILECT------HSGRVLRQFGHGNADLIDGGVGEAAFRR 230
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P+G+ ++ + LYVADT N A+R+I
Sbjct: 231 PQGLALERE-ELYVADTGNHALRRI 254
>gi|290972869|ref|XP_002669173.1| predicted protein [Naegleria gruberi]
gi|284082716|gb|EFC36429.1| predicted protein [Naegleria gruberi]
Length = 1010
Score = 40.0 bits (92), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 48/99 (48%), Gaps = 12/99 (12%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRG--RLVAG-SFQGYTGHVDGKPN-EA 132
P I + +LF D I ++ +S G L AG F GY+ DG ++
Sbjct: 67 PSGICAGQSDDLFISDSSRHVIFRM------FSNGTISLFAGIGFAGYSK--DGYSALDS 118
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG 171
FN P G+ M GNLY+AD+ N IR + + V++I G
Sbjct: 119 LFNSPNGIAMSPNGNLYIADSQNDKIRIVSNGIVSSIDG 157
>gi|290997205|ref|XP_002681172.1| predicted protein [Naegleria gruberi]
gi|284094795|gb|EFC48428.1| predicted protein [Naegleria gruberi]
Length = 2212
Score = 40.0 bits (92), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAK 188
++ + P+GV + G++Y+ D N IRKI G+ + G V+G+RDG + A+
Sbjct: 490 SQLSQPRGVCVGPTGDIYIVDAGNYVIRKIDSNGIISTFIGDG-VSGYRDGDALTAR 545
>gi|339239807|ref|XP_003375829.1| NHL repeat-containing domain protein [Trichinella spiralis]
gi|316975491|gb|EFV58926.1| NHL repeat-containing domain protein [Trichinella spiralis]
Length = 1266
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 28/47 (59%), Gaps = 2/47 (4%)
Query: 126 DGKPNE-ARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAG 171
DG P E A+ PKG+T D KGNLY+AD + + + D + TI G
Sbjct: 876 DGGPAEIAKLTFPKGITFDSKGNLYIADDRKIRMVTV-DGLIKTILG 921
>gi|407792655|ref|ZP_11139692.1| NHL repeat containing protein [Idiomarina xiamenensis 10-D-4]
gi|407217768|gb|EKE87600.1| NHL repeat containing protein [Idiomarina xiamenensis 10-D-4]
Length = 510
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 48/171 (28%), Positives = 75/171 (43%), Gaps = 14/171 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ I + +D + A+ + I + +P + S ++AGS G +DG +A FN
Sbjct: 311 PWDITMYQDDVIIAMAGTH-QIWRYSPSQQRLS---ILAGS--GREALLDGTAKQAAFNQ 364
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P G+ + LYVAD AIR I V T+ G G +DG A +
Sbjct: 365 PSGLAVVGD-TLYVADAEASAIRAIDLTTQKVRTVVGQGLFEFGSKDGDFARALLQHPLA 423
Query: 195 VVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLV 245
V + P L+V D N LR + L Q +++ +D L+ G V +
Sbjct: 424 VAALTPD-QLVVADTYNHQLRLLDLGTQ----QVSTLHLSDSLLEPGGVSI 469
>gi|162450283|ref|YP_001612650.1| hypothetical protein sce2011 [Sorangium cellulosum So ce56]
gi|161160865|emb|CAN92170.1| hypothetical protein predicted by Glimmer/Critica [Sorangium
cellulosum So ce56]
Length = 2257
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 60/126 (47%), Gaps = 15/126 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHV--DGK-PNEAR 133
P ++ + DG ++ +DE N + KI+P G + + G G DG
Sbjct: 982 PRRLTIGPDGTIYFIDERNYRVRKISP------DGIITTVAGNGLLGTRPPDGVLATSGS 1035
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGG---KSNVAGFRDGPSEDAKF 189
G+ + GNLY+A N +IRK+ D +TT+AG ++V+ GP+ DA+F
Sbjct: 1036 SGSVYGLDVSSDGNLYLA--FNNSIRKVAPDGIITTVAGSWNTNASVSSGDGGPAVDARF 1093
Query: 190 SNDFDV 195
+D+
Sbjct: 1094 RLIYDM 1099
>gi|332212861|ref|XP_003255539.1| PREDICTED: NHL repeat-containing protein 2 [Nomascus leucogenys]
Length = 726
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 69/149 (46%), Gaps = 28/149 (18%)
Query: 68 IEGNEIGVVPYKIRVSEDGELF----AVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTG 123
I N+IG+ YK + LF VD+V + +V ++ R++ G
Sbjct: 204 IRDNKIGIKLYKDSLPPSPLLFPGKVTVDQVTNRLV-----IADTGHHRILVVWKNGQIQ 258
Query: 124 HVDGKPN---------EARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAG-VTTIAG- 171
+ G PN E+ FN P+GV + + +YVADT N IRKI +AG V+T+AG
Sbjct: 259 YSIGGPNPGRKDGIFSESTFNSPQGVAIMNN-IIYVADTENHLIRKIDLEAGKVSTVAGI 317
Query: 172 ---GKSNVAGFRDGPSEDAKFSNDFDVVY 197
G G + E S+ +DVV+
Sbjct: 318 GIQGTDKEGGAK---GEQQPISSPWDVVF 343
Score = 39.3 bits (90), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 32/57 (56%), Gaps = 4/57 (7%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG-GKSN 175
G VDG A+ HP GVT D K N LYVAD+ N I+ + TT+AG G +N
Sbjct: 463 GDVDGVGINAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTN 519
>gi|290969801|ref|XP_002667957.1| predicted protein [Naegleria gruberi]
gi|284080864|gb|EFC35213.1| predicted protein [Naegleria gruberi]
Length = 235
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 44/90 (48%), Gaps = 4/90 (4%)
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSN---VAGFRDGPSEDAK 188
A+F++ + + + G L +AD N IRK+ G+ + G N V+ GP+ AK
Sbjct: 26 AKFDYLQHIAFNSLGELVIADVYNHRIRKVAVNGIVSTIAGSLNLNYVSSGDGGPATSAK 85
Query: 189 FSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
N + Y + L++ + N LR++S
Sbjct: 86 LINPYGFCYTKYN-ELIIAETNNNRLRKVS 114
>gi|221633398|ref|YP_002522623.1| hypothetical protein trd_1418 [Thermomicrobium roseum DSM 5159]
gi|221156519|gb|ACM05646.1| conserved hypothetical protein [Thermomicrobium roseum DSM 5159]
Length = 443
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 37/76 (48%), Gaps = 8/76 (10%)
Query: 87 ELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGV-TMDDK 145
EL D N I ++ P + S + G GH DG P ARF P G+ T D+
Sbjct: 355 ELLVADTYNHKIKRLDPVARRCS-----SWLGTGQPGHEDGPPERARFWEPSGLATTFDR 409
Query: 146 GNLYVADTLNLAIRKI 161
+YVADT N A+R I
Sbjct: 410 --VYVADTNNHAVRVI 423
>gi|269837841|ref|YP_003320069.1| alkyl hydroperoxide reductase [Sphaerobacter thermophilus DSM
20745]
gi|269787104|gb|ACZ39247.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Sphaerobacter thermophilus DSM 20745]
Length = 494
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 47/109 (43%), Gaps = 4/109 (3%)
Query: 82 VSEDGELFAV-DEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGV 140
++ DG L V D S + +T P R + G F G VDG +E R HP V
Sbjct: 349 LATDGRLLYVADSETSAVRTVTFPPGDEVRTLVGTGLFD--FGDVDGIGDEVRLQHPLAV 406
Query: 141 TMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKF 189
D G LYVAD+ N I+++ A S G DG A+F
Sbjct: 407 ACGD-GVLYVADSYNHKIKRLDPATRRCETWLGSGEPGDADGTGTAARF 454
Score = 39.7 bits (91), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 36/75 (48%), Gaps = 6/75 (8%)
Query: 85 DGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDD 144
DG L+ D N I ++ P R GS G G DG ARF+ P G+++
Sbjct: 410 DGVLYVADSYNHKIKRLDP---ATRRCETWLGS--GEPGDADGTGTAARFHEPGGLSLAG 464
Query: 145 KGNLYVADTLNLAIR 159
LY+ADT N AIR
Sbjct: 465 S-RLYIADTNNHAIR 478
>gi|403259474|ref|XP_003922237.1| PREDICTED: NHL repeat-containing protein 2 [Saimiri boliviensis
boliviensis]
Length = 726
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 67/149 (44%), Gaps = 28/149 (18%)
Query: 68 IEGNEIGVVPYKIRVSEDGELF----AVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTG 123
I ++IG+ PYK + LF VD+V + +V ++ R++ G
Sbjct: 204 IRDDKIGIKPYKDSLPPSPLLFPGKVTVDQVTNRLV-----IADTGHHRILVVWKNGQIQ 258
Query: 124 HVDGKPN---------EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG- 171
+ G PN E+ FN P+GV + + +YVADT N IRKI V+T+AG
Sbjct: 259 YSIGGPNPGRKDGIFSESTFNSPQGVAIMNN-IIYVADTENHLIRKIDLEAEKVSTVAGI 317
Query: 172 ---GKSNVAGFRDGPSEDAKFSNDFDVVY 197
G G + E S+ +DVV+
Sbjct: 318 GIQGTDKEGGAK---GEQQPISSPWDVVF 343
Score = 39.3 bits (90), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 32/57 (56%), Gaps = 4/57 (7%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG-GKSN 175
G VDG A+ HP GVT D K N LYVAD+ N I+ + TT+AG G +N
Sbjct: 463 GDVDGVGINAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTN 519
>gi|156406833|ref|XP_001641249.1| predicted protein [Nematostella vectensis]
gi|156228387|gb|EDO49186.1| predicted protein [Nematostella vectensis]
Length = 708
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 28/61 (45%), Positives = 37/61 (60%), Gaps = 4/61 (6%)
Query: 114 VAGSFQGYT-GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIA 170
V G G++ G DG EA+F+ P+GV M + +YVADT N AIR+I VTT+A
Sbjct: 237 VIGGEDGFSAGFQDGTFKEAKFHAPQGVAMLGE-VIYVADTENHAIREINLDSKKVTTVA 295
Query: 171 G 171
G
Sbjct: 296 G 296
>gi|325917587|ref|ZP_08179787.1| NHL repeat protein [Xanthomonas vesicatoria ATCC 35937]
gi|325536221|gb|EGD08017.1| NHL repeat protein [Xanthomonas vesicatoria ATCC 35937]
Length = 472
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 51/102 (50%), Gaps = 11/102 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + +ED L+ D + +++ T GR++ G +DG EA F
Sbjct: 169 PTGLAATED-RLYIADTGHHRVLECT------HGGRVLRQFGHGNADLIDGGTGEAAFRR 221
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG-GKSN 175
P+G+ + ++ LYVADT N A+R+I + V T+ G G+S
Sbjct: 222 PQGLAL-ERDQLYVADTGNHALRRINLRNGQVDTLCGTGRSG 262
>gi|317419409|emb|CBN81446.1| NHL repeat-containing protein 2 [Dicentrarchus labrax]
Length = 719
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 26/53 (49%), Positives = 35/53 (66%), Gaps = 5/53 (9%)
Query: 122 TGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG 171
+G DG +EA FN P+GV + KG+ +YVADT N IRKI + V+T+AG
Sbjct: 261 SGRRDGDLSEASFNSPQGVAI--KGDTVYVADTENHLIRKIDLLEGRVSTLAG 311
>gi|163841552|ref|YP_001625957.1| hypothetical protein RSal33209_2819 [Renibacterium salmoninarum
ATCC 33209]
gi|162955028|gb|ABY24543.1| conserved hypothetical protein [Renibacterium salmoninarum ATCC
33209]
Length = 425
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 69/143 (48%), Gaps = 17/143 (11%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVA--DTLNLAIRKIGDAGV---- 166
++AG+ G G +DG ++A F G+ +D G+++VA +T L + ++ D V
Sbjct: 228 ILAGT--GLEGLLDGAADQAWFAQSSGLAVDANGDIWVAASETSALRVLRVTDGRVNRVE 285
Query: 167 TTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQ-------ISL 219
T + G + GF+DG + A+ + V + P S+ + D N A+R+ +S
Sbjct: 286 TAVGEGLFDF-GFQDGDASAARLQHPLGVAAL-PDGSVAIADTYNGAVRRYDPASKTVST 343
Query: 220 NQDDCEYQYNSISPTDILMVVGA 242
D + + D+L+VV A
Sbjct: 344 LARDLAEPSDVVLDGDLLIVVEA 366
>gi|348175195|ref|ZP_08882089.1| NHL repeat-containing protein [Saccharopolyspora spinosa NRRL
18395]
Length = 630
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 24/38 (63%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK 160
GH DG ++A F HP GV + G++ V DT N A+R+
Sbjct: 412 GHADGTADKALFQHPLGVAVLPDGSVAVCDTYNGAVRR 449
>gi|406832294|ref|ZP_11091888.1| NHL repeat containing protein [Schlesneria paludicola DSM 18645]
Length = 707
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 32/51 (62%), Gaps = 3/51 (5%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG 171
G++DG +E RF HP G+ D G+L+VAD+ N +R++ V+T AG
Sbjct: 455 GNIDGSGDEVRFQHPLGLVHHD-GSLFVADSYNHQVRQVDLKTRAVSTWAG 504
Score = 39.3 bits (90), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 35/63 (55%), Gaps = 3/63 (4%)
Query: 111 GRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTT 168
G+L+ G GH DG A+F+HP+G+T+ LYVADT N +R I V+T
Sbjct: 261 GKLIDVIGSGAIGHKDGGYAVAQFDHPQGMTLVGD-TLYVADTENHLLRAIDLKQKHVST 319
Query: 169 IAG 171
AG
Sbjct: 320 FAG 322
>gi|423555305|ref|ZP_17531608.1| hypothetical protein II3_00510 [Bacillus cereus MC67]
gi|401196709|gb|EJR03647.1| hypothetical protein II3_00510 [Bacillus cereus MC67]
Length = 617
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 51/111 (45%), Gaps = 9/111 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVA--GSFQGYTGHVDGKPNEARF 134
PY + +G +F D N I+K G++V GS G G + +F
Sbjct: 333 PYDVERDTNGNVFVSDSFNHRILKYDIS------GKIVGKWGSLFGAGGPLGFGSLPGQF 386
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVA-GFRDGPS 184
P+ + MD N+YV+D++N I+K ++G+ + G V GF PS
Sbjct: 387 FVPRQIAMDRYNNVYVSDSVNHRIQKFTNSGIVLASYGSFGVLPGFFQFPS 437
>gi|29830930|ref|NP_825564.1| hypothetical protein SAV_4387 [Streptomyces avermitilis MA-4680]
gi|29608043|dbj|BAC72099.1| hypothetical protein SAV_4387 [Streptomyces avermitilis MA-4680]
Length = 294
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 43/139 (30%), Positives = 62/139 (44%), Gaps = 8/139 (5%)
Query: 109 SRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVT 167
+ G + G+ G VDG EA F P G+ L++AD+ A+R + D V
Sbjct: 10 AAGTVEVGAGTTNEGLVDGPGEEAWFAQPSGLAA-TADRLWLADSETSALRWVDLDGAVH 68
Query: 168 TIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQ 227
T G G RDG +E A + V + P S+ V D N ALR+ D +
Sbjct: 69 TAVGTGLFDFGHRDGDAEQALLQHPLGVTAL-PDGSVAVSDTYNHALRRY----DPATGE 123
Query: 228 YNSISPTDILMVVGAVLVG 246
+++ TD+ AVLVG
Sbjct: 124 VTTLA-TDLREPSDAVLVG 141
>gi|383829668|ref|ZP_09984757.1| thiol-disulfide isomerase-like thioredoxin [Saccharomonospora
xinjiangensis XJ-54]
gi|383462321|gb|EID54411.1| thiol-disulfide isomerase-like thioredoxin [Saccharomonospora
xinjiangensis XJ-54]
Length = 681
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 63/149 (42%), Gaps = 17/149 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSF-QGYTGHVDGKPNEARFN 135
P + V+ +G + D N +I + R F G G VDG + ARF
Sbjct: 227 PSRAVVTAEGRVLVADAANHSIAEFASDAETLLR------RFGDGERGAVDGAFDIARFA 280
Query: 136 HPKGVTM------DDKG-NLYVADTLNLAIRKIG--DAGVTTIAG-GKSNVAGFRDGPSE 185
P G+T+ + G +L VADT N +R I V T+AG G+ G G +
Sbjct: 281 EPSGLTLLPTHIAEQVGYHLLVADTANHLLRAIDLRTGAVRTVAGTGRQWRDGDDSGAAL 340
Query: 186 DAKFSNDFDVVYVRPTCSLLVIDRGNAAL 214
D ++ +DV + ++V GN L
Sbjct: 341 DVDLTSPWDVRWWDTVGGVVVAMAGNHTL 369
Score = 39.3 bits (90), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 33/59 (55%), Gaps = 2/59 (3%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGF 179
GH DG + A HP GVT+ G++ VADT N A+R+ VTT+A + +GF
Sbjct: 461 GHRDGAADTALLQHPLGVTVLPDGSIGVADTYNGAVRRFDPSTGVVTTLARDLAEPSGF 519
>gi|410976139|ref|XP_003994481.1| PREDICTED: NHL repeat-containing protein 2 [Felis catus]
Length = 637
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 30/52 (57%), Gaps = 3/52 (5%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG 171
G VDG +A+ HP GVT D K N LYVAD+ N I+ + TT+AG
Sbjct: 374 GDVDGVGIDAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAG 425
>gi|189462806|ref|ZP_03011591.1| hypothetical protein BACCOP_03504 [Bacteroides coprocola DSM 17136]
gi|189430422|gb|EDU99406.1| NHL repeat protein [Bacteroides coprocola DSM 17136]
Length = 500
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 43/79 (54%), Gaps = 5/79 (6%)
Query: 87 ELFAVDEVNSNIVKITPP--LSQYSRGRLVAGSFQGYTGHVDGK-PNEARFNHPKGVTMD 143
+ + D N + +TP ++ Y+ GR A G+VDG EARFN+P+G+ D
Sbjct: 398 DFYFTDSGNHCVRVLTPEGVVTTYA-GRGSANLNNEARGYVDGTLRGEARFNNPRGLAYD 456
Query: 144 DKGNL-YVADTLNLAIRKI 161
++ N+ YV D N IRKI
Sbjct: 457 EQNNVFYVGDMDNHIIRKI 475
>gi|290972323|ref|XP_002668903.1| predicted protein [Naegleria gruberi]
gi|284082440|gb|EFC36159.1| predicted protein [Naegleria gruberi]
Length = 296
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 48/94 (51%), Gaps = 4/94 (4%)
Query: 131 EARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG--GKSNVAGFRDGPSED 186
A N P GV + G++Y+AD N IRK+ + +TTIAG G++ + + + + +
Sbjct: 108 NAELNFPSGVAVHSNGDVYIADKSNHVIRKVSALNGKITTIAGIAGETELNKYSNSLATN 167
Query: 187 AKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLN 220
++ + T +++ D N +R++ LN
Sbjct: 168 TTLNSPQYLAVNSSTAEVIISDTNNNVIRKVYLN 201
>gi|405965443|gb|EKC30819.1| Teneurin-3 [Crassostrea gigas]
Length = 2798
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 53/100 (53%), Gaps = 11/100 (11%)
Query: 85 DGELFAVDEVNSNIVKITP--PLSQYSRG-RLVAGSFQ-GYTGHVD-----GKPNEARFN 135
+G+L+ D + +++I P+ + +++AG+ + TG VD G +AR
Sbjct: 1299 NGKLYISDYMKHRVIQIATMGPVQNLEQNYKVIAGNGEECSTGLVDECGDGGLAIQARLL 1358
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSN 175
PKG+ ++ +G +Y+AD LN IR+I G+ + G N
Sbjct: 1359 GPKGIAINKEGVIYIADNLN--IRQISTTGIISTLIGSHN 1396
>gi|430751035|ref|YP_007213943.1| NHL repeat protein [Thermobacillus composti KWC4]
gi|430735000|gb|AGA58945.1| NHL repeat protein [Thermobacillus composti KWC4]
Length = 487
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 65/147 (44%), Gaps = 30/147 (20%)
Query: 31 IKHLTSLLKWT----TRSSSKAPQADGNVLQFENGYLVETV-----------IEGNEIGV 75
IKH+ LL + A +ADG + + + +TV + G ++GV
Sbjct: 7 IKHVVVLLAMAGLLGAGMGTAAARADGVPVAYNYSFWGDTVSSPAPYLPAALLNGRDLGV 66
Query: 76 VPYK----IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNE 131
P+K + V+ DG+++ +D N+ IV + L S V SF GH D
Sbjct: 67 GPFKEPNDLFVTADGDIYVLDSGNNRIVVLDRQLQPAS----VIDSFV-RDGHAD----- 116
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAI 158
F +P+G+ + D + +ADT N I
Sbjct: 117 -TFANPQGIFVTDDKEVLIADTGNRRI 142
>gi|453054414|gb|EMF01867.1| redoxin domain-containing protein [Streptomyces mobaraensis NBRC
13819 = DSM 40847]
Length = 608
Score = 39.7 bits (91), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 23/50 (46%), Positives = 30/50 (60%), Gaps = 2/50 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIA 170
GH DG +A F HP GVT G++ V+DT N A+R+ A VTT+A
Sbjct: 392 GHRDGDAAQALFQHPLGVTALPDGSVAVSDTYNHALRRYDPATGQVTTLA 441
>gi|423300918|ref|ZP_17278942.1| hypothetical protein HMPREF1057_02083 [Bacteroides finegoldii
CL09T03C10]
gi|408472253|gb|EKJ90781.1| hypothetical protein HMPREF1057_02083 [Bacteroides finegoldii
CL09T03C10]
Length = 459
Score = 39.7 bits (91), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 45/85 (52%), Gaps = 6/85 (7%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P ++ + E+G ++ I KITP G + + + G +G DG ++A+FNH
Sbjct: 378 PCQMDLDEEGNIYVAVRKKHRIAKITPD------GVVTSYTGTGTSGTTDGPLDKAQFNH 431
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKI 161
P+GV G LYV+D N IRKI
Sbjct: 432 PEGVQFGPDGALYVSDYWNHKIRKI 456
Score = 38.9 bits (89), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 52/98 (53%), Gaps = 2/98 (2%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGF 179
G TG V+G+ N+AR P + +D++GN+YVA I KI GV T + +G
Sbjct: 361 GKTGFVEGQGNDARLQEPCQMDLDEEGNIYVAVRKKHRIAKITPDGVVTSY-TGTGTSGT 419
Query: 180 RDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
DGP + A+F N + V P +L V D N +R+I
Sbjct: 420 TDGPLDKAQF-NHPEGVQFGPDGALYVSDYWNHKIRKI 456
>gi|428206267|ref|YP_007090620.1| NHL repeat containing protein [Chroococcidiopsis thermalis PCC
7203]
gi|428008188|gb|AFY86751.1| NHL repeat containing protein [Chroococcidiopsis thermalis PCC
7203]
Length = 502
Score = 39.7 bits (91), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 52/107 (48%), Gaps = 7/107 (6%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAGGKSNV 176
G G DG +A+F P+G+ +DD+ LY+ADT N AIR+I V TIAG
Sbjct: 214 GKPGLTDGDFTQAQFFAPQGMVLDDENQLLYLADTENHAIRRINLPLQRVETIAGTGEQS 273
Query: 177 AGFR--DGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQ 221
R G + ++ +D+ ++ L + GN + Q+ L +
Sbjct: 274 RNIRPYGGIGQKIALNSPWDLTMLKN--YLFIAMAGNHQIWQMDLEK 318
>gi|189352530|ref|YP_001948157.1| peptidylglycine monooxygenase [Burkholderia multivorans ATCC 17616]
gi|189336552|dbj|BAG45621.1| peptidylglycine monooxygenase [Burkholderia multivorans ATCC 17616]
Length = 204
Score = 39.7 bits (91), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 8/107 (7%)
Query: 131 EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFS 190
EA FNHP V + +G++YVAD G+A V A S + G+ +FS
Sbjct: 41 EAPFNHPTDVAVGAEGDIYVADGY-------GNAQVHRFASDGSYLNGWGTPGISAGEFS 93
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDIL 237
++V P +LV+DR N ++ N + + P DI
Sbjct: 94 TPH-AIWVLPDERVLVVDRDNDRIQVFDRNGAVLDEWRGLVRPMDIW 139
>gi|161521298|ref|YP_001584725.1| NHL repeat-containing protein [Burkholderia multivorans ATCC 17616]
gi|160345348|gb|ABX18433.1| NHL repeat containing protein [Burkholderia multivorans ATCC 17616]
Length = 211
Score = 39.7 bits (91), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 8/107 (7%)
Query: 131 EARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFS 190
EA FNHP V + +G++YVAD G+A V A S + G+ +FS
Sbjct: 48 EAPFNHPTDVAVGAEGDIYVADGY-------GNAQVHRFASDGSYLNGWGTPGISAGEFS 100
Query: 191 NDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDIL 237
++V P +LV+DR N ++ N + + P DI
Sbjct: 101 TPH-AIWVLPDERVLVVDRDNDRIQVFDRNGAVLDEWRGLVRPMDIW 146
>gi|147920395|ref|YP_685830.1| hypothetical protein RCIX1195 [Methanocella arvoryzae MRE50]
gi|110621226|emb|CAJ36504.1| hypothetical protein RCIX1195 [Methanocella arvoryzae MRE50]
Length = 673
Score = 39.7 bits (91), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 52/178 (29%), Positives = 67/178 (37%), Gaps = 39/178 (21%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSR----GRLVAGSFQGYTGHVDGKPNEA 132
P + V G ++ D +N N VKI Y R G L G ++
Sbjct: 176 PMSVAVDSAGSIYVADYMN-NKVKIFDGAGTYLRSIGTGTLGTGDYE------------- 221
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPS-------- 184
F PKGVT+D GN+YV D N I+ AG G S GF GPS
Sbjct: 222 -FRRPKGVTVDGSGNVYVVDGYNNRIQVFDSAGTYLRTIGAS---GF--GPSGTTHFFVP 275
Query: 185 EDAKFSNDFDVVYVRPTCSLL--VIDRGNAALRQISL--NQDDCEYQYNSISPTDILM 238
+D K D VYV + V A +R I YQ+N P D+ +
Sbjct: 276 KDCKVGAD-GTVYVADEGGMCVHVFSNTGAWIRTIGTPGTSGSGNYQFN--CPCDVAV 330
>gi|393904741|gb|EFO23437.2| NCL-1 protein [Loa loa]
Length = 808
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 47/182 (25%), Positives = 79/182 (43%), Gaps = 23/182 (12%)
Query: 48 APQADGNVLQFENGYLVETVIEGNEIGVVP---YKIRVSEDGELFAVDEVNSNIVKITPP 104
+ Q+ G + +NG+ E +G+ P ++ S D EL E N I + PP
Sbjct: 463 SSQSLGTFVTVDNGF--SNQYEKWSMGLEPASGLQLAESPDLEL----EKNMGISNLYPP 516
Query: 105 LSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDA 164
+Q R +++ G G ++G+ F P GV ++ + ++ VADT N I+
Sbjct: 517 RAQIKRQKMIYHCKFGEFGVMEGQ-----FTEPSGVAVNAQNDIIVADTNNHRIQVFDKE 571
Query: 165 GVTTIAGGKSNVAGFRDG----PSEDA--KFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G G+ G RDG P+ A + + DF V PT + + ++ LR+
Sbjct: 572 GRFKFQFGE---CGKRDGQLLYPNRVAVNRLTGDFIVTERSPTHQIQIYNQYGQFLRKFG 628
Query: 219 LN 220
N
Sbjct: 629 AN 630
>gi|311112964|ref|YP_003984186.1| NHL repeat-containing protein [Rothia dentocariosa ATCC 17931]
gi|310944458|gb|ADP40752.1| NHL repeat-containing protein [Rothia dentocariosa ATCC 17931]
Length = 667
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 39/88 (44%), Gaps = 4/88 (4%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYT----GHVDGKPNEA 132
P I + DG L+ V S + +T + R + S G G VDG +
Sbjct: 393 PSGIIEARDGSLWVVCSETSGLRHVTFTRDDHGRQSVQVTSAVGLGLFDFGFVDGDSQTS 452
Query: 133 RFNHPKGVTMDDKGNLYVADTLNLAIRK 160
R HP G+ G++ VADT N AIR+
Sbjct: 453 RMQHPLGLAELPDGSIAVADTYNGAIRR 480
>gi|431895427|gb|ELK04943.1| NHL repeat-containing protein 2 [Pteropus alecto]
Length = 726
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 27/61 (44%), Positives = 33/61 (54%), Gaps = 5/61 (8%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIA--GGKSNVA 177
G +DG A+ HP GVT D K N LYVAD+ N I+ + TT+A G SNV
Sbjct: 463 GDIDGVGINAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKHCTTLAGTGDASNVI 522
Query: 178 G 178
G
Sbjct: 523 G 523
>gi|111224062|ref|YP_714856.1| serine/threonine protein kinase [Frankia alni ACN14a]
gi|111151594|emb|CAJ63313.1| Putative eukaryotic-type serine/threonine protein kinase [Frankia
alni ACN14a]
Length = 730
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 47/95 (49%), Gaps = 6/95 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P ++ V D ++ +D+ N ++ K+ P R L AG+ G G A+ N
Sbjct: 640 PLQVAVGPDSSVYILDDAN-DVRKVDP----SGRITLFAGNGTGGFSGDGGPATRAQMNQ 694
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIA 170
P + + G++Y+AD N IRKI AG + TIA
Sbjct: 695 PSAIAVGSDGSVYIADEGNKRIRKIDPAGRINTIA 729
>gi|343476735|emb|CCD12251.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 582
Score = 39.7 bits (91), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 43/82 (52%), Gaps = 4/82 (4%)
Query: 158 IRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
IR + G+ TIAG K+ V G +DGP+ A F+N V + + V DR N +R+I
Sbjct: 97 IRSVNTTGIDTIAGSKT-VRGNQDGPAAAALFNNPTSVAGIGD--DIFVADRDNKCIRRI 153
Query: 218 SLNQDDCEY-QYNSISPTDILM 238
+ + + Q PTDI++
Sbjct: 154 DASGNVTIFGQVTLNQPTDIIV 175
>gi|320168169|gb|EFW45068.1| peptidyl-alpha-hydroxyglycine-alpha-amidating lyase-PA [Capsaspora
owczarzaki ATCC 30864]
Length = 1374
Score = 39.7 bits (91), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 30/52 (57%), Gaps = 2/52 (3%)
Query: 119 QGYTGHVDGK--PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTT 168
+G G +DG+ P F+ P VT D KGN Y+AD N I+ + +AGV T
Sbjct: 723 KGIFGAIDGQDLPPAGMFHIPHSVTFDSKGNSYIADRENGRIQVLSEAGVFT 774
>gi|89899073|ref|YP_521544.1| NHL repeat-containing protein [Rhodoferax ferrireducens T118]
gi|89343810|gb|ABD68013.1| NHL repeat protein [Rhodoferax ferrireducens T118]
Length = 384
Score = 39.7 bits (91), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 25/97 (25%), Positives = 41/97 (42%), Gaps = 13/97 (13%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V DG+++ V+ N + + TP G F + G P + F
Sbjct: 227 PTNLAVGRDGDIYVVETGNFRVARFTP-----------EGKFVRHYGEAGQAPGQ--FAR 273
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGK 173
PKG+ MD G +YV+D ++ G +A G+
Sbjct: 274 PKGIAMDRSGRMYVSDAAFQNVQIFDGQGRVLMAFGQ 310
>gi|111220793|ref|YP_711587.1| hypothetical protein FRAAL1337 [Frankia alni ACN14a]
gi|111148325|emb|CAJ59997.1| Hypothetical protein FRAAL1337 [Frankia alni ACN14a]
Length = 722
Score = 39.7 bits (91), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 50/109 (45%), Gaps = 13/109 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVD--GKPNEARF 134
P + V DG + D N + ++ P R+VAGS G GH G A
Sbjct: 538 PRSVTVDGDGVIHLADTGNHRVWRLDP----DGTARVVAGS--GTPGHSGDGGLAIHASL 591
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIA----GGKSNVAG 178
P+ V +D +G L VAD + +R++ AG + TIA GG+ AG
Sbjct: 592 RGPQAVAVDAQGRLLVADQEHRRVRRVDAAGRIETIAGTAYGGRPATAG 640
>gi|116625316|ref|YP_827472.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116228478|gb|ABJ87187.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 981
Score = 39.7 bits (91), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 45/89 (50%), Gaps = 9/89 (10%)
Query: 86 GELFAVDEVNSNIVKITPP--LSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMD 143
G + D N+ I KI+ +S Y+ G AG F G +G A+ N P V +D
Sbjct: 101 GNAYFADSSNNVIRKISAAGIISTYA-GNGTAG-FAGDSGAA----TSAQLNGPTDVAID 154
Query: 144 DKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
GNLY+ D+ N +RK+ G +TT AG
Sbjct: 155 GNGNLYICDSSNNRVRKVTPGGTITTFAG 183
>gi|402881549|ref|XP_003904331.1| PREDICTED: NHL repeat-containing protein 2 [Papio anubis]
Length = 688
Score = 39.7 bits (91), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 32/57 (56%), Gaps = 4/57 (7%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG-GKSN 175
G VDG A+ HP GVT D K N LYVAD+ N I+ + TT+AG G +N
Sbjct: 425 GDVDGVGINAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTN 481
>gi|21618524|gb|AAH32598.1| NHLRC2 protein [Homo sapiens]
gi|325464307|gb|ADZ15924.1| NHL repeat containing 2 [synthetic construct]
Length = 367
Score = 39.7 bits (91), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 32/57 (56%), Gaps = 4/57 (7%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG-GKSN 175
G VDG A+ HP GVT D K N LYVAD+ N I+ + TT+AG G +N
Sbjct: 104 GDVDGVGINAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTN 160
>gi|295700709|ref|YP_003608602.1| NHL repeat containing protein [Burkholderia sp. CCGE1002]
gi|295439922|gb|ADG19091.1| NHL repeat containing protein [Burkholderia sp. CCGE1002]
Length = 727
Score = 39.7 bits (91), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 42/82 (51%), Gaps = 7/82 (8%)
Query: 74 GVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGY---TGH-VDGKP 129
G VP + V+ G+L D S + + ++ + G++ AG+ G T H V G P
Sbjct: 261 GTVPTDLTVTPSGQLLVADNGVSQQILV---FNKGAGGQMQAGTSIGTRNGTFHPVRGVP 317
Query: 130 NEARFNHPKGVTMDDKGNLYVA 151
+ RFN G+ +D GNLYVA
Sbjct: 318 GDTRFNGITGIGVDAAGNLYVA 339
>gi|29789158|ref|NP_080087.1| NHL repeat-containing protein 2 [Mus musculus]
gi|81898236|sp|Q8BZW8.1|NHLC2_MOUSE RecName: Full=NHL repeat-containing protein 2
gi|26329039|dbj|BAC28258.1| unnamed protein product [Mus musculus]
gi|63146329|gb|AAH95956.1| NHL repeat containing 2 [Mus musculus]
gi|148669816|gb|EDL01763.1| NHL repeat containing 2 [Mus musculus]
gi|187951881|gb|AAI38191.1| NHL repeat containing 2 [Mus musculus]
gi|223459878|gb|AAI38192.1| NHL repeat containing 2 [Mus musculus]
Length = 725
Score = 39.7 bits (91), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 25/51 (49%), Positives = 31/51 (60%), Gaps = 3/51 (5%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG 171
G DG +E+ FN P+GV + D +YVADT N IRKI VTT+AG
Sbjct: 267 GRKDGMFSESSFNSPQGVAIADN-VIYVADTENHLIRKIDLEAEKVTTVAG 316
>gi|326924577|ref|XP_003208502.1| PREDICTED: teneurin-1-like [Meleagris gallopavo]
Length = 2704
Score = 39.7 bits (91), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 28/46 (60%), Gaps = 3/46 (6%)
Query: 127 GKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAG 171
GK +EA N P+G+T+D G +Y D IRKI + GV TTI G
Sbjct: 1278 GKASEASLNSPRGITIDKHGFIYFVD--GTMIRKIDENGVITTIIG 1321
>gi|312075930|ref|XP_003140635.1| NCL-1 protein [Loa loa]
Length = 789
Score = 39.7 bits (91), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 47/182 (25%), Positives = 79/182 (43%), Gaps = 23/182 (12%)
Query: 48 APQADGNVLQFENGYLVETVIEGNEIGVVP---YKIRVSEDGELFAVDEVNSNIVKITPP 104
+ Q+ G + +NG+ E +G+ P ++ S D EL E N I + PP
Sbjct: 444 SSQSLGTFVTVDNGF--SNQYEKWSMGLEPASGLQLAESPDLEL----EKNMGISNLYPP 497
Query: 105 LSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDA 164
+Q R +++ G G ++G+ F P GV ++ + ++ VADT N I+
Sbjct: 498 RAQIKRQKMIYHCKFGEFGVMEGQ-----FTEPSGVAVNAQNDIIVADTNNHRIQVFDKE 552
Query: 165 GVTTIAGGKSNVAGFRDG----PSEDA--KFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G G+ G RDG P+ A + + DF V PT + + ++ LR+
Sbjct: 553 GRFKFQFGE---CGKRDGQLLYPNRVAVNRLTGDFIVTERSPTHQIQIYNQYGQFLRKFG 609
Query: 219 LN 220
N
Sbjct: 610 AN 611
>gi|124009915|ref|ZP_01694581.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
gi|123984066|gb|EAY24439.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
Length = 768
Score = 39.7 bits (91), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 42/75 (56%), Gaps = 5/75 (6%)
Query: 127 GKPNEARFNHPKGVTMDDKGNLYVAD-TLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGP 183
G A+ P+G+ +D GNLY A+ + IRKI D ++T+AGG +N G G
Sbjct: 101 GLATSAKVASPRGIFLDKNGNLYFAEGGASKKIRKINASDGKISTVAGGGANNPG-DGGA 159
Query: 184 SEDAKFSNDFDVVYV 198
+ DAK ++ F VYV
Sbjct: 160 ATDAKLNDPF-FVYV 173
>gi|290996648|ref|XP_002680894.1| predicted protein [Naegleria gruberi]
gi|284094516|gb|EFC48150.1| predicted protein [Naegleria gruberi]
Length = 966
Score = 39.3 bits (90), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 52/101 (51%), Gaps = 14/101 (13%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLV--AGSF--QGYTGHVDGK-PNE 131
P I S +G+L+ + I KI ++ G +V AG + +GY+ DG+
Sbjct: 130 PMGISFSANGDLYLTEAEKHRIRKI------FTNGTIVTVAGVYGTEGYS--ADGQLAIN 181
Query: 132 ARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-VTTIAG 171
+ P G+ + + G +Y AD+ N +RKIG G ++TIAG
Sbjct: 182 SNLRFPFGINVANDGTVYFADSFNCLLRKIGANGIISTIAG 222
>gi|380812570|gb|AFE78159.1| NHL repeat-containing protein 2 [Macaca mulatta]
Length = 726
Score = 39.3 bits (90), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 32/57 (56%), Gaps = 4/57 (7%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG-GKSN 175
G VDG A+ HP GVT D K N LYVAD+ N I+ + TT+AG G +N
Sbjct: 463 GDVDGVGINAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTN 519
>gi|383418201|gb|AFH32314.1| NHL repeat-containing protein 2 [Macaca mulatta]
Length = 726
Score = 39.3 bits (90), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 32/57 (56%), Gaps = 4/57 (7%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG-GKSN 175
G VDG A+ HP GVT D K N LYVAD+ N I+ + TT+AG G +N
Sbjct: 463 GDVDGVGINAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTN 519
>gi|326431736|gb|EGD77306.1| NHL repeat-containing protein [Salpingoeca sp. ATCC 50818]
Length = 1384
Score = 39.3 bits (90), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 27/96 (28%), Positives = 45/96 (46%), Gaps = 5/96 (5%)
Query: 127 GKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG-----VTTIAGGKSNVAGFRD 181
G P AR N P+G+ G + +ADT N IR + G + T+AG +
Sbjct: 1287 GAPMAARLNTPRGIAAMPAGEIAIADTGNSRIRTLNVGGQGAGVIETVAGTGARGFSGDG 1346
Query: 182 GPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
G + +A + + + T +++ +DR N +RQI
Sbjct: 1347 GVATEANLNFPTGITFSPSTNNIVFVDRRNRRVRQI 1382
>gi|262091900|gb|ACY25449.1| putative PKD domain containing protein [uncultured microorganism]
Length = 340
Score = 39.3 bits (90), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 38/89 (42%), Gaps = 13/89 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + DG L+ VD N + + TP G G G P + FN
Sbjct: 128 PSGFAFNADGTLWIVDTQNHRLQQFTP-----------EGECLGGFGQFGAGPGQ--FNM 174
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
P G+T D +G++YV+D N ++K G
Sbjct: 175 PWGITFDSEGDMYVSDWRNDRVQKFAPDG 203
>gi|290972902|ref|XP_002669189.1| predicted protein [Naegleria gruberi]
gi|284082733|gb|EFC36445.1| predicted protein [Naegleria gruberi]
Length = 536
Score = 39.3 bits (90), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 44/92 (47%), Gaps = 4/92 (4%)
Query: 76 VPYKIRVSEDGEL--FAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEAR 133
+ YK ++S E+ A+D N N + I+ S SR ++ G K N+
Sbjct: 310 LEYKYQISTQSEIRGMALDPTNDNSILISCDDSIVSRYTIMESWHVEMYGSKGCKSNQ-- 367
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
FN P G+ +D++G ++V D LN I + G
Sbjct: 368 FNGPTGMVVDEEGRIFVCDCLNSRIVVLSKEG 399
>gi|225012691|ref|ZP_03703126.1| NHL repeat containing protein [Flavobacteria bacterium MS024-2A]
gi|225003224|gb|EEG41199.1| NHL repeat containing protein [Flavobacteria bacterium MS024-2A]
Length = 405
Score = 39.3 bits (90), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 47/84 (55%), Gaps = 12/84 (14%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGK-PNEARFN 135
P+ I + ++G ++ + N+ + K P + G +VAG G+ +G PN+
Sbjct: 225 PFGIALDKEGYIYVSEIGNNRVTKWAPGADE---GIIVAG------GNGEGNAPNQLAL- 274
Query: 136 HPKGVTMDDKGNLYVADTLNLAIR 159
P G+T+D++GN+YVADT N I+
Sbjct: 275 -PLGLTVDNEGNVYVADTYNHRIQ 297
>gi|109090607|ref|XP_001091193.1| PREDICTED: NHL repeat-containing protein 2 [Macaca mulatta]
Length = 726
Score = 39.3 bits (90), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 32/57 (56%), Gaps = 4/57 (7%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG-GKSN 175
G VDG A+ HP GVT D K N LYVAD+ N I+ + TT+AG G +N
Sbjct: 463 GDVDGVGINAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTN 519
>gi|114632912|ref|XP_508046.2| PREDICTED: NHL repeat-containing protein 2 [Pan troglodytes]
gi|410221294|gb|JAA07866.1| NHL repeat containing 2 [Pan troglodytes]
gi|410252040|gb|JAA13987.1| NHL repeat containing 2 [Pan troglodytes]
gi|410252042|gb|JAA13988.1| NHL repeat containing 2 [Pan troglodytes]
gi|410252044|gb|JAA13989.1| NHL repeat containing 2 [Pan troglodytes]
gi|410252046|gb|JAA13990.1| NHL repeat containing 2 [Pan troglodytes]
gi|410292446|gb|JAA24823.1| NHL repeat containing 2 [Pan troglodytes]
gi|410341451|gb|JAA39672.1| NHL repeat containing 2 [Pan troglodytes]
gi|410341453|gb|JAA39673.1| NHL repeat containing 2 [Pan troglodytes]
Length = 726
Score = 39.3 bits (90), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 32/57 (56%), Gaps = 4/57 (7%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG-GKSN 175
G VDG A+ HP GVT D K N LYVAD+ N I+ + TT+AG G +N
Sbjct: 463 GDVDGVGINAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTN 519
>gi|147918859|ref|YP_687415.1| hypothetical protein RRC374 [Methanocella arvoryzae MRE50]
gi|110622811|emb|CAJ38089.1| hypothetical protein RRC374 [Methanocella arvoryzae MRE50]
Length = 514
Score = 39.3 bits (90), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 37/83 (44%), Gaps = 13/83 (15%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P+ I V G ++ D N I + ++ TG D P +F+
Sbjct: 135 PHAIAVDGAGNIYVADTFNDRIQVWNKATNTWTT-----------TGSPDDDP--GQFSS 181
Query: 137 PKGVTMDDKGNLYVADTLNLAIR 159
P+G+ +D GN+YVADT N I+
Sbjct: 182 PRGIAVDGAGNIYVADTYNYRIQ 204
>gi|355783113|gb|EHH65034.1| hypothetical protein EGM_18374, partial [Macaca fascicularis]
Length = 723
Score = 39.3 bits (90), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 32/57 (56%), Gaps = 4/57 (7%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG-GKSN 175
G VDG A+ HP GVT D K N LYVAD+ N I+ + TT+AG G +N
Sbjct: 460 GDVDGVGINAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTN 516
>gi|333381581|ref|ZP_08473261.1| hypothetical protein HMPREF9455_01427 [Dysgonomonas gadei ATCC
BAA-286]
gi|332829828|gb|EGK02470.1| hypothetical protein HMPREF9455_01427 [Dysgonomonas gadei ATCC
BAA-286]
Length = 455
Score = 39.3 bits (90), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 31/119 (26%), Positives = 53/119 (44%), Gaps = 11/119 (9%)
Query: 113 LVAGSFQGYTGHVDGKPNEARFNHPK-GVTM--------DDKGNLYVADTLNLAIRKIGD 163
L+AG Q G+ DG + AR N P+ GV + +D+ + Y D N +R +
Sbjct: 338 LIAGKDQT-AGYADGVGSYARVNEPQQGVFVKNPEYEGQEDEYDFYFCDKANHVVRIMTP 396
Query: 164 AGVTTIAGGKSNVAGFRDGPSE-DAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQ 221
G + G +G+RDG DA+F+ + Y + D N +R+++ +
Sbjct: 397 QGRVSTFAGTPQASGYRDGDLRLDARFAYPAGIAYDEERQCFYIGDSNNRRIRKVAYEE 455
>gi|297687408|ref|XP_002821208.1| PREDICTED: NHL repeat-containing protein 2 [Pongo abelii]
Length = 726
Score = 39.3 bits (90), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 32/57 (56%), Gaps = 4/57 (7%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG-GKSN 175
G VDG A+ HP GVT D K N LYVAD+ N I+ + TT+AG G +N
Sbjct: 463 GDVDGVGINAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTN 519
>gi|290976426|ref|XP_002670941.1| predicted protein [Naegleria gruberi]
gi|284084505|gb|EFC38197.1| predicted protein [Naegleria gruberi]
Length = 872
Score = 39.3 bits (90), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 71/148 (47%), Gaps = 11/148 (7%)
Query: 75 VVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGK-PNEAR 133
+ + + S +GE++ D ++V I L ++ G G Q Y G DG +++
Sbjct: 298 LTAFGVSTSNEGEVYFADYARVSVVTING-LIKHVAGN---GKLQ-YLG--DGNLATQSQ 350
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAGGKSNVAGF-RDGPSEDAKFSN 191
P+GV + G L+++D+ IRK+ GV TIAGG + G+ +DG F N
Sbjct: 351 LFKPRGVAVSTSGELFISDSAQDTIRKVSTEGVINTIAGGTFPLGGYIQDGVDPLNTFLN 410
Query: 192 DFDVV-YVRPTCSLLVIDRGNAALRQIS 218
+ + + + L + D N +R+IS
Sbjct: 411 NPSALEFSSSSGELFLSDSHNFRVRKIS 438
Score = 38.5 bits (88), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 67/299 (22%), Positives = 126/299 (42%), Gaps = 69/299 (23%)
Query: 69 EGNEIGVVPYKIRVSEDGE-LFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDG 127
+ ++ + P K++ E ++ D +N+ I K+ + S +V F G++G DG
Sbjct: 14 DAKQVCLAPTKVKFDRKSENIYISDRINNRIRKLDVMSGKIST--IVGNEFIGFSG--DG 69
Query: 128 KPNE-ARFNHPKGVTMDDKGNLYVADTLNLAIRKI-GDAGVTTIAGG---------KSNV 176
+ A N+P V + ++Y ADT N IRK+ + + TIAG KS V
Sbjct: 70 QAATLAHINNPGKVEVSSNNDIYFADTNNHRIRKVFANGTILTIAGCGEFGFDLEEKSPV 129
Query: 177 AGFRDGPSEDAK--------FSNDFDVVYVRPTCSLLV----IDRGNAALRQISLNQDDC 224
P+ +AK S+D ++Y+ T S ++ + G ++ SL +D
Sbjct: 130 ------PALEAKINYPKAVLLSSDEKILYITDTASHMIKALSFETGLVSIIAGSLYRDSS 183
Query: 225 EYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNK 284
+ + ++Q GP + + SE F ++ IK+++N+
Sbjct: 184 QN----------------------STIVQ---GPISLALSPLTSEVYFVDDDGIKKITNE 218
Query: 285 EKPIPIVESMKEEPGWPSFGQLIID--LSKLALEAMGS----MFLNFVPFRFRSSGTKG 337
K +E + + G L I+ +S + +E +G+ + ++ R R GT G
Sbjct: 219 GK----IEMITRNSKFGGDGGLAINAGVSAVDIEFLGTSNTLLIADYGNNRIRIIGTNG 273
>gi|397510538|ref|XP_003825652.1| PREDICTED: NHL repeat-containing protein 2 [Pan paniscus]
Length = 726
Score = 39.3 bits (90), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 32/57 (56%), Gaps = 4/57 (7%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG-GKSN 175
G VDG A+ HP GVT D K N LYVAD+ N I+ + TT+AG G +N
Sbjct: 463 GDVDGVGINAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTN 519
>gi|119569873|gb|EAW49488.1| NHL repeat containing 2, isoform CRA_b [Homo sapiens]
Length = 726
Score = 39.3 bits (90), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 32/57 (56%), Gaps = 4/57 (7%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG-GKSN 175
G VDG A+ HP GVT D K N LYVAD+ N I+ + TT+AG G +N
Sbjct: 463 GDVDGVGINAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTN 519
>gi|42476013|ref|NP_940916.2| NHL repeat-containing protein 2 [Homo sapiens]
gi|74762548|sp|Q8NBF2.1|NHLC2_HUMAN RecName: Full=NHL repeat-containing protein 2
gi|21748831|dbj|BAC03493.1| unnamed protein product [Homo sapiens]
Length = 726
Score = 39.3 bits (90), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 32/57 (56%), Gaps = 4/57 (7%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG-GKSN 175
G VDG A+ HP GVT D K N LYVAD+ N I+ + TT+AG G +N
Sbjct: 463 GDVDGVGINAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTN 519
>gi|290988662|ref|XP_002677016.1| predicted protein [Naegleria gruberi]
gi|284090621|gb|EFC44272.1| predicted protein [Naegleria gruberi]
Length = 450
Score = 39.3 bits (90), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 15/36 (41%), Positives = 24/36 (66%)
Query: 126 DGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
DG P +P G+ MD++GN++++DT N IRK+
Sbjct: 189 DGPPRSIPLYYPTGIAMDEEGNIFISDTRNNLIRKV 224
>gi|223940572|ref|ZP_03632418.1| NHL repeat containing protein [bacterium Ellin514]
gi|223890764|gb|EEF57279.1| NHL repeat containing protein [bacterium Ellin514]
Length = 791
Score = 39.3 bits (90), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 38/87 (43%), Gaps = 19/87 (21%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITP---PLSQYSRGRLVAGSFQGYTGHVDGKPNEAR 133
PY IRV + G + + NS I PL G V G P E
Sbjct: 703 PYDIRVDKAGRQYVCEFGNSRIQIFDANDKPLE--------------VLGGVGGAPGE-- 746
Query: 134 FNHPKGVTMDDKGNLYVADTLNLAIRK 160
F++P + D KGNLYVAD+ N ++K
Sbjct: 747 FSNPWSIAFDSKGNLYVADSQNHRVQK 773
>gi|224098054|ref|XP_002197115.1| PREDICTED: teneurin-1-like isoform 2 [Taeniopygia guttata]
Length = 2705
Score = 39.3 bits (90), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 28/46 (60%), Gaps = 3/46 (6%)
Query: 127 GKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAG 171
GK +EA N P+G+T+D G +Y D IRKI + GV TTI G
Sbjct: 1278 GKASEASLNSPRGITVDKHGFIYFVD--GTMIRKIDENGVITTIIG 1321
>gi|254383375|ref|ZP_04998727.1| NHL repeat containing protein [Streptomyces sp. Mg1]
gi|194342272|gb|EDX23238.1| NHL repeat containing protein [Streptomyces sp. Mg1]
Length = 607
Score = 39.3 bits (90), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 37/76 (48%), Gaps = 9/76 (11%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFR 180
GH DG +A HP GVT G++ V DT N A+R+ A VTT+A R
Sbjct: 392 GHRDGDAGQALLQHPLGVTALPDGSVAVCDTYNHALRRFDPATGRVTTLA------TDLR 445
Query: 181 DGPSEDAKFSNDFDVV 196
+ PS+ ND VV
Sbjct: 446 E-PSDAVLAGNDIVVV 460
>gi|17228508|ref|NP_485056.1| hypothetical protein alr1013 [Nostoc sp. PCC 7120]
gi|17130359|dbj|BAB72970.1| alr1013 [Nostoc sp. PCC 7120]
Length = 503
Score = 39.3 bits (90), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 51/98 (52%), Gaps = 10/98 (10%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P K+ + G LF D + IV +S ++ G ++ G +G DG EA+F+
Sbjct: 175 PGKVLATPSG-LFVADSGHHRIV-----VSDFN-GEILHLIGNGKSGLTDGNFQEAQFSA 227
Query: 137 PKGVTMDDKGN-LYVADTLNLAIRK--IGDAGVTTIAG 171
P+G+ D + LY+ADT N A+R+ I V TIAG
Sbjct: 228 PQGMAFDMENQILYLADTENHALRRVDINQQTVETIAG 265
>gi|307727531|ref|YP_003910744.1| NHL repeat containing protein [Burkholderia sp. CCGE1003]
gi|307588056|gb|ADN61453.1| NHL repeat containing protein [Burkholderia sp. CCGE1003]
Length = 396
Score = 39.3 bits (90), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 60/112 (53%), Gaps = 12/112 (10%)
Query: 113 LVAGSFQ-GYTGHVDGKP-NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTT 168
++AG+ Q GY+G DG P +A ++P GV +D GN+++A+ +RK+ +TT
Sbjct: 86 IIAGTGQAGYSG--DGGPATQAMLHYPTGVALDRAGNVFIAEPFAHVVRKVDRHTGVITT 143
Query: 169 IAG-GKSNVAGFRD-GPSEDAK-FSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
+AG G +V D GP+ A ++ND V V L + D +R++
Sbjct: 144 VAGPGTPDVPNRGDGGPATQATMYAND---VAVNREGDLYIADSAAGRIRRV 192
>gi|20092638|ref|NP_618713.1| cell surface protein [Methanosarcina acetivorans C2A]
gi|19917919|gb|AAM07193.1| cell surface protein [Methanosarcina acetivorans C2A]
Length = 526
Score = 39.3 bits (90), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 54/132 (40%), Gaps = 22/132 (16%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P I V G ++ VD NS I+K G + G G DG +F
Sbjct: 194 PEGIAVDSSGNIYVVDSGNSRIMKF--------DGTGTYLTEWGTPGQEDG-----QFRS 240
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 196
P G+ +D G +YV DT N I+K G S V + + D KF N +V
Sbjct: 241 PHGIAIDSSGAIYVTDTGNRRIQKFDSTG--------SYVTKWVSPENGDGKFQNPVGIV 292
Query: 197 YVRPTCSLLVID 208
V + ++ V+D
Sbjct: 293 -VDSSNNVYVVD 303
>gi|449281653|gb|EMC88689.1| Teneurin-1, partial [Columba livia]
Length = 2479
Score = 39.3 bits (90), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 28/46 (60%), Gaps = 3/46 (6%)
Query: 127 GKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGV-TTIAG 171
GK +EA N P+G+T+D G +Y D IRKI + GV TTI G
Sbjct: 1052 GKASEASLNSPRGITVDKHGFIYFVD--GTMIRKIDENGVITTIIG 1095
>gi|432329836|ref|YP_007247979.1| hypothetical protein Metfor_0399 [Methanoregula formicicum SMSP]
gi|432136545|gb|AGB01472.1| hypothetical protein Metfor_0399 [Methanoregula formicicum SMSP]
Length = 711
Score = 39.3 bits (90), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 51/115 (44%), Gaps = 17/115 (14%)
Query: 51 ADGNVLQFENG--YLVETVIEGNEIGVVPYKIRVSED--GELFAVDEVNSNIVKITPPLS 106
A+ V +F +G Y+ + EG G Y V+ D G ++ D N + K +
Sbjct: 276 ANERVQKFSSGGSYISQWGTEGTGNGQFNYLHGVATDTSGNVYVADSSNHRVQKFS---- 331
Query: 107 QYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
S G+ +A TG+ +FN P G+ D GN+YV DT N ++K
Sbjct: 332 --SDGKYIAQWGTRGTGN-------GQFNRPYGIATDTSGNVYVVDTWNSRVQKF 377
>gi|342182680|emb|CCC92159.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 565
Score = 39.3 bits (90), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 43/82 (52%), Gaps = 4/82 (4%)
Query: 158 IRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 217
IR + G+ TIAG K+ V G +DGP+ A F+N V + + V DR N +R I
Sbjct: 97 IRSVSTTGIDTIAGNKT-VRGNQDGPAAAALFNNPTSVAGIGD--DIFVADRDNNCIRHI 153
Query: 218 SLNQDDCEYQYNSIS-PTDILM 238
+ + + ++ PTDI++
Sbjct: 154 DASGNVTTLELIDLNQPTDIIV 175
>gi|21758510|dbj|BAC05316.1| unnamed protein product [Homo sapiens]
Length = 440
Score = 39.3 bits (90), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 32/57 (56%), Gaps = 4/57 (7%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG-GKSN 175
G VDG A+ HP GVT D K N LYVAD+ N I+ + TT+AG G +N
Sbjct: 177 GDVDGVGINAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTN 233
>gi|284034698|ref|YP_003384629.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Kribbella flavida DSM 17836]
gi|283813991|gb|ADB35830.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Kribbella flavida DSM 17836]
Length = 594
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 24/35 (68%)
Query: 126 DGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK 160
DGK +EA HP GVT+ G++ +ADT N A+R+
Sbjct: 382 DGKADEALLQHPLGVTVLPDGSIAIADTYNGAVRR 416
>gi|397781344|ref|YP_006545817.1| Internalin-A [Methanoculleus bourgensis MS2]
gi|396939846|emb|CCJ37101.1| Internalin-A [Methanoculleus bourgensis MS2]
Length = 2759
Score = 39.3 bits (90), Expect = 4.9, Method: Composition-based stats.
Identities = 31/93 (33%), Positives = 44/93 (47%), Gaps = 14/93 (15%)
Query: 119 QGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAG 178
+G +G DG+ F P G+T+D GN+YVADT N I+K +G G
Sbjct: 226 KGSSGSGDGQ-----FLSPYGITVDAAGNVYVADTWNHRIQKFDSSGNFLTKWGSRG--- 277
Query: 179 FRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGN 211
S + +FS F V V ++ V +RGN
Sbjct: 278 -----SGNGQFSEPFGVA-VDSAGNVYVTERGN 304
>gi|229029660|ref|ZP_04185736.1| Cell surface protein [Bacillus cereus AH1271]
gi|228731664|gb|EEL82570.1| Cell surface protein [Bacillus cereus AH1271]
Length = 617
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 52/111 (46%), Gaps = 9/111 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVA--GSFQGYTGHVDGKPNEARF 134
PY + ++G +F D N I+K + G++V GS G G + +F
Sbjct: 333 PYDVERDKNGNVFVSDSFNHRILKYD------TSGKVVGKWGSLFGAGGPLGYGSLPGQF 386
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVA-GFRDGPS 184
P+ + D N+YV+D++N I+K ++G+ + G V GF PS
Sbjct: 387 YVPRQIATDRYNNVYVSDSVNHRIQKFTNSGIALASYGSFGVLPGFFQFPS 437
>gi|27552887|gb|AAH41104.1| Nhlrc2 protein, partial [Mus musculus]
Length = 522
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 25/51 (49%), Positives = 31/51 (60%), Gaps = 3/51 (5%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG 171
G DG +E+ FN P+GV + D +YVADT N IRKI VTT+AG
Sbjct: 64 GRKDGMFSESSFNSPQGVAIADN-VIYVADTENHLIRKIDLEAEKVTTVAG 113
>gi|328783461|ref|XP_396242.3| PREDICTED: NHL repeat-containing protein 2 [Apis mellifera]
Length = 699
Score = 39.3 bits (90), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 58/118 (49%), Gaps = 9/118 (7%)
Query: 56 LQFENGYLVETVIEGNEIGVVPYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVA 115
LQ E +L+ TV N + +I +E GE + + +N + I ++ + ++
Sbjct: 211 LQLE-CHLLATVDNKNLLFPSKLEIFQNEQGENLIIADTGNNRILIVD--TKGNVQHVIG 267
Query: 116 GSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG--DAGVTTIAG 171
GS + DG ARFN P+GV + D +YVAD N AIRKI VTT+ G
Sbjct: 268 GSNPDFR---DGDFENARFNAPQGVCILDTF-IYVADNENHAIRKIDLIKKMVTTVVG 321
Score = 38.9 bits (89), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 51/104 (49%), Gaps = 7/104 (6%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAG-GKSNVAG 178
G DG A+ HP G+T K N +Y+ DT N I+KI TI G GK N
Sbjct: 466 GDSDGIKYAAKLQHPLGITWHSKDNAVYITDTYNHKIKKIDVTTQNCKTIYGDGKPNEKF 525
Query: 179 FRDGPSEDAKFSNDFDVVYVRPTCS--LLVIDRGNAALRQISLN 220
D PS A S++ D++YV T + + VID + +S+N
Sbjct: 526 SFDEPSGIA-ISSEKDLLYVADTNNHEVKVIDTKKENITTLSIN 568
>gi|290987760|ref|XP_002676590.1| predicted protein [Naegleria gruberi]
gi|284090193|gb|EFC43846.1| predicted protein [Naegleria gruberi]
Length = 836
Score = 39.3 bits (90), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 38/66 (57%), Gaps = 4/66 (6%)
Query: 113 LVAG-SFQGYTGHVDGKPNEARFNHPKGVTMD-DKGNLYVADTLNLAIRKIGDAG-VTTI 169
++AG FQG++G G A+ N P V D + GN+Y+AD+ N IR I G +TTI
Sbjct: 508 VIAGIGFQGFSGD-GGLAINAQLNKPHSVHFDKNSGNIYIADSGNHRIRMIFPNGTITTI 566
Query: 170 AGGKSN 175
G +N
Sbjct: 567 VGSGTN 572
>gi|290983792|ref|XP_002674612.1| predicted protein [Naegleria gruberi]
gi|284088203|gb|EFC41868.1| predicted protein [Naegleria gruberi]
Length = 589
Score = 39.3 bits (90), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 54/106 (50%), Gaps = 9/106 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGS-FQGYTGHVDGKPNEARFN 135
PY + + GEL + + ++ +++ S + R +AGS +G+ G + F
Sbjct: 52 PYGLEI--HGELLYIADTYNDRIRVMNLTSH--KIRTIAGSGVRGFNGDFSMDAMQVMFR 107
Query: 136 HPKGVTMD-DKGNLYVADTLNLAIRKIG--DAGVTTIAG-GKSNVA 177
P+ V +D GNL ++DT N IRK+ V+TIAG G +N A
Sbjct: 108 QPEDVKIDPTDGNLIISDTFNNRIRKMWKESGNVSTIAGTGATNTA 153
>gi|256556932|gb|ACU83555.1| thiol-disulfide isomerase/thioredoxin [uncultured bacterium
HF130_AEPn_1]
Length = 497
Score = 39.3 bits (90), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 42/78 (53%), Gaps = 8/78 (10%)
Query: 86 GELFAV-DEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDD 144
GELF + D N ++ I + ++V GS G G+ DG ARF P G+ D+
Sbjct: 199 GELFFLTDSSNHRVLAIDLS----GKVKMVIGS--GKEGNKDGDIKVARFRRPHGLAFDE 252
Query: 145 KGN-LYVADTLNLAIRKI 161
K + LY+ADT N +I+ +
Sbjct: 253 KNDLLYIADTDNHSIKSL 270
>gi|408679290|ref|YP_006879117.1| hypothetical protein SVEN_3572 [Streptomyces venezuelae ATCC 10712]
gi|328883619|emb|CCA56858.1| hypothetical protein SVEN_3572 [Streptomyces venezuelae ATCC 10712]
Length = 597
Score = 39.3 bits (90), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 29/50 (58%), Gaps = 2/50 (4%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIA 170
GH DG +A HP GVT G++ V+DT N A+R+ A VTT+A
Sbjct: 382 GHRDGAAEQALLQHPLGVTALPDGSVAVSDTYNHALRRFDPASGEVTTLA 431
>gi|170575166|ref|XP_001893127.1| B-box type zinc-finger protein ncl-1 [Brugia malayi]
gi|158601023|gb|EDP38039.1| B-box type zinc-finger protein ncl-1, putative [Brugia malayi]
Length = 788
Score = 38.9 bits (89), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 47/182 (25%), Positives = 79/182 (43%), Gaps = 23/182 (12%)
Query: 48 APQADGNVLQFENGYLVETVIEGNEIGVVP---YKIRVSEDGELFAVDEVNSNIVKITPP 104
+ Q+ G + +NG+ E +G+ P ++ S D EL E N I + PP
Sbjct: 444 SSQSLGTFVTVDNGF--SNQYEKWSMGLEPASGLQLAESPDLEL----EKNMGISNLYPP 497
Query: 105 LSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDA 164
+Q R +++ G G ++G+ F P GV ++ + ++ VADT N I+
Sbjct: 498 RAQIKRQKMIYHCKFGEFGVMEGQ-----FTEPSGVAVNAQNDIIVADTNNHRIQVFDKE 552
Query: 165 GVTTIAGGKSNVAGFRDG----PSEDA--KFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G G+ G RDG P+ A + + DF V PT + + ++ LR+
Sbjct: 553 GRFKFQFGE---CGKRDGQLLYPNRVAVNRLTGDFIVTERSPTHQIQIYNQYGQFLRKFG 609
Query: 219 LN 220
N
Sbjct: 610 AN 611
>gi|260808708|ref|XP_002599149.1| hypothetical protein BRAFLDRAFT_81816 [Branchiostoma floridae]
gi|229284425|gb|EEN55161.1| hypothetical protein BRAFLDRAFT_81816 [Branchiostoma floridae]
Length = 610
Score = 38.9 bits (89), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 53/129 (41%), Gaps = 23/129 (17%)
Query: 42 TRSSSKAPQADGNVLQFE-NGYLVETVIEGNEIGV-VPYKIRVSEDGELFAVDEVNSNIV 99
T+S+ PQ +G V F +G LV V G + G+ P I V E+G + D N I
Sbjct: 455 TQSTGDWPQRNGEVQVFRPDGKLVRVV--GQQQGMKYPQHITVDEEGSILVSDYDNHCIY 512
Query: 100 KITPPLSQYSRGRLVAGSFQ---GYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNL 156
G F G G V+G+ HP+G+ D GN+ VAD+ N
Sbjct: 513 VFNDD-----------GQFLFKFGGEGSVEGQ-----LKHPRGICTDKSGNIIVADSGNS 556
Query: 157 AIRKIGDAG 165
+ G
Sbjct: 557 RVEMFDKTG 565
>gi|290976287|ref|XP_002670872.1| predicted protein [Naegleria gruberi]
gi|284084435|gb|EFC38128.1| predicted protein [Naegleria gruberi]
Length = 1223
Score = 38.9 bits (89), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 49/107 (45%), Gaps = 21/107 (19%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + VS +G+L+ D N+ I R+V+ S TG V A FN
Sbjct: 26 PSSLSVSPNGDLYIADTQNNKI-------------RIVSAS----TGLVSSIS--ATFNK 66
Query: 137 PKGVTMDDKGNLYVADTLNLAIRK--IGDAGVTTIAGGKSNVAGFRD 181
P G+ + LY+ADT N I+K I ++TI G K+ + G D
Sbjct: 67 PLGIVVSSNNILYIADTQNNLIKKYDISTRVLSTIGGEKAYLDGSYD 113
>gi|254390353|ref|ZP_05005570.1| NHL repeat-containing protein [Streptomyces clavuligerus ATCC
27064]
gi|294813833|ref|ZP_06772476.1| NHL repeat containing protein [Streptomyces clavuligerus ATCC
27064]
gi|326442251|ref|ZP_08216985.1| hypothetical protein SclaA2_14354 [Streptomyces clavuligerus ATCC
27064]
gi|197704057|gb|EDY49869.1| NHL repeat-containing protein [Streptomyces clavuligerus ATCC
27064]
gi|294326432|gb|EFG08075.1| NHL repeat containing protein [Streptomyces clavuligerus ATCC
27064]
Length = 631
Score = 38.9 bits (89), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 24/38 (63%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRK 160
GH DG +A F HP GVT G++ V+DT N A+R+
Sbjct: 399 GHRDGAAAQALFQHPLGVTALPDGSVAVSDTYNHALRR 436
>gi|196230114|ref|ZP_03128977.1| SMP-30/Gluconolaconase/LRE domain protein [Chthoniobacter flavus
Ellin428]
gi|196225711|gb|EDY20218.1| SMP-30/Gluconolaconase/LRE domain protein [Chthoniobacter flavus
Ellin428]
Length = 1549
Score = 38.9 bits (89), Expect = 5.6, Method: Composition-based stats.
Identities = 25/57 (43%), Positives = 31/57 (54%), Gaps = 2/57 (3%)
Query: 129 PNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPS 184
P FN P+G+T D GNLYVA+ L +I K+ D V+T A G GF PS
Sbjct: 1068 PFATGFNAPEGMTFDTTGNLYVANFLGDSISKVTPDGEVSTFASGFP-TTGFTSYPS 1123
>gi|119509213|ref|ZP_01628363.1| NHL repeat protein [Nodularia spumigena CCY9414]
gi|119466055|gb|EAW46942.1| NHL repeat protein [Nodularia spumigena CCY9414]
Length = 502
Score = 38.9 bits (89), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 68/148 (45%), Gaps = 14/148 (9%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P K+ + G LF D + +V +S Y G ++ G +G DG NEA+F
Sbjct: 177 PGKVLATPAG-LFIADSGHHRVV-----MSSYD-GEILHLIGTGKSGLTDGAFNEAQFFA 229
Query: 137 PKGVTMDDKGN-LYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGF--RDGPSEDAKFSN 191
P+G+ D + LYVADT N +R++ V TIAG G + ++
Sbjct: 230 PQGMAFDPENQMLYVADTENHTLRRVDLKRQIVKTIAGTGEQSRNIYPHGGAGIETALNS 289
Query: 192 DFDVVYVRPTCSLLVIDRGNAALRQISL 219
+DVV + T L + G+ + Q++L
Sbjct: 290 PWDVVQLGQT--LFIAMAGSHQIWQMNL 315
>gi|159113256|ref|XP_001706855.1| Haloacid dehalogenase-like hydrolase family [Giardia lamblia ATCC
50803]
gi|157434955|gb|EDO79181.1| Haloacid dehalogenase-like hydrolase family [Giardia lamblia ATCC
50803]
Length = 969
Score = 38.9 bits (89), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 49/102 (48%), Gaps = 7/102 (6%)
Query: 120 GYTGHVDGKPNEARFNHPKGVTMDD-KGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNV 176
G H+DG ++A FN P + D G +Y+ D N IR + V T+A G V
Sbjct: 126 GEAAHLDGGFSQAAFNFPHSIVSDPVNGIIYLTDAKNHCIRTLSLMTRKVKTLA-GTPGV 184
Query: 177 AGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G++DG + F+ +V +++V D N ALR +S
Sbjct: 185 FGYKDG--LNPLFNEPLGLVLTEDG-NVIVCDSKNGALRYVS 223
>gi|166364150|ref|YP_001656423.1| hypothetical protein MAE_14090 [Microcystis aeruginosa NIES-843]
gi|166086523|dbj|BAG01231.1| hypothetical protein MAE_14090 [Microcystis aeruginosa NIES-843]
Length = 342
Score = 38.9 bits (89), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 49/118 (41%), Gaps = 13/118 (11%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
PY I VS G ++ D N N V++ +G FQ G +F
Sbjct: 111 PYGIAVSRGGNVYVADTDN-NRVQVFDS----------SGVFQSAFG--TNGTGTGQFQG 157
Query: 137 PKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 194
P + + GN+YVADT N I+ GV A G S+ G G S + ++ F+
Sbjct: 158 PYAIAVGSSGNVYVADTFNNRIQVFNSTGVFQFAFGSSSPYGIAVGSSGNIYVADTFN 215
>gi|163848101|ref|YP_001636145.1| NHL repeat-containing protein [Chloroflexus aurantiacus J-10-fl]
gi|163669390|gb|ABY35756.1| NHL repeat containing protein [Chloroflexus aurantiacus J-10-fl]
Length = 1139
Score = 38.9 bits (89), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 38/81 (46%), Gaps = 13/81 (16%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
I + + G+++ D N I P G LV +F G +D +F P+G
Sbjct: 848 IAIDQQGQIYVADTYNHRIEVFAP------DGTLVR-NFGGQGNGLD------QFYEPRG 894
Query: 140 VTMDDKGNLYVADTLNLAIRK 160
+ D +GNLY+ADT N + K
Sbjct: 895 LAFDSQGNLYIADTWNARVVK 915
>gi|73667964|ref|YP_303979.1| hypothetical protein Mbar_A0416 [Methanosarcina barkeri str.
Fusaro]
gi|72395126|gb|AAZ69399.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro]
Length = 346
Score = 38.9 bits (89), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 49/110 (44%), Gaps = 21/110 (19%)
Query: 61 GYLVETVIEGNEIGVVPY--KIRVSEDGELFAVDEVNSNIVKITPP---LSQYSRGRLVA 115
G+L + GN G + + V G ++ D N+ I K L+Q+
Sbjct: 129 GHLTQWGSSGNGNGQFYFLNGVAVDSSGNVYVADSGNNRIQKFNSNGGYLTQW------- 181
Query: 116 GSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG 165
GS+ G N +FN P+GV +D GN+YVAD+ N I+K G
Sbjct: 182 GSY--------GSGN-GQFNDPEGVAVDSSGNVYVADSGNNRIQKFNSTG 222
>gi|75909875|ref|YP_324171.1| NHL repeat-containing protein [Anabaena variabilis ATCC 29413]
gi|75703600|gb|ABA23276.1| NHL repeat protein [Anabaena variabilis ATCC 29413]
Length = 503
Score = 38.9 bits (89), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 46/89 (51%), Gaps = 9/89 (10%)
Query: 86 GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDK 145
G LF D + IV +S ++ G ++ G +G DG EA+F+ P+G+ D +
Sbjct: 183 GGLFVADSGHHRIV-----VSDFN-GEILHLIGNGKSGLTDGNFQEAQFSAPQGMAFDME 236
Query: 146 GN-LYVADTLNLAIRK--IGDAGVTTIAG 171
LYVADT N +R+ I V TIAG
Sbjct: 237 NQILYVADTDNHVVRRADIQQQTVETIAG 265
>gi|260793567|ref|XP_002591783.1| hypothetical protein BRAFLDRAFT_123530 [Branchiostoma floridae]
gi|229276993|gb|EEN47794.1| hypothetical protein BRAFLDRAFT_123530 [Branchiostoma floridae]
Length = 299
Score = 38.9 bits (89), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 42/142 (29%), Positives = 61/142 (42%), Gaps = 29/142 (20%)
Query: 91 VDEVNSNIVKITP-------------PLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHP 137
VD+ S ++K P L + G LV F G+ P + +F HP
Sbjct: 16 VDQNRSGLIKFRPYGGIGELVKLGVGELDSRTVGELVV-KFGGW------GPQQGQFKHP 68
Query: 138 KGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVY 197
GV + G++ VADT N I+ GV A GF G ++DA FS+ DV
Sbjct: 69 GGVAVSSAGHIIVADTGNHRIQVFDSRGVFLRA------FGFY-GSADDA-FSHPHDVAM 120
Query: 198 VRPTCSLLVIDRGNAALRQISL 219
+LV D+GN ++ +L
Sbjct: 121 TTDD-RILVTDKGNKLVKLFTL 141
>gi|222525995|ref|YP_002570466.1| NHL repeat containing protein [Chloroflexus sp. Y-400-fl]
gi|222449874|gb|ACM54140.1| NHL repeat containing protein [Chloroflexus sp. Y-400-fl]
Length = 1122
Score = 38.9 bits (89), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 38/81 (46%), Gaps = 13/81 (16%)
Query: 80 IRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKG 139
I + + G+++ D N I P G LV +F G +D +F P+G
Sbjct: 831 IAIDQQGQIYVADTYNHRIEVFAP------DGTLVR-NFGGQGNGLD------QFYEPRG 877
Query: 140 VTMDDKGNLYVADTLNLAIRK 160
+ D +GNLY+ADT N + K
Sbjct: 878 LAFDSQGNLYIADTWNARVVK 898
>gi|371777392|ref|ZP_09483714.1| hypothetical protein AnHS1_08238 [Anaerophaga sp. HS1]
Length = 448
Score = 38.9 bits (89), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 43/87 (49%), Gaps = 5/87 (5%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVDGKPNEARFNH 136
P + V EDG ++ V+ + IVK S L+AG GH DG P EA+F
Sbjct: 364 PGGLTVDEDGNIYVVNIDGNTIVKCDRKSDFVS---LLAG-VPLTMGHKDGPPLEAQFYL 419
Query: 137 PKGVTMDDKGNLYVADTLNLA-IRKIG 162
P G+++D+ GN + + +RK+
Sbjct: 420 PFGISIDEDGNFIICECWGSGVVRKLA 446
>gi|283458725|ref|YP_003363360.1| thiol-disulfide isomerase [Rothia mucilaginosa DY-18]
gi|283134775|dbj|BAI65540.1| thiol-disulfide isomerase [Rothia mucilaginosa DY-18]
Length = 656
Score = 38.9 bits (89), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 32/114 (28%), Positives = 49/114 (42%), Gaps = 22/114 (19%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKG-------NLYVADT-------LNLA---IRKIGDAG 165
GHVDG EARFN P G+ + + ++ VADT LNL +R + G
Sbjct: 240 GHVDGTGTEARFNEPAGLALVPENLREQLGYDVLVADTVNHRLRSLNLTTGEVRTLAGNG 299
Query: 166 VTTIAGGKSNVAGFRDG-----PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAAL 214
V + G + + G + P+ + S+ +D VY R ++ G L
Sbjct: 300 VQRVIDGDNAITGDPNHIPAGVPATEIALSSPWDAVYSREAGQFIIAMAGTHQL 353
>gi|195998269|ref|XP_002109003.1| hypothetical protein TRIADDRAFT_20343 [Trichoplax adhaerens]
gi|190589779|gb|EDV29801.1| hypothetical protein TRIADDRAFT_20343 [Trichoplax adhaerens]
Length = 666
Score = 38.9 bits (89), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 42/87 (48%), Gaps = 11/87 (12%)
Query: 130 NEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVA-GFRDGPSEDAK 188
N +F P+GVT+D +GN+ VAD+ N I+ G G G + + G D PS
Sbjct: 588 NNGQFLRPQGVTIDPEGNIIVADSRNHRIQVFGSNGNFLCKFGSNGINDGCLDRPSG--- 644
Query: 189 FSNDFDVVYVRPTCSLLVIDRGNAALR 215
+ V P ++V+D GN L+
Sbjct: 645 -------IAVTPEGLIIVVDFGNNRLQ 664
>gi|290976772|ref|XP_002671113.1| predicted protein [Naegleria gruberi]
gi|284084679|gb|EFC38369.1| predicted protein [Naegleria gruberi]
Length = 520
Score = 38.9 bits (89), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 41/86 (47%), Gaps = 6/86 (6%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQ-GYTGHVDGKPNEARFN 135
P I + + GE+F D N I +ITP + +AG+ GY+G G A +
Sbjct: 281 PKSIAIGKHGEIFFTDSDNQVIRRITPDGIITT----IAGTGNFGYSGD-GGLATSADIS 335
Query: 136 HPKGVTMDDKGNLYVADTLNLAIRKI 161
P G+ +D G +Y D N +RK+
Sbjct: 336 KPTGIAVDSNGTIYFCDNNNNRVRKL 361
>gi|395218326|ref|ZP_10401983.1| alkyl hydroperoxide reductase [Pontibacter sp. BAB1700]
gi|394454578|gb|EJF09204.1| alkyl hydroperoxide reductase [Pontibacter sp. BAB1700]
Length = 474
Score = 38.5 bits (88), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 48/95 (50%), Gaps = 9/95 (9%)
Query: 69 EGNEIGVVPYKIRVSED--GELFAVDEVNSNIVKITPPLSQYSRGRLVAGSFQGYTGHVD 126
E E GV+ + ++ D G L+ D ++ I+K LSQ + V GS G G +
Sbjct: 163 EVKEAGVLSFPSKLISDAAGNLYLSDSGHNRILK----LSQQGQVLEVIGS--GERGFNN 216
Query: 127 GKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKI 161
G +A FN P G+ + LYVAD N AIRK+
Sbjct: 217 GGYADATFNEPHGLALHGS-TLYVADAKNHAIRKV 250
>gi|229017270|ref|ZP_04174175.1| Cell surface protein [Bacillus cereus AH1273]
gi|229023443|ref|ZP_04179943.1| Cell surface protein [Bacillus cereus AH1272]
gi|423420069|ref|ZP_17397158.1| hypothetical protein IE3_03541 [Bacillus cereus BAG3X2-1]
gi|228737853|gb|EEL88349.1| Cell surface protein [Bacillus cereus AH1272]
gi|228744023|gb|EEL94120.1| Cell surface protein [Bacillus cereus AH1273]
gi|401101978|gb|EJQ09965.1| hypothetical protein IE3_03541 [Bacillus cereus BAG3X2-1]
Length = 598
Score = 38.5 bits (88), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 51/111 (45%), Gaps = 9/111 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVA--GSFQGYTGHVDGKPNEARF 134
PY + +G +F D N I+K G++VA GS G G + +F
Sbjct: 314 PYDVERDTNGNVFVSDSFNHRILKYDIS------GKIVAKWGSLFGTGGPLGFGSLPGQF 367
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVA-GFRDGPS 184
P+ + D N+YV+D++N I+K ++G+ + G V GF PS
Sbjct: 368 FVPRQIATDRYNNVYVSDSVNHRIQKFTNSGIALASYGSFGVLPGFFQFPS 418
>gi|402594976|gb|EJW88902.1| hypothetical protein WUBG_00188 [Wuchereria bancrofti]
Length = 788
Score = 38.5 bits (88), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 47/182 (25%), Positives = 79/182 (43%), Gaps = 23/182 (12%)
Query: 48 APQADGNVLQFENGYLVETVIEGNEIGVVP---YKIRVSEDGELFAVDEVNSNIVKITPP 104
+ Q+ G + +NG+ E +G+ P ++ S D EL E N I + PP
Sbjct: 444 SSQSLGTFVTVDNGF--SNQYEKWSMGLEPASGLQLAESPDLEL----EKNMGINNLYPP 497
Query: 105 LSQYSRGRLVAGSFQGYTGHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDA 164
+Q R +++ G G ++G+ F P GV ++ + ++ VADT N I+
Sbjct: 498 RAQIKRQKMIYHCKFGEFGVMEGQ-----FTEPSGVAVNAQNDIIVADTNNHRIQVFDKE 552
Query: 165 GVTTIAGGKSNVAGFRDG----PSEDA--KFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 218
G G+ G RDG P+ A + + DF V PT + + ++ LR+
Sbjct: 553 GRFKFQFGE---CGKRDGQLLYPNRVAVNRLTGDFIVTERSPTHQIQIYNQYGQFLRKFG 609
Query: 219 LN 220
N
Sbjct: 610 AN 611
>gi|351706822|gb|EHB09741.1| NHL repeat-containing protein 2 [Heterocephalus glaber]
Length = 726
Score = 38.5 bits (88), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 25/51 (49%), Positives = 31/51 (60%), Gaps = 3/51 (5%)
Query: 123 GHVDGKPNEARFNHPKGVTMDDKGNLYVADTLNLAIRKIGDAG--VTTIAG 171
G DG +E+ FN P+GV M D +YVADT N IRKI V+T+AG
Sbjct: 267 GRKDGLFSESTFNSPQGVAMMDN-VIYVADTENHLIRKIDLEAERVSTVAG 316
>gi|423509851|ref|ZP_17486382.1| hypothetical protein IG3_01348 [Bacillus cereus HuA2-1]
gi|402456083|gb|EJV87861.1| hypothetical protein IG3_01348 [Bacillus cereus HuA2-1]
Length = 598
Score = 38.5 bits (88), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 51/111 (45%), Gaps = 9/111 (8%)
Query: 77 PYKIRVSEDGELFAVDEVNSNIVKITPPLSQYSRGRLVA--GSFQGYTGHVDGKPNEARF 134
PY + +G +F D N I+K G++VA GS G G + +F
Sbjct: 314 PYDVERDTNGNVFVSDSFNHRILKYDIS------GKIVAKWGSLFGTGGPLGFGSLPGQF 367
Query: 135 NHPKGVTMDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVA-GFRDGPS 184
P+ + D N+YV+D++N I+K ++G+ + G V GF PS
Sbjct: 368 FVPRQIATDRYNNVYVSDSVNHRIQKFTNSGIALASYGSFGVLPGFFQFPS 418
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.134 0.387
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,067,558,148
Number of Sequences: 23463169
Number of extensions: 352615223
Number of successful extensions: 710009
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 483
Number of HSP's successfully gapped in prelim test: 681
Number of HSP's that attempted gapping in prelim test: 705261
Number of HSP's gapped (non-prelim): 4212
length of query: 486
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 339
effective length of database: 8,910,109,524
effective search space: 3020527128636
effective search space used: 3020527128636
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 79 (35.0 bits)