BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 004302
(762 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255560179|ref|XP_002521107.1| catalytic, putative [Ricinus communis]
gi|223539676|gb|EEF41258.1| catalytic, putative [Ricinus communis]
Length = 767
Score = 936 bits (2419), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 467/777 (60%), Positives = 579/777 (74%), Gaps = 25/777 (3%)
Query: 1 MALRNGCLRRISRFLPHIYSGSHFQQSRRAIINSLASSLITFP-------RECEQISRNG 53
M+ R R+IS+ LP I G+ + R+A A S I+ P R + R
Sbjct: 1 MSRRYQRFRQISKLLPRILPGNQLHKCRKATSIHSAFSTISVPNSDYDGMRNTLILHRGF 60
Query: 54 VNFSFSTIAQAS-PAESLSQSDTLSFIESTFNEFQGPHHLWFNIVED-NIHFFKRGGAFL 111
+ +STI++ S + S D LSFI+STFNE QGP+H W N V+ + + G FL
Sbjct: 61 YSQRYSTISEVSHESNSPPVDDVLSFIKSTFNELQGPNHCWLNKVDHRDKDCLNKDGIFL 120
Query: 112 VLAGRFVDNCDSLIAGCGTVVTFEKVKSIQQSFPQLQVIGFLHGCSTISAVDQTRLVEML 171
++AG+ ++N V EK+KSIQQ FPQL ++GF G S SA D+TRLVE++
Sbjct: 121 LIAGQLINNSQ-------IVFMIEKIKSIQQRFPQLCIVGFHCGSSIGSADDRTRLVELI 173
Query: 172 MKEYITFPILLSNKNFPQMENGACYLLSKDFGNARVFHENSLDIGMLNKAVEELIMQQQE 231
MKE++TFP+LLS+KNF QMENGACY+L KDF N+ ++H+ LDI +LNKA+EEL MQQ
Sbjct: 174 MKEFLTFPVLLSSKNFLQMENGACYILFKDFKNSVIYHDRDLDIEILNKAIEELHMQQ-- 231
Query: 232 NSSSPSG------LKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSN 285
N + SG LK +W KQAEV KEP + S ++NL+L+FPGC+SADESG+RLFLSDSN
Sbjct: 232 NGYTNSGISNLRDLKSSWVKQAEVTKEPCSSSFLQNLVLYFPGCVSADESGDRLFLSDSN 291
Query: 286 HHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRR 345
HHRII+FDGNGKI+D IGSCPGFEDGEFES+KL+RPAASFYH +DCLYIVD+EN AIRR
Sbjct: 292 HHRIIIFDGNGKIMDSIGSCPGFEDGEFESAKLLRPAASFYHNSEDCLYIVDAENQAIRR 351
Query: 346 ADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQSLIFPWHLMKSE 405
ADM RRVLET+YPT ISK N+S+W WI+ K+GF R++D KS++ D Q L+FPWHL KS
Sbjct: 352 ADMERRVLETLYPTCSISKNNSSVWTWIVNKMGFGRNSDMKSKEFDSQLLMFPWHLFKSV 411
Query: 406 DDNLLIINRSFETLWIMDLASGEIKEAVKGFSKVLEICGVLVMEKVFLLKQMPQDWLLHQ 465
DD+LLIINRSFE+LWIMDLASG+IKE ++GF K+LE CG L+ EKV LLKQMP DWL Q
Sbjct: 412 DDSLLIINRSFESLWIMDLASGKIKEIIRGFPKILETCGQLITEKVSLLKQMPNDWLQQQ 471
Query: 466 IDSSCSLKELPYAGLISSSIAFQNHILLCDIVGQRIMRLNRESGVCSNFQFSNFAILGLP 525
ID+SCS + LP+A L+SS FQNH+++CD V QR+++LNRESG+CSN QFSNF ILG P
Sbjct: 472 IDASCSPEGLPFASLLSSVTTFQNHLIMCDTVAQRVVKLNRESGICSNIQFSNFGILGFP 531
Query: 526 YWFAFPLERVYAVAGGHQGSWTDHIQRCSLLPGRIDIKVNVDIPSDTELVESLQEGCIWR 585
YW +FPLERVY+ A G W DH+Q SLLPGRIDI++NVDIP D +LVE LQEGCIWR
Sbjct: 532 YWSSFPLERVYSEAPP-DGGWMDHLQSFSLLPGRIDIRLNVDIPVDVDLVEPLQEGCIWR 590
Query: 586 QARGTASVVLRAEDVAGSLEKVGVAQLWYDELDTLALSTPESESNIEDETTTSDLRSEDD 645
QARG A+ +L E V G+ EKVGVAQ WYD+LD LA STPESE ED +SD++SED
Sbjct: 591 QARGAATEILGREGVVGTSEKVGVAQQWYDDLDNLAFSTPESEMVKEDSCASSDVKSEDK 650
Query: 646 TVHIDCAVNTSPGTSEVIISAALYLKLRRYPDQQDDGREKYAARISDILKLGRSGAMQRD 705
VHIDC+VNTSPGTSEVII ALYLKLRR PD Q+ +EKYAARI+DIL R G + RD
Sbjct: 651 RVHIDCSVNTSPGTSEVIIYVALYLKLRRDPDSQEVSQEKYAARIADILNPARKGGIGRD 710
Query: 706 SFIRFLLKSNQDLRDVIFVKPLHVSIQFDTLDHPKADNSKDIILTDSNMEVDVSLNT 762
S I+ LLKSN DLRD+IF++PLHV I+ D DHPK++N KDIILT+ ++EV+V L T
Sbjct: 711 SCIQLLLKSNADLRDLIFMRPLHVRIKMDCPDHPKSENGKDIILTNDSIEVNVMLQT 767
>gi|359480128|ref|XP_002267331.2| PREDICTED: uncharacterized protein LOC100267131 [Vitis vinifera]
Length = 705
Score = 856 bits (2212), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/703 (59%), Positives = 532/703 (75%), Gaps = 4/703 (0%)
Query: 58 FSTIAQASPAESLSQSDTLSFIESTFNEFQGPHHLWFNIVEDNIHFFKRGGAFLVLAGRF 117
FSTI++ S ES + + L +IES F+ +GP H W N VE++ +F KR FLVLAG F
Sbjct: 7 FSTISELS-HESCPEVEILPYIESAFDSLEGPSHCWLNKVEESKNFSKRDRIFLVLAGVF 65
Query: 118 VDNCDSLIAGCGTVVTFEKVKSIQQSFPQLQVIGFLHGCSTISAVDQTRLVEMLMKEYIT 177
+ DSL+ G +VV FEK+KS+QQ +P L V+GFL GCS+ S D++ L++++MKEYIT
Sbjct: 66 CE--DSLMLGFDSVVMFEKLKSLQQRYPLLHVMGFLSGCSSFSGADKSHLLQIIMKEYIT 123
Query: 178 FPILLSNKNFPQMENGACYLLSKDFGNARVFHENSLDIGMLNKAVEELIMQQQENSSSPS 237
FPILLSNK+F +M GACY+L K+F + ++HE +D+G LNKAVEEL Q + S++
Sbjct: 124 FPILLSNKDFSEMAGGACYILFKEFRSPLIYHEKDVDVGTLNKAVEELYTQDKGESATLQ 183
Query: 238 GLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK 297
K KQ++ KEP+ S++NLLL++PGCIS DESGNRLFLSDSNHHRII+F GNGK
Sbjct: 184 NWKSNGLKQSDSTKEPYV-GSLQNLLLYYPGCISVDESGNRLFLSDSNHHRIIIFSGNGK 242
Query: 298 ILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVY 357
ILDCIGS PGFEDGEFES+KL RPAASFYH D+DCLY VDSENHAIRRADMG RVLET+Y
Sbjct: 243 ILDCIGSAPGFEDGEFESAKLARPAASFYHADEDCLYFVDSENHAIRRADMGTRVLETLY 302
Query: 358 PTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFE 417
P +KK N LW+WI+ KLG E+D DTK + D L+FPWHL+K D +L IINRSFE
Sbjct: 303 PPCNTNKKKNGLWSWIVNKLGMEKDADTKPAEFDSGLLMFPWHLIKLVDGDLFIINRSFE 362
Query: 418 TLWIMDLASGEIKEAVKGFSKVLEICGVLVMEKVFLLKQMPQDWLLHQIDSSCSLKELPY 477
TLWIM LASGEIKE V+G KVLEICG ++MEK+ +L +MP DWL Q+DS+ SL+ +PY
Sbjct: 363 TLWIMALASGEIKEVVRGIPKVLEICGEMIMEKLAVLNKMPHDWLQQQVDSNFSLEGIPY 422
Query: 478 AGLISSSIAFQNHILLCDIVGQRIMRLNRESGVCSNFQFSNFAILGLPYWFAFPLERVYA 537
AGL+SS FQ+ I++CD V QRI+RLN ESG +N +FSNF ILGLPYWF+FPLERV A
Sbjct: 423 AGLMSSLATFQDDIVICDTVAQRILRLNGESGDFTNLEFSNFGILGLPYWFSFPLERVCA 482
Query: 538 VAGGHQGSWTDHIQRCSLLPGRIDIKVNVDIPSDTELVESLQEGCIWRQARGTASVVLRA 597
V +G+ DH Q S LPG+I+I++ V+IP DTELVE LQ+GCIWR ARG A+VV
Sbjct: 483 VGNLIRGANADHFQSFSFLPGKINIQLMVEIPEDTELVEPLQDGCIWRLARGAAAVVSGV 542
Query: 598 EDVAGSLEKVGVAQLWYDELDTLALSTPESESNIEDETTTSDLRSEDDTVHIDCAVNTSP 657
ED+ S KVGVAQ WYDELD LA TPE ES+ E+E TT D +++ V IDCAVNTSP
Sbjct: 543 EDLVASSAKVGVAQQWYDELDNLAFFTPEPESDAEEENTTLDTPFQEEKVRIDCAVNTSP 602
Query: 658 GTSEVIISAALYLKLRRYPDQQDDGREKYAARISDILKLGRSGAMQRDSFIRFLLKSNQD 717
GTSEVI+ AALYLKL+R P+ Q+ EK AARI +IL+ +SG + R+S I+FL K N+D
Sbjct: 603 GTSEVIVYAALYLKLKRNPNSQERNPEKNAARILNILEPEKSGKITRESCIQFLSKWNED 662
Query: 718 LRDVIFVKPLHVSIQFDTLDHPKADNSKDIILTDSNMEVDVSL 760
+ D+IF+KPLHV I ++L HPKA+N+K+ +LT+++++V+V L
Sbjct: 663 VEDLIFMKPLHVRIGLESLSHPKAENAKETVLTETSIQVNVFL 705
>gi|449453417|ref|XP_004144454.1| PREDICTED: uncharacterized protein LOC101209700 [Cucumis sativus]
Length = 836
Score = 726 bits (1873), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/763 (49%), Positives = 512/763 (67%), Gaps = 15/763 (1%)
Query: 1 MALRNGCLRRISRFLPHIYSGSHFQQSRRAIINSLASSLITFPRECEQISRN-GVNFSFS 59
MA R L+ ISR +P IYS + Q RR ++SLA S+ F R E+I R N +
Sbjct: 88 MAFRFRRLKEISRSIPQIYSEFYHQHHRRYGVSSLALSVAPF-RVSERIGRRLFYNGRYF 146
Query: 60 TIAQASPAESLSQSDTLSFIESTFNEFQGPHHLWFNIVEDNIHFFKRGGAFLVLAGRFVD 119
T + P + L + ++ +GP+H W N +N F+ G +L+LA +F++
Sbjct: 147 TSCGSPPYKPLYKLIRFYHLDCLVRR-RGPNHYWLNTSNENKVIFEEDGKYLILANQFLE 205
Query: 120 NCDSLIAGCGTVVTFEKVKSIQQSFPQLQVIGFLHGCSTISAVDQTRLVEMLMKEYITFP 179
S +VV E VK +QQ FP L VIGF ST+S +++ +++ +M+EYI+FP
Sbjct: 206 MTSS-----DSVVLVENVKFLQQRFPHLHVIGF-QCSSTLSVAEKSDMIQFIMREYISFP 259
Query: 180 ILLSNKNFPQMENGACYLLSKDFGNARVFHENSLDIGMLNKAVEELIMQQQENSSSPSGL 239
ILLSNK F E CY++SKD N + E +D+ +L KA+EEL + E S +
Sbjct: 260 ILLSNKIF---EVAGCYIISKDLSNPLLVSERGMDLSILRKAIEELHEPENEKSGLSNMG 316
Query: 240 KCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKIL 299
K T+ KQAE++KEP++CS + N LLH+PGCISADE G RLFLSDSNH+RI++F+ GKIL
Sbjct: 317 KTTYLKQAEMIKEPNSCSFMHNFLLHYPGCISADEEGGRLFLSDSNHNRIVIFNSYGKIL 376
Query: 300 DCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
D IGS PGFEDGEFE KL RPAASFYH +CLY VDSENHAIR+AD+G+RV+ET+YP
Sbjct: 377 DMIGSYPGFEDGEFELVKLARPAASFYHSTQNCLYFVDSENHAIRKADLGKRVVETLYPE 436
Query: 360 SGISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETL 419
+ +KK+ W+WIM+K G D + + +PQS++FPWH+++ DD LLI+NRS TL
Sbjct: 437 NYSNKKSTQFWSWIMDKFGLGSIPDREVKDFNPQSIMFPWHMIRYMDDRLLILNRSLGTL 496
Query: 420 WIMDLASGEIKEAVKGFSKVLEICGVLVMEKVFLLKQMPQDWLLHQIDSSCSLKELPYAG 479
W MDL SG+I E V+G S+++E G L+M+++ ++KQ+P L D++ ++ PY
Sbjct: 497 WTMDLVSGKIIEVVRGLSRIMESYGQLIMDRLSVIKQIPDGMLQRPSDANIAIGGSPYLD 556
Query: 480 LISSSIAFQNHILLCDIVGQRIMRLNRESGVCSNFQFSNFAILGLPYWFAFPLERVYAVA 539
L+SS +F+N I++CD VGQ +++ N++SG CS+FQFSNF +LGLPYWFA PL
Sbjct: 557 LLSSLTSFENCIIICDSVGQVVLKCNKKSGECSSFQFSNFGVLGLPYWFA-PLPEKVITT 615
Query: 540 GGHQGSWTDHIQRCSLLPGRIDIKVNVDIPSDTELVESLQEGCIWRQARGTASVVLRAED 599
+G+ DH+Q LLPG + I++NVD+PSD ELVESL E IWRQARGTA+ + E
Sbjct: 616 AEFRGAGIDHLQFFKLLPGEVGIQINVDLPSDIELVESLHEDSIWRQARGTATEISIVEQ 675
Query: 600 VAGSLEKVGVAQLWYDELDTLALSTPESESNIEDETTTSDLRSEDDTVHIDCAVNTSPGT 659
VAG EKVG AQ WYDELD+LA S ESE +ED + D+ VHI+CAVNTSPGT
Sbjct: 676 VAGPSEKVGSAQQWYDELDSLAFSPQESEM-VEDNVRAQNYIG-DNKVHIECAVNTSPGT 733
Query: 660 SEVIISAALYLKLRRYPDQQDDGREKYAARISDILKLGRSGAMQRDSFIRFLLKSNQDLR 719
SEVI+ AALYL+LRR D + +G + +A RI+D L G M +++ I+FL+ S +DLR
Sbjct: 734 SEVIVYAALYLRLRRNQDSEGNGEKHHATRIADFLYPESRGKMIKENCIQFLINSKRDLR 793
Query: 720 DVIFVKPLHVSIQFDTLDHPKADNSKDIILTDSNMEVDVSLNT 762
++IFVKPLHV I+ D+ HPKA+NSK IILT S +EV+VSL++
Sbjct: 794 ELIFVKPLHVRIKLDSSGHPKAENSKSIILTKSKVEVNVSLSS 836
>gi|356570396|ref|XP_003553375.1| PREDICTED: uncharacterized protein LOC100806465 [Glycine max]
Length = 765
Score = 713 bits (1841), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/784 (48%), Positives = 509/784 (64%), Gaps = 42/784 (5%)
Query: 1 MALRNGCL-RRISRFL-PHIYSGSHFQQSRRAIINSLASSLITFPRECEQIS-----RNG 53
MALR+ L +++S+FL P + +G+ QQS +A+ N +A S++ P I+ R
Sbjct: 1 MALRHRRLTKQLSKFLTPVLNAGNCCQQSSKAL-NLVALSIMHMPSAQSPIANWTQVRGF 59
Query: 54 VNFSFSTIAQASPAESLSQSDTLSFIESTFNEFQGPHHLWFNIVEDNIHFFKRGGAFLVL 113
+ ST + E ++ D LSFI+++ +EF+G HH W N ++N F G FLVL
Sbjct: 60 REYRISTKSDEL-CEPATEIDLLSFIKASLDEFEGTHHYWLNRSDNNDQLFGADGIFLVL 118
Query: 114 AGRFVDNCDSLIAGCGTVVTFEKVKSIQQSFPQLQVIGFLHGCSTI-SAVDQTRLVEMLM 172
+ R D CG + EK+K+IQ+ FP + ++G TI S+ DQ L+++LM
Sbjct: 119 SARVFD--------CG--IKLEKLKTIQKRFPHINIMGL----KTIHSSSDQVHLIQLLM 164
Query: 173 KEYITFPILLSNKNFPQMENGACYLLSKDFGNARVFHENSLDIGMLNKAVEELIMQQQEN 232
E ITFPILLS + FP++E GACY+L K+F + ++HE + +L+KAV+EL Q +
Sbjct: 165 TENITFPILLSRQKFPKIEKGACYILFKNFRSPVIYHEKDAGLEILSKAVQELQEQPGGD 224
Query: 233 SSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVF 292
S S + ++CT KQ ++K+ + S ++NLLL++PGC+S DES NRLF SD NHHRI+V
Sbjct: 225 SKSLNVVRCTSLKQDGIIKDECSFSPLQNLLLYYPGCVSTDESNNRLFFSDCNHHRILVS 284
Query: 293 DGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRV 352
GNG+ILDCIGS PGFEDG+FES+KL RPA S+YH +DCLY VDSEN+AIR+ADMG R
Sbjct: 285 GGNGEILDCIGSSPGFEDGDFESAKLRRPAGSYYHATEDCLYFVDSENNAIRKADMGART 344
Query: 353 LETVYPTSGISKKNNSLWAWIMEKLGFERDNDT----KSEKLDPQSLIFPWHLMKSEDDN 408
+ET+YPTS +K +W WIM KLG E T +SE D + L FPWHL+KS DD
Sbjct: 345 VETLYPTSAPNKGGIQIWNWIMSKLGLESSGKTSVQERSEVFDSK-LYFPWHLLKSPDDT 403
Query: 409 LLIINRSFETLWIMDLASGEIKEAVKGFSKVLEICGVLVMEKVFLLKQMPQDWLLHQIDS 468
I++R F+TLW+MD+ SG+I E +G ++L+ICG L+M+ + ++ Q+P DW Q +
Sbjct: 404 FYIMDRRFQTLWVMDINSGKIDEVFEGSPRILKICGQLIMKSLSIIDQIPSDWFQQQTKN 463
Query: 469 SCSLKELPYAGLISSSIAFQNHILLCDIVGQRIMRLNRESGVCSNFQFSNFAILGLPYWF 528
L L ++ L+SS NHI +CD VGQRI+++NRESGVCSNF SN ILGLPYW
Sbjct: 464 GFLLGGLQHSDLLSSLATLHNHIFICDPVGQRILKVNRESGVCSNFWLSNLGILGLPYWL 523
Query: 529 AFPLERVYAV-----------AGGHQGSWTDHIQRCSLLPGRIDIKVNVDIPSDTELVES 577
FPLE YAV A G G+ DH+Q LLPGRIDI + VD+P D ELV
Sbjct: 524 NFPLETFYAVELQYSTALSVKANGLSGTPIDHLQHFDLLPGRIDINLRVDLPMDIELVVP 583
Query: 578 LQEGCIWRQARGTASVVLRAEDVAGSLEKVGVAQLWYDELDTLALSTPESESNIEDETTT 637
LQE CIW QARG A+ ++V SL K G+AQ WYDELD LA PESE N++D+
Sbjct: 584 LQESCIWYQARGAATETSGMDEV--SLNKSGLAQQWYDELDDLAAPKPESEINVQDDNLD 641
Query: 638 SDLRSEDDTVHIDCAVNTSPGTSEVIISAALYLKLRRYPDQQDDGREKYAARISDILKLG 697
ED+ V V TSPGTSEVII AALY KLR P + RE++AARI DIL
Sbjct: 642 KKSVVEDEKVPFSVGVFTSPGTSEVIIYAALYCKLRSVPKSNEGNREEHAARILDILSSK 701
Query: 698 RSGAMQRDSFIRFLLKSNQDLRDVIFVKPLHVSIQFDTLDHPKADNSKDIILTDSNMEVD 757
RSG ++RD + FLL+S DLRD+IF+KPLH+ ++ DHPKADN +D ILTDS+++VD
Sbjct: 702 RSGKIERDLWKAFLLQSKGDLRDLIFMKPLHIRLRLSCQDHPKADNGRDFILTDSSIKVD 761
Query: 758 VSLN 761
V LN
Sbjct: 762 VLLN 765
>gi|186509871|ref|NP_187362.3| NHL domain-containing protein [Arabidopsis thaliana]
gi|332640973|gb|AEE74494.1| NHL domain-containing protein [Arabidopsis thaliana]
Length = 774
Score = 680 bits (1755), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/698 (49%), Positives = 470/698 (67%), Gaps = 19/698 (2%)
Query: 74 DTLSFIESTFNEFQGPHHLWFNIVEDNIHFFKRGGAFLVLAGRFVDNCDSLIAGCGTVVT 133
D LSFI+++ ++ +GP H W N N FK G ++VLAG +D L
Sbjct: 85 DLLSFIKASLDKLEGPSHHWLNRDFGNKQLFKDKGTYVVLAGHLLDGTSDLSG------F 138
Query: 134 FEKVKSIQQSFPQLQVIGFLHGCSTISAVDQTRLVEMLMKEYITFPILLSNKNFPQMENG 193
FEK+K +QQ P + +G A D+T L E+++KEY+TFP+LLS K FP+
Sbjct: 139 FEKLKLLQQRSPGVCFMGIHFSDQARIADDRTALAELILKEYLTFPVLLSEKEFPKTSGE 198
Query: 194 ACYLLSKDFGNARVFHENSLDIGMLNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEP 253
Y++ KDF N ++ E LDI + KA++ L+ Q E S S TW+KQAE +KE
Sbjct: 199 VRYIVFKDFKNPLIYEEKDLDIASVVKALDSLLTQDTEKSKSVRLFTNTWSKQAEAIKES 258
Query: 254 HACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEF 313
H S ++LLL+FPGCISADE G+RLFLSD+NHHRII+F+ +GKI+D IG PGFEDG+F
Sbjct: 259 HFPSFFQDLLLYFPGCISADEVGDRLFLSDTNHHRIIIFENSGKIVDSIGCFPGFEDGDF 318
Query: 314 ESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWI 373
ES+K++RP + Y + +DCLYIVDSENHAIRRA++ RVLETVYP + KK LW+WI
Sbjct: 319 ESAKMLRPTGTLYDEAEDCLYIVDSENHAIRRANINSRVLETVYPK--VIKKTGGLWSWI 376
Query: 374 MEKLGFERDNDT------KSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASG 427
MEK+G +D+DT KSE+ D +SL+FPWH++K +D++LL+IN+SF LWI++ ASG
Sbjct: 377 MEKMGLGKDDDTTVDADTKSEEFDARSLLFPWHILKRDDESLLVINKSFSKLWIINFASG 436
Query: 428 EIKEAVKGFSKVLEICGVLVMEKVFLLKQMPQDWLLHQIDSSCSLKELPYAGLISSSIAF 487
EI+E V+GFSK++EICG + EK+ +L+ MP +WL Q + S KE P A L+SS
Sbjct: 437 EIEEVVEGFSKIIEICGQSITEKLSVLEHMPSNWLQQQTAAIASFKEQPSASLLSSFTKL 496
Query: 488 QNHILLCDIVGQRIMRLNRESGVCSNFQFSNFAILGLPYWFAFPLERVYAVAGGHQGSWT 547
+ I++ DI QR+++LNR+SG CS+ QFSN ILGLPYW PLERV+ +A G Q +
Sbjct: 497 GDDIVMTDIACQRVLKLNRDSGACSSIQFSNSGILGLPYWLFIPLERVFNLANGVQEAHL 556
Query: 548 DHIQRCSLLPGRIDIKVNVDIPSDTELVESLQEGCIWRQARGTASVVLRAEDVAGSLEKV 607
H Q LLPG+I I++N++IP TELVE +QE CIWRQ RG S A EK+
Sbjct: 557 SHTQELRLLPGKISIRLNIEIPPCTELVEPIQESCIWRQTRGAISEFSSAGSAVEPSEKI 616
Query: 608 GVAQLWYDELDTLA--LSTP--ESESNIEDETTTSDLRSEDDTVHIDCAVNTSPGTSEVI 663
GV+Q WYDELD+LA ++ P E ED + R ED +HIDC V TSPG+SE+I
Sbjct: 617 GVSQQWYDELDSLAKEIANPEAAEEEEEEDVNPSEVDREEDGRIHIDCPVKTSPGSSELI 676
Query: 664 ISAALYLKLRRYPDQQDDGREKYAARISDILKLGRS-GAMQRDSFIRFLLKSNQDLRDVI 722
+ AALYL+L R + + +E+ A +I+ ILK R+ M+ D F+ L KS ++LRD++
Sbjct: 677 VYAALYLRLARNEETESATQEELARKIAKILKPVRNITTMKEDLFVNLLSKSKRELRDIV 736
Query: 723 FVKPLHVSIQFDTLDHPKADNSKDIILTDSNMEVDVSL 760
F+KP+HV I+ D+ DHPKADNS+D+ILTDS++EVDVSL
Sbjct: 737 FIKPMHVRIRLDSKDHPKADNSRDVILTDSSVEVDVSL 774
>gi|297829252|ref|XP_002882508.1| EMB1974 [Arabidopsis lyrata subsp. lyrata]
gi|297328348|gb|EFH58767.1| EMB1974 [Arabidopsis lyrata subsp. lyrata]
Length = 771
Score = 673 bits (1736), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/698 (49%), Positives = 473/698 (67%), Gaps = 20/698 (2%)
Query: 74 DTLSFIESTFNEFQGPHHLWFNIVEDNIHFFKRGGAFLVLAGRFVDNCDSLIAGCGTVVT 133
D LS+I+++ ++ +GP H W N N FK G ++VLAG ++ L
Sbjct: 83 DLLSYIKASLDKLEGPSHHWLNRDIGNKQLFKDKGTYVVLAGHLLNETSDLSG------F 136
Query: 134 FEKVKSIQQSFPQLQVIGFLHGCSTISAVDQTRLVEMLMKEYITFPILLSNKNFPQMENG 193
FEK+K +QQ P + + A D+T L E+++KEY+TFP+LLS K FP+
Sbjct: 137 FEKLKLLQQRSPGVCFMCIHFSDQAQIADDRTALAELIVKEYLTFPVLLSEKEFPKTTGE 196
Query: 194 ACYLLSKDFGNARVFHENSLDIGMLNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEP 253
Y++ KDF N ++ E LDI + KA++ L Q E S S TW+KQAE +KE
Sbjct: 197 VRYIVFKDFKNPLIYEEKDLDIASVAKALDSL-TQYTEKSKSVRIFTNTWSKQAEAIKES 255
Query: 254 HACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEF 313
H S ++LLL+FPGCI+ADE G RLFLSD+NHHRII+F+ NGKILD IG PGFEDGEF
Sbjct: 256 HFPSFFQDLLLYFPGCITADEVGERLFLSDTNHHRIIMFEKNGKILDSIGCFPGFEDGEF 315
Query: 314 ESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWI 373
ES+K++RP + Y +++DCLYIVDSENHAIRRA++ RVLETVYP + KK+ LW+WI
Sbjct: 316 ESAKMLRPTGTLYDEEEDCLYIVDSENHAIRRANINSRVLETVYPK--VIKKSGGLWSWI 373
Query: 374 MEKLGFERDNDT------KSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASG 427
MEK+G +D+DT KSE+ D +SL+FPWH++K D++LL+IN+SF LWI+++A+
Sbjct: 374 MEKMGLGKDDDTTVDADAKSEEFDARSLLFPWHILKRNDESLLVINKSFSKLWIINIATR 433
Query: 428 EIKEAVKGFSKVLEICGVLVMEKVFLLKQMPQDWLLHQIDSSCSLKELPYAGLISSSIAF 487
EI+E V+GFSK++EICG + EK+ +LK MP +WL Q ++ S KE P A L+SS
Sbjct: 434 EIEEVVEGFSKIMEICGQSITEKLSVLKHMPSNWLQQQTEAIISCKEQPSASLLSSFTKL 493
Query: 488 QNHILLCDIVGQRIMRLNRESGVCSNFQFSNFAILGLPYWFAFPLERVYAVAGGHQGSWT 547
+ I++ DI QR+++LN +S CS+ QFSN ILGLPYW PLERV+ +A G Q +
Sbjct: 494 GDDIVMTDIDCQRVLKLNIDSRACSSIQFSNIGILGLPYWLFIPLERVFNLANGVQEAHL 553
Query: 548 DHIQRCSLLPGRIDIKVNVDIPSDTELVESLQEGCIWRQARGTASVVLRAEDVAGSLEKV 607
H Q LLPG+I +++N++IP TELVE +QE CIWRQ RG S V A EKV
Sbjct: 554 SHTQELRLLPGKISLRLNIEIPPCTELVEPIQESCIWRQTRGAISEVSSAGSAVEPSEKV 613
Query: 608 GVAQLWYDELDTLA--LSTPES--ESNIEDETTTSDLRSEDDTVHIDCAVNTSPGTSEVI 663
GV+Q WYDELD+LA ++ PE+ E ED + R ED +HIDC V TSPG+SE+I
Sbjct: 614 GVSQQWYDELDSLAKEIANPEAVEEEEEEDVNPSEVEREEDGRIHIDCTVKTSPGSSELI 673
Query: 664 ISAALYLKLRRYPDQQDDGREKYAARISDILKLGRSG-AMQRDSFIRFLLKSNQDLRDVI 722
+ AALYL+L R + + +E+ A RI++ILK R+ M+ D F++ L KS ++LRD++
Sbjct: 674 VYAALYLRLPRNEETESASQEELARRIAEILKPVRNNTTMKEDLFVKLLSKSKRELRDIV 733
Query: 723 FVKPLHVSIQFDTLDHPKADNSKDIILTDSNMEVDVSL 760
F+KP+HV I+ D++DHPKADNS+D+ILTDS++EVD+SL
Sbjct: 734 FMKPMHVRIRLDSMDHPKADNSRDVILTDSSVEVDISL 771
>gi|357503569|ref|XP_003622073.1| Ribosomal protein L15 [Medicago truncatula]
gi|355497088|gb|AES78291.1| Ribosomal protein L15 [Medicago truncatula]
Length = 816
Score = 641 bits (1653), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 329/691 (47%), Positives = 460/691 (66%), Gaps = 19/691 (2%)
Query: 74 DTLSFIESTFNEFQGPHHLWFNIVEDNIHFFKRGGAFLVLAGRFVDNCDSLIAGCGTVVT 133
D LSFI+S+ + +G H W N N FF G FLVLA D CG +
Sbjct: 142 DLLSFIKSSLDTLEGTDHYWLNRSVKNEEFFGIHGTFLVLAANNFD--------CG--IM 191
Query: 134 FEKVKSIQQSFPQLQVIGFLHGCSTISAVDQTRLVEMLMKEYITFPILLSNKNFPQMENG 193
F+K+K+IQ+ FP + ++G I++ D+ ++ LM E ITFPILLS + FPQ++ G
Sbjct: 192 FQKLKTIQERFPHITIMGI----KLINSSDRENQIQFLMTENITFPILLSQRTFPQIKEG 247
Query: 194 ACYLLSKDFGNARVFHENSLDIGMLNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEP 253
ACY+L ++F N ++++EN + +L + ++EL MQ ++S + ++ T +Q + K+
Sbjct: 248 ACYILFRNFKNPKIYNENDVSPEILCQDIQELKMQPSDDSDWLNVVRSTTWRQDLIAKDE 307
Query: 254 HACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEF 313
+ CS ++NL+L++PGC+SADES NRLF+SD NHHRIIV D NGKI+DCIGS PGFEDG+F
Sbjct: 308 YICSPLQNLVLYYPGCVSADESANRLFISDCNHHRIIVCDDNGKIMDCIGSSPGFEDGDF 367
Query: 314 ESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWI 373
ES+KL RPA S+Y+ +DCLY +DSENHAIRRADM R++ET+YP S K ++ WI
Sbjct: 368 ESAKLRRPAGSYYNATEDCLYFLDSENHAIRRADMEARLVETLYPIS-TDNKGGGIFNWI 426
Query: 374 MEKLGFERD--NDTKSEKLDPQSLIFPWHLMKS-EDDNLLIINRSFETLWIMDLASGEIK 430
+ KLG E N KSE LDP+ L FPWHL+KS +DD + II+R F+TLW MD SG++
Sbjct: 427 LNKLGLETSVRNMEKSEVLDPKRLYFPWHLLKSDDDDTIYIIDRRFQTLWTMDSGSGKVD 486
Query: 431 EAVKGFSKVLEICGVLVMEKVFLLKQMPQDWLLHQIDSSCSLKELPYAGLISSSIAFQNH 490
+ +G K+LEICG L+ + + ++P D + ++ +L LP++ +SS QNH
Sbjct: 487 KIFEGSPKILEICGQLIRQNLSTFDKIPCDQFQQKTNNVFALDGLPHSDRLSSLTTLQNH 546
Query: 491 ILLCDIVGQRIMRLNRESGVCSNFQFSNFAILGLPYWFAFPLERVYAVAGGHQGSWTDHI 550
+ +CD V QRI+++N ESGVC +F+ SNF +LG PYW PLE YA G + DH+
Sbjct: 547 MFICDKVRQRILKVNIESGVCLDFELSNFGLLGFPYWLNSPLETCYAGGNGLSDTAIDHL 606
Query: 551 QRCSLLPGRIDIKVNVDIPSDTELVESLQEGCIWRQARGTASVVLRAEDVAGSLEKVGVA 610
Q+ LLPG IDIK++VD+ +D E+VE L+E CIWRQARG A+ + +D S++KVGVA
Sbjct: 607 QQFDLLPGNIDIKLSVDVHADIEVVEPLRESCIWRQARGAAAEITGMDD-PRSMDKVGVA 665
Query: 611 QLWYDELDTLALSTPESESNIEDETTTSDLRSEDDTVHIDCAVNTSPGTSEVIISAALYL 670
Q WYDELD LA + ES + ++ + EDD + I+ V TSPGTSEVII A LY
Sbjct: 666 QQWYDELDDLASPKADPESEVIEDDLDQNTAMEDDKIRINSCVGTSPGTSEVIIFAVLYC 725
Query: 671 KLRRYPDQQDDGREKYAARISDILKLGRSGAMQRDSFIRFLLKSNQDLRDVIFVKPLHVS 730
KLR+ P+ +EKYAARI D L RSG +RDS+ +LL+S DLRD+IF K +H+
Sbjct: 726 KLRKIPNTNYGNQEKYAARILDFLSSKRSGKRERDSWNAYLLQSKGDLRDLIFTKLVHIR 785
Query: 731 IQFDTLDHPKADNSKDIILTDSNMEVDVSLN 761
++ +T DHPKA+N +D ILTDS ++V+V LN
Sbjct: 786 VRINTSDHPKAENDRDFILTDSTIKVNVLLN 816
>gi|357503571|ref|XP_003622074.1| Ribosomal protein L15 [Medicago truncatula]
gi|355497089|gb|AES78292.1| Ribosomal protein L15 [Medicago truncatula]
Length = 814
Score = 628 bits (1619), Expect = e-177, Method: Compositional matrix adjust.
Identities = 325/693 (46%), Positives = 455/693 (65%), Gaps = 21/693 (3%)
Query: 72 QSDTLSFIESTFNEFQGPHHLWFNIVEDNIHFFKRGGAFLVLAGRFVDNCDSLIAGCGTV 131
+ D LSFI+S+ + +G H W N N FF G FLVLA D CG
Sbjct: 140 EVDLLSFIKSSLDTLEGTDHYWLNRSVKNEEFFGIHGTFLVLAANNFD--------CG-- 189
Query: 132 VTFEKVKSIQQSFPQLQVIGFLHGCSTISAVDQTRLVEMLMKEYITFPILLSNKNFPQME 191
+ F+K+K+IQ+ FP + ++G I++ D+ ++ LM E ITFPILLS + FPQ++
Sbjct: 190 IMFQKLKTIQERFPHITIMGI----KLINSSDRENQIQFLMTENITFPILLSQRTFPQIK 245
Query: 192 NGACYLLSKDFGNARVFHENSLDIGMLNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLK 251
GACY+L ++F N ++++EN + +L + ++EL MQ ++S + ++ T +Q + K
Sbjct: 246 EGACYILFRNFKNPKIYNENDVSPEILCQDIQELKMQPSDDSDWLNVVRSTTWRQDLIAK 305
Query: 252 EPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDG 311
+ + CS ++NL+L++PGC+SADES NRLF+SD NHHRIIV D NGKI+DCIGS PGFEDG
Sbjct: 306 DEYICSPLQNLVLYYPGCVSADESANRLFISDCNHHRIIVCDDNGKIMDCIGSSPGFEDG 365
Query: 312 EFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWA 371
+FES+KL RPA S+Y+ +DCLY +DSENHAIRRADM R++ET+YP S K ++
Sbjct: 366 DFESAKLRRPAGSYYNATEDCLYFLDSENHAIRRADMEARLVETLYPIS-TDNKGGGIFN 424
Query: 372 WIMEKLGFERD--NDTKSEKLDPQSLIFPWHLMKS-EDDNLLIINRSFETLWIMDLASGE 428
WI+ KLG E N KSE LDP+ L FPWHL+KS +DD + II+R F+TLW MD SG+
Sbjct: 425 WILNKLGLETSVRNMEKSEVLDPKRLYFPWHLLKSDDDDTIYIIDRRFQTLWTMDSGSGK 484
Query: 429 IKEAVKGFSKVLEICGVLVMEKVFLLKQMPQDWLLHQIDSSCSLKELPYAGLISSSIAFQ 488
+ + +G K+LEICG L+ + + ++P D + ++ +L LP++ +SS Q
Sbjct: 485 VDKIFEGSPKILEICGQLIRQNLSTFDKIPCDQFQQKTNNVFALDGLPHSDRLSSLTTLQ 544
Query: 489 NHILLCDIVGQRIMRLNRESGVCSNFQFSNFAILGLPYWFAFPLERVYAVAGGHQGSWTD 548
NH+ +CD V QRI+++N ESGVC +F+ SNF +LG PYW PLE YA G + D
Sbjct: 545 NHMFICDKVRQRILKVNIESGVCLDFELSNFGLLGFPYWLNSPLETCYAGGNGLSDTAID 604
Query: 549 HIQRCSLLPGRIDIKVNVDIPSDTELVESLQEGCIWRQARGTASVVLRAEDVAGSLEKVG 608
H+Q+ LLPG IDIK++VD+ +D E+VE L+E CIWRQAR + + L +VG
Sbjct: 605 HLQQFDLLPGNIDIKLSVDVHADIEVVEPLRESCIWRQARAVWLCICTH---SPFLVQVG 661
Query: 609 VAQLWYDELDTLALSTPESESNIEDETTTSDLRSEDDTVHIDCAVNTSPGTSEVIISAAL 668
VAQ WYDELD LA + ES + ++ + EDD + I+ V TSPGTSEVII A L
Sbjct: 662 VAQQWYDELDDLASPKADPESEVIEDDLDQNTAMEDDKIRINSCVGTSPGTSEVIIFAVL 721
Query: 669 YLKLRRYPDQQDDGREKYAARISDILKLGRSGAMQRDSFIRFLLKSNQDLRDVIFVKPLH 728
Y KLR+ P+ +EKYAARI D L RSG +RDS+ +LL+S DLRD+IF K +H
Sbjct: 722 YCKLRKIPNTNYGNQEKYAARILDFLSSKRSGKRERDSWNAYLLQSKGDLRDLIFTKLVH 781
Query: 729 VSIQFDTLDHPKADNSKDIILTDSNMEVDVSLN 761
+ ++ +T DHPKA+N +D ILTDS ++V+V LN
Sbjct: 782 IRVRINTSDHPKAENDRDFILTDSTIKVNVLLN 814
>gi|357469427|ref|XP_003604998.1| NHL repeat-containing protein [Medicago truncatula]
gi|355506053|gb|AES87195.1| NHL repeat-containing protein [Medicago truncatula]
Length = 784
Score = 619 bits (1595), Expect = e-174, Method: Compositional matrix adjust.
Identities = 348/801 (43%), Positives = 492/801 (61%), Gaps = 61/801 (7%)
Query: 1 MALRNGCLRRISRFLPHIYSGSHFQQSRRAIINSLASSLITFPRECEQISRNGV--NFSF 58
++LR +++ISR L YSG F+ +NSL P + I +G+ F F
Sbjct: 5 LSLRFRRVKQISRLL--YYSGDCFKHQCDKGLNSL-------PFSPKPIFNDGLVRGFHF 55
Query: 59 STIAQASPAESLSQS----DTLSFIESTFNEFQGPHHLWFNIVEDNIHFF---KRGGAFL 111
++ + L + D LSFI+S+ + +G H W N N FF G FL
Sbjct: 56 HQHRFSTSVDVLHKHAPEVDLLSFIKSSLDTLEGTDHYWLNRSVKNEEFFGIHGIHGTFL 115
Query: 112 VLAGRFVDNCDSLIAGCGTVVTFEKVKSIQQSF--------PQLQVIGFLH--------- 154
VLA D CG + F+K+K+IQ+ + P+L I H
Sbjct: 116 VLAANNFD--------CG--IMFQKLKAIQERYRKVIISIPPKLDGIRDGHFIIVMIIIK 165
Query: 155 -----------GCSTISAVDQTRLVEMLMKEYITFPILLSNKNFPQMENGACYLLSKDFG 203
G I++ D+ ++ LM E ITFPILLS + FPQ++ GACY+L ++F
Sbjct: 166 NCCRFPHITIMGIKLINSSDRENQIQFLMTENITFPILLSQQTFPQIKEGACYILFRNFK 225
Query: 204 NARVFHENSLDIGMLNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLL 263
+ ++++E + +L + + EL MQ ++S + ++ T +Q + K+ + CS ++NL+
Sbjct: 226 SPKIYNEKDVSPEILCQDILELQMQPSDDSDWLNVVRSTTWRQDHIAKDEYICSPLQNLV 285
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAA 323
L++PGC+SADES +RLF+SD NHHRIIV D +GKI+DCIGS PGFEDG+FES+KL RPA
Sbjct: 286 LYYPGCVSADESESRLFISDCNHHRIIVCDDDGKIMDCIGSSPGFEDGDFESAKLRRPAG 345
Query: 324 SFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD- 382
S+Y+ +DCLY +DSENHAIRRADM R++ET+YP S K ++ WI+ KLG E
Sbjct: 346 SYYNATEDCLYFLDSENHAIRRADMEARLVETLYPIS-TDNKGGGIFNWILNKLGLETSV 404
Query: 383 -NDTKSEKLDPQSLIFPWHLMKS-EDDNLLIINRSFETLWIMDLASGEIKEAVKGFSKVL 440
N KSE LDP+ L FPWHL+KS +DD + II+R F+TLW MD SG++ + +G K+L
Sbjct: 405 RNMEKSEVLDPKRLYFPWHLLKSDDDDTIYIIDRRFQTLWTMDSGSGKVDKIFEGSPKIL 464
Query: 441 EICGVLVMEKVFLLKQMPQDWLLHQIDSSCSLKELPYAGLISSSIAFQNHILLCDIVGQR 500
EICG L+ + + ++P D + ++ +L LP++ +SS QNH+ +CD V QR
Sbjct: 465 EICGQLIRQNLSTFDKIPCDQFQQKTNNVFALDGLPHSDRLSSLTTLQNHMFICDKVRQR 524
Query: 501 IMRLNRESGVCSNFQFSNFAILGLPYWFAFPLERVYAVAGGHQGSWTDHIQRCSLLPGRI 560
I+++N ESGVC +FQ SNF +LG PYW PLE YA G + DH+Q+ LLPG I
Sbjct: 525 ILKVNIESGVCLDFQLSNFGLLGFPYWLNSPLETCYAGGNGLSDTAIDHLQQFDLLPGNI 584
Query: 561 DIKVNVDIPSDTELVESLQEGCIWRQARGTASVVLRAEDVAGSLEKVGVAQLWYDELDTL 620
DIK++VD+ +D E+VE L E CIWRQARG A+ + +D S++KVGVAQ WYDELD L
Sbjct: 585 DIKLSVDVHADIEVVEPLHESCIWRQARGAAAEITGMDD-PRSMDKVGVAQQWYDELDDL 643
Query: 621 ALSTPESESNIEDETTTSDLRSEDDTVHIDCAVNTSPGTSEVIISAALYLKLRRYPDQQD 680
A + ES + ++ + EDD + I+ V TSPGTSEVII A LY KLR+ P+ D
Sbjct: 644 ASPKADPESEVTEDDLDQNTAMEDDKIRINSCVGTSPGTSEVIIFAVLYCKLRKIPNSND 703
Query: 681 DGREKYAARISDILKLGRSGAMQRDSFIRFLLKSNQDLRDVIFVKPLHVSIQFDTLDHPK 740
+EKYAARI D L RSG +RDS+ +LL+S DLRD+IF K +H+ ++ +T DHPK
Sbjct: 704 GNQEKYAARILDFLSSKRSGKRERDSWNAYLLQSKGDLRDLIFTKLVHIRVRINTSDHPK 763
Query: 741 ADNSKDIILTDSNMEVDVSLN 761
A+N +D ILTDS ++V+V LN
Sbjct: 764 AENDRDFILTDSTIKVNVLLN 784
>gi|224065332|ref|XP_002301778.1| predicted protein [Populus trichocarpa]
gi|222843504|gb|EEE81051.1| predicted protein [Populus trichocarpa]
Length = 525
Score = 597 bits (1538), Expect = e-167, Method: Compositional matrix adjust.
Identities = 314/548 (57%), Positives = 384/548 (70%), Gaps = 65/548 (11%)
Query: 216 GMLNKAVEELIMQQQ---ENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISA 272
G +EEL +Q+ +N S LK TWAKQAEV+KEP+ CS ++NLLL+FPGC+SA
Sbjct: 40 GGKRNVIEELNVQENMNFDNGISRPKLKTTWAKQAEVIKEPYMCSPLQNLLLYFPGCVSA 99
Query: 273 DESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDC 332
DESGNRLFLSDSNHHRIIV DGNGKILD IGS PGFEDGEFES+KL RPAASFY ++DC
Sbjct: 100 DESGNRLFLSDSNHHRIIVSDGNGKILDSIGSGPGFEDGEFESAKLARPAASFYDDEEDC 159
Query: 333 LYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDP 392
LYIVDSENHAIRRAD+ RVLETVYP S SKKNNS+W WIM+KLG + D KSE+ D
Sbjct: 160 LYIVDSENHAIRRADLESRVLETVYPKS-FSKKNNSIWTWIMDKLGSRINVDAKSEEFDS 218
Query: 393 QSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEIKEAVKGFSKVLEICGVLVMEKVF 452
Q L+FPWHL+KS D+ LII+RSFETLW++DL SGE+KE +KGF +LE CG L+ KV
Sbjct: 219 QPLVFPWHLLKSVDNTFLIISRSFETLWVIDLVSGEMKECIKGFPNILETCGQLITGKVS 278
Query: 453 LLKQMPQDWLLHQIDSSCSLKELPYAGLISSSIAFQNHILLCDIVGQRIMRLNRESGVCS 512
LL K+LP I L +++ V
Sbjct: 279 LL------------------KQLP------------------------IDYLKQQTDVNC 296
Query: 513 NFQFSNFAILGLPYWFAFPLERVYAVAGGHQGSWTDHIQRCSLLPGRIDIKVNVDIPSDT 572
+ + FP YA + ++ + I C GR+DI++N+DIP DT
Sbjct: 297 SLK-------------EFP----YATLVSNLTTFENDIVLCD--TGRVDIRLNIDIPMDT 337
Query: 573 ELVESLQEGCIWRQARGTASVVLRAEDVAGSLEKVGVAQLWYDELDTLALSTPESESNIE 632
ELVE LQEGCIWRQARG+A+V+L AEDV GS EK GV+Q WYDELD LA STP E E
Sbjct: 338 ELVEPLQEGCIWRQARGSATVILGAEDVVGSSEKAGVSQQWYDELDNLAFSTPGLEMATE 397
Query: 633 DETTTSDLRSEDDTVHIDCAVNTSPGTSEVIISAALYLKLRRYPDQQDDGREKYAARISD 692
+++ TSD+ +D+ +HIDCAVNTSPGTSE+II AALYLKLRR+ D ++ G++K+AARI+D
Sbjct: 398 EDSATSDVNYQDERLHIDCAVNTSPGTSELIIHAALYLKLRRHLDLEEGGQQKHAARIAD 457
Query: 693 ILKLGRSGAMQRDSFIRFLLKSNQDLRDVIFVKPLHVSIQFDTLDHPKADNSKDIILTDS 752
IL GR G +++DS I+ LLKSN +LRD+IFVKPLH+ I DTLDHPKADNSKDIILTDS
Sbjct: 458 ILNPGRGGGLEKDSCIQLLLKSNCNLRDLIFVKPLHLRINLDTLDHPKADNSKDIILTDS 517
Query: 753 NMEVDVSL 760
+EV+VSL
Sbjct: 518 AIEVNVSL 525
>gi|449521828|ref|XP_004167931.1| PREDICTED: uncharacterized LOC101209700, partial [Cucumis sativus]
Length = 491
Score = 553 bits (1425), Expect = e-154, Method: Compositional matrix adjust.
Identities = 270/494 (54%), Positives = 357/494 (72%), Gaps = 3/494 (0%)
Query: 269 CISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHK 328
CISADE G RLFLSDSNH+RI++F+ GKILD IGS PGFEDGEFE KL RPAASFYH
Sbjct: 1 CISADEEGGRLFLSDSNHNRIVIFNSYGKILDMIGSYPGFEDGEFELVKLARPAASFYHS 60
Query: 329 DDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSE 388
+CLY VDSENHAIR+AD+G+RV+ET+YP + +KK+ W+WIM+K G D + +
Sbjct: 61 TQNCLYFVDSENHAIRKADLGKRVVETLYPENYSNKKSTQFWSWIMDKFGLGSIPDREVK 120
Query: 389 KLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEIKEAVKGFSKVLEICGVLVM 448
+PQS++FPWH+++ DD LLI+NRS TLW MDL SG+I E V+G S+++E G L+M
Sbjct: 121 DFNPQSIMFPWHMIRYMDDRLLILNRSLGTLWTMDLVSGKIIEVVRGLSRIMESYGQLIM 180
Query: 449 EKVFLLKQMPQDWLLHQIDSSCSLKELPYAGLISSSIAFQNHILLCDIVGQRIMRLNRES 508
+++ ++KQ+P L D++ ++ PY L+SS +F+N I++CD VGQ +++ N++S
Sbjct: 181 DRLSVIKQIPDGMLQRPSDANIAIGGSPYLDLLSSLTSFENCIIICDSVGQVVLKCNKKS 240
Query: 509 GVCSNFQFSNFAILGLPYWFAFPLERVYAVAGGHQGSWTDHIQRCSLLPGRIDIKVNVDI 568
G CS+FQFSNF +LGLPYWFA PL +G+ DH+Q LLPG + I++NVD+
Sbjct: 241 GECSSFQFSNFGVLGLPYWFA-PLPEKVITTAEFRGAGIDHLQFFKLLPGEVGIQINVDL 299
Query: 569 PSDTELVESLQEGCIWRQARGTASVVLRAEDVAGSLEKVGVAQLWYDELDTLALSTPESE 628
PSD ELVESL E IWRQARGTA+ + E VAG EKVG AQ WYDELD+LA S ESE
Sbjct: 300 PSDIELVESLHEDSIWRQARGTATEISIVEQVAGPSEKVGSAQQWYDELDSLAFSPQESE 359
Query: 629 SNIEDETTTSDLRSEDDTVHIDCAVNTSPGTSEVIISAALYLKLRRYPDQQDDGREKYAA 688
+ED + D+ VHI+CAVNTSPGTSEVI+ AALYL+LRR D + +G + A
Sbjct: 360 M-VEDNVRAQNYIG-DNKVHIECAVNTSPGTSEVIVYAALYLRLRRNQDSEGNGEKHRAT 417
Query: 689 RISDILKLGRSGAMQRDSFIRFLLKSNQDLRDVIFVKPLHVSIQFDTLDHPKADNSKDII 748
RI+D L G M +++ I+FL+ S +DLR++IFVKPLHV I+ D+ HPKA+NSK II
Sbjct: 418 RIADFLYPESRGKMIKENCIQFLINSKRDLRELIFVKPLHVRIKLDSSGHPKAENSKSII 477
Query: 749 LTDSNMEVDVSLNT 762
LT S +EV+VSL++
Sbjct: 478 LTKSKVEVNVSLSS 491
>gi|357139579|ref|XP_003571358.1| PREDICTED: uncharacterized protein LOC100836563 [Brachypodium
distachyon]
Length = 794
Score = 526 bits (1356), Expect = e-146, Method: Compositional matrix adjust.
Identities = 292/711 (41%), Positives = 436/711 (61%), Gaps = 32/711 (4%)
Query: 60 TIAQASPAESLSQSDTLSFIESTFNEFQGPHHLWFNIVEDNIHFFKRGGAFLVLAGRFVD 119
++ +PA SQS L FI+STF +G +H W N F + G +LVL
Sbjct: 107 SMEATTPA---SQSQLLGFIKSTFGALEGQNHCWLNSANGIWRTFDQEGIYLVL---LYQ 160
Query: 120 NCDSLIAGCGTVVTFEKVKSIQQSFPQLQVIGFLHGCSTISAVDQTRLVEMLMKEYITFP 179
+C +L + FE++K +QQ +P L V G + S Q++ V +MKEYITFP
Sbjct: 161 SCGTLDSQNKRSAAFERLKFLQQRYPHLNVFALQFGSAASSLAAQSQAVRTIMKEYITFP 220
Query: 180 ILLSNKNFPQMENGACYLL---SKDFGNARVFHENSLDIGMLNKAVEELIMQQQENSSSP 236
ILLS+K+F M NGACYLL SKD + E + ++ K +E E S +
Sbjct: 221 ILLSDKDFTNMTNGACYLLFEGSKDHILSVKLDE---EPELMTKGLEGCTALSAEPSENV 277
Query: 237 SGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNG 296
K +W K+ EV+KEP+ S+RNLLL+ PGC+S DE G+R+F+SDSNHHRII+ + +G
Sbjct: 278 LESKVSWQKE-EVIKEPYV-GSLRNLLLYHPGCVSVDEDGDRIFISDSNHHRIIISNSDG 335
Query: 297 KILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
I+D IGS PGFEDGEFE +KL++PAASFYH +DCLYIVDSENHA+R+AD RR LETV
Sbjct: 336 IIMDFIGSSPGFEDGEFEYAKLLQPAASFYHAAEDCLYIVDSENHAVRKADFSRRFLETV 395
Query: 357 YPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSF 416
YP +KK++ +W+WI +KLG ++ + + D S+ FPWHL++ +D+LL+ + +F
Sbjct: 396 YPV--FNKKSSGIWSWITDKLGLTKEVASNIQDFDADSVTFPWHLLQISEDDLLVADLNF 453
Query: 417 ETLWIMDLASGEIKEAVKGFSKVLEICGVLVMEKVFLLKQMPQDWLLHQIDSSCSLKELP 476
ET WI+++++GE ++ KG ++ +E+C + E+ LLK + + L + S L+++
Sbjct: 454 ETSWILNMSTGEKQDIAKGRAEAMELCQQTINERRALLKDILMNGSLGDKEHS-YLEKIS 512
Query: 477 YAGLISSSIAFQNHILLCDIVGQRIMRLNRESGVCSNFQFSNFAILGLPYWFAFPLERVY 536
++SS FQ +++ CD GQR+++ + ++ SN F+N +LGLPYWF LERV
Sbjct: 513 CNEILSSISRFQKYVVFCDTDGQRVLKHDLDTRDTSNIHFTNLGVLGLPYWFVCSLERV- 571
Query: 537 AVAGGHQGSWTDHIQRCSLLPGRIDIKVNVDIPSDTELVESLQEGCIWRQARGTASVVLR 596
+ G G + +H ++ ++LPGR +IKV+VDIP DTEL L E CIWRQ RG+ + +
Sbjct: 572 STWGHSVGQFREHTRKVNVLPGRCNIKVSVDIPVDTELATPLVESCIWRQVRGSGAEISE 631
Query: 597 AEDVAGSLEKVGVAQLWYDELDTLALS--TPESESNIEDETTTSDLRSEDD-TVHIDCAV 653
++ + EKVG+AQ WYDE+D LA S E ++ D+ D +D TVH CA+
Sbjct: 632 SDGQDTNPEKVGIAQQWYDEIDNLAFSEAAEEPAAHKGDDDKPGDGNYQDQKTVHFTCAI 691
Query: 654 NTSPGTSEVIISAALYLKLRRYPDQQDDGREKYAARISDILKL-GRSGAMQRDSFIRFLL 712
N SPGT E++ SAALYLK+ R +D + A I IL+ GR + + + L
Sbjct: 692 NVSPGTCELVASAALYLKIDRSKSNLEDQK----AVIKRILQCQGR----EEHAGVELLT 743
Query: 713 KSN--QDLRDVIFVKPLHVSIQFDTLDHPKADNSKDIILTDSNMEVDVSLN 761
S+ D R ++ +KP+H+ ++ D DHP +K+ I T+S++E+ +SL+
Sbjct: 744 GSSGGDDARGLVLMKPVHLRLRLDCADHPAGATNKETINTESSLEIIISLD 794
>gi|293331825|ref|NP_001169202.1| hypothetical protein [Zea mays]
gi|223975493|gb|ACN31934.1| unknown [Zea mays]
gi|413921351|gb|AFW61283.1| hypothetical protein ZEAMMB73_133482 [Zea mays]
Length = 791
Score = 514 bits (1325), Expect = e-143, Method: Compositional matrix adjust.
Identities = 275/695 (39%), Positives = 412/695 (59%), Gaps = 22/695 (3%)
Query: 72 QSDTLSFIESTFNEFQGPHHLWFNIVEDNIHFFKRGGAFLVLAGRFVDNCDSLIAGCGTV 131
S+ + FI+S F + +G +H WFNI+ D G +L+L +C +L +
Sbjct: 114 HSELVDFIKSAFGKLEGQNHCWFNIMNDTWMNLNHEGIYLLL---LYQSCGTLNNDGKHL 170
Query: 132 VTFEKVKSIQQSFPQLQVIGFLHGCSTISAVDQTRLVEMLMKEYITFPILLSNKNFPQME 191
+ FE +K +QQ +P+L V HG S Q++ ++KEYITFPIL+S+KNF M
Sbjct: 171 IAFESLKHLQQRYPRLNVFALQHGSDVSSLASQSQAFRTIVKEYITFPILISDKNFSNMT 230
Query: 192 NGACYLLSKDFGNARVFHENSLDIGMLNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLK 251
NGACYLL + + +F + ++ KA++EL +++ S + + +W K+ EV+K
Sbjct: 231 NGACYLLFEGSKDPVLFTNWVEEPDVMIKAIDELTTSKEKPSENVLS-RVSWQKE-EVVK 288
Query: 252 EPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDG 311
EP + S RNLLL+ P C+S DE G+R+F+SDSNHHRII+ + NG ILD IGS PG+EDG
Sbjct: 289 EP-SVGSFRNLLLYHPACVSVDEDGDRIFISDSNHHRIIISNSNGMILDYIGSSPGYEDG 347
Query: 312 EFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWA 371
EFES+K +RPA+SFYH +DCL+IVDSENHA+R+AD+GRR L+TVYP + K+N +W+
Sbjct: 348 EFESAKFLRPASSFYHAAEDCLFIVDSENHAVRKADLGRRTLQTVYP---VFNKSNGIWS 404
Query: 372 WIMEKLGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEIKE 431
WI +KLG ++ + S+ PWHL++ +D LL+ +RSFET W + +++GE ++
Sbjct: 405 WITDKLGMRKEATPTIQDFHADSMALPWHLIQISEDTLLVADRSFETPWTLTISTGEKQD 464
Query: 432 AVKGFSKVLEICGVLVMEKVFLLKQMPQDWLLHQIDSSCSLKELPYAGLISSSIAFQNHI 491
+G ++++E V E+ L+K M +W S S++++P LISS FQN+I
Sbjct: 465 IGRGRAEIMESYQKTVKERCALIKDMHMNWSSSARVLSDSVQKIPNNELISSIARFQNYI 524
Query: 492 LLCDIVGQRIMRLNRESGVCSNFQFSNFAILGLPYWFAFPLERVYAVAGGHQGSWTDHIQ 551
+ CD GQR+++ + ++ S QFS+ +LGLPYW LERV + G G +H Q
Sbjct: 525 VFCDTDGQRVLKHDLDTKTTSTIQFSDCEVLGLPYWSVCNLERV-STWGYSSGQLQEHAQ 583
Query: 552 RCSLLPGRIDIKVNVDIPSDTELVESLQEGCIWRQARGTASVVLRAEDVAGSLEKVGVAQ 611
+ +LPGR +I V VDIP TEL L + CIWRQ RG+ + V ++ + EKVG+AQ
Sbjct: 584 QVDVLPGRCNITVYVDIPVGTELAAPLVDNCIWRQVRGSGAEVSGSDGPDTATEKVGIAQ 643
Query: 612 LWYDELDTLALSTPESESNIE---DETTTSDLRSEDDTVHIDCAVNTSPGTSEVIISAAL 668
WYDELD LA S E D T E V C VN SPGT E++ S AL
Sbjct: 644 QWYDELDNLAFSEVAEEPTAAAHGDGNTDDQSYQEQRRVQFTCTVNVSPGTCELVASVAL 703
Query: 669 YLKLRRYPDQQDDGREKYAARISDILKLGRSGAMQRDSFIRFLLKSNQ--DLRDVIFVKP 726
YL+L DG E R+ +G + + + L + + D ++ VKP
Sbjct: 704 YLRLAG-KRTTTDGVE-LVKRV-----MGGCQFREEQAGVELLARCREDDDATGLVVVKP 756
Query: 727 LHVSIQFDTLDHPKADNSKDIILTDSNMEVDVSLN 761
+H+ ++ DHP +K+ I T+S+++++V+L+
Sbjct: 757 VHLRLRLVCGDHPAGATNKETISTESSLKINVTLD 791
>gi|18481710|gb|AAL73532.1|AF466200_11 hypothetical protein S250_18C08.21 [Sorghum bicolor]
Length = 869
Score = 499 bits (1284), Expect = e-138, Method: Compositional matrix adjust.
Identities = 287/769 (37%), Positives = 421/769 (54%), Gaps = 101/769 (13%)
Query: 76 LSFIESTFNEFQGPHHLWFNIVEDNIHFFKRGGAFLVLAGRFVDNCDSLIAGCGTVVTFE 135
+ FI+S + +G +H W N V G +LVL +C +L + + FE
Sbjct: 119 VDFIKSALGKLEGQNHCWLNTVNGTWRNLNEEGVYLVL---LYQSCGTLNSNNKYPIAFE 175
Query: 136 KVKSIQQSFPQLQVIGFLHGCSTISAVDQTRLVEMLMKEYITFPILLSNKNFP------- 188
++K +QQ +P+L V HG S Q++ + KEYITFPIL+S+K+F
Sbjct: 176 RLKHLQQRYPRLNVFALQHGGDISSLASQSQAFRTIAKEYITFPILMSDKDFSNVRLRPY 235
Query: 189 ---------------------------------------QMENGACYLLSKDFGNARVFH 209
QM NGACYLL + + +F
Sbjct: 236 PFSHIVSIYLPDLCCSCFMHFSYHMKYYVKTRNVIFMSVQMTNGACYLLFEGSKDPVLFT 295
Query: 210 ENSLDIGMLNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGC 269
+ ++ KA++EL ++E S + + +W K+ EV+KEP + S RNLLL+ P C
Sbjct: 296 NWVEEPDVMIKAIDELTSLKEEPSENVLS-RVSWQKE-EVVKEP-SVGSFRNLLLYHPAC 352
Query: 270 ISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKD 329
IS DE G+R+F+SDSNHHRII+ + NG ILD IGS PGFEDGEFES+K + PA+SFYH
Sbjct: 353 ISVDEDGDRIFISDSNHHRIIISNSNGMILDYIGSSPGFEDGEFESAKFLHPASSFYHAA 412
Query: 330 DDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEK 389
+DCL+IVDSENHA+R+AD GRR LETVYP + K+N +W+WI +KLG ++ +
Sbjct: 413 EDCLFIVDSENHAVRKADFGRRTLETVYP---VFNKSNGIWSWITDKLGLRKEVAPTIQD 469
Query: 390 LDPQSLIFPWHLMKSEDDNLLIINR-------------------SFETLWIMDLASGEIK 430
D S+ PWHL++ +DNLLI +R SFE WI+ +++GE +
Sbjct: 470 FDADSIALPWHLIQISEDNLLIADRSCLYNLKSGTHWEQITWLLSFEAPWILRISTGEKQ 529
Query: 431 EA--------------VKGFSKVLEICGVLVMEKVFLLKQMPQDWLLHQIDSSCSLKELP 476
+ +G ++V E +V E+ L+K M +W + S++++P
Sbjct: 530 DIERDLFSSHLIELKLCEGRAEVTESYQQIVKERCALIKDMHMNWSSSARVLTDSMEKIP 589
Query: 477 YAGLISSSIAFQNHILLCDIVGQRIMRLNRESGVCSNFQFSNFAILGLPYWFAFPLERVY 536
+ LISS FQN+I+ CD GQR+++ + ++ SN QFSN +LGLPYWF LERV
Sbjct: 590 NSELISSVARFQNYIIFCDTDGQRVLKHDLDTKTTSNIQFSNCEVLGLPYWFVCNLERV- 648
Query: 537 AVAGGHQGSWTDHIQRCSLLPGRIDIKVNVDIPSDTELVESLQEGCIWRQARGTASVVLR 596
+ G G + +H+ + +LPGR +I V VDIP DTEL L E CIWRQ RG+ + V
Sbjct: 649 STWGHSSGQFQEHVHQVDVLPGRCNITVYVDIPVDTELAAPLAENCIWRQVRGSGAEVSG 708
Query: 597 AEDVAGSLEKVGVAQLWYDELDTLALSTPESESNIE--DETTTSDLRSEDD-TVHIDCAV 653
++ + EKVG+AQ WYDELD LA S E D++ D +D V CAV
Sbjct: 709 SDGPDTTTEKVGIAQQWYDELDNLAFSEVAEEPTTAHGDDSKPGDQSYQDQRRVQFTCAV 768
Query: 654 NTSPGTSEVIISAALYLKLRRYPDQQDDGREKYAARISDILKLGRSGAMQRDSFIRFLLK 713
N SPGT E++ SAALYLKL R D D A + ++ R + + + L++
Sbjct: 769 NVSPGTCELVASAALYLKLARTTDGLDQ-----KALVKRVIGCQRR---EEHAGVELLMR 820
Query: 714 S-NQDLRDVIFVKPLHVSIQFDTLDHPKADNSKDIILTDSNMEVDVSLN 761
S D R ++ VKP+H+ ++ + DHP +K+ I T+S+++++V+L+
Sbjct: 821 SCGDDARGLVVVKPVHLRLRLECSDHPAGTTNKETISTESSLKINVTLD 869
>gi|242080483|ref|XP_002445010.1| hypothetical protein SORBIDRAFT_07g002690 [Sorghum bicolor]
gi|241941360|gb|EES14505.1| hypothetical protein SORBIDRAFT_07g002690 [Sorghum bicolor]
Length = 870
Score = 494 bits (1272), Expect = e-137, Method: Compositional matrix adjust.
Identities = 285/764 (37%), Positives = 416/764 (54%), Gaps = 101/764 (13%)
Query: 76 LSFIESTFNEFQGPHHLWFNIVEDNIHFFKRGGAFLVLAGRFVDNCDSLIAGCGTVVTFE 135
+ FI+S + +G +H W N V G +LVL +C +L + + FE
Sbjct: 119 VDFIKSALGKLEGQNHCWLNTVNGTWRNLNEEGVYLVL---LYQSCGTLNSNNKYPIAFE 175
Query: 136 KVKSIQQSFPQLQVIGFLHGCSTISAVDQTRLVEMLMKEYITFPILLSNKNFP------- 188
++K +QQ +P+L V HG S Q++ + KEYITFPIL+S+K+F
Sbjct: 176 RLKHLQQRYPRLNVFALQHGGDISSLASQSQAFRTIAKEYITFPILMSDKDFSNVRLRPY 235
Query: 189 ---------------------------------------QMENGACYLLSKDFGNARVFH 209
QM NGACYLL + + +F
Sbjct: 236 PFSHIVSIYLPDLCCSCFMHFSYHMKYYVKTRNVIFMSVQMTNGACYLLFEGSKDPVLFT 295
Query: 210 ENSLDIGMLNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGC 269
+ ++ KA++EL ++E S + + +W K+ EV+KEP + S RNLLL+ P C
Sbjct: 296 NWVEEPDVMIKAIDELTSLKEEPSENVLS-RVSWQKE-EVVKEP-SVGSFRNLLLYHPAC 352
Query: 270 ISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKD 329
IS DE G+R+F+SDSNHHRII+ + NG ILD IGS PGFEDGEFES+K + PA+SFYH
Sbjct: 353 ISVDEDGDRIFISDSNHHRIIISNSNGMILDYIGSSPGFEDGEFESAKFLHPASSFYHAA 412
Query: 330 DDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEK 389
+DCL+IVDSENHA+R+AD GRR LETVYP + K+N +W+WI +KLG ++ +
Sbjct: 413 EDCLFIVDSENHAVRKADFGRRTLETVYP---VFNKSNGIWSWITDKLGLRKEVAPTIQD 469
Query: 390 LDPQSLIFPWHLMKSEDDNLLIINR-------------------SFETLWIMDLASGEIK 430
D S+ PWHL++ +DNLLI +R SFE WI+ +++GE +
Sbjct: 470 FDADSIALPWHLIQISEDNLLIADRSCLYNLKSGTHWEQITWLLSFEAPWILRISTGEKQ 529
Query: 431 EA--------------VKGFSKVLEICGVLVMEKVFLLKQMPQDWLLHQIDSSCSLKELP 476
+ +G ++V E +V E+ L+K M +W + S++++P
Sbjct: 530 DIERDLFSSHLIELKLCEGRAEVTESYQQIVKERCALIKDMHMNWSSSARVLTDSMEKIP 589
Query: 477 YAGLISSSIAFQNHILLCDIVGQRIMRLNRESGVCSNFQFSNFAILGLPYWFAFPLERVY 536
+ LISS FQN+I+ CD GQR+++ + ++ SN QFSN +LGLPYWF LERV
Sbjct: 590 NSELISSVARFQNYIIFCDTDGQRVLKHDLDTKTTSNIQFSNCEVLGLPYWFVCNLERV- 648
Query: 537 AVAGGHQGSWTDHIQRCSLLPGRIDIKVNVDIPSDTELVESLQEGCIWRQARGTASVVLR 596
+ G G + +H+ + +LPGR +I V VDIP DTEL L E CIWRQ RG+ + V
Sbjct: 649 STWGHSSGQFQEHVHQVDVLPGRCNITVYVDIPVDTELAAPLAENCIWRQVRGSGAEVSG 708
Query: 597 AEDVAGSLEKVGVAQLWYDELDTLALSTPESESNIE--DETTTSDLRSEDD-TVHIDCAV 653
++ + EKVG+AQ WYDELD LA S E D++ D +D V CAV
Sbjct: 709 SDGPDTTTEKVGIAQQWYDELDNLAFSEVAEEPTTAHGDDSKPGDQSYQDQRRVQFTCAV 768
Query: 654 NTSPGTSEVIISAALYLKLRRYPDQQDDGREKYAARISDILKLGRSGAMQRDSFIRFLLK 713
N SPGT E++ SAALYLKL R D D A + ++ R + + + L++
Sbjct: 769 NVSPGTCELVASAALYLKLARTTDGLDQ-----KALVKRVIGCQRR---EEHAGVELLMR 820
Query: 714 S-NQDLRDVIFVKPLHVSIQFDTLDHPKADNSKDIILTDSNMEV 756
S D R ++ VKP+H+ ++ + DHP +K+ I T+S++++
Sbjct: 821 SCGDDARGLVVVKPVHLRLRLECSDHPAGTTNKETISTESSLKI 864
>gi|218200436|gb|EEC82863.1| hypothetical protein OsI_27722 [Oryza sativa Indica Group]
Length = 643
Score = 461 bits (1187), Expect = e-127, Method: Compositional matrix adjust.
Identities = 242/550 (44%), Positives = 354/550 (64%), Gaps = 16/550 (2%)
Query: 214 DIGMLNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISAD 273
D+ +L K ++ +++ E S + + +W K+ EV KEP+ SVRNLLL+ P CIS D
Sbjct: 108 DLEVLIKGMDNAVLKA-EPSDAALESRVSWQKE-EVPKEPYV-GSVRNLLLYHPACISVD 164
Query: 274 ESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCL 333
E G+R+F+SDSNHHR+I+ D NG ILD IG PGFEDGEFES+K +RPAASFY +DCL
Sbjct: 165 EDGDRIFISDSNHHRVIISDSNGTILDYIGCSPGFEDGEFESAKFLRPAASFYDAAEDCL 224
Query: 334 YIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQ 393
YIVDSENHA+R+AD+GRR+LETVYP S SK ++ +W+WI++KLG R+ D SE
Sbjct: 225 YIVDSENHAVRKADLGRRMLETVYPVS--SKSSSGIWSWIVDKLGLRRE-DAPSENFVAD 281
Query: 394 SLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEIKEAVKGFSKVLEICGVLVMEKVFL 453
S+ FPWHL+K +D+ L+ +R+FET WI+ ++G+ +E +G ++ +E + E+ L
Sbjct: 282 SIAFPWHLLKLTEDDFLVADRNFETSWILSGSTGQKQEIGRGRAEEMESYQQTIHERCAL 341
Query: 454 LKQMPQDWLLHQIDSSCSLKELPYAGLISSSIAFQNHILLCDIVGQRIMRLNRESGVCSN 513
LK M +W S L ++P+ L+SS FQ +I+ CD GQR+++ + ++ SN
Sbjct: 342 LKDMNMNWSSSAKKHSDLLGKIPFKELVSSVARFQKYIIFCDADGQRVLKHDLDTNDTSN 401
Query: 514 FQFSNFAILGLPYWFAFPLERVYAVAGGHQGSWTDHIQRCSLLPGRIDIKVNVDIPSDTE 573
QFSN +LGLPYWF LERV + G G + +HI++ ++LPGR +IKV+VDIPSDT+
Sbjct: 402 IQFSNLGVLGLPYWFVCHLERV-STWGRSIGQFQEHIRKVNVLPGRCNIKVSVDIPSDTQ 460
Query: 574 LVESLQEGCIWRQARGTASVVLRAEDVAGSLEKVGVAQLWYDELDTLALSTPESESNIE- 632
L L E CIWRQ RG+ + V +++ + S EKVG+AQ WYDE+D LA S E +
Sbjct: 461 LAAPLVESCIWRQVRGSGAEVSGSDEPSTSTEKVGIAQQWYDEIDNLAFSEVPEEPTVHE 520
Query: 633 -DETTTSDLRSEDDTVHIDCAVNTSPGTSEVIISAALYLKLRRYPDQQDDGREKYAARIS 691
DE + E VH C VN SPGT E++ SAALYLK+ D D + A +
Sbjct: 521 GDENLSDGSYQEQRRVHFSCVVNVSPGTCELVASAALYLKI----DSAKDDHGEQQALVK 576
Query: 692 DILKLGRSGAMQRDSFIRFLLKSNQDLRDVIFVKPLHVSIQFDTLDHPKADNSKDIILTD 751
I++ R + + + L++S +D RD+ +KP+H+ + + DHP +K+ I ++
Sbjct: 577 RIIQCQR---REDHAGVELLMESCKDARDLTIMKPVHLRLMLECADHPAGTTNKETISSE 633
Query: 752 SNMEVDVSLN 761
S+ E+++SL+
Sbjct: 634 SSFEINISLD 643
>gi|297744354|emb|CBI37324.3| unnamed protein product [Vitis vinifera]
Length = 696
Score = 422 bits (1085), Expect = e-115, Method: Compositional matrix adjust.
Identities = 206/346 (59%), Positives = 265/346 (76%)
Query: 415 SFETLWIMDLASGEIKEAVKGFSKVLEICGVLVMEKVFLLKQMPQDWLLHQIDSSCSLKE 474
SFETLWIM LASGEIKE V+G KVLEICG ++MEK+ +L +MP DWL Q+DS+ SL+
Sbjct: 351 SFETLWIMALASGEIKEVVRGIPKVLEICGEMIMEKLAVLNKMPHDWLQQQVDSNFSLEG 410
Query: 475 LPYAGLISSSIAFQNHILLCDIVGQRIMRLNRESGVCSNFQFSNFAILGLPYWFAFPLER 534
+PYAGL+SS FQ+ I++CD V QRI+RLN ESG +N +FSNF ILGLPYWF+FPLER
Sbjct: 411 IPYAGLMSSLATFQDDIVICDTVAQRILRLNGESGDFTNLEFSNFGILGLPYWFSFPLER 470
Query: 535 VYAVAGGHQGSWTDHIQRCSLLPGRIDIKVNVDIPSDTELVESLQEGCIWRQARGTASVV 594
V AV +G+ DH Q S LPG+I+I++ V+IP DTELVE LQ+GCIWR ARG A+VV
Sbjct: 471 VCAVGNLIRGANADHFQSFSFLPGKINIQLMVEIPEDTELVEPLQDGCIWRLARGAAAVV 530
Query: 595 LRAEDVAGSLEKVGVAQLWYDELDTLALSTPESESNIEDETTTSDLRSEDDTVHIDCAVN 654
ED+ S KVGVAQ WYDELD LA TPE ES+ E+E TT D +++ V IDCAVN
Sbjct: 531 SGVEDLVASSAKVGVAQQWYDELDNLAFFTPEPESDAEEENTTLDTPFQEEKVRIDCAVN 590
Query: 655 TSPGTSEVIISAALYLKLRRYPDQQDDGREKYAARISDILKLGRSGAMQRDSFIRFLLKS 714
TSPGTSEVI+ AALYLKL+R P+ Q+ EK AARI +IL+ +SG + R+S I+FL K
Sbjct: 591 TSPGTSEVIVYAALYLKLKRNPNSQERNPEKNAARILNILEPEKSGKITRESCIQFLSKW 650
Query: 715 NQDLRDVIFVKPLHVSIQFDTLDHPKADNSKDIILTDSNMEVDVSL 760
N+D+ D+IF+KPLHV I ++L HPKA+N+K+ +LT+++++V+V L
Sbjct: 651 NEDVEDLIFMKPLHVRIGLESLSHPKAENAKETVLTETSIQVNVFL 696
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 128/194 (65%), Positives = 150/194 (77%), Gaps = 1/194 (0%)
Query: 221 AVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLF 280
AVEEL Q + S++ K KQ++ KEP+ S++NLLL++PGCIS DESGNRLF
Sbjct: 36 AVEELYTQDKGESATLQNWKSNGLKQSDSTKEPYV-GSLQNLLLYYPGCISVDESGNRLF 94
Query: 281 LSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSEN 340
LSDSNHHRII+F GNGKILDCIGS PGFEDGEFES+KL RPAASFYH D+DCLY VDSEN
Sbjct: 95 LSDSNHHRIIIFSGNGKILDCIGSAPGFEDGEFESAKLARPAASFYHADEDCLYFVDSEN 154
Query: 341 HAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQSLIFPWH 400
HAIRRADMG RVLET+YP +KK N LW+WI+ KLG E+D DTK + D L+FPWH
Sbjct: 155 HAIRRADMGTRVLETLYPPCNTNKKKNGLWSWIVNKLGMEKDADTKPAEFDSGLLMFPWH 214
Query: 401 LMKSEDDNLLIINR 414
L+K D +L IINR
Sbjct: 215 LIKLVDGDLFIINR 228
>gi|222639869|gb|EEE68001.1| hypothetical protein OsJ_25956 [Oryza sativa Japonica Group]
Length = 635
Score = 385 bits (990), Expect = e-104, Method: Compositional matrix adjust.
Identities = 193/462 (41%), Positives = 296/462 (64%), Gaps = 13/462 (2%)
Query: 302 IGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSG 361
IG PGFEDGEFES+K +RPAASFY+ +DCLYIVDSENHA+R+AD+GRR+LETVYP S
Sbjct: 185 IGCSPGFEDGEFESAKFLRPAASFYNTAEDCLYIVDSENHAVRKADLGRRMLETVYPVS- 243
Query: 362 ISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWI 421
SK ++ +W+WI++KLG R+ D SE S+ FPWHL+ +D+ L+ +R+FE WI
Sbjct: 244 -SKSSSGIWSWIVDKLGLRRE-DAPSENFIADSIAFPWHLLNITEDDFLVADRNFEASWI 301
Query: 422 MDLASGEIKEAVKGFSKVLEICGVLVMEKVFLLKQMPQDWLLHQIDSSCSLKELPYAGLI 481
+ +++G+ +E +G ++ +E ++ E+ LLK M +W + S L ++P+ L+
Sbjct: 302 LSVSTGQKQEIGRGIAEEMESYQQIIHERCALLKDMNMNWSSSAKEHSDLLGKIPFKELV 361
Query: 482 SSSIAFQNHILLCDIVGQRIMRLNRESGVCSNFQFSNFAILGLPYWFAFPLERVYAVAGG 541
SS FQ +I+ CD GQR+++ + ++ SN QFSNF +LGLPYWF LERV + G
Sbjct: 362 SSVARFQKYIIFCDADGQRVLKHDLDTNGTSNIQFSNFGVLGLPYWFVCHLERV-STRGH 420
Query: 542 HQGSWTDHIQRCSLLPGRIDIKVNVDIPSDTELVESLQEGCIWRQARGTASVVLRAEDVA 601
G + +H ++ ++LPGR +IKV+VDIP+DT+L L E CIWRQ RG+ + V ++ +
Sbjct: 421 SIGKFQEHTRKVNVLPGRCNIKVSVDIPADTQLAAPLVESCIWRQVRGSGAEVSGFDEPS 480
Query: 602 GSLEKVGVAQLWYDELDTLALSTPESESNIE--DETTTSDLRSEDDTVHIDCAVNTSPGT 659
S EKVG+AQ WYDE+D LA S E + DE + E H +C VN SPGT
Sbjct: 481 TSTEKVGIAQQWYDEIDNLAFSEVPEEPTVHEGDENLSDGSYQEQRRAHFNCVVNVSPGT 540
Query: 660 SEVIISAALYLKLRRYPDQQDDGREKYAARISDILKLGRSGAMQRDSFIRFLLKSNQDLR 719
E++ SAALYLK+ D ++ + + S + ++ + + + + L++S +D R
Sbjct: 541 CELVASAALYLKI-------DSAKDDHGEQKSLVKRIIQCQRREDHAGVELLMESCKDSR 593
Query: 720 DVIFVKPLHVSIQFDTLDHPKADNSKDIILTDSNMEVDVSLN 761
D+ +KP+H+ + + DHP +K+ I ++S+ E+++SL+
Sbjct: 594 DLTIMKPVHLRLMLECADHPAGTTNKETISSESSFEINISLD 635
>gi|6729013|gb|AAF27010.1|AC016827_21 hypothetical protein [Arabidopsis thaliana]
Length = 648
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 199/447 (44%), Positives = 278/447 (62%), Gaps = 35/447 (7%)
Query: 74 DTLSFIESTFNEFQGPHHLWFNIVEDNIHFFKRGGAFLVLAGRFVDNCDSLIAGCGTVVT 133
D LSFI+++ ++ +GP H W N N FK G ++VLAG +D L
Sbjct: 40 DLLSFIKASLDKLEGPSHHWLNRDFGNKQLFKDKGTYVVLAGHLLDGTSDLSG------F 93
Query: 134 FEKVKSIQQSFPQLQVIGFLHGCSTISAVDQTRLVEMLMKEYITFPILLSNKNFPQMENG 193
FEK+K +QQ P + +G A D+T L E+++KEY+TFP+LLS K FP+
Sbjct: 94 FEKLKLLQQRSPGVCFMGIHFSDQARIADDRTALAELILKEYLTFPVLLSEKEFPKTSGE 153
Query: 194 ACYLLSKDFGNARVFHENSLDIGMLNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEP 253
Y++ KDF N ++ E LDI + KA++ L+ Q E S S TW+KQAE +KE
Sbjct: 154 VRYIVFKDFKNPLIYEEKDLDIASVVKALDSLLTQDTEKSKSVRLFTNTWSKQAEAIKES 213
Query: 254 HACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEF 313
H S ++LLL+FPGCISADE G+RLFLSD+NHHRII+F+ +GKI+D IG PGFEDG+F
Sbjct: 214 HFPSFFQDLLLYFPGCISADEVGDRLFLSDTNHHRIIIFENSGKIVDSIGCFPGFEDGDF 273
Query: 314 ESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWI 373
ES+K++RP + Y + +DCLYIVDSE + KK LW+WI
Sbjct: 274 ESAKMLRPTGTLYDEAEDCLYIVDSE----------------------VIKKTGGLWSWI 311
Query: 374 MEKLGFER------DNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASG 427
MEK+G + D DTKSE+ D +SL+FPWH++K +D++LL+IN+SF LWI++ ASG
Sbjct: 312 MEKMGLGKDDDTTVDADTKSEEFDARSLLFPWHILKRDDESLLVINKSFSKLWIINFASG 371
Query: 428 EIKEAVKGFSKVLEICGVLVMEKVFLLKQMPQDWLLHQIDSSCSLKELPYAGLISSSIAF 487
EI+E V+GFSK++EICG + EK+ +L+ MP +WL Q + S KE P A L+SS
Sbjct: 372 EIEEVVEGFSKIIEICGQSITEKLSVLEHMPSNWLQQQTAAIASFKEQPSASLLSSFTKL 431
Query: 488 QNHILLCDIVGQRIMRLNRESGVCSNF 514
+ I++ DI G+ +RLN E C+
Sbjct: 432 GDDIVMTDI-GKISIRLNIEIPPCTEL 457
Score = 196 bits (499), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 104/208 (50%), Positives = 143/208 (68%), Gaps = 5/208 (2%)
Query: 558 GRIDIKVNVDIPSDTELVESLQEGCIWRQARGTASVVLRAEDVAGSLEKVGVAQLWYDEL 617
G+I I++N++IP TELVE +QE CIWRQ RG S A EK+GV+Q WYDEL
Sbjct: 441 GKISIRLNIEIPPCTELVEPIQESCIWRQTRGAISEFSSAGSAVEPSEKIGVSQQWYDEL 500
Query: 618 DTLA--LSTPESESNIE--DETTTSDLRSEDDTVHIDCAVNTSPGTSEVIISAALYLKLR 673
D+LA ++ PE+ E D + R ED +HIDC V TSPG+SE+I+ AALYL+L
Sbjct: 501 DSLAKEIANPEAAEEEEEEDVNPSEVDREEDGRIHIDCPVKTSPGSSELIVYAALYLRLA 560
Query: 674 RYPDQQDDGREKYAARISDILKLGRS-GAMQRDSFIRFLLKSNQDLRDVIFVKPLHVSIQ 732
R + + +E+ A +I+ ILK R+ M+ D F+ L KS ++LRD++F+KP+HV I+
Sbjct: 561 RNEETESATQEELARKIAKILKPVRNITTMKEDLFVNLLSKSKRELRDIVFIKPMHVRIR 620
Query: 733 FDTLDHPKADNSKDIILTDSNMEVDVSL 760
D+ DHPKADNS+D+ILTDS++EVDVSL
Sbjct: 621 LDSKDHPKADNSRDVILTDSSVEVDVSL 648
>gi|325260813|gb|ADZ04632.1| hypothetical protein [Oryza glaberrima]
Length = 484
Score = 323 bits (829), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 203/575 (35%), Positives = 310/575 (53%), Gaps = 96/575 (16%)
Query: 189 QMENGACYLLSKDFGNARVFHENSLDIGMLNKAVEELIMQQQENSSSPSGLKCTWAKQAE 248
Q+ NGAC LL + + +F ++ +L K ++ +++ E S + + +W K+ E
Sbjct: 4 QITNGACCLLFEGSKDHALFSNLDEELEVLIKGMDNAVLKA-EPSDTALESRVSWQKE-E 61
Query: 249 VLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGF 308
V KEP+ S NL+ C + + D N I F IG PGF
Sbjct: 62 VPKEPYVGSLGLNLI-----CTECRTAA----MQDDNC--ICPFQ--------IGCSPGF 102
Query: 309 EDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNS 368
EDGEFES+K +RPAASFY+ +DCLYIVDSENHA+R+AD+GRR+LETVYP S SK ++
Sbjct: 103 EDGEFESAKFLRPAASFYNTAEDCLYIVDSENHAVRKADLGRRMLETVYPVS--SKSSSG 160
Query: 369 LWAWIMEKLGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGE 428
+W+WI++KLG R+ D SE S+ FPWHL+ +D+ L+ +R+FET WI+ +++G+
Sbjct: 161 IWSWIVDKLGLRRE-DAPSENFIADSIAFPWHLLNITEDDFLVADRNFETSWILSVSTGQ 219
Query: 429 IKEAVKGFSKVLEICGVLVMEKVFLLKQMPQDWLLHQIDSSCSLKELPYAGLISSSIAFQ 488
+E + +VL+ H +D++ +S+I F
Sbjct: 220 KQEIGRDGQRVLK----------------------HDLDTNG-----------TSNIQFS 246
Query: 489 NHILLCDIVGQRIMRLNRESGVCSNFQFSNFAILGLPYWFAFPLERVYAVAGGHQGSWTD 548
N F +LGLPYWF LERV + G G + +
Sbjct: 247 N-----------------------------FGVLGLPYWFVCHLERV-STRGHSIGKFQE 276
Query: 549 HIQRCSLLPGRIDIKVNVDIPSDTELVESLQEGCIWRQARGTASVVLRAEDVAGSLEKVG 608
H ++ ++LPGR +IKV+VDIP+DT+L L E CIWRQ RG+ + V ++ + S EKVG
Sbjct: 277 HTRKVNVLPGRCNIKVSVDIPADTQLAAPLVESCIWRQVRGSGAEVSGFDEPSTSTEKVG 336
Query: 609 VAQLWYDELDTLALSTPESESNIED--ETTTSDLRSEDDTVHIDCAVNTSPGTSEVIISA 666
+AQ WYDE+D LA S E + + E + E H +C VN SPGT E++ SA
Sbjct: 337 IAQQWYDEIDNLAFSEVPEEPTVHEGNENLSDGSYQEQRRAHFNCVVNVSPGTCELVASA 396
Query: 667 ALYLKLRRYPDQQDDGREKYAARISDILKLGRSGAMQRDSFIRFLLKSNQDLRDVIFVKP 726
ALYLK+ D D + A + I++ R + + + L++S +D RD+ +KP
Sbjct: 397 ALYLKI----DSAKDDHGEQQALVKRIIQCQRR---EDHAGVELLMESCKDSRDLTIMKP 449
Query: 727 LHVSIQFDTLDHPKADNSKDIILTDSNMEVDVSLN 761
+H+ + + DHP +K+ I ++S+ E+++SL+
Sbjct: 450 VHLRLMLECADHPAGTTNKETISSESSFEINISLD 484
>gi|325260830|gb|ADZ04648.1| hypothetical protein [Oryza punctata]
Length = 630
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 179/462 (38%), Positives = 266/462 (57%), Gaps = 75/462 (16%)
Query: 302 IGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSG 361
IG PGFEDGEFES+K +RPAASFY+ +DCLYIVDSENHA+R+AD+GRR+LETVYP S
Sbjct: 242 IGCSPGFEDGEFESAKFLRPAASFYNAAEDCLYIVDSENHAVRKADLGRRMLETVYPVS- 300
Query: 362 ISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWI 421
SK ++ +W+WI++KLG R+ D SE S+ FPWHL+K +D+ L+ +R+FET WI
Sbjct: 301 -SKSSSGIWSWIVDKLGLRRE-DAPSETFVADSIAFPWHLLKITEDDFLVADRNFETSWI 358
Query: 422 MDLASGEIKEAVKGFSKVLEICGVLVMEKVFLLKQMPQDWLLHQIDSSCSLKELPYAGLI 481
+ +++G+ +E + +VL+ H +D+
Sbjct: 359 LSVSTGQKQEIGRDGQRVLK----------------------HDLDTK-----------D 385
Query: 482 SSSIAFQNHILLCDIVGQRIMRLNRESGVCSNFQFSNFAILGLPYWFAFPLERVYAVAGG 541
+S+I F N F +LGLPYWF LERV G
Sbjct: 386 TSNIQFSN-----------------------------FGVLGLPYWFVCHLERV-PTWGH 415
Query: 542 HQGSWTDHIQRCSLLPGRIDIKVNVDIPSDTELVESLQEGCIWRQARGTASVVLRAEDVA 601
G + +H ++ ++LPGR +IKV+VDIP+DT+L L E CIWRQ RG+ + V +++ +
Sbjct: 416 SIGQFQEHTRKVNVLPGRCNIKVSVDIPADTQLAAPLVESCIWRQVRGSGAEVSGSDEPS 475
Query: 602 GSLEKVGVAQLWYDELDTLALS-TPESESNIEDETTTSDLRSEDD-TVHIDCAVNTSPGT 659
S EK+G+AQ WYDE+D LA S PE + E + SD +D VH C VN SPGT
Sbjct: 476 TSTEKIGIAQQWYDEIDNLAFSEVPEEPTVHEGDEKPSDGSYQDQRRVHFTCVVNVSPGT 535
Query: 660 SEVIISAALYLKLRRYPDQQDDGREKYAARISDILKLGRSGAMQRDSFIRFLLKSNQDLR 719
E++ SA LYLK+ D D G +K A + I++ R + + + L+++ +D R
Sbjct: 536 CELVASAGLYLKIDSAKD--DHGEQK--ALVKRIIQCQRH---EDHAGVELLMENCKDAR 588
Query: 720 DVIFVKPLHVSIQFDTLDHPKADNSKDIILTDSNMEVDVSLN 761
D+ + PL++ + + DHP +K+ I ++S+ E+++SL+
Sbjct: 589 DLTIMTPLNLRLMLECADHPAGTTNKETISSESSFEINISLD 630
>gi|167997357|ref|XP_001751385.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162697366|gb|EDQ83702.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 663
Score = 187 bits (476), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 142/450 (31%), Positives = 216/450 (48%), Gaps = 59/450 (13%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFY 326
P I+ + +F+SDS H RIIV DG+G++LD IGS GFEDG FES++L RP++ +
Sbjct: 180 PRGIATCSEEDLIFVSDSLHDRIIVLDGSGRVLDYIGSIAGFEDGSFESARLRRPSSIVF 239
Query: 327 HKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTK 386
K+ LYI D ENHAIRRA + R+++T+YP W + +LG + + K
Sbjct: 240 DKEKKLLYIADCENHAIRRAHISSRMVDTLYPKPDECPGFLQRW---LHRLGILQGSSQK 296
Query: 387 SEKL-DPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEI-----KEAVKGFSKVL 440
+ + + +PWHL +++ + F+ W++D+ +G + +E F K++
Sbjct: 297 RSNVSNDYEMTYPWHLSVVSSGHVIASTQWFKQSWMIDVDTGALTVEPDQEPATTFKKLM 356
Query: 441 EICGVLVMEKVFLLKQMPQDWLLHQIDSSCSLKELPYAGL---ISSSIAFQNHILLCDIV 497
+ K+ L+ DW + + K L GL +S A I++ D
Sbjct: 357 K-------SKIAELEDNNLDW--ESVYEALVSKGLACEGLSNVVSQVAALAREIVVIDPE 407
Query: 498 GQRIMRLNRESGVCSNFQFSNFAILGLPYWFA-------------------FPLERVYAV 538
++ RL+ SG S Q SN AILGLP W+ LER+
Sbjct: 408 THQVSRLDLSSGYLSPMQLSNVAILGLPAWWCSYDNPQVASSLNNDKPVCLTSLERL--- 464
Query: 539 AGGHQGSWTDHIQRCSLLPGRIDIKVNVDIPSDTELVESLQEGCIWRQARGTASVVLRAE 598
H+G + I ++ PGR I VNV +P T L E E C+WRQ+RG+ V+
Sbjct: 465 ---HEGRTEEDIVTFAIQPGRHLICVNVALPRGTLLAEPFTEECLWRQSRGS---VVDLS 518
Query: 599 DVAGSLE--KVGVAQLWYDELDTLALSTPESESNIEDETTTSDLRSEDDTVHIDCA-VNT 655
G L+ KV AQ ++D+L+ L E ++ED + + + C+ V+
Sbjct: 519 TFEGELKSRKVTAAQRYFDDLNFL---DGTKEYDLEDPAEVLHILQQSPSRLSLCSTVDV 575
Query: 656 SPGTSEVIISAALYLKLRRYPDQQDDGREK 685
S GT EV+ A LYLK PD D K
Sbjct: 576 SLGTGEVVFDAVLYLK----PDWSVDSEIK 601
>gi|413941601|gb|AFW74250.1| hypothetical protein ZEAMMB73_090152 [Zea mays]
Length = 767
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 85/218 (38%), Positives = 131/218 (60%), Gaps = 26/218 (11%)
Query: 186 NFPQMENGACYLLSKDFGNARVFHENSLDIGMLNKAVEELIMQQQENSSSPSGLKCTWAK 245
N P ++N KDF +++G +++EL +++ S + + +W K
Sbjct: 561 NSPSIDN-------KDFNVGDHIDVKLIEVGADGGSIDELTAFKEKPSENVLS-RVSWQK 612
Query: 246 QAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC 305
+ EV+KEP + S RNLLL+ C+S DE G+ +F+SDSNHHRII+ + NG ILD
Sbjct: 613 E-EVVKEP-SVGSFRNLLLYHQACVSVDEDGDHIFISDSNHHRIIISNSNGMILD----- 665
Query: 306 PGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKK 365
++S+K +RPA+SFYH +DC +IVDSENHA+R+AD+G R L+TVYP + K
Sbjct: 666 -------YKSAKFLRPASSFYHAAEDCQFIVDSENHAVRKADLG-RTLQTVYP---VFNK 714
Query: 366 NNSLWAWIMEKLGFERDNDTKSEKLDPQSLIFPWHLMK 403
+N +W+WI +KLG ++ + S+ PWHL++
Sbjct: 715 SNGIWSWITDKLGMRKEATPTIQDFHADSMPLPWHLIR 752
>gi|17228508|ref|NP_485056.1| hypothetical protein alr1013 [Nostoc sp. PCC 7120]
gi|17130359|dbj|BAB72970.1| alr1013 [Nostoc sp. PCC 7120]
Length = 503
Score = 95.9 bits (237), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 66/206 (32%), Positives = 105/206 (50%), Gaps = 31/206 (15%)
Query: 226 IMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSN 285
++Q+ + + + L T KQ + L P L FPG + A SG LF++DS
Sbjct: 144 VIQEHQGAINFQQLSLTLEKQRQPLITP----------LAFPGKVLATPSG--LFVADSG 191
Query: 286 HHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIR 344
HHRI+V D NG+IL IG+ G DG F+ ++ P + ++ LY+ D+ENHA+R
Sbjct: 192 HHRIVVSDFNGEILHLIGNGKSGLTDGNFQEAQFSAPQGMAFDMENQILYLADTENHALR 251
Query: 345 RADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQSLIFPWHLMKS 404
R D+ ++ +ET+ +G +++ ++ +G E +L PW L+
Sbjct: 252 RVDINQQTVETI---AGTGEQSRNIQP--HGGVGLET------------ALNSPWDLVNI 294
Query: 405 EDDNLLIINRSFETLWIMDLASGEIK 430
E ++L I +W MDLASG IK
Sbjct: 295 E-NSLYIAMAGVHQIWQMDLASGVIK 319
>gi|427719807|ref|YP_007067801.1| NHL repeat containing protein [Calothrix sp. PCC 7507]
gi|427352243|gb|AFY34967.1| NHL repeat containing protein [Calothrix sp. PCC 7507]
Length = 505
Score = 94.0 bits (232), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 71/227 (31%), Positives = 115/227 (50%), Gaps = 34/227 (14%)
Query: 205 ARVFHENSLDIGMLNKAVEELIMQQQENSS-SPSGLKCTWAKQAEVLKEPHACSSVRNLL 263
A V E + D+ L++ + +LI Q QE + + L T KQ + L P
Sbjct: 126 ASVSGEGNRDV--LDEFIAKLINQHQEKGTLNFQELSLTLEKQRQPLITP---------- 173
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG + A SG LF++DS HHR++ +G+IL IG+ PGF DG+F ++ P
Sbjct: 174 LAFPGKVLATASG--LFIADSGHHRLVRSSFDGEILHVIGTGKPGFIDGDFSEAQFFAPQ 231
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
+ +++ LYI D+ENHA+RR D+ ++++T+ +G K++ ++ G E +
Sbjct: 232 GMAFDEENQILYIADTENHALRRVDLRHQLVKTI---AGTGKQSRNIHP--HSGAGLETE 286
Query: 383 NDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEI 429
++ PW L+K +NL I +W MDL +G I
Sbjct: 287 LNS------------PWDLVKV-GNNLFIAMAGPHQIWQMDLETGII 320
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 23/83 (27%), Positives = 45/83 (54%), Gaps = 6/83 (7%)
Query: 269 CISADESGNRLFLSDSNHHRIIVFDGNGKILDCI---GSCPGFEDGEFESSKLMRPAASF 325
C+ + + N L+++D+ +H+I + + + + GS G +DGE ++S+ P+
Sbjct: 402 CLGVEYAQNYLWVADTYNHKIKLVSPSTRNCQTVLGDGSA-GLQDGEGKNSRFFEPSG-- 458
Query: 326 YHKDDDCLYIVDSENHAIRRADM 348
+ LY+ D+ NH IRR D+
Sbjct: 459 LSVINSHLYVADTNNHVIRRVDL 481
>gi|87311714|ref|ZP_01093830.1| NHL repeat protein [Blastopirellula marina DSM 3645]
gi|87285608|gb|EAQ77526.1| NHL repeat protein [Blastopirellula marina DSM 3645]
Length = 652
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 86/318 (27%), Positives = 141/318 (44%), Gaps = 54/318 (16%)
Query: 134 FEKVKSIQQSFPQLQVIGFLHGCSTISAVDQTRLVEMLMKEYITFPILLSN--------- 184
++K ++ FP V+ +H + D + E +++ I P++ N
Sbjct: 36 LPELKKLEHEFPNELVVIGVHSAKFENEGDAKNIAEAILRYEIEHPVVNDNQHRIWNSFF 95
Query: 185 -KNFPQMENGACYLLSKD----FGNARVFHENSLDIGMLNKAVEELIMQQQENSSSPSGL 239
+++P M YL+ + +G + F + +LN+A+ QQ +P
Sbjct: 96 VRSWPSM-----YLIDPEGNVVYGKSGEFEAKEI-AAVLNRAIP-YYEQQGTLDRTPIRF 148
Query: 240 KCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKIL 299
+ Q EP L FPG + ADE +RLF++DSNH+RI++ D +GK++
Sbjct: 149 ELLAYSQ-----EPTP--------LRFPGKVLADEKSDRLFIADSNHNRIVIADLDGKLI 195
Query: 300 DCIGSCP-GFEDGEFESSKLMRPAASFYH-----KDDDCLYIVDSENHAIRRADMGRRVL 353
+ IG+ G DG++ + ASF+H D D LY+ D+ENH +R+ D+ + +
Sbjct: 196 ETIGNGAIGTTDGDYAT-------ASFHHLQGMALDGDTLYVADTENHMLRKVDLKTKQV 248
Query: 354 ETVYPTSGISKKNNSLWAWIMEKLGFERDN--DTKSEKLDPQSLIFPWHLMKSEDDNLLI 411
T+ G+ +N+ W I E G R D K +L PW L DD L I
Sbjct: 249 TTIA-GKGVQGRNS--WPGIPES-GLLRSQVPDRFVGKPLETALNSPWALWVKGDD-LYI 303
Query: 412 INRSFETLWIMDLASGEI 429
+W M L EI
Sbjct: 304 AMAGPHQIWKMKLDGSEI 321
>gi|307106270|gb|EFN54516.1| hypothetical protein CHLNCDRAFT_31604 [Chlorella variabilis]
Length = 1072
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 63/213 (29%), Positives = 92/213 (43%), Gaps = 47/213 (22%)
Query: 264 LHFPG-CISADESGNRLFLSDSNHHRIIVFDGNGKILDCIG-SCPGFEDGEFESSKLMRP 321
L FPG C + + RLF+SDSN++RI+V D NG+ L+ IG PG DG +E + RP
Sbjct: 569 LRFPGACKAVSDRAGRLFISDSNNNRIVVCDENGRFLEAIGCGAPGLVDGSYEDAAFFRP 628
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT-----------SGISKKNNSLW 370
Y DCLY+ D+E+HA+R+ D+ ++ + T+ SG S+ NS W
Sbjct: 629 QGIAYSAKRDCLYVADTESHALRQVDLQKQTVFTLAGNGAKGSDYRGGGSGASQLMNSPW 688
Query: 371 -----------------------AWIMEKLGFERDNDTKSEKLDPQSLIFPWHL-----M 402
A GFER + P +L W +
Sbjct: 689 DRYVYVAMAGLHQIWRFDTSTGIASSFSGTGFERFQN------GPSALTTAWAQPSGLSL 742
Query: 403 KSEDDNLLIINRSFETLWIMDLASGEIKEAVKG 435
D L + + T+ +DL SG + V G
Sbjct: 743 APTGDALFVADSESSTVRRLDLGSGGSQACVGG 775
>gi|428308867|ref|YP_007119844.1| thiol-disulfide isomerase-like thioredoxin [Microcoleus sp. PCC
7113]
gi|428250479|gb|AFZ16438.1| thiol-disulfide isomerase-like thioredoxin [Microcoleus sp. PCC
7113]
Length = 515
Score = 90.5 bits (223), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 54/174 (31%), Positives = 89/174 (51%), Gaps = 22/174 (12%)
Query: 217 MLNKAVEELIMQQQENSS-SPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADES 275
+L++ +E++I + QE + + L T KQ + L P L FPG + DE
Sbjct: 141 VLDQLIEQVIREHQEKGTINFQELSLTLEKQRQPLATP----------LAFPGKVLTDEE 190
Query: 276 GNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPAASFYHKDDDCLY 334
G RLF++DS HHRI+V +G++ IG+ PG DG FE ++ P ++ LY
Sbjct: 191 GERLFIADSGHHRIVVSSLSGEVQQVIGTGKPGLTDGSFEEAQFFAPQGMTLDSENQLLY 250
Query: 335 IVDSENHAIRRADMGRRVLETVY----------PTSGISKKNNSLWAWIMEKLG 378
+ D+ENH +R+ D+ + ++T+ P SG + + W +EK+G
Sbjct: 251 VADTENHCLRQVDLKNQQVKTIAGTGEQSHHIRPHSGKALETRLNSPWDVEKVG 304
>gi|75909875|ref|YP_324171.1| NHL repeat-containing protein [Anabaena variabilis ATCC 29413]
gi|75703600|gb|ABA23276.1| NHL repeat protein [Anabaena variabilis ATCC 29413]
Length = 503
Score = 90.1 bits (222), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 62/206 (30%), Positives = 98/206 (47%), Gaps = 31/206 (15%)
Query: 226 IMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSN 285
++Q+ + + + L T KQ + L P L FPG + A G LF++DS
Sbjct: 144 VIQEHQGAINFQQLSLTLEKQRQPLITP----------LAFPGKVLATPGG--LFVADSG 191
Query: 286 HHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIR 344
HHRI+V D NG+IL IG+ G DG F+ ++ P + ++ LY+ D++NH +R
Sbjct: 192 HHRIVVSDFNGEILHLIGNGKSGLTDGNFQEAQFSAPQGMAFDMENQILYVADTDNHVVR 251
Query: 345 RADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQSLIFPWHLMKS 404
RAD+ ++ +ET+ T S+ +E +L PW L+K
Sbjct: 252 RADIQQQTVETIAGTGEQSRNIQPHGGAGLET-----------------ALNSPWDLVKV 294
Query: 405 EDDNLLIINRSFETLWIMDLASGEIK 430
++L I +W MDL SG +K
Sbjct: 295 -GNSLYIAMAGTHQIWQMDLPSGFVK 319
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 48/91 (52%), Gaps = 8/91 (8%)
Query: 269 CISADESGNRLFLSDSNHHRIIVFD---GNGKILDCIGSCPGFEDGEFESSKLMRPAASF 325
C+ + N L+++D+ +H+I + GN + + GS G ++G+ ++++ P+
Sbjct: 400 CLGVEYFQNYLWVADTYNHKIKLVSPHTGNCQTVLGDGSA-GLQNGQGKNTRFFEPSG-- 456
Query: 326 YHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
D LYI D+ NH IRR D+ R LE
Sbjct: 457 LSAMDSYLYISDTNNHVIRRVDL--RTLEVT 485
>gi|320103300|ref|YP_004178891.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
gi|319750582|gb|ADV62342.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
Length = 698
Score = 89.0 bits (219), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 60/168 (35%), Positives = 86/168 (51%), Gaps = 20/168 (11%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESSKLMRPA 322
L +PG + AD G RLF+SD+ H+RI+V + GK LD IGS GF DG FE++ R
Sbjct: 237 LLYPGKVLADAQGGRLFISDTGHNRIVVTNLEGKRLDVIGSGATGFADGSFETASFNRQQ 296
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
+ LY+ D+ENHAIR A++ R + TV +G K+ G+
Sbjct: 297 G--IRLVGETLYVADTENHAIRAANLTTRTVATV---AGTGKQTY--------HRGWGVR 343
Query: 383 NDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSF-ETLWIMDLASGEI 429
K+ L+ PW L+K D++ LII + +W D + GEI
Sbjct: 344 GPAKTTGLN-----SPWDLVKIPDEDRLIIAMAGPHQIWDYDFSKGEI 386
>gi|428297525|ref|YP_007135831.1| NHL repeat containing protein [Calothrix sp. PCC 6303]
gi|428234069|gb|AFY99858.1| NHL repeat containing protein [Calothrix sp. PCC 6303]
Length = 511
Score = 87.4 bits (215), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 67/212 (31%), Positives = 104/212 (49%), Gaps = 32/212 (15%)
Query: 217 MLNKAVEELIMQ-QQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADES 275
+L++ + +LI Q +Q+ S + +K T KQ++ L P L FPG + A
Sbjct: 136 ILDELINQLIQQYEQQGSINFQRIKLTLEKQSQPLLTP----------LAFPGKVLATSV 185
Query: 276 GNRLFLSDSNHHRIIVFDGNGKILDCIG-SCPGFEDGEFESSKLMRPAASFYHKDDDCLY 334
G LF++D+ HHRII +G++L IG GF DG+FE++ P + + + LY
Sbjct: 186 G--LFIADTAHHRIIWTKFDGEVLQIIGVGKAGFVDGDFEAATFSTPQGMTFDEGNQILY 243
Query: 335 IVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQS 394
I D+ NHA+R+ D+G +++ T+ T GI +N I G + S
Sbjct: 244 IADTGNHALRQVDLGNQLVRTIAGT-GIQSRN------IRPHCGKALETPLNS------- 289
Query: 395 LIFPWHLMKSEDDNLLIINRSFETLWIMDLAS 426
PW L+K + NL I LW MDL +
Sbjct: 290 ---PWDLVKIK-GNLWIAIAGSHQLWKMDLET 317
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 23/82 (28%), Positives = 42/82 (51%), Gaps = 4/82 (4%)
Query: 269 CISADESGNRLFLSDSNHHRIIVFDGNGKILDCI--GSCPGFEDGEFESSKLMRPAASFY 326
C+ + N L+++D+ +H+I + D + I + GF D + ++ P+ Y
Sbjct: 402 CLGIEYIDNSLWIADTYNHKIKLVDSQSRNCQTILGDTQSGFIDAKGTDARFFEPSGLSY 461
Query: 327 HKDDDCLYIVDSENHAIRRADM 348
D LYI D+ NHAIR+ ++
Sbjct: 462 W--DSYLYIADTNNHAIRKVNL 481
>gi|302786076|ref|XP_002974809.1| hypothetical protein SELMODRAFT_101985 [Selaginella moellendorffii]
gi|300157704|gb|EFJ24329.1| hypothetical protein SELMODRAFT_101985 [Selaginella moellendorffii]
Length = 1049
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 64/218 (29%), Positives = 98/218 (44%), Gaps = 29/218 (13%)
Query: 214 DIGMLNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISAD 273
D+ L A + +++ ++ P L+ +K A +L P L FPG ++ D
Sbjct: 528 DLDNLVDAALQYYGEKKLLNAQPIPLQLEKSKSARILTTP----------LSFPGKLATD 577
Query: 274 ESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESSKLMRPAASFYHKDDDC 332
+ +RLF+SDSNH+RI+V D NG + +GS G DG F + RP Y + +
Sbjct: 578 LANSRLFISDSNHNRIVVTDLNGNYILQVGSGAEGLTDGSFSLASFNRPQGVAYDTEKNI 637
Query: 333 LYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDP 392
LY+ D+ENHA+R D V ETV +G ++ GF+ +
Sbjct: 638 LYVADTENHALREVDF---VKETVRTLAGTGER------------GFDYRGGKAGQS--- 679
Query: 393 QSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEIK 430
Q L PW L D L I +W + ++G K
Sbjct: 680 QPLNSPWDLCLHPDGMLYIAMAGEHQIWEFNRSNGVAK 717
>gi|283781091|ref|YP_003371846.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
gi|283439544|gb|ADB17986.1| NHL repeat containing protein [Pirellula staleyi DSM 6068]
Length = 782
Score = 86.7 bits (213), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 57/170 (33%), Positives = 82/170 (48%), Gaps = 13/170 (7%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESSKLMRPA 322
L FPG I AD +G RLF+SDSNH+RI++ +GK+++ IGS G DG F + P
Sbjct: 293 LRFPGKILADAAGQRLFISDSNHNRIVITSLDGKLIETIGSGVIGKADGSFAEASFDHPQ 352
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
D + LY+ D+ENH +R+ D+ ++ + T+ +G K+ S W I E
Sbjct: 353 GCAL--DGETLYVADTENHLLRKIDLTKKTVTTI---AGTGKQAESAWPGIEE---VRLT 404
Query: 383 NDTKSEKLDPQSLI---FPWHLMKSEDDNLLIINRSFETLWIMDLASGEI 429
+ + P PW L E NL I +W M L EI
Sbjct: 405 AEVPERWVGPPKTTGINSPWALWVHE-KNLYIAMAGPHQIWKMPLDESEI 453
>gi|302760605|ref|XP_002963725.1| hypothetical protein SELMODRAFT_79954 [Selaginella moellendorffii]
gi|300168993|gb|EFJ35596.1| hypothetical protein SELMODRAFT_79954 [Selaginella moellendorffii]
Length = 1052
Score = 86.7 bits (213), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 64/218 (29%), Positives = 97/218 (44%), Gaps = 29/218 (13%)
Query: 214 DIGMLNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISAD 273
D+ L A + +++ ++ P L+ +K A +L P L FPG ++ D
Sbjct: 531 DLDNLVDAALQYYGEKKLLNAQPIPLQLEKSKSARILTTP----------LSFPGKLATD 580
Query: 274 ESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESSKLMRPAASFYHKDDDC 332
+ RLF+SDSNH+RI+V D NG + +GS G DG F + RP Y + +
Sbjct: 581 LANGRLFISDSNHNRIVVTDLNGNYILQVGSGSEGLTDGSFSLASFNRPQGVAYDTEKNI 640
Query: 333 LYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDP 392
LY+ D+ENHA+R D V ETV +G ++ GF+ +
Sbjct: 641 LYVADTENHALREVDF---VKETVRTLAGTGER------------GFDYRGGKAGQS--- 682
Query: 393 QSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEIK 430
Q L PW L D L I +W + ++G K
Sbjct: 683 QPLNSPWDLCLHPDGMLYIAMAGEHQIWEFNRSNGVAK 720
>gi|254417234|ref|ZP_05030979.1| Redoxin family [Coleofasciculus chthonoplastes PCC 7420]
gi|196176040|gb|EDX71059.1| Redoxin family [Coleofasciculus chthonoplastes PCC 7420]
Length = 496
Score = 86.7 bits (213), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 64/215 (29%), Positives = 100/215 (46%), Gaps = 30/215 (13%)
Query: 217 MLNKAVEELIMQQQENSS-SPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADES 275
+L++ + ELI + Q + + L T KQ L P L FPG + ADE
Sbjct: 136 VLDQLIGELIQKHQHKGTMTMEPLHFTLEKQQHPLTNP----------LAFPGKVLADEF 185
Query: 276 GNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPAASFYHKDDDCLY 334
+RLF++DS HHRI+V G++L+ IG+ PG DG F ++ P D+ LY
Sbjct: 186 SDRLFIADSGHHRILVSTLTGEVLNIIGTGEPGLTDGTFAEAQFFAPQGITLDPDNQRLY 245
Query: 335 IVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQS 394
+ D++NH +R + + +ET+ T S++ LG + +
Sbjct: 246 VADTDNHCLRCIHLETKQVETIAGTGEQSRQIQP-----HSGLGLQ------------TA 288
Query: 395 LIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEI 429
L PW + + D+ LLI +W M L +GEI
Sbjct: 289 LNSPWDI-EIIDNRLLIAMAGSHQIWEMQLETGEI 322
>gi|386811933|ref|ZP_10099158.1| conserved hypothetical protein [planctomycete KSU-1]
gi|386404203|dbj|GAB62039.1| conserved hypothetical protein [planctomycete KSU-1]
Length = 669
Score = 86.3 bits (212), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 44/113 (38%), Positives = 69/113 (61%), Gaps = 6/113 (5%)
Query: 263 LLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRP 321
L FPG + ADE NRLF++DSNH+RI++ +GK+LD +G+ G +DG F + P
Sbjct: 209 LFSFPGKVLADEISNRLFIADSNHNRIVIATLDGKVLDVVGNGKIGKDDGTFTEASFNHP 268
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIM 374
+H+ D LY+ D+ENH IR+ D+ + ++T+ +G K+ N + A M
Sbjct: 269 QGMAFHR--DYLYVADTENHLIRKLDLKTKTVKTI---AGTGKQANFMAAGGM 316
>gi|440683842|ref|YP_007158637.1| NHL repeat containing protein [Anabaena cylindrica PCC 7122]
gi|428680961|gb|AFZ59727.1| NHL repeat containing protein [Anabaena cylindrica PCC 7122]
Length = 506
Score = 85.9 bits (211), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 69/215 (32%), Positives = 104/215 (48%), Gaps = 34/215 (15%)
Query: 220 KAVEELIMQ-QQENSSSPS--GLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESG 276
+A+E +I+Q QQ+N + + + T KQ + L P L FPG + A SG
Sbjct: 135 QAIENVIIQLQQKNQAHINFQQFQFTLEKQHQPLITP----------LAFPGKVLATSSG 184
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRPAASFYHKDDDCLYI 335
LF++DS HHRI++ +GKIL IG+ G DG F + P + + LYI
Sbjct: 185 --LFIADSGHHRIVMTRFDGKILYVIGTGKSGLTDGAFNQVQFSAPQGMTFDDESQILYI 242
Query: 336 VDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQSL 395
D+EN+A+R+ D+ R+V+ET+ +G ++N+ I G + S
Sbjct: 243 ADTENYALRQIDLQRQVVETI---AGTGQQNHI----ISPHGGVALETALNS-------- 287
Query: 396 IFPWHLMKSEDDNLLIINRSFETLWIMDLASGEIK 430
PW L K + LLI +W MDL G I+
Sbjct: 288 --PWDLQKV-GNKLLIAMAGSHQIWEMDLERGTIE 319
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 50/90 (55%), Gaps = 4/90 (4%)
Query: 269 CISADESGNRLFLSDSNHHRI-IVFDGNGKILDCIG-SCPGFEDGEFESSKLMRPAASFY 326
C+ + N L+++D+ +H+I +V G +G PGF+DG+ ++++ P+
Sbjct: 400 CLGVEYFQNYLWVADTYNHKIKLVHPTTGNCQTVLGDGLPGFQDGQGKNTRFSEPSG--L 457
Query: 327 HKDDDCLYIVDSENHAIRRADMGRRVLETV 356
D LYI D+ NHAIRR ++ ++T+
Sbjct: 458 SGIDSYLYISDTNNHAIRRVNLKTLTVQTL 487
>gi|167041535|gb|ABZ06284.1| putative NHL repeat protein [uncultured marine microorganism
HF4000_008B14]
Length = 426
Score = 85.9 bits (211), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 62/172 (36%), Positives = 80/172 (46%), Gaps = 32/172 (18%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG + AD NRLF++DSNH+RIIV +G L +GS PG DG F A
Sbjct: 108 LSFPGKVVADADSNRLFIADSNHNRIIVTTLDGDALKVVGSGEPGMVDGSFA-------A 160
Query: 323 ASFYHKD-----DDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKL 377
ASF H D LY+ D+ENHAIR+ D+ +ET+ T S+ L
Sbjct: 161 ASFDHPQGMVLVGDLLYVADAENHAIRKVDLAAEQVETIAGTG----TQGSMREGTGPGL 216
Query: 378 GFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEI 429
E ++ PW L E +L I LW MDL+SG +
Sbjct: 217 HTELNS--------------PWDLAFHE-GSLYIAMAGLHQLWSMDLSSGTV 253
>gi|186683945|ref|YP_001867141.1| redoxin domain-containing protein [Nostoc punctiforme PCC 73102]
gi|186466397|gb|ACC82198.1| Redoxin domain protein [Nostoc punctiforme PCC 73102]
Length = 509
Score = 85.5 bits (210), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 64/215 (29%), Positives = 100/215 (46%), Gaps = 32/215 (14%)
Query: 218 LNKAVEELIMQQQENSS-SPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESG 276
L++ +++LI Q Q+ + + L KQ + L P L FPG + A ++G
Sbjct: 137 LDELIQKLIQQHQDKGTINFQELSLILEKQRQPLITP----------LAFPGKVLATQTG 186
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRPAASFYHKDDDCLYI 335
LF++DS HHR+I+ +G+IL IG+ G DG F ++ P Y ++ LY+
Sbjct: 187 --LFIADSGHHRLIMSSFDGEILHLIGTGKSGLTDGAFNEAQFFAPQGMAYDAENQILYV 244
Query: 336 VDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQSL 395
D+ENH +RR DM R+V++ + T S+ I G + S
Sbjct: 245 ADTENHTLRRVDMKRQVVDAIAGTGEQSRN-------IHPHGGVGLETPLNS-------- 289
Query: 396 IFPWHLMKSEDDNLLIINRSFETLWIMDLASGEIK 430
PW L+K + L I +W MDL + IK
Sbjct: 290 --PWDLVKV-GNTLFIAMAGPHQIWEMDLETSMIK 321
Score = 44.7 bits (104), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 24/82 (29%), Positives = 47/82 (57%), Gaps = 5/82 (6%)
Query: 269 CISADESGNRLFLSDSNHHRI--IVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFY 326
C+ + + N ++++D+ +H+I + GN + + G+ G +DG+ ++S+ P+
Sbjct: 402 CLGVEFAQNFIWVTDTYNHKIKLVSPSGNCQTIQGDGTA-GLQDGQGKNSRFFEPSG--L 458
Query: 327 HKDDDCLYIVDSENHAIRRADM 348
+ LYI D+ NHAIRR D+
Sbjct: 459 SINGSYLYIADTNNHAIRRVDL 480
>gi|434403812|ref|YP_007146697.1| thiol-disulfide isomerase-like thioredoxin [Cylindrospermum
stagnale PCC 7417]
gi|428258067|gb|AFZ24017.1| thiol-disulfide isomerase-like thioredoxin [Cylindrospermum
stagnale PCC 7417]
Length = 506
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 57/182 (31%), Positives = 91/182 (50%), Gaps = 29/182 (15%)
Query: 209 HENSLDIGMLNKAVEELIMQQQENSS-SPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFP 267
H +SLD + +E LI + +E + + L T KQ + L P L FP
Sbjct: 133 HRDSLD-----QLIENLIHEHREKGTINFQELSLTLEKQRQPLITP----------LAFP 177
Query: 268 GCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRPAASFY 326
G + A +G LF++DS HHR+++ NG++ IG+ GF DG F ++ P Y
Sbjct: 178 GKVLATPAG--LFIADSGHHRLVMSSFNGEVFHIIGTGKSGFTDGNFSEAQFSAPQGMAY 235
Query: 327 HKDDDCLYIVDSENHAIRRADMGRRVLETVY----------PTSGISKKNNSLWAWIMEK 376
++ LY+ D+ENHA+R+ D+ R+V+ T+ P SG + + W +EK
Sbjct: 236 DAENQILYVADTENHALRKVDIKRQVVGTIAGTGEQSHNTRPHSGAALETALNSPWDLEK 295
Query: 377 LG 378
+G
Sbjct: 296 IG 297
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 46/83 (55%), Gaps = 6/83 (7%)
Query: 269 CISADESGNRLFLSDSNHHRIIVFD---GNGKILDCIGSCPGFEDGEFESSKLMRPAASF 325
C+ D N L+++D+ +H+I + + G+ + + GS G ++G+ ++++ P+
Sbjct: 402 CLGVDFFQNHLWVTDTYNHKIKLVNPITGDCQTVLGDGSA-GLQNGQGKNTRFFEPSG-- 458
Query: 326 YHKDDDCLYIVDSENHAIRRADM 348
D LYI D+ NH IRR D+
Sbjct: 459 LSAIDSYLYISDTNNHVIRRVDL 481
>gi|347756512|ref|YP_004864075.1| Thiol-disulfide isomerase / thioredoxin [Candidatus
Chloracidobacterium thermophilum B]
gi|347589029|gb|AEP13558.1| Thiol-disulfide isomerase / thioredoxin [Candidatus
Chloracidobacterium thermophilum B]
Length = 676
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 43/106 (40%), Positives = 63/106 (59%), Gaps = 6/106 (5%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESSKLMRPA 322
L FPG + AD + RLF++D+NHHRI+V +GK+LD IGS PG +G + ++ P
Sbjct: 216 LRFPGKVYADAASQRLFIADTNHHRIVVAGFDGKVLDTIGSGAPGTRNGPYNFAEFQYPQ 275
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNS 368
D D LY+ D+ NH +RR ++ R++ETV +G K S
Sbjct: 276 G--MALDGDFLYVADTGNHLLRRVNLKTRMVETV---AGTGKNERS 316
>gi|119509213|ref|ZP_01628363.1| NHL repeat protein [Nodularia spumigena CCY9414]
gi|119466055|gb|EAW46942.1| NHL repeat protein [Nodularia spumigena CCY9414]
Length = 502
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 58/215 (26%), Positives = 106/215 (49%), Gaps = 32/215 (14%)
Query: 218 LNKAVEELIMQQQENSS-SPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESG 276
L++ +++LI + QE + + + T KQ + L P L FPG + A +G
Sbjct: 137 LDELIQQLISEHQEKGTINFQEISVTLEKQRQALTTP----------LAFPGKVLATPAG 186
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRPAASFYHKDDDCLYI 335
LF++DS HHR+++ +G+IL IG+ G DG F ++ P + ++ LY+
Sbjct: 187 --LFIADSGHHRVVMSSYDGEILHLIGTGKSGLTDGAFNEAQFFAPQGMAFDPENQMLYV 244
Query: 336 VDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQSL 395
D+ENH +RR D+ R++++T+ +G +++ +++ G E +L
Sbjct: 245 ADTENHTLRRVDLKRQIVKTI---AGTGEQSRNIYP--HGGAGIET------------AL 287
Query: 396 IFPWHLMKSEDDNLLIINRSFETLWIMDLASGEIK 430
PW +++ L I +W M+L SG +K
Sbjct: 288 NSPWDVVQL-GQTLFIAMAGSHQIWQMNLESGIVK 321
Score = 47.0 bits (110), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 49/90 (54%), Gaps = 4/90 (4%)
Query: 269 CISADESGNRLFLSDSNHHRI-IVFDGNGKILDCIG-SCPGFEDGEFESSKLMRPAASFY 326
C+ + + N L+++D+ +H+I +V G +G G +DG+ ++++ P+
Sbjct: 402 CLGVEYAQNYLWVADTYNHKIKLVSPSTGNCQTILGDGSAGLQDGQGQNTRFFEPSG--L 459
Query: 327 HKDDDCLYIVDSENHAIRRADMGRRVLETV 356
D LYI D+ NHAIRR D+ + + T+
Sbjct: 460 SVMGDYLYISDTNNHAIRRGDLNKFEVTTL 489
>gi|384248465|gb|EIE21949.1| hypothetical protein COCSUDRAFT_16964 [Coccomyxa subellipsoidea
C-169]
Length = 1021
Score = 84.3 bits (207), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 53/179 (29%), Positives = 88/179 (49%), Gaps = 25/179 (13%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGN--------GKILDCIG-SCPGFEDGEFE 314
L +PG ++AD +G RLF+SDSN+HR G+ LD IG + P DG ++
Sbjct: 510 LRYPGKLTADLAGGRLFISDSNNHRHAAHATALLHSTPHPGRFLDQIGGNGPALRDGAYQ 569
Query: 315 SSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIM 374
++ L RP Y DCLY+ D+EN+A+R ++ R L T+ ++++
Sbjct: 570 AAALYRPQGLAYSARRDCLYVADTENNALREVNLKTRTLRTLAGDGAKAEQD-------- 621
Query: 375 EKLGFERDNDTKSEKLD-PQSLIF---PWHLMKSEDDNLLIINRSFETLWIMDLASGEI 429
+ ++++L+ P L F H + + +D+L I +W DLASG +
Sbjct: 622 ----YTGGRSGRAQRLNSPWDLAFNAQANHQLGNPEDSLYIAMAGQHQIWRHDLASGSM 676
>gi|12323029|gb|AAG51506.1|AC058785_9 hypothetical protein [Arabidopsis thaliana]
Length = 1041
Score = 84.0 bits (206), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 42/95 (44%), Positives = 57/95 (60%), Gaps = 2/95 (2%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC--PGFEDGEFESSKLMRP 321
L FPG ++ D NRLF+SDSNH+RIIV D G + IGS GF+DG FE + RP
Sbjct: 566 LKFPGKLAIDTLNNRLFISDSNHNRIIVTDLEGNFIVQIGSSGEEGFQDGSFEDAAFNRP 625
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
Y+ + LY+ D+ENHA+R D ++T+
Sbjct: 626 QGLAYNAKKNLLYVADTENHALREIDFVNERVQTL 660
>gi|30696124|ref|NP_564718.2| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
gi|17529226|gb|AAL38840.1| unknown protein [Arabidopsis thaliana]
gi|332195279|gb|AEE33400.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
Length = 1055
Score = 84.0 bits (206), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 42/95 (44%), Positives = 57/95 (60%), Gaps = 2/95 (2%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC--PGFEDGEFESSKLMRP 321
L FPG ++ D NRLF+SDSNH+RIIV D G + IGS GF+DG FE + RP
Sbjct: 566 LKFPGKLAIDTLNNRLFISDSNHNRIIVTDLEGNFIVQIGSSGEEGFQDGSFEDAAFNRP 625
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
Y+ + LY+ D+ENHA+R D ++T+
Sbjct: 626 QGLAYNAKKNLLYVADTENHALREIDFVNERVQTL 660
>gi|126179662|ref|YP_001047627.1| NHL repeat-containing protein [Methanoculleus marisnigri JR1]
gi|125862456|gb|ABN57645.1| NHL repeat containing protein [Methanoculleus marisnigri JR1]
Length = 487
Score = 83.6 bits (205), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 59/102 (57%), Gaps = 1/102 (0%)
Query: 255 ACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEF 313
A + R L+ PG ++AD +G RLF+SD+ HHRI+V G+GKIL+ IGS G DG F
Sbjct: 162 APETARETSLYRPGKVAADNAGMRLFISDTGHHRIVVAGGDGKILETIGSGAAGNADGPF 221
Query: 314 ESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLET 355
+ + P + ++ LY+ D+ NH IRR R + T
Sbjct: 222 DEAAFYLPEGLAFDEEAGILYVADTGNHTIRRVSFTERRVAT 263
>gi|427739164|ref|YP_007058708.1| thiol-disulfide isomerase-like thioredoxin [Rivularia sp. PCC 7116]
gi|427374205|gb|AFY58161.1| thiol-disulfide isomerase-like thioredoxin [Rivularia sp. PCC 7116]
Length = 487
Score = 83.6 bits (205), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 47/116 (40%), Positives = 71/116 (61%), Gaps = 8/116 (6%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG + A +S +LF++DS HHRI+V NG++L IGS G +DG FE ++ P
Sbjct: 185 LSFPGKVLATDS--KLFIADSGHHRIVVSSLNGEVLHLIGSGKAGLKDGSFEETEFFAPQ 242
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLG 378
+ + D+ LY+ D+EN+AIR+ D + +ET+ +GI + NS W + KLG
Sbjct: 243 GMAFDEVDNILYVADTENNAIRKVDFKSQKVETI---AGIKESLNSPWDLV--KLG 293
>gi|297848174|ref|XP_002891968.1| hypothetical protein ARALYDRAFT_892829 [Arabidopsis lyrata subsp.
lyrata]
gi|297337810|gb|EFH68227.1| hypothetical protein ARALYDRAFT_892829 [Arabidopsis lyrata subsp.
lyrata]
Length = 1058
Score = 83.2 bits (204), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 41/87 (47%), Positives = 53/87 (60%), Gaps = 2/87 (2%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC--PGFEDGEFESSKLMRP 321
L FPG ++ D NRLF+SDSNH+RIIV D G + IGS GF+DG FE + RP
Sbjct: 569 LKFPGKLAIDTLNNRLFISDSNHNRIIVTDLEGNFIVQIGSTGEEGFQDGSFEDAAFNRP 628
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADM 348
Y+ + LY+ D+ENHA+R D
Sbjct: 629 QGLAYNAKKNLLYVADTENHALREIDF 655
>gi|356558815|ref|XP_003547698.1| PREDICTED: uncharacterized protein LOC100803900 [Glycine max]
Length = 90
Score = 82.4 bits (202), Expect = 8e-13, Method: Composition-based stats.
Identities = 38/89 (42%), Positives = 58/89 (65%), Gaps = 4/89 (4%)
Query: 134 FEKVKSIQQSFPQLQVIGFLHGCSTISAVDQTRLVEMLMKEYITFPILLSNKNFPQMENG 193
EK+K+IQ+SFP + ++G I + DQ L+++LM E ITFPILLS + P++E G
Sbjct: 2 LEKLKTIQKSFPHINIMGL----KAIHSSDQVHLIQLLMTENITFPILLSRQKTPKIEKG 57
Query: 194 ACYLLSKDFGNARVFHENSLDIGMLNKAV 222
ACY+L K F + ++HE + +L+K V
Sbjct: 58 ACYILFKIFRSPVIYHEKDAGLEILSKGV 86
>gi|383100947|emb|CCD74491.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis halleri subsp. halleri]
Length = 1028
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 40/83 (48%), Positives = 52/83 (62%), Gaps = 2/83 (2%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC--PGFEDGEFESSKLMRP 321
L FPG ++ D NRLF+SDSNH+RIIV D G + IGS GF+DG FE + RP
Sbjct: 569 LKFPGKLAIDTLNNRLFISDSNHNRIIVTDLEGNFIVQIGSTGEEGFQDGSFEDAAFNRP 628
Query: 322 AASFYHKDDDCLYIVDSENHAIR 344
Y+ + LY+ D+ENHA+R
Sbjct: 629 QGLAYNAKKNLLYVADTENHALR 651
>gi|428214156|ref|YP_007087300.1| thiol-disulfide isomerase-like thioredoxin [Oscillatoria acuminata
PCC 6304]
gi|428002537|gb|AFY83380.1| thiol-disulfide isomerase-like thioredoxin [Oscillatoria acuminata
PCC 6304]
Length = 506
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 51/167 (30%), Positives = 77/167 (46%), Gaps = 18/167 (10%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESSKLMRPA 322
L FPG + AD + +RLF++DS HHRIIV NG + IG+ PG DG F ++ P
Sbjct: 175 LAFPGKVLADAASSRLFIADSGHHRIIVTTLNGTVQSIIGNGIPGLTDGSFSEAQFFGPQ 234
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
+ + LY+ D+ENHAIR+ D ++T+ +G ++N R
Sbjct: 235 GMSWDANSQRLYVADTENHAIRQIDFTTEQVQTL---AGTGEQN--------------RT 277
Query: 383 NDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEI 429
+ L PW L + LL+ +W +DL G +
Sbjct: 278 QGPQGGHGLETPLNSPWDLELIGGEVLLMAMAGSHQIWALDLMDGTV 324
>gi|221633398|ref|YP_002522623.1| hypothetical protein trd_1418 [Thermomicrobium roseum DSM 5159]
gi|221156519|gb|ACM05646.1| conserved hypothetical protein [Thermomicrobium roseum DSM 5159]
Length = 443
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 72/300 (24%), Positives = 125/300 (41%), Gaps = 42/300 (14%)
Query: 137 VKSIQQSFPQLQVIGFLHGCSTISAVDQTRLVEMLMKEYITFPILLSN-----KNFPQME 191
++++++ +P V+ +H + + + + +++E IT P++ +++
Sbjct: 1 MRAVEERYPTQVVVIGVHSPKFPNEREPQNVRQAVLREEITHPVVHDPALQIWRSYAIRA 60
Query: 192 NGACYLLSKDFGNARVFHENSLDIGMLNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLK 251
LS D G HE + L +AV L+ + +S P L VL+
Sbjct: 61 WPTLVFLSPD-GYLLGIHEGEISAESLVRAVGRLLARTGVTTSEPFALLS-------VLE 112
Query: 252 EPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFED 310
P L FPG ++ D S +RL +SD+ HHR+++ +G + IG PG D
Sbjct: 113 RPQGP-------LAFPGKLAVDPSRDRLVVSDTGHHRLVIARLDGTVTAVIGDGRPGLVD 165
Query: 311 GEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLW 370
G F ++ P + C ++ D NHAIRR D+ +ET+ T + +
Sbjct: 166 GTFAEARFREPQG-IALVGETC-FVADRGNHAIRRIDLAAGTVETLAGTGRLGQG----- 218
Query: 371 AWIMEKLGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEIK 430
M G D +S PW L L + +W++DLASG I+
Sbjct: 219 ---MLSAGPAHQVDLRS----------PWALWHRR-GLLFVAMAGSHQIWVLDLASGIIQ 264
>gi|302834543|ref|XP_002948834.1| hypothetical protein VOLCADRAFT_80356 [Volvox carteri f.
nagariensis]
gi|300266025|gb|EFJ50214.1| hypothetical protein VOLCADRAFT_80356 [Volvox carteri f.
nagariensis]
Length = 774
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 49/166 (29%), Positives = 80/166 (48%), Gaps = 20/166 (12%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCI-GSCPGFEDGEFESSKLMRPA 322
L +PG I++D S RLF+SDSN+HRI++ D NG+ ++ I G+ P DG FE++ RP
Sbjct: 260 LRYPGKIASDLSSGRLFVSDSNNHRIVITDLNGRFIEQIGGNGPALRDGSFETAAFNRPQ 319
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
+ + L++ D+ENHA+R D+ + T + G +
Sbjct: 320 GLVFSPRRNQLFVADTENHAVRCCDLTSNTVIT------------------LAGNGSKGR 361
Query: 383 NDTKSEKLDPQSLIFPWHL-MKSEDDNLLIINRSFETLWIMDLASG 427
+ Q L PW + + + D L I +W ++L+SG
Sbjct: 362 DYRGGRGGSAQPLNSPWDVELDARQDYLYIALAGQHQIWDLELSSG 407
>gi|434391812|ref|YP_007126759.1| NHL repeat containing protein [Gloeocapsa sp. PCC 7428]
gi|428263653|gb|AFZ29599.1| NHL repeat containing protein [Gloeocapsa sp. PCC 7428]
Length = 509
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/149 (34%), Positives = 76/149 (51%), Gaps = 14/149 (9%)
Query: 218 LNKAVEELIMQQQENSS-SPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESG 276
+++ V +LI + QE + S L KQ + L P L FPG + A +
Sbjct: 136 IDQLVAQLIQKHQEKGTFSAQVLNLILEKQRQPLITP----------LAFPGKVLATTAS 185
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPAASFYHKDDDCLYI 335
LF++DS HHRI+V G++ IG+ PG DG F ++ P + + LY+
Sbjct: 186 --LFIADSGHHRIVVSTLAGELQYVIGTGKPGLRDGSFSEAQFFSPQGMCFDTEKQLLYV 243
Query: 336 VDSENHAIRRADMGRRVLETVYPTSGISK 364
D+ENHAIRR D R+V+ET+ T S+
Sbjct: 244 ADTENHAIRRIDFQRQVVETIAGTGDQSR 272
>gi|440794391|gb|ELR15552.1| haloacid dehalogenase family hydrolase family protein [Acanthamoeba
castellanii str. Neff]
Length = 731
Score = 80.9 bits (198), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 57/171 (33%), Positives = 83/171 (48%), Gaps = 23/171 (13%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS--CPGFEDGEFESSKLMRP 321
L +PG I+ D RLF+SDSN+HRI+V D G L+ IG+ GF+DG + +K R
Sbjct: 223 LLYPGKIAIDNEHGRLFISDSNNHRIVVTDLQGAFLEEIGTRGSLGFKDGSYAEAKFNRL 282
Query: 322 AASFYHKDDDC--LYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGF 379
YHK D+ LY+ D+ENHA+R D+ + +TV +G + N ++ K G
Sbjct: 283 QGVAYHKTDEHERLYVADAENHALRVVDL---LAKTVTTLAGDGTQGND---FVGGKSGR 336
Query: 380 ERDNDTKSEKLDPQSLIFPWHLMKSED-DNLLIINRSFETLWIMDLASGEI 429
E Q L PW + S D L + +W + L S +
Sbjct: 337 E------------QQLSTPWDVALSPDGQQLFVAMAGTHQIWEVSLDSSAV 375
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 23/90 (25%), Positives = 41/90 (45%), Gaps = 18/90 (20%)
Query: 277 NRLFLSDSNHHRIIVFD----------GNGKILDCIGSCPGFEDGEFESSKLMRPAASFY 326
NR+ +D+ +H++ + D GNGK PG DG ++ P+ +
Sbjct: 472 NRVVTTDTYNHKVKLLDPATNVISFWLGNGK--------PGLSDGVGAEAQFYEPSGACV 523
Query: 327 HKDDDCLYIVDSENHAIRRADMGRRVLETV 356
D +++ D+ NHAIR D+ + T+
Sbjct: 524 TADGKTVFVCDTNNHAIRVIDVDTAAVRTL 553
>gi|296089788|emb|CBI39607.3| unnamed protein product [Vitis vinifera]
Length = 1002
Score = 80.9 bits (198), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 45/106 (42%), Positives = 60/106 (56%), Gaps = 5/106 (4%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC--PGFEDGEFESSKLMRP 321
L FPG ++ D NRLF+SDSNH+RI+V D NG + IGS G DG F+ + RP
Sbjct: 514 LKFPGKLAIDVINNRLFISDSNHNRIVVTDLNGNYILQIGSTGEEGLRDGSFDDATFNRP 573
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNN 367
Y+ + LY+ D+ENHA+R D V ETV +G K +
Sbjct: 574 QGLAYNAKKNLLYVADTENHALREIDF---VNETVQTLAGNGTKGS 616
>gi|225450563|ref|XP_002277564.1| PREDICTED: NHL repeat-containing protein 2-like [Vitis vinifera]
Length = 1078
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 40/95 (42%), Positives = 56/95 (58%), Gaps = 2/95 (2%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC--PGFEDGEFESSKLMRP 321
L FPG ++ D NRLF+SDSNH+RI+V D NG + IGS G DG F+ + RP
Sbjct: 590 LKFPGKLAIDVINNRLFISDSNHNRIVVTDLNGNYILQIGSTGEEGLRDGSFDDATFNRP 649
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
Y+ + LY+ D+ENHA+R D ++T+
Sbjct: 650 QGLAYNAKKNLLYVADTENHALREIDFVNETVQTL 684
>gi|168024767|ref|XP_001764907.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683943|gb|EDQ70349.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1040
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 40/94 (42%), Positives = 58/94 (61%), Gaps = 1/94 (1%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK-ILDCIGSCPGFEDGEFESSKLMRPA 322
L FPG ++ D RLF+SDSNHHRI+V D +G I++ G G +DG FE+S RP
Sbjct: 543 LKFPGKLTTDLVNGRLFISDSNHHRIVVTDLDGNFIVNVGGGGEGLKDGSFETSLFNRPQ 602
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
Y+ + LY+ D+ENHA+R D+ + + T+
Sbjct: 603 GVAYNAFKNVLYVADTENHALREIDLAKETVRTL 636
>gi|428206267|ref|YP_007090620.1| NHL repeat containing protein [Chroococcidiopsis thermalis PCC
7203]
gi|428008188|gb|AFY86751.1| NHL repeat containing protein [Chroococcidiopsis thermalis PCC
7203]
Length = 502
Score = 79.7 bits (195), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 63/213 (29%), Positives = 100/213 (46%), Gaps = 32/213 (15%)
Query: 217 MLNKAVEELIMQQQENSSSPSG-LKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADES 275
+L++ + +LI + ++ + S L KQ + L P A FPG + A +
Sbjct: 137 VLDELIAKLIQEYRDKGAIASQELSLILEKQRQPLTTPIA----------FPGKVLATAT 186
Query: 276 GNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPAASFYHKDDDCLY 334
G LF++DS HHRII +G+IL IG+ PG DG+F ++ P ++ LY
Sbjct: 187 G--LFIADSGHHRIIWSTLDGEILHVIGTGKPGLTDGDFTQAQFFAPQGMVLDDENQLLY 244
Query: 335 IVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQS 394
+ D+ENHAIRR ++ + +ET+ T G +N + I +K+ +
Sbjct: 245 LADTENHAIRRINLPLQRVETIAGT-GEQSRNIRPYGGIGQKI----------------A 287
Query: 395 LIFPWHLMKSEDDNLLIINRSFETLWIMDLASG 427
L PW L + + L I +W MDL G
Sbjct: 288 LNSPWDLTMLK-NYLFIAMAGNHQIWQMDLEKG 319
>gi|449435944|ref|XP_004135754.1| PREDICTED: NHL repeat-containing protein 2-like [Cucumis sativus]
Length = 1086
Score = 79.7 bits (195), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 40/95 (42%), Positives = 56/95 (58%), Gaps = 2/95 (2%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC--PGFEDGEFESSKLMRP 321
L FPG ++ D NRLF+SDSNH+RI+V D +G L IGS G DG F+ + RP
Sbjct: 602 LKFPGKLAIDILNNRLFISDSNHNRIVVTDLSGNFLLQIGSTGEDGLRDGNFDDATFNRP 661
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
Y+ + LY+ D+ENHA+R D + + T+
Sbjct: 662 QGLAYNAKKNLLYVADTENHALREVDFVKERVRTL 696
>gi|224052833|ref|XP_002198291.1| PREDICTED: NHL repeat-containing protein 2 [Taeniopygia guttata]
Length = 727
Score = 79.7 bits (195), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 52/177 (29%), Positives = 85/177 (48%), Gaps = 28/177 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D+SG RL ++D+ HHRI+V NG+IL IG G DG F + P
Sbjct: 218 LLFPGKVAVDDSGGRLVIADTGHHRILVTRKNGQILHTIGGPNSGRRDGRFSEAAFNSPQ 277
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ +++ TV +GI + G +++
Sbjct: 278 GIAIK--NNVIYVADTENHLIRKIDLELQMVTTV---AGIGVQ------------GVDKE 320
Query: 383 NDTKSEKLDPQSLIFPWHLM-------KSEDDNLLIINRSFETLWIMDLASGEIKEA 432
K E+ Q + PW ++ EDD L I +W + L G++ +
Sbjct: 321 GGAKGEE---QPISSPWDVVFGSSISGTQEDDVLWIAMAGIHQIWALMLEGGKLPKG 374
>gi|224125014|ref|XP_002319481.1| predicted protein [Populus trichocarpa]
gi|222857857|gb|EEE95404.1| predicted protein [Populus trichocarpa]
Length = 1065
Score = 79.3 bits (194), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 45/118 (38%), Positives = 61/118 (51%), Gaps = 5/118 (4%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC--PGFEDGEFESSKLMRP 321
L FPG ++ D NRLF+SDSNH+RI+V D +G + IGS G DG F+ + RP
Sbjct: 575 LKFPGKLAIDVLNNRLFISDSNHNRIVVTDLDGNFIAQIGSSGEEGLRDGSFDDATFNRP 634
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGF 379
Y+ + LY+ D+ENHA+R D V E V +G K + L F
Sbjct: 635 QGLAYNSKKNILYVADTENHALREVDF---VSEKVRTLAGNGTKGSDYQGAFGASLTF 689
>gi|449524308|ref|XP_004169165.1| PREDICTED: NHL repeat-containing protein 2-like, partial [Cucumis
sativus]
Length = 892
Score = 79.3 bits (194), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 40/95 (42%), Positives = 56/95 (58%), Gaps = 2/95 (2%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC--PGFEDGEFESSKLMRP 321
L FPG ++ D NRLF+SDSNH+RI+V D +G L IGS G DG F+ + RP
Sbjct: 602 LKFPGKLAIDILNNRLFISDSNHNRIVVTDLSGNFLLQIGSTGEDGLRDGNFDDATFNRP 661
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
Y+ + LY+ D+ENHA+R D + + T+
Sbjct: 662 QGLAYNAKKNLLYVADTENHALREVDFVKERVRTL 696
>gi|390559793|ref|ZP_10244078.1| conserved hypothetical protein [Nitrolancetus hollandicus Lb]
gi|390173637|emb|CCF83377.1| conserved hypothetical protein [Nitrolancetus hollandicus Lb]
Length = 442
Score = 79.0 bits (193), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 63/217 (29%), Positives = 92/217 (42%), Gaps = 33/217 (15%)
Query: 209 HENSLDIGMLNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPG 268
HE +D G L V+ LI + A+ A P L FPG
Sbjct: 81 HEGEIDAGSLAATVQRLIEHYDKEGIISRASIPGIARMAR----PSGA-------LAFPG 129
Query: 269 CISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPAASFYH 327
+ AD +GNRLF++DS H+RI+V +G IGS PG DG+ +++ P
Sbjct: 130 KVLADAAGNRLFIADSGHNRIVVTRLDGAEPWMIGSGQPGLADGDAATAQFNHPQGMALA 189
Query: 328 KDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKS 387
D LY+ D+ NH IRR D+ R +ET+ +G + S G R D +S
Sbjct: 190 G--DILYLADTRNHVIRRVDLASRQVETI---AGTGEPGMS-----YASAGSARTIDLRS 239
Query: 388 EKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDL 424
PW ++ + D L I +W +D+
Sbjct: 240 ----------PWDVLL-QGDTLYIAMAGMHQIWALDI 265
>gi|326923951|ref|XP_003208196.1| PREDICTED: NHL repeat-containing protein 2-like [Meleagris
gallopavo]
Length = 727
Score = 79.0 bits (193), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 52/177 (29%), Positives = 85/177 (48%), Gaps = 28/177 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D+SG RL ++D+ HHRI+V NG+IL IG G +DG F + P
Sbjct: 218 LLFPGKVTVDKSGERLVIADTGHHRILVTLKNGQILHTIGGPNSGRKDGRFSEAAFNSPQ 277
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ ++ TV +GI + G +++
Sbjct: 278 GVAIK--NNVIYVADTENHLIRKIDLDLEIVTTV---AGIGIQ------------GVDKE 320
Query: 383 NDTKSEKLDPQSLIFPWHLM-------KSEDDNLLIINRSFETLWIMDLASGEIKEA 432
K E+ Q + PW ++ EDD L I +W + L G++ +
Sbjct: 321 GGAKGEE---QPISSPWDVVLGNSVAGTEEDDVLWIAMAGIHQVWALMLEGGKLPKG 374
>gi|57529823|ref|NP_001006504.1| NHL repeat-containing protein 2 [Gallus gallus]
gi|82081010|sp|Q5ZI67.1|NHLC2_CHICK RecName: Full=NHL repeat-containing protein 2
gi|53136494|emb|CAG32576.1| hypothetical protein RCJMB04_29n5 [Gallus gallus]
Length = 727
Score = 79.0 bits (193), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 52/177 (29%), Positives = 85/177 (48%), Gaps = 28/177 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D+SG RL ++D+ HHRI+V NG+IL IG G +DG F + P
Sbjct: 218 LLFPGKVTVDKSGERLVIADTGHHRILVTLKNGQILHTIGGPNSGRKDGRFSEAAFNSPQ 277
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ ++ TV +GI + G +++
Sbjct: 278 GVAIK--NNVIYVADTENHLIRKIDLELEIVTTV---AGIGIQ------------GVDKE 320
Query: 383 NDTKSEKLDPQSLIFPWHLM-------KSEDDNLLIINRSFETLWIMDLASGEIKEA 432
K E+ Q + PW ++ EDD L I +W + L G++ +
Sbjct: 321 GGAKGEE---QPISSPWDVVFGNSVSGTQEDDVLWIAMAGIHQVWALMLEGGKLPKG 374
>gi|427731644|ref|YP_007077881.1| thiol-disulfide isomerase-like thioredoxin [Nostoc sp. PCC 7524]
gi|427367563|gb|AFY50284.1| thiol-disulfide isomerase-like thioredoxin [Nostoc sp. PCC 7524]
Length = 505
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 59/215 (27%), Positives = 103/215 (47%), Gaps = 32/215 (14%)
Query: 217 MLNKAVEELIMQQQENSS-SPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADES 275
+L++ + ELI + QE + + T KQ + L P L FPG + A
Sbjct: 136 VLDQLIAELIQKHQEKGTINFQQFSLTLEKQRQPLITP----------LAFPGKVLATPV 185
Query: 276 GNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRPAASFYHKDDDCLY 334
G LF++DS H+R+++ + G++L IG+ GF DG F+ ++ P + + LY
Sbjct: 186 G--LFIADSGHNRLVLSNLKGEVLHLIGTGKSGFTDGAFDEAQFSAPQGMTFDAANQILY 243
Query: 335 IVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQS 394
+ D++NHA+R+ ++ R+++ET+ T S+ + +G E +
Sbjct: 244 VADTDNHAVRQINLKRQIVETIAGTGEQSRNIHP-----HGGVGLE------------TA 286
Query: 395 LIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEI 429
L PW L+K + L I +W +DL +G I
Sbjct: 287 LNSPWDLVKV-GNTLFIAMAGSHQIWRIDLDTGMI 320
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 45/82 (54%), Gaps = 4/82 (4%)
Query: 269 CISADESGNRLFLSDSNHHRI-IVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPAASFY 326
C+ + + N L+++D+ +H+I +V G +G G +DG+ +S++ P+
Sbjct: 402 CLGIEHANNFLWVADTYNHKIKLVSPNTGNCQTILGDGNAGLQDGQGKSARFFEPSG--L 459
Query: 327 HKDDDCLYIVDSENHAIRRADM 348
D LYI D+ NHAIRR D+
Sbjct: 460 SAMDSNLYICDTNNHAIRRVDL 481
>gi|356571070|ref|XP_003553704.1| PREDICTED: NHL repeat-containing protein 2-like [Glycine max]
Length = 1098
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 39/95 (41%), Positives = 56/95 (58%), Gaps = 2/95 (2%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC--PGFEDGEFESSKLMRP 321
L FPG ++ D NRLF+SDSNH+RI+V D +G + IGS G +DG F+ + RP
Sbjct: 609 LKFPGKLAIDVLNNRLFISDSNHNRIVVTDLDGNFIVQIGSSGEEGLQDGSFDDATFNRP 668
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
Y+ + LY+ D+ENHA+R D + T+
Sbjct: 669 QGLAYNAKKNILYVADTENHALREIDFVNEKVRTL 703
>gi|149177401|ref|ZP_01856005.1| hypothetical protein PM8797T_18906 [Planctomyces maris DSM 8797]
gi|148843734|gb|EDL58093.1| hypothetical protein PM8797T_18906 [Planctomyces maris DSM 8797]
Length = 683
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 57/179 (31%), Positives = 88/179 (49%), Gaps = 32/179 (17%)
Query: 257 SSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFES 315
+ ++ L FPG + AD + RLF+SDSNH+RI++ +GK++D IGS G +DG++++
Sbjct: 213 AKLKQTPLKFPGKLLADPAQQRLFISDSNHNRIVIASLDGKLIDVIGSGQIGNKDGDYKT 272
Query: 316 SKLMRPAASFYHKD-----DDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLW 370
ASF H + LY+ D+ENH+IR D+ ++ + T+ T
Sbjct: 273 -------ASFDHPQGMALVGNTLYVADTENHSIRTVDLDKKQVSTLAGTG---------- 315
Query: 371 AWIMEKLGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEI 429
E+ F + KL +L PW L + DD L I +W L S EI
Sbjct: 316 ----EQARFR----SAGGKLKEAALNSPWALA-AIDDVLYICMAGPHQIWSHKLGSDEI 365
>gi|115474687|ref|NP_001060940.1| Os08g0135300 [Oryza sativa Japonica Group]
gi|113622909|dbj|BAF22854.1| Os08g0135300, partial [Oryza sativa Japonica Group]
Length = 139
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 78/146 (53%), Gaps = 9/146 (6%)
Query: 618 DTLALSTPESESNIE--DETTTSDLRSEDDTVHIDCAVNTSPGTSEVIISAALYLKLRRY 675
D LA S E + DE + E H +C VN SPGT E++ SAALYLK+
Sbjct: 1 DNLAFSEVPEEPTVHEGDENLSDGSYQEQRRAHFNCVVNVSPGTCELVASAALYLKI--- 57
Query: 676 PDQQDDGREKYAARISDILKLGRSGAMQRDSFIRFLLKSNQDLRDVIFVKPLHVSIQFDT 735
D ++ + + S + ++ + + + + L++S +D RD+ +KP+H+ + +
Sbjct: 58 ----DSAKDDHGEQKSLVKRIIQCQRREDHAGVELLMESCKDSRDLTIMKPVHLRLMLEC 113
Query: 736 LDHPKADNSKDIILTDSNMEVDVSLN 761
DHP +K+ I ++S+ E+++SL+
Sbjct: 114 ADHPAGTTNKETISSESSFEINISLD 139
>gi|255542920|ref|XP_002512523.1| 2-deoxyglucose-6-phosphate phosphatase, putative [Ricinus communis]
gi|223548484|gb|EEF49975.1| 2-deoxyglucose-6-phosphate phosphatase, putative [Ricinus communis]
Length = 1016
Score = 77.0 bits (188), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 38/95 (40%), Positives = 55/95 (57%), Gaps = 2/95 (2%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC--PGFEDGEFESSKLMRP 321
L FPG ++ D RLF+SDSNH+RI+V D +G + IGS G DG F+ + RP
Sbjct: 528 LKFPGKLAIDVLNKRLFISDSNHNRIVVTDLDGNFIVQIGSTGEEGLRDGPFDEATFNRP 587
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
Y+ + LY+ D+ENHA+R D ++ T+
Sbjct: 588 QGLAYNAKKNLLYVADTENHALREIDFVNEMVRTL 622
>gi|108707787|gb|ABF95582.1| HAD-superfamily hydrolase, subfamily IA, variant 3 containing
protein, expressed [Oryza sativa Japonica Group]
Length = 772
Score = 76.3 bits (186), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 50/138 (36%), Positives = 73/138 (52%), Gaps = 16/138 (11%)
Query: 231 ENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRII 290
+NSS P L+ K + +L P L FPG ++ D NRLF+SDSNH+RI+
Sbjct: 550 QNSSLPLALEKD--KDSRLLASP----------LKFPGKLAIDVLNNRLFISDSNHNRIV 597
Query: 291 VFDGNGKILDCIGSC-PGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMG 349
V + G+ + IGS G DG F+++ RP Y+ + LY+ D+ENHA+R +
Sbjct: 598 VTNLEGEFICQIGSSEEGLLDGTFDTASFNRPQGLAYNSKKNILYVADTENHALREINF- 656
Query: 350 RRVLETVYPTSGISKKNN 367
V ETV +G K +
Sbjct: 657 --VSETVKTLAGNGTKGS 672
>gi|37519956|ref|NP_923333.1| hypothetical protein gll0387 [Gloeobacter violaceus PCC 7421]
gi|35210948|dbj|BAC88328.1| gll0387 [Gloeobacter violaceus PCC 7421]
Length = 544
Score = 75.5 bits (184), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 47/141 (33%), Positives = 74/141 (52%), Gaps = 15/141 (10%)
Query: 217 MLNKAVEELIMQQQENSSS-PSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADES 275
+L++ + EL+++Q P GL+ T ++A P L FPG ++
Sbjct: 177 LLDQLIGELVLRQTNQGRPLPGGLR-TRLEKASAPPTP----------LAFPGKLTV--Y 223
Query: 276 GNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESSKLMRPAASFYHKDDDCLY 334
G+RLF+SDS HHRI+ +G + IGS PG+ DG FE ++ P D L+
Sbjct: 224 GDRLFVSDSGHHRIVTSSLDGSSHESIGSGIPGWRDGNFEEAEFWAPQGLALSADGRTLF 283
Query: 335 IVDSENHAIRRADMGRRVLET 355
+ D+ENH +R+ D+ RR + T
Sbjct: 284 VCDTENHLLRKIDLVRRQVST 304
Score = 43.1 bits (100), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 51/97 (52%), Gaps = 6/97 (6%)
Query: 262 LLLHFPGCISADESGNRLFLSDSNHHRIIVFDG-NGKILDCIGSC-PGFEDGEFESSKLM 319
+LL P + D G L+L+D+ +H+I D +G+ G G+ DG+ ++
Sbjct: 436 VLLQHPLGVHWD--GGTLWLTDTYNHKIKRIDPESGRCETLTGHIDSGYLDGDLAEARFW 493
Query: 320 RPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
PA + ++ D +YI D+ NHAIR D+ R + T+
Sbjct: 494 EPAGLW--RNGDRIYIADTNNHAIRVIDLVSRRVSTL 528
>gi|222624800|gb|EEE58932.1| hypothetical protein OsJ_10596 [Oryza sativa Japonica Group]
Length = 1017
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 50/138 (36%), Positives = 73/138 (52%), Gaps = 16/138 (11%)
Query: 231 ENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRII 290
+NSS P L+ K + +L P L FPG ++ D NRLF+SDSNH+RI+
Sbjct: 476 QNSSLPLALEKD--KDSRLLASP----------LKFPGKLAIDVLNNRLFISDSNHNRIV 523
Query: 291 VFDGNGKILDCIGSC-PGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMG 349
V + G+ + IGS G DG F+++ RP Y+ + LY+ D+ENHA+R +
Sbjct: 524 VTNLEGEFICQIGSSEEGLLDGTFDTASFNRPQGLAYNSKKNILYVADTENHALREINF- 582
Query: 350 RRVLETVYPTSGISKKNN 367
V ETV +G K +
Sbjct: 583 --VSETVKTLAGNGTKGS 598
>gi|427706390|ref|YP_007048767.1| alkyl hydroperoxide reductase [Nostoc sp. PCC 7107]
gi|427358895|gb|AFY41617.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Nostoc sp. PCC 7107]
Length = 505
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 47/174 (27%), Positives = 89/174 (51%), Gaps = 24/174 (13%)
Query: 217 MLNKAVEELIMQQQENSS-SPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADES 275
+L+K + ++I + Q+ S+ + + KQ + P L F G + A
Sbjct: 136 VLDKLIAKVIAEHQQKSTINFQQINHILEKQQQPTISP----------LAFTGKVLATPV 185
Query: 276 GNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRPAASFYHKDDDCLY 334
G LF++DS HHR+++ + +G++L IG+ G DG F +++ P + ++ Y
Sbjct: 186 G--LFIADSGHHRLVMSNFDGEVLHIIGTGKSGLTDGSFSTAQFSAPQGMVFDSENQIFY 243
Query: 335 IVDSENHAIRRADMGRRVLET----------VYPTSGISKKNNSLWAWIMEKLG 378
I D+ENHA+RR D+ ++++ET ++P SG++ + W + K+G
Sbjct: 244 IADTENHALRRVDLKQQIVETIAGTGEQSRNIHPHSGVALETALNSPWDLVKVG 297
Score = 44.3 bits (103), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 45/82 (54%), Gaps = 4/82 (4%)
Query: 269 CISADESGNRLFLSDSNHHRI-IVFDGNGKILDCIG-SCPGFEDGEFESSKLMRPAASFY 326
C+ + + N L+++D+ +H+I +V G +G G +DG+ ++S+ P+
Sbjct: 402 CMGVEYADNFLWVADTYNHKIKLVSPSTGNCQTILGDGFAGLQDGQGKNSRFFEPSG--L 459
Query: 327 HKDDDCLYIVDSENHAIRRADM 348
+ LYI D+ NHAIRR D+
Sbjct: 460 SAINSYLYISDTNNHAIRRVDL 481
>gi|218190788|gb|EEC73215.1| hypothetical protein OsI_07299 [Oryza sativa Indica Group]
Length = 1103
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 50/138 (36%), Positives = 73/138 (52%), Gaps = 16/138 (11%)
Query: 231 ENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRII 290
+NSS P L+ K + +L P L FPG ++ D NRLF+SDSNH+RI+
Sbjct: 562 QNSSLPLALEKD--KDSRLLASP----------LKFPGKLAIDVLNNRLFISDSNHNRIV 609
Query: 291 VFDGNGKILDCIGSC-PGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMG 349
V + G+ + IGS G DG F+++ RP Y+ + LY+ D+ENHA+R +
Sbjct: 610 VTNLEGEFICQIGSSEEGLLDGTFDTASFNRPQGLAYNSKKNILYVADTENHALREINF- 668
Query: 350 RRVLETVYPTSGISKKNN 367
V ETV +G K +
Sbjct: 669 --VSETVKTLAGNGTKGS 684
>gi|269837841|ref|YP_003320069.1| alkyl hydroperoxide reductase [Sphaerobacter thermophilus DSM
20745]
gi|269787104|gb|ACZ39247.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Sphaerobacter thermophilus DSM 20745]
Length = 494
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/168 (30%), Positives = 74/168 (44%), Gaps = 22/168 (13%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG + AD + +RL ++DS HHR+IV +G IGS G DG + P
Sbjct: 177 LAFPGKVLADAATDRLIVADSGHHRLIVARLDGSDAWVIGSGEAGLRDGPGAIAAFHDPQ 236
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
+ D LY+ D+ NH +RR D+ R +ET+ T I W + G R
Sbjct: 237 GLALYG--DTLYVADTRNHVVRRVDLAARTVETIAGTGHI--------GWSYARAGAARA 286
Query: 383 NDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEIK 430
D +S PW L+ D L I +W +DL E++
Sbjct: 287 IDLRS----------PWDLVL-HDGVLYIAMAGMHQIWALDLDEAELR 323
>gi|116754580|ref|YP_843698.1| NHL repeat-containing protein [Methanosaeta thermophila PT]
gi|116666031|gb|ABK15058.1| NHL repeat containing protein [Methanosaeta thermophila PT]
Length = 504
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 84/321 (26%), Positives = 137/321 (42%), Gaps = 49/321 (15%)
Query: 113 LAGRFV--DNCDSLIAGCGTVVTFEKVKSIQQSFPQLQVIGFLHGCSTISAVDQTRLVEM 170
L GRFV D C VVT +K +++ +P+L VIG +H + + + +
Sbjct: 50 LRGRFVLIDFWTYCCINCMHVVT--DLKMLEERYPELVVIG-VHTAKFENEMRVENIEKA 106
Query: 171 LMKEYITFPILLSNKNFPQMENG----ACYLLSKDFGN--ARVFHENSLDIGMLNKAVEE 224
+M+ I P+++ + G ++L G R E I L +E+
Sbjct: 107 IMRYGIEHPVIVDSDRTLWRAYGIRAWPSFVLIAPDGEILGRTSGEGIFSI--LMPIMEQ 164
Query: 225 LIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDS 284
LI + ++ S G P A + L FPG + + G+ +F++DS
Sbjct: 165 LIPEYEKRGSLHHGKPA-----------PRATHKGVSGALSFPGKVIS--GGDNIFIADS 211
Query: 285 NHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAI 343
N++RI++ +G ++D IGS G+ DG+F ++L RP D +YI D+ NH +
Sbjct: 212 NNNRILIVSPDGDLMDVIGSGERGYSDGDFSEARLFRPQGIAIVG--DVVYIADTGNHMV 269
Query: 344 RRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQSLIFPWHLMK 403
R AD+ RR L + T ++ L E SL PW L+
Sbjct: 270 RAADLRRRTLVRMAGTG--KSRHPGLGGRGAE-----------------VSLSSPWDLVF 310
Query: 404 SEDDNLLIINRSFETLWIMDL 424
+ D+L I +W MDL
Sbjct: 311 VQ-DHLYIAMAGSHQIWRMDL 330
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 52/219 (23%), Positives = 89/219 (40%), Gaps = 43/219 (19%)
Query: 224 ELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLL--------LHFPGCISADES 275
+L+ Q + +G W E + EP+A S + L L P ++ D
Sbjct: 307 DLVFVQDHLYIAMAGSHQIWRMDLEGMVEPYAGSGIEGLADGPLEQARLAQPSGLTTD-- 364
Query: 276 GNRLFLSDSNHHRIIVFDGN-----GKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDD 330
GNR++ DS + V DG+ G+ L G DG+F ++L P FY +
Sbjct: 365 GNRIYFVDSESSSLRVIDGDVRTLIGRDLFYFGDI----DGDFGRARLQHPLGLFYK--E 418
Query: 331 DCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKL 390
+Y+ D+ NH I++AD+ +++ T+G GF ++
Sbjct: 419 GSIYVADTYNHRIKKADLSS---GSIHTTAGTGSP------------GFADGPGAQAAFN 463
Query: 391 DPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEI 429
+P L F D+L I + + + I D SG++
Sbjct: 464 EPSGLTF-------LGDSLFIADTNNHAVRIYDQRSGDV 495
>gi|408405002|ref|YP_006862985.1| NHL repeat-containing protein [Candidatus Nitrososphaera gargensis
Ga9.2]
gi|408365598|gb|AFU59328.1| NHL repeat-containing protein [Candidatus Nitrososphaera gargensis
Ga9.2]
Length = 501
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 93/205 (45%), Gaps = 26/205 (12%)
Query: 240 KCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGN-GKI 298
+ T A + +K P A S +L +PG +S G L +SDSNH+RI+V D + GKI
Sbjct: 157 QGTLAPEPIEIKRPEAAHS---RVLSYPGKLSLSPDGRMLAISDSNHNRILVVDADSGKI 213
Query: 299 LDCI-GSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVY 357
+ + G DG FE ++ RP + D +Y+ D+ENHA+R D+ R+++T+
Sbjct: 214 IHKVGGGSKDLRDGSFEQARFFRPQGVLWVGYDK-IYVADTENHALREIDLQSRMVKTL- 271
Query: 358 PTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFE 417
N WI + D K +L PW L + +LI
Sbjct: 272 ------AGNGKQGYWI------QSPQDGKVTQLSS-----PWDLTH-DSGFILIAMAGLH 313
Query: 418 TLWIMDLASGEIKE-AVKGFSKVLE 441
+W + +G I A G+ +++
Sbjct: 314 QIWAYHIQTGRIGPFAGSGYENIVD 338
>gi|334116916|ref|ZP_08491008.1| NHL repeat containing protein [Microcoleus vaginatus FGP-2]
gi|333461736|gb|EGK90341.1| NHL repeat containing protein [Microcoleus vaginatus FGP-2]
Length = 546
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 74/301 (24%), Positives = 125/301 (41%), Gaps = 72/301 (23%)
Query: 145 PQLQVIGFLHGCSTISAVDQTRLVEMLMKEY----------ITFPILLSNK--NFPQMEN 192
P+ V+GF+ G A+D+ LV L+ EY +TF + K + P
Sbjct: 119 PESYVVGFVSGEGNREALDE--LVGKLIAEYKDKNALKFKQLTFSLEKQRKPVSTPLAFP 176
Query: 193 GACYLLSKDFGNARVFHENSLDIGML---NKAVEELIMQQQENSSSPSGLKCTWAKQAEV 249
G + A+ H+ +G + N++V+E ++ NS
Sbjct: 177 GKVLAFAIAATEAKQLHQQGALLGEMSQGNESVDETVLPGPANSV--------------- 221
Query: 250 LKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGF 308
++NL+ G+ LF++DS H+RI+V G++L IGS PG
Sbjct: 222 ---------IQNLV------------GSCLFIADSGHNRIVVSTLEGEVLHVIGSGQPGL 260
Query: 309 EDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNS 368
DG+FE ++ P + + LY+ D+ENHA+R+ D + +ETV +G ++++
Sbjct: 261 TDGDFEEAEFFAPQGMAFDAESQILYVADTENHALRKIDFTTQRVETV---AGTGEQSHE 317
Query: 369 LWAWIMEKLGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGE 428
+ +S K L PW L + + L I +W M L +G
Sbjct: 318 I--------------SPRSGKGLETQLNSPWDLERV-GNRLFIAMAGSHQIWEMQLDTGT 362
Query: 429 I 429
I
Sbjct: 363 I 363
>gi|298245110|ref|ZP_06968916.1| NHL repeat containing protein [Ktedonobacter racemifer DSM 44963]
gi|297552591|gb|EFH86456.1| NHL repeat containing protein [Ktedonobacter racemifer DSM 44963]
Length = 496
Score = 73.9 bits (180), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 50/177 (28%), Positives = 87/177 (49%), Gaps = 20/177 (11%)
Query: 255 ACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEF 313
+ ++ + L FPG ++ D +RL +SDS HHR++ + G++ IGS G DG F
Sbjct: 166 SAATAPSSFLAFPGKLAVDAQADRLIISDSAHHRLVETNLQGRVRTIIGSGVQGQADGSF 225
Query: 314 ESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWI 373
++ P +D LY+ D++NH IRR D+ + +ET+ +G ++N + +
Sbjct: 226 AEAQFNHPQGVAL--VNDLLYVADTDNHLIRRVDLRTKQVETL---AGTGEQNGMVRTRL 280
Query: 374 MEKLGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEIK 430
G R S PW L+ E + L I +++++LA+GEI+
Sbjct: 281 Q---GPARSIALSS----------PWDLVAFE-EKLYIAMAGMHRIFVLNLATGEIE 323
>gi|168702582|ref|ZP_02734859.1| NHL repeat [Gemmata obscuriglobus UQM 2246]
Length = 599
Score = 73.6 bits (179), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 53/94 (56%), Gaps = 3/94 (3%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESSKLMRPA 322
L FPG + ADE G RLF++DS HHR++V D G L IG+ PG DG F+ ++ P
Sbjct: 262 LFFPGKVVADEPGRRLFIADSTHHRVVVTDLAGNKLHVIGTGAPGKTDGPFDKAQFDDPQ 321
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
D L++ D +NH +R D+ + + T+
Sbjct: 322 GMAVR--GDTLFVADRKNHLVREIDLKAKTVATI 353
>gi|108707786|gb|ABF95581.1| HAD-superfamily hydrolase, subfamily IA, variant 3 containing
protein, expressed [Oryza sativa Japonica Group]
Length = 1064
Score = 73.6 bits (179), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 44/115 (38%), Positives = 64/115 (55%), Gaps = 13/115 (11%)
Query: 231 ENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRII 290
+NSS P L+ K + +L P L FPG ++ D NRLF+SDSNH+RI+
Sbjct: 550 QNSSLPLALEKD--KDSRLLASP----------LKFPGKLAIDVLNNRLFISDSNHNRIV 597
Query: 291 VFDGNGKILDCIGSC-PGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIR 344
V + G+ + IGS G DG F+++ RP Y+ + LY+ D+ENHA+R
Sbjct: 598 VTNLEGEFICQIGSSEEGLLDGTFDTASFNRPQGLAYNSKKNILYVADTENHALR 652
>gi|397779898|ref|YP_006544371.1| NHL repeat-containing protein 2 [Methanoculleus bourgensis MS2]
gi|396938400|emb|CCJ35655.1| NHL repeat-containing protein 2 [Methanoculleus bourgensis MS2]
Length = 483
Score = 73.6 bits (179), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 63/253 (24%), Positives = 109/253 (43%), Gaps = 20/253 (7%)
Query: 113 LAGRFVDNCDSLIAGCGTVVTFEKVKSIQQSFPQLQVIGFLHGCSTISAVDQTRLVEMLM 172
LAGR V C A + +++ +++ P+L VI +H S L E +
Sbjct: 27 LAGRIVLLCFGTFACSNCMRMVPELRRLEEDHPELVVI-EVHTPGFESPAVTGNLQEAIR 85
Query: 173 KEYITFPILLSNKNFP----QMENGACYLLSKDFGNARVFHENSLDIGMLNKAVEELIMQ 228
I +P+++ + + + N ++L G+ G LN ++ +
Sbjct: 86 CAGIDYPVVIDHDHLLWQAFGIRNWPTFVLIDPEGHVLGKTAGEGLYGRLNPRIDRIARD 145
Query: 229 QQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHR 288
+ ++ ++K+ ++ +L+ P I+AD +G RLF+SDS HHR
Sbjct: 146 --------------FEQRGMLVKKRLQFAAAPATVLYHPDKITADHAGMRLFISDSGHHR 191
Query: 289 IIVFDGNGKILDCIGS-CPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRAD 347
IIV +G IL IG+ PG DG + P + ++ LY+ D+ NH IR+
Sbjct: 192 IIVASRDGAILGVIGTGAPGNADGSSSEAAFYLPEGLAFDEEAGVLYVADAGNHTIRQVS 251
Query: 348 MGRRVLETVYPTS 360
+ETV T
Sbjct: 252 WPDLRVETVAGTG 264
>gi|12835819|dbj|BAB23375.1| unnamed protein product [Mus musculus]
Length = 748
Score = 73.2 bits (178), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 50/176 (28%), Positives = 78/176 (44%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D + RL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 246 LLFPGKVAVDHATGRLVVADTGHHRILVIQKNGRIQSSIGGPNPGRKDGMFPESSFNSPQ 305
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
D+ +Y+ D+ENH IR+ D+ + TV +G+ G +
Sbjct: 306 GVAIA--DNVIYVADTENHLIRKTDLEAEKVTTV---AGV---------------GIQGT 345
Query: 383 NDTKSEKLDPQSLIFPWHLMKS-------EDDNLLIINRSFETLWIMDLASGEIKE 431
+ E+ D Q + PW + +D L I +W + L SG + +
Sbjct: 346 DTEGGEEGDKQPISSPWDVALGTSGSEVQRNDILWIAMAGTHQIWALLLDSGTLPK 401
>gi|27552887|gb|AAH41104.1| Nhlrc2 protein, partial [Mus musculus]
Length = 522
Score = 73.2 bits (178), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 49/176 (27%), Positives = 76/176 (43%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D + RL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 20 LLFPGKVAVDHATGRLVVADTGHHRILVIQKNGRIQSSIGGPNPGRKDGMFSESSFNSPQ 79
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
D+ +Y+ D+ENH IR+ D+ + TV +G +
Sbjct: 80 GVAIA--DNVIYVADTENHLIRKIDLEAEKVTTV------------------AGVGIQGT 119
Query: 383 NDTKSEKLDPQSLIFPWHLMKS-------EDDNLLIINRSFETLWIMDLASGEIKE 431
+ E+ D Q + PW + +D L I +W + L SG + +
Sbjct: 120 DTEGGEEGDKQPISSPWDVALGTSGSEVQRNDILWIAMAGTHQIWALLLDSGTLPK 175
>gi|256556932|gb|ACU83555.1| thiol-disulfide isomerase/thioredoxin [uncultured bacterium
HF130_AEPn_1]
Length = 497
Score = 73.2 bits (178), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 51/170 (30%), Positives = 86/170 (50%), Gaps = 22/170 (12%)
Query: 264 LHFPGCISADES---GNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLM 319
L+FP + +S G FL+DS++HR++ D +GK+ IGS G +DG+ + ++
Sbjct: 184 LYFPSKMVYSKSKSLGELFFLTDSSNHRVLAIDLSGKVKMVIGSGKEGNKDGDIKVARFR 243
Query: 320 RPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGF 379
RP + + +D LYI D++NH+I+ D+ + + T+ SG ++ GF
Sbjct: 244 RPHGLAFDEKNDLLYIADTDNHSIKSLDLKSKKVLTL---SGNGER------------GF 288
Query: 380 ERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEI 429
+R + K P + PW L + + + LLI LW MDL G +
Sbjct: 289 KRKAENAKAKGHPMA--SPWDL-QLKGEKLLIAMAGTHQLWEMDLKKGTV 335
>gi|157818439|ref|NP_001100914.1| NHL repeat-containing protein 2 [Rattus norvegicus]
gi|149040462|gb|EDL94500.1| NHL repeat containing 2 (predicted) [Rattus norvegicus]
Length = 725
Score = 73.2 bits (178), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 52/176 (29%), Positives = 78/176 (44%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++AD + RL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKVAADPTTGRLVVADTGHHRILVLQKNGQIQYSIGGPNPGRKDGVFSESSFHSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
D+ +Y+ D+ENH IR+ D+ + TV +GI G +
Sbjct: 283 GVAIA--DNVIYVADTENHLIRKIDLEAEKVSTV---AGI---------------GVQGT 322
Query: 383 NDTKSEKLDPQSLIFPWHLMKS-------EDDNLLIINRSFETLWIMDLASGEIKE 431
+ E+ D Q + PW + +D L I +W L SG + +
Sbjct: 323 DTEGGEEGDKQPISSPWDVALGTSGSKVQRNDILWIAMAGTHQIWAFLLDSGTLPK 378
Score = 43.1 bits (100), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 25/95 (26%), Positives = 42/95 (44%), Gaps = 2/95 (2%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDG--EFESSKLMRP 321
L P ++ DE + L+++DS +H+I V D N K+ + D F S P
Sbjct: 474 LQHPLGVAWDEKRHLLYVADSYNHKIKVVDPNTKVCTTLAGTGDANDASSSFAESAFNEP 533
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
+ LY+ D+ NH I+ D+ R + +
Sbjct: 534 GGLCIGERGRLLYVADTNNHQIKVMDLEARTVSVL 568
>gi|327277470|ref|XP_003223487.1| PREDICTED: NHL repeat-containing protein 2-like [Anolis
carolinensis]
Length = 724
Score = 72.8 bits (177), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 48/177 (27%), Positives = 79/177 (44%), Gaps = 28/177 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D SG+RL ++D+ HHRI+V G+IL +G G +DG + + P
Sbjct: 219 LLFPGKVTVDSSGDRLVIADTGHHRILVVKKTGQILYTVGGPDSGRKDGTYSEATFNSPQ 278
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ LE V +G+ G +
Sbjct: 279 GVAIK--NNIIYVADTENHLIRKVDLQ---LEMVSTVAGV---------------GIQGT 318
Query: 383 NDTKSEKLDPQSLIFPWHL-------MKSEDDNLLIINRSFETLWIMDLASGEIKEA 432
+ + D Q + PW + EDD L I +W + L G++ +
Sbjct: 319 DKEGGARGDEQPISSPWDVAFGTSVSATHEDDVLWIAMAGIHQIWALMLDCGKLPKG 375
>gi|428304139|ref|YP_007140964.1| NHL repeat containing protein [Crinalium epipsammum PCC 9333]
gi|428245674|gb|AFZ11454.1| NHL repeat containing protein [Crinalium epipsammum PCC 9333]
Length = 537
Score = 72.8 bits (177), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 47/174 (27%), Positives = 77/174 (44%), Gaps = 33/174 (18%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESSKLMRPA 322
L FP + A + N LF++D+ HHR+++ NG+ IG+ P + DG+ E ++ P
Sbjct: 203 LAFPSKVIACQQSNSLFIADTGHHRLVIASLNGETQAVIGTGSPAWVDGDLEIAQFCEPM 262
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
+ + +Y+ D+ NH +R+ D+ R + T+
Sbjct: 263 GMVFDYEQQVIYVADTVNHLLRKIDLKTRQVSTI------------------------AG 298
Query: 383 NDTKSEKLDPQ-------SLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEI 429
N T+S L P +L PW L+K + D L I +W+MDLA I
Sbjct: 299 NGTQSRYLFPHGGKALETALNSPWDLVKIK-DKLYITMAGSHQIWMMDLAQETI 351
>gi|29789158|ref|NP_080087.1| NHL repeat-containing protein 2 [Mus musculus]
gi|81898236|sp|Q8BZW8.1|NHLC2_MOUSE RecName: Full=NHL repeat-containing protein 2
gi|26329039|dbj|BAC28258.1| unnamed protein product [Mus musculus]
gi|63146329|gb|AAH95956.1| NHL repeat containing 2 [Mus musculus]
gi|148669816|gb|EDL01763.1| NHL repeat containing 2 [Mus musculus]
gi|187951881|gb|AAI38191.1| NHL repeat containing 2 [Mus musculus]
gi|223459878|gb|AAI38192.1| NHL repeat containing 2 [Mus musculus]
Length = 725
Score = 72.8 bits (177), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 49/176 (27%), Positives = 76/176 (43%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D + RL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKVAVDHATGRLVVADTGHHRILVIQKNGRIQSSIGGPNPGRKDGMFSESSFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
D+ +Y+ D+ENH IR+ D+ + TV +G +
Sbjct: 283 GVAIA--DNVIYVADTENHLIRKIDLEAEKVTTV------------------AGVGIQGT 322
Query: 383 NDTKSEKLDPQSLIFPWHLMKS-------EDDNLLIINRSFETLWIMDLASGEIKE 431
+ E+ D Q + PW + +D L I +W + L SG + +
Sbjct: 323 DTEGGEEGDKQPISSPWDVALGTSGSEVQRNDILWIAMAGTHQIWALLLDSGTLPK 378
>gi|428317077|ref|YP_007114959.1| NHL repeat containing protein [Oscillatoria nigro-viridis PCC 7112]
gi|428240757|gb|AFZ06543.1| NHL repeat containing protein [Oscillatoria nigro-viridis PCC 7112]
Length = 546
Score = 72.4 bits (176), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 47/155 (30%), Positives = 76/155 (49%), Gaps = 19/155 (12%)
Query: 276 GNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPAASFYHKDDDCLY 334
G+ LF++DS H+RI+V G++L IGS PG DG+FE ++ P + + LY
Sbjct: 227 GSCLFIADSGHNRIVVSTLEGEVLHVIGSGKPGLTDGDFEEAEFFAPQGMAFDAESQILY 286
Query: 335 IVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQS 394
+ D+ENHA+R+ D + +ETV +G ++++ + +S K
Sbjct: 287 VADTENHALRKIDFTTQRVETV---AGTGEQSHEI--------------SPRSGKGLETQ 329
Query: 395 LIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEI 429
L PW L + + L I +W M L +G I
Sbjct: 330 LNSPWDLERV-GNRLFIAMAGSHQIWEMQLDTGTI 363
>gi|403259474|ref|XP_003922237.1| PREDICTED: NHL repeat-containing protein 2 [Saimiri boliviensis
boliviensis]
Length = 726
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 52/176 (29%), Positives = 83/176 (47%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D+ NRL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKVTVDQVTNRLVIADTGHHRILVVWKNGQIQYSIGGPNPGRKDGIFSESTFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ + TV +GI + G +++
Sbjct: 283 GVAIM--NNIIYVADTENHLIRKIDLEAEKVSTV---AGIGIQ------------GTDKE 325
Query: 383 NDTKSEKLDPQSLIFPWHLM-------KSEDDNLLIINRSFETLWIMDLASGEIKE 431
K E+ Q + PW ++ DD L I +W + L SG++ +
Sbjct: 326 GGAKGEQ---QPISSPWDVVFGTSGSEVQRDDILWIAMAGTHQIWALLLDSGKLPK 378
Score = 42.0 bits (97), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 43/99 (43%), Gaps = 3/99 (3%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK---ILDCIGSCPGFEDGEFESSK 317
N L P ++ D+ N L+++DS +H+I V D K L G F S
Sbjct: 471 NAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTNNVTSSSFTEST 530
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P ++ LY+ D+ NH I+ D+ +++ +
Sbjct: 531 FNEPGGLCIGENGQLLYVADTNNHQIKVMDLETKMVSVL 569
>gi|428225769|ref|YP_007109866.1| NHL repeat containing protein [Geitlerinema sp. PCC 7407]
gi|427985670|gb|AFY66814.1| NHL repeat containing protein [Geitlerinema sp. PCC 7407]
Length = 498
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 72/295 (24%), Positives = 124/295 (42%), Gaps = 62/295 (21%)
Query: 146 QLQVIGFLHGCSTISAVDQTRLVEMLMKEY-ITFPILLSN----------KNFPQME--N 192
+LQVIG G S VE + Y I P+++ K +P + +
Sbjct: 61 RLQVIGIHTG--KFSQEQHPASVEQAIARYGIRHPVVVDQNQYLWDQYAIKAWPTLVLID 118
Query: 193 GACYLLSKDFGNARVFHENSLDIGMLNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLKE 252
A Y++++ G H ++ D + + EL+ + +S +P+ ++
Sbjct: 119 PAGYVVTQRAGEG---HSDTFD-----RQIGELLAGRSPDSGAPA------------PEQ 158
Query: 253 PHACSSVRNLLLHFPG--CISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFE 309
P A L FPG I A N LF++D+ HHR+++ GK+ +GS PG++
Sbjct: 159 PIASP------LAFPGKVAIGAIAQENHLFVADTGHHRLVIATLEGKVRATVGSGTPGYQ 212
Query: 310 DGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSL 369
DG +E+++ P Y D LY+ D+ NH IR R + T+ +G ++N SL
Sbjct: 213 DGPWETAQFRAPQGLTYDPAGDRLYVADTGNHLIRCVHGRSRTVTTL---AGTGQQNRSL 269
Query: 370 WAWIMEKLGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDL 424
+ +L+ L PW + + D L I +W++ L
Sbjct: 270 -------------RPQQGRRLE-TPLNSPWDVAL-KGDRLFIAMAGSHQIWVLSL 309
Score = 42.7 bits (99), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 43/89 (48%), Gaps = 4/89 (4%)
Query: 269 CISADESGNRLFLSDSNHHRIIVFD-GNGKILDCIGS-CPGFEDGEFESSKLMRPAASFY 326
C+ L+++D+ +H+I + G ++ +GS PG +DG + P S
Sbjct: 396 CLGLAHGPGNLWIADTYNHKIKRLNLAEGHCVNIVGSGLPGHQDGWGPEASFSEP--SGL 453
Query: 327 HKDDDCLYIVDSENHAIRRADMGRRVLET 355
+ LYI D+ NHAIRR D + + T
Sbjct: 454 ASEGQTLYIADTNNHAIRRWDWATQQVTT 482
>gi|395218326|ref|ZP_10401983.1| alkyl hydroperoxide reductase [Pontibacter sp. BAB1700]
gi|394454578|gb|EJF09204.1| alkyl hydroperoxide reductase [Pontibacter sp. BAB1700]
Length = 474
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 49/168 (29%), Positives = 82/168 (48%), Gaps = 23/168 (13%)
Query: 263 LLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRP 321
+L FP + +D +GN L+LSDS H+RI+ G++L+ IGS GF +G + + P
Sbjct: 169 VLSFPSKLISDAAGN-LYLSDSGHNRILKLSQQGQVLEVIGSGERGFNNGGYADATFNEP 227
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFER 381
H LY+ D++NHAIR+ D+ R++ T T +L +
Sbjct: 228 HGLALHG--STLYVADAKNHAIRKVDLQHRMVSTAAGTG---------------ELEYYF 270
Query: 382 DNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEI 429
+D + E ++P S PW L+ ++ I N + MDL + ++
Sbjct: 271 HDDKRREPVNPNS---PWDLLIY-GRSMFIANAGNHQILRMDLETEQV 314
>gi|428304361|ref|YP_007141186.1| NHL repeat containing protein [Crinalium epipsammum PCC 9333]
gi|428245896|gb|AFZ11676.1| NHL repeat containing protein [Crinalium epipsammum PCC 9333]
Length = 491
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 53/213 (24%), Positives = 98/213 (46%), Gaps = 24/213 (11%)
Query: 222 VEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFL 281
++ L+ Q E SS G K + + EP+ ++R L FP + A + N LF+
Sbjct: 144 LDNLVQQLIEESS---GKKTVDGQSGKFTLEPN--QTLRLSPLAFPSKVIACQQSNSLFI 198
Query: 282 SDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSEN 340
+D+ HHR+++ +G+ IG+ G+ DG+ E ++ P ++ + +Y+ D+ N
Sbjct: 199 ADTGHHRLVIASLDGETKAVIGTGSAGWVDGDLEIAQFCEPMGMVFNHEQQVIYVADTVN 258
Query: 341 HAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQSLIFPWH 400
H +R+ D+ R + T+ +G ++ L+ K +L PW
Sbjct: 259 HLLRKIDLKTRQVSTL---AGTGTQSRYLFP--------------HGGKALETALNSPWD 301
Query: 401 LMKSEDDNLLIINRSFETLWIMDLASGEIKEAV 433
L+K + D L I +W+MDL I+ +
Sbjct: 302 LVKIK-DQLYITMAGSHQIWMMDLTEKTIQTLI 333
>gi|149689662|ref|XP_001495693.1| PREDICTED: NHL repeat-containing protein 2 [Equus caballus]
Length = 726
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 82/176 (46%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D NRL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKVTVDRVTNRLVIADTGHHRILVVWKNGQIQYSIGGPNPGRKDGTFSESTFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ ++ TV +GI + G +++
Sbjct: 283 GVAIM--NNIIYVADTENHLIRKIDLEAEMVSTV---AGIGIQ------------GTDKE 325
Query: 383 NDTKSEKLDPQSLIFPWHLM-------KSEDDNLLIINRSFETLWIMDLASGEIKE 431
+ E+ Q + PW ++ DD L I +W + L SG + +
Sbjct: 326 GGAEGEQ---QPISSPWDVVFGTSGSEVQRDDILWIAMAGTHQIWALLLDSGRLPK 378
Score = 40.8 bits (94), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 24/99 (24%), Positives = 43/99 (43%), Gaps = 3/99 (3%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKI---LDCIGSCPGFEDGEFESSK 317
N L P ++ D+ N ++++DS +H+I V D K L G F S
Sbjct: 471 NAKLQHPLGVTWDKKRNLVYVADSYNHKIKVVDPKTKTCTTLAGTGDASNVISSSFTQST 530
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P ++ LY+ D+ NH I+ D+ +++ +
Sbjct: 531 FNEPGGLCIGENGQLLYVADTNNHQIKVMDLETKMISVL 569
>gi|344274719|ref|XP_003409162.1| PREDICTED: NHL repeat-containing protein 2 [Loxodonta africana]
Length = 726
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 82/176 (46%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG + D NRL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKVIVDHVTNRLVIADTGHHRILVVWKNGQIQYSIGGPNPGRKDGTFSESTFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ ++ TV +GI + G +++
Sbjct: 283 GVAIM--NNVIYVADTENHLIRKIDLEAEIVSTV---AGIGIQ------------GTDKE 325
Query: 383 NDTKSEKLDPQSLIFPWHLMKS-------EDDNLLIINRSFETLWIMDLASGEIKE 431
K E+ Q + PW ++ +D L I +W + L SG++ +
Sbjct: 326 GGAKGEQ---QPISSPWDVVLGTSGSEIQRNDILWIAMAGTHQIWALLLDSGKLPK 378
>gi|440789641|gb|ELR10946.1| NHL repeat containing 2 isoform 3, putative [Acanthamoeba
castellanii str. Neff]
Length = 934
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 51/169 (30%), Positives = 77/169 (45%), Gaps = 23/169 (13%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS---CPGFEDGEFESSKLMR 320
L FP ++ D G+R+F++DS HHRI+V +G IGS G DG++ +++
Sbjct: 295 LRFPHGVAVDREGDRMFVADSGHHRILVLSLDGAFRTAIGSDDATTGLVDGDYATARFHS 354
Query: 321 PAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFE 380
P Y +D LY+ DSENH IR R E V +G ++ G+E
Sbjct: 355 PLGLSYAGEDK-LYVADSENHCIRCV---RLAEERVVTVAGTGQR------------GYE 398
Query: 381 RDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEI 429
R ++ + SL PW + S +L I +W + S EI
Sbjct: 399 RKGGGRALEW---SLSNPWDVA-SHGHDLYIAMAGTHQIWKYNEQSEEI 443
>gi|296221249|ref|XP_002756652.1| PREDICTED: NHL repeat-containing protein 2 [Callithrix jacchus]
Length = 726
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 83/176 (47%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D+ NRL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKVTVDQVTNRLVIADTGHHRILVVWKNGQIQYSIGGPNPGRKDGIFSESTFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ + TV +GI + G +++
Sbjct: 283 GVAIM--NNTIYVADTENHLIRKIDLEAEKVSTV---AGIGIQ------------GTDKE 325
Query: 383 NDTKSEKLDPQSLIFPWHLM-------KSEDDNLLIINRSFETLWIMDLASGEIKE 431
+ E+ Q + PW ++ DD L I +W + L SG++ +
Sbjct: 326 GGAQGEQ---QPISSPWDVVFGTSGPEVQRDDILWIAMAGTHQIWALLLDSGKLPK 378
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 43/99 (43%), Gaps = 3/99 (3%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK---ILDCIGSCPGFEDGEFESSK 317
N L P ++ D+ N L+++DS +H+I V D K L G F S
Sbjct: 471 NAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKHCTTLAGTGDTNNVTSSSFTEST 530
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P ++ LY+ D+ NH I+ D+ +++ +
Sbjct: 531 FNEPGGLCIGENGQLLYVADTNNHQIKVMDLETKMVSVL 569
>gi|301755504|ref|XP_002913612.1| PREDICTED: NHL repeat-containing protein 2-like [Ailuropoda
melanoleuca]
gi|281347739|gb|EFB23323.1| hypothetical protein PANDA_001408 [Ailuropoda melanoleuca]
Length = 726
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 52/176 (29%), Positives = 84/176 (47%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D NRL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKVTVDHVSNRLVIADTGHHRILVVWKNGQIQYSIGGPNPGRKDGVFSESTFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH +R+ D+ ++ TV +GI + G +++
Sbjct: 283 GVAIR--NNIIYVADTENHLVRKIDLEAEMVSTV---AGIGIQ------------GTDKE 325
Query: 383 NDTKSEKLDPQSLIFPWHLM----KSE---DDNLLIINRSFETLWIMDLASGEIKE 431
+ E+ Q + PW ++ SE DD L I +W + L SG + +
Sbjct: 326 GGAQGEE---QPISSPWDVVFGTSGSEIQRDDILWIAMAGTHQIWALLLDSGRLPK 378
Score = 43.9 bits (102), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 45/108 (41%), Gaps = 6/108 (5%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK---ILDCIGSCPGFEDGEFESSK 317
N L P ++ D+ N L+++DS +H+I V D K L G F S
Sbjct: 471 NAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCSTLAGTGDASNVITSSFTEST 530
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRR---VLETVYPTSGI 362
P +D LY+ D+ NH I+ D+ + VL P S +
Sbjct: 531 FNEPGGLCIGEDGQLLYVADTNNHQIKVMDLETKTISVLPVFIPDSAV 578
>gi|351706822|gb|EHB09741.1| NHL repeat-containing protein 2 [Heterocephalus glaber]
Length = 726
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 45/137 (32%), Positives = 68/137 (49%), Gaps = 21/137 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D + NRL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKVTVDHATNRLVIADTGHHRILVIWKNGQIQYSIGGPNPGRKDGLFSESTFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
D+ +Y+ D+ENH IR+ D+ + TV +GI + G +++
Sbjct: 283 GVAMM--DNVIYVADTENHLIRKIDLEAERVSTV---AGIGVQ------------GTDKE 325
Query: 383 NDTKSEKLDPQSLIFPW 399
K E+ Q + PW
Sbjct: 326 GGAKGEQ---QPICSPW 339
>gi|139949007|ref|NP_001077192.1| NHL repeat-containing protein 2 [Bos taurus]
gi|166233893|sp|A4IF69.1|NHLC2_BOVIN RecName: Full=NHL repeat-containing protein 2
gi|134025213|gb|AAI34431.1| NHLRC2 protein [Bos taurus]
gi|296472617|tpg|DAA14732.1| TPA: NHL repeat-containing protein 2 [Bos taurus]
Length = 726
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 52/176 (29%), Positives = 82/176 (46%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG I+ D NRL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKITVDHVSNRLVIADTGHHRILVVWKNGQIQYSIGGPNPGRKDGIFSESSFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ ++ TV +GI G +
Sbjct: 283 GVAIM--NNIIYVADTENHLIRKIDLEAEMVSTV---AGI---------------GIQGT 322
Query: 383 NDTKSEKLDPQSLIFPWHLM------KSEDDNLLIINRS-FETLWIMDLASGEIKE 431
+ K D Q + PW ++ + + DN+L I + +W + L G + +
Sbjct: 323 DKEGGAKGDEQPISSPWDVVFGRSGPEVQRDNILWIAMAGTHQIWALLLDCGRLPK 378
Score = 43.1 bits (100), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 46/102 (45%), Gaps = 4/102 (3%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK---ILDCIGSCPGFEDGEFESSK 317
N L P ++ D+ N L+++DS +H+I V D K L G+ F S
Sbjct: 471 NARLQHPLGVTWDQKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGNASNMIGSSFTDST 530
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
P ++ LY+ D+ NH I+ D+ + + +V+P
Sbjct: 531 FNEPGGLCIGENGQLLYVADTNNHQIKVLDLETKTV-SVFPV 571
>gi|241584846|ref|XP_002403871.1| NHL repeat-containing protein [Ixodes scapularis]
gi|215500274|gb|EEC09768.1| NHL repeat-containing protein [Ixodes scapularis]
Length = 621
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 52/184 (28%), Positives = 83/184 (45%), Gaps = 26/184 (14%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
LHFP + A E+ RL ++D+ HHR++V D +G++L +GS PG+ DG ++ P
Sbjct: 238 LHFPAKVFATET--RLVVADTGHHRVLVADHDGRVLYTVGSPEPGWRDGALNDARFNGPQ 295
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
+ +D +Y+ D+ NH +R D+ + + TV T +R
Sbjct: 296 GVVW-RDPHFVYVADTGNHTVREVDLEQAQVRTVAGTG-------------------QRG 335
Query: 383 NDTKSEKLDP-QSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEIKEAVK-GFSKVL 440
+D + + P Q L PW L E D L I LW + L ++ + G L
Sbjct: 336 SDLEGGRQGPQQPLSSPWDLCLVE-DILFIAMAGSHQLWALALRDSQLFGKLHLGAGTCL 394
Query: 441 EICG 444
I G
Sbjct: 395 RIAG 398
>gi|440911916|gb|ELR61535.1| NHL repeat-containing protein 2, partial [Bos grunniens mutus]
Length = 760
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 52/176 (29%), Positives = 82/176 (46%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG I+ D NRL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 239 LLFPGKITVDHVSNRLVIADTGHHRILVVWKNGQIQYSIGGPNPGRKDGIFSESSFNSPQ 298
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ ++ TV +GI G +
Sbjct: 299 GVAIM--NNIIYVADTENHLIRKIDLEAEMVSTV---AGI---------------GIQGT 338
Query: 383 NDTKSEKLDPQSLIFPWHLM------KSEDDNLLIINRS-FETLWIMDLASGEIKE 431
+ K D Q + PW ++ + + DN+L I + +W + L G + +
Sbjct: 339 DKEGGAKGDEQPISSPWDVVFGRSGPEVQRDNILWIAMAGTHQIWALLLDCGRLPK 394
Score = 42.7 bits (99), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 46/102 (45%), Gaps = 4/102 (3%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK---ILDCIGSCPGFEDGEFESSK 317
N L P ++ D+ N L+++DS +H+I V D K L G+ F S
Sbjct: 505 NARLQHPLGVTWDQKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGNASNMIGSSFTDST 564
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
P ++ LY+ D+ NH I+ D+ + + +V+P
Sbjct: 565 FNEPGGLCIGENGQLLYVADTNNHQIKVLDLETKTV-SVFPV 605
>gi|114632912|ref|XP_508046.2| PREDICTED: NHL repeat-containing protein 2 [Pan troglodytes]
gi|410221294|gb|JAA07866.1| NHL repeat containing 2 [Pan troglodytes]
gi|410252040|gb|JAA13987.1| NHL repeat containing 2 [Pan troglodytes]
gi|410252042|gb|JAA13988.1| NHL repeat containing 2 [Pan troglodytes]
gi|410252044|gb|JAA13989.1| NHL repeat containing 2 [Pan troglodytes]
gi|410252046|gb|JAA13990.1| NHL repeat containing 2 [Pan troglodytes]
gi|410292446|gb|JAA24823.1| NHL repeat containing 2 [Pan troglodytes]
gi|410341451|gb|JAA39672.1| NHL repeat containing 2 [Pan troglodytes]
gi|410341453|gb|JAA39673.1| NHL repeat containing 2 [Pan troglodytes]
Length = 726
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 82/176 (46%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D+ NRL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKVTVDQVTNRLVIADTGHHRILVVWKNGQIQYSIGGPNPGRKDGIFSESTFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ + TV +GI + G +++
Sbjct: 283 GVAIM--NNIIYVADTENHLIRKIDLEAEKVSTV---AGIGIQ------------GTDKE 325
Query: 383 NDTKSEKLDPQSLIFPWHLM-------KSEDDNLLIINRSFETLWIMDLASGEIKE 431
K E+ Q + PW ++ D L I +W + L SG++ +
Sbjct: 326 GGAKGEQ---QPISSPWDVVFGTSGSEAQRGDILWIAMAGTHQIWALLLDSGKLPK 378
Score = 42.0 bits (97), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 43/99 (43%), Gaps = 3/99 (3%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK---ILDCIGSCPGFEDGEFESSK 317
N L P ++ D+ N L+++DS +H+I V D K L G F S
Sbjct: 471 NAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTNNVTSSSFTEST 530
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P ++ LY+ D+ NH I+ D+ +++ +
Sbjct: 531 FNEPGGLCIGENGQLLYVADTNNHQIKVMDLETKMVSVL 569
>gi|397510538|ref|XP_003825652.1| PREDICTED: NHL repeat-containing protein 2 [Pan paniscus]
Length = 726
Score = 70.1 bits (170), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 82/176 (46%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D+ NRL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKVTVDQVTNRLVIADTGHHRILVVWKNGQIQYSIGGPNPGRKDGIFSESTFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ + TV +GI + G +++
Sbjct: 283 GVAIM--NNIIYVADTENHLIRKIDLEAEKVSTV---AGIGIQ------------GTDKE 325
Query: 383 NDTKSEKLDPQSLIFPWHLM-------KSEDDNLLIINRSFETLWIMDLASGEIKE 431
K E+ Q + PW ++ D L I +W + L SG++ +
Sbjct: 326 GGAKGEQ---QPISSPWDVVFGTSGSEAQRGDILWIAMAGTHQIWALLLDSGKLPK 378
Score = 42.0 bits (97), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 43/99 (43%), Gaps = 3/99 (3%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK---ILDCIGSCPGFEDGEFESSK 317
N L P ++ D+ N L+++DS +H+I V D K L G F S
Sbjct: 471 NAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTNNVTSSSFTEST 530
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P ++ LY+ D+ NH I+ D+ +++ +
Sbjct: 531 FNEPGGLCIGENGQLLYVADTNNHQIKVMDLETKMVSVL 569
>gi|432922359|ref|XP_004080313.1| PREDICTED: NHL repeat-containing protein 2-like [Oryzias latipes]
Length = 720
Score = 70.1 bits (170), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 44/152 (28%), Positives = 71/152 (46%), Gaps = 19/152 (12%)
Query: 244 AKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIG 303
A + + K+ ACS +L FPG ++ D+S RL ++D+ HHRI+V G++L IG
Sbjct: 203 ALKIRLYKDSLACS-----ILSFPGKVAIDKSTQRLVIADTGHHRILVVSSAGQLLHVIG 257
Query: 304 S-CPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTS-- 360
G +DG + P + D +Y+ D+ENH IR+ D+ + T+
Sbjct: 258 GPASGRKDGSLSEASFNSPQGVAF--KGDAVYVADTENHLIRKIDLSGGRVSTIAGAGVQ 315
Query: 361 ---------GISKKNNSLWAWIMEKLGFERDN 383
G+ + +S W + G E DN
Sbjct: 316 GTDREGGAVGVQQPISSPWDVTLGTAGGEEDN 347
>gi|406832294|ref|ZP_11091888.1| NHL repeat containing protein [Schlesneria paludicola DSM 18645]
Length = 707
Score = 69.7 bits (169), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 37/93 (39%), Positives = 56/93 (60%), Gaps = 3/93 (3%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRPA 322
L +PG I AD + +RLF+SDSNH+R+IV GK++D IGS G +DG + ++ P
Sbjct: 229 LRYPGKILADAANDRLFISDSNHNRLIVCTLTGKLIDVIGSGAIGHKDGGYAVAQFDHPQ 288
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLET 355
D LY+ D+ENH +R D+ ++ + T
Sbjct: 289 GMTLV--GDTLYVADTENHLLRAIDLKQKHVST 319
>gi|389844275|ref|YP_006346355.1| NHL repeat protein [Mesotoga prima MesG1.Ag.4.2]
gi|387859021|gb|AFK07112.1| NHL repeat protein [Mesotoga prima MesG1.Ag.4.2]
Length = 496
Score = 69.7 bits (169), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 56/171 (32%), Positives = 88/171 (51%), Gaps = 27/171 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGN----GKILDCIGS-CPGFEDGEFESSKL 318
L FPG I+ + G LF+SD+N+ RI++ + + K +D IGS G EDG FE+++L
Sbjct: 171 LSFPGKIAFGD-GENLFISDTNNDRILLTELSTPFVAKTIDQIGSGLSGLEDGPFENARL 229
Query: 319 MRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLG 378
+P Y + LY+ D+ENHA+R ADM +R L T+ SG ++N W +
Sbjct: 230 NKPQGIVY--SNGRLYVADTENHALRIADMNQRCLSTL---SGDGFQDND-WNY------ 277
Query: 379 FERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEI 429
+ D +L+ PW L +++ L I +W +DL + I
Sbjct: 278 ---NGDASKARLNS-----PWDL-QTDGRFLYIAMAGMHQIWRLDLENNTI 319
Score = 40.8 bits (94), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 39/84 (46%), Gaps = 6/84 (7%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCP----GFEDGEFESSKLMRPAASFYHKDDDC 332
N L+++DS I D + + G+ DG + S P H +
Sbjct: 351 NSLYVADSEVSAIRFVDLEAGTVQTVAGSGLFSFGYVDGILKRSLFQHPIG--IHGEGRF 408
Query: 333 LYIVDSENHAIRRADMGRRVLETV 356
LY+ D+ NHAIR+ D+G R +ETV
Sbjct: 409 LYVADTYNHAIRKIDLGIRRVETV 432
>gi|297687408|ref|XP_002821208.1| PREDICTED: NHL repeat-containing protein 2 [Pongo abelii]
Length = 726
Score = 69.7 bits (169), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 82/176 (46%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D+ NRL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKVTVDQVTNRLVIADTGHHRILVVWKNGQIQYSIGGPNPGRKDGIFSESTFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ + TV +GI + G +++
Sbjct: 283 GVAIM--NNIIYVADTENHLIRKIDLEAEKVSTV---AGIGIQ------------GTDKE 325
Query: 383 NDTKSEKLDPQSLIFPWHLM-------KSEDDNLLIINRSFETLWIMDLASGEIKE 431
K E+ Q + PW ++ D L I +W + L SG++ +
Sbjct: 326 GGAKGEQ---QPISSPWDIVFGTSGSEVQRGDILWIAMAGTHQIWALLLDSGKLPK 378
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 43/99 (43%), Gaps = 3/99 (3%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK---ILDCIGSCPGFEDGEFESSK 317
N L P ++ D+ N L+++DS +H+I V D K L G F S
Sbjct: 471 NAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTNNVTSSSFTEST 530
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P ++ LY+ D+ NH I+ D+ +++ +
Sbjct: 531 FNEPGGLCVGENGQLLYVADTNNHQIKVMDLETKMVSVL 569
>gi|383418201|gb|AFH32314.1| NHL repeat-containing protein 2 [Macaca mulatta]
Length = 726
Score = 69.3 bits (168), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 82/176 (46%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D+ NRL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKVTVDQVTNRLVIADTGHHRILVVWKNGQIQYSIGGPNPGRKDGIFSESTFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ + TV +GI + G +++
Sbjct: 283 GVAIM--NNIIYVADTENHLIRKIDLEAERVSTV---AGIGIQ------------GTDKE 325
Query: 383 NDTKSEKLDPQSLIFPWHLM-------KSEDDNLLIINRSFETLWIMDLASGEIKE 431
K E+ Q + PW ++ D L I +W + L SG++ +
Sbjct: 326 GGAKGEQ---QPISSPWDVVFGTSGSEVQRGDILWIAMAGTHQIWALLLDSGKLPK 378
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 43/99 (43%), Gaps = 3/99 (3%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK---ILDCIGSCPGFEDGEFESSK 317
N L P ++ D+ N L+++DS +H+I V D K L G F S
Sbjct: 471 NAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTNNVTSSSFTEST 530
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P ++ LY+ D+ NH I+ D+ +++ +
Sbjct: 531 FNEPGGLCIGENGQLLYVADTNNHQIKIMDLETKMVSVL 569
>gi|380812570|gb|AFE78159.1| NHL repeat-containing protein 2 [Macaca mulatta]
Length = 726
Score = 69.3 bits (168), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 82/176 (46%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D+ NRL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKVTVDQVTNRLVIADTGHHRILVVWKNGQIQYSIGGPNPGRKDGIFSESTFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ + TV +GI + G +++
Sbjct: 283 GVAIM--NNIIYVADTENHLIRKIDLEAERVSTV---AGIGIQ------------GTDKE 325
Query: 383 NDTKSEKLDPQSLIFPWHLM-------KSEDDNLLIINRSFETLWIMDLASGEIKE 431
K E+ Q + PW ++ D L I +W + L SG++ +
Sbjct: 326 GGAKGEQ---QPISSPWDVVFGTSGSEVQRGDILWIAMAGTHQIWALLLDSGKLPK 378
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 43/99 (43%), Gaps = 3/99 (3%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK---ILDCIGSCPGFEDGEFESSK 317
N L P ++ D+ N L+++DS +H+I V D K L G F S
Sbjct: 471 NAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTNNVTSSSFTEST 530
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P ++ LY+ D+ NH I+ D+ +++ +
Sbjct: 531 FNEPGGLCIGENGQLLYVADTNNHQIKIMDLETKMVSVL 569
>gi|109090607|ref|XP_001091193.1| PREDICTED: NHL repeat-containing protein 2 [Macaca mulatta]
Length = 726
Score = 69.3 bits (168), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 82/176 (46%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D+ NRL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKVTVDQVTNRLVIADTGHHRILVVWKNGQIQYSIGGPNPGRKDGIFSESTFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ + TV +GI + G +++
Sbjct: 283 GVAIM--NNIIYVADTENHLIRKIDLEAERVSTV---AGIGIQ------------GTDKE 325
Query: 383 NDTKSEKLDPQSLIFPWHLM-------KSEDDNLLIINRSFETLWIMDLASGEIKE 431
K E+ Q + PW ++ D L I +W + L SG++ +
Sbjct: 326 GGAKGEQ---QPISSPWDVVFGTSGSEVQRGDILWIAMAGTHQIWALLLDSGKLPK 378
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 43/99 (43%), Gaps = 3/99 (3%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK---ILDCIGSCPGFEDGEFESSK 317
N L P ++ D+ N L+++DS +H+I V D K L G F S
Sbjct: 471 NAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTNNVTSSSFTEST 530
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P ++ LY+ D+ NH I+ D+ +++ +
Sbjct: 531 FNEPGGLCIGENGQLLYVADTNNHQIKIMDLETKMVSVL 569
>gi|380016202|ref|XP_003692077.1| PREDICTED: membrane-associated protein Hem-like [Apis florea]
Length = 1439
Score = 69.3 bits (168), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 63/111 (56%), Gaps = 5/111 (4%)
Query: 252 EPHACSSVRNLLLHFPGC--ISADESGNRLFLSDSNHHRIIVFDGNGKILDCIG-SCPGF 308
E H ++V N L FP I +E G L ++D+ ++RI++ D G I IG S P F
Sbjct: 208 ECHLLAAVDNRNLLFPSKLEIFQNEQGENLIIADTGNNRILIIDTKGNIQHVIGGSNPDF 267
Query: 309 EDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
DG+FE+++ P D +Y+ D+ENHAIR+ D+ ++++ TV T
Sbjct: 268 RDGDFENARFNAPQGVCIL--DTFVYVADNENHAIRKIDLIKKIVTTVVGT 316
>gi|355783113|gb|EHH65034.1| hypothetical protein EGM_18374, partial [Macaca fascicularis]
Length = 723
Score = 69.3 bits (168), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 82/176 (46%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D+ NRL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 220 LLFPGKVTVDQVTNRLVIADTGHHRILVVWKNGQIQYSIGGPNPGRKDGIFSESTFNSPQ 279
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ + TV +GI + G +++
Sbjct: 280 GVAIM--NNIIYVADTENHLIRKIDLEAERVSTV---AGIGIQ------------GTDKE 322
Query: 383 NDTKSEKLDPQSLIFPWHLM-------KSEDDNLLIINRSFETLWIMDLASGEIKE 431
K E+ Q + PW ++ D L I +W + L SG++ +
Sbjct: 323 GGAKGEQ---QPISSPWDVVFGTSGSEVQRGDILWIAMAGTHQIWALLLDSGKLPK 375
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 43/99 (43%), Gaps = 3/99 (3%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK---ILDCIGSCPGFEDGEFESSK 317
N L P ++ D+ N L+++DS +H+I V D K L G F S
Sbjct: 468 NAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTNNVTSSSFTEST 527
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P ++ LY+ D+ NH I+ D+ +++ +
Sbjct: 528 FNEPGGLCIGENGQLLYVADTNNHQIKIMDLETKMVSVL 566
>gi|307202136|gb|EFN81636.1| NHL repeat-containing protein 2 [Harpegnathos saltator]
Length = 688
Score = 68.9 bits (167), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 65/109 (59%), Gaps = 6/109 (5%)
Query: 254 HACSSVRNLLLHFPG--CISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFED 310
H SS+++ LL FPG I + E G +L +SDS + RI++ + +G+I IG GFED
Sbjct: 210 HLLSSLKDGLL-FPGKLAIFSSEQGVKLIISDSGNSRILITNQHGEIEHIIGGTNQGFED 268
Query: 311 GEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
G+F+S+K P DD +Y+ D+ NHAIR+ ++ + + T+ T
Sbjct: 269 GDFKSAKFNSPQGVCML--DDVIYVADNNNHAIRKINLSEKSVSTIAGT 315
>gi|332212861|ref|XP_003255539.1| PREDICTED: NHL repeat-containing protein 2 [Nomascus leucogenys]
Length = 726
Score = 68.9 bits (167), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 82/176 (46%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D+ NRL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKVTVDQVTNRLVIADTGHHRILVVWKNGQIQYSIGGPNPGRKDGIFSESTFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ + TV +GI + G +++
Sbjct: 283 GVAIM--NNIIYVADTENHLIRKIDLEAGKVSTV---AGIGIQ------------GTDKE 325
Query: 383 NDTKSEKLDPQSLIFPWHLM-------KSEDDNLLIINRSFETLWIMDLASGEIKE 431
K E+ Q + PW ++ D L I +W + L SG++ +
Sbjct: 326 GGAKGEQ---QPISSPWDVVFGTSGSEVQRGDILWIAMAGTHQIWALLLDSGKLPK 378
Score = 42.7 bits (99), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 44/99 (44%), Gaps = 3/99 (3%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK---ILDCIGSCPGFEDGEFESSK 317
N L P ++ D+ N L+++DS +H+I V D K L G + F S
Sbjct: 471 NAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTNNVTNSSFTEST 530
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P ++ LY+ D+ NH I+ D+ +++ +
Sbjct: 531 FNEPGGLCIGENGQLLYVADTNNHQIKVMDLETKMVSVL 569
>gi|325106982|ref|YP_004268050.1| redoxin [Planctomyces brasiliensis DSM 5305]
gi|324967250|gb|ADY58028.1| Redoxin domain protein [Planctomyces brasiliensis DSM 5305]
Length = 563
Score = 68.9 bits (167), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 51/175 (29%), Positives = 79/175 (45%), Gaps = 22/175 (12%)
Query: 256 CSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFE 314
+ N L FPG + AD +G RLF++DSNH+RII+ +G++ IG+ G +DG +E
Sbjct: 226 ADKLENTPLRFPGKVLADAAGKRLFIADSNHNRIIIAGLDGQLQAVIGTGKIGRQDGSYE 285
Query: 315 SSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIM 374
+ P + LY+ D+ENH +R D+ + + T+ T
Sbjct: 286 QASFDHPQGMALVG--NTLYVADTENHLLRTIDLESKQVATLAGTG-------------- 329
Query: 375 EKLGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEI 429
E+ F + +E +L PW L + + L I LW L S EI
Sbjct: 330 EQARFRAGGGSLAET----ALNSPWALQEL-NGTLYIAMAGPHQLWKHQLGSQEI 379
>gi|344257216|gb|EGW13320.1| NHL repeat-containing protein 2 [Cricetulus griseus]
Length = 552
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 47/176 (26%), Positives = 77/176 (43%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D + RL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 50 LLFPGKVAVDHATGRLVVADTGHHRILVIQKNGQIECSIGGPNPGRKDGMFSESSFNSPQ 109
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ + TV +G +
Sbjct: 110 GVAIV--NNVIYVADTENHLIRKIDLEAEKVSTV------------------AGVGVQGT 149
Query: 383 NDTKSEKLDPQSLIFPWHLMKSEDDN-------LLIINRSFETLWIMDLASGEIKE 431
+ E+ + Q + PW ++ D+ L I +W + L SG + +
Sbjct: 150 DTEGGERGEKQPISSPWDVVVGTSDSEVQRNDILWIAMAGTHQIWALLLDSGTLPK 205
>gi|426366237|ref|XP_004050167.1| PREDICTED: NHL repeat-containing protein 2 [Gorilla gorilla
gorilla]
Length = 726
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 82/176 (46%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D+ NRL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKVTVDQVTNRLVIADTGHHRILVVWKNGQIQYSIGGPNPGRKDGIFSESTFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ + TV +GI + G +++
Sbjct: 283 GVAII--NNIIYVADTENHLIRKIDLEAEKVSTV---AGIGIQ------------GTDKE 325
Query: 383 NDTKSEKLDPQSLIFPWHLM-------KSEDDNLLIINRSFETLWIMDLASGEIKE 431
K E+ Q + PW ++ D L I +W + L SG++ +
Sbjct: 326 GGAKGEQ---QPISSPWDVVFGTSGSEVQRGDILWIAMAGTHQIWALLLDSGKLPK 378
>gi|426253136|ref|XP_004020256.1| PREDICTED: NHL repeat-containing protein 2 [Ovis aries]
Length = 726
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 81/176 (46%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG I+ D NRL ++D+ HHRI+V NG+ IG PG +DG F S P
Sbjct: 223 LLFPGKIAVDHVSNRLVIADTGHHRILVVWKNGQTQHSIGGPNPGRKDGIFSESSFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ ++ TV +GI G +
Sbjct: 283 GVAIM--NNIIYVADTENHLIRKIDLEAEMVSTV---AGI---------------GIQGT 322
Query: 383 NDTKSEKLDPQSLIFPWHLM------KSEDDNLLIINRS-FETLWIMDLASGEIKE 431
+ K D Q + PW ++ + + DN+L I + +W + L G + +
Sbjct: 323 DKEGGAKGDEQPISSPWDVVFGRSGSEVQRDNILWIAMAGTHQIWALLLDCGRLPK 378
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 26/99 (26%), Positives = 45/99 (45%), Gaps = 4/99 (4%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK---ILDCIGSCPGFEDGEFESSKLMR 320
L P ++ D+ N L+++DS +H+I V D K L G+ F S
Sbjct: 474 LQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGNASNIIGSSFTDSTFNE 533
Query: 321 PAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
P ++ LY+ D+ NH I+ D+ + + +V+P
Sbjct: 534 PGGLCIGENGQLLYVADTNNHQIKVLDLETKTV-SVFPV 571
>gi|343780954|ref|NP_001230490.1| NHL repeat-containing protein 2 [Sus scrofa]
Length = 725
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 50/176 (28%), Positives = 80/176 (45%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D NRL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKVTVDHVTNRLVIADTGHHRILVVWKNGQIQYTIGGPNPGRKDGIFSESTFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ ++ TV +GI + G +++
Sbjct: 283 GVAIR--NNIIYVADTENHLIRKIDLEAEMVSTV---AGIGIQ------------GTDKE 325
Query: 383 NDTKSEKLDPQSLIFPWHLM-------KSEDDNLLIINRSFETLWIMDLASGEIKE 431
E+ Q + PW ++ DD L I +W + L G + +
Sbjct: 326 GGANGEE---QPISSPWDVVFGRSGSEAPGDDILWIAMAGTHQIWALLLDCGRLPK 378
Score = 44.3 bits (103), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 45/106 (42%), Gaps = 17/106 (16%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFD----------GNGKILDCIGSCPGFED 310
N L P ++ DE N L+++DS +H+I V D G G D IGS
Sbjct: 471 NAKLQHPLGVTWDEKRNLLYVADSYNHKIKVVDPKTKSCTTLAGTGDAGDTIGS------ 524
Query: 311 GEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
F S P ++ LY+ D+ NH I+ D+ + + +
Sbjct: 525 -SFTKSTFNEPGGLCIGENGQLLYVADTNNHQIKVMDLETKTVSVL 569
>gi|57107627|ref|XP_544027.1| PREDICTED: NHL repeat-containing protein 2 isoform 1 [Canis lupus
familiaris]
Length = 726
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 53/94 (56%), Gaps = 3/94 (3%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D NRL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKVTVDHVSNRLVIADTGHHRILVVWKNGQIQYIIGGPNPGRKDGIFSESTFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
++ +Y+ D+ENH IR+ D+ ++ TV
Sbjct: 283 GVAIR--NNIIYVADTENHLIRKIDLEAEMVSTV 314
>gi|354501729|ref|XP_003512941.1| PREDICTED: NHL repeat-containing protein 2 [Cricetulus griseus]
Length = 699
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 47/176 (26%), Positives = 77/176 (43%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D + RL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 197 LLFPGKVAVDHATGRLVVADTGHHRILVIQKNGQIECSIGGPNPGRKDGMFSESSFNSPQ 256
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ + TV +G +
Sbjct: 257 GVAIV--NNVIYVADTENHLIRKIDLEAEKVSTV------------------AGVGVQGT 296
Query: 383 NDTKSEKLDPQSLIFPWHLMKSEDDN-------LLIINRSFETLWIMDLASGEIKE 431
+ E+ + Q + PW ++ D+ L I +W + L SG + +
Sbjct: 297 DTEGGERGEKQPISSPWDVVVGTSDSEVQRNDILWIAMAGTHQIWALLLDSGTLPK 352
>gi|348508723|ref|XP_003441903.1| PREDICTED: NHL repeat-containing protein 2-like [Oreochromis
niloticus]
Length = 719
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 51/188 (27%), Positives = 81/188 (43%), Gaps = 31/188 (16%)
Query: 263 LLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRP 321
+L FPG ++ D S RL ++D+ HHRI+V G +L IG G +DG+ + P
Sbjct: 217 ILSFPGKVAIDNSTKRLVIADTGHHRILVVSSTGDLLHVIGGPESGRKDGDVSEASFNSP 276
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFER 381
D +Y+ D+ENH IR+ D+ + T+ +G+ +
Sbjct: 277 QGVAIK--GDTVYVADTENHLIRKIDLLEGKVSTL---AGVGTQG--------------- 316
Query: 382 DNDTKSEKLDPQSLI-FPWHLM-----KSEDDNLLIINRSFETLWIMDLASGEIKEAVKG 435
D + + PQ I PW +M ED+ L + +W + LA G++ KG
Sbjct: 317 -TDKEGGAMGPQQPISSPWDVMLGTASGDEDNVLWVAMAGTHQIWALFLADGKLP---KG 372
Query: 436 FSKVLEIC 443
IC
Sbjct: 373 SESKAGIC 380
>gi|328783461|ref|XP_396242.3| PREDICTED: NHL repeat-containing protein 2 [Apis mellifera]
Length = 699
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 63/111 (56%), Gaps = 5/111 (4%)
Query: 252 EPHACSSVRNLLLHFPGC--ISADESGNRLFLSDSNHHRIIVFDGNGKILDCIG-SCPGF 308
E H ++V N L FP I +E G L ++D+ ++RI++ D G + IG S P F
Sbjct: 214 ECHLLATVDNKNLLFPSKLEIFQNEQGENLIIADTGNNRILIVDTKGNVQHVIGGSNPDF 273
Query: 309 EDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
DG+FE+++ P D +Y+ D+ENHAIR+ D+ ++++ TV T
Sbjct: 274 RDGDFENARFNAPQGVCIL--DTFIYVADNENHAIRKIDLIKKMVTTVVGT 322
Score = 40.4 bits (93), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 17/51 (33%), Positives = 29/51 (56%)
Query: 307 GFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVY 357
G DG ++KL P +H D+ +YI D+ NH I++ D+ + +T+Y
Sbjct: 466 GDSDGIKYAAKLQHPLGITWHSKDNAVYITDTYNHKIKKIDVTTQNCKTIY 516
>gi|395828052|ref|XP_003787200.1| PREDICTED: NHL repeat-containing protein 2 [Otolemur garnettii]
Length = 726
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 52/176 (29%), Positives = 80/176 (45%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG I+ D RL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKITVDHVTKRLVIADTGHHRILVVWKNGQIQYSIGGPNPGRKDGIFSESTFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ + TV +GI + G + +
Sbjct: 283 GVAIK--NNIIYVADTENHLIRKIDLEAEKVSTV---AGIGIQ------------GTDTE 325
Query: 383 NDTKSEKLDPQSLIFPWHLMKS-------EDDNLLIINRSFETLWIMDLASGEIKE 431
K E+ Q + PW ++ DD L I +W + L SG++ +
Sbjct: 326 GGAKGEE---QPINSPWDVILGTSDSEVQRDDILWIAMAGTHQIWALLLDSGKLPK 378
>gi|119569872|gb|EAW49487.1| NHL repeat containing 2, isoform CRA_a [Homo sapiens]
Length = 683
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 51/174 (29%), Positives = 84/174 (48%), Gaps = 26/174 (14%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D+ +RL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKVTVDQVTDRLVIADTGHHRILVVWKNGQIQYSIGGPNPGRKDGIFSESTFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ + T+ +GI + G +++
Sbjct: 283 GVAIM--NNIIYVADTENHLIRKIDLEAEKVSTI---AGIGIQ------------GTDKE 325
Query: 383 NDTKSEKLDPQSLIFPWHLM--KSE---DDNLLIINRSFETLWIMDLASGEIKE 431
K E+ Q + PW ++ SE D L I +W + L SG++ +
Sbjct: 326 GGAKGEQ---QPISSPWDVVFGTSEVQRGDILWIAMAGTHQIWALLLDSGKLPK 376
>gi|326672461|ref|XP_003199671.1| PREDICTED: NHL repeat-containing protein 2-like [Danio rerio]
Length = 719
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 77/174 (44%), Gaps = 25/174 (14%)
Query: 263 LLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESSKLMRP 321
+L FPG I+ D SG +L ++D+ HHR++V G++L +G G DG ++ P
Sbjct: 217 ILSFPGKIAIDPSGKQLAIADTGHHRVLVVSNTGQVLHTVGGPSSGKRDGNLSEAQFSSP 276
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFER 381
F D +Y+ D+ENH +R+ ++ + T+ +GI +
Sbjct: 277 QGLFI--KGDTVYVADTENHLVRKINLTEGKVSTL---AGIGVQG--------------- 316
Query: 382 DNDTKSEKLDPQSLI-FPWH--LMKSEDDNLLIINRSFETLWIMDLASGEIKEA 432
D + L PQ I PW L + D L I +W + L G++ +
Sbjct: 317 -TDKEGGALGPQQPISSPWDVTLGSAGGDVLWIAMAGTHQIWALFLEDGKLPKG 369
>gi|431895427|gb|ELK04943.1| NHL repeat-containing protein 2 [Pteropus alecto]
Length = 726
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 49/176 (27%), Positives = 80/176 (45%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D RL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKVTVDHVSKRLVIADTGHHRILVIWKNGQIQYSIGGPNPGRKDGIFSESTFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ ++ T+ +GI + G +++
Sbjct: 283 GVAIM--NNIIYVADTENHLIRKIDLEAEMVSTI---AGIGIQ------------GSDKE 325
Query: 383 NDTKSEKLDPQSLIFPWHLM-------KSEDDNLLIINRSFETLWIMDLASGEIKE 431
K E+ Q + PW ++ DD L I +W + L G + +
Sbjct: 326 GGAKGEQ---QPISSPWDVVIGTSDSEVQRDDILWIAMAGTHQIWALLLDCGRLPK 378
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 42/99 (42%), Gaps = 3/99 (3%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK---ILDCIGSCPGFEDGEFESSK 317
N L P ++ D+ N L+++DS +H+I V D K L G F S
Sbjct: 471 NAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKHCTTLAGTGDASNVIGSSFTEST 530
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P ++ LY+ D+ NH I+ D+ + + +
Sbjct: 531 FNEPGGLCIGENGQLLYVADTNNHQIKVMDLETKTISVL 569
>gi|443708985|gb|ELU03866.1| hypothetical protein CAPTEDRAFT_52631, partial [Capitella teleta]
Length = 507
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 41/105 (39%), Positives = 54/105 (51%), Gaps = 5/105 (4%)
Query: 266 FPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESSKLMRPAAS 324
FP I D G RL +SD+ HRI+V NG + IG G DG F SS+ P
Sbjct: 161 FPSKICIDAKGERLVMSDAVRHRIVVASMNGMVQHVIGGRGHGLTDGSFISSQFFGPQGV 220
Query: 325 FYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSL 369
+H DD + + D+ENHAIR+ D+ V TV +G K+ L
Sbjct: 221 CFHG-DDLIIVADTENHAIRKIDL---VAGTVRTIAGNGKQGTDL 261
>gi|417404183|gb|JAA48864.1| Putative nhl repeat-containing protein 2 [Desmodus rotundus]
Length = 725
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 49/176 (27%), Positives = 80/176 (45%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D RL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKVTVDHVSKRLVIADTGHHRILVVWKNGQIQYSIGGPDPGRKDGIFSESTFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ ++ TV +GI + G +++
Sbjct: 283 GVAIM--NNIIYVADTENHLIRKIDLEAEMVSTV---AGIGIQ------------GTDKE 325
Query: 383 NDTKSEKLDPQSLIFPWHLM-------KSEDDNLLIINRSFETLWIMDLASGEIKE 431
K E+ Q + PW ++ DD L + +W + L G + +
Sbjct: 326 GGAKGEE---QPISSPWDVVWGSSDSEVQRDDILWVAMAGTHQIWALLLDCGRLPK 378
>gi|170066897|ref|XP_001868267.1| NHL repeat containing 2 [Culex quinquefasciatus]
gi|167863075|gb|EDS26458.1| NHL repeat containing 2 [Culex quinquefasciatus]
Length = 734
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 41/111 (36%), Positives = 63/111 (56%), Gaps = 10/111 (9%)
Query: 255 ACSSVRNLLLHFPGCIS-------ADESGNRLF-LSDSNHHRIIVFDGNGKILDCIGSCP 306
A S V L FPG I+ +D + N L+ +SDS +HRI+V + G +L IG
Sbjct: 213 ASSQVETSHLQFPGKIACSYREGISDGNDNPLYAISDSGNHRILVVNSEGTVLHRIGGKK 272
Query: 307 -GFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
GF DG+F ++ P + ++DD +++ D+ENHAIRR D+ + + TV
Sbjct: 273 SGFVDGDFRKARFNAPQGLAF-QNDDVIFVADNENHAIRRIDLKSKQVTTV 322
Score = 43.1 bits (100), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 26/98 (26%), Positives = 44/98 (44%), Gaps = 9/98 (9%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFD-GNGKILDCIGS--------CPGFEDGEFESSK 317
P ++ + L+L+DS I +GK+L G G DG+ ++K
Sbjct: 430 PSGLALNRDAKELYLADSESSSIRKMSLADGKVLAVAGGDRNPLDLFSFGDIDGKLYAAK 489
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLET 355
L P Y+ D+C+Y+ D+ NH I++ + V T
Sbjct: 490 LQHPLGVAYNARDNCIYVADTYNHKIKKINASTNVATT 527
>gi|124003658|ref|ZP_01688507.1| NHL repeat [Microscilla marina ATCC 23134]
gi|123991227|gb|EAY30679.1| NHL repeat [Microscilla marina ATCC 23134]
Length = 508
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 62/109 (56%), Gaps = 4/109 (3%)
Query: 249 VLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPG 307
+L++ + ++ L FP I + G L++SDS ++RI+ D NGK+L IGS G
Sbjct: 192 MLEKDQQGNKTKHTFLRFPSKIIQGDKG-ELYISDSGNNRILKIDKNGKVLLQIGSGTQG 250
Query: 308 FEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
+DG+F+ + P D LY+ D++N+ IRR D+ R+ ++T+
Sbjct: 251 LKDGDFDKATFYEPHGLALK--GDFLYVADTKNNVIRRVDLKRKKVKTI 297
>gi|326672463|ref|XP_003199672.1| PREDICTED: NHL repeat-containing protein 2-like [Danio rerio]
Length = 719
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 77/174 (44%), Gaps = 25/174 (14%)
Query: 263 LLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESSKLMRP 321
+L FPG I+ D SG +L ++D+ HHR++V G++L +G G DG ++ P
Sbjct: 217 ILSFPGKIAIDPSGKQLAIADTGHHRVLVVSNTGQVLHTVGGPSSGKRDGNLSEAQFSSP 276
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFER 381
F D +Y+ D+ENH +R+ ++ + T+ +GI +
Sbjct: 277 QGLFI--KGDTVYVADTENHLVRKINLTEGKVSTL---AGIGVQG--------------- 316
Query: 382 DNDTKSEKLDPQSLI-FPWH--LMKSEDDNLLIINRSFETLWIMDLASGEIKEA 432
D + L PQ I PW L + D L I +W + L G++ +
Sbjct: 317 -TDKEGGALGPQQPISSPWDVTLGSAGGDVLWIAMAGTHQIWALFLEDGKLPKG 369
>gi|42476013|ref|NP_940916.2| NHL repeat-containing protein 2 [Homo sapiens]
gi|74762548|sp|Q8NBF2.1|NHLC2_HUMAN RecName: Full=NHL repeat-containing protein 2
gi|21748831|dbj|BAC03493.1| unnamed protein product [Homo sapiens]
Length = 726
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/176 (28%), Positives = 82/176 (46%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D+ +RL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKVTVDQVTDRLVIADTGHHRILVVWKNGQIQYSIGGPNPGRKDGIFSESTFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ + TV +GI + G +++
Sbjct: 283 GVAIM--NNIIYVADTENHLIRKIDLEAEKVSTV---AGIGIQ------------GTDKE 325
Query: 383 NDTKSEKLDPQSLIFPWHLM-------KSEDDNLLIINRSFETLWIMDLASGEIKE 431
K E+ Q + PW ++ D L I +W + L SG++ +
Sbjct: 326 GGAKGEQ---QPISSPWDVVFGTSGSEVQRGDILWIAMAGTHQIWALLLDSGKLPK 378
Score = 42.7 bits (99), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 44/99 (44%), Gaps = 3/99 (3%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK---ILDCIGSCPGFEDGEFESSK 317
N L P ++ D+ N L+++DS +H+I V D K L G F S
Sbjct: 471 NAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTNNVTSSSFTEST 530
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P ++ + LY+ D+ NH I+ D+ +++ +
Sbjct: 531 FNEPGGLCIGENGELLYVADTNNHQIKVMDLETKMVSVL 569
>gi|126273410|ref|XP_001377730.1| PREDICTED: NHL repeat-containing protein 2 [Monodelphis domestica]
Length = 777
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 48/174 (27%), Positives = 81/174 (46%), Gaps = 28/174 (16%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D +RL ++D+ HHRI+V NG+I IG G +DG F S P
Sbjct: 273 LLFPGKVTVDNLSDRLVIADTGHHRILVVRKNGQIQHSIGGPNSGSKDGTFLESTFNSPQ 332
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ ++ TV +GI + G +++
Sbjct: 333 G--VSIKNNIIYVADTENHLIRKIDLETEMVSTV---AGIGIQ------------GTDKE 375
Query: 383 NDTKSEKLDPQSLIFPWHLM-------KSEDDNLLIINRSFETLWIMDLASGEI 429
K E+ Q + PW ++ ++D L I +W + L G++
Sbjct: 376 GGAKGEE---QPISSPWDVVFGTSGSQVQQNDVLWIAMAGIHQIWALMLEDGKL 426
>gi|348578653|ref|XP_003475097.1| PREDICTED: NHL repeat-containing protein 2-like [Cavia porcellus]
Length = 726
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 45/137 (32%), Positives = 67/137 (48%), Gaps = 21/137 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG IS D + N L ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKISVDHATNTLVMADTGHHRILVIWENGQIQYSIGGPNPGRKDGLFSESAFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
D+ +++ D+ENH IR+ D+ + E V +GI + G +++
Sbjct: 283 GVAMM--DNIIFVADTENHLIRKIDL---IAEKVSTVAGIGIQ------------GTDKE 325
Query: 383 NDTKSEKLDPQSLIFPW 399
K E+ Q + PW
Sbjct: 326 GGAKGEE---QPISSPW 339
>gi|119569873|gb|EAW49488.1| NHL repeat containing 2, isoform CRA_b [Homo sapiens]
Length = 726
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/176 (27%), Positives = 82/176 (46%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D+ +RL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKVTVDQVTDRLVIADTGHHRILVVWKNGQIQYSIGGPNPGRKDGIFSESTFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ + T+ +GI + G +++
Sbjct: 283 GVAIM--NNIIYVADTENHLIRKIDLEAEKVSTI---AGIGIQ------------GTDKE 325
Query: 383 NDTKSEKLDPQSLIFPWHLM-------KSEDDNLLIINRSFETLWIMDLASGEIKE 431
K E+ Q + PW ++ D L I +W + L SG++ +
Sbjct: 326 GGAKGEQ---QPISSPWDVVFGTSGSEVQRGDILWIAMAGTHQIWALLLDSGKLPK 378
Score = 42.7 bits (99), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 44/99 (44%), Gaps = 3/99 (3%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK---ILDCIGSCPGFEDGEFESSK 317
N L P ++ D+ N L+++DS +H+I V D K L G F S
Sbjct: 471 NAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTNNVTSSSFTEST 530
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P ++ + LY+ D+ NH I+ D+ +++ +
Sbjct: 531 FNEPGGLCIGENGELLYVADTNNHQIKVMDLETKMVSVL 569
>gi|345482264|ref|XP_001607897.2| PREDICTED: NHL repeat-containing protein 2-like [Nasonia
vitripennis]
Length = 768
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 62/99 (62%), Gaps = 5/99 (5%)
Query: 264 LHFPGCISA--DESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESSKLMR 320
L FPG I + ++G +L +SD+ ++RI++ +GK+ +G PGF+DG+FE+++
Sbjct: 299 LLFPGKIHSFKTDAGEKLVVSDTGNNRILIMGKDGKVEHVVGGYSPGFKDGDFETARFNA 358
Query: 321 PAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
P KDD +++ D+ENHAIR+ ++ + + ++ T
Sbjct: 359 PQGVCVLKDD--VFVADNENHAIRKINLKTKTVISIAGT 395
Score = 43.5 bits (101), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 49/105 (46%), Gaps = 17/105 (16%)
Query: 253 PHACSSVRNLLLHFPGCISADESGNRLFLSDSNHH--RIIVFDGNGKILDCIGS------ 304
PHA S L P ++ + +F +DS R + D +GK+ G+
Sbjct: 482 PHAAS------LAQPSGLAVSQELKSVFFADSESSTVRRVCLD-DGKVFAVCGADNDPSN 534
Query: 305 --CPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRAD 347
C G +DG+ S+KL P ++K D +YI D+ NH I++ D
Sbjct: 535 LHCFGDQDGKKYSAKLQHPLGVAWNKFDQMVYIADTYNHKIKKTD 579
>gi|355707227|gb|AES02894.1| NHL repeat containing 2 [Mustela putorius furo]
Length = 498
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 35/94 (37%), Positives = 53/94 (56%), Gaps = 3/94 (3%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D +RL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 148 LLFPGKVTVDHVSHRLVIADTGHHRILVVRKNGEIQYSIGGPNPGRKDGIFSESTFNSPQ 207
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
++ +Y+ D+ENH IR+ D+ ++ TV
Sbjct: 208 GVAIR--NNIIYVADTENHLIRKIDLEAEMVSTV 239
Score = 43.9 bits (102), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 44/103 (42%), Gaps = 3/103 (2%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK---ILDCIGSCPGFEDGEFESSK 317
N L P ++ D+ N L+++DS +H+I V D K L G F S
Sbjct: 396 NAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCSTLAGTGDASNVTTSSFTEST 455
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTS 360
P ++D LY+ D+ NH I+ D+ + + TS
Sbjct: 456 FNEPGGLCIGENDQLLYVADTNNHQIKVMDLETKTISVDLLTS 498
>gi|72094692|ref|XP_795849.1| PREDICTED: NHL repeat-containing protein 2-like [Strongylocentrotus
purpuratus]
Length = 706
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 46/173 (26%), Positives = 80/173 (46%), Gaps = 23/173 (13%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L +PG ++ + G L +SD+ H+R+I+ +G + CIG GF DG + ++ P
Sbjct: 209 LLYPGKVTTNPEGTLLAVSDTGHNRVIIVALDGVVQHCIGGPETGFNDGLYREARFHSPQ 268
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
+ + D +Y+ D+ENHAIR+ D+ + + T+ +G ++ W G E
Sbjct: 269 GLCWAQ--DVIYVADTENHAIRKIDLKEQCVTTI---AGTGEQGVD---WYGAGRGTE-- 318
Query: 383 NDTKSEKLDPQSLIFPWHLM--KSEDDNLLIINRSFETLWIMDLASGEIKEAV 433
Q + PW ++ +D L I LW + L+ G +AV
Sbjct: 319 ----------QVISSPWDVVLGPPNEDVLFIAMAGTHQLWGLFLSDGHWLKAV 361
>gi|390369862|ref|XP_798415.2| PREDICTED: NHL repeat-containing protein 2-like [Strongylocentrotus
purpuratus]
Length = 322
Score = 66.2 bits (160), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 46/173 (26%), Positives = 81/173 (46%), Gaps = 23/173 (13%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L +PG ++ + G L +SD+ H+R+I+ +G + CIG GF DG ++ ++ P
Sbjct: 170 LLYPGKVTTNPEGTLLAVSDTGHNRVIIVALDGVVQHCIGGPETGFNDGLYQEARFHSPQ 229
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
+ + D +Y+ D+ENHAIR+ D+ + + T+ +G ++ W G E
Sbjct: 230 GLCWAQ--DVIYVADTENHAIRKIDLKEKRVTTI---AGTGEQGVD---WYGAGRGTE-- 279
Query: 383 NDTKSEKLDPQSLIFPWHLM--KSEDDNLLIINRSFETLWIMDLASGEIKEAV 433
Q + PW ++ +D L I LW + L+ G +AV
Sbjct: 280 ----------QVISSPWDVVLGPPNEDVLFIAMAGTHQLWGLFLSDGHWLKAV 322
>gi|312372811|gb|EFR20689.1| hypothetical protein AND_19679 [Anopheles darlingi]
Length = 745
Score = 66.2 bits (160), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 53/212 (25%), Positives = 92/212 (43%), Gaps = 34/212 (16%)
Query: 264 LHFPGCISADESGNRLF--LSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESSKLMR 320
L FPG + E G +SDS +HRI++F+ +G + IG GF DG F ++
Sbjct: 223 LKFPGKVVCCEGGEEELYAVSDSGNHRILIFEPSGTVRYTIGGKSSGFVDGGFREARFNA 282
Query: 321 PAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFE 380
P + D+ LY+ D+ENHAIRR D+ R + TV G +
Sbjct: 283 PQGVAFRGGDE-LYVADNENHAIRRIDLRTRTVSTVAGN------------------GVQ 323
Query: 381 RDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEIKEAVKGFSKVL 440
++ T + Q+L PW D + R + + +D + +K+ + ++
Sbjct: 324 GNDRTGGKTGREQALSSPW-------DVAVYTTRDLDMSFHVDESKAPLKDVL-----LI 371
Query: 441 EICGVLVMEKVFLLKQMPQDWLLHQIDSSCSL 472
I G+ + +FL + + + S C++
Sbjct: 372 AIAGIHQIWAIFLEDTIWWKFKKYTAGSCCAI 403
>gi|449670570|ref|XP_002158620.2| PREDICTED: NHL repeat-containing protein 2-like [Hydra
magnipapillata]
Length = 629
Score = 66.2 bits (160), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 61/101 (60%), Gaps = 3/101 (2%)
Query: 257 SSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFES 315
SS++ L +PG ++A G L +SDS HH++++ + NG ++ IG+ PGF+DGE
Sbjct: 203 SSLKFKNLRYPGKLTASPCGKMLAVSDSGHHQVLIIERNGYVVKRIGTGFPGFKDGEAHV 262
Query: 316 SKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
S+ P + ++ +++ D+ENH+IR ++ + +T+
Sbjct: 263 SQFSSPQGLSW--IENVIFVADTENHSIREINVANWITKTI 301
>gi|345324001|ref|XP_001513492.2| PREDICTED: NHL repeat-containing protein 2-like [Ornithorhynchus
anatinus]
Length = 725
Score = 65.9 bits (159), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 50/176 (28%), Positives = 80/176 (45%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D + RL ++D+ HHRI+V NG+I IG G +DG F S P
Sbjct: 223 LLFPGKVAVDRATGRLVIADTGHHRILVVKKNGQIQYSIGGPNSGRKDGTFSESAFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ +E V +G+ + G +++
Sbjct: 283 GVAIK--NNIIYVADTENHLIRKIDL---EVEQVSTVAGVGVQ------------GTDKE 325
Query: 383 NDTKSEKLDPQSLIFPWHLM----KSEDDN---LLIINRSFETLWIMDLASGEIKE 431
K E Q + PW + SED+ L I +W + L SG + +
Sbjct: 326 GGAKGEA---QPISSPWDVAFGTSGSEDNRDYILWIAMAGTHQIWALLLDSGTLPK 378
Score = 46.2 bits (108), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 49/108 (45%), Gaps = 18/108 (16%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFD----------GNGKILDCIGSCPGFED 310
N L P ++ DE GN L+++DS +H+I V D G G+ + IGS
Sbjct: 471 NAKLQHPLGVTWDEKGNLLYVADSYNHKIKVVDPKTKNCATLAGTGEASNVIGS------ 524
Query: 311 GEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYP 358
+ P +D LY+ D+ NH I+ D+ + + +V+P
Sbjct: 525 -SLTQTTFNEPGGLCIGEDGHLLYVADTNNHQIKVLDLEAKTV-SVFP 570
>gi|322788140|gb|EFZ13922.1| hypothetical protein SINV_02735 [Solenopsis invicta]
Length = 717
Score = 65.9 bits (159), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 65/109 (59%), Gaps = 6/109 (5%)
Query: 254 HACSSVRNLLLHFPGCISA--DESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFED 310
H S ++ LL FPG ++ E G +L +SDS ++RI++ + +G++ IG C GF+D
Sbjct: 239 HLLPSSKDGLL-FPGKLTVLQSEQGTKLVISDSGNNRIVITNEHGRVEHVIGGCSQGFKD 297
Query: 311 GEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
G+F++++ P ++ +YI D+ NHAIR+ ++ + + T+ T
Sbjct: 298 GDFKNARFNSPQGVC--ALNNTIYIADNNNHAIRKINLSEKTVSTIAGT 344
>gi|452822384|gb|EME29404.1| haloacid dehalogenase-like hydrolase family protein [Galdieria
sulphuraria]
Length = 679
Score = 65.5 bits (158), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 52/103 (50%), Gaps = 13/103 (12%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-------------GFEDGEF 313
PG I+ D RLF+SDS HHRI++ G+ LD IG G++DG F
Sbjct: 332 PGKIALDLKNGRLFISDSGHHRILIVTMEGQFLDQIGGLRVEDDFILSEKESMGWKDGSF 391
Query: 314 ESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
E ++ P Y+++ D L I D N AIR A + +R + T
Sbjct: 392 EETRFRNPQGMIYNEEYDELVIADMWNDAIRIAHLKQRQVRTT 434
>gi|332027687|gb|EGI67755.1| NHL repeat-containing protein 2 [Acromyrmex echinatior]
Length = 686
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 66/110 (60%), Gaps = 6/110 (5%)
Query: 254 HACSSVRNLLLHFPGCISAD--ESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFED 310
H S ++ LL FPG ++ E +L +SDS ++RI++ + +G++ IG C GF+D
Sbjct: 208 HLLPSSKDGLL-FPGKLTVLQLEQETKLVISDSGNNRIVITNEHGRVEHVIGGCNQGFKD 266
Query: 311 GEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTS 360
G+F++++ P ++ +Y+ D+ NHAIR+ D+ +R++ T+ T
Sbjct: 267 GDFKNARFNSPQGVCVL--NNIIYVADNNNHAIRKIDLAKRIVSTIAGTG 314
>gi|452825888|gb|EME32883.1| hypothetical protein Gasu_02330 [Galdieria sulphuraria]
Length = 631
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 53/109 (48%), Gaps = 3/109 (2%)
Query: 251 KEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIG-SCPGFE 309
+ P V + FP + +SD+ HHRI+V + NGKIL IG F
Sbjct: 182 RGPRVLLQVPQTIFRFPRDAFIHPDKKWILVSDTFHHRIVVLETNGKILGTIGCGVAWFR 241
Query: 310 DGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYP 358
DG+F S +P+ + + D +YI D+EN +R AD + T+YP
Sbjct: 242 DGDFSSCAFHQPSGMAW--NGDAIYIADTENGCLRMADWKAGHVSTLYP 288
>gi|395502099|ref|XP_003755423.1| PREDICTED: NHL repeat-containing protein 2 [Sarcophilus harrisii]
Length = 1044
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 48/176 (27%), Positives = 82/176 (46%), Gaps = 28/176 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D+ + L ++D+ HHRI+V NG+I IG G +DG F S P
Sbjct: 543 LLFPGKVAIDDVTDSLVIADTGHHRILVVRKNGQIQHSIGGPNSGRKDGTFLESTFNSPQ 602
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
++ +Y+ D+ENH IR+ D+ +++ TV +GI + G +++
Sbjct: 603 GVAI--KNNIIYVADTENHLIRKIDLETKMVSTV---AGIGIQ------------GTDKE 645
Query: 383 NDTKSEKLDPQSLIFPWHLMKS-------EDDNLLIINRSFETLWIMDLASGEIKE 431
K E+ Q + PW ++ +DD L I +W + L G + +
Sbjct: 646 GGAKGEE---QPISSPWDVVFGISVSEICKDDILWIAMAGIHQIWALLLDDGRLPK 698
Score = 40.8 bits (94), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 26/99 (26%), Positives = 44/99 (44%), Gaps = 3/99 (3%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCI---GSCPGFEDGEFESSK 317
N L P ++ D+ N L+++DS +H+I V D K I G S
Sbjct: 791 NAKLQHPLGVAWDKKRNLLYVADSYNHKIKVVDPKTKSCTTIAGTGEASNVITSTSTEST 850
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P +D++ LYI D+ NH I+ D+ +++ +
Sbjct: 851 FNEPGGLCIGEDNNLLYIADTNNHQIKVMDLETKIISVL 889
>gi|156406833|ref|XP_001641249.1| predicted protein [Nematostella vectensis]
gi|156228387|gb|EDO49186.1| predicted protein [Nematostella vectensis]
Length = 708
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 57/109 (52%), Gaps = 6/109 (5%)
Query: 255 ACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS----CPGFED 310
A S+ L +PG + D +G RL ++D+ HHR+IV G + + IG GF+D
Sbjct: 191 AKESIPKSKLLYPGKVCLDGAGRRLVIADTGHHRVIVCSTEGVVHEVIGGEDGFSAGFQD 250
Query: 311 GEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
G F+ +K P + +Y+ D+ENHAIR ++ + + TV T
Sbjct: 251 GTFKEAKFHAPQGVAML--GEVIYVADTENHAIREINLDSKKVTTVAGT 297
>gi|430743901|ref|YP_007203030.1| thiol-disulfide isomerase-like thioredoxin [Singulisphaera
acidiphila DSM 18658]
gi|430015621|gb|AGA27335.1| thiol-disulfide isomerase-like thioredoxin [Singulisphaera
acidiphila DSM 18658]
Length = 696
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/98 (35%), Positives = 56/98 (57%), Gaps = 3/98 (3%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRPA 322
L +PG + ADE +LF++D+ H+RI++ D +G+ +GS G DG++ ++ RP
Sbjct: 226 LLYPGKVLADEKSKQLFIADTAHNRIVLTDLDGRKSVVVGSGGIGMVDGDYAKAEFNRPQ 285
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTS 360
DD LY+ D+ENHAIR + + + TV T
Sbjct: 286 GLCLV--DDTLYVADTENHAIRAIHLKTKQVSTVAGTG 321
>gi|307182571|gb|EFN69764.1| NHL repeat-containing protein 2 [Camponotus floridanus]
Length = 687
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 63/109 (57%), Gaps = 6/109 (5%)
Query: 254 HACSSVRNLLLHFPG--CISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFED 310
H SS +N LL FPG I + E G +L +SDS ++RI++ +G++ IG C GF+D
Sbjct: 210 HLLSSFKNGLL-FPGKLAILSSEHGTKLIISDSGNNRIVIATKHGEVEHFIGGCNQGFKD 268
Query: 311 GEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
G F++++ P ++ +Y+ D+ NHAIR+ ++ + T+ T
Sbjct: 269 GSFKNARFNSPQGVCVL--NNTIYVADNNNHAIRKINLTENNVSTIAGT 315
>gi|296124331|ref|YP_003632109.1| NHL repeat containing protein [Planctomyces limnophilus DSM 3776]
gi|296016671|gb|ADG69910.1| NHL repeat containing protein [Planctomyces limnophilus DSM 3776]
Length = 581
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 51/86 (59%), Gaps = 3/86 (3%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRPA 322
L FPG + D + R+F+SDSNH+RI+V G++L IGS G +DG ES++ P
Sbjct: 247 LRFPGKLLVDPAHERVFISDSNHNRIVVASLAGQLLKVIGSGKIGAKDGPAESAQFDHPQ 306
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADM 348
D + LY+ D+ENH +R ++
Sbjct: 307 GMAL--DGNTLYVADTENHLLRTVNL 330
>gi|157112324|ref|XP_001657496.1| hypothetical protein AaeL_AAEL000965 [Aedes aegypti]
gi|108883769|gb|EAT47994.1| AAEL000965-PA [Aedes aegypti]
Length = 812
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 39/100 (39%), Positives = 60/100 (60%), Gaps = 8/100 (8%)
Query: 264 LHFPGCISA-----DESGNRLF-LSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESS 316
L FPG IS+ D + N L+ +SDS +HRI+V + +G IL IG GF DG+F +
Sbjct: 305 LQFPGKISSSYRGGDNTDNCLYAISDSGNHRILVVNSDGLILYRIGGKRAGFVDGDFRKA 364
Query: 317 KLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
+ P + ++D L++ D+ENHAIR+ D+ + + TV
Sbjct: 365 RFNAPQGVAF-QNDTTLFVADNENHAIRKIDLKSQQVTTV 403
Score = 40.4 bits (93), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 24/98 (24%), Positives = 44/98 (44%), Gaps = 9/98 (9%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFD-GNGKILDCIGS--------CPGFEDGEFESSK 317
P ++ ++ ++L+DS + +GK+L G G DG+ S+K
Sbjct: 511 PSGLAINKDAKEIYLADSESSSVRKMSLSDGKVLAVAGGDRNPLDLFSFGDIDGKQYSAK 570
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLET 355
P Y+ D+C+Y+ D+ NH I++ + V T
Sbjct: 571 FQHPLGVAYNSKDNCIYVADTYNHKIKKINASTNVATT 608
>gi|350403739|ref|XP_003486886.1| PREDICTED: NHL repeat-containing protein 2-like [Bombus impatiens]
Length = 696
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 50/163 (30%), Positives = 78/163 (47%), Gaps = 28/163 (17%)
Query: 270 ISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPAASFYHK 328
I ++ G L ++D+ H+RI+V D G + IG P F+DG+FE++K P
Sbjct: 231 ILQNDQGESLVIADTGHNRILVTDTTGNVQHIIGGPNPDFKDGDFETAKFNAPQG--VST 288
Query: 329 DDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSE 388
+Y+ D++NHAIR+ D+ RR++ T+ T GI + ++ K+G +
Sbjct: 289 LGSLIYVADTKNHAIRKIDLMRRIVITIAGT-GIKGHD-----YVGGKIGKD-------- 334
Query: 389 KLDPQSLIFPWHL--MKSEDDN-----LLIINRSFETLWIMDL 424
Q L PW L K E DN LLI +W + L
Sbjct: 335 ----QVLSSPWDLAVYKHEHDNNITPILLIAMAGNHQIWALFL 373
>gi|402881549|ref|XP_003904331.1| PREDICTED: NHL repeat-containing protein 2 [Papio anubis]
Length = 688
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 49/87 (56%), Gaps = 3/87 (3%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FPG ++ D+ NRL ++D+ HHRI+V NG+I IG PG +DG F S P
Sbjct: 223 LLFPGKVTVDQVTNRLVIADTGHHRILVVWKNGQIQYSIGGPNPGRKDGIFSESTFNSPQ 282
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMG 349
++ +Y+ D+ENH IR+ G
Sbjct: 283 GVAIM--NNIIYVADTENHLIRKLTKG 307
Score = 41.6 bits (96), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 43/99 (43%), Gaps = 3/99 (3%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK---ILDCIGSCPGFEDGEFESSK 317
N L P ++ D+ N L+++DS +H+I V D K L G F S
Sbjct: 433 NAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTNNVTSSSFTEST 492
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P ++ LY+ D+ NH I+ D+ +++ +
Sbjct: 493 FNEPGGLCIGENGQLLYVADTNNHQIKIMDLETKMVSVL 531
>gi|451981204|ref|ZP_21929574.1| NHL repeat containing protein [Nitrospina gracilis 3/211]
gi|451761567|emb|CCQ90828.1| NHL repeat containing protein [Nitrospina gracilis 3/211]
Length = 488
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 65/137 (47%), Gaps = 21/137 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESSKLMRPA 322
L FPG I+ ES ++D+NH+++++ D G I IGS G DG FE + RP
Sbjct: 175 LSFPGKIAYSESDKLFAVADANHNQVVIADREGTIRHRIGSGAVGKADGGFEEASFYRPQ 234
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
+ DD +++ D+ENH +R+ D+ + + TV T ++ GF RD
Sbjct: 235 GLCFQ--DDVVWVADTENHLLRKIDLNSKQVTTVAGTG--------------DQGGFLRD 278
Query: 383 NDTKSEKLDPQSLIFPW 399
E +L PW
Sbjct: 279 THPAKET----ALSSPW 291
>gi|156399714|ref|XP_001638646.1| predicted protein [Nematostella vectensis]
gi|156225768|gb|EDO46583.1| predicted protein [Nematostella vectensis]
Length = 286
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/95 (34%), Positives = 51/95 (53%), Gaps = 1/95 (1%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFD-GNGKILDCIGSCPGFEDGEFESSKLMRPAASF 325
P +S D+ G+RLF+ D+++H I V +G + +G GF+DG SK P
Sbjct: 155 PQSLSLDDEGDRLFVGDTDNHAIRVISLKDGSVQTLVGGSLGFKDGVGLKSKFYHPTGIA 214
Query: 326 YHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTS 360
Y +++D LY+ D NH IR + RV+ T+ T
Sbjct: 215 YDRENDILYVSDHYNHVIRAVKVSERVVTTLAGTG 249
>gi|317419409|emb|CBN81446.1| NHL repeat-containing protein 2 [Dicentrarchus labrax]
Length = 719
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 46/181 (25%), Positives = 79/181 (43%), Gaps = 26/181 (14%)
Query: 258 SVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESS 316
S+ L FPG ++ D S RL ++D+ HHRI++ G++L IG G DG+ +
Sbjct: 212 SLPTSFLSFPGKVAIDSSEKRLAIADTGHHRILLLSSTGQLLHVIGGPESGRRDGDLSEA 271
Query: 317 KLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEK 376
P D +Y+ D+ENH IR+ D+ + T+ +G+ +
Sbjct: 272 SFNSPQGVAIK--GDTVYVADTENHLIRKIDLLEGRVSTL---AGVGIQ----------- 315
Query: 377 LGFERDNDTKSEKLDPQSLIFPWHLMKS-----EDDNLLIINRSFETLWIMDLASGEIKE 431
G ++D + Q + PW + ED+ L I +W + LA G++ +
Sbjct: 316 -GTDKDGGAMGPQ---QPISSPWDVTLGTAGGVEDNVLWIAMAGTHQIWALFLADGKLPK 371
Query: 432 A 432
Sbjct: 372 G 372
>gi|405960880|gb|EKC26754.1| NHL repeat-containing protein 2 [Crassostrea gigas]
Length = 699
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 59/108 (54%), Gaps = 8/108 (7%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLM 319
N L FPG I SG + +SD+ H+R++V +G + CIG GF++G F +
Sbjct: 211 NFSLKFPGKICV--SGKNIVVSDTGHNRVLVLSKDGVVQHCIGGKERGFKNGTFTQCRFD 268
Query: 320 RPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNN 367
P ++ +Y+ D+ENHAIR+ D+ ++ + T+ +G K+ N
Sbjct: 269 SPQGVVMK--ENVIYLADTENHAIRKIDLQKQEVTTI---AGNGKQGN 311
>gi|261855185|ref|YP_003262468.1| redoxin [Halothiobacillus neapolitanus c2]
gi|261835654|gb|ACX95421.1| Redoxin domain protein [Halothiobacillus neapolitanus c2]
Length = 498
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 57/107 (53%), Gaps = 5/107 (4%)
Query: 251 KEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFE 309
K P S N + P I+ ++ NR+ +SD+ H ++ +FD GK+L IG GF
Sbjct: 184 KLPLKPLSADNHFMVLPTKIAVND--NRVAISDTGHDQVRIFDHEGKLLATIGDGKAGFA 241
Query: 310 DGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
DG F +++ RP + +D LY+ D++N IRR D + + TV
Sbjct: 242 DGSFATAEFNRPQGIAWLGND--LYVADTDNQRIRRIDFASQTVSTV 286
>gi|91076402|ref|XP_969236.1| PREDICTED: similar to CG12547 CG12547-PA [Tribolium castaneum]
gi|270002449|gb|EEZ98896.1| hypothetical protein TcasGA2_TC004511 [Tribolium castaneum]
Length = 712
Score = 62.8 bits (151), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 59/97 (60%), Gaps = 5/97 (5%)
Query: 264 LHFPGCISADESGNR---LFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLM 319
L FPG I+ + N+ L +SD+ ++RI++ +G IL +GS GF DG F +
Sbjct: 214 LLFPGKITNFVTKNQEEILAISDTGNNRILILKSDGTILHQVGSGEIGFRDGSFTQCEFN 273
Query: 320 RPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P + ++++ L++ D+ENHAIR+ D+ ++ +E+V
Sbjct: 274 APQGLVF-QNENTLFVADTENHAIRKIDLAKKTVESV 309
>gi|196009344|ref|XP_002114537.1| hypothetical protein TRIADDRAFT_11855 [Trichoplax adhaerens]
gi|190582599|gb|EDV22671.1| hypothetical protein TRIADDRAFT_11855 [Trichoplax adhaerens]
Length = 507
Score = 62.4 bits (150), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 59/115 (51%), Gaps = 7/115 (6%)
Query: 259 VRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC----PGFEDGEFE 314
++ + L FPG + G+ + +SDS HHRI++ +G C+G+ GF DG F+
Sbjct: 163 IKPMTLSFPGKVCVVNDGDDIVISDSGHHRILIVGKDGITRSCVGNKEYFEEGFVDGVFQ 222
Query: 315 SSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSL 369
++ P ++ +++ D+ NHAIR+ D+ + T+ +G + N L
Sbjct: 223 KARFNNPQGITCSRNGKTIFVADTNNHAIRKIDLEYCEVTTI---AGTGNQGNDL 274
>gi|320103301|ref|YP_004178892.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
gi|319750583|gb|ADV62343.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
Length = 676
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 55/102 (53%), Gaps = 3/102 (2%)
Query: 260 RNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKL 318
R L +PG + A +G+ LF++D+ H+RI++ +G + +G+ G DG +E +
Sbjct: 200 RETPLLYPGKVVA--AGDSLFIADTGHNRIVIAGLDGSLKAVVGNGKIGMRDGAYERASF 257
Query: 319 MRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTS 360
RP + LY+ D+ENHAIR D+ R + TV T
Sbjct: 258 NRPQGIRLDALRNRLYVADTENHAIRAIDLTTRSVTTVAGTG 299
>gi|340725457|ref|XP_003401086.1| PREDICTED: NHL repeat-containing protein 2-like [Bombus terrestris]
Length = 692
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 50/95 (52%), Gaps = 6/95 (6%)
Query: 274 ESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPAASFYHKDDDC 332
E G L ++D+ H+RI+V D G + IG P F DG FE +K P D
Sbjct: 231 EQGENLVVADTGHNRILVMDTAGNVQHIIGGPNPDFRDGNFEIAKFNGPQG--MSTLDSV 288
Query: 333 LYIVDSENHAIRRADMGRRVLETVYPTSGISKKNN 367
+Y+ D++NHAIR+ D+ ++ TV +G K N
Sbjct: 289 IYVADTKNHAIRKIDLIKK---TVITIAGTGTKGN 320
>gi|410896258|ref|XP_003961616.1| PREDICTED: NHL repeat-containing protein 2-like [Takifugu rubripes]
Length = 716
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 52/98 (53%), Gaps = 3/98 (3%)
Query: 263 LLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRP 321
+L FPG I+ D+ RL ++D+ HHR+++ G++L IG G +DG+ + P
Sbjct: 217 ILSFPGKIAIDDGNKRLAIADTGHHRVLMVSTTGQLLQVIGGPESGRKDGDLSEASFNSP 276
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
D +Y+ D+ENH IR+ D+ + T+ T
Sbjct: 277 QGVAIK--GDTVYVADTENHLIRKIDLSAGKVSTLAGT 312
>gi|336313754|ref|ZP_08568676.1| thiol-disulfide isomerase-like thioredoxin [Rheinheimera sp. A13L]
gi|335881693|gb|EGM79570.1| thiol-disulfide isomerase-like thioredoxin [Rheinheimera sp. A13L]
Length = 498
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 77/170 (45%), Gaps = 23/170 (13%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESSKLM 319
N +L PG ++ DE + +SD+ HH+I++ D GK++ +GS +DG +SS
Sbjct: 188 NTILAAPGAVTVDE--RYIAISDTLHHQIVLMDHQGKLVKRLGSGIAELKDGHSDSSAFS 245
Query: 320 RPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGF 379
P D LY+ D+ NHAIRR D+ + T+ +++ G
Sbjct: 246 SPQGLVL--TDKALYVADTGNHAIRRIDLSTFQVNTIAGNGELAQ-------------GR 290
Query: 380 ERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEI 429
T ++ SL PW L +++ L I +W +DL S E+
Sbjct: 291 LISGSTPAQV----SLRSPWDL-ALDNNTLYIAMAGSHQIWTLDLKSSEL 335
>gi|47219684|emb|CAG12606.1| unnamed protein product [Tetraodon nigroviridis]
Length = 805
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 47/187 (25%), Positives = 80/187 (42%), Gaps = 19/187 (10%)
Query: 263 LLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRP 321
+L FPG I+ D+ R+ ++D+ HHR+++ G++L IG G +DG+ + P
Sbjct: 217 ILSFPGKIAIDDRSKRVAIADTGHHRVLLVSTTGQLLQVIGGPESGRKDGDLSEASFNSP 276
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNS--------LWAWI 373
D +Y+ D+ENH IR+ + L + P+ +S N+ L A
Sbjct: 277 QGVAI--KGDTVYVADTENHLIRKVQLITASLCSTLPSRCLSPCLNASVCIYQIDLSAGK 334
Query: 374 MEKLGFERDNDTKSEKLDP---QSLIFPWHLM-----KSEDDNLLIINRSFETLWIMDLA 425
+ L T E P Q + PW + E + L I +W + LA
Sbjct: 335 VSTLAGTGAQGTDKEGGAPGPRQPISSPWDVTLGSAGGDEANVLWIAMAGTHQIWALFLA 394
Query: 426 SGEIKEA 432
G++ +
Sbjct: 395 DGKLPKG 401
>gi|402814801|ref|ZP_10864394.1| copper amine oxidase domain protein [Paenibacillus alvei DSM 29]
gi|402507172|gb|EJW17694.1| copper amine oxidase domain protein [Paenibacillus alvei DSM 29]
Length = 571
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 47/82 (57%), Gaps = 3/82 (3%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIG-SCPGFEDGEFESSKLMRPAASF 325
P ++AD+ GN L+++D+ +H I DG GK+ G PG++DG +++ P A
Sbjct: 144 PTGLAADKKGN-LYVADAGNHVIRKIDGAGKVTTVAGDGIPGWKDGAAGTARFHTPRA-I 201
Query: 326 YHKDDDCLYIVDSENHAIRRAD 347
DD +Y+ DS NH IRR D
Sbjct: 202 AVADDGAIYVTDSLNHVIRRID 223
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 67/141 (47%), Gaps = 10/141 (7%)
Query: 218 LNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPH----ACSSVRNLLLHFPGCISAD 273
LNK E+ Q E+ +SP+ W + + H A ++ + L FP I+A
Sbjct: 38 LNKKSEQ--AQAFEHFTSPAIEMTAWKAASHWYETVHLSGKAMANKQVERLQFPTGIAAG 95
Query: 274 ESGNRLFLSDSNHHRIIVFDGNGKILDCIG--SCPGFEDGEFESSKLMRPAASFYHKDDD 331
E+ + ++D+ +H + D G+I IG + G+EDGE + +P K +
Sbjct: 96 EN-QSIIVTDTRNHVLRRLDAEGRITATIGMINKQGWEDGEQAQVQFDQPTGLAADKKGN 154
Query: 332 CLYIVDSENHAIRRADMGRRV 352
LY+ D+ NH IR+ D +V
Sbjct: 155 -LYVADAGNHVIRKIDGAGKV 174
>gi|340370782|ref|XP_003383925.1| PREDICTED: NHL repeat-containing protein 2-like [Amphimedon
queenslandica]
Length = 730
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 53/99 (53%), Gaps = 2/99 (2%)
Query: 260 RNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC--PGFEDGEFESSK 317
R++ L P ++ D++ L+++DS +H+I V + K+ + PG DG FE ++
Sbjct: 466 RDVKLQHPMGVAWDDTNQLLYIADSFNHKIKVVNPKTKVCSTLAGTGSPGLVDGSFEVAQ 525
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
PA ++ D LY+ D+ NHAIR D+ + + V
Sbjct: 526 FSEPAGLCMSQEGDTLYVADTNNHAIRILDLKEKKVSQV 564
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 48/193 (24%), Positives = 88/193 (45%), Gaps = 27/193 (13%)
Query: 259 VRNLLLHFPGCISADESGN---RLFLSDSNHHRIIVFDG-NGKILDCIGS-CPGFEDGEF 313
V +L FPG + + + G L +SDS++HR++V D G + GS PGF+DG
Sbjct: 217 VEESILSFPGKLVSFKRGGVSELLCISDSSNHRVLVVDAVTGLVKQVYGSGSPGFKDGRG 276
Query: 314 ESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWI 373
+ ++ P ++C+Y+ D+ENH IR+ + + TV T
Sbjct: 277 KEAEFNCPQGLVIC--EECVYVADTENHLIRKISLSDDFVLTVAGT-------------- 320
Query: 374 MEKLGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRS-FETLWIMDLASGEIKEA 432
G++ ++ + Q + PW L + D +++ I + LW L + ++ +
Sbjct: 321 ----GYQGNDKEGGKVGKEQEISSPWDLAINSDCSIIYIAMAGTHQLWAYFLNNAQLFKK 376
Query: 433 VK-GFSKVLEICG 444
V+ G V+ + G
Sbjct: 377 VEYGRGSVVRMVG 389
>gi|403364957|gb|EJY82253.1| Haloacid dehalogenase-like hydrolase domain-containing protein
[Oxytricha trifallax]
Length = 621
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 51/194 (26%), Positives = 89/194 (45%), Gaps = 32/194 (16%)
Query: 244 AKQAEVLKEPHACSSVRNLLLHFPGCISA--DESG-------NRLFLSDSNHHRIIVFDG 294
+KQ E L+E + +NL + PG + A D G + + ++DS+++R IV D
Sbjct: 232 SKQTEGLREEQIRAVKQNL--NNPGKVIAIVDTEGLPYPDTTDIMVVADSSNNRYIVIDM 289
Query: 295 NG-KILDCIGSCP-GFEDGEFESSKLMRPAASFY---HKDDDCLYIVDSENHAIRRADMG 349
N K +D IG+ GF+DG F+ ++ + K + CL D++NH IR ++
Sbjct: 290 NTWKCVDVIGNGKIGFKDGNFQEAEFHHTQGMCHFQNEKGEHCLMACDTKNHCIREINLH 349
Query: 350 RRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNL 409
++ + V A + E GF+RD K + Q + PW ++ D
Sbjct: 350 KKTVTKV--------------AGLPEVRGFDRDGGKKPA--NEQMIASPWDILNVGDGQF 393
Query: 410 LIINRSFETLWIMD 423
+ N +W++D
Sbjct: 394 IYCNAGTHQIWMLD 407
>gi|158293254|ref|XP_314572.4| AGAP010610-PA [Anopheles gambiae str. PEST]
gi|157016870|gb|EAA09865.4| AGAP010610-PA [Anopheles gambiae str. PEST]
Length = 730
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 57/101 (56%), Gaps = 9/101 (8%)
Query: 264 LHFPGCIS------ADESGNRLF-LSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFES 315
+ +PG ++ A S +LF +SDS +HR+++ D G +L +G GF DG F
Sbjct: 230 MKYPGKVACSVPTGAGGSEEQLFAVSDSGNHRVLIVDSAGTVLHKVGGKQSGFVDGNFTK 289
Query: 316 SKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
++ P + + D +++ D+ENHA+RR D+ R++ T+
Sbjct: 290 ARFNAPQGVAF-QGTDVVFVADNENHAVRRIDLKARLVSTI 329
>gi|242017281|ref|XP_002429120.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212513984|gb|EEB16382.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 791
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/95 (36%), Positives = 54/95 (56%), Gaps = 6/95 (6%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRP- 321
L +P I+ GN +SDS +HRII+F+ G + IG GF+DG F+ + P
Sbjct: 208 LKYPSKIT--RWGNLFAVSDSGNHRIIIFNVEGVVKYTIGGKEQGFKDGNFQEALFDTPH 265
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
+F+ ++ LYI D+ NHAIR D+ + ++ TV
Sbjct: 266 GLAFF--NESILYIADTNNHAIRLVDLNKMIVTTV 298
>gi|403363814|gb|EJY81656.1| NHL repeat-containing protein 2 [Oxytricha trifallax]
Length = 478
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/180 (26%), Positives = 81/180 (45%), Gaps = 31/180 (17%)
Query: 258 SVRNLLLHFPG---CISADES--GNRLFLSDSNHHRIIVFDGNGKILDCIGSCP----GF 308
+ +N L +PG CI + N + +SD+ ++R+I+ N + L+C G G
Sbjct: 82 TAQNQNLRYPGKVLCIEKQKGLDNNLMIISDTGNNRLIIV--NEETLECQGVIGNGKIGL 139
Query: 309 EDGEFESSKLMRPAA--SFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKN 366
DG +E P Y + +Y+ D++NHAIR ++ ++ + TV T
Sbjct: 140 VDGNYEEGSFHHPQGMCHIYRDNMHFIYLCDTKNHAIREINLTKKEVLTVIGTG------ 193
Query: 367 NSLWAWIMEKLGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLAS 426
EK GF+R+ + + E Q L PW ++ D LLI +W ++L +
Sbjct: 194 --------EK-GFDREGNKEPEL---QKLSSPWDIVAVNRDTLLIAMAGVHQIWALNLKT 241
>gi|449017726|dbj|BAM81128.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 924
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 48/90 (53%), Gaps = 2/90 (2%)
Query: 257 SSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS--CPGFEDGEFE 314
+S + L L FPG + +RLF++DS HHRI+V +G+ ++ IG GF DG F
Sbjct: 460 TSRQALALRFPGKLDVHAESDRLFIADSGHHRILVTKLSGEFIEQIGGREGAGFRDGTFS 519
Query: 315 SSKLMRPAASFYHKDDDCLYIVDSENHAIR 344
+ P + + L + DS N+A+R
Sbjct: 520 EALFQYPQGLVFDPLGNRLIVADSGNNALR 549
>gi|262199596|ref|YP_003270805.1| NHL repeat containing protein [Haliangium ochraceum DSM 14365]
gi|262082943|gb|ACY18912.1| NHL repeat containing protein [Haliangium ochraceum DSM 14365]
Length = 344
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 47/177 (26%), Positives = 74/177 (41%), Gaps = 17/177 (9%)
Query: 257 SSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFES 315
SS R + PG ++ DE G L+ +D+ +HR+++ D G + +GS PG+ DG+
Sbjct: 24 SSKRAFPVRAPGHVAVDEHGT-LWTADTGNHRLVMADRGGAVRLVVGSGDPGWRDGDGAD 82
Query: 316 SKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIME 375
+ L P LY+ DS IRR D+ TV +G ++ +L
Sbjct: 83 AALHAPRGIAVCASGRRLYVADSGTCTIRRVDLDAVGDATVTTVAGTGERGEALAPGRRP 142
Query: 376 KLGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEIKEA 432
L +L P + ED LL +W+++L SGE A
Sbjct: 143 AL--------------RTALPGPVAVAVDEDRELLFATMDAGQVWVVEL-SGERAAA 184
>gi|254524393|ref|ZP_05136448.1| NHL repeat protein [Stenotrophomonas sp. SKA14]
gi|219721984|gb|EED40509.1| NHL repeat protein [Stenotrophomonas sp. SKA14]
Length = 470
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 53/205 (25%), Positives = 89/205 (43%), Gaps = 37/205 (18%)
Query: 163 DQTRLVEMLMKEYITFPILLSN----------KNFPQMENGACYLLSKDFGNARVFHENS 212
D +++L ++ +T P+LL +P M L ++ R+ + +
Sbjct: 76 DPGAALKLLRRQGLTMPVLLDADWDGWRRFGINAWPTM----VLLDAQGREQQRLVGQGA 131
Query: 213 LDIGMLNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISA 272
G L +A+ L Q S+ P G +E+ EP L FP ++
Sbjct: 132 --PGELERALNALC--QGAPSAPPRG-------GSELHPEPRQA-------LRFPLGLAV 173
Query: 273 DESGNRLFLSDSNHHRIIVFDGNGKILDCIG-SCPGFEDGEFESSKLMRPAASFYHKDDD 331
S RL+++DS HHRI+ G+IL G F DG + RP A + D
Sbjct: 174 --STERLYIADSGHHRILECSHGGRILRQFGLGTADFMDGNLAEAAFHRPQALVLER--D 229
Query: 332 CLYIVDSENHAIRRADMGRRVLETV 356
LY+ D+ NHA+RR ++ +++T+
Sbjct: 230 ALYVADTGNHAVRRINLLTGIVDTL 254
>gi|78047814|ref|YP_363989.1| hypothetical protein XCV2258 [Xanthomonas campestris pv.
vesicatoria str. 85-10]
gi|78036244|emb|CAJ23935.1| conserved hypothetical protein [Xanthomonas campestris pv.
vesicatoria str. 85-10]
Length = 499
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 53/190 (27%), Positives = 87/190 (45%), Gaps = 28/190 (14%)
Query: 166 RLVEMLMKEYITFPILLSN-----KNFPQMENGACYLLSKDFGNARVFHENSLDI-GMLN 219
R ++ L ++ PILL + F +E+ +L +G R + + + G L+
Sbjct: 106 RSLKQLRGHGVSAPILLDKDWETWRRF-GIESWPTLVLLDAYGRER---QRLVGVTGDLD 161
Query: 220 KAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRL 279
KA+ L QQ S + AE EPH L L FP ++A E +RL
Sbjct: 162 KALTALCEGQQPPSDA------DLHGVAEREPEPH-------LALRFPTGLAATE--DRL 206
Query: 280 FLSDSNHHRIIVFDGNGKILDCIGSCPG-FEDGEFESSKLMRPAASFYHKDDDCLYIVDS 338
+++D+ HHR++ +G++L G F DG + RP +D LY+ D+
Sbjct: 207 YVADTGHHRVLECTHSGRVLRQFGHGNADFIDGGVGEAAFRRPQGLALERDQ--LYVADT 264
Query: 339 ENHAIRRADM 348
NHA+RR ++
Sbjct: 265 GNHALRRINL 274
>gi|408824784|ref|ZP_11209674.1| NHL repeat containing protein [Pseudomonas geniculata N1]
Length = 470
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 50/94 (53%), Gaps = 5/94 (5%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIG-SCPGFEDGEFESSKLMRPA 322
L FP ++ S RL+++DS HHRI+ G+IL G F DG + RP
Sbjct: 165 LRFPLGLAV--STERLYIADSGHHRILECSHGGRILRQFGLGTADFMDGNLAEAAFHRPQ 222
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
A + D LY+ D+ NHA+RR ++ +++T+
Sbjct: 223 ALVLER--DALYVADTGNHAVRRINLLTGIVDTL 254
>gi|424667903|ref|ZP_18104928.1| hypothetical protein A1OC_01487 [Stenotrophomonas maltophilia
Ab55555]
gi|401068165|gb|EJP76689.1| hypothetical protein A1OC_01487 [Stenotrophomonas maltophilia
Ab55555]
Length = 470
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 50/94 (53%), Gaps = 5/94 (5%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIG-SCPGFEDGEFESSKLMRPA 322
L FP ++ S RL+++DS HHRI+ G+IL G F DG + RP
Sbjct: 165 LRFPLGLAV--STERLYIADSGHHRILECSHGGRILRQFGLGTADFMDGNLAEAAFHRPQ 222
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
A + D LY+ D+ NHA+RR ++ +++T+
Sbjct: 223 ALVLER--DALYVADTGNHAVRRINLLTGIVDTL 254
>gi|456733989|gb|EMF58811.1| Hypothetical protein EPM1_4079 [Stenotrophomonas maltophilia EPM1]
Length = 470
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 50/94 (53%), Gaps = 5/94 (5%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIG-SCPGFEDGEFESSKLMRPA 322
L FP ++ S RL+++DS HHRI+ G+IL G F DG + RP
Sbjct: 165 LRFPLGLAV--STERLYIADSGHHRILECSHGGRILRQFGLGTADFMDGNLAEAAFHRPQ 222
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
A + D LY+ D+ NHA+RR ++ +++T+
Sbjct: 223 ALVLER--DALYVADTGNHAVRRINLLTGIVDTL 254
>gi|427796943|gb|JAA63923.1| Putative tlpa-like family, partial [Rhipicephalus pulchellus]
Length = 719
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 52/98 (53%), Gaps = 4/98 (4%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FP + A E RL ++D+ HHRI++ + +G++L +G PG++DG + P
Sbjct: 247 LCFPAKVFATEM--RLIIADTGHHRIVITNHSGEVLQVVGGYEPGYQDGPLNGALFSEPN 304
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTS 360
+ +D + + D+ NHAIR D+ + T+ T
Sbjct: 305 GVQW-RDPHFILVADTGNHAIREVDLENGTVSTLAGTG 341
>gi|386717879|ref|YP_006184205.1| hypothetical protein SMD_1470 [Stenotrophomonas maltophilia D457]
gi|384077441|emb|CCH12027.1| Alr1013 protein [Stenotrophomonas maltophilia D457]
Length = 470
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 43/142 (30%), Positives = 66/142 (46%), Gaps = 21/142 (14%)
Query: 216 GMLNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADES 275
G L +A+ L Q S+ P G +E+ EP L FP ++ S
Sbjct: 133 GELERALNTLC--QGAPSAPPRG-------GSELHPEPRQA-------LRFPLGLAV--S 174
Query: 276 GNRLFLSDSNHHRIIVFDGNGKILDCIG-SCPGFEDGEFESSKLMRPAASFYHKDDDCLY 334
RL+++DS HHRI+ G+IL G F DG + RP A + D LY
Sbjct: 175 TERLYIADSGHHRILECSHGGRILRQFGLGTADFMDGNLAEAAFHRPQALVLER--DSLY 232
Query: 335 IVDSENHAIRRADMGRRVLETV 356
+ D+ NHA+RR ++ +++T+
Sbjct: 233 VADTGNHAVRRINLLTGIVDTL 254
>gi|190573634|ref|YP_001971479.1| hypothetical protein Smlt1644 [Stenotrophomonas maltophilia K279a]
gi|190011556|emb|CAQ45175.1| conserved hypothetical protein [Stenotrophomonas maltophilia K279a]
Length = 470
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 50/94 (53%), Gaps = 5/94 (5%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIG-SCPGFEDGEFESSKLMRPA 322
L FP ++ S RL+++DS HHRI+ G+IL G F DG + RP
Sbjct: 165 LRFPLGLAV--STERLYIADSGHHRILECSHGGRILRQFGLGTADFMDGNLAEAAFHRPQ 222
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
A + D LY+ D+ NHA+RR ++ +++T+
Sbjct: 223 ALVLER--DALYVADTGNHAVRRINLLTGIVDTL 254
>gi|344206806|ref|YP_004791947.1| NHL repeat containing protein [Stenotrophomonas maltophilia JV3]
gi|343778168|gb|AEM50721.1| NHL repeat containing protein [Stenotrophomonas maltophilia JV3]
Length = 470
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 43/142 (30%), Positives = 66/142 (46%), Gaps = 21/142 (14%)
Query: 216 GMLNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADES 275
G L +A+ L Q S+ P G +E+ EP L FP ++ S
Sbjct: 133 GELERALNALC--QGAPSAPPRG-------GSELHPEPRQA-------LRFPLGLAV--S 174
Query: 276 GNRLFLSDSNHHRIIVFDGNGKILDCIG-SCPGFEDGEFESSKLMRPAASFYHKDDDCLY 334
RL+++DS HHRI+ G+IL G F DG + RP A + D LY
Sbjct: 175 TERLYIADSGHHRILECSHGGRILRQFGLGTADFMDGNLAEAAFHRPQALVLER--DSLY 232
Query: 335 IVDSENHAIRRADMGRRVLETV 356
+ D+ NHA+RR ++ +++T+
Sbjct: 233 VADTGNHAVRRINLLTGIVDTL 254
>gi|255513298|gb|EET89564.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Candidatus Micrarchaeum acidiphilum ARMAN-2]
Length = 579
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 53/98 (54%), Gaps = 4/98 (4%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNG-KILDCIGS-CPGFEDGEFESSKLMRP 321
L +PG + G ++ +S+SN + I++ D +LD G+ G DG FE+S+ RP
Sbjct: 247 LRYPGKLCMSPDGTKIAISNSNANEIVIIDAKSLAVLDTAGNGAKGLSDGSFETSEFYRP 306
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
+ + +YI D+EN+A+R ++ R + TV T
Sbjct: 307 QGLEWVGNR--IYIADTENNAVREINLDDRTVATVLGT 342
>gi|194365180|ref|YP_002027790.1| NHL repeat containing protein [Stenotrophomonas maltophilia R551-3]
gi|194347984|gb|ACF51107.1| NHL repeat containing protein [Stenotrophomonas maltophilia R551-3]
Length = 470
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 50/94 (53%), Gaps = 5/94 (5%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIG-SCPGFEDGEFESSKLMRPA 322
L FP ++ S RL+++DS HHRI+ G+IL G F DG + RP
Sbjct: 165 LRFPLGLAV--STERLYIADSGHHRILECSHGGRILRQFGLGTADFMDGNLAEAAFHRPQ 222
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
A + D LY+ D+ NHA+RR ++ +++T+
Sbjct: 223 ALVLER--DSLYVADTGNHAVRRINLLTGIVDTL 254
>gi|392397842|ref|YP_006434443.1| NHL repeat protein,AhpC/TSA family protein [Flexibacter litoralis
DSM 6794]
gi|390528920|gb|AFM04650.1| NHL repeat protein,AhpC/TSA family protein [Flexibacter litoralis
DSM 6794]
Length = 543
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 54/224 (24%), Positives = 92/224 (41%), Gaps = 53/224 (23%)
Query: 235 SPSGLKCTWAKQAE-----------VLKEPHAC-----------SSVRNLLLHFPGCI-- 270
SP+G K W + E LKE H + V L FP I
Sbjct: 179 SPTG-KVVWQRAGESFYEDARNQILALKEQHKAELNTQKFDFQLAKVEKKELMFPSKIIE 237
Query: 271 --SADESGNR-----LFLSDSNHHRIIVFDGNGKILDCIG-SCPGFEDGEFESSKLMRPA 322
+ +++ N+ +++DS ++RII + GK+L+ IG S G DG FE + P
Sbjct: 238 VKTENDTDNKNNEPTFWIADSGNNRIIKINLEGKVLETIGNSKKGNTDGTFEETSFYEPH 297
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
+ + LYI D++N+ I+ AD+ + ++ + T + G+
Sbjct: 298 GLALSPNGEKLYIADTKNNVIKEADLVNKTVKIINGTG---------------ETGYYFG 342
Query: 383 NDTKSEKLDPQSLIFPWHLM--KSEDDNLLIINRSFETLWIMDL 424
+ E ++P S PW L+ K + + I N + M+L
Sbjct: 343 EKSWGENINPNS---PWDLLIDKKYPNTMYIANAGNHQILKMNL 383
>gi|410477767|ref|YP_006765404.1| NHL repeat domain-containing protein [Leptospirillum ferriphilum
ML-04]
gi|406773019|gb|AFS52444.1| NHL repeat domain protein [Leptospirillum ferriphilum ML-04]
Length = 494
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 70/158 (44%), Gaps = 25/158 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFD------GNGKILDCIG-SCPGFEDGEFESS 316
L FP I D SG ++ +SDS +HR+++ D + + IG PG DG F+ +
Sbjct: 176 LRFPAKI--DISGKKVLVSDSGNHRLLLLDWEEHSPEKAALREVIGQGTPGSADGSFDQA 233
Query: 317 KLMRPAASFYHKDD-DCLYIVDSENHAIRRADMGRRVLETVY-----------PTSGISK 364
+ P + DD D + D+ NH +RR D RR + T+ P +S
Sbjct: 234 QFRDPQGIRFCPDDPDIAIVADTGNHLLRRVDFRRRSVTTIAGTGVQGWAIFEPVPAMSA 293
Query: 365 KNNSLWAWIMEKLGF---ERDNDTKSEKLDPQSL-IFP 398
NS W + + G + + +LDP+ IFP
Sbjct: 294 VLNSPWDILFHRDGMLYVAQAGPHQIIRLDPERQEIFP 331
>gi|285018581|ref|YP_003376292.1| nhl repeat protein [Xanthomonas albilineans GPE PC73]
gi|283473799|emb|CBA16302.1| hypothetical nhl repeat protein [Xanthomonas albilineans GPE PC73]
Length = 473
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 61/129 (47%), Gaps = 18/129 (13%)
Query: 218 LNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGN 277
L+KA+ L +Q P L + E EP L+L FP ++ E +
Sbjct: 134 LDKALHALCAEQ------PLSLDDELPEVPETQPEP-------RLMLRFPSGLALYE--D 178
Query: 278 RLFLSDSNHHRIIVFDGNGKILDCIGSCPG-FEDGEFESSKLMRPAASFYHKDDDCLYIV 336
RL+++DS HHRI+ + G++L G+ F DGE + RP + LY+
Sbjct: 179 RLYIADSGHHRILECNTGGRLLRQFGTGTADFIDGELGQAAFHRPQGLALQR--GVLYVA 236
Query: 337 DSENHAIRR 345
D+ NHA+RR
Sbjct: 237 DTGNHALRR 245
>gi|389875076|ref|YP_006374432.1| NHL repeat-containing protein [Tistrella mobilis KA081020-065]
gi|388532256|gb|AFK57450.1| NHL repeat-containing protein [Tistrella mobilis KA081020-065]
Length = 487
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 60/238 (25%), Positives = 96/238 (40%), Gaps = 32/238 (13%)
Query: 281 LSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSE 339
++D+ HH+I +FD G + +GS PG DG ES++ RP D +++ D+
Sbjct: 191 IADAGHHQIALFDNAGNEIRRVGSGEPGLVDGPAESARFQRPQGLI--AADGAIFVADTW 248
Query: 340 NHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQSLIFPW 399
NHA+RR D+ + T+ +G ++ L A + +D +L PW
Sbjct: 249 NHAVRRIDVASGEVTTI---AGTGRRGPILKA--------------PAPAID-SALASPW 290
Query: 400 HLMKSEDDNLLIINRSFETLWIMDLASGEIKEAVKGFSKVLEICG-----VLVMEKVFLL 454
L E L + N L +DL +G + E + G + G L L
Sbjct: 291 DLEYRE-GVLYVANAGTHQLARIDLETG-VLEPLAGTGAEALVDGPADQAALAQPSALAL 348
Query: 455 KQMPQDWLLHQIDSSCSLKELPYAGLISSSIAFQNHILLCDIVGQ-RIMRLNRESGVC 511
K +D L + SL+ L AG S++ + G R R G+C
Sbjct: 349 K---EDRLWFIDAETSSLRRLDLAGGTVSTLVGDGLFEFGMVDGPARTARFQHPLGLC 403
>gi|289665900|ref|ZP_06487481.1| hypothetical protein XcampvN_23187 [Xanthomonas campestris pv.
vasculorum NCPPB 702]
Length = 472
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 52/190 (27%), Positives = 86/190 (45%), Gaps = 28/190 (14%)
Query: 166 RLVEMLMKEYITFPILLSN-----KNFPQMENGACYLLSKDFGNARVFHENSLDI-GMLN 219
R ++ L ++ PILL + F +E+ +L +G R + + + G L+
Sbjct: 79 RALKQLRGHGVSAPILLDKDWETWRRF-GIESWPTLVLLDAYGRER---QRLVGVTGDLD 134
Query: 220 KAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRL 279
KA+ L QQ S + AE EPH L L FP ++A E +RL
Sbjct: 135 KALTALCEGQQPPSDA------DLHGVAERGPEPH-------LALRFPTGLAATE--DRL 179
Query: 280 FLSDSNHHRIIVFDGNGKILDCIGSCPG-FEDGEFESSKLMRPAASFYHKDDDCLYIVDS 338
+++D+ HHR++ +G++L G DG + RP +D LY+ D+
Sbjct: 180 YVADTGHHRVLECTHSGRVLRQFGHGNADLIDGGVGEAAFRRPQGLALERDQ--LYVADT 237
Query: 339 ENHAIRRADM 348
NHA+RR ++
Sbjct: 238 GNHALRRINL 247
>gi|431807572|ref|YP_007234470.1| hypothetical protein BPP43_04715 [Brachyspira pilosicoli P43/6/78]
gi|430780931|gb|AGA66215.1| tetratricopeptide repeat-containing protein [Brachyspira pilosicoli
P43/6/78]
Length = 688
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/90 (35%), Positives = 49/90 (54%), Gaps = 9/90 (10%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFY 326
P + AD G RL++SD+ ++RI+VFD NG LD G G DGEF S PA
Sbjct: 250 PSSLYADNGG-RLYVSDTGNNRIVVFDVNGNYLDSFGKM-GENDGEFFS-----PAGIV- 301
Query: 327 HKDDDCLYIVDSENHAIRRADMGRRVLETV 356
D++ +Y+ D N +++ D+ + T+
Sbjct: 302 -VDNNYIYVADMGNKRVQKFDLNGNYISTI 330
>gi|198421727|ref|XP_002130106.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 454
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 51/98 (52%), Gaps = 5/98 (5%)
Query: 264 LHFPGCISADESGNRLFLSDSNHH--RIIVFDGNGKILDCIGSCP---GFEDGEFESSKL 318
L+ P + D++ +RLF++D+++H RII D + + P GF+DG F +
Sbjct: 174 LNNPQQLEYDDTNDRLFIADTDNHAIRIINVDSATPKVTTLTGGPRMEGFKDGSFAEAMW 233
Query: 319 MRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P Y D LY+ D NHA+R D+ ++ + T+
Sbjct: 234 RHPTGMAYDVRKDVLYVSDHYNHAVRMLDLKKKTVTTI 271
Score = 39.3 bits (90), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 24/85 (28%), Positives = 44/85 (51%), Gaps = 4/85 (4%)
Query: 263 LLHFPGCISADESGNRLFLSDSNHHRIIVFD-GNGKILDCIGSC--PGFEDGEFESSKLM 319
H G ++ D +++++DS ++ I + D G ++ G+ F+DG ++ L
Sbjct: 117 FFHIAG-MALDHKRQKMYIADSGNNAIRLMDLGTNEVTTFAGTNGDKAFKDGSLLTAGLN 175
Query: 320 RPAASFYHKDDDCLYIVDSENHAIR 344
P Y +D L+I D++NHAIR
Sbjct: 176 NPQQLEYDDTNDRLFIADTDNHAIR 200
>gi|407780651|ref|ZP_11127872.1| NHL repeat-containing protein [Oceanibaculum indicum P24]
gi|407208878|gb|EKE78785.1| NHL repeat-containing protein [Oceanibaculum indicum P24]
Length = 498
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 56/224 (25%), Positives = 92/224 (41%), Gaps = 30/224 (13%)
Query: 137 VKSIQQSFPQ-LQVIGFLHGCSTISAVDQTRLVEMLMKEYITFPIL-----LSNKNFPQM 190
+K ++Q+FP+ + VIG +H + + + + + I P++ L + +
Sbjct: 54 LKRLEQAFPEDVAVIG-VHSPKFAAEREPEAVRHAIARHGIAHPVVHDPDFLLWQEYAVR 112
Query: 191 ENGACYLLSKDFGNARVFHENSLDIGMLNKAVEELIMQ-QQENSSSPSGLKCTWAKQAEV 249
+S D G H D L + VE+ + +++ + P L
Sbjct: 113 AWPTLIFVSPD-GYVIGQHSGEPDAERLQEVVEKTLKDYRRKGTMRPRPLDL-------- 163
Query: 250 LKEPHACSSVRNLLLHFPGCI-----SADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS 304
EP S R FPG I + D G L+DS HH+I++ D G+ L GS
Sbjct: 164 --EPERMPSAR---FRFPGKIKPLPEAPDSPGKLWILADSGHHQIVILDDEGRELMRFGS 218
Query: 305 C-PGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRAD 347
GF G + ++ P D +Y+ D+ NHAIRR D
Sbjct: 219 GEAGFASGPADRARFRDPQG--LAADTGAIYVADTGNHAIRRID 260
>gi|346725095|ref|YP_004851764.1| Thiol-disulfide isomerase and thioredoxin [Xanthomonas axonopodis
pv. citrumelo F1]
gi|418518756|ref|ZP_13084891.1| Thiol-disulfide isomerase and thioredoxin [Xanthomonas axonopodis
pv. malvacearum str. GSPB1386]
gi|346649842|gb|AEO42466.1| Thiol-disulfide isomerase and thioredoxin [Xanthomonas axonopodis
pv. citrumelo F1]
gi|410702450|gb|EKQ60955.1| Thiol-disulfide isomerase and thioredoxin [Xanthomonas axonopodis
pv. malvacearum str. GSPB1386]
Length = 472
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 52/190 (27%), Positives = 86/190 (45%), Gaps = 28/190 (14%)
Query: 166 RLVEMLMKEYITFPILLSN-----KNFPQMENGACYLLSKDFGNARVFHENSLDI-GMLN 219
R ++ L ++ PILL + F +E+ +L +G R + + + G L+
Sbjct: 79 RSLKQLRGHGVSAPILLDKDWETWRRF-GIESWPTLVLLDAYGRER---QRLVGVTGDLD 134
Query: 220 KAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRL 279
KA+ L QQ S + AE EPH L L FP ++A E +RL
Sbjct: 135 KALTALCEGQQPPSDA------DLHGVAEREPEPH-------LALRFPTGLAATE--DRL 179
Query: 280 FLSDSNHHRIIVFDGNGKILDCIGSCPG-FEDGEFESSKLMRPAASFYHKDDDCLYIVDS 338
+++D+ HHR++ +G++L G DG + RP +D LY+ D+
Sbjct: 180 YVADTGHHRVLECTHSGRVLRQFGHGNADLIDGGVGEAAFRRPQGLALERDQ--LYVADT 237
Query: 339 ENHAIRRADM 348
NHA+RR ++
Sbjct: 238 GNHALRRINL 247
>gi|289669010|ref|ZP_06490085.1| hypothetical protein XcampmN_11072, partial [Xanthomonas campestris
pv. musacearum NCPPB 4381]
Length = 284
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 52/190 (27%), Positives = 86/190 (45%), Gaps = 28/190 (14%)
Query: 166 RLVEMLMKEYITFPILLSN-----KNFPQMENGACYLLSKDFGNARVFHENSLDI-GMLN 219
R ++ L ++ PILL + F +E+ +L +G R + + + G L+
Sbjct: 79 RALKQLRGHGVSAPILLDKDWETWRRF-GIESWPTLVLLDAYGRER---QRLVGVTGDLD 134
Query: 220 KAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRL 279
KA+ L QQ S + AE EPH L L FP ++A E +RL
Sbjct: 135 KALTALCEGQQPPSDA------DLHGVAERGPEPH-------LALRFPTGLAATE--DRL 179
Query: 280 FLSDSNHHRIIVFDGNGKILDCIGSCPG-FEDGEFESSKLMRPAASFYHKDDDCLYIVDS 338
+++D+ HHR++ +G++L G DG + RP +D LY+ D+
Sbjct: 180 YVADTGHHRVLECTHSGRVLRQFGHGNADLIDGGVGEAAFRRPQGLALERDQ--LYVADT 237
Query: 339 ENHAIRRADM 348
NHA+RR ++
Sbjct: 238 GNHALRRINL 247
>gi|347757619|ref|YP_004865181.1| NHL repeat family protein [Micavibrio aeruginosavorus ARL-13]
gi|347590137|gb|AEP09179.1| NHL repeat family protein [Micavibrio aeruginosavorus ARL-13]
Length = 495
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 49/178 (27%), Positives = 77/178 (43%), Gaps = 29/178 (16%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRPA 322
L FP I +++G+ +F++DS H+RI+V D G + IGS G +DG F +K P
Sbjct: 188 LSFPARI--EDAGDFIFVADSGHNRILVLDRTGAVKMTIGSGELGHQDGSFVMAKFNHPR 245
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERD 382
+ LYI D++NH +R A++ + T+ SG K+ +
Sbjct: 246 GMVLIGRN--LYIADTDNHMLRVANLDSGSVMTL---SGNGKRTFA-------------- 286
Query: 383 NDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLW-------IMDLASGEIKEAV 433
K++K L PW + D ++ I LW M + G EAV
Sbjct: 287 PPFKAQKAMTTELASPWDVEALPDGSVAIAMAGVHQLWRYAPTARTMQVMVGSGAEAV 344
>gi|325926967|ref|ZP_08188244.1| NHL repeat protein [Xanthomonas perforans 91-118]
gi|325542663|gb|EGD14128.1| NHL repeat protein [Xanthomonas perforans 91-118]
Length = 443
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 52/190 (27%), Positives = 86/190 (45%), Gaps = 28/190 (14%)
Query: 166 RLVEMLMKEYITFPILLSN-----KNFPQMENGACYLLSKDFGNARVFHENSLDI-GMLN 219
R ++ L ++ PILL + F +E+ +L +G R + + + G L+
Sbjct: 79 RSLKQLRGHGVSAPILLDKDWETWRRF-GIESWPTLVLLDAYGRER---QRLVGVTGDLD 134
Query: 220 KAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRL 279
KA+ L QQ S + AE EPH L L FP ++A E +RL
Sbjct: 135 KALTALCEGQQPPSDA------DLHGVAEREPEPH-------LALRFPTGLAATE--DRL 179
Query: 280 FLSDSNHHRIIVFDGNGKILDCIGSCPG-FEDGEFESSKLMRPAASFYHKDDDCLYIVDS 338
+++D+ HHR++ +G++L G DG + RP +D LY+ D+
Sbjct: 180 YVADTGHHRVLECTHSGRVLRQFGHGNADLIDGGVGEAAFRRPQGLALERDQ--LYVADT 237
Query: 339 ENHAIRRADM 348
NHA+RR ++
Sbjct: 238 GNHALRRINL 247
>gi|405375498|ref|ZP_11029528.1| putative hemolysin [Chondromyces apiculatus DSM 436]
gi|397086225|gb|EJJ17355.1| putative hemolysin [Myxococcus sp. (contaminant ex DSM 436)]
Length = 855
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 45/85 (52%), Gaps = 1/85 (1%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAA 323
L+ P I+ D +GN +++SD++H+ I D GK+ GS PG +DG + + +PA
Sbjct: 616 LNGPMGIATDAAGN-VYVSDTDHYVIRRIDPEGKVELFAGSTPGLQDGPAKQAAFNQPAG 674
Query: 324 SFYHKDDDCLYIVDSENHAIRRADM 348
D L + D N IRR D+
Sbjct: 675 LTVTPDGSALLVADLNNGVIRRIDL 699
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 53/102 (51%), Gaps = 4/102 (3%)
Query: 245 KQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRII-VFDGNGKILDCIG 303
++ +++ E H ++++ L+ P ++ E G L++ +S R++ + DG ++ G
Sbjct: 695 RRIDLVAEGHPVTTLQGDWLYRPSGVTVSEDGGTLYVVESGMSRVVRIRDGVTSVV--AG 752
Query: 304 SCPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRR 345
+ PGF DG ESS+ + P D L + D N+ +RR
Sbjct: 753 TTPGFRDGAPESSQFL-PYLGIAVLKDGSLAVSDPGNYRVRR 793
>gi|407792264|ref|ZP_11139332.1| hypothetical protein B3C1_18191 [Gallaecimonas xiamenensis 3-C-1]
gi|407197721|gb|EKE67773.1| hypothetical protein B3C1_18191 [Gallaecimonas xiamenensis 3-C-1]
Length = 488
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 52/106 (49%), Gaps = 5/106 (4%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPG-FEDGEFESSKLMRPA 322
L PG + G+ L +SDS H+RII +G+++ +GS +DG E + P
Sbjct: 185 LAAPGKVRV--KGDLLAISDSGHNRIIFARPDGRVVKVVGSGEACAKDGSAEEACFASPQ 242
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNS 368
+ + D LY+ D+ NH IRR D+ + TV T + N+
Sbjct: 243 GTLFQ--DQALYVADTNNHLIRRIDLRNYQVSTVAGTGKLGGYLNA 286
>gi|357008076|ref|ZP_09073075.1| copper amine oxidase domain-containing protein [Paenibacillus elgii
B69]
Length = 533
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 50/181 (27%), Positives = 81/181 (44%), Gaps = 26/181 (14%)
Query: 263 LLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESSKLMRP 321
L + P ++AD +GN ++++DS +H I D G++ G+ G +DGE + + RP
Sbjct: 126 LFNEPQGLAADANGN-VYVADSGNHAIRKIDTAGQVSTVAGNGLLGRKDGEGKDALFYRP 184
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIME------ 375
D LY+ DS NHAIR ++ P SG K N+L ++E
Sbjct: 185 T-DVAVAADGTLYVADSLNHAIR----------SISP-SGEVKTLNALSPRVVELFPGQV 232
Query: 376 -KLGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEIKEAVK 434
G D D KS K + P L+ NL++ + + + +DL G++
Sbjct: 233 SPAGDFADGDLKSAKFNE-----PTALVLDAKGNLIVSDSGNQRIRYIDLQQGKVTTLAG 287
Query: 435 G 435
G
Sbjct: 288 G 288
>gi|269926393|ref|YP_003323016.1| alkyl hydroperoxide reductase [Thermobaculum terrenum ATCC BAA-798]
gi|269790053|gb|ACZ42194.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Thermobaculum terrenum ATCC BAA-798]
Length = 493
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 78/169 (46%), Gaps = 21/169 (12%)
Query: 263 LLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRP 321
+L FPG + A E + L+++DSNH+RI+ +GKI G+ G DG +K P
Sbjct: 175 ILAFPGKVLALEDEDSLYIADSNHNRILECSLSGKIRRIWGNGEEGLVDGSASEAKFNHP 234
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFER 381
++ LY+ D+ENHA+R + +ET+ +G ++ + F+
Sbjct: 235 QGMAIRGNE--LYVADTENHALRLLHLNEGKVETI---AGTGEQGYP--------ISFQP 281
Query: 382 DNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEIK 430
+E L PW L + +D L I +W M++ SG ++
Sbjct: 282 SIGKYTE------LNSPWDL-EIVNDVLYIAMAGCHQIWAMEMPSGVVR 323
>gi|170759724|ref|YP_001788082.1| NHL repeat-containing protein [Clostridium botulinum A3 str. Loch
Maree]
gi|169406713|gb|ACA55124.1| NHL repeat protein [Clostridium botulinum A3 str. Loch Maree]
Length = 464
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 55/92 (59%), Gaps = 4/92 (4%)
Query: 266 FPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRPAAS 324
+P IS +E+G + S+ +RI++ D N + + IGS G +DG+F+ ++ P
Sbjct: 175 YPSKISINENG-LIAFSEKGKNRIVILDQNLEKIKTIGSSEYGLKDGDFKETQFKAPEGL 233
Query: 325 FYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
+ ++ +Y+ D+ENH+IR D+ + +++T+
Sbjct: 234 RFIENK--IYVADTENHSIRECDLEKEIVKTI 263
>gi|418523208|ref|ZP_13089230.1| hypothetical protein WS7_19596 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410700170|gb|EKQ58738.1| hypothetical protein WS7_19596 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
Length = 472
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 52/103 (50%), Gaps = 12/103 (11%)
Query: 247 AEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP 306
AE EPH L L FP ++A E +RL+++D+ HHR++ +G++L G
Sbjct: 156 AEREPEPH-------LALRFPTGLAATE--DRLYIADTGHHRVLECTHSGRVLRQFGHGN 206
Query: 307 G-FEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADM 348
DG + RP +D LY+ D+ NHA+RR ++
Sbjct: 207 ADLIDGGVGEAAFRRPQGLALERDQ--LYVADTGNHALRRINL 247
>gi|398337803|ref|ZP_10522508.1| NHL repeat-containing protein [Leptospira kmetyi serovar Malaysia
str. Bejo-Iso9]
Length = 396
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/102 (27%), Positives = 54/102 (52%), Gaps = 2/102 (1%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFD-GNGKILDCIGSCPGFEDGEFESS-KL 318
N+ L+ P ++ D+ +++SD+N+HRI +D N + +G+ G+ DG +L
Sbjct: 136 NVKLNRPSYVAVDDQAKYIYISDTNNHRIRKYDIANDMLSTVVGTVAGYHDGAGTYDVRL 195
Query: 319 MRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTS 360
P D+ L+I D N+ +RR D+ +++T+ P +
Sbjct: 196 NTPKGIDLSFDNKYLFIADYGNNKVRRYDIANDMVDTIVPNA 237
>gi|381173685|ref|ZP_09882761.1| NHL repeat family protein [Xanthomonas citri pv. mangiferaeindicae
LMG 941]
gi|380685855|emb|CCG39248.1| NHL repeat family protein [Xanthomonas citri pv. mangiferaeindicae
LMG 941]
Length = 472
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 52/103 (50%), Gaps = 12/103 (11%)
Query: 247 AEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP 306
AE EPH L L FP ++A E +RL+++D+ HHR++ +G++L G
Sbjct: 156 AEREPEPH-------LALRFPTGLAATE--DRLYVADTGHHRVLECTHSGRVLRQFGHGN 206
Query: 307 G-FEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADM 348
DG + RP +D LY+ D+ NHA+RR ++
Sbjct: 207 ADLIDGGVGEAAFRRPQGLALERDQ--LYVADTGNHALRRINL 247
>gi|206603181|gb|EDZ39661.1| NHL repeat domain protein [Leptospirillum sp. Group II '5-way CG']
Length = 494
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/135 (28%), Positives = 60/135 (44%), Gaps = 21/135 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVF------DGNGKILDCIG-SCPGFEDGEFESS 316
L FP I D +G ++ +SDS +HR+++ + + IG PG DG F+ +
Sbjct: 176 LRFPAKI--DIAGKKVLVSDSGNHRLLLLAWEEHSPEKAALREVIGQGIPGSADGSFDQA 233
Query: 317 KLMRPAASFYHKDD-DCLYIVDSENHAIRRADMGRRVLETVY-----------PTSGISK 364
+ P + DD D + D+ NH +RR D RR + T+ P S +S
Sbjct: 234 QFRDPQGVRFCPDDPDTAIVADTGNHLLRRVDFRRRSVTTIAGTGVQGWAIFEPVSAMSA 293
Query: 365 KNNSLWAWIMEKLGF 379
NS W + + G
Sbjct: 294 VLNSPWDILFHRDGM 308
>gi|390991569|ref|ZP_10261830.1| NHL repeat family protein [Xanthomonas axonopodis pv. punicae str.
LMG 859]
gi|372553665|emb|CCF68805.1| NHL repeat family protein [Xanthomonas axonopodis pv. punicae str.
LMG 859]
Length = 472
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 52/103 (50%), Gaps = 12/103 (11%)
Query: 247 AEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP 306
AE EPH L L FP ++A E +RL+++D+ HHR++ +G++L G
Sbjct: 156 AEREPEPH-------LALRFPTGLAATE--DRLYVADTGHHRVLECTHSGRVLRQFGHGN 206
Query: 307 G-FEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADM 348
DG + RP +D LY+ D+ NHA+RR ++
Sbjct: 207 ADLIDGGVGEAAFRRPQGLALERDQ--LYVADTGNHALRRINL 247
>gi|168183170|ref|ZP_02617834.1| NHL repeat protein [Clostridium botulinum Bf]
gi|237796201|ref|YP_002863753.1| NHL repeat-containing protein [Clostridium botulinum Ba4 str. 657]
gi|182673679|gb|EDT85640.1| NHL repeat protein [Clostridium botulinum Bf]
gi|229261965|gb|ACQ52998.1| NHL repeat protein [Clostridium botulinum Ba4 str. 657]
Length = 467
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 55/92 (59%), Gaps = 4/92 (4%)
Query: 266 FPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRPAAS 324
+P IS +E+G + S+ +RI++ D N + + IGS G +DG+F+ ++ P
Sbjct: 178 YPSKISINENG-LIAFSEKGKNRIVILDQNLEKIKIIGSGKYGLKDGDFKEAQFKAPEGL 236
Query: 325 FYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
+ ++ +Y+ D+ENH+IR D+ + +++T+
Sbjct: 237 RFIENK--IYVADTENHSIRECDLEKEIVKTI 266
>gi|187778661|ref|ZP_02995134.1| hypothetical protein CLOSPO_02256 [Clostridium sporogenes ATCC
15579]
gi|187772286|gb|EDU36088.1| NHL repeat protein [Clostridium sporogenes ATCC 15579]
Length = 467
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 55/92 (59%), Gaps = 4/92 (4%)
Query: 266 FPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRPAAS 324
+P IS +E+G + S+ +RI++ D N + + IGS G +DG+F+ ++ P
Sbjct: 178 YPSKISINENG-LIAFSEKGRNRIVILDQNLEKIKTIGSSEYGLKDGDFKEAQFKAPEGL 236
Query: 325 FYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
+ ++ +Y+ D+ENH+IR D+ + +++T+
Sbjct: 237 RFIENK--IYVADTENHSIRECDLEKEMVKTI 266
>gi|195456364|ref|XP_002075106.1| GK23381 [Drosophila willistoni]
gi|194171191|gb|EDW86092.1| GK23381 [Drosophila willistoni]
Length = 721
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 55/97 (56%), Gaps = 2/97 (2%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FP I+ + G+R ++D+ ++R++V +G + IG GF DG+ +S+ P
Sbjct: 228 LRFPAKIARSQLGHRYAIADAGNNRVLVVLNDGVVEHKIGGFESGFIDGDLTTSRFNNPQ 287
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
+ D+D L + D++NHA+R+ + ++ET+ T
Sbjct: 288 GIAF-LDEDTLIVADTKNHALRQISLQGGIVETLAGT 323
>gi|434381753|ref|YP_006703536.1| tetratricopeptide repeat-containing protein [Brachyspira pilosicoli
WesB]
gi|404430402|emb|CCG56448.1| tetratricopeptide repeat-containing protein [Brachyspira pilosicoli
WesB]
Length = 700
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 9/90 (10%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFY 326
P + AD G RL++SD+ ++RI+VFD NG LD G G +GEF S PA
Sbjct: 262 PSSLYADNGG-RLYVSDTGNNRIVVFDVNGNYLDSFGKM-GENEGEFFS-----PAGIV- 313
Query: 327 HKDDDCLYIVDSENHAIRRADMGRRVLETV 356
D++ +Y+ D N +++ D+ + T+
Sbjct: 314 -VDNNYIYVADMGNKRVQKFDLNGNYISTI 342
>gi|21242830|ref|NP_642412.1| hypothetical protein XAC2091 [Xanthomonas axonopodis pv. citri str.
306]
gi|21108317|gb|AAM36948.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 499
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 52/103 (50%), Gaps = 12/103 (11%)
Query: 247 AEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP 306
AE EPH L L FP ++A E +RL+++D+ HHR++ +G++L G
Sbjct: 183 AEREPEPH-------LALRFPTGLAATE--DRLYVADTGHHRVLECTHSGRVLRQFGHGN 233
Query: 307 G-FEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADM 348
DG + RP +D LY+ D+ NHA+RR ++
Sbjct: 234 ADLIDGGVGEAAFRRPQGLALERDQ--LYVADTGNHALRRINL 274
>gi|404476394|ref|YP_006707825.1| hypothetical protein B2904_orf1746 [Brachyspira pilosicoli B2904]
gi|404437883|gb|AFR71077.1| tetratricopeptide repeat-containing protein [Brachyspira pilosicoli
B2904]
Length = 700
Score = 52.8 bits (125), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 9/90 (10%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFY 326
P + AD G RL++SD+ ++RI+VFD NG LD G G +GEF S PA
Sbjct: 262 PSSLYADNGG-RLYVSDTGNNRIVVFDVNGNYLDSFGKM-GENEGEFFS-----PAGIV- 313
Query: 327 HKDDDCLYIVDSENHAIRRADMGRRVLETV 356
D++ +Y+ D N +++ D+ + T+
Sbjct: 314 -VDNNYIYVADMGNKRVQKFDLNGNYISTI 342
>gi|300871685|ref|YP_003786558.1| hypothetical protein BP951000_2080 [Brachyspira pilosicoli 95/1000]
gi|300689386|gb|ADK32057.1| tetratricopeptide repeat family protein [Brachyspira pilosicoli
95/1000]
Length = 657
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 9/90 (10%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFY 326
P + AD G RL++SD+ ++RI+VFD NG LD G G +GEF S PA
Sbjct: 219 PSSLYADNGG-RLYVSDTGNNRIVVFDVNGNYLDSFGKM-GENEGEFFS-----PAGIV- 270
Query: 327 HKDDDCLYIVDSENHAIRRADMGRRVLETV 356
D++ +Y+ D N +++ D+ + T+
Sbjct: 271 -VDNNYIYVADMGNKRVQKFDLNGNYISTI 299
>gi|319786925|ref|YP_004146400.1| NHL repeat containing protein [Pseudoxanthomonas suwonensis 11-1]
gi|317465437|gb|ADV27169.1| NHL repeat containing protein [Pseudoxanthomonas suwonensis 11-1]
Length = 469
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 67/141 (47%), Gaps = 18/141 (12%)
Query: 218 LNKAVEELIMQQQEN-SSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESG 276
LN ++ + EN + P G + E + EP L L FP ++A S
Sbjct: 129 LNGDLDRALSALAENVAPHPDG-----GRLVETVPEPR-------LSLRFPSAVAA--SA 174
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIG-SCPGFEDGEFESSKLMRPAASFYHKDDDCLYI 335
+RLF++DS HHRI+ +G++L G DG + + RP + + LY+
Sbjct: 175 DRLFIADSGHHRILECTHSGRVLRQFGLGTADSMDGPHDQAAFNRPQGLALER--EVLYV 232
Query: 336 VDSENHAIRRADMGRRVLETV 356
D+ NH++RR ++ ++T+
Sbjct: 233 ADTGNHSLRRINLLNGTVDTL 253
>gi|424835068|ref|ZP_18259738.1| NHL repeat-containing protein [Clostridium sporogenes PA 3679]
gi|365978195|gb|EHN14287.1| NHL repeat-containing protein [Clostridium sporogenes PA 3679]
Length = 467
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 53/92 (57%), Gaps = 4/92 (4%)
Query: 266 FPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRPAAS 324
+P IS +E+G + S+ +RI++ D N + + IGS G +DG+F ++ P
Sbjct: 178 YPSKISINENG-LIAFSEKGKNRIVILDQNLEKIKTIGSSEYGLKDGDFREAQFKAPEGL 236
Query: 325 FYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
+ K+ +Y+ D+ENH IR D+ + +++T+
Sbjct: 237 RFIKNK--IYVADTENHFIRECDLEKEMVKTI 266
>gi|424868210|ref|ZP_18291970.1| NHL repeat domain protein [Leptospirillum sp. Group II 'C75']
gi|124516337|gb|EAY57845.1| NHL repeat domain protein [Leptospirillum rubarum]
gi|387221511|gb|EIJ76065.1| NHL repeat domain protein [Leptospirillum sp. Group II 'C75']
Length = 494
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 69/158 (43%), Gaps = 25/158 (15%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFD------GNGKILDCIG-SCPGFEDGEFESS 316
L FP I D SG ++ +SDS +HR+++ D + + IG PG DG F+ +
Sbjct: 176 LRFPAKI--DISGKKVLVSDSGNHRLLLLDWEEHSPEKAALREVIGQGTPGSADGSFDQA 233
Query: 317 KLMRPAASFYHKDD-DCLYIVDSENHAIRRADMGRRVLETVY-----------PTSGISK 364
+ P + DD D + D+ NH +RR D R + T+ P +S
Sbjct: 234 QFRDPQGIRFCPDDPDIAIVADTGNHLLRRVDFRRGSVTTIAGTGVQGWAIFEPVPAMSA 293
Query: 365 KNNSLWAWIMEKLGF---ERDNDTKSEKLDPQSL-IFP 398
NS W + + G + + +LDP+ IFP
Sbjct: 294 VLNSPWDILFHRDGMLYVAQAGPHQIIRLDPERQEIFP 331
>gi|407792655|ref|ZP_11139692.1| NHL repeat containing protein [Idiomarina xiamenensis 10-D-4]
gi|407217768|gb|EKE87600.1| NHL repeat containing protein [Idiomarina xiamenensis 10-D-4]
Length = 510
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 37/156 (23%), Positives = 69/156 (44%), Gaps = 22/156 (14%)
Query: 266 FPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPAAS 324
F A +G+ + ++D+ HH+I+V D +GK + IG+ F DG ++ P
Sbjct: 198 FAAPGKAVSNGDYIAVADTLHHQIVVTDKHGKTVQRIGTGNAAFNDGSASAASFNSPQGL 257
Query: 325 FYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDND 384
+ + LY+ D+ NHA+R D+ + TV T + + N+
Sbjct: 258 AFA--EQTLYVADTGNHALRAIDLSSWQVTTVAGTGQLGRNYNA---------------- 299
Query: 385 TKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLW 420
S K +L PW + +DD ++ + + + +W
Sbjct: 300 --SGKATAVALRSPWDITMYQDDVIIAMAGTHQ-IW 332
>gi|170755613|ref|YP_001782372.1| NHL repeat-containing protein [Clostridium botulinum B1 str. Okra]
gi|429244666|ref|ZP_19208103.1| NHL repeat-containing protein [Clostridium botulinum CFSAN001628]
gi|169120825|gb|ACA44661.1| NHL repeat protein [Clostridium botulinum B1 str. Okra]
gi|428758305|gb|EKX80740.1| NHL repeat-containing protein [Clostridium botulinum CFSAN001628]
Length = 467
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 53/92 (57%), Gaps = 4/92 (4%)
Query: 266 FPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRPAAS 324
+P IS +E G + S+ +RI++ D N + + IGS G +DG+F+ ++ P
Sbjct: 178 YPSKISINEDG-LIAFSEKGKNRIVILDQNLEKIKTIGSSKYGLKDGDFKEAQFKAPEGL 236
Query: 325 FYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
+ ++ +Y+ D+ENH IR D+ + +++T+
Sbjct: 237 RFIENK--IYVADTENHFIRECDLEKEIVKTI 266
>gi|325917587|ref|ZP_08179787.1| NHL repeat protein [Xanthomonas vesicatoria ATCC 35937]
gi|325536221|gb|EGD08017.1| NHL repeat protein [Xanthomonas vesicatoria ATCC 35937]
Length = 472
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 61/134 (45%), Gaps = 18/134 (13%)
Query: 216 GMLNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADES 275
G L+KA+ L QQ S + Q +EP L L FP ++A E
Sbjct: 131 GDLDKALAALCEGQQPPSDAD--------LQGIAEREPEP-----RLALRFPTGLAATE- 176
Query: 276 GNRLFLSDSNHHRIIVFDGNGKILDCIGSCPG-FEDGEFESSKLMRPAASFYHKDDDCLY 334
+RL+++D+ HHR++ G++L G DG + RP +D LY
Sbjct: 177 -DRLYIADTGHHRVLECTHGGRVLRQFGHGNADLIDGGTGEAAFRRPQGLALERDQ--LY 233
Query: 335 IVDSENHAIRRADM 348
+ D+ NHA+RR ++
Sbjct: 234 VADTGNHALRRINL 247
>gi|226950178|ref|YP_002805269.1| NHL repeat-containing protein [Clostridium botulinum A2 str. Kyoto]
gi|226843692|gb|ACO86358.1| NHL repeat protein [Clostridium botulinum A2 str. Kyoto]
Length = 467
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 53/92 (57%), Gaps = 4/92 (4%)
Query: 266 FPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRPAAS 324
+P IS +E G + S+ +RI++ D N + + IGS G +DG+F+ ++ P
Sbjct: 178 YPSKISINEDG-LIAFSEKGKNRIVILDQNLEKIKTIGSSEYGLKDGDFKEAQFKAPEGL 236
Query: 325 FYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
+ ++ +Y+ D+ENH IR D+ + +++T+
Sbjct: 237 RFIENK--IYVADTENHFIRECDLEKEIVKTI 266
>gi|153938821|ref|YP_001392044.1| NHL repeat-containing protein [Clostridium botulinum F str.
Langeland]
gi|384463040|ref|YP_005675635.1| NHL repeat protein [Clostridium botulinum F str. 230613]
gi|152934717|gb|ABS40215.1| NHL repeat protein [Clostridium botulinum F str. Langeland]
gi|295320057|gb|ADG00435.1| NHL repeat protein [Clostridium botulinum F str. 230613]
Length = 467
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 53/92 (57%), Gaps = 4/92 (4%)
Query: 266 FPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRPAAS 324
+P IS +E G + S+ +RI++ D N + + IGS G +DG+F+ ++ P
Sbjct: 178 YPSKISINEDG-LIAFSEKGKNRIVILDQNLEKIKTIGSSEYGLKDGDFKEAQFKAPEGL 236
Query: 325 FYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
+ ++ +Y+ D+ENH IR D+ + +++T+
Sbjct: 237 RFIENK--IYVADTENHFIRECDLEKEIVKTI 266
>gi|159470937|ref|XP_001693613.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283116|gb|EDP08867.1| predicted protein [Chlamydomonas reinhardtii]
Length = 480
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 50/109 (45%), Gaps = 23/109 (21%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAA 323
L +PG ++ D SG RLF+SDSN+HR+++ D G ++ A
Sbjct: 234 LRYPGKLATDLSGGRLFVSDSNNHRVVITDLAGNFIE---------------------AG 272
Query: 324 SFYHKDDDCLYIVDSENHA--IRRADMGRRVLETVYPTSGISKKNNSLW 370
Y + LY+ D+ENHA + A G + + G ++ NS W
Sbjct: 273 LAYSPRRNTLYVADTENHATVVTLAGNGTKGRDYRGGKGGSAQPLNSPW 321
>gi|386773539|ref|ZP_10095917.1| NHL repeat protein [Brachybacterium paraconglomeratum LC44]
Length = 634
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 50/221 (22%), Positives = 90/221 (40%), Gaps = 27/221 (12%)
Query: 218 LNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGN 277
L++ V EL+ + + + S G P + L FP ++ G
Sbjct: 147 LDRLVAELVAEGEADGSLRRG------------PAPTVLAERSEQTLRFPSKLAVLPDG- 193
Query: 278 RLFLSDSNHHRIIVFDGNGKILDC-IGSCP-GFEDGEFESSKLMRP------AASFYHKD 329
RL +SD+ HR++VF +G +D IG+ G DG+ ++++ P A H+
Sbjct: 194 RLVVSDAGQHRLVVFQADGATVDAIIGTGERGHADGDEDTAQFAEPNGVLALPAEVAHEV 253
Query: 330 DDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEK 389
L + D+ H +R +G+ L T+ ++ + W+ + + D ++
Sbjct: 254 GYDLLVADTAGHRLRGVKIGQDRLLRSRTTTEVTTLAGTGEQWMQGEPLPRGEGDART-- 311
Query: 390 LDPQSLIFPWHLMKSEDDNLLIINRS-FETLWIMDLASGEI 429
SL PW L S N +I + LW D +G +
Sbjct: 312 ---YSLSTPWDLTWSHSLNRAVIAMAGIHQLWTYDPVTGAL 349
>gi|58581916|ref|YP_200932.1| hypothetical protein XOO2293 [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|58426510|gb|AAW75547.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC
10331]
Length = 472
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/190 (26%), Positives = 85/190 (44%), Gaps = 28/190 (14%)
Query: 166 RLVEMLMKEYITFPILLSN-----KNFPQMENGACYLLSKDFGNARVFHENSLDI-GMLN 219
R ++ L ++ PILL + F +E+ +L +G R + + + G L+
Sbjct: 79 RALKQLRGHGVSAPILLDKDWETWRRF-GIESWPTLVLLDAYGRER---QRLVGVTGDLD 134
Query: 220 KAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRL 279
KA+ L QQ S + AE EPH L L FP ++A E + L
Sbjct: 135 KALTALCEGQQPPSDA------DLYGVAERDPEPH-------LALRFPTGLAATE--DLL 179
Query: 280 FLSDSNHHRIIVFDGNGKILDCIGSCPG-FEDGEFESSKLMRPAASFYHKDDDCLYIVDS 338
+++D+ HHR++ +G++L G DG + RP +D LY+ D+
Sbjct: 180 YVADTGHHRVLECTHSGRVLRQFGHGNADLIDGGVGEAAFRRPQGLALERDQ--LYVADT 237
Query: 339 ENHAIRRADM 348
NHA+RR ++
Sbjct: 238 GNHALRRINL 247
>gi|84623828|ref|YP_451200.1| hypothetical protein XOO_2171 [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|84367768|dbj|BAE68926.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
311018]
Length = 499
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/190 (26%), Positives = 85/190 (44%), Gaps = 28/190 (14%)
Query: 166 RLVEMLMKEYITFPILLSN-----KNFPQMENGACYLLSKDFGNARVFHENSLDI-GMLN 219
R ++ L ++ PILL + F +E+ +L +G R + + + G L+
Sbjct: 106 RALKQLRGHGVSAPILLDKDWETWRRF-GIESWPTLVLLDAYGRER---QRLVGVTGDLD 161
Query: 220 KAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRL 279
KA+ L QQ S + AE EPH L L FP ++A E + L
Sbjct: 162 KALTALCEGQQPPSDA------DLYGVAERDPEPH-------LALRFPTGLAATE--DLL 206
Query: 280 FLSDSNHHRIIVFDGNGKILDCIGSCPG-FEDGEFESSKLMRPAASFYHKDDDCLYIVDS 338
+++D+ HHR++ +G++L G DG + RP +D LY+ D+
Sbjct: 207 YVADTGHHRVLECTHSGRVLRQFGHGNADLIDGGVGEAAFRRPQGLALERDQ--LYVADT 264
Query: 339 ENHAIRRADM 348
NHA+RR ++
Sbjct: 265 GNHALRRINL 274
>gi|188576512|ref|YP_001913441.1| hypothetical protein PXO_00583 [Xanthomonas oryzae pv. oryzae
PXO99A]
gi|188520964|gb|ACD58909.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae
PXO99A]
Length = 472
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/190 (26%), Positives = 85/190 (44%), Gaps = 28/190 (14%)
Query: 166 RLVEMLMKEYITFPILLSN-----KNFPQMENGACYLLSKDFGNARVFHENSLDI-GMLN 219
R ++ L ++ PILL + F +E+ +L +G R + + + G L+
Sbjct: 79 RALKQLRGHGVSAPILLDKDWETWRRF-GIESWPTLVLLDAYGRER---QRLVGVTGDLD 134
Query: 220 KAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRL 279
KA+ L QQ S + AE EPH L L FP ++A E + L
Sbjct: 135 KALTALCEGQQPPSDA------DLYGVAERDPEPH-------LALRFPTGLAATE--DLL 179
Query: 280 FLSDSNHHRIIVFDGNGKILDCIGSCPG-FEDGEFESSKLMRPAASFYHKDDDCLYIVDS 338
+++D+ HHR++ +G++L G DG + RP +D LY+ D+
Sbjct: 180 YVADTGHHRVLECTHSGRVLRQFGHGNADLIDGGVGEAAFRRPQGLALERDQ--LYVADT 237
Query: 339 ENHAIRRADM 348
NHA+RR ++
Sbjct: 238 GNHALRRINL 247
>gi|108761192|ref|YP_633033.1| lipoprotein [Myxococcus xanthus DK 1622]
gi|108465072|gb|ABF90257.1| putative lipoprotein [Myxococcus xanthus DK 1622]
Length = 868
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 43/85 (50%), Gaps = 1/85 (1%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAA 323
L+ P I+AD +GN ++++D++H+ I D GK+ G PG DG + + +P
Sbjct: 629 LNGPMGIAADAAGN-VYVADTDHYVIRRIDPEGKVEVFAGGTPGLMDGPAKQAAFNQPTG 687
Query: 324 SFYHKDDDCLYIVDSENHAIRRADM 348
D L + D N IRR D+
Sbjct: 688 LAVTPDGTALLVADMNNGVIRRIDL 712
Score = 48.9 bits (115), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 28/102 (27%), Positives = 53/102 (51%), Gaps = 4/102 (3%)
Query: 245 KQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRII-VFDGNGKILDCIG 303
++ +++ E H ++++ L+ P ++ GN LF+ +S R++ + DG ++ G
Sbjct: 708 RRIDLVAEGHPVTTLQGDWLYRPSGVAVSADGNTLFVVESGMSRVVRIRDGLTSVV--AG 765
Query: 304 SCPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRR 345
+ PGF DG ESS+ + P D L + D N+ +RR
Sbjct: 766 TTPGFRDGAPESSQFL-PYLGIAVLKDGSLAVSDPGNYRVRR 806
>gi|325923856|ref|ZP_08185461.1| NHL repeat protein [Xanthomonas gardneri ATCC 19865]
gi|325545661|gb|EGD16910.1| NHL repeat protein [Xanthomonas gardneri ATCC 19865]
Length = 472
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 46/89 (51%), Gaps = 5/89 (5%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPG-FEDGEFESSKLM 319
L L FP ++A E +RL+++D+ HHR++ G++L G DG +
Sbjct: 163 RLELQFPTGLAATE--DRLYIADTGHHRVLECTHGGRVLRQFGHGNADLIDGGVGEAAFR 220
Query: 320 RPAASFYHKDDDCLYIVDSENHAIRRADM 348
RP +D LY+ D+ NHA+RR ++
Sbjct: 221 RPQGLALERDQ--LYVADTGNHALRRINL 247
>gi|380511998|ref|ZP_09855405.1| nhl repeat protein [Xanthomonas sacchari NCPPB 4393]
Length = 473
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 50/103 (48%), Gaps = 7/103 (6%)
Query: 248 EVLKEPHACSSVRNLLLHFP-GCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIG-SC 305
E L++ L L FP G + AD+ L+++DS HHRI+ NG++L G
Sbjct: 151 EDLRDARETQPEPRLPLRFPTGLVVADDL---LYVADSGHHRILECSTNGRVLRQFGIGT 207
Query: 306 PGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADM 348
F DG + RP + LY+ D+ NHA+RR ++
Sbjct: 208 ADFIDGAIGEAAFHRPRGLALER--GVLYVADTGNHALRRINL 248
>gi|338536093|ref|YP_004669427.1| putative lipoprotein [Myxococcus fulvus HW-1]
gi|337262189|gb|AEI68349.1| putative lipoprotein [Myxococcus fulvus HW-1]
Length = 855
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/85 (30%), Positives = 43/85 (50%), Gaps = 1/85 (1%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAA 323
L+ P I+ D +GN ++++D++H+ I D GK+ G PG +DG + + +P
Sbjct: 616 LNGPMGIAVDAAGN-VYVADTDHYVIRRIDVEGKVEVFAGGTPGLQDGPAKQAAFNQPTG 674
Query: 324 SFYHKDDDCLYIVDSENHAIRRADM 348
D L + D N IRR D+
Sbjct: 675 MTVTPDGTALLVADMNNGVIRRIDL 699
>gi|384428019|ref|YP_005637378.1| hypothetical protein XCR_2380 [Xanthomonas campestris pv. raphani
756C]
gi|341937121|gb|AEL07260.1| conserved hypothetical protein [Xanthomonas campestris pv. raphani
756C]
Length = 472
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 48/89 (53%), Gaps = 5/89 (5%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPG-FEDGEFESSKLM 319
L L FP ++A E +RL+++D+ HHRI+ +G++L G DG +
Sbjct: 163 RLALRFPTGLAATE--DRLYIADTGHHRILECTHSGRVLRQFGHGNADLIDGGVGEAAFR 220
Query: 320 RPAASFYHKDDDCLYIVDSENHAIRRADM 348
RP +++ LY+ D+ NHA+RR ++
Sbjct: 221 RPQGLALEREE--LYVADTGNHALRRINL 247
>gi|21231560|ref|NP_637477.1| hypothetical protein XCC2116 [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|66768320|ref|YP_243082.1| hypothetical protein XC_1999 [Xanthomonas campestris pv. campestris
str. 8004]
gi|21113245|gb|AAM41401.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. ATCC 33913]
gi|66573652|gb|AAY49062.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. 8004]
Length = 499
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 51/190 (26%), Positives = 86/190 (45%), Gaps = 28/190 (14%)
Query: 166 RLVEMLMKEYITFPILLSN-----KNFPQMENGACYLLSKDFGNARVFHENSLDI-GMLN 219
R ++ L ++ PILL + F +E+ +L +G R + + + G L+
Sbjct: 106 RSLKQLRGHGVSAPILLDKDWETWRRF-GIESWPTLVLLDAYGRER---QRLVGVTGDLD 161
Query: 220 KAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRL 279
KA+ L QQ S + AE EP L L FP ++A E +RL
Sbjct: 162 KALAALCEGQQPPSDA------DLHGVAERDPEP-------RLALRFPTGLAATE--DRL 206
Query: 280 FLSDSNHHRIIVFDGNGKILDCIGSCPG-FEDGEFESSKLMRPAASFYHKDDDCLYIVDS 338
+++D+ HHRI+ +G++L G DG + RP +++ LY+ D+
Sbjct: 207 YIADTGHHRILECTHSGRVLRQFGHGNADLIDGGVGEAAFRRPQGLALEREE--LYVADT 264
Query: 339 ENHAIRRADM 348
NHA+RR ++
Sbjct: 265 GNHALRRINL 274
>gi|168180827|ref|ZP_02615491.1| NHL repeat protein [Clostridium botulinum NCTC 2916]
gi|421838034|ref|ZP_16272034.1| NHL repeat-containing protein [Clostridium botulinum CFSAN001627]
gi|182668214|gb|EDT80193.1| NHL repeat protein [Clostridium botulinum NCTC 2916]
gi|409739668|gb|EKN40278.1| NHL repeat-containing protein [Clostridium botulinum CFSAN001627]
Length = 467
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 54/92 (58%), Gaps = 4/92 (4%)
Query: 266 FPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRPAAS 324
+P S +E+G + S+ +RI++ D N + + IGS G +DG+F+ ++ P
Sbjct: 178 YPSKASINENG-LIAFSEKGKNRIVILDQNLEKIKIIGSGEYGLKDGDFKEAQFKAPEGL 236
Query: 325 FYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
+ ++ +Y+ D+ENH+IR D+ + +++T+
Sbjct: 237 RFIENK--IYVADTENHSIRECDLEKEIVKTI 266
>gi|321469552|gb|EFX80532.1| hypothetical protein DAPPUDRAFT_304106 [Daphnia pulex]
Length = 689
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 47/90 (52%), Gaps = 4/90 (4%)
Query: 272 ADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESSKLMRPAASFYHKDD 330
+ E + +SD+ HHRI++ G+I IG PG+ +G F ++K P +
Sbjct: 200 SKEDSTYMAVSDTGHHRILIISLQGEIKVTIGGPTPGWNEGAFSTAKFRSPQGIAFLPP- 258
Query: 331 DC-LYIVDSENHAIRRADMGRRVLETVYPT 359
C LY+ D+ NH+IR D + ++TV T
Sbjct: 259 -CHLYVADTGNHSIRLIDFSTKQVKTVAGT 287
>gi|387819032|ref|YP_005679379.1| hypothetical protein H04402_02846 [Clostridium botulinum H04402
065]
gi|322807076|emb|CBZ04650.1| gll0387 protein [Clostridium botulinum H04402 065]
Length = 467
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 54/92 (58%), Gaps = 4/92 (4%)
Query: 266 FPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRPAAS 324
+P S +E+G + S+ +RI++ D N + + IGS G +DG+F+ ++ P
Sbjct: 178 YPSKASINENG-LIAFSEKGKNRIVILDQNLEKIKIIGSGEYGLKDGDFKEAQFKAPEGL 236
Query: 325 FYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
+ ++ +Y+ D+ENH+IR D+ + +++T+
Sbjct: 237 RFIENK--IYVADTENHSIRECDLEKEIVKTI 266
>gi|188991459|ref|YP_001903469.1| hypothetical protein xccb100_2064 [Xanthomonas campestris pv.
campestris str. B100]
gi|167733219|emb|CAP51417.1| Conserved hypothetical protein [Xanthomonas campestris pv.
campestris]
Length = 481
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 48/89 (53%), Gaps = 5/89 (5%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPG-FEDGEFESSKLM 319
L L FP ++A E +RL+++D+ HHRI+ +G++L G DG +
Sbjct: 172 RLALRFPTGLAATE--DRLYIADTGHHRILECTHSGRVLRQFGHGNADLIDGGVGEAAFR 229
Query: 320 RPAASFYHKDDDCLYIVDSENHAIRRADM 348
RP +++ LY+ D+ NHA+RR ++
Sbjct: 230 RPQGLALEREE--LYVADTGNHALRRINL 256
>gi|148380724|ref|YP_001255265.1| NHL repeat protein [Clostridium botulinum A str. ATCC 3502]
gi|153933286|ref|YP_001385008.1| NHL repeat-containing protein [Clostridium botulinum A str. ATCC
19397]
gi|153935024|ref|YP_001388478.1| NHL repeat-containing protein [Clostridium botulinum A str. Hall]
gi|148290208|emb|CAL84327.1| conserved hypothetical protein [Clostridium botulinum A str. ATCC
3502]
gi|152929330|gb|ABS34830.1| NHL repeat protein [Clostridium botulinum A str. ATCC 19397]
gi|152930938|gb|ABS36437.1| NHL repeat protein [Clostridium botulinum A str. Hall]
Length = 467
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 54/92 (58%), Gaps = 4/92 (4%)
Query: 266 FPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRPAAS 324
+P S +E+G + S+ +RI++ D N + + IGS G +DG+F+ ++ P
Sbjct: 178 YPSKASINENG-LIAFSEKGKNRIVILDQNLEKIKIIGSGEYGLKDGDFKEAQFKAPEGL 236
Query: 325 FYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
+ ++ +Y+ D+ENH+IR D+ + +++T+
Sbjct: 237 RFIENK--IYVADTENHSIRECDLEKEIVKTI 266
>gi|194877446|ref|XP_001973882.1| GG16549 [Drosophila erecta]
gi|190657069|gb|EDV54282.1| GG16549 [Drosophila erecta]
Length = 717
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 53/97 (54%), Gaps = 3/97 (3%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FP I +G R ++D+ ++R++V G G + IG PGF DG +++ P
Sbjct: 225 LRFPAKIVRSPNG-RYAIADTGNNRVLVLTGGGVVQHKIGGYQPGFVDGNSTAARFNNPQ 283
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
+ +++ L + D++NHAIR+ + ++ET+ T
Sbjct: 284 GIAF-LNENTLIVADTKNHAIRQISLTNGMVETLAGT 319
>gi|328717168|ref|XP_001944227.2| PREDICTED: NHL repeat-containing protein 2-like [Acyrthosiphon
pisum]
Length = 943
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 52/104 (50%), Gaps = 14/104 (13%)
Query: 281 LSDSNHHRIIVFDGNGKILDCIG----------SCPGFEDGEFESSKLMRPAASFYHKDD 330
++DS HHRI++ G++ IG + GF+DG F + P + ++
Sbjct: 298 IADSGHHRIVISTIQGRVKHVIGGGGTGLFSSTTKKGFKDGNFTEALFHSPQGICF-QNS 356
Query: 331 DCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIM 374
L++ D+ENHAIR D+ + ++TV +G KK + + M
Sbjct: 357 HILFVCDTENHAIRMIDLKEKTVKTV---AGNGKKGHDKYGGQM 397
>gi|433676068|ref|ZP_20508221.1| RING finger protein nhl-1 [Xanthomonas translucens pv. translucens
DSM 18974]
gi|430818813|emb|CCP38486.1| RING finger protein nhl-1 [Xanthomonas translucens pv. translucens
DSM 18974]
Length = 473
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 49/103 (47%), Gaps = 7/103 (6%)
Query: 248 EVLKEPHACSSVRNLLLHFP-GCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIG-SC 305
E L++ L L FP G AD+ RL+++DS HHRI+ G++L G
Sbjct: 151 EELRDARETQPEPRLSLRFPFGLALADD---RLYIADSGHHRIVECTSGGRMLRQFGLGT 207
Query: 306 PGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADM 348
F DG + RP + LY+ D+ NHA+RR ++
Sbjct: 208 ADFIDGGIGEAAFHRPRGLALER--GVLYVADTGNHALRRINL 248
>gi|302552781|ref|ZP_07305123.1| NHL repeat containing protein [Streptomyces viridochromogenes DSM
40736]
gi|302470399|gb|EFL33492.1| NHL repeat containing protein [Streptomyces viridochromogenes DSM
40736]
Length = 605
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 66/141 (46%), Gaps = 18/141 (12%)
Query: 253 PHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK-ILDCIGS-CPGFED 310
P+ +L FPG SGN +SD+ H+++ G+G+ ++ IGS GF D
Sbjct: 167 PYVAPEPEPTVLRFPGKALLLPSGN-FLVSDTTRHQLVELAGDGESVVRRIGSGARGFAD 225
Query: 311 GEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVY----------PTS 360
G + + P DD + + D+ NHA+RR D+ + T+ PTS
Sbjct: 226 GSADEAAFNEPQG-LTLLDDGSVVVADTVNHALRRLDLATGEVTTLAGTGRQWWQGSPTS 284
Query: 361 GISKKNN--SLW--AWIMEKL 377
G +++ + S W AW +K+
Sbjct: 285 GPAREVDLSSPWDVAWWQDKV 305
>gi|440733650|ref|ZP_20913345.1| nhl repeat protein [Xanthomonas translucens DAR61454]
gi|440359672|gb|ELP96967.1| nhl repeat protein [Xanthomonas translucens DAR61454]
Length = 473
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 49/103 (47%), Gaps = 7/103 (6%)
Query: 248 EVLKEPHACSSVRNLLLHFP-GCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIG-SC 305
E L++ L L FP G AD+ RL+++DS HHRI+ G++L G
Sbjct: 151 EELRDARETQPEPRLSLRFPFGLALADD---RLYIADSGHHRIVECTSGGRMLRQFGLGT 207
Query: 306 PGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADM 348
F DG + RP + LY+ D+ NHA+RR ++
Sbjct: 208 ADFIDGGIGEAAFHRPRGLALER--GVLYVADTGNHALRRINL 248
>gi|302036371|ref|YP_003796693.1| hypothetical protein NIDE1006 [Candidatus Nitrospira defluvii]
gi|300604435|emb|CBK40767.1| conserved protein of unknown function, contains NHL repeats
[Candidatus Nitrospira defluvii]
Length = 404
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 51/101 (50%), Gaps = 7/101 (6%)
Query: 260 RNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDG-NGKILDCIGSCPG---FEDGEFES 315
R L+FP ++ D +GN L+++D+ +HR+ DG G I + G+ + G S
Sbjct: 142 RRARLNFPSAVAVDRAGN-LYIADTMNHRVRKVDGATGIITNVAGTGQARYSGDGGPAVS 200
Query: 316 SKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
+ + P D+ LYI D N+ +RR D+ V+ TV
Sbjct: 201 AAINEPTG--LAVSDEALYIADQSNNRVRRVDLATGVITTV 239
>gi|290971766|ref|XP_002668650.1| predicted protein [Naegleria gruberi]
gi|284082136|gb|EFC35906.1| predicted protein [Naegleria gruberi]
Length = 728
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 62/122 (50%), Gaps = 9/122 (7%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GF--EDGEFESSKLMR 320
L+FP C+ S N ++++D +HRI NG I+ G+ P GF ++G +++L
Sbjct: 122 LNFPRCVFV-SSNNEVYIADQINHRIRKILENGNIVTIAGNGPYGFCGDNGLATNAQLNS 180
Query: 321 PAASFYHKDDDCLYIVDSENHAIRRA-DMGRRVLETVYPTSGISKKNNSLWAWIMEKLGF 379
PA F +++ +YI D +NH IR+ + G V T+G S N EKL F
Sbjct: 181 PAGVFVSSNNE-IYIADYDNHRIRKILENGNIVTIAGKGTAGFSGDN---GLATKEKLNF 236
Query: 380 ER 381
R
Sbjct: 237 PR 238
Score = 44.3 bits (103), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 48/85 (56%), Gaps = 5/85 (5%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GF--EDGEFESSKLMR 320
L+FP C+ S N ++++D +HRI NG I+ G+ P GF ++G +++L
Sbjct: 234 LNFPRCVFV-SSNNEVYIADQINHRIRKILENGNIVTIAGNGPYGFCGDNGLATNAQLNS 292
Query: 321 PAASFYHKDDDCLYIVDSENHAIRR 345
PA F +++ +YI + NH IR+
Sbjct: 293 PAGVFVSSNNE-IYIAEYGNHRIRK 316
Score = 40.8 bits (94), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 54/111 (48%), Gaps = 8/111 (7%)
Query: 275 SGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGF--EDGEFESSKLMRPAASFYHKDDD 331
S N ++++D +HRI NG I+ G+ GF ++G ++L P F +++
Sbjct: 20 SNNEVYIADYGNHRIRKILENGNIVTIAGNGTAGFSGDNGIATKAQLNGPVGVFVSSNNE 79
Query: 332 CLYIVDSENHAIRRA-DMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFER 381
+YI D +NH IR+ + G V+ T+G S N EKL F R
Sbjct: 80 -VYIADYDNHRIRKILENGNIVIIAGKGTAGFSGDN---GLATKEKLNFPR 126
>gi|424792718|ref|ZP_18218917.1| hypothetical protein XTG29_01651 [Xanthomonas translucens pv.
graminis ART-Xtg29]
gi|422796954|gb|EKU25368.1| hypothetical protein XTG29_01651 [Xanthomonas translucens pv.
graminis ART-Xtg29]
Length = 473
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 49/103 (47%), Gaps = 7/103 (6%)
Query: 248 EVLKEPHACSSVRNLLLHFP-GCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIG-SC 305
E L++ L L FP G AD+ RL+++DS HHRI+ G++L G
Sbjct: 151 EELRDARETQPEPRLSLRFPVGLALADD---RLYIADSGHHRILECTTGGRVLRQFGLGT 207
Query: 306 PGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADM 348
F DG + RP + LY+ D+ NHA+RR ++
Sbjct: 208 ADFIDGGIGEAAFHRPRGLALER--GVLYVADTGNHALRRINL 248
>gi|384419507|ref|YP_005628867.1| hypothetical protein XOC_2585 [Xanthomonas oryzae pv. oryzicola
BLS256]
gi|353462420|gb|AEQ96699.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 472
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 51/190 (26%), Positives = 84/190 (44%), Gaps = 28/190 (14%)
Query: 166 RLVEMLMKEYITFPILLSN-----KNFPQMENGACYLLSKDFGNARVFHENSLDI-GMLN 219
R ++ L ++ PILL + F +E+ +L +G R + + + G L+
Sbjct: 79 RALKQLRGHGVSAPILLDKDWETWRRF-GIESWPTLVLLDAYGRER---QRLVGVTGDLD 134
Query: 220 KAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRL 279
KA+ L QQ S + AE EPH L L FP ++A E + L
Sbjct: 135 KALTALCEGQQPPSDA------DLHGVAEHDPEPH-------LALRFPTGLAATE--DLL 179
Query: 280 FLSDSNHHRIIVFDGNGKILDCIGSCPG-FEDGEFESSKLMRPAASFYHKDDDCLYIVDS 338
+++D+ HHR++ +G++L G DG + RP D LY+ D+
Sbjct: 180 YVADTGHHRVLECMHSGRVLRQFGHGNADLIDGGVGEAAFRRPQGLALESDQ--LYVADT 237
Query: 339 ENHAIRRADM 348
NHA+RR ++
Sbjct: 238 GNHALRRINL 247
>gi|330824371|ref|YP_004387674.1| NHL repeat containing protein [Alicycliphilus denitrificans K601]
gi|329309743|gb|AEB84158.1| NHL repeat containing protein [Alicycliphilus denitrificans K601]
Length = 362
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 44/83 (53%), Gaps = 2/83 (2%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESSKLMRPAASF 325
P ++ + SG ++++D N+HRI +G ++ G GF DG+ +S++ +P
Sbjct: 113 PIAVAVERSGA-VYVADRNNHRIRKIRPDGTVITLAGGDSAGFVDGDLKSARFNQPYGVA 171
Query: 326 YHKDDDCLYIVDSENHAIRRADM 348
LY+ D NHAIRR D+
Sbjct: 172 LDAAQTTLYVADYLNHAIRRIDL 194
>gi|260904571|ref|ZP_05912893.1| NHL repeat-containing protein [Brevibacterium linens BL2]
Length = 647
Score = 48.9 bits (115), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 54/116 (46%), Gaps = 9/116 (7%)
Query: 253 PHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK-ILDCIGSCP-GFED 310
P+ L +PG ++ SGN L ++DS HH ++ +D +G+ I+ IG+ G D
Sbjct: 172 PYVPPPAPETELFYPGKVTELPSGN-LLVADSGHHSLVEYDASGQNIIRRIGTGERGAND 230
Query: 311 GEFESSKLMRPAASFYHKDDDC------LYIVDSENHAIRRADMGRRVLETVYPTS 360
G+F S+ P DD L + D+ NH +R ++ + TV T
Sbjct: 231 GDFTSASFSEPGGITVLPDDVAAKAGYHLVVADTVNHTLRGINLDTETVTTVAGTG 286
>gi|241613940|ref|XP_002407473.1| hypothetical protein IscW_ISCW020206 [Ixodes scapularis]
gi|215502822|gb|EEC12316.1| hypothetical protein IscW_ISCW020206 [Ixodes scapularis]
Length = 393
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 47/92 (51%), Gaps = 16/92 (17%)
Query: 260 RNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLM 319
+N L+FP ++A GN + + D+ +HR+IVF+ G+ L C + PG + +
Sbjct: 294 KNDYLNFPIGVTATSGGN-IIVGDTGNHRVIVFNAEGRPL-CKAALPG-------TVRFR 344
Query: 320 RPAA-------SFYHKDDDCLYIVDSENHAIR 344
RP+A SF KDD C+Y+ IR
Sbjct: 345 RPSAVVALDDGSFAVKDDRCVYLFSKAGEYIR 376
>gi|391330742|ref|XP_003739813.1| PREDICTED: NHL repeat-containing protein 2-like [Metaseiulus
occidentalis]
Length = 685
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 55/100 (55%), Gaps = 6/100 (6%)
Query: 264 LHFPGCISADESGNRLF--LSDSNHHRIIVFDG-NGKILDCIGSC-PGFEDGEFESSKLM 319
L PG I+ D +G+ + +SDS ++RI+VFD + ++ +G+ GF DG + +
Sbjct: 221 LRSPGKIALDYTGDTSYYVVSDSANNRILVFDRFSNEVQLIVGTGEAGFLDGAYGICRFS 280
Query: 320 RPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
P ++ D +++ D+ NHAIRR D + + TV T
Sbjct: 281 SPQGVCFY--DGGIFVADAGNHAIRRVDFSTKCVSTVVGT 318
>gi|402571883|ref|YP_006621226.1| hypothetical protein Desmer_1348 [Desulfosporosinus meridiei DSM
13257]
gi|402253080|gb|AFQ43355.1| conserved repeat protein [Desulfosporosinus meridiei DSM 13257]
Length = 1977
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 55/102 (53%), Gaps = 9/102 (8%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAA 323
FP ++AD+SGN +++D++ HRI+ +D +G +L+ G G G+F S PA
Sbjct: 45 FFFPNSVAADDSGN-FYVADTSSHRILKYDPSGVLLENWGPY-GTGAGKFYS-----PAG 97
Query: 324 SFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKK 365
+ +Y+ D+ NH I++ + G V T + +SG K
Sbjct: 98 VAVDGAGN-VYVADTSNHRIQKRNAGTGVW-TTWGSSGSGNK 137
>gi|451982091|ref|ZP_21930422.1| hypothetical protein NITGR_780006 [Nitrospina gracilis 3/211]
gi|451760645|emb|CCQ91702.1| hypothetical protein NITGR_780006 [Nitrospina gracilis 3/211]
Length = 724
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 51/181 (28%), Positives = 78/181 (43%), Gaps = 26/181 (14%)
Query: 263 LLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESS---KLM 319
LL +P I DESGN L+++D ++R+ +G I G F G+F + L
Sbjct: 221 LLKYPFGIHVDESGN-LYIADRGNNRVRKVTPDGIITTVAGEGTHFFSGDFGPATRCSLA 279
Query: 320 RPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGF 379
P D LYI D N+ +R+ D ++ TV T KN +
Sbjct: 280 YP-TDVVTDDRGNLYIADRNNNRVRKVDT-NGIITTVMGTG----KNE-----------Y 322
Query: 380 ERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEIKEAVKGFSKV 439
DN+ SE SL P+ L + D +LLI++R+ + M L + + V G +
Sbjct: 323 NGDNEIASET----SLHLPFALAFTPDQHLLIVDRNHHRVRSMHLVQHTV-QTVAGNGQA 377
Query: 440 L 440
L
Sbjct: 378 L 378
>gi|291000392|ref|XP_002682763.1| predicted protein [Naegleria gruberi]
gi|284096391|gb|EFC50019.1| predicted protein [Naegleria gruberi]
Length = 1260
Score = 48.5 bits (114), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 25/95 (26%), Positives = 47/95 (49%), Gaps = 3/95 (3%)
Query: 263 LLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIG-SCPGFEDGEFESSKLMRP 321
L +P + +++ LF+SD+ H+RI+ +G + G G +DG F K RP
Sbjct: 919 LFQYPTFL--EKADKHLFISDTFHNRIVKMKPDGHVEYVFGDGFAGLKDGIFSDCKFYRP 976
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
F+ + + + D+ N+ IRR ++ + T+
Sbjct: 977 QGLFHDIQQNQIIVADTNNNTIRRINLNENFVSTL 1011
>gi|153867703|ref|ZP_01997987.1| NHL repeat protein [Beggiatoa sp. SS]
gi|152144978|gb|EDN72014.1| NHL repeat protein [Beggiatoa sp. SS]
Length = 263
Score = 48.5 bits (114), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 42/84 (50%), Gaps = 3/84 (3%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIG-SCPGFE-DGEFESSKLMRP 321
L +P I D +GN L+++DS++HRI D G I G PGF DG+ ++ +
Sbjct: 143 LSYPTGIDIDTAGN-LYIADSSNHRIRKVDTTGNITTIAGDGTPGFAGDGKIATAAQLNA 201
Query: 322 AASFYHKDDDCLYIVDSENHAIRR 345
+YI D+ NH IR+
Sbjct: 202 PTQVMVDSTGQVYIADTSNHRIRK 225
>gi|194767578|ref|XP_001965892.1| GF16360 [Drosophila ananassae]
gi|190619368|gb|EDV34892.1| GF16360 [Drosophila ananassae]
Length = 300
Score = 48.5 bits (114), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 51/98 (52%), Gaps = 3/98 (3%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESSKLMRPA 322
L FP I+ +G R +SD+ ++R++V G + IG G DG+F ++ P
Sbjct: 45 LRFPAKIARSPAG-RFAISDAGNNRVLVVSSRGLVEHIIGDHKAGLIDGKFTEARFKHPQ 103
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTS 360
+ D+ L + D+ENHA+R+ + +++T+ T
Sbjct: 104 GLTF-LDEHTLIVADTENHALRKISLADGIVKTLAGTG 140
>gi|357620506|gb|EHJ72671.1| hypothetical protein KGM_13091 [Danaus plexippus]
Length = 747
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 51/103 (49%), Gaps = 8/103 (7%)
Query: 264 LHFPGCISADE----SGNRLFL--SDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESS 316
L+FP I+ + G FL SD+ HHR+++ D +G IL +G PGF+DG+ +
Sbjct: 216 LYFPSKIALNPFYRGRGEEPFLAISDTGHHRVLLTDCSGIILRVVGGKTPGFKDGKLTDA 275
Query: 317 KLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
+ P + L + D+ NHA+R + +E + T
Sbjct: 276 QFNSPQGLCW-LSSSVLVVCDTNNHALRAVHLDEGTVEVLAGT 317
Score = 40.8 bits (94), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 16/44 (36%), Positives = 28/44 (63%)
Query: 316 SKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
+KL P A Y++ + LY+ D+ NH I++ D+G + + T+ PT
Sbjct: 494 AKLQHPMAVAYNEANKTLYVADTYNHKIKKVDVGPQKVSTINPT 537
>gi|195335766|ref|XP_002034534.1| GM21930 [Drosophila sechellia]
gi|194126504|gb|EDW48547.1| GM21930 [Drosophila sechellia]
Length = 784
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 41/68 (60%), Gaps = 7/68 (10%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIV 336
NR+ +SDSN+HRI +FD NGK+L +G G +DG+F K R A D +++
Sbjct: 675 NRVIVSDSNNHRIQIFDVNGKVLSTVGG-EGSDDGQF---KFPRGVAV---DDQGYIFVA 727
Query: 337 DSENHAIR 344
DS N+ I+
Sbjct: 728 DSGNNRIQ 735
>gi|115376708|ref|ZP_01463935.1| hypothetical protein STIAU_5350 [Stigmatella aurantiaca DW4/3-1]
gi|310822897|ref|YP_003955255.1| hypothetical protein STAUR_5665 [Stigmatella aurantiaca DW4/3-1]
gi|115366257|gb|EAU65265.1| hypothetical protein STIAU_5350 [Stigmatella aurantiaca DW4/3-1]
gi|309395969|gb|ADO73428.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1]
Length = 856
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 54/106 (50%), Gaps = 6/106 (5%)
Query: 247 AEVLKEPHACSSVRNLL--LHFPGCISADESGNRLFLSDSNHH--RIIVFDGNGKILDCI 302
A +K A +R +L L+ P I+AD SGN ++++DS++H R I DG +
Sbjct: 597 ANSVKRVSAMGDIRTILTGLNGPMGIAADASGN-IYVADSDNHCIRRIAPDGTATVFAGA 655
Query: 303 GSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADM 348
PG DG ++++ +PA + + L + D N IRR D+
Sbjct: 656 VMEPGQGDGTAKAARFNQPAGLAFGPGGELL-VADLGNGVIRRVDL 700
>gi|298387997|ref|ZP_06997545.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
gi|298259263|gb|EFI02139.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
Length = 439
Score = 48.1 bits (113), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 46/80 (57%), Gaps = 3/80 (3%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCI-GSCPGFEDGEFESSKLMRPAASF 325
P I+ D+ GN L+++D N+HRI D I+ I GS G++DG+ ++ +P +
Sbjct: 358 PRGIAIDKEGN-LYIADVNNHRIRKVDTKLNIVTTIAGSGKGYKDGDPLEAQFDQPWGVY 416
Query: 326 YHKDDDCLYIVDSENHAIRR 345
K++ LYI D NH IR+
Sbjct: 417 LDKNE-FLYIADQNNHCIRK 435
>gi|260817547|ref|XP_002603647.1| hypothetical protein BRAFLDRAFT_235607 [Branchiostoma floridae]
gi|229288969|gb|EEN59658.1| hypothetical protein BRAFLDRAFT_235607 [Branchiostoma floridae]
Length = 219
Score = 48.1 bits (113), Expect = 0.015, Method: Composition-based stats.
Identities = 29/82 (35%), Positives = 45/82 (54%), Gaps = 4/82 (4%)
Query: 266 FPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEF--ESSKLMRPAA 323
FP ++ D G +F++D+ ++R+ VFDG G G+ G DGEF S+ ++ PA
Sbjct: 11 FPIGVAVDSYGG-IFVTDTGNNRVQVFDGEGHFKFKFGTIGG-RDGEFLRPSAIVVTPAG 68
Query: 324 SFYHKDDDCLYIVDSENHAIRR 345
KDD + + DSE +RR
Sbjct: 69 DIVVKDDKKIQVFDSEGRFVRR 90
>gi|380692433|ref|ZP_09857292.1| NHL repeat-containing protein [Bacteroides faecis MAJ27]
Length = 439
Score = 48.1 bits (113), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 46/80 (57%), Gaps = 3/80 (3%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCI-GSCPGFEDGEFESSKLMRPAASF 325
P I+ D+ GN L+++D N+HRI D I+ I GS G++DG+ ++ +P +
Sbjct: 358 PRGIAIDKEGN-LYIADVNNHRIRKVDTKLNIVTTIAGSGKGYKDGDPLEAQFDQPWGVY 416
Query: 326 YHKDDDCLYIVDSENHAIRR 345
K++ LYI D NH IR+
Sbjct: 417 LDKNE-FLYIADQNNHCIRK 435
>gi|427430403|ref|ZP_18920257.1| hypothetical protein C882_1420 [Caenispirillum salinarum AK4]
gi|425878863|gb|EKV27574.1| hypothetical protein C882_1420 [Caenispirillum salinarum AK4]
Length = 498
Score = 48.1 bits (113), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 49/95 (51%), Gaps = 4/95 (4%)
Query: 264 LHFPGCISADESG-NRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESSKLMRP 321
L FPG + R ++D +H+++V + +G++L G+ PG +DG+ E + P
Sbjct: 173 LRFPGKVKPLAGAEGRWAVADGGNHQVLVCESDGRVLARYGTGLPGTDDGQAERAAFTSP 232
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
+D +++ D+ NH +RR D+ + TV
Sbjct: 233 QG--LACTEDAIFVADTGNHLVRRIDLCSGAVTTV 265
>gi|345487473|ref|XP_003425698.1| PREDICTED: RING finger protein nhl-1-like [Nasonia vitripennis]
Length = 510
Score = 48.1 bits (113), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 39/92 (42%), Positives = 52/92 (56%), Gaps = 13/92 (14%)
Query: 255 ACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFE 314
+C S R L H P I A S NR+ +SDSN+HR+ +FD NG++L GS G ++G+F
Sbjct: 381 SCGSGRGQLEH-PHYI-AVSSTNRVIVSDSNNHRVQIFDVNGRVLTAFGS-EGSDEGQF- 436
Query: 315 SSKLMRPAASFYHKDDDCLYIV--DSENHAIR 344
K R A DD YIV DS N+ I+
Sbjct: 437 --KFPRGVAV-----DDQGYIVVADSGNNRIQ 461
>gi|195029981|ref|XP_001987850.1| GH22138 [Drosophila grimshawi]
gi|193903850|gb|EDW02717.1| GH22138 [Drosophila grimshawi]
Length = 1353
Score = 48.1 bits (113), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 41/68 (60%), Gaps = 7/68 (10%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIV 336
NR+ +SDSN+HRI +FD NGK+L +G G +DG+F K R A D +++
Sbjct: 1244 NRVIVSDSNNHRIQIFDVNGKVLSTVGG-EGSDDGQF---KFPRGVAV---DDQGYIFVA 1296
Query: 337 DSENHAIR 344
DS N+ I+
Sbjct: 1297 DSGNNRIQ 1304
>gi|195384463|ref|XP_002050937.1| GJ19924 [Drosophila virilis]
gi|194145734|gb|EDW62130.1| GJ19924 [Drosophila virilis]
Length = 1346
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 41/68 (60%), Gaps = 7/68 (10%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIV 336
NR+ +SDSN+HRI +FD NGK+L +G G +DG+F K R A D +++
Sbjct: 1237 NRVIVSDSNNHRIQIFDVNGKVLSTVGG-EGSDDGQF---KFPRGVAV---DDQGYIFVA 1289
Query: 337 DSENHAIR 344
DS N+ I+
Sbjct: 1290 DSGNNRIQ 1297
>gi|221330419|ref|NP_001137707.1| another B-box affiliate, isoform C [Drosophila melanogaster]
gi|220902287|gb|ACL83161.1| another B-box affiliate, isoform C [Drosophila melanogaster]
Length = 1517
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 41/68 (60%), Gaps = 7/68 (10%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIV 336
NR+ +SDSN+HRI +FD NGK+L +G G +DG+F K R A D +++
Sbjct: 1408 NRVIVSDSNNHRIQIFDVNGKVLSTVGG-EGSDDGQF---KFPRGVAV---DDQGYIFVA 1460
Query: 337 DSENHAIR 344
DS N+ I+
Sbjct: 1461 DSGNNRIQ 1468
>gi|290991059|ref|XP_002678153.1| serine/threonine protein kinase [Naegleria gruberi]
gi|284091764|gb|EFC45409.1| serine/threonine protein kinase [Naegleria gruberi]
Length = 1397
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 40/162 (24%), Positives = 79/162 (48%), Gaps = 22/162 (13%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFE-DGEFESSKL 318
N +P I +S +F+SDS ++RI NG I+ +G+ GF+ D S+ L
Sbjct: 223 NASFSYPQGIYVSQSSGEIFVSDSGNNRIRKILTNGTIVTLVGTVNIGFDSDSSGLSTTL 282
Query: 319 MRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLG 378
P + +K + LYI D+ N+ IR+ D ++ T+ G + N S+ A
Sbjct: 283 NNPTKLWMNKLETELYIADTGNNRIRKYDTLSGMVTTI--AGGGTSDNISIQA------- 333
Query: 379 FERDNDTKSEKLDPQSLIF-----PWHLMKSEDDNLLIINRS 415
T+++ +P+S++ W++++S D + +++++
Sbjct: 334 ------TQAQLYEPRSIMRNPFTEEWYILESRADRISLLSQT 369
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 3/58 (5%)
Query: 314 ESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWA 371
+S+ L P FY D+CLY+ DS NH IR+ D V+ T+ +G NN+ +
Sbjct: 162 KSALLNNPTGLFYSAFDECLYVCDSGNHVIRKIDTNTNVITTI---AGNVNTNNTFYG 216
>gi|194881256|ref|XP_001974764.1| GG21942 [Drosophila erecta]
gi|190657951|gb|EDV55164.1| GG21942 [Drosophila erecta]
Length = 1353
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 41/68 (60%), Gaps = 7/68 (10%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIV 336
NR+ +SDSN+HRI +FD NGK+L +G G +DG+F K R A D +++
Sbjct: 1244 NRVIVSDSNNHRIQIFDVNGKVLSTVGG-EGSDDGQF---KFPRGVAV---DDQGYIFVA 1296
Query: 337 DSENHAIR 344
DS N+ I+
Sbjct: 1297 DSGNNRIQ 1304
>gi|195487256|ref|XP_002091832.1| GE12016 [Drosophila yakuba]
gi|194177933|gb|EDW91544.1| GE12016 [Drosophila yakuba]
Length = 1354
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 41/68 (60%), Gaps = 7/68 (10%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIV 336
NR+ +SDSN+HRI +FD NGK+L +G G +DG+F K R A D +++
Sbjct: 1245 NRVIVSDSNNHRIQIFDVNGKVLSTVGG-EGSDDGQF---KFPRGVAV---DDQGYIFVA 1297
Query: 337 DSENHAIR 344
DS N+ I+
Sbjct: 1298 DSGNNRIQ 1305
>gi|195426818|ref|XP_002061491.1| GK20936 [Drosophila willistoni]
gi|194157576|gb|EDW72477.1| GK20936 [Drosophila willistoni]
Length = 1354
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 41/68 (60%), Gaps = 7/68 (10%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIV 336
NR+ +SDSN+HRI +FD NGK+L +G G +DG+F K R A D +++
Sbjct: 1245 NRVIVSDSNNHRIQIFDVNGKVLSTVGG-EGSDDGQF---KFPRGVAV---DDQGYIFVA 1297
Query: 337 DSENHAIR 344
DS N+ I+
Sbjct: 1298 DSGNNRIQ 1305
>gi|22024206|ref|NP_611390.2| another B-box affiliate, isoform A [Drosophila melanogaster]
gi|24655386|ref|NP_725843.1| another B-box affiliate, isoform B [Drosophila melanogaster]
gi|21464288|gb|AAM51947.1| GH06739p [Drosophila melanogaster]
gi|21626957|gb|AAF57612.2| another B-box affiliate, isoform A [Drosophila melanogaster]
gi|21626958|gb|AAF57611.2| another B-box affiliate, isoform B [Drosophila melanogaster]
Length = 1353
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 41/68 (60%), Gaps = 7/68 (10%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIV 336
NR+ +SDSN+HRI +FD NGK+L +G G +DG+F K R A D +++
Sbjct: 1244 NRVIVSDSNNHRIQIFDVNGKVLSTVGG-EGSDDGQF---KFPRGVAV---DDQGYIFVA 1296
Query: 337 DSENHAIR 344
DS N+ I+
Sbjct: 1297 DSGNNRIQ 1304
>gi|198457486|ref|XP_001360687.2| GA13495 [Drosophila pseudoobscura pseudoobscura]
gi|198135998|gb|EAL25262.2| GA13495 [Drosophila pseudoobscura pseudoobscura]
Length = 1349
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 41/68 (60%), Gaps = 7/68 (10%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIV 336
NR+ +SDSN+HRI +FD NGK+L +G G +DG+F K R A D +++
Sbjct: 1240 NRVIVSDSNNHRIQIFDVNGKVLSTVGG-EGSDDGQF---KFPRGVAV---DDQGYIFVA 1292
Query: 337 DSENHAIR 344
DS N+ I+
Sbjct: 1293 DSGNNRIQ 1300
>gi|195150695|ref|XP_002016286.1| GL11502 [Drosophila persimilis]
gi|194110133|gb|EDW32176.1| GL11502 [Drosophila persimilis]
Length = 1349
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 41/68 (60%), Gaps = 7/68 (10%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIV 336
NR+ +SDSN+HRI +FD NGK+L +G G +DG+F K R A D +++
Sbjct: 1240 NRVIVSDSNNHRIQIFDVNGKVLSTVGG-EGSDDGQF---KFPRGVAV---DDQGYIFVA 1292
Query: 337 DSENHAIR 344
DS N+ I+
Sbjct: 1293 DSGNNRIQ 1300
>gi|195120698|ref|XP_002004858.1| GI19367 [Drosophila mojavensis]
gi|193909926|gb|EDW08793.1| GI19367 [Drosophila mojavensis]
Length = 1340
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 41/68 (60%), Gaps = 7/68 (10%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIV 336
NR+ +SDSN+HRI +FD NGK+L +G G +DG+F K R A D +++
Sbjct: 1231 NRVIVSDSNNHRIQIFDVNGKVLSTVGG-EGSDDGQF---KFPRGVAV---DDQGYIFVA 1283
Query: 337 DSENHAIR 344
DS N+ I+
Sbjct: 1284 DSGNNRIQ 1291
>gi|295394954|ref|ZP_06805167.1| NHL repeat containing protein [Brevibacterium mcbrellneri ATCC
49030]
gi|294972287|gb|EFG48149.1| NHL repeat containing protein [Brevibacterium mcbrellneri ATCC
49030]
Length = 655
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 46/194 (23%), Positives = 83/194 (42%), Gaps = 31/194 (15%)
Query: 253 PHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNG-KILDCIGSCP-GFED 310
P+ + R L +PG + + G RL ++DS HH ++D +G +L IGS G D
Sbjct: 181 PYVPPAPREGDLFYPGKVVRLDDG-RLIVADSGHHSYALYDSDGANLLGRIGSGERGNAD 239
Query: 311 GEFESSKLMRPAASF-----------YHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
G+F +++ P YH L D+ NH +R ++ + + TV T
Sbjct: 240 GDFATAQFSEPGGVAQLPPEIAQQVGYH-----LVAADTVNHTLRGINLETQTVTTVAGT 294
Query: 360 SG---ISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQSLIFPWHLMKS-EDDNLLIINRS 415
+ +N + ++ GF D + + L PW ++ + + N++I
Sbjct: 295 GEQYMVGASDN-----VPDQPGFFGRYDGPATAV---KLSSPWDVVYAPKTSNVVIAMAG 346
Query: 416 FETLWIMDLASGEI 429
T+W D +G +
Sbjct: 347 NHTMWTFDPVAGTV 360
>gi|322790121|gb|EFZ15148.1| hypothetical protein SINV_01508 [Solenopsis invicta]
Length = 472
Score = 47.8 bits (112), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 37/92 (40%), Positives = 51/92 (55%), Gaps = 13/92 (14%)
Query: 255 ACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFE 314
+C S R L H P I+ + NR+ +SD N+HR+ +FD NG++L GS G +DG+F
Sbjct: 343 SCGSGRGQLEH-PHYIAVSNT-NRVIVSDGNNHRVQIFDVNGRVLTSFGS-EGSDDGQF- 398
Query: 315 SSKLMRPAASFYHKDDDCLYIV--DSENHAIR 344
K R A DD YIV DS N+ I+
Sbjct: 399 --KFPRGVAV-----DDQGYIVVADSGNNRIQ 423
Score = 40.4 bits (93), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 46/114 (40%), Gaps = 29/114 (25%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCL--- 333
N + ++DS++HR+ VFDGNG + G+ G +GEF DCL
Sbjct: 222 NSIVVADSSNHRVQVFDGNGNFIKEFGTY-GSGEGEF-----------------DCLAGV 263
Query: 334 -------YIV-DSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGF 379
YI+ D NH I+ D R L + N W + LGF
Sbjct: 264 AVNRIGQYIIADRYNHRIQVLDPSGRFLRAFGSQGTADGRFNYPWGITTDALGF 317
>gi|389612944|dbj|BAM19866.1| similar to CG12547, partial [Papilio xuthus]
Length = 372
Score = 47.8 bits (112), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 44/82 (53%), Gaps = 2/82 (2%)
Query: 279 LFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPAASFYHKDDDCLYIVD 337
L +SD+ +HR+I+ D +G IL IG PGF+DG+ ++L P + L + D
Sbjct: 237 LAISDTGNHRVILTDCSGIILRIIGGPEPGFKDGKLSEARLNSPQGVCW-LSGSVLAVCD 295
Query: 338 SENHAIRRADMGRRVLETVYPT 359
+ NHA+R + +E + T
Sbjct: 296 TNNHAVRAVHLDDGTVEVLVGT 317
>gi|114568006|ref|YP_755160.1| leucine-rich repeat-containing protein [Syntrophomonas wolfei
subsp. wolfei str. Goettingen]
gi|114338941|gb|ABI69789.1| Leucine-rich repeat (LRR) protein-like protein [Syntrophomonas
wolfei subsp. wolfei str. Goettingen]
Length = 1351
Score = 47.8 bits (112), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 71/149 (47%), Gaps = 19/149 (12%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGF--EDGEFESSKLMR 320
L++P ++ D SGN L+++D +HRI D +G I G+ G+ + G S++L
Sbjct: 474 LNYPFGVACDGSGN-LYIADRGNHRIRKVDTSGIISTVAGNGTAGYSGDGGPATSAQLKD 532
Query: 321 PAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT-----SGISKKNNSLWAWIME 375
P K+ + LYIVD +N+ IR+ D ++ TV +G S W+
Sbjct: 533 PVGVACDKNGN-LYIVDKDNNRIRKVDNTGKI-STVAGNGTGGYAGDGSPATSAQIWVPY 590
Query: 376 KLGFERD--------NDTKSEKLDPQSLI 396
+ F+ + N+ + K+DP +I
Sbjct: 591 GVTFDNNGNMYIADMNNKRIRKVDPSGII 619
>gi|357622762|gb|EHJ74161.1| hypothetical protein KGM_17545 [Danaus plexippus]
Length = 1198
Score = 47.8 bits (112), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 34/72 (47%), Positives = 42/72 (58%), Gaps = 11/72 (15%)
Query: 275 SGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLY 334
S NR+ +SDSN+HRI VFD NG++L G G EDG+F K R A DD Y
Sbjct: 1087 STNRVLVSDSNNHRIQVFDVNGRVLSSFGE-EGSEDGQF---KFPRGVAV-----DDQGY 1137
Query: 335 IV--DSENHAIR 344
IV DS N+ I+
Sbjct: 1138 IVVADSGNNRIQ 1149
>gi|225619604|ref|YP_002720861.1| NHL repeat containing protein [Brachyspira hyodysenteriae WA1]
gi|225214423|gb|ACN83157.1| NHL repeat containing protein [Brachyspira hyodysenteriae WA1]
Length = 700
Score = 47.8 bits (112), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 51/102 (50%), Gaps = 9/102 (8%)
Query: 255 ACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFE 314
S V N L P ++ D GN L++SD+ ++RI++FD G L G G +GEF
Sbjct: 250 GVSGVSNGQLLGPSSVAVDREGN-LYVSDTGNNRIVIFDIEGNFLYSFGKL-GENNGEFF 307
Query: 315 SSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
S PA DD +Y+ D N I++ D+ ++++
Sbjct: 308 S-----PAGI--AVDDKYIYVADMGNKRIQQFDLSGNYIQSI 342
>gi|195164496|ref|XP_002023083.1| GL21149 [Drosophila persimilis]
gi|194105168|gb|EDW27211.1| GL21149 [Drosophila persimilis]
Length = 595
Score = 47.8 bits (112), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 55/108 (50%), Gaps = 8/108 (7%)
Query: 254 HACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGE 312
H SS+R FP I+ G R ++D+ ++R++V +G + IG GF DG+
Sbjct: 220 HPTSSLR-----FPAKIARSSLG-RYAIADAGNNRVLVVSADGVVEHKIGGLQAGFVDGK 273
Query: 313 FESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTS 360
S+ P + ++ L + D+ENHA+R+ + ++ET+ T
Sbjct: 274 LALSRFNSPQGVAF-LNESTLIVADTENHALRKISLENELVETLAGTG 320
>gi|425734502|ref|ZP_18852820.1| NHL repeat-containing protein [Brevibacterium casei S18]
gi|425481116|gb|EKU48277.1| NHL repeat-containing protein [Brevibacterium casei S18]
Length = 666
Score = 47.4 bits (111), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 31/116 (26%), Positives = 54/116 (46%), Gaps = 9/116 (7%)
Query: 253 PHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK-ILDCIGSCP-GFED 310
P+ L +PG ++ SGN L ++DS HH ++ + +G+ +L IG+ G D
Sbjct: 186 PYVPPPAPETELFYPGKVTVLPSGN-LLVADSGHHSLVEYTPDGQTVLRRIGTGERGLTD 244
Query: 311 GEFESSKLMRPAASFYHKDDDC------LYIVDSENHAIRRADMGRRVLETVYPTS 360
G+F S++ P +D L + D+ NH +R D+ + T+ T
Sbjct: 245 GDFASAQFSEPGGITVLPEDIAARAGYQLVVADTVNHVLRGIDLDGETVRTIAGTG 300
>gi|147818279|emb|CAN64724.1| hypothetical protein VITISV_026725 [Vitis vinifera]
Length = 423
Score = 47.4 bits (111), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 40/79 (50%), Gaps = 5/79 (6%)
Query: 290 IVFDGNGKILDCIGSC--PGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRAD 347
+V D NG + IGS G DG F+ + RP Y+ + LY+ D+ENHA+R D
Sbjct: 43 VVTDLNGNYILQIGSTGEEGLRDGSFDDATFNRPQGLAYNAKKNLLYVADTENHALREID 102
Query: 348 MGRRVLETVYPTSGISKKN 366
V ETV +G K+
Sbjct: 103 F---VNETVQTLAGNGTKD 118
>gi|148654532|ref|YP_001274737.1| NHL repeat-containing protein [Roseiflexus sp. RS-1]
gi|148566642|gb|ABQ88787.1| NHL repeat containing protein [Roseiflexus sp. RS-1]
Length = 1030
Score = 47.4 bits (111), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 36/72 (50%), Gaps = 6/72 (8%)
Query: 274 ESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCL 333
+S NRL++ D +HRI VFD NG L IG G G+F R A + +
Sbjct: 619 DSANRLYVGDEWNHRIQVFDANGAYLTTIGGSAGPRTGQF------RGARGVAVDNAGNI 672
Query: 334 YIVDSENHAIRR 345
Y+ D NH I++
Sbjct: 673 YVADRLNHRIQK 684
>gi|384208589|ref|YP_005594309.1| NHL repeat containing protein [Brachyspira intermedia PWS/A]
gi|343386239|gb|AEM21729.1| NHL repeat containing protein [Brachyspira intermedia PWS/A]
Length = 716
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 52/102 (50%), Gaps = 9/102 (8%)
Query: 255 ACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFE 314
+ V N L P ++ D+ GN +++SD+ ++RI+VFD G L G G +GEF
Sbjct: 266 GVTGVSNGQLLGPSAVAVDKEGN-IYVSDTGNNRIVVFDVEGNFLYSFGKL-GENNGEFF 323
Query: 315 SSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
S PA +D +Y+ D N I++ D+ ++T+
Sbjct: 324 S-----PAGI--AVNDQYIYVADMGNKRIQQFDLSGNYIQTI 358
>gi|223934991|ref|ZP_03626910.1| NHL repeat containing protein [bacterium Ellin514]
gi|223896444|gb|EEF62886.1| NHL repeat containing protein [bacterium Ellin514]
Length = 1064
Score = 47.4 bits (111), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 54/116 (46%), Gaps = 9/116 (7%)
Query: 235 SPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRI--IVF 292
+P G T A E +S N ++P ++ D +GN +F++D +H I ++
Sbjct: 537 TPGGAVTTLAGNFENFGSSDGTNS--NARFYWPSGVAVDNAGN-VFVADYMNHTIRELIP 593
Query: 293 DGNGKILDCIGSCPGFE---DGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRR 345
G I++ + GF DG S++ +P S LY+ DS NHAIR+
Sbjct: 594 SGTNWIVNTVAGLAGFWGSIDGTNTSARFFQPR-SLSVDASGALYVADSGNHAIRK 648
>gi|428172619|gb|EKX41527.1| hypothetical protein GUITHDRAFT_153825 [Guillardia theta CCMP2712]
Length = 285
Score = 47.0 bits (110), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 57/201 (28%), Positives = 86/201 (42%), Gaps = 38/201 (18%)
Query: 189 QMENGACYLLSKDFGN---ARVFHENSLDIGMLNKAVEELIMQQQENSSSPSGLKCTWAK 245
Q +GA +LL DF N +VF + N E ++ N S P GL+
Sbjct: 66 QDMDGAEHLLVADFDNNCIRKVF--------ITNGVGEHVVTWAGSNLSKP-GLRD---- 112
Query: 246 QAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVF----DGNGKILDC 301
S++ L + PG ISAD+SG +F++D+ +H I D N +
Sbjct: 113 -----------GSIKEALFNHPGGISADQSGENIFIADTYNHAIRHITRDEDRNWTVTLI 161
Query: 302 IGS---CPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYP 358
GS GFEDGE ES++ P + + L I D N AIR + RRV
Sbjct: 162 AGSRSGQSGFEDGEGESARFNCPTGLAVVNEQEIL-ISDFSNSAIR---LLRRVRGDWLV 217
Query: 359 TSGISKKNNSLWAWIMEKLGF 379
T+ + K + ++ + G+
Sbjct: 218 TTVVGKTSKDSEGRMLPESGY 238
>gi|290979906|ref|XP_002672674.1| predicted protein [Naegleria gruberi]
gi|284086252|gb|EFC39930.1| predicted protein [Naegleria gruberi]
Length = 747
Score = 47.0 bits (110), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 65/328 (19%), Positives = 127/328 (38%), Gaps = 55/328 (16%)
Query: 191 ENGACYLLSKDFGNARVFHENSLDIGMLNKAVEELIMQQQENSSSPSGLKCTWAKQAEVL 250
NG+ Y FG + +++ D+ + ++++ G++ A A +
Sbjct: 256 NNGSVYTYVSSFGPKGIVFDSNGDLYVAEATYDQIVKFSN-------GVRTVIANNAGI- 307
Query: 251 KEPHACSSVRNLL-LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS----- 304
P N L F G S G +L+++D++++RI + I++ I
Sbjct: 308 -TPGFSGDGGNALNATFHGITSLAIYGKKLYIADASNNRIRHINLTTNIINTIAGDGSEI 366
Query: 305 CPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISK 364
G G + L RP SFY+ D L I D+ N+ + R + +
Sbjct: 367 YTGGTSGYQSTIALKRPYTSFYNNQTDELLIADTNNYRVLRVNSLSNI------------ 414
Query: 365 KNNSLWAWIMEKLGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDL 424
+++SL + +G +D D + + +L +P+ + +SE + I +F+ L
Sbjct: 415 ESSSLIETVAGVVGLVQD-DIDGKLGNETALNYPYSVTQSEISGDVFIGTTFKIL----- 468
Query: 425 ASGEIKEAVKGFSKVLEICGVLVMEKVFLLKQMPQDWLLHQIDSSCSLKELPYAGLISSS 484
++ + K S V L + +S S P AGL +
Sbjct: 469 ---KVSKKDKRVSTVAGSVSTLAGD---------------NFNSIASQLNEP-AGL---A 506
Query: 485 IAFQNHILLCDIVGQRIMRLNRESGVCS 512
A + +CD + I +++ E+G+ S
Sbjct: 507 FAINGDLYICDSINHAIRKIDHETGIIS 534
>gi|380011633|ref|XP_003689904.1| PREDICTED: uncharacterized protein LOC100869288, partial [Apis
florea]
Length = 1660
Score = 47.0 bits (110), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 37/92 (40%), Positives = 51/92 (55%), Gaps = 13/92 (14%)
Query: 255 ACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFE 314
+C S R L H P I+ + NR+ +SD N+HRI +FD NG++L GS G ++G+F
Sbjct: 1531 SCGSGRGQLEH-PHYIAVSNT-NRVIVSDGNNHRIQIFDVNGRVLTSFGS-EGSDEGQF- 1586
Query: 315 SSKLMRPAASFYHKDDDCLYIV--DSENHAIR 344
K R A DD YIV DS N+ I+
Sbjct: 1587 --KFPRGVAV-----DDQGYIVVADSGNNRIQ 1611
>gi|328779273|ref|XP_391967.4| PREDICTED: RING finger protein nhl-1-like [Apis mellifera]
Length = 1281
Score = 47.0 bits (110), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 37/92 (40%), Positives = 51/92 (55%), Gaps = 13/92 (14%)
Query: 255 ACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFE 314
+C S R L H P I+ + NR+ +SD N+HRI +FD NG++L GS G ++G+F
Sbjct: 1152 SCGSGRGQLEH-PHYIAVSNT-NRVIVSDGNNHRIQIFDVNGRVLTSFGS-EGSDEGQF- 1207
Query: 315 SSKLMRPAASFYHKDDDCLYIV--DSENHAIR 344
K R A DD YIV DS N+ I+
Sbjct: 1208 --KFPRGVAV-----DDQGYIVVADSGNNRIQ 1232
>gi|383857485|ref|XP_003704235.1| PREDICTED: RING finger protein nhl-1-like isoform 2 [Megachile
rotundata]
Length = 1297
Score = 46.6 bits (109), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 37/92 (40%), Positives = 51/92 (55%), Gaps = 13/92 (14%)
Query: 255 ACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFE 314
+C S R L H P I+ + NR+ +SD N+HRI +FD NG++L GS G ++G+F
Sbjct: 1168 SCGSGRGQLEH-PHYIAVSNT-NRVIVSDGNNHRIQIFDVNGRVLTSFGS-EGSDEGQF- 1223
Query: 315 SSKLMRPAASFYHKDDDCLYIV--DSENHAIR 344
K R A DD YIV DS N+ I+
Sbjct: 1224 --KFPRGVAV-----DDQGYIVVADSGNNRIQ 1248
>gi|383857483|ref|XP_003704234.1| PREDICTED: RING finger protein nhl-1-like isoform 1 [Megachile
rotundata]
Length = 1233
Score = 46.6 bits (109), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 37/92 (40%), Positives = 51/92 (55%), Gaps = 13/92 (14%)
Query: 255 ACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFE 314
+C S R L H P I+ + NR+ +SD N+HRI +FD NG++L GS G ++G+F
Sbjct: 1104 SCGSGRGQLEH-PHYIAVSNT-NRVIVSDGNNHRIQIFDVNGRVLTSFGS-EGSDEGQF- 1159
Query: 315 SSKLMRPAASFYHKDDDCLYIV--DSENHAIR 344
K R A DD YIV DS N+ I+
Sbjct: 1160 --KFPRGVAV-----DDQGYIVVADSGNNRIQ 1184
>gi|336178839|ref|YP_004584214.1| serine/threonine protein kinase [Frankia symbiont of Datisca
glomerata]
gi|334859819|gb|AEH10293.1| serine/threonine protein kinase [Frankia symbiont of Datisca
glomerata]
Length = 776
Score = 46.6 bits (109), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 50/87 (57%), Gaps = 5/87 (5%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGF--EDGEFESSKLMR 320
L PG ++AD +GN L+++D+ ++RI +G I+ GS G+ + G +++L
Sbjct: 632 LARPGGLAADTAGN-LYIADNANNRIRRVSSDGVIITVAGSGTSGYSGDGGPATAAQLAN 690
Query: 321 PAASFYHKDDDCLYIVDSENHAIRRAD 347
P S DD +YI D++N+ +RR D
Sbjct: 691 PG-SVAVTDDGRVYIADTDNNRVRRVD 716
>gi|374606289|ref|ZP_09679172.1| copper amine oxidase domain-containing protein [Paenibacillus
dendritiformis C454]
gi|374388103|gb|EHQ59542.1| copper amine oxidase domain-containing protein [Paenibacillus
dendritiformis C454]
Length = 564
Score = 46.6 bits (109), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 44/82 (53%), Gaps = 3/82 (3%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESSKLMRPAASF 325
P ++ D GN L+++D+ +H I D +G++ GS G++DG+ ++ P
Sbjct: 141 PMGMAEDRQGN-LYIADAANHVIRKLDKSGRVTTVAGSGLAGWKDGKGAEARFNEPR-DV 198
Query: 326 YHKDDDCLYIVDSENHAIRRAD 347
+D LY+ D+ NH IRR D
Sbjct: 199 AVAEDGSLYVADALNHVIRRID 220
>gi|153868931|ref|ZP_01998653.1| receptor protein kinase [Beggiatoa sp. PS]
gi|152074497|gb|EDN71345.1| receptor protein kinase [Beggiatoa sp. PS]
Length = 3115
Score = 46.6 bits (109), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 42/150 (28%), Positives = 73/150 (48%), Gaps = 21/150 (14%)
Query: 222 VEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFL 281
V ++++Q +NS+ T ++A +++ A L+ P +S D SGN ++
Sbjct: 2097 VSDIVIQPNDNSA-----IWTLVEEAAAIEQTGAN-------LNSPKSMSFDASGNA-YI 2143
Query: 282 SDSNHHRIIVFDGNGKILDCIGSCP---GFEDGEFESSKLMRPAASFYHKDDDCLYIVDS 338
+DS +HRI+ D G + G+ +DG +KL P + + + LYI D+
Sbjct: 2144 ADSLNHRILKRDTQGNLTVVAGTGAKGSTGDDGPAIEAKLKNPQGTAIDHEGN-LYIADT 2202
Query: 339 ENHAIRRADMGRRVLETVYPTSGISKKNNS 368
NH IR+ D ++ TV +GI K N+
Sbjct: 2203 LNHRIRKVD-SNGIITTV---AGIGKAGNT 2228
>gi|307166850|gb|EFN60780.1| RING finger protein nhl-1 [Camponotus floridanus]
Length = 510
Score = 46.6 bits (109), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 50/92 (54%), Gaps = 13/92 (14%)
Query: 255 ACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFE 314
+C R L H P I+ + NR+ +SD N+HR+ +FD NG++L GS G +DG+F
Sbjct: 381 SCGGGRGQLEH-PHYIAVSNT-NRVIVSDGNNHRVQIFDVNGRVLTSFGS-EGSDDGQF- 436
Query: 315 SSKLMRPAASFYHKDDDCLYIV--DSENHAIR 344
K R A DD YIV DS N+ I+
Sbjct: 437 --KFPRGVAV-----DDQGYIVVADSGNNRIQ 461
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 46/114 (40%), Gaps = 29/114 (25%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCL--- 333
N + ++DS++HRI VFDGNG + G+ G +GEF DCL
Sbjct: 260 NSIVVADSSNHRIQVFDGNGNFMKEFGTY-GSGEGEF-----------------DCLAGV 301
Query: 334 -------YIV-DSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGF 379
YI+ D NH I+ D R L + N W + LGF
Sbjct: 302 AVNRIGQYIIADRYNHRIQVLDPSGRFLRAFGSQGTADGRFNYPWGITTDALGF 355
>gi|194757852|ref|XP_001961176.1| GF11130 [Drosophila ananassae]
gi|190622474|gb|EDV37998.1| GF11130 [Drosophila ananassae]
Length = 1348
Score = 46.6 bits (109), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 41/70 (58%), Gaps = 11/70 (15%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYI- 335
NR+ +SDSN+HRI +FD NGK+L +G G +DG+F K R A DD YI
Sbjct: 1239 NRVIVSDSNNHRIQIFDVNGKVLSTVGG-EGSDDGQF---KFPRGVAV-----DDLGYIF 1289
Query: 336 -VDSENHAIR 344
DS N+ I+
Sbjct: 1290 VADSGNNRIQ 1299
>gi|429124393|ref|ZP_19184925.1| NHL repeat containing protein [Brachyspira hampsonii 30446]
gi|426279632|gb|EKV56653.1| NHL repeat containing protein [Brachyspira hampsonii 30446]
Length = 700
Score = 46.6 bits (109), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 53/102 (51%), Gaps = 9/102 (8%)
Query: 255 ACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFE 314
S V N L P ++ D+ GN L++SD+ ++RI+VFD +G L G G +GEF
Sbjct: 250 GVSGVSNGQLLGPSSVALDKDGN-LYVSDTGNNRIVVFDVDGNFLFSFGKL-GENEGEFF 307
Query: 315 SSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
S PA ++ +Y+ D N I++ D+ ++T+
Sbjct: 308 S-----PAGI--AVNEQYIYVADMGNKRIQQFDLSGNYIQTI 342
>gi|357417484|ref|YP_004930504.1| NHL repeat containing protein [Pseudoxanthomonas spadix BD-a59]
gi|355335062|gb|AER56463.1| NHL repeat containing protein [Pseudoxanthomonas spadix BD-a59]
Length = 474
Score = 46.2 bits (108), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 64/143 (44%), Gaps = 20/143 (13%)
Query: 215 IGMLNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADE 274
+G L AV L Q E P G + E EP A L FP +
Sbjct: 134 VGALEAAVSALCEQVLE----PPGTT----RAVETNPEPRAP-------LRFPTGLVV-- 176
Query: 275 SGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFE-DGEFESSKLMRPAASFYHKDDDCL 333
+ RL+++D+ HH+I+ G++L G+ DGE E + RP +++ L
Sbjct: 177 TAERLYVADTGHHQILECTHAGRVLRRFGNGTADALDGEPEIASFCRPNGLSLLREE--L 234
Query: 334 YIVDSENHAIRRADMGRRVLETV 356
Y+ D+ NH +RR ++ ++TV
Sbjct: 235 YVADTGNHLLRRINLRSGAVQTV 257
>gi|449684055|ref|XP_002166330.2| PREDICTED: NHL repeat-containing protein 2-like [Hydra
magnipapillata]
Length = 193
Score = 46.2 bits (108), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 22/84 (26%), Positives = 41/84 (48%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMR 320
NL++H+P + D N L++ + +H I V G + G G +DG +++
Sbjct: 41 NLIIHYPEGLVYDPDYNLLYVCEFENHNIRVVSMQGLVKTLAGEFKGKKDGVGRTARFYY 100
Query: 321 PAASFYHKDDDCLYIVDSENHAIR 344
P+ + + + LY+ D NH +R
Sbjct: 101 PSGLAFDRKNRMLYVSDQYNHLVR 124
>gi|148657095|ref|YP_001277300.1| PA14 domain-containing protein [Roseiflexus sp. RS-1]
gi|148569205|gb|ABQ91350.1| PA14 domain protein [Roseiflexus sp. RS-1]
Length = 1293
Score = 46.2 bits (108), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 39/67 (58%), Gaps = 7/67 (10%)
Query: 279 LFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDS 338
++++DS++HRI F G++L GS PG DG+F S + + A D +Y+ D+
Sbjct: 71 VYVADSDNHRIQRFSAAGELLGAWGS-PGTGDGQFSSPRSVAVAP------DGTVYVADT 123
Query: 339 ENHAIRR 345
NH I+R
Sbjct: 124 GNHRIQR 130
Score = 42.4 bits (98), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 39/75 (52%), Gaps = 7/75 (9%)
Query: 271 SADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDD 330
+A S ++++DSN+HRI F +G L G G DG+F + + A+ D
Sbjct: 334 AAVASDGTVYVADSNNHRIQRFSADGTFLGAWG-AEGSGDGQFVYPRSVAVAS------D 386
Query: 331 DCLYIVDSENHAIRR 345
+Y+ DS NH I+R
Sbjct: 387 GTVYVADSNNHRIQR 401
>gi|320106842|ref|YP_004182432.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
gi|319925363|gb|ADV82438.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
Length = 382
Score = 46.2 bits (108), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 31/118 (26%), Positives = 51/118 (43%), Gaps = 17/118 (14%)
Query: 280 FLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDS 338
+ DS +H I + +G ++ G+ G+ DG+ S++ +P Y+ D YI D+
Sbjct: 249 LIVDSGNHAIRLLNGANQVTTLAGNGTAGYADGQGNSARFNKPTQVVYNSGDGAYYIADT 308
Query: 339 ENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQSLI 396
N+ IRR D S N + +A I + G D + S K D + I
Sbjct: 309 FNNCIRRMD---------------SAGNVTTYAGIGGQSGLV-DGASTSAKFDKPTGI 350
>gi|116749588|ref|YP_846275.1| NHL repeat-containing protein [Syntrophobacter fumaroxidans MPOB]
gi|116698652|gb|ABK17840.1| NHL repeat containing protein [Syntrophobacter fumaroxidans MPOB]
Length = 1750
Score = 46.2 bits (108), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 48/95 (50%), Gaps = 5/95 (5%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFE-DGEFESSKLMRP 321
L+ P I+ D S ++++D+N+HRI DG G I G+ PG+ DG ++ +
Sbjct: 485 LNNPSAIAVDGS-ESIYIADTNNHRIRKVDGGGTITTVAGNGTPGYSGDGASATAASLNF 543
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
D ++I D+ NH +R D G V+ TV
Sbjct: 544 PNGVAVDADGNVFIADTSNHRVRMVDSG--VITTV 576
Score = 43.5 bits (101), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 44/89 (49%), Gaps = 6/89 (6%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFD-GNGKILDCIGS-CPGF--EDGEFESSKLM 319
L P ++ D SGN ++LSD+N+HRI D G+I G PG+ + G + L
Sbjct: 37 LDSPSGVAVDTSGN-VYLSDTNNHRIRKLDVAAGQIQTLAGGQSPGYSGDGGTAAKAGLN 95
Query: 320 RPAASFYHKDDDCLYIVDSENHAIRRADM 348
RP + +Y DS NH IR+ D
Sbjct: 96 RPRGIAVDAAGN-VYFADSNNHCIRKIDT 123
>gi|29830929|ref|NP_825563.1| hypothetical protein SAV_4386 [Streptomyces avermitilis MA-4680]
gi|29608042|dbj|BAC72098.1| hypothetical protein SAV_4386 [Streptomyces avermitilis MA-4680]
Length = 341
Score = 46.2 bits (108), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 53/110 (48%), Gaps = 4/110 (3%)
Query: 253 PHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK-ILDCIGS-CPGFED 310
P+ +L FPG SGN +SD+ H+++ +G+ ++ IGS GF D
Sbjct: 167 PYVAPEPEPTVLRFPGKALLLPSGN-FLVSDTTRHQLVELAADGESVVRRIGSGIRGFAD 225
Query: 311 GEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTS 360
G +++ P D+D + + D+ NHA+RR D+ + T+ T
Sbjct: 226 GPADAASFSEPQGMTL-LDEDVVVVADTVNHALRRLDLATGEVTTLAGTG 274
>gi|380300595|ref|ZP_09850288.1| NHL repeat protein [Brachybacterium squillarum M-6-3]
Length = 644
Score = 45.8 bits (107), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 42/164 (25%), Positives = 67/164 (40%), Gaps = 20/164 (12%)
Query: 278 RLFLSDSNHHRIIVFDGNGKILD-CIGSC-PGFEDGEFESSKLMRPAASFYHKDDDC--- 332
R+ +SD+ HR++VFD +G +D IG G DGE + P D
Sbjct: 194 RIVVSDAGQHRLVVFDPDGTTVDEEIGDGRRGLLDGESAGVRFAEPNGVLALPSDVAAEV 253
Query: 333 ---LYIVDSENHAIRRADMGR-RVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSE 388
L + D+ NH +R +G+ R L S+ +W + + +
Sbjct: 254 GYDLLVADTANHVLRGVKVGQDRFLR--------SRTATEVWTVAGSGAQWMQGDALPRG 305
Query: 389 KLDPQS--LIFPWHLMKSEDDNLLIINRS-FETLWIMDLASGEI 429
+ DP+S L PW L S N ++I + LW D +G +
Sbjct: 306 EGDPRSYALSTPWDLGWSHALNRVVITMAGIHQLWTFDPVTGSL 349
>gi|383454552|ref|YP_005368541.1| putative lipoprotein [Corallococcus coralloides DSM 2259]
gi|380728713|gb|AFE04715.1| putative lipoprotein [Corallococcus coralloides DSM 2259]
Length = 861
Score = 45.8 bits (107), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 46/94 (48%), Gaps = 3/94 (3%)
Query: 257 SSVRNLL--LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFE 314
+R +L L+ P I+AD GN +++SD+ + I GK G+ G++DG
Sbjct: 614 GKIRTVLTGLYGPMGIAADALGN-VYVSDTENAVIRRISPEGKAEVFAGTTWGYQDGPAL 672
Query: 315 SSKLMRPAASFYHKDDDCLYIVDSENHAIRRADM 348
S+ +PA + D L + D N IRR DM
Sbjct: 673 SAAFNQPAGLSFTPDGTALLVADLNNSVIRRIDM 706
>gi|253748633|gb|EET02664.1| Haloacid dehalogenase-like hydrolase family [Giardia intestinalis
ATCC 50581]
Length = 963
Score = 45.8 bits (107), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 31/140 (22%), Positives = 58/140 (41%), Gaps = 3/140 (2%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSC--PGFEDGEFESSKLMRPAASFYHKDDDCLY 334
N + LSDS H+R++ +G D + DG F + P + + +Y
Sbjct: 97 NGILLSDSGHNRLVFINGETGHCDHLAGTGEAAHLDGGFSQAAFNSPHSVVSDPVNGIIY 156
Query: 335 IVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQS 394
+ D++NH IR + R + T+ T G+ + L E LG D+ D ++
Sbjct: 157 LTDTKNHCIRTLSLATRKVRTLAGTPGVFGYRDGLNPLFNEPLGLVLTEDSNIIVCDSKN 216
Query: 395 LIFPWHLMKSEDDNLLIINR 414
H+ + + + ++ R
Sbjct: 217 GAL-RHISRENGETITLVGR 235
>gi|189241070|ref|XP_001808548.1| PREDICTED: similar to AGAP007135-PA [Tribolium castaneum]
Length = 1245
Score = 45.8 bits (107), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 32/70 (45%), Positives = 41/70 (58%), Gaps = 11/70 (15%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYI- 335
NR+ +SDSN+HRI +FD NGK+L GS G +DG+F K R A DD YI
Sbjct: 1136 NRVVVSDSNNHRIQIFDVNGKVLSSFGS-EGSDDGQF---KFPRGVAV-----DDQGYIC 1186
Query: 336 -VDSENHAIR 344
DS N+ I+
Sbjct: 1187 VADSGNNRIQ 1196
>gi|270013854|gb|EFA10302.1| hypothetical protein TcasGA2_TC012517 [Tribolium castaneum]
Length = 2020
Score = 45.8 bits (107), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 29/68 (42%), Positives = 40/68 (58%), Gaps = 7/68 (10%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIV 336
NR+ +SDSN+HRI +FD NGK+L GS G +DG+F K R A D + +
Sbjct: 1126 NRVVVSDSNNHRIQIFDVNGKVLSSFGS-EGSDDGQF---KFPRGVAV---DDQGYICVA 1178
Query: 337 DSENHAIR 344
DS N+ I+
Sbjct: 1179 DSGNNRIQ 1186
>gi|116007528|ref|NP_001036460.1| CG12547, isoform A [Drosophila melanogaster]
gi|442622271|ref|NP_001260704.1| CG12547, isoform B [Drosophila melanogaster]
gi|17945781|gb|AAL48938.1| RE33981p [Drosophila melanogaster]
gi|30923523|gb|EAA46001.1| CG12547, isoform A [Drosophila melanogaster]
gi|220948326|gb|ACL86706.1| CG12547-PA [synthetic construct]
gi|440214080|gb|AGB93239.1| CG12547, isoform B [Drosophila melanogaster]
Length = 717
Score = 45.8 bits (107), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 27/98 (27%), Positives = 51/98 (52%), Gaps = 3/98 (3%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FP I +G ++D+ ++R++V G + IG PGF DG+ ++ P
Sbjct: 225 LRFPAKIVQGPNG-LYAIADTGNNRVLVLTAGGVVQHRIGGHQPGFVDGDLTVARFNNPQ 283
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTS 360
+ +++ L + D++NHAIR+ + ++ET+ T
Sbjct: 284 GIAF-LNENVLIVADTKNHAIRQISLTNAMVETLAGTG 320
>gi|421861496|ref|ZP_16293498.1| hypothetical protein PPOP_3335 [Paenibacillus popilliae ATCC 14706]
gi|410828922|dbj|GAC43935.1| hypothetical protein PPOP_3335 [Paenibacillus popilliae ATCC 14706]
Length = 575
Score = 45.8 bits (107), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 42/82 (51%), Gaps = 3/82 (3%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFESSKLMRPAASF 325
P ++ D GN L+++D+ +H I D +G++ GS G+ DG ++ P
Sbjct: 152 PMGMAEDRQGN-LYIADAGNHVIRKLDKSGRVTTVAGSGLAGWRDGTGSEARFNEPR-DV 209
Query: 326 YHKDDDCLYIVDSENHAIRRAD 347
+D LY+ D+ NH +RR D
Sbjct: 210 AVAEDGSLYVADALNHVLRRID 231
>gi|340723937|ref|XP_003400343.1| PREDICTED: hypothetical protein LOC100643462 [Bombus terrestris]
Length = 1787
Score = 45.8 bits (107), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 37/92 (40%), Positives = 50/92 (54%), Gaps = 13/92 (14%)
Query: 255 ACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFE 314
+C S R L H P I+ + NR+ +SD N+HRI +FD NG +L GS G ++G+F
Sbjct: 1658 SCGSGRGQLEH-PHYIAVSNT-NRVIVSDGNNHRIQIFDVNGHVLTSFGS-EGSDEGQF- 1713
Query: 315 SSKLMRPAASFYHKDDDCLYIV--DSENHAIR 344
K R A DD YIV DS N+ I+
Sbjct: 1714 --KFPRGVAV-----DDQGYIVVADSGNNRIQ 1738
Score = 40.4 bits (93), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 46/114 (40%), Gaps = 29/114 (25%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCL--- 333
N + ++DS++HR+ VFDGNG + G+ G +GEF DCL
Sbjct: 1537 NSIVVADSSNHRVQVFDGNGNFMKEFGTY-GSGEGEF-----------------DCLAGV 1578
Query: 334 -------YIV-DSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGF 379
YI+ D NH I+ D R L + N W + LGF
Sbjct: 1579 AVNRIGQYIIADRYNHRIQVLDPSGRFLRAFGSQGTADGRFNYPWGITTDVLGF 1632
>gi|350405986|ref|XP_003487620.1| PREDICTED: hypothetical protein LOC100746052 [Bombus impatiens]
Length = 1795
Score = 45.8 bits (107), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 37/92 (40%), Positives = 50/92 (54%), Gaps = 13/92 (14%)
Query: 255 ACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFE 314
+C S R L H P I+ + NR+ +SD N+HRI +FD NG +L GS G ++G+F
Sbjct: 1666 SCGSGRGQLEH-PHYIAVSNT-NRVIVSDGNNHRIQIFDVNGHVLTSFGS-EGSDEGQF- 1721
Query: 315 SSKLMRPAASFYHKDDDCLYIV--DSENHAIR 344
K R A DD YIV DS N+ I+
Sbjct: 1722 --KFPRGVAV-----DDQGYIVVADSGNNRIQ 1746
Score = 40.0 bits (92), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 46/114 (40%), Gaps = 29/114 (25%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCL--- 333
N + ++DS++HR+ VFDGNG + G+ G +GEF DCL
Sbjct: 1545 NSIVVADSSNHRVQVFDGNGNFMKEFGTY-GSGEGEF-----------------DCLAGV 1586
Query: 334 -------YIV-DSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGF 379
YI+ D NH I+ D R L + N W + LGF
Sbjct: 1587 AVNRIGQYIIADRYNHRIQVLDPSGRFLRAFGSQGTADGRFNYPWGITTDVLGF 1640
>gi|257069643|ref|YP_003155898.1| NHL repeat protein [Brachybacterium faecium DSM 4810]
gi|256560461|gb|ACU86308.1| NHL repeat protein [Brachybacterium faecium DSM 4810]
Length = 697
Score = 45.8 bits (107), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 53/221 (23%), Positives = 93/221 (42%), Gaps = 27/221 (12%)
Query: 218 LNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGN 277
L++ VE+L+ + + + S G T VL E + L FP ++ G
Sbjct: 147 LDRLVEQLVAEGEADGSLRRGPAPT------VLAERSPQT------LRFPSKLTLLPDG- 193
Query: 278 RLFLSDSNHHRIIVFDGNGKILD-CIGSC-PGFEDGEFESSKLMRPAASFY---HKDDDC 332
RL +SD+ HR++VF+ +G +D IG+ G DG+ E+++ P D+
Sbjct: 194 RLVVSDAGQHRLVVFENDGVTVDAAIGTGRRGHADGDEETAQFAEPNGVLALPREVADEV 253
Query: 333 LY---IVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEK 389
Y +VD+ H +R +G+ L + ++ S W+ + D ++
Sbjct: 254 GYDLLVVDTAGHRLRGVKVGQDRLLRSRTATEVTTVAGSGEQWMQGAPLPRGEGDART-- 311
Query: 390 LDPQSLIFPWHLMKSEDDNLLIINRS-FETLWIMDLASGEI 429
+L PW L S N +I + LW D +G +
Sbjct: 312 ---FALSTPWDLTWSHVLNRAVIAMAGVHQLWTFDPVAGTL 349
>gi|195119117|ref|XP_002004078.1| GI18254 [Drosophila mojavensis]
gi|193914653|gb|EDW13520.1| GI18254 [Drosophila mojavensis]
Length = 734
Score = 45.4 bits (106), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 50/97 (51%), Gaps = 4/97 (4%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FP I+ E ++ ++D+ ++RIIV G + IG GF DG S+ P
Sbjct: 224 LRFPAKIAMLE--DQYAIADAGNNRIIVVTNCGVVKYKIGGLEAGFVDGSLNMSRFNNPQ 281
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
+ D + L + D++NHA+R+ + V+ET+ T
Sbjct: 282 GVAF-LDKNTLIVADTDNHALRQISLNNGVVETLAGT 317
Score = 40.4 bits (93), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 44/91 (48%), Gaps = 9/91 (9%)
Query: 275 SGNRLFLSDSNHHRIIVFDG-NGKILDCIGS--------CPGFEDGEFESSKLMRPAASF 325
S + L+++DS I +GK++ +G G EDG+ ++KL P
Sbjct: 428 SSDYLYIADSESSSIRKASMVDGKVMPVVGGDRNPLNLFAFGDEDGKLYNAKLQHPLGVA 487
Query: 326 YHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
Y+ D LY+ D+ NH I++ D ++ T+
Sbjct: 488 YNNVDRKLYVADTYNHKIKQVDTETNIISTL 518
>gi|423072880|ref|ZP_17061629.1| repeat protein [Desulfitobacterium hafniense DP7]
gi|361856375|gb|EHL08282.1| repeat protein [Desulfitobacterium hafniense DP7]
Length = 2017
Score = 45.4 bits (106), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 48/85 (56%), Gaps = 8/85 (9%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMR 320
N +P I+ D +GN +++SDS ++RI F GK ++ GS PG DG+F LM
Sbjct: 510 NDQFRYPNGIAVDSAGN-VYVSDSGNYRIQKFSSTGKSIEKWGS-PGVADGQF----LMP 563
Query: 321 PAASFYHKDDDCLYIVDSENHAIRR 345
+ ++ +Y+VDS N+ +++
Sbjct: 564 GGITLDSSNN--IYVVDSGNNRVQK 586
Score = 43.9 bits (102), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 53/103 (51%), Gaps = 9/103 (8%)
Query: 243 WAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCI 302
++ + E L + + V N+ P ++ D SGN +++ D+++ RI FD +G L
Sbjct: 898 YSAEGEFLTK-WGSNGVGNVQFGAPQDVATDSSGN-VYVVDTSNKRIQKFDSSGSFLTKW 955
Query: 303 GSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRR 345
GS G ++GEF RP D + +Y+ DS NH I++
Sbjct: 956 GSN-GSDEGEFN-----RPYGIAVDSDGN-IYVADSNNHRIQK 991
>gi|329935738|ref|ZP_08285543.1| hypothetical protein SGM_1035 [Streptomyces griseoaurantiacus M045]
gi|329304829|gb|EGG48702.1| hypothetical protein SGM_1035 [Streptomyces griseoaurantiacus M045]
Length = 617
Score = 45.4 bits (106), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 35/138 (25%), Positives = 58/138 (42%), Gaps = 14/138 (10%)
Query: 253 PHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRII--VFDGNGKILDCIGSCPGFED 310
P+ L FPG SGN L +SD+ H+++ DG ++ GF D
Sbjct: 176 PYVAPEPEPTTLRFPGKALLLPSGN-LLVSDTTRHQLVELAADGESEVRRIGSGARGFAD 234
Query: 311 GEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVY----------PTS 360
G E++ P +D + + D+ NHA+RR D+ + T+ PTS
Sbjct: 235 GTAEAAAFSEPQ-GLALLEDGSVVVADTVNHALRRLDVATGGVTTLAGTGRQWWQGSPTS 293
Query: 361 GISKKNNSLWAWIMEKLG 378
G +++ + W + G
Sbjct: 294 GPAREVDLSSPWDVAVFG 311
>gi|296133348|ref|YP_003640595.1| NHL repeat containing protein [Thermincola potens JR]
gi|296031926|gb|ADG82694.1| NHL repeat containing protein [Thermincola potens JR]
Length = 335
Score = 45.4 bits (106), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 45/80 (56%), Gaps = 7/80 (8%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIV 336
N+L+++D H ++VFD GK+L IG PG E G+F + P A +D + +Y+V
Sbjct: 175 NKLYVTDVKKHAVLVFDLQGKLLKTIGK-PGREPGQF-----LAPNAVTVDRDGN-IYVV 227
Query: 337 DSENHAIRRADMGRRVLETV 356
D+ N ++ D + ++ +
Sbjct: 228 DTGNQRVQVFDKNGKFMKII 247
>gi|290990564|ref|XP_002677906.1| NHL domain-containing protein kinase [Naegleria gruberi]
gi|284091516|gb|EFC45162.1| NHL domain-containing protein kinase [Naegleria gruberi]
Length = 2733
Score = 45.4 bits (106), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 52/99 (52%), Gaps = 6/99 (6%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGF--EDGEFESSKLMR 320
L+FP ++ +G+ ++++D+ +HRI + NG I GS G+ + G S+K +
Sbjct: 515 LNFPSGVAVSSNGD-IYIADTENHRIRKIETNGYIATVAGSGASGYSGDGGLLTSAKFQQ 573
Query: 321 PAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
P + + +YI D+ENH +R+ VL T+ T
Sbjct: 574 PQGVAVSSNGE-IYIADTENHVVRKIST-SNVLSTIAGT 610
Score = 44.3 bits (103), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 45/89 (50%), Gaps = 5/89 (5%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFE---SSKLMR 320
L+ P CI+ D +G ++SDS +HRI NG I G+ DG+ +++L
Sbjct: 203 LYAPACIAIDSNG-EFYISDSMNHRIRKIATNGIITTIAGTGTHGYDGDGALAINAQLYS 261
Query: 321 PAASFYHKDDDCLYIVDSENHAIRRADMG 349
P + + D +YI DS N+ IR+ G
Sbjct: 262 PTGVVVNSNGD-VYIADSGNNRIRKISKG 289
>gi|359151030|ref|ZP_09183771.1| Redoxin domain-containing protein [Streptomyces sp. S4]
Length = 568
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 47/191 (24%), Positives = 81/191 (42%), Gaps = 26/191 (13%)
Query: 221 AVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLF 280
A+E+L+ + + ++ L+ V EP A + L FPG SGN
Sbjct: 94 AIEKLVEELEAEHAAKGTLRR--GDGPYVAPEPVAGA------LRFPGRAVLLPSGN-FL 144
Query: 281 LSDSNHHRIIVFDGNGK-ILDCIGSCP--GFEDGEFESSKLMRPAASFYHKDDDCLYIVD 337
+SD+ H+++ +G+ ++ IG G DG ++ P + D + + D
Sbjct: 145 VSDTTRHQLVELAEDGESVVRRIGGEGERGLADGGPAEARFSEPQGLVLTPEGDAVIVAD 204
Query: 338 SENHAIRRADMGRRVLETVY----------PTSGISKKNN--SLW--AWIMEKLGFERDN 383
+ NHA+RR ++ V+ T PTSG +++ + S W AW KL
Sbjct: 205 TVNHALRRVELASGVVTTPAGTGRQWWQGSPTSGPAREVDLSSPWDVAWFGGKLWIAMAG 264
Query: 384 DTKSEKLDPQS 394
+ DPQ+
Sbjct: 265 THQLWTYDPQA 275
>gi|357400623|ref|YP_004912548.1| hypothetical protein SCAT_3039 [Streptomyces cattleya NRRL 8057 =
DSM 46488]
gi|386356676|ref|YP_006054922.1| hypothetical protein SCATT_30300 [Streptomyces cattleya NRRL 8057 =
DSM 46488]
gi|337767032|emb|CCB75743.1| conserved protein of unknown function [Streptomyces cattleya NRRL
8057 = DSM 46488]
gi|365807185|gb|AEW95401.1| hypothetical protein SCATT_30300 [Streptomyces cattleya NRRL 8057 =
DSM 46488]
Length = 613
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 70/171 (40%), Gaps = 21/171 (12%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK-ILDCIGSCP-GFEDGEFESSKLMRP 321
L FPG +GN L +SDS H ++ +G+ ++ IGS G DG + ++ P
Sbjct: 181 LRFPGKALLLPTGN-LLVSDSTRHALVELAPDGETVVRRIGSGERGLADGPADRARFSEP 239
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFER 381
D + + + D+ NHA+RR D+ + TV +G ++ W G R
Sbjct: 240 QGLALLPDGESVIVADTVNHALRRVDLASGEVTTV---AGTGRQ----WWQGSPVAGPAR 292
Query: 382 DNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEIKEA 432
+ +L PW + DD + I LW D A + A
Sbjct: 293 E----------VALSSPWDVAWY-DDRVWIAMAGVHQLWTYDPADATVAVA 332
>gi|428183007|gb|EKX51866.1| hypothetical protein GUITHDRAFT_84912 [Guillardia theta CCMP2712]
Length = 351
Score = 45.1 bits (105), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 39/72 (54%), Gaps = 4/72 (5%)
Query: 279 LFLSDSNHHRI---IVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYI 335
L++ D ++HRI + DG+ L G+ PGF DG++ SS +P D LY+
Sbjct: 41 LYVGDRDNHRIRKVSMRDGSTTTLAGTGT-PGFADGDWNSSSFNQPCGVSTSPDGQWLYV 99
Query: 336 VDSENHAIRRAD 347
DS N+ +R+ +
Sbjct: 100 ADSGNNRVRKVN 111
>gi|329908044|ref|ZP_08274804.1| hypothetical protein IMCC9480_3484 [Oxalobacteraceae bacterium
IMCC9480]
gi|327546777|gb|EGF31711.1| hypothetical protein IMCC9480_3484 [Oxalobacteraceae bacterium
IMCC9480]
Length = 642
Score = 45.1 bits (105), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 42/83 (50%), Gaps = 5/83 (6%)
Query: 266 FPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIG---SCPGFEDGEFESSKLMRPA 322
+PG + D +GNR +++D N+++I +G + G S PG DG +++ P
Sbjct: 22 YPGSMVVDSAGNR-YIADQNNNQIRKVRPDGSVSTLAGASASTPGAVDGRGAAARFYSPE 80
Query: 323 ASFYHKDDDCLYIVDSENHAIRR 345
S LY+ DS NH IR+
Sbjct: 81 -SIAIDRAGMLYVADSVNHTIRK 102
>gi|324517576|gb|ADY46859.1| RING finger protein nhl-1 [Ascaris suum]
Length = 309
Score = 45.1 bits (105), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 44/91 (48%), Gaps = 12/91 (13%)
Query: 254 HACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGE 312
H SS R + P + DE+G ++ D ++HRI VFD NG+ + GS PG
Sbjct: 134 HGPSSGR---FNNPWGVCVDETG-IIYAVDKDNHRIQVFDSNGQFVSKFGSMGPG----- 184
Query: 313 FESSKLMRPAASFYHKDDDCLYIVDSENHAI 343
+L P YHK LY+ DS NH +
Sbjct: 185 --PGQLHNPQFIAYHKQTQHLYVTDSSNHRV 213
>gi|332025715|gb|EGI65873.1| RING finger protein nhl-1 [Acromyrmex echinatior]
Length = 1199
Score = 45.1 bits (105), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 51/92 (55%), Gaps = 13/92 (14%)
Query: 255 ACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFE 314
+C S R L H P I+ + NR+ +SD N+HR+ +FD NG++L GS G ++G+F
Sbjct: 1070 SCGSGRGQLEH-PHYIAVSNT-NRVIVSDGNNHRVQIFDVNGRVLTSFGS-EGSDEGQF- 1125
Query: 315 SSKLMRPAASFYHKDDDCLYIV--DSENHAIR 344
K R A DD YI+ DS N+ I+
Sbjct: 1126 --KFPRGVAV-----DDQGYIIVADSGNNRIQ 1150
Score = 42.0 bits (97), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 46/114 (40%), Gaps = 29/114 (25%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCL--- 333
N + ++DS++HR+ VFDGNG + G+ G +GEF DCL
Sbjct: 949 NSIVVADSSNHRVQVFDGNGNFMKEFGTY-GSNEGEF-----------------DCLAGV 990
Query: 334 -------YIV-DSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGF 379
YI+ D NH I+ D R L + N W + LGF
Sbjct: 991 AVNRIGQYIIADRYNHRIQVLDPSGRFLRAFGSQGTADGRFNYPWGITTDALGF 1044
>gi|86610239|ref|YP_479001.1| NHL repeat-containing protein [Synechococcus sp. JA-2-3B'a(2-13)]
gi|86558781|gb|ABD03738.1| NHL repeat protein [Synechococcus sp. JA-2-3B'a(2-13)]
Length = 649
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 38/170 (22%), Positives = 70/170 (41%), Gaps = 27/170 (15%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFY 326
P ++ D+ GN L+++D HR+ G++ G+ PG+ DG ++ +
Sbjct: 504 PTALAVDKEGN-LWIADG--HRLRRLSAGGRVTTLSGAEPGYRDGPLAEARFQTLGGLAF 560
Query: 327 HKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTK 386
+ L++ D +NH IRR G +V S ++ +N W +D T
Sbjct: 561 DAAGN-LWVADRDNHRIRRLQPGGQV-------STLAGQNQPGW----------QDGPTS 602
Query: 387 SEKLDPQSLIFPWHLMKSEDDNLLIINRSFETL-WIMDLASGEIKEAVKG 435
+ D P L D ++++++ L I +ASG + G
Sbjct: 603 VARFD-----QPAGLAVLPDGSVVVVDAGLPGLRRISRVASGPVTPTPGG 647
>gi|291000698|ref|XP_002682916.1| dual specificity protein phosphatase [Naegleria gruberi]
gi|284096544|gb|EFC50172.1| dual specificity protein phosphatase [Naegleria gruberi]
Length = 1502
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 46/83 (55%), Gaps = 5/83 (6%)
Query: 276 GNRLFLSDSNHHRIIVFDGNGKILDCIGS--CPGFEDGEFESSKLMRPAASFYHKDDDCL 333
G+ +F+SD+ +HR++ + NG + + G+ GF DG+ ++S L P F H ++ L
Sbjct: 452 GDNVFVSDTGNHRVVRIN-NGFVTNAAGNPRTAGFMDGKVQNSILNSPTDLFSH--ENTL 508
Query: 334 YIVDSENHAIRRADMGRRVLETV 356
YI D N IR+ V+ TV
Sbjct: 509 YINDFGNSRIRQVFANYSVVRTV 531
>gi|290980410|ref|XP_002672925.1| predicted protein [Naegleria gruberi]
gi|284086505|gb|EFC40181.1| predicted protein [Naegleria gruberi]
Length = 536
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 60/140 (42%), Gaps = 28/140 (20%)
Query: 279 LFLSDSNHHRIIVFDGNGKILDCI-GSCPGFEDGE---FESSKLMRPAASFYHKDDDCLY 334
+++SDS +H I DGN ++ GS +DG S+KL P +K++ LY
Sbjct: 79 VYMSDSTYHVIRKVDGNTGVISVFAGSYASGDDGNNIVATSAKLNAPTGMVLNKNETWLY 138
Query: 335 IVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQS 394
V S N +R D+ + T+ G S N+S DP S
Sbjct: 139 FVSSSN-CVRGVDLTTGNISTIVGKCGSSGYNDS---------------------TDPYS 176
Query: 395 LIFPWHL--MKSEDDNLLII 412
++F L S+DDNL+ I
Sbjct: 177 VLFASTLSVALSDDDNLMFI 196
>gi|445064338|ref|ZP_21376408.1| NHL repeat containing protein [Brachyspira hampsonii 30599]
gi|444504298|gb|ELV04990.1| NHL repeat containing protein [Brachyspira hampsonii 30599]
Length = 700
Score = 44.7 bits (104), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 51/102 (50%), Gaps = 9/102 (8%)
Query: 255 ACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFE 314
S V N L P ++ D+ GN L++SD+ ++RI VFD G L G G +GEF
Sbjct: 250 GVSGVSNGQLLGPSSVALDKDGN-LYVSDTGNNRIAVFDIYGNFLFSFGKL-GENEGEFF 307
Query: 315 SSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
S PA +D +Y+ D N +++ D+ ++T+
Sbjct: 308 S-----PAGI--AVNDQYIYVADMGNKRVQQFDLSGNYIQTI 342
>gi|291452573|ref|ZP_06591963.1| NHL repeat containing protein [Streptomyces albus J1074]
gi|291355522|gb|EFE82424.1| NHL repeat containing protein [Streptomyces albus J1074]
Length = 613
Score = 44.7 bits (104), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 47/191 (24%), Positives = 81/191 (42%), Gaps = 26/191 (13%)
Query: 221 AVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLF 280
A+E+L+ + + ++ L+ V EP A + L FPG SGN
Sbjct: 139 AIEKLVEELEAEHAAKGTLRR--GDGPYVAPEPVAGA------LRFPGRAVLLPSGN-FL 189
Query: 281 LSDSNHHRIIVFDGNGK-ILDCIGSCP--GFEDGEFESSKLMRPAASFYHKDDDCLYIVD 337
+SD+ H+++ +G+ ++ IG G DG ++ P + D + + D
Sbjct: 190 VSDTTRHQLVELAEDGESVVRRIGGEGERGLADGGPAEARFSEPQGLALTPEGDAVIVAD 249
Query: 338 SENHAIRRADMGRRVLETVY----------PTSGISKKNN--SLW--AWIMEKLGFERDN 383
+ NHA+RR ++ V+ T PTSG +++ + S W AW KL
Sbjct: 250 TVNHALRRVELASGVVTTPAGTGRQWWQGSPTSGPAREVDLSSPWDVAWFGGKLWIAMAG 309
Query: 384 DTKSEKLDPQS 394
+ DPQ+
Sbjct: 310 THQLWTYDPQA 320
>gi|336180276|ref|YP_004585651.1| NHL repeat containing protein [Frankia symbiont of Datisca
glomerata]
gi|334861256|gb|AEH11730.1| NHL repeat containing protein [Frankia symbiont of Datisca
glomerata]
Length = 398
Score = 44.7 bits (104), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 3/86 (3%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFE-DGEFESSKLMRP 321
L+ P + D GN +++SDS++HRI D G I G+ GF DG ++ +
Sbjct: 135 LNSPATTAVDSDGN-VYISDSHNHRIRKIDPLGIITTIAGTGTAGFSGDGGPATAAQLNE 193
Query: 322 AASFYHKDDDCLYIVDSENHAIRRAD 347
DD +YI D EN IR+ D
Sbjct: 194 PYGLAVATDDSIYISDYENQRIRKID 219
>gi|421740691|ref|ZP_16178929.1| NHL repeat protein [Streptomyces sp. SM8]
gi|406690906|gb|EKC94689.1| NHL repeat protein [Streptomyces sp. SM8]
Length = 444
Score = 44.7 bits (104), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 64/148 (43%), Gaps = 18/148 (12%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK-ILDCIGSCP--GFEDGEFESSKLMR 320
L FPG SGN +SD+ H+++ +G+ ++ IG G DG ++
Sbjct: 5 LRFPGRAVLLPSGN-FLVSDTTRHQLVELAEDGESVVRRIGGEGERGLADGGPAEARFSE 63
Query: 321 PAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVY----------PTSGISKKNN--S 368
P + D + + D+ NHA+RR ++ V+ T PTSG +++ + S
Sbjct: 64 PQGLALTPEGDAVIVADTVNHALRRVELASGVVTTPAGTGRQWWQGSPTSGPAREVDLSS 123
Query: 369 LW--AWIMEKLGFERDNDTKSEKLDPQS 394
W AW KL + DPQ+
Sbjct: 124 PWDVAWFGGKLWIAMAGTHQLWTYDPQA 151
>gi|290990712|ref|XP_002677980.1| predicted protein [Naegleria gruberi]
gi|284091590|gb|EFC45236.1| predicted protein [Naegleria gruberi]
Length = 1555
Score = 44.7 bits (104), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 53/99 (53%), Gaps = 8/99 (8%)
Query: 262 LLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC--PGFE-DGEFES-SK 317
L+L+ P I A S +++SDS +HRII ++ + C GF+ DG + +K
Sbjct: 458 LVLNSPKGIFATTS--YVYISDSGNHRIIKVHRTSGVVIGVAGCGTSGFDGDGGLATLAK 515
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
L P + D++ LY+ D+ NH IR ++ +++T+
Sbjct: 516 LNSPTGIYL--DNNVLYVADTNNHRIRAVNITSGIIQTI 552
>gi|451946198|ref|YP_007466793.1| NHL repeat protein [Desulfocapsa sulfexigens DSM 10523]
gi|451905546|gb|AGF77140.1| NHL repeat protein [Desulfocapsa sulfexigens DSM 10523]
Length = 282
Score = 44.7 bits (104), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 43/79 (54%), Gaps = 8/79 (10%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFD-GNGKILDCIGSCPGFEDGEFESSKLMRPAASF 325
P + AD SG F++D+N+HRI+ FD K+++ G PG E EF LM
Sbjct: 108 PTDVVADSSGKWCFVADNNNHRILQFDIATKKLINSYGK-PGAEKWEFRYPFLMH----- 161
Query: 326 YHKDDDCLYIVDSENHAIR 344
H+D D LYIVD N ++
Sbjct: 162 LHRDKD-LYIVDVINTRVQ 179
>gi|290978674|ref|XP_002672060.1| predicted protein [Naegleria gruberi]
gi|284085634|gb|EFC39316.1| predicted protein [Naegleria gruberi]
Length = 1219
Score = 44.7 bits (104), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 47/97 (48%), Gaps = 7/97 (7%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDG---EFESSKLMR 320
L+ P I D + +F+SDS +HRI NGKI G+ +G E + +KL
Sbjct: 137 LNTPSGIVIDPNNGEIFISDSKNHRIRKILQNGKITTIAGTGEAGYNGDGIEAKFAKLYL 196
Query: 321 PAA-SFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P Y K+ L+I D NH IR+ + ++ T+
Sbjct: 197 PNGIDLYEKE---LFIADQNNHRIRKVSLDTGLISTI 230
Score = 43.5 bits (101), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 47/85 (55%), Gaps = 3/85 (3%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGF--EDGEFESSKLMR 320
L+ P I D + ++++DSN+HRI NGKI G+ G+ +D +S++L
Sbjct: 80 LNNPVGIYVDTNSREVYIADSNNHRIRKILQNGKITTIAGTGIAGYNGDDKSADSAQLNT 139
Query: 321 PAASFYHKDDDCLYIVDSENHAIRR 345
P+ ++ ++I DS+NH IR+
Sbjct: 140 PSGIVIDPNNGEIFISDSKNHRIRK 164
>gi|294630411|ref|ZP_06708971.1| NHL repeat protein [Streptomyces sp. e14]
gi|292833744|gb|EFF92093.1| NHL repeat protein [Streptomyces sp. e14]
Length = 609
Score = 44.7 bits (104), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 45/97 (46%), Gaps = 4/97 (4%)
Query: 253 PHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK-ILDCIGS-CPGFED 310
P+ L FPG SGN +SD+ H+++ +G+ ++ IGS GF D
Sbjct: 171 PYVAPEPEPTTLRFPGKALLLPSGN-FLVSDTTRHQLVELAADGESVVRRIGSGARGFAD 229
Query: 311 GEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRAD 347
G E + P DD + + D+ NHA+RR D
Sbjct: 230 GAAEDAAFNEPQ-GLALLDDGAVVVADTVNHALRRFD 265
>gi|374855491|dbj|BAL58347.1| NHL repeat containing protein [uncultured candidate division OP1
bacterium]
Length = 325
Score = 44.3 bits (103), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 33/119 (27%), Positives = 57/119 (47%), Gaps = 19/119 (15%)
Query: 235 SPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDG 294
+P W KQ + +A +V + D GN ++++D+ +HR+ FD
Sbjct: 25 TPPVFVTQWGKQGSGNGQFNALEAV-----------AVDAQGN-VYVADTENHRVQKFDT 72
Query: 295 NGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVL 353
NGK L GS G +G+FES P ++++ +Y+ D+ NH I++ D + L
Sbjct: 73 NGKFLLTWGS-KGVNNGQFES-----PGGIALDRENN-VYVADTFNHRIQKFDATGKFL 124
Score = 42.4 bits (98), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 40/74 (54%), Gaps = 8/74 (10%)
Query: 272 ADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDD 331
A +S RL+++D HHR+ VFD G ++ G G GEF+ K + +F
Sbjct: 256 AIDSSQRLYVTDQGHHRVQVFDTRGNLITLWGK-EGSGPGEFQFLKGIAIYQNF------ 308
Query: 332 CLYIVDSENHAIRR 345
+YI DS+NH +++
Sbjct: 309 -VYIADSDNHRVQK 321
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 47/90 (52%), Gaps = 8/90 (8%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAA 323
+FP I+ D+ GN L+++D+ +HR+ FD GK L GS G DG+F + A
Sbjct: 155 FNFPIGIAFDKDGN-LYVTDAFNHRVQKFDPTGKFLGSFGSF-GSGDGQFNVT------A 206
Query: 324 SFYHKDDDCLYIVDSENHAIRRADMGRRVL 353
+ LY+ D++N +++ D R L
Sbjct: 207 GIAIDTEGNLYVSDNKNDRVQKFDPTGRFL 236
>gi|251771416|gb|EES51995.1| conserved protein of unknown function [Leptospirillum
ferrodiazotrophum]
Length = 373
Score = 44.3 bits (103), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 44/90 (48%), Gaps = 4/90 (4%)
Query: 269 CISADESGNRLFLSDSNHHRIIVFD-GNGKILDCIGSC-PGFEDGEFESSKLMRPAASFY 326
IS+D G L+++DS ++RI D +G + G PG DG ++ P A
Sbjct: 121 AISSD--GKTLYIADSRNNRIRKMDLSSGAVTTLAGRAFPGSNDGTGGAAGFYGPRALAL 178
Query: 327 HKDDDCLYIVDSENHAIRRADMGRRVLETV 356
D LYI DS N+ IR+ D+ + TV
Sbjct: 179 TPDGKTLYISDSGNNMIRKLDVATGTVTTV 208
>gi|290995061|ref|XP_002680150.1| predicted protein [Naegleria gruberi]
gi|284093769|gb|EFC47406.1| predicted protein [Naegleria gruberi]
Length = 880
Score = 44.3 bits (103), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 82/190 (43%), Gaps = 37/190 (19%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRI-IVFDGNGKILDCIGSCPGFEDGEFE-SSKL 318
N L+FP +S D ++++DS ++RI V +G + GS DGE ++KL
Sbjct: 426 NAKLYFPRGVSVDPFKGDVYIADSYNNRIRKVSNGFISTIAGTGSAGFTGDGELAIAAKL 485
Query: 319 MRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLG 378
P + Y +YI+D+ N IR + PT+G ++ +I G
Sbjct: 486 DTPYSVAYSNTSGLVYILDTNNARIRN----------IKPTTGF------IYTYIGGTRG 529
Query: 379 FERDNDTKSEKLDPQSLI-FPWHLMKSEDDNLLIINRSFETLWIMDLASGEIKEAVKGFS 437
+ D + + Q+ I F + + SE D+L+ I D SG A++ S
Sbjct: 530 YSGD----GQVVGSQTQIDFSYGISLSEKDDLV----------IADTFSG----AIRVVS 571
Query: 438 KVLEICGVLV 447
K +I LV
Sbjct: 572 KATDIISTLV 581
>gi|116620267|ref|YP_822423.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116223429|gb|ABJ82138.1| NHL repeat containing protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 380
Score = 44.3 bits (103), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 61/136 (44%), Gaps = 10/136 (7%)
Query: 237 SGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGN-RLFLSDSNH-HRIIVFDG 294
SG TWA E P + + L+ P I++D GN L L + N +R+ V
Sbjct: 213 SGKIETWAGTGERKPTPDG-APIAGTPLNGPRAITSDPEGNLYLVLREGNAVYRMDVR-- 269
Query: 295 NGKILDCIGSCPGFEDGEFESSKLMRPAA--SFYHKDDDCLYIVDSENHAIRRADMGRRV 352
GKI G+ G+ +KL + + D LY+ D+E+H IRR D+ V
Sbjct: 270 AGKIFHVAGTGESGYSGDGGPAKLAKLSGPKGIAWAPDGSLYLADTESHTIRRVDLKSGV 329
Query: 353 LETVYPTSGISKKNNS 368
+ TV +G K+ +
Sbjct: 330 ITTV---AGTGKRGDG 342
>gi|144899324|emb|CAM76188.1| conserved hypothetical protein [Magnetospirillum gryphiswaldense
MSR-1]
Length = 477
Score = 44.3 bits (103), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 46/180 (25%), Positives = 69/180 (38%), Gaps = 23/180 (12%)
Query: 253 PHACSSVRNL--LLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFED 310
P C V + L FPG I G L+DS HH++++ D +G + GS D
Sbjct: 159 PTPCQPVSDPGGRLRFPGKIKRGADG-VWALADSGHHQVVLLDDHGGEIARFGSGLAGRD 217
Query: 311 GEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLW 370
++S + D +Y+ D+ NH IRR D SG + +
Sbjct: 218 DGIDASFNGPEGVAI---DRAFIYVADTRNHLIRRIDR----------QSGGDRHHRRAQ 264
Query: 371 AWIMEKLGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEIK 430
W LG + +L PW M+ D +L N L +DL +G I+
Sbjct: 265 GWRGGTLG------PGGARGWETALASPWD-MEVHDGSLYFANAGSHQLGALDLDTGMIR 317
>gi|321468757|gb|EFX79740.1| hypothetical protein DAPPUDRAFT_304337 [Daphnia pulex]
Length = 1319
Score = 44.3 bits (103), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 32/72 (44%), Positives = 42/72 (58%), Gaps = 11/72 (15%)
Query: 275 SGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLY 334
S NR+ +SDSN+HRI VFD NGK++ G G ++G+F K R A DD Y
Sbjct: 1208 STNRVIVSDSNNHRIQVFDVNGKVIASFGG-EGADEGQF---KFPRGVAV-----DDQGY 1258
Query: 335 IV--DSENHAIR 344
IV DS N+ I+
Sbjct: 1259 IVVADSGNNRIQ 1270
>gi|290977893|ref|XP_002671671.1| predicted protein [Naegleria gruberi]
gi|284085242|gb|EFC38927.1| predicted protein [Naegleria gruberi]
Length = 3330
Score = 44.3 bits (103), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 51/100 (51%), Gaps = 4/100 (4%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLM 319
N L+ P I SG+ L +SDS+++RI + D G I G+ GF+DG F +
Sbjct: 2435 NASLNNPYGIVETYSGD-LIISDSDNNRIRLIDRYGIITTLAGTGEKGFQDGFFLDALFN 2493
Query: 320 RPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
P+ FY LYI D+ IR A++ + ++T+ T
Sbjct: 2494 NPSQLFYFHSR--LYISDTYGQRIREANLESKTVKTIIGT 2531
>gi|153868997|ref|ZP_01998705.1| protein kinase-like protein [Beggiatoa sp. PS]
gi|152074439|gb|EDN71293.1| protein kinase-like protein [Beggiatoa sp. PS]
Length = 1432
Score = 43.9 bits (102), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 47/85 (55%), Gaps = 5/85 (5%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFE---SSKLMR 320
L P ++ D GN L+++D+N+HRI D +G I +G+ G+ E +++L +
Sbjct: 467 LSNPEGLAIDAQGN-LYIADTNNHRIRKIDSDGIITTVVGTGEAGYAGDNEFAIAAQLKK 525
Query: 321 PAASFYHKDDDCLYIVDSENHAIRR 345
P A + + YI DS N++IR+
Sbjct: 526 PTAIVFDHNGH-FYIADSGNNSIRK 549
>gi|284097436|ref|ZP_06385541.1| NHL repeat containing protein [Candidatus Poribacteria sp. WGA-A3]
gi|283831050|gb|EFC35055.1| NHL repeat containing protein [Candidatus Poribacteria sp. WGA-A3]
Length = 387
Score = 43.9 bits (102), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 50/94 (53%), Gaps = 6/94 (6%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPG---FEDGEFESS-KLM 319
L P ++ D+ GN L+++D+ RI +D + ++ + G F GE ESS L
Sbjct: 241 LAGPSGLAVDQEGN-LYIADTFSGRIRKWDRHTGTVETVAGGTGAFQFTPGENESSPNLS 299
Query: 320 RPAASFYHKDDDCLYIVDSENHAIRRADMGRRVL 353
RP A H D L+I DS+NH IR D+ +R +
Sbjct: 300 RPYAIALHPDG-RLFITDSDNHLIRVWDLQKREM 332
Score = 40.0 bits (92), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 50/102 (49%), Gaps = 8/102 (7%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFD-GNGKILDCIGS----CPGFEDGEFESSKL 318
L+FP ++ E G ++++D+ +HRI D G G I G+ C G ++G + L
Sbjct: 127 LNFPSAVAVAEDGT-VYIADTWNHRIRRVDPGTGAISTIAGTGQAKCYG-DNGPAVKAAL 184
Query: 319 MRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTS 360
P A D LYI D N+ +R+ D+ V+ TV T
Sbjct: 185 NEPVA-LALDGPDRLYIADQSNNRVRKLDLTSGVMTTVVGTG 225
>gi|374849371|dbj|BAL52388.1| PKD domain protein [uncultured candidate division OP1 bacterium]
gi|374857216|dbj|BAL60069.1| hypothetical protein HGMM_OP4C705 [uncultured candidate division
OP1 bacterium]
Length = 635
Score = 43.9 bits (102), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 42/82 (51%), Gaps = 7/82 (8%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFY 326
P I+ D +GN +++ DS +HRI FD GK L G+ G DG+FE+ P
Sbjct: 392 PTDIAVDAAGN-VYVVDSGNHRIQKFDSTGKFLGKWGTR-GSGDGQFET-----PIGIAL 444
Query: 327 HKDDDCLYIVDSENHAIRRADM 348
+Y+ D NH I++ D+
Sbjct: 445 DGSGKFIYVADKGNHRIQKFDI 466
>gi|406947628|gb|EKD78528.1| NHL repeat containing protein [uncultured bacterium]
Length = 810
Score = 43.9 bits (102), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 44/172 (25%), Positives = 79/172 (45%), Gaps = 27/172 (15%)
Query: 260 RNLLLHFPGCISADESGNRLFLSDSNHHRI-IVFDGNGKILDCIG--SCPGFEDGEFESS 316
++ + P ++ D + N L+++D ++H I V +G++ +G S G+ DG+F +
Sbjct: 566 KDAQFNVPIAVTIDNADNYLYVADRDNHTIRQVRISDGQVTTLVGNPSKKGYLDGKFAEA 625
Query: 317 KLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEK 376
L P FY++ + +Y ++ H +R AD G ++ V G NN
Sbjct: 626 YLNLPVEVFYNRGN--VYFSEAGTHMVRVADRGDEAVKLV-AGDGNRGYNNG-------- 674
Query: 377 LGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGE 428
+RDN + D P L + + DNLLI + + + +DL GE
Sbjct: 675 ---DRDN----TQFDN-----PVGLAR-KGDNLLIADSLNDLIRKVDLGDGE 713
>gi|290975813|ref|XP_002670636.1| predicted protein [Naegleria gruberi]
gi|284084197|gb|EFC37892.1| predicted protein [Naegleria gruberi]
Length = 2442
Score = 43.9 bits (102), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 46/88 (52%), Gaps = 5/88 (5%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGF--EDGEFESSK 317
N +L+ P + + G +++SD+ +HR+ D NG I GS GF + G ++K
Sbjct: 883 NAVLNSPESLFVNSKG-EIYVSDTVNHRVRKIDLNGTITTVAGSGLAGFYGDGGLATNAK 941
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRR 345
L P + K+ D +YI D NH IR+
Sbjct: 942 LSYPMGIYVSKNGD-IYIADYGNHRIRK 968
>gi|261404592|ref|YP_003240833.1| copper amine oxidase domain-containing protein [Paenibacillus sp.
Y412MC10]
gi|261281055|gb|ACX63026.1| copper amine oxidase domain protein [Paenibacillus sp. Y412MC10]
Length = 1280
Score = 43.5 bits (101), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 44/94 (46%), Gaps = 17/94 (18%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFD-GNGKILDCIGSCP--------------GFEDG 311
P ++ D GN L++SDS +H I D G G + GS P G+ DG
Sbjct: 264 PAGLALDGKGN-LYVSDSGNHAIRYIDFGKGTVSTAAGSVPSSDGYAKDALYADPGYLDG 322
Query: 312 EFESSKLMRPAASFYHKDDDCLYIVDSENHAIRR 345
++++ P + +D L I DS NHA+R+
Sbjct: 323 AADAARFNSPRGLAWSVEDGLL-IADSHNHAVRQ 355
>gi|373488690|ref|ZP_09579354.1| NHL repeat containing protein [Holophaga foetida DSM 6591]
gi|372005635|gb|EHP06271.1| NHL repeat containing protein [Holophaga foetida DSM 6591]
Length = 438
Score = 43.5 bits (101), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 51/107 (47%), Gaps = 3/107 (2%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC--PGFEDGEFESSKLMRPAAS 324
P ++ D SGN L+++D +H I D G + G G DG+ S++ P
Sbjct: 180 PEGLALDASGN-LYVADYGNHTIRKIDTLGAVTTFAGQAEASGTVDGDRLSARFNHPIGL 238
Query: 325 FYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWA 371
++ LY+ DS NH IR ++ + + T T+G ++S+ +
Sbjct: 239 AFNASYSVLYVADSGNHTIRAINIKSQTVSTFAGTAGSPGTSSSMLS 285
>gi|428179226|gb|EKX48098.1| hypothetical protein GUITHDRAFT_69001 [Guillardia theta CCMP2712]
Length = 334
Score = 43.5 bits (101), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 52/112 (46%), Gaps = 7/112 (6%)
Query: 235 SPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDG 294
SP G T A + K+ A + L ++P ++ D G+ +F+SD +H I DG
Sbjct: 129 SPDGTVTTIAGSKKGFKDGPAGKA----LFNYPAYVAVDSKGS-IFVSDFGNHCIRKIDG 183
Query: 295 NGKILDCIGSCP-GFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRR 345
G + G+ G+ DG+ ++ P K +D +YI D N IR+
Sbjct: 184 EGMVTTVAGNGKMGWADGKGAKARFNSPQGMCIDK-EDTVYIADYGNQRIRK 234
>gi|157134133|ref|XP_001663162.1| tripartite motif protein trim2,3 [Aedes aegypti]
gi|108881414|gb|EAT45639.1| AAEL003104-PA [Aedes aegypti]
Length = 1293
Score = 43.5 bits (101), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 41/70 (58%), Gaps = 11/70 (15%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYI- 335
NR+ +SDSN+HRI +FD NG++L GS G ++G+F K R A DD YI
Sbjct: 1184 NRVIVSDSNNHRIQIFDVNGRVLTTFGS-EGSDEGQF---KFPRGVAV-----DDQGYIC 1234
Query: 336 -VDSENHAIR 344
DS N+ I+
Sbjct: 1235 VADSGNNRIQ 1244
>gi|290991506|ref|XP_002678376.1| NHL repeat domain-containing protein [Naegleria gruberi]
gi|284091988|gb|EFC45632.1| NHL repeat domain-containing protein [Naegleria gruberi]
Length = 985
Score = 43.5 bits (101), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 45/161 (27%), Positives = 75/161 (46%), Gaps = 25/161 (15%)
Query: 202 FGNARVFHENSLDIGMLNKAVEELIMQQQENSSSPSGLKCTWAK---QAEVLKEPHACSS 258
FGNA F E SL+I + ++ MQ +GL T A Q+ E + +S
Sbjct: 224 FGNAFYFAETSLNI------IRKVDMQ--------TGLLTTVAGIQGQSGDSGEGYLATS 269
Query: 259 VRNLLLHFPGCISADESGNRLFLSDSNHHRII-VFDGNGKILDCIGSCPGFEDGEFESSK 317
L+ P ++ + +G+ +F+SDS +++I VF NG I+ G+ G+ ++K
Sbjct: 270 A---FLNGPQSVAFNSNGD-MFISDSKNNKIKKVFFNNGTIITIAGTGSANFTGDGSNAK 325
Query: 318 ---LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLET 355
L P +YI+DS+N+ IR+ V+ T
Sbjct: 326 FATLNLPTFISVSNSTGDIYIIDSQNYRIRKITASTNVIST 366
>gi|290987760|ref|XP_002676590.1| predicted protein [Naegleria gruberi]
gi|284090193|gb|EFC43846.1| predicted protein [Naegleria gruberi]
Length = 836
Score = 43.1 bits (100), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 43/87 (49%), Gaps = 5/87 (5%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGFEDGEFE--SSK 317
N L+ P + D++ ++++DS +HRI + NG I +GS GF +
Sbjct: 526 NAQLNKPHSVHFDKNSGNIYIADSGNHRIRMIFPNGTITTIVGSGTNGFNGDGLNPLETH 585
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIR 344
L P FY D + LYI DS NH +R
Sbjct: 586 LNLPKYVFY--DSEILYISDSFNHRVR 610
>gi|242011481|ref|XP_002426477.1| nhl repeat-containing protein, putative [Pediculus humanus corporis]
gi|212510603|gb|EEB13739.1| nhl repeat-containing protein, putative [Pediculus humanus corporis]
Length = 1324
Score = 43.1 bits (100), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 41/68 (60%), Gaps = 7/68 (10%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIV 336
NR+ +SDSN+HRI +FD NG++L G+ G ++G+F K R A D +++
Sbjct: 1219 NRVIVSDSNNHRIQIFDVNGRVLSSFGN-EGSDEGQF---KFPRGVAV---DDQGYIFVG 1271
Query: 337 DSENHAIR 344
DS N+ I+
Sbjct: 1272 DSGNNRIQ 1279
>gi|157279030|gb|AAI34672.1| NHLRC2 protein [Bos taurus]
Length = 376
Score = 43.1 bits (100), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 46/101 (45%), Gaps = 4/101 (3%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK---ILDCIGSCPGFEDGEFESSK 317
N L P ++ D+ N L+++DS +H+I V D K L G+ F S
Sbjct: 121 NARLQHPLGVTWDQKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGNASNMIGSSFTDST 180
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYP 358
P ++ LY+ D+ NH I+ D+ + + +V+P
Sbjct: 181 FNEPGGLCIGENGQLLYVADTNNHQIKVLDLETKTV-SVFP 220
>gi|21758510|dbj|BAC05316.1| unnamed protein product [Homo sapiens]
Length = 440
Score = 43.1 bits (100), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 46/101 (45%), Gaps = 4/101 (3%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK---ILDCIGSCPGFEDGEFESSK 317
N L P ++ D+ N L+++DS +H+I V D K L G F S
Sbjct: 185 NAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTNNVTSSSFTEST 244
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYP 358
P ++ + LY+ D+ NH I+ D+ +++ +V P
Sbjct: 245 FNEPGGLCIGENGELLYVADTNNHQIKVMDLETKMV-SVLP 284
>gi|290996232|ref|XP_002680686.1| predicted protein [Naegleria gruberi]
gi|284094308|gb|EFC47942.1| predicted protein [Naegleria gruberi]
Length = 655
Score = 43.1 bits (100), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 32/114 (28%), Positives = 54/114 (47%), Gaps = 5/114 (4%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP--GFEDGEFESSKLMRP 321
++ P +S + ++ +DS+++ I V + I+ I GF +G SSKL +P
Sbjct: 101 VNSPKGVSISQISGDIYFADSSNNLIRVMNAATNIVSTIAGTGSFGFSEGNATSSKLNKP 160
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIME 375
Y L DS N+ IR+ D+ ++ T+ T GIS+ N A I+
Sbjct: 161 TEPVYWNGQ--LLFADSLNYRIRKVDLSNGMISTLTGT-GISEFNGDGLALIVS 211
>gi|158286215|ref|XP_308626.4| AGAP007135-PA [Anopheles gambiae str. PEST]
gi|157020362|gb|EAA04156.5| AGAP007135-PA [Anopheles gambiae str. PEST]
Length = 1351
Score = 43.1 bits (100), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 48/91 (52%), Gaps = 13/91 (14%)
Query: 256 CSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFES 315
C + L H P I+ + NR+ +SDSN+HR+ +FD NG++L G G E+G+F
Sbjct: 1223 CGKEKGQLEH-PHYIAVSNT-NRVIVSDSNNHRVQIFDVNGRVLTTFGG-EGSEEGQF-- 1277
Query: 316 SKLMRPAASFYHKDDDCLYI--VDSENHAIR 344
K R A DD YI DS N+ I+
Sbjct: 1278 -KFPRGVAV-----DDQGYICVADSGNNRIQ 1302
>gi|260821256|ref|XP_002605949.1| hypothetical protein BRAFLDRAFT_132236 [Branchiostoma floridae]
gi|229291286|gb|EEN61959.1| hypothetical protein BRAFLDRAFT_132236 [Branchiostoma floridae]
Length = 446
Score = 43.1 bits (100), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 23/88 (26%), Positives = 41/88 (46%), Gaps = 3/88 (3%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP---GFEDGEFESSKLMR 320
+ P + D NR+F++D+++H I ++ + P G +G+ ++K
Sbjct: 187 FYHPQGLYLDAKQNRMFVADTDNHVIREMSMPDAVVKTVAGTPKEKGLVNGQGRAAKFYH 246
Query: 321 PAASFYHKDDDCLYIVDSENHAIRRADM 348
P Y D LY+ D NHAIR ++
Sbjct: 247 PTQMAYDPYTDILYVSDHFNHAIRTINL 274
>gi|402571882|ref|YP_006621225.1| hypothetical protein Desmer_1347 [Desulfosporosinus meridiei DSM
13257]
gi|402253079|gb|AFQ43354.1| conserved repeat protein [Desulfosporosinus meridiei DSM 13257]
Length = 2025
Score = 43.1 bits (100), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 46/79 (58%), Gaps = 8/79 (10%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFY 326
P I+ D +GN +++SD ++RI F +G+ L+ GS PG + G+F LM S
Sbjct: 526 PNGIAVDSAGN-VYVSDGGNYRIQKFSSSGEFLEKWGS-PGVDAGQF----LM--PGSVM 577
Query: 327 HKDDDCLYIVDSENHAIRR 345
++ +Y+VDS N+ I++
Sbjct: 578 VDGNNSIYVVDSGNNRIQK 596
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 53/113 (46%), Gaps = 9/113 (7%)
Query: 243 WAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCI 302
++ + E L + V N+ P ++ D SGN +++ DS + RI FD +G L
Sbjct: 908 YSAEGEFLAK-WGSQGVGNVQFGAPQDVATDSSGN-VYVVDSGNKRIQKFDSSGSFLKKW 965
Query: 303 GSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLET 355
GS G + GEF RP + + +Y+ DS NH I++ + + T
Sbjct: 966 GSN-GSDQGEFN-----RPQGIAVDSEGN-IYVADSNNHRIQKFNAAGAFMTT 1011
>gi|241998856|ref|XP_002434071.1| nhl repeat-containing protein, putative [Ixodes scapularis]
gi|215495830|gb|EEC05471.1| nhl repeat-containing protein, putative [Ixodes scapularis]
Length = 533
Score = 42.7 bits (99), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 41/70 (58%), Gaps = 11/70 (15%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIV 336
NR+ +SDSN+HRI +FD NG+ L GS G +DG+F K R A DD YI+
Sbjct: 424 NRVIVSDSNNHRIQIFDVNGRSLSTFGS-EGSDDGQF---KFPRGVAV-----DDQGYIM 474
Query: 337 --DSENHAIR 344
DS N+ I+
Sbjct: 475 VGDSGNNRIQ 484
>gi|31874765|emb|CAD98082.1| hypothetical protein [Homo sapiens]
Length = 252
Score = 42.7 bits (99), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 44/99 (44%), Gaps = 3/99 (3%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK---ILDCIGSCPGFEDGEFESSK 317
N L P ++ D+ N L+++DS +H+I V D K L G F S
Sbjct: 154 NAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTNNVTSSSFTEST 213
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P ++ + LY+ D+ NH I+ D+ +++ +
Sbjct: 214 FNEPGGLCIGENGELLYVADTNNHQIKVMDLETKMVSVI 252
>gi|326429396|gb|EGD74966.1| hypothetical protein PTSG_12546 [Salpingoeca sp. ATCC 50818]
Length = 1448
Score = 42.7 bits (99), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 49/190 (25%), Positives = 79/190 (41%), Gaps = 31/190 (16%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDG-NGKILDCIGSCPGFEDGEFESSK---LM 319
L+ P C++ D +GN +++S+ HR+ V D +G + +G+ G SS+ +
Sbjct: 1178 LNRPTCVALDAAGN-VYISEEGSHRVSVVDATSGDLSVLVGTGHSGHRGMGGSSRAADIH 1236
Query: 320 RPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGF 379
P D LY+ D+ENH + D+ R L +++A + G+
Sbjct: 1237 TPRGLAVDNDAKVLYVADAENHVVYSVDL--RTLSI------------NVYAGTPNQRGY 1282
Query: 380 ERDNDTKSEKLDPQSLIF---PWHLMKSEDDN-LLIINRSFETLWIMDLASGEIKE-AVK 434
DN P L F P L S N L I + + ++DL SG I A
Sbjct: 1283 RGDN-------GPAVLGFLNGPTALALSASSNRLYIADTDNNRIRLVDLDSGVIDTIAGS 1335
Query: 435 GFSKVLEICG 444
G S ++ G
Sbjct: 1336 GLSGASDVVG 1345
>gi|290970020|ref|XP_002668012.1| predicted protein [Naegleria gruberi]
gi|284081028|gb|EFC35268.1| predicted protein [Naegleria gruberi]
Length = 243
Score = 42.7 bits (99), Expect = 0.74, Method: Composition-based stats.
Identities = 26/87 (29%), Positives = 53/87 (60%), Gaps = 6/87 (6%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGF--EDGEFESSKLMR 320
L++P + + N + ++D+ +HRI + NG+I+ G+ PG+ ++G +++L R
Sbjct: 87 LYYPQSVFV-STNNEICIADTFNHRIRKIE-NGRIVTIAGNGQPGYSGDNGPATTAQLHR 144
Query: 321 PAASFYHKDDDCLYIVDSENHAIRRAD 347
P + F +++ +YI D+ NH+IR+ D
Sbjct: 145 PYSVFVSANNE-VYIADTFNHSIRKID 170
Score = 42.4 bits (98), Expect = 1.1, Method: Composition-based stats.
Identities = 29/93 (31%), Positives = 51/93 (54%), Gaps = 7/93 (7%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS----CPGFEDGEFESSKLM 319
L++P + + N +F++D N++RI NG I+ G+ C G ++G SS+L
Sbjct: 31 LNYPSGVFV-STNNEVFIADKNNNRIRKIVKNGNIVLIAGNGQTGCGG-DNGSATSSQLY 88
Query: 320 RPAASFYHKDDDCLYIVDSENHAIRRADMGRRV 352
P + F +++ + I D+ NH IR+ + GR V
Sbjct: 89 YPQSVFVSTNNE-ICIADTFNHRIRKIENGRIV 120
>gi|196003538|ref|XP_002111636.1| hypothetical protein TRIADDRAFT_23736 [Trichoplax adhaerens]
gi|190585535|gb|EDV25603.1| hypothetical protein TRIADDRAFT_23736 [Trichoplax adhaerens]
Length = 689
Score = 42.7 bits (99), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 41/157 (26%), Positives = 64/157 (40%), Gaps = 37/157 (23%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFY 326
P C++ D GN + +SD+N++R+ VF G + +GS G E GEF P
Sbjct: 477 PSCLTLDIDGN-VIVSDTNNNRVQVFTRKGIFIKSLGS-GGTEPGEFHD-----PHGITI 529
Query: 327 HKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTK 386
KD +++VDS NH I+ V+ + G + G + DN
Sbjct: 530 DKDGR-IFVVDSGNHRIQ-----------VFSSDG----------EYLYHFGAKGDN--- 564
Query: 387 SEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMD 423
P L PW + N+LI + + + D
Sbjct: 565 -----PGQLGLPWGIAIDHSGNILITDYKNHRIQVFD 596
>gi|21618524|gb|AAH32598.1| NHLRC2 protein [Homo sapiens]
gi|325464307|gb|ADZ15924.1| NHL repeat containing 2 [synthetic construct]
Length = 367
Score = 42.7 bits (99), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 44/99 (44%), Gaps = 3/99 (3%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK---ILDCIGSCPGFEDGEFESSK 317
N L P ++ D+ N L+++DS +H+I V D K L G F S
Sbjct: 112 NAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTNNVTSSSFTEST 171
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P ++ + LY+ D+ NH I+ D+ +++ +
Sbjct: 172 FNEPGGLCIGENGELLYVADTNNHQIKVMDLETKMVSVL 210
>gi|290985160|ref|XP_002675294.1| predicted protein [Naegleria gruberi]
gi|284088889|gb|EFC42550.1| predicted protein [Naegleria gruberi]
Length = 1229
Score = 42.7 bits (99), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 35/69 (50%), Gaps = 2/69 (2%)
Query: 273 DESGNRLFLSDSNHHRIIVFDGNGKILDCIG--SCPGFEDGEFESSKLMRPAASFYHKDD 330
D+S L+ +DSN++ + D NGKI D G G++D SS + P F H +
Sbjct: 118 DDSNGDLYFTDSNNYLVRKIDVNGKIFDIAGIPRTSGYDDRSLNSSTFVGPRQLFVHGPN 177
Query: 331 DCLYIVDSE 339
+Y+ D E
Sbjct: 178 KDIYLADGE 186
>gi|290973488|ref|XP_002669480.1| predicted protein [Naegleria gruberi]
gi|284083028|gb|EFC36736.1| predicted protein [Naegleria gruberi]
Length = 1071
Score = 42.7 bits (99), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 49/97 (50%), Gaps = 4/97 (4%)
Query: 263 LLHFPGCISADESGNRLFLSDSNHHRIIVFD-GNGKILDCIGSCPGF--EDGEFESSKLM 319
+L+ P + D S N +++ +S +I D +G I D G+ G+ ++G E +KL
Sbjct: 517 MLNRPVQVFGDYS-NMIYIVESGSSKIRKVDLQSGIISDFCGTSIGYGGDNGLPEYAKLA 575
Query: 320 RPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
PA FYH+ +YI D+ N IR+ + TV
Sbjct: 576 NPAGGFYHRSTGDIYIADTGNGRIRKISANDSTITTV 612
>gi|290986699|ref|XP_002676061.1| predicted protein [Naegleria gruberi]
gi|284089661|gb|EFC43317.1| predicted protein [Naegleria gruberi]
Length = 819
Score = 42.7 bits (99), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 41/83 (49%), Gaps = 3/83 (3%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPG-FEDGEFESSKLMRPA 322
L+ P IS D +G ++ S++N HRI NG ++ +GS G DG +KL P
Sbjct: 449 LYNPTSISIDSNG-TIWFSEANSHRIRKLLSNGTVITVVGSTTGSTADGPALQTKLTIPN 507
Query: 323 ASFYHKDDDCLYIVDSENHAIRR 345
+ D L I D NH IR+
Sbjct: 508 GIGFDLDGSLL-IADRYNHRIRK 529
>gi|290979059|ref|XP_002672252.1| predicted protein [Naegleria gruberi]
gi|284085827|gb|EFC39508.1| predicted protein [Naegleria gruberi]
Length = 1239
Score = 42.7 bits (99), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 46/90 (51%), Gaps = 5/90 (5%)
Query: 263 LLHFPGCISADESGNRLFLSDSNHHRIIVFD-GNGKILDCIGSCPGFEDGEFES--SKLM 319
LL+FP + D S L++SDSN+HR+I ++ + + G+ +GE + +L
Sbjct: 333 LLNFPQSLVMDSSYQSLYISDSNNHRVIRYNISSDTVYRVAGNGTAGYNGEGLAINCQLN 392
Query: 320 RPAASFYHKDDDCLYIVDSENHAIRRADMG 349
P FY ++ LYI D+ N IR G
Sbjct: 393 NPNGLFY--NNGILYISDTGNRRIRIVSNG 420
>gi|431798812|ref|YP_007225716.1| gluconolactonase [Echinicola vietnamensis DSM 17526]
gi|430789577|gb|AGA79706.1| gluconolactonase [Echinicola vietnamensis DSM 17526]
Length = 428
Score = 42.4 bits (98), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 42/79 (53%), Gaps = 4/79 (5%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFY 326
P IS ++G +L ++D ++H+I + G ++ GS GF DG +S+ P Y
Sbjct: 350 PYGISVADNG-QLVVADLSNHKIRLIQGE-NVITIAGSVAGFLDGVGVTSQFYNPTDVTY 407
Query: 327 HKDDDCLYIVDSENHAIRR 345
H D +Y+ D NH +R+
Sbjct: 408 H--DGVIYVADLGNHRVRK 424
Score = 40.8 bits (94), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 23/78 (29%), Positives = 39/78 (50%), Gaps = 1/78 (1%)
Query: 279 LFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDS 338
L+++D+ +HRI + +G + GS G +DG E + P+ D +Y+ D
Sbjct: 253 LYVTDNRNHRIRKIEVDGTVSTVAGSEQGNQDGALEEATFRYPSG-LDVDDMGNIYVADR 311
Query: 339 ENHAIRRADMGRRVLETV 356
NH IR+ D+ + TV
Sbjct: 312 INHLIRKIDLNAGQVSTV 329
>gi|124009517|ref|ZP_01694192.1| cell surface protein [Microscilla marina ATCC 23134]
gi|123984863|gb|EAY24831.1| cell surface protein [Microscilla marina ATCC 23134]
Length = 508
Score = 42.4 bits (98), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 44/93 (47%), Gaps = 8/93 (8%)
Query: 255 ACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFE 314
S N L FP ++ D GN SN HRI FD +G L G+ G DG+F
Sbjct: 73 GGSGTTNGKLEFPYWLAKDSQGNIFVTDGSN-HRIQKFDKDGNYLSQFGTN-GNGDGQFN 130
Query: 315 SSKLMRPAASFYHKDDDCLYIVDSENHAIRRAD 347
+ + + AS D +Y+VD N+ +++ D
Sbjct: 131 TPRAIVIDAS------DNIYVVDFNNNRVQKFD 157
>gi|340374379|ref|XP_003385715.1| PREDICTED: tripartite motif-containing protein 71-like [Amphimedon
queenslandica]
Length = 671
Score = 42.4 bits (98), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 43/180 (23%), Positives = 70/180 (38%), Gaps = 44/180 (24%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFY 326
P I AD GN + + D ++HRI VFD G GS G G+F RPA
Sbjct: 406 PWGICADHKGN-IIIGDRSNHRIQVFDSKGAFSHAFGS-EGSHPGQFN-----RPAGVAV 458
Query: 327 HKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDTK 386
K+ D + + D +NH I++ L +M G ND +
Sbjct: 459 TKEGD-IVVADKDNHRIQKF---------------------QLNGKLMHHFGSRGSNDGQ 496
Query: 387 SEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEI--KEAVKGFSKVLEICG 444
+I+P+ + + D +++ + +M + G + K KG+ +CG
Sbjct: 497 --------MIYPYDVAVHQTDGRIVVTDTGNHRILMFTSDGSLLGKFGYKGY-----LCG 543
>gi|410976139|ref|XP_003994481.1| PREDICTED: NHL repeat-containing protein 2 [Felis catus]
Length = 637
Score = 42.4 bits (98), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 27/41 (65%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS 304
L FPG ++ D N+L ++D+ HHRI+V NG+I IG+
Sbjct: 169 LLFPGKVTVDHVSNKLVIADTGHHRILVVLKNGQIQYSIGA 209
>gi|421094122|ref|ZP_15554843.1| putative lipoprotein [Leptospira borgpetersenii str. 200801926]
gi|410363263|gb|EKP14295.1| putative lipoprotein [Leptospira borgpetersenii str. 200801926]
gi|456886734|gb|EMF97860.1| putative lipoprotein [Leptospira borgpetersenii str. 200701203]
Length = 358
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 25/97 (25%), Positives = 46/97 (47%), Gaps = 2/97 (2%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCI-GSCPGFEDGEFESSKLM 319
N L P + D N L++ + +H I + N + + + G G+ DG+ S++
Sbjct: 174 NSLFKGPLFMDLDRERN-LYVGELGNHTIRKINLNSETVSTLSGGVSGYLDGDLTSAQFK 232
Query: 320 RPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P+ Y + D L + D +NH IR+ D+ + T+
Sbjct: 233 SPSGIIYDQKTDSLLVADLQNHRIRKIDLKTSTVSTL 269
>gi|291221599|ref|XP_002730807.1| PREDICTED: tripartite motif protein 2-like, partial [Saccoglossus
kowalevskii]
Length = 686
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 28/133 (21%), Positives = 65/133 (48%), Gaps = 6/133 (4%)
Query: 221 AVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLF 280
++E+ +QQE K ++ +++ E + S+ +P +++ E G +
Sbjct: 383 TIQEMTKEQQELEKKQIERKAAEMRKTKLMYELND-STHEGGEFDWPSGVASTEDGEYVV 441
Query: 281 LSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSEN 340
++D ++ RI +++ +GK +C G +G+FE + + +D+ C+YI D N
Sbjct: 442 ITDRDNDRIQIYNRDGK-FECKFGSRGKRNGQFE----LPLDVAISEEDEPCVYITDEYN 496
Query: 341 HAIRRADMGRRVL 353
H +++ + R +
Sbjct: 497 HRVQKFTLYGRYM 509
>gi|288801999|ref|ZP_06407440.1| conserved hypothetical protein [Prevotella melaninogenica D18]
gi|288335434|gb|EFC73868.1| conserved hypothetical protein [Prevotella melaninogenica D18]
Length = 486
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 46/94 (48%), Gaps = 7/94 (7%)
Query: 257 SSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFD----GNGKILDCIG--SCPGFED 310
+++N +P ++ + G +L+++DS +H I + D N ++ IG GF+D
Sbjct: 389 GALKNAKFCYPRQMTFTKDG-KLYIADSGNHCIRMIDTTLGKNARVTTPIGVPQSAGFQD 447
Query: 311 GEFESSKLMRPAASFYHKDDDCLYIVDSENHAIR 344
G E +K P D +Y+ DS+N IR
Sbjct: 448 GGVELAKFNWPTGVAVSADGSTVYVADSKNQVIR 481
>gi|328719579|ref|XP_001944980.2| PREDICTED: hypothetical protein LOC100167755 [Acyrthosiphon pisum]
Length = 1280
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 48/114 (42%), Gaps = 29/114 (25%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCL--- 333
N + ++DS++HRI VFD +GKIL GS G +GEF DCL
Sbjct: 1030 NLIVVADSSNHRIQVFDNSGKILKDFGSY-GNSEGEF-----------------DCLAGV 1071
Query: 334 -------YIV-DSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGF 379
YI+ D NH I+ D R L + + N W + LGF
Sbjct: 1072 AVNRIGQYIIADRYNHRIQVLDPSGRFLRSFGSQGSSDGRFNYPWGITTDALGF 1125
Score = 40.4 bits (93), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 28/38 (73%), Gaps = 1/38 (2%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFE 314
NR+ +SD N+HRI +FD NG+++ G+ G E+G+F+
Sbjct: 1171 NRVIVSDCNNHRIQIFDVNGRVISSFGT-EGSENGQFK 1207
>gi|291232537|ref|XP_002736217.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 361
Score = 42.0 bits (97), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 36/122 (29%), Positives = 56/122 (45%), Gaps = 24/122 (19%)
Query: 246 QAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC 305
Q L PH + P +S D NRLF++D + R+IVF+ + K I
Sbjct: 223 QTPGLPAPH--------VFKVPHSLSLDADHNRLFVADRENGRVIVFNSDSKTF--IDQY 272
Query: 306 PGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSEN--HAIRRADMGRR---VLETVYPTS 360
GF GE R A+ Y+K +D LY+V+ ++ H ++ R ++ET P
Sbjct: 273 TGF--GE-------RVFAADYNKINDTLYVVNGDDGYHVVQGYTFSVRTKQIIETWKPEQ 323
Query: 361 GI 362
G+
Sbjct: 324 GV 325
>gi|406886261|gb|EKD33314.1| hypothetical protein ACD_76C00046G0005 [uncultured bacterium]
Length = 1682
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 48/93 (51%), Gaps = 8/93 (8%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSK-------LM 319
P ++ D++GNRLF++D ++R++VFD + I+D + +F+SS +
Sbjct: 1066 PIDLAFDDAGNRLFVTDQGNNRVLVFDVD-VIVDGEDAVAVLGQPDFDSSDPATTDIGMN 1124
Query: 320 RPAASFYHKDDDCLYIVDSENHAIRRADMGRRV 352
P Y D + LY+ + +NH + D+ V
Sbjct: 1125 APQGLAYDSDANLLYVSEVDNHRVTVFDVAEIV 1157
>gi|406901359|gb|EKD44040.1| NHL repeat containing protein, partial [uncultured bacterium]
Length = 667
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 45/88 (51%), Gaps = 5/88 (5%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRI-IVFDGNGKILDCIG--SCPGFEDGEFESSKLMR 320
+ P ++ D + N L+++D ++H I V +GK+ G S PG+ DG E + L
Sbjct: 574 FNVPIALTIDSADNYLYVADRDNHAIRKVRISDGKVTTVTGNPSTPGYLDGRLEDAYLNY 633
Query: 321 PAASFYHKDDDCLYIVDSENHAIRRADM 348
P +Y++ + +Y +S +R DM
Sbjct: 634 PVEVYYNRGN--IYFSESGTQRVRVVDM 659
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 27/96 (28%), Positives = 44/96 (45%), Gaps = 6/96 (6%)
Query: 267 PGCISADESGNRLFLSDSNH---HRIIVFDGNGKILDCIGSCP---GFEDGEFESSKLMR 320
PG I D SG +++D+ + R+ + I P GF+DG +++
Sbjct: 517 PGGIVVDGSGKYAYVADTGNEVIRRVTLTGSKQGQTKLIAGSPTESGFKDGTKTAAEFNV 576
Query: 321 PAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P A D+ LY+ D +NHAIR+ + + TV
Sbjct: 577 PIALTIDSADNYLYVADRDNHAIRKVRISDGKVTTV 612
>gi|114566902|ref|YP_754056.1| hypothetical protein Swol_1378 [Syntrophomonas wolfei subsp. wolfei
str. Goettingen]
gi|114337837|gb|ABI68685.1| hypothetical protein Swol_1378 [Syntrophomonas wolfei subsp. wolfei
str. Goettingen]
Length = 1030
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 51/100 (51%), Gaps = 3/100 (3%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKI--LDCIGSCPGFEDGEFESSKLMRP 321
L+ P I+ D SGN ++++DSN++ I D +G I G+ F DG +S +R
Sbjct: 744 LNNPNGITFDSSGN-MYIADSNNNCIRKVDHSGMISTFAGNGTSGHFGDGGPATSAQLRN 802
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSG 361
+ L+I D +H+IR+ + + T+ PT+G
Sbjct: 803 PVGVALDNSGNLFIADYFDHSIRKVVLAAQQNSTISPTTG 842
>gi|325110177|ref|YP_004271245.1| hypothetical protein Plabr_3626 [Planctomyces brasiliensis DSM
5305]
gi|324970445|gb|ADY61223.1| NHL repeat containing protein [Planctomyces brasiliensis DSM 5305]
Length = 370
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 39/84 (46%), Gaps = 5/84 (5%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCI---GSCPGFEDGEFESSKLMR 320
L P IS D+ GN +FL+D+ H I DG I+ + G DG KL R
Sbjct: 283 LAGPKGISVDKQGN-IFLADTESHTIRRIDGQTGIITTLVGNGKKGDGPDGNPLECKLNR 341
Query: 321 PAASFYHKDDDCLYIVDSENHAIR 344
P F D +YI DS NH +R
Sbjct: 342 PHGVFVAADGK-VYIGDSSNHRVR 364
>gi|395646873|ref|ZP_10434733.1| PKD domain containing protein [Methanofollis liminatans DSM 4140]
gi|395443613|gb|EJG08370.1| PKD domain containing protein [Methanofollis liminatans DSM 4140]
Length = 2489
Score = 42.0 bits (97), Expect = 1.4, Method: Composition-based stats.
Identities = 29/83 (34%), Positives = 43/83 (51%), Gaps = 12/83 (14%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFES--SKLMRP 321
+ P I+ D GN +F++DS +HRI +FD NG L GS G +GEF L+
Sbjct: 1214 FNVPRGIAVDSRGN-VFVADSLNHRIQIFDTNGTFLTEFGSY-GTGEGEFNEPWDTLVDN 1271
Query: 322 AASFYHKDDDCLYIVDSENHAIR 344
A + +Y+ D+ NH I+
Sbjct: 1272 AGN--------VYVTDARNHRIQ 1286
>gi|290976675|ref|XP_002671065.1| predicted protein [Naegleria gruberi]
gi|284084630|gb|EFC38321.1| predicted protein [Naegleria gruberi]
Length = 343
Score = 41.6 bits (96), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 51/183 (27%), Positives = 76/183 (41%), Gaps = 32/183 (17%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVF-----DGNGKILDCI--GSCPGFEDGEF 313
N L+ P I GN L+++D ++H I D NG I + GS E+G
Sbjct: 122 NSFLNSPRGICLSNDGNYLYIADRDNHAIRKVSLNDDDDNGLIETIVGNGSSGESENGSL 181
Query: 314 E-SSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLW-- 370
+S++ P KDD+ LY + NH IR+ D KN LW
Sbjct: 182 ALNSQINCPYGIAISKDDE-LYFSEFSNHTIRKID-----------------KNGKLWKV 223
Query: 371 AWIMEKLGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEIK 430
A I K G N+ L +P ++ ++NL I +R ++ +DL +G I
Sbjct: 224 AGIDGKNG----NNEYDHHAFNNLLNYPAYICFDNNNNLYIADRDNNSIRKIDLKTGLIN 279
Query: 431 EAV 433
AV
Sbjct: 280 TAV 282
>gi|395768739|ref|ZP_10449254.1| hypothetical protein Saci8_03116 [Streptomyces acidiscabies 84-104]
Length = 596
Score = 41.6 bits (96), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 33/125 (26%), Positives = 56/125 (44%), Gaps = 15/125 (12%)
Query: 253 PHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGK-ILDCIGSCP-GFED 310
P+ L FPG SGN +SD+ H+++ +G+ ++ IG+ GF D
Sbjct: 159 PYVAPEPEPTALRFPGKALLLPSGN-FLVSDTTRHQLVELAQDGESVVRRIGTGTRGFAD 217
Query: 311 GEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVY----------PTS 360
G E++ DD + + D+ NHA+RR D+ + T+ PTS
Sbjct: 218 GTAEAA--FSEPQGLALLDDGAVVVADTVNHALRRVDLATGDVTTLAGTGRQWWQGSPTS 275
Query: 361 GISKK 365
G +++
Sbjct: 276 GPARE 280
>gi|153864226|ref|ZP_01997193.1| NHL repeat protein [Beggiatoa sp. SS]
gi|152146280|gb|EDN72808.1| NHL repeat protein [Beggiatoa sp. SS]
Length = 313
Score = 41.6 bits (96), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 23/81 (28%), Positives = 42/81 (51%), Gaps = 4/81 (4%)
Query: 279 LFLSDSNHHRIIVFDGNGKILDCIGSCP---GFEDGEFESSKLMRPAASFYHKDDDCLYI 335
++++D +HRI D +G I +G+ G + GE ++L P+ D LYI
Sbjct: 81 VYITDKGNHRIRKVDTSGIITTIVGNGVASFGGDGGEAPGAQLNSPS-DIAIDDSGSLYI 139
Query: 336 VDSENHAIRRADMGRRVLETV 356
+D+ NH IR+ + ++ T+
Sbjct: 140 IDTNNHRIRKVEKATNIITTL 160
Score = 40.0 bits (92), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 46/84 (54%), Gaps = 5/84 (5%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCI-GSCPGF--EDGEFESSKLMR 320
L+ P I+ D+SG+ L++ D+N+HRI + I+ + G GF +DG+ S++
Sbjct: 123 LNSPSDIAIDDSGS-LYIIDTNNHRIRKVEKATNIITTLAGGVSGFSGDDGQALSAQFNT 181
Query: 321 PAASFYHKDDDCLYIVDSENHAIR 344
P D+ L I DSENH +R
Sbjct: 182 PDG-IAISDEGHLLIADSENHRVR 204
>gi|403382146|ref|ZP_10924203.1| copper amine oxidase domain-containing protein [Paenibacillus sp.
JC66]
Length = 531
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 3/82 (3%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIG-SCPGFEDGEFESSKLMRPAASF 325
P ++ D +GN LF++D +H + D G + G G +DG E S P
Sbjct: 127 PAGLALDHNGN-LFVADKGNHAVRKVDAEGNVTTYAGQGVLGHKDGTAEESLFYAPE-DV 184
Query: 326 YHKDDDCLYIVDSENHAIRRAD 347
D +Y+ D+ NH IR+ D
Sbjct: 185 VVASDGTVYVADTLNHVIRKID 206
>gi|156370888|ref|XP_001628499.1| predicted protein [Nematostella vectensis]
gi|156215477|gb|EDO36436.1| predicted protein [Nematostella vectensis]
Length = 688
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 40/71 (56%), Gaps = 6/71 (8%)
Query: 278 RLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIVD 337
++ +SDS H + VF +G ++ G G +DG+F M P A + K + CLY+ D
Sbjct: 437 QIVVSDSLKHHLQVFTPDGNLMFDFGG-EGSDDGKF-----MHPMAIAFDKSEKCLYVAD 490
Query: 338 SENHAIRRADM 348
S+N+ I+ D+
Sbjct: 491 SDNNRIQVVDV 501
>gi|428175994|gb|EKX44881.1| hypothetical protein GUITHDRAFT_60229, partial [Guillardia theta
CCMP2712]
Length = 181
Score = 41.6 bits (96), Expect = 1.7, Method: Composition-based stats.
Identities = 23/96 (23%), Positives = 45/96 (46%), Gaps = 6/96 (6%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP------GFEDGEFESSKLMR 320
P I+ + G+ L ++D+N+H I D ++ I CP G +DG ++
Sbjct: 22 PLDIAFSKDGSWLAVADTNNHAIRRIDAMTGVVTTIAGCPRSSSCLGSQDGPASAASFNS 81
Query: 321 PAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P + D +++ D+ N+ IR+ D+ + T+
Sbjct: 82 PTSIALDPQDKFIFVADTSNNMIRQIDLTAMTVTTI 117
>gi|301064161|ref|ZP_07204608.1| RHS repeat-associated core domain protein [delta proteobacterium
NaphS2]
gi|300441781|gb|EFK06099.1| RHS repeat-associated core domain protein [delta proteobacterium
NaphS2]
Length = 2050
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 40/82 (48%), Gaps = 3/82 (3%)
Query: 270 ISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFE-DGEFESSKLMRPAASFYH 327
I+ D +GN L++ D ++HR+ D NG I GS GF DG + A
Sbjct: 947 IAIDSAGN-LYILDGSNHRVRKVDTNGIITTVAGSDDYGFSGDGGPAIEASLGYAVGIAI 1005
Query: 328 KDDDCLYIVDSENHAIRRADMG 349
+ LYI DS NH IRR D G
Sbjct: 1006 DSAENLYISDSSNHCIRRVDTG 1027
>gi|159113256|ref|XP_001706855.1| Haloacid dehalogenase-like hydrolase family [Giardia lamblia ATCC
50803]
gi|157434955|gb|EDO79181.1| Haloacid dehalogenase-like hydrolase family [Giardia lamblia ATCC
50803]
Length = 969
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 27/111 (24%), Positives = 47/111 (42%), Gaps = 14/111 (12%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFE--------DGEFESSKLMRPAASFYHK 328
N + L+D+ H+R++ +G G C F DG F + P +
Sbjct: 97 NGVLLADAGHNRLVFINGE------TGHCEHFAGTGEAAHLDGGFSQAAFNFPHSIVSDP 150
Query: 329 DDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGF 379
+ +Y+ D++NH IR + R ++T+ T G+ + L E LG
Sbjct: 151 VNGIIYLTDAKNHCIRTLSLMTRKVKTLAGTPGVFGYKDGLNPLFNEPLGL 201
>gi|326436774|gb|EGD82344.1| NHL repeat protein [Salpingoeca sp. ATCC 50818]
Length = 1484
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 68/162 (41%), Gaps = 24/162 (14%)
Query: 203 GNARVFHENSLDIGMLNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNL 262
GN + E + + +++ A ++LI+ + G+ T N
Sbjct: 1236 GNVYITEEGNHRVSIVDVASKKLIVGAGTGNPGHRGMGGTPG----------------NA 1279
Query: 263 LLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGF-----EDGEFESSK 317
+H P ++ + G+ LF SD +H + + + I+ + P F +DG ++
Sbjct: 1280 AIHSPVGLAWADDGSLLF-SDEENHVVYMVNRRSLIISVVAGTPRFAGYSGDDGLSIGAR 1338
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
L RP K LYI DS NH IR DM +V+ TV T
Sbjct: 1339 LNRPRGIALSKG--VLYIADSGNHRIRAVDMRTQVITTVAGT 1378
>gi|418737091|ref|ZP_13293489.1| NHL repeat protein [Leptospira borgpetersenii serovar Castellonis
str. 200801910]
gi|410747250|gb|EKR00156.1| NHL repeat protein [Leptospira borgpetersenii serovar Castellonis
str. 200801910]
Length = 358
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 25/97 (25%), Positives = 46/97 (47%), Gaps = 2/97 (2%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCI-GSCPGFEDGEFESSKLM 319
N L P + D N L++ + +H I + N + + + G G+ DG+ S++
Sbjct: 174 NSLFKGPLFMDLDRERN-LYVGELGNHTIRKINLNSETVSTLSGGVSGYLDGDLTSAQFK 232
Query: 320 RPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P+ Y + D L + D +NH IR+ D+ + T+
Sbjct: 233 FPSGIIYDQKTDSLLVADLQNHRIRKIDLKTSTVSTL 269
>gi|290989086|ref|XP_002677176.1| predicted protein [Naegleria gruberi]
gi|284090782|gb|EFC44432.1| predicted protein [Naegleria gruberi]
Length = 2103
Score = 41.2 bits (95), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 54/125 (43%), Gaps = 10/125 (8%)
Query: 263 LLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGS-CPGF--EDGEFESSKLM 319
++++P + + L +SD N+HRI NG I G+ GF + G ++ L
Sbjct: 496 IMNYPSHCVINSTNGELIISDMNNHRIRKVSNNGIITTIAGNGTAGFCGDGGLAVNTCLN 555
Query: 320 RPAASFYHKDDDCLYIVDSENHAIRRADMGRRVL------ETVYPTSGISKKNNSLWAWI 373
RP + LYI D NH IR+ + T+Y GI N SL++ +
Sbjct: 556 RPNGIAISSSGE-LYIADYGNHRIRKVSNNGIITTIAGNGNTIYNGDGIDAANASLYSPV 614
Query: 374 MEKLG 378
+G
Sbjct: 615 DVSIG 619
>gi|195030130|ref|XP_001987921.1| GH10881 [Drosophila grimshawi]
gi|193903921|gb|EDW02788.1| GH10881 [Drosophila grimshawi]
Length = 728
Score = 41.2 bits (95), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 50/97 (51%), Gaps = 4/97 (4%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP-GFEDGEFESSKLMRPA 322
L FP IS +R ++D+ ++R++V G + IG G DG+ +++ P
Sbjct: 224 LRFPAKIS--RFSDRYAIADAGNNRVLVVTIAGIVEHKIGGLDNGLVDGDLLTARFNNPQ 281
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
+ D+ L + D++NHA+R+ + ++ET+ T
Sbjct: 282 GVAF-LDEHTLIVADTDNHALRQISLKNGIVETLAGT 317
>gi|156386715|ref|XP_001634057.1| predicted protein [Nematostella vectensis]
gi|156221135|gb|EDO41994.1| predicted protein [Nematostella vectensis]
Length = 549
Score = 41.2 bits (95), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 42/142 (29%), Positives = 69/142 (48%), Gaps = 16/142 (11%)
Query: 204 NARVFHENSLDIGMLNKAVEELIMQQQENSSSPSGLKCTWAKQAEVL-KEPHACSSVRNL 262
NA + S G LNK + +++ +S+S + + W+K +V+ K H S N
Sbjct: 239 NATIGGLESNTFGSLNKKIGQVV-----SSASSADILPDWSKAGKVIHKIGHKGSRKGNF 293
Query: 263 LLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPA 322
PG ++++E G + +SD + RI VFD GK L G G +DG F+ P
Sbjct: 294 --KSPGGVASNEFG-EIAVSDFFNDRIQVFDSKGKFLFQFGK-KGTKDGLFQG-----PT 344
Query: 323 ASFYHKDDDCLYIVDSENHAIR 344
Y + + + ++DS NH I+
Sbjct: 345 GVAYTVNSE-IMVLDSRNHRIQ 365
>gi|290995104|ref|XP_002680171.1| predicted protein [Naegleria gruberi]
gi|284093791|gb|EFC47427.1| predicted protein [Naegleria gruberi]
Length = 928
Score = 41.2 bits (95), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 43/74 (58%), Gaps = 4/74 (5%)
Query: 275 SGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPG---FEDGEFESSKLMRPAASFYHKDDD 331
S N ++++D ++HRI NGKI+ G+ G ++G +++L P + F +++
Sbjct: 76 SNNEVYIADFSNHRIRKILENGKIVTIAGNGTGGFSGDNGPATNAQLNNPYSVFVSSNNE 135
Query: 332 CLYIVDSENHAIRR 345
+YIVD NH IR+
Sbjct: 136 -VYIVDYNNHRIRK 148
>gi|348681654|gb|EGZ21470.1| hypothetical protein PHYSODRAFT_329412 [Phytophthora sojae]
Length = 251
Score = 41.2 bits (95), Expect = 2.0, Method: Composition-based stats.
Identities = 46/167 (27%), Positives = 77/167 (46%), Gaps = 12/167 (7%)
Query: 192 NGACYLLSKDFGNARVFHENSLDIGMLNKAVEELIMQQQENSSSPSGLKCTWAKQAEVLK 251
NG YL S N V S DI + + L++Q SS + + + + +V+K
Sbjct: 41 NGDFYLGS----NEAVMKITSTDIITIASLADSLVLQGITVDSSEN-VYVSRLTEFDVVK 95
Query: 252 -EPHACSSV--RNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGF 308
P S+ + + I+ D S N ++++D HRI+ F G++ G GF
Sbjct: 96 IAPDGSVSIFANSSMFTLAMGIAIDSSDN-IYITD--QHRILKFTLAGEMGVVAGRVQGF 152
Query: 309 EDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLET 355
+ E ++ P A D + LY+VDS+N+ IR+ D+ R + T
Sbjct: 153 LNALGELARFSTPWALTIGSDGN-LYVVDSDNNCIRKVDLTTREVTT 198
>gi|39995724|ref|NP_951675.1| NHL repeat domain-containing protein [Geobacter sulfurreducens PCA]
gi|39982488|gb|AAR33948.1| NHL repeat domain lipoprotein [Geobacter sulfurreducens PCA]
Length = 361
Score = 41.2 bits (95), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 1/44 (2%)
Query: 270 ISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEF 313
I+ DE G R+ + D+ HRI + D G +L G G EDGEF
Sbjct: 173 IAVDEKGQRILVVDAREHRIAILDMQGGLLSAFGKR-GIEDGEF 215
>gi|195387367|ref|XP_002052367.1| GJ17513 [Drosophila virilis]
gi|194148824|gb|EDW64522.1| GJ17513 [Drosophila virilis]
Length = 736
Score = 41.2 bits (95), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 39/174 (22%), Positives = 77/174 (44%), Gaps = 35/174 (20%)
Query: 275 SGNRLFLSDSNHHRIIVFDG-NGKILDCIGS--------CPGFEDGEFESSKLMRPAASF 325
S + LF++DS I +GK++ +G G EDG+ ++KL P
Sbjct: 428 SNDYLFIADSESSSIRKASMVDGKVMPVVGGDRNPLNLFAFGDEDGKLYNAKLQHPLGVA 487
Query: 326 YHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDT 385
+++ D LY+ D+ NH I+ D ++ T+ I K++S+ A+
Sbjct: 488 FNQIDRKLYVADTYNHKIKVIDTDTNIISTLV----IKNKDDSILAF------------- 530
Query: 386 KSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEIKEAVKGFSKV 439
+P + + +E + LL+ + + T+ I+DLA+ + + F ++
Sbjct: 531 ----REPSGIC-----VDAEGNRLLVADTNNHTIHIIDLATHTAQPFLLDFKQI 575
Score = 40.8 bits (94), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 25/97 (25%), Positives = 49/97 (50%), Gaps = 4/97 (4%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
L FP I+ + ++D+ ++R++V +G + +G GF DG +++ P
Sbjct: 224 LRFPAKIAM--FNDYYAIADAGNNRVLVVTTSGVVEYKVGGLEAGFVDGNLNTARFNNPQ 281
Query: 323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
+ D D L + D++NHA+R+ + +ET+ T
Sbjct: 282 GVAF-LDKDTLIVADTDNHALRQISLKSGAVETLAGT 317
>gi|290999745|ref|XP_002682440.1| predicted protein [Naegleria gruberi]
gi|284096067|gb|EFC49696.1| predicted protein [Naegleria gruberi]
Length = 731
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 50/85 (58%), Gaps = 5/85 (5%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGF-EDG-EFESSKLMR 320
L++P +++D GN +++SDS +HR+ + NG I G+ GF +DG + SS+L
Sbjct: 425 LNYPHGVASDSIGN-IYISDSYNHRVRIVFTNGTISTIAGNGNSGFNKDGIQATSSQLNY 483
Query: 321 PAASFYHKDDDCLYIVDSENHAIRR 345
P + +D+ L+I D NH +R+
Sbjct: 484 PFGIALNGNDE-LFISDRSNHRVRK 507
>gi|297568314|ref|YP_003689658.1| hypothetical protein [Desulfurivibrio alkaliphilus AHT2]
gi|296924229|gb|ADH85039.1| YD repeat protein [Desulfurivibrio alkaliphilus AHT2]
Length = 2439
Score = 41.2 bits (95), Expect = 2.4, Method: Composition-based stats.
Identities = 32/104 (30%), Positives = 47/104 (45%), Gaps = 16/104 (15%)
Query: 254 HACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGF----- 308
H+ N L+ P C+S D G ++++D+ +HRI +G I GS GF
Sbjct: 1080 HSGGLAVNARLYRPSCVSIDPDGT-IYIADTGNHRIRSVGTDGIITTHAGS--GFRAGGL 1136
Query: 309 -------EDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRR 345
+DG +++L P D +YI DS NH IRR
Sbjct: 1137 GDSGYSGDDGPAVNARLNSPT-DIALGPDGSIYIADSINHRIRR 1179
>gi|349603355|gb|AEP99217.1| NHL repeat-containing protein 2-like protein, partial [Equus
caballus]
Length = 456
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 45/101 (44%), Gaps = 4/101 (3%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKI---LDCIGSCPGFEDGEFESSK 317
N L P ++ D+ N ++++DS +H+I V D K L G F S
Sbjct: 201 NAKLQHPLGVTWDKKRNLVYVADSYNHKIKVVDPKTKTCTTLAGTGDASNVISSSFTQST 260
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYP 358
P ++ LY+ D+ NH I+ D+ +++ +V P
Sbjct: 261 FNEPGGLCIGENGQLLYVADTNNHQIKVMDLETKMI-SVLP 300
>gi|116622401|ref|YP_824557.1| Ig domain-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116225563|gb|ABJ84272.1| Ig domain protein, group 1 domain protein [Candidatus Solibacter
usitatus Ellin6076]
Length = 2770
Score = 40.8 bits (94), Expect = 2.5, Method: Composition-based stats.
Identities = 31/99 (31%), Positives = 49/99 (49%), Gaps = 4/99 (4%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDG-NGKILDCIGSCPGFEDGEFESSKLMRPA 322
L+ P I+ D +GN L+++D+ H RI DG G I G+ G+ + + L +
Sbjct: 108 LNHPSGIAFDTAGN-LYIADTGHDRIRRVDGVTGIITTVAGTGERGYSGDGQPATLAKIN 166
Query: 323 ASFYHKDDD--CLYIVDSENHAIRRADMGRRVLETVYPT 359
+ ++ D L+I D NH +RR D V+ TV T
Sbjct: 167 SPYHIALDGHGNLFIADDGNHRVRRVDGNSGVITTVAGT 205
>gi|291223251|ref|XP_002731624.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 434
Score = 40.8 bits (94), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 23/81 (28%), Positives = 42/81 (51%), Gaps = 1/81 (1%)
Query: 277 NRLFLSDSNHHRII-VFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYI 335
+R F++D+++H I V +G++ G GF+DG+ + PA + L+I
Sbjct: 195 DRFFVADTDNHAIREVSLPDGEVTTIAGGEKGFKDGKGTGATFYHPAGVTIDPIRNILFI 254
Query: 336 VDSENHAIRRADMGRRVLETV 356
D NHAIR + +++ T+
Sbjct: 255 ADHYNHAIRMIGVESKIVTTL 275
Score = 40.8 bits (94), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 19/82 (23%), Positives = 40/82 (48%)
Query: 263 LLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPA 322
+ ++P ++ D L++ + +++ + + D G++ +G G DG E +K P
Sbjct: 292 MFNYPEGMAYDTENKVLYVVEFDNNCVRIVDDEGEVRSFVGGREGKSDGLGEEAKFFHPT 351
Query: 323 ASFYHKDDDCLYIVDSENHAIR 344
+ + + +YI D NH IR
Sbjct: 352 GLTFDEKEKIIYITDQYNHQIR 373
Score = 40.8 bits (94), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 28/116 (24%), Positives = 49/116 (42%), Gaps = 6/116 (5%)
Query: 234 SSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFD 293
S P G T A + K+ + + P ++ D N LF++D +H I +
Sbjct: 211 SLPDGEVTTIAGGEKGFKDGKGTGAT----FYHPAGVTIDPIRNILFIADHYNHAIRMIG 266
Query: 294 GNGKILDCIGSC--PGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRAD 347
KI+ + PGF +G + P Y ++ LY+V+ +N+ +R D
Sbjct: 267 VESKIVTTLAGSGKPGFVNGMGNQAMFNYPEGMAYDTENKVLYVVEFDNNCVRIVD 322
>gi|290982010|ref|XP_002673724.1| NHL repeat-containing protein [Naegleria gruberi]
gi|284087309|gb|EFC40980.1| NHL repeat-containing protein [Naegleria gruberi]
Length = 456
Score = 40.8 bits (94), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 49/97 (50%), Gaps = 6/97 (6%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDG-NGKILDCIGSCPGFEDGE---FESSKLM 319
L FP I+ D SGN ++++D+ +HRI F GK+ G+ +G+ ++KL
Sbjct: 138 LLFPTSIAVDSSGN-IYIADTQNHRIRKFTALTGKLSTIAGTEEEGYNGDNIIATTAKLS 196
Query: 320 RPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P + +YI DS+NH IR+ + + T+
Sbjct: 197 SPTGVALDSIGN-VYIADSQNHRIRKITISNGKISTI 232
>gi|380693812|ref|ZP_09858671.1| hypothetical protein BfaeM_07498 [Bacteroides faecis MAJ27]
Length = 456
Score = 40.8 bits (94), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 45/105 (42%), Gaps = 5/105 (4%)
Query: 243 WAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNH--HRIIVFDGNGKILD 300
WA Q K + + + + PG + D GN + NH +I + G L
Sbjct: 351 WAGQEG--KSGYLDGPIGQAMFNKPGQMCVDSEGNIILTDTENHCIRKITMSTGYVSTLA 408
Query: 301 CIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRR 345
G+ +G E ++ +P +DD +YI DSEN AIRR
Sbjct: 409 GKPQNSGYVNGSAEDAQFKKPLG-ICIDNDDVMYIGDSENRAIRR 452
>gi|290996598|ref|XP_002680869.1| predicted protein [Naegleria gruberi]
gi|284094491|gb|EFC48125.1| predicted protein [Naegleria gruberi]
Length = 1407
Score = 40.8 bits (94), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 6/77 (7%)
Query: 273 DESGNRLFLSDSNHHRII-VFDGNGKILDCIGSCPGFEDG---EFESSKLMRPAASFYHK 328
D S LF+SD+++HRI+ VF NG +L G+ +G + +++L P F
Sbjct: 62 DLSSGDLFISDNSNHRIVRVFAANGTVLTIAGTGVSGYNGDNIQATTAQLNTPTGVFVFN 121
Query: 329 DDDCLYIVDSENHAIRR 345
+ +YI DS+N +R+
Sbjct: 122 SE--IYIADSQNSRVRK 136
>gi|296132542|ref|YP_003639789.1| NHL repeat containing protein [Thermincola potens JR]
gi|296031120|gb|ADG81888.1| NHL repeat containing protein [Thermincola potens JR]
Length = 332
Score = 40.8 bits (94), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 44/88 (50%), Gaps = 7/88 (7%)
Query: 266 FPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASF 325
FP ++ D SGN +++SD + HRI+VFD +G+ + G G GE +
Sbjct: 166 FPVTVAVDRSGN-VYVSDVSQHRIVVFDDSGRFVRYFGRA-GSNPGELSYVNGIA----- 218
Query: 326 YHKDDDCLYIVDSENHAIRRADMGRRVL 353
++D +Y+ +S N I D+ R L
Sbjct: 219 VNEDAGEIYLANSNNGRIDTFDLNGRYL 246
>gi|291239557|ref|XP_002739689.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 744
Score = 40.8 bits (94), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 45/90 (50%), Gaps = 8/90 (8%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFY 326
P IS GN + ++D+++HR+ VFDG+G+ L G C D +L P
Sbjct: 624 PWGISIGVDGN-VIVADNSNHRMQVFDGDGEYLYSFGDCGNGSD------RLQNPLGVST 676
Query: 327 HKDDDCLYIVDSENHAIRRADMGRRVLETV 356
+D + +Y+ D NH I++ D + L +
Sbjct: 677 DRDGN-VYVCDHNNHRIQKYDSHGKYLTRI 705
>gi|291222861|ref|XP_002731433.1| PREDICTED: tripartite motif protein 2-like [Saccoglossus
kowalevskii]
Length = 693
Score = 40.8 bits (94), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 47/98 (47%), Gaps = 9/98 (9%)
Query: 266 FPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASF 325
+P I+ D GN + ++D N+HRI V G G+ L GSC G DG+ + S
Sbjct: 572 YPWGITTDNKGN-VIVADQNNHRIQVMTGEGEFLFKFGSC-GNSDGQLQY------PGSV 623
Query: 326 YHKDDDCLYIVDSENHAIRRAD-MGRRVLETVYPTSGI 362
D +Y+ D N+ +++ D G+ V P G+
Sbjct: 624 ATDTDGYVYVSDCVNNRVQKYDSHGQFVCRIDSPGDGL 661
>gi|290977429|ref|XP_002671440.1| predicted protein [Naegleria gruberi]
gi|284085009|gb|EFC38696.1| predicted protein [Naegleria gruberi]
Length = 1494
Score = 40.8 bits (94), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 29/116 (25%), Positives = 55/116 (47%), Gaps = 3/116 (2%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAA 323
L+ P ++ +G +++SDS++ I D NG I +GS GF D ++L P
Sbjct: 496 LNGPTGVAISPNG-EVYISDSSNDVIRKIDTNGAISTVVGSSSGFLDASARRAQLANPMG 554
Query: 324 SFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWAWIMEKLGF 379
+ + D L I D++N+ IR+ + + T+ T G++ N + + +
Sbjct: 555 IAFLPNGD-LIISDAKNNRIRKFSISSGNVTTIAGT-GLTTYNGEGLVGVATNINY 608
>gi|326432343|gb|EGD77913.1| NHL repeat protein [Salpingoeca sp. ATCC 50818]
Length = 1505
Score = 40.4 bits (93), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 51/101 (50%), Gaps = 8/101 (7%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP---GFE-DGEFE-SSKL 318
+H P ++ E GN LF SD +H + + + + I+ + P G E DG+ ++L
Sbjct: 1301 IHSPQAMAWTEEGNLLF-SDEENHVVYMVNPHTTIISVVAGTPRVAGDEGDGQLAIGARL 1359
Query: 319 MRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
PA + D LYI DS NH +R D+ +V+ TV T
Sbjct: 1360 NMPAGIAVY--DHMLYIADSGNHRVRAVDLYTQVITTVAGT 1398
>gi|398337816|ref|ZP_10522521.1| hypothetical protein LkmesMB_21054 [Leptospira kmetyi serovar
Malaysia str. Bejo-Iso9]
Length = 359
Score = 40.4 bits (93), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 28/108 (25%), Positives = 50/108 (46%), Gaps = 2/108 (1%)
Query: 278 RLFLSDSNHHRIIVFDGNGKILDCI-GSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIV 336
L+L + +H I F+ N + + + GS G++DG ++ P Y + + L +V
Sbjct: 191 NLYLGELGNHDIRKFNLNFQSVSTLSGSSLGYQDGSVSTALFKSPIGLVYDRKKNSLLVV 250
Query: 337 DSENHAIRRADMGRRVLETVYPT-SGISKKNNSLWAWIMEKLGFERDN 383
D ++H IR+ ++ + T+ T +G S N L A DN
Sbjct: 251 DIQDHRIRKINLDTMQVSTLLGTGTGASVDGNGLNASFFGPAHIALDN 298
>gi|260803617|ref|XP_002596686.1| hypothetical protein BRAFLDRAFT_78416 [Branchiostoma floridae]
gi|229281945|gb|EEN52698.1| hypothetical protein BRAFLDRAFT_78416 [Branchiostoma floridae]
Length = 498
Score = 40.4 bits (93), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 45/92 (48%), Gaps = 5/92 (5%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMR--P 321
L +P I+ DES +R +++D H++ VFD GK+ G G +GE E +R P
Sbjct: 376 LQYPWFIAVDESRDRFYVTDCRAHKVFVFDLEGKLKFSFGK-QGQNEGELECPTGIRVDP 434
Query: 322 AASF--YHKDDDCLYIVDSENHAIRRADMGRR 351
A + ++D+ L + D + R +R
Sbjct: 435 AGNIIVINRDNGHLQVFDPDGMYQRTVATVKR 466
>gi|290997788|ref|XP_002681463.1| predicted protein [Naegleria gruberi]
gi|284095087|gb|EFC48719.1| predicted protein [Naegleria gruberi]
Length = 2297
Score = 40.4 bits (93), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 47/85 (55%), Gaps = 4/85 (4%)
Query: 275 SGNRLFLSDSNHHRI-IVFDGNGKILDCIGSCPGF--EDGEFESSKLMRPAASFYHKDDD 331
S L+++D +HRI V +G K + GS G+ ++G S+KL +P A ++ D
Sbjct: 79 SNGDLYIADKTYHRIRKVSNGIIKTIAGTGSSGGYFGDNGVATSAKLNKPQALAFYSGDG 138
Query: 332 CLYIVDSENHAIRRADMGRRVLETV 356
LY+ DS N+ IR+ +++ TV
Sbjct: 139 TLYLGDSLNYRIRKIST-NQIISTV 162
>gi|326435583|gb|EGD81153.1| hypothetical protein PTSG_13151 [Salpingoeca sp. ATCC 50818]
Length = 1551
Score = 40.4 bits (93), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 51/101 (50%), Gaps = 8/101 (7%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCP---GFE-DGEFE-SSKL 318
+H P ++ E GN LF SD +H + + + + I+ + P G E DG+ ++L
Sbjct: 1347 IHSPQAMAWTEEGNLLF-SDEENHAVYMVNPHTTIISVVAGTPRVAGDEGDGQLAIGARL 1405
Query: 319 MRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPT 359
PA + D LYI DS NH +R D+ +V+ TV T
Sbjct: 1406 NMPAGIAVY--DHMLYIADSGNHRVRAVDLYTQVITTVAGT 1444
>gi|29349488|ref|NP_812991.1| hypothetical protein BT_4080 [Bacteroides thetaiotaomicron
VPI-5482]
gi|298384984|ref|ZP_06994543.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
gi|383120427|ref|ZP_09941155.1| hypothetical protein BSIG_2566 [Bacteroides sp. 1_1_6]
gi|29341397|gb|AAO79185.1| putative cell surface protein [Bacteroides thetaiotaomicron
VPI-5482]
gi|251840523|gb|EES68605.1| hypothetical protein BSIG_2566 [Bacteroides sp. 1_1_6]
gi|298262128|gb|EFI04993.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
Length = 456
Score = 40.4 bits (93), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 45/105 (42%), Gaps = 5/105 (4%)
Query: 243 WAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNH--HRIIVFDGNGKILD 300
WA Q K + + + + PG + D GN + NH +I + G L
Sbjct: 351 WAGQEG--KSGYLDGPIGQAMFNKPGQMCVDSEGNIILTDTENHCIRKITMSTGYVSTLA 408
Query: 301 CIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRR 345
G+ +G E ++ +P +DD +YI DSEN AIRR
Sbjct: 409 GKPQNSGYVNGSAEDAQFKKPLG-ICIDNDDVMYIGDSENRAIRR 452
>gi|390440891|ref|ZP_10229086.1| conserved exported hypothetical protein [Microcystis sp. T1-4]
gi|389835800|emb|CCI33212.1| conserved exported hypothetical protein [Microcystis sp. T1-4]
Length = 340
Score = 40.4 bits (93), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 29/44 (65%), Gaps = 1/44 (2%)
Query: 272 ADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFES 315
A SG +++++D+ ++RI VFD NG L GS PG +GEF S
Sbjct: 68 AVGSGGKIYVADTANNRIQVFDSNGGFLSTFGS-PGTGNGEFSS 110
>gi|402587633|gb|EJW81568.1| RING finger protein nhl-1 [Wuchereria bancrofti]
Length = 794
Score = 40.0 bits (92), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 46/95 (48%), Gaps = 17/95 (17%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAA 323
H P I+ ++ + ++++DS++HR+ +FD NG + GS GF++G+ K R A
Sbjct: 673 FHHPQFIAVNKRTHNIYVTDSSNHRVCIFDHNGNSIFQFGS-EGFQNGQM---KFPRGIA 728
Query: 324 SFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYP 358
DD IV AD G ++ YP
Sbjct: 729 V----DDQGFIIV---------ADSGNNRVQIFYP 750
>gi|170573447|ref|XP_001892474.1| Hypothetical RING finger protein F54G8.4 in chromosome III,
putative [Brugia malayi]
gi|158601961|gb|EDP38697.1| Hypothetical RING finger protein F54G8.4 in chromosome III,
putative [Brugia malayi]
Length = 1024
Score = 40.0 bits (92), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 46/95 (48%), Gaps = 17/95 (17%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAA 323
H P I+ ++ + ++++DS++HR+ +FD NG + GS GF++G+ K R A
Sbjct: 903 FHHPQFIAVNKRTHNIYVTDSSNHRVCIFDHNGNPIFQFGS-EGFQNGQM---KFPRGIA 958
Query: 324 SFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYP 358
DD IV AD G ++ YP
Sbjct: 959 V----DDQGFIIV---------ADSGNNRVQIFYP 980
>gi|398331208|ref|ZP_10515913.1| hypothetical protein LalesM3_04029 [Leptospira alexanderi serovar
Manhao 3 str. L 60]
Length = 357
Score = 40.0 bits (92), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 26/96 (27%), Positives = 43/96 (44%), Gaps = 2/96 (2%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGN-GKILDCIGSCPGFEDGEFESSKLM 319
N L P + D N L++ + +H I + N G + G G+ DG+ S++
Sbjct: 173 NSLFKGPFFMDLDRERN-LYVGELGNHAIRKINLNSGTVSTLSGGVSGYLDGDLASAQFK 231
Query: 320 RPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLET 355
P Y + D L + D +NH IR+ D+ + T
Sbjct: 232 SPLGITYDQKTDSLLVADLQNHRIRKIDLKTSTVST 267
>gi|336399828|ref|ZP_08580628.1| NHL repeat containing protein [Prevotella multisaccharivorax DSM
17128]
gi|336069564|gb|EGN58198.1| NHL repeat containing protein [Prevotella multisaccharivorax DSM
17128]
Length = 494
Score = 40.0 bits (92), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 44/94 (46%), Gaps = 7/94 (7%)
Query: 257 SSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFD----GNGKILDCIG--SCPGFED 310
S++N ++P I + G RL+++DS +H I D N ++ IG G+ D
Sbjct: 397 GSLKNAKFNYPRQICFTKDG-RLYIADSQNHCIRSIDTTQGANARVTTPIGVPGSSGYAD 455
Query: 311 GEFESSKLMRPAASFYHKDDDCLYIVDSENHAIR 344
G E +K P D + +YI D +N IR
Sbjct: 456 GGVELAKFNIPMGVAVSSDGEKVYIADMKNQVIR 489
>gi|290975027|ref|XP_002670245.1| predicted protein [Naegleria gruberi]
gi|284083802|gb|EFC37501.1| predicted protein [Naegleria gruberi]
Length = 1363
Score = 40.0 bits (92), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 38/75 (50%), Gaps = 3/75 (4%)
Query: 277 NRLFLSDSNHHRIIVFDGNGKILDCIGSCPGF--EDGEFESSKLMRPAASFYHKDDDCLY 334
+ ++++D+ +HRI D NG I G+ GF + G ++KL P D LY
Sbjct: 475 DEIYIADNLNHRIRFVDVNGNISTVAGTSIGFSGDSGLATAAKLNAPMDVSLSASGD-LY 533
Query: 335 IVDSENHAIRRADMG 349
I D +N+ IRR G
Sbjct: 534 IADRDNYRIRRVSSG 548
>gi|421097867|ref|ZP_15558546.1| hypothetical protein LEP1GSC125_1130 [Leptospira borgpetersenii
str. 200901122]
gi|410799150|gb|EKS01231.1| hypothetical protein LEP1GSC125_1130 [Leptospira borgpetersenii
str. 200901122]
Length = 357
Score = 39.7 bits (91), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 45/97 (46%), Gaps = 2/97 (2%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGN-GKILDCIGSCPGFEDGEFESSKLM 319
N L P + D N L++ + ++H I + N G + G G+ DG+ S++
Sbjct: 173 NSLFKSPFFMDLDRERN-LYVGELSNHAIRKINLNSGTVSTLSGGVLGYLDGDLASAQFK 231
Query: 320 RPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
P Y + D L + D +NH IR+ D+ + T+
Sbjct: 232 SPLGITYDQKTDSLLVADIQNHNIRKIDLKASTVSTL 268
>gi|319652355|ref|ZP_08006472.1| cell surface protein [Bacillus sp. 2_A_57_CT2]
gi|317396016|gb|EFV76737.1| cell surface protein [Bacillus sp. 2_A_57_CT2]
Length = 615
Score = 39.7 bits (91), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 42/81 (51%), Gaps = 8/81 (9%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFY 326
P ++ D SGN L++ D +HRI+ D +GKIL GS G GEF P
Sbjct: 47 PVALAKDSSGN-LYMEDMGNHRIVKMDSSGKILGAFGSL-GHSPGEFN-----MPFGIAI 99
Query: 327 HKDDDCLYIVDSENHAIRRAD 347
++ + L + D+ N+ I++ D
Sbjct: 100 DREGNIL-VADTGNYRIQKFD 119
>gi|291223217|ref|XP_002731607.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 730
Score = 39.7 bits (91), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 39/83 (46%), Gaps = 15/83 (18%)
Query: 275 SGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDD---- 330
S N++ +SD +HRI++ D N K L C GS G DGEF ++ H D
Sbjct: 620 SKNQILVSDLGNHRILLLDANFKYLSCFGS-RGTGDGEF----------NYPHGIDVDGN 668
Query: 331 DCLYIVDSENHAIRRADMGRRVL 353
D +YI D N I + R L
Sbjct: 669 DNIYICDPNNRRICKYSPDGRFL 691
>gi|40062498|gb|AAR37450.1| NHL repeat domain protein [uncultured marine bacterium 105]
Length = 1281
Score = 39.7 bits (91), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 47/101 (46%), Gaps = 13/101 (12%)
Query: 273 DESGNRLFLSDSNHHRIIVFDGNGKILD------CIGSCPGFEDGE--FESSKLMRPAAS 324
DE RLF+SD +++RI+++D + L + P F E +S L P +
Sbjct: 117 DEKHQRLFVSDGDNNRILIWDVDPARLTETPNAMAVIGQPDFYQNEPHVGASGLNDPGSI 176
Query: 325 FYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKK 365
Y D L++VD NH I D+ LET G+S K
Sbjct: 177 HYDAATDRLFVVDGGNHRILTFDVTPESLET-----GMSAK 212
>gi|408828763|ref|ZP_11213653.1| hypothetical protein SsomD4_16351 [Streptomyces somaliensis DSM
40738]
Length = 614
Score = 39.7 bits (91), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 14/114 (12%)
Query: 264 LHFPGCISADESGNRLFLSDSNHHRI--IVFDGNGKILDCIGSCPGFEDGEFESSKLMRP 321
L FPG A ESGN L +SD+ HR+ + DG + G DG ES++ P
Sbjct: 186 LRFPGKALALESGN-LLVSDTTRHRLVEVAPDGETVVRRYGDGRRGLVDGPAESARFSEP 244
Query: 322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVY----------PTSGISKK 365
D + + D+ NHA+R D + T+ PTSG +++
Sbjct: 245 -QGMCALPDGRIVVADTVNHALRALDPETGAVSTLAGTGAQWMQGSPTSGPARE 297
>gi|290972172|ref|XP_002668832.1| predicted protein [Naegleria gruberi]
gi|284082360|gb|EFC36088.1| predicted protein [Naegleria gruberi]
Length = 2254
Score = 39.3 bits (90), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 55/114 (48%), Gaps = 5/114 (4%)
Query: 261 NLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGE---FESSK 317
N LL P ++ ++ G+ LF+++ HRI D G I G DG+ +S+
Sbjct: 50 NALLGGPSHLTFNQDGD-LFVAEYVFHRIRKIDRYGYISTIAGQMTKGSDGDGGLATASQ 108
Query: 318 LMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTSGISKKNNSLWA 371
+ P F+ L++ D+ NH IR+ D V+ T + GI K N+S+ +
Sbjct: 109 MNSPRGVFFDDVTGSLFVADTNNHKIRKID-NFGVISTDFDCFGIKKSNSSVCS 161
>gi|391336193|ref|XP_003742466.1| PREDICTED: RING finger protein nhl-1-like [Metaseiulus
occidentalis]
Length = 851
Score = 39.3 bits (90), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 33/85 (38%), Positives = 45/85 (52%), Gaps = 16/85 (18%)
Query: 267 PGCIS-----ADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRP 321
PGC+ A S NR+ +SD+N+HR+ VFD NGK G+ G E+G F + R
Sbjct: 727 PGCLEHPSYIAVSSTNRVIVSDTNNHRVQVFDVNGKCQFTFGA-EGSEEGLF---RFPRG 782
Query: 322 AASFYHKDDDCLYIV--DSENHAIR 344
A DD YI+ DS N+ I+
Sbjct: 783 IAV-----DDQGYIIVSDSGNNRIQ 802
>gi|156370856|ref|XP_001628483.1| predicted protein [Nematostella vectensis]
gi|156215461|gb|EDO36420.1| predicted protein [Nematostella vectensis]
Length = 789
Score = 39.3 bits (90), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 22/90 (24%), Positives = 45/90 (50%), Gaps = 9/90 (10%)
Query: 267 PGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFY 326
P I D++G R+ ++D ++HR+ VF+ G+ GS P +L P + Y
Sbjct: 587 PNGIFVDQNG-RMIITDWHNHRVQVFNSEGRFQFAFGSSP--------QDQLKHPRDAIY 637
Query: 327 HKDDDCLYIVDSENHAIRRADMGRRVLETV 356
H+ + ++ D+ ++ ++ D + L T+
Sbjct: 638 HEPAEKFFVSDTGHNVLKVYDKKGKFLRTI 667
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.136 0.405
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 11,698,705,000
Number of Sequences: 23463169
Number of extensions: 485269471
Number of successful extensions: 1066441
Number of sequences better than 100.0: 467
Number of HSP's better than 100.0 without gapping: 162
Number of HSP's successfully gapped in prelim test: 305
Number of HSP's that attempted gapping in prelim test: 1064576
Number of HSP's gapped (non-prelim): 1672
length of query: 762
length of database: 8,064,228,071
effective HSP length: 151
effective length of query: 611
effective length of database: 8,816,256,848
effective search space: 5386732934128
effective search space used: 5386732934128
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 81 (35.8 bits)