BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 036974
(522 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255540187|ref|XP_002511158.1| conserved hypothetical protein [Ricinus communis]
gi|223550273|gb|EEF51760.1| conserved hypothetical protein [Ricinus communis]
Length = 534
Score = 521 bits (1341), Expect = e-145, Method: Compositional matrix adjust.
Identities = 285/543 (52%), Positives = 364/543 (67%), Gaps = 52/543 (9%)
Query: 7 ALGNEDLLQHILSRLPALSFASAACVNKSWNKVCNQILSKPKLASALSLSPSLHVAVSEV 66
+L +ED++++ILSRLPALSF SA+CV+K WNKVC +ILS+PKLASALSL+PSLH AV EV
Sbjct: 9 SLVSEDVIENILSRLPALSFVSASCVSKCWNKVCVRILSRPKLASALSLNPSLHEAVDEV 68
Query: 67 LDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPVITNAVTGIIGLDAHLDEIC 126
L KVL +PI PHF IA +G Q L THQL+T R G+R PVITNA +GIIGLDA DE+
Sbjct: 69 LGKVLLQPIVPHFVIACIGKQFSLEITHQLLTKRFGTRVPVITNAASGIIGLDAATDEVR 128
Query: 127 EVKWTLL------------EDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLRSKEEPEFS 174
EV+W +NLLN GIVL+VG+VPGLKVE IPLLRSK P+ +
Sbjct: 129 EVRWESSDDEDDNNDPDSEANNLLN-----RGIVLVVGFVPGLKVEAIPLLRSKTVPQPT 183
Query: 175 MVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIVGDATSCF 234
+VDKFL DI+++S S+S C+SP GIILFGD++ID+KP+LA MDY L EETV+VGDA+ CF
Sbjct: 184 LVDKFLTDIKNFSVSVSDCTSPAGIILFGDRSIDLKPVLARMDYALNEETVMVGDASGCF 243
Query: 235 LFKTGENSQNYNGALYFFDAVALVFSRD---SDNSNVPEIQFDITMSTGVLPFGPELKAV 291
L ++ +NS N G +Y DAVALVFS+D S +++ E QF IT+STG++PFGP+L+A+
Sbjct: 244 LCRSVDNSHNNYGDMYLLDAVALVFSKDKHKSHGADIGETQFHITLSTGLMPFGPQLQAI 303
Query: 292 SVKEHNADCSLLTARMEGYDGLLHGEEILEDIKEHI-DDKYPYLYIGVIHQR----GSLQ 346
V D S L+ARMEG +L+GE +L DI + D+ +P LYIGV+ QR G+
Sbjct: 304 CVIARGTDNSWLSARMEGQYDVLNGEGLLTDINDQFTDEDFPELYIGVVQQREYPIGAES 363
Query: 347 FGSRSYMSLYEVLGAEDQFFIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLRLLNAS 406
SR+ M+ YEV+G E+QFF++NGVGI+PGD F+FYHSDS TASSS D L L +
Sbjct: 364 TISRASMAFYEVMGGENQFFVINGVGIRPGDYFLFYHSDSGTASSSCSDAYRDLATLKSE 423
Query: 407 SC---CGTIGRNVT-------NANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFE 456
S C + VT KEVFGGLIFSC+ R + D
Sbjct: 424 STHKNCNNPLKEVTGSSSSSSGKEKEVFGGLIFSCYLRGEIFHPNVD------------- 470
Query: 457 SYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVMSYT 516
S P NFP LAG++C GEIGRG + +EE+ S RC LH++S VYLV+SY
Sbjct: 471 SSPIHENFPGVALAGMYCNGEIGRGSSSSISQEDDEEN---SARCCLHYHSAVYLVLSY- 526
Query: 517 IPP 519
+PP
Sbjct: 527 VPP 529
>gi|224136209|ref|XP_002322272.1| predicted protein [Populus trichocarpa]
gi|222869268|gb|EEF06399.1| predicted protein [Populus trichocarpa]
Length = 541
Score = 462 bits (1190), Expect = e-127, Method: Compositional matrix adjust.
Identities = 259/544 (47%), Positives = 348/544 (63%), Gaps = 51/544 (9%)
Query: 7 ALGNEDLLQHILSRLPALSFASAACVNKSWNKVCNQILSKPKLASALSLSPSLHV----- 61
+L E+++Q+ILSRLPAL+FA AACVNK W K+C+QIL +PKLASALSL+PSLH
Sbjct: 14 SLVTEEIMQNILSRLPALAFAYAACVNKRWYKICSQILKRPKLASALSLNPSLHKTNRAH 73
Query: 62 ---------AVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPVITNAV 112
AV EV+++VLSEPIRPHFAIA + + L TH LI +LGS P ITN
Sbjct: 74 QFLCYDCKDAVEEVIEQVLSEPIRPHFAIACISKEFNLELTHGLIIKKLGSSIPFITNIA 133
Query: 113 TGIIGLDAHLDEICEVKW-TLLEDNLLNDFDHC-YGIVLIVGYVPGLKVETIPLLRSKEE 170
+GIIG+D DE+ E KW T D D G+VL+VG++PGLK+ TIPLLR +E
Sbjct: 134 SGIIGVDGIADELYEEKWETTTAGPNSQDSDRVDRGLVLLVGFLPGLKIGTIPLLRPMQE 193
Query: 171 PEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIVGDA 230
++VDKF+MDI HY++++S C +P GII+FGD+ D+KPI++ MD +PEETVIVGDA
Sbjct: 194 SN-TLVDKFVMDILHYTSAVSDCPAPTGIIIFGDKTTDMKPIVSNMDCAMPEETVIVGDA 252
Query: 231 TSCFLFKTGENSQNYNGALYFFDAVALVFSRDS-DNSNVPEIQFDITMSTGVLPFGPELK 289
++ F+F+ G+NS N+ +F AVALVF+RD + EIQF +TMS GV+PFGP L+
Sbjct: 253 SANFIFRNGDNSLNHLAHTCYFQAVALVFARDRYKPEGIGEIQFHVTMSKGVMPFGPTLE 312
Query: 290 AVSVKEHNADCSLLTARMEGYDGLLHGEEILEDIKEHIDD--KYPYLYIGVIHQRGSLQ- 346
A SV + +++CS ++A+++G +G++ EIL D+K+ D K +YIGV + S
Sbjct: 313 AASVLQKDSECSWISAKLKGQNGIVAAGEILNDLKQQFRDANKSADIYIGVTKETISTND 372
Query: 347 ---FGSRSYMSLYEVLGAEDQFFIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLRLL 403
+ + YEV G ++F VNGVGI+PGDSF+FY SDS+TASS+ + L L
Sbjct: 373 SGIWTPGRCLDFYEVRGGGGRYFNVNGVGIQPGDSFLFYQSDSETASSTCDHAFNKLLAL 432
Query: 404 NASSCCGTIGRNV--------TNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYF 455
A +N + KEV GGLIFSC+ R D +
Sbjct: 433 KAE----LKSKNYLHLSKFADKDDKKEVLGGLIFSCYRRGESFF-----------GDPFV 477
Query: 456 ESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVMSY 515
+SYPFC +FP P+AG+FC GEIGRG L+N+E ED + S RC LH YST+YLVMSY
Sbjct: 478 DSYPFCDSFPTAPVAGLFCRGEIGRGP--ESLMNEEYEDVN-SPRCCLHVYSTIYLVMSY 534
Query: 516 TIPP 519
+PP
Sbjct: 535 -LPP 537
>gi|147863571|emb|CAN79767.1| hypothetical protein VITISV_019403 [Vitis vinifera]
Length = 527
Score = 448 bits (1152), Expect = e-123, Method: Compositional matrix adjust.
Identities = 262/531 (49%), Positives = 352/531 (66%), Gaps = 32/531 (6%)
Query: 3 GGSAALGNEDLLQHILSRLPALSFASAACVNKSWNKVCNQILSKPKLASALSLSPSLHVA 62
GG AL +EDLLQ+ILSRLPALSFA+A CV++SW + +LS+PKLASA+SL+PS A
Sbjct: 6 GGGVALLSEDLLQNILSRLPALSFANAGCVSRSWRRAAGDVLSRPKLASAISLNPSFQDA 65
Query: 63 VSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPVITNAVTGIIGLDAHL 122
V EVLD VLS PIRPHFAIA +G++ L TH+LIT +LGS TPVIT+ GIIG DA
Sbjct: 66 VKEVLDSVLSRPIRPHFAIACIGLKFSLERTHKLITKKLGSATPVITSVARGIIGSDAIT 125
Query: 123 DEICEVKWTL-LED-NLLNDFDHCYGIVLIVGYVPGLKVETIPLLRSKEEPEFSMVDKFL 180
+E EVKW + +ED NL + D GIVLIVG++PGLKV+ IPLLR EEP S++DKF+
Sbjct: 126 EEFKEVKWGVDVEDFNLPANKDR--GIVLIVGFMPGLKVDAIPLLRELEEPGISLIDKFV 183
Query: 181 MDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIVGDATSCFLFKTGE 240
MDIR++SA++SGC+SP GI++FGD++ D+KP+L +MDY + ETVI+G+ + F++++G+
Sbjct: 184 MDIRNFSAAVSGCTSPTGIVMFGDKHADMKPVLEKMDYAMSMETVILGEESGHFMYRSGD 243
Query: 241 NSQNYNGALY-FFDAVALVFSRDSDN-SNVPEIQFDITMSTGVLPFGPELKAVSVK---E 295
+S+N +G+L D VALVF+RD+D V E QF + +STGV+P GP LKA SVK +
Sbjct: 244 DSRNISGSLKNSCDGVALVFARDNDKPQGVGESQFHVALSTGVVPVGPTLKAASVKVKGD 303
Query: 296 HNADCSLLTARMEGYDGLLHGEEILEDIKEHIDDKYPY--LYIGVIHQR----GSLQFGS 349
+ + LTAR EG L GE +L DI + ++++ LYIGV +R GS +
Sbjct: 304 GSERSTWLTARKEGLKEALDGERLLHDIYDEMENENASHDLYIGVTKRRKCSIGSEKVRW 363
Query: 350 RSYMSLYEVLGAEDQFFIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLRLLNASSCC 409
+ + ++VLG ++++ V+GVGIK GD F FY SDSDTA SS V + R L +
Sbjct: 364 VTTLEFHDVLGGDEEYLFVDGVGIKTGDPFRFYRSDSDTALSSCRHVSEEFRNLKQAWTH 423
Query: 410 GTIG--RNVTNA--NKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRNFP 465
R V + EV GG+IFSC+ R GD +V +S PF NFP
Sbjct: 424 KNSYHFRGVADGGDKTEVCGGIIFSCYGR---------GDSFFGQANV--DSSPFLENFP 472
Query: 466 ETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVMSYT 516
PLAGI C GEIGR L+ +Q ++ S S R LH+YSTVYLV+S+T
Sbjct: 473 GFPLAGIMCGGEIGRVH-LSSADHQGGQEES-SPRSYLHYYSTVYLVISHT 521
>gi|359491092|ref|XP_002283895.2| PREDICTED: F-box/LRR-repeat protein At5g63520-like [Vitis vinifera]
gi|297734433|emb|CBI15680.3| unnamed protein product [Vitis vinifera]
Length = 527
Score = 446 bits (1146), Expect = e-122, Method: Compositional matrix adjust.
Identities = 261/531 (49%), Positives = 351/531 (66%), Gaps = 32/531 (6%)
Query: 3 GGSAALGNEDLLQHILSRLPALSFASAACVNKSWNKVCNQILSKPKLASALSLSPSLHVA 62
GG AL +EDLLQ+ILSRLPALSFA+A CV++SW + +LS+PKLASA+SL+PS A
Sbjct: 6 GGGVALLSEDLLQNILSRLPALSFANAGCVSRSWRRAAGDVLSRPKLASAISLNPSFQDA 65
Query: 63 VSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPVITNAVTGIIGLDAHL 122
V EVLD VLS PIRPHFAIA +G++ L TH+LIT +LGS TPVIT+ GIIG DA
Sbjct: 66 VKEVLDSVLSRPIRPHFAIACIGLKFSLERTHKLITKKLGSATPVITSVARGIIGSDAIT 125
Query: 123 DEICEVKWTL-LED-NLLNDFDHCYGIVLIVGYVPGLKVETIPLLRSKEEPEFSMVDKFL 180
+E EVKW + +ED NL + D GIVLIVG++PGLKV+ IPLLR EEP S++DKF+
Sbjct: 126 EEFKEVKWGVDVEDFNLPANKDR--GIVLIVGFMPGLKVDAIPLLRELEEPGISLIDKFV 183
Query: 181 MDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIVGDATSCFLFKTGE 240
MDIR++SA++SGC+SP GI++FGD++ D+KP+L +MDY + ETVI+G+ + F++++G+
Sbjct: 184 MDIRNFSAAVSGCTSPTGIVMFGDKHADMKPVLEKMDYAMSMETVILGEESGHFMYRSGD 243
Query: 241 NSQNYNGALY-FFDAVALVFSRDSDN-SNVPEIQFDITMSTGVLPFGPELKAVSVK---E 295
+S+N +G+L D VALVF+RD+D V E QF + +STGV+P GP KA SVK +
Sbjct: 244 DSRNISGSLKNSCDGVALVFARDNDKPQGVGETQFHVALSTGVVPVGPTHKAASVKVKGD 303
Query: 296 HNADCSLLTARMEGYDGLLHGEEILEDIKEHIDDKYPY--LYIGVIHQR----GSLQFGS 349
+ + LTAR EG L GE +L DI + ++++ LYIGV +R GS +
Sbjct: 304 GSERSTWLTARKEGLKEALDGERLLHDIYDEMENENASHDLYIGVTKRRKCSIGSEKVRW 363
Query: 350 RSYMSLYEVLGAEDQFFIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLRLLNASSCC 409
+ + ++VLG ++++ V+GVGIK GD F FY SDSDTA SS V + R L +
Sbjct: 364 VTTLEFHDVLGGDEEYLFVDGVGIKTGDPFRFYRSDSDTALSSCRHVSEEFRNLKQAWTH 423
Query: 410 GTIG--RNVTNA--NKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRNFP 465
R V + EV GG+IFSC+ R GD +V +S PF NFP
Sbjct: 424 KNSYHFRGVADGGDKTEVCGGIIFSCYGR---------GDSFFGQANV--DSSPFLENFP 472
Query: 466 ETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVMSYT 516
PLAGI C GEIGR L+ +Q ++ S S R LH+YSTVYLV+S+T
Sbjct: 473 GFPLAGIMCGGEIGRVH-LSSADHQGGQEES-SPRSYLHYYSTVYLVISHT 521
>gi|224136205|ref|XP_002322271.1| f-box family protein [Populus trichocarpa]
gi|222869267|gb|EEF06398.1| f-box family protein [Populus trichocarpa]
Length = 533
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 235/524 (44%), Positives = 322/524 (61%), Gaps = 34/524 (6%)
Query: 10 NEDLLQHILSRLPALSFASAACVNKSWNKVCNQILSKPKLASALSLSPSLHVAVSEVLDK 69
NEDL+ +IL RLPALSFASAACV+KSWN++CNQIL KPK ASA SL+P VA+ EV++K
Sbjct: 23 NEDLILNILKRLPALSFASAACVSKSWNQICNQILYKPKFASAFSLNPDEKVALEEVVNK 82
Query: 70 VLSEPIRPHFAIASV-GMQSKLAATHQLITARLGSRTPVITNAVTGIIGLDAHLDEICEV 128
VLSEPIRPHFAIA+V G L+ + +LG +TP+I + +GI+G DA DE EV
Sbjct: 83 VLSEPIRPHFAIANVIGSGVDLSERLNFLATKLGFQTPIIVSCTSGIMGRDAVTDEHREV 142
Query: 129 KWTLLEDNLLN-DFDHCYGIVLIVGYVPGLKVETIPLLRSKEEPEFSMVDKFLMDIRHYS 187
+LE+ ++ + + C GI+L VG++PGLKV+ IPL + ++ +MVD F+MDI+ Y+
Sbjct: 143 ---MLEEYWVDGESNPCNGIILTVGFLPGLKVDAIPLFQPRKGCRATMVDNFVMDIKDYA 199
Query: 188 ASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIVGDATSCFLFKTGENSQN-YN 246
SISGC+SP GII+FGD++ D KP++ ++D+ + +T+I+GD + FL++ G S+N Y
Sbjct: 200 TSISGCASPVGIIMFGDEDADQKPVMEKLDHAMSSDTIIIGDERAQFLYRNGVESRNDYE 259
Query: 247 GALYFFDAVALVFSRDSDN-SNVPEIQFDITMSTGVLPFGPELKAVSVKE---HNADCSL 302
+ YF AVALVF+RD D EIQF +S+GV GP KAVSVK+ +
Sbjct: 260 SSEYFSAAVALVFARDRDKPCGTGEIQFHAALSSGVSAVGPRYKAVSVKKIVSGTGHTTW 319
Query: 303 LTARMEGYDGLLHGEEILEDIKEHIDDK--YPYLYIGVIHQR----GSLQFGSRSYMSLY 356
LTAR EG + G+ IL+DI + ++ +P LYIGV QR GS + +++ +
Sbjct: 320 LTARREGEHEIQDGQRILDDINNELVNQVGHPDLYIGVTEQRRCFIGSQKSRVMTFLVFH 379
Query: 357 EVLGAEDQFFIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLRLLN----ASSCCGTI 412
V+G + ++ +GVGI+ GD F FYH D A SS +V R LN + +C
Sbjct: 380 GVMGGDQEYLFADGVGIRTGDYFQFYHPDPSAALSSCSNVSKNFRNLNLDWSSRNCLHAR 439
Query: 413 GRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRNFPETPLAGI 472
G NKE+ GG +FSC R E + D S PF NFP P+AGI
Sbjct: 440 GVYDNVCNKELVGGFVFSCCGRGESFFERCNVD-----------SSPFLDNFPGFPMAGI 488
Query: 473 FCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVMSYT 516
FC GEIGRG + +EE S C LH YS VYL++SYT
Sbjct: 489 FCRGEIGRGFSVFNADEGQEERTS---HCCLHVYSAVYLLVSYT 529
>gi|224122060|ref|XP_002318743.1| predicted protein [Populus trichocarpa]
gi|222859416|gb|EEE96963.1| predicted protein [Populus trichocarpa]
Length = 551
Score = 400 bits (1029), Expect = e-109, Method: Compositional matrix adjust.
Identities = 238/550 (43%), Positives = 330/550 (60%), Gaps = 54/550 (9%)
Query: 10 NEDLLQHILSRLPALSFASAACVNKSWNKVCNQILSKPKLASALSLSPSLHVAVSEVLDK 69
NEDL+Q+I+ RLPA SFASAACV+KSWN++CNQILSKPK ASA SL+P+ VA+ EV++K
Sbjct: 19 NEDLVQNIVKRLPASSFASAACVSKSWNQICNQILSKPKFASAFSLNPNEKVALEEVVNK 78
Query: 70 VLSEPIRPHFAIASV-GMQSKLAATHQLITARLGSRTPVITNAVTGIIGLDAHLDEICEV 128
VLSEPIRPHFAIA+V G L + +LGS+TP+I + +GI+G DA E EV
Sbjct: 79 VLSEPIRPHFAIANVIGSGVDLREKLDFLATKLGSQTPIIVSCASGIMGRDAVTGEHREV 138
Query: 129 KWTLLEDNLLN-DFDHCYGIVLIVGYVPGLKVETIPLLRSKEEPEFSMVDKFLMDIRHYS 187
+LE+ + + C+GI+L VG++PGLKV+ IPLL+ ++ ++VD F+M+IR Y+
Sbjct: 139 ---MLEEYWADGESISCFGIILTVGFLPGLKVDVIPLLQPRKVHRPALVDYFVMNIRDYA 195
Query: 188 ASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIVGDATSCFLFKTGENSQN-YN 246
AS+SG +SP GIILFGD+ D KP++ ++D+ + +TVIVGD + FL+++G S+N Y
Sbjct: 196 ASVSGWASPAGIILFGDEGADQKPVMEKLDHAMSRDTVIVGDERAQFLYRSGVESRNDYG 255
Query: 247 GALYFFDAVALVFSRDSDN----------------------SNVPEIQFDITMSTGVLPF 284
+ YF AVALVF+RD D EIQF +S+GV
Sbjct: 256 SSEYFPAAVALVFARDRDKPCGIGLLISVFAISSFATKMDWQCTGEIQFHAALSSGVSAI 315
Query: 285 GPELKAVSVKEHNAD---CSLLTARMEGYDGLLHGEEILEDIKEHIDDK--YPYLYIGVI 339
GP KAVSV++ ++ +LLTAR EG + G+ IL+DI + ++ P LYIGV
Sbjct: 316 GPRYKAVSVRKIGSETGCTTLLTARREGEQEIQDGQRILDDINNELVNQIGRPDLYIGVT 375
Query: 340 HQR----GSLQFGSRSYMSLYEVLGAEDQFFIVNGVGIKPGDSFIFYHSDSDTASSSSID 395
QR GS + +++ + V+G + ++ +GVGI+ GD F FYHSD TA SS +
Sbjct: 376 EQRKCFIGSEKSRVMTFLVFHGVMGGDQEYLFADGVGIRTGDYFQFYHSDPTTALSSCNE 435
Query: 396 VLDGLRLLN---ASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDND 452
V R L +S C G + +KE+ GG +FSC R E + D
Sbjct: 436 VSKNFRKLKLDWSSRNCLQAGVSDNVCSKELVGGFVFSCCGRGESFFERCNVD------- 488
Query: 453 VYFESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLV 512
S PF NFP P+AG+FC GEIGRG ++N +E + C LH YST YL+
Sbjct: 489 ----SSPFLDNFPGVPMAGVFCRGEIGRG---FSVLNADEGPEERTLHCCLHVYSTAYLL 541
Query: 513 MSYTIPPLLH 522
+SYT P H
Sbjct: 542 VSYTPAPAEH 551
>gi|224115878|ref|XP_002317147.1| predicted protein [Populus trichocarpa]
gi|222860212|gb|EEE97759.1| predicted protein [Populus trichocarpa]
Length = 558
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 243/532 (45%), Positives = 327/532 (61%), Gaps = 41/532 (7%)
Query: 10 NEDLLQHILSRLPALSFASAACVNKSWNKVCNQILSKPKLASALSLSPSLHVAVSEVLDK 69
NEDL+Q+IL R PA SFASAACV KSWN+ CNQILSKPKLASA SL+P VA EV++K
Sbjct: 49 NEDLVQNILKRTPASSFASAACVCKSWNQTCNQILSKPKLASAFSLNPDQKVASQEVVNK 108
Query: 70 VLSEPIRPHFAIASV-GMQSKLAATHQLITARLGSRTPVITNAVTGIIGLDAHLDEICEV 128
VLSEPIRP FAIA+V G L+ T + A+LGS+TP+I + GIIG DA DE EV
Sbjct: 109 VLSEPIRPQFAIANVIGSGVDLSETLNFLAAKLGSKTPIIVSCANGIIGRDAVTDEHQEV 168
Query: 129 KWTLLEDNLLN--DFDHCYGIVLIVGYVPGLKVETIPLLR-SKEEPEFSMVDKFLMDIRH 185
+LED + + +G++L VG++PGL+VE IPLLR K ++VDKF+MDIR+
Sbjct: 169 ---MLEDFWADAASKNSGFGVLLTVGFLPGLQVEAIPLLRPRKAASRMALVDKFVMDIRN 225
Query: 186 YSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIVGDATSCFLFKTGENSQNY 245
Y+A++SG +SP +I+FG + + KP++ ++D+ + ET I GD + FL+K+G S+N
Sbjct: 226 YAANVSGSTSPALVIMFGGEKAEQKPVMEKLDHAMSRETFIAGDERAQFLYKSGIESRNV 285
Query: 246 NGA--LYFFDAVALVFSRDSDN-SNVPEIQFDITMSTGVLPFGPELKAVSVKEHNADCSL 302
+G+ Y DAV LVF+RD S+V EIQF +S+GV GP K VSVKE + L
Sbjct: 286 HGSGNEYISDAVVLVFARDRHRASDVGEIQFHSALSSGVSTIGPRYKVVSVKEIQPETDL 345
Query: 303 LT---ARMEGYDGLLHGEEILEDIKEHIDDKYPYLYIGVIHQR----GSLQFGSRSYMSL 355
T AR EG +L G+ I++DI + +K L+IGV QR GS ++
Sbjct: 346 TTCLKARREGEQEILGGQRIIDDINNELVNKTE-LFIGVSKQRQCVIGSENPKLLRSLAF 404
Query: 356 YEVLGAEDQFFIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLR--LLNASSC---CG 410
+EV G + + V+G GI GD F FYHSDS A S++ +V R L+ SS G
Sbjct: 405 HEVKGGDGEHLFVSGDGIGSGDYFHFYHSDSKAALSATSNVSKNFRNLKLDWSSSQLHAG 464
Query: 411 TIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRNFPETPLA 470
+G +KEV GGL+FSC+ R E G + D S PF NFP P+A
Sbjct: 465 GVG------SKEVVGGLVFSCWGR----GESFFGHSNVD-------SSPFLDNFPGIPMA 507
Query: 471 GIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVMSYTIPPLLH 522
GIFCYGE+GRG + + E+++ S C LH YST+Y+++SYT PL H
Sbjct: 508 GIFCYGEVGRGFTMLNADDHEDQEEKTSC-CCLHVYSTIYVLVSYTPAPLKH 558
>gi|356516535|ref|XP_003526949.1| PREDICTED: F-box/LRR-repeat protein At5g63520-like [Glycine max]
Length = 540
Score = 394 bits (1011), Expect = e-107, Method: Compositional matrix adjust.
Identities = 246/532 (46%), Positives = 328/532 (61%), Gaps = 41/532 (7%)
Query: 5 SAALGNEDLLQHILSRLPALSFASAACVNKSWNKVCNQILSKPKLASALSLSPSLHVAVS 64
S ++ NEDLLQ+IL+RLP+L FASAACV+KSWN +C++ILS+PKL+SA+SL+PSL AV+
Sbjct: 28 SLSMLNEDLLQNILARLPSLHFASAACVSKSWNSLCSRILSRPKLSSAISLNPSLPDAVN 87
Query: 65 EVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPVITNAVTGIIGLDAHLDE 124
EV+ KVLSEPIRPHFAIA++G A T LI LG PVI GI+G DA DE
Sbjct: 88 EVVHKVLSEPIRPHFAIANIGTGFNTAKTLCLIRKSLGFNIPVIVTVANGIMGRDAVTDE 147
Query: 125 ICEVKWTLL-----EDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLRSKEEPEFSMVDKF 179
EVKW L E++ + G+VL VGY+PGLKVE +PL R + + + VD F
Sbjct: 148 FKEVKWGALFSGFGEESYTRFINE--GLVLTVGYLPGLKVEALPLRRPTKTSQATWVDNF 205
Query: 180 LMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIVGDATSCFLF--K 237
+ DI+ YSAS+S P GIILFG+ + D+K +L ++D+ +P + VIVGD F F K
Sbjct: 206 IKDIKEYSASVSSSPFPVGIILFGEASSDMKLVLEKLDHAMPMDMVIVGDERGSFDFVHK 265
Query: 238 TGENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVLPFGPELKAVSVKEHN 297
+G +S+ +AVALVF++D D S + I+F + +S GV GP KA SVK +N
Sbjct: 266 SGNDSRIICSKKGNIEAVALVFAQDRDRS-LGTIRFHVALSNGVSTVGPRYKAASVKSNN 324
Query: 298 ADCS-LLTARMEGYDGLLHGEEILEDIKEHIDD--KYPYLYIGVIHQR----GSLQFGSR 350
ADCS LTAR EG L G+ IL DI +D+ + P LYIGVI R G+ + R
Sbjct: 325 ADCSTWLTARREGQQENLDGQSILLDINNLLDNHIESPDLYIGVIKHRKLSTGAEKPMPR 384
Query: 351 SYMSLYEVLGAEDQFFIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLRLL----NAS 406
+ ++ + V+G ++++ V+G+GIK GD F FY+SD +TA +S V D L+ + N+
Sbjct: 385 TCIAYHGVVGGDEEYLYVDGIGIKTGDIFQFYYSDPNTALASLTKVHDALKSIHLEKNSK 444
Query: 407 SCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRNFPE 466
S G G N TN VFGG++F+C+ R E G + D S PF NFP
Sbjct: 445 SSKGD-GDNATN----VFGGIVFACYGR----GESFFGRHNVD-------SSPFLENFPG 488
Query: 467 TPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVMSYTIP 518
P++GIFC GE+ R T +I Q E IS C LH YSTVYL MSYT P
Sbjct: 489 VPVSGIFCGGEM--VRPCTTVIGQCEGASPIS--CCLHVYSTVYLAMSYTPP 536
>gi|224122064|ref|XP_002318744.1| predicted protein [Populus trichocarpa]
gi|222859417|gb|EEE96964.1| predicted protein [Populus trichocarpa]
Length = 465
Score = 393 bits (1009), Expect = e-106, Method: Compositional matrix adjust.
Identities = 225/473 (47%), Positives = 297/473 (62%), Gaps = 31/473 (6%)
Query: 62 AVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPVITNAVTGIIGLDAH 121
AV EV+++VLSEPIRPHFAIA + + L H LI +LGSR P+ITN +GIIG+D
Sbjct: 8 AVKEVIEQVLSEPIRPHFAIACISKEFNLELAHGLIIEKLGSRIPIITNVSSGIIGVDGI 67
Query: 122 LDEICEVKWTLLEDNLLNDFDHC-YGIVLIVGYVPGLKVETIPLLRSKEEPEFSMVDKFL 180
DE+ E KW + + D G+VL+VG++PGLK+ TIPLL+ ++E ++VDKF+
Sbjct: 68 ADELFEEKWETTSGPNIQESDTAERGLVLLVGFLPGLKIGTIPLLQPRQESN-TLVDKFV 126
Query: 181 MDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIVGDATSCFLFKTGE 240
MDI HY+A++S C++P GII+FGD+ D+KPI+A+MD +PEETVIVGDA++ F+F+TG+
Sbjct: 127 MDILHYTAAVSDCAAPAGIIMFGDKTTDMKPIVAKMDCAMPEETVIVGDASADFIFRTGD 186
Query: 241 NSQNYNGALYFFDAVALVFSRDS-DNSNVPEIQFDITMSTGVLPFGPELKAVSVKEHNAD 299
+S N F AVALVF+RD + EIQF +T STGVLPFGP LKAV V +++
Sbjct: 187 DSLNQLVYTCCFQAVALVFARDRYKPEGLGEIQFHVTKSTGVLPFGPNLKAVCVVPKDSE 246
Query: 300 CSLLTARMEGYDGLLHGEEILEDIKEHID--DKYPYLYIGVIH--QR----GSLQFGSRS 351
S L AR+EG DG++ IL +IK+ D + LYIGV QR G L G
Sbjct: 247 RSCLFARLEGQDGIMAAGAILNEIKQQFREADTFADLYIGVTKETQRTSDSGILTPGKS- 305
Query: 352 YMSLYEVLGAEDQFFIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLRLLNASSCCGT 411
+ Y+V+G + +F VNG+GI+ GDSF+FY SDS TASSS + L L A
Sbjct: 306 -LDFYKVIGGGEYYFTVNGIGIRTGDSFLFYQSDSATASSSCDHAFNKLLALKAELKSKN 364
Query: 412 IGRNVTNANK----EVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRNFPET 467
R A+K EV GG IFSC+ R D + +SYPFC NFP
Sbjct: 365 YLRLSNLADKDDKEEVLGGFIFSCYHRGESFF-----------GDTFVDSYPFCNNFPTA 413
Query: 468 PLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVMSYTIPPL 520
P+AG+FC GEI RG L+N+E +D S RC +H YST+YLVMSY PPL
Sbjct: 414 PVAGLFCRGEIARGP--KSLMNEEYDD-ETSPRCCVHVYSTIYLVMSYLPPPL 463
>gi|118488987|gb|ABK96301.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 533
Score = 391 bits (1004), Expect = e-106, Method: Compositional matrix adjust.
Identities = 233/531 (43%), Positives = 326/531 (61%), Gaps = 38/531 (7%)
Query: 10 NEDLLQHILSRLPALSFASAACVNKSWNKVCNQILSKPKLASALSLSPSLHVAVSEVLDK 69
NEDL Q+IL R+PALSFASAACV+KSWN+ CNQIL KPKLAS+ SL+P VA+ EV++K
Sbjct: 23 NEDLFQNILKRIPALSFASAACVSKSWNRNCNQILYKPKLASSFSLNPVQKVALEEVVNK 82
Query: 70 VLSEPIRPHFAIASV-GMQSKLAATHQLITARLGSRTPVITNAVTGIIGLDAHLDEICEV 128
VLSEPIRP FAIA+V G L+ + A+LGS+TP+I + GI+G DA DE EV
Sbjct: 83 VLSEPIRPQFAIANVIGSGVHLSGMLDFLAAKLGSKTPIIVSCAGGIMGRDAVTDEYKEV 142
Query: 129 KWTLLEDNLLNDF-DHCYGIVLIVGYVPGLKVETIPLLRSKEEPEFSMVDKFLMDIRHYS 187
++ED ++ + +GI+L VG++PGLKV+ IPLLR ++ +MVDKF+MDIR+Y+
Sbjct: 143 ---MIEDFWVDGASNSSFGIMLSVGFLPGLKVDAIPLLRPRKARGVAMVDKFVMDIRNYA 199
Query: 188 ASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIVGDATSCFLFKTGENSQN-YN 246
A +S +SP+ II+FG + D KP++ ++D+ + ET++VGD + FL+++G S+N Y
Sbjct: 200 ALVSDSTSPSLIIMFGSEKTDQKPVMEKLDHAMSRETIVVGDERAQFLYRSGIESRNVYY 259
Query: 247 GAL--YFFDAVALVFSRDSDN-SNVPEIQFDITMSTGVLPFGPELKAVSVKEHNADCSL- 302
G++ YF DAVALVF+RD + S EI F +S+GV GP KAVS E ++ L
Sbjct: 260 GSVDHYFSDAVALVFARDQNRPSGTGEIHFHSALSSGVSAIGPRFKAVSANEIESETGLS 319
Query: 303 --LTARMEGYDGLLHGEEILEDIKEHIDDKYPYLYIGVIHQR----GSLQFGSRSYMSLY 356
LT R E +L G+ I++DI + ++ L+IGV QR G + ++ +
Sbjct: 320 TWLTVRREAEQEILGGQRIIDDINNELGNQTK-LFIGVSEQRKCFVGPEKPRQMRSLAFH 378
Query: 357 EVLGAEDQFFIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLRLLNA----SSCCGTI 412
EV+G +++ V+GVGIK GD F YH D A SS ++ R L SC
Sbjct: 379 EVMGGDEEHLFVDGVGIKTGDYFHLYHPDPSAALSSCSNISKNFRNLKLDWSFRSCQLHA 438
Query: 413 GRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRNFPETPLAGI 472
R V KEV GG +F+C+ R G+ N+V +S PF NFP +AGI
Sbjct: 439 ARGV--GEKEVIGGFVFACWGR---------GESFFGHNNV--DSSPFLDNFPGVLMAGI 485
Query: 473 FCYGEIGRGRGLTRLINQEEEDCSISGRCL-LHHYSTVYLVMSYTIPPLLH 522
F YGEIGRG ++N +E + C +H YSTVYL++SYT P+ H
Sbjct: 486 FTYGEIGRG---FSILNTDESGQEVKTLCFCVHVYSTVYLLVSYTPAPIEH 533
>gi|356508825|ref|XP_003523154.1| PREDICTED: F-box/LRR-repeat protein At5g63520-like [Glycine max]
Length = 538
Score = 391 bits (1004), Expect = e-106, Method: Compositional matrix adjust.
Identities = 246/526 (46%), Positives = 323/526 (61%), Gaps = 40/526 (7%)
Query: 10 NEDLLQHILSRLPALSFASAACVNKSWNKVCNQILSKPKLASALSLSPSLHVAVSEVLDK 69
NEDLLQ+IL+RLPAL FASAACV+KSWN +CN+IL++PKL+SA+SL+PSL AV+EV+ K
Sbjct: 32 NEDLLQNILARLPALHFASAACVSKSWNSLCNRILTRPKLSSAISLNPSLPDAVNEVVHK 91
Query: 70 VLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPVITNAVTGIIGLDAHLDEICEVK 129
VLSEPIRPHFAIA++G A T LI LGS PVI +GI+G DA DE EVK
Sbjct: 92 VLSEPIRPHFAIANIGTGFSTAKTLCLIRQSLGSNIPVIVTVASGIMGRDAVTDEFKEVK 151
Query: 130 WTLL-----EDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLRSKEEPEFSMVDKFLMDIR 184
W L E++ + G+VL VGY+PGLKVE +PL R + VD F+ DI+
Sbjct: 152 WGALFSGFGEESYTRFINE--GLVLTVGYLPGLKVEAVPLRRPTKTQAI-WVDNFIKDIK 208
Query: 185 HYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIVGDATSCFLF--KTGENS 242
YSAS+S P GIILFG+ + D+K +L ++D+ +P +TVIVGD F F K+G +S
Sbjct: 209 EYSASVSSSPFPVGIILFGEASSDMKLVLEKLDHAMPMDTVIVGDERGSFDFVHKSGNDS 268
Query: 243 QNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVLPFGPELKAVSVKEHNADCS- 301
+ +AVALVF+RD N ++ I+F + +S GV GP KA SVK +NADCS
Sbjct: 269 RIICSKKGNIEAVALVFARDR-NRSLGTIRFHVALSNGVSTVGPRYKAASVKSNNADCST 327
Query: 302 LLTARMEGYDGLLHGEEILEDIKEHIDD--KYPYLYIGVIHQR----GSLQFGSRSYMSL 355
LTAR EG L G+ IL DI +D+ + P L+IGVI R G+ + R+ +S
Sbjct: 328 WLTARREGQQENLDGQSILLDINNLLDNHVESPDLHIGVIKHRKLSTGAEKPMPRTCISY 387
Query: 356 YEVLGAEDQFFIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLRLLNASSCCGTI--- 412
+ V+G ++++ V+G+GIK GD F FY+SD + A +S V D L+ + +
Sbjct: 388 HGVVGGDEEYLYVDGIGIKTGDFFQFYYSDPNIALASLTKVHDALKSIKLEKKSKSSKGD 447
Query: 413 GRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRNFPETPLAGI 472
G N TN VFGG+IF+C+SR E G + D S PF NFP P++GI
Sbjct: 448 GDNATN----VFGGIIFACYSR----GESFFGRQNVD-------SSPFLENFPGVPVSGI 492
Query: 473 FCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVMSYTIP 518
FC GE+ R T +I Q E IS C LH YSTVYL MSYT P
Sbjct: 493 FCGGEM--VRPCTTVIGQCEGASPIS--CCLHVYSTVYLAMSYTPP 534
>gi|224118086|ref|XP_002331554.1| predicted protein [Populus trichocarpa]
gi|222873778|gb|EEF10909.1| predicted protein [Populus trichocarpa]
Length = 530
Score = 387 bits (994), Expect = e-105, Method: Compositional matrix adjust.
Identities = 235/532 (44%), Positives = 325/532 (61%), Gaps = 39/532 (7%)
Query: 10 NEDLLQHILSRLPALSFASAACVNKSWNKVCNQILSKPKLASALSLSPSLHVAVSEVLDK 69
NEDL Q+IL R+PALSFASAACV+KSW++ CNQIL KPKLASA SL+P VA+ EV++K
Sbjct: 19 NEDLFQNILKRIPALSFASAACVSKSWSRNCNQILYKPKLASAFSLNPVQKVALEEVVNK 78
Query: 70 VLSEPIRPHFAIASV-GMQSKLAATHQLITARLGSRTPVITNAVTGIIGLDAHLDEICEV 128
VLSEPIRP FAIA+V G L+ + A+LGS+TP+I + GI+G DA DE EV
Sbjct: 79 VLSEPIRPQFAIANVIGSGVDLSGILDFLAAKLGSKTPIIVSCAGGIMGRDAVTDEYKEV 138
Query: 129 KWTLLEDNLLNDF-DHCYGIVLIVGYVPGLKVETIPLLRSKEEPEFSMVDKFLMDIRHYS 187
++ED ++ + +GI+L VG++PGLKV+ IPLLR ++ +MVDKF+MDIR+Y+
Sbjct: 139 ---MIEDFWVDGASNSSFGIMLAVGFLPGLKVDAIPLLRPRKAQGVAMVDKFVMDIRNYA 195
Query: 188 ASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIVGDATSCFLFKTGENSQN-YN 246
A +S +SP+ II+FG + D KP++ ++D+ + ET++VGD + FL+++G S+N Y
Sbjct: 196 ALVSDSTSPSLIIMFGSEKTDQKPVMEKLDHAMSRETIVVGDERAQFLYRSGIESRNVYY 255
Query: 247 GAL--YFFDAVALVFSRDSDN-SNVPEIQFDITMSTGVLPFGPELKAVSVKEHNADCSL- 302
G++ YF DAVALVF+RD + S EI F +S+GV GP KAVS E ++ L
Sbjct: 256 GSVDQYFSDAVALVFARDQNRPSGTGEIHFHSALSSGVSAIGPRFKAVSANEIESETGLS 315
Query: 303 --LTARMEGYDGLLHGEEILEDIKEHIDDKYPYLYIGVIHQR----GSLQFGSRSYMSLY 356
L+ R EG +L G+ I++DI + ++ L+IGV QR G + ++ +
Sbjct: 316 TWLSVRREGGQEILGGQRIIDDINNELGNQTK-LFIGVSEQRKCFVGPEKPRQMRSLAFH 374
Query: 357 EVLGAEDQFFIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLRLLNA----SSCCGTI 412
EV+G + + V+GVGIK GD F YH D A SS ++ R L SC
Sbjct: 375 EVMGGDVEHLFVDGVGIKTGDYFHLYHPDPSAALSSCSNISKNFRNLKLDWSFRSCQLHA 434
Query: 413 GRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRNFPETPLAGI 472
R V KEV GG +F+C+ R E G + D S PF NFP P+AGI
Sbjct: 435 ARGV--GEKEVIGGFVFACWGR----GESFFGHSNVD-------SSPFLDNFPGVPMAGI 481
Query: 473 FCYGEIGRGRG-LTRLINQEEEDCSISGRCL-LHHYSTVYLVMSYTIPPLLH 522
F YGEIGRG L +ED ++ C +H YSTVYL++SYT P+ H
Sbjct: 482 FTYGEIGRGFSILNTDFESGQEDKTL---CFCVHVYSTVYLLVSYTPAPIEH 530
>gi|224115874|ref|XP_002317146.1| predicted protein [Populus trichocarpa]
gi|222860211|gb|EEE97758.1| predicted protein [Populus trichocarpa]
Length = 637
Score = 373 bits (957), Expect = e-100, Method: Compositional matrix adjust.
Identities = 234/521 (44%), Positives = 321/521 (61%), Gaps = 50/521 (9%)
Query: 10 NEDLLQHILSRLPALSFASAACVNKSWNKVCNQILSKPKLASALSLSPSLHVAVSEVLDK 69
NEDL+Q+IL R PA SFASAACV+KSWN CNQILSKPKLASA SL+P VA+ EV+ K
Sbjct: 19 NEDLVQNILKRTPATSFASAACVSKSWNHNCNQILSKPKLASAFSLNPDPKVALQEVVSK 78
Query: 70 VLSEPIRPHFAIASV---GMQSKLAATHQLITARLGSRTPVITNAVTGIIGLDAHLDEIC 126
VLSEPIRP FAIA+V G++ L+ T L+ A+LGS+TP+I + GIIG DA E
Sbjct: 79 VLSEPIRPQFAIANVIESGVE-YLSETLYLLAAKLGSKTPIIVSCTNGIIGRDAVTSEHK 137
Query: 127 EVKWTLLEDNLLN--DFDHCYGIVLIVGYVPGLKVETIPLLRSKEEPEFSMVDKFLMDIR 184
EV +LED ++ + +G++L VGY+PGLKVE +PLLR ++ +M+D F+MDI+
Sbjct: 138 EV---MLEDFWVDAASKNSGFGMLLTVGYLPGLKVEALPLLRPRKAGPVAMIDNFVMDIK 194
Query: 185 HYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIVGDATSCFLFKTGENSQN 244
+YSAS+SG +SP II+FG + D+KP++ ++D+ + ET+I G S FL++ G S+N
Sbjct: 195 NYSASVSGSTSPALIIMFGGEEADLKPVMEKLDHAMSRETIIAGGMRSQFLYRRGIESRN 254
Query: 245 YNGA--LYFFDAVALVFSRDSDN-SNVPEIQFDITMSTGVLPFGPELKAVSVKEHNADCS 301
G+ YF DAVALVF+RD D S +IQF +S+GV GP KAVSVKE ++
Sbjct: 255 IYGSSTKYFTDAVALVFARDEDKPSGEGKIQFHSAISSGVSAIGPRYKAVSVKETQSETG 314
Query: 302 L---LTARMEGYDGLLHGEEILEDIKEHIDDKYPYLYIGVIHQRGSLQFGSRS-----YM 353
L LT+R EG +L G+ I++ I+ + +K L+IGV QR S+ GS + +
Sbjct: 315 LTTWLTSRREGEQEILGGQMIIDSIESELVNKTE-LFIGVSKQRQSV-IGSENPKLLRSL 372
Query: 354 SLYEVLGAEDQFFIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLR--LLNASSC--- 408
+L++V G + + V+G GI GD F FYHSD A S++ +V R L+ SC
Sbjct: 373 ALHQVKGGDGEHLFVSGDGIGSGDYFHFYHSDPKAALSATSNVSKYFRNLKLDWRSCQLH 432
Query: 409 CGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRNFPETP 468
G +G +KEV GGL+FSC+ R + D S PF NFP P
Sbjct: 433 AGDVG------SKEVVGGLVFSCWGRGASFFGHSNVD-----------SSPFLDNFPGIP 475
Query: 469 LAGIFCYGEIGRGRGLTRL---INQEEED---CSISGRCLL 503
+AGIF GE+GRG + ++QEE+ C SG+ LL
Sbjct: 476 MAGIFGCGEVGRGFTMLNADDHVDQEEKTSCCCLHSGKNLL 516
>gi|255540189|ref|XP_002511159.1| conserved hypothetical protein [Ricinus communis]
gi|223550274|gb|EEF51761.1| conserved hypothetical protein [Ricinus communis]
Length = 523
Score = 367 bits (943), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 224/521 (42%), Positives = 301/521 (57%), Gaps = 35/521 (6%)
Query: 10 NEDLLQHILSRLPALSFASAACVNKSWNKVCNQILSKPKLASALSLSPSLHVAVSEVLDK 69
+ED+++ IL +LPALS ASAACV KSW N+ILS+PKLASA+SL+PSL +A+ EV+DK
Sbjct: 22 SEDIVEKILRKLPALSLASAACVCKSWYHNSNRILSRPKLASAISLNPSLDIALQEVVDK 81
Query: 70 VLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPVITNAVTGIIGLDAHLDEICEVK 129
VLSE IRPHFAIAS G +KL L + GSRTP+I GI+G DA +E
Sbjct: 82 VLSESIRPHFAIAS-GFGNKLG----LEKRKFGSRTPLIVTWANGIMGRDAVTNEDDSSN 136
Query: 130 WTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLRSKEEPEFSMVDKFLMDIRHYSAS 189
+ D + G +L VG+VPGLKV+ IP LR M+D F+MDIR+Y+ S
Sbjct: 137 YDGDGDGDDEHMEINSGFLLTVGFVPGLKVDVIPYLRKIRPAPMEMIDIFVMDIRNYTTS 196
Query: 190 ISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIVGDATSCFLFKTGENSQNYNGAL 249
SGC+SP GII+F ++ D+KPI+ ++DY + +ET+IVGD + FL+++ +S N
Sbjct: 197 ASGCTSPVGIIMFASEDFDLKPIMEKLDYAMSKETIIVGDERTKFLYRSRIDSTN----- 251
Query: 250 YFFDAVALVFSRDSDNSN-VPEIQFDITMSTGVLPFGPELKAVSVKEHNAD-CSLLTARM 307
F A+ALVF++D + + + EIQF +S GV GP K S +E D + LTAR
Sbjct: 252 PFAKAIALVFAKDREKPHGLGEIQFHAALSNGVSAIGPRYKTASAREAFHDRNTWLTARQ 311
Query: 308 EGYDGLLHGEEILEDIKEHIDDKY--PYLYIGVIHQRGSLQFGSR----SYMSLYEVLGA 361
EG +L G+ IL DI + ++++ LYIGV R + S +S Y V+G
Sbjct: 312 EGQPEILDGQRILNDINDELENRIGDTDLYIGVTELRKRRIRKEKPRLMSSLSFYGVMGG 371
Query: 362 EDQFFIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLRLLNASSCCGTIGRNVTNAN- 420
++++ V+G+GI+ D F FYHSD A SS +V LR L C AN
Sbjct: 372 DEEYLFVHGIGIRTADYFQFYHSDPSAALSSCRNVSANLRNLRLDWSCKKYLYPTDGANE 431
Query: 421 --KEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRNFPETPLAGIFCYGEI 478
KE GG IFSC R + D S PF NFP PLAGIFC GEI
Sbjct: 432 FKKECIGGFIFSCCGRGEAFFGSPNVD-----------SSPFLENFPGVPLAGIFCGGEI 480
Query: 479 GRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVMSYTIPP 519
GR ++ ++ EE S RC LH +S+VYLV+SYT P
Sbjct: 481 GRSFSISNTLDDREE--STPSRC-LHVFSSVYLVLSYTPSP 518
>gi|357465029|ref|XP_003602796.1| F-box/LRR-repeat protein [Medicago truncatula]
gi|355491844|gb|AES73047.1| F-box/LRR-repeat protein [Medicago truncatula]
Length = 579
Score = 364 bits (934), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 227/574 (39%), Positives = 323/574 (56%), Gaps = 85/574 (14%)
Query: 10 NEDLLQHILSRLPALSFASAACVNKSWNKVCNQILSKPKLASALSLSPSLHVAVSEVLDK 69
++DLL +I +RLPA+SFASA CVNKSWN VCN+I+S+PKLASALSL+PSL AV+EV+DK
Sbjct: 30 SDDLLLNIFTRLPAISFASATCVNKSWNSVCNRIISRPKLASALSLNPSLRDAVNEVVDK 89
Query: 70 VLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPVITNAVTGIIGLDAHLDEICEVK 129
VLSEPIRP+FAI ++G + +L+ R+G PV+ GIIG DA DE EVK
Sbjct: 90 VLSEPIRPYFAIVNIGCGFDPSKILRLVKRRVGFNIPVVVTVNNGIIGRDAVTDEFKEVK 149
Query: 130 WTLLEDNLLNDFDHCY----GIVLIVGYVPGLKVETIPLLRSKEEPEFSMVDKFLMDIRH 185
W L ++D ++ GIVL +G +PGLKVE IPL+R + P+ VD F MDI+
Sbjct: 150 WGALFSG-IDDEEYARHINEGIVLTIGCLPGLKVEAIPLIRPAKTPQEPCVDSFSMDIKE 208
Query: 186 YSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIVGDATSCFLFKTGENSQNY 245
YSAS+SG P GIILFG+ + D+K ++ ++DY +P +TV+VGD C +F+ G +S++
Sbjct: 209 YSASVSGHQFPVGIILFGEASSDMKLVMEKLDYAMPMDTVVVGDERGCSVFRCGNDSRHA 268
Query: 246 NGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVLPFGPELKAVSVKEHNADCS-LLT 304
G+ +AVALVF++D + S+ I+F + S GV P G KA SV+ + +DCS LT
Sbjct: 269 CGSKGCIEAVALVFAQDRNRSS-GNIRFHVAFSNGVSPVGGRYKAASVRTNKSDCSTWLT 327
Query: 305 ARMEGYDGLLHGEEILEDIKEHIDD--KYPYLYIGVIHQR----GSLQFGSRSYMSLYEV 358
A+ EG+ L G+ IL DI +++ + P LYIGV R G+ + R+ ++ + V
Sbjct: 328 AKREGHQQPLDGQTILHDINTLLENHIEPPELYIGVTKHRKVSIGAEKPMPRTCIAYHGV 387
Query: 359 LG--------------------------------------------AEDQFFIVNGVGIK 374
+G ++++ V+G+GIK
Sbjct: 388 VGWVVPIVEKMIESHFRWFGHALKRPKELIKRIDEVEAISFHVLDRGDEEYLYVDGMGIK 447
Query: 375 PGDSFIFYHSDSDTASSSSIDVLDGLRLLNASSCCGTIGRNVTNANKE------VFGGLI 428
GD F FYHSD + A +S +V + +GRN ++ + VFGG++
Sbjct: 448 TGDIFQFYHSDPNVALASLTEVRGSFKKFK-------LGRNSRSSENDGDNAINVFGGIV 500
Query: 429 FSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLI 488
F+C+ R + D S PF NFP PLAG+FC GE+ R T +I
Sbjct: 501 FACYGRGESFFGRLNAD-----------SSPFLENFPGVPLAGMFCGGEM--VRPCTTMI 547
Query: 489 NQEEEDCSISGRCLLHHYSTVYLVMSYTIPPLLH 522
+ IS C LH YS+VYL+MSY P + H
Sbjct: 548 GLCPDAKPIS--CFLHVYSSVYLLMSYDPPSVDH 579
>gi|297797349|ref|XP_002866559.1| F-box/LRR-repeat protein At5g63520 [Arabidopsis lyrata subsp.
lyrata]
gi|297312394|gb|EFH42818.1| F-box/LRR-repeat protein At5g63520 [Arabidopsis lyrata subsp.
lyrata]
Length = 526
Score = 316 bits (809), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 216/525 (41%), Positives = 294/525 (56%), Gaps = 46/525 (8%)
Query: 7 ALGNEDLLQHILSRLPALSFASAACVNKSWNKVCNQILSKPKLASALSLSPSLHVAVSEV 66
A NEDLL +IL RLPA SFA A+CVN+SW+ VCN+ILS+PK+ SA S +P A EV
Sbjct: 32 AAMNEDLLHNILLRLPAKSFAFASCVNRSWSSVCNRILSRPKMISAFSRNPDQLRAGEEV 91
Query: 67 LDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPVITNAVTGIIGLDAHLDEIC 126
LDKVLSEPIRPHF IA++ + T LIT R+GSR P+I + VTGI+G +A D+
Sbjct: 92 LDKVLSEPIRPHFVIANITC-GNMEETLTLITERVGSRVPIIVSVVTGILGKEACNDKAA 150
Query: 127 EVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLRSKEEPEFSMVDKFLMDIRHY 186
EVK D+ L + + I+L +GY+PG+KV+ IP++++K E E + DKF+MDIR+Y
Sbjct: 151 EVKQHSTSDDELFIVPN-FAILLTIGYLPGMKVDVIPVIQAKGESESDIGDKFVMDIRNY 209
Query: 187 SASISG-CSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIVGDATSCFLFKTGENSQNY 245
+ +SG ++P +ILFG+ +P+L ++DY +P ETVIVGD FL K G S+N
Sbjct: 210 VSMVSGHAAAPACLILFGEDTHATEPVLHKLDYAMPAETVIVGDQIGEFLHKRGNESRNV 269
Query: 246 NGALYFFDAVA-LVFSRDS-DNSNVPEIQFDITMSTGVLPFGPELKAVSVKEHNADC--S 301
+A L+F+RD + IQFD +S G+ KA +V C +
Sbjct: 270 QLPKDDCRVLAGLIFARDRLRPAQAERIQFDTAISRGMSSVDLRYKAANVNVSRPRCPST 329
Query: 302 LLTARMEGYDGLLHGEEILEDI----KEHIDDKYPYLYIGVIHQRG-SLQFGSR----SY 352
LLTA+ G +L GE+IL+DI + HI + PYL GVI +R S+ + S
Sbjct: 330 LLTAKRRGEAEVLDGEQILDDIDNILENHIWENDPYL--GVIKRRKYSIGLEEKPKIMSS 387
Query: 353 MSLYEVLGAEDQFFIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLRLLNASSCCGTI 412
+ ++V G++DQ +V+G GIK GD F Y D A +S V R L
Sbjct: 388 LVFHQVNGSDDQDLLVDGAGIKTGDQFQVYLPDLKVAEASLKAVTSQHRNLK-------- 439
Query: 413 GRNVTNANK-EVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRNFPETPLAG 471
+ ANK E+ GG F SR D D S PF NFPE G
Sbjct: 440 ----SKANKPEIVGGFAFVGNSRGDLFFGRPDAD-----------SSPFLENFPELRFGG 484
Query: 472 IFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVMSYT 516
IFC EIGR + + + EE +S R LH YS+VYL++SYT
Sbjct: 485 IFCDSEIGR----SLFVEEGEEKKEVSIRRFLHVYSSVYLIVSYT 525
>gi|75262730|sp|Q9FMV0.1|FBL91_ARATH RecName: Full=F-box/LRR-repeat protein At5g63520
gi|9758293|dbj|BAB08817.1| unnamed protein product [Arabidopsis thaliana]
Length = 529
Score = 311 bits (796), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 208/530 (39%), Positives = 295/530 (55%), Gaps = 55/530 (10%)
Query: 7 ALGNEDLLQHILSRLPALSFASAACVNKSWNKVCNQILSKPKLASALSLSPSLHVAVSEV 66
A NEDLL +IL RLPA SFA A+CVN+ W+ VCN+ILS+PK+ SA S +P A EV
Sbjct: 34 AAMNEDLLHNILLRLPAKSFAFASCVNRFWSSVCNRILSRPKMISAFSRNPDQLRAGEEV 93
Query: 67 LDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPVITNAVTGIIGLDAHLDEIC 126
LDKVLSEPIRP F IA++ + T LIT R+GSR P+I + VTGI+G +A D+
Sbjct: 94 LDKVLSEPIRPQFVIANITC-GNMEETLTLITERVGSRVPIIVSVVTGILGKEACNDKAG 152
Query: 127 EVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLRSKEEPEFSMVDKFLMDIRHY 186
EV+ D+ L D + + I+L +GY+PG+KV+ IP++++K E M DKF+MDIR+Y
Sbjct: 153 EVRLHSTSDDELFDVAN-FAILLTIGYLPGMKVDIIPVIQAKGESGAEMEDKFVMDIRNY 211
Query: 187 SASISG-CSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIVGDATSCFLFKTGENSQNY 245
+ +SG ++P +ILF + +P+L ++DY +P ETVIVG FL K G +N
Sbjct: 212 MSMVSGHAAAPACLILFAEDTHATEPVLHKLDYAMPAETVIVGGQIGEFLHKRGNEPRNV 271
Query: 246 NGALYFFDAVA-LVFSRDSDN-SNVPEIQFDITMSTGVLPFGPELKAVSVKEH---NADC 300
+A L+F+RD + IQFD +S G+ KA +V +
Sbjct: 272 QLQKDDIRVLAGLIFARDRHRPAQAERIQFDTAISNGMSSVDLRYKAANVNVSLGPSCPS 331
Query: 301 SLLTARMEGYDGLLHGEEILEDIKEHIDDKYPY---LYIGVIHQRG-SLQFGSR----SY 352
+LLTA+ G +L G++IL+DI ++I + Y + Y+GVI +R S+ + S
Sbjct: 332 TLLTAKRRGEAEVLDGDQILDDI-DNILENYIWENDSYLGVIKRRKYSIGLEEKPKIMSS 390
Query: 353 MSLYEVLGAEDQFFIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLRLLNASSCCGTI 412
+ ++V G++DQ +V+G GIK GD F Y D A ++ DV LR L
Sbjct: 391 LVFHQVNGSDDQDLLVDGAGIKTGDQFQVYLPDLKVAEAALNDVSAQLRNLK-------- 442
Query: 413 GRNVTNANK-EVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYF-----ESYPFCRNFPE 466
+ NK EV GG F R D +F +S PF NFPE
Sbjct: 443 ----SKPNKPEVVGGFAFVGSCRG----------------DSFFGCPNADSSPFLENFPE 482
Query: 467 TPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVMSYT 516
P GIFC GEIGR + ++ + EE +S + LH YS+VYL++SYT
Sbjct: 483 LPFGGIFCDGEIGR----SLILEEGEEKKEVSIQRFLHVYSSVYLIVSYT 528
>gi|79546803|ref|NP_201157.3| uncharacterized protein [Arabidopsis thaliana]
gi|332010379|gb|AED97762.1| uncharacterized protein [Arabidopsis thaliana]
Length = 519
Score = 310 bits (794), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 208/530 (39%), Positives = 295/530 (55%), Gaps = 55/530 (10%)
Query: 7 ALGNEDLLQHILSRLPALSFASAACVNKSWNKVCNQILSKPKLASALSLSPSLHVAVSEV 66
A NEDLL +IL RLPA SFA A+CVN+ W+ VCN+ILS+PK+ SA S +P A EV
Sbjct: 24 AAMNEDLLHNILLRLPAKSFAFASCVNRFWSSVCNRILSRPKMISAFSRNPDQLRAGEEV 83
Query: 67 LDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPVITNAVTGIIGLDAHLDEIC 126
LDKVLSEPIRP F IA++ + T LIT R+GSR P+I + VTGI+G +A D+
Sbjct: 84 LDKVLSEPIRPQFVIANITC-GNMEETLTLITERVGSRVPIIVSVVTGILGKEACNDKAG 142
Query: 127 EVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLRSKEEPEFSMVDKFLMDIRHY 186
EV+ D+ L D + + I+L +GY+PG+KV+ IP++++K E M DKF+MDIR+Y
Sbjct: 143 EVRLHSTSDDELFDVAN-FAILLTIGYLPGMKVDIIPVIQAKGESGAEMEDKFVMDIRNY 201
Query: 187 SASISG-CSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIVGDATSCFLFKTGENSQNY 245
+ +SG ++P +ILF + +P+L ++DY +P ETVIVG FL K G +N
Sbjct: 202 MSMVSGHAAAPACLILFAEDTHATEPVLHKLDYAMPAETVIVGGQIGEFLHKRGNEPRNV 261
Query: 246 NGALYFFDAVA-LVFSRDSDN-SNVPEIQFDITMSTGVLPFGPELKAVSVKEH---NADC 300
+A L+F+RD + IQFD +S G+ KA +V +
Sbjct: 262 QLQKDDIRVLAGLIFARDRHRPAQAERIQFDTAISNGMSSVDLRYKAANVNVSLGPSCPS 321
Query: 301 SLLTARMEGYDGLLHGEEILEDIKEHIDDKYPY---LYIGVIHQRG-SLQFGSR----SY 352
+LLTA+ G +L G++IL+DI ++I + Y + Y+GVI +R S+ + S
Sbjct: 322 TLLTAKRRGEAEVLDGDQILDDI-DNILENYIWENDSYLGVIKRRKYSIGLEEKPKIMSS 380
Query: 353 MSLYEVLGAEDQFFIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLRLLNASSCCGTI 412
+ ++V G++DQ +V+G GIK GD F Y D A ++ DV LR L
Sbjct: 381 LVFHQVNGSDDQDLLVDGAGIKTGDQFQVYLPDLKVAEAALNDVSAQLRNLK-------- 432
Query: 413 GRNVTNANK-EVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYF-----ESYPFCRNFPE 466
+ NK EV GG F R D +F +S PF NFPE
Sbjct: 433 ----SKPNKPEVVGGFAFVGSCRG----------------DSFFGCPNADSSPFLENFPE 472
Query: 467 TPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVMSYT 516
P GIFC GEIGR + ++ + EE +S + LH YS+VYL++SYT
Sbjct: 473 LPFGGIFCDGEIGR----SLILEEGEEKKEVSIQRFLHVYSSVYLIVSYT 518
>gi|26451740|dbj|BAC42965.1| unknown protein [Arabidopsis thaliana]
Length = 482
Score = 264 bits (675), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 177/442 (40%), Positives = 253/442 (57%), Gaps = 32/442 (7%)
Query: 7 ALGNEDLLQHILSRLPALSFASAACVNKSWNKVCNQILSKPKLASALSLSPSLHVAVSEV 66
A NEDLL +IL RLPA SFA A+CVN+ W+ VCN+ILS+PK+ SA S +P A EV
Sbjct: 24 AAMNEDLLHNILLRLPAKSFAFASCVNRFWSSVCNRILSRPKMISAFSRNPDQLRAGEEV 83
Query: 67 LDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPVITNAVTGIIGLDAHLDEIC 126
LDKVLSEPIRP F IA++ + T LIT R+GSR P+I + VTGI+G +A D+
Sbjct: 84 LDKVLSEPIRPQFVIANITC-GNMEETLTLITERVGSRVPIIVSVVTGILGKEACNDKAG 142
Query: 127 EVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLRSKEEPEFSMVDKFLMDIRHY 186
EV+ D+ L D + + I+L +GY+PG+KV+ IP++++K E M DKF+MDIR+Y
Sbjct: 143 EVRLHSTSDDELFDVAN-FAILLTIGYLPGMKVDIIPVIQAKGESGAEMEDKFVMDIRNY 201
Query: 187 SASISG-CSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIVGDATSCFLFKTGENSQNY 245
+ +SG ++P +ILF + +P+L ++DY +P ETVIVG FL K G +N
Sbjct: 202 MSMVSGHAAAPACLILFAEDTHATEPVLHKLDYAMPAETVIVGGQIGEFLHKRGNEPRNV 261
Query: 246 NGALYFFDAVA-LVFSRDSDN-SNVPEIQFDITMSTGVLPFGPELKAVSVKEH---NADC 300
+A L+F+RD + IQFD +S G+ KA +V +
Sbjct: 262 QLQKDDIRVLAGLIFARDRHRPAQAERIQFDTAISNGMSSVDLRYKAANVNVSLGPSCPS 321
Query: 301 SLLTARMEGYDGLLHGEEILEDIKEHIDDKYPY---LYIGVIHQRG-SLQFGSR----SY 352
+LLTA+ G +L G++IL+DI ++I + Y + Y+GVI +R S+ + S
Sbjct: 322 TLLTAKRRGEAEVLDGDQILDDI-DNILENYIWENDSYLGVIKRRKYSIGLEEKPKIMSS 380
Query: 353 MSLYEVLGAEDQFFIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLRLLNASSCCGTI 412
+ ++V G++DQ +V+G GIK GD F Y D A ++ DV LR L
Sbjct: 381 LVFHQVNGSDDQDLLVDGAGIKTGDQFQVYLPDLKVAEAALNDVSAQLRNLK-------- 432
Query: 413 GRNVTNANK-EVFGGLIF--SC 431
+ NK EV GG F SC
Sbjct: 433 ----SKPNKPEVVGGFAFVGSC 450
>gi|118484799|gb|ABK94267.1| unknown [Populus trichocarpa]
Length = 342
Score = 237 bits (605), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 146/356 (41%), Positives = 204/356 (57%), Gaps = 28/356 (7%)
Query: 181 MDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIVGDATSCFLFKTGE 240
M+IR Y+AS+SG +SP GIILFGD+ D KP++ ++D+ + +TVIVGD + FL+++G
Sbjct: 1 MNIRDYAASVSGWASPAGIILFGDEGADQKPVMEKLDHAMSRDTVIVGDERAQFLYRSGV 60
Query: 241 NSQN-YNGALYFFDAVALVFSRDSDN-SNVPEIQFDITMSTGVLPFGPELKAVSVKEHNA 298
S+N Y + YF AVALVF+RD D EIQF +S+GV GP KAVSV++ +
Sbjct: 61 ESRNDYGSSEYFPAAVALVFARDRDKPCGTGEIQFHAALSSGVSAIGPRYKAVSVRKIGS 120
Query: 299 D---CSLLTARMEGYDGLLHGEEILEDIKEHIDDK--YPYLYIGVIHQR----GSLQFGS 349
+ +LLTAR EG + G+ IL+DI + ++ P LYIGV QR GS +
Sbjct: 121 ETGCTTLLTARREGEQEIQDGQRILDDINNELVNQIGRPDLYIGVTEQRKCFIGSEKSRV 180
Query: 350 RSYMSLYEVLGAEDQFFIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLRLLN---AS 406
+++ + V+G + ++ +GVGI+ GD F FYHSD TA SS +V R L +S
Sbjct: 181 MTFLVFHGVMGGDQEYLFADGVGIRTGDYFQFYHSDPTTALSSCNEVSKNFRKLKLDWSS 240
Query: 407 SCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRNFPE 466
C G + +KE+ GG +FSC R E + D S PF NFP
Sbjct: 241 RNCLQAGVSDNVCSKELVGGFVFSCCGRGESFFERCNVD-----------SSPFLDNFPG 289
Query: 467 TPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVMSYTIPPLLH 522
P+AG+FC GEIGRG ++N +E + C LH YST YL++SYT P H
Sbjct: 290 VPMAGVFCRGEIGRG---FSVLNADEGPEERTLHCCLHVYSTAYLLVSYTPAPAEH 342
>gi|224122068|ref|XP_002318745.1| predicted protein [Populus trichocarpa]
gi|222859418|gb|EEE96965.1| predicted protein [Populus trichocarpa]
Length = 77
Score = 84.7 bits (208), Expect = 9e-14, Method: Composition-based stats.
Identities = 37/54 (68%), Positives = 49/54 (90%)
Query: 7 ALGNEDLLQHILSRLPALSFASAACVNKSWNKVCNQILSKPKLASALSLSPSLH 60
+L +E+++Q+ILSRLPAL+FA AACVNK W K+C++IL +PKLASALSL+PSLH
Sbjct: 14 SLVDEEIVQNILSRLPALTFAYAACVNKRWYKICSKILKRPKLASALSLNPSLH 67
>gi|422301994|ref|ZP_16389358.1| Genome sequencing data, contig C308 [Microcystis aeruginosa PCC
9806]
gi|389788899|emb|CCI15183.1| Genome sequencing data, contig C308 [Microcystis aeruginosa PCC
9806]
Length = 417
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 115/482 (23%), Positives = 193/482 (40%), Gaps = 88/482 (18%)
Query: 51 SALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPV--- 107
+ALS PSL AV+EV++KV + + +A + + S A+ + + + + PV
Sbjct: 9 NALSTRPSLEAAVTEVVEKV-QDKLVGSADLAIIFISSAYASDYPRLVPLILDKLPVPVL 67
Query: 108 ITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLRS 167
I GI+G+ E + + + L V ++PG++V+ + +
Sbjct: 68 IGCGGAGIVGMG--------------EREKAREIEASPALSLTVAHLPGVEVQPF-YIEA 112
Query: 168 KEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIV 227
E P+ ++ + +P I+L + I +L +D+ P I
Sbjct: 113 AEMPDLDSSPSSWTEL----LGVEAAKNPQFILLADPFSSRINDLLEGLDFAYPGSAKIG 168
Query: 228 GDATSCFLFKTG-----ENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVL 282
G + + ++G + + N LY V + S + I + ++ G
Sbjct: 169 GLVSGGMIERSGGLFYHDQQKPRNSYLYRQGTVGIALSGN--------IIVETIVAQGCR 220
Query: 283 PFGPELKAVSVKEHNADCSLLTARMEGYDGLLHGE-EILEDIKEHIDDK-----YPYLYI 336
P GP + VS E N S+ +G DG +L D+ + +K L+I
Sbjct: 221 PIGP-IYQVSEGERNIIISMTG---KGADGTSQPPLNLLRDLIASLSEKDRELAQHSLFI 276
Query: 337 GVIHQRGSLQFGSRSYMSLYEVLGAE-DQFFIVNGVGIKPGDSFIFYHSDSDTASSSSID 395
G+ +Q + ++ + VLG + Q I G ++PG F+ D+DT++
Sbjct: 277 GIARDEFKMQLRAGDFL-IRNVLGVDPRQGAIAIGDRVRPGQRVQFHLRDADTSA----- 330
Query: 396 VLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYF 455
LD LL A + N++ EV G LIFSC R L E D F
Sbjct: 331 -LDLELLLQA------FPQEKLNSS-EVLGALIFSCLGRGENLYEKPD-----------F 371
Query: 456 ESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVMSY 515
+S F R F PLAG FC GEIG ++GR LH Y++ + +
Sbjct: 372 DSGLFQRYFANVPLAGFFCNGEIG----------------PVAGRTFLHGYTSAFALFRQ 415
Query: 516 TI 517
I
Sbjct: 416 GI 417
>gi|390438259|ref|ZP_10226743.1| Genome sequencing data, contig C308 [Microcystis sp. T1-4]
gi|389838325|emb|CCI30867.1| Genome sequencing data, contig C308 [Microcystis sp. T1-4]
Length = 417
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 114/482 (23%), Positives = 192/482 (39%), Gaps = 88/482 (18%)
Query: 51 SALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPV--- 107
+ALS PSL AV+EV++KV + + +A + + S A+ + + + + PV
Sbjct: 9 NALSTRPSLEAAVTEVVEKV-QDKLVGSADLAIIFISSAYASDYPRLVPLILDKLPVPVL 67
Query: 108 ITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLRS 167
I GI+G+ E + + + L V ++P ++V+ + +
Sbjct: 68 IGCGGAGIVGMG--------------EREKAREIEASPALSLTVAHLPDVEVQPF-YIEA 112
Query: 168 KEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIV 227
E P+ ++ + +P I+L + I +L +D+ P I
Sbjct: 113 AEMPDLDSSPSSWTEL----LGVEAAKNPQFILLADPFSSRINDLLEGLDFAYPGSAKIG 168
Query: 228 GDATSCFLFKTG-----ENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVL 282
G + + ++G + + N LY V + S + I + ++ G
Sbjct: 169 GLVSGGMIERSGGLFYHDQQKPRNSYLYRQGTVGIALSGN--------IIVETIVAQGCR 220
Query: 283 PFGPELKAVSVKEHNADCSLLTARMEGYDGLLHGE-EILEDIKEHIDDK-----YPYLYI 336
P GP + VS E N S+ +G DG +L D+ + +K L+I
Sbjct: 221 PIGP-IYQVSEGERNIIISMTG---KGADGTPQPPLNLLRDLIASLSEKDRELAQHSLFI 276
Query: 337 GVIHQRGSLQFGSRSYMSLYEVLGAE-DQFFIVNGVGIKPGDSFIFYHSDSDTASSSSID 395
G+ +Q + ++ + VLG + Q I G ++PG F+ D+DT++
Sbjct: 277 GIARDEFKMQLRAGDFL-IRNVLGVDPRQGAIAIGDRVRPGQRVQFHLRDADTSA----- 330
Query: 396 VLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYF 455
LD LL A R +++ EV G LIFSC R L E D F
Sbjct: 331 -LDLELLLQA------FPREKPDSS-EVLGALIFSCLGRGENLYEKPD-----------F 371
Query: 456 ESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVMSY 515
+S F R F PLAG FC GEIG ++GR LH Y++ + +
Sbjct: 372 DSGLFQRYFANVPLAGFFCNGEIG----------------PVAGRTFLHGYTSAFALFRQ 415
Query: 516 TI 517
I
Sbjct: 416 GI 417
>gi|443656594|ref|ZP_21131716.1| FIST C domain protein [Microcystis aeruginosa DIANCHI905]
gi|159028345|emb|CAO87243.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443333392|gb|ELS47955.1| FIST C domain protein [Microcystis aeruginosa DIANCHI905]
Length = 417
Score = 78.6 bits (192), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 114/482 (23%), Positives = 192/482 (39%), Gaps = 88/482 (18%)
Query: 51 SALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPV--- 107
+ALS PSL AV+EV++KV + + IA + + S A+ + + + + PV
Sbjct: 9 NALSTRPSLEAAVTEVVEKV-QDKLVGSADIAIIFISSAYASDYPRLVPLILDKLPVPVL 67
Query: 108 ITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLRS 167
I GI+G+ + + + + L V ++P ++V+ + +
Sbjct: 68 IGCGGAGIVGMG--------------DREKAREIEASPALSLTVAHLPDVEVQPF-YIEA 112
Query: 168 KEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIV 227
E P+ ++ + +P I+L + I +L +D+ P I
Sbjct: 113 AEMPDLDSSPSSWTEL----LGVEAAKNPQFILLADPFSSRINDLLEGLDFAYPGSAKIG 168
Query: 228 GDATSCFLFKTG-----ENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVL 282
G + + ++G + + N LY V + S + I + ++ G
Sbjct: 169 GLVSGGMIERSGGLFYHDQQKPRNSYLYRQGTVGIALSGN--------IIVETIVAQGCR 220
Query: 283 PFGPELKAVSVKEHNADCSLLTARMEGYDGLLHGE-EILEDIKEHIDDK-----YPYLYI 336
P GP + VS E N S+ +G DG +L D+ + +K L+I
Sbjct: 221 PIGP-IYQVSEGERNIIISMTG---KGADGTPQPPLNLLRDLIPSLREKDRELVQNSLFI 276
Query: 337 GVIHQRGSLQFGSRSYMSLYEVLGAE-DQFFIVNGVGIKPGDSFIFYHSDSDTASSSSID 395
G+ +Q + ++ + VLG + Q I G ++PG F+ D+DT++
Sbjct: 277 GIARDEFKMQLRAGDFL-IRSVLGVDPRQGAIAIGDRVRPGQRVQFHLRDADTSA----- 330
Query: 396 VLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYF 455
LD LL A + N++ EV G LIFSC R L E D F
Sbjct: 331 -LDLELLLQA------FPQERPNSS-EVLGALIFSCLGRGENLYEKPD-----------F 371
Query: 456 ESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVMSY 515
+S F R F PLAG FC GEIG ++GR LH Y++ + +
Sbjct: 372 DSGLFQRYFANVPLAGFFCNGEIG----------------PVAGRTFLHGYTSAFALFRQ 415
Query: 516 TI 517
I
Sbjct: 416 GI 417
>gi|425434741|ref|ZP_18815205.1| Genome sequencing data, contig C308 [Microcystis aeruginosa PCC
9432]
gi|389675774|emb|CCH95162.1| Genome sequencing data, contig C308 [Microcystis aeruginosa PCC
9432]
Length = 417
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 112/482 (23%), Positives = 191/482 (39%), Gaps = 88/482 (18%)
Query: 51 SALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPV--- 107
+ALS PSL AV+EV++KV + + +A + + S A+ + + + + PV
Sbjct: 9 NALSTRPSLEAAVTEVVEKV-QDKLVGSADLAIIFISSAYASDYPRLVPLILDKLPVPVL 67
Query: 108 ITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLRS 167
I GI+G+ E + + + L V ++P ++++ + +
Sbjct: 68 IGCGGAGIVGMG--------------EREKAREIEASPALSLTVAHLPNVEIQPF-YIEA 112
Query: 168 KEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIV 227
E P+ ++ + +P I+L + I +L +D+ P I
Sbjct: 113 AEMPDLDSSPSSWTEL----LGVEAAKNPQFILLADPFSSRINDLLEGLDFAYPSSAKIG 168
Query: 228 GDATSCFLFKTG-----ENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVL 282
G + + ++G + + N LY V + S + I + ++ G
Sbjct: 169 GLVSGGMIERSGGLFYHDQQKPRNSYLYRQGTVGIALSGN--------IIVETIVAQGCR 220
Query: 283 PFGPELKAVSVKEHNADCSLLTARMEGYDGLLHGE-EILEDIKEHIDDK-----YPYLYI 336
P GP + VS E N S+ +G DG +L D+ + +K L+I
Sbjct: 221 PIGP-IYQVSEGERNIIISMTG---KGADGTPQPPLNLLRDLIPSLREKDRELVQNSLFI 276
Query: 337 GVIHQRGSLQFGSRSYMSLYEVLGAE-DQFFIVNGVGIKPGDSFIFYHSDSDTASSSSID 395
G+ +Q + ++ + VLG + Q I G ++PG F+ D+DT++
Sbjct: 277 GIARDEFKMQLRAGDFL-IRSVLGVDPRQGAIAIGDRVRPGQRVQFHLRDADTSA----- 330
Query: 396 VLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYF 455
LD LL A + N++ EV G LIFSC R L E D F
Sbjct: 331 -LDLELLLQA------FPQERPNSS-EVLGALIFSCLGRGENLYEKPD-----------F 371
Query: 456 ESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVMSY 515
+S F R F PLAG F GEIG ++GR LH Y++ + +
Sbjct: 372 DSGLFQRYFANVPLAGFFGNGEIG----------------PVAGRTFLHGYTSAFALFRQ 415
Query: 516 TI 517
I
Sbjct: 416 GI 417
>gi|425468793|ref|ZP_18847781.1| Genome sequencing data, contig C308 [Microcystis aeruginosa PCC
9701]
gi|389884556|emb|CCI35164.1| Genome sequencing data, contig C308 [Microcystis aeruginosa PCC
9701]
Length = 417
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 112/482 (23%), Positives = 192/482 (39%), Gaps = 88/482 (18%)
Query: 51 SALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPV--- 107
+ALS PSL AV+EV++KV ++ + +A + + S A+ + + + + PV
Sbjct: 9 NALSTRPSLEAAVTEVVEKV-TDKLVGSADLAIIFISSAYASDYPRLVPLILDKLPVPVL 67
Query: 108 ITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLRS 167
I GI+G+ + + + + L V ++P ++V+ + +
Sbjct: 68 IGCGGAGIVGMG--------------DREKAREIEASPALSLTVAHLPDVEVQPF-YIEA 112
Query: 168 KEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIV 227
E P+ ++ + +P I+L + I +L +D+ P I
Sbjct: 113 AEMPDLDSSPSSWTEL----LGVEAAKNPQFILLADPFSSRINDLLEGLDFAYPGSAKIG 168
Query: 228 GDATSCFLFKTG-----ENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVL 282
G + + ++G + + N LY V + S + I + ++ G
Sbjct: 169 GLVSGGMIERSGGLFYHDQQKPRNSYLYRQGTVGIALSGN--------IIVETIVAQGCR 220
Query: 283 PFGPELKAVSVKEHNADCSLLTARMEGYDGLLHGE-EILEDIKEHIDDK-----YPYLYI 336
P GP + VS E N S+ +G DG +L D+ + +K L+I
Sbjct: 221 PIGP-IYQVSEGERNIIISMTG---KGADGTPQPPLNLLRDLIPSLREKDRELVQNSLFI 276
Query: 337 GVIHQRGSLQFGSRSYMSLYEVLGAE-DQFFIVNGVGIKPGDSFIFYHSDSDTASSSSID 395
G+ +Q + ++ + VLG + Q I G ++PG F+ D+DT++
Sbjct: 277 GIARDEFKMQLRAGDFL-IRSVLGVDPRQGAIAIGDRVRPGQRVQFHLRDADTSA----- 330
Query: 396 VLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYF 455
LD LL A + N++ EV G LIFSC R L E D F
Sbjct: 331 -LDLELLLQA------FPQERPNSS-EVLGALIFSCLGRGENLYEKPD-----------F 371
Query: 456 ESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVMSY 515
+S F R F PLAG F GEIG ++GR LH Y++ + +
Sbjct: 372 DSGLFQRYFANVPLAGFFGNGEIG----------------PVAGRTFLHGYTSAFALFRQ 415
Query: 516 TI 517
I
Sbjct: 416 GI 417
>gi|440756839|ref|ZP_20936039.1| FIST C domain protein [Microcystis aeruginosa TAIHU98]
gi|440172868|gb|ELP52352.1| FIST C domain protein [Microcystis aeruginosa TAIHU98]
Length = 417
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 111/482 (23%), Positives = 191/482 (39%), Gaps = 88/482 (18%)
Query: 51 SALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPV--- 107
+ALS PSL AV+EV++KV + + +A + + S A+ + + + + PV
Sbjct: 9 NALSTRPSLEAAVTEVVEKV-QDKLVGSADLAIIFISSAYASDYPRLVPLILDKLPVPVL 67
Query: 108 ITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLRS 167
I GI+G+ + + + + L V ++P ++V+ + +
Sbjct: 68 IGCGGAGIVGMG--------------DREKAREIEASPALSLTVAHLPNVEVQPF-YIEA 112
Query: 168 KEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIV 227
E P+ ++ + +P I+L + I +L +D+ P I
Sbjct: 113 AEMPDLDSSPSSWTEL----LGVEAAKNPQFILLADPFSSRINDLLEGLDFAYPGSAKIG 168
Query: 228 GDATSCFLFKTG-----ENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVL 282
G + + ++G + + N LY V + S + I + ++ G
Sbjct: 169 GLVSGGMIERSGGLFYHDQQKPRNSYLYRQGTVGIALSGN--------IIVETIVAQGCR 220
Query: 283 PFGPELKAVSVKEHNADCSLLTARMEGYD----GLLHG--EEILEDIKEHIDDKYPYLYI 336
P GP + VS E N S+ + +G LL + E +E + + L+I
Sbjct: 221 PIGP-IYQVSEGERNIIISMTSKEADGTPQPPLNLLRDLIPSLREKDRELVQNS---LFI 276
Query: 337 GVIHQRGSLQFGSRSYMSLYEVLGAE-DQFFIVNGVGIKPGDSFIFYHSDSDTASSSSID 395
G+ +Q + ++ + VLG + Q I G ++PG F+ D+DT++
Sbjct: 277 GIARDEFKMQLRAGDFL-IRSVLGVDPRQGAIAIGDRVRPGQRVQFHLRDADTSA----- 330
Query: 396 VLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYF 455
LD LL A + N++ EV G LIFSC R L E D F
Sbjct: 331 -LDLELLLQA------FPQERPNSS-EVLGALIFSCLGRGENLYEKPD-----------F 371
Query: 456 ESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVMSY 515
+S F R F PLAG F GEIG ++GR LH Y++ + +
Sbjct: 372 DSGLFQRYFANVPLAGFFGNGEIG----------------PVAGRTFLHGYTSAFALFRQ 415
Query: 516 TI 517
I
Sbjct: 416 GI 417
>gi|425460981|ref|ZP_18840461.1| Genome sequencing data, contig C308 [Microcystis aeruginosa PCC
9808]
gi|389826235|emb|CCI23414.1| Genome sequencing data, contig C308 [Microcystis aeruginosa PCC
9808]
Length = 417
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 112/482 (23%), Positives = 191/482 (39%), Gaps = 88/482 (18%)
Query: 51 SALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPV--- 107
+ALS PSL AV+EV++KV + + +A + + S A+ + + + + PV
Sbjct: 9 NALSTRPSLEAAVTEVVEKV-QDKLVGSADLAIIFISSAYASDYPRLVPLILDKLPVPVL 67
Query: 108 ITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLRS 167
I GI+G+ + + + + L V ++P ++V+ + +
Sbjct: 68 IGCGGAGIVGMG--------------DREKAREIEASPALSLTVAHLPDVEVQPF-YIEA 112
Query: 168 KEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIV 227
E P+ ++ + +P I+L + I +L +D+ P I
Sbjct: 113 AEMPDLDSSPSSWTEL----LGVEAAKNPQFILLADPFSSRINDLLEGLDFAYPGSAKIG 168
Query: 228 GDATSCFLFKTG-----ENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVL 282
G + + ++G + + N LY V + S + I + ++ G
Sbjct: 169 GLVSGGMIERSGGLFYHDQQKPRNSYLYRQGTVGIALSGN--------IIVETIVAQGCR 220
Query: 283 PFGPELKAVSVKEHNADCSLLTARMEGYDGLLHGE-EILEDIKEHIDDK-----YPYLYI 336
P GP + VS E N S+ +G DG +L D+ + +K L+I
Sbjct: 221 PIGP-IYQVSEGERNIIISMTG---KGADGTPQPPLNLLRDLIPSLREKDRELVQNSLFI 276
Query: 337 GVIHQRGSLQFGSRSYMSLYEVLGAE-DQFFIVNGVGIKPGDSFIFYHSDSDTASSSSID 395
G+ +Q + ++ + VLG + Q I G ++PG F+ D+DT++
Sbjct: 277 GIARDEFKMQLRAGDFL-IRSVLGVDPRQGAIAIGDRVRPGQRVQFHLRDADTSA----- 330
Query: 396 VLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYF 455
LD LL A + N++ EV G LIFSC R L E D F
Sbjct: 331 -LDLELLLQA------FPQERPNSS-EVLGALIFSCLGRGENLYEKPD-----------F 371
Query: 456 ESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVMSY 515
+S F R F PLAG F GEIG ++GR LH Y++ + +
Sbjct: 372 DSGLFQRYFANVPLAGFFGNGEIG----------------PVAGRTFLHGYTSAFALFRQ 415
Query: 516 TI 517
I
Sbjct: 416 GI 417
>gi|425451214|ref|ZP_18831036.1| Genome sequencing data, contig C308 [Microcystis aeruginosa PCC
7941]
gi|389767595|emb|CCI07053.1| Genome sequencing data, contig C308 [Microcystis aeruginosa PCC
7941]
Length = 417
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 112/482 (23%), Positives = 190/482 (39%), Gaps = 88/482 (18%)
Query: 51 SALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPV--- 107
+ALS PSL AV+EV++KV + + +A + + S A+ + + + + PV
Sbjct: 9 NALSTRPSLEAAVTEVVEKV-QDKLVGSADLAIIFISSAYASDYPRLVPLILDKLPVPVL 67
Query: 108 ITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLRS 167
I GI+G+ + + + + L V ++P ++V+ + +
Sbjct: 68 IGCGGAGIVGMG--------------DREKAREIEASPALSLTVAHLPNVEVQPF-YIEA 112
Query: 168 KEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIV 227
E P+ ++ + +P I+L + I +L +D+ P I
Sbjct: 113 AEMPDLDSSPSSWTEL----LGVEAAKNPQFILLADPFSSRINDLLEGLDFAYPGSAKIG 168
Query: 228 GDATSCFLFKTG-----ENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVL 282
G + + ++G + + N LY V + S + I + ++ G
Sbjct: 169 GLVSGGMIERSGGLFYHDQQKPRNSYLYRQGTVGIALSGN--------IIVETIVAQGCR 220
Query: 283 PFGPELKAVSVKEHNADCSLLTARMEGYDGLLHGE-EILEDIKEHIDDK-----YPYLYI 336
P GP + VS E N S+ +G DG +L D+ + +K L+I
Sbjct: 221 PIGP-IYQVSEGERNIIISMTG---KGADGTPQPPLNLLRDLIPSLREKDRELVQNSLFI 276
Query: 337 GVIHQRGSLQFGSRSYMSLYEVLGAE-DQFFIVNGVGIKPGDSFIFYHSDSDTASSSSID 395
G+ +Q ++ + VLG + Q I G ++PG F+ D+DT++
Sbjct: 277 GIARDEFKMQLRPGDFL-IRSVLGVDPRQGAIAIGDRVRPGQRVQFHLRDADTSA----- 330
Query: 396 VLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYF 455
LD LL A + N++ EV G LIFSC R L E D F
Sbjct: 331 -LDLELLLQA------FPQERPNSS-EVLGALIFSCLGRGENLYEKPD-----------F 371
Query: 456 ESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVMSY 515
+S F R F PLAG F GEIG ++GR LH Y++ + +
Sbjct: 372 DSGLFQRYFANVPLAGFFGNGEIG----------------PVAGRTFLHGYTSAFALFRQ 415
Query: 516 TI 517
I
Sbjct: 416 GI 417
>gi|425463916|ref|ZP_18843246.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
gi|389828576|emb|CCI30095.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
Length = 417
Score = 72.0 bits (175), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 108/482 (22%), Positives = 188/482 (39%), Gaps = 88/482 (18%)
Query: 51 SALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPV--- 107
+ALS PSL AV+EV++KV + + +A + + S A+ + + + + PV
Sbjct: 9 NALSTRPSLEAAVTEVVEKV-QDKLVGSADLAIIFISSAYASDYPRLVPLILDKLPVPVL 67
Query: 108 ITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLRS 167
I GI+G+D + + + + L V ++P ++V+ + +
Sbjct: 68 IGCGGAGIVGMD--------------DREKAREIEASPALSLTVAHLPNVEVQPF-YIEA 112
Query: 168 KEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIV 227
E P+ ++ + +P I+L + I +L +D+ P I
Sbjct: 113 AEMPDLDSSPSSWTEL----LGVEAAKNPQFILLADPFSSRINDLLEGLDFAYPSSAKIG 168
Query: 228 GDATSCFLFKTG-----ENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVL 282
G + + ++G + + N LY V + S + I + ++ G
Sbjct: 169 GLVSGGMIERSGGLFYHDQQKPRNTYLYRQGTVGIALSGN--------IIVETIVAQGCR 220
Query: 283 PFGPELKAVSVKEHNADCSLLTARMEGYDGLLHG-----EEILEDIKEHIDDKYPY-LYI 336
P GP + VS E N S+ +G DG ++ ++E + + L+I
Sbjct: 221 PIGP-IYQVSEGERNIIISMTG---KGADGTPQPPLNLLRALIPSLREKDRELAQHSLFI 276
Query: 337 GVIHQRGSLQFGSRSYMSLYEVLGAE-DQFFIVNGVGIKPGDSFIFYHSDSDTASSSSID 395
G+ +Q ++ + VLG + Q I G ++PG F+ D++T++
Sbjct: 277 GIARDEFKMQLRPGDFL-IRNVLGVDPRQGAIAIGDRVRPGQRVQFHLRDAETSA----- 330
Query: 396 VLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYF 455
LD LL A A+ ++ G LIFSC R L E D F
Sbjct: 331 -LDLELLLQAFP-------QEKPASSDILGALIFSCLGRGENLYEKPD-----------F 371
Query: 456 ESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVMSY 515
+S F R F PLAG F GEIG + GR LH Y++ + +
Sbjct: 372 DSGLFQRYFANVPLAGFFGNGEIG----------------PVGGRTFLHGYTSAFALFRQ 415
Query: 516 TI 517
I
Sbjct: 416 GI 417
>gi|166366981|ref|YP_001659254.1| hypothetical protein MAE_42400 [Microcystis aeruginosa NIES-843]
gi|166089354|dbj|BAG04062.1| hypothetical protein MAE_42400 [Microcystis aeruginosa NIES-843]
Length = 417
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 107/482 (22%), Positives = 189/482 (39%), Gaps = 88/482 (18%)
Query: 51 SALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSR--TPVI 108
+ALS PSL AV+EV++KV + + +A + + S A+ + + + + PV+
Sbjct: 9 NALSTRPSLEAAVTEVVEKV-QDKLVGSADLAIIFISSAYASDYPRLVPLILDKLSVPVL 67
Query: 109 TN-AVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLRS 167
GI+G+D + + + + L V ++P ++V+ + +
Sbjct: 68 IGCGGAGIVGMD--------------DREKAREIEASPALSLTVAHLPNVEVQPF-YIEA 112
Query: 168 KEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIV 227
E P+ ++ + +P I+L + I +L +D+ P I
Sbjct: 113 AEMPDLDSSPSSWTEL----LGVEAAKNPQFILLADPFSSRINDLLEGLDFAYPSSAKIG 168
Query: 228 GDATSCFLFKTG-----ENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVL 282
G + + ++G + + N LY V + S + I + ++ G
Sbjct: 169 GLVSGGMIERSGGLFYHDQQKPRNTYLYRQGTVGIALSGN--------IIVETIVAQGCR 220
Query: 283 PFGPELKAVSVKEHNADCSLLTARMEGYDGLLHG-----EEILEDIKEHIDDKYPY-LYI 336
P GP + VS E N S+ +G DG ++ ++E + + L+I
Sbjct: 221 PIGP-IYQVSEGERNIIISMTG---KGADGTPQPPLNLLRALIPSLREKDRELAQHSLFI 276
Query: 337 GVIHQRGSLQFGSRSYMSLYEVLGAE-DQFFIVNGVGIKPGDSFIFYHSDSDTASSSSID 395
G+ +Q + ++ + VLG + Q I G ++PG F+ D++T++
Sbjct: 277 GIARDEFKMQLRAGDFL-IRNVLGVDPRQGAIAIGDRVRPGQRVQFHLRDAETSA----- 330
Query: 396 VLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYF 455
LD LL A A+ ++ G LIFSC R L E D F
Sbjct: 331 -LDLELLLQAFP-------QEKPASSDILGALIFSCLGRGENLYEKPD-----------F 371
Query: 456 ESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVMSY 515
+S F R F PLAG F GEIG + GR LH Y++ + +
Sbjct: 372 DSGLFQRYFANVPLAGFFGNGEIG----------------PVGGRTFLHGYTSAFALFRQ 415
Query: 516 TI 517
I
Sbjct: 416 GI 417
>gi|427418383|ref|ZP_18908566.1| hypothetical protein Lepto7375DRAFT_4144 [Leptolyngbya sp. PCC
7375]
gi|425761096|gb|EKV01949.1| hypothetical protein Lepto7375DRAFT_4144 [Leptolyngbya sp. PCC
7375]
Length = 409
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 110/459 (23%), Positives = 174/459 (37%), Gaps = 101/459 (22%)
Query: 48 KLASALSLSPSLHVAVSEVLDKVLSE-PIRPHFAI--ASVGMQSKLAATHQLITARLGSR 104
K ASA+S PSL +A+ EV+++VL++ + P+ AI S S+ + L+ LG
Sbjct: 2 KWASAVSTHPSLELALREVIERVLTQLEMAPNLAIIFISSAFASEYSRVLPLLKGPLGG- 60
Query: 105 TPVITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPL 164
++ + G+IG ++ E+ EV+ T GI L V Y+P + ++
Sbjct: 61 AHIVGCSGGGVIG-RSNTGELIEVEET-------------AGISLTVAYLPDVNIQGFH- 105
Query: 165 LRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEET 224
L E P+ +I +S P+ +++ + +L +D+ PE
Sbjct: 106 LSIDELPDLDSPPSEWTEI----IGVSPAEKPHFLLMADPFASGMNDLLQGLDFAYPESV 161
Query: 225 VIVGDA------TSCFLFKTGENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMS 278
+ G A SC LF G+ LY V + S + I D ++
Sbjct: 162 KVGGLAGIESISRSCGLF-CGQQ-------LYRQGVVGVALSGN--------IVIDAIVA 205
Query: 279 TGVLPFGPELKAVSVKEHNADCSLLTARMEGYDG---LLHGEEILEDIKEHIDDKYPY-- 333
G P GP + V + A + D L +E+ +D+ E D +
Sbjct: 206 QGCRPIGPTFRVVEGDRNVVTKVAAQASQDEADTQTPLEALQELFQDLDE-TDRQLAQES 264
Query: 334 LYIGVIHQRGSLQFGSRSYMSLYEVLGAEDQFFIVNGVG-------------IKPGDSFI 380
L+IG+ S + LG D F I N VG I+PG
Sbjct: 265 LFIGLAQS------------SFKQALGQGD-FLIRNLVGVDPKVGAIAIADRIRPGQRIQ 311
Query: 381 FYHSDSDTASSSSIDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSE 440
F+ D+ TA + +L RL N + G L+F+C R L +
Sbjct: 312 FHLRDAHTAKDDLVALLKTYRLDNQERSAPS-------------GALLFACNGRGTSLFD 358
Query: 441 DDDGDDDEDDNDVYFESYPFCRNFPETPLAGIFCYGEIG 479
+ D + F R PL G FC GEIG
Sbjct: 359 TPNCDTKQ-----------FSRQLGPVPLGGFFCNGEIG 386
>gi|428214117|ref|YP_007087261.1| hypothetical protein Oscil6304_3783 [Oscillatoria acuminata PCC
6304]
gi|428002498|gb|AFY83341.1| hypothetical protein Oscil6304_3783 [Oscillatoria acuminata PCC
6304]
Length = 407
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 114/478 (23%), Positives = 195/478 (40%), Gaps = 93/478 (19%)
Query: 48 KLASALSLSPSLHVAVSEVLDK---VLSEPIRPHFAIASVGMQSKLAATHQLITARLGSR 104
K A+ALS PSL AV EV+++ +LS P F S S+ + L+ +LG
Sbjct: 2 KWANALSTRPSLEGAVLEVVERSQQLLSAPADLGFVFISSAFASEYSRLMPLLQEQLG-- 59
Query: 105 TPVITN-AVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIP 163
PV+ + +GIIG+D+ L L E+ + L + +PG++VE
Sbjct: 60 VPVLIGCSGSGIIGMDSQL-----AARELEEEQ--------PALSLTLACLPGVQVEAFH 106
Query: 164 LLRSKEEPEF-SMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPE 222
L +E P+ S D ++ I + S P+ +IL I +L +D+ P
Sbjct: 107 L-DGEELPDLDSPPDAWVEAI-----GVPASSHPHFVILADPFTSKINDLLQGLDFAYPG 160
Query: 223 ETVIVGDATSCFLFKTGENSQNYNG-ALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGV 281
+ G A+ + G ++G LY AV + S + I + ++ G
Sbjct: 161 SAKVGGLASGGAM---GNTIGLFSGDRLYREGAVGVALSGN--------IVLETIVAQGC 209
Query: 282 LPFGPELKAVSVKEHNADCSLLTARMEGYDGLLHGEEILEDIK---EHIDDK-----YPY 333
P G +VKE + ++L A +G G + LE +K EH+ +
Sbjct: 210 KPIG---HPFTVKE--CERNILLALDDGPCGSGVSQRPLEALKTVIEHLSESDRQLAQHS 264
Query: 334 LYIGVIHQRGSLQFGSRSYMSLYEVLGAE-DQFFIVNGVGIKPGDSFIFYHSDSDTASSS 392
L+IG+ + ++ + VLG + + I G ++PG F+ D+ T++
Sbjct: 265 LFIGIARNEFKDELEQGDFL-IRNVLGVDPREGAIAIGDRLRPGQRIQFHLRDAQTSA-- 321
Query: 393 SIDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDND 452
+ L +L ++ +A+ G L+F+C R L + D
Sbjct: 322 -----EDLEMLLQRY------QSRFSASNSSIGALMFACLGRGEQLYDQPD--------- 361
Query: 453 VYFESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVY 510
F+S+ F + P++G FC GEIG + G LH Y++V+
Sbjct: 362 --FDSHLFRQYVGNIPVSGFFCNGEIG----------------PVGGGTFLHGYTSVF 401
>gi|126659056|ref|ZP_01730197.1| hypothetical protein CY0110_28949 [Cyanothece sp. CCY0110]
gi|126619713|gb|EAZ90441.1| hypothetical protein CY0110_28949 [Cyanothece sp. CCY0110]
Length = 405
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 116/478 (24%), Positives = 184/478 (38%), Gaps = 94/478 (19%)
Query: 50 ASALSLSPSLHVAVSEVLDKV---LSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTP 106
+ALS PSL AV+EV + LS P S S LI +L S
Sbjct: 4 TNALSTRPSLEGAVTEVTQTIGNTLSSPADVGIFFISSAYASDYPRLIPLILEKL-SLPI 62
Query: 107 VITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLR 166
+I GI G + + EICE+ E+N L+VG +P +++ P +
Sbjct: 63 LIGCGGAGITGRNEN-KEICEI-----ENN--------PAFSLMVGCLPDVQIN--PFIL 106
Query: 167 SKEE-PEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETV 225
S E P+ + + +S P+ I+L + I +LA +D+ PE
Sbjct: 107 SPESLPDLDSSPETWQRL----MGVSPQEKPHFILLSDPFSTQINDLLAGLDFAYPEAVK 162
Query: 226 IVGDATSCFLFKTGENSQNYNG----ALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGV 281
+ G ++S + G + NG Y V + S + I D ++ G
Sbjct: 163 VGGLSSSSLMGVPGTVFYHQNGDPQSGFYHQGTVGVGLSGN--------IAIDSIVAQGC 214
Query: 282 LPFGPELKAVSVKEHNADCSLLTARMEGYDGLLHGEEILEDIKEHIDDKYPY-LYIGVIH 340
P G + V E N L + E L E++E+++E + L++GV+
Sbjct: 215 RPIGQAYQVVK-GERNVILEL-SQNGETRSPLDWLRELMENLEESDRQLAQHSLFVGVVR 272
Query: 341 Q--RGSLQFGS---RSYMSLYEVLGAEDQFFIVNGVGIKPGDSFIFYHSDSDTASSSSID 395
+ LQ G R+ + + LGA I G ++PG F+ D+ T++ D
Sbjct: 273 DEFKQELQPGDFLIRNILGVDPRLGA-----IAIGDRVRPGQRLQFHLRDAQTSAD---D 324
Query: 396 VLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYF 455
+ L+ N S+ + G IFSC R L + + F
Sbjct: 325 LETLLKQYNQST--------------PIQGAFIFSCLGRGQTLYQMPN-----------F 359
Query: 456 ESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVM 513
+S F FP L G FC GEIG + QE LH Y++V+ ++
Sbjct: 360 DSQLFTNYFPGVSLGGFFCNGEIGP-------VGQE---------TFLHGYTSVFALV 401
>gi|172038301|ref|YP_001804802.1| hypothetical protein cce_3388 [Cyanothece sp. ATCC 51142]
gi|354554350|ref|ZP_08973655.1| protein of unknown function DUF1745 [Cyanothece sp. ATCC 51472]
gi|171699755|gb|ACB52736.1| unknown [Cyanothece sp. ATCC 51142]
gi|353554029|gb|EHC23420.1| protein of unknown function DUF1745 [Cyanothece sp. ATCC 51472]
Length = 406
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 111/478 (23%), Positives = 173/478 (36%), Gaps = 94/478 (19%)
Query: 50 ASALSLSPSLHVAVSEVLDKV---LSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTP 106
+ALS PSL AV+EV+ + LS P S S LI +L P
Sbjct: 4 TNALSTRPSLETAVTEVIQTIENSLSTPADVGIFFISSAYASDYPRLIPLILEKL--PLP 61
Query: 107 VITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLR 166
+ I EI E+ E+N L+VG +P +K+ P
Sbjct: 62 RLIGCGGAGIIGQHSGHEISEI-----ENN--------SAFSLMVGCLPNVKIN--PFFL 106
Query: 167 SKEE-PEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETV 225
S E P+ + D +S P+ I+L + I +LA +D+ PE
Sbjct: 107 SPESLPDLDSSPETWQDF----MGVSPQEKPHFILLSDPFSTRINDLLAGLDFAYPESVK 162
Query: 226 IVGDATSCFLFKTG----ENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGV 281
+ G ++S + G S ++ Y V L S D I D ++ G
Sbjct: 163 VGGLSSSSLMGVPGTVFYHQSGDHQSGYYHQGTVGLALSGD--------ITIDTIVAQGC 214
Query: 282 LPFGPELKAVSVKEHNADCSLLTARMEGYDGLLHGEEILEDIKEHIDDKYPY-LYIGVIH 340
P G + V E N L + E L E++E + E + L++GV+
Sbjct: 215 RPIGQPYQVVK-GERNIILEL-SQNGETRSPLDCLRELMESLDESDRQLAQHSLFVGVVR 272
Query: 341 Q--RGSLQFGS---RSYMSLYEVLGAEDQFFIVNGVGIKPGDSFIFYHSDSDTASSSSID 395
+ +LQ G R+ + + LGA I G ++PG F+ D+ T++
Sbjct: 273 DEFKQNLQSGDFLIRNLLGIDPKLGA-----IAVGDRVRPGQRLQFHLRDAQTSADDLET 327
Query: 396 VLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYF 455
+L R + + G L+FSC R L + + F
Sbjct: 328 LLKAYR-----------------QSTSIQGALMFSCLGRGQTLYQMPN-----------F 359
Query: 456 ESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVM 513
+S F F PL G FC GEIG + LH Y++V+ ++
Sbjct: 360 DSQLFANYFSGVPLGGFFCNGEIG----------------PVGRETFLHGYTSVFALV 401
>gi|428778473|ref|YP_007170259.1| hypothetical protein Dacsa_0083 [Dactylococcopsis salina PCC 8305]
gi|428692752|gb|AFZ48902.1| hypothetical protein Dacsa_0083 [Dactylococcopsis salina PCC 8305]
Length = 418
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 84/395 (21%), Positives = 153/395 (38%), Gaps = 71/395 (17%)
Query: 134 EDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLRSKEEPEFSMVDKFLMDIRHYSASISGC 193
E+N + + + L V ++PG+ V+ + ++E P+ K ++ +S
Sbjct: 80 EENQAQEIEGNPALSLTVAHLPGVNVQGFHI-SAEEIPDLDSGAKAWTNL----TGVSPE 134
Query: 194 SSPNGIILFGDQNIDIKPILAEMDYGLPEETVIVGDATSCFL-FKTGENSQN--YNGALY 250
P+ I+L + +L +D+ P+ + G A++ + +TG QN N L
Sbjct: 135 QEPDFILLADPFFSKVNDLLEGLDFAYPQGKKVGGLASAMAMGMQTGLFYQNGTENTELL 194
Query: 251 FFDAVALVFSRDSDNSNVPEIQFDITMSTGVLPFGPELKAVSVKEHNADCSLLTARMEGY 310
V + S + I+ D ++ G P GP+ + ++ E N L M G
Sbjct: 195 RGGMVGVALSGN--------IKVDTIVAQGCRPVGPQFQ-ITKGERNV---LAEVAMVGE 242
Query: 311 DGLLHGEEILEDIKEHIDDKYP--------YLYIGVIHQRGSLQFGSRSYMSLYEVLGAE 362
+G G+ L+ ++E +++ P L++G+ L+ ++ + +LG +
Sbjct: 243 NGAEAGKPPLQALRELMNELSPDDQQLAQDSLFLGIARSEFKLELQEGDFL-IRNLLGID 301
Query: 363 DQF-FIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLRLLNASSCCGTIGRNVTNANK 421
+ I G ++PG F+ D +T S+ ++VL N
Sbjct: 302 PKVGAIAVGDKLRPGQRIQFHLRDGNT-SAEDLEVL-------------LEKYQREEKNN 347
Query: 422 EVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRNFPETPLAGIFCYGEIGRG 481
G L+FSC R L + F+S F R E PL G FC GEIG
Sbjct: 348 SAVGALLFSCLGRGKELYGKPN-----------FDSELFRRYVGEIPLGGFFCNGEIG-- 394
Query: 482 RGLTRLINQEEEDCSISGRCLLHHYSTVYLVMSYT 516
+ LH Y++ + + T
Sbjct: 395 --------------PVGSETFLHGYTSSFAIFRPT 415
>gi|298490695|ref|YP_003720872.1| hypothetical protein Aazo_1561 ['Nostoc azollae' 0708]
gi|298232613|gb|ADI63749.1| domain of unknown function DUF1745 ['Nostoc azollae' 0708]
Length = 404
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 102/475 (21%), Positives = 180/475 (37%), Gaps = 88/475 (18%)
Query: 46 KPKLASALSLSPSLHVAVSEVLDKVLSEPIRPH---FAIASVGMQSKLAATHQLITARLG 102
K + A+ALS SL AV++V+ + +S P S S+ + L+T +L
Sbjct: 4 KMQWANALSTHHSLETAVTDVVQQAVSSLTAPADLGLVFISSAFTSEYSRLLPLLTEKL- 62
Query: 103 SRTPVITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETI 162
S +I + G++G + N + + I L + ++PG+ +
Sbjct: 63 SVPMLIGCSAAGVVGTKS--------------GNKTQEIESEPAISLTLAHLPGVDIRAF 108
Query: 163 PLLRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPE 222
+L + P+ +D+ + S+P I+L + +L +D+ P
Sbjct: 109 HIL-GDQLPDLDCSPDAWIDL----VGVLPSSAPQFILLSSAFSSGTNDLLQGLDFAYPS 163
Query: 223 ETVIVGDATSCFLFKTGENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVL 282
++ G A+ F+ + + N LY V L S D I + ++ G
Sbjct: 164 SVIVGGQASGGFV--SDRIALFCNDRLYRQGTVGLALSGD--------IVLETIVAQGCR 213
Query: 283 PFGPELKAVSVKEHNA----DCSLLTARMEGYDGLLHGEEILEDIKEHIDDKYPYLYIGV 338
P G EL V+ E N D + + L EE + + +H L++G+
Sbjct: 214 PIG-ELLQVTKAERNIILELDEQVPLVVLRNLISSLSEEEKM--LTQH------SLFVGL 264
Query: 339 IHQRGSLQFGSRSYMSLYEVLGAEDQF-FIVNGVGIKPGDSFIFYHSDSDTASSSSIDVL 397
L ++ + +LG + I G ++PG F+ D+ AS+ ++++
Sbjct: 265 AMNEFQLSLKQGDFL-IRNLLGVDPSAGAIAIGDRVRPGQRLQFHLRDAQ-ASAEDLELI 322
Query: 398 DGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFES 457
L+ S G+ L+FSC R L F+S
Sbjct: 323 --LQEYQEQSTSGS----------SPLAALMFSCVGRGAGLY-----------GKANFDS 359
Query: 458 YPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLV 512
F R F + P+ G FC GEIG +SGR LH Y++V+ +
Sbjct: 360 ELFKRYFHDIPMGGYFCAGEIG----------------PVSGRTFLHGYTSVFAI 398
>gi|376007333|ref|ZP_09784531.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|375324293|emb|CCE20284.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 413
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 103/453 (22%), Positives = 177/453 (39%), Gaps = 83/453 (18%)
Query: 48 KLASALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPV 107
+ SA+S PSL AV+EV+ K E ++ + V + S ++ + + L + V
Sbjct: 2 EWVSAISTRPSLEAAVTEVV-KQCQESLKSSPDLGLVFISSAFSSDYPRLMPLLAEQLSV 60
Query: 108 ---ITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPL 164
I GI+G+ + + + + + L + ++PG V+ P
Sbjct: 61 RVLIGCTGGGIVGMQT--------------ETQVKEIEGKPALCLCLAHLPG--VDICPF 104
Query: 165 -LRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEE 223
L + + P+ + +++ + + P I+L I +L +DY PE
Sbjct: 105 HLNTNDLPDLDNPPEDWVEL----IGVHPQNDPQFILLIDPFYGKINDLLQGLDYAYPES 160
Query: 224 TVIVGDATSCFLFKTGENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVLP 283
+ G A+S + + + NG +Y +AV + + + I + ++ G P
Sbjct: 161 PKVGGLASSGMMGRA--TAVFCNGEMYSAEAVGVALTGN--------IVLETIVAQGCRP 210
Query: 284 FGPELKAVSVKEHNADCSLLTARMEGYDGLLHGEEI----LEDIKEHI-----DDK---Y 331
G + V E + L + DGL + LE ++E I DD+
Sbjct: 211 IG---EPYRVSEGERNIILTVEKCSETDGLNINDRREVAPLEALQELIAELGEDDRKLAQ 267
Query: 332 PYLYIGVIHQ--RGSLQFGS---RSYMSLYEVLGAEDQFFIVNGVGIKPGDSFIFYHSDS 386
L++GV + L+ G R+ M + +GA + G ++PG F+ DS
Sbjct: 268 NSLFVGVARDEFKAKLESGDFLIRNLMGVDPRVGA-----MAIGDRVRPGQRIQFHLRDS 322
Query: 387 DTASSSSIDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDD 446
T++ +L G + L T A G LIFSC R E+ G
Sbjct: 323 RTSAEDLKGLLSGHQKLTE-----------TTAPVATEGALIFSCLGRG----ENLYGQP 367
Query: 447 DEDDNDVYFESYPFCRNFPETPLAGIFCYGEIG 479
D D N R + + PL G FC GEIG
Sbjct: 368 DFDSN--------LFREYFKIPLTGFFCNGEIG 392
>gi|209524312|ref|ZP_03272861.1| protein of unknown function DUF1745 [Arthrospira maxima CS-328]
gi|423063414|ref|ZP_17052204.1| hypothetical protein SPLC1_S100340 [Arthrospira platensis C1]
gi|209495103|gb|EDZ95409.1| protein of unknown function DUF1745 [Arthrospira maxima CS-328]
gi|406714846|gb|EKD10004.1| hypothetical protein SPLC1_S100340 [Arthrospira platensis C1]
Length = 413
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 107/454 (23%), Positives = 180/454 (39%), Gaps = 85/454 (18%)
Query: 48 KLASALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPV 107
+ SA+S PSL AV+EV+ K E ++ + V + S ++ + + L + V
Sbjct: 2 EWVSAISTRPSLEAAVTEVV-KQCQESLKSSPDLGLVFISSAFSSDYPRLMPLLAEQLSV 60
Query: 108 ---ITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPL 164
I GI+G+ + + + + + L + ++PG V+ P
Sbjct: 61 RVLIGCTGGGIVGMQT--------------ETQVKEIEGKPALGLCLAHLPG--VDICPF 104
Query: 165 -LRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEE 223
L + + P+ + +++ + + P I+L I +L +DY PE
Sbjct: 105 HLNTNDLPDLDNPPEDWVEL----IGVHPQNDPQFILLIDPFYGKINDLLQGLDYAYPES 160
Query: 224 TVIVGDATSCFLFKTGENSQNY-NGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVL 282
+ G A+S + G + + NG +Y +AV + + + I + ++ G
Sbjct: 161 PKVGGLASSGMM---GWATAVFCNGEMYSAEAVGVALTGN--------IVLETIVAQGCR 209
Query: 283 PFGPELKAVSVKEHNADCSLLTARMEGYDGLLHGEEI----LEDIKEHI-----DDK--- 330
P G L+ VS E N L + DGL + LE ++E I DD+
Sbjct: 210 PIGEPLR-VSQGERN--IILTVEKCSETDGLNINDRREVAPLEALQELIAELGEDDRKLA 266
Query: 331 YPYLYIGVIHQ--RGSLQFGS---RSYMSLYEVLGAEDQFFIVNGVGIKPGDSFIFYHSD 385
L++GV + L+ G R+ M + +GA + G ++PG F+ D
Sbjct: 267 QNSLFVGVARDEFKAKLESGDFLIRNLMGVDPRVGA-----MAIGDRVRPGQRIQFHLRD 321
Query: 386 SDTASSSSIDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGD 445
S T++ +L G + L T A G LIFSC R E+ G
Sbjct: 322 SRTSAEDLKGLLSGHQKLTE-----------TTAPVATEGALIFSCLGRG----ENLYGQ 366
Query: 446 DDEDDNDVYFESYPFCRNFPETPLAGIFCYGEIG 479
D D N R + + PL G FC GEIG
Sbjct: 367 PDFDSN--------LFREYFKIPLTGFFCNGEIG 392
>gi|67921944|ref|ZP_00515460.1| similar to Uncharacterized protein conserved in bacteria
[Crocosphaera watsonii WH 8501]
gi|67856160|gb|EAM51403.1| similar to Uncharacterized protein conserved in bacteria
[Crocosphaera watsonii WH 8501]
Length = 406
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 99/453 (21%), Positives = 165/453 (36%), Gaps = 96/453 (21%)
Query: 50 ASALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGM---QSKLAATHQLITARLGSRTP 106
+ALS PSL AV+EV +LS P A VG+ S A+ + + + + P
Sbjct: 4 TNALSTQPSLETAVTEVSQTILSRLSSP----ADVGIFWISSAYASDYSRLIPLILEKFP 59
Query: 107 VITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLR 166
L + E+ ++ ++ + L VG +P +K+ L
Sbjct: 60 -----------LPILIGCGGAGIIGQNENKESSEIENNPALSLTVGCLPNVKINPF-FLN 107
Query: 167 SKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVI 226
+ P+ + ++ +S PN I+L + I +LA +D+ P+ +
Sbjct: 108 PESLPDLDSSPEIWQEL----MGVSPQEQPNFILLSDPFSTPINDLLAGLDFAYPQSVKV 163
Query: 227 VGDATSCFLFKTG----ENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVL 282
G ++S + G + + Y V L S + I D ++ G
Sbjct: 164 GGLSSSNLMGVPGTVFYHQADDPQSGFYHQGTVGLALSGN--------IAIDTIVAQGCR 215
Query: 283 PFGPELKAVSVK-----------EHNADCSLLTARMEGYDGLLHGEEILEDIKEHIDDKY 331
P G + V + E + L MEG + + + +H
Sbjct: 216 PIGEPYQVVKGQRNIILELSQDGETKSPLEWLRQLMEGLN------DSDRSLAQH----- 264
Query: 332 PYLYIGVIHQ--RGSLQFGS---RSYMSLYEVLGAEDQFFIVNGVGIKPGDSFIFYHSDS 386
L++GV+ + LQ G R+++ + LGA I G ++PG F+ D+
Sbjct: 265 -SLFVGVVRDEFKQELQPGDFLIRNFLGVDPQLGA-----ISIGDRVRPGQRLQFHLRDA 318
Query: 387 DTASSSSIDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDD 446
T S D LD L +N + G L+FSC R L + +
Sbjct: 319 QT----SADDLDTLL-------------KQSNPPTPIQGALMFSCLGRGQTLYQIPN--- 358
Query: 447 DEDDNDVYFESYPFCRNFPETPLAGIFCYGEIG 479
F+S FP P+ G FC GEIG
Sbjct: 359 --------FDSQLLANYFPGVPIGGFFCNGEIG 383
>gi|416388015|ref|ZP_11685105.1| hypothetical protein CWATWH0003_1932 [Crocosphaera watsonii WH
0003]
gi|357264500|gb|EHJ13384.1| hypothetical protein CWATWH0003_1932 [Crocosphaera watsonii WH
0003]
Length = 406
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 99/453 (21%), Positives = 165/453 (36%), Gaps = 96/453 (21%)
Query: 50 ASALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGM---QSKLAATHQLITARLGSRTP 106
+ALS PSL AV+EV +LS P A VG+ S A+ + + + + P
Sbjct: 4 TNALSTQPSLETAVTEVSQTILSRLSSP----ADVGIFWISSAYASDYSRLIPLILEKFP 59
Query: 107 VITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLR 166
L + E+ ++ ++ + L VG +P +K+ L
Sbjct: 60 -----------LPILIGCGGAGIIGQNENKESSEIENNPALSLTVGCLPNVKINPF-FLN 107
Query: 167 SKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVI 226
+ P+ + ++ +S PN I+L + I +LA +D+ P+ +
Sbjct: 108 PESLPDLDSSPEIWQEL----MGVSPQEQPNFILLSDPFSTPINDLLAGLDFAYPQSVKV 163
Query: 227 VGDATSCFLFKTG----ENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVL 282
G ++S + G + + Y V L S + I D ++ G
Sbjct: 164 GGLSSSNLMGVPGTVFYHQADDPQSGFYHQGTVGLALSGN--------IAIDTIVAQGCR 215
Query: 283 PFGPELKAVSVK-----------EHNADCSLLTARMEGYDGLLHGEEILEDIKEHIDDKY 331
P G + V + E + L MEG + + + +H
Sbjct: 216 PIGESYQVVKGQRNIILELSQDGETKSPLEWLRQLMEGLN------DSDRSLAQH----- 264
Query: 332 PYLYIGVIHQ--RGSLQFGS---RSYMSLYEVLGAEDQFFIVNGVGIKPGDSFIFYHSDS 386
L++GV+ + LQ G R+++ + LGA I G ++PG F+ D+
Sbjct: 265 -SLFVGVVRDEFKQELQPGDFLIRNFLGVDPQLGA-----ISIGDRVRPGQRLQFHLRDA 318
Query: 387 DTASSSSIDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDD 446
T S D LD L +N + G L+FSC R L + +
Sbjct: 319 QT----SADDLDTLL-------------KQSNPPTPIQGALMFSCLGRGQTLYQIPN--- 358
Query: 447 DEDDNDVYFESYPFCRNFPETPLAGIFCYGEIG 479
F+S FP P+ G FC GEIG
Sbjct: 359 --------FDSQLLANYFPGVPIGGFFCNGEIG 383
>gi|218245775|ref|YP_002371146.1| hypothetical protein PCC8801_0913 [Cyanothece sp. PCC 8801]
gi|257058821|ref|YP_003136709.1| hypothetical protein Cyan8802_0940 [Cyanothece sp. PCC 8802]
gi|218166253|gb|ACK64990.1| domain of unknown function DUF1745 [Cyanothece sp. PCC 8801]
gi|256588987|gb|ACU99873.1| domain of unknown function DUF1745 [Cyanothece sp. PCC 8802]
Length = 414
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 105/487 (21%), Positives = 187/487 (38%), Gaps = 109/487 (22%)
Query: 48 KLASALSLSPSLHVAVSEVLDKV-LSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTP 106
+ +ALS PSL AV+EV+D + S P I + S A+ + + + + P
Sbjct: 6 QWTNALSTRPSLEAAVTEVVDGIKRSLSHAPDLGILFI--SSAYASDYPRLIPLILEKLP 63
Query: 107 V---ITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKV---- 159
+ + + GIIG++ + + + + + + L V +P + +
Sbjct: 64 LPILVGCSGGGIIGMN--------------DPSQIEEIEGKPALSLTVASLPNVNIQPFY 109
Query: 160 ---ETIPLLRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEM 216
ET+P L S + ++ +S P+ I+L + I +LA +
Sbjct: 110 LTPETLPDLDSPPDTWSELI------------GVSPADQPDFILLSSPFSPGITDLLAGL 157
Query: 217 DYGLPEETVIVGDAT-------SCFLFKTGENSQNYNGALYFFDAVALVFSRDSDNSNVP 269
D+ P + G A+ S + T E + +++ V + + +
Sbjct: 158 DFAYPGAVKVGGLASTSIIGVQSALFYHTPETADQ---GVFYQGTVGIALTGN------- 207
Query: 270 EIQFDITMSTGVLPFGPELKAVSVKEHNADCSLLTARMEGYDGLLHGEEILEDIKEHIDD 329
I+ + ++ G P G + +S E N L + L E+++ + E
Sbjct: 208 -IRVESIVAQGCRPIG-QPYEISQGERNVILQLRDHNQQIRPPLELLRELIQTLGEKDRQ 265
Query: 330 KYPY-LYIGVIHQ--RGSLQFGS---RSYMSLYEVLGAEDQFFIVNGVGIKPGDSFIFYH 383
+ L++G++ + +LQ G R+ + + LGA I G I+PG F+
Sbjct: 266 LAEHSLFVGIVSDEFKQTLQPGDFLIRNLLGVDPRLGA-----IAIGDRIRPGQRIQFHL 320
Query: 384 SDSDTASSSSIDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDD 443
D+ T++ D L+LL S R + V G L+FSC R L
Sbjct: 321 RDAQTSA-------DDLQLLLESY------RASVGSTNPVEGALMFSCLGRGEGLY---- 363
Query: 444 GDDDEDDNDVYFESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLL 503
N F+S F R FP+ L G FC GEIG + G+ L
Sbjct: 364 -------NQPNFDSGLFSRFFPKLSLGGFFCNGEIG----------------PVGGQTFL 400
Query: 504 HHYSTVY 510
H Y++ +
Sbjct: 401 HGYTSAF 407
>gi|17230343|ref|NP_486891.1| hypothetical protein alr2851 [Nostoc sp. PCC 7120]
gi|17131945|dbj|BAB74550.1| alr2851 [Nostoc sp. PCC 7120]
Length = 406
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 105/472 (22%), Positives = 185/472 (39%), Gaps = 86/472 (18%)
Query: 46 KPKLASALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSR- 104
+ + A+ALS PSL AV++V+ + +S P + V + S A+ + + L +
Sbjct: 4 RMQWANALSTRPSLEAAVTDVVQRAVSTLTAPA-DLGLVFISSAFASEYSRVLPLLAEQL 62
Query: 105 -TPVITN-AVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETI 162
PV+ + G+IG A + + E L H G+ L V +V G E +
Sbjct: 63 SVPVMIGCSGGGVIGTAAS----GQTQELEAEAALSLTLAHLPGVNLQVFHVLG---EEL 115
Query: 163 PLLRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPE 222
P L S + +++ + +P+ I+L + I +L +D+ P
Sbjct: 116 PDLDSPPDTWINLI------------GVPPSPTPHFILLSSAFSSGINDLLQGLDFAYPG 163
Query: 223 ETVIVGDATSCFLFKTGENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVL 282
++ G A+ + G + NG+L+ V L S + I + ++ G
Sbjct: 164 SVILGGQASVGGM--GGRLALFCNGSLHREGTVGLALSGN--------IVLEPIVAQGCR 213
Query: 283 PFGPELKAVSVKEHNADCSLLTARMEGYDGLLHGEEILEDIKEHIDDKYPY-LYIGVIHQ 341
P G L+ V+ E N + ++ L+ +++ + EH + L++GV
Sbjct: 214 PIGEPLQ-VTKAERN-----IILELDEKAPLVVLRDLIASLSEHERALAQHSLFVGVAMD 267
Query: 342 RGSLQFGSRSYMSLYEVLGAEDQF-FIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGL 400
L ++ + +LG + I G ++PG F+ DS AS+ ++ L
Sbjct: 268 EFKLSLQQGDFL-IRSILGVDPSGGAIAIGDLVRPGQRLQFHLRDSQ-ASAEELEFL--- 322
Query: 401 RLLNASSCCGTIGRNVTNA--NKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESY 458
+ R T A + G L+FSC R L + F+S
Sbjct: 323 -----------LERYQTKAEFDNAAVGALMFSCVGRGEGLYGKPN-----------FDSE 360
Query: 459 PFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVY 510
F R + P+ G FC GEIG + GR LH Y++V+
Sbjct: 361 LFKRYIQDVPVGGFFCGGEIG----------------PVGGRTFLHGYTSVF 396
>gi|414075569|ref|YP_006994887.1| hypothetical protein ANA_C10267 [Anabaena sp. 90]
gi|413968985|gb|AFW93074.1| hypothetical protein ANA_C10267 [Anabaena sp. 90]
Length = 407
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 95/471 (20%), Positives = 177/471 (37%), Gaps = 84/471 (17%)
Query: 48 KLASALSLSPSLHVAVSEVLDKV---LSEPIRPHFAIASVGMQSKLAATHQLITARLGSR 104
+ +ALS PSL A+++V+++ L+ P S S+ + L+ +L
Sbjct: 6 QWTNALSTRPSLEAAINDVVEQAVASLTAPAHLGLVFISSAFMSEYSRLLPLLAEKL--S 63
Query: 105 TPVITN-AVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIP 163
PV+ + G+IG E E++ + L + ++PG+++
Sbjct: 64 VPVLIGCSAGGVIG-KKQAGETEEIEAE-------------PALSLTLAHLPGVEIRPFH 109
Query: 164 LLRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEE 223
++ ++E P+ +D+ S+ P I+L +L +D+ P
Sbjct: 110 IV-AEELPDSDSSPMAWIDLLGVPPSVV----PQFILLSSPFASGTNDLLQGLDFAYPGS 164
Query: 224 TVIVGDATSCFLFKTGENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVLP 283
V+ G A+S F+ G N LY V + S + I D ++ G P
Sbjct: 165 VVVGGQASSGFM--NGRVGLFCNDKLYREGTVGIALSGN--------IVLDTIVAQGCRP 214
Query: 284 FGPELKAVSVKEHNADCSLLTARMEGYDGLLHGEEILEDIKEHIDDKYPY-LYIGVIHQR 342
G L+ K + + ++ L+ ++ + E + L++G+
Sbjct: 215 IGEPLQVTKAKRN------IIVELDEKVPLVVLRNLISSLSEEDRTLAQHSLFVGLAMDE 268
Query: 343 GSLQFGSRSYMSLYEVLGAE-DQFFIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLR 401
L S ++ + +LG + + I G I+ G F+ D++ ++ +L +
Sbjct: 269 FRLNLHSGDFL-IRNILGVDPNAGAIAIGDRIRAGQRLQFHLRDAEASAQDLEILLQEYQ 327
Query: 402 LLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFC 461
NAS G L+F+C R L + F+S F
Sbjct: 328 SQNAS-------------EPSPVGALMFTCLGRGTGLYGKPN-----------FDSQLFS 363
Query: 462 RNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLV 512
R + P+ G FC GEIG +SGR LH Y++V+ +
Sbjct: 364 RYLHDLPMGGFFCGGEIG----------------PVSGRTFLHGYTSVFAI 398
>gi|428208504|ref|YP_007092857.1| hypothetical protein Chro_3531 [Chroococcidiopsis thermalis PCC
7203]
gi|428010425|gb|AFY88988.1| protein of unknown function DUF1745 [Chroococcidiopsis thermalis
PCC 7203]
Length = 406
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 142/368 (38%), Gaps = 68/368 (18%)
Query: 149 LIVGYVPGLKVETIPLLRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNID 208
L + ++PG++V ++ S+E P+ ++ AS +P I+L +
Sbjct: 99 LTLAHLPGVQVTPFHIV-SEELPDLDSSPNTWEELLGVPAS----PTPQFILLAEPFSSQ 153
Query: 209 IKPILAEMDYGLPEETVIVGDATSCFLFKTGENSQNYNGALYFFDAVALVFSRDSDNSNV 268
I +LA +D+ P + G A+S + G + +N +Y AV + S +
Sbjct: 154 INDLLAGLDFAYPGSVTVGGLASSSQM--GGRINLFFNDKVYREGAVGVALSGN------ 205
Query: 269 PEIQFDITMSTGVLPFGPELKAVSVKEHNADCSLLTARMEGYDGLLHGEEILEDIKEHID 328
+ + ++ G P G + + + N + +E L +ILED+ E D
Sbjct: 206 --VVLETIVAQGCRPIGKPYQ-IGACDRN-----IVLELEAQPPLTVLRDILEDLSE--D 255
Query: 329 DK---YPYLYIGVIHQRGSLQFGSRSYMSLYEVLGAEDQF-FIVNGVGIKPGDSFIFYHS 384
D+ L+IGV ++ + +LG + +F I G I+PG F+
Sbjct: 256 DRELAQNSLFIGVARDEFKQDLEQGDFL-IRNLLGVDPKFGAIAIGDRIRPGQRIQFHLR 314
Query: 385 DSDTASSSSIDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDG 444
D++T++ +L ++ SS G L+FSC R L
Sbjct: 315 DANTSAEDLEYLLQRYQIQTQSSPAAA-------------GALMFSCLGRGEGLY----- 356
Query: 445 DDDEDDNDVYFESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLH 504
F+S F + P PL G FC GEIG + LH
Sbjct: 357 ------GKANFDSQLFRQYLPGLPLGGFFCNGEIG----------------PVGNSTFLH 394
Query: 505 HYSTVYLV 512
Y++V+ +
Sbjct: 395 GYTSVFAI 402
>gi|428204250|ref|YP_007082839.1| hypothetical protein Ple7327_4151 [Pleurocapsa sp. PCC 7327]
gi|427981682|gb|AFY79282.1| hypothetical protein Ple7327_4151 [Pleurocapsa sp. PCC 7327]
Length = 420
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 60/251 (23%), Positives = 100/251 (39%), Gaps = 53/251 (21%)
Query: 273 FDITMSTGVLPFGPELKAVSVKEHNADCSLLTARMEGYDGLLHGE----EILEDIKEHID 328
+ ++ G P G + VS E N L T +G +G+ E+L D+ + +D
Sbjct: 210 LETIVAQGCRPIGKPYR-VSQGERNIILEL-TEIEDGNEGIASSSRPPLEVLRDLLQTLD 267
Query: 329 DK-----YPYLYIGVIHQRGSLQFGSRSYMSLYEVLGAEDQF-FIVNGVGIKPGDSFIFY 382
DK L+IG+ + G ++ + +LG + + + G I+PG F+
Sbjct: 268 DKDRELAQHSLFIGIARDEFKARLGRGDFL-IRNLLGVDPRKGAMAVGDRIRPGQRVQFH 326
Query: 383 HSDSDTASSSSIDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDD 442
DS+T++ +L+ + +S G G L+FSC R L
Sbjct: 327 LRDSETSAEDLEFLLEAYQKEKGTSSPGA-------------GALMFSCLGRGEALYGVP 373
Query: 443 DGDDDEDDNDVYFESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCL 502
+ F+S F R + PL G FC GEIG +SGR
Sbjct: 374 N-----------FDSKLFGRYLHDIPLGGFFCNGEIG----------------PVSGRTF 406
Query: 503 LHHYSTVYLVM 513
LH Y++ + ++
Sbjct: 407 LHGYTSAFAIL 417
>gi|409991855|ref|ZP_11275082.1| hypothetical protein APPUASWS_12371 [Arthrospira platensis str.
Paraca]
gi|291571759|dbj|BAI94031.1| hypothetical protein [Arthrospira platensis NIES-39]
gi|409937289|gb|EKN78726.1| hypothetical protein APPUASWS_12371 [Arthrospira platensis str.
Paraca]
Length = 413
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 104/455 (22%), Positives = 181/455 (39%), Gaps = 87/455 (19%)
Query: 48 KLASALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPV 107
+ SA+S PSL AV+EV+ K E ++ + V + S ++ + + L + V
Sbjct: 2 EWVSAISTRPSLEAAVTEVVKKC-QESLKSSPDLGLVFISSAFSSDYPRLMPLLAEQLSV 60
Query: 108 ---ITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPL 164
I GI+G+ + + + + + L + ++PG V+ P
Sbjct: 61 RVLIGCTGGGIVGMQT--------------ETQVKEIEGKPALGLCLAHLPG--VDICPF 104
Query: 165 -LRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEE 223
L + + P+ + +++ + + P I+L I +L +DY PE
Sbjct: 105 HLTTNDLPDLDNPPEDWVEL----IGVHPQNDPQFIVLIDPFYGKINDLLQGLDYAYPES 160
Query: 224 TVIVGDATSCFLFKTGENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVLP 283
+ G A+S + + + G +Y +AV + + + I + ++ G P
Sbjct: 161 PKVGGLASSGMMGRA--TAVFCKGEMYSAEAVGVALTGN--------IVLETIVAQGCRP 210
Query: 284 FGPELKAVSVKEHNADCSLLTARMEGYDGLL--HGEEI--LEDIKEHIDD--------KY 331
G + V E + L + DGL G+E+ LE ++E I +
Sbjct: 211 IG---EPYRVSEGERNIILTVQKCSETDGLNINCGDEVAPLEALQELIAELGEEDRQLAQ 267
Query: 332 PYLYIGVIHQ--RGSLQFGS---RSYMSLYEVLGAEDQFFIVNGVGIKPGDSFIFYHSDS 386
L++GV + +L+ G R+ M + +GA + G ++PG F+ DS
Sbjct: 268 NSLFVGVARDEFKANLESGDFLIRNLMGVDPRVGA-----MAIGDRVRPGQRIQFHLRDS 322
Query: 387 DTASSSSIDVLDGLRLLNASSCCGTIGRNVTNANKEVF--GGLIFSCFSRSVPLSEDDDG 444
T++ L GL T + +T A EV G L+FSC R E+ G
Sbjct: 323 RTSAED----LKGLL---------TRHQKLTEATTEVAKEGALMFSCLGRG----ENLYG 365
Query: 445 DDDEDDNDVYFESYPFCRNFPETPLAGIFCYGEIG 479
D D N R + + PL G FC GEIG
Sbjct: 366 QPDFDSN--------LFREYFKIPLTGFFCNGEIG 392
>gi|427728722|ref|YP_007074959.1| hypothetical protein Nos7524_1486 [Nostoc sp. PCC 7524]
gi|427364641|gb|AFY47362.1| hypothetical protein Nos7524_1486 [Nostoc sp. PCC 7524]
Length = 403
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 109/476 (22%), Positives = 179/476 (37%), Gaps = 100/476 (21%)
Query: 48 KLASALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSR--T 105
+ A+ALS PSL AV+EV+ + +S P + V + S A+ + + L +
Sbjct: 6 QWANALSTRPSLEAAVAEVVQRTVSLLTAPA-DLGLVFISSAFASEYSRVLPLLAEKLSV 64
Query: 106 PVITN-AVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPL 164
PV+ + G+IG A + + E L H G+ L V +V E +P
Sbjct: 65 PVLIGCSGGGVIGTGAS----GQTQELEAEAALSLTLAHLPGVDLQVFHV---VAEDLPD 117
Query: 165 LRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEET 224
L S + ++D + + P I+L + I +L +D+ P
Sbjct: 118 LDSPPDAWIDLID------------VEPSAKPQFILLSSAFSSGINDLLQGLDFAYPGSV 165
Query: 225 VIVGDATSCFLFKTGENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVLPF 284
++ G A++ L G + N LY V L S + I + ++ G P
Sbjct: 166 IVGGQASAGGL--GGRLALFCNDTLYRDGTVGLALSGN--------IVLETIVAQGCKPI 215
Query: 285 GPELKAVSVKEHNADCSLLTARMEGYDGLLHGEEILEDIKEHIDDKYPYL-----YIGVI 339
G L+ AD +++ E + +L D+ + +K L ++GV
Sbjct: 216 GEPLQVT-----KADRNIILEIDEKVPLV-----VLRDLIASLSEKERMLAQHSLFVGVA 265
Query: 340 HQ--RGSLQFGS---RSYMSLYEVLGAEDQFFIVNGVGIKPGDSFIFYHSDSDTASSSSI 394
+ SLQ G RS + + GA I G ++PG F+ D+ ++
Sbjct: 266 MDEFKLSLQQGDFLIRSILGVDPAGGA-----IAIGDLVRPGQRLQFHLRDAQASA---- 316
Query: 395 DVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVY 454
D L+ L + R + L+FSC R L + D +
Sbjct: 317 ---DDLKFL--------LERYQQQGSPSAAAALMFSCVGRGEGLYGKPNFDSE------L 359
Query: 455 FESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVY 510
F SY P P+ G FC GEIG + GR LH Y++V+
Sbjct: 360 FNSY-----LPAIPVGGFFCGGEIG----------------PVGGRTFLHGYTSVF 394
>gi|220909604|ref|YP_002484915.1| hypothetical protein Cyan7425_4241 [Cyanothece sp. PCC 7425]
gi|219866215|gb|ACL46554.1| domain of unknown function DUF1745 [Cyanothece sp. PCC 7425]
Length = 411
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 111/491 (22%), Positives = 191/491 (38%), Gaps = 115/491 (23%)
Query: 48 KLASALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPV 107
K ASA+S SL AV EV + +P +A V + S A+ + + L V
Sbjct: 2 KWASAISTRYSLEAAVKEVTRQTQHALGQPA-DLALVFISSSFASEYSRLLPLLQDHLTV 60
Query: 108 ---ITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPL 164
I G+IG++ D + + + L++G++PG+ + +
Sbjct: 61 PHLIGCGGEGVIGMN--------------RDGEPEEVESEPALALMLGHLPGVNLHPFHI 106
Query: 165 LRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEET 224
+ +++ P+ +D+ ++ + P+ ++L I +L +DY P
Sbjct: 107 V-AEDLPDLDSPPDQWVDL----IGVAVETQPHFVLLADALTAKINDLLQGLDYAYPGAI 161
Query: 225 VIVG-----DATSCFLFKTGENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMST 279
I G ++ C LF N LY + + S + + + ++
Sbjct: 162 KIGGLTNSNNSRGCGLF--------CNSTLYREGCIGIALSGN--------LVLETIVAQ 205
Query: 280 GVLPFGPELKAVSVKEHN-----------ADCSLLTARM----EGYDGLLHGEEILEDIK 324
G P G E V+ E N A S +AR+ E + L+ E+ E +
Sbjct: 206 GCRPIG-EPYRVAEAERNIVLKLEPLTVEASPSSASARLQTPLEALEDLIR--ELSESDR 262
Query: 325 EHIDDKYPYLYIGVIHQ--RGSLQFGS---RSYMSLYEVLGAEDQFFIVNGVGIKPGDSF 379
+ D L++GV+ + +L+ G R+ + L GA I G ++PG
Sbjct: 263 QLAQDS---LFVGVVRDEFKQTLEPGDFLIRNLIGLDPRAGA-----IAIGDRVRPGQRI 314
Query: 380 IFYHSDSDTASSSSIDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLS 439
F+ DS+ AS++ +++L L+ N + G AN G L+FSC R V L
Sbjct: 315 QFHLRDSE-ASAAELELL--LQRYNQNLEVG--------ANP--IGALLFSCLGRGVGLY 361
Query: 440 EDDDGDDDEDDNDVYFESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISG 499
+F+S F R PL+G FCYGEIG + G
Sbjct: 362 -----------GKPHFDSRLFSRYLNTIPLSGFFCYGEIG----------------PLGG 394
Query: 500 RCLLHHYSTVY 510
LH Y++ +
Sbjct: 395 ETFLHGYTSAF 405
>gi|427715630|ref|YP_007063624.1| hypothetical protein Cal7507_0291 [Calothrix sp. PCC 7507]
gi|427348066|gb|AFY30790.1| protein of unknown function DUF1745 [Calothrix sp. PCC 7507]
Length = 410
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 99/472 (20%), Positives = 183/472 (38%), Gaps = 82/472 (17%)
Query: 46 KPKLASALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSR- 104
K + A+ALS PSL AV++V+++ +S P + V + S A+ + + L +
Sbjct: 4 KMQWANALSTRPSLEAAVTDVVERAVSSLTAPA-DLGLVFISSAFASEYSRLLPLLSEKL 62
Query: 105 -TPVITN-AVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETI 162
PV+ + G+IG + + + + L + ++PG+ +E
Sbjct: 63 SVPVLIGCSGGGVIGTTT--------------EGQTQELEAEAALSLTLAHLPGVDLEVF 108
Query: 163 PLLRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPE 222
++ S+E P+ +D+ S +P I+L + I +L +D+ P
Sbjct: 109 HVV-SEELPDLDSSPDAWIDLIGVPPS----PTPQFILLCSSFSSGINDLLQGLDFAYPG 163
Query: 223 ETVIVGDATSCFLF-KTGENSQNYNG-ALYFFDAVALVFSRDSDNSNVPEIQFDITMSTG 280
+ G A++ + +T + G LY + L S + I + ++ G
Sbjct: 164 SVTLGGQASAGGMSGRTALFCHDAGGDRLYREGTLGLALSGN--------IAVETIVAQG 215
Query: 281 VLPFGPELKAVSVKEHNADCSLLTARMEGYDGLLHGEEILEDIKEHIDDKYPY-LYIGVI 339
P G L+ V+ E N + ++ L+ E++ ++ E + L++GV
Sbjct: 216 CRPIGKPLQ-VTKAERN-----IILELDEQVPLVVLREVIANLSEQERMLAQHSLFVGVA 269
Query: 340 HQRGSLQFGSRSYMSLYEVLGAEDQF-FIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLD 398
L ++ + +LG + I G ++PG F+ D++ ++ +
Sbjct: 270 MDGFKLTLQQGDFL-IRGILGVDPSAGAIAIGDRVRPGQRLQFHLRDAEASA-------E 321
Query: 399 GLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESY 458
L LL +N NA L+FSC R L + F+S
Sbjct: 322 DLELLLQQY------QNQRNAEPAAIAALMFSCVGRGEGLYGQPN-----------FDSD 364
Query: 459 PFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVY 510
F R + P+ G FC GEIG I G LH Y++V+
Sbjct: 365 LFRRYIKDIPVGGFFCGGEIG----------------PIGGSTFLHGYTSVF 400
>gi|428775879|ref|YP_007167666.1| hypothetical protein PCC7418_1252 [Halothece sp. PCC 7418]
gi|428690158|gb|AFZ43452.1| protein of unknown function DUF1745 [Halothece sp. PCC 7418]
Length = 416
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 72/359 (20%), Positives = 146/359 (40%), Gaps = 56/359 (15%)
Query: 134 EDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLRSKEEPEFSMVDKFLMDIRHYSASISGC 193
E+N + + + L V ++P + V+ + + + P+ D+ +S
Sbjct: 80 EENQAQEIEGNPALSLTVAHLPRVNVQGFHI-SADQIPDLDSAATAWTDL----TGVSPD 134
Query: 194 SSPNGIILFGDQNIDIKPILAEMDYGLPEETVIVGDATSCFL-FKTG---ENSQNYNGAL 249
P+ I+L + +L +D+ P + G A++ + +TG E++ NG L
Sbjct: 135 QDPDFILLADPFFSKVNDLLEGLDFAYPSAKKVGGLASAMAMGMQTGLFYESASEKNGGL 194
Query: 250 YFFDAVALVFSRDSDNSNVPEIQFDITMSTGVLPFGPELKAVSVKEHNADCSLLTARMEG 309
V + S + I+ D ++ G P GP+ + ++ E N + A+++G
Sbjct: 195 LREGIVGVALSGN--------IKMDTIVAQGCRPIGPQYQ-ITQGERNVLAEV--AQVKG 243
Query: 310 YDGLLHGEEILEDIKEHIDDKYP--------YLYIGVIHQRGSLQFGSRSYMSLYEVLGA 361
+G + L+ ++E +++ P L++G+ L+ ++ + +LG
Sbjct: 244 -NGTESAKPPLQALRELMNELSPEDQQLAQDSLFLGIARDEFKLELQQGDFL-IRNLLGV 301
Query: 362 EDQF-FIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLRLLNASSCCGTIGRNVTNAN 420
+ + I G ++PG F+ D +T++ +L+ + +S
Sbjct: 302 DPKVGAIAVGDKLRPGQRIQFHLRDGNTSAEDLQVLLEQYQQDEQTSTP----------- 350
Query: 421 KEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRNFPETPLAGIFCYGEIG 479
G L+FSC R L + F+S F ++ + PL G FC GEIG
Sbjct: 351 ---VGALLFSCLGRGKELYGKPN-----------FDSELFRQSMSQIPLGGFFCNGEIG 395
>gi|411118139|ref|ZP_11390520.1| hypothetical protein OsccyDRAFT_1996 [Oscillatoriales
cyanobacterium JSC-12]
gi|410711863|gb|EKQ69369.1| hypothetical protein OsccyDRAFT_1996 [Oscillatoriales
cyanobacterium JSC-12]
Length = 425
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 106/486 (21%), Positives = 181/486 (37%), Gaps = 93/486 (19%)
Query: 45 SKPKLASALSLSPSLHVAVSEVLDKVLSEPIRPHFA-----IASVGMQSKLAATHQLITA 99
S + SALS PSL A++EV+++ L+ + H A S S+ L+
Sbjct: 7 SSMRWVSALSTRPSLESAIAEVVERTLA--VLQHSADLGLIFISSAFTSEYPRLMPLLQE 64
Query: 100 RLGSRTPVITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKV 159
+L P+I G+IG+ + E++ E L H G+++ ++P
Sbjct: 65 KL-PNVPMIGCGGAGVIGM-GNYSRAREIEG---EPALSLTLAHLPGVLVHPFHLP---P 116
Query: 160 ETIPLLRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYG 219
E++P L S + +V ++ P ++L I +L +D+
Sbjct: 117 ESLPDLDSSPDTWVDLV------------GVAPTQQPQFVLLAEPAFGRINDLLQGLDFA 164
Query: 220 LPEETVIVGDATSCFLFKTGENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMST 279
P + G ++ F G NY VAL+ + I D ++
Sbjct: 165 YPGSAKVGGLSSGGFGSNAGALFCNYQLHREGTVGVALMGN----------IVLDAIVAQ 214
Query: 280 GVLPFGPELKAVSVKEHNADCSLLTARMEGYDGLLHGE---------EILEDIKEHIDDK 330
G P G VS E N +L + G E+L+D+ +++ ++
Sbjct: 215 GCRPIGQPF-LVSEGERNIMLALEAQGEVTSNSFAGGAAVSQKGTPLEMLQDLIQNLSEE 273
Query: 331 -----YPYLYIGVIHQRGSLQFGSRSYMSLYEVLGAEDQF-FIVNGVGIKPGDSFIFYHS 384
L++GV ++ + +++G + + I G I+PG F+
Sbjct: 274 DRLLAQHSLFVGVAQSEFKQTLEQGDFL-IRQLIGVDPRVGAIAIGDRIRPGQRIQFHLR 332
Query: 385 DSDTASSSSIDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDG 444
D+ T S+ D+ LR N N++ G L+FSC R L D
Sbjct: 333 DAKT---SAEDLEAMLRRYQV---------NNPNSSSTAIGALMFSCTGRGEGLYGQSD- 379
Query: 445 DDDEDDNDVYFESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLH 504
F+S F FP PL+G FC GEIG + G LH
Sbjct: 380 ----------FDSQLFTNYFPGVPLSGFFCNGEIG----------------PVGGNTFLH 413
Query: 505 HYSTVY 510
+++V+
Sbjct: 414 GFTSVF 419
>gi|75907272|ref|YP_321568.1| hypothetical protein Ava_1049 [Anabaena variabilis ATCC 29413]
gi|75700997|gb|ABA20673.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
Length = 406
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 104/474 (21%), Positives = 179/474 (37%), Gaps = 94/474 (19%)
Query: 48 KLASALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSR--T 105
+ A+ALS PSL AV++V+ + +S P + V + S A+ + + L +
Sbjct: 6 QWANALSTRPSLEAAVTDVVQRAVSTLTAPA-DLGLVFISSAFASEYSRVLPLLAEQLSV 64
Query: 106 PVITN-AVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPL 164
PV+ + G+IG A + + E L H G+ L V +V G E +P
Sbjct: 65 PVMIGCSGGGVIGTAAS----GQTQELEAEAALSLTLAHLPGVNLQVFHVLG---EELPD 117
Query: 165 LRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEET 224
L S + +++ + +P+ I+L + I +L +D+ P
Sbjct: 118 LDSPPDTWINLI------------GVPPSPTPHFILLSSAFSSGINDLLQGLDFAYPGSV 165
Query: 225 VIVGDATSCFLFKTGENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVLPF 284
++ G A+ + G + NG+L+ V L S + I + ++ G P
Sbjct: 166 ILGGQASVGGM--GGRLALFCNGSLHREGTVGLALSGN--------IVLEPIVAQGCRPI 215
Query: 285 GPELKAVSVK-----EHNADCSLLTARMEGYDGLLHGEEILEDIKEHIDDKYPYLYIGVI 339
G L+ + E + L+ R D + E + +H L++GV
Sbjct: 216 GEPLQVTKAERNIILELDEKVPLVVLR----DLIASLSEKERALAQH------SLFVGVA 265
Query: 340 HQRGSLQFGSRSYMSLYEVLGAEDQF-FIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLD 398
L ++ + +LG + I G ++PG F+ DS AS+ ++ L
Sbjct: 266 MDEFKLSLQQGDFL-IRSILGVDPSGGAIAIGDLVRPGQRLQFHLRDSQ-ASAEELEFL- 322
Query: 399 GLRLLNASSCCGTIGRNVTNA--NKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFE 456
+ R T + G L+FSC R L + F+
Sbjct: 323 -------------LERYQTKPEFDNSAVGALMFSCVGRGEGLYGKPN-----------FD 358
Query: 457 SYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVY 510
S F R + P+ G FC GEIG + GR LH Y++V+
Sbjct: 359 SELFKRYIQDVPVGGFFCGGEIG----------------PVGGRTFLHGYTSVF 396
>gi|170077800|ref|YP_001734438.1| hypothetical protein SYNPCC7002_A1183 [Synechococcus sp. PCC 7002]
gi|169885469|gb|ACA99182.1| conserved hypothetical protein [Synechococcus sp. PCC 7002]
Length = 432
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 106/481 (22%), Positives = 186/481 (38%), Gaps = 97/481 (20%)
Query: 51 SALSLSPSLHVAVSEVLDKV---LSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPV 107
+ALS SL A+ E + ++ LS S S+ A L+T +L + +
Sbjct: 27 NALSTQASLEGAIDEAVAQIQENLSGSADLAILFISAAFASEYARILPLLTEKLACKVVI 86
Query: 108 ITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYG---IVLIVGYVPGLKVETIPL 164
V+ IIG DA + C G + L V ++P VE +P
Sbjct: 87 GCGGVS-IIGTDAT-----------------GETQECEGQPALSLTVAHLP--DVEVVPF 126
Query: 165 -LRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEE 223
+ K+ P+ +DI +S + PN I+L + I +LA +D+ P
Sbjct: 127 HVTEKDLPDLDSAPDPWIDIF----GVSPEAEPNFILLADPFSSSITDLLAGLDFAYPNA 182
Query: 224 TVIVGDATSCFLFKTGENSQNYNGALYFFDA---VALVFSRDSDNSNVPEIQFDITMSTG 280
+ G +S +G L++++A L+ S + IQ + ++ G
Sbjct: 183 AKVGGLTSSGGRTASG---------LFYYEADQEPTLLRSGTVGVALAGNIQMETVVAQG 233
Query: 281 VLPFGPELKAVSVKEHNADCSLLTARMEGYDGLLHGEEI--LEDIKEHIDDK-----YPY 333
P G E+ ++ + N L A E L HG + L+++ +D++
Sbjct: 234 CRPIG-EVYQITQCDRNIITELSVAEGEQ---LRHGSPLRFLQELIAELDEEDQALAQDS 289
Query: 334 LYIGVIHQRGSLQFGSRSYMSLYEVLGAEDQF-FIVNGVGIKPGDSFIFYHSDSDTASSS 392
L+IG+ + ++ + +LG + + I G I+ G F+ D++T S+
Sbjct: 290 LFIGIAMDAFKQKLIHGDFL-IRNLLGVDPRAGAIAVGDRIRAGQRVQFHLRDAET-SAE 347
Query: 393 SIDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDND 452
+ VL L+ A+ + + FG L+F+C R L + +
Sbjct: 348 DLSVL--LQQFQAN-----------DPLEPPFGALMFACLGRGKGLYGEPN--------- 385
Query: 453 VYFESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLV 512
F+S F P L G FC GEIG + R LH Y++V+ +
Sbjct: 386 --FDSTLFSTALPTPNLGGFFCNGEIG----------------PVGDRTFLHGYTSVFGI 427
Query: 513 M 513
+
Sbjct: 428 L 428
>gi|224168137|ref|XP_002339115.1| predicted protein [Populus trichocarpa]
gi|222874431|gb|EEF11562.1| predicted protein [Populus trichocarpa]
Length = 70
Score = 47.8 bits (112), Expect = 0.013, Method: Composition-based stats.
Identities = 26/57 (45%), Positives = 36/57 (63%), Gaps = 3/57 (5%)
Query: 270 EIQFDITMSTGVLPFGPELKAVSVKEHNAD---CSLLTARMEGYDGLLHGEEILEDI 323
EIQF +S+GV GP KAVSV++ ++ +LLTAR EG + G+ IL+DI
Sbjct: 8 EIQFHAALSSGVSAIGPRYKAVSVRKIGSETGCTTLLTARREGEQEIQDGQRILDDI 64
>gi|300867474|ref|ZP_07112126.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300334529|emb|CBN57294.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 419
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 100/479 (20%), Positives = 175/479 (36%), Gaps = 83/479 (17%)
Query: 45 SKPKLASALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSR 104
S+ + SALS PSL A+ EV+++ A G+Q I++ S
Sbjct: 5 SEMQWVSALSTRPSLEAALKEVVEQ------------AQQGLQGPADLGLVFISSAFASE 52
Query: 105 TPVITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPL 164
+ + + A + + + + + L + +PG+KV+ +
Sbjct: 53 YSRLMPLLQEYLPAAAIAGCGGGGVIGMNRGGITEEVEGTPALSLSLARLPGVKVKAFHI 112
Query: 165 LRSKEEPEF-SMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEE 223
++E P+ S D ++ I +S P I+L + I +L +DY P
Sbjct: 113 A-AEELPDMDSPPDTWVEQI-----GVSAQEQPQFILLADPFSSKINDLLQGLDYAYPGS 166
Query: 224 TVIVGDATSCFLFKTGENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVLP 283
+ G A+ + ++ NY LY V + S + I + ++ G P
Sbjct: 167 AKVGGLASGNGMGRSAALFCNYR--LYREGTVGVALSGN--------IVLETIVAQGCRP 216
Query: 284 FGPELKAVSVKEHNADCSLLTARMEGYD---GLLHGEEILEDIKEHI-----DDKY---P 332
G + V E + L E D G + LE +++ I +D+
Sbjct: 217 IG---QPYRVTEGERNILLGLEEQESLDQRTGSGRKQSPLEALRDLISTLSEEDRQLAQH 273
Query: 333 YLYIGVIHQRGSLQFGSRSYMSLYEVLGAEDQF-FIVNGVGIKPGDSFIFYHSDSDTASS 391
L++GV+ L+ ++ + +LG + + I G ++PG F+ D+ T S+
Sbjct: 274 SLFVGVVRDEFKLKLDQGDFL-IRNLLGVDPKVGAIAIGDRVRPGQRIQFHLRDART-SA 331
Query: 392 SSIDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDN 451
+++L A N G L+FSC R L
Sbjct: 332 EDLEMLLARYQREAP----------FNPVAARAGALMFSCMGRGEGLY-----------G 370
Query: 452 DVYFESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVY 510
+ F+S F R PL G FC GEIG + G LH Y++V+
Sbjct: 371 EPSFDSRLFSRYLNNIPLTGFFCNGEIG----------------PVGGSTFLHGYTSVF 413
>gi|307152903|ref|YP_003888287.1| hypothetical protein Cyan7822_3057 [Cyanothece sp. PCC 7822]
gi|306983131|gb|ADN15012.1| domain of unknown function DUF1745 [Cyanothece sp. PCC 7822]
Length = 409
Score = 47.4 bits (111), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 98/477 (20%), Positives = 179/477 (37%), Gaps = 83/477 (17%)
Query: 48 KLASALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPV 107
+ A+ALS PSL A++EV++KV P+ I I++ S P
Sbjct: 2 QWANALSTRPSLEAAITEVVEKVTQSITNPNIGIV-------------FISSAYASDYPR 48
Query: 108 ITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLRS 167
+ + + L + + ++L + + + L V +PG++++ L
Sbjct: 49 VLPLILDKLPLPILIGCGGGGIIGVNSGDVL-EIEGTPALSLSVAALPGVQMQPF-YLEG 106
Query: 168 KEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIV 227
+ P+ K +D+ +S +P I+L I +L +D+ P I
Sbjct: 107 ENLPDLDAPPKAWIDL----IGVSPEQNPQFILLCDPFTSKISDLLEGLDFAYPGAIKIG 162
Query: 228 GDATSCFLFKTG----ENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVLP 283
G A+S + +S++ N ++Y V L D I + ++ G P
Sbjct: 163 GLASSATMGLQQALFYHHSKDNNTSVYREGTVGLALWGD--------IILETIVAQGCRP 214
Query: 284 FGPELKAVSVKEHNADCSLLTARMEGYDGLLHGEEILEDIKEHIDDK-----YPYLYIGV 338
G + +S E N L E IL+D+ + + ++ L+IG+
Sbjct: 215 IGSPYR-ISQCERNIIVELTDTDDESTS--RPPLAILQDLIQTLSEQDRLLAQQSLFIGI 271
Query: 339 IHQRGSLQFGSRSYMSLYEVLGAE-DQFFIVNGVGIKPGDSFIFYHSDSDTASSSSIDVL 397
L+ ++ + +LG + Q I ++ G F+ D+ T S++ +++
Sbjct: 272 ARDEFKLKLNHGDFL-IRNLLGVDPRQGAIAIADRVRNGQRIQFHLRDAQT-SATDLEI- 328
Query: 398 DGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFES 457
LL A R ++ V G L+FSC R L + D
Sbjct: 329 ----LLQAYQ------REAAQSSPAV-GALMFSCLGRGEGLYGKPNFDSR---------- 367
Query: 458 YPFCRNF-PETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLVM 513
RN+ P+ + G FC GEIG + GR LH Y++ + +
Sbjct: 368 --LLRNYLPKISIGGFFCNGEIG----------------PVGGRTFLHGYTSAFAIF 406
>gi|86609276|ref|YP_478038.1| hypothetical protein CYB_1819 [Synechococcus sp. JA-2-3B'a(2-13)]
gi|86557818|gb|ABD02775.1| conserved hypothetical protein [Synechococcus sp. JA-2-3B'a(2-13)]
Length = 441
Score = 47.0 bits (110), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 78/359 (21%), Positives = 133/359 (37%), Gaps = 70/359 (19%)
Query: 165 LRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEET 224
LR + P+ +D +S S P+ ++L + I +L +D+ P
Sbjct: 133 LRGNQLPDLDAAPSAWVD----CVGVSPQSKPHFLLLADGFSSGISELLQGLDFAYPGSV 188
Query: 225 VIVGDATSCFLFKTGENSQNYNGALYFFDAVALVFSRDSDNSNV------PEIQFDITMS 278
+ G A+ G N AL+ DA L R+ + D ++
Sbjct: 189 KVGGLAS-------GGRGPRGN-ALFLLDARTLTPRRELYREGTVGLALYGNVVLDAVVA 240
Query: 279 TGVLPFGPELKAVSVKEHNADCSLLTARMEGYDGLLHGEEILEDIKEH---IDDKYPY-- 333
G P G L+ V+ E N L EG L +L+D+ E +D +
Sbjct: 241 QGCRPIGDPLR-VTEAEGNVILGL-----EGRPPL----AVLQDLAERLSPVDQRLARHS 290
Query: 334 LYIGVIHQRGSLQFGSRSYMSLYEVLGAEDQF-FIVNGVGIKPGDSFIFYHSDSDTASSS 392
L+IG++ + ++ + +LG + + + G ++PG + F+ D+ T++
Sbjct: 291 LFIGLLMDEFKSEPTPGDFL-IRVILGVDPRVGALAIGDQVRPGQTVQFHLRDAQTSA-- 347
Query: 393 SIDVLDGLRLLNASSCC-GTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDN 451
+ LR + C + ++ + E G L+FSC R L D
Sbjct: 348 -----EDLRWALSRYCAERNLRQSPSQPRPEPCGALMFSCLGRGKGLYGTPD-------- 394
Query: 452 DVYFESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVY 510
F+S F E PL G FC GEIG + G LH Y++ +
Sbjct: 395 ---FDSQRFRELLGELPLGGFFCNGEIG----------------PVGGSTFLHGYTSCF 434
>gi|367467911|ref|ZP_09467822.1| hypothetical protein PAI11_11030 [Patulibacter sp. I11]
gi|365817029|gb|EHN12016.1| hypothetical protein PAI11_11030 [Patulibacter sp. I11]
Length = 388
Score = 46.6 bits (109), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 66/270 (24%), Positives = 106/270 (39%), Gaps = 61/270 (22%)
Query: 253 DAVALVFSRDSDNSNVPEIQFDIT-----MSTGVLPFGPELKAVS----VKEHNADCSLL 303
D L+ R + ++ I+F +S G P GPEL + V + A +
Sbjct: 167 DGRPLLHDRGATDAAAIGIRFSGVEVLPCVSQGARPIGPELAVTAGEGGVIQELAGRPAI 226
Query: 304 TARMEGYDGLLHGEEILEDIKEHIDDKYPYLYIGVIHQRGSLQFGSRSYMSLYEVLGAED 363
A E DGL G E I+ L +G++ G ++G+ ++ + + GA+
Sbjct: 227 EALRETVDGL--GAEDQARIRH-------GLLLGLVVGPGRPEYGAGDFV-VRGLAGADP 276
Query: 364 QF-FIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLRLLNASSCCGTIGRNVTNANKE 422
+ +V G ++ G + D +TA+ D+ D LRL A++ G +
Sbjct: 277 EAGAVVVGAPVEVGQIAQLHVRDPETATR---DLEDALRLRRAAAGSG-----------Q 322
Query: 423 VFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRNFPETPLAGIFCYGEIGRGR 482
V G L F+C R D G D D + + E P PLAG+F GEIG
Sbjct: 323 VAGALAFTCNGRG----HDMFGHDHHDADAIQRELGPL-------PLAGMFSAGEIG--- 368
Query: 483 GLTRLINQEEEDCSISGRCLLHHYSTVYLV 512
+ GR +H ++ V
Sbjct: 369 -------------PVGGRPFVHGFTATVAV 385
>gi|254412137|ref|ZP_05025912.1| conserved domain protein [Coleofasciculus chthonoplastes PCC 7420]
gi|196181103|gb|EDX76092.1| conserved domain protein [Coleofasciculus chthonoplastes PCC 7420]
Length = 416
Score = 46.6 bits (109), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 71/334 (21%), Positives = 133/334 (39%), Gaps = 63/334 (18%)
Query: 190 ISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVIVGDATSCFL-FKTG---ENSQNY 245
+S P+ I+L + I +L +D+ P + G A++ + ++G +S+ Y
Sbjct: 127 VSPQDQPHFILLADPFSSKINDLLQGLDFAYPGSVKVGGLASASAMGVQSGLFYRDSERY 186
Query: 246 NGA-LYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVLPFGPELKAVSVKEHN-----AD 299
+G L+ + + S + + D +S G P G + ++ E N AD
Sbjct: 187 SGGTLHREGTIGVALSGN--------VVLDPIVSQGCRPIGQPYQ-ITKGERNIVLELAD 237
Query: 300 CSLLT-ARMEGYDGLLHGEEILEDIKEHIDDKYPY-LYIGVIHQRGSLQFGSRSYMSLYE 357
+ ++ + +E L ++++++ E + + L+IG+ G ++ +
Sbjct: 238 SNGMSFSEVESQPPLAVLRDVIQNLSESDRELAQHSLFIGIARDEFKQSLGQGDFL-IRN 296
Query: 358 VLGAEDQF-FIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLRLLNASSCCGTIGRNV 416
+LG + + I G ++PG F+ D+ T+ + L LL + +N
Sbjct: 297 LLGVDPRLGAIAIGDRVRPGQRIQFHLRDARTSE-------EDLELLLQNY------QNQ 343
Query: 417 TNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRNFPETPLAGIFCYG 476
N+ E G L+FSC R L D F+S CR + G FC G
Sbjct: 344 VNSTPETAGALMFSCLGRGQGLYGKPD-----------FDSQLLCRYINNISVGGFFCNG 392
Query: 477 EIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVY 510
EIG + G LH Y++V+
Sbjct: 393 EIG----------------PVGGSTFLHGYTSVF 410
>gi|440683596|ref|YP_007158391.1| protein of unknown function DUF1745 [Anabaena cylindrica PCC 7122]
gi|428680715|gb|AFZ59481.1| protein of unknown function DUF1745 [Anabaena cylindrica PCC 7122]
Length = 408
Score = 45.8 bits (107), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 97/471 (20%), Positives = 180/471 (38%), Gaps = 84/471 (17%)
Query: 48 KLASALSLSPSLHVAVSEVLDKV---LSEPIRPHFAIASVGMQSKLAATHQLITARLGSR 104
+ A+ALS SL AV++V+ + L+ P S S+ + L+ +L
Sbjct: 6 QWANALSTHHSLEAAVADVVQQAVSSLTAPANLGLVFISSAFTSEYSRLLPLLAEKL--S 63
Query: 105 TPVITN-AVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIP 163
PV+ + G+IG + + + + + L + ++PG+ ++
Sbjct: 64 VPVLIGCSAAGVIGTTSK--------------SQTQEIEAEPALSLTLAHLPGVNLQAFH 109
Query: 164 LLRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEE 223
+L + S D ++ + + I P I+L + +L +D+ P
Sbjct: 110 VLADQLPDLDSSPDAWINLLGVPPSPI-----PQFILLSSAFSSGTNDLLQGLDFAYPGS 164
Query: 224 TVIVGDATSCFLFKTGENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVLP 283
V+ G A+ F+ + + N LY V L S D I + ++ G P
Sbjct: 165 VVVGGQASGGFV--SDRIALFCNNRLYRQGTVGLALSGD--------IVLETIVAQGCRP 214
Query: 284 FGPELKAVSVKEHNADCSLLTARMEGYDGLLHGEEILEDIKEHIDDKYPY-LYIGVIHQR 342
G L+ V+ E N + ++ L+ +++ ++ E + L++G+
Sbjct: 215 IGEPLQ-VTKAERN-----IILELDEKVPLVVLRDLISNLSEEEKMLAQHSLFVGLAMNE 268
Query: 343 GSLQFGSRSYMSLYEVLGAEDQF-FIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLR 401
L ++ + +LG + I G ++PG F+ D+ AS+ ++ L
Sbjct: 269 FKLSLKQGDFL-IRNLLGVDPSAGAIAIGDRVRPGQRLQFHLRDAQ-ASAEDLEFL---- 322
Query: 402 LLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFC 461
L ++ ++ F L+FSC R L + F+S F
Sbjct: 323 -LQEY-------QDQSSNESSPFAALMFSCVGRGAGLYGKSN-----------FDSELFQ 363
Query: 462 RNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLV 512
R PL G FC GEIG +SGR LLH Y++V+ +
Sbjct: 364 RYLHNIPLGGCFCGGEIG----------------PVSGRTLLHGYTSVFAI 398
>gi|434390944|ref|YP_007125891.1| protein of unknown function DUF1745 [Gloeocapsa sp. PCC 7428]
gi|428262785|gb|AFZ28731.1| protein of unknown function DUF1745 [Gloeocapsa sp. PCC 7428]
Length = 400
Score = 45.8 bits (107), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 105/479 (21%), Positives = 170/479 (35%), Gaps = 101/479 (21%)
Query: 48 KLASALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPV 107
+ A+A+S SL AV+EV+DK AS +QS I+A S P
Sbjct: 5 QWANAVSTRASLEAAVAEVVDK------------ASALLQSPADLGLVFISAAFTSEYPR 52
Query: 108 ITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPL--- 164
+ + + L + + +F+ + L + +PG+ V +
Sbjct: 53 LLPLLQEKLRNIKVLIGCGGGGIIGTNQHAVQEFEGVPALSLSLAQLPGVTVTPFHIAAE 112
Query: 165 -LRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEE 223
L + P + VD F +S P I+L + + +L +DY P
Sbjct: 113 QLPDPDSPPKAWVDLF---------GVSPAEQPQFILLSDPFSSGVNDLLQGIDYAYPGS 163
Query: 224 TVIVGDATSCFLFKTGENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVLP 283
+ G A+ G N LY V + S + I D ++ G P
Sbjct: 164 ITVGGLASGSQ--TPGRIGLFCNDKLYRSGTVGVALSGN--------IVLDTIVAQGCRP 213
Query: 284 FGPELKAVSVKEHNADCSLLTARMEGYDGLLHGEEILEDIKEHIDDKYPYL-----YIGV 338
G E VS E N +L E L E+L D+ + D L +IGV
Sbjct: 214 IG-EPYRVSASERNILLAL-----EEQPPL----EVLRDLISSLSDADRQLAEHSLFIGV 263
Query: 339 IHQ--RGSLQFGS---RSYMSLYEVLGAEDQFFIVNGVGIKPGDSFIFYHSDSDTASSSS 393
+ + +L+ G R+ + + +GA I ++PG F+ D++T++
Sbjct: 264 VRDEFKQNLEHGDFLIRNLLGVDPKVGA-----IAVADLVRPGQRIQFHLRDAETSA--- 315
Query: 394 IDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDV 453
D L L T+++ G L+FSC R L +
Sbjct: 316 ----DDLEWLLQR-------YQQTHSHVSPTGALMFSCLGRGEMLYGKPN---------- 354
Query: 454 YFESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLV 512
F+S F P P+ G F GEIG + G LH Y++V+ +
Sbjct: 355 -FDSQLFSSYMPNIPMGGFFGNGEIG----------------PVGGSTFLHGYTSVFAI 396
>gi|443312374|ref|ZP_21041992.1| hypothetical protein Syn7509DRAFT_00015920 [Synechocystis sp. PCC
7509]
gi|442777612|gb|ELR87887.1| hypothetical protein Syn7509DRAFT_00015920 [Synechocystis sp. PCC
7509]
Length = 402
Score = 45.4 bits (106), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 100/473 (21%), Positives = 178/473 (37%), Gaps = 83/473 (17%)
Query: 44 LSKPKLASALSLSPSLHVAVSEVLDK---VLSEPIRPHFAIASVGMQSKLAATHQLITAR 100
++K + A+ALS SL AV+EV+++ +L P S S L+ +
Sbjct: 1 MTKMQWANALSTHSSLESAVAEVVERATSILQAPADLAIVFISAAFTSDYPRLLPLLHEK 60
Query: 101 LGSRTPVITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVE 160
L + V+ G I + E+ E LED+ + L + ++P ++++
Sbjct: 61 L-TDIKVLIGCGGGGIVGVSQWGEMQE-----LEDS--------PALSLSLAHLPDVEIQ 106
Query: 161 TIPLLRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGL 220
++ +E P+ +D+ + P I+L + I +L +D+
Sbjct: 107 AFHIV-PEELPDLDSPPHTWVDVIGVEPDLM----PQFILLSDPFSSKINDLLQGLDFAY 161
Query: 221 PEETVIVGDATSCFLFKTGENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTG 280
P V+ G A+ G NG LY V + S + I + ++ G
Sbjct: 162 PGSVVVGGLASGGS--NPGVTGLFCNGCLYREGTVGVALSGN--------IVLETIVAQG 211
Query: 281 VLPFGPELKAVSVKEHNADCSLLTARMEGYDGLLHGEEILEDIKEHIDDKYPY--LYIGV 338
P G + VS + N + ++ L +++E + E D + L++GV
Sbjct: 212 CRPIGKPYQ-VSSSDRN-----IILELDEKPPLTQLRQLIESLNEE-DQRLAQTALFVGV 264
Query: 339 IHQRGSLQFGSRSYMSLYEVLGAE-DQFFIVNGVGIKPGDSFIFYHSDSDTASSSSIDVL 397
G ++ + +LG + + I G I+PG F+ D+ T++ D+
Sbjct: 265 TRDEFKQNLGQGDFL-IRNLLGVDPNAGAIAIGDRIRPGQRIQFHLRDAQTSAE---DLE 320
Query: 398 DGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFES 457
L+ S + G L+F+C R L + F+S
Sbjct: 321 WWLQKYQKSH----------QSQPSEAGALMFACLGRGEGLYGKPN-----------FDS 359
Query: 458 YPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVY 510
F R + PL G FC GEIG +SG LH Y++V+
Sbjct: 360 GLFQRYLSDIPLGGFFCSGEIG----------------PVSGNTFLHGYTSVF 396
>gi|428316599|ref|YP_007114481.1| protein of unknown function DUF1745 [Oscillatoria nigro-viridis PCC
7112]
gi|428240279|gb|AFZ06065.1| protein of unknown function DUF1745 [Oscillatoria nigro-viridis PCC
7112]
Length = 417
Score = 45.4 bits (106), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 102/478 (21%), Positives = 175/478 (36%), Gaps = 84/478 (17%)
Query: 45 SKPKLASALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSR 104
+K + SALS PSL A+ EV+++ + P +A + + S A+ + + L
Sbjct: 5 NKMQWVSALSTRPSLESALKEVVEQADRDLEGPA-DLALIFISSAFASEYSRLMPLLREL 63
Query: 105 TPVITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPL 164
PV A L L + + + L + +PG+ V T
Sbjct: 64 LPV-----------PAILGCGGGGVIGTNRGGLTEEVEGSPAVSLSLARLPGVNVRTFH- 111
Query: 165 LRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEET 224
+ ++E P+ +D+ + P+ IIL + I +L +DY P
Sbjct: 112 VGAEELPDLDSPPDTWVDL----LGVPAREEPHFIILADPFSAKINDLLQGLDYAYPGAA 167
Query: 225 VIVGDATSCFLFKTGENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVLPF 284
+ G A + NY LY V + S + I + ++ G P
Sbjct: 168 KVGGLAGGDGAGRGAALFCNYQ--LYREGTVGVALSGN--------IVLETIVAQGCRPI 217
Query: 285 GPELKAVSVKEHNADCSLLTARMEGYDGLLHG----------EEILEDIKEHIDDKYPY- 333
G + V+ E N +L E D + G E+++ + E + +
Sbjct: 218 GQPYR-VTEGERN----ILLKLEEQTDDIGSGGIERSPLEALRELVQTLSEQDRELAQHS 272
Query: 334 LYIGVIHQRGSLQFGSRSYMSLYEVLGAEDQF-FIVNGVGIKPGDSFIFYHSDSDTASSS 392
L++G++ L ++ + +LG + + I G ++PG F+ D+ T++
Sbjct: 273 LFVGLVSDEFKLTLEPGDFL-IRNLLGVDPKVGAIAIGDRVRPGQRIQFHLRDARTSAED 331
Query: 393 SIDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDND 452
+LD R A+ GT G L+FSC R L +
Sbjct: 332 LEMLLD--RYQRAAEYSGT----------SSAGALMFSCLGRGEGLYGQSN--------- 370
Query: 453 VYFESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVY 510
F+S F R PL+G FC GEIG + G LH Y++V+
Sbjct: 371 --FDSRLFGRYLKNIPLSGFFCNGEIG----------------PVGGSTFLHGYTSVF 410
>gi|309810993|ref|ZP_07704791.1| CBS domain protein [Dermacoccus sp. Ellin185]
gi|308434957|gb|EFP58791.1| CBS domain protein [Dermacoccus sp. Ellin185]
Length = 441
Score = 45.4 bits (106), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 51/105 (48%), Gaps = 12/105 (11%)
Query: 239 GENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVLPFGPELKAVS--VKEH 296
GENS + G LYF D V + D+D + E+ D L F PE K V +KE
Sbjct: 238 GENSDDLVGLLYFKDVVRRTLTGDADAIAISEVMRD-------LAFVPESKPVDALLKEM 290
Query: 297 NAD---CSLLTARMEGYDGLLHGEEILEDIKEHIDDKYPYLYIGV 338
D +++ G G++ E+I+E+I IDD+Y + GV
Sbjct: 291 QRDRVHFAVVIDEYGGTAGIVTMEDIVEEIVGEIDDEYDRVSPGV 335
>gi|428771682|ref|YP_007163472.1| hypothetical protein Cyan10605_3386 [Cyanobacterium aponinum PCC
10605]
gi|428685961|gb|AFZ55428.1| protein of unknown function DUF1745 [Cyanobacterium aponinum PCC
10605]
Length = 414
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 100/489 (20%), Positives = 181/489 (37%), Gaps = 111/489 (22%)
Query: 51 SALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQL----ITARLGSRTP 106
+ALS++PSL A+ EV+DK+ +SKL L I++ S P
Sbjct: 5 NALSINPSLEKAIDEVVDKI----------------KSKLDGNADLGIIFISSAFASDYP 48
Query: 107 VITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKV------- 159
+ + + L + + D + + + L V +P +++
Sbjct: 49 RLMPILLDKLPLPCVIGCGGGGIVGMKNDYQPQEIEGNPALSLTVASLPDVEITPFHIIP 108
Query: 160 ETIPLLRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYG 219
+ +P L S SM+ + PN I+L + I +L +D+
Sbjct: 109 DDLPDLDSPPSAWCSMI------------GVEIQKEPNFILLSDPFSAKINELLEGLDFA 156
Query: 220 LPEETVIVGDATSCFLFKTGENSQNYNGALYFFDAVALVFSRDSDNSNVPE--------- 270
P I G A++ ++ L+++D +SD E
Sbjct: 157 YPGAVKIGGLAST--------STMGVGSGLFYYDG------SNSDQLFRTEGTVGIALTG 202
Query: 271 -IQFDITMSTGVLPFGPELKAVSVKEHNA--DCSLLTARMEGYDGLLHGEEILEDIKEHI 327
IQ + ++ G P G E V+ + N + S +++ LL E++ +
Sbjct: 203 NIQVESIVAQGCRPIG-ETYQVTKGQRNVILEMSDREGKIDSPLNLLR--ELINSLSGED 259
Query: 328 DDKYPY-LYIGVIHQRGSLQFGSRSYMSLYEVLGAEDQF-FIVNGVGIKPGDSFIFYHSD 385
+ Y L++G+ L+ G+ ++ + ++G + ++ I G I+ G F+ D
Sbjct: 260 QELAQYALFMGIARDEFKLELGAGDFL-IRNLVGVDPKYGAIAVGDKIRTGQRIKFHLRD 318
Query: 386 SDTASSSSIDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGD 445
+ AS+ ++ L N S TIG L+FSC R L +
Sbjct: 319 A-KASADDLETLLATYYNNKQSLDQTIG------------ALMFSCLGRGEGLYGKPN-- 363
Query: 446 DDEDDNDVYFESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHH 505
F+S F + P+AG FC GEIG ++G LH
Sbjct: 364 ---------FDSQLFLDYVTDIPIAGFFCNGEIG----------------PVAGNTFLHG 398
Query: 506 YSTVYLVMS 514
Y++V+ + S
Sbjct: 399 YTSVFGIFS 407
>gi|427724044|ref|YP_007071321.1| hypothetical protein Lepto7376_2196 [Leptolyngbya sp. PCC 7376]
gi|427355764|gb|AFY38487.1| protein of unknown function DUF1745 [Leptolyngbya sp. PCC 7376]
Length = 417
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 101/493 (20%), Positives = 181/493 (36%), Gaps = 124/493 (25%)
Query: 51 SALSLSPSLHVAVSEVLDKVLSEPIRPHFA----IASVGMQSKLAATHQLITARLGSRTP 106
+ALS SL A++EV+D+ I+P + V + S + + + L P
Sbjct: 9 NALSTRASLEGAIAEVVDQ-----IKPKLTGTADLGIVFISSAFTSEYSRVVPLLTEMLP 63
Query: 107 VITNAVTG---IIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIP 163
+ G IIG D ++N + + + L V ++P + ++
Sbjct: 64 MKVLVGCGGASIIGTD--------------DNNQPQELEDRPALSLTVAHLPDVTIKP-- 107
Query: 164 LLRSKEEPEFSMVDKFLMDI-----RHYSA-SISGCSSPNGIILFGDQNIDIKPILAEMD 217
F ++DK + D+ R S S+ P+ II + +I +LA +D
Sbjct: 108 ---------FQLIDKDIPDLDSSPDRWTSIFSVESSEEPDFIIFADPFSSNINDLLAGLD 158
Query: 218 YGLPEETVIVGDATS----------CFLFKTGENSQNYNGALYFFDAVALVFSRDSDNSN 267
+ P + G A+S C++ G + G + VA+ +
Sbjct: 159 FAYPNAVKVGGLASSGGMGRTNGLFCYVQDQGRSPMFREGTV----GVAITGN------- 207
Query: 268 VPEIQFDITMSTGVLPFGPELKAVSVKEHNADCSLLTARMEGYDGLLHGEEILEDIKEHI 327
I+ D ++ G P G E+ VS E N + L+ E ++L+ + +
Sbjct: 208 ---IKIDAIVAQGCRPIG-EVYQVSECERNI-ITELSLESENETTTKSPLKMLQSLIASL 262
Query: 328 DD-----KYPYLYIGVIHQRGSLQ-----FGSRSYMSLYEVLGAEDQFFIVNGVGIKPGD 377
DD L+IG+ + F R+ + + +GA + G I+PG
Sbjct: 263 DDDDQVLAQDSLFIGIAMDAFKQKLIHGDFLVRNLLGVDPKVGA-----MAIGDRIRPGQ 317
Query: 378 SFIFYHSDSDTASSSSIDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVP 437
F+ D++T++ +L R ++ +E FG L+FSC R
Sbjct: 318 RIQFHLRDAETSAEDLTVLLKQYREQESNF-------------QEPFGVLMFSCMGRGKG 364
Query: 438 LSEDDDGDDDEDDNDVYFESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSI 497
L ++ F++ P + G FC GEIG +
Sbjct: 365 LY-----------GELNFDANKLASYLPNPNIGGFFCNGEIG----------------PV 397
Query: 498 SGRCLLHHYSTVY 510
R LH Y++V+
Sbjct: 398 GDRTFLHGYTSVF 410
>gi|119484398|ref|ZP_01619015.1| hypothetical protein L8106_01732 [Lyngbya sp. PCC 8106]
gi|119457872|gb|EAW38995.1| hypothetical protein L8106_01732 [Lyngbya sp. PCC 8106]
Length = 422
Score = 43.9 bits (102), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 92/447 (20%), Positives = 175/447 (39%), Gaps = 70/447 (15%)
Query: 48 KLASALSLSPSLHVAVSEVLDKVLSE-PIRPHFAIASVGMQSKLAATHQLITARLGSRTP 106
+ +ALS PSL A+ EV++++ P P+F + V + S A+ + + L R
Sbjct: 10 QWVNALSKRPSLEAAIDEVVEQIQQALPASPNFGL--VFISSAFASEYSRLMPLLQER-- 65
Query: 107 VITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPLLR 166
+ L A + + + + + + L + ++PG++V L
Sbjct: 66 ---------LNLSAIIGCGGGGIVGVNSNRQAEEIEGEPALSLSLAHLPGVQVHPF-HLT 115
Query: 167 SKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETVI 226
++ P+ + +D+ + P I+L I +L +DY P +
Sbjct: 116 AEGLPDLDSPPQAWIDL----IGVDPSQQPQFILLVDPMYNRINDLLQGLDYAYPGSVKV 171
Query: 227 VGDATSCFLFKTG--ENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVLPF 284
G A+ T +SQ+Y G + V + + I + ++ G P
Sbjct: 172 GGLASGGLNNITAVFYDSQHYQG-----EVVGMALCGN--------IVLETIVAQGCRPV 218
Query: 285 GPELKAVSVKEHNADCSL---LTARMEGYDGLLHGEEILEDIKEHIDD--------KYPY 333
G + V+ E N +L L++ Y G LE ++E + D
Sbjct: 219 GLPYR-VTKGERNIILALEEELSSDELSYGGSGESLSPLEALQELVQDLSEPDRQLAQNA 277
Query: 334 LYIGVIHQRGSLQFGSRSYMSLYEVLGAEDQF-FIVNGVGIKPGDSFIFYHSDSDTASSS 392
L++GV+ + ++ + +LG + + I G ++PG F+ D++T+
Sbjct: 278 LFVGVVRDEFKQKLEPGDFL-IRNLLGVDPRIGAIAIGDRVRPGQRIQFHLRDAETSKED 336
Query: 393 SIDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDND 452
+L + + +S + +AN G L+FSC R L + +
Sbjct: 337 LQALLTRYQQQHPNS-------SQLSANA---GALMFSCLGRGEGLYGEPN--------- 377
Query: 453 VYFESYPFCRNFPETPLAGIFCYGEIG 479
F+S F R + P++G+FC GEIG
Sbjct: 378 --FDSSQFQR-YLNIPVSGLFCNGEIG 401
>gi|428310675|ref|YP_007121652.1| hypothetical protein Mic7113_2444 [Microcoleus sp. PCC 7113]
gi|428252287|gb|AFZ18246.1| hypothetical protein Mic7113_2444 [Microcoleus sp. PCC 7113]
Length = 415
Score = 43.5 bits (101), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 101/475 (21%), Positives = 174/475 (36%), Gaps = 84/475 (17%)
Query: 48 KLASALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSR--T 105
+ A+ALS PSL A++EV+D+ + + + V + S A+ + + A L +
Sbjct: 6 QWANALSTCPSLEAAIAEVVDRA-QKSLTATPDLGLVFISSAYASEYSRLMALLQKQLSV 64
Query: 106 PVITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPLL 165
PVI I E E++ E L H + + ++PG + +P L
Sbjct: 65 PVIIGCGGSGIIGMNSQGETQEIEG---EPALSLSLAHLPDVQVQAFHIPG---DGLPDL 118
Query: 166 RSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEETV 225
S ++ + H+ IL D I +L +D+ P +
Sbjct: 119 DSPPNTWVDLIGVPPQEEPHF-------------ILLADPFSAINDLLQGLDFAYPGSSK 165
Query: 226 IVGDATSCFLFKTGENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVLPFG 285
+ G ++ + NY LY V + S + I + ++ G P G
Sbjct: 166 VGGMTSAGAMGVQSALFCNYK--LYREGTVGVALSGN--------IVLETIVAQGCRPIG 215
Query: 286 PELKAVSVKEHNADCSLLTA-RMEGYDGLLHGE---EILEDIKEHIDDK-----YPYLYI 336
+ V+ E N L T +E + E+L D+ + + ++ L++
Sbjct: 216 -QTYQVTACERNIVLELATQDNVEKTSSEIESRRPLEVLRDLLQSLSEEDRQLAQHSLFV 274
Query: 337 GVIHQRGSLQFGSRSYMSLYEVLGAEDQF-FIVNGVGIKPGDSFIFYHSDSDTASSSSID 395
GV Q G ++ + +LG + + I G ++PG F+ D+ T+
Sbjct: 275 GVARDEFKQQLGHGDFL-IRNLLGVDPRVGAIAIGDRVRPGQRIQFHLRDARTSE----- 328
Query: 396 VLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYF 455
+ L LL + T+ E G L+F+C R L D F
Sbjct: 329 --EDLELLLHRY------QKDTSGTTEAAGALMFACLGRGKGLYGKPD-----------F 369
Query: 456 ESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVY 510
+S F R L+G FC GEIG + G LH Y++V+
Sbjct: 370 DSQLFGRYLSNIQLSGFFCNGEIG----------------PVGGSTFLHGYTSVF 408
>gi|37520395|ref|NP_923772.1| hypothetical protein gll0826 [Gloeobacter violaceus PCC 7421]
gi|35211388|dbj|BAC88767.1| gll0826 [Gloeobacter violaceus PCC 7421]
Length = 407
Score = 43.5 bits (101), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 71/347 (20%), Positives = 135/347 (38%), Gaps = 72/347 (20%)
Query: 147 IVLIVGYVPGLKVETIPLLRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQN 206
+ L+ ++PG+++ L+++E P+ K ++ SA ++P+ +++ +
Sbjct: 93 LSLLAAHLPGVELRPF-WLKAEELPDLDSSPKTWENLMEISAG----AAPHFVLMVDGSS 147
Query: 207 IDIKPILAEMDYGLPEETVIVGDATSCFLFKTGENSQNYNGALYFFDAVALVFSRDSDNS 266
+ ++ +D+ P+ + G A+ + G+N + AV +V + D
Sbjct: 148 FPVDVLIGGLDFAFPKAIKVGGLASGGN--RPGQNRLFFGDQAVGSGAVGVVLAGD---- 201
Query: 267 NVPEIQFDITMSTGVLPFGPELKAVSVKEHNADCSLLTARMEGYDGLLHGEEILEDIKEH 326
I + ++ G P G + ++ E N L ++G L ++L+ + +
Sbjct: 202 ----IAVEAAVAQGCRPVGETFQ-ITRAEGN-----LLWELDGQPAL----QVLQTVLQQ 247
Query: 327 IDDKYPYLYIGVIHQRGSLQFGSRSYMSLYEVLGAEDQFFIVNGVGI------------- 373
+D+ L R +L G R MS + + F + N +G+
Sbjct: 248 LDENDQRLA------RNALFVGVR--MSEFHSGSEQGDFLVRNLMGVDSRTGGLAVGEWL 299
Query: 374 KPGDSFIFYHSDSDTASSSSIDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFS 433
+ G + F+ D+ T+ VL RL ++ G L+FSC
Sbjct: 300 RTGQTVRFHLRDAATSRDDLQLVLQRHRL--------------EHSGAPPAGALLFSCLG 345
Query: 434 RSVPLSEDDDGDDDEDDNDVYFESYPFCRNFPE-TPLAGIFCYGEIG 479
R L + D D S F + E PLAG FC GEIG
Sbjct: 346 RGESLYGEPDVD-----------STLFAQVLGEGVPLAGFFCNGEIG 381
>gi|334121517|ref|ZP_08495584.1| domain of unknown function DUF1745 [Microcoleus vaginatus FGP-2]
gi|333454957|gb|EGK83627.1| domain of unknown function DUF1745 [Microcoleus vaginatus FGP-2]
Length = 417
Score = 43.5 bits (101), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 43/178 (24%), Positives = 69/178 (38%), Gaps = 41/178 (23%)
Query: 334 LYIGVIHQRGSLQFGSRSYMSLYEVLGAEDQF-FIVNGVGIKPGDSFIFYHSDSDTASSS 392
L++G++ L ++ + +LG + + I G ++PG F+ D+ T++
Sbjct: 273 LFVGLVSDEFKLTLEPGDFL-IRNLLGVDPKVGAIAIGDRVRPGQRIQFHLRDARTSAED 331
Query: 393 SIDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDND 452
+LD R A+ GT G L+FSC R L + +
Sbjct: 332 LEMLLD--RYQRAAEYTGT----------SSAGALMFSCLGRGEGLYGESN--------- 370
Query: 453 VYFESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVY 510
F+S F R PL G FC GEIG + G LH Y++V+
Sbjct: 371 --FDSRLFGRYLKNIPLGGFFCNGEIG----------------PVGGSTFLHGYTSVF 410
>gi|322788471|gb|EFZ14140.1| hypothetical protein SINV_16325 [Solenopsis invicta]
Length = 167
Score = 42.4 bits (98), Expect = 0.66, Method: Composition-based stats.
Identities = 28/92 (30%), Positives = 44/92 (47%), Gaps = 13/92 (14%)
Query: 426 GLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRNFPETPLAGIFCYGEIGRGR--G 483
G +F C +R G + D+N + ES F + FP+ PLAG F YGE G+
Sbjct: 86 GFMFVCNAR---------GSNLYDEN--HIESTIFKKLFPKVPLAGCFGYGEFGKNTFDE 134
Query: 484 LTRLINQEEEDCSISGRCLLHHYSTVYLVMSY 515
N EE + + +S+V+L+++Y
Sbjct: 135 TNEEKNSEEGQRPKRSKSWYNEFSSVFLILTY 166
>gi|443314762|ref|ZP_21044296.1| hypothetical protein Lep6406DRAFT_00044800 [Leptolyngbya sp. PCC
6406]
gi|442785639|gb|ELR95445.1| hypothetical protein Lep6406DRAFT_00044800 [Leptolyngbya sp. PCC
6406]
Length = 418
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 99/480 (20%), Positives = 177/480 (36%), Gaps = 99/480 (20%)
Query: 48 KLASALSLSPSLHVAVSEVLDKVLSE-PIRPHFAIA--SVGMQSKLAATHQLITARLGSR 104
+ SA+S SL AV EV+ + + P + S S+ + L+ L
Sbjct: 9 QWVSAISTQVSLEAAVEEVVARTQRQLTAAPDLGLVFISSAFASEFSRVLPLLQTALEIP 68
Query: 105 TPVITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPL 164
T +I + G++G+D+ D + + + + L + +PG+ V++ L
Sbjct: 69 T-LIGCSGGGVVGMDSEEDAL--------------EVEDAPALSLCLATLPGVTVQSFYL 113
Query: 165 ----LRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGL 220
L + P + V+ +D R P I+L + +L +D+
Sbjct: 114 KETDLPDLDSPPDAWVEAVGVDPR---------VDPQFIVLADPFFSGVNDLLQGLDFAY 164
Query: 221 PEETVIVGDATSCFLFKTGENSQNYNGAL-----YFFDAVALVFSRDSDNSNVPEIQFDI 275
P+ + G +G + Q + G Y V L S + +
Sbjct: 165 PKAVKVGG-------LASGSSWQRHGGLFCDRTYYTEGVVGLALSG--------HVVLEA 209
Query: 276 TMSTGVLPFGPELKAVSVKEHNADCSLLTARMEGYD---GLLHGEEILEDIKEHIDDKYP 332
++ G P G + V+ E N L + D L + +L D+ +
Sbjct: 210 IVAQGCRPIGQHYR-VAAAERNILLELEPPDILDRDPQPALASLKAMLNDLSDRDRQLAQ 268
Query: 333 Y-LYIGVIHQRGSLQFGSRSYMSLYEVLGAEDQF-FIVNGVGIKPGDSFIFYHSDSDTAS 390
+ L+IG+ + G ++ + +LG + + I G ++PG F+ D++T S
Sbjct: 269 HSLFIGIAQDGFKITLGPGDFL-IRNLLGVDPRVGAIAIGDRVRPGQRIQFHLRDAET-S 326
Query: 391 SSSIDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDD 450
+ ++VL L + T V G L+FSC R L ++ +
Sbjct: 327 AEDLEVL-----LQQYAHQSTASAPV--------GALMFSCVGRGHGLYQEAN------- 366
Query: 451 NDVYFESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVY 510
F+S F + PL+G FC GEIG I G LH Y++V+
Sbjct: 367 ----FDSRLFHHHLGPVPLSGFFCNGEIG----------------PIGGTTFLHGYTSVF 406
>gi|16332081|ref|NP_442809.1| hypothetical protein sll0524 [Synechocystis sp. PCC 6803]
gi|383323824|ref|YP_005384678.1| hypothetical protein SYNGTI_2916 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383326993|ref|YP_005387847.1| hypothetical protein SYNPCCP_2915 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383492877|ref|YP_005410554.1| hypothetical protein SYNPCCN_2915 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384438145|ref|YP_005652870.1| hypothetical protein SYNGTS_2917 [Synechocystis sp. PCC 6803]
gi|451816233|ref|YP_007452685.1| hypothetical protein MYO_129450 [Synechocystis sp. PCC 6803]
gi|1001390|dbj|BAA10880.1| sll0524 [Synechocystis sp. PCC 6803]
gi|339275178|dbj|BAK51665.1| hypothetical protein SYNGTS_2917 [Synechocystis sp. PCC 6803]
gi|359273144|dbj|BAL30663.1| hypothetical protein SYNGTI_2916 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359276314|dbj|BAL33832.1| hypothetical protein SYNPCCN_2915 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359279484|dbj|BAL37001.1| hypothetical protein SYNPCCP_2915 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|451782202|gb|AGF53171.1| hypothetical protein MYO_129450 [Synechocystis sp. PCC 6803]
Length = 447
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 104/493 (21%), Positives = 178/493 (36%), Gaps = 111/493 (22%)
Query: 48 KLASALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPV 107
+ +ALS PSL A+ E AIA VG Q + +++ S
Sbjct: 35 RWVNALSTRPSLERALDE--------------AIAKVGGQGQWDLAIIFLSSSFASDAAR 80
Query: 108 ITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKV-------E 160
+ V + + + + + + + + + + L V +PG+ V E
Sbjct: 81 LMPLVQEKLSVPNLIGCLGGGIIGMENNATVQEVEGEVALSLTVAQLPGVTVTPFYVHGE 140
Query: 161 TIPLLRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGL 220
+P L + P S VD +D S + IIL I LA +D+
Sbjct: 141 GMPDL---DAPPQSWVDLMGVDP---------ASGADFIILADPMTGGITDFLAGLDFAY 188
Query: 221 PEETVIVGDAT--------SCFLFKTGENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQ 272
P+ T + G A+ S FL + + Y F VAL + I+
Sbjct: 189 PQATKVGGLASGENVAMGGSLFLQSSDHPAGRYGDG---FIGVALAGN----------IR 235
Query: 273 FDITMSTGVLPFGPELKAVSVKEHNADCSLLTARMEGYD---GLLHGEEILEDIKEHIDD 329
++ G P G E V+ + N + ++G D G + E L +++ I
Sbjct: 236 LGSVVAQGCRPIG-EPFVVNQGQRN-----IITEIQGKDADSGTVVLETPLASLQKLIPT 289
Query: 330 KYPY--------LYIGVIHQRGSLQFGSRSYMSLYEVLGAE-DQFFIVNGVGIKPGDSFI 380
P L++GV L ++ + +LG + Q I G ++ G
Sbjct: 290 LSPKDQELAQSSLFVGVASDEFKLTLQPGDFL-IRNLLGVDPRQGAIAIGDRVRKGQRLQ 348
Query: 381 FYHSDSDTASSSSIDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSE 440
F+ D +T++ D L++L G ++ + A G L+FSC R L
Sbjct: 349 FHLRDRETSA-------DDLQILLRHYTEGEANQSSSTA----IGALMFSCLGRGYGLYG 397
Query: 441 DDDGDDDEDDNDVYFESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGR 500
+ F+S F + FP L G FC GEIG+ + +
Sbjct: 398 TPN-----------FDSQMFGQYFPGVALGGFFCNGEIGQ----------------VGAQ 430
Query: 501 CLLHHYSTVYLVM 513
LH Y++ + ++
Sbjct: 431 TFLHGYTSAFAIV 443
>gi|119510716|ref|ZP_01629844.1| hypothetical protein N9414_22128 [Nodularia spumigena CCY9414]
gi|119464670|gb|EAW45579.1| hypothetical protein N9414_22128 [Nodularia spumigena CCY9414]
Length = 412
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 102/478 (21%), Positives = 176/478 (36%), Gaps = 95/478 (19%)
Query: 48 KLASALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPV 107
+ A+ALS PSL AV++V+++ +S P + V + S A+ + + L + V
Sbjct: 6 QWANALSTRPSLEAAVTDVVEQAVSSLTVPA-DLGLVFISSAFASEYSRLLPLLAEKLAV 64
Query: 108 ---ITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPL 164
I + G+IG A + + + I L +GY+PG+ ++ +
Sbjct: 65 PVMIGCSGGGVIGTTATGEP--------------QELESQPAISLTLGYLPGVNIQVFHV 110
Query: 165 LRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEET 224
L S+E P+ +D+ S +P I+L + I +L +D+ P
Sbjct: 111 L-SEELPDLDSSPDAWIDVLGVEPS----PAPQFILLSSAFSSRINDLLQGLDFAYPGSV 165
Query: 225 VIVGDATSCF------LFKTGENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMS 278
VI G A+S LF + G LY V L S + I + ++
Sbjct: 166 VIGGQASSGGSSSQISLFCHDPEGGLHQG-LYREGTVGLALSGN--------IVLETIVA 216
Query: 279 TGVLPFGPELKAVSVKEHNADCSL-----LTARMEGYDGLLHGEEILEDIKEHIDDKYPY 333
G P G ++ V+ E N L L + L E +L
Sbjct: 217 QGCRPIGQPMQ-VTKAERNIILELDEKVPLVVLRDLISSLSEEERMLAQQS--------- 266
Query: 334 LYIGVIHQRGSLQFGSRSYMSLYEVLGAEDQF-FIVNGVGIKPGDSFIFYHSDSDTASSS 392
L++G++ + L ++ + +LG + I G +PG F D++ ++
Sbjct: 267 LFVGIVMDQFKLSLQQGDFL-IRNILGVDPSVGAIAIGDVARPGQRLQFQLRDAEASA-- 323
Query: 393 SIDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDND 452
+ L LL +N + +FSC R L +
Sbjct: 324 -----EDLELLLERY------QNQQGSQPSAAAAFMFSCVGRGQGLYGKPN--------- 363
Query: 453 VYFESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVY 510
F+S F R + P+ G FC GEIG + G +H Y++V+
Sbjct: 364 --FDSELFRRYIHDIPIGGFFCNGEIG----------------PVGGSTFVHGYTSVF 403
>gi|33864822|ref|NP_896381.1| hypothetical protein SYNW0286 [Synechococcus sp. WH 8102]
gi|33632345|emb|CAE06801.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
Length = 423
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 71/308 (23%), Positives = 121/308 (39%), Gaps = 72/308 (23%)
Query: 199 IILFGDQNIDIKPILAEMDYGLPEETVIVGDATSCFLFKTGENSQNYNGALYFFDAV--- 255
I+L + +I +++ MDY P I G A +G+L F D V
Sbjct: 150 ILLIDPTSSNINDLISGMDYAFPGAEKIGGIACP---------HNAPHGSLLFDDRVVTG 200
Query: 256 ALVFSRDSDNSNVPEIQFDITMSTGVLPFGPELKAVSVKEHNADCSLLTARMEGYDGLLH 315
A++ S D + D ++ G P GP ++++ + L + E D +
Sbjct: 201 AVICSIGGD------WRLDSVVAQGCRPIGP---VFAIEQVQRNVVLELSDGERRDTPVA 251
Query: 316 G-EEILEDIKEHIDDKYPY-LYIGVIHQRGSLQFGSRSYMSLYEVLGAEDQFFIVNGVGI 373
+ IL D+ E D+ + L++G+ +R +LQ ++ + A F + N +G+
Sbjct: 252 CLQRILADLSEEERDQVRHSLFLGI--ERRNLQ------LTPNRLDAAGGAFLVRNLIGV 303
Query: 374 -------------KPGDSFIFYHSDSDTASSSSIDVLDGLRLLNASSCCGTIGRNVTNAN 420
+PG + F ++D + + + L LL +S+ +A
Sbjct: 304 DPNNGAVAVADRVRPGMNVQFQLREADASRN------EALSLLRSST---------ESAG 348
Query: 421 KEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRN-FPETPLAGIFCYGEIG 479
GL+ +C R L DGD + R P+ P+AG FC GEIG
Sbjct: 349 SAPLFGLLMACLGRGQGLFGQPDGDVN------------LGRTVMPDLPMAGAFCNGEIG 396
Query: 480 RGRGLTRL 487
G T L
Sbjct: 397 PVAGSTHL 404
>gi|87301924|ref|ZP_01084758.1| hypothetical protein WH5701_01325 [Synechococcus sp. WH 5701]
gi|87283492|gb|EAQ75447.1| hypothetical protein WH5701_01325 [Synechococcus sp. WH 5701]
Length = 436
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 33/74 (44%), Gaps = 11/74 (14%)
Query: 414 RNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRNFPETPLAGIF 473
RN A++ GL+F+C R L + DGD + FP P+AG F
Sbjct: 348 RNQAKASEPPLAGLLFACLGRGKGLYGEADGDVRIAQGE-----------FPGLPMAGAF 396
Query: 474 CYGEIGRGRGLTRL 487
C GEIG G T L
Sbjct: 397 CNGEIGPVGGSTYL 410
>gi|427711276|ref|YP_007059900.1| hypothetical protein Syn6312_0103 [Synechococcus sp. PCC 6312]
gi|427375405|gb|AFY59357.1| hypothetical protein Syn6312_0103 [Synechococcus sp. PCC 6312]
Length = 409
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 93/439 (21%), Positives = 163/439 (37%), Gaps = 65/439 (14%)
Query: 48 KLASALSLSPSLHVAVSEVLDKVLSE--PIRPHFAIA--SVGMQSKLAATHQLITARLGS 103
K A+ALS SL AV EV D+ ++P I S S+ + L+ L
Sbjct: 2 KWATALSTHFSLEQAVKEVTDQARERLGNLQPDLGIVFISQAFASEFSRVIPLLQTTL-- 59
Query: 104 RTPVITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIP 163
PV+ G + + D+ EV+ T G+ L + ++P +K++
Sbjct: 60 HLPVLIGCGGGGVIGRNYRDQPQEVEET-------------AGLSLTLAHLPDVKIKPFS 106
Query: 164 LLRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEE 223
L+ + I + P ++L + I +L +DY P+
Sbjct: 107 LVADDLPDPDDPPQAWWRLI-----GVDPEQQPEFLLLADTSSARINDLLRGLDYAYPQA 161
Query: 224 TVIVGDATSCFLFKTGENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVLP 283
+ G S + G Y + V L + + I+ D ++ G P
Sbjct: 162 VKVGGITGSNQGW--GRTGLFYQTKVQREGTVGLALTGN--------IRIDPIVAQGCRP 211
Query: 284 FGPELKAVSVKEHNADCSLLTARMEGYDGLLHGEEILEDIKEHIDDKYPY--LYIGVIHQ 341
GP L ++ E N SL + + L E ++E + E +D + L+IG++H
Sbjct: 212 IGP-LYRIASGEKNIILSLEDDQAKVAPPLDMLENLVERLPE-VDRELARSALFIGLVHN 269
Query: 342 RGSLQFGSRSYMSLYEVLGAEDQF-FIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGL 400
G ++ + ++G + Q + G ++ G F+ D+ +++ L
Sbjct: 270 EFKPNLGPGDFL-IRNLIGIDPQTGALALGDRVRVGQRIQFHLRDAAASAAD-------L 321
Query: 401 RLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPF 460
+ + C G G ++FSC R V L D D D
Sbjct: 322 QAWLETYCHSYPGPPPE-------GTMMFSCLGRGVNLYGQPDFDATMVD---------- 364
Query: 461 CRNFPETPLAGIFCYGEIG 479
+ P P++G FC+GEIG
Sbjct: 365 -KFLPNIPVSGFFCFGEIG 382
>gi|317968653|ref|ZP_07970043.1| hypothetical protein SCB02_03858 [Synechococcus sp. CB0205]
Length = 424
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 25/69 (36%), Positives = 30/69 (43%), Gaps = 13/69 (18%)
Query: 420 NKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRN-FPETPLAGIFCYGEI 478
N E L+F+C R L +GD D CR FP P++G FC GEI
Sbjct: 353 NAEPMAALLFACLGRGEGLYGSPNGDVDG------------CRQQFPAVPISGAFCNGEI 400
Query: 479 GRGRGLTRL 487
G G T L
Sbjct: 401 GPVAGATHL 409
>gi|47217857|emb|CAG02350.1| unnamed protein product [Tetraodon nigroviridis]
Length = 374
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 42/90 (46%), Gaps = 15/90 (16%)
Query: 426 GLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRNFPETPLAGIFCYGEIGRGRGLT 485
GL+F+C R G + +DV ES F + FP PL G+F GEI G
Sbjct: 299 GLMFACVGR---------GRSYYNQSDV--ESSAFRKVFPTVPLFGLFGNGEI----GCD 343
Query: 486 RLINQEEEDCSISGRCLLHHYSTVYLVMSY 515
R++ + C + L H Y+TV ++ +
Sbjct: 344 RIVKDDYTLCDSDRKSLQHQYTTVMTLVHF 373
>gi|47203449|emb|CAF91713.1| unnamed protein product [Tetraodon nigroviridis]
Length = 185
Score = 40.8 bits (94), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 42/90 (46%), Gaps = 15/90 (16%)
Query: 426 GLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRNFPETPLAGIFCYGEIGRGRGLT 485
GL+F+C R G + +DV ES F + FP PL G+F GEI G
Sbjct: 110 GLMFACVGR---------GRSYYNQSDV--ESSAFRKVFPTVPLFGLFGNGEI----GCD 154
Query: 486 RLINQEEEDCSISGRCLLHHYSTVYLVMSY 515
R++ + C + L H Y+TV ++ +
Sbjct: 155 RIVKDDYTLCDSDRKSLQHQYTTVMTLVHF 184
>gi|427710471|ref|YP_007052848.1| hypothetical protein Nos7107_5188 [Nostoc sp. PCC 7107]
gi|427362976|gb|AFY45698.1| protein of unknown function DUF1745 [Nostoc sp. PCC 7107]
Length = 402
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 98/475 (20%), Positives = 177/475 (37%), Gaps = 97/475 (20%)
Query: 48 KLASALSLSPSLHVAVSEVLDKVLSEPIRPHFAIASVGMQSKLAATHQLITARLGSRTPV 107
+ A+ALS PSL AV+EV+ K S + + V + S A+ + + L + V
Sbjct: 6 QWANALSTRPSLEAAVTEVVQKTTSL-LTASADLGLVFISSAFASEYSRLLPLLAEKLSV 64
Query: 108 ---ITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIPL 164
I + G+IG D + + + L + ++PG+ ++ +
Sbjct: 65 PVMIGCSGGGVIGTTT--------------DGQTQEIEAEAALSLTLAHLPGVNLQVSHI 110
Query: 165 LRSKEEPEF-SMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEE 223
L ++E P+ S D ++ I + ++P I+L + I +L +D+ P
Sbjct: 111 L-AEELPDLDSSPDAWVNLI-----GVQPSATPQFILLSSSFSSGINELLQGLDFAYPGS 164
Query: 224 TVIVGDATSCFLFKTGENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVLP 283
++ G A+ + + N LY V L S + I + ++ G P
Sbjct: 165 VIVGGQASGGMGGR---QTLFCNDRLYREGTVGLALSGN--------IILETIVAQGCRP 213
Query: 284 FGPELKAVSVK-----EHNADCSLLTARMEGYDGLLHGEEILEDIKEHIDDKYPYLYIGV 338
G ++ + E N L+ R L++ L D + + ++ + +
Sbjct: 214 IGQPMQITKAERNIILELNEQVPLVVLR-----DLINS---LSDQERTLAQHSLFVGVAM 265
Query: 339 IHQRGSLQFGS---RSYMSLYEVLGAEDQFFIVNGVGIKPGDSFIFYHSDSDTASSSSID 395
+ SLQ G RS + + GA I G ++PG F+ D++ ++
Sbjct: 266 DEFKLSLQQGDFLIRSILGVDPAGGA-----IAIGDRVRPGQRLQFHLRDAEASAEDLEF 320
Query: 396 VLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYF 455
+L+ N ++ L+FSC R L + F
Sbjct: 321 LLEKYH-------------NQEASSSSAIAALMFSCVGRGEGLYGKPN-----------F 356
Query: 456 ESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVY 510
+S F R F P+ G FC GEIG + G LH Y++V+
Sbjct: 357 DSSLFRRYFQNIPIGGFFCGGEIG----------------PVGGNTFLHGYTSVF 395
>gi|148243314|ref|YP_001228471.1| hypothetical protein SynRCC307_2215 [Synechococcus sp. RCC307]
gi|147851624|emb|CAK29118.1| Conserved hypothetical protein [Synechococcus sp. RCC307]
Length = 413
Score = 40.4 bits (93), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 74/313 (23%), Positives = 118/313 (37%), Gaps = 84/313 (26%)
Query: 199 IILFGDQNID-IKPILAEMDYGLPEETVIVGDATSCFLFKTGENSQNYNGALYFFD---- 253
++L+ D +I I +++ +DY P + G A G +S N+ G+L D
Sbjct: 146 MLLWLDPSISGINDLISGLDYAYPAMAKLGGIA--------GNHSANH-GSLLLADQVHH 196
Query: 254 -AVALVFSRDSDNSNVPEIQFDITMSTGVLPFGPELKAVSVKEHNADCSLLTARMEGYDG 312
AV V S + V ++ G P GP + K + + + D
Sbjct: 197 SAVGCVISGAWTLAPV--------VAQGCRPIGPIFEVEQAKRN------VVLELRQGDE 242
Query: 313 LLHGEEILEDIKEHIDDKYPYLYIGVIHQRGSLQFG-SRSYMSLYEVLGAEDQFFIVNGV 371
L + L+ + E + D L R SL G +R+ SL + + + F + N +
Sbjct: 243 LANPVTALQQVIETLPDPDKELL------RHSLFVGLARNSFSLQQDR-SSNPFLVRNLM 295
Query: 372 GIKP-------------GDSFIFYHSDSDTASSSSIDVLDGLRLLNASSCCGTIGRNVTN 418
G+ P G F D T+ LDGL + SC
Sbjct: 296 GVDPRHGAMAVADSLQVGQRLQFQLRDGATSRQE----LDGLLAASEQSC---------- 341
Query: 419 ANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESY---PFCR-NFPETPLAGIFC 474
+ L+F+C R +Y E++ CR +FPE P++G+FC
Sbjct: 342 -QQPPVAALLFACLGRG---------------QGLYGEAHVDTGLCRKHFPELPISGLFC 385
Query: 475 YGEIGRGRGLTRL 487
GEIG G T+L
Sbjct: 386 NGEIGPVDGSTQL 398
>gi|186681216|ref|YP_001864412.1| hypothetical protein Npun_F0717 [Nostoc punctiforme PCC 73102]
gi|186463668|gb|ACC79469.1| domain of unknown function DUF1745 [Nostoc punctiforme PCC 73102]
Length = 411
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 96/478 (20%), Positives = 182/478 (38%), Gaps = 95/478 (19%)
Query: 48 KLASALSLSPSLHVAVSEVLDK---VLSEPIRPHFAIASVGMQSKLAATHQLITARLGSR 104
+ A+ALS SL AV++V+++ +L+ P S S+ + L+ +L
Sbjct: 6 QWANALSTRHSLEAAVTDVVERAVSLLTAPADLGLVFISSAFTSEYSRLLPLLAEKLS-- 63
Query: 105 TPVITN-AVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETIP 163
PV+ + G+IG + + + + + I L + ++PG+KV+
Sbjct: 64 VPVLIGCSGGGVIGTTVNGE--------------IQELEAEPAISLTLAHLPGVKVQVFH 109
Query: 164 LLRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPEE 223
++ ++E P+ +D+ AS +P I+L I +L +D+ P
Sbjct: 110 VV-AEELPDLDSSPDAWVDLIGVPAS----PTPQFILLSSSFASGINDLLQGLDFAYPGS 164
Query: 224 TVIVGDAT------SCFLF---KTGENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFD 274
++ G A+ LF GE Q+ LY V + + + I +
Sbjct: 165 VIVGGQASGGGMGGRVALFCNESDGEECQS----LYREGTVGIALTGN--------IVLE 212
Query: 275 ITMSTGVLPFGPELKAVSVKEHNADCSLLTARMEGYDGLLHGEEILEDIKEHIDDKYPY- 333
++ G P G L+ V+ + N + ++ L+ +++ + EH +
Sbjct: 213 TIVAQGCRPIGKPLQ-VTKADRN-----IILELDEQVPLVVLRDLIASLSEHERTLAQHS 266
Query: 334 LYIGVIHQRGSLQFGSRSYMSLYEVLGAEDQF-FIVNGVGIKPGDSFIFYHSDSDTASSS 392
L++GV L ++ + +LG + I G ++PG F+ D+ ++
Sbjct: 267 LFVGVAMDEFKLALQQGDFL-IRGILGVDPTAGAIAIGDRVRPGQRLQFHLRDAQASA-- 323
Query: 393 SIDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDND 452
+ L LL S + + L+F+C R L +
Sbjct: 324 -----EDLELLLQSY------QTQRESEPSAVAALMFACLGRGEGLYGKPN--------- 363
Query: 453 VYFESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVY 510
F+S F R + P+ G FC GEIG + GR LH Y++ +
Sbjct: 364 --FDSELFRRYLSDIPVGGFFCGGEIG----------------PVGGRTFLHAYTSAF 403
>gi|86606541|ref|YP_475304.1| hypothetical protein CYA_1894 [Synechococcus sp. JA-3-3Ab]
gi|86555083|gb|ABD00041.1| conserved hypothetical protein [Synechococcus sp. JA-3-3Ab]
Length = 446
Score = 40.0 bits (92), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 75/338 (22%), Positives = 125/338 (36%), Gaps = 77/338 (22%)
Query: 194 SSPNGIILFGDQNIDIKPILAEMDYGLPEETVIVGDATSC------FLFKTGENSQNYNG 247
S P+ ++L + I +L +D+ P + G A+ LF +
Sbjct: 158 SKPHFLLLADGFSSRISELLQGLDFAYPGAVKVGGLASGGRGPRGNALFLLDARTPTPRR 217
Query: 248 ALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVLPFGPELKAVSVKEHNADCSLLTARM 307
LY V L S + + D ++ G P G L+ V+ E N SL
Sbjct: 218 ELYREGTVGLALSGN--------VVLDAVVAQGCRPIGDPLR-VTEAEGNVILSL----- 263
Query: 308 EGYDGLLHGEEILEDIKEHI---DDKYPY--LYIGVIHQRGSLQFGSRSYMSLYEVLGAE 362
EG L +L+D+ E + D + L+IG++ + S ++ + +LG +
Sbjct: 264 EGRPPL----AVLQDLAERLSPSDQRLARQALFIGLLMDEFKSEPTSGDFL-IRVILGID 318
Query: 363 DQF-FIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLRLLNASSCCGTIGRNV----- 416
+ I G ++PG + F+ D+ T++ + LR + C RN+
Sbjct: 319 PRVGAIAIGDRVRPGQTVQFHLRDAQTSA-------EDLRWALSRYCAE---RNLQQSYP 368
Query: 417 ----TNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRNFPETPLAGI 472
+ + G L+FSC R L + F+S F E PL G
Sbjct: 369 AERSSQPKPDPCGALMFSCLGRGKGLYGTPN-----------FDSQRFRELLGELPLGGF 417
Query: 473 FCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVY 510
FC GEIG + G LH Y++ +
Sbjct: 418 FCNGEIG----------------PVGGSTFLHGYTSCF 439
>gi|297835084|ref|XP_002885424.1| hypothetical protein ARALYDRAFT_898555 [Arabidopsis lyrata subsp.
lyrata]
gi|297331264|gb|EFH61683.1| hypothetical protein ARALYDRAFT_898555 [Arabidopsis lyrata subsp.
lyrata]
Length = 404
Score = 40.0 bits (92), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 11 EDLLQHILSRLPALSFASAACVNKSWNKVCN--QILSKPKLASALSLSPSL 59
EDLL ILS++PALS A +K WN + ++L+K A+AL S SL
Sbjct: 8 EDLLVEILSKVPALSLARLRSTSKKWNALIKYERLLAKKHSANALMHSSSL 58
>gi|332708603|ref|ZP_08428577.1| hypothetical protein LYNGBM3L_26790 [Moorea producens 3L]
gi|332352700|gb|EGJ32266.1| hypothetical protein LYNGBM3L_26790 [Moorea producens 3L]
Length = 422
Score = 39.3 bits (90), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 69/298 (23%), Positives = 113/298 (37%), Gaps = 55/298 (18%)
Query: 200 ILFGDQ-NIDIKPILAEMDYGLPEETVIVGDATS-CFLFKTG----ENSQNYNGALYFFD 253
ILF D + I +L +DY P + G A+ + K G ++ +LY
Sbjct: 140 ILFSDPLSSKINDLLQGLDYAYPGSAKVGGQASGGSMMVKNGLFCLKDQTQLAKSLYHEG 199
Query: 254 AVALVFSRDSDNSNVPEIQFDITMSTGVLPFGPELKAVSVKEHNADCSLLTARMEG---- 309
V + S + + + ++ G P G E V+ + N L TA +G
Sbjct: 200 TVGVALSGN--------VVLETIVAQGCRPIG-ETYQVAKADQNILLEL-TALDQGKITS 249
Query: 310 -----YDGLLHGEEILEDIKEHIDDKYPY--LYIGVIHQRGSLQFGSRSYMSLYEVLGAE 362
+ L+ E+++ + E D K L++GV Q G ++ + +LG +
Sbjct: 250 GEPASHPPLMVLRELIQSMDE-ADRKLAQHSLFVGVARDEFKQQLGQGDFL-IRNLLGVD 307
Query: 363 DQF-FIVNGVGIKPGDSFIFYHSDSDTASSSSIDVLDGLRLLNASSCCGTIGRNVTNANK 421
+ I I+PG F+ D+ T+ +L+ + TN +
Sbjct: 308 PRIGAIAIADRIRPGQRIQFHLRDAQTSEEDLALLLEDYQ-------------KQTNTTQ 354
Query: 422 EVFGGLIFSCFSRSVPLSEDDDGDDDEDDNDVYFESYPFCRNFPETPLAGIFCYGEIG 479
G L+FSC R L + F+S F R + LAG FC GEIG
Sbjct: 355 AA-GALMFSCLGRGEGLYGKPN-----------FDSQLFHRYIKDIQLAGFFCNGEIG 400
>gi|407960279|dbj|BAM53519.1| hypothetical protein BEST7613_4588 [Synechocystis sp. PCC 6803]
Length = 398
Score = 38.9 bits (89), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 41/181 (22%), Positives = 71/181 (39%), Gaps = 40/181 (22%)
Query: 334 LYIGVIHQRGSLQFGSRSYMSLYEVLGAE-DQFFIVNGVGIKPGDSFIFYHSDSDTASSS 392
L++GV L ++ + +LG + Q I G ++ G F+ D +T++
Sbjct: 253 LFVGVASDEFKLTLQPGDFL-IRNLLGVDPRQGAIAIGDRVRKGQRLQFHLRDRETSA-- 309
Query: 393 SIDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDEDDND 452
D L++L G ++ + A G L+FSC R L +
Sbjct: 310 -----DDLQILLRHYTEGEANQSSSTA----IGALMFSCLGRGYGLYGTPN--------- 351
Query: 453 VYFESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTVYLV 512
F+S F + FP L G FC GEIG+ + + LH Y++ + +
Sbjct: 352 --FDSQMFGQYFPGVALGGFFCNGEIGQ----------------VGAQTFLHGYTSAFAI 393
Query: 513 M 513
+
Sbjct: 394 V 394
>gi|358346280|ref|XP_003637197.1| F-box/kelch-repeat protein [Medicago truncatula]
gi|355503132|gb|AES84335.1| F-box/kelch-repeat protein [Medicago truncatula]
Length = 399
Score = 38.9 bits (89), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 24/37 (64%)
Query: 11 EDLLQHILSRLPALSFASAACVNKSWNKVCNQILSKP 47
+DLL+ ILS LP +S A+CV K W+ V + LS P
Sbjct: 46 DDLLERILSYLPIVSIFRASCVCKRWHTVFERFLSNP 82
>gi|428227046|ref|YP_007111143.1| hypothetical protein GEI7407_3624 [Geitlerinema sp. PCC 7407]
gi|427986947|gb|AFY68091.1| protein of unknown function DUF1745 [Geitlerinema sp. PCC 7407]
Length = 420
Score = 38.5 bits (88), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 100/480 (20%), Positives = 175/480 (36%), Gaps = 96/480 (20%)
Query: 48 KLASALSLSPSLHVAVSEVLDKV---LSEPIRPHFAI--ASVGMQSKLAATHQLITARLG 102
+ +ALS PSL AV EV+ +V L P P AI S S L+ +L
Sbjct: 10 QWVNALSTRPSLEAAVDEVVTQVHHKLQGP--PDLAILFISATFSSDFPRLMPLLREKLD 67
Query: 103 SRTPVITNAVTGIIGLDAHLDEICEVKWTLLEDNLLNDFDHCYGIVLIVGYVPGLKVETI 162
+I + GI+G DA + + + L V +PG+ +
Sbjct: 68 VPH-LIGCSGGGIVGCDAA--------------GQTQEIEETPALALTVAKLPGVSIRAF 112
Query: 163 PLLRSKEEPEFSMVDKFLMDIRHYSASISGCSSPNGIILFGDQNIDIKPILAEMDYGLPE 222
L + E P+ +D+ + + P ++L + I +L +D+
Sbjct: 113 -HLSADEIPDLDSPPSAWIDL----IGVPPEAQPKFVLLADPFSAKINDLLQGLDFAYAG 167
Query: 223 ETVIVGDATSCFLFKTGENSQNYNGALYFFDAVALVFSRDSDNSNVPEIQFDITMSTGVL 282
+ G A G NY L+ +V L S + I + ++ G
Sbjct: 168 SPTVGGLAGGGSGTNNGLFC-NYQ--LHREGSVGLALSGN--------IVLETIVAQGCR 216
Query: 283 PFGPELKAVSVKEH-------NADCSLLTARMEGYDGLLHGEEILEDIKEHIDDK----- 330
P G + V+ H D LL++ E E+L++I + + D+
Sbjct: 217 PIGKPYRVAEVERHIMLKLSEQDDADLLSSPKERTP-----LEVLQEIIQDLSDEDRALA 271
Query: 331 YPYLYIGVIHQRGSLQFGSRSYMSLYEVLGAEDQF-FIVNGVGIKPGDSFIFYHSDSDTA 389
L++G+ ++ + +LG + + I G ++PG F+ D++T
Sbjct: 272 QHSLFVGLARNEFKRNLEQGDFL-IRNLLGVDPRVGAIAIGDRVRPGQRIQFHLRDANT- 329
Query: 390 SSSSIDVLDGLRLLNASSCCGTIGRNVTNANKEVFGGLIFSCFSRSVPLSEDDDGDDDED 449
S+ +++L L+ ++ + + + G L+FSC R L E +
Sbjct: 330 SAEDLEIL--LQRYQST---------LDDQATQPVGALMFSCMGRGEGLYEQPN------ 372
Query: 450 DNDVYFESYPFCRNFPETPLAGIFCYGEIGRGRGLTRLINQEEEDCSISGRCLLHHYSTV 509
F+S R L+G FC GEIG + G LH Y++V
Sbjct: 373 -----FDSQLVHRYLGPLALSGFFCNGEIG----------------PVGGGTFLHGYTSV 411
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.138 0.411
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,512,956,081
Number of Sequences: 23463169
Number of extensions: 377465056
Number of successful extensions: 1303341
Number of sequences better than 100.0: 97
Number of HSP's better than 100.0 without gapping: 22
Number of HSP's successfully gapped in prelim test: 75
Number of HSP's that attempted gapping in prelim test: 1303068
Number of HSP's gapped (non-prelim): 115
length of query: 522
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 375
effective length of database: 8,910,109,524
effective search space: 3341291071500
effective search space used: 3341291071500
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 79 (35.0 bits)