BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 003012
(857 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255573883|ref|XP_002527860.1| conserved hypothetical protein [Ricinus communis]
gi|223532711|gb|EEF34491.1| conserved hypothetical protein [Ricinus communis]
Length = 868
Score = 1365 bits (3533), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 682/866 (78%), Positives = 753/866 (86%), Gaps = 54/866 (6%)
Query: 25 GDNSEQNKFRQREATDDQLGLPQIDEDALVNTQCPKNLELRWQTEVSSSIYATPLIADIN 84
G+ S +NKFR+REATDD LG P+IDE AL+NTQCP+NLELRWQTEVSSSIYA+PLIADIN
Sbjct: 24 GEESSKNKFREREATDDALGYPEIDETALLNTQCPRNLELRWQTEVSSSIYASPLIADIN 83
Query: 85 SDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALATY 144
SDGKLDIVVPSF+HYLEVLEGSDGDKMPGWPAFHQS+VH+SPLLYDIDKDGVREIALATY
Sbjct: 84 SDGKLDIVVPSFVHYLEVLEGSDGDKMPGWPAFHQSTVHASPLLYDIDKDGVREIALATY 143
Query: 145 NGEVLFFRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVDRSHPDVHDDLIV---QESEAA 201
NGEVLFFRVSGYMMT+KL +PRR+VRKDW+VGL+ DPVDRS PDVHDD +V E ++
Sbjct: 144 NGEVLFFRVSGYMMTEKLVVPRRRVRKDWHVGLNPDPVDRSQPDVHDDQLVFEAMEKKSE 203
Query: 202 RMKSMLE----------TKKSTPETNATVTTSTESNPAPATVSNPDVKKVNESLVNVSNP 251
+ +++E T STPE N+ ++ STES P +V+ P
Sbjct: 204 SLDNIIEYCYSVETTGSTHGSTPEKNSAISASTEST-IPQSVTVP--------------- 247
Query: 252 SEERKVNESHTEMNIKLPTSVDNSSTTTVSGG--------------TNSSENGTNTGRRL 297
VNE+ T+ IKLP ++DNSS T+S G TN++E GT TGRRL
Sbjct: 248 -----VNENQTDPIIKLPINMDNSSKDTMSAGLNNPENGNNTESVGTNTTEKGTKTGRRL 302
Query: 298 LEDNNSKGSQEG------NDKEDVPVATAENDQALDENADSSFELFRDTDELADEYNYDY 351
LED+ +K SQEG N+ E+V AT END+ L+ +ADSSFELFRDTDELADEY+YDY
Sbjct: 303 LEDDKTKDSQEGSLESGENNSENVHEATVENDEGLEADADSSFELFRDTDELADEYSYDY 362
Query: 352 DDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHE 411
DDYVDD MWGDEEWTEE+HEK+EDYVN+DSHIL TPVIADIDNDGVSE+I+AVSYFFDHE
Sbjct: 363 DDYVDDTMWGDEEWTEEKHEKLEDYVNIDSHILCTPVIADIDNDGVSEIIVAVSYFFDHE 422
Query: 412 YYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVV 471
YYDNPEHLKELGGIDIGKYVAG+IVVFNLDTKQVKWT +LDLSTD ++FRAYIYSSPTVV
Sbjct: 423 YYDNPEHLKELGGIDIGKYVAGSIVVFNLDTKQVKWTKELDLSTDTSTFRAYIYSSPTVV 482
Query: 472 DLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTD 531
DLDGDGNLDILVGTSFGLFYVLDHHG IREKFPLEMAEIQGAVVAADINDDGKIELVTTD
Sbjct: 483 DLDGDGNLDILVGTSFGLFYVLDHHGNIREKFPLEMAEIQGAVVAADINDDGKIELVTTD 542
Query: 532 THGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSK 591
THGNVAAWT++GK IWE+HLKSLV+QGP++GDVDGDG +DVVVPT+SGNIYVLSGKDGS
Sbjct: 543 THGNVAAWTSQGKEIWERHLKSLVSQGPTVGDVDGDGRTDVVVPTISGNIYVLSGKDGSI 602
Query: 592 VRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGET 651
VRPYPYRTHGRVMNQVLLVDL+KRGEKSKGL++VTTSFDGYLYLIDGPTSCADVVDIGET
Sbjct: 603 VRPYPYRTHGRVMNQVLLVDLSKRGEKSKGLSLVTTSFDGYLYLIDGPTSCADVVDIGET 662
Query: 652 SYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAG 711
SYS VLADNVDGGDDLDLIVTTMNGNVFCFSTP PHHPLKAWRS NQGRNNVA RYNR G
Sbjct: 663 SYSTVLADNVDGGDDLDLIVTTMNGNVFCFSTPVPHHPLKAWRSANQGRNNVANRYNREG 722
Query: 712 IYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQ 771
+Y+T SRAFRDEEG+NFW+EIEIVD+YR+PSGSQAPY V+TTLLVPGNYQGERRIKQ++
Sbjct: 723 VYITPSSRAFRDEEGKNFWLEIEIVDKYRYPSGSQAPYKVSTTLLVPGNYQGERRIKQNE 782
Query: 772 IFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPML 831
F R GKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPML
Sbjct: 783 TFDRPGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPML 842
Query: 832 GMFGVLVILRPQEAMPLPSFSRNTDL 857
GMFGVLVILRPQEAMPLPSFSRNTDL
Sbjct: 843 GMFGVLVILRPQEAMPLPSFSRNTDL 868
>gi|297740250|emb|CBI30432.3| unnamed protein product [Vitis vinifera]
Length = 847
Score = 1365 bits (3532), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 686/860 (79%), Positives = 741/860 (86%), Gaps = 27/860 (3%)
Query: 8 ANCVLLICLLLFNSAR----GGDNSEQNKFRQREATDDQLGLPQIDEDALVNTQCPKNLE 63
A V ICLLL + S +NKFR+REA+DD LG P +DEDAL+NT+CP+NLE
Sbjct: 5 AARVFFICLLLCTRSSFIQSDQQESNKNKFREREASDDALGYPNLDEDALLNTRCPRNLE 64
Query: 64 LRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVH 123
LRWQTEVSSSIYATPLIADINSDGKLDIVVPSF+HYLEVLEGSDGDKMPGWPAFHQS+VH
Sbjct: 65 LRWQTEVSSSIYATPLIADINSDGKLDIVVPSFVHYLEVLEGSDGDKMPGWPAFHQSTVH 124
Query: 124 SSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVD 183
SSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLE+PRR+VRKDWYVGL+ DPVD
Sbjct: 125 SSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEVPRRRVRKDWYVGLNPDPVD 184
Query: 184 RSHPDVHDDLIVQESEAARMKSMLETKKSTPETNATVTTSTESNPAPATVSNPDVKKVNE 243
RSHPDV DD +VQE AA MK + ST +N +V TS ES+ A
Sbjct: 185 RSHPDVKDDQLVQE--AADMKLFSQMNGSTSGSNTSVLTSAESHLGTA------------ 230
Query: 244 SLVNVSNPSEERKVNESHTEMNIKLPTSVDNSSTTTVSGGTNSSENGTNTGRRLLEDNNS 303
N SN K N + TE NIKLPTS NSS S T+++ENGTNTGRRLLEDN+S
Sbjct: 231 ---NASNLENNGKTNGNETETNIKLPTSTHNSSEDIGSVRTSNAENGTNTGRRLLEDNDS 287
Query: 304 KGSQEG------NDKEDVPVATAENDQALDENADSSFELFRDTDELADEYNYDYDDYVDD 357
KGSQ G N D +ND+AL+ ADSSFELFR+ DELADEY+YDYDDYVD+
Sbjct: 288 KGSQGGHSQSKDNSSGDAQAVNVQNDEALEAEADSSFELFRENDELADEYSYDYDDYVDE 347
Query: 358 AMWGDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPE 417
+MWGDE WTE QHEK+EDYVN+DSHIL TPVIADIDNDGVSEM++AVSYFFDHEYYDN E
Sbjct: 348 SMWGDEGWTEGQHEKMEDYVNIDSHILCTPVIADIDNDGVSEMVVAVSYFFDHEYYDNQE 407
Query: 418 HLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDG 477
HLKELG IDIGKYVAGAIVVFNLDTKQVKWTT LDLSTD +FRAYIYSSPTVVDLDGDG
Sbjct: 408 HLKELGDIDIGKYVAGAIVVFNLDTKQVKWTTPLDLSTDAGNFRAYIYSSPTVVDLDGDG 467
Query: 478 NLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVA 537
NLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQG VVAADINDDGKIELVT DTHGN+A
Sbjct: 468 NLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGGVVAADINDDGKIELVTADTHGNIA 527
Query: 538 AWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPY 597
AWTA+GK IW H+KSLV Q P+IGDVDGDGH+DVVVPTLSGNIYVL+GKDG +VRPYPY
Sbjct: 528 AWTAQGKEIWVTHVKSLVPQAPTIGDVDGDGHTDVVVPTLSGNIYVLNGKDGLQVRPYPY 587
Query: 598 RTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVL 657
RTHGRVMNQVLLVDL+KRGEK KGLT+VTTSFDGYLYLIDGPTSCADVVDIGETSYSMVL
Sbjct: 588 RTHGRVMNQVLLVDLSKRGEKKKGLTLVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVL 647
Query: 658 ADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHP 717
ADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRS NQGRNNVA R++R GIY++
Sbjct: 648 ADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSPNQGRNNVANRHSREGIYISQS 707
Query: 718 SRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRG 777
SRAFRDEEG++FWVEIEIVD+YRFPSGSQAPYNVTTTLLVPGNYQGERRIKQ+Q F G
Sbjct: 708 SRAFRDEEGKSFWVEIEIVDKYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQNQTFDCAG 767
Query: 778 KYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVL 837
K+RIKLPTVGVRTTGTVLVEMVDKNGLYFSD+FSLTFHM+YYKLLKWLLVLPML MFGVL
Sbjct: 768 KHRIKLPTVGVRTTGTVLVEMVDKNGLYFSDDFSLTFHMHYYKLLKWLLVLPMLAMFGVL 827
Query: 838 VILRPQEAMPLPSFSRNTDL 857
VILRPQEAMPLPSFSRNTDL
Sbjct: 828 VILRPQEAMPLPSFSRNTDL 847
>gi|225440608|ref|XP_002273795.1| PREDICTED: uncharacterized protein LOC100251006 [Vitis vinifera]
Length = 857
Score = 1363 bits (3529), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 686/868 (79%), Positives = 745/868 (85%), Gaps = 33/868 (3%)
Query: 8 ANCVLLICLLLFNSAR----GGDNSEQNKFRQREATDDQLGLPQIDEDALVNTQCPKNLE 63
A V ICLLL + S +NKFR+REA+DD LG P +DEDAL+NT+CP+NLE
Sbjct: 5 AARVFFICLLLCTRSSFIQSDQQESNKNKFREREASDDALGYPNLDEDALLNTRCPRNLE 64
Query: 64 LRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVH 123
LRWQTEVSSSIYATPLIADINSDGKLDIVVPSF+HYLEVLEGSDGDKMPGWPAFHQS+VH
Sbjct: 65 LRWQTEVSSSIYATPLIADINSDGKLDIVVPSFVHYLEVLEGSDGDKMPGWPAFHQSTVH 124
Query: 124 SSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVD 183
SSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLE+PRR+VRKDWYVGL+ DPVD
Sbjct: 125 SSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEVPRRRVRKDWYVGLNPDPVD 184
Query: 184 RSHPDVHDDLIVQESEAARM-------KSMLETK-KSTPETNATVTTSTESNPAPATVSN 235
RSHPDV DD +VQE+ ++ KS+ E K +ST +N +V TS ES+ A
Sbjct: 185 RSHPDVKDDQLVQEAADMKLFSRKLINKSLQEVKTRSTSGSNTSVLTSAESHLGTA---- 240
Query: 236 PDVKKVNESLVNVSNPSEERKVNESHTEMNIKLPTSVDNSSTTTVSGGTNSSENGTNTGR 295
N SN K N + TE NIKLPTS NSS S T+++ENGTNTGR
Sbjct: 241 -----------NASNLENNGKTNGNETETNIKLPTSTHNSSEDIGSVRTSNAENGTNTGR 289
Query: 296 RLLEDNNSKGSQEG------NDKEDVPVATAENDQALDENADSSFELFRDTDELADEYNY 349
RLLEDN+SKGSQ G N D +ND+AL+ ADSSFELFR+ DELADEY+Y
Sbjct: 290 RLLEDNDSKGSQGGHSQSKDNSSGDAQAVNVQNDEALEAEADSSFELFRENDELADEYSY 349
Query: 350 DYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFD 409
DYDDYVD++MWGDE WTE QHEK+EDYVN+DSHIL TPVIADIDNDGVSEM++AVSYFFD
Sbjct: 350 DYDDYVDESMWGDEGWTEGQHEKMEDYVNIDSHILCTPVIADIDNDGVSEMVVAVSYFFD 409
Query: 410 HEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPT 469
HEYYDN EHLKELG IDIGKYVAGAIVVFNLDTKQVKWTT LDLSTD +FRAYIYSSPT
Sbjct: 410 HEYYDNQEHLKELGDIDIGKYVAGAIVVFNLDTKQVKWTTPLDLSTDAGNFRAYIYSSPT 469
Query: 470 VVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVT 529
VVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQG VVAADINDDGKIELVT
Sbjct: 470 VVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGGVVAADINDDGKIELVT 529
Query: 530 TDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDG 589
DTHGN+AAWTA+GK IW H+KSLV Q P+IGDVDGDGH+DVVVPTLSGNIYVL+GKDG
Sbjct: 530 ADTHGNIAAWTAQGKEIWVTHVKSLVPQAPTIGDVDGDGHTDVVVPTLSGNIYVLNGKDG 589
Query: 590 SKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIG 649
+VRPYPYRTHGRVMNQVLLVDL+KRGEK KGLT+VTTSFDGYLYLIDGPTSCADVVDIG
Sbjct: 590 LQVRPYPYRTHGRVMNQVLLVDLSKRGEKKKGLTLVTTSFDGYLYLIDGPTSCADVVDIG 649
Query: 650 ETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNR 709
ETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRS NQGRNNVA R++R
Sbjct: 650 ETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSPNQGRNNVANRHSR 709
Query: 710 AGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQ 769
GIY++ SRAFRDEEG++FWVEIEIVD+YRFPSGSQAPYNVTTTLLVPGNYQGERRIKQ
Sbjct: 710 EGIYISQSSRAFRDEEGKSFWVEIEIVDKYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQ 769
Query: 770 SQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLP 829
+Q F GK+RIKLPTVGVRTTGTVLVEMVDKNGLYFSD+FSLTFHM+YYKLLKWLLVLP
Sbjct: 770 NQTFDCAGKHRIKLPTVGVRTTGTVLVEMVDKNGLYFSDDFSLTFHMHYYKLLKWLLVLP 829
Query: 830 MLGMFGVLVILRPQEAMPLPSFSRNTDL 857
ML MFGVLVILRPQEAMPLPSFSRNTDL
Sbjct: 830 MLAMFGVLVILRPQEAMPLPSFSRNTDL 857
>gi|356506784|ref|XP_003522156.1| PREDICTED: uncharacterized protein LOC100805038 [Glycine max]
Length = 886
Score = 1297 bits (3356), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 647/860 (75%), Positives = 729/860 (84%), Gaps = 31/860 (3%)
Query: 26 DNSEQNKFRQREATDDQLGLPQIDEDALVNTQCPKNLELRWQTEVSSSIYATPLIADINS 85
D+S +N FR+REA+DD LG P+IDEDALVN++CPKNLELRWQTEVSSSIYA PLIADINS
Sbjct: 29 DSSRKNTFREREASDDSLGYPEIDEDALVNSKCPKNLELRWQTEVSSSIYANPLIADINS 88
Query: 86 DGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALATYN 145
DGKL+IVVPSF+HYLEVLEG+DGDKMPGWPAFHQS+VHSSPLLYDIDKDGVREIALATYN
Sbjct: 89 DGKLEIVVPSFVHYLEVLEGADGDKMPGWPAFHQSTVHSSPLLYDIDKDGVREIALATYN 148
Query: 146 GEVLFFRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVDRSHPDVHDDLIVQESEAARMKS 205
GEVLFFRVSGYMM+DKLE+PRRKV K W+VGL DPVDRSHPDVHDD +VQ+ A S
Sbjct: 149 GEVLFFRVSGYMMSDKLEVPRRKVLKKWFVGLDPDPVDRSHPDVHDDQLVQD--ATIKNS 206
Query: 206 MLETKKSTPETNATVTTSTE--------SNPAPAT------------VSNPDV-KKVNES 244
M + S E ++ TSTE SNP P V NP+ KK+N S
Sbjct: 207 MSQMNGSRHEAKSSAATSTENHLETKNLSNPEPEKKINGSQVDESIKVPNPEPEKKINGS 266
Query: 245 ----LVNVSNPSEERKVNESHTEMNIKLPTSVDNSSTTTVSGGTNSSENGTNTGRRLLED 300
++ V NP E+K+N S + +IK+PT VDNSS S T ++N T+TGRRLLED
Sbjct: 267 QIDEIIKVPNPEPEKKINGSQVDESIKVPTVVDNSSVNAGSLETVHADNKTSTGRRLLED 326
Query: 301 NNSKGSQEGND----KEDVPVATAENDQALDENADSSFELFRDTDELADEYNYDYDDYVD 356
NNSKG+ +G+ KE + AT END+ LD +ADSSFELFR++++LADEY+YDYDDYVD
Sbjct: 327 NNSKGAVQGSSESKVKEGIHAATVENDEGLDADADSSFELFRNSEDLADEYSYDYDDYVD 386
Query: 357 DAMWGDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNP 416
+ MWGDEEWTE +HEK+EDYVNVDSHIL TPVIADIDNDGVSEMI+AVSYFFDHEYYDN
Sbjct: 387 ETMWGDEEWTEVKHEKLEDYVNVDSHILCTPVIADIDNDGVSEMIVAVSYFFDHEYYDNQ 446
Query: 417 EHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGD 476
EH KELG IDIGKYVAG IVVFNLDTKQVKWT +LDLSTD ++FRAYIYSSPTVVDLDGD
Sbjct: 447 EHRKELGDIDIGKYVAGGIVVFNLDTKQVKWTAELDLSTDTSNFRAYIYSSPTVVDLDGD 506
Query: 477 GNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNV 536
GNLDILVGTS+GLFYVLDHHG +R+KFPLEMAEIQGAVVAAD+NDDGKIELVT DTHGNV
Sbjct: 507 GNLDILVGTSYGLFYVLDHHGNVRQKFPLEMAEIQGAVVAADVNDDGKIELVTADTHGNV 566
Query: 537 AAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYP 596
A WT +G IWE+HLKSL+ QGP++GDVDGDGH+++VVPTLSG I+VL G+DGS + YP
Sbjct: 567 AVWTPKGDLIWEKHLKSLIPQGPTVGDVDGDGHTELVVPTLSGKIHVLDGRDGSSIGRYP 626
Query: 597 YRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMV 656
Y+THGR+MNQVLLVDL+K EK KGLTIVTTSFDGYLYLIDGPT CAD VDIGETSYSMV
Sbjct: 627 YQTHGRIMNQVLLVDLSKDKEKKKGLTIVTTSFDGYLYLIDGPTGCADAVDIGETSYSMV 686
Query: 657 LADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTH 716
LADNVDGGDDLDLIVTTMNGNVFCFSTP+PHHPLKAWR +QGRNN+A RY+R GIYVTH
Sbjct: 687 LADNVDGGDDLDLIVTTMNGNVFCFSTPSPHHPLKAWRLPSQGRNNLANRYSREGIYVTH 746
Query: 717 PSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARR 776
PSRAFRDEEG++FWVEIEIVD YR+PSG Q PY VTT+LLVPGNYQGER IK + + +
Sbjct: 747 PSRAFRDEEGKSFWVEIEIVDNYRYPSGHQGPYKVTTSLLVPGNYQGERTIKLNNTYGQP 806
Query: 777 GKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGV 836
GKYRIKLPTV VRT GTVLVEMVD+NGLYFSD+FSLTFHM+YYKLLKWLLVLPMLGMFGV
Sbjct: 807 GKYRIKLPTVSVRTMGTVLVEMVDRNGLYFSDDFSLTFHMHYYKLLKWLLVLPMLGMFGV 866
Query: 837 LVILRPQEAMPLPSFSRNTD 856
LVILRPQ +MPLPSFSRN D
Sbjct: 867 LVILRPQGSMPLPSFSRNND 886
>gi|357468639|ref|XP_003604604.1| Defective in exine formation [Medicago truncatula]
gi|355505659|gb|AES86801.1| Defective in exine formation [Medicago truncatula]
Length = 890
Score = 1280 bits (3313), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 647/891 (72%), Positives = 740/891 (83%), Gaps = 42/891 (4%)
Query: 3 SSTTSANCVLLICLLLFNSARGGDNSEQNKFRQREATDDQLGLPQIDEDALVNTQCPKNL 62
SST +LL+C + D + N FR+REATDD LG P+IDEDALVN++CP NL
Sbjct: 5 SSTNLLLFLLLLCTFSSSVFAEEDAKKNNTFREREATDDALGYPEIDEDALVNSKCPMNL 64
Query: 63 ELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSV 122
ELRWQTEVSSS+YA PLIADINSDGKLDIVVPSF+HYLEVLEG+DGDKMPGWPAFHQS+V
Sbjct: 65 ELRWQTEVSSSVYANPLIADINSDGKLDIVVPSFVHYLEVLEGTDGDKMPGWPAFHQSTV 124
Query: 123 HSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPV 182
HSSPLLYDIDKDGVREIALATYNGEVLFFRVSGY+M+DKLE+PRRKV K+W+VGL+ DPV
Sbjct: 125 HSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYIMSDKLEVPRRKVLKNWHVGLNKDPV 184
Query: 183 DRSHPDVHDDLIVQESEAARMKSMLETKKSTPETNATVTTSTESNPAPATVSNPDV-KKV 241
DR+HPDVHDD +VQE+ A SM + S E N++ +TSTES+P +VSNP+ KK+
Sbjct: 185 DRTHPDVHDDQLVQEATIA--NSMSQMNGSRHEVNSSASTSTESHPDTKSVSNPEPEKKI 242
Query: 242 NES---------------LVNVSNPSEERKVNESHTEMNIKLPTSVDNSSTTTVSGGTNS 286
N S NVSNP E+KVNES +E IK+PT NSS + S T +
Sbjct: 243 NGSQSEESINTSTESHPDTKNVSNPEPEKKVNESQSEEGIKMPT---NSSVSAGSVETVN 299
Query: 287 SENGTNTGRRLLEDNNSKGSQ----EGNDKEDVPVATAENDQALDENADSSFELFRDTDE 342
++N T+TGRRLLEDNN KG++ E KE+V AT EN++ L+ +ADSSFELFR++D+
Sbjct: 300 ADNKTSTGRRLLEDNNLKGAEQVGSESKGKEEVHAATVENEEGLEADADSSFELFRNSDD 359
Query: 343 LADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMII 402
LADEYNYDYDDYVD+++WGDEEW E +HEK+EDYVNVDSHILSTPVIADIDNDGV EM++
Sbjct: 360 LADEYNYDYDDYVDESLWGDEEWIEGKHEKLEDYVNVDSHILSTPVIADIDNDGVMEMVV 419
Query: 403 AVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRA 462
AVSYFFD EYYDN EH+KELG IDIGKYVAG IVVFNLDTKQVKWT +LD+STD A+FRA
Sbjct: 420 AVSYFFDQEYYDNQEHMKELGDIDIGKYVAGGIVVFNLDTKQVKWTAELDMSTDTANFRA 479
Query: 463 YIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDD 522
Y+YSSPTVVDLDGDG LDILVGTS+GLFYVLDHHGK+REKFPLEMAEIQ VVAADINDD
Sbjct: 480 YVYSSPTVVDLDGDGYLDILVGTSYGLFYVLDHHGKVREKFPLEMAEIQAGVVAADINDD 539
Query: 523 GKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQ-----------------GPSIGDVD 565
GKIELVT DTHGNV AWT +G IWE+HLKSL+ P+IGD+D
Sbjct: 540 GKIELVTADTHGNVVAWTPKGDMIWEKHLKSLIPHVMYYLNLPWHVNECSMIAPTIGDID 599
Query: 566 GDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIV 625
GDG +++VVPTLSG I+VL G+DGS + YP+ THGR+MNQ+LLVDL+K+ EK KGLT+V
Sbjct: 600 GDGRTELVVPTLSGKIHVLDGRDGSPIGRYPFITHGRIMNQILLVDLSKQKEKKKGLTLV 659
Query: 626 TTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPA 685
T+SFDGYLYLIDGPT CADVVDIGETSYSMVLADNVDGGDDLDLIV+TMNGNVFCFSTP+
Sbjct: 660 TSSFDGYLYLIDGPTGCADVVDIGETSYSMVLADNVDGGDDLDLIVSTMNGNVFCFSTPS 719
Query: 686 PHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPSGS 745
PHHPLKAWR NQGRNNVA RY R GIYVTHPSRAFRDEEG++F+VEIEIVD YR+PSG
Sbjct: 720 PHHPLKAWRLPNQGRNNVANRYGREGIYVTHPSRAFRDEEGKSFFVEIEIVDNYRYPSGH 779
Query: 746 QAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLY 805
Q PY+VTT+LLVPGNYQGER IKQ+Q + + GK+RIKLPTVGVRTTGTVLVEMVDKNGLY
Sbjct: 780 QGPYHVTTSLLVPGNYQGERTIKQNQTYYQPGKHRIKLPTVGVRTTGTVLVEMVDKNGLY 839
Query: 806 FSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVILRPQEAMPLPSFSRNTD 856
FSDEFSLTFHM+YYKLLKWLLVLPMLGMFGVLVILRPQ +PLPSFSRN D
Sbjct: 840 FSDEFSLTFHMHYYKLLKWLLVLPMLGMFGVLVILRPQGPVPLPSFSRNND 890
>gi|297833660|ref|XP_002884712.1| hypothetical protein ARALYDRAFT_897057 [Arabidopsis lyrata subsp.
lyrata]
gi|297330552|gb|EFH60971.1| hypothetical protein ARALYDRAFT_897057 [Arabidopsis lyrata subsp.
lyrata]
Length = 897
Score = 1276 bits (3301), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 638/877 (72%), Positives = 725/877 (82%), Gaps = 53/877 (6%)
Query: 30 QNKFRQREATDDQLGLPQIDEDALVNTQCPKNLELRWQTEVSSSIYATPLIADINSDGKL 89
+NKFR+R+ATDD LG P+IDED+L+NTQCPK LELRWQTEV+SS+YATPLIADINSDGKL
Sbjct: 25 ENKFRERKATDDDLGYPEIDEDSLLNTQCPKKLELRWQTEVTSSVYATPLIADINSDGKL 84
Query: 90 DIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVL 149
DIVVPSF+HYLEVLEG+DGDKMPGWPAFHQS+VHSSPLL+DIDKDGVREIALATYNGEVL
Sbjct: 85 DIVVPSFVHYLEVLEGADGDKMPGWPAFHQSNVHSSPLLFDIDKDGVREIALATYNGEVL 144
Query: 150 FFRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVDRSHPDVHDDLIVQESEAARMKSMLET 209
FFRVSG++M+DKLE+PRRKV K+W+VGL+ DPVDRSHPDVHDD++ E EA MKS
Sbjct: 145 FFRVSGFLMSDKLEVPRRKVHKNWHVGLNPDPVDRSHPDVHDDVL--EEEAMAMKSSATR 202
Query: 210 K-------------------------------------------KSTPETNATVTTSTES 226
K K TPE + + + +
Sbjct: 203 KSECSNHNTKCYTLDVQRVHGGDSNVSSQEDQKRLENNQTEAIVKPTPELHNSSMGAGAN 262
Query: 227 NPAPATVSNPDVKKVNESLVNVSNPSEERKVNESHTEMNIKLPTSVDNSSTTT-VSGGTN 285
N + + +K+N ++ +N ++ K++E E IKL TS NSS + +G ++
Sbjct: 263 NSSANVTTAGSTEKLNGNVT--TNEVDQSKISEDKNETVIKLNTSTGNSSESLGTTGNSS 320
Query: 286 SSENGTNTGRRLLEDNNSKGSQEG-----NDKEDVPVATAENDQALDENADSSFELFRDT 340
++E T +GRRLLE++ SK S +G ++KE V +AT END L+ +ADSSFEL R+
Sbjct: 321 TTETVTKSGRRLLEEDGSKESVDGHSDNKDNKEGVRMATVENDGGLEADADSSFELLREN 380
Query: 341 DELADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEM 400
DELADEY+YDYDDYVD+ MWGDEEW E QHE EDYVN+D+HIL TPVIADID DGV EM
Sbjct: 381 DELADEYSYDYDDYVDEKMWGDEEWVEGQHENSEDYVNIDAHILCTPVIADIDKDGVQEM 440
Query: 401 IIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASF 460
I+AVSYFFD EYYDNPEHLKELGGIDI Y+A +IVVFNL+TKQVKW +LDLSTD A+F
Sbjct: 441 IVAVSYFFDPEYYDNPEHLKELGGIDIKNYIASSIVVFNLETKQVKWIKELDLSTDKANF 500
Query: 461 RAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADIN 520
RAYIYSSPTVVDLDGDG LDILVGTSFGLFY +DH G IREKFPLEMAEIQGAVVAADIN
Sbjct: 501 RAYIYSSPTVVDLDGDGYLDILVGTSFGLFYAMDHRGNIREKFPLEMAEIQGAVVAADIN 560
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGN 580
DDGKIELVTTD+HGN+AAWT +G IWE HLKSLV QGPSIGDVDGDGH+DVVVPT SGN
Sbjct: 561 DDGKIELVTTDSHGNIAAWTTQGVEIWEAHLKSLVPQGPSIGDVDGDGHTDVVVPTSSGN 620
Query: 581 IYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPT 640
IYVLSGKDGS VRPYPYRTHGRVMNQ+LLVDL KRGEK KGLTIVTTSFDGYLYLIDGPT
Sbjct: 621 IYVLSGKDGSIVRPYPYRTHGRVMNQLLLVDLNKRGEKKKGLTIVTTSFDGYLYLIDGPT 680
Query: 641 SCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGR 700
SC DVVDIGETSYSMVLADNVDGGDDLDLIV+TMNGNVFCFSTP+P+HPLKAWRS +QGR
Sbjct: 681 SCTDVVDIGETSYSMVLADNVDGGDDLDLIVSTMNGNVFCFSTPSPYHPLKAWRSTDQGR 740
Query: 701 NNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGN 760
NN A RY+R G++V+H +R FRDEEG+NFW EIEIVD+YR+PSGSQAPYNVTTTLLVPGN
Sbjct: 741 NNKANRYDREGVFVSHSTRGFRDEEGKNFWAEIEIVDKYRYPSGSQAPYNVTTTLLVPGN 800
Query: 761 YQGERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYK 820
YQG+RRI QSQI+ R GKYRIKLPTVGVRTTGTV+VEMVD+NGL+FSDEFSLTFHMYYYK
Sbjct: 801 YQGDRRITQSQIYDRPGKYRIKLPTVGVRTTGTVMVEMVDRNGLHFSDEFSLTFHMYYYK 860
Query: 821 LLKWLLVLPMLGMFGVLVILRPQEAMPLPSFSRNTDL 857
LLKWLLVLPMLGMFG+LVILRPQEA+PLPSFSRNTDL
Sbjct: 861 LLKWLLVLPMLGMFGLLVILRPQEAVPLPSFSRNTDL 897
>gi|11138669|gb|AAG31444.1|AF257186_1 defective in exine formation [Arabidopsis thaliana]
Length = 896
Score = 1268 bits (3282), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 645/875 (73%), Positives = 721/875 (82%), Gaps = 50/875 (5%)
Query: 30 QNKFRQREATDDQLGLPQIDEDALVNTQCPKNLELRWQTEVSSSIYATPLIADINSDGKL 89
+NKFR+R+ATDD+LG P IDEDAL+NTQCPK LELRWQTEV+SS+YATPLIADINSDGKL
Sbjct: 25 ENKFRERKATDDELGYPDIDEDALLNTQCPKKLELRWQTEVTSSVYATPLIADINSDGKL 84
Query: 90 DIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVL 149
DIVVPSF+HYLEVLEG+DGDKMPGWPAFHQS+VHSSPLL+DIDKDGVREIALATYN EVL
Sbjct: 85 DIVVPSFVHYLEVLEGADGDKMPGWPAFHQSNVHSSPLLFDIDKDGVREIALATYNAEVL 144
Query: 150 FFRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVDRSHPDVHDDLIVQESEAARMKS---- 205
FFRVSG++M+DKLE+PRRKV K+W+VGL+ DPVDRSHPDVHDD++ E EA MKS
Sbjct: 145 FFRVSGFLMSDKLEVPRRKVHKNWHVGLNPDPVDRSHPDVHDDVL--EEEAMAMKSSTTQ 202
Query: 206 ----------MLETKKSTPETNATVTTST-----ESNPAPATVSNPDVKKVNESL----- 245
+ K N+ V+T E+N A V P + N S+
Sbjct: 203 TNATTTTPNVTVSMTKEVHGANSYVSTQEDQKRPENNQTEAIVK-PTPELHNSSMDAGAN 261
Query: 246 ----------------VNVS-NPSEERKVNESHTEMNIKLPTSVDNSSTTT-VSGGTNSS 287
NV+ N ++ K++ E IKL TS NSS T SG ++++
Sbjct: 262 NLAANATTAGSRENLNRNVTTNEVDQSKISGDKNETVIKLNTSTGNSSETLGTSGNSSTA 321
Query: 288 ENGTNTGRRLLEDNNSKGSQEG-----NDKEDVPVATAENDQALDENADSSFELFRDTDE 342
E T +GRRLLE++ SK S + ++ E V +AT END L+ +ADSSFEL R+ DE
Sbjct: 322 ETVTKSGRRLLEEDGSKESVDSHSDSKDNSEGVRMATVENDGGLERDADSSFELLRENDE 381
Query: 343 LADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMII 402
LADEY+YDYDDYVD+ MWGDEEW E QHE EDYVN+D+HIL TPVIADID DGV EMI+
Sbjct: 382 LADEYSYDYDDYVDEKMWGDEEWVEGQHENSEDYVNIDAHILCTPVIADIDKDGVQEMIV 441
Query: 403 AVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRA 462
AVSYFFD EYYDNPEHLKELGGIDI Y+A +IVVFNLDTKQVKW +LDLSTD A+FRA
Sbjct: 442 AVSYFFDPEYYDNPEHLKELGGIDIKNYIASSIVVFNLDTKQVKWIKELDLSTDKANFRA 501
Query: 463 YIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDD 522
YIYSSPTVVDLDGDG LDILVGTSFGLFY +DH G IREKFPLEMAEIQGAVVAADINDD
Sbjct: 502 YIYSSPTVVDLDGDGYLDILVGTSFGLFYAMDHRGNIREKFPLEMAEIQGAVVAADINDD 561
Query: 523 GKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIY 582
GKIELVTTD+HGN+AAWT +G IWE HLKSLV QGPSIGDVDGDGH++VVVPT SGNIY
Sbjct: 562 GKIELVTTDSHGNIAAWTTQGVEIWEAHLKSLVPQGPSIGDVDGDGHTEVVVPTSSGNIY 621
Query: 583 VLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSC 642
VLSGKDGS VRPYPYRTHGRVMNQ+LLVDL KRGEK KGLTIVTTSFDGYLYLIDGPTSC
Sbjct: 622 VLSGKDGSIVRPYPYRTHGRVMNQLLLVDLNKRGEKKKGLTIVTTSFDGYLYLIDGPTSC 681
Query: 643 ADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNN 702
DVVDIGETSYSMVLADNVDGGDDLDLIV+TMNGNVFCFSTP+PHHPLKAWRS +QGRNN
Sbjct: 682 TDVVDIGETSYSMVLADNVDGGDDLDLIVSTMNGNVFCFSTPSPHHPLKAWRSSDQGRNN 741
Query: 703 VAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQ 762
A RY+R G++VTH +R FRDEEG+NFW EIEIVD+YR+PSGSQAPYNVTTTLLVPGNYQ
Sbjct: 742 KANRYDREGVFVTHSTRGFRDEEGKNFWAEIEIVDKYRYPSGSQAPYNVTTTLLVPGNYQ 801
Query: 763 GERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLL 822
GERRI QSQI+ R GKYRIKLPTVGVRTTGTV+VEM DKNGL+FSDEFSLTFHMYYYKLL
Sbjct: 802 GERRITQSQIYDRPGKYRIKLPTVGVRTTGTVMVEMADKNGLHFSDEFSLTFHMYYYKLL 861
Query: 823 KWLLVLPMLGMFGVLVILRPQEAMPLPSFSRNTDL 857
KWLLVLPMLGMFG+LVILRPQEA+PLPSFSRNTDL
Sbjct: 862 KWLLVLPMLGMFGLLVILRPQEAVPLPSFSRNTDL 896
>gi|18398367|ref|NP_566343.1| defective in exine formation protein (DEX1) [Arabidopsis thaliana]
gi|332641198|gb|AEE74719.1| defective in exine formation protein (DEX1) [Arabidopsis thaliana]
Length = 896
Score = 1268 bits (3281), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 645/875 (73%), Positives = 721/875 (82%), Gaps = 50/875 (5%)
Query: 30 QNKFRQREATDDQLGLPQIDEDALVNTQCPKNLELRWQTEVSSSIYATPLIADINSDGKL 89
+NKFR+R+ATDD+LG P IDEDAL+NTQCPK LELRWQTEV+SS+YATPLIADINSDGKL
Sbjct: 25 ENKFRERKATDDELGYPDIDEDALLNTQCPKKLELRWQTEVTSSVYATPLIADINSDGKL 84
Query: 90 DIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVL 149
DIVVPSF+HYLEVLEG+DGDKMPGWPAFHQS+VHSSPLL+DIDKDGVREIALATYN EVL
Sbjct: 85 DIVVPSFVHYLEVLEGADGDKMPGWPAFHQSNVHSSPLLFDIDKDGVREIALATYNAEVL 144
Query: 150 FFRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVDRSHPDVHDDLIVQESEAARMKS---- 205
FFRVSG++M+DKLE+PRRKV K+W+VGL+ DPVDRSHPDVHDD++ E EA MKS
Sbjct: 145 FFRVSGFLMSDKLEVPRRKVHKNWHVGLNPDPVDRSHPDVHDDVL--EEEAMAMKSSTTQ 202
Query: 206 ----------MLETKKSTPETNATVTTST-----ESNPAPATVSNPDVKKVNESL----- 245
+ K N+ V+T E+N A V P + N S+
Sbjct: 203 TNATTTTPNVTVSMTKEVHGANSYVSTQEDQKRPENNQTEAIVK-PTPELHNSSMDAGAN 261
Query: 246 ----------------VNVS-NPSEERKVNESHTEMNIKLPTSVDNSSTTT-VSGGTNSS 287
NV+ N ++ K++ E IKL TS NSS T SG ++++
Sbjct: 262 NLAANATTAGSRENLNRNVTTNEVDQSKISGDKNETVIKLNTSTGNSSETLGTSGNSSTA 321
Query: 288 ENGTNTGRRLLEDNNSKGSQEG-----NDKEDVPVATAENDQALDENADSSFELFRDTDE 342
E T +GRRLLE++ SK S + ++ E V +AT END L+ +ADSSFEL R+ DE
Sbjct: 322 ETVTKSGRRLLEEDGSKESVDSHSDSKDNSEGVRMATVENDGGLEADADSSFELLRENDE 381
Query: 343 LADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMII 402
LADEY+YDYDDYVD+ MWGDEEW E QHE EDYVN+D+HIL TPVIADID DGV EMI+
Sbjct: 382 LADEYSYDYDDYVDEKMWGDEEWVEGQHENSEDYVNIDAHILCTPVIADIDKDGVQEMIV 441
Query: 403 AVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRA 462
AVSYFFD EYYDNPEHLKELGGIDI Y+A +IVVFNLDTKQVKW +LDLSTD A+FRA
Sbjct: 442 AVSYFFDPEYYDNPEHLKELGGIDIKNYIASSIVVFNLDTKQVKWIKELDLSTDKANFRA 501
Query: 463 YIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDD 522
YIYSSPTVVDLDGDG LDILVGTSFGLFY +DH G IREKFPLEMAEIQGAVVAADINDD
Sbjct: 502 YIYSSPTVVDLDGDGYLDILVGTSFGLFYAMDHRGNIREKFPLEMAEIQGAVVAADINDD 561
Query: 523 GKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIY 582
GKIELVTTD+HGN+AAWT +G IWE HLKSLV QGPSIGDVDGDGH++VVVPT SGNIY
Sbjct: 562 GKIELVTTDSHGNIAAWTTQGVEIWEAHLKSLVPQGPSIGDVDGDGHTEVVVPTSSGNIY 621
Query: 583 VLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSC 642
VLSGKDGS VRPYPYRTHGRVMNQ+LLVDL KRGEK KGLTIVTTSFDGYLYLIDGPTSC
Sbjct: 622 VLSGKDGSIVRPYPYRTHGRVMNQLLLVDLNKRGEKKKGLTIVTTSFDGYLYLIDGPTSC 681
Query: 643 ADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNN 702
DVVDIGETSYSMVLADNVDGGDDLDLIV+TMNGNVFCFSTP+PHHPLKAWRS +QGRNN
Sbjct: 682 TDVVDIGETSYSMVLADNVDGGDDLDLIVSTMNGNVFCFSTPSPHHPLKAWRSSDQGRNN 741
Query: 703 VAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQ 762
A RY+R G++VTH +R FRDEEG+NFW EIEIVD+YR+PSGSQAPYNVTTTLLVPGNYQ
Sbjct: 742 KANRYDREGVFVTHSTRGFRDEEGKNFWAEIEIVDKYRYPSGSQAPYNVTTTLLVPGNYQ 801
Query: 763 GERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLL 822
GERRI QSQI+ R GKYRIKLPTVGVRTTGTV+VEM DKNGL+FSDEFSLTFHMYYYKLL
Sbjct: 802 GERRITQSQIYDRPGKYRIKLPTVGVRTTGTVMVEMADKNGLHFSDEFSLTFHMYYYKLL 861
Query: 823 KWLLVLPMLGMFGVLVILRPQEAMPLPSFSRNTDL 857
KWLLVLPMLGMFG+LVILRPQEA+PLPSFSRNTDL
Sbjct: 862 KWLLVLPMLGMFGLLVILRPQEAVPLPSFSRNTDL 896
>gi|28973744|gb|AAO64188.1| unknown protein [Arabidopsis thaliana]
gi|110737094|dbj|BAF00499.1| defective in exine formation [Arabidopsis thaliana]
Length = 891
Score = 1268 bits (3280), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 645/875 (73%), Positives = 721/875 (82%), Gaps = 50/875 (5%)
Query: 30 QNKFRQREATDDQLGLPQIDEDALVNTQCPKNLELRWQTEVSSSIYATPLIADINSDGKL 89
+NKFR+R+ATDD+LG P IDEDAL+NTQCPK LELRWQTEV+SS+YATPLIADINSDGKL
Sbjct: 20 ENKFRERKATDDELGYPDIDEDALLNTQCPKKLELRWQTEVTSSVYATPLIADINSDGKL 79
Query: 90 DIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVL 149
DIVVPSF+HYLEVLEG+DGDKMPGWPAFHQS+VHSSPLL+DIDKDGVREIALATYN EVL
Sbjct: 80 DIVVPSFVHYLEVLEGADGDKMPGWPAFHQSNVHSSPLLFDIDKDGVREIALATYNAEVL 139
Query: 150 FFRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVDRSHPDVHDDLIVQESEAARMKS---- 205
FFRVSG++M+DKLE+PRRKV K+W+VGL+ DPVDRSHPDVHDD++ E EA MKS
Sbjct: 140 FFRVSGFLMSDKLEVPRRKVHKNWHVGLNPDPVDRSHPDVHDDVL--EEEAMAMKSSTTQ 197
Query: 206 ----------MLETKKSTPETNATVTTST-----ESNPAPATVSNPDVKKVNESL----- 245
+ K N+ V+T E+N A V P + N S+
Sbjct: 198 TNATTTTPNVTVSMTKEVHGANSYVSTQEDQKRPENNQTEAIVK-PTPELHNSSMDAGAN 256
Query: 246 ----------------VNVS-NPSEERKVNESHTEMNIKLPTSVDNSSTTT-VSGGTNSS 287
NV+ N ++ K++ E IKL TS NSS T SG ++++
Sbjct: 257 NLAANATTAGSRENLNRNVTTNEVDQSKISGDKNETVIKLNTSTGNSSETLGTSGNSSTA 316
Query: 288 ENGTNTGRRLLEDNNSKGSQEG-----NDKEDVPVATAENDQALDENADSSFELFRDTDE 342
E T +GRRLLE++ SK S + ++ E V +AT END L+ +ADSSFEL R+ DE
Sbjct: 317 ETVTKSGRRLLEEDGSKESVDSHSDSKDNSEGVRMATVENDGGLEADADSSFELLRENDE 376
Query: 343 LADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMII 402
LADEY+YDYDDYVD+ MWGDEEW E QHE EDYVN+D+HIL TPVIADID DGV EMI+
Sbjct: 377 LADEYSYDYDDYVDEKMWGDEEWVEGQHENSEDYVNIDAHILCTPVIADIDKDGVQEMIV 436
Query: 403 AVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRA 462
AVSYFFD EYYDNPEHLKELGGIDI Y+A +IVVFNLDTKQVKW +LDLSTD A+FRA
Sbjct: 437 AVSYFFDPEYYDNPEHLKELGGIDIKNYIASSIVVFNLDTKQVKWIKELDLSTDKANFRA 496
Query: 463 YIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDD 522
YIYSSPTVVDLDGDG LDILVGTSFGLFY +DH G IREKFPLEMAEIQGAVVAADINDD
Sbjct: 497 YIYSSPTVVDLDGDGYLDILVGTSFGLFYAMDHRGNIREKFPLEMAEIQGAVVAADINDD 556
Query: 523 GKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIY 582
GKIELVTTD+HGN+AAWT +G IWE HLKSLV QGPSIGDVDGDGH++VVVPT SGNIY
Sbjct: 557 GKIELVTTDSHGNIAAWTTQGVEIWEAHLKSLVPQGPSIGDVDGDGHTEVVVPTSSGNIY 616
Query: 583 VLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSC 642
VLSGKDGS VRPYPYRTHGRVMNQ+LLVDL KRGEK KGLTIVTTSFDGYLYLIDGPTSC
Sbjct: 617 VLSGKDGSIVRPYPYRTHGRVMNQLLLVDLNKRGEKKKGLTIVTTSFDGYLYLIDGPTSC 676
Query: 643 ADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNN 702
DVVDIGETSYSMVLADNVDGGDDLDLIV+TMNGNVFCFSTP+PHHPLKAWRS +QGRNN
Sbjct: 677 TDVVDIGETSYSMVLADNVDGGDDLDLIVSTMNGNVFCFSTPSPHHPLKAWRSSDQGRNN 736
Query: 703 VAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQ 762
A RY+R G++VTH +R FRDEEG+NFW EIEIVD+YR+PSGSQAPYNVTTTLLVPGNYQ
Sbjct: 737 KANRYDREGVFVTHSTRGFRDEEGKNFWAEIEIVDKYRYPSGSQAPYNVTTTLLVPGNYQ 796
Query: 763 GERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLL 822
GERRI QSQI+ R GKYRIKLPTVGVRTTGTV+VEM DKNGL+FSDEFSLTFHMYYYKLL
Sbjct: 797 GERRITQSQIYDRPGKYRIKLPTVGVRTTGTVMVEMADKNGLHFSDEFSLTFHMYYYKLL 856
Query: 823 KWLLVLPMLGMFGVLVILRPQEAMPLPSFSRNTDL 857
KWLLVLPMLGMFG+LVILRPQEA+PLPSFSRNTDL
Sbjct: 857 KWLLVLPMLGMFGLLVILRPQEAVPLPSFSRNTDL 891
>gi|11138671|gb|AAG31445.1|AF257187_1 defective in exine formation [Arabidopsis thaliana]
Length = 896
Score = 1266 bits (3276), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 644/875 (73%), Positives = 720/875 (82%), Gaps = 50/875 (5%)
Query: 30 QNKFRQREATDDQLGLPQIDEDALVNTQCPKNLELRWQTEVSSSIYATPLIADINSDGKL 89
+NKFR+R+ATDD+LG P IDEDAL+NTQCPK LELRWQTEV+SS+YATPLIADINSDGKL
Sbjct: 25 ENKFRERKATDDELGYPDIDEDALLNTQCPKKLELRWQTEVTSSVYATPLIADINSDGKL 84
Query: 90 DIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVL 149
DIVVPSF+HYLEVLEG+DGDKMPGWPAFHQS+VHSSPLL+DIDKDGVREIALATYN EVL
Sbjct: 85 DIVVPSFVHYLEVLEGADGDKMPGWPAFHQSNVHSSPLLFDIDKDGVREIALATYNAEVL 144
Query: 150 FFRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVDRSHPDVHDDLIVQESEAARMKS---- 205
FFRVSG++M+DKLE+PRRKV K+W+VGL+ DPVDRSHPDVHDD++ E EA MKS
Sbjct: 145 FFRVSGFLMSDKLEVPRRKVHKNWHVGLNPDPVDRSHPDVHDDVL--EEEAMAMKSSTTQ 202
Query: 206 ----------MLETKKSTPETNATVTTST-----ESNPAPATVSNPDVKKVNESL----- 245
+ K N+ V+ E+N A V P + N S+
Sbjct: 203 TNATTTTPNVTVSMTKEVHGANSYVSIQEDQKRPENNQTEAIVK-PTPELHNSSMDAGAN 261
Query: 246 ----------------VNVS-NPSEERKVNESHTEMNIKLPTSVDNSSTTT-VSGGTNSS 287
NV+ N ++ K++ E IKL TS NSS T SG ++++
Sbjct: 262 NLAANATTAGSRENLNRNVTTNEVDQSKISGDKNETVIKLNTSTGNSSETLGTSGNSSTA 321
Query: 288 ENGTNTGRRLLEDNNSKGSQEG-----NDKEDVPVATAENDQALDENADSSFELFRDTDE 342
E T +GRRLLE++ SK S + ++ E V +AT END L+ +ADSSFEL R+ DE
Sbjct: 322 ETVTKSGRRLLEEDGSKESVDSHSDSKDNSEGVRMATVENDGGLERDADSSFELLRENDE 381
Query: 343 LADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMII 402
LADEY+YDYDDYVD+ MWGDEEW E QHE EDYVN+D+HIL TPVIADID DGV EMI+
Sbjct: 382 LADEYSYDYDDYVDEKMWGDEEWVEGQHENSEDYVNIDAHILCTPVIADIDKDGVQEMIV 441
Query: 403 AVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRA 462
AVSYFFD EYYDNPEHLKELGGIDI Y+A +IVVFNLDTKQVKW +LDLSTD A+FRA
Sbjct: 442 AVSYFFDPEYYDNPEHLKELGGIDIKNYIASSIVVFNLDTKQVKWIKELDLSTDKANFRA 501
Query: 463 YIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDD 522
YIYSSPTVVDLDGDG LDILVGTSFGLFY +DH G IREKFPLEMAEIQGAVVAADINDD
Sbjct: 502 YIYSSPTVVDLDGDGYLDILVGTSFGLFYAMDHRGNIREKFPLEMAEIQGAVVAADINDD 561
Query: 523 GKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIY 582
GKIELVTTD+HGN+AAWT +G IWE HLKSLV QGPSIGDVDGDGH++VVVPT SGNIY
Sbjct: 562 GKIELVTTDSHGNIAAWTTQGVEIWEAHLKSLVPQGPSIGDVDGDGHTEVVVPTSSGNIY 621
Query: 583 VLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSC 642
VLSGKDGS VRPYPYRTHGRVMNQ+LLVDL KRGEK KGLTIVTTSFDGYLYLIDGPTSC
Sbjct: 622 VLSGKDGSIVRPYPYRTHGRVMNQLLLVDLNKRGEKKKGLTIVTTSFDGYLYLIDGPTSC 681
Query: 643 ADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNN 702
DVVDIGETSYSMVLADNVDGGDDLDLIV+TMNGNVFCFSTP+PHHPLKAWRS +QGRNN
Sbjct: 682 TDVVDIGETSYSMVLADNVDGGDDLDLIVSTMNGNVFCFSTPSPHHPLKAWRSSDQGRNN 741
Query: 703 VAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQ 762
A RY+R G++VTH +R FRDEEG+NFW EIEIVD+YR+PSGSQAPYNVTTTLLVPGNYQ
Sbjct: 742 KANRYDREGVFVTHSTRGFRDEEGKNFWAEIEIVDKYRYPSGSQAPYNVTTTLLVPGNYQ 801
Query: 763 GERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLL 822
GERRI QSQI+ R GKYRIKLPTVGVRTTGTV+VEM DKNGL+FSDEFSLTFHMYYYKLL
Sbjct: 802 GERRITQSQIYDRPGKYRIKLPTVGVRTTGTVMVEMADKNGLHFSDEFSLTFHMYYYKLL 861
Query: 823 KWLLVLPMLGMFGVLVILRPQEAMPLPSFSRNTDL 857
KWLLVLPMLGMFG+LVILRPQEA+PLPSFSRNTDL
Sbjct: 862 KWLLVLPMLGMFGLLVILRPQEAVPLPSFSRNTDL 896
>gi|356496156|ref|XP_003516936.1| PREDICTED: uncharacterized protein LOC100500591 [Glycine max]
Length = 858
Score = 1261 bits (3262), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 631/836 (75%), Positives = 710/836 (84%), Gaps = 31/836 (3%)
Query: 50 EDALVNTQCPKNLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGD 109
EDALVN++CPKNLELRWQTEVSSSIYA PLIADINSDGKL+IVVPSF+HYLEVLEG+DGD
Sbjct: 25 EDALVNSKCPKNLELRWQTEVSSSIYANPLIADINSDGKLEIVVPSFVHYLEVLEGADGD 84
Query: 110 KMPGWPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKV 169
KMPGWPAFHQS+VHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMM+DKLE+PRR+V
Sbjct: 85 KMPGWPAFHQSTVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMSDKLEVPRRRV 144
Query: 170 RKDWYVGLHSDPVDRSHPDVHDDLIVQESEAARMKSMLETKKSTPETNATVTTSTES--- 226
K W+VGL DPVDRSHPDVHDD ++Q+ A SM + S E ++ STE+
Sbjct: 145 LKKWFVGLDPDPVDRSHPDVHDDQLIQD--ATIKNSMSQMNGSRHEARSSAAISTENHLD 202
Query: 227 -----NPAPAT------------VSNPDV-KKVNESLVN----VSNPSEERKVNESHTEM 264
NP P V NP+ KK+N S V+ V NP E+K+N S +
Sbjct: 203 SKKLPNPEPEKKINGSQADESIKVPNPEPEKKINGSQVDESIKVPNPEPEKKINGSQVDE 262
Query: 265 NIKLPTSVDNSSTTTVSGGTNSSENGTNTGRRLLEDNNSKGSQEG----NDKEDVPVATA 320
+IK+PT VDNSS S T ++N T+TGRRLLEDNNSKG+++G DKE + AT
Sbjct: 263 SIKVPTIVDNSSVNAGSLETVHADNKTSTGRRLLEDNNSKGAEQGGSESKDKEGIHAATV 322
Query: 321 ENDQALDENADSSFELFRDTDELADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVD 380
END+ L+ +ADSSFELFR++++LADEY+YDYDDYVD++MWGDEEWTE +HEK+ED+VNVD
Sbjct: 323 ENDEGLEADADSSFELFRNSEDLADEYSYDYDDYVDESMWGDEEWTEVKHEKLEDFVNVD 382
Query: 381 SHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNL 440
SHIL TPVIADIDNDGVSEMI+AVSYFFDHEYYDN EH KELG IDIGKYVAG IVVFNL
Sbjct: 383 SHILCTPVIADIDNDGVSEMIVAVSYFFDHEYYDNQEHRKELGDIDIGKYVAGGIVVFNL 442
Query: 441 DTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIR 500
DTKQVKWT +LDLSTD ++FRAYIYSSPTVVDLDGDGNLDILVGTS+GLFYVLDHHGK+R
Sbjct: 443 DTKQVKWTAELDLSTDTSNFRAYIYSSPTVVDLDGDGNLDILVGTSYGLFYVLDHHGKVR 502
Query: 501 EKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPS 560
+KFPLEMAEIQGAVVAAD+NDDGKIELVT DTHGNVA WT +G IWE+HLKSL+ QGP+
Sbjct: 503 QKFPLEMAEIQGAVVAADVNDDGKIELVTADTHGNVAVWTPKGDLIWEKHLKSLIPQGPT 562
Query: 561 IGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSK 620
+GDVDGDGH+++VVPTLSG I+VL G+DGS + YPY THGR+MNQVLLVDL+K EK K
Sbjct: 563 VGDVDGDGHTELVVPTLSGKIHVLDGRDGSSIGRYPYPTHGRIMNQVLLVDLSKHKEKRK 622
Query: 621 GLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFC 680
GLTIVTTSFDGYLYLIDGPT CADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFC
Sbjct: 623 GLTIVTTSFDGYLYLIDGPTGCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFC 682
Query: 681 FSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYR 740
FSTP+PHHPLKAWR +QGRNNVA RYNR GIYVTHPSRAF DEEG++FWVEIEIVD YR
Sbjct: 683 FSTPSPHHPLKAWRLPSQGRNNVANRYNREGIYVTHPSRAFHDEEGKSFWVEIEIVDNYR 742
Query: 741 FPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVD 800
+PSG Q PY VTT+LLVPGNYQGER IK + + + GKYRIKLPTV VRTTGTVLVEMVD
Sbjct: 743 YPSGHQGPYKVTTSLLVPGNYQGERTIKLNNTYDQPGKYRIKLPTVSVRTTGTVLVEMVD 802
Query: 801 KNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVILRPQEAMPLPSFSRNTD 856
+NGLYFSD+FSLTFHM+YYKLLKWLLVLPMLGMFGVLVIL PQ +MPLPSFSRN D
Sbjct: 803 RNGLYFSDDFSLTFHMHYYKLLKWLLVLPMLGMFGVLVILHPQGSMPLPSFSRNID 858
>gi|5923674|gb|AAD56325.1|AC009326_12 unknown protein [Arabidopsis thaliana]
Length = 907
Score = 1260 bits (3260), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 645/886 (72%), Positives = 721/886 (81%), Gaps = 61/886 (6%)
Query: 30 QNKFRQREATDDQLGLPQIDEDALVNTQCPKNLELRWQTEVSSSIYATPLIADINSDGKL 89
+NKFR+R+ATDD+LG P IDEDAL+NTQCPK LELRWQTEV+SS+YATPLIADINSDGKL
Sbjct: 25 ENKFRERKATDDELGYPDIDEDALLNTQCPKKLELRWQTEVTSSVYATPLIADINSDGKL 84
Query: 90 DIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVL 149
DIVVPSF+HYLEVLEG+DGDKMPGWPAFHQS+VHSSPLL+DIDKDGVREIALATYN EVL
Sbjct: 85 DIVVPSFVHYLEVLEGADGDKMPGWPAFHQSNVHSSPLLFDIDKDGVREIALATYNAEVL 144
Query: 150 FFRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVDRSHPDVHDDLIVQESEAARMKS---- 205
FFRVSG++M+DKLE+PRRKV K+W+VGL+ DPVDRSHPDVHDD++ E EA MKS
Sbjct: 145 FFRVSGFLMSDKLEVPRRKVHKNWHVGLNPDPVDRSHPDVHDDVL--EEEAMAMKSSTTQ 202
Query: 206 ----------MLETKKSTPETNATVTTST-----ESNPAPATVSNPDVKKVNESL----- 245
+ K N+ V+T E+N A V P + N S+
Sbjct: 203 TNATTTTPNVTVSMTKEVHGANSYVSTQEDQKRPENNQTEAIVK-PTPELHNSSMDAGAN 261
Query: 246 ----------------VNVS-NPSEERKVNESHTEMNIKLPTSVDNSSTTT-VSGGTNSS 287
NV+ N ++ K++ E IKL TS NSS T SG ++++
Sbjct: 262 NLAANATTAGSRENLNRNVTTNEVDQSKISGDKNETVIKLNTSTGNSSETLGTSGNSSTA 321
Query: 288 ENGTNTGRRLLEDNNSKGSQEG-----NDKEDVPVATAENDQALDENADSSFELFRDTDE 342
E T +GRRLLE++ SK S + ++ E V +AT END L+ +ADSSFEL R+ DE
Sbjct: 322 ETVTKSGRRLLEEDGSKESVDSHSDSKDNSEGVRMATVENDGGLEADADSSFELLRENDE 381
Query: 343 LADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMII 402
LADEY+YDYDDYVD+ MWGDEEW E QHE EDYVN+D+HIL TPVIADID DGV EMI+
Sbjct: 382 LADEYSYDYDDYVDEKMWGDEEWVEGQHENSEDYVNIDAHILCTPVIADIDKDGVQEMIV 441
Query: 403 AVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRA 462
AVSYFFD EYYDNPEHLKELGGIDI Y+A +IVVFNLDTKQVKW +LDLSTD A+FRA
Sbjct: 442 AVSYFFDPEYYDNPEHLKELGGIDIKNYIASSIVVFNLDTKQVKWIKELDLSTDKANFRA 501
Query: 463 YIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDD 522
YIYSSPTVVDLDGDG LDILVGTSFGLFY +DH G IREKFPLEMAEIQGAVVAADINDD
Sbjct: 502 YIYSSPTVVDLDGDGYLDILVGTSFGLFYAMDHRGNIREKFPLEMAEIQGAVVAADINDD 561
Query: 523 GKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIY 582
GKIELVTTD+HGN+AAWT +G IWE HLKSLV QGPSIGDVDGDGH++VVVPT SGNIY
Sbjct: 562 GKIELVTTDSHGNIAAWTTQGVEIWEAHLKSLVPQGPSIGDVDGDGHTEVVVPTSSGNIY 621
Query: 583 VLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSC 642
VLSGKDGS VRPYPYRTHGRVMNQ+LLVDL KRGEK KGLTIVTTSFDGYLYLIDGPTSC
Sbjct: 622 VLSGKDGSIVRPYPYRTHGRVMNQLLLVDLNKRGEKKKGLTIVTTSFDGYLYLIDGPTSC 681
Query: 643 ADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLK----------- 691
DVVDIGETSYSMVLADNVDGGDDLDLIV+TMNGNVFCFSTP+PHHPLK
Sbjct: 682 TDVVDIGETSYSMVLADNVDGGDDLDLIVSTMNGNVFCFSTPSPHHPLKARSTPNSICER 741
Query: 692 AWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNV 751
AWRS +QGRNN A RY+R G++VTH +R FRDEEG+NFW EIEIVD+YR+PSGSQAPYNV
Sbjct: 742 AWRSSDQGRNNKANRYDREGVFVTHSTRGFRDEEGKNFWAEIEIVDKYRYPSGSQAPYNV 801
Query: 752 TTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFS 811
TTTLLVPGNYQGERRI QSQI+ R GKYRIKLPTVGVRTTGTV+VEM DKNGL+FSDEFS
Sbjct: 802 TTTLLVPGNYQGERRITQSQIYDRPGKYRIKLPTVGVRTTGTVMVEMADKNGLHFSDEFS 861
Query: 812 LTFHMYYYKLLKWLLVLPMLGMFGVLVILRPQEAMPLPSFSRNTDL 857
LTFHMYYYKLLKWLLVLPMLGMFG+LVILRPQEA+PLPSFSRNTDL
Sbjct: 862 LTFHMYYYKLLKWLLVLPMLGMFGLLVILRPQEAVPLPSFSRNTDL 907
>gi|334185182|ref|NP_001189844.1| defective in exine formation protein (DEX1) [Arabidopsis thaliana]
gi|332641200|gb|AEE74721.1| defective in exine formation protein (DEX1) [Arabidopsis thaliana]
Length = 891
Score = 1226 bits (3172), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 625/853 (73%), Positives = 699/853 (81%), Gaps = 50/853 (5%)
Query: 30 QNKFRQREATDDQLGLPQIDEDALVNTQCPKNLELRWQTEVSSSIYATPLIADINSDGKL 89
+NKFR+R+ATDD+LG P IDEDAL+NTQCPK LELRWQTEV+SS+YATPLIADINSDGKL
Sbjct: 25 ENKFRERKATDDELGYPDIDEDALLNTQCPKKLELRWQTEVTSSVYATPLIADINSDGKL 84
Query: 90 DIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVL 149
DIVVPSF+HYLEVLEG+DGDKMPGWPAFHQS+VHSSPLL+DIDKDGVREIALATYN EVL
Sbjct: 85 DIVVPSFVHYLEVLEGADGDKMPGWPAFHQSNVHSSPLLFDIDKDGVREIALATYNAEVL 144
Query: 150 FFRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVDRSHPDVHDDLIVQESEAARMKS---- 205
FFRVSG++M+DKLE+PRRKV K+W+VGL+ DPVDRSHPDVHDD++ E EA MKS
Sbjct: 145 FFRVSGFLMSDKLEVPRRKVHKNWHVGLNPDPVDRSHPDVHDDVL--EEEAMAMKSSTTQ 202
Query: 206 ----------MLETKKSTPETNATVTTST-----ESNPAPATVSNPDVKKVNESL----- 245
+ K N+ V+T E+N A V P + N S+
Sbjct: 203 TNATTTTPNVTVSMTKEVHGANSYVSTQEDQKRPENNQTEAIVK-PTPELHNSSMDAGAN 261
Query: 246 ----------------VNVS-NPSEERKVNESHTEMNIKLPTSVDNSSTTT-VSGGTNSS 287
NV+ N ++ K++ E IKL TS NSS T SG ++++
Sbjct: 262 NLAANATTAGSRENLNRNVTTNEVDQSKISGDKNETVIKLNTSTGNSSETLGTSGNSSTA 321
Query: 288 ENGTNTGRRLLEDNNSKGSQEG-----NDKEDVPVATAENDQALDENADSSFELFRDTDE 342
E T +GRRLLE++ SK S + ++ E V +AT END L+ +ADSSFEL R+ DE
Sbjct: 322 ETVTKSGRRLLEEDGSKESVDSHSDSKDNSEGVRMATVENDGGLEADADSSFELLRENDE 381
Query: 343 LADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMII 402
LADEY+YDYDDYVD+ MWGDEEW E QHE EDYVN+D+HIL TPVIADID DGV EMI+
Sbjct: 382 LADEYSYDYDDYVDEKMWGDEEWVEGQHENSEDYVNIDAHILCTPVIADIDKDGVQEMIV 441
Query: 403 AVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRA 462
AVSYFFD EYYDNPEHLKELGGIDI Y+A +IVVFNLDTKQVKW +LDLSTD A+FRA
Sbjct: 442 AVSYFFDPEYYDNPEHLKELGGIDIKNYIASSIVVFNLDTKQVKWIKELDLSTDKANFRA 501
Query: 463 YIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDD 522
YIYSSPTVVDLDGDG LDILVGTSFGLFY +DH G IREKFPLEMAEIQGAVVAADINDD
Sbjct: 502 YIYSSPTVVDLDGDGYLDILVGTSFGLFYAMDHRGNIREKFPLEMAEIQGAVVAADINDD 561
Query: 523 GKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIY 582
GKIELVTTD+HGN+AAWT +G IWE HLKSLV QGPSIGDVDGDGH++VVVPT SGNIY
Sbjct: 562 GKIELVTTDSHGNIAAWTTQGVEIWEAHLKSLVPQGPSIGDVDGDGHTEVVVPTSSGNIY 621
Query: 583 VLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSC 642
VLSGKDGS VRPYPYRTHGRVMNQ+LLVDL KRGEK KGLTIVTTSFDGYLYLIDGPTSC
Sbjct: 622 VLSGKDGSIVRPYPYRTHGRVMNQLLLVDLNKRGEKKKGLTIVTTSFDGYLYLIDGPTSC 681
Query: 643 ADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNN 702
DVVDIGETSYSMVLADNVDGGDDLDLIV+TMNGNVFCFSTP+PHHPLKAWRS +QGRNN
Sbjct: 682 TDVVDIGETSYSMVLADNVDGGDDLDLIVSTMNGNVFCFSTPSPHHPLKAWRSSDQGRNN 741
Query: 703 VAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQ 762
A RY+R G++VTH +R FRDEEG+NFW EIEIVD+YR+PSGSQAPYNVTTTLLVPGNYQ
Sbjct: 742 KANRYDREGVFVTHSTRGFRDEEGKNFWAEIEIVDKYRYPSGSQAPYNVTTTLLVPGNYQ 801
Query: 763 GERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLL 822
GERRI QSQI+ R GKYRIKLPTVGVRTTGTV+VEM DKNGL+FSDEFSLTFHMYYYKLL
Sbjct: 802 GERRITQSQIYDRPGKYRIKLPTVGVRTTGTVMVEMADKNGLHFSDEFSLTFHMYYYKLL 861
Query: 823 KWLLVLPMLGMFG 835
KWLLVLPMLGMFG
Sbjct: 862 KWLLVLPMLGMFG 874
>gi|326506216|dbj|BAJ86426.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 841
Score = 1196 bits (3094), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 577/831 (69%), Positives = 681/831 (81%), Gaps = 28/831 (3%)
Query: 31 NKFRQREATDDQLGLPQIDEDALVNTQCPKNLELRWQTEVSSSIYATPLIADINSDGKLD 90
NKFRQREA+DD LG P +DEDAL++T+CPK++ELRWQTEVSSSIYA+PLIADINSDGKL+
Sbjct: 29 NKFRQREASDDMLGYPHLDEDALLSTKCPKHVELRWQTEVSSSIYASPLIADINSDGKLE 88
Query: 91 IVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVLF 150
+VVPSF+HYLEVLEGSDGDK+PGWPAFHQS+VHSSPLLYDIDKDG REI LATYNG V F
Sbjct: 89 VVVPSFVHYLEVLEGSDGDKLPGWPAFHQSNVHSSPLLYDIDKDGTREIVLATYNGVVNF 148
Query: 151 FRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVDRSHPDVHDDLIVQESEAARMKSMLETK 210
FR+SGY+M DKLE+PRRKV KDW+VGL+ DPVDRSHPDV+D I +++ + S ++ K
Sbjct: 149 FRISGYLMMDKLEVPRRKVHKDWHVGLNPDPVDRSHPDVNDSSIAKQAASEESHSDIQHK 208
Query: 211 KSTPETNATVTTSTESNPAPATVSNPDVKKVNESLVNVSN-PSEERKVNESHTEMNIKLP 269
+ ES+ P + S + + + L + S S E K N + + N +LP
Sbjct: 209 ----------SVGDESSKEPQSRSTTNTPQGADPLKHPSEVQSAETKPNSTAEKENPELP 258
Query: 270 TSVDNSSTTTVSGGTNSSENGTNTGRRLLE------DNNSKGSQEGNDKEDVPVATAEND 323
N TTT E+ ++ RRLL+ D G+D AT END
Sbjct: 259 K---NPKTTT--------ESASHAQRRLLQTADKSDDQTGSAETHGSDAGTTGKATVEND 307
Query: 324 QALDENADSSFELFRDTDELADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHI 383
+ L++ A++SF+LFRD ++L +EY+YDYDDYV+D WGDE+WTE++HEK EDYV++D+HI
Sbjct: 308 EPLEDEANASFDLFRDAEDLPEEYSYDYDDYVNDTWWGDEDWTEQEHEKAEDYVSIDAHI 367
Query: 384 LSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTK 443
LSTPVIADIDNDGV EM+IAVSYFFD EYYDN EHLKELGGID GKY+A +IVVFNLDTK
Sbjct: 368 LSTPVIADIDNDGVQEMVIAVSYFFDREYYDNAEHLKELGGIDTGKYIASSIVVFNLDTK 427
Query: 444 QVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKF 503
QVKWT +LDLST++ F A+ YSSPTVVDLDGDGNLDILVGTS+GLFYVLDHHGK R+ F
Sbjct: 428 QVKWTAELDLSTESGKFLAHAYSSPTVVDLDGDGNLDILVGTSYGLFYVLDHHGKTRKNF 487
Query: 504 PLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGD 563
PLEMAEI V+AADINDDGKIE+VT D HGNVAAWTAEG IWE HLKSLV Q P++GD
Sbjct: 488 PLEMAEIHAPVIAADINDDGKIEMVTADVHGNVAAWTAEGDEIWEVHLKSLVPQRPTVGD 547
Query: 564 VDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLT 623
V+GDGH+DVVVPT+SGNIYVLSGKDGSKV+P+PYRTHGR+M+ VLLVD++KRGEK++GLT
Sbjct: 548 VNGDGHTDVVVPTVSGNIYVLSGKDGSKVQPFPYRTHGRIMSPVLLVDMSKRGEKTQGLT 607
Query: 624 IVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFST 683
+ TTSFDGYLYLI+G + CADVVDIGETSY+MVLADNVDGGDDLDLIVTTMNGNVFCFST
Sbjct: 608 LATTSFDGYLYLIEGSSGCADVVDIGETSYTMVLADNVDGGDDLDLIVTTMNGNVFCFST 667
Query: 684 PAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPS 743
P+PHHPLK WRS NQGRNN A R+NR GIYV H SRAFRDEEG++FWVE EIVD+YR P
Sbjct: 668 PSPHHPLKEWRSSNQGRNNAAYRHNRQGIYVKHGSRAFRDEEGKHFWVEFEIVDKYRVPY 727
Query: 744 GSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNG 803
G+QAPYNVT TLLVPGNYQG+RRI SQ++ G R++LPTV VRTTGTVLVEMVDK+G
Sbjct: 728 GNQAPYNVTVTLLVPGNYQGDRRIVVSQVYNEPGHKRMQLPTVPVRTTGTVLVEMVDKHG 787
Query: 804 LYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVILRPQEAMPLPSFSRN 854
L+F+DE+SLTFH++Y+KLLKWL+VLPMLGMF VLVILRPQE PLPSFSRN
Sbjct: 788 LHFADEYSLTFHVHYFKLLKWLVVLPMLGMFLVLVILRPQEGAPLPSFSRN 838
>gi|326488411|dbj|BAJ93874.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 841
Score = 1195 bits (3091), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 576/831 (69%), Positives = 681/831 (81%), Gaps = 28/831 (3%)
Query: 31 NKFRQREATDDQLGLPQIDEDALVNTQCPKNLELRWQTEVSSSIYATPLIADINSDGKLD 90
NKFRQREA+DD LG P +DEDAL++T+CPK++ELRWQTEVSSSIYA+PLIADINSDGKL+
Sbjct: 29 NKFRQREASDDMLGYPHLDEDALLSTKCPKHVELRWQTEVSSSIYASPLIADINSDGKLE 88
Query: 91 IVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVLF 150
+VVPSF+HYLEVLEGSDGDK+PGWPAFHQS+VHSSPLLYDIDKDG REI LATYNG V F
Sbjct: 89 VVVPSFVHYLEVLEGSDGDKLPGWPAFHQSNVHSSPLLYDIDKDGTREIVLATYNGVVNF 148
Query: 151 FRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVDRSHPDVHDDLIVQESEAARMKSMLETK 210
FR+SGY+M DKLE+PRRKV KDW+VGL+ DPVDRSHPDV+D I +++ + S ++ K
Sbjct: 149 FRISGYLMMDKLEVPRRKVHKDWHVGLNPDPVDRSHPDVNDSSIAKQAASEESHSDIQHK 208
Query: 211 KSTPETNATVTTSTESNPAPATVSNPDVKKVNESLVNVSN-PSEERKVNESHTEMNIKLP 269
+ ES+ P + S + + + L + S S E K N + + N +LP
Sbjct: 209 ----------SVGDESSKEPQSRSTTNTPQGADPLKHPSEVQSAETKPNSTAEKENPELP 258
Query: 270 TSVDNSSTTTVSGGTNSSENGTNTGRRLLE------DNNSKGSQEGNDKEDVPVATAEND 323
N TTT E+ ++ RRLL+ D G+D AT END
Sbjct: 259 K---NPKTTT--------ESASHAQRRLLQTADKSDDQTGSAETHGSDAGTTGKATVEND 307
Query: 324 QALDENADSSFELFRDTDELADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHI 383
+ L++ A++SF+LFRD ++L +EY+YDYDDYV+D WGDE+WTE++HEK EDYV++D+HI
Sbjct: 308 EPLEDEANASFDLFRDAEDLPEEYSYDYDDYVNDTWWGDEDWTEQEHEKAEDYVSIDAHI 367
Query: 384 LSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTK 443
LSTPVIADIDNDGV EM+IAVSYFFD EYYDN EHLKELGGID GKY+A +IVVFNLDTK
Sbjct: 368 LSTPVIADIDNDGVQEMVIAVSYFFDREYYDNAEHLKELGGIDTGKYIASSIVVFNLDTK 427
Query: 444 QVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKF 503
QVKWT +LDLST++ F A+ YSSPTVVDLDGDGNLDILVGTS+GLFYVLDHHGK R+ F
Sbjct: 428 QVKWTAELDLSTESGKFLAHAYSSPTVVDLDGDGNLDILVGTSYGLFYVLDHHGKTRKNF 487
Query: 504 PLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGD 563
PLEMAEI V+AADINDDGKIE+VT D HGNVAAWTAEG IWE HLKSLV Q P++GD
Sbjct: 488 PLEMAEIHAPVIAADINDDGKIEMVTADVHGNVAAWTAEGDEIWEVHLKSLVPQRPTVGD 547
Query: 564 VDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLT 623
V+GDGH+DVVVPT+SGNIYVLSGKDGSKV+P+PYRTHGR+M+ VLLVD++KRGEK++GLT
Sbjct: 548 VNGDGHTDVVVPTVSGNIYVLSGKDGSKVQPFPYRTHGRIMSPVLLVDMSKRGEKTQGLT 607
Query: 624 IVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFST 683
+ TTSFDGYLYLI+G + CADVVDIGETSY+MVLADNVDGGDDLDLIVTT+NGNVFCFST
Sbjct: 608 LATTSFDGYLYLIEGSSGCADVVDIGETSYTMVLADNVDGGDDLDLIVTTLNGNVFCFST 667
Query: 684 PAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPS 743
P+PHHPLK WRS NQGRNN A R+NR GIYV H SRAFRDEEG++FWVE EIVD+YR P
Sbjct: 668 PSPHHPLKEWRSSNQGRNNAAYRHNRQGIYVKHGSRAFRDEEGKHFWVEFEIVDKYRVPY 727
Query: 744 GSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNG 803
G+QAPYNVT TLLVPGNYQG+RRI SQ++ G R++LPTV VRTTGTVLVEMVDK+G
Sbjct: 728 GNQAPYNVTVTLLVPGNYQGDRRIVVSQVYNEPGHKRMQLPTVPVRTTGTVLVEMVDKHG 787
Query: 804 LYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVILRPQEAMPLPSFSRN 854
L+F+DE+SLTFH++Y+KLLKWL+VLPMLGMF VLVILRPQE PLPSFSRN
Sbjct: 788 LHFADEYSLTFHVHYFKLLKWLVVLPMLGMFLVLVILRPQEGAPLPSFSRN 838
>gi|115456315|ref|NP_001051758.1| Os03g0825700 [Oryza sativa Japonica Group]
gi|15042825|gb|AAK82448.1|AC091247_15 putative dex1 protein [Oryza sativa Japonica Group]
gi|108711842|gb|ABF99637.1| defective in exine formation protein, putative, expressed [Oryza
sativa Japonica Group]
gi|113550229|dbj|BAF13672.1| Os03g0825700 [Oryza sativa Japonica Group]
Length = 851
Score = 1186 bits (3067), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 583/832 (70%), Positives = 682/832 (81%), Gaps = 15/832 (1%)
Query: 31 NKFRQREATDDQLGLPQIDEDALVNTQCPKNLELRWQTEVSSSIYATPLIADINSDGKLD 90
NKFRQREATDD LG P +DEDAL+ T+CPK++ELRWQTEVSSSIYATPLIADINSDGKL+
Sbjct: 29 NKFRQREATDDMLGYPHLDEDALLKTKCPKHVELRWQTEVSSSIYATPLIADINSDGKLE 88
Query: 91 IVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVLF 150
+VVPSF+HYLEVLEGSDGDK+PGWPAFHQS+VHSSPLLYDIDKDG REI LATYNG V F
Sbjct: 89 VVVPSFVHYLEVLEGSDGDKLPGWPAFHQSNVHSSPLLYDIDKDGTREIVLATYNGVVNF 148
Query: 151 FRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVDRSHPDVHDDLIVQESEAARMKSMLETK 210
FRVSGYMM DKLE+PRRKV KDWYVGL++DPVDRSHPDVHD I +++ ++
Sbjct: 149 FRVSGYMMMDKLEVPRRKVHKDWYVGLNTDPVDRSHPDVHDSSIAKKAA---------SE 199
Query: 211 KSTPETNATVTTSTESNPAPATVSNPDVKKVNESLVNVSNPSE-ERKVNESHTEMNIKLP 269
+S P + S + + +N + +S+ + S E K N + + N+ +
Sbjct: 200 ESHPNIQDKPVVNESSKESQSRSTNDSTTRGVDSMKHASKEEPVESKPNSTRGQENMDVL 259
Query: 270 TSVDNSSTTTVSGGTNSSENGTNTGRRLL---EDNNSKGSQE--GNDKEDVPVATAENDQ 324
+++++ S + ++EN ++ RRLL E +N GS E +D AT EN +
Sbjct: 260 NNLNSTDAGNNSSLSTTTENASHVQRRLLQTDEKSNQAGSSETDASDTGTAKAATVENSE 319
Query: 325 ALDENADSSFELFRDTDELADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHIL 384
L+ +AD+SF LFRD ++L DEYNYDYDDYVD+ MWGDE+W E+QHEK EDYV++D+HIL
Sbjct: 320 PLEADADASFNLFRDVEDLPDEYNYDYDDYVDETMWGDEDWKEQQHEKAEDYVSIDAHIL 379
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
STPVIADID DG+ EM+I+VSYFFDHEYYD PEHLKELGGIDIGKY+A +IVVFNLDT+Q
Sbjct: 380 STPVIADIDRDGIQEMVISVSYFFDHEYYDKPEHLKELGGIDIGKYIASSIVVFNLDTRQ 439
Query: 445 VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFP 504
VKWT +LDLSTD+ +F A+ YSSPTVVDLDGDGNLDILVGTSFGLFYV+DH GK+R KFP
Sbjct: 440 VKWTAELDLSTDSGNFTAHAYSSPTVVDLDGDGNLDILVGTSFGLFYVIDHRGKVRNKFP 499
Query: 505 LEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDV 564
LEMAEI V+AADINDDGKIE+VT D HGNVAAWTAEG+ IWE HLKSL+ Q P++GDV
Sbjct: 500 LEMAEIHAPVIAADINDDGKIEMVTADVHGNVAAWTAEGEEIWEVHLKSLIPQRPTVGDV 559
Query: 565 DGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTI 624
+GDG ++VVVPT+SGNIYVLSGKDGSK++P+PYRTHGR+M+ VLL+D++K EKSKGLT+
Sbjct: 560 NGDGRTEVVVPTVSGNIYVLSGKDGSKIQPFPYRTHGRIMSPVLLLDMSKHDEKSKGLTL 619
Query: 625 VTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTP 684
TTSFDGYLYLI+G + CADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTP
Sbjct: 620 ATTSFDGYLYLIEGSSGCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTP 679
Query: 685 APHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPSG 744
+PHHPLK WRS NQGRNN A RYNR GIYV H SR FRDEEG++FWVE EIVD+YR P G
Sbjct: 680 SPHHPLKEWRSSNQGRNNAAYRYNREGIYVKHGSRTFRDEEGKHFWVEFEIVDKYRVPYG 739
Query: 745 SQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGL 804
+QAPYNVT TLLVPGNYQGERRI + + GK R+KLPTV VRTTGTVLVEMVDKNG
Sbjct: 740 NQAPYNVTVTLLVPGNYQGERRIVVNAAYNEPGKQRMKLPTVPVRTTGTVLVEMVDKNGF 799
Query: 805 YFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVILRPQEAMPLPSFSRNTD 856
YFSDEFSLTFHM+YYKLLKWL++LPMLGMF VLVILRPQE PLPSFSRN D
Sbjct: 800 YFSDEFSLTFHMHYYKLLKWLVLLPMLGMFSVLVILRPQEGAPLPSFSRNID 851
>gi|357123438|ref|XP_003563417.1| PREDICTED: uncharacterized protein LOC100842013 [Brachypodium
distachyon]
Length = 854
Score = 1172 bits (3033), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 579/833 (69%), Positives = 682/833 (81%), Gaps = 16/833 (1%)
Query: 31 NKFRQREATDDQLGLPQIDEDALVNTQCPKNLELRWQTEVSSSIYATPLIADINSDGKLD 90
NKFRQREA+DD LG P +DEDAL+NT+CPK++ELRWQTEVSSSIYATPLIADINSDGKL+
Sbjct: 31 NKFRQREASDDLLGYPHLDEDALLNTKCPKHVELRWQTEVSSSIYATPLIADINSDGKLE 90
Query: 91 IVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVLF 150
+VVPSF+HYLEVLEGSDGDK+PGWPAFHQS+VHSSPLLYDIDKDG REI LATYNG V F
Sbjct: 91 VVVPSFVHYLEVLEGSDGDKLPGWPAFHQSNVHSSPLLYDIDKDGTREIVLATYNGVVNF 150
Query: 151 FRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVDRSHPDVHDDLIVQESEAARMKSMLETK 210
FR+SGYMM DKLE+PRRKVRKDW+VGL+ DPVDRSHPDVHD I +++ + + K
Sbjct: 151 FRISGYMMMDKLEVPRRKVRKDWHVGLNPDPVDRSHPDVHDSSIAKKTASEESHPDIHDK 210
Query: 211 KSTPETNATVTTSTESNPAPATVSNPDVKKVNESLVNVSN-PSEERKVNESHTEMNIKLP 269
+++ + + +N A V +SL + S S E+K N + N++LP
Sbjct: 211 PVVEKSSEETKSRSAANTATQEV---------DSLKHASELQSTEKKPNSTPGNENMELP 261
Query: 270 TSVDNSSTTTVSGGTNSSENGTNTGRRLLE------DNNSKGSQEGNDKEDVPVATAEND 323
+ +N+++ S +++N ++ RRLL+ D GND T END
Sbjct: 262 NNPNNTNSGNTSSLYTTTDNASHAQRRLLQTADKSDDQTGNAEIHGNDAGTTGEMTVEND 321
Query: 324 QALDENADSSFELFRDTDELADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHI 383
+ L+E+A++SF+LFRD ++L DEYNYDYDDYVD++MWGDE+WTE++HEK +DYV++D+HI
Sbjct: 322 EPLEEDANASFDLFRDAEDLPDEYNYDYDDYVDESMWGDEDWTEQEHEKADDYVSIDAHI 381
Query: 384 LSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTK 443
LSTPVIADID DGV EM+IAVSYFFD EYYDNP+H+KELGGIDIGKY+A IVVF+LDTK
Sbjct: 382 LSTPVIADIDKDGVQEMVIAVSYFFDREYYDNPDHIKELGGIDIGKYIASGIVVFDLDTK 441
Query: 444 QVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKF 503
QVKWT DLDLST+N FRA+ YSSP VVDLDGDG LDILVGTS+GLFYV+DH GKIR F
Sbjct: 442 QVKWTADLDLSTENGIFRAHAYSSPAVVDLDGDGYLDILVGTSYGLFYVIDHRGKIRSNF 501
Query: 504 PLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGD 563
PLEMAEI V+AADINDDGKIE+VT D HGNVAAWTAEGK IWE HLKSLV Q P++GD
Sbjct: 502 PLEMAEIHAPVIAADINDDGKIEMVTADVHGNVAAWTAEGKEIWEVHLKSLVPQRPTVGD 561
Query: 564 VDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLT 623
VDGDGH+D+VVPT+SGNIYVL GKDG KV+P+PYR HGR+M+ VLL+D++KR E S+GLT
Sbjct: 562 VDGDGHTDIVVPTVSGNIYVLRGKDGLKVQPFPYRAHGRIMSPVLLLDMSKREENSRGLT 621
Query: 624 IVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFST 683
+ TTSFDGYLYLI+G + CADVVDIGETSY+MVLADNVDGGDDLDLIVTTMNGNVFCFST
Sbjct: 622 LATTSFDGYLYLIEGSSGCADVVDIGETSYTMVLADNVDGGDDLDLIVTTMNGNVFCFST 681
Query: 684 PAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPS 743
P+PHHPLK WRS NQGRNN A RYNR GIYV H SRAFRDEEG++FWVE EIVD+YR P
Sbjct: 682 PSPHHPLKEWRSSNQGRNNAAYRYNRQGIYVKHGSRAFRDEEGKHFWVEFEIVDKYRVPY 741
Query: 744 GSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNG 803
G+Q PYNVT TLLVPGNYQG+RRI SQI+ G R++LPTV VRTTGTVLVEMVDK+G
Sbjct: 742 GNQGPYNVTVTLLVPGNYQGDRRIVVSQIYHEPGSQRMQLPTVPVRTTGTVLVEMVDKHG 801
Query: 804 LYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVILRPQEAMPLPSFSRNTD 856
++FSDE+SLTFH +YYKLLKWL+VLPMLGMF VLVILRPQE PLPSFSRN D
Sbjct: 802 IHFSDEYSLTFHTHYYKLLKWLVVLPMLGMFCVLVILRPQEGAPLPSFSRNID 854
>gi|222626087|gb|EEE60219.1| hypothetical protein OsJ_13190 [Oryza sativa Japonica Group]
Length = 842
Score = 1167 bits (3020), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 577/832 (69%), Positives = 676/832 (81%), Gaps = 24/832 (2%)
Query: 31 NKFRQREATDDQLGLPQIDEDALVNTQCPKNLELRWQTEVSSSIYATPLIADINSDGKLD 90
NKFRQREATDD LG P +DEDAL+ T+CPK++ELRWQTEVSSSIYATPLIADINSDGKL+
Sbjct: 29 NKFRQREATDDMLGYPHLDEDALLKTKCPKHVELRWQTEVSSSIYATPLIADINSDGKLE 88
Query: 91 IVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVLF 150
+VVPSF+HYLEVLEGSDGDK+PGWPAFHQS+VHSSPLLYDIDKDG REI LATYNG V F
Sbjct: 89 VVVPSFVHYLEVLEGSDGDKLPGWPAFHQSNVHSSPLLYDIDKDGTREIVLATYNGVVNF 148
Query: 151 FRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVDRSHPDVHDDLIVQESEAARMKSMLETK 210
FRVSGYMM DKLE+PRRKV KDWYVGL++DPVDRSHPDVHD I +++ ++
Sbjct: 149 FRVSGYMMMDKLEVPRRKVHKDWYVGLNTDPVDRSHPDVHDSSIAKKAA---------SE 199
Query: 211 KSTPETNATVTTSTESNPAPATVSNPDVKKVNESLVNVSNPSE-ERKVNESHTEMNIKLP 269
+S P + S + + +N + +S+ + S E K N + + N+ +
Sbjct: 200 ESHPNIQDKPVVNESSKESQSRSTNDSTTRGVDSMKHASKEEPVESKPNSTRGQENMDVL 259
Query: 270 TSVDNSSTTTVSGGTNSSENGTNTGRRLL---EDNNSKGSQE--GNDKEDVPVATAENDQ 324
+++++ S + ++EN ++ RRLL E +N GS E +D AT EN +
Sbjct: 260 NNLNSTDAGNNSSLSTTTENASHVQRRLLQTDEKSNQAGSSETDASDTGTAKAATVENSE 319
Query: 325 ALDENADSSFELFRDTDELADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHIL 384
L+ +AD+SF LFRD ++L DEYNYDYDDYVD+ MWGDE+W E+QHEK EDYV++D+HIL
Sbjct: 320 PLEADADASFNLFRDVEDLPDEYNYDYDDYVDETMWGDEDWKEQQHEKAEDYVSIDAHIL 379
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
STPVIADID DG+ EM+I+VSYFFDHEYYD PEHLKELGGIDIGKY+A +IVVFNLDT+Q
Sbjct: 380 STPVIADIDRDGIQEMVISVSYFFDHEYYDKPEHLKELGGIDIGKYIASSIVVFNLDTRQ 439
Query: 445 VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFP 504
VKWT +LDLSTD+ +F A+ YSSPTVVDLDGDGNLDILVGTSFGLFYV+DH GK+R KFP
Sbjct: 440 VKWTAELDLSTDSGNFTAHAYSSPTVVDLDGDGNLDILVGTSFGLFYVIDHRGKVRNKFP 499
Query: 505 LEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDV 564
LEMAEI V+AADINDDGKIE+VT D HGNVAAWTAEG+ IWE+ P++GDV
Sbjct: 500 LEMAEIHAPVIAADINDDGKIEMVTADVHGNVAAWTAEGEEIWER---------PTVGDV 550
Query: 565 DGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTI 624
+GDG ++VVVPT+SGNIYVLSGKDGSK++P+PYRTHGR+M+ VLL+D++K EKSKGLT+
Sbjct: 551 NGDGRTEVVVPTVSGNIYVLSGKDGSKIQPFPYRTHGRIMSPVLLLDMSKHDEKSKGLTL 610
Query: 625 VTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTP 684
TTSFDGYLYLI+G + CADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTP
Sbjct: 611 ATTSFDGYLYLIEGSSGCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTP 670
Query: 685 APHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPSG 744
+PHHPLK WRS NQGRNN A RYNR GIYV H SR FRDEEG++FWVE EIVD+YR P G
Sbjct: 671 SPHHPLKEWRSSNQGRNNAAYRYNREGIYVKHGSRTFRDEEGKHFWVEFEIVDKYRVPYG 730
Query: 745 SQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGL 804
+QAPYNVT TLLVPGNYQGERRI + + GK R+KLPTV VRTTGTVLVEMVDKNG
Sbjct: 731 NQAPYNVTVTLLVPGNYQGERRIVVNAAYNEPGKQRMKLPTVPVRTTGTVLVEMVDKNGF 790
Query: 805 YFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVILRPQEAMPLPSFSRNTD 856
YFSDEFSLTFHM+YYKLLKWL++LPMLGMF VLVILRPQE PLPSFSRN D
Sbjct: 791 YFSDEFSLTFHMHYYKLLKWLVLLPMLGMFSVLVILRPQEGAPLPSFSRNID 842
>gi|218194018|gb|EEC76445.1| hypothetical protein OsI_14141 [Oryza sativa Indica Group]
Length = 842
Score = 1163 bits (3009), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/831 (69%), Positives = 671/831 (80%), Gaps = 22/831 (2%)
Query: 31 NKFRQREATDDQLGLPQIDEDALVNTQCPKNLELRWQTEVSSSIYATPLIADINSDGKLD 90
NKFRQREATDD LG P +DEDAL+ T+CPK++ELRWQTEVSSSIYATPLIADINSDGKL+
Sbjct: 29 NKFRQREATDDMLGYPHLDEDALLKTKCPKHVELRWQTEVSSSIYATPLIADINSDGKLE 88
Query: 91 IVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVLF 150
+VVPSF+HYLEVLEGSDGDK+PGWPAFHQS+VHSSPLLYDIDKDG REI LATYNG V F
Sbjct: 89 VVVPSFVHYLEVLEGSDGDKLPGWPAFHQSNVHSSPLLYDIDKDGTREIVLATYNGVVNF 148
Query: 151 FRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVDRSHPDVHDDLIVQESEAARMKSMLETK 210
FRVSGYMM DKLE+PRRKV KDWYVGL+ DPVDRSHPDVHD I +++ + ++ K
Sbjct: 149 FRVSGYMMMDKLEVPRRKVHKDWYVGLNPDPVDRSHPDVHDSSIAKKAASEESHPNIQDK 208
Query: 211 KSTPETNATVTTSTESNPAPATVSNPDVKKVNESLVNVSNPSEERKVNESHTEMNIKLPT 270
P N + S + +T D K V E K N + + N+ +
Sbjct: 209 ---PVVNESSKESQSRSTNDSTTQGVDSMKHASKEEPV-----ESKPNSTRGQENMDVLN 260
Query: 271 SVDNSSTTTVSGGTNSSENGTNTGRRLL---EDNNSKGSQE--GNDKEDVPVATAENDQA 325
+++++ S + ++EN ++ RRLL E +N GS E +D AT EN +
Sbjct: 261 NLNSTDAGNNSSLSTTTENASHVQRRLLQTDEKSNQAGSSETDASDTGTAKAATVENSEP 320
Query: 326 LDENADSSFELFRDTDELADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHILS 385
L+ +AD+SF LFRD ++L DEYNYDYDDYVD+ MWGDE+W E+QHEK EDYV++D+HILS
Sbjct: 321 LEADADASFNLFRDVEDLPDEYNYDYDDYVDETMWGDEDWKEQQHEKAEDYVSIDAHILS 380
Query: 386 TPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQV 445
TPVIADID DG+ EM+I+VSYFFDHEYYD PEHLKELGGIDIGKY+A +IVVFNLDT+QV
Sbjct: 381 TPVIADIDRDGIQEMVISVSYFFDHEYYDKPEHLKELGGIDIGKYIASSIVVFNLDTRQV 440
Query: 446 KWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPL 505
KWT +LDLSTD+ +F A+ YSSPTVVDLDGDGNLDILVGTSFGLFYV+DH GK+R KFPL
Sbjct: 441 KWTAELDLSTDSGNFTAHAYSSPTVVDLDGDGNLDILVGTSFGLFYVIDHRGKVRNKFPL 500
Query: 506 EMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVD 565
EMAEI V+AADINDDGKIE+VT D HGNVAAWTA G+ IWE+ P++GDV+
Sbjct: 501 EMAEIHAPVIAADINDDGKIEMVTADVHGNVAAWTAGGEEIWER---------PTVGDVN 551
Query: 566 GDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIV 625
GDG ++VVVPT+SGNIYVLSGKDGSK++P+PYRTHGR+M+ VLL+D++K EKSKGLT+
Sbjct: 552 GDGRTEVVVPTVSGNIYVLSGKDGSKIQPFPYRTHGRIMSPVLLLDMSKHDEKSKGLTLA 611
Query: 626 TTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPA 685
TTSFDGYLYLI+G + CADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTP+
Sbjct: 612 TTSFDGYLYLIEGSSGCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPS 671
Query: 686 PHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPSGS 745
PHHPLK WRS NQGRNN A RYNR GIYV H SR FRDEEG++FWVE EIVD+YR P G+
Sbjct: 672 PHHPLKEWRSSNQGRNNAAYRYNREGIYVKHGSRTFRDEEGKHFWVEFEIVDKYRVPYGN 731
Query: 746 QAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLY 805
QAPYNVT TLLVPGNYQGERRI + + GK R+KLPTV VRTTGTVLVEMVDKNG Y
Sbjct: 732 QAPYNVTVTLLVPGNYQGERRIVVNAAYNEPGKQRMKLPTVPVRTTGTVLVEMVDKNGFY 791
Query: 806 FSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVILRPQEAMPLPSFSRNTD 856
FSDEFSLTFHM+YYKLLKWL++LPMLGMF VLVILRPQE PLPSFSRN D
Sbjct: 792 FSDEFSLTFHMHYYKLLKWLVLLPMLGMFSVLVILRPQEGAPLPSFSRNID 842
>gi|224139008|ref|XP_002322957.1| predicted protein [Populus trichocarpa]
gi|222867587|gb|EEF04718.1| predicted protein [Populus trichocarpa]
Length = 701
Score = 1066 bits (2758), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 536/712 (75%), Positives = 605/712 (84%), Gaps = 48/712 (6%)
Query: 11 VLLICLLLFNSARGGDNSEQNKFRQREATDDQLGLPQIDEDALVNTQCPKNLELRWQTEV 70
V LIC LLF ++ GD S++NKFR REATDD LG P IDE+AL+NTQCPKNLELRWQTEV
Sbjct: 8 VFLICFLLFTTSIHGDESKKNKFRDREATDDALGYPDIDENALLNTQCPKNLELRWQTEV 67
Query: 71 SSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYD 130
SSS+YATPLIADINSDGKLD+VVPSF+HYLE LEGSDGDK+PGWPAFHQS+VH+SPLLYD
Sbjct: 68 SSSVYATPLIADINSDGKLDVVVPSFVHYLEALEGSDGDKIPGWPAFHQSTVHASPLLYD 127
Query: 131 IDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVDRSHPDVH 190
IDKDGVREIALATYNGEVLFFRVSGYMMTDKLE+PRR+V+K+WYVGL DPVDRSHPDVH
Sbjct: 128 IDKDGVREIALATYNGEVLFFRVSGYMMTDKLEVPRRRVKKNWYVGLDPDPVDRSHPDVH 187
Query: 191 DDLIVQESEAARMKSML--ETKKSTPETNATVTTSTE-SNPAPATVSNPDVKKVNESLVN 247
DD +V E+ + +S T ++TPETN++++TSTE S+PA A++
Sbjct: 188 DDQLVLEATENKSQSHTTGNTHQNTPETNSSISTSTENSHPANASIETG----------- 236
Query: 248 VSNPSEERKVNESHTEMNIKLPTSVDNSS---------------------TTTVSGGTNS 286
+K++E+ T+ IKL + VDNSS T TV TN+
Sbjct: 237 -------KKMSENQTKTMIKLSSQVDNSSVGAGSNGTDNAQNGTNKTQNGTITVEKETNN 289
Query: 287 SENGTNTGRRLLEDNNSKGSQEG------NDKEDVPVATAENDQALDENADSSFELFRDT 340
+ENGTNTGRRLLED+NSKGS EG ND E+V AT END+ L+ +ADSSFELFR++
Sbjct: 290 AENGTNTGRRLLEDDNSKGSHEGGSESKENDHENVHAATVENDEGLEADADSSFELFRES 349
Query: 341 DELADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEM 400
DELADEY+YDYDDYVD++MWGDEEW E +HE++EDYVN+DSHIL TPVIADIDNDGV+EM
Sbjct: 350 DELADEYSYDYDDYVDESMWGDEEWKERKHERLEDYVNIDSHILCTPVIADIDNDGVTEM 409
Query: 401 IIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASF 460
I+AVSYFFDHEYYDNPEHLKELG ID+GKYVA ++VVFNLDTKQVKWT +LDLST A F
Sbjct: 410 IVAVSYFFDHEYYDNPEHLKELGDIDVGKYVASSVVVFNLDTKQVKWTRELDLSTSTAKF 469
Query: 461 RAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADIN 520
RAYIYSSP+VVDLDGDGNLDILVGTSFGLFYVLDHHG IRE FPLEMAEIQGAVVAADIN
Sbjct: 470 RAYIYSSPSVVDLDGDGNLDILVGTSFGLFYVLDHHGNIRENFPLEMAEIQGAVVAADIN 529
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGN 580
DDGK ELVTTD HGNVAAWT++GK IWE+HLKSLV+QGP+IGDVDGDGH+DVVVPTLSGN
Sbjct: 530 DDGKTELVTTDVHGNVAAWTSQGKEIWEKHLKSLVSQGPTIGDVDGDGHTDVVVPTLSGN 589
Query: 581 IYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPT 640
IYVLSGKDGS VRPYPYRTHGRVMNQVLL+DL+KRGEK+KGLT+VTTSFDGYLYLIDGPT
Sbjct: 590 IYVLSGKDGSIVRPYPYRTHGRVMNQVLLLDLSKRGEKNKGLTLVTTSFDGYLYLIDGPT 649
Query: 641 SCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKA 692
SCADVVDIGETSYSMVLADNVDGGDDLDLIV+TMNGNVFCFSTP PHHPLKA
Sbjct: 650 SCADVVDIGETSYSMVLADNVDGGDDLDLIVSTMNGNVFCFSTPVPHHPLKA 701
>gi|186509913|ref|NP_001118602.1| defective in exine formation protein (DEX1) [Arabidopsis thaliana]
gi|332641199|gb|AEE74720.1| defective in exine formation protein (DEX1) [Arabidopsis thaliana]
Length = 817
Score = 1065 bits (2753), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 549/770 (71%), Positives = 620/770 (80%), Gaps = 50/770 (6%)
Query: 30 QNKFRQREATDDQLGLPQIDEDALVNTQCPKNLELRWQTEVSSSIYATPLIADINSDGKL 89
+NKFR+R+ATDD+LG P IDEDAL+NTQCPK LELRWQTEV+SS+YATPLIADINSDGKL
Sbjct: 25 ENKFRERKATDDELGYPDIDEDALLNTQCPKKLELRWQTEVTSSVYATPLIADINSDGKL 84
Query: 90 DIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVL 149
DIVVPSF+HYLEVLEG+DGDKMPGWPAFHQS+VHSSPLL+DIDKDGVREIALATYN EVL
Sbjct: 85 DIVVPSFVHYLEVLEGADGDKMPGWPAFHQSNVHSSPLLFDIDKDGVREIALATYNAEVL 144
Query: 150 FFRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVDRSHPDVHDDLIVQESEAARMKS---- 205
FFRVSG++M+DKLE+PRRKV K+W+VGL+ DPVDRSHPDVHDD++ E EA MKS
Sbjct: 145 FFRVSGFLMSDKLEVPRRKVHKNWHVGLNPDPVDRSHPDVHDDVL--EEEAMAMKSSTTQ 202
Query: 206 ----------MLETKKSTPETNATVTTST-----ESNPAPATVSNPDVKKVNESL----- 245
+ K N+ V+T E+N A V P + N S+
Sbjct: 203 TNATTTTPNVTVSMTKEVHGANSYVSTQEDQKRPENNQTEAIVK-PTPELHNSSMDAGAN 261
Query: 246 ----------------VNVS-NPSEERKVNESHTEMNIKLPTSVDNSSTTT-VSGGTNSS 287
NV+ N ++ K++ E IKL TS NSS T SG ++++
Sbjct: 262 NLAANATTAGSRENLNRNVTTNEVDQSKISGDKNETVIKLNTSTGNSSETLGTSGNSSTA 321
Query: 288 ENGTNTGRRLLEDNNSKGSQEG-----NDKEDVPVATAENDQALDENADSSFELFRDTDE 342
E T +GRRLLE++ SK S + ++ E V +AT END L+ +ADSSFEL R+ DE
Sbjct: 322 ETVTKSGRRLLEEDGSKESVDSHSDSKDNSEGVRMATVENDGGLEADADSSFELLRENDE 381
Query: 343 LADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMII 402
LADEY+YDYDDYVD+ MWGDEEW E QHE EDYVN+D+HIL TPVIADID DGV EMI+
Sbjct: 382 LADEYSYDYDDYVDEKMWGDEEWVEGQHENSEDYVNIDAHILCTPVIADIDKDGVQEMIV 441
Query: 403 AVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRA 462
AVSYFFD EYYDNPEHLKELGGIDI Y+A +IVVFNLDTKQVKW +LDLSTD A+FRA
Sbjct: 442 AVSYFFDPEYYDNPEHLKELGGIDIKNYIASSIVVFNLDTKQVKWIKELDLSTDKANFRA 501
Query: 463 YIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDD 522
YIYSSPTVVDLDGDG LDILVGTSFGLFY +DH G IREKFPLEMAEIQGAVVAADINDD
Sbjct: 502 YIYSSPTVVDLDGDGYLDILVGTSFGLFYAMDHRGNIREKFPLEMAEIQGAVVAADINDD 561
Query: 523 GKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIY 582
GKIELVTTD+HGN+AAWT +G IWE HLKSLV QGPSIGDVDGDGH++VVVPT SGNIY
Sbjct: 562 GKIELVTTDSHGNIAAWTTQGVEIWEAHLKSLVPQGPSIGDVDGDGHTEVVVPTSSGNIY 621
Query: 583 VLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSC 642
VLSGKDGS VRPYPYRTHGRVMNQ+LLVDL KRGEK KGLTIVTTSFDGYLYLIDGPTSC
Sbjct: 622 VLSGKDGSIVRPYPYRTHGRVMNQLLLVDLNKRGEKKKGLTIVTTSFDGYLYLIDGPTSC 681
Query: 643 ADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNN 702
DVVDIGETSYSMVLADNVDGGDDLDLIV+TMNGNVFCFSTP+PHHPLKAWRS +QGRNN
Sbjct: 682 TDVVDIGETSYSMVLADNVDGGDDLDLIVSTMNGNVFCFSTPSPHHPLKAWRSSDQGRNN 741
Query: 703 VAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVT 752
A RY+R G++VTH +R FRDEEG+NFW EIEIVD+YR+PSGSQAPYNVT
Sbjct: 742 KANRYDREGVFVTHSTRGFRDEEGKNFWAEIEIVDKYRYPSGSQAPYNVT 791
>gi|242037555|ref|XP_002466172.1| hypothetical protein SORBIDRAFT_01g002810 [Sorghum bicolor]
gi|241920026|gb|EER93170.1| hypothetical protein SORBIDRAFT_01g002810 [Sorghum bicolor]
Length = 738
Score = 1011 bits (2615), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 519/753 (68%), Positives = 608/753 (80%), Gaps = 34/753 (4%)
Query: 113 GWPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKD 172
GWPAFHQS+VHSSPLLYDIDKDGVREIALATYNG V FFRVSGYMM DKLE+PRRKVRKD
Sbjct: 11 GWPAFHQSNVHSSPLLYDIDKDGVREIALATYNGVVNFFRVSGYMMMDKLEVPRRKVRKD 70
Query: 173 WYVGLHSDPVDRSHPDVHDDLIVQESEAARMKSMLETKKSTPETNATVTTSTESNPAPAT 232
WYVGL+ DPVDRSHPDVHD I +E+ T K TP + + S E
Sbjct: 71 WYVGLNPDPVDRSHPDVHDSSIAKEA----------TSKETPVIDQNKSGSMEG------ 114
Query: 233 VSNPDVKKVNESLVNVSNP-SEERKVNESHTEMNIKLPTSVDNSSTTTVSGGTNSSEN-- 289
E+L N S S E K+N + + N++LP +V+++ + ++S T +EN
Sbjct: 115 ---------GETLKNTSEQHSVETKLNSTQAQENVQLPNNVNDTHSGSISNVTTKAENTS 165
Query: 290 GTNTGRRLLEDNNSKGSQEGNDKEDVP------VATAENDQALDENADSSFELFRDTDEL 343
++T RRLL+ + Q G+ K AT EN + LDE+AD+SF+LFRD ++L
Sbjct: 166 QSHTQRRLLQTADKSDEQTGSSKTHESDSGAKVAATVENGEPLDEDADASFDLFRDPEDL 225
Query: 344 ADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIA 403
DEYNYDYDDYVDD +WGDE+W E++HEK EDYV++D+HILSTPVIADID DGV EM+IA
Sbjct: 226 PDEYNYDYDDYVDDRLWGDEDWKEQEHEKAEDYVSIDAHILSTPVIADIDKDGVQEMVIA 285
Query: 404 VSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAY 463
VSYFFDHEYY + +H KEL GIDIGKYVA +IVVFNLDT+QVKW +LDLSTD FRA+
Sbjct: 286 VSYFFDHEYYRDSDHAKELEGIDIGKYVASSIVVFNLDTRQVKWAAELDLSTDTVHFRAH 345
Query: 464 IYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDG 523
++SSPTVVDLDGDG LDIL+GT++G FYV+DH GK+R KFPLEMAEI V+AADINDDG
Sbjct: 346 VFSSPTVVDLDGDGYLDILIGTAYGYFYVIDHRGKVRNKFPLEMAEIHAPVIAADINDDG 405
Query: 524 KIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYV 583
KIE+VTTD+HGNVAAWTAEG+ IWE HLKSL+ Q P++GDV+GDGH+DVVVPT+SGNIYV
Sbjct: 406 KIEMVTTDSHGNVAAWTAEGEEIWEVHLKSLIPQRPTVGDVNGDGHTDVVVPTVSGNIYV 465
Query: 584 LSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCA 643
LSGKDGSK++P+PYRTHGR+M+ VLL+D++K GE +KGLT+ TTSFDGYLYLI+G + CA
Sbjct: 466 LSGKDGSKIQPFPYRTHGRIMSPVLLLDMSKHGENAKGLTLATTSFDGYLYLIEGSSGCA 525
Query: 644 DVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNNV 703
DVVDIGETSY+MVLADNVDGGDDLDLIVTTMNGNVFCFST +PHHPLK WRS NQGRNN
Sbjct: 526 DVVDIGETSYTMVLADNVDGGDDLDLIVTTMNGNVFCFSTQSPHHPLKEWRSSNQGRNNA 585
Query: 704 AIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQG 763
A +YNR GIYV H SR FRDEEG+NFWVE EIVD+YR P G+QAPYNVT TLLVPGNYQG
Sbjct: 586 AYQYNREGIYVKHGSRTFRDEEGKNFWVEFEIVDKYRVPYGNQAPYNVTVTLLVPGNYQG 645
Query: 764 ERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLK 823
+RRI S ++ + GK R+ LPTV VRTTGTV+VEMVDKNGLYFSDEFSLTFHM+YYKLLK
Sbjct: 646 DRRIVVSGLYHQPGKQRMMLPTVPVRTTGTVVVEMVDKNGLYFSDEFSLTFHMHYYKLLK 705
Query: 824 WLLVLPMLGMFGVLVILRPQEAMPLPSFSRNTD 856
WLL+LPMLGMFGVLVILRPQE PLPSFSRN D
Sbjct: 706 WLLLLPMLGMFGVLVILRPQEGAPLPSFSRNVD 738
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 46/92 (50%), Gaps = 2/92 (2%)
Query: 66 WQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSS 125
W+ + S I P + D+N DG D+VVP+ + VL G DG K+ +P + S
Sbjct: 429 WEVHLKSLIPQRPTVGDVNGDGHTDVVVPTVSGNIYVLSGKDGSKIQPFPYRTHGRIMSP 488
Query: 126 PLLYDIDKDG--VREIALATYNGEVLFFRVSG 155
LL D+ K G + + LAT + + + + G
Sbjct: 489 VLLLDMSKHGENAKGLTLATTSFDGYLYLIEG 520
>gi|168061167|ref|XP_001782562.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162665969|gb|EDQ52637.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 867
Score = 949 bits (2453), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 494/849 (58%), Positives = 600/849 (70%), Gaps = 106/849 (12%)
Query: 26 DNSEQNKFRQREATDDQLGLPQIDEDALVNTQCPKNLELRWQTEVSSSIYATPLIADINS 85
++ + NKF EA+DD LG P ++ +AL NT+CPK +ELRWQTEVSSSIYATPLI D+N+
Sbjct: 41 EHVQSNKFLANEASDDSLGFPTLNAEALANTRCPKQVELRWQTEVSSSIYATPLITDLNT 100
Query: 86 DGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALATYN 145
DGKL+IVVPSF+HYLEVLEGSDG+K+PGWPA HQS+VHSS L+YDIDKDG+RE+ALAT+N
Sbjct: 101 DGKLEIVVPSFVHYLEVLEGSDGEKLPGWPASHQSTVHSSALMYDIDKDGIREVALATFN 160
Query: 146 GEVLFFRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVDRSHPDVHDDLIVQESEAARMKS 205
GEVLFFR SG++M +KL +PRR+VRKDWYVGL D DRS DVHDD ++ + A
Sbjct: 161 GEVLFFRPSGFLMGEKLVVPRRRVRKDWYVGLSPDHADRSKNDVHDDSLIDLNTIAP--- 217
Query: 206 MLETKKSTPETNATVT---TSTESNPAPATVSNPDVKKVNESLVNVSNPSEERKVNESHT 262
L S TNA++ T +S PAPA
Sbjct: 218 -LNNTYSIHGTNASLAGNNTLQQSGPAPA------------------------------- 245
Query: 263 EMNIKLPTSVDNSSTTTVSGGTNSSENGTNTGRRLLEDNNSKGSQEGNDKEDVPVATAEN 322
VS G+++S T RRLLE+ +KE AT EN
Sbjct: 246 -----------------VSTGSHASSQNV-TKRRLLEE---------PEKE----ATVEN 274
Query: 323 DQA--LDENADSSFELFRDTDELADEYN---------YDYDDYVDDAMWGDEEWTEEQHE 371
D + L+++ADSSF++FR+ +E+A+++ YDYDDYVD++MW D+ W+E QH
Sbjct: 275 DDSNVLEDDADSSFDVFRNAEEIAEDHGDQGLLDHDTYDYDDYVDESMWSDDSWSEAQHL 334
Query: 372 KIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHE-------------------- 411
K D++++D+HIL TPVIADID DGV E+I+A SYFFD E
Sbjct: 335 KEADFIDIDAHILCTPVIADIDKDGVDELIVAASYFFDREIDRKDCKQHVDGSVGRRIEG 394
Query: 412 -----YYDNPEHLKEL-GGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIY 465
YYD PEH KEL G+DI KYV G+IVVFNLDTKQVKWT +LDLSTD+ S+RAYIY
Sbjct: 395 RAKFLYYDLPEHAKELPDGLDISKYVGGSIVVFNLDTKQVKWTVELDLSTDSVSYRAYIY 454
Query: 466 SSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKI 525
SSPTVVD+D DG +I+VGTSFG YVL H+G ++E FPL M EIQG VVA DINDDGKI
Sbjct: 455 SSPTVVDVDEDGFSEIVVGTSFGFLYVLQHNGTLKEPFPLLMGEIQGQVVAGDINDDGKI 514
Query: 526 ELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLS 585
E+V D GNVAA+T +GK +W HLKS+V QGP+IGDVDGDGH+DVVVPT SG I+VL
Sbjct: 515 EIVGADVRGNVAAFTGDGKELWTVHLKSIVAQGPTIGDVDGDGHTDVVVPTASGKIFVLR 574
Query: 586 GKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADV 645
G DG V P+P+RTHGRVM LLVDL KR + KGLTI TSFDGY YLI+G + CA+
Sbjct: 575 GSDGVFVAPFPFRTHGRVMAPALLVDLNKRKAERKGLTIAVTSFDGYFYLIEGSSGCAEA 634
Query: 646 VDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAI 705
+DIGETSYSMVLADNVDGGDDLDLIVTTMNGNV+CF TPAPHHPLKAW S QGRN VA
Sbjct: 635 IDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVYCFQTPAPHHPLKAWPSQLQGRNVVAS 694
Query: 706 RYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGER 765
+ NR GIYV SR+FRDE G NFWVE +IVD++R +GS YNVT TLLVPGNYQG +
Sbjct: 695 KINRQGIYVLPKSRSFRDEAGTNFWVEFKIVDQHRPMAGSPGIYNVTVTLLVPGNYQGLK 754
Query: 766 RIKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWL 825
R+ Q++I+ G Y++K+P V VRTTG+V+VEM DKNGLYFSDEF+LTFHM YY+LLKWL
Sbjct: 755 RLVQNKIYTSPGVYQVKVPCVSVRTTGSVVVEMEDKNGLYFSDEFALTFHMRYYRLLKWL 814
Query: 826 LVLPMLGMF 834
+ LP +GM
Sbjct: 815 ICLPFVGML 823
>gi|302787711|ref|XP_002975625.1| hypothetical protein SELMODRAFT_50714 [Selaginella moellendorffii]
gi|300156626|gb|EFJ23254.1| hypothetical protein SELMODRAFT_50714 [Selaginella moellendorffii]
Length = 822
Score = 933 bits (2411), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 473/845 (55%), Positives = 597/845 (70%), Gaps = 43/845 (5%)
Query: 30 QNKFRQREATDDQLGLPQIDEDALVNTQCPKNLELRWQTEVSSSIYATPLIADINSDGKL 89
QNKFR REA+DD LG P +DE+AL NTQCP+NLELRWQ E S+SIYA PLIADIN DGKL
Sbjct: 1 QNKFRSREASDDLLGYPNLDEEALANTQCPRNLELRWQAEASASIYAAPLIADINGDGKL 60
Query: 90 DIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVL 149
DIVVPSF+HYL+VLEG+DG+KM GWPA+HQS+VHSSPLLYDIDKDG +EI L TYNGEVL
Sbjct: 61 DIVVPSFVHYLDVLEGADGEKMQGWPAYHQSTVHSSPLLYDIDKDGYKEILLPTYNGEVL 120
Query: 150 FFRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVDRSHPDVHDDLIVQESEAA-------- 201
FR SGY + +K+ +PR +VRKDWYVGL D VDRSHPDVHDD +V S
Sbjct: 121 MFRSSGYPINEKITVPRLRVRKDWYVGLSPDHVDRSHPDVHDDGLVIPSPPGTYQCLCVK 180
Query: 202 RMKSMLETKKSTPETNATVTTSTESNPAPATVSNPDVKKVNESLVNVSNPSEERKVNESH 261
++S + + + ++T+ T N +T+ D + S NVS + ++ +
Sbjct: 181 SIRSCVSSDHAVANASSTIATKDGQN-GSSTIETKDGQN-GSSPGNVSGKLDNSTLSNTP 238
Query: 262 TEMNIKLPTSVDNSSTTT-----------VSGGTNSSENGTNTGRRLLEDNNSKGSQEGN 310
+N+ + N S T + GT S N T RRLL+ + G
Sbjct: 239 ETLNVSATATAGNGSATLKVSDVAEAGSELRMGTTSPNNTQGTQRRLLQQEATAG----- 293
Query: 311 DKEDVPVATAENDQALDENADSSFELFRDTDELADEYNYDYDDYVDDAMWGDEEWTEEQH 370
+N + L+ +A+++F++FRD L+DEY YDYDDYVD++MW D+ W + H
Sbjct: 294 ----------DNREKLEADAEATFDVFRDKGGLSDEYAYDYDDYVDESMWTDDHWVQAVH 343
Query: 371 EKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYD-NPEHLKELGGIDIGK 429
E+ ED++++D+HIL TPVIADID DGV E+I+A S+FFD EYY+ NP+ L + +DI K
Sbjct: 344 EREEDFIDIDAHILCTPVIADIDKDGVDELIVAASFFFDREYYEANPDLLSK--DVDISK 401
Query: 430 YVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGL 489
YVAGAI VF+LDTKQVKW DLSTD+ +FRAYIYSSPTVVDLDGDG L I+VGTSFG
Sbjct: 402 YVAGAIFVFSLDTKQVKWKIHFDLSTDSVAFRAYIYSSPTVVDLDGDGYLSIVVGTSFGF 461
Query: 490 FYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQ 549
Y L ++G + FPL+M EI VVAAD+NDDGK+E+VT DT GNV AW +GK +WE
Sbjct: 462 VYALHYNGNTWKNFPLQMGEIHAQVVAADVNDDGKVEIVTADTRGNVVAWETDGKLLWEV 521
Query: 550 HLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLL 609
HLKS++ Q P++GDV+GDG +DVVVPT SGNIYVL G DGS V P+P+RTHGR+M VLL
Sbjct: 522 HLKSMIAQAPTVGDVNGDGITDVVVPTASGNIYVLKGTDGSYVPPFPFRTHGRIMASVLL 581
Query: 610 VDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDL 669
+D + + + +V TSFDGYLY+IDG T CAD DIGETSY+MVL +NVDGGDDLDL
Sbjct: 582 LDFGNKESEQADVVLVATSFDGYLYIIDGKTGCADATDIGETSYTMVLVENVDGGDDLDL 641
Query: 670 IVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNF 729
IVTTMNGNV+CF TP HHPLKAW + NQGRN ++ RYNR G+Y SR FRDE G +F
Sbjct: 642 IVTTMNGNVYCFQTPVKHHPLKAWPTANQGRNVLSPRYNREGVYALPSSRKFRDEAGESF 701
Query: 730 WVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVR 789
WV +IVD++ PYNVT TLLVPGNY RRI Q + + G ++K+P V +R
Sbjct: 702 WVVFKIVDQHNL----HGPYNVTLTLLVPGNYHEPRRITQWHRYDQPGVQKLKVPCVPIR 757
Query: 790 TTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVILRPQEAMPLP 849
+TGTV +EMVD++GL F+DEFSLTFHM+YY+LLKW +VLP+LGM +V L P++ + LP
Sbjct: 758 STGTVTLEMVDRHGLAFTDEFSLTFHMHYYRLLKWFVVLPLLGMLVAIVGLHPEDHVSLP 817
Query: 850 SFSRN 854
SFS N
Sbjct: 818 SFSSN 822
>gi|302783661|ref|XP_002973603.1| hypothetical protein SELMODRAFT_50757 [Selaginella moellendorffii]
gi|300158641|gb|EFJ25263.1| hypothetical protein SELMODRAFT_50757 [Selaginella moellendorffii]
Length = 825
Score = 933 bits (2411), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 474/848 (55%), Positives = 598/848 (70%), Gaps = 46/848 (5%)
Query: 30 QNKFRQREATDDQLGLPQIDEDALVNTQCPKNLELRWQTEVSSSIYATPLIADINSDGKL 89
QNKFR REA+DD LG P +DE+AL NTQCP+NLELRWQ E S+SIYA PLIADIN DGKL
Sbjct: 1 QNKFRSREASDDLLGYPNLDEEALANTQCPRNLELRWQAEASASIYAAPLIADINGDGKL 60
Query: 90 DIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVL 149
DIVVPSF+HYL+VLEG+DG+KM GWPA+HQS+VHSSPLLYDIDKDG +EI L TYNGEVL
Sbjct: 61 DIVVPSFVHYLDVLEGADGEKMQGWPAYHQSTVHSSPLLYDIDKDGYKEILLPTYNGEVL 120
Query: 150 FFRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVDRSHPDVHDDLIVQESEAA-------- 201
FR SGY + +K+ +PR +VRKDWYVGL D VDRSHPDVHDD +V S
Sbjct: 121 MFRSSGYQINEKITVPRLRVRKDWYVGLSPDHVDRSHPDVHDDGLVIPSPPGTYQCLCVK 180
Query: 202 RMKSMLETKKSTPETNATVTTSTESNPAPATVSNPDVKKVNESLVNVSNPSEERKVNESH 261
++S + + + ++T+ T N +T+ D + S NVS + ++ +
Sbjct: 181 SIRSCVSSDHAVANASSTIATKDGQN-GSSTIETKDGQN-GSSPGNVSAKLDNSTLSNTP 238
Query: 262 TEMNIKLPTSVDNSSTTT-----------VSGGTNSSENGTNTGRRLLEDNNSKGSQEGN 310
+N+ + N S T + GT S N T RRLL+ + G
Sbjct: 239 ETLNVSATATAGNGSATLKVSDVAEAGSELRMGTTSPNNTQGTQRRLLQQEATAG----- 293
Query: 311 DKEDVPVATAENDQALDENADSSFELFRDTDELADEYNYDYDDYVDDAMWGDEEWTEEQH 370
+N + L+ +A+++F++FRD L+DEY YDYDDYVD++MW D+ W H
Sbjct: 294 ----------DNREKLEADAEATFDVFRDKGGLSDEYAYDYDDYVDESMWTDDHWVRAVH 343
Query: 371 EKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYD-NPEHLKELGGIDIGK 429
E+ ED++++D+HIL TPVIADID DGV E+I+A S+FFD EYY+ NP+ L + +DI K
Sbjct: 344 EREEDFIDIDAHILCTPVIADIDKDGVDELIVAASFFFDREYYEANPDLLSK--DVDISK 401
Query: 430 YVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGL 489
YVAGAI VF+LDTKQVKW DLSTD+ +FRAYIYSSPTVVDLDGDG L I+VGTSFG
Sbjct: 402 YVAGAIFVFSLDTKQVKWKIHFDLSTDSVAFRAYIYSSPTVVDLDGDGYLSIVVGTSFGF 461
Query: 490 FYVLDHHGK---IREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGI 546
Y L ++G +R+ FPL+M EI VVAAD+NDDGK+E+VT DT GNV AW +GK +
Sbjct: 462 VYALHYNGNTCIVRKNFPLQMGEIHAQVVAADVNDDGKVEIVTADTRGNVVAWETDGKLL 521
Query: 547 WEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQ 606
WE HLKS++ Q P++GDV+GDG +DVVVPT SGNIYVL G DGS V P+P+RTHGR+M
Sbjct: 522 WEVHLKSMIAQAPTVGDVNGDGITDVVVPTASGNIYVLKGTDGSYVPPFPFRTHGRIMAS 581
Query: 607 VLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDD 666
VLL+D + + + +V TSFDGYLY+IDG T CAD DIGETSY+MVLA+NVDGGDD
Sbjct: 582 VLLLDFGNKESEQADVVLVATSFDGYLYIIDGKTGCADATDIGETSYTMVLAENVDGGDD 641
Query: 667 LDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEG 726
LDLIVTTMNGNV+CF TP HHPLKAW + N GRN ++ RYNR G+Y SR FRDE G
Sbjct: 642 LDLIVTTMNGNVYCFQTPVKHHPLKAWPTANHGRNVLSPRYNREGVYALPSSRKFRDEAG 701
Query: 727 RNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTV 786
+FWV +IVD++ PYNVT TLLVPGNY RRI Q + + G ++K+P V
Sbjct: 702 ESFWVVFKIVDQHNL----HGPYNVTLTLLVPGNYHEPRRIMQWHRYDQPGVQKLKVPCV 757
Query: 787 GVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVILRPQEAM 846
+R+TGTV +EMVD++GL F+DEFSLTFHM+YY+LLKW +VLP+LGM +V L P++ +
Sbjct: 758 PIRSTGTVTLEMVDRHGLAFTDEFSLTFHMHYYRLLKWFVVLPLLGMLVAIVGLHPEDHV 817
Query: 847 PLPSFSRN 854
LPSFS N
Sbjct: 818 SLPSFSSN 825
>gi|414873719|tpg|DAA52276.1| TPA: hypothetical protein ZEAMMB73_385672 [Zea mays]
Length = 627
Score = 866 bits (2238), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 440/626 (70%), Positives = 523/626 (83%), Gaps = 9/626 (1%)
Query: 240 KVNESLVNVSNP-SEERKVNESHTEMNIKLPTSVDNSSTTTVSGGTNSSEN--GTNTGRR 296
K E+L N S S E K+N + + N++LP +V+++ + ++S T +EN T+T RR
Sbjct: 2 KGGEALKNSSEQHSAETKLNSTQVQENVQLPNNVNDTHSESISSVTTKAENTSQTHTQRR 61
Query: 297 LLEDNNSKGSQEG------NDKEDVPVATAENDQALDENADSSFELFRDTDELADEYNYD 350
LLE + Q G +D AT EN + LDE+AD+SF+LFRD ++L DEYNYD
Sbjct: 62 LLETADKSDDQTGSSETHESDSGAKAAATVENSEPLDEDADASFDLFRDPEDLPDEYNYD 121
Query: 351 YDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDH 410
YDDYVDD +WGDE+W E++HEK EDYV++D+HILSTPVIADID DGV EM+IAVSYFFDH
Sbjct: 122 YDDYVDDRLWGDEDWKEQEHEKEEDYVSIDAHILSTPVIADIDKDGVQEMVIAVSYFFDH 181
Query: 411 EYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTV 470
EYY + +H KEL GID+GKYVA +IVVFNLDT+QVKW DLDLSTD +FRA+++SSPTV
Sbjct: 182 EYYRDSDHAKELEGIDMGKYVASSIVVFNLDTRQVKWAADLDLSTDAVNFRAHVFSSPTV 241
Query: 471 VDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTT 530
VDLDGDG LDIL+GT++GLFYV+DH GK+R KFPLEMAEI V+AADINDDGKIE+VTT
Sbjct: 242 VDLDGDGYLDILIGTAYGLFYVIDHRGKVRNKFPLEMAEIHAPVIAADINDDGKIEMVTT 301
Query: 531 DTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGS 590
D+HGNVAAWTAEG+ IWE HLKSL+ Q P++GDV+GDGH+DVVVPT+SGNIYVLSGKDGS
Sbjct: 302 DSHGNVAAWTAEGEEIWEVHLKSLIPQRPTVGDVNGDGHTDVVVPTVSGNIYVLSGKDGS 361
Query: 591 KVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGE 650
K++P+PYRTHGR+M+ VLL+D++K G+ +KGLT+ TTSFDGYLYLI+G + CADVVDIGE
Sbjct: 362 KIQPFPYRTHGRIMSPVLLLDMSKHGDNAKGLTLATTSFDGYLYLIEGSSGCADVVDIGE 421
Query: 651 TSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRA 710
TSY+MVLADNVDGGDDLDLIVTTMNGNVFCFST +PHHPLK WRS NQGRNN A +YNR
Sbjct: 422 TSYTMVLADNVDGGDDLDLIVTTMNGNVFCFSTQSPHHPLKEWRSSNQGRNNAAYQYNRE 481
Query: 711 GIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQS 770
GIYV H SR FRDEEG+NFW+E EIVD+YR P G QAPYNVT TLLVPGNYQG+RRI S
Sbjct: 482 GIYVKHGSRTFRDEEGKNFWIEFEIVDKYRVPYGKQAPYNVTVTLLVPGNYQGDRRIVVS 541
Query: 771 QIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPM 830
++ + GK R+ LPTV VRTTGTV+VEM+DKNGLYFSDEFSLTFHM+YYKLLKWLL+LPM
Sbjct: 542 GVYHQPGKQRMMLPTVPVRTTGTVVVEMIDKNGLYFSDEFSLTFHMHYYKLLKWLLLLPM 601
Query: 831 LGMFGVLVILRPQEAMPLPSFSRNTD 856
LGMFGVLVILRPQE PLPSFSRN D
Sbjct: 602 LGMFGVLVILRPQEGAPLPSFSRNID 627
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 46/92 (50%), Gaps = 2/92 (2%)
Query: 66 WQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSS 125
W+ + S I P + D+N DG D+VVP+ + VL G DG K+ +P + S
Sbjct: 318 WEVHLKSLIPQRPTVGDVNGDGHTDVVVPTVSGNIYVLSGKDGSKIQPFPYRTHGRIMSP 377
Query: 126 PLLYDIDKDG--VREIALATYNGEVLFFRVSG 155
LL D+ K G + + LAT + + + + G
Sbjct: 378 VLLLDMSKHGDNAKGLTLATTSFDGYLYLIEG 409
>gi|357468637|ref|XP_003604603.1| Defective in exine formation [Medicago truncatula]
gi|355505658|gb|AES86800.1| Defective in exine formation [Medicago truncatula]
Length = 521
Score = 761 bits (1965), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/463 (78%), Positives = 405/463 (87%), Gaps = 17/463 (3%)
Query: 411 EYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTV 470
+YYDN EH+KELG IDIGKYVAG IVVFNLDTKQVKWT +LD+STD A+FRAY+YSSPTV
Sbjct: 59 QYYDNQEHMKELGDIDIGKYVAGGIVVFNLDTKQVKWTAELDMSTDTANFRAYVYSSPTV 118
Query: 471 VDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTT 530
VDLDGDG LDILVGTS+GLFYVLDHHGK+REKFPLEMAEIQ VVAADINDDGKIELVT
Sbjct: 119 VDLDGDGYLDILVGTSYGLFYVLDHHGKVREKFPLEMAEIQAGVVAADINDDGKIELVTA 178
Query: 531 DTHGNVAAWTAEGKGIWEQHLKSLVTQ-----------------GPSIGDVDGDGHSDVV 573
DTHGNV AWT +G IWE+HLKSL+ P+IGD+DGDG +++V
Sbjct: 179 DTHGNVVAWTPKGDMIWEKHLKSLIPHVMYYLNLPWHVNECSMIAPTIGDIDGDGRTELV 238
Query: 574 VPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYL 633
VPTLSG I+VL G+DGS + YP+ THGR+MNQ+LLVDL+K+ EK KGLT+VT+SFDGYL
Sbjct: 239 VPTLSGKIHVLDGRDGSPIGRYPFITHGRIMNQILLVDLSKQKEKKKGLTLVTSSFDGYL 298
Query: 634 YLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAW 693
YLIDGPT CADVVDIGETSYSMVLADNVDGGDDLDLIV+TMNGNVFCFSTP+PHHPLKAW
Sbjct: 299 YLIDGPTGCADVVDIGETSYSMVLADNVDGGDDLDLIVSTMNGNVFCFSTPSPHHPLKAW 358
Query: 694 RSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTT 753
R NQGRNNVA RY R GIYVTHPSRAFRDEEG++F+VEIEIVD YR+PSG Q PY+VTT
Sbjct: 359 RLPNQGRNNVANRYGREGIYVTHPSRAFRDEEGKSFFVEIEIVDNYRYPSGHQGPYHVTT 418
Query: 754 TLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLT 813
+LLVPGNYQGER IKQ+Q + + GK+RIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLT
Sbjct: 419 SLLVPGNYQGERTIKQNQTYYQPGKHRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLT 478
Query: 814 FHMYYYKLLKWLLVLPMLGMFGVLVILRPQEAMPLPSFSRNTD 856
FHM+YYKLLKWLLVLPMLGMFGVLVILRPQ +PLPSFSRN D
Sbjct: 479 FHMHYYKLLKWLLVLPMLGMFGVLVILRPQGPVPLPSFSRNND 521
Score = 46.6 bits (109), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 61/260 (23%), Positives = 108/260 (41%), Gaps = 44/260 (16%)
Query: 364 EWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVS----YFFDHEYYDNPEHL 419
+WT E + D N +++ S+P + D+D DG ++++ S Y DH +
Sbjct: 94 KWTAEL-DMSTDTANFRAYVYSSPTVVDLDGDGYLDILVGTSYGLFYVLDHHGKVREKFP 152
Query: 420 KELGGIDIGKYVA-----GAIVVFNLDTKQ--VKWTTDLDLSTDN--ASFRAYIYS---- 466
E+ I G A G I + DT V WT D+ + S ++
Sbjct: 153 LEMAEIQAGVVAADINDDGKIELVTADTHGNVVAWTPKGDMIWEKHLKSLIPHVMYYLNL 212
Query: 467 ----------SPTVVDLDGDGNLDILVGTSFGLFYVLD-HHGKIREKFP-LEMAEIQGAV 514
+PT+ D+DGDG +++V T G +VLD G ++P + I +
Sbjct: 213 PWHVNECSMIAPTIGDIDGDGRTELVVPTLSGKIHVLDGRDGSPIGRYPFITHGRIMNQI 272
Query: 515 VAADINDDGK----IELVTTDTHGNV-----AAWTAEGKGIWEQHLKSLVTQGPSIGDVD 565
+ D++ + + LVT+ G + A+ I E ++ +VD
Sbjct: 273 LLVDLSKQKEKKKGLTLVTSSFDGYLYLIDGPTGCADVVDIGETSYSMVLAD-----NVD 327
Query: 566 GDGHSDVVVPTLSGNIYVLS 585
G D++V T++GN++ S
Sbjct: 328 GGDDLDLIVSTMNGNVFCFS 347
Score = 44.7 bits (104), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 39/81 (48%), Gaps = 2/81 (2%)
Query: 62 LELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSS 121
L L W S I P I DI+ DG+ ++VVP+ + VL+G DG + +P
Sbjct: 210 LNLPWHVNECSMI--APTIGDIDGDGRTELVVPTLSGKIHVLDGRDGSPIGRYPFITHGR 267
Query: 122 VHSSPLLYDIDKDGVREIALA 142
+ + LL D+ K ++ L
Sbjct: 268 IMNQILLVDLSKQKEKKKGLT 288
>gi|147769687|emb|CAN65534.1| hypothetical protein VITISV_018283 [Vitis vinifera]
Length = 375
Score = 636 bits (1640), Expect = e-179, Method: Compositional matrix adjust.
Identities = 314/381 (82%), Positives = 336/381 (88%), Gaps = 20/381 (5%)
Query: 477 GNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNV 536
GNL I V F+ + GKIREKFPLEMAEIQG VVAADINDDGKIELVT DTHGN+
Sbjct: 15 GNLYIYV------FFFNIYPGKIREKFPLEMAEIQGGVVAADINDDGKIELVTADTHGNI 68
Query: 537 AAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYP 596
AAWTA+GK IW H+KSLV Q P+IGDVDGDGH+DVVVPTLSGNIYVL+GKDG +VRPYP
Sbjct: 69 AAWTAQGKEIWVTHVKSLVPQAPTIGDVDGDGHTDVVVPTLSGNIYVLNGKDGLQVRPYP 128
Query: 597 YRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMV 656
YRTHGRVMNQVLLVDL+KRGEK KGLT+VTTSFDGYLYLIDGPTSCADVVDIGETSYSMV
Sbjct: 129 YRTHGRVMNQVLLVDLSKRGEKKKGLTLVTTSFDGYLYLIDGPTSCADVVDIGETSYSMV 188
Query: 657 LADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTH 716
LADNVDGGDDLDLIVTTMNGN AWRS NQGRNNVA R++R GIY++
Sbjct: 189 LADNVDGGDDLDLIVTTMNGN--------------AWRSPNQGRNNVANRHSREGIYISQ 234
Query: 717 PSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARR 776
SRAFRDEEG++FWVEIEIVD+YRFPSGSQAPYNVTTTLLVPGNYQGERRIKQ+Q F
Sbjct: 235 SSRAFRDEEGKSFWVEIEIVDKYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQNQTFDXA 294
Query: 777 GKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGV 836
GK+RIKLPTVGVRTTGTVLVEMVDKNGLYFSD+FSLTFHM+YYKLLKWLLVLPML MFGV
Sbjct: 295 GKHRIKLPTVGVRTTGTVLVEMVDKNGLYFSDDFSLTFHMHYYKLLKWLLVLPMLAMFGV 354
Query: 837 LVILRPQEAMPLPSFSRNTDL 857
LVILRPQEAMPLPSFSRNTDL
Sbjct: 355 LVILRPQEAMPLPSFSRNTDL 375
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 39/76 (51%)
Query: 66 WQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSS 125
W T V S + P I D++ DG D+VVP+ + VL G DG ++ +P V +
Sbjct: 79 WVTHVKSLVPQAPTIGDVDGDGHTDVVVPTLSGNIYVLNGKDGLQVRPYPYRTHGRVMNQ 138
Query: 126 PLLYDIDKDGVREIAL 141
LL D+ K G ++ L
Sbjct: 139 VLLVDLSKRGEKKKGL 154
>gi|384250657|gb|EIE24136.1| integrin alpha N-terminal domain-containing protein [Coccomyxa
subellipsoidea C-169]
Length = 790
Score = 627 bits (1616), Expect = e-176, Method: Compositional matrix adjust.
Identities = 358/828 (43%), Positives = 487/828 (58%), Gaps = 86/828 (10%)
Query: 58 CPKNLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMP--GWP 115
C N+ L+W TEVSSS+YATP+I D+ SDG DI+VPSF+HY+EVL+G DG K WP
Sbjct: 18 CDTNIRLKWMTEVSSSVYATPIIHDLLSDGHKDIIVPSFVHYVEVLDGVDGAKASDDSWP 77
Query: 116 AFHQSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDWYV 175
AFH+S VH+SP+L+D D+DGV +I +ATYNGE+L + +G ++ +KL +PR +VRKDWY
Sbjct: 78 AFHKSFVHASPVLFDFDEDGVLDILIATYNGEILVVKDTGEVLAEKLVVPRLRVRKDWYK 137
Query: 176 GLHSDPVDRSHPDV------HDDLIVQESEAARMKSMLETKKSTPETNATVTTSTESNPA 229
GL DP D SHPDV D V+ + A M+ L ES A
Sbjct: 138 GLDDDPTDHSHPDVGIRPDAQDGSTVEANLTAGMRRQL-------------LAVNESGSA 184
Query: 230 PATVSN--PDVKKVNESLVNVSNPSEERKV-----NESHTEMNIKLPTSVDNSSTTTVSG 282
P S+ +++ ++ V P E + + H E N +L + D++++
Sbjct: 185 PGGQSDLRAGQRRLLQAESKVDIPEEAAQTYKELFEDPHNEWNAELES--DDAASYM--- 239
Query: 283 GTNSSENGTNTGRRLLEDNNSKGSQEGNDKEDVPVATAENDQALDENADSSFELFRDTDE 342
+ N G R L D + G D DV ++ Q D
Sbjct: 240 -----HDMDNPGIRGLIDGD------GFDSGDVADWHSKYVQHYD--------------- 273
Query: 343 LADEYNYDYD---DYVDDAMWGDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSE 399
D Y D+ D +D MWGDE + E HE+ + ++ +DSHI+STP IADID DG +E
Sbjct: 274 --DHYVDDWKPPPDMSEDEMWGDEWFFEAPHEREKGFLRLDSHIMSTPAIADIDGDGQAE 331
Query: 400 MIIAVSYFFDHEYYDNPEHLKELG-GIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNA 458
++IA SYFFD +YYD+PEH KELG +D+ KYVAG IVVFN T+ +KW LDLSTD
Sbjct: 332 IVIAASYFFDPDYYDDPEHKKELGEDVDVSKYVAGGIVVFNSRTRTIKWQQHLDLSTDRT 391
Query: 459 SFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAAD 518
+F+AYI +SPT+ D++GDG L+I++GT+ G YVLD G+ E +PL+M +Q V AD
Sbjct: 392 AFKAYIQASPTLADINGDGKLEIIIGTNMGFVYVLDCEGRTVEGWPLQMGAVQAQVAVAD 451
Query: 519 INDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLS 578
IN DG +E+V D GNVAA++A+G +WE+H+KSLV Q PS+GD++GDG +VV + S
Sbjct: 452 INGDGALEIVAADARGNVAAFSADGSELWERHVKSLVAQAPSLGDINGDGVLEVVFASAS 511
Query: 579 GNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDG 638
G +Y LSG G V P+P+RT GR+ VLLV L G L V S+DG+LY IDG
Sbjct: 512 GAVYALSGVSGHDVAPFPFRTRGRITAPVLLVRLRDSG---PALHAVVQSYDGHLYAIDG 568
Query: 639 PTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQ 698
T CAD VDIGET+YSMVLAD++DG +DL++ TMNGNV+CF T A +HPLKAW S
Sbjct: 569 VTGCADTVDIGETAYSMVLADDMDGNGRMDLVLATMNGNVYCFETAASYHPLKAWPSQVL 628
Query: 699 GRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEY-RFPSGSQ----APYNVTT 753
+ RYN G+Y SRA RD G+ V +I+D+ R G + PYNV
Sbjct: 629 EHSGFVARYNWEGVYAVAASRAPRDVRGQTLAVRFQIIDKRPRSTVGGRNATGGPYNVAV 688
Query: 754 TLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLT 813
L G E + + +P R+T TV +EMVD+N F D F+L+
Sbjct: 689 RL--QGVGVEEMKAGDQPVIG-----TTSVPCPRTRSTATVRIEMVDENKQMFVDTFALS 741
Query: 814 FHMYYYKLLKWLLVLPMLGMFGVLVILR----PQEAMPLPSFSRNTDL 857
FHM++++LLKWLLV P L ++ L+ Q+A LPSF+R +
Sbjct: 742 FHMHFHRLLKWLLVAPFLACVAAVLSLQFDIDLQDA--LPSFNRRVSM 787
>gi|255087997|ref|XP_002505921.1| predicted protein [Micromonas sp. RCC299]
gi|226521192|gb|ACO67179.1| predicted protein [Micromonas sp. RCC299]
Length = 855
Score = 565 bits (1456), Expect = e-158, Method: Compositional matrix adjust.
Identities = 342/843 (40%), Positives = 464/843 (55%), Gaps = 58/843 (6%)
Query: 27 NSEQNKFRQREATDDQLGLPQIDEDALV-NTQCPKNLELRWQTEVSSSIYATPLIADINS 85
+ + NKF R+++ D DE+A+ +CP NL LRW TEV+SS+Y+TP+IAD+ S
Sbjct: 38 SGKGNKFLDRKSSPDSFD----DEEAMRPQDECPANLRLRWMTEVTSSVYSTPVIADLFS 93
Query: 86 DGKLDIVVPSFLHYLEVLEGSDGDKMPG-WPAFHQSSVHSSPLLYDIDKDGVREIALATY 144
DG ++VVPSF+HYLEVLEG DG K G WPAFH+S+VH+SPLL + + EI L Y
Sbjct: 94 DGHKEVVVPSFVHYLEVLEGEDGAKAGGDWPAFHKSTVHASPLLRESGEG--TEILLPMY 151
Query: 145 NGEVLFFRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVDRSHPDVHDDLIVQESEAARMK 204
+GEV FF G + +L +PR +VRKDW+VGL D VD PDV D E+ + +
Sbjct: 152 DGEVHFFNDRGQALDKRLYVPRLRVRKDWHVGLAPDHVDHRSPDVGAD--STENFESGFR 209
Query: 205 SMLETKKSTPETNATVTTSTESNPAPATVSNPDVKKVNESLVNVSNPSEER------KVN 258
+ PE E+ +N L++ S PS ER +
Sbjct: 210 EPDPVTRGAPEHAHHALRRQEARSG----ANGGKGGKGRRLLSESEPSAERGETLTEEAA 265
Query: 259 ESHTEMNIKLPTSVDNSSTTTVSGGTNSSENGTNTGRRLLEDNNSKGSQEGNDKEDVPVA 318
S + D++ V G + E R E + EG V
Sbjct: 266 ASFKVFDNDDGDDTDDAGEERVGGLSTEDEKILGDYRAFWEGTDG---DEGTQGVGGSVN 322
Query: 319 TAENDQALDENADSSFELFRDTDELADEYNYDYDDYVDDAMWGDEEWTEEQHEKIED-YV 377
+ + +NA S RD A W DE + + E +D +V
Sbjct: 323 EGDEGREASKNARPSLRTHRDAH----------------AGWEDESFHQRPRESGDDTHV 366
Query: 378 NVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKEL-GGIDIGKYVAGAIV 436
VD+H+L TP +ADID DG E++++VSYFFD EYYDNP H EL ID+ KYVAG +
Sbjct: 367 YVDAHLLCTPSVADIDGDGRDELVLSVSYFFDREYYDNPTHSNELDASIDVSKYVAGGVY 426
Query: 437 VFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHH 496
V +L T ++KW T LDLSTD S+RAYIYSSPT+VDLD DG ++I+VGTS G YVL
Sbjct: 427 VVDLKTLELKWHTHLDLSTDTVSYRAYIYSSPTLVDLDRDGKMEIVVGTSVGFLYVLRAD 486
Query: 497 GKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVT 556
G FP++M EIQG V A D++ DG EL+ DT G+VAA+ +G +WE+HL SL+
Sbjct: 487 GTTMRGFPIQMGEIQGQVAAVDLDGDGYPELIAADTRGSVAAFRRDGSELWERHLASLIA 546
Query: 557 QGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRG 616
QG S+GDVDGDG +VVV T SG I+VL G G V P+P+ T+GRVM VLL L RG
Sbjct: 547 QGASVGDVDGDGSLEVVVGTSSGAIHVLRGATGEPVHPFPFYTNGRVMAPVLLTKL--RG 604
Query: 617 EKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNG 676
++S +T+V SFDG++YL+DG +C DV+D+GETSYSM L D++ G +DL++ TMNG
Sbjct: 605 DES-AMTLVAVSFDGFVYLVDGKRACRDVIDLGETSYSMPLVDDLTGNGKMDLVLATMNG 663
Query: 677 NVFCF-STPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEI 735
V+ + S P+ PL AW S NN+A R A V R + D G V EI
Sbjct: 664 VVYAYESLDTPYDPLHAWTSQVHSVNNMAARCGVA-FGVRGKDRGYHDVRGERIDVPFEI 722
Query: 736 VDEYRFP-------SGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGV 788
VD PY VT T+ PG R + + G Y++ P
Sbjct: 723 VDTRVVVPVVETKGGAPHGPYKVTVTVTSPGFSAVAR-----GSYDKPGAYKLSAPIPNW 777
Query: 789 RTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVILRPQEAMPL 848
R G V V++ D + L+ D +S++FHM YY++LKW+LVLP + L+ + E L
Sbjct: 778 RARGRVTVKVSDASALHVEDSYSVSFHMRYYRVLKWVLVLPFIAATTALMQMTRGEGNAL 837
Query: 849 PSF 851
P++
Sbjct: 838 PTW 840
>gi|357517647|ref|XP_003629112.1| Defective in exine formation [Medicago truncatula]
gi|355523134|gb|AET03588.1| Defective in exine formation [Medicago truncatula]
Length = 335
Score = 563 bits (1451), Expect = e-157, Method: Compositional matrix adjust.
Identities = 271/350 (77%), Positives = 300/350 (85%), Gaps = 15/350 (4%)
Query: 507 MAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDG 566
MAEIQ VVAADINDDGKIELVT DTHGNV AWT +G IWE+HLKSL+ P+IGD+DG
Sbjct: 1 MAEIQAGVVAADINDDGKIELVTADTHGNVVAWTPKGDMIWEKHLKSLIPHAPTIGDIDG 60
Query: 567 DGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVT 626
DG +++VVPTLSG I+VL G+DGS + YP+ THGR+MNQ+LLVDL+K+ EK KGLT+VT
Sbjct: 61 DGRTELVVPTLSGKIHVLDGRDGSPIGRYPFITHGRIMNQILLVDLSKQKEKKKGLTLVT 120
Query: 627 TSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAP 686
+SFDGYLYLIDGPT CADVVDIGETSYSMVLADNVDGGDDLDLIV+TMNGNVFCFSTP+P
Sbjct: 121 SSFDGYLYLIDGPTGCADVVDIGETSYSMVLADNVDGGDDLDLIVSTMNGNVFCFSTPSP 180
Query: 687 HHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPSGSQ 746
HHPLKAWR NQGRNNVA RY R GIYVTHPSR IVD YR+PSG Q
Sbjct: 181 HHPLKAWRLPNQGRNNVANRYGREGIYVTHPSR---------------IVDNYRYPSGHQ 225
Query: 747 APYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYF 806
PY+VTT+LLVPGNYQGER IKQ+Q + + GK+RIKLPTVGVRTTGTVLVEMVDKNGLYF
Sbjct: 226 GPYHVTTSLLVPGNYQGERTIKQNQTYYQPGKHRIKLPTVGVRTTGTVLVEMVDKNGLYF 285
Query: 807 SDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVILRPQEAMPLPSFSRNTD 856
SDEFSLTFHM+YYKLLKWLLVLPMLGMFGVLVILRPQ +PLPSFSRN D
Sbjct: 286 SDEFSLTFHMHYYKLLKWLLVLPMLGMFGVLVILRPQGPVPLPSFSRNND 335
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 49/103 (47%), Gaps = 1/103 (0%)
Query: 40 DDQLGLPQIDEDALVNTQCPKNLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHY 99
D ++ L D V PK ++ W+ + S I P I DI+ DG+ ++VVP+
Sbjct: 16 DGKIELVTADTHGNVVAWTPKG-DMIWEKHLKSLIPHAPTIGDIDGDGRTELVVPTLSGK 74
Query: 100 LEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALA 142
+ VL+G DG + +P + + LL D+ K ++ L
Sbjct: 75 IHVLDGRDGSPIGRYPFITHGRIMNQILLVDLSKQKEKKKGLT 117
>gi|357468635|ref|XP_003604602.1| Defective in exine formation [Medicago truncatula]
gi|355505657|gb|AES86799.1| Defective in exine formation [Medicago truncatula]
Length = 432
Score = 517 bits (1331), Expect = e-143, Method: Compositional matrix adjust.
Identities = 282/429 (65%), Positives = 336/429 (78%), Gaps = 25/429 (5%)
Query: 3 SSTTSANCVLLICLLLFNSARGGDNSEQNKFRQREATDDQLGLPQIDEDALVNTQCPKNL 62
SST +LL+C + D + N FR+REATDD LG P+IDEDALVN++CP NL
Sbjct: 5 SSTNLLLFLLLLCTFSSSVFAEEDAKKNNTFREREATDDALGYPEIDEDALVNSKCPMNL 64
Query: 63 ELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSV 122
ELRWQTEVSSS+YA PLIADINSDGKLDIVVPSF+HYLEVLEG+DGDKMPGWPAFHQS+V
Sbjct: 65 ELRWQTEVSSSVYANPLIADINSDGKLDIVVPSFVHYLEVLEGTDGDKMPGWPAFHQSTV 124
Query: 123 HSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPV 182
HSSPLLYDIDKDGVREIALATYNGEVLFFRVSGY+M+DKLE+PRRKV K+W+VGL+ DPV
Sbjct: 125 HSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYIMSDKLEVPRRKVLKNWHVGLNKDPV 184
Query: 183 DRSHPDVHDDLIVQESEAARMKSMLETKKSTPETNATVTTSTESNPAPATVSNPD-VKKV 241
DR+HPDVHDD +VQE+ A SM + S E N++ +TSTES+P +VSNP+ KK+
Sbjct: 185 DRTHPDVHDDQLVQEATIAN--SMSQMNGSRHEVNSSASTSTESHPDTKSVSNPEPEKKI 242
Query: 242 NES---------------LVNVSNPSEERKVNESHTEMNIKLPTSVDNSSTTTVSGGTNS 286
N S NVSNP E+KVNES +E IK+PT NSS + S T +
Sbjct: 243 NGSQSEESINTSTESHPDTKNVSNPEPEKKVNESQSEEGIKMPT---NSSVSAGSVETVN 299
Query: 287 SENGTNTGRRLLEDNNSKGSQ----EGNDKEDVPVATAENDQALDENADSSFELFRDTDE 342
++N T+TGRRLLEDNN KG++ E KE+V AT EN++ L+ +ADSSFELFR++D+
Sbjct: 300 ADNKTSTGRRLLEDNNLKGAEQVGSESKGKEEVHAATVENEEGLEADADSSFELFRNSDD 359
Query: 343 LADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMII 402
LADEYNYDYDDYVD+++WGDEEW E +HEK+EDYVNVDSHILSTPVIADIDNDGV EM++
Sbjct: 360 LADEYNYDYDDYVDESLWGDEEWIEGKHEKLEDYVNVDSHILSTPVIADIDNDGVMEMVV 419
Query: 403 AVSYFFDHE 411
AVSYFFD E
Sbjct: 420 AVSYFFDQE 428
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 38/76 (50%)
Query: 547 WEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQ 606
W+ + S V P I D++ DG D+VVP+ + VL G DG K+ +P V +
Sbjct: 68 WQTEVSSSVYANPLIADINSDGKLDIVVPSFVHYLEVLEGTDGDKMPGWPAFHQSTVHSS 127
Query: 607 VLLVDLTKRGEKSKGL 622
LL D+ K G + L
Sbjct: 128 PLLYDIDKDGVREIAL 143
>gi|302816207|ref|XP_002989783.1| hypothetical protein SELMODRAFT_451591 [Selaginella moellendorffii]
gi|300142560|gb|EFJ09260.1| hypothetical protein SELMODRAFT_451591 [Selaginella moellendorffii]
Length = 851
Score = 502 bits (1293), Expect = e-139, Method: Compositional matrix adjust.
Identities = 325/780 (41%), Positives = 418/780 (53%), Gaps = 130/780 (16%)
Query: 74 IYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDK 133
+A A++ DGKLD+VV L+VLEG+DG+K+ GWPAFHQ+ VHSSPL+YDIDK
Sbjct: 37 CFACESFANLYRDGKLDVVV------LDVLEGADGEKLSGWPAFHQAIVHSSPLIYDIDK 90
Query: 134 DGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVDRSHPDVHDDL 193
D +EI L GEVL FR SG+ DK +P+ +V KDWYVGL D DRSHPDVHDD
Sbjct: 91 DRYKEILL----GEVLIFRSSGFSSDDKFVVPKLRVGKDWYVGLDPDHDDRSHPDVHDD- 145
Query: 194 IVQESEAARMKSMLETKKSTPETNATVTTSTESNPAPATVSNPDVKKVNESLVNVSNPSE 253
L + P+ N ++T+ S ES +V+N SE
Sbjct: 146 ------------GLASSVPIPDRNMNMSTTEHS----------------ESASHVANSSE 177
Query: 254 ERKVNESH-TEMNIKLPTSVDNSSTTTVSGGT----NSSENGTNTGRRLLEDNNSKGSQE 308
++N S + +N+ P V+ SST VS ++ E GT N S+ S
Sbjct: 178 TAQINASADSSVNVSKPVEVNVSSTGNVSVTRKVVDDTEEPGTGFNMAAASLNESRSSIS 237
Query: 309 GNDKEDVPVATAENDQALDE-------NADSSFELFR----DTDELADEYNYDYD-DYVD 356
K +A D+ + S E+ R D +E DEY D D +
Sbjct: 238 AVRKLLDWTVIGPEVRAFDDAFLDRGDDDLSDDEMSRWPLDDKEEEFDEYFIDEAVDVAN 297
Query: 357 DAMWGDEEWTEEQHEK-IEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDN 415
+ D + E+ + + ++ S V D NDG+S+ + F +
Sbjct: 298 KPVEADTGYVSSVGERTVTEEGVLEEAAASFDVFRDHGNDGLSDERRTCACFRAGQ---G 354
Query: 416 PEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDG 475
H K + V T QV+ T LDLSTD+ SFRAYI PTVVDLDG
Sbjct: 355 CRHWKIC-------FWGDCCVQHGYKTSQVE--THLDLSTDSVSFRAYI---PTVVDLDG 402
Query: 476 DGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGN 535
+G L ILVGTSFG Y L ++G +R FPL+M EI VVAAD+NDDGK+E++T DT GN
Sbjct: 403 EGYLSILVGTSFGFVYALHYNGSVRANFPLQMGEIHAQVVAADVNDDGKVEILTADTRGN 462
Query: 536 VAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPY 595
VAAW GK +WE HLKSLV Q P++GDV+GDG S++VVPT SGNIYVL G DGS V P+
Sbjct: 463 VAAWAPTGKLLWEVHLKSLVAQAPTVGDVNGDGSSNIVVPTASGNIYVLRGSDGSYVPPF 522
Query: 596 PYRTHGRVMNQVLLVDLTKRGE--KSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSY 653
P+RTHGRVM+ VLL+D + ++ GLT+VTTSFDGYLY+IDG + CAD +DIGE S
Sbjct: 523 PFRTHGRVMSSVLLLDFGSSEDTVQTGGLTMVTTSFDGYLYMIDGKSGCADTIDIGEKS- 581
Query: 654 SMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIY 713
NV+ RYNR GIY
Sbjct: 582 ------------------------------------------------NVSSRYNREGIY 593
Query: 714 VTHPSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIF 773
SRAFRDE G FWV +IVD++ Q PYNVT TLLVPGN+QG R+ +QSQI+
Sbjct: 594 ALPSSRAFRDEAGSRFWVIFKIVDQHSL----QGPYNVTVTLLVPGNFQGPRQTRQSQIY 649
Query: 774 ARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGM 833
+ G +++LP R+TGTV +EMVDKNG YF+ F Y LL V M+G
Sbjct: 650 DQPGVQKLQLPCASSRSTGTVRLEMVDKNGFYFAGS---VFPHVSYALLSSSQVESMVGF 706
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 37/69 (53%)
Query: 63 ELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSV 122
+L W+ + S + P + D+N DG +IVVP+ + VL GSDG +P +P V
Sbjct: 471 KLLWEVHLKSLVAQAPTVGDVNGDGSSNIVVPTASGNIYVLRGSDGSYVPPFPFRTHGRV 530
Query: 123 HSSPLLYDI 131
SS LL D
Sbjct: 531 MSSVLLLDF 539
>gi|302839107|ref|XP_002951111.1| hypothetical protein VOLCADRAFT_117761 [Volvox carteri f.
nagariensis]
gi|300263806|gb|EFJ48005.1| hypothetical protein VOLCADRAFT_117761 [Volvox carteri f.
nagariensis]
Length = 1088
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 239/587 (40%), Positives = 341/587 (58%), Gaps = 68/587 (11%)
Query: 321 ENDQALDENADSSFELFRDTDEL-ADEYNYDY-DDYVDDAM----WGDEEWTEEQH-EKI 373
E+D AL E+ D + DEL D Y + Y DY + WG+E++ + H +
Sbjct: 505 ESDAALHEHGDDPRAA--NPDELQVDSYRHLYLGDYHGGSRAHGGWGEEDFVQSGHPDAA 562
Query: 374 EDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAG 433
YV VD H+++TP IADID DG E+++AVSYF+D EYYD+P+H K+L GID+ KYVA
Sbjct: 563 GGYVYVDPHVMTTPAIADIDGDGHDELVLAVSYFYDREYYDDPDHAKDLKGIDLSKYVAS 622
Query: 434 AIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVL 493
+VV +L T+ KW LDLSTD A+++AY YSSPT+VD++GDG L+++VGTS G YVL
Sbjct: 623 GVVVLDLRTRSEKWVQHLDLSTDTATYKAYAYSSPTLVDINGDGKLEVVVGTSMGFLYVL 682
Query: 494 DHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKS 553
DH G +P++M EIQ + ADIN+DG +E+ +D GN+AA+ +G+ +WE+H++S
Sbjct: 683 DHKGDPVPGWPIQMGEIQAQPLVADINNDGDLEIFISDMRGNMAAFNVKGEEVWERHVRS 742
Query: 554 LVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLT 613
V+QG GD+DGDG +VVV T SG IY L+G G + +PYR GR+ + L
Sbjct: 743 AVSQGAVAGDIDGDGQLEVVVGTASGYIYALAGSTGVPIPNWPYRARGRIQAAPTITHLV 802
Query: 614 KRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTT 673
+ G+ +V +FDG+LY++DG CADVVD+GETSY+ VL D++DG L+L+ TT
Sbjct: 803 ----EGTGMQVVVPAFDGFLYVVDGLQGCADVVDVGETSYAAVLVDDIDGDGALELVATT 858
Query: 674 MNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEI 733
MNGNV+ F T +P+HPLK W S G N RY AGI+ T SR RD G V+
Sbjct: 859 MNGNVYAFETGSPYHPLKTWTSQVMGPNGQVARYGYAGIFATPSSRRPRDVAGERLQVQF 918
Query: 734 EIVDEY-------RFPSGSQAPYNVTTTLL----------------------VPGNYQGE 764
E+VD+ G + PYNVT L PG Y +
Sbjct: 919 EVVDKRVAFADNGTLLPGGRGPYNVTVVLKGVGVREMAAGEAPVVGVADSFPGPGRYSVD 978
Query: 765 RRIKQSQ--------------------IFARRGKYRIKLPTVGVRTT-----GTVLVEMV 799
++Q + + K +I+ V+ T V +E+V
Sbjct: 979 IPCPKTQQSSIGFSVVYRTSCITNPNLVHISKSKVKIRFSNSNVQITSLQASAVVRLELV 1038
Query: 800 DKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVILR-PQEA 845
D +GL +SDEF+L+FHM++++LLKW++ LP+ M VL+ + PQ +
Sbjct: 1039 DGSGLLYSDEFALSFHMHFHRLLKWVIALPLGLMMAVLLAVSGPQRS 1085
Score = 192 bits (488), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 93/163 (57%), Positives = 122/163 (74%), Gaps = 2/163 (1%)
Query: 31 NKFRQREATDDQLGLPQIDEDALVNTQCPKN-LELRWQTEVSSSIYATPLIADINSDGKL 89
NK+R R+A D+ G Q + ++C ++ L+L W +E +SS+YATPLI D++ DG+
Sbjct: 40 NKYRSRKADMDKEGDIQHEGPVPGASRCGRHRLDLTWMSEATSSVYATPLITDLHGDGRR 99
Query: 90 DIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVL 149
DIVVPSF+HYLEVLEG +G + GWPAFH SSVH+SPLLYDID DGVR+I LATY+G+++
Sbjct: 100 DIVVPSFVHYLEVLEGPNGGQAVGWPAFHASSVHASPLLYDIDFDGVRDIMLATYDGQIM 159
Query: 150 FFRVSGYMMTDKLEIPRRKVRKDWYVGLH-SDPVDRSHPDVHD 191
FF+ +G M + L+I R +VRKDWYVGL DP D SHPDV D
Sbjct: 160 FFKDTGEKMLEGLQISRLRVRKDWYVGLDPKDPFDHSHPDVSD 202
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 49/81 (60%), Gaps = 2/81 (2%)
Query: 75 YATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKD 134
Y++P + DIN DGKL++VV + + +L VL+ GD +PGWP + + PL+ DI+ D
Sbjct: 654 YSSPTLVDINGDGKLEVVVGTSMGFLYVLD-HKGDPVPGWP-IQMGEIQAQPLVADINND 711
Query: 135 GVREIALATYNGEVLFFRVSG 155
G EI ++ G + F V G
Sbjct: 712 GDLEIFISDMRGNMAAFNVKG 732
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 40/174 (22%), Positives = 63/174 (36%), Gaps = 50/174 (28%)
Query: 66 WQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSS 125
W+ V S++ + DI+ DG+L++VV + Y+ L GS G +P WP + + ++
Sbjct: 736 WERHVRSAVSQGAVAGDIDGDGQLEVVVGTASGYIYALAGSTGVPIPNWPYRARGRIQAA 795
Query: 126 P---------------------------------------------LLYDIDKDGVREIA 140
P L+ DID DG E+
Sbjct: 796 PTITHLVEGTGMQVVVPAFDGFLYVVDGLQGCADVVDVGETSYAAVLVDDIDGDGALELV 855
Query: 141 LATYNGEVLFFRVSG-----YMMTDKLEIPRRKVRKDWYVGLHSDPVDRSHPDV 189
T NG V F T ++ P +V + Y G+ + P R DV
Sbjct: 856 ATTMNGNVYAFETGSPYHPLKTWTSQVMGPNGQVARYGYAGIFATPSSRRPRDV 909
>gi|428163594|gb|EKX32657.1| hypothetical protein GUITHDRAFT_166656 [Guillardia theta CCMP2712]
Length = 1207
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 211/484 (43%), Positives = 310/484 (64%), Gaps = 15/484 (3%)
Query: 372 KIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELG-GIDIGKY 430
K E YV VD+H+L TP+IAD+D DG E+I AVSY+FD E Y +P ++L ++ KY
Sbjct: 732 KEEGYVFVDAHVLCTPIIADLDRDGHDEIIFAVSYYFDKEQYSDPSAYEDLDVDVNTKKY 791
Query: 431 VAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLF 490
VAG +VV++ T+++KW LDL+TD ++RAYIYSSPTV DLD DG L+I++GTS G
Sbjct: 792 VAGGVVVYDALTRKLKWNIHLDLTTDETAYRAYIYSSPTVADLDADGKLEIILGTSLGFV 851
Query: 491 YVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQH 550
YV++H G++++ FP+ MAEIQG V AD+NDDG++E++ TDT NVA + ++GK +WE H
Sbjct: 852 YVIEHDGRVKDNFPVTMAEIQGQVAVADVNDDGQLEIIATDTRHNVAVFNSKGKEVWETH 911
Query: 551 LKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLV 610
+ TQGP +GDV+GDG D+V+ T SG+I+ L G G + +P +T G +M+ LL+
Sbjct: 912 ISGFSTQGPVVGDVNGDGLVDIVLATTSGHIWALQGSSGRTLENFPVKTGGPIMSLPLLL 971
Query: 611 DLTKRGEKSKGLT---IVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDL 667
+ K G+ GL I+ S DG+LY+++G T C+ VD+GE SYSMVLAD++ G +
Sbjct: 972 QM-KGGKGRAGLPVRHIIVPSHDGFLYIVNGATGCSFKVDVGENSYSMVLADDITGNGKM 1030
Query: 668 DLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGR 727
DL+VTTMNGNV C T +HP+KAW S QG NNV +R R GI++ R F D G
Sbjct: 1031 DLLVTTMNGNVICLGTDVDYHPMKAWTSKEQGNNNVELRDGRQGIFILDRFRHFHDHNGP 1090
Query: 728 NFWVEIEIVDE--YRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPT 785
+ EIVD+ + S Y+V +L G ++ + + + G+Y ++P
Sbjct: 1091 SMTFGFEIVDKRPVKGLGASGGEYHVRISL------GGSTKLFE-KTYTHPGRYLEEIPC 1143
Query: 786 VGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGM-FGVLVILRPQE 844
R +V VEM ++ G +F D S++F+M++Y+ LKWLL++P M G++ I
Sbjct: 1144 PDRRQFTSVFVEMTNEYGQHFEDRVSMSFNMHFYRALKWLLMVPFTLMSLGIVFIKEVNT 1203
Query: 845 AMPL 848
+P+
Sbjct: 1204 VLPV 1207
Score = 120 bits (302), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 60/159 (37%), Positives = 97/159 (61%), Gaps = 3/159 (1%)
Query: 31 NKFRQREATDDQLGLPQIDEDALV-NTQCPKNLELRWQTEVSSSIYATPLIADINSDGKL 89
NKF+QR+ + + +D N+ CP+ + +W E+ SS Y TP I D+ SDG
Sbjct: 81 NKFKQRDIQGIENFIFGMDASNFAPNSSCPQTIIQQWAMEIESSSYVTPTIFDVASDGVK 140
Query: 90 DIVVPSFLHYLEVLEGSDGDKMPGWP-AFHQSSVHSSPLLYDIDKDGVREIALATYNGEV 148
DIV+P+F+ ++EV+ G G + PG+P F S+ ++S L+YDI+KDG +I + T+NGE+
Sbjct: 141 DIVIPTFVRHIEVINGPHGHRSPGFPFTFPNSAFYASALIYDINKDGEPDIGVTTFNGEL 200
Query: 149 LFFRVSGY-MMTDKLEIPRRKVRKDWYVGLHSDPVDRSH 186
++ +G + ++IP +++K WY GL DP D H
Sbjct: 201 IWLTENGIPIFGWSIKIPHLRIKKKWYEGLKPDPQDMEH 239
Score = 43.5 bits (101), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 45/88 (51%), Gaps = 8/88 (9%)
Query: 66 WQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSS 125
W+T +S P++ D+N DG +DIV+ + ++ L+GS G + +P + S
Sbjct: 908 WETHISGFSTQGPVVGDVNGDGLVDIVLATTSGHIWALQGSSGRTLENFPVKTGGPIMSL 967
Query: 126 PLLYDIDKDG-------VREIALATYNG 146
PLL + K G VR I + +++G
Sbjct: 968 PLLLQM-KGGKGRAGLPVRHIIVPSHDG 994
Score = 42.7 bits (99), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 44/88 (50%), Gaps = 5/88 (5%)
Query: 547 WEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYR-THGRVMN 605
W ++S P+I DV DG D+V+PT +I V++G G + +P+ +
Sbjct: 117 WAMEIESSSYVTPTIFDVASDGVKDIVIPTFVRHIEVINGPHGHRSPGFPFTFPNSAFYA 176
Query: 606 QVLLVDLTKRGEKSKGLTIVTTSFDGYL 633
L+ D+ K GE G+ T+F+G L
Sbjct: 177 SALIYDINKDGEPDIGV----TTFNGEL 200
Score = 40.4 bits (93), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 46/89 (51%), Gaps = 2/89 (2%)
Query: 67 QTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSP 126
+T + IY++P +AD+++DGKL+I++ + L ++ V+E DG +P + +
Sbjct: 818 ETAYRAYIYSSPTVADLDADGKLEIILGTSLGFVYVIE-HDGRVKDNFPV-TMAEIQGQV 875
Query: 127 LLYDIDKDGVREIALATYNGEVLFFRVSG 155
+ D++ DG EI V F G
Sbjct: 876 AVADVNDDGQLEIIATDTRHNVAVFNSKG 904
>gi|303275874|ref|XP_003057231.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461583|gb|EEH58876.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 896
Score = 416 bits (1070), Expect = e-113, Method: Compositional matrix adjust.
Identities = 230/521 (44%), Positives = 319/521 (61%), Gaps = 36/521 (6%)
Query: 358 AMWGDEEWTE--EQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDN 415
A W DE + + +HE ++YV VD+H+L TP IAD+D DG E+I++VSYF+D EYYDN
Sbjct: 371 AGWDDEAFVQPAHRHEHEDEYVFVDAHLLCTPAIADLDGDGRDEIILSVSYFYDKEYYDN 430
Query: 416 PEHLKELG-GIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLD 474
P H ELG +DIGKYVAG + V +L+T +++W T LDLSTD S+RAY+YS+PTVVD+D
Sbjct: 431 PAHAAELGVNVDIGKYVAGGVYVADLETLEMRWQTHLDLSTDAVSYRAYMYSAPTVVDID 490
Query: 475 GDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHG 534
DG L+I +GTS G YVL H G ++++P+ M EIQG V AAD++ DG +ELV DT G
Sbjct: 491 RDGYLEIALGTSVGFLYVLRHDGTTQKRWPILMGEIQGQVAAADVDGDGYLELVAADTRG 550
Query: 535 NVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRP 594
+VAA+ + +WE HL SL++QG + GDVDGDG ++ V T SG ++VLS G P
Sbjct: 551 SVAAFRRDATEVWETHLASLISQGVTFGDVDGDGELELAVGTSSGAVHVLSAATGKPKPP 610
Query: 595 YPYRTHGRVMNQVLLVDL---------TKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADV 645
+P+ T GRVM+ VLL L RG GL++V SFDGY+Y+IDG +C DV
Sbjct: 611 FPFYTRGRVMSPVLLAKLGSASASDPARARGSPG-GLSLVVNSFDGYVYVIDGARACVDV 669
Query: 646 VDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCF-STPAPHHPLKAWRS-INQGRNNV 703
+D+GETSY+M L D++ G DLI+ +MNG V+ + S +P+ P+ AW S ++ G N V
Sbjct: 670 IDVGETSYAMPLLDDLTGDGKNDLIIASMNGVVYAYESLNSPYDPMNAWPSQVHAGNNMV 729
Query: 704 AIRYNRAGIY-VTHPSRAFRDEEGRNFWVEIEIVD----------EYRFPSGSQ-APYNV 751
A RAG Y V R + D G+ V EIVD + R P+ + PY V
Sbjct: 730 A----RAGWYGVRALDRGYHDVRGKAMDVAFEIVDVREGRGRGGRKARTPAVYKYGPYAV 785
Query: 752 TTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFS 811
++ P N+ RR+ + F G Y +++ R G V V + D L D +S
Sbjct: 786 VVSITAP-NFS--RRVAAT--FETPGTYSLRVDVPNARGRGRVTVRVADATRLNAEDSYS 840
Query: 812 LTFHMYYYKLLKWLLVLPMLGMFGVLVILRPQEAMPLPSFS 852
++FHM YY++LKW++VLP M G L +P+FS
Sbjct: 841 VSFHMRYYRVLKWIVVLPFAFMVGAFSALTASSLGAMPTFS 881
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 81/168 (48%), Positives = 110/168 (65%), Gaps = 13/168 (7%)
Query: 27 NSEQNKFRQREAT----DDQLGLPQIDEDALVNTQCPKNLELRWQTEVSSSIYATPLIAD 82
+++ NK+ REA DD+L A+ CP NL LRWQTEVSSS+YATP++ D
Sbjct: 41 DADANKYLDREAKPDAHDDELA-------AIPRDVCPANLRLRWQTEVSSSVYATPVVTD 93
Query: 83 INSDGKLDIVVPSFLHYLEVLEGSDGDKMPG-WPAFHQSSVHSSPLLYDIDKDGVREIAL 141
+ DG +IVVPSF+HYLEVLEG DG + G WP+FH+S+ H+SPL++ G I L
Sbjct: 94 LFDDGHKEIVVPSFVHYLEVLEGEDGARAGGRWPSFHESTAHASPLVHHSSSAGT-TILL 152
Query: 142 ATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVDRSHPDV 189
TY+GEV F+ +G + L +PR +VR+DW+VGL +D VD + DV
Sbjct: 153 PTYDGEVRFYDHAGERLEKTLRVPRLRVRRDWHVGLAADHVDHARADV 200
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 47/102 (46%), Gaps = 11/102 (10%)
Query: 60 KNLELRWQTEVSSS---------IYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDK 110
+ LE+RWQT + S +Y+ P + DI+ DG L+I + + + +L VL DG
Sbjct: 457 ETLEMRWQTHLDLSTDAVSYRAYMYSAPTVVDIDRDGYLEIALGTSVGFLYVLR-HDGTT 515
Query: 111 MPGWPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVLFFR 152
WP + D+D DG E+ A G V FR
Sbjct: 516 QKRWPIL-MGEIQGQVAAADVDGDGYLELVAADTRGSVAAFR 556
Score = 40.0 bits (92), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 43/91 (47%), Gaps = 4/91 (4%)
Query: 547 WEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQ 606
W+ + S V P + D+ DGH ++VVP+ + VL G+DG++ H +
Sbjct: 77 WQTEVSSSVYATPVVTDLFDDGHKEIVVPSFVHYLEVLEGEDGARAGGRWPSFHESTAHA 136
Query: 607 VLLVDLTKRGEKSKGLTIVTTSFDGYLYLID 637
LV + S G TI+ ++DG + D
Sbjct: 137 SPLVHHS----SSAGTTILLPTYDGEVRFYD 163
>gi|290989517|ref|XP_002677384.1| predicted protein [Naegleria gruberi]
gi|284090991|gb|EFC44640.1| predicted protein [Naegleria gruberi]
Length = 760
Score = 373 bits (958), Expect = e-100, Method: Compositional matrix adjust.
Identities = 201/473 (42%), Positives = 294/473 (62%), Gaps = 17/473 (3%)
Query: 374 EDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELG-GIDIGKYVA 432
++++ VD+HILSTPVIADIDNDG E+I +VSYF+D EYY +++ EL ID+ KYVA
Sbjct: 297 KEFITVDAHILSTPVIADIDNDGKEELIASVSYFYDREYYH--QNMFELDVDIDMDKYVA 354
Query: 433 GAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYV 492
G I V++L T+++KW LD++TDN +AYIY +PTV D+DGDG+LD++VGT G Y
Sbjct: 355 GGIAVYDLSTRKIKWHAHLDMTTDNVKQKAYIYGNPTVADIDGDGSLDVIVGTGLGWIYA 414
Query: 493 LDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLK 552
++ GK+ + FP+ M EIQG V+A DIN DG IE+ D + N+ + +GK IW Q L
Sbjct: 415 YNNQGKLLDGFPVLMGEIQGQVIAEDINQDGYIEICAVDFNSNLVCFDGKGKEIWSQRLS 474
Query: 553 SLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDL 612
Q P++GDVDGDG D+VV T +G+I+ +SG+ G + +P +T + + LL+DL
Sbjct: 475 GSAAQAPTVGDVDGDGKLDLVVGTSTGHIWAVSGETGKVLPHFPVKTGSSIYSPALLIDL 534
Query: 613 TKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVT 672
+ L IV SFDGYLY+++G + C D VDIGE SYS VLAD++ G LDL+V+
Sbjct: 535 SNGTRSDNSLDIVLPSFDGYLYIVNGKSGCVDKVDIGEKSYSQVLADDLTGNGKLDLLVS 594
Query: 673 TMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVE 732
TMNGN+ C ST +P+HPLK Q N R GIY+ +R ++D G F +E
Sbjct: 595 TMNGNLICLSTESPYHPLKTTAYQMQFENGFKYRNGLYGIYIK--NREYKDIVGSKFAIE 652
Query: 733 IEIVDEYRFPSGSQAP--YNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRT 790
EI+D R S P Y+V L G ++ Q + + G YR++L + R
Sbjct: 653 FEIIDN-RKNSKDHLPITYDVRVVL-------GASKVLFEQKYTQPGSYRVELTSPEDRL 704
Query: 791 TGTVL-VEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLG-MFGVLVILR 841
T++ VEM ++ ++D +L+F++ ++ KW++VLP + + +L LR
Sbjct: 705 HNTMVSVEMRNEFSQLYTDHITLSFNISFFSFFKWMVVLPFIATVCAILYSLR 757
Score = 119 bits (298), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 150/582 (25%), Positives = 257/582 (44%), Gaps = 84/582 (14%)
Query: 57 QCPKNLELRWQTEVSSSIYATPLIADI-----NSDG-KLDIVVPSFLHYLEVLEGSDGDK 110
+C ++ + W++ V SS+Y TP+I ++ N G K +VVP+F+ Y++VL G G +
Sbjct: 54 KCKLDVGITWRSNVDSSVYQTPIITNLMNVKSNQHGDKKFVVVPTFVRYIQVLNGDSGAE 113
Query: 111 MP----GWPAFH-QSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDK-LEI 164
WP H Q + HSS +L+DID D +++I + T NGE+LFF G +++DK +
Sbjct: 114 HKEGNSHWPYAHSQLASHSSIVLHDIDGDNIKDIVVTTANGEILFFNWKGELLSDKTFAV 173
Query: 165 PRRKVRKDWYVGLHSDPVDRSHPDVHDDLIVQESEAARMKSMLETKKSTPETNATVTTST 224
P V K WY GL ++ DV L +ESE E K+ P+ +A T
Sbjct: 174 PGLMVDKFWYKGLENEKF-----DVSISLSQKESE-----KHAEVAKNKPK-DAPKETPK 222
Query: 225 ESNPAPATVSNPDVKKVNESLVNVSNP-SEERKVNESHTEMNIKLPTSVDNSSTTTVSGG 283
E NP + + +V ++ + S E + + S + + P S+ +
Sbjct: 223 E-NPRTRSKQKRKLLQVAKAPTSTEGSLSSEARASLSLVQQKVHTPQSL----RPFIGEK 277
Query: 284 TNSSENGTNTGRRLLEDNNSKGSQEGNDKEDVPVATAENDQALDENADSSFELFRDTDEL 343
S E+ + ++ + + PV A+ D E +S F D +
Sbjct: 278 IESVEHKHFSHHTTVKKDGKEFITVDAHILSTPV-IADIDNDGKEELIASVSYFYDREYY 336
Query: 344 -ADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVD---------SHILSTPVIADID 393
+ + D D +D + G + KI+ + ++D ++I P +ADID
Sbjct: 337 HQNMFELDVDIDMDKYVAGGIAVYDLSTRKIKWHAHLDMTTDNVKQKAYIYGNPTVADID 396
Query: 394 NDGVSEMIIAVSYFFDHEYYDN-------PEHLKELGGIDIGK------YVAGAIVVFN- 439
DG ++I+ + + Y + P + E+ G I + Y+ V FN
Sbjct: 397 GDGSLDVIVGTGLGWIYAYNNQGKLLDGFPVLMGEIQGQVIAEDINQDGYIEICAVDFNS 456
Query: 440 ----LDTKQVK-WTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG-LFYVL 493
D K + W+ L S A PTV D+DGDG LD++VGTS G ++ V
Sbjct: 457 NLVCFDGKGKEIWSQRLSGSAAQA---------PTVGDVDGDGKLDLVVGTSTGHIWAVS 507
Query: 494 DHHGKIREKFPLEM-AEIQGAVVAADIND----DGKIELVTTDTHG-----NVAAWTAEG 543
GK+ FP++ + I + D+++ D +++V G N + +
Sbjct: 508 GETGKVLPHFPVKTGSSIYSPALLIDLSNGTRSDNSLDIVLPSFDGYLYIVNGKSGCVDK 567
Query: 544 KGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLS 585
I E+ ++ D+ G+G D++V T++GN+ LS
Sbjct: 568 VDIGEKSYSQVLAD-----DLTGNGKLDLLVSTMNGNLICLS 604
>gi|388521975|gb|AFK49049.1| unknown [Lotus japonicus]
Length = 202
Score = 366 bits (939), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 168/201 (83%), Positives = 186/201 (92%)
Query: 655 MVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYV 714
MV+ADNVDGGDDLDLIV+TMNGNVFCFSTP+PHHPLK W+ QGRNNVA RYNR GIY+
Sbjct: 1 MVMADNVDGGDDLDLIVSTMNGNVFCFSTPSPHHPLKTWKMPTQGRNNVANRYNREGIYI 60
Query: 715 THPSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFA 774
THPSRAFRDEEG++FWVEIEIVD YR+PSG Q PY+VT TLLVPGNYQGER IKQ+Q +
Sbjct: 61 THPSRAFRDEEGKSFWVEIEIVDNYRYPSGHQGPYHVTITLLVPGNYQGERTIKQNQTYY 120
Query: 775 RRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMF 834
+ GK+RIKLPTVGVRT+GTVLVEMVDKNGLYF D+FSLTFHM+YYKLLKWLLVLPMLGMF
Sbjct: 121 QPGKHRIKLPTVGVRTSGTVLVEMVDKNGLYFYDDFSLTFHMHYYKLLKWLLVLPMLGMF 180
Query: 835 GVLVILRPQEAMPLPSFSRNT 855
G+LVILRPQE+MPLPSFSRNT
Sbjct: 181 GMLVILRPQESMPLPSFSRNT 201
>gi|340055012|emb|CCC49320.1| putative FG-GAP repeat protein [Trypanosoma vivax Y486]
Length = 865
Score = 327 bits (838), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 254/872 (29%), Positives = 404/872 (46%), Gaps = 82/872 (9%)
Query: 13 LICLLLFNSARGGDNSEQNKFRQREATDDQLGLPQIDEDALVN------------TQCPK 60
L+ ++L AR S Q R L ED + N + C
Sbjct: 31 LLAIMLIAVARAAPWSPQEPLAHRLLRSQVL-----QEDMMANEEPVTPISADAASACRS 85
Query: 61 NLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQS 120
+ L W V SS ATP + D+N+DG +I+VP++ YLE ++G G + G+P H +
Sbjct: 86 GVHLEWTAHVGSSALATPRVVDLNNDGNKEILVPTYSQYLEAIDGMSGGDVAGFPFVHPN 145
Query: 121 -SVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDWYVGLHS 179
++SPL D+ DG + +A Y GE++ F G + + ++IP V+K W +H
Sbjct: 146 FKSYASPLPVDMSGDGKTDWLVAMYTGELIVFSDDG-QIEETIQIPHLPVKKKW---VHE 201
Query: 180 D-----PVDRSHPDVHDDLIVQESEAARMKSMLETKKSTPETNATVTTSTESNPAPATVS 234
+ V+ + H+ + + E A RM+ + + + P + S+ N S
Sbjct: 202 NVTTGHAVNVTFKMGHEPIKLLE-HALRMRHIQDWTRFQPRRSGHTDGSSLRN----HTS 256
Query: 235 NPDVKKVNESLVNVSNPSEERKVNESHTEMNIKLPTSV--------DNSSTTT---VSGG 283
+ ++K+N + RK EM++ P + DN T GG
Sbjct: 257 DYKLEKMNLQM---------RK------EMDMPAPPARLSAPADPGDNDRTKRQGDTEGG 301
Query: 284 TNSSENGTNTGRRLLEDNNSKGSQEGNDKEDVPVATAENDQALDENADSSFELFRDTDEL 343
S + +T +ED G K + T D L + A +S +L +
Sbjct: 302 AEESADSFDTVHDDIEDALRYEELSGEMKRPHTIGT---DGWLSDEAKASMDLLYHPELY 358
Query: 344 ADEYNY--DYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMI 401
N+ D + +M G E ED V VD HI+S VIAD D DG +++
Sbjct: 359 KSSVNHGQQKDAFSFQSMVGRSEAVVA-----EDEVAVDPHIISNLVIADADGDGYLDIV 413
Query: 402 IAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFR 461
+ VSYFFD + Y+ + YVA A+VV +L T VKW L+++T +
Sbjct: 414 LHVSYFFDPKDYEGDRADILTKDTAMNNYVADAVVVIDLITGGVKWMKTLNITTKKDAVP 473
Query: 462 AYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAE-IQGAVVAADIN 520
AY +S+P VV+ + +G I V T+ G + G + +P+ M + +V D+N
Sbjct: 474 AYAFSTPAVVNYNENGGSGIFVTTTAGAIFGFTGEGGMINGWPVWMDHPVIASVSVEDVN 533
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGN 580
DG +++ T DT G VA ++A G+ +W + V+ GDVDGDG D+ T SG
Sbjct: 534 GDGVLDVCTGDTSGVVACFSANGRRLWSTRVSGAVSDHIVFGDVDGDGLMDLAFGTTSGL 593
Query: 581 IYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDL--TKRGEKS--KGLTIVTTSFDGYLYLI 636
IY L G DG + +P T G ++ LLV+L T G +GL IV S DG LY++
Sbjct: 594 IYALRGHDGLLLPHFPIATGGTILAPPLLVNLNDTVVGAHGSERGLHIVIPSHDGVLYIV 653
Query: 637 DGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSI 696
G T C D +DIGE + +MVLAD+V G LDL+V+T+ G++ T A HPLKAW S
Sbjct: 654 SGATGCVDGIDIGEKASAMVLADDVTGNGKLDLVVSTLAGSIMVLETSATFHPLKAWPSR 713
Query: 697 NQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLL 756
+ N G+++ SR RD G F + + I D+ R + Y + T+
Sbjct: 714 VKSVNGFTAGEGHVGVFIDPSSRVVRDVRGETFSLLVAIHDQ-RNIDARKRRYRLVITI- 771
Query: 757 VPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHM 816
G R + +++ G Y I++ R +V V MV NG + D +++F+M
Sbjct: 772 ------GPRVLVHDEVYTDPGTYSIQMHAPLERMYASVHVVMVLPNGQRYEDSLAMSFNM 825
Query: 817 YYYKLLKWLLVLP-MLGMFGVLVILRPQEAMP 847
++ + +K+ ++P L + ++ + E +P
Sbjct: 826 HFLESVKYTFLMPFFLACLAISLVDKRHEVVP 857
>gi|328868410|gb|EGG16788.1| hypothetical protein DFA_07766 [Dictyostelium fasciculatum]
Length = 907
Score = 315 bits (808), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 194/528 (36%), Positives = 306/528 (57%), Gaps = 26/528 (4%)
Query: 321 ENDQALDENA-DSSFELFRDTDELADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNV 379
E+D DE+ + L +D ++L Y+ YDDY+ + K D+V V
Sbjct: 384 EDDDIFDESKYKLNINLLQDKEDLL--YSKQYDDYI--------RMNKRYSNKSMDHVWV 433
Query: 380 DSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELG-GIDIGKYVAGAIVVF 438
D HILSTPVI DID DG E+I++VSY+FDHEYY +P H ++ + + KYVAG IV F
Sbjct: 434 DGHILSTPVITDIDLDGHYELIVSVSYYFDHEYYSDPIHRSKIDRDVQLDKYVAGGIVCF 493
Query: 439 NLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGK 498
+LD Q+KW T DL+ D +F Y+Y++PTV+D++GD L+ ++GTS G YVLD GK
Sbjct: 494 DLDKLQIKWQTHFDLTVDFKAFSGYVYNTPTVIDVNGDSMLETVIGTSIGYVYVLDVRGK 553
Query: 499 IREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQG 558
L + I VV D+N DG +E++ D++GN+ ++ G +WE +
Sbjct: 554 PITSLTLPLDSIYSQVVVEDLNKDGNVEIIVADSNGNLVCFSNTGDELWENRFAGMTESH 613
Query: 559 PSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEK 618
SIGD++GDG DVV+ ++SG IY SG G ++ +P +T V++ +LLVDL+ +
Sbjct: 614 VSIGDINGDGILDVVLGSMSGAIYAWSGDSGKELVDFPIKTKSLVLSPLLLVDLSLDIDL 673
Query: 619 S--KGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNG 676
+ GL+I+T + DG L+ IDG SCA+ +DIG++S SMVLAD++ G ++L+VTT G
Sbjct: 674 NGRNGLSIITHTSDGVLFSIDGKQSCANKLDIGDSSVSMVLADDLLGDGKINLLVTTYYG 733
Query: 677 NVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSR-AFRDEEGRNFWVEIEI 735
N++ F+T +P+HPLKA +S N+G NV ++ + V P + D G F +++ I
Sbjct: 734 NIYIFTTNSPYHPLKAQQSFNKGL-NVFTSGSQGVVIVKQPGQPETMDILGSEFSIDVMI 792
Query: 736 VDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTGTVL 795
VD +F Y ++ Y R I S + G ++ +PT G +
Sbjct: 793 VDT-QFNESIGNSYKLSV-------YFANRLISVSN-YHTPGLKQLTMPTPGRAYPNLLR 843
Query: 796 VEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVILRPQ 843
VE+ +K G + D S +F++Y+ + +KW++++P + + +L+ L +
Sbjct: 844 VELTNKFGQGYQDTVSYSFNLYFARTIKWVIIIPFV-LSSILIYLNTK 890
Score = 102 bits (254), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 58/138 (42%), Positives = 85/138 (61%), Gaps = 4/138 (2%)
Query: 57 QCP-KNLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWP 115
+C ++++L+W + SSI +TPLIAD+ DGK IV+ + Y+EVL G G+ GWP
Sbjct: 237 ECKEQSIDLQWSLDTHSSISSTPLIADLFGDGKKYIVLVNSNSYVEVLSGRTGEMAIGWP 296
Query: 116 AFH-QSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGY-MMTDKLEIPRRKVRKDW 173
S+ SSPLL+DID DG E+ ++T GE+LF G+ ++ + L+IP KV KDW
Sbjct: 297 FISPDSTFASSPLLFDIDGDGKNEVVVSTREGEILFIGRDGFPLLNNTLKIPPLKVLKDW 356
Query: 174 YVGLHSDPVDRSHPDVHD 191
Y G+ VD + +HD
Sbjct: 357 YEGISGKHVDSAFS-LHD 373
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 54/107 (50%), Gaps = 19/107 (17%)
Query: 62 LELRWQTEV---------SSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMP 112
L+++WQT S +Y TP + D+N D L+ V+ + + Y+ VL+ +
Sbjct: 498 LQIKWQTHFDLTVDFKAFSGYVYNTPTVIDVNGDSMLETVIGTSIGYVYVLD------VR 551
Query: 113 GWP----AFHQSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSG 155
G P S++S ++ D++KDG EI +A NG ++ F +G
Sbjct: 552 GKPITSLTLPLDSIYSQVVVEDLNKDGNVEIIVADSNGNLVCFSNTG 598
>gi|281208597|gb|EFA82773.1| hypothetical protein PPL_04468 [Polysphondylium pallidum PN500]
Length = 763
Score = 306 bits (785), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 199/560 (35%), Positives = 301/560 (53%), Gaps = 51/560 (9%)
Query: 321 ENDQALDENADSSFELFRDTDELADEYNY----DYDDYVDDAMWGDEEWTEEQHEKIEDY 376
+ND + D S++L + DE+++ Y Y+DD + +++ ++ Y
Sbjct: 223 DNDNDERKKKDDSYKLSPILKQKYDEFDFLYSVQYQRYLDDNI----QYSNYSNK----Y 274
Query: 377 VNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKEL-GGIDIGKYVAGAI 435
+ VD+HIL+TPVIADID DG E+I++ SY++D EYY P H L I I Y AG I
Sbjct: 275 IWVDAHILATPVIADIDQDGSMELIVSASYYYDSEYYSVPIHNATLERDISIENYAAGGI 334
Query: 436 VVFNLDTKQVKWTT--------------------------DLDLSTDNASFRAYIYSSPT 469
+ FNL +KW T DLDL+TDN F Y+ SPT
Sbjct: 335 ICFNLKDMSIKWQTRKYINNNNNNNNNNNNSNKLTINNIVDLDLTTDNKVFNGYVLGSPT 394
Query: 470 VVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVT 529
VVD++ DG L+ ++GT+ G YVLD+ G + PL M I +VVA D+N DGK+EL+
Sbjct: 395 VVDINNDGYLETVIGTNLGFLYVLDYQGNPLFE-PLFMDSIYASVVAQDVNGDGKVELIV 453
Query: 530 TDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDG 589
+D +GN+ G+ +WE + + SIGD++GDG DVVV + SG IY G G
Sbjct: 454 SDVNGNLVCLNNIGEELWESRIIGMTEHPVSIGDINGDGILDVVVGSFSGAIYAFRGDTG 513
Query: 590 SKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIG 649
++ +P + +M+ +LLV+L+ ++ KGL+IV + DG L I+ SC + +DIG
Sbjct: 514 EELVGFPIKLKASIMSPLLLVNLSPNQQEQKGLSIVVHTSDGVLLSINTRESCTNRLDIG 573
Query: 650 ETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNR 709
+ S SMVLA ++ G LDL+V TM G V+ +T +P P+K + NQG+N +
Sbjct: 574 DNSVSMVLASDLTGDGYLDLLVNTMYGVVYTLTTNSPFTPMKERVAFNQGKN--VYTFQH 631
Query: 710 AGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQ 769
GIY++ +R D G F + +I+D F S YNV T Y G++
Sbjct: 632 EGIYISQENRDTTDILGSEFTLAFDIIDN-GFNRSSPNNYNVRAT------YGGKQIFTM 684
Query: 770 SQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLP 829
+ F G+ I +P + +EM DK GL FSD+ SL+F++++ +LLKW++VLP
Sbjct: 685 N--FEYPGRKTITIPVPERSHMRVLTLEMKDKQGLQFSDQISLSFNIHFSRLLKWIIVLP 742
Query: 830 MLGMFGVLVILRPQEAMPLP 849
+++I + LP
Sbjct: 743 FFIASTIIIIHQQNTKSSLP 762
Score = 85.9 bits (211), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 51/157 (32%), Positives = 83/157 (52%), Gaps = 21/157 (13%)
Query: 31 NKFRQREATDDQLGLPQIDEDALVNT----------QCP-KNLELRWQTEVSSSIYATPL 79
NK+R P + E+ L NT +C + ++ W T+ +++ TPL
Sbjct: 42 NKYRNNH--------PIVLEEVLFNTDLTQGGNSRLECKDEQTDMIWSTQTHTTLINTPL 93
Query: 80 IADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFH-QSSVHSSPLLYDIDKDGVRE 138
I D+ +G DIV+ +Y++VL+GS G + GWP +SS +SPL++DID DG E
Sbjct: 94 ITDLFGEGSKDIVLVGTQNYVDVLKGSTGQRALGWPFIADRSSFSASPLVHDIDGDGTNE 153
Query: 139 IALATYNGEVLFFRVSGY-MMTDKLEIPRRKVRKDWY 174
+ +AT + E++F G + L +P KV K+W+
Sbjct: 154 VIVATRDAEIIFLNSLGMPLHFTTLRVPPLKVAKNWF 190
Score = 42.7 bits (99), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 40/78 (51%), Gaps = 1/78 (1%)
Query: 55 NTQCPKNL-ELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPG 113
N C N+ E W++ + I DIN DG LD+VV SF + G G+++ G
Sbjct: 459 NLVCLNNIGEELWESRIIGMTEHPVSIGDINGDGILDVVVGSFSGAIYAFRGDTGEELVG 518
Query: 114 WPAFHQSSVHSSPLLYDI 131
+P ++S+ S LL ++
Sbjct: 519 FPIKLKASIMSPLLLVNL 536
Score = 39.3 bits (90), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 22/76 (28%), Positives = 42/76 (55%), Gaps = 3/76 (3%)
Query: 74 IYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDK 133
+ +P + DIN+DG L+ V+ + L +L VL D P + S+++S + D++
Sbjct: 389 VLGSPTVVDINNDGYLETVIGTNLGFLYVL---DYQGNPLFEPLFMDSIYASVVAQDVNG 445
Query: 134 DGVREIALATYNGEVL 149
DG E+ ++ NG ++
Sbjct: 446 DGKVELIVSDVNGNLV 461
>gi|412990709|emb|CCO18081.1| predicted protein [Bathycoccus prasinos]
Length = 835
Score = 305 bits (782), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 178/474 (37%), Positives = 277/474 (58%), Gaps = 25/474 (5%)
Query: 377 VNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYD-NPEHLKELGGIDIGKYVAGAI 435
V +D+H+L +P+IAD+DNDG E+I+AVSYFFD E D + ++ I+ KY+A I
Sbjct: 353 VYIDAHVLCSPIIADVDNDGRKELILAVSYFFDEEREDLSSMQYRKRNKIEAEKYLATGI 412
Query: 436 VVFNLDTK-QVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLD 494
V+ ++ + +VK + LD+S + +FRA +S PTV+D+D D +++V TS G YVL+
Sbjct: 413 VLLDIANEYRVKNSVILDVSVEATAFRARAFSQPTVLDIDKDSTNELIVATSTGNVYVLN 472
Query: 495 HHGKIRE-----KFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQ 549
GK E +FP +M EIQ +V AD ++DG E + D GNVA +T +G +WE+
Sbjct: 473 --GKTLENKNPRQFPKQMGEIQAPIVVADFDNDGYYEFIACDLRGNVAVFTKDGDILWEK 530
Query: 550 HLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLL 609
+L+S ++ P++GDVDGD ++VV T SG I+VLS + G + P+P+ T GR++ V L
Sbjct: 531 NLQSAISANPTLGDVDGDHFLEIVVGTSSGAIHVLSAETGEEKYPFPFYTGGRILAPVAL 590
Query: 610 VDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDL 669
+ K +S L++V TSFDGY+Y+IDG T C DV+D+GETSY++ L D++ L +
Sbjct: 591 GRVRKASPES--LSLVATSFDGYVYIIDGKTGCRDVIDVGETSYALPLIDDLTNSGYLSI 648
Query: 670 IVTTMNGNVFCFST--PAPH-HPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEG 726
I+ TMNG V CF PH H L+ +S N + R GIY R + D G
Sbjct: 649 ILATMNGEVHCFEATHSGPHAHALEGQQSQFLSANLMTHRNKYFGIY--GEDRGYFDARG 706
Query: 727 RNFWVEIEIVD-------EYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKY 779
++ + D + + + + PY VT TL E+ K + +
Sbjct: 707 EFMTIKYRMRDNRSIFGSKMKKKASNFGPYVVTVTLNALDVNFMEKASKYYEDVMEIDQI 766
Query: 780 RIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGM 833
+++P V R G + +EM D +G+ +D +S++FH +Y++LKW++ LP +GM
Sbjct: 767 TLRVPKVKSR--GEIFIEMKDGSGVVATDTYSVSFHERHYRILKWIVALPFIGM 818
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 56/189 (29%), Positives = 98/189 (51%), Gaps = 29/189 (15%)
Query: 15 CLLLFNSARGGDNSEQNKFRQREATDD-----QLGLPQIDEDA-------LVNTQCPKNL 62
L+ ++S N++ + F QR+A D ++G + +D+ V CP N+
Sbjct: 24 VLMKYSSDNTNTNNDGSIFLQRKAKPDASFKEKIGRGILTDDSKKTKKKKFVGGYCPSNV 83
Query: 63 ELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPG--WPAFHQS 120
L+W+ VSSS+YATPL+ D+++DG +++V +F+H +E+LE G + +P F +S
Sbjct: 84 HLKWRQRVSSSVYATPLLVDVHNDGHKEVLVNTFVHAIELLEPETGQPVANGRFPIFRES 143
Query: 121 SVHSSPLLY---------------DIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIP 165
+H S +Y + + ++ +ATY+GE+L F G + +P
Sbjct: 144 KLHGSTAVYVNNDDTKNNNKSNNNNNRNNNEKKFIVATYDGEILEFNARGDVDVKVARLP 203
Query: 166 RRKVRKDWY 174
R KV KDW+
Sbjct: 204 RAKVTKDWF 212
>gi|66828943|ref|XP_647825.1| hypothetical protein DDB_G0278923 [Dictyostelium discoideum AX4]
gi|60470082|gb|EAL68063.1| hypothetical protein DDB_G0278923 [Dictyostelium discoideum AX4]
Length = 792
Score = 288 bits (738), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 176/534 (32%), Positives = 290/534 (54%), Gaps = 58/534 (10%)
Query: 329 NADSSFELFRDTD---ELADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHILS 385
N +S F++ + E Y+ YD Y+ D W + +E Y+ VDSHILS
Sbjct: 247 NDNSKFQILKQISLDIENDPLYSVLYDKYLQDNKWFSRKTSE--------YIWVDSHILS 298
Query: 386 TPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELG-GIDIGKYVAGAIVVFNLDTKQ 444
TPVIADID DG++E++++VSY++D E Y+NP + +L G+ + YVAG I VFNL T +
Sbjct: 299 TPVIADIDGDGINELVVSVSYYYDPEKYENPHYKSQLDDGVQVENYVAGGISVFNLATGE 358
Query: 445 VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHG-KIREKF 503
+KW T LDL+TD F+ +Y SPTVVDL+ DG L+I++GT+ G YVLD+ G + +
Sbjct: 359 IKWQTHLDLTTDKTDFKGLVYGSPTVVDLNNDGKLEIIIGTALGFLYVLDYQGVPLTGNY 418
Query: 504 PLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGD 563
PL + I +V D++ DG++EL+ DT GNV + ++G IW Q + PSIGD
Sbjct: 419 PLVLESIFTQIVTEDLDSDGRLELIVIDTKGNVVCFNSDGDEIWSQQVTGKTEFPPSIGD 478
Query: 564 VDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLT 623
+DGDG DVV+ T + I+ +GK G ++ YP + +++ +LL++ + + GLT
Sbjct: 479 LDGDGILDVVMSTFTTGIFAWNGKTGKPLQGYPLKFEPSIISPLLLLE---TNDNNGGLT 535
Query: 624 IVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFST 683
I + DG LY I+ C +DIG+ S + +L +++ +L+L+++T + V+C S
Sbjct: 536 IFVHANDGILYSINDNGKCIGRLDIGDFSITKILTNDLTADGNLNLLLSTFDQKVYCLSI 595
Query: 684 PA-----PHHPLKAWRSINQGRNNVAIRYNRA---GIYVTHP------------------ 717
+ +PLK S N N + Y+++ G+ +T P
Sbjct: 596 TSGGGNVKFNPLKVENSYNINGNKFS-SYSKSTPMGVLITQPTTTTTSGSGGIDLHYNFQ 654
Query: 718 -SRAFRDEEGRNFWVEIEIVDEYRFPSGS---QAPYNVTTTLLVPGNYQGERRIKQSQIF 773
SR D G +F +EI I+D+ + + Y ++ Y GE R+K+ +I+
Sbjct: 655 YSRNSIDISGSDFNLEILILDQKNISLSTIEIKKKYTISV-------YFGEIRLKK-EIY 706
Query: 774 ARRGKYRIKL---PTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKW 824
G ++ + P+V ++V++ + N + D ++F+ Y+Y+ LK+
Sbjct: 707 YSTGLKQLLITIPPSVSRSDVKLLIVKLENINSQIYQDSIPISFNFYFYRFLKF 760
Score = 83.2 bits (204), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 59/189 (31%), Positives = 100/189 (52%), Gaps = 25/189 (13%)
Query: 22 ARGGDNSEQ-----NKFRQRE---------ATDDQL---GLPQIDEDALVNTQCPK-NLE 63
G N +Q NK+R++E A ++ + GL ++ + ++ K L+
Sbjct: 22 VEGNSNDKQLLLLDNKYREKEPIGLENILFANNEDVINRGLGKLKDQGIICKGGNKVELD 81
Query: 64 LRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKM----PGWP-AFH 118
+ W+++++S+I TPLI D+ G I+ + Y++V+ G DGD++ GWP
Sbjct: 82 IVWESQLASTISNTPLIVDLFGTGDKQIIYTNSHTYIDVINGIDGDRLINGGGGWPYILP 141
Query: 119 QSSVHSSPLLYDIDKDGVREIALATYNGEVLFF--RVSGYMMTDKLEIPRRKVRKDWYVG 176
SS +SPL+YDID +G +I +AT + E+LF + + + L++P KV+K WY G
Sbjct: 142 YSSFSTSPLVYDIDGNGDLDIMVATTDAEILFIDSKKAVAIKEKTLKVPPLKVKKLWYDG 201
Query: 177 LHSDPVDRS 185
L VD S
Sbjct: 202 LDDKHVDAS 210
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 57/117 (48%), Gaps = 15/117 (12%)
Query: 63 ELRWQTEVSSS---------IYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPG 113
E++WQT + + +Y +P + D+N+DGKL+I++ + L +L VL+ G + G
Sbjct: 358 EIKWQTHLDLTTDKTDFKGLVYGSPTVVDLNNDGKLEIIIGTALGFLYVLD-YQGVPLTG 416
Query: 114 WPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSG-----YMMTDKLEIP 165
S+ + + D+D DG E+ + G V+ F G +T K E P
Sbjct: 417 NYPLVLESIFTQIVTEDLDSDGRLELIVIDTKGNVVCFNSDGDEIWSQQVTGKTEFP 473
>gi|330804242|ref|XP_003290106.1| hypothetical protein DICPUDRAFT_56468 [Dictyostelium purpureum]
gi|325079771|gb|EGC33355.1| hypothetical protein DICPUDRAFT_56468 [Dictyostelium purpureum]
Length = 762
Score = 287 bits (734), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 177/530 (33%), Positives = 288/530 (54%), Gaps = 35/530 (6%)
Query: 304 KGSQEGNDKEDVPVATAENDQALDENADSSFELFRDTDELADEYNYDYDDYVDDAMWGDE 363
+ ++E N KED + L++ D DE Y+ YD +++ W
Sbjct: 233 RQNEEANKKEDQQQQQQQKLTVLEQGKD---------DEYDPLYSLAYDKWLNQNKW--- 280
Query: 364 EWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKEL- 422
K + Y VDSH+L+TPVIADID+DG+ E+I++VSY++D E Y NP + +L
Sbjct: 281 -----FSRKDDSYTWVDSHVLATPVIADIDDDGIKELIVSVSYYYDQEKYANPIYRSKLE 335
Query: 423 GGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
I I Y+AG I VF+L T +KW T LDL+TD +F+ IY SPTVVDLDGDG L+I+
Sbjct: 336 SDILIENYIAGGITVFDLSTGDIKWQTHLDLTTDKTAFKGSIYGSPTVVDLDGDGKLEII 395
Query: 483 VGTSFGLFYVLDHHGKIRE-KFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTA 541
+GT+ G YVL+H G E +PL + + +V D+N DGK+EL+ D +GN+ ++
Sbjct: 396 IGTALGFVYVLNHLGNPWEGNYPLVLDSVFTQIVTEDLNADGKLELIVFDNNGNIVCFSY 455
Query: 542 EGKGIWEQHLKSLVTQGP-SIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTH 600
G IW+ + S ++ P SIGD+DGDG DVV+ T + I+ +G G + +P +
Sbjct: 456 NGDQIWDNKVSSGKSEFPASIGDIDGDGILDVVLSTFNAGIFAWNGMTGKSLEGFPIKFE 515
Query: 601 GRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADN 660
+++ +LL+D + G+TI + DG +Y I+G C + +DIG+ S + +L+ +
Sbjct: 516 SSIISPLLLLD---TDNSNGGMTIFVHADDGMVYSINGKDRCVNRIDIGDFSITKILSSD 572
Query: 661 VDGGDDLDLIVTTMNGNVFCFSTPA-PHHPLKAWRSINQGRNNVAIRYNR---AGIYVTH 716
+ G +L+L+++T + ++C + P+ K+ + N N+ Y+R G+ +
Sbjct: 573 LTGDGNLNLLLSTYDHKMYCLNVNGVPYSSRKSVEAYNPNGNSFT-SYSREAPTGVMILP 631
Query: 717 PSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARR 776
R RD G +E I+D+ + T ++ Y G+ R+ + F
Sbjct: 632 SYRNLRDITGSELNLEFLIMDQKGLVVNNAKNIKYTVSI-----YFGQIRLFRDVYFTTG 686
Query: 777 GK-YRIKLPTVGVRT-TGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKW 824
K I +P+ RT ++ VEMV+ NG + D ++F+ Y+Y+LLK+
Sbjct: 687 IKQVSITIPSALSRTDPNSLRVEMVNSNGQSYHDSIIISFNHYFYRLLKF 736
Score = 92.4 bits (228), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 67/222 (30%), Positives = 119/222 (53%), Gaps = 22/222 (9%)
Query: 13 LICLLLFNSARGGDNSEQNKFRQREATD-DQLGLPQIDEDALV------NTQCPK-NLEL 64
+I + F +G N++R RE + + + E+ LV N +C K +L++
Sbjct: 23 IIYSIEFKGEKGDIALLDNEYRNREPIKLEDILFGKTIENGLVKELNVGNEKCQKVDLDV 82
Query: 65 RWQTEVSSSIYATPLIAD---INSDGKLD----IVVPSFLHYLEVLEGSDGDKMPGWP-A 116
W+T ++S+I TPLI D ++ +G+L IVV + Y++V+ G DGD++ GWP
Sbjct: 83 VWETVLASTIPNTPLITDQLSVSDNGQLTGDKYIVVANSHTYIDVINGRDGDRLIGWPFI 142
Query: 117 FHQSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDK-LEIPRRKVRKDWYV 175
SS SSPLLYDID D +I + T + E++F +G + +K L++P K++K W+
Sbjct: 143 IPDSSFASSPLLYDIDGDQSLDIMVTTTDAEMVFISSNGMAIREKSLKVPPLKIKKLWHE 202
Query: 176 GLHSDPVDRSHPDVHDDLIVQESEAARMKSMLETKKSTPETN 217
G+ + VD + L ++++ ++ +E ++ E N
Sbjct: 203 GIGGNHVDAAFS-----LYNKDNKKFAIQKQIELQRQNEEAN 239
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 28/102 (27%), Positives = 52/102 (50%), Gaps = 10/102 (9%)
Query: 63 ELRWQTEV---------SSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPG 113
+++WQT + SIY +P + D++ DGKL+I++ + L ++ VL G+ G
Sbjct: 357 DIKWQTHLDLTTDKTAFKGSIYGSPTVVDLDGDGKLEIIIGTALGFVYVLNHL-GNPWEG 415
Query: 114 WPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSG 155
SV + + D++ DG E+ + NG ++ F +G
Sbjct: 416 NYPLVLDSVFTQIVTEDLNADGKLELIVFDNNGNIVCFSYNG 457
Score = 40.0 bits (92), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 35/68 (51%), Gaps = 1/68 (1%)
Query: 66 WQTEVSSSIYATPL-IADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHS 124
W +VSS P I DI+ DG LD+V+ +F + G G + G+P +SS+ S
Sbjct: 461 WDNKVSSGKSEFPASIGDIDGDGILDVVLSTFNAGIFAWNGMTGKSLEGFPIKFESSIIS 520
Query: 125 SPLLYDID 132
LL D D
Sbjct: 521 PLLLLDTD 528
>gi|407404443|gb|EKF29894.1| FG-GAP repeat protein, putative,intergrin alpha chain protein,
putative [Trypanosoma cruzi marinkellei]
Length = 860
Score = 280 bits (716), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 226/796 (28%), Positives = 358/796 (44%), Gaps = 88/796 (11%)
Query: 114 WPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDW 173
W A SSV ++P + D++ DG +EI + TY F + G D++ P V ++
Sbjct: 90 WTARVGSSVFATPRIVDLNHDGNKEILVPTYTQ--YFEALDGVNGEDEIGFPF--VHPNF 145
Query: 174 YVGLHSDPVDRSHPDVHDDLIVQ------------ESEAARMKSMLETKKSTPETNATVT 221
PVD + LI ++ A L KK+ + N T
Sbjct: 146 KSYCSPIPVDMDGDGKTEWLIAMYTGELMVFGDDGKARGAIQVPSLPIKKNWMKRNLTGE 205
Query: 222 TSTESNPAPATVSNPDVKKVNESLVNVSNPSEERKVNESHTEMNIKLPTSVDNSSTTTVS 281
+ P P S ++ + R V+ + ++ TS +N +
Sbjct: 206 EAVNLKPQPVRPSEEYFDRILRA---------RRMVDMTPLSQKMRQKTSDENRGDAATA 256
Query: 282 GGTNSSENGTNTGRRLLED---NNSKGSQEGNDKEDV-PVATAENDQALDENADSSFELF 337
G S RR L D + ++ G+++ DV V + D+ ELF
Sbjct: 257 LG--ESRPPAAPPRRHLADFDVEEEENNELGDERHDVGTVPRVQEDE-------RELELF 307
Query: 338 RDTDELADEYNYDYDDYVDDAMWGDEEWTEEQ-------------------HEKIEDYVN 378
D DE ++ D G E W + +E +D N
Sbjct: 308 LDEDEDINDELALLDSMEKQHGIGAEGWLSAEAKASMEMLYHPELYKSSVNYEGEKDAFN 367
Query: 379 VDS------------------HILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLK 420
+ HI+STPVI D D DG ++++ +SYFFD + Y+ E
Sbjct: 368 FRNIRNPTTVVVADDEVAVDPHIMSTPVIVDADGDGDLDIVLHLSYFFDMKAYEGEEAEL 427
Query: 421 ELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLD 480
IDI YVA ++V NL T ++KW L L+ N + AY SSP + + D D D
Sbjct: 428 LPSDIDINDYVADVLMVLNLVTGEMKWMKLLHLTKKNDTIPAYALSSPVIANPDEDNQQD 487
Query: 481 ILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAA-DINDDGKIELVTTDTHGNVAAW 539
I V T+ G + + HGK +P+ M+ + + D+N DG I+L DT GNVA +
Sbjct: 488 IYVTTTAGAIFGFNGHGKQLSGWPVWMSSSITSSPSMEDVNADGIIDLCAGDTGGNVACF 547
Query: 540 TAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRT 599
TA+GK +W + VT+ + GDVDGDG D+ T SG +Y + G+DG+ + +P
Sbjct: 548 TAKGKQLWSKTFSGGVTEQITYGDVDGDGSIDLAFGTTSGLLYAVRGRDGALLPHFPIAA 607
Query: 600 HGRVMNQVLLVDLTKRG----EKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSM 655
G ++ LLV+L ++++GL ++ S DG LY++ G T C D +DI E S +M
Sbjct: 608 EGPILAPPLLVNLNNTQPLTQQETEGLHLIVPSHDGVLYIVSGTTGCVDGIDINEKSSTM 667
Query: 656 VLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVT 715
VLAD++ G LDL+V+T++G+V F T A HPLKAW S + N + G+++
Sbjct: 668 VLADDMTGNGKLDLVVSTLSGSVMVFETQAAFHPLKAWISRVKSTNGLTASEGYVGVFIH 727
Query: 716 HPSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFAR 775
SR RD G +F + + I D+ R + Y +T T+ G R + ++
Sbjct: 728 PSSRVPRDIRGEHFVLLVTIYDQRR-GDAIKRRYGLTITI-------GPRILVHHMVYGA 779
Query: 776 RGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFG 835
G Y I L T R ++ V M+ NG + D +L+F+M++ + +K + ++P L
Sbjct: 780 PGTYAITLRTPLERMYASLFVVMLLPNGQRYEDSVALSFNMHFLESIKIVFLVPFLLACL 839
Query: 836 VLVILRPQEAMPLPSF 851
+ +R Q + P F
Sbjct: 840 AISFVRKQHEVQPPPF 855
Score = 84.3 bits (207), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 66/117 (56%), Gaps = 2/117 (1%)
Query: 58 CPKNLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAF 117
C + L W V SS++ATP I D+N DG +I+VP++ Y E L+G +G+ G+P
Sbjct: 82 CRSGVYLEWTARVGSSVFATPRIVDLNHDGNKEILVPTYTQYFEALDGVNGEDEIGFPFV 141
Query: 118 HQS-SVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDW 173
H + + SP+ D+D DG E +A Y GE++ F G +++P ++K+W
Sbjct: 142 HPNFKSYCSPIPVDMDGDGKTEWLIAMYTGELMVFGDDG-KARGAIQVPSLPIKKNW 197
>gi|71422190|ref|XP_812058.1| FG-GAP repeat protein [Trypanosoma cruzi strain CL Brener]
gi|70876794|gb|EAN90207.1| FG-GAP repeat protein, putative [Trypanosoma cruzi]
Length = 863
Score = 280 bits (716), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 166/475 (34%), Positives = 255/475 (53%), Gaps = 13/475 (2%)
Query: 382 HILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLD 441
HI+STPVI D D DG ++++ +SYFFD + Y+ + IDI YVA ++V NL
Sbjct: 392 HIMSTPVIVDADGDGDLDIVLHLSYFFDMKAYEGEKAELLPSDIDINDYVADVLMVLNLV 451
Query: 442 TKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIRE 501
T ++KW L LS N + AY SSP + + D D DI V T+ G + + HGK
Sbjct: 452 TGEMKWMKVLHLSKKNDTIPAYALSSPVIANPDEDYQQDIYVTTTAGAIFGFNGHGKQLS 511
Query: 502 KFPLEMAEIQGAVVAA-DINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPS 560
+P+ M+ + + D+N DG I++ DT GNVA +TA GK +W + VT+ +
Sbjct: 512 GWPVWMSSSITSSPSMEDVNADGLIDVCAGDTEGNVACFTANGKQLWSKTFSGAVTKQIT 571
Query: 561 IGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTK----RG 616
GDVDGDG D+ T SG +Y + G+DG+ + +P T G ++ LLV+L
Sbjct: 572 YGDVDGDGSIDLAFGTTSGLLYAVRGRDGALLPHFPIATEGPILAPPLLVNLNNTQPLSQ 631
Query: 617 EKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNG 676
++++GL I+ S DG LY++ G T C D +DI E S +MVLAD++ G LDL+V+T++G
Sbjct: 632 QETEGLHIIVPSHDGVLYIVSGTTGCVDGIDINEKSSTMVLADDMTGNGKLDLVVSTLSG 691
Query: 677 NVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIV 736
+V F T A HPLKAW S + N + G+++ SR RD G F + + I
Sbjct: 692 SVMVFETQAAFHPLKAWISRVKSTNGLTASEGYVGVFIHPSSRVPRDIRGEQFVLLVTIY 751
Query: 737 DEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLV 796
D+ R + Y +T T+ G R + ++ G Y I L T R ++ V
Sbjct: 752 DQRR-GDAIKRRYGLTITI-------GPRILVHHMVYGAPGTYAITLRTPLERMYASLFV 803
Query: 797 EMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVILRPQEAMPLPSF 851
M+ NG + D +L+F+M++ + +K + ++P L + +R Q + P F
Sbjct: 804 VMLLPNGQRYEDSVALSFNMHFLESIKIIFLVPFLLACLAISFVRKQHEVQPPPF 858
Score = 84.0 bits (206), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 66/117 (56%), Gaps = 2/117 (1%)
Query: 58 CPKNLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAF 117
C + L W V SS++ATP I D+N DG +I+VP++ Y E L+G +G+ G+P
Sbjct: 82 CRSGVYLEWTARVGSSVFATPRIVDLNHDGNKEILVPTYTQYFEALDGVNGEDEIGFPFV 141
Query: 118 HQS-SVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDW 173
H + + SP+ D+D DG E +A Y GE++ F G +++P ++K+W
Sbjct: 142 HPNFKSYCSPIPVDMDGDGKTEWLVAMYTGELMVFGDDG-KARGAIQVPALPIKKNW 197
>gi|407838162|gb|EKF99973.1| FG-GAP repeat protein, putative,intergrin alpha chain protein,
putative [Trypanosoma cruzi]
Length = 861
Score = 278 bits (712), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 166/475 (34%), Positives = 255/475 (53%), Gaps = 13/475 (2%)
Query: 382 HILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLD 441
HI+STPVI D D DG ++++ +SYFFD + Y+ + IDI YVA ++V NL
Sbjct: 390 HIMSTPVIVDADGDGDLDIVLHLSYFFDMKAYEGEKAELLPSDIDINDYVADVLMVLNLV 449
Query: 442 TKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIRE 501
T ++KW L LS N + AY SSP + + D D DI V T+ G + + HGK
Sbjct: 450 TGEMKWMKVLHLSKKNDTIPAYALSSPVIANPDEDYQQDIYVTTTAGAIFGFNGHGKQLR 509
Query: 502 KFPLEMAEIQGAVVAA-DINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPS 560
+P+ M+ + + D+N DG I++ DT GNVA +TA GK +W + VT+ +
Sbjct: 510 GWPVWMSSSITSSPSMEDVNADGLIDVCAGDTEGNVACFTANGKQLWSKTFSGAVTKQIT 569
Query: 561 IGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTK----RG 616
GDVDGDG D+ T SG +Y + G+DG+ + +P T G ++ LLV+L
Sbjct: 570 YGDVDGDGSIDLAFGTTSGLLYAVRGRDGALLPHFPIATEGPILAPPLLVNLNNTQPLAQ 629
Query: 617 EKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNG 676
++++GL I+ S DG LY++ G T C D +DI E S +MVLAD++ G LDL+V+T++G
Sbjct: 630 KETEGLHIIVPSHDGVLYIVSGTTGCVDGIDINEKSSTMVLADDMTGNGKLDLVVSTLSG 689
Query: 677 NVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIV 736
+V F T A HPLKAW S + N + G+++ SR RD G F + + I
Sbjct: 690 SVMVFETQAAFHPLKAWISRVKSTNGLTASEGYVGVFIHPSSRVPRDIRGEQFVLLVTIY 749
Query: 737 DEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLV 796
D+ R + Y +T T+ G R + ++ G Y I L T R ++ V
Sbjct: 750 DQRR-GDAIKRRYGLTITI-------GPRILVHHMVYGAPGTYAITLRTPLERMYASLFV 801
Query: 797 EMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVILRPQEAMPLPSF 851
M+ NG + D +L+F+M++ + +K + ++P L + +R Q + P F
Sbjct: 802 VMLLPNGQRYEDSVALSFNMHFLESIKIVFLVPFLLACLAISFVRKQHEVQPPPF 856
Score = 83.6 bits (205), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 66/117 (56%), Gaps = 2/117 (1%)
Query: 58 CPKNLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAF 117
C + L W V SS++ATP I D+N DG +I+VP++ Y E L+G +G+ G+P
Sbjct: 82 CRSGVYLEWTARVGSSVFATPRIVDLNHDGNKEILVPTYTQYFEALDGVNGEDEIGFPFV 141
Query: 118 HQS-SVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDW 173
H + + SP+ D+D DG E +A Y GE++ F G +++P ++K+W
Sbjct: 142 HPNFKSYCSPIPVDMDGDGKTEWLVAMYTGELMVFGDDG-KARGAIQVPALPIKKNW 197
>gi|71402411|ref|XP_804122.1| FG-GAP repeat protein [Trypanosoma cruzi strain CL Brener]
gi|70866923|gb|EAN82271.1| FG-GAP repeat protein, putative [Trypanosoma cruzi]
Length = 499
Score = 276 bits (706), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 165/475 (34%), Positives = 254/475 (53%), Gaps = 13/475 (2%)
Query: 382 HILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLD 441
HI+STPVI D D DG ++++ +SYFFD + Y+ + IDI YVA ++V NL
Sbjct: 28 HIMSTPVIVDADGDGDLDIVLHLSYFFDMKAYEGEKAELLPSDIDINDYVADVLMVLNLV 87
Query: 442 TKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIRE 501
T ++KW L LS N + AY SSP + + D D DI V T+ G + + HGK
Sbjct: 88 TGEMKWMKVLHLSKKNDTIPAYALSSPVIANPDEDNQQDIYVTTTAGAIFGFNGHGKQLS 147
Query: 502 KFPLEMAEIQGAVVAA-DINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPS 560
+P+ M+ + + D+N DG I++ DT GNVA +TA GK +W + VT+ +
Sbjct: 148 GWPVWMSSSITSSPSMEDVNADGLIDVCAGDTEGNVACFTANGKQLWSKTFSGAVTKQIT 207
Query: 561 IGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLT----KRG 616
GDVDGDG D+ T SG +Y + G+DG+ + +P G ++ LLV+L
Sbjct: 208 YGDVDGDGSIDLAFGTTSGLLYAVRGRDGALLPHFPIAAEGPILAPPLLVNLNITQPLTQ 267
Query: 617 EKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNG 676
++++GL I+ S DG LY++ G T C D +DI E S +MVLAD++ G LDL+V+T++G
Sbjct: 268 QETEGLHIIVPSHDGVLYIVSGTTGCVDGIDINEKSSTMVLADDMTGNGKLDLVVSTLSG 327
Query: 677 NVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIV 736
+V F T A HPLKAW S + N + G+++ SR RD G F + + I
Sbjct: 328 SVMVFETQAAFHPLKAWISRVKSTNGLTASEGYVGVFIHPSSRVPRDIRGEQFVLLVTIY 387
Query: 737 DEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLV 796
D+ R + Y +T T+ G R + ++ G Y I L T R ++ V
Sbjct: 388 DQRR-GDAIKRRYGLTITI-------GPRILVHHMVYGAPGTYAITLRTPLERMYASLFV 439
Query: 797 EMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVILRPQEAMPLPSF 851
M+ NG + D +L+F+M++ + +K + ++P L + +R Q + P F
Sbjct: 440 VMLLPNGQRYEDSVALSFNMHFLESIKIVFLVPFLLACLAISFVRKQHEVQPPPF 494
>gi|157872947|ref|XP_001684995.1| putative intergrin alpha chain protein [Leishmania major strain
Friedlin]
gi|68128066|emb|CAJ08162.1| putative intergrin alpha chain protein [Leishmania major strain
Friedlin]
Length = 785
Score = 270 bits (690), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 163/468 (34%), Positives = 247/468 (52%), Gaps = 13/468 (2%)
Query: 374 EDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYY-DNPEHLKELGGIDIGKYVA 432
ED + VD+HILSTPV+ D+D +G ++I+ VSYFF + + PE GG+D +VA
Sbjct: 314 EDEIKVDAHILSTPVVTDVDQNGDLDVILHVSYFFSEDVIAEQPERFA--GGVDPHDFVA 371
Query: 433 GAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYV 492
A+V NL T + +W L ST N++ AY S+P V++ D D ++++ V TS G +
Sbjct: 372 TAVVCLNLVTGETRWARLLHTSTKNSTNAAYALSTPLVINADEDRSMEVFVSTSLGFVFG 431
Query: 493 LDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLK 552
G I + +P+ + + + A DI DG +++ DT G V + A G E+ +
Sbjct: 432 FTADGNILDGWPVLLGPLSASPTAEDITGDGTLDICIGDTEGKVRCYNASGSVTLEKDVI 491
Query: 553 SLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDL 612
V + GDVDGDG D+V T SG IY L G DG+ + +P T G ++ VLL++L
Sbjct: 492 GGVADRLTFGDVDGDGAVDIVFGTTSGLIYALRGTDGAVLPSFPIVTGGSIVAPVLLLNL 551
Query: 613 TKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVT 672
S L IV S +G +YL+ G + C +DI E S SMVLAD+V G +DL+VT
Sbjct: 552 NGM-LYSNNLDIVVPSHNGCVYLVRGSSGCIQTIDIDEKSSSMVLADDVLGNGQMDLVVT 610
Query: 673 TMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVE 732
T+ G V+ F T P+ + AW S +G N + N GI ++ R FRD G F VE
Sbjct: 611 TIKGGVYVFQTSTPYTAMNAWPSKTKGVNGFSASGNYIGIVISPNFRTFRDIHGDTFNVE 670
Query: 733 IEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTG 792
I D P + Y V T+ G R F + G + +K+ R
Sbjct: 671 FTIYDSR--PVSPKRSYTVEITV-------GTRIRLYRGRFHKPGTHIVKVRAPLERMYS 721
Query: 793 TVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVIL 840
++ V + NG F+D SL+F+M++ +K+ L++P L + L+++
Sbjct: 722 SLNVALTLPNGQSFTDSISLSFNMHFMAAIKYTLIVPFLAVSAALMLV 769
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 36/73 (49%), Gaps = 3/73 (4%)
Query: 547 WEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQ 606
W H+ S V P I D+ DGH +V++PT S + L G G + +P+ H R+
Sbjct: 42 WSAHVGSSVYATPRIVDLAHDGHKEVLIPTFSQFLEALDGPTGEDIPGFPF-MHPRMKTY 100
Query: 607 V--LLVDLTKRGE 617
L VDL G+
Sbjct: 101 ASPLPVDLNGDGQ 113
>gi|401426092|ref|XP_003877530.1| putative FG-GAP repeat protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322493776|emb|CBZ29065.1| putative FG-GAP repeat protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 793
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 162/468 (34%), Positives = 246/468 (52%), Gaps = 13/468 (2%)
Query: 374 EDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYY-DNPEHLKELGGIDIGKYVA 432
+D + VD+HILSTPV+ D+D +G ++I+ VSYFF + D PE GG+D +VA
Sbjct: 322 DDEIKVDAHILSTPVVTDVDQNGDLDVILHVSYFFSEDVITDQPERFA--GGVDPHDFVA 379
Query: 433 GAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYV 492
A+V NL T + +W L ST N+S AY S+P V++ D D ++++ V TS G +
Sbjct: 380 TAVVCLNLVTGETRWARLLHASTRNSSNAAYALSTPLVINADEDRSMEVFVSTSLGFVFG 439
Query: 493 LDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLK 552
G E +P+ + + + A DI DG +++ DT G V + A G E+ +
Sbjct: 440 FSADGNALEGWPVLLGPLSASPTAEDITGDGTLDICIGDTEGKVRCYNASGSVTLEKDVI 499
Query: 553 SLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDL 612
V + GDVDGDG D+V T SG IY L G +G+ + +P T G ++ LL++L
Sbjct: 500 GGVADRLTFGDVDGDGAVDIVFGTTSGLIYALRGTNGAVLPSFPIITGGSIVAPALLLNL 559
Query: 613 TKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVT 672
S L IV S +G +YL+ G + C +DI E S SMVLAD+V G +DL+VT
Sbjct: 560 NGM-MYSNDLDIVVPSHNGRVYLVRGSSGCVQTIDIDEKSSSMVLADDVLGNGQMDLVVT 618
Query: 673 TMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVE 732
T+ G V+ F T P+ + AW S +G N + N GI ++ R FRD +G F VE
Sbjct: 619 TIKGGVYVFQTSTPYTAMNAWPSKTKGVNGFSASGNYIGIAISPNLRTFRDIQGDTFNVE 678
Query: 733 IEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTG 792
I D P Y V T+ G R + F + G + +K+ R
Sbjct: 679 FTIYDSR--PVSPNRSYTVEITV-------GTRIRLYRERFNKPGTHIVKVRAPLERMYS 729
Query: 793 TVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVIL 840
++ V + NG F+D SL+F+M++ +K+ L++P L + L+++
Sbjct: 730 SLNVALTLPNGQSFTDSISLSFNMHFMAAIKYTLLVPFLAVSAALMLV 777
Score = 92.8 bits (229), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 47/133 (35%), Positives = 79/133 (59%), Gaps = 13/133 (9%)
Query: 49 DEDALV---NTQCPKNLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEG 105
D D LV +QC +++L W V SS+YATP I D+ DG ++++P+F +LE L+G
Sbjct: 22 DADQLVPNSTSQCYPSIDLGWSAHVGSSVYATPRIVDLAHDGHKEVLIPTFSQFLEALDG 81
Query: 106 SDGDKMPGWPAFH-QSSVHSSPLLYDIDKDGVREIALATYNGEVLFF----RVSGYMMTD 160
G+ +PG+P H + ++SPL D++ DG + +A Y GE++ F +V GY
Sbjct: 82 PTGEDIPGFPFMHPRMKTYASPLPVDLNGDGQVDWLVALYTGELVVFSQDGKVQGY---- 137
Query: 161 KLEIPRRKVRKDW 173
L++P ++++W
Sbjct: 138 -LQVPPLAMKRNW 149
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 36/73 (49%), Gaps = 3/73 (4%)
Query: 547 WEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQ 606
W H+ S V P I D+ DGH +V++PT S + L G G + +P+ H R+
Sbjct: 42 WSAHVGSSVYATPRIVDLAHDGHKEVLIPTFSQFLEALDGPTGEDIPGFPF-MHPRMKTY 100
Query: 607 V--LLVDLTKRGE 617
L VDL G+
Sbjct: 101 ASPLPVDLNGDGQ 113
Score = 40.8 bits (94), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 48/93 (51%), Gaps = 4/93 (4%)
Query: 71 SSSIYA--TPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLL 128
S++ YA TPL+ + + D +++ V + L ++ +DG+ + GWP + +SP
Sbjct: 406 SNAAYALSTPLVINADEDRSMEVFVSTSLGFVFGFS-ADGNALEGWPVL-LGPLSASPTA 463
Query: 129 YDIDKDGVREIALATYNGEVLFFRVSGYMMTDK 161
DI DG +I + G+V + SG + +K
Sbjct: 464 EDITGDGTLDICIGDTEGKVRCYNASGSVTLEK 496
>gi|72391950|ref|XP_846269.1| FG-GAP repeat protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62359889|gb|AAX80316.1| FG-GAP repeat protein, putative [Trypanosoma brucei]
gi|70802805|gb|AAZ12710.1| FG-GAP repeat protein, putative [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 845
Score = 261 bits (667), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 160/480 (33%), Positives = 246/480 (51%), Gaps = 13/480 (2%)
Query: 374 EDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAG 433
ED V VD HI+STPV+A +D G +++++ VSY+FD Y+ G + YVA
Sbjct: 372 EDEVAVDPHIMSTPVVASVDGRGETDVVLHVSYYFDPRDYERRGVELSQDGAQVDNYVAD 431
Query: 434 AIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVL 493
++V NL +VKW L L+T N + A+ +SSPTV ++ DI V T+ G +
Sbjct: 432 VLMVVNLARGEVKWLKVLHLTTKNDTAPAFAFSSPTVTSINDRKQRDIFVTTTAGAIFGF 491
Query: 494 DHHGKIREKFPLEM-AEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLK 552
G + +P+ M + I +V D++ DG ++ T D G VA +T EGK IW +
Sbjct: 492 QGDGMQLKGWPVWMNSSITASVSVEDVDGDGVADVCTGDALGYVACFTKEGKKIWSKRFS 551
Query: 553 SLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDL 612
V + GD++GDG D+ T +G IY + G DGS + +P T G ++ LLV L
Sbjct: 552 GAVGDHIAFGDINGDGDMDMAFGTATGLIYAVKGSDGSLLPQFPIATEGPIVASPLLVRL 611
Query: 613 TKRGEKS----KGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLD 668
+ KGL I+ S DG LY++ G T C D +D E S MVLAD+V G LD
Sbjct: 612 DNASDPKQVDEKGLHIIVPSHDGVLYIVKGTTGCVDAIDTDEKSSVMVLADDVTGNGMLD 671
Query: 669 LIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRN 728
L+V+T++G++ F T HPLKAW S + N G+++ SR RD G
Sbjct: 672 LVVSTLSGSIMVFGTATKFHPLKAWPSRVKSLNGFTAAEGHIGVFIHPSSRVPRDIHGDK 731
Query: 729 FWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGV 788
F + + I D+ + Q YN+ TL G +F G Y I++ + V
Sbjct: 732 FSLTVTIYDDRKGKDTGQL-YNLAFTL-------GPHVSLGRYLFDTPGTYAIQMHSPLV 783
Query: 789 RTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVILRPQEAMPL 848
R GTV V + +G + D +L+F+M++ + +K+ ++P L ++ + + +PL
Sbjct: 784 RMYGTVSVVLTLPDGQMYEDTLALSFNMHFLESVKYTFLIPFLAVWLAHSFVMEKHQVPL 843
Score = 83.2 bits (204), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 87/180 (48%), Gaps = 7/180 (3%)
Query: 1 MKSSTTSANCVLLI-CLLLFNSA--RGGDNSEQNKFRQREATDDQLGLPQIDEDALVNTQ 57
+ + T+ CVL+I C + N+ + QR +D L + T
Sbjct: 3 LTGNITTIFCVLVILCTTVVNAIPLEWSEPLAHKSPHQRVLQEDLLPHARFAPTNASTTA 62
Query: 58 --CPKNLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWP 115
C + L W T V SS+ ATP I D+N DG +I+VP++ Y+E ++G+ G + G+P
Sbjct: 63 HTCRSGVYLEWSTRVGSSVLATPRIVDLNYDGNKEILVPTYSQYIEAVDGASGADIAGFP 122
Query: 116 AFHQS-SVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDWY 174
H + ++SP+ DI DG E +A Y GE+L G +++P ++K W+
Sbjct: 123 FVHPNIKTYASPIPVDILDDGATEWLVAMYTGELLLIGGEGE-TRGGVKVPSLPIKKRWF 181
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 25/87 (28%), Positives = 43/87 (49%), Gaps = 1/87 (1%)
Query: 75 YATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKD 134
+++P + IN + DI V + + +G DG ++ GWP + SS+ +S + D+D D
Sbjct: 463 FSSPTVTSINDRKQRDIFVTTTAGAIFGFQG-DGMQLKGWPVWMNSSITASVSVEDVDGD 521
Query: 135 GVREIALATYNGEVLFFRVSGYMMTDK 161
GV ++ G V F G + K
Sbjct: 522 GVADVCTGDALGYVACFTKEGKKIWSK 548
Score = 40.4 bits (93), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 39/83 (46%), Gaps = 1/83 (1%)
Query: 51 DALVNTQC-PKNLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGD 109
DAL C K + W S ++ DIN DG +D+ + + ++GSDG
Sbjct: 530 DALGYVACFTKEGKKIWSKRFSGAVGDHIAFGDINGDGDMDMAFGTATGLIYAVKGSDGS 589
Query: 110 KMPGWPAFHQSSVHSSPLLYDID 132
+P +P + + +SPLL +D
Sbjct: 590 LLPQFPIATEGPIVASPLLVRLD 612
>gi|261329883|emb|CBH12866.1| FG-GAP repeat protein, putative [Trypanosoma brucei gambiense
DAL972]
Length = 845
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 160/480 (33%), Positives = 246/480 (51%), Gaps = 13/480 (2%)
Query: 374 EDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAG 433
ED V VD HI+STPV+A +D G +++++ VSY+FD Y+ G + YVA
Sbjct: 372 EDEVAVDPHIMSTPVVASVDGRGETDVVLHVSYYFDPRDYERRGVELSQDGAQVDNYVAD 431
Query: 434 AIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVL 493
++V NL +VKW L L+T N + A+ +SSPTV ++ DI V T+ G +
Sbjct: 432 VLMVVNLARGEVKWLKVLHLTTKNDTAPAFAFSSPTVTSINDRKQRDIFVTTTAGAIFGF 491
Query: 494 DHHGKIREKFPLEM-AEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLK 552
G + +P+ M + I +V D++ DG ++ T D G VA +T EGK IW +
Sbjct: 492 QGDGTQLKGWPVWMNSSITTSVSVEDVDGDGVADVCTGDALGYVACFTKEGKKIWSKRFS 551
Query: 553 SLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDL 612
V + GD++GDG D+ T +G IY + G DGS + +P T G ++ LLV L
Sbjct: 552 GAVGDHIAFGDINGDGDMDMAFGTATGLIYAVKGSDGSLLPQFPIATEGPIVASPLLVRL 611
Query: 613 TKRGEKS----KGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLD 668
+ KGL I+ S DG LY++ G T C D +D E S MVLAD+V G LD
Sbjct: 612 DNASDPKQVDEKGLHIIVPSHDGVLYIVKGTTGCVDAIDTDEKSSVMVLADDVTGNGMLD 671
Query: 669 LIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRN 728
L+V+T++G++ F T HPLKAW S + N G+++ SR RD G
Sbjct: 672 LVVSTLSGSIMVFGTATKFHPLKAWPSRVKSLNGFTAAEGHIGVFIHPSSRVPRDIHGDK 731
Query: 729 FWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGV 788
F + + I D+ + Q YN+ TL G +F G Y I++ + V
Sbjct: 732 FSLTVTIYDDRKGKDTGQL-YNLAFTL-------GPHVSLGRYLFDTPGTYAIQMHSPLV 783
Query: 789 RTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVILRPQEAMPL 848
R GTV V + +G + D +L+F+M++ + +K+ ++P L ++ + + +PL
Sbjct: 784 RMYGTVSVVLTLPDGQMYEDTLALSFNMHFLESVKYTFLIPFLAVWLAHSFVMEKHQVPL 843
Score = 83.2 bits (204), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 87/180 (48%), Gaps = 7/180 (3%)
Query: 1 MKSSTTSANCVLLI-CLLLFNSA--RGGDNSEQNKFRQREATDDQLGLPQIDEDALVNTQ 57
+ + T+ CVL+I C + N+ + QR +D L + T
Sbjct: 3 LTGNITTIFCVLVILCTTVVNAIPLEWSEPLAHKSPHQRVLQEDLLPHARFAPTNASTTA 62
Query: 58 --CPKNLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWP 115
C + L W T V SS+ ATP I D+N DG +I+VP++ Y+E ++G+ G + G+P
Sbjct: 63 HTCRSGVYLEWSTRVGSSVLATPRIVDLNYDGNKEILVPTYSQYIEAVDGASGADIAGFP 122
Query: 116 AFHQS-SVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDWY 174
H + ++SP+ DI DG E +A Y GE+L G +++P ++K W+
Sbjct: 123 FVHPNIKTYASPIPVDILDDGATEWLVAMYTGELLLIGGEGETR-GGVKVPSLPIKKRWF 181
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 25/87 (28%), Positives = 43/87 (49%), Gaps = 1/87 (1%)
Query: 75 YATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKD 134
+++P + IN + DI V + + +G DG ++ GWP + SS+ +S + D+D D
Sbjct: 463 FSSPTVTSINDRKQRDIFVTTTAGAIFGFQG-DGTQLKGWPVWMNSSITTSVSVEDVDGD 521
Query: 135 GVREIALATYNGEVLFFRVSGYMMTDK 161
GV ++ G V F G + K
Sbjct: 522 GVADVCTGDALGYVACFTKEGKKIWSK 548
Score = 40.4 bits (93), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 39/83 (46%), Gaps = 1/83 (1%)
Query: 51 DALVNTQC-PKNLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGD 109
DAL C K + W S ++ DIN DG +D+ + + ++GSDG
Sbjct: 530 DALGYVACFTKEGKKIWSKRFSGAVGDHIAFGDINGDGDMDMAFGTATGLIYAVKGSDGS 589
Query: 110 KMPGWPAFHQSSVHSSPLLYDID 132
+P +P + + +SPLL +D
Sbjct: 590 LLPQFPIATEGPIVASPLLVRLD 612
>gi|342182243|emb|CCC91722.1| putative intergrin alpha chain protein [Trypanosoma congolense
IL3000]
Length = 854
Score = 250 bits (639), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 153/471 (32%), Positives = 244/471 (51%), Gaps = 16/471 (3%)
Query: 382 HILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEH-LKELGGIDIGKYVAGAIVVFNL 440
HI+STP IAD+D + ++++ VSYFFD YD E + G++ +YVA +V +L
Sbjct: 385 HIMSTPTIADVDGNSNVDVVVHVSYFFDLRDYDTSERDERFFEGVNFDEYVADVLVSIDL 444
Query: 441 DTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIR 500
+ ++KW L L+T + A+ SSP + D G D+ V T+ G + G+
Sbjct: 445 VSGELKWLKVLHLTTQSDRTPAFALSSPVIGGSDKMGKRDVFVTTTAGAIFGFKGDGRPM 504
Query: 501 EKFPLEM-AEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGP 559
+ +P+ M + + + D+++DG I++ D G VA +T +GK IW + +
Sbjct: 505 KGWPVWMNSSMTASPSLEDVDNDGVIDVCAGDALGYVACFTNDGKQIWSKRFSGAMGDHI 564
Query: 560 SIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDL-TKRGEK 618
+ GDVDGDG D+ T +G IY + G DG+ + +P T G ++ LLV+L GE+
Sbjct: 565 TFGDVDGDGKIDMTFGTAAGLIYAVRGHDGTILPHFPIATGGSILASPLLVNLNATSGEE 624
Query: 619 -----SKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTT 673
S+GL +V S DG LY+ G + C D +DI E S +MVLAD++ G LDL+V+T
Sbjct: 625 YDEVDSRGLHVVVPSHDGVLYIASGTSGCIDAIDIDEKSSAMVLADDITGNGMLDLVVST 684
Query: 674 MNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEI 733
++G++ F T A HPLKAW S + N R G+++ SR RD G F + +
Sbjct: 685 LSGSIMVFETSAKFHPLKAWPSRVKSLNGFTASEGRVGVFIHPSSRVPRDIRGETFALLV 744
Query: 734 EIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTGT 793
I D+ P G Y++ T+ G R + ++ G Y I + R G
Sbjct: 745 SIHDKRTGP-GVARVYDIVITI-------GPRILVHRNVYTAPGTYAITMHAPLERMYGI 796
Query: 794 VLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVILRPQE 844
V V M NG + D +L+F+M++ + +K+ ++P V+ +R Q
Sbjct: 797 VSVVMTLPNGQQYEDTLALSFNMHFLEGVKYTFIIPFFFACLVISFVRKQH 847
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 65/118 (55%), Gaps = 2/118 (1%)
Query: 57 QCPKNLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPA 116
C + L W T V SS ATP + D+N DG +I+VP++ YLE ++G+ G + G+P
Sbjct: 82 MCRTGVHLEWTTHVGSSALATPRVVDLNHDGNKEILVPTYSQYLEAIDGARGGDVAGFPF 141
Query: 117 FH-QSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDW 173
H + ++SP+ D+ DG + +A Y GE++ F G + +P V++DW
Sbjct: 142 AHPRFKTYASPVPVDVIGDGTTKWLVAMYTGELMVFSEDGDAQGAAM-VPPLPVKRDW 198
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 51/103 (49%), Gaps = 2/103 (1%)
Query: 60 KNLELRWQTEVSSSI-YATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFH 118
K L L Q++ + + ++P+I + GK D+ V + + +G DG M GWP +
Sbjct: 453 KVLHLTTQSDRTPAFALSSPVIGGSDKMGKRDVFVTTTAGAIFGFKG-DGRPMKGWPVWM 511
Query: 119 QSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDK 161
SS+ +SP L D+D DGV ++ G V F G + K
Sbjct: 512 NSSMTASPSLEDVDNDGVIDVCAGDALGYVACFTNDGKQIWSK 554
>gi|405965032|gb|EKC30460.1| hypothetical protein CGI_10016522 [Crassostrea gigas]
Length = 660
Score = 250 bits (638), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 159/499 (31%), Positives = 256/499 (51%), Gaps = 34/499 (6%)
Query: 363 EEWTEEQHEKIEDYVNVDSHILSTPVIADIDND-GVSEMIIAVSYFFDHEYYDNPEHLKE 421
E++T K Y+ +D HIL+TP++AD++ND E++I VSYF++ + Y E LK
Sbjct: 159 EKYTSFSQTKDSKYLPIDPHILATPILADLNNDFREEELVIPVSYFYEEDDYRVKEKLKT 218
Query: 422 LGGI---DIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDG- 477
LGG+ DI KY+ I V NL T Q W+ LDL+ +A+F AY SPTV+DLD DG
Sbjct: 219 LGGLEFSDIDKYLVAGITVLNLTTGQTIWSKLLDLTQVDATFPAYNIFSPTVIDLDVDGG 278
Query: 478 NLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVA 537
L++++GTS G ++ + G +R +P+ I G + AAD+ DG ++++T DT NV
Sbjct: 279 QLEVIMGTSAGSLFIFNQDGSVRTGWPVSQNTIHGQITAADLAGDGVVKIITIDTSANVI 338
Query: 538 AWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPY 597
+ EG +WE + + G + DVD DG D+++ T G IY L+G +G+ + +P
Sbjct: 339 CYNKEGSKLWESVISGTSSPGSRLYDVDRDGKMDIIITTDDGFIYALNGVNGTVLPDWPV 398
Query: 598 RTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVL 657
R+ VL +T+ +V + DG ++LI C + +GETS VL
Sbjct: 399 SVGARMTANVL---ITRVSTTHTTPDMVVMADDGSIHLISMDLKCKSQIPLGETSLVQVL 455
Query: 658 A-DNVDGGDDLDLIVTTMNGNVFCFSTPA--------------PHHPLKAWRSINQGRNN 702
+ D + ++L+V T +G + C +T +H L + S + N+
Sbjct: 456 SHDLIKWFSGMELLVATSDGTLMCLATGQELAEFDDARDETLRSNHML-SLTSETKTAND 514
Query: 703 VAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQ 762
+ + G+Y+T ++ + G F VE EI+D P S+ Y V Y
Sbjct: 515 FSFSEKKTGVYITTTNKIETEVTGETFPVEFEIIDPLYKPGKSK--YFVAI-------YF 565
Query: 763 GERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLL 822
G R + Q + F Y +++P + G V V + ++ G F D FSL F+ + L
Sbjct: 566 GNRLLMQGE-FPEPKTYNVQVPAGEEPSQGHVTVRLTNQYGQIFEDRFSLRFNKLIMEDL 624
Query: 823 KWLLVLPMLGMFGVLVILR 841
+WLL+ P + M +L++L
Sbjct: 625 QWLLLAPFVAMIIILLVLH 643
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/161 (28%), Positives = 79/161 (49%), Gaps = 20/161 (12%)
Query: 57 QCPKNLELRWQTEVSSSIYAT-PLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMP--G 113
+C L+ W E++ + +A PL+ ++ D KLDIV F VL G +
Sbjct: 25 KCGYELKTLWTAELAHAPFAAAPLVTLVDDDQKLDIVAAPFSESFSVLNSETGKILSQTA 84
Query: 114 WPAFH-QSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDK-LEIPRRKVRK 171
WPA + +SVH+SP+ +DI+ DG +I T + E+ F+ +G+ + ++ +++P V
Sbjct: 85 WPALNLDNSVHASPIQFDINNDGQLDILFTTSSAELRFYTPAGHFLRNQTVQLPPAYVAS 144
Query: 172 DWYVGLHSDPVDRSHPDVHDDLIVQESEAARMKSMLETKKS 212
DWY DL+V+ + + S +TK S
Sbjct: 145 DWY---------------KSDLVVKFEDIEKYTSFSQTKDS 170
>gi|146094431|ref|XP_001467273.1| putative FG-GAP repeat protein [Leishmania infantum JPCM5]
gi|134071638|emb|CAM70327.1| putative FG-GAP repeat protein [Leishmania infantum JPCM5]
Length = 785
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 161/468 (34%), Positives = 247/468 (52%), Gaps = 13/468 (2%)
Query: 374 EDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYY-DNPEHLKELGGIDIGKYVA 432
ED + VD+HILSTPV+ D+D +G ++I+ VSYFF + + PE GG+D +VA
Sbjct: 314 EDEIKVDAHILSTPVVTDVDQNGDLDVILHVSYFFGEDVIAEQPERFA--GGVDPHDFVA 371
Query: 433 GAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYV 492
A+V NL T + +W L ST N++ AY S+P V++ D D ++++ V TS G +
Sbjct: 372 TAVVCLNLVTGETRWARLLHASTKNSTNAAYALSTPLVINADEDRSMEVFVSTSLGFVFG 431
Query: 493 LDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLK 552
G + +P+ + + + A DI DG +++ DT G V + A G E+ +
Sbjct: 432 FTADGNTLDGWPVLLGPLSASPTAEDITGDGTLDICIGDTEGKVRCYNASGSVTLEKDVI 491
Query: 553 SLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDL 612
V + GDVDGDG D+V T SG IY L G DG+ + +P T G ++ LL++L
Sbjct: 492 GGVADRLTFGDVDGDGVVDIVFGTTSGLIYALRGTDGAVLPSFPIVTGGSIVAPALLLNL 551
Query: 613 TKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVT 672
S L IV S +G +YL+ G + C +DI E S SMVLAD+V G +DL+VT
Sbjct: 552 NGM-LYSNDLDIVVPSHNGRVYLLRGSSGCVQTIDIDEKSSSMVLADDVLGNGQMDLVVT 610
Query: 673 TMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVE 732
T+ G V+ F T P+ + AW S +G N + N GI ++ R FRD +G F VE
Sbjct: 611 TIKGGVYVFQTSTPYTAMNAWPSKTKGVNGFSASGNYIGIAISPNFRTFRDIQGDTFNVE 670
Query: 733 IEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTG 792
I D P + Y V T+ G R + F + G + +K+ R
Sbjct: 671 FTIYDSR--PVSPKRSYTVEITV-------GTRIRLYRERFHKPGTHIVKVRAPLERMYS 721
Query: 793 TVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVIL 840
++ V + NG F+D SL+F+M++ +K+ L++P L + L+++
Sbjct: 722 SLNVALTLPNGQSFTDSISLSFNMHFMAAIKYTLIVPFLAVSAALMLV 769
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 36/73 (49%), Gaps = 3/73 (4%)
Query: 547 WEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQ 606
W H+ S V P I D+ DGH +V++PT S + L G G + +P+ H R+
Sbjct: 42 WSAHVGSSVYATPRIVDLAHDGHKEVLIPTFSQFLEALDGPTGEDIPGFPF-MHPRMKTY 100
Query: 607 V--LLVDLTKRGE 617
L VDL G+
Sbjct: 101 ASPLPVDLNGDGQ 113
Score = 39.7 bits (91), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 51/108 (47%), Gaps = 11/108 (10%)
Query: 63 ELRWQTEVSSSI-------YA--TPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPG 113
E RW + +S YA TPL+ + + D +++ V + L ++ +DG+ + G
Sbjct: 383 ETRWARLLHASTKNSTNAAYALSTPLVINADEDRSMEVFVSTSLGFVFGFT-ADGNTLDG 441
Query: 114 WPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDK 161
WP + +SP DI DG +I + G+V + SG + +K
Sbjct: 442 WPVL-LGPLSASPTAEDITGDGTLDICIGDTEGKVRCYNASGSVTLEK 488
>gi|398019877|ref|XP_003863102.1| FG-GAP repeat protein, putative [Leishmania donovani]
gi|322501334|emb|CBZ36412.1| FG-GAP repeat protein, putative [Leishmania donovani]
Length = 785
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 161/468 (34%), Positives = 247/468 (52%), Gaps = 13/468 (2%)
Query: 374 EDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYY-DNPEHLKELGGIDIGKYVA 432
ED + VD+HILSTPV+ D+D +G ++I+ VSYFF + + PE GG+D +VA
Sbjct: 314 EDEIKVDAHILSTPVVTDVDQNGDLDVILHVSYFFGEDVIAEQPERFA--GGVDPHDFVA 371
Query: 433 GAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYV 492
A+V NL T + +W L ST N++ AY S+P V++ D D ++++ V TS G +
Sbjct: 372 TAVVCLNLVTGETRWARLLHASTKNSTNAAYALSTPLVINADEDRSMEVFVSTSLGFVFG 431
Query: 493 LDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLK 552
G + +P+ + + + A DI DG +++ DT G V + A G E+ +
Sbjct: 432 FTADGNTLDGWPVLLGPLSASPTAEDITGDGTLDICIGDTEGKVRCYNASGSVTLEKDVI 491
Query: 553 SLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDL 612
V + GDVDGDG D+V T SG IY L G DG+ + +P T G ++ LL++L
Sbjct: 492 GGVADRLTFGDVDGDGVVDIVFGTTSGLIYALRGTDGAVLPSFPIVTGGSIVAPALLLNL 551
Query: 613 TKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVT 672
S L IV S +G +YL+ G + C +DI E S SMVLAD+V G +DL+VT
Sbjct: 552 NGM-LYSNDLDIVVPSHNGRVYLLRGSSGCVQTIDIDEKSSSMVLADDVLGNGQMDLVVT 610
Query: 673 TMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVE 732
T+ G V+ F T P+ + AW S +G N + N GI ++ R FRD +G F VE
Sbjct: 611 TIKGGVYVFQTSTPYTAMNAWPSKTKGVNGFSASGNYIGIAISPNFRTFRDIQGDTFNVE 670
Query: 733 IEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTG 792
I D P + Y V T+ G R + F + G + +K+ R
Sbjct: 671 FTIYDSR--PVSPKRSYTVEITV-------GTRIRLYRERFHKPGTHIVKVRAPLERMYS 721
Query: 793 TVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVIL 840
++ V + NG F+D SL+F+M++ +K+ L++P L + L+++
Sbjct: 722 SLNVALTLPNGQSFTDSISLSFNMHFMAAIKYTLIVPFLAVSAALMLV 769
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 36/73 (49%), Gaps = 3/73 (4%)
Query: 547 WEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQ 606
W H+ S V P I D+ DGH +V++PT S + L G G + +P+ H R+
Sbjct: 42 WSAHVGSSVYATPRIVDLAHDGHKEVLIPTFSQFLEALDGPTGEDIPGFPF-MHPRMKTY 100
Query: 607 V--LLVDLTKRGE 617
L VDL G+
Sbjct: 101 ASPLPVDLNGDGQ 113
Score = 39.3 bits (90), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 51/108 (47%), Gaps = 11/108 (10%)
Query: 63 ELRWQTEVSSSI-------YA--TPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPG 113
E RW + +S YA TPL+ + + D +++ V + L ++ +DG+ + G
Sbjct: 383 ETRWARLLHASTKNSTNAAYALSTPLVINADEDRSMEVFVSTSLGFVFGFT-ADGNTLDG 441
Query: 114 WPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDK 161
WP + +SP DI DG +I + G+V + SG + +K
Sbjct: 442 WPVL-LGPLSASPTAEDITGDGTLDICIGDTEGKVRCYNASGSVTLEK 488
>gi|154342128|ref|XP_001567012.1| putative FG-GAP repeat protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134064341|emb|CAM42431.1| putative FG-GAP repeat protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 785
Score = 245 bits (626), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 163/468 (34%), Positives = 250/468 (53%), Gaps = 13/468 (2%)
Query: 374 EDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYD-NPEHLKELGGIDIGKYVA 432
ED + VD+HIL+TPV+ D+D +G ++I+ VSYFF ++ + +PE + G+D +VA
Sbjct: 314 EDEIKVDAHILATPVVTDVDQNGDLDVILHVSYFFSNDVINKHPERFPD--GVDPHDFVA 371
Query: 433 GAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYV 492
A+V NL T + +W L ST N+S AY S+P +++ D D ++++ V TS G +
Sbjct: 372 TAVVCLNLVTGETRWAHLLHASTKNSSNAAYALSTPLIINADEDRSMEVFVSTSPGFVFG 431
Query: 493 LDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLK 552
GK +P+ + + + A D+ DG +++ DT G V + A G +WE+ +
Sbjct: 432 FTADGKPLGGWPVLLGPLSASPTAEDVTGDGTLDICIGDTEGKVRCYNASGTIMWEKDVI 491
Query: 553 SLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDL 612
V + GDVDGDG DVV T SG IY L G DG+ + +P T G ++ VLL++L
Sbjct: 492 GGVADRLTFGDVDGDGVVDVVFGTTSGLIYALRGTDGTVLPSFPIVTGGSIVAPVLLLNL 551
Query: 613 TKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVT 672
+GL IV S +G +YL+ + C +DI E S SMVLAD+V G +DL+VT
Sbjct: 552 NGM-MFGEGLDIVVPSHNGRVYLVRASSGCVQTIDIDEKSSSMVLADDVLGNGQMDLVVT 610
Query: 673 TMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVE 732
T+NG V+ F T P+ + AW S +G N + N GI ++ R FRD G F VE
Sbjct: 611 TINGGVYVFETATPYTAMNAWPSKTKGVNGCSASGNHVGISISPNFRTFRDIRGDTFNVE 670
Query: 733 IEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTG 792
I D P Y V T+ G R F + G + +K+ R
Sbjct: 671 FTIYDAR--PVIMNRSYAVDITV-------GTRIRLYRMRFDKPGTHNVKVRAPLERMYS 721
Query: 793 TVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVIL 840
++ V + NG F+D SL+F+MY+ +K+ L++P L + L+++
Sbjct: 722 SLNVVLTLPNGQSFTDSISLSFNMYFMHAIKYTLIVPFLVVSAALMLV 769
Score = 90.5 bits (223), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 44/120 (36%), Positives = 69/120 (57%), Gaps = 5/120 (4%)
Query: 56 TQCPKNLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWP 115
+QC L L W V SS+YATP I D+ DG ++++P+F +LE L+G G+ +PG+P
Sbjct: 32 SQCYSGLNLGWSARVGSSVYATPRIVDLAHDGHKEVLIPTFSQFLEALDGPTGEDIPGFP 91
Query: 116 AFH-QSSVHSSPLLYDIDKDGVREIALATYNGEVLFF----RVSGYMMTDKLEIPRRKVR 170
H + ++SPL D++ DG + +A Y GE++ F + GY+ L + R VR
Sbjct: 92 FMHPRMKTYASPLPVDLNGDGQVDWLVALYTGELVIFGSDGKAQGYLQVPPLTMKRNWVR 151
Score = 40.4 bits (93), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 47/96 (48%), Gaps = 10/96 (10%)
Query: 71 SSSIYA--TPLIADINSDGKLDIVV---PSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSS 125
S++ YA TPLI + + D +++ V P F+ +DG + GWP + +S
Sbjct: 398 SNAAYALSTPLIINADEDRSMEVFVSTSPGFVFGFT----ADGKPLGGWPVL-LGPLSAS 452
Query: 126 PLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDK 161
P D+ DG +I + G+V + SG +M +K
Sbjct: 453 PTAEDVTGDGTLDICIGDTEGKVRCYNASGTIMWEK 488
>gi|145350159|ref|XP_001419484.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579716|gb|ABO97777.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 742
Score = 241 bits (614), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 149/472 (31%), Positives = 244/472 (51%), Gaps = 31/472 (6%)
Query: 368 EQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDH--EYYDNPEHLKELGGI 425
E ++ V VD+H L TP + D+D DG E+++AVSY+FD + D+ +
Sbjct: 258 EARRAVDGRVFVDAHALCTPAVGDVDGDGEPELVLAVSYYFDSSVRFEDD---------V 308
Query: 426 DIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGT 485
D +Y A A+VV N + K + LD S A F+A Y+ PT+VD+DGDG LDI++GT
Sbjct: 309 DPKQYAATALVVLNGEDLSTKRSIALDQSAATALFKARAYAPPTLVDVDGDGRLDIVIGT 368
Query: 486 SFGLFYVLDH-HGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGK 544
G+ +V+D G +P + +++ V AAD++ DG IEL+ D G VA + + G
Sbjct: 369 YAGVLHVVDGVSGNPLPGWPRRLGQMEAQVTAADVDSDGDIELIACDVRGTVAVFKSNGV 428
Query: 545 GIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVM 604
+W +H++S + S+GDVDGD ++VV SG ++ KDG+ +P +++
Sbjct: 429 ELWNKHVESRIAVAASVGDVDGDDEIEIVVGDTSGAVHAFRAKDGTAREHWPVYVGDKIL 488
Query: 605 NQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGG 664
++ LTK + +GL ++ + DG +++++G + C DV DI E Y+ + G
Sbjct: 489 APIV---LTKLRQTKRGLDVIVATHDGVVHVLEGRSRCRDVFDIAEKIYAAPFVTSFAGF 545
Query: 665 DDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDE 724
+LD+I++TM G+V F A S + RY+ G+ + R +R
Sbjct: 546 GELDVIISTMEGHVRSFKAKGSKFNALALTSSDH-----VSRYDYFGVALY--DRTYRVI 598
Query: 725 EGRNFWVEIEIVDEY-----RFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKY 779
G V EI+D + S APY V T+ + G ++I ++ R G+Y
Sbjct: 599 RGTYVDVVYEIIDRRVLDHAKARKSSLAPYKVAITVT---SLDGFKKIVSAR-HDRAGRY 654
Query: 780 RIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPML 831
+++ +T G + V + D N + D +S++FH Y LKWL+ LP L
Sbjct: 655 TLRVGVPSTKTRGEIRVRVQDVNLIADEDAYSVSFHENYEIALKWLVALPFL 706
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 47/87 (54%), Gaps = 1/87 (1%)
Query: 75 YATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKD 134
YA P + D++ DG+LDIV+ ++ L V++G G+ +PGWP + + D+D D
Sbjct: 348 YAPPTLVDVDGDGRLDIVIGTYAGVLHVVDGVSGNPLPGWPR-RLGQMEAQVTAADVDSD 406
Query: 135 GVREIALATYNGEVLFFRVSGYMMTDK 161
G E+ G V F+ +G + +K
Sbjct: 407 GDIELIACDVRGTVAVFKSNGVELWNK 433
>gi|298705814|emb|CBJ28983.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 757
Score = 227 bits (578), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 190/637 (29%), Positives = 285/637 (44%), Gaps = 96/637 (15%)
Query: 269 PTSVDNSSTTTVSGGTNSSENGTNTGRRLLEDNNSKGSQEGNDKEDVPVATAENDQALDE 328
P S+ N T SG ++ RRLL+DN ++ + + A N Q +
Sbjct: 121 PRSLKNVQETEDSGDADAGLRSPGRRRRLLDDNEAQEQHAASPNQGGDSAALGNGQIEHQ 180
Query: 329 NADSSFE--LFRDT--DELADEYNYDYDDYVDDAM--WGD-------------------- 362
SFE +F++ L E YD +AM WG+
Sbjct: 181 WTGDSFEGGMFQEGVDSGLDGESAYDTAAAYAEAMRRWGEYGTGPLPNDDGIGWWANYGD 240
Query: 363 -------EEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYY-- 413
EE + + + V + HI + PV D ++ MI+ VSYF Y
Sbjct: 241 DFDLIGLEEGEDFGKRQAGEVVEIQPHISAAPVFLDAEDGQGKRMIVTVSYFKSKRRYWG 300
Query: 414 -DNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFR-AYIYSSPTVV 471
P E K VAG++ ++ + + W DL +T R A + +SPTV
Sbjct: 301 SGTPATAAE-------KVVAGSLACWDFEDSKWLWIHDLGETTSGEKKRGAPVDASPTVG 353
Query: 472 DLDGDGNLDILVGTSFGLFYVLD-HHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTT 530
DLDGDG L+++VGT+ G +V+D G RE FPL I+G VV AD++ D ++E++
Sbjct: 354 DLDGDGKLEVVVGTARGGLHVVDASSGLAREGFPLTYEPIRGQVVIADVHGDERLEMLLA 413
Query: 531 DTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVV--VPTLSGN--IYVLSG 586
D+ G V +GK W + L T ++GDVDGDG DVV V T G+ ++ LS
Sbjct: 414 DSSGAVVCVKHDGKECWNKALSGSTTSMLTLGDVDGDGALDVVAGVTTAEGSAEVWALSA 473
Query: 587 KDGSKVRPYPYRTHGR--VMNQVLLVDLTKR--------------------------GEK 618
+ G + YP R + + LVDL +R G
Sbjct: 474 ESGLPLPNYPVALRNRKGISAPITLVDLHQRSAGSMGGGGPADASSQAKRRGNVAPIGGW 533
Query: 619 SKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNV 678
GL ++ S DG++Y+I+G T+C + +D+GE SMVLAD+VDG LDLIV TM+G V
Sbjct: 534 GAGLHLLAPSEDGHVYVIEGATACVNKIDLGERVRSMVLADDVDGDGTLDLIVGTMSGEV 593
Query: 679 FCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDE 738
S P+HPL AW S +G N G+Y + + + GR+F + EIVD
Sbjct: 594 VALSANVPYHPLNAWTSQVRGPTNGFTHGGYQGVYFVGAASDYGEMVGRHFALTFEIVDR 653
Query: 739 YRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKY-RIKLPTVGVRTTGTVL-- 795
+ + P+ +T + G +IF R Y + TV + VL
Sbjct: 654 ---STRTATPFQITI-------FAG-----TEEIFRRDNMYGGTHVATVSLEAPRRVLLR 698
Query: 796 VEMVDK-NGLYFSDEFSLTFHMYYYKLLKWLLVLPML 831
+EM +K L F D + ++ ++ LKW+ +PM+
Sbjct: 699 IEMTNKAQRLVFEDHVFVGYNTKFHVALKWMTAIPMV 735
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 58/104 (55%), Gaps = 6/104 (5%)
Query: 52 ALVNTQCPKNLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKM 111
L + +CP ++ + W EV S +Y+TP+I ++ G IV+ + L +E+++G DG
Sbjct: 2 GLGSRECPLDISVLWTAEVDSPVYSTPVILPSSAGGYKQIVLAT-LRNVELIDG-DGSVP 59
Query: 112 PGWPA----FHQSSVHSSPLLYDIDKDGVREIALATYNGEVLFF 151
WP S+ +SP L+DID DGV+++ +A G + +
Sbjct: 60 YPWPIEMDDHENSAFLASPALHDIDGDGVQDVCVADTMGWLRWL 103
>gi|440803167|gb|ELR24077.1| FGGAP repeat domain containing protein [Acanthamoeba castellanii
str. Neff]
Length = 786
Score = 222 bits (565), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 158/474 (33%), Positives = 253/474 (53%), Gaps = 41/474 (8%)
Query: 371 EKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKY 430
E ++ V +D H+L+ PV+ D+D DG+ ++++AVSY+ Y E G D KY
Sbjct: 315 EPDDNVVVLDPHVLAEPVVRDVDGDGLDDLVVAVSYYL---YAARAEEQ----GADDSKY 367
Query: 431 VAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAY--------IYSSPTVVDLDGDGNLDIL 482
+ +V FNL +KW T LDLS + ++ Y IY SP+V D++GDGN++IL
Sbjct: 368 ASSGVVCFNLPNGTLKWHTPLDLSFADRAYSPYLTRRNHHQIYESPSVGDIEGDGNVEIL 427
Query: 483 VGTSFGLFYVLDHH-GKIRE--KFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAW 539
VGT G+ Y+L+ GK+R+ +FP+ M I G V D++ +GK++++ D +GN+A +
Sbjct: 428 VGTGLGVLYLLEGKTGKVRDDDQFPISMDSIHGRVAMHDVDGNGKLDIIANDINGNLAVF 487
Query: 540 TAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRT 599
+GK +W GPS+GD+DGDG D+V+ T G+++ + K G ++ +P R
Sbjct: 488 DHQGKELWSAQFTPSSHVGPSVGDIDGDGQIDIVLATELGHVWAWNAKTGKVLKNFPMRL 547
Query: 600 HGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLAD 659
+ ++ LL+ T+ + GL I+ T +G LYLI T + +DIG S MVLAD
Sbjct: 548 EAGIHDRPLLLPRTR--DADDGLLIIVTGDNGILYLIHPLTPAIERIDIGHYSLGMVLAD 605
Query: 660 NVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSR 719
++ G ++ L+V T +G + +T P PL A R QGR GIYV
Sbjct: 606 DLLGNGEVHLVVATTSGRLLALTTSLPLLPLAAVRGNGQGRMG-----GEYGIYVE--GE 658
Query: 720 AFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKY 779
+R GR F V I+D P APY++T Y G +++ ++ +A G Y
Sbjct: 659 RYRQVTGRTFTVPFRIIDNAETP---LAPYHLTF-------YVGGKKVLETT-YADPGAY 707
Query: 780 RIKLPTVGVRTTGTVLVEMVDKNGLY--FSDEFSLTFHMYYYKLLKWLLVLPML 831
+ L V + ++ VE+V N + + D F++ + + + LKW +V P L
Sbjct: 708 SVAL-DVPATVSNSIGVELVGMNKVSEEWHDSFTIQCNANFARTLKWFIVGPFL 760
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 41/83 (49%), Gaps = 9/83 (10%)
Query: 74 IYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDG-----DKMPGWPAFHQSSVHSSPLL 128
IY +P + DI DG ++I+V + L L +LEG G D+ P S+H +
Sbjct: 409 IYESPSVGDIEGDGNVEILVGTGLGVLYLLEGKTGKVRDDDQFP----ISMDSIHGRVAM 464
Query: 129 YDIDKDGVREIALATYNGEVLFF 151
+D+D +G +I NG + F
Sbjct: 465 HDVDGNGKLDIIANDINGNLAVF 487
Score = 40.0 bits (92), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 18/69 (26%), Positives = 35/69 (50%)
Query: 66 WQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSS 125
W + + S + P + DI+ DG++DIV+ + L ++ G + +P ++ +H
Sbjct: 495 WSAQFTPSSHVGPSVGDIDGDGQIDIVLATELGHVWAWNAKTGKVLKNFPMRLEAGIHDR 554
Query: 126 PLLYDIDKD 134
PLL +D
Sbjct: 555 PLLLPRTRD 563
Score = 39.7 bits (91), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 30/118 (25%), Positives = 50/118 (42%), Gaps = 20/118 (16%)
Query: 63 ELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSV 122
+L W+ +S +TP I D+ DG ++V + ++ G G P F
Sbjct: 88 DLLWEVSLSGPSRSTPSIVDLYRDGTKQVLVSAAFNH-----GGTG------PVFMA--- 133
Query: 123 HSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMT-DKLEIPRRKVRKDWYVGLHS 179
++DI DG E+ +G + + SG+ + EIP V + W+ GL S
Sbjct: 134 -----MHDISGDGREEVVTVGEDGNIRVWHESGWSLAYPPFEIPHLPVPRGWFDGLES 186
>gi|361067539|gb|AEW08081.1| Pinus taeda anonymous locus 0_18399_01 genomic sequence
gi|383132240|gb|AFG46971.1| Pinus taeda anonymous locus 0_18399_01 genomic sequence
Length = 140
Score = 221 bits (564), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 102/136 (75%), Positives = 117/136 (86%)
Query: 719 RAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGK 778
R RDEEG +FW++ +I+D++R PSGS PYNVT +LLVPGNYQG RRI Q QI+ R
Sbjct: 1 RVLRDEEGDSFWLQFKILDQHRVPSGSHGPYNVTVSLLVPGNYQGPRRILQHQIYDRPDT 60
Query: 779 YRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLV 838
Y++KLPTV VRTTGTV+VEMVDKNGLYFSDEFSLTFHM+YYKLLKWLLVLPM+GMFG+LV
Sbjct: 61 YKMKLPTVPVRTTGTVIVEMVDKNGLYFSDEFSLTFHMHYYKLLKWLLVLPMVGMFGLLV 120
Query: 839 ILRPQEAMPLPSFSRN 854
I RPQE PLPSFSRN
Sbjct: 121 IFRPQEGAPLPSFSRN 136
>gi|383132242|gb|AFG46972.1| Pinus taeda anonymous locus 0_18399_01 genomic sequence
Length = 140
Score = 218 bits (555), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 101/136 (74%), Positives = 116/136 (85%)
Query: 719 RAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGK 778
R RDEEG +FW++ +I+D++R PSGS PYNVT +LLVPGNYQG RRI Q QI+ R
Sbjct: 1 RVLRDEEGDSFWLQFKILDQHRVPSGSHGPYNVTVSLLVPGNYQGPRRILQHQIYDRPDT 60
Query: 779 YRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLV 838
Y++KLPTV VRTTGTV+VEMVDKNGLYFSDEFSLTFHM+YYKLLKWLLVLPM+GMFG+LV
Sbjct: 61 YKMKLPTVPVRTTGTVIVEMVDKNGLYFSDEFSLTFHMHYYKLLKWLLVLPMVGMFGLLV 120
Query: 839 ILRPQEAMPLPSFSRN 854
I R QE PLPSFSRN
Sbjct: 121 IFRAQEGAPLPSFSRN 136
>gi|390331642|ref|XP_780158.2| PREDICTED: uncharacterized protein LOC574700 [Strongylocentrotus
purpuratus]
Length = 665
Score = 211 bits (536), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 141/488 (28%), Positives = 249/488 (51%), Gaps = 39/488 (7%)
Query: 372 KIEDYVNVDSHILSTPVIADIDNDGVSE-MIIAVSYFFDHEYYDNPEHLK-ELGGIDIGK 429
+ +++ D+HIL+TP + D+D DG E + + VSY YD E + E G +I
Sbjct: 180 QTNNFIACDAHILATPALTDLDGDGREEELALPVSY------YDQLESPRDEESGENIQD 233
Query: 430 YVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGD-GNLDILVGTSFG 488
V G +V+ NL + + L LS +A AY+ PTVVDLDG G ++++ T+ G
Sbjct: 234 IVVGGLVILNLTSGHLIKEVPLHLSKQSAPSPAYLLYPPTVVDLDGHHGPPEVIIATASG 293
Query: 489 LFYVLDHHGKIREKFP--LEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGI 546
L + +D G+ R FP LE I G + D+++DG +E++ T G+V+ ++ G +
Sbjct: 294 LLHAMDSMGRARSGFPVRLEGGAITGQITVDDLDNDGNLEMIVLQTSGDVSCFSNHGNKM 353
Query: 547 WEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQ 606
W + G + D+D DG DV++PT GN++ LSGK G+++ +P R+++
Sbjct: 354 WSSKVSGGSMPGSRLADLDDDGVMDVIIPTRDGNVWALSGKTGTRLPHWPVHVGSRLVSN 413
Query: 607 VLLVDLTKRGEKSKGLTIVTTSFDGYLYLI---------DGPTSCADVVDIGETSYSMV- 656
V++ L S+G +++TT +DG LY+I GP C + +++GE+S + V
Sbjct: 414 VVITSLIN---SSQGPSVITTGYDGNLYIISSSSDKGKEQGP--CIEAIELGESSLTPVH 468
Query: 657 LADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSIN--QGRNNVAIRYNRA--GI 712
+AD + +++I T +G++ ST I N+ Y +A G+
Sbjct: 469 MADMIPSMPGVEMIAATTHGSIMLMSTQKQEMESVELEDIRWMDATNDKQFAYKQAKYGV 528
Query: 713 YVTHPSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQI 772
+V +R G +F +++ I DE R PSG + Y + + G ++
Sbjct: 529 FVRRETRQRSYISGTSFTIDLHIEDENR-PSGGHS-YVIQV-------FAGSICLQSPVT 579
Query: 773 FARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLG 832
+++ G Y +++ T T V V M +++G F D F ++F+ ++W+LV+P +
Sbjct: 580 YSQPGLYSLQVATPNHPITVLVEVHMTNQHGQIFVDSFPVSFNTKVTDDIQWMLVIPTIA 639
Query: 833 MFGVLVIL 840
M +L++L
Sbjct: 640 MAILLLVL 647
Score = 80.5 bits (197), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 70/106 (66%), Gaps = 4/106 (3%)
Query: 72 SSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSD--GDKMPGWP-AFHQSSVHSSPLL 128
S I ++PL+ADI++DG LDI+ +F ++V++G + WP F S+VH+SP+L
Sbjct: 49 SPIVSSPLVADIDADGTLDIITTTFDGTVDVVDGENMKPHSRSQWPYQFPNSTVHASPML 108
Query: 129 YDIDKDGVREIALATYNGEVLFFRVSGYMMTDK-LEIPRRKVRKDW 173
+D+D DG+RE+ + T +GE+ +F+ G + +K ++IP V K+W
Sbjct: 109 FDVDGDGLREVLVFTTDGEINYFKSDGTFLPEKTVQIPPLWVSKEW 154
Score = 39.3 bits (90), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 27/108 (25%), Positives = 53/108 (49%), Gaps = 5/108 (4%)
Query: 450 DLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDH-----HGKIREKFP 504
D+D+ + + I SSP V D+D DG LDI+ T G V+D H + + +
Sbjct: 37 DVDIIQSHHVGNSPIVSSPLVADIDADGTLDIITTTFDGTVDVVDGENMKPHSRSQWPYQ 96
Query: 505 LEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLK 552
+ + + + D++ DG E++ T G + + ++G + E+ ++
Sbjct: 97 FPNSTVHASPMLFDVDGDGLREVLVFTTDGEINYFKSDGTFLPEKTVQ 144
>gi|156400118|ref|XP_001638847.1| predicted protein [Nematostella vectensis]
gi|156225971|gb|EDO46784.1| predicted protein [Nematostella vectensis]
Length = 628
Score = 196 bits (498), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 149/501 (29%), Positives = 240/501 (47%), Gaps = 48/501 (9%)
Query: 361 GDEEWTEEQHEKIED-------YVNVDSHILSTPVIADIDNDG-VSEMIIAVSYFFDHEY 412
GD W++ + +V VD H+LSTPVI D +NDG +E++ AV+++F HE
Sbjct: 138 GDPLWSQRHPGRCPSGVSTDGVWVYVDPHVLSTPVITDFNNDGHENELVAAVNFYFGHER 197
Query: 413 YDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVD 472
L L + Y+ +VVF+LD+ ++ + + L+++ ++ + Y SSPT+VD
Sbjct: 198 MSFTPPLTALSEEEAENYLGCGLVVFDLDSGKIIFHSILEVTKRSSPYPGYCLSSPTIVD 257
Query: 473 LDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDT 532
N +++GTS G +++ E M +QG V AD+NDDG +ELV+ D
Sbjct: 258 -----NTSVVIGTSTGNLFLVSSRKSQPETL-TSMDSLQGQVTVADVNDDGAVELVSIDD 311
Query: 533 HGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKV 592
GNVA GK +WE + G + D++GDG +VV T G I+ LSG G V
Sbjct: 312 SGNVACTDLSGKMVWEGRVSGSAAAGIRVADINGDGPLEVVFATYDGYIWALSGDQGKVV 371
Query: 593 RPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETS 652
+P + + VL+ L GE S GL IV DG L +I G C +++ G S
Sbjct: 372 PGWPVKMPDDLRAAVLITKLLP-GE-SSGLDIVVPLLDGTLAIIRGTDRCTEIIRTGVDS 429
Query: 653 Y-SMVLADNVDGGDDLDLIVTTMNGNVFCFSTPA-----PHHPLKAWRSINQGRNNVAIR 706
S V AD + ++LIV +G + C P H + W R AI
Sbjct: 430 LASAVTADLLPWKPGMELIVGGTDGTLLCLGYPGNTTKDGHVHEQLWL-----RAEFAIF 484
Query: 707 Y-NRAGIYVTHPSRAFRDEEGRNFWVEIEIVD----EYRFPSGSQAPYNVTTTLLVPGNY 761
+ ++ G+ +T +R G ++ +E EI+D + R+ N T +L
Sbjct: 485 FLSKVGVSLTKATRLLTQFSGSSYPLEFEIIDIQPNDLRW-------NNYKTKVLF---- 533
Query: 762 QGERRIKQSQIFARRGKY--RIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYY 819
G R++ ++ F G Y + LP+ +R T+ V N F D F ++ + +Y
Sbjct: 534 -GNRQVALNKSFDAPGVYMENLVLPSEPLRAVLTIRVTT--PNNQCFEDSFHVSLNTHYL 590
Query: 820 KLLKWLLVLPMLGMFGVLVIL 840
+ + W L+ P L + +L++L
Sbjct: 591 EDIHWYLLTPFLTVACLLLVL 611
Score = 77.0 bits (188), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 68/109 (62%), Gaps = 2/109 (1%)
Query: 52 ALVNTQCPKNLELRWQTEVSSS-IYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDK 110
++++ C +L W +E ++ I ++PLI D+N D D+ V +F + V++GS G
Sbjct: 11 SVLSHDCHYSLNQAWLSEAGAAPIVSSPLIVDVNGDNIKDVAVTTFDGQVSVIDGSTGQC 70
Query: 111 MPGWP-AFHQSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMM 158
+ GWP S+ H++PL+YDIDKDG+ E+ + T + +V+F + G+++
Sbjct: 71 LSGWPITITGSNFHAAPLMYDIDKDGLMEMVVTTSDSKVVFIDMDGHLL 119
Score = 47.0 bits (110), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 46/92 (50%), Gaps = 4/92 (4%)
Query: 27 NSEQNKFRQREATDD-QLGLPQIDEDALVNTQCPK-NLELRWQTEVSSSIYATPLIADIN 84
+S Q + + DD + L ID+ N C + ++ W+ VS S A +ADIN
Sbjct: 287 DSLQGQVTVADVNDDGAVELVSIDDSG--NVACTDLSGKMVWEGRVSGSAAAGIRVADIN 344
Query: 85 SDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPA 116
DG L++V ++ Y+ L G G +PGWP
Sbjct: 345 GDGPLEVVFATYDGYIWALSGDQGKVVPGWPV 376
Score = 39.3 bits (90), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 53/108 (49%), Gaps = 6/108 (5%)
Query: 462 AYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLD-HHGKIREKFPLEM--AEIQGAVVAAD 518
A I SSP +VD++GD D+ V T G V+D G+ +P+ + + A + D
Sbjct: 32 APIVSSPLIVDVNGDNIKDVAVTTFDGQVSVIDGSTGQCLSGWPITITGSNFHAAPLMYD 91
Query: 519 INDDGKIELVTTDTHGNVAAWTAEGKGIWE---QHLKSLVTQGPSIGD 563
I+ DG +E+V T + V +G ++ Q L +L P+ GD
Sbjct: 92 IDKDGLMEMVVTTSDSKVVFIDMDGHLLYGRTLQVLYALQQNSPTQGD 139
>gi|443693748|gb|ELT95035.1| hypothetical protein CAPTEDRAFT_206740 [Capitella teleta]
Length = 596
Score = 196 bits (497), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 148/528 (28%), Positives = 247/528 (46%), Gaps = 44/528 (8%)
Query: 298 LEDNNSKGSQEGNDKEDVPVATA-ENDQALDENADSSFELFRDTDELADEYNYDYDDYVD 356
+ N + +EG +P +T + D N D + TD + Y + ++
Sbjct: 87 IHGRNGRDMEEGQWSTKIPHSTVLSSPLQFDINGDGILDTVIITDGGQIFFLYKDGNVIE 146
Query: 357 DAMWGDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDG-VSEMIIAVSYFFDHEYYDN 415
D + + Y+ VD H+L+TPVI D++ DG + EM+I VSYFFD Y+
Sbjct: 147 DTTF---------QSDLTSYIQVDPHVLATPVIVDLNQDGSIEEMVIPVSYFFDQREYEL 197
Query: 416 PEHLKELG-GID---IGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVV 471
P+ KEL G+D + Y+A ++VV NL + Q ++L+ A F YI +PT+
Sbjct: 198 PD--KELPLGLDFDSLNNYLASSLVVLNLTSLQPIKALHMELTKVTAEFPGYILFTPTLA 255
Query: 472 DLDGD-GNLDILVGTSFGLFYVLDHH-GKIREKFPLEMAEIQGAVVAADINDDGKIELVT 529
D+D +L+I+VGTS G + +D GK R+ FP++ I G +V D+N DG +E++
Sbjct: 256 DIDSSFSSLEIVVGTSAGQIHCIDSDTGKNRDGFPIQTDTIFGQLVVEDVNQDGSLEIIA 315
Query: 530 TDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDG 589
D GNV +++ GK IWE ++ G + D++ DG +V++ GN+YVLSG G
Sbjct: 316 QDVSGNVVCYSSSGKVIWEMQIEMTRPTGTRVVDINADGLLEVMLCADDGNVYVLSGSSG 375
Query: 590 SKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTS-CADVVDI 648
V +P++ L++ + + L VT +GYL +I G S C D V++
Sbjct: 376 EAVEKWPFKVVDGTQAPPLVMPILPQNV----LAFVTLGNNGYLNVISGDRSTCRDTVNL 431
Query: 649 GETSYSMVLA-DNVDGGDDLDLIVTTMNGNVFCFSTPAPHHP---------LKAWRSINQ 698
ETS +L+ D V L++++ T +G + C + + + + AW S N+
Sbjct: 432 AETSLVQILSHDLVPQAPGLEMLIATEDGTLLCVGSGSNYTTQHSRLDSADMIAWPSENR 491
Query: 699 GRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVP 758
+ + G+ + + G +F +E E +D Y V
Sbjct: 492 ALTDGTYHGQKFGLLLKAEPTQVLEVSGSSFNLEFEFIDVYSQLRRKGFKLQV------- 544
Query: 759 GNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYF 806
Y G + + S+ + G Y IK+P ++V+ NG F
Sbjct: 545 --YAGVKLL-HSERYLTAGLYTIKIPCAREPQLAHLVVQASTANGKVF 589
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 62/108 (57%), Gaps = 4/108 (3%)
Query: 57 QCPKNLELRWQTEVSSSIYA-TPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMP--G 113
+C +L + W E+SS+ +A +PLIADI++DG LDIV S+ + + G +G M
Sbjct: 40 RCIHDLSIIWSRELSSAPFAASPLIADIDADGDLDIVSASYAGDIHAIHGRNGRDMEEGQ 99
Query: 114 WPA-FHQSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTD 160
W S+V SSPL +DI+ DG+ + + T G++ F G ++ D
Sbjct: 100 WSTKIPHSTVLSSPLQFDINGDGILDTVIITDGGQIFFLYKDGNVIED 147
>gi|307111515|gb|EFN59749.1| hypothetical protein CHLNCDRAFT_133372 [Chlorella variabilis]
Length = 193
Score = 176 bits (446), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 84/162 (51%), Positives = 113/162 (69%), Gaps = 1/162 (0%)
Query: 417 EHLKELGG-IDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDG 475
EH ELG +++G YVA +VVF+L + VKW+ LDLSTD SF+AY YS+PT+ D+D
Sbjct: 18 EHRLELGKDVELGLYVASGVVVFDLHRRSVKWSQHLDLSTDYTSFKAYAYSAPTLADIDN 77
Query: 476 DGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGN 535
+G ++I++GTS G YVLD +G R FPL+M +IQ V ADIN DG++E+V D+ GN
Sbjct: 78 NGKMEIIMGTSMGFLYVLDCYGVTRPGFPLQMGDIQAQVAMADINADGQLEMVAADSRGN 137
Query: 536 VAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTL 577
VAA+TA GK +WE HL S + Q GD+DGDG +VV+ T
Sbjct: 138 VAAFTAAGKEVWETHLNSQIHQNAVFGDIDGDGELEVVLATF 179
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 47/88 (53%), Gaps = 2/88 (2%)
Query: 68 TEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPL 127
T + Y+ P +ADI+++GK++I++ + + +L VL+ G PG+P + +
Sbjct: 60 TSFKAYAYSAPTLADIDNNGKMEIIMGTSMGFLYVLD-CYGVTRPGFP-LQMGDIQAQVA 117
Query: 128 LYDIDKDGVREIALATYNGEVLFFRVSG 155
+ DI+ DG E+ A G V F +G
Sbjct: 118 MADINADGQLEMVAADSRGNVAAFTAAG 145
Score = 40.4 bits (93), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 61/143 (42%), Gaps = 33/143 (23%)
Query: 364 EWTEEQHEKIE-DYVNVDSHILSTPVIADIDNDGVSEMIIAVS----YFFDHEYYDNPEH 418
+W+ QH + DY + ++ S P +ADIDN+G E+I+ S Y D P
Sbjct: 48 KWS--QHLDLSTDYTSFKAYAYSAPTLADIDNNGKMEIIMGTSMGFLYVLDCYGVTRPGF 105
Query: 419 LKELGGIDIGKYVA----------------GAIVVFNLDTKQVKWTTDLDLSTDNASFRA 462
++G I +A G + F K+V W T L+ +
Sbjct: 106 PLQMGDIQAQVAMADINADGQLEMVAADSRGNVAAFTAAGKEV-WETHLN---------S 155
Query: 463 YIYSSPTVVDLDGDGNLDILVGT 485
I+ + D+DGDG L++++ T
Sbjct: 156 QIHQNAVFGDIDGDGELEVVLAT 178
>gi|307111514|gb|EFN59748.1| hypothetical protein CHLNCDRAFT_133371 [Chlorella variabilis]
Length = 470
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 86/177 (48%), Positives = 120/177 (67%), Gaps = 14/177 (7%)
Query: 27 NSEQNKFRQREATDD-QLGLPQID-EDALVNTQCPKNLELRWQTEVSSSIYATPLIADIN 84
N+ NKF +REA+ D ++G + ED ++ CP+N+ L+W E SS+IYATPLI D+
Sbjct: 33 NTTGNKFLKREASADVRIGTGTHEAEDEELDDVCPQNIALKWHAEASSAIYATPLITDLF 92
Query: 85 SDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALATY 144
SDG+ D+VVP+FLH LEV EG DG K P + AFH S+ H+SPL++DID DGV +I +ATY
Sbjct: 93 SDGRRDVVVPAFLHQLEVFEGRDGAKAPDFEAFHHSTTHTSPLMHDIDLDGVPDIVVATY 152
Query: 145 NGEVLFFR---VSGYMMTD---------KLEIPRRKVRKDWYVGLHSDPVDRSHPDV 189
+GE+LFF+ G+ + +L IPR +VR+DW+ GL+ DP D S PD+
Sbjct: 153 DGEILFFKDTVRQGWCLVAAAAGEEAATRLTIPRLRVRRDWHKGLNPDPNDHSQPDI 209
>gi|159490276|ref|XP_001703106.1| hypothetical protein CHLREDRAFT_195069 [Chlamydomonas reinhardtii]
gi|158270802|gb|EDO96636.1| predicted protein [Chlamydomonas reinhardtii]
Length = 626
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 109/300 (36%), Positives = 158/300 (52%), Gaps = 40/300 (13%)
Query: 542 EGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHG 601
+G+ +WE+H+ S V+QG GDVDGDG +V G+ + +PYR G
Sbjct: 351 KGEEVWERHVHSAVSQGAVAGDVDGDGQLEV---------------GGADIPKWPYRARG 395
Query: 602 RVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNV 661
RV V L L + GL ++ +FDG+LY+IDG +TSY+ VL D++
Sbjct: 396 RVQAPVTLSHLME----GSGLQVLVPAFDGFLYVIDGL----------QTSYAAVLVDDL 441
Query: 662 DGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAF 721
DG L+L+ TTMNGNV+ F T AP+HPLK W S G N RY GI + +R
Sbjct: 442 DGDGLLELLATTMNGNVYAFETGAPYHPLKTWTSQVLGPNGQVARYGYVGIAASPATRQA 501
Query: 722 RDEEGRNFWVEIEIVDEY-------RFPSGSQAPYNVTTTLLVPGNYQGERR----IKQS 770
RD G V E++D+ G + PYNVT L G + + +
Sbjct: 502 RDVAGERLQVAFEVLDKRVAFADNGTLLPGGRGPYNVTVVLKGVGVREMASGDAPVVGVA 561
Query: 771 QIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPM 830
F G+Y + +P + + V +EMVD +GL +SDEF+L+FHM++++LLKW + LP+
Sbjct: 562 DSFPGPGRYVVDIPCPKSQASAVVRLEMVDSSGLLYSDEFALSFHMHFHRLLKWAVALPL 621
Score = 169 bits (429), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 91/199 (45%), Positives = 126/199 (63%), Gaps = 38/199 (19%)
Query: 31 NKFRQREATDDQLGLPQIDEDALVNTQCPKN-LELRWQTEVSSSIYATPLIADINSDGKL 89
NK+R+R+A D+ G Q + ++C ++ L+L W +E +SS+YA+PLI D+++DG+
Sbjct: 39 NKYRERKADMDKEGDIQHEGPVPGASRCGRHRLDLSWMSEATSSVYASPLITDLHADGRR 98
Query: 90 DIVVPSFLHYLEVLEGSDGDKMPG-WPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEV 148
DI+VP+F+HYLEVLEG +G + PG WPAFH SSVH+SPLLYDID DGVR++ LATY+G+V
Sbjct: 99 DIIVPAFVHYLEVLEGPNGGQAPGGWPAFHASSVHASPLLYDIDFDGVRDVLLATYDGQV 158
Query: 149 LFFRVSGYMMTDKLEIPRRKVRK-----------------------------------DW 173
LF++ +G M + L+I R +VRK DW
Sbjct: 159 LFYKDTGDKMHEGLQISRLRVRKAGWLPADPAGPHVALPVPPAPPYVPAPLVPTRCRLDW 218
Query: 174 YVGLH-SDPVDRSHPDVHD 191
YVGL +DP D SHPDV D
Sbjct: 219 YVGLDPNDPFDHSHPDVGD 237
>gi|196006714|ref|XP_002113223.1| hypothetical protein TRIADDRAFT_57158 [Trichoplax adhaerens]
gi|190583627|gb|EDV23697.1| hypothetical protein TRIADDRAFT_57158 [Trichoplax adhaerens]
Length = 594
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 123/445 (27%), Positives = 215/445 (48%), Gaps = 37/445 (8%)
Query: 366 TEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSE-MIIAVSYFFDHEYYDNPEHLKELGG 424
T ++ E ++ VD HIL+TP IADI+ DGV E ++I V+++FD+ + HL
Sbjct: 142 TNQEKSSKEKFLKVDPHILATPAIADINGDGVEEEIVIIVNWYFDNG--NVKHHLNSRSN 199
Query: 425 I---DIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLD--GDGNL 479
I D+ + VVFNL ++ L+ S + ++ Y ++PT+++L +L
Sbjct: 200 ITSGDVRDGIHCGAVVFNLTNGKLIKKIVLESSKMTSLYKPYAVATPTLLNLQPQTKSDL 259
Query: 480 DILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAW 539
DILVGT G Y+++ H K + + IQG + AD+ D+ +E++ TD+ NV
Sbjct: 260 DILVGTLSGNLYIVNLHSKDYPEPVTIVDNIQGQITVADLTDNEGLEIIVTDSSANVNCL 319
Query: 540 TAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRT 599
+++G+ IW+ + G DV+ D + DV+ + G ++ L GK G ++ +P
Sbjct: 320 SSKGEVIWDSMISGTFIAGSRTADVNNDANLDVITASNDGYVWALDGKSGKTLKHWPINL 379
Query: 600 HGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYS---MV 656
H R+ + VL+ ++ L +V +S DGYL++ID + C +V+ G + +
Sbjct: 380 HHRLHSVVLMTYF-----RNNSLHLVISSIDGYLFIIDLKSRCLEVLRFGTHAVGATVQI 434
Query: 657 LADN-VDGGDDLDLIVTTMNGNVFCF-------STPAPHHPLKAWRSINQGRNNVAIRYN 708
LAD+ V L+LI+ T++GN+ C P + L +W S N R
Sbjct: 435 LADDIVPEVPGLELILATLDGNIICLGEKENGHKLQTPSNDLNSWPSETLSYNGFTPRMQ 494
Query: 709 RAGIYVTHPSRAFRDEEGRNFWVEIEIVDE--YRFPSGSQAPYNV-----TTTLLVPGNY 761
G+ + + F ++ EI D R S + PY+V + +L+ +Y
Sbjct: 495 NIGVIIKECPHEVNQMQ---FAIKFEIYDNSMSRRMSNNNPPYHVKIYIGSNHVLLNTSY 551
Query: 762 QGERRIKQ---SQIFARRGKYRIKL 783
Q ++ S + +R I+L
Sbjct: 552 QNAGLFEENLNSLTYPKRTLLTIRL 576
Score = 70.1 bits (170), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 63/105 (60%), Gaps = 3/105 (2%)
Query: 54 VNTQCPKNLELRWQTEVSSS-IYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMP 112
V C + ++ W + S + + P+I DI+++G D++ S+ +L+G+ G +
Sbjct: 6 VRHSCHYDFKVLWSYRLGHSPVVSPPVIFDIDANGIKDVIATSYADQFTILQGNSGKHLE 65
Query: 113 G-WPA-FHQSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSG 155
G WP + +++SSPLLYDID D + L+TY+GE++FFR +G
Sbjct: 66 GVWPLHMKEVTMYSSPLLYDIDHDLEMDSLLSTYDGEIIFFRNNG 110
Score = 46.6 bits (109), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 48/100 (48%), Gaps = 2/100 (2%)
Query: 30 QNKFRQREATDDQLGLPQIDEDALVNTQCPKNL-ELRWQTEVSSSIYATPLIADINSDGK 88
Q + + TD++ GL I D+ N C + E+ W + +S + A AD+N+D
Sbjct: 291 QGQITVADLTDNE-GLEIIVTDSSANVNCLSSKGEVIWDSMISGTFIAGSRTADVNNDAN 349
Query: 89 LDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLL 128
LD++ S Y+ L+G G + WP +HS L+
Sbjct: 350 LDVITASNDGYVWALDGKSGKTLKHWPINLHHRLHSVVLM 389
>gi|308807463|ref|XP_003081042.1| putative dex1 protein (ISS) [Ostreococcus tauri]
gi|116059504|emb|CAL55211.1| putative dex1 protein (ISS) [Ostreococcus tauri]
Length = 324
Score = 117 bits (293), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 86/291 (29%), Positives = 133/291 (45%), Gaps = 17/291 (5%)
Query: 565 DGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTI 624
+GDG DVVV T SG ++ +DGS + +P R +++ ++L TK GL +
Sbjct: 31 NGDGALDVVVGTTSGAVHAYRAEDGSALGGWPVRASDKILAPIVL---TKIRPNKLGLDV 87
Query: 625 VTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTP 684
+ + DG + DG +C DVVD+ E Y+ + + G LD++V+TM GNV F
Sbjct: 88 IVAAHDGVVNFFDGKATCRDVVDVAEKIYAAPVVTSFAGSGALDVVVSTMQGNVHAFRAK 147
Query: 685 APHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRF--- 741
A S ++ A R G+ + RA R G V EIVD
Sbjct: 148 GSKFDALAVSS----SDSHAARVGYFGVVLR--DRAHRVVRGTRINVAYEIVDRRVLNVA 201
Query: 742 -PSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVD 800
+ PY +T TL+ + ER + +A GK+ + + +T G + +VD
Sbjct: 202 KSKRTHEPYQITITLIALDGF--ERSVTAK--YAHAGKFTLWVDVPATKTRGEIRARVVD 257
Query: 801 KNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVILRPQEAMPLPSF 851
D +SL+FH Y +LKWL+ +P L V + +EA+ L F
Sbjct: 258 ATLNDAEDAYSLSFHDNYESILKWLVAVPFLLASFVAIRRASEEALELDVF 308
>gi|298204950|emb|CBI34257.3| unnamed protein product [Vitis vinifera]
Length = 79
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 49/65 (75%), Positives = 57/65 (87%)
Query: 111 MPGWPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVR 170
M GWPAFH S+VHS P L+DIDKDGVREIALATYNGEVLFF VSGYM+ +KLE+ R +V+
Sbjct: 1 MSGWPAFHWSTVHSGPPLHDIDKDGVREIALATYNGEVLFFGVSGYMVINKLEVLRMRVQ 60
Query: 171 KDWYV 175
KDWY+
Sbjct: 61 KDWYM 65
>gi|300176158|emb|CBK23469.2| unnamed protein product [Blastocystis hominis]
Length = 330
Score = 92.0 bits (227), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 67/215 (31%), Positives = 105/215 (48%), Gaps = 32/215 (14%)
Query: 513 AVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDV 572
VV AD+ +D +E++ D G +A +G+ IW + + DV GDG+ DV
Sbjct: 2 GVVCADVANDAYLEIIVGDMSGILAVLDPKGEEIWSSVMTA---------DVSGDGNLDV 52
Query: 573 VVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGY 632
+ T +G+I++L G+ G + YP RT + + L+D++ G+ GL +V TS +G
Sbjct: 53 IAVTDNGDIWILDGQTGKSLPNYPIRTGYQFRSTPTLIDVSPSGD---GLHMVFTSKEGE 109
Query: 633 LYLIDGPT-----------SCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCF 681
+ +++ T C +D+GE + VLA D L LIV+TMNGN+ F
Sbjct: 110 VGIVNLRTFFRFWFFVTSRGCVRRIDVGEELRTRVLASKEMESDMLALIVSTMNGNLLQF 169
Query: 682 STPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTH 716
+T W +N+ N YN GIY H
Sbjct: 170 TTSG------KWSQVNRFNNR---GYNSDGIYPPH 195
>gi|255630710|gb|ACU15716.1| unknown [Glycine max]
Length = 55
Score = 88.6 bits (218), Expect = 1e-14, Method: Composition-based stats.
Identities = 42/47 (89%), Positives = 43/47 (91%)
Query: 604 MNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGE 650
MNQVLLVDL+K EK KGLTIVTTSFDGYLYLIDGPT CADVVDIGE
Sbjct: 1 MNQVLLVDLSKHKEKRKGLTIVTTSFDGYLYLIDGPTGCADVVDIGE 47
>gi|390370660|ref|XP_003731864.1| PREDICTED: lambda-carrageenase-like, partial [Strongylocentrotus
purpuratus]
Length = 192
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 47/167 (28%), Positives = 90/167 (53%), Gaps = 15/167 (8%)
Query: 527 LVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSG 586
++ T G+V+ ++ G +W + G + D+D DG DV++PT GN++ LSG
Sbjct: 1 MIVLQTSGDVSCFSNHGNKMWSSKVSGGSMPGSRLTDLDDDGVMDVIIPTSDGNVWALSG 60
Query: 587 KDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLI---------D 637
K G+ + +P R+++ V++ L ++G +++TT +DG LY+I
Sbjct: 61 KTGTHLPHWPVHVGSRLVSNVVITSLIN---STRGPSVITTGYDGNLYIISSSSDKGKEQ 117
Query: 638 GPTSCADVVDIGETSYSMV-LADNVDGGDDLDLIVTTMNGNVFCFST 683
GP C + +++GE+S + V +AD + +++I T +G++ ST
Sbjct: 118 GP--CIEAIELGESSLTPVHMADMIPSMPGVEMIAATTHGSIMLMST 162
>gi|52352425|gb|AAU43714.1| cell surface protein [uncultured archaeon GZfos26D8]
Length = 613
Score = 84.3 bits (207), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 94/356 (26%), Positives = 154/356 (43%), Gaps = 31/356 (8%)
Query: 383 ILSTPVIADIDNDGVSEMIIAV----SYFFDHEYYDNPEHLKELGG--------IDIGKY 430
+ S+P + DID DG E+++ Y + H+ K GG DI +
Sbjct: 145 VRSSPALGDIDRDGDMEIVVGSFDDKVYAWHHDGTLVTGWPKITGGDVRSSPALGDIDRD 204
Query: 431 VAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAY-IYSSPTVVDLDGDGNLDILVGTSFGL 489
IVV + D K W D L T ++SSP + D+DG+G+++I+VG+
Sbjct: 205 GDMEIVVGSFDDKVYAWHHDGTLVTGWPKITGGDVWSSPALGDVDGNGDIEIVVGSDDDR 264
Query: 490 FYVLDHHGKIREKFPLEM-AEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGI-- 546
Y H G + +P ++ + D++ D IE+V V AW +G +
Sbjct: 265 VYAWHHDGTLVTGWPKATGGDVWSSPALGDVDGDSDIEIVVGSQDDKVYAWHHDGTLVTG 324
Query: 547 WEQHLKSLV-TQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMN 605
W Q V P++GD+DGDG ++VV + +Y DG+ V +P T V +
Sbjct: 325 WPQATGDEVYPSSPALGDIDGDGDIEIVVGSDDNKVYAWH-HDGTLVTGWPQTTGADVDS 383
Query: 606 QVLLVDLTKRGEKSKGLTIVTTSFDGYLYLI--DGPTSCADVVDIGETSYSMVLA-DNVD 662
L D+ G+ + IV S D +Y DG G YS A ++D
Sbjct: 384 SPALGDIDGDGD----IEIVVGSLDDKVYAWHHDGALVTGWPKATGGNVYSSSPALGDID 439
Query: 663 GGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPS 718
G D++++V + +G V+ + + P +I G + ++ R G+Y PS
Sbjct: 440 GDGDIEVVVGSRDGKVYAWDCSGTYDP----GNIEWGTFHHDVK--RTGLYAPLPS 489
Score = 76.6 bits (187), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 62/221 (28%), Positives = 105/221 (47%), Gaps = 10/221 (4%)
Query: 466 SSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEM-AEIQGAVVAADINDDGK 524
SSP + D+DGDG+++I+VG++ Y H G I +P +++ + DI+ DG
Sbjct: 100 SSPALGDIDGDGDMEIVVGSADDKVYAWHHDGTIVTGWPKATGGDVRSSPALGDIDRDGD 159
Query: 525 IELVTTDTHGNVAAWTAEGKGI--WEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIY 582
+E+V V AW +G + W + V P++GD+D DG ++VV + +Y
Sbjct: 160 MEIVVGSFDDKVYAWHHDGTLVTGWPKITGGDVRSSPALGDIDRDGDMEIVVGSFDDKVY 219
Query: 583 VLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLI--DGPT 640
DG+ V +P T G V + L D+ G+ + IV S D +Y DG
Sbjct: 220 AWH-HDGTLVTGWPKITGGDVWSSPALGDVDGNGD----IEIVVGSDDDRVYAWHHDGTL 274
Query: 641 SCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCF 681
G +S +VDG D++++V + + V+ +
Sbjct: 275 VTGWPKATGGDVWSSPALGDVDGDSDIEIVVGSQDDKVYAW 315
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 115/466 (24%), Positives = 193/466 (41%), Gaps = 80/466 (17%)
Query: 385 STPVIADIDNDGVSEMIIAVS----YFFDHEYYDNPEHLKELGG--------IDIGKYVA 432
S+P + DID DG E+++ + Y + H+ K GG DI +
Sbjct: 100 SSPALGDIDGDGDMEIVVGSADDKVYAWHHDGTIVTGWPKATGGDVRSSPALGDIDRDGD 159
Query: 433 GAIVVFNLDTKQVKWTTDLDLSTDNASFRAY-IYSSPTVVDLDGDGNLDILVGTSFGLFY 491
IVV + D K W D L T + SSP + D+D DG+++I+VG+ Y
Sbjct: 160 MEIVVGSFDDKVYAWHHDGTLVTGWPKITGGDVRSSPALGDIDRDGDMEIVVGSFDDKVY 219
Query: 492 VLDHHGKIREKFP-LEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGI--WE 548
H G + +P + ++ + D++ +G IE+V V AW +G + W
Sbjct: 220 AWHHDGTLVTGWPKITGGDVWSSPALGDVDGNGDIEIVVGSDDDRVYAWHHDGTLVTGWP 279
Query: 549 QHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVM-NQV 607
+ V P++GDVDGD ++VV + +Y DG+ V +P T V +
Sbjct: 280 KATGGDVWSSPALGDVDGDSDIEIVVGSQDDKVYAWH-HDGTLVTGWPQATGDEVYPSSP 338
Query: 608 LLVDLTKRGEKSKGLTIVTTSFDGYLY-------LIDG--PTSCADVVDIGETSYSMVLA 658
L D+ G+ + IV S D +Y L+ G T+ ADV S L
Sbjct: 339 ALGDIDGDGD----IEIVVGSDDNKVYAWHHDGTLVTGWPQTTGADV------DSSPALG 388
Query: 659 DNVDGGDDLDLIVTTMNGNVFCFSTPAPHHP---LKAWRSINQGRNNVAIRYNRAGIYVT 715
D +DG D++++V +++ V+ + HH + W G +Y +
Sbjct: 389 D-IDGDGDIEIVVGSLDDKVYAW-----HHDGALVTGWPKATGGN-----------VYSS 431
Query: 716 HPSRAFRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQG---ERRIKQSQI 772
P+ D +G +IE+V R G ++ + T PGN + +K++ +
Sbjct: 432 SPALGDIDGDG-----DIEVVVGSR--DGKVYAWDCSGTYD-PGNIEWGTFHHDVKRTGL 483
Query: 773 FARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYY 818
+A LP+V V+ + DK S+ +T + Y
Sbjct: 484 YA-------PLPSV-----PKVISVVTDKTNYTLSETVHVTLEINY 517
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 49/94 (52%), Gaps = 1/94 (1%)
Query: 66 WQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSS 125
W + ++P + DI+ DG ++IVV SF + DG + GWP V SS
Sbjct: 137 WPKATGGDVRSSPALGDIDRDGDMEIVVGSFDDKVYAWH-HDGTLVTGWPKITGGDVRSS 195
Query: 126 PLLYDIDKDGVREIALATYNGEVLFFRVSGYMMT 159
P L DID+DG EI + +++ +V + G ++T
Sbjct: 196 PALGDIDRDGDMEIVVGSFDDKVYAWHHDGTLVT 229
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 37/133 (27%), Positives = 61/133 (45%), Gaps = 8/133 (6%)
Query: 46 PQIDEDALVNTQCPKNLELR-------WQTEVSSSIYATPLIADINSDGKLDIVVPSFLH 98
P IDE+ + + + + W + ++P + DI+ DG ++IVV S
Sbjct: 63 PTIDEEKITDGAISMGISIASTALQPGWPRITGGDVDSSPALGDIDGDGDMEIVVGSADD 122
Query: 99 YLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMM 158
+ DG + GWP V SSP L DID+DG EI + +++ +V + G ++
Sbjct: 123 KVYAWH-HDGTIVTGWPKATGGDVRSSPALGDIDRDGDMEIVVGSFDDKVYAWHHDGTLV 181
Query: 159 TDKLEIPRRKVRK 171
T +I VR
Sbjct: 182 TGWPKITGGDVRS 194
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 47/178 (26%), Positives = 81/178 (45%), Gaps = 9/178 (5%)
Query: 508 AEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGI--WEQHLKSLVTQGPSIGDVD 565
++ + DI+ DG +E+V V AW +G + W + V P++GD+D
Sbjct: 96 GDVDSSPALGDIDGDGDMEIVVGSADDKVYAWHHDGTIVTGWPKATGGDVRSSPALGDID 155
Query: 566 GDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIV 625
DG ++VV + +Y DG+ V +P T G V + L D+ + G+ + IV
Sbjct: 156 RDGDMEIVVGSFDDKVYAWH-HDGTLVTGWPKITGGDVRSSPALGDIDRDGD----MEIV 210
Query: 626 TTSFDGYLYLI--DGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCF 681
SFD +Y DG G +S +VDG D++++V + + V+ +
Sbjct: 211 VGSFDDKVYAWHHDGTLVTGWPKITGGDVWSSPALGDVDGNGDIEIVVGSDDDRVYAW 268
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 49/96 (51%), Gaps = 2/96 (2%)
Query: 61 NLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQS 120
L W + + ++P + DI+ DG ++IVV S + DG + GWP
Sbjct: 368 TLVTGWPQTTGADVDSSPALGDIDGDGDIEIVVGSLDDKVYAWH-HDGALVTGWPKATGG 426
Query: 121 SVHSS-PLLYDIDKDGVREIALATYNGEVLFFRVSG 155
+V+SS P L DID DG E+ + + +G+V + SG
Sbjct: 427 NVYSSSPALGDIDGDGDIEVVVGSRDGKVYAWDCSG 462
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 1/99 (1%)
Query: 61 NLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQS 120
L W + ++P + DI+ DG ++IVV SF + DG + GWP
Sbjct: 179 TLVTGWPKITGGDVRSSPALGDIDRDGDMEIVVGSFDDKVYAWH-HDGTLVTGWPKITGG 237
Query: 121 SVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMT 159
V SSP L D+D +G EI + + + V + G ++T
Sbjct: 238 DVWSSPALGDVDGNGDIEIVVGSDDDRVYAWHHDGTLVT 276
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 50/100 (50%), Gaps = 2/100 (2%)
Query: 61 NLELRWQTEVSSSIY-ATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQ 119
L W +Y ++P + DI+ DG ++IVV S + + DG + GWP
Sbjct: 320 TLVTGWPQATGDEVYPSSPALGDIDGDGDIEIVVGSDDNKVYAWH-HDGTLVTGWPQTTG 378
Query: 120 SSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMT 159
+ V SSP L DID DG EI + + + +V + G ++T
Sbjct: 379 ADVDSSPALGDIDGDGDIEIVVGSLDDKVYAWHHDGALVT 418
Score = 44.7 bits (104), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 72/159 (45%), Gaps = 17/159 (10%)
Query: 547 WEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQ 606
W + V P++GD+DGDG ++VV + +Y DG+ V +P T G V +
Sbjct: 90 WPRITGGDVDSSPALGDIDGDGDMEIVVGSADDKVYAWH-HDGTIVTGWPKATGGDVRSS 148
Query: 607 VLLVDLTKRGEKSKGLTIVTTSFDGYLYLI--DGP-TSCADVVDIGETSYSMVLADNVDG 663
L D+ + G+ + IV SFD +Y DG + + G+ S L D +D
Sbjct: 149 PALGDIDRDGD----MEIVVGSFDDKVYAWHHDGTLVTGWPKITGGDVRSSPALGD-IDR 203
Query: 664 GDDLDLIVTTMNGNVFCFSTPAPHHP---LKAWRSINQG 699
D++++V + + V+ + HH + W I G
Sbjct: 204 DGDMEIVVGSFDDKVYAW-----HHDGTLVTGWPKITGG 237
Score = 43.9 bits (102), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 27/99 (27%), Positives = 47/99 (47%), Gaps = 1/99 (1%)
Query: 61 NLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQS 120
L W ++++P + D++ +G ++IVV S + DG + GWP
Sbjct: 226 TLVTGWPKITGGDVWSSPALGDVDGNGDIEIVVGSDDDRVYAWH-HDGTLVTGWPKATGG 284
Query: 121 SVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMT 159
V SSP L D+D D EI + + + +V + G ++T
Sbjct: 285 DVWSSPALGDVDGDSDIEIVVGSQDDKVYAWHHDGTLVT 323
>gi|383764384|ref|YP_005443366.1| hypothetical protein CLDAP_34290 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
gi|381384652|dbj|BAM01469.1| hypothetical protein CLDAP_34290 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
Length = 2073
Score = 80.9 bits (198), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 74/274 (27%), Positives = 123/274 (44%), Gaps = 44/274 (16%)
Query: 381 SHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDN--------PEHLKELGGIDIGKYVA 432
SH +TPV+ D+D DG E+II + E YD+ P+ L+ G+ V
Sbjct: 1685 SHYYATPVVVDLDGDGEREIIIGNLTRWQLEVYDSRGKMRSGWPQLLRS--GVKAAATVG 1742
Query: 433 ----------------GAIVVFNLDTKQVKWTTDLDLSTD---NASFRAYIYSSPTVVDL 473
G + FN D +++ + L T +A +R I + P + DL
Sbjct: 1743 DLNGDGKLDILVGDLRGYLHAFNADGTRLEGWSSQGLKTGLHPDAEYR--ILAQPAIADL 1800
Query: 474 DGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAE---------IQGAVVAADINDDGK 524
DGDG +I++ S YV + +G +R +P+ + + I V ADIN DG+
Sbjct: 1801 DGDGRNEIVLALSDSRLYVYEANGTLRNGWPVSLGDAPDIYGSHVIDSTPVVADINGDGQ 1860
Query: 525 IELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGD-VDGDGHSDVVVPTLSGNIYV 583
E+V + + A+G+ +W + ++ P+IGD V G ++V+ + +Y
Sbjct: 1861 QEIVVGSYDHRLYVYRADGRLLWTYETRDIIMSTPAIGDLVPGSPGLEIVIGSGDRFVYT 1920
Query: 584 LSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGE 617
L+ G + +P T V + LLVDL G+
Sbjct: 1921 LT-SSGQLLWKHP--TGWIVRSSPLLVDLDGDGQ 1951
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 76/300 (25%), Positives = 131/300 (43%), Gaps = 31/300 (10%)
Query: 363 EEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDN------- 415
E W+ Q K + + + IL+ P IAD+D DG +E+++A+S + Y N
Sbjct: 1772 EGWSS-QGLKTGLHPDAEYRILAQPAIADLDGDGRNEIVLALSDSRLYVYEANGTLRNGW 1830
Query: 416 PEHLKELGGI-------------DIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRA 462
P L + I DI IVV + D + + D L + R
Sbjct: 1831 PVSLGDAPDIYGSHVIDSTPVVADINGDGQQEIVVGSYDHRLYVYRADGRLLWTYET-RD 1889
Query: 463 YIYSSPTVVDL-DGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADIND 521
I S+P + DL G L+I++G+ Y L G++ K P ++ + + D++
Sbjct: 1890 IIMSTPAIGDLVPGSPGLEIVIGSGDRFVYTLTSSGQLLWKHPTGWI-VRSSPLLVDLDG 1948
Query: 522 DGKIELVTTDTHGNVAAWTAEGKGI--WEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSG 579
DG+ E+V V AW + G + W Q + V P D+D DG ++V + +G
Sbjct: 1949 DGQFEIVIGSDDKKVWAWHSNGAPVAGWPQATGAPVASSPIAVDLDKDGAPEIVAGSDNG 2008
Query: 580 NIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGP 639
+Y +DG+ ++ +P T + L+ R + IV +++G L ++ GP
Sbjct: 2009 AVYAWR-RDGTLLKGWPKLTGCPIKGAPALL----RNDGVFEPEIVVANYEGTLRVVGGP 2063
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 52/96 (54%), Gaps = 1/96 (1%)
Query: 63 ELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSV 122
+L W+ + ++PL+ D++ DG+ +IV+ S + S+G + GWP + V
Sbjct: 1926 QLLWKHPTGWIVRSSPLLVDLDGDGQFEIVIGSDDKKVWAWH-SNGAPVAGWPQATGAPV 1984
Query: 123 HSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMM 158
SSP+ D+DKDG EI + NG V +R G ++
Sbjct: 1985 ASSPIAVDLDKDGAPEIVAGSDNGAVYAWRRDGTLL 2020
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 53/125 (42%), Gaps = 22/125 (17%)
Query: 66 WQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPA------FHQ 119
W + S + A + D+N DGKLDI+V YL +DG ++ GW + H
Sbjct: 1727 WPQLLRSGVKAAATVGDLNGDGKLDILVGDLRGYLHAFN-ADGTRLEGWSSQGLKTGLHP 1785
Query: 120 SS---VHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDWYVG 176
+ + + P + D+D DG EI LA + + + +G + R W V
Sbjct: 1786 DAEYRILAQPAIADLDGDGRNEIVLALSDSRLYVYEANGTL------------RNGWPVS 1833
Query: 177 LHSDP 181
L P
Sbjct: 1834 LGDAP 1838
Score = 46.2 bits (108), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 49/93 (52%), Gaps = 9/93 (9%)
Query: 74 IYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPA--------FHQSSVHSS 125
I A P IAD++ DG+ +IV+ L V E ++G GWP + + S+
Sbjct: 1791 ILAQPAIADLDGDGRNEIVLALSDSRLYVYE-ANGTLRNGWPVSLGDAPDIYGSHVIDST 1849
Query: 126 PLLYDIDKDGVREIALATYNGEVLFFRVSGYMM 158
P++ DI+ DG +EI + +Y+ + +R G ++
Sbjct: 1850 PVVADINGDGQQEIVVGSYDHRLYVYRADGRLL 1882
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 26/85 (30%), Positives = 46/85 (54%), Gaps = 2/85 (2%)
Query: 72 SSIYATPLIADINSDGKLDIVVPSFLHY-LEVLEGSDGDKMPGWPAFHQSSVHSSPLLYD 130
S YATP++ D++ DG+ +I++ + + LEV + S G GWP +S V ++ + D
Sbjct: 1685 SHYYATPVVVDLDGDGEREIIIGNLTRWQLEVYD-SRGKMRSGWPQLLRSGVKAAATVGD 1743
Query: 131 IDKDGVREIALATYNGEVLFFRVSG 155
++ DG +I + G + F G
Sbjct: 1744 LNGDGKLDILVGDLRGYLHAFNADG 1768
>gi|167540101|ref|XP_001741552.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165893860|gb|EDR21979.1| hypothetical protein EDI_020530 [Entamoeba dispar SAW760]
Length = 627
Score = 79.3 bits (194), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 75/277 (27%), Positives = 127/277 (45%), Gaps = 20/277 (7%)
Query: 328 ENADSSFELFRDT----DELADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHI 383
E S ELF +T DE+ D N + D+ D +++ Y + +HI
Sbjct: 32 EGQRSLAELFHNTKQQKDEIDDHINCYHAAIYDNYAGKDGDFS---------YARLRAHI 82
Query: 384 LSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTK 443
LS P I I+ +++ V+YF + + K ID KY AI + T
Sbjct: 83 LSDPEI--IEGKEGKLLVVPVTYFINPTPSIDQRGKKIEKMIDEKKYCVSAICILEPSTG 140
Query: 444 QVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLD-HHGKIR-- 500
++K T +DL+ N+ A + S + G + I+VG + G + + GK++
Sbjct: 141 KIKSLTTIDLTVMNSRESAALVSGIESIGKYGIPDT-IIVGNAAGRIHQVSLDDGKVKDP 199
Query: 501 EKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPS 560
+ +PL++ I G +V D+ DG+I +VT D +GNV + G+ WE + + QG
Sbjct: 200 QGWPLQVGPIMGRIVVEDVTRDGEINIVTGDINGNVVCFDITGRVRWEMNGGGGINQGLE 259
Query: 561 IGD-VDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYP 596
I V G + ++ T +G I+ L G +G + +P
Sbjct: 260 IVRLVLGRKITLIIFGTNNGKIHCLDGSNGKNIDIFP 296
>gi|386001518|ref|YP_005919817.1| Cell surface protein [Methanosaeta harundinacea 6Ac]
gi|357209574|gb|AET64194.1| Cell surface protein [Methanosaeta harundinacea 6Ac]
Length = 590
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 50/143 (34%), Positives = 77/143 (53%), Gaps = 4/143 (2%)
Query: 464 IYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAE-IQGAVVAADINDD 522
++SSP + D+DGDG L+I+VGT+ G YV + G FP + I+ + D++ D
Sbjct: 182 VWSSPALADIDGDGELEIVVGTNEGRLYVFNRDGSKVTGFPKRAEDYIRSSPAVEDLDGD 241
Query: 523 GKIELVTTDTHGNVAAWTAEGKGI--WEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGN 580
G +++V G + AW G+ + + + + V P+IGD+DGD +VVV +
Sbjct: 242 GLLDIVVGSDDGRLYAWNGRGEELSGFPRLTSNSVWSSPAIGDLDGDSGLEVVVGSTDRG 301
Query: 581 IYVLSGKDGSKVRPYPYRTHGRV 603
IY +G DGS V YP T V
Sbjct: 302 IYAFNG-DGSPVCGYPLITSNSV 323
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 47/171 (27%), Positives = 74/171 (43%), Gaps = 35/171 (20%)
Query: 375 DYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGA 434
+++ S + S+P +ADID DG E+++ + G
Sbjct: 174 EWIQTSSPVWSSPALADIDGDGELEIVVGTN--------------------------EGR 207
Query: 435 IVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLD 494
+ VFN D +V T ++ YI SSP V DLDGDG LDI+VG+ G Y +
Sbjct: 208 LYVFNRDGSKV---TGFPKRAED-----YIRSSPAVEDLDGDGLLDIVVGSDDGRLYAWN 259
Query: 495 HHGKIREKFP-LEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGK 544
G+ FP L + + D++ D +E+V T + A+ +G
Sbjct: 260 GRGEELSGFPRLTSNSVWSSPAIGDLDGDSGLEVVVGSTDRGIYAFNGDGS 310
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 59/215 (27%), Positives = 91/215 (42%), Gaps = 40/215 (18%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S+P++AD+D DG E++I S D Y A+ D +
Sbjct: 396 SSPILADLDGDGSLEIVIGAS---DGLY---------------------ALTRVGEDYRG 431
Query: 445 VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFP 504
T L SF A D+DGDG ++I+ G + G Y+ G FP
Sbjct: 432 FPRKTTGSLQD---SFIA-------AGDVDGDGEVEIVGGATDGRLYLWRSDGSDHPGFP 481
Query: 505 LEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGK---GIWEQHLKSLVTQGPSI 561
++ + ADI+ DG+ E++ V AW +G G + L ++ T P+I
Sbjct: 482 IQTGGYVKNIALADIDGDGQQEIIGGSLDNRVHAWRLDGTEVDGFPKVTLGNVDTT-PAI 540
Query: 562 GDVDGDGHSDVVVPTLSGNIYV--LSGKDGSKVRP 594
D++GDG D+V + G IY+ +SG G P
Sbjct: 541 ADLEGDGVLDMVAGSDDGRIYIWEISGAFGDHEWP 575
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 57/105 (54%), Gaps = 4/105 (3%)
Query: 69 EVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLL 128
+ SS ++++P +ADI+ DG+L+IVV + L V DG K+ G+P + + SSP +
Sbjct: 177 QTSSPVWSSPALADIDGDGELEIVVGTNEGRLYVFN-RDGSKVTGFPKRAEDYIRSSPAV 235
Query: 129 YDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDW 173
D+D DG+ +I + + +G + + G ++ PR W
Sbjct: 236 EDLDGDGLLDIVVGSDDGRLYAWNGRGEELSG---FPRLTSNSVW 277
Score = 42.7 bits (99), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 44/82 (53%), Gaps = 1/82 (1%)
Query: 74 IYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDK 133
I ++P + D++ DG LDIVV S L G G+++ G+P +SV SSP + D+D
Sbjct: 229 IRSSPAVEDLDGDGLLDIVVGSDDGRLYAWNGR-GEELSGFPRLTSNSVWSSPAIGDLDG 287
Query: 134 DGVREIALATYNGEVLFFRVSG 155
D E+ + + + + F G
Sbjct: 288 DSGLEVVVGSTDRGIYAFNGDG 309
Score = 40.0 bits (92), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 22/91 (24%), Positives = 49/91 (53%), Gaps = 1/91 (1%)
Query: 80 IADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREI 139
+ADI+ DG+ +I+ S + + DG ++ G+P +V ++P + D++ DGV ++
Sbjct: 493 LADIDGDGQQEIIGGSLDNRVHAWR-LDGTEVDGFPKVTLGNVDTTPAIADLEGDGVLDM 551
Query: 140 ALATYNGEVLFFRVSGYMMTDKLEIPRRKVR 170
+ +G + + +SG + + R+ +R
Sbjct: 552 VAGSDDGRIYIWEISGAFGDHEWPMVRQNLR 582
>gi|374851757|dbj|BAL54708.1| cell surface protein, partial [uncultured candidate division OP1
bacterium]
Length = 282
Score = 73.6 bits (179), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 65/256 (25%), Positives = 116/256 (45%), Gaps = 46/256 (17%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S+ + D+D DG+ E+++ YD G + ++ T
Sbjct: 71 SSAAVGDVDGDGILEVVVG--------SYD------------------GNLYAYHGRTGA 104
Query: 445 VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHH-GKIREKF 503
+KW + T N ++YSSP + D DGDG L++ +G+ F +LD G+ + F
Sbjct: 105 LKWRYE----TKN-----WVYSSPAIADADGDGRLEVAIGSQDKSFALLDGRTGRPKWAF 155
Query: 504 PLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAE-GKGIWEQHLKSLVTQGPSIG 562
E + +A D++ DGK E++ V A+ G W+ + P++
Sbjct: 156 LTEDGILSSPAIA-DVDGDGKSEVIVGSER--VYAFDGRTGNVKWKFLPEFPARSSPAVA 212
Query: 563 DVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGL 622
D+DGDG ++VV+ +L G +Y L G+ G+ + + T+G V + + DL G +
Sbjct: 213 DIDGDGKAEVVIGSLDGRLYALDGRTGTA--KWAFHTNGPVESSPAIGDLNGDGL----V 266
Query: 623 TIVTTSFDGYLYLIDG 638
+V S ++ +DG
Sbjct: 267 EVVVGSSKRGVFCLDG 282
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 68/234 (29%), Positives = 108/234 (46%), Gaps = 24/234 (10%)
Query: 457 NASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIRE-KFPLEMAE-IQGAV 514
NA R + SS V D+DGDG L+++VG+ G Y +HG+ K+ E + +
Sbjct: 62 NAFTRGLVESSAAVGDVDGDGILEVVVGSYDGNLYA--YHGRTGALKWRYETKNWVYSSP 119
Query: 515 VAADINDDGKIELVTTDTHGNVAAWTAE-GKGIWEQHLKSLVTQGPSIGDVDGDGHSDVV 573
AD + DG++E+ + A G+ W + + P+I DVDGDG S+V+
Sbjct: 120 AIADADGDGRLEVAIGSQDKSFALLDGRTGRPKWAFLTEDGILSSPAIADVDGDGKSEVI 179
Query: 574 VPTLSGNIYVLSGKDGS---KVRP-YPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSF 629
V S +Y G+ G+ K P +P R+ V + +D + E +V S
Sbjct: 180 VG--SERVYAFDGRTGNVKWKFLPEFPARSSPAVAD----IDGDGKAE------VVIGSL 227
Query: 630 DGYLYLIDGPTSCADVV--DIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCF 681
DG LY +DG T A G S + D ++G ++++V + VFC
Sbjct: 228 DGRLYALDGRTGTAKWAFHTNGPVESSPAIGD-LNGDGLVEVVVGSSKRGVFCL 280
Score = 66.2 bits (160), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 60/206 (29%), Positives = 86/206 (41%), Gaps = 40/206 (19%)
Query: 383 ILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDT 442
+ S+P IAD D DG E+ I D + + + T
Sbjct: 115 VYSSPAIADADGDGRLEVAIGSQ---DKSF-----------------------ALLDGRT 148
Query: 443 KQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHH-GKIRE 501
+ KW L+ D I SSP + D+DGDG +++VG+ Y D G ++
Sbjct: 149 GRPKWAF---LTEDG------ILSSPAIADVDGDGKSEVIVGSE--RVYAFDGRTGNVKW 197
Query: 502 KFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAE-GKGIWEQHLKSLVTQGPS 560
KF E V ADI+ DGK E+V G + A G W H V P+
Sbjct: 198 KFLPEFPARSSPAV-ADIDGDGKAEVVIGSLDGRLYALDGRTGTAKWAFHTNGPVESSPA 256
Query: 561 IGDVDGDGHSDVVVPTLSGNIYVLSG 586
IGD++GDG +VVV + ++ L G
Sbjct: 257 IGDLNGDGLVEVVVGSSKRGVFCLDG 282
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 45/80 (56%), Gaps = 2/80 (2%)
Query: 64 LRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVH 123
L+W+ E + +Y++P IAD + DG+L++ + S +L+G G P W + +
Sbjct: 105 LKWRYETKNWVYSSPAIADADGDGRLEVAIGSQDKSFALLDGRTG--RPKWAFLTEDGIL 162
Query: 124 SSPLLYDIDKDGVREIALAT 143
SSP + D+D DG E+ + +
Sbjct: 163 SSPAIADVDGDGKSEVIVGS 182
Score = 44.3 bits (103), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 43/94 (45%), Gaps = 6/94 (6%)
Query: 547 WEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQ 606
W + LV ++GDVDGDG +VVV + GN+Y G+ G+ + Y T V +
Sbjct: 61 WNAFTRGLVESSAAVGDVDGDGILEVVVGSYDGNLYAYHGRTGAL--KWRYETKNWVYSS 118
Query: 607 VLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPT 640
+ D G L + S D L+DG T
Sbjct: 119 PAIADADGDGR----LEVAIGSQDKSFALLDGRT 148
Score = 44.3 bits (103), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 42/84 (50%), Gaps = 4/84 (4%)
Query: 65 RWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHS 124
+W I ++P IAD++ DGK +++V S Y +G G+ W + S
Sbjct: 152 KWAFLTEDGILSSPAIADVDGDGKSEVIVGSERVY--AFDGRTGNVK--WKFLPEFPARS 207
Query: 125 SPLLYDIDKDGVREIALATYNGEV 148
SP + DID DG E+ + + +G +
Sbjct: 208 SPAVADIDGDGKAEVVIGSLDGRL 231
Score = 43.9 bits (102), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 45/81 (55%), Gaps = 4/81 (4%)
Query: 64 LRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSS-V 122
++W+ ++P +ADI+ DGK ++V+ S L L+G G W AFH + V
Sbjct: 195 VKWKFLPEFPARSSPAVADIDGDGKAEVVIGSLDGRLYALDGRTGTAK--W-AFHTNGPV 251
Query: 123 HSSPLLYDIDKDGVREIALAT 143
SSP + D++ DG+ E+ + +
Sbjct: 252 ESSPAIGDLNGDGLVEVVVGS 272
Score = 40.0 bits (92), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 27/103 (26%), Positives = 46/103 (44%), Gaps = 7/103 (6%)
Query: 46 PQIDEDALVNTQCPKNLEL-----RWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYL 100
P D L + K ++ RW + ++ + D++ DG L++VV S+ L
Sbjct: 36 PMFRRDLLHTGRATKTSKIIAPKERWNAFTRGLVESSAAVGDVDGDGILEVVVGSYDGNL 95
Query: 101 EVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALAT 143
G G W ++ V+SSP + D D DG E+A+ +
Sbjct: 96 YAYHGRTGALK--WRYETKNWVYSSPAIADADGDGRLEVAIGS 136
>gi|440295199|gb|ELP88112.1| hypothetical protein EIN_222750 [Entamoeba invadens IP1]
Length = 852
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 73/279 (26%), Positives = 125/279 (44%), Gaps = 16/279 (5%)
Query: 324 QALDENADSSFELFRDTDELADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHI 383
Q +E S ELF + DE + D++D A++ + + +Y + SHI
Sbjct: 253 QLTEEGRRSLAELFHNAACSKDELD-DHNDCYHAAIYDNYAGLDGNF----NYARLRSHI 307
Query: 384 LSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTK 443
LS P + D DG +++ V+Y+ + K ++ +Y AI + +
Sbjct: 308 LSDPQLVDT-KDG-KRLVVPVTYYIKQTPDVDKSGKKIEKMLNDKRYCVSAICILDPQNG 365
Query: 444 QVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNL--DILVGTSFGLFYVLD-HHGKI- 499
++ T LDL+ N A + V GD I+VG S G ++L GKI
Sbjct: 366 KIISLTTLDLTVMNTKESAALLGGFEVF---GDKGYPEKIVVGNSAGRVHMLSLQDGKII 422
Query: 500 -REKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQG 558
++ +PL++ + G VV D+ G + +VT D +GNV + GK IWE ++ V Q
Sbjct: 423 KQDGWPLQVGPMMGKVVVEDVQRSGDLSIVTADVNGNVVCFNLLGKVIWETNVGGPVLQN 482
Query: 559 PS-IGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYP 596
+ V + + + T +G I+ L G +G + +P
Sbjct: 483 IEVVKTVLVKKIAMLFIATSNGMIHCLDGINGKNIDIFP 521
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 65/119 (54%), Gaps = 8/119 (6%)
Query: 64 LRWQTEVS--SSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQ-S 120
+RW ++ ++I + P++ D N DG+ +++ P+ HY+E ++ + G+K+PG+P + S
Sbjct: 73 VRWTQSITGHATIVSEPIVMDFNHDGRKEVIFPTQSHYMEAIDVNTGNKVPGFPVIVEDS 132
Query: 121 SVHSSPLLYDIDKDGVREIALAT-YNGEVLFFRVSG-YMMTDKLEIPRRKVRKDWYVGL 177
S + Y ++G E ++ + NG + F G Y T ++IP V +W GL
Sbjct: 133 GFVSKAIRY---QEGDEEFSIVSAANGFIYFVNKKGSYNKTKTIKIPPVYVPANWDEGL 188
>gi|322421781|ref|YP_004201004.1| peptidase S8 and S53 subtilisin kexin sedolisin [Geobacter sp. M18]
gi|320128168|gb|ADW15728.1| peptidase S8 and S53 subtilisin kexin sedolisin [Geobacter sp. M18]
Length = 1732
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 93/218 (42%), Gaps = 43/218 (19%)
Query: 387 PVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVK 446
P++AD+D DG+ E+I A + VV +
Sbjct: 598 PILADMDGDGIPEVIAAADQLY---------------------------VVKGDGSIMSG 630
Query: 447 WTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLE 506
W ++ F Y S+ V DLDGDG +++ T G Y G FP+
Sbjct: 631 WPKEV-------PFDPYFKSTLVVGDLDGDGKPEVVANTRGGTIYAFRGDGSRMPGFPMV 683
Query: 507 MAEIQG-AVVAADINDDGKIELVTTDT-HGNVAAWTAEGKGI--W----EQHLKSLVTQG 558
++V AD++ DG +E+V+ D G V AW +G + W E + +
Sbjct: 684 PPGYGCLSLVLADLDGDGGMEIVSHDQWSGQVYAWRYDGTTLPGWPISIEPTYYAYNSMS 743
Query: 559 PSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYP 596
P++GD+DGDG +V+VP +S +Y DG+ V +P
Sbjct: 744 PAVGDIDGDGSPEVLVPAMSNKVYAWH-HDGTPVAGWP 780
Score = 66.2 bits (160), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 82/354 (23%), Positives = 150/354 (42%), Gaps = 48/354 (13%)
Query: 377 VNVDSHILSTPVIADIDNDGVSEMII----AVSYFFDHEYYDNPEH-------------L 419
V D + ST V+ D+D DG E++ Y F + P L
Sbjct: 635 VPFDPYFKSTLVVGDLDGDGKPEVVANTRGGTIYAFRGDGSRMPGFPMVPPGYGCLSLVL 694
Query: 420 KEL---GGIDIGKY--VAGAIVVFNLD-TKQVKWTTDLDLSTDNASFRAYIYSSPTVVDL 473
+L GG++I + +G + + D T W ++ ++ AY SP V D+
Sbjct: 695 ADLDGDGGMEIVSHDQWSGQVYAWRYDGTTLPGWPISIE-----PTYYAYNSMSPAVGDI 749
Query: 474 DGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTH 533
DGDG+ ++LV Y H G +P + + ++ D+ +G+ ++
Sbjct: 750 DGDGSPEVLVPAMSNKVYAWHHDGTPVAGWPASLTDAPHVILLGDVEGNGQYKIFAGTAG 809
Query: 534 GNVAAWTAEGKGI--WEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSK 591
GN+ + A+G + W Q P++GD+DGDG ++V + ++ V SG DG+
Sbjct: 810 GNIYGFNADGSPLTGWPQPGYM-----PALGDIDGDGRLELVTDAVW-HLAVYSG-DGTM 862
Query: 592 VRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDG-YLYLIDGPTSCADVVDI-- 648
+P + +L D+ G G+ +V + G Y + DG +D+
Sbjct: 863 QPGWPQDKESNTVLSPVLGDIDGDG----GIEVVERTLAGIYAWHADGTPLPGFPIDMNS 918
Query: 649 --GETSYSMVLADNVDGGDDLDLIVTTM--NGNVFCFSTPAPHHPLKAWRSINQ 698
GE V ++ G + L+ ++ +G+++ FS P+ PL+ ++ Q
Sbjct: 919 ITGEWHDIPVALGDLTGSGQVSLVTGSIVGDGSLYAFSIPSAPAPLRLPWAMQQ 972
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 29/69 (42%), Positives = 39/69 (56%), Gaps = 2/69 (2%)
Query: 78 PLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVR 137
P + DI+ DG+L++V + H L V G DG PGWP +S+ SP+L DID DG
Sbjct: 832 PALGDIDGDGRLELVTDAVWH-LAVYSG-DGTMQPGWPQDKESNTVLSPVLGDIDGDGGI 889
Query: 138 EIALATYNG 146
E+ T G
Sbjct: 890 EVVERTLAG 898
Score = 47.4 bits (111), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 59/237 (24%), Positives = 93/237 (39%), Gaps = 64/237 (27%)
Query: 467 SPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIE 526
+P + D+DGDG +++ YV+ G I +P E+
Sbjct: 597 APILADMDGDGIPEVIAAAD--QLYVVKGDGSIMSGWPKEVP------------------ 636
Query: 527 LVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSG 586
++ + KS + +GD+DGDG +VV T G IY G
Sbjct: 637 --------------------FDPYFKSTLV----VGDLDGDGKPEVVANTRGGTIYAFRG 672
Query: 587 KDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVT---TSFDGYLYLIDGPTSCA 643
DGS++ +P G ++L DL G G+ IV+ S Y + DG T
Sbjct: 673 -DGSRMPGFPMVPPGYGCLSLVLADLDGDG----GMEIVSHDQWSGQVYAWRYDGTTLPG 727
Query: 644 DVVDIGETSY---SMVLA-DNVDGGDDLDLIVTTMNGNVFCFSTPAPHH---PLKAW 693
+ I T Y SM A ++DG +++V M+ V+ + HH P+ W
Sbjct: 728 WPISIEPTYYAYNSMSPAVGDIDGDGSPEVLVPAMSNKVYAW-----HHDGTPVAGW 779
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 27/98 (27%), Positives = 46/98 (46%), Gaps = 6/98 (6%)
Query: 66 WQTEVSSSIYA----TPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSS 121
W + + YA +P + DI+ DG +++VP+ + + DG + GWPA +
Sbjct: 728 WPISIEPTYYAYNSMSPAVGDIDGDGSPEVLVPAMSNKVYAWH-HDGTPVAGWPASLTDA 786
Query: 122 VHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMT 159
H LL D++ +G +I T G + F G +T
Sbjct: 787 PHVI-LLGDVEGNGQYKIFAGTAGGNIYGFNADGSPLT 823
Score = 39.7 bits (91), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 57/117 (48%), Gaps = 6/117 (5%)
Query: 44 GLPQI--DEDALVNTQCPKNLELRWQTEVSSSIY--ATPLIADINSDGKLDIVVPSFLHY 99
G+P++ D L + ++ W EV Y +T ++ D++ DGK ++V +
Sbjct: 607 GIPEVIAAADQLYVVKGDGSIMSGWPKEVPFDPYFKSTLVVGDLDGDGKPEVVANTRGGT 666
Query: 100 LEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREI-ALATYNGEVLFFRVSG 155
+ G DG +MPG+P S +L D+D DG EI + ++G+V +R G
Sbjct: 667 IYAFRG-DGSRMPGFPMVPPGYGCLSLVLADLDGDGGMEIVSHDQWSGQVYAWRYDG 722
>gi|373458246|ref|ZP_09550013.1| peptidase S8 and S53 subtilisin kexin sedolisin [Caldithrix abyssi
DSM 13497]
gi|371719910|gb|EHO41681.1| peptidase S8 and S53 subtilisin kexin sedolisin [Caldithrix abyssi
DSM 13497]
Length = 1095
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 115/456 (25%), Positives = 180/456 (39%), Gaps = 68/456 (14%)
Query: 385 STPVIADIDNDGVSEMI------IAVSYFFDHEY-----YDNPEHLKELGGIDIGKYVAG 433
+ PVIAD+D DG E+I I FD + D P + G + +G + A
Sbjct: 656 AAPVIADLDLDGQKEIITVNRRGILRVVGFDGKIKKEFKLDEPVY----GDLVVGNFDAD 711
Query: 434 A------------IVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDI 481
I +FNLD+ Q+ + + + V DL+GD D
Sbjct: 712 PELEMAFGTMHRNIHIFNLDSTQINGFP--------RAMSSLVNLGGAVADLNGDEMDDF 763
Query: 482 LVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGN-VAAWT 540
++GT G +V+ +G FP+ ++ VV I G + T N +
Sbjct: 764 VIGTFDGKLHVILSNGDSLPGFPVNLST--RVVVNPVIGKYGDSLFIAVATLDNKIVVLN 821
Query: 541 AEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTH 600
G+ +E+ L VT G + DV+ DG ++ T G +Y+L S +P +
Sbjct: 822 KNGQIKFERALSESVTGGLMLADVNQDGAPEICAITADGRLYLLEADGASFNDVFPLQLD 881
Query: 601 GRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLI--DGPTSCADVVDIGETSYSMVLA 658
G + D+ G+ L I+T G L+LI DG VD+GE S+
Sbjct: 882 GTPQTAPVSFDVDNDGQ----LEILTVVNQGVLHLIKLDGREVTNFPVDLGEGVSSVPAL 937
Query: 659 DNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPS 718
++DG D +++V+T + + P + AW + QG N+ R G+ P
Sbjct: 938 GDLDGDGDTEVLVSTQSA-LLAIDLPTDAGFVDAWNTY-QGNNH---RTGSFGV----PV 988
Query: 719 RAFRDEEGRNFWVEIEIVDEYRFPSGSQA--PYNVTTTLLVPG------NYQGERRIKQS 770
A D+E R E++ Y P Q NV L G N GE ++S
Sbjct: 989 SAIADKEKRPLPDSFELLANYPNPFNHQTVIQVNVPGQLTASGLQLYIYNVLGELVFEKS 1048
Query: 771 QIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYF 806
G RI+ G + G +L +G+YF
Sbjct: 1049 IKRLNAGLNRIRWD--GRDSAGKIL-----SSGIYF 1077
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 68/145 (46%), Gaps = 5/145 (3%)
Query: 492 VLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHL 551
V D +G+ FPLE+ A V AD++ DG+ E++T + G + +GK E L
Sbjct: 636 VYDQNGRKVNGFPLEVDATSAAPVIADLDLDGQKEIITVNRRGILRVVGFDGKIKKEFKL 695
Query: 552 KSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVD 611
V +G+ D D ++ T+ NI++ + D +++ +P V + D
Sbjct: 696 DEPVYGDLVVGNFDADPELEMAFGTMHRNIHIFN-LDSTQINGFPRAMSSLVNLGGAVAD 754
Query: 612 LTKRGEKSKGLTIVTTSFDGYLYLI 636
L G++ I T FDG L++I
Sbjct: 755 LN--GDEMDDFVIGT--FDGKLHVI 775
Score = 40.8 bits (94), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 24/101 (23%), Positives = 49/101 (48%)
Query: 60 KNLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQ 119
KN +++++ +S S+ ++AD+N DG +I + L +LE +P
Sbjct: 822 KNGQIKFERALSESVTGGLMLADVNQDGAPEICAITADGRLYLLEADGASFNDVFPLQLD 881
Query: 120 SSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTD 160
+ ++P+ +D+D DG EI G + ++ G +T+
Sbjct: 882 GTPQTAPVSFDVDNDGQLEILTVVNQGVLHLIKLDGREVTN 922
>gi|116751095|ref|YP_847782.1| FG-GAP repeat-containing protein [Syntrophobacter fumaroxidans
MPOB]
gi|116700159|gb|ABK19347.1| FG-GAP repeat protein [Syntrophobacter fumaroxidans MPOB]
Length = 639
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 48/144 (33%), Positives = 72/144 (50%), Gaps = 16/144 (11%)
Query: 465 YSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMA---EIQGAVVAADIND 521
YSSP V DLDGDG +++L G ++ LF + G +KF + + VV ADI+
Sbjct: 53 YSSPAVADLDGDGKMEVL-GAAYSLFVLNGEDGS--QKFSVTAGSGGRVWPGVVVADIDR 109
Query: 522 DGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNI 581
DGK E+V G V GK +W + + +G S D+D DG ++VV
Sbjct: 110 DGKPEIVLAQGGGYVTVLDHAGKTVWSRQPTTSELRGLSAYDLDRDGFLEIVV------- 162
Query: 582 YVLSGKDGSKVRPYPYRTHGRVMN 605
+G GSK + Y +GR+++
Sbjct: 163 ---TGAVGSKTNTWVYNHNGRLLD 183
Score = 45.8 bits (107), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 48/203 (23%), Positives = 81/203 (39%), Gaps = 47/203 (23%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S+P +AD+D DG E++ A F V N +
Sbjct: 54 SSPAVADLDGDGKMEVLGAAYSLF----------------------------VLNGEDGS 85
Query: 445 VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFP 504
K++ A ++ V D+D DG +I++ G VLDH GK
Sbjct: 86 QKFSV-------TAGSGGRVWPGVVVADIDRDGKPEIVLAQGGGYVTVLDHAGKTVWSRQ 138
Query: 505 LEMAEIQGAVVAADINDDGKIELVTTDTHG---NVAAWTAEGKGI--WEQ------HLKS 553
+E++G + A D++ DG +E+V T G N + G+ + W Q +
Sbjct: 139 PTTSELRG-LSAYDLDRDGFLEIVVTGAVGSKTNTWVYNHNGRLLDGWPQLANDNGYAWG 197
Query: 554 LVTQGPSIGDVDGDGHSDVVVPT 576
+ ++ D+D DG ++VVP+
Sbjct: 198 VYNSNAAVYDLDKDGQGEIVVPS 220
Score = 44.3 bits (103), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 50/101 (49%), Gaps = 8/101 (7%)
Query: 61 NLELRWQ------TEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGW 114
N L+W+ + + Y++P +AD++ DGK++++ ++ L VL G DG +
Sbjct: 33 NPVLKWKHGGCYSSWCETGWYSSPAVADLDGDGKMEVLGAAY--SLFVLNGEDGSQKFSV 90
Query: 115 PAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSG 155
A V ++ DID+DG EI LA G V +G
Sbjct: 91 TAGSGGRVWPGVVVADIDRDGKPEIVLAQGGGYVTVLDHAG 131
Score = 40.4 bits (93), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 44/79 (55%), Gaps = 6/79 (7%)
Query: 559 PSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEK 618
P++ D+DGDG +V+ S ++VL+G+DGS+ + GRV V++ D+ + G+
Sbjct: 56 PAVADLDGDGKMEVLGAAYS--LFVLNGEDGSQKFSVTAGSGGRVWPGVVVADIDRDGKP 113
Query: 619 SKGLTIVTTSFDGYLYLID 637
IV GY+ ++D
Sbjct: 114 E----IVLAQGGGYVTVLD 128
>gi|383761157|ref|YP_005440139.1| hypothetical protein CLDAP_02020 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
gi|381381425|dbj|BAL98241.1| hypothetical protein CLDAP_02020 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
Length = 643
Score = 66.6 bits (161), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 72/289 (24%), Positives = 124/289 (42%), Gaps = 45/289 (15%)
Query: 380 DSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFN 439
D++I S+PVIAD+D DG E++I V + + + + GGI + +Y F+
Sbjct: 98 DNNIRSSPVIADLDKDGHLEIVIGVG----GDVHSPNRNERWNGGILVYRYNGSNPWGFS 153
Query: 440 LDTK---------------QVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVG 484
L +V W + S D + ++++P V D+DGDG+L+I+
Sbjct: 154 LIQPRSPDGTSGWPQPCKDEVGWPPEKGYS-DPDGYWDGVWATPAVGDIDGDGDLEIVYL 212
Query: 485 TSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAA--DINDDGKIELVT--------TDTHG 534
Y H G++ +P+E+ G A D++ DG E+V + +
Sbjct: 213 GIDRFLYAWHHDGRLVSGWPIEIPVDGGLSSPALGDLDLDGLPEIVVGTMSPASLSQSWK 272
Query: 535 NVAAWTAEGKGI----WEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSG-----NIYVLS 585
W +G + + ++ P+ GD+DGDG+ ++V+ + G N+
Sbjct: 273 EATLWAVKGDATIVPGFPVRTEQILLSSPASGDIDGDGYLEIVIASGYGTPGRQNLVYAW 332
Query: 586 GKDGSKVRPYPYRTHGR--VMNQVLLVDLTKRGEKSKGLTIVTTSFDGY 632
DG+ V +P G VM L D+ GE + IV +GY
Sbjct: 333 NHDGTPVPGWPVEAPGASVVMAPPALGDIDNDGE----IEIVVGCGNGY 377
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 39/128 (30%), Positives = 52/128 (40%), Gaps = 29/128 (22%)
Query: 44 GLPQIDEDALVNTQCPKNLELRWQ------------------TEVSSSIYATPLIADINS 85
GLP+I +V T P +L W+ + ++P DI+
Sbjct: 253 GLPEI----VVGTMSPASLSQSWKEATLWAVKGDATIVPGFPVRTEQILLSSPASGDIDG 308
Query: 86 DGKLDIVVPSFL-----HYLEVLEGSDGDKMPGWP--AFHQSSVHSSPLLYDIDKDGVRE 138
DG L+IV+ S L DG +PGWP A S V + P L DID DG E
Sbjct: 309 DGYLEIVIASGYGTPGRQNLVYAWNHDGTPVPGWPVEAPGASVVMAPPALGDIDNDGEIE 368
Query: 139 IALATYNG 146
I + NG
Sbjct: 369 IVVGCGNG 376
Score = 45.1 bits (105), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 39/128 (30%), Positives = 52/128 (40%), Gaps = 21/128 (16%)
Query: 25 GDNSEQNKFRQREATDDQLGLPQIDEDALVNTQCPKNLELRWQTEVSSS--------IYA 76
G N Q + D G PQ +D E+ W E S ++A
Sbjct: 146 GSNPWGFSLIQPRSPDGTSGWPQPCKD-----------EVGWPPEKGYSDPDGYWDGVWA 194
Query: 77 TPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPA-FHQSSVHSSPLLYDIDKDG 135
TP + DI+ DG L+IV +L DG + GWP SSP L D+D DG
Sbjct: 195 TPAVGDIDGDGDLEIVYLGIDRFLYAWH-HDGRLVSGWPIEIPVDGGLSSPALGDLDLDG 253
Query: 136 VREIALAT 143
+ EI + T
Sbjct: 254 LPEIVVGT 261
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 53/222 (23%), Positives = 90/222 (40%), Gaps = 57/222 (25%)
Query: 377 VNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIV 436
V + +LS+P DID DG E++IA Y ++L +
Sbjct: 291 VRTEQILLSSPASGDIDGDGYLEIVIASGYGTPGR-----QNL---------------VY 330
Query: 437 VFNLDTKQVK-WTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGL------ 489
+N D V W + A + + + P + D+D DG ++I+VG G
Sbjct: 331 AWNHDGTPVPGWPVE-------APGASVVMAPPALGDIDNDGEIEIVVGCGNGYQADSCD 383
Query: 490 -FYVLDHHGKIREKFPLEM-------AEIQGAVVAADINDDGKIELVTTDTHGNVAAW-- 539
Y + G + FP+ + + + V ADI+ DG++E++ G + W
Sbjct: 384 KLYAWNPDGTLVNGFPIVLPSRMNSYTSMPYSPVLADIDGDGEVEILM----GRLGDWGI 439
Query: 540 --TAEGKGIWEQHLKSLVTQG-----PSIGDVDGDGHSDVVV 574
+G ++ L S T G P + D+DGDG + V+
Sbjct: 440 TVVNPIQGTYD--LSSHQTPGGLLASPVVDDIDGDGFLETVI 479
Score = 40.0 bits (92), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 45/169 (26%), Positives = 73/169 (43%), Gaps = 44/169 (26%)
Query: 381 SHILSTPVIADIDNDGVSEMIIAVSYFF-----DHEYYDNPEHLKELGGIDIGKYVAGAI 435
S +++ P + DIDNDG E+++ + D Y NP+ G V G
Sbjct: 350 SVVMAPPALGDIDNDGEIEIVVGCGNGYQADSCDKLYAWNPD----------GTLVNGFP 399
Query: 436 VVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGT--SFGLFYVL 493
+V L + S+ + YS P + D+DGDG ++IL+G +G+ V
Sbjct: 400 IV---------------LPSRMNSYTSMPYS-PVLADIDGDGEVEILMGRLGDWGITVV- 442
Query: 494 DHHGKIREKFPLEMAEIQGAVVAA----DINDDGKIELVTTD---THGN 535
I+ + L + G ++A+ DI+ DG +E V HGN
Sbjct: 443 ---NPIQGTYDLSSHQTPGGLLASPVVDDIDGDGFLETVIGGIDYNHGN 488
>gi|162454086|ref|YP_001616453.1| alkaline serine protease [Sorangium cellulosum So ce56]
gi|161164668|emb|CAN95973.1| probable alkaline serine protease [Sorangium cellulosum So ce56]
Length = 1211
Score = 66.2 bits (160), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 58/250 (23%), Positives = 103/250 (41%), Gaps = 48/250 (19%)
Query: 386 TPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQV 445
+P +ADID DG+ E++ + H P+ + L V +
Sbjct: 613 SPKMADIDGDGIRELVYPTAGGALHVLKMTPKGPQPLPRFPFMTGVTDGLRALPASPATP 672
Query: 446 KWTTDLDLSTDNASF-RAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFP 504
+ + S N+ R I S+P + DLDGDG +I++ T G YV+ G R +P
Sbjct: 673 DYLSAPAYSKVNSELGREPILSAPAIADLDGDGRQEIVISTWPGTIYVIGADGGARRGWP 732
Query: 505 LEMAEI-----------------------QGAV---VAADINDDGKIELVTTDTHGNVAA 538
+ + EI +GA V AD++ DG+++++ G V A
Sbjct: 733 IRLPEIPSCPLDPGAPASAPCMSAAARIARGAFASPVLADLDGDGRLDVIQAAFDGTVHA 792
Query: 539 WTAEGKG-------------IWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTL-------- 577
+ +G + E+ +S + P++ D +GDG D++V +
Sbjct: 793 FDRDGNALRGFPVEVHYAGPLAEEPARSRLLATPAVADFNGDGLPDLLVASSERLGEDGE 852
Query: 578 SGNIYVLSGK 587
+G +YV+ G+
Sbjct: 853 AGAVYVIDGR 862
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 61/132 (46%), Gaps = 10/132 (7%)
Query: 467 SPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIE 526
SP + D+DGDG +++ T+ G +VL K + P + G DG
Sbjct: 613 SPKMADIDGDGIRELVYPTAGGALHVLKMTPKGPQPLP-RFPFMTGVT-------DGLRA 664
Query: 527 LVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSG 586
L + + + A K + + + + P+I D+DGDG ++V+ T G IYV+ G
Sbjct: 665 LPASPATPDYLSAPAYSK-VNSELGREPILSAPAIADLDGDGRQEIVISTWPGTIYVI-G 722
Query: 587 KDGSKVRPYPYR 598
DG R +P R
Sbjct: 723 ADGGARRGWPIR 734
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 42/74 (56%), Gaps = 1/74 (1%)
Query: 73 SIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDID 132
+I + + D+N DG +++ S ++L +G G + G+P F S+ ++P + D+D
Sbjct: 1033 AISGSQAVVDLNGDGYPEVLSGSSGYFLHAFDGC-GREPRGFPKFTGQSITTTPAVGDLD 1091
Query: 133 KDGVREIALATYNG 146
DG EIA+ T +G
Sbjct: 1092 GDGTLEIAVGTRDG 1105
Score = 40.0 bits (92), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 36/80 (45%), Gaps = 1/80 (1%)
Query: 464 IYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFP-LEMAEIQGAVVAADINDD 522
I S VVDL+GDG ++L G+S + D G+ FP I D++ D
Sbjct: 1034 ISGSQAVVDLNGDGYPEVLSGSSGYFLHAFDGCGREPRGFPKFTGQSITTTPAVGDLDGD 1093
Query: 523 GKIELVTTDTHGNVAAWTAE 542
G +E+ G + AW E
Sbjct: 1094 GTLEIAVGTRDGWLFAWHTE 1113
>gi|71414127|ref|XP_809177.1| FG-GAP repeat protein [Trypanosoma cruzi strain CL Brener]
gi|70873518|gb|EAN87326.1| FG-GAP repeat protein, putative [Trypanosoma cruzi]
Length = 162
Score = 66.2 bits (160), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 47/79 (59%), Gaps = 1/79 (1%)
Query: 58 CPKNLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAF 117
C + L W V SS++ATP I D+N DG +I+VP++ Y E L+G +G+ G+P
Sbjct: 82 CRSGVYLEWTARVGSSVFATPRIVDLNHDGNKEILVPTYTQYFEALDGVNGEDEIGFPFV 141
Query: 118 HQS-SVHSSPLLYDIDKDG 135
H + + SP+ D+D DG
Sbjct: 142 HPNFKSYCSPIPVDMDGDG 160
>gi|307111516|gb|EFN59750.1| hypothetical protein CHLNCDRAFT_133373 [Chlorella variabilis]
Length = 91
Score = 65.1 bits (157), Expect = 2e-07, Method: Composition-based stats.
Identities = 32/90 (35%), Positives = 50/90 (55%), Gaps = 11/90 (12%)
Query: 767 IKQSQIFARRGKYRIKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLL 826
I + + G + +++P R+T T+ VEM D+ L F DEFSL+FH+++Y+LLKWL+
Sbjct: 9 IGMADVINSTGAFTMEIPVPRSRSTATIRVEMRDETKLLFVDEFSLSFHIHFYRLLKWLI 68
Query: 827 VLPMLGMFGVLVILRPQEAMPLPSFSRNTD 856
P + AM L +F +TD
Sbjct: 69 AGPFVA-----------AAMALLAFRADTD 87
>gi|397567685|gb|EJK45723.1| hypothetical protein THAOC_35647, partial [Thalassiosira oceanica]
Length = 840
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 74/291 (25%), Positives = 112/291 (38%), Gaps = 82/291 (28%)
Query: 340 TDELADEYNYDYDDYVDDAMWGDE----------------EWTEEQHEKIED--YVNVDS 381
T+ AD++ Y DDY +GD+ E+ Q D Y+ +
Sbjct: 412 TEFAADDHPYPVDDYYGRYGYGDDYHGYQPEPPDGWDSYDEYQNAQDAYYHDSNYLRLPP 471
Query: 382 HILSTPVIADIDN----------DGVSEMII-AVSYFFDHEYYDNPEHLKELGGID---- 426
H+LST +A++ D + EM++ AVSY+FD + D
Sbjct: 472 HLLSTCTLAELPRAYAGSSAEAIDRIDEMLLCAVSYYFDEDECSAGRSFGRQANADGGDE 531
Query: 427 ----IGKYVAGAIVVFNLDTKQVKWTTD--LDLSTD------------------------ 456
G+YVA AI+ +N+ K W+T LDLSTD
Sbjct: 532 DEGQRGRYVASAILGYNMRWKY--WSTQEVLDLSTDWSAPLGDIVQGGTASVHSDSYNGS 589
Query: 457 ----------------NASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIR 500
+ A+ +SPT LDGD N IL+GTS GL Y L+
Sbjct: 590 ECCPALSHFSSRHILTDIEVGAWAVASPTTAKLDGDKN-HILLGTSMGLVYALEVQWHAS 648
Query: 501 EKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHL 551
+ ++ ++ D+ D +E+ D G+VA A GK +W + L
Sbjct: 649 KWVAQMRHPVERKIIVEDVVGDTNLEVFIVDGGGDVACLDANGKVLWARKL 699
>gi|406899277|gb|EKD42591.1| BNR repeat protein, partial [uncultured bacterium]
Length = 1010
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 66/226 (29%), Positives = 103/226 (45%), Gaps = 50/226 (22%)
Query: 378 NVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVV 437
+ D+H +TPVIADI++DG +++ SY +L G G +
Sbjct: 358 DADTHTFATPVIADINSDGNLDLLFG-SYM---------NNLTLYTGDGSGAFT------ 401
Query: 438 FNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG---LF---- 490
LD ST+ AS +YS P D +GD N+DILVG + G LF
Sbjct: 402 -------------LDASTNVASLSTGLYSRPITGDFNGDENIDILVGNNAGELILFLGDG 448
Query: 491 ---YVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIW 547
+ LD G + +++ Q A A INDD I+L+ + G++ + +G G +
Sbjct: 449 ANTFTLDATGSLSS---VDVGS-QSAPCAGYINDDEDIDLIVGTSFGDIQIFLGDGSGNF 504
Query: 548 EQH----LKSLVTQGPSI---GDVDGDGHSDVVVPTLSGNIYVLSG 586
+ L S+ T P+I GDV+GD D+++ S ++ V G
Sbjct: 505 VEDTTNPLASIYTGVPTIPVMGDVNGDAQDDLLLTAESTSLLVYLG 550
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 60/220 (27%), Positives = 92/220 (41%), Gaps = 43/220 (19%)
Query: 379 VDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVF 438
VD S PVIADIDND +++I+ D +L G G Y A
Sbjct: 258 VDVGTYSVPVIADIDNDSDNDVIVGAE--------DGKVYLYR--GDGTGNYTA------ 301
Query: 439 NLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGK 498
D + +S S P V D++ DG++D++VG SFG V +G
Sbjct: 302 -------------DTAAGLSSIAFGTKSRPAVSDINSDGDVDVIVGNSFGTTNVA--YGD 346
Query: 499 IREKFPLEMA-----EIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIW----EQ 549
F + + V ADIN DG ++L+ N+ +T +G G +
Sbjct: 347 GTGNFTIASSGDADTHTFATPVIADINSDGNLDLLFGSYMNNLTLYTGDGSGAFTLDAST 406
Query: 550 HLKSLVT---QGPSIGDVDGDGHSDVVVPTLSGNIYVLSG 586
++ SL T P GD +GD + D++V +G + + G
Sbjct: 407 NVASLSTGLYSRPITGDFNGDENIDILVGNNAGELILFLG 446
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 44/169 (26%), Positives = 76/169 (44%), Gaps = 35/169 (20%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S P IAD++ DG+++++I G Y+ + +N +
Sbjct: 57 SRPCIADLNGDGLNDIVI-------------------------GSYIGKVNIYYNDGASE 91
Query: 445 VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG--LFYVLDHHGKIREK 502
T ++TD +F Y ++P V DL GDG+LDI+VG G ++ D G E
Sbjct: 92 FIQDTTSAITTD--TFGPY--ATPFVGDLSGDGHLDIIVGDQRGNVNYFTGDGAGNFTEV 147
Query: 503 FPLEMAEIQGAVVA----ADINDDGKIELVTTDTHGNVAAWTAEGKGIW 547
++A + A DINDDG +L+ + G + + ++G G++
Sbjct: 148 IAGDLAGLDIGSYAEPLVTDINDDGFNDLILGENGGQLKCYLSDGAGLF 196
Score = 44.7 bits (104), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 49/105 (46%), Gaps = 16/105 (15%)
Query: 61 NLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFH-- 118
N + + + +ATP+IADINSDG LD++ S+++ L + G G AF
Sbjct: 350 NFTIASSGDADTHTFATPVIADINSDGNLDLLFGSYMNNLTLYTGD------GSGAFTLD 403
Query: 119 --------QSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSG 155
+ ++S P+ D + D +I + GE++ F G
Sbjct: 404 ASTNVASLSTGLYSRPITGDFNGDENIDILVGNNAGELILFLGDG 448
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 33/144 (22%), Positives = 67/144 (46%), Gaps = 24/144 (16%)
Query: 465 YSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGK---IREKFPLEMAEIQGAVVA---AD 518
YS P + DL+GDG DI++G+ G + + G I++ + G D
Sbjct: 56 YSRPCIADLNGDGLNDIVIGSYIGKVNIYYNDGASEFIQDTTSAITTDTFGPYATPFVGD 115
Query: 519 INDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVT-------QGPSIGDVDGDGHSD 571
++ DG ++++ D GNV +T +G G + + + + P + D++ DG +D
Sbjct: 116 LSGDGHLDIIVGDQRGNVNYFTGDGAGNFTEVIAGDLAGLDIGSYAEPLVTDINDDGFND 175
Query: 572 VVVPTLSGNIYVLSGKDGSKVRPY 595
+++ G++G +++ Y
Sbjct: 176 LIL-----------GENGGQLKCY 188
>gi|297181018|gb|ADI17219.1| subtilisin-like serine proteases [uncultured delta proteobacterium
HF0070_10I02]
Length = 1127
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 50/170 (29%), Positives = 72/170 (42%), Gaps = 40/170 (23%)
Query: 466 SSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGA------------ 513
SSP +VDLD DG L++L+G + G L G E +P+ + + G
Sbjct: 584 SSPILVDLDDDGILEVLIGNADGDVLALHGDGTAVEGWPVRIDDRSGVHRESEGYASGDV 643
Query: 514 -----------VVAADINDDGKIELVTTDTHGNVAAWT-----AEGKGIW---------- 547
V A DI+ DG++E+V G + AW EG W
Sbjct: 644 DPGTGDAFTATVAAGDIDGDGEVEIVGATLWGGIYAWNRDGSRVEGFPFWSIGRTPEEFD 703
Query: 548 -EQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYP 596
E T P++ D+DGDG +++V G +YV+ G GS PYP
Sbjct: 704 TEHTYDQGFTGAPTLRDLDGDGAAEIVAAGGDGRLYVVDGT-GSDWGPYP 752
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 45/143 (31%), Positives = 69/143 (48%), Gaps = 21/143 (14%)
Query: 464 IYSSPTVVDLDGDGNLDILVGTSFGLF---------YVLDHH--------GKIREKFPLE 506
+ ++P+ DLDGDG D++ G + L+ Y H G+I E +P +
Sbjct: 905 MINNPSFGDLDGDGLPDVMQGGAGALWVASLAMTQHYDFQHAVIAWSGVTGEILEGWPRQ 964
Query: 507 MAEIQGAVV--AADINDDGKIELVTTDTHGNVAAWTAEGKGI--WEQHLKSLVTQGPSIG 562
+ +IQ V ADI+ DGK E + V AW G+ W + + P++G
Sbjct: 965 IEDIQFLVAPAVADISGDGKNEAIYGSGGYMVYAWDGTGQVAEGWPKFTGHWILGSPAVG 1024
Query: 563 DVDGDGHSDVVVPTLSGNIYVLS 585
D+DGDG+ DVV+ T G I+ S
Sbjct: 1025 DIDGDGYLDVVISTREGWIHAWS 1047
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 52/188 (27%), Positives = 81/188 (43%), Gaps = 37/188 (19%)
Query: 489 LFYVLDHHGKIREKFPLEM-AEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGI- 546
+FYV I FP +M + + + + D++DDG +E++ + G+V A +G +
Sbjct: 561 VFYVTPDE-DILPGFPYDMKSSGESSPILVDLDDDGILEVLIGNADGDVLALHGDGTAVE 619
Query: 547 -W--------------EQHLKSLVTQGP--------SIGDVDGDGHSDVVVPTLSGNIYV 583
W E + V G + GD+DGDG ++V TL G IY
Sbjct: 620 GWPVRIDDRSGVHRESEGYASGDVDPGTGDAFTATVAAGDIDGDGEVEIVGATLWGGIYA 679
Query: 584 LSGKDGSKVRPYPYRTHGR---------VMNQVLLVDLTKRGEKSKGLT-IVTTSFDGYL 633
+ +DGS+V +P+ + GR +Q T R G IV DG L
Sbjct: 680 WN-RDGSRVEGFPFWSIGRTPEEFDTEHTYDQGFTGAPTLRDLDGDGAAEIVAAGGDGRL 738
Query: 634 YLIDGPTS 641
Y++DG S
Sbjct: 739 YVVDGTGS 746
Score = 46.6 bits (109), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 58/221 (26%), Positives = 91/221 (41%), Gaps = 51/221 (23%)
Query: 427 IGKYVAGAIVVFNLDTKQVK----WT---TDLDLSTDNASFRAYIYSSPTVVDLDGDGNL 479
+G + G I +N D +V+ W+ T + T++ + + +PT+ DLDGDG
Sbjct: 669 VGATLWGGIYAWNRDGSRVEGFPFWSIGRTPEEFDTEHTYDQGFT-GAPTLRDLDGDGAA 727
Query: 480 DILVGTSFGLFYVLDHHGKIREKFPLE--------------MAEIQG-----AVVAADIN 520
+I+ G YV+D G +P+E M I G + + AD++
Sbjct: 728 EIVAAGGDGRLYVVDGTGSDWGPYPIEMCFPGTAERDFEDRMCGIGGDRSIVSPMVADVD 787
Query: 521 DDGKIELVTTDTHGN----------VAAWTAEGKGIW---------EQHLKSLVTQGP-- 559
DG +E+ + A TAE + W E L L+ QG
Sbjct: 788 ADGALEIGFGTNEAVDDGRYSASYLIDAATAEPEPGWPLLESGLINEAALLPLIGQGHPS 847
Query: 560 --SIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYR 598
S DVDGDG +++ P + G +LS DG+ PY
Sbjct: 848 SMSAADVDGDGDLEIMNPVMLGQTELLS-HDGTVHLDIPYH 887
Score = 46.2 bits (108), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 53/114 (46%), Gaps = 7/114 (6%)
Query: 467 SPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAE-IQGAVVAADINDDGKI 525
+P V D+ GDG + + G+ + Y D G++ E +P I G+ DI+ DG +
Sbjct: 973 APAVADISGDGKNEAIYGSGGYMVYAWDGTGQVAEGWPKFTGHWILGSPAVGDIDGDGYL 1032
Query: 526 ELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSG 579
++V + G + AW+ G SL + GD HS V +PT +G
Sbjct: 1033 DVVISTREGWIHAWSTNGHADQTIEWASLHHDAANTGD-----HS-VPIPTQAG 1080
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 42/80 (52%), Gaps = 1/80 (1%)
Query: 77 TPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGV 136
P +ADI+ DGK + + S + + +G+ G GWP F + SP + DID DG
Sbjct: 973 APAVADISGDGKNEAIYGSGGYMVYAWDGT-GQVAEGWPKFTGHWILGSPAVGDIDGDGY 1031
Query: 137 REIALATYNGEVLFFRVSGY 156
++ ++T G + + +G+
Sbjct: 1032 LDVVISTREGWIHAWSTNGH 1051
Score = 40.8 bits (94), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 30/49 (61%)
Query: 107 DGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSG 155
D D +PG+P +SS SSP+L D+D DG+ E+ + +G+VL G
Sbjct: 567 DEDILPGFPYDMKSSGESSPILVDLDDDGILEVLIGNADGDVLALHGDG 615
>gi|149371696|ref|ZP_01891112.1| hypothetical protein SCB49_09835 [unidentified eubacterium SCB49]
gi|149355323|gb|EDM43883.1| hypothetical protein SCB49_09835 [unidentified eubacterium SCB49]
Length = 583
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 65/239 (27%), Positives = 103/239 (43%), Gaps = 53/239 (22%)
Query: 389 IADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWT 448
+AD+DNDGV E+I V + D G I K V G +++
Sbjct: 76 LADLDNDGVEEIIFGVDTTLFALHGD--------GTIFFEKAVEGPVLL----------- 116
Query: 449 TDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSF----GLFYVLDHHGKIREKFP 504
P++ DLD DG+ +I++ T + G YV D +G+ +P
Sbjct: 117 ------------------PPSIADLDDDGSPEIIINTGYPTTVGRIYVTDSNGEDLPGWP 158
Query: 505 LEMAE--IQGAVVAADINDDGKIELVTTD----THGNVAAWTAEGKGI---WEQHLKSLV 555
L + + A AD++ DG +++VT + T G + A +G I W + +
Sbjct: 159 LNFDDKWMINAPAIADVDGDGSLDIVTGERFGSTQGFIHALNIDGTEINANWPVEVPATP 218
Query: 556 TQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHG--RVMNQVLLVDL 612
PSIGD+D DG +DVV+ SG +Y + + G + +P G +LVDL
Sbjct: 219 AFTPSIGDIDNDGSNDVVIAASSGGMYAFNSQ-GEILDGFPLVETGVSYSYQSPMLVDL 276
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 70/260 (26%), Positives = 103/260 (39%), Gaps = 70/260 (26%)
Query: 386 TPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQV 445
TP I DIDNDG ++++IA S +G + FN Q
Sbjct: 221 TPSIGDIDNDGSNDVVIAAS--------------------------SGGMYAFN---SQG 251
Query: 446 KWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG---LFYVLDHHGKIREK 502
+ L S Y Y SP +VDLD D L+I VG + G FYVL+H
Sbjct: 252 EILDGFPLVETGVS---YSYQSPMLVDLDDDETLEI-VGANHGDNAAFYVLNHDATYAPG 307
Query: 503 FPLEMAEIQGA-VVAADINDDGKIELVTTDTH------------------GNVAAWTAEG 543
+P+ + A AD+NDDG E+ D + N+A + E
Sbjct: 308 WPIALGGWTYAPSTVADLNDDGTYEIFMGDRNTSNDGSDLPTIYGLAPDGTNIANFPIEK 367
Query: 544 KGIWEQHLKSLVTQGPSIGDVDGDGHSDVVV-PTLS-----GNIYVLSGKDGSKVRPYPY 597
G E + +IGD++ DG D++ TL+ G I+ S ++ +P
Sbjct: 368 YGGNEGVI--------TIGDINDDGVWDIIFSSTLTDAAGFGYIHAYSSDGSGEIDGFPL 419
Query: 598 RTHG-RVMNQVLLVDLTKRG 616
R G +N +L D+ G
Sbjct: 420 RPEGFTFLNGAVLGDVDNDG 439
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 65/288 (22%), Positives = 119/288 (41%), Gaps = 63/288 (21%)
Query: 379 VDSHILSTPVIADIDNDGVSEMIIAVSY----------------------FFDHEYYDNP 416
V+ +L P IAD+D+DG E+II Y FD ++ N
Sbjct: 110 VEGPVLLPPSIADLDDDGSPEIIINTGYPTTVGRIYVTDSNGEDLPGWPLNFDDKWMINA 169
Query: 417 EHLKEL---GGIDI--GKYVA---GAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSP 468
+ ++ G +DI G+ G I N+D ++ +++ A +P
Sbjct: 170 PAIADVDGDGSLDIVTGERFGSTQGFIHALNIDGTEINANWPVEVPATPAF-------TP 222
Query: 469 TVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQ---GAVVAADINDDGKI 525
++ D+D DG+ D+++ S G Y + G+I + FPL + + + D++DD +
Sbjct: 223 SIGDIDNDGSNDVVIAASSGGMYAFNSQGEILDGFPLVETGVSYSYQSPMLVDLDDDETL 282
Query: 526 ELVTTDTHGNVAAW-----TAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVV------- 573
E+V + HG+ AA+ A W L ++ D++ DG ++
Sbjct: 283 EIVGAN-HGDNAAFYVLNHDATYAPGWPIALGGWTYAPSTVADLNDDGTYEIFMGDRNTS 341
Query: 574 -----VPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRG 616
+PT IY L+ DG+ + +P +G + + D+ G
Sbjct: 342 NDGSDLPT----IYGLA-PDGTNIANFPIEKYGGNEGVITIGDINDDG 384
Score = 44.7 bits (104), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 49/108 (45%), Gaps = 19/108 (17%)
Query: 59 PKNLELRWQTEVSSSIYATPLIADINSDGKLDIVV-------PSFLHYLEVLEGSDGDKM 111
P N + +W P IAD++ DG LDIV F+H L + DG ++
Sbjct: 158 PLNFDDKWMINA-------PAIADVDGDGSLDIVTGERFGSTQGFIHALNI----DGTEI 206
Query: 112 -PGWPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMM 158
WP ++ +P + DID DG ++ +A +G + F G ++
Sbjct: 207 NANWPVEVPATPAFTPSIGDIDNDGSNDVVIAASSGGMYAFNSQGEIL 254
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 44/92 (47%), Gaps = 3/92 (3%)
Query: 62 LELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSS 121
+ W EV ++ TP I DI++DG D+V+ + + S G+ + G+P
Sbjct: 206 INANWPVEVPATPAFTPSIGDIDNDGSNDVVIAASSGGMYAFN-SQGEILDGFPLVETGV 264
Query: 122 VHS--SPLLYDIDKDGVREIALATYNGEVLFF 151
+S SP+L D+D D EI A + F+
Sbjct: 265 SYSYQSPMLVDLDDDETLEIVGANHGDNAAFY 296
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 61/123 (49%), Gaps = 13/123 (10%)
Query: 64 LRWQTEVSSSIYATPLIADINSDGKLDIVV----PSFLHYLEVLEGSDGDKMPGWPA-FH 118
+ ++ V + P IAD++ DG +I++ P+ + + V + S+G+ +PGWP F
Sbjct: 104 IFFEKAVEGPVLLPPSIADLDDDGSPEIIINTGYPTTVGRIYVTD-SNGEDLPGWPLNFD 162
Query: 119 QSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDWYVGLH 178
+ ++P + D+D DG +I GE F G++ L I ++ +W V +
Sbjct: 163 DKWMINAPAIADVDGDGSLDIV----TGE-RFGSTQGFIH--ALNIDGTEINANWPVEVP 215
Query: 179 SDP 181
+ P
Sbjct: 216 ATP 218
>gi|408492498|ref|YP_006868867.1| VCBS repeat domain protein () [Psychroflexus torquis ATCC 700755]
gi|408469773|gb|AFU70117.1| VCBS repeat domain protein () [Psychroflexus torquis ATCC 700755]
Length = 488
Score = 62.8 bits (151), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 68/239 (28%), Positives = 115/239 (48%), Gaps = 32/239 (13%)
Query: 461 RAYIYSSPTVVDLDGDGNLDILVGTSF----GLFYVLDHHGKIREKFPLEMAE--IQGAV 514
I P++ DLDGDG+ +I+V T + G YV+D++G +P+ + + A
Sbjct: 111 EGLILLPPSIADLDGDGSPEIIVNTGYPTTVGRVYVIDNNGDDLPGWPITFNDKWMINAP 170
Query: 515 VAADINDDGKIELVTTD----THGNVAAWTAEGKGI---WEQHLKSLVTQGPSIGDVDGD 567
AD++ DG ++++T + + G V A +G I W + + PSIGD+D D
Sbjct: 171 AIADLDGDGTLDIITGERFGSSQGFVHALNIDGTEINENWPVEVPATPAFTPSIGDIDND 230
Query: 568 GHSDVVVPTLSGNIYVLSGK----DG-----SKVRPYPYRTHGRV---MNQVLLVDLTKR 615
G +DVV+ S +Y + + DG + VR Y Y++ V ++ L +
Sbjct: 231 GSNDVVIAASSAGMYAFNSQGVILDGFPLLDAAVR-YSYQSPMLVDLDGDETLEIVGANH 289
Query: 616 GEKSKGLTIV---TTSFDGYLYLIDGPT-SCADVVDI-GETSYSMVLADNVDGGDDLDL 669
G+ S G ++ T F G+ +G S VVD+ G+ +Y + +AD D+ DL
Sbjct: 290 GD-SPGFYVLNHDATYFPGWPIATNGWIYSTPTVVDLDGDETYEIFMADRNTSSDETDL 347
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 62/223 (27%), Positives = 97/223 (43%), Gaps = 36/223 (16%)
Query: 386 TPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIG-KYVAGAIVVFNLD--- 441
TP I DIDNDG ++++IA S + + L +D +Y + ++ +LD
Sbjct: 221 TPSIGDIDNDGSNDVVIAASSAGMYAFNSQGVILDGFPLLDAAVRYSYQSPMLVDLDGDE 280
Query: 442 TKQVKWTTDLD------LSTDNASF-------RAYIYSSPTVVDLDGDGNLDILVG---- 484
T ++ D L+ D F +IYS+PTVVDLDGD +I +
Sbjct: 281 TLEIVGANHGDSPGFYVLNHDATYFPGWPIATNGWIYSTPTVVDLDGDETYEIFMADRNT 340
Query: 485 ----TSFGLFYVLDHHGKIREKFPLE-MAEIQGAVVAADINDDGKIELVTTDT------H 533
T Y L G FP+E +G + D+NDDG +++ + T
Sbjct: 341 SSDETDLPTIYGLTPSGGNLSFFPIEKYGGNEGVLTIGDVNDDGVWDIIFSSTMTDAEGF 400
Query: 534 GNVAAWTAEGKGIWEQH-LKS---LVTQGPSIGDVDGDGHSDV 572
G + A++ +G G E L+ G +GD+D DG D+
Sbjct: 401 GYIHAYSTDGSGEIEGFPLRPEGFTFLNGAVLGDIDNDGMMDL 443
Score = 47.0 bits (110), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 37/148 (25%), Positives = 70/148 (47%), Gaps = 18/148 (12%)
Query: 64 LRWQTEVSSSIYATPLIADINSDGKLDIVV----PSFLHYLEVLEGSDGDKMPGWP-AFH 118
+ ++ EV I P IAD++ DG +I+V P+ + + V++ ++GD +PGWP F+
Sbjct: 104 ILFEKEVEGLILLPPSIADLDGDGSPEIIVNTGYPTTVGRVYVID-NNGDDLPGWPITFN 162
Query: 119 QSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDWYVGLH 178
+ ++P + D+D DG +I G F L I ++ ++W V +
Sbjct: 163 DKWMINAPAIADLDGDGTLDIITGERFGSSQGF-------VHALNIDGTEINENWPVEVP 215
Query: 179 SDP-----VDRSHPDVHDDLIVQESEAA 201
+ P + D +D+++ S A
Sbjct: 216 ATPAFTPSIGDIDNDGSNDVVIAASSAG 243
Score = 45.8 bits (107), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 57/217 (26%), Positives = 92/217 (42%), Gaps = 40/217 (18%)
Query: 513 AVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDV 572
V AD+N DG E++ + + A + +G ++E+ ++ L+ PSI D+DGDG ++
Sbjct: 73 GVTLADLNGDGTDEVIY-GINTTLFAISGDGSILFEKEVEGLILLPPSIADLDGDGSPEI 131
Query: 573 VV----PTLSGNIYVLSGKDGSKVRPYPYRTHGRVM-NQVLLVDLTKRGEKSKGLTIVTT 627
+V PT G +YV+ +G + +P + + M N + DL G L I+T
Sbjct: 132 IVNTGYPTTVGRVYVID-NNGDDLPGWPITFNDKWMINAPAIADLDGDGT----LDIITG 186
Query: 628 SFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPH 687
G +S V A N+DG T +N N + PA
Sbjct: 187 ERFG-------------------SSQGFVHALNIDG--------TEINEN-WPVEVPATP 218
Query: 688 HPLKAWRSI-NQGRNNVAIRYNRAGIYVTHPSRAFRD 723
+ I N G N+V I + AG+Y + D
Sbjct: 219 AFTPSIGDIDNDGSNDVVIAASSAGMYAFNSQGVILD 255
>gi|116625217|ref|YP_827373.1| FG-GAP repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116228379|gb|ABJ87088.1| FG-GAP repeat protein [Candidatus Solibacter usitatus Ellin6076]
Length = 431
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 75/158 (47%), Gaps = 15/158 (9%)
Query: 472 DLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEI------QGAVVAADINDDGKI 525
D DGDG+LD+ + + +V+ GK KF ++ + G + AAD+N DGK
Sbjct: 102 DFDGDGHLDLAIAMNNSPLHVMKGDGKGGFKFFSQLERVGPIVDCPGPLFAADLNHDGKT 161
Query: 526 ELVTTDTHGNVAAWTAEGKGIWEQ-------HLKSLVTQGPSIGDVDGDGHSDVVVPTLS 578
++V G +A + +G G + + V Q + D + DG+ D+ V T++
Sbjct: 162 DIVANQCGGGIAIFPGKGDGTFSSPRKIEVPEINRSVDQSMVLADFNNDGYIDIAVGTMT 221
Query: 579 GNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRG 616
G +L+ K+G+ P T G M ++ DL + G
Sbjct: 222 GAAVLLADKEGNFAAPLSLAT-GFPM-KIATADLNRDG 257
Score = 43.1 bits (100), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 37/140 (26%), Positives = 63/140 (45%), Gaps = 16/140 (11%)
Query: 470 VVDLDGDGNLDILV---GTSFGLFYVLDHHGKIREKFPLEMAEI---QGAVVAADINDDG 523
V D++GDG D++V GT ++ D GK +++ E+ + G A D + DG
Sbjct: 48 VADVNGDGFADVIVVAPGTMAITVWLGDGSGKFKDRVDSELGGVVYSGGNAAAGDFDGDG 107
Query: 524 KIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSI--------GDVDGDGHSDVVVP 575
++L + + +GKG + + L GP + D++ DG +D+V
Sbjct: 108 HLDLAIAMNNSPLHVMKGDGKGGF-KFFSQLERVGPIVDCPGPLFAADLNHDGKTDIVAN 166
Query: 576 TLSGNIYVLSGK-DGSKVRP 594
G I + GK DG+ P
Sbjct: 167 QCGGGIAIFPGKGDGTFSSP 186
>gi|407044977|gb|EKE42933.1| hypothetical protein ENU1_004350, partial [Entamoeba nuttalli P19]
Length = 543
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 51/201 (25%), Positives = 95/201 (47%), Gaps = 5/201 (2%)
Query: 400 MIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNAS 459
+++ V+YF + + + K ID KY AI + T ++ T +DL+ N+
Sbjct: 10 LVVPVTYFINPKPSVDQSGQKIEKMIDEKKYCVSAICIIEPSTGKIISLTTIDLTVMNSR 69
Query: 460 FRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLD-HHGKIR--EKFPLEMAEIQGAVVA 516
A + S + G + I+VG + G + + GK++ + +PL++ I G +V
Sbjct: 70 ESAALISGIESIGKYGIPD-TIIVGNTAGRIHQVSLEDGKVKDPQGWPLQVGPIMGRIVV 128
Query: 517 ADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGD-VDGDGHSDVVVP 575
D+ DG++ +VT D +GNV + G+ WE + + G + V G + ++
Sbjct: 129 EDVTRDGELNIVTGDINGNVVCFDMTGRVRWEMNGGGGINHGIEVLRLVLGRKITMLMFG 188
Query: 576 TLSGNIYVLSGKDGSKVRPYP 596
T +G I+ L G G + +P
Sbjct: 189 TNNGKIHCLDGSTGKNIDIFP 209
>gi|390954683|ref|YP_006418441.1| FG-GAP repeat-containing protein [Aequorivita sublithincola DSM
14238]
gi|390420669|gb|AFL81426.1| FG-GAP repeat protein [Aequorivita sublithincola DSM 14238]
Length = 585
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 58/209 (27%), Positives = 91/209 (43%), Gaps = 50/209 (23%)
Query: 389 IADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWT 448
+ADIDNDGV+E++ ++ LK G + K V+G I++
Sbjct: 78 LADIDNDGVAEILYGINKIL--------YALKGDGTVLWEKPVSGTILL----------- 118
Query: 449 TDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGT----SFGLFYVLDHHGKIREKFP 504
PTV DLD DG L+I++ T + G Y+LD G +P
Sbjct: 119 ------------------PPTVADLDSDGTLEIILNTGGVPNAGRVYLLDPSGNNLPGWP 160
Query: 505 LEMAE--IQGAVVAADINDDGKIELVT----TDTHGNVAAWTAEGKGI---WEQHLKSLV 555
L + + A AD++ DG +++VT + G V +G + W + +
Sbjct: 161 LNFDDHWMINAPAVADVDGDGVLDIVTGERVSGAVGFVHVIKMDGTPLNANWPVEVAATP 220
Query: 556 TQGPSIGDVDGDGHSDVVVPTLSGNIYVL 584
PSI DVD DG++DVV+ S +Y+
Sbjct: 221 AFTPSIADVDNDGNADVVIAASSAGMYIF 249
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 67/235 (28%), Positives = 99/235 (42%), Gaps = 59/235 (25%)
Query: 386 TPVIADIDNDGVSEMIIAVS----YFFDHE----------------YYDNP-------EH 418
TP IAD+DNDG ++++IA S Y FD++ Y +P +
Sbjct: 223 TPSIADVDNDGNADVVIAASSAGMYIFDNQGQVFPNFPVFDPNVKYSYQSPILADLDGDD 282
Query: 419 LKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGN 478
E+ G + G G V+ + T + W L + YS TV+D DGDG
Sbjct: 283 DLEIIGSNHGD-APGFYVMEHDGTYKAGWPITL---------GGWTYSPATVLDFDGDGV 332
Query: 479 LDILV---------GTSFGLFYVLDHHGKIREKFPLE-MAEIQGAVVAADINDDGKIEL- 527
DI + G Y LD G FP+E +G + AD+NDDG +++
Sbjct: 333 FDIFMSDRNTSGTAGDPLPTIYGLDPAGNNLPNFPIEKYGGTEGVLSIADVNDDGTLDII 392
Query: 528 ---VTTDTHGN--VAAWTAEGKG-----IWEQHLKSLVTQGPSIGDVDGDGHSDV 572
V TD G + A++ +G G H + + G +GDVD DG D+
Sbjct: 393 FPSVLTDAAGEGYIHAFSLDGSGEIPGFPLRPHGFTFL-NGAVLGDVDNDGLLDL 446
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 65/123 (52%), Gaps = 17/123 (13%)
Query: 66 WQTEVSSSIYATPLIADINSDGKLDIV-----VPSFLHYLEVLEGSDGDKMPGWPA-FHQ 119
W+ VS +I P +AD++SDG L+I+ VP+ + +L+ S G+ +PGWP F
Sbjct: 108 WEKPVSGTILLPPTVADLDSDGTLEIILNTGGVPN-AGRVYLLDPS-GNNLPGWPLNFDD 165
Query: 120 SSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYM-MTDKLEIPRRKVRKDWYVGLH 178
+ ++P + D+D DGV +I GE RVSG + +++ + +W V +
Sbjct: 166 HWMINAPAVADVDGDGVLDIV----TGE----RVSGAVGFVHVIKMDGTPLNANWPVEVA 217
Query: 179 SDP 181
+ P
Sbjct: 218 ATP 220
Score = 43.1 bits (100), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 30/134 (22%), Positives = 62/134 (46%), Gaps = 38/134 (28%)
Query: 377 VNVDSH-ILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAI 435
+N D H +++ P +AD+D DGV +++ G+ V+GA+
Sbjct: 161 LNFDDHWMINAPAVADVDGDGVLDIV-------------------------TGERVSGAV 195
Query: 436 VVFNLDTKQVKWTTDLDLSTDNASFRAYIYS----SPTVVDLDGDGNLDILVGTSFGLFY 491
++ +D + NA++ + + +P++ D+D DGN D+++ S Y
Sbjct: 196 GFVHV--------IKMDGTPLNANWPVEVAATPAFTPSIADVDNDGNADVVIAASSAGMY 247
Query: 492 VLDHHGKIREKFPL 505
+ D+ G++ FP+
Sbjct: 248 IFDNQGQVFPNFPV 261
Score = 39.3 bits (90), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 37/72 (51%), Gaps = 7/72 (9%)
Query: 80 IADINSDGKLDIVVPSFL------HYLEVLEGSDGDKMPGWPAF-HQSSVHSSPLLYDID 132
IAD+N DG LDI+ PS L Y+ ++PG+P H + + +L D+D
Sbjct: 380 IADVNDDGTLDIIFPSVLTDAAGEGYIHAFSLDGSGEIPGFPLRPHGFTFLNGAVLGDVD 439
Query: 133 KDGVREIALATY 144
DG+ ++ +Y
Sbjct: 440 NDGLLDLTANSY 451
>gi|399579038|ref|ZP_10772782.1| FG-GAP repeat protein [Halogranum salarium B-1]
gi|399236064|gb|EJN57004.1| FG-GAP repeat protein [Halogranum salarium B-1]
Length = 421
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 94/362 (25%), Positives = 152/362 (41%), Gaps = 76/362 (20%)
Query: 343 LADEYNYDYD-DYVDDAMWGDEEWT----EEQHEKIEDYVNVDSHILSTPVIADIDNDGV 397
+AD DYD D VD+ E T Q ++E N+ S+ + P++ D DG
Sbjct: 110 VADPTIADYDSDGVDEVFAATTEQTVIGLHPQTGEVEFRQNLTSYGYTQPLVTDFTGDGT 169
Query: 398 SEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDN 457
+E+I+ +D V G + VF + V WT L
Sbjct: 170 NEVIV----------------------VD----VKGTVFVFRPNGTAV-WTKQL------ 196
Query: 458 ASFRAYIYSSPTVVDLDGDGNLDILVGTSF-GLFYVLDHHGKIR-EKFPLEMAEIQGAVV 515
+Y + PTV D DGD + +++VG G Y+ + +G IR ++ L + I +
Sbjct: 197 ---SSYTWGQPTVADFDGDDSAELVVGLGGNGSLYMFEQNGSIRWQRTNLFDSSIT-WMT 252
Query: 516 AADINDDGKIELVTTDTHGNVAAWTAE-GKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVV 574
+DD +E+V T G VA G+ W+++ ++ + GD DGDG +V
Sbjct: 253 TGQADDDPGVEVVAATTDGIVALVDGRTGQIQWQRNFEAFAAV-HAFGDGDGDGEPEVYA 311
Query: 575 PTLSGNIYVLSGKDG----------SKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTI 624
GN+ L+ DG V+ P + G + G+++ L
Sbjct: 312 VARDGNLRSLAASDGRVEWTTTLTTEDVQMTPPPSMGDI-----------DGDRNPELVA 360
Query: 625 VTTSFDGYLYLIDGPTSCADVVDIGETSYSMV----LADNVDGGDDLDLIVTTMNGNVFC 680
VT +G + L+D P S +++D E ++ LAD VDG ++ VT +G V
Sbjct: 361 VTN--NGIVSLVD-PQS-GEIIDSYEREVPILTQPTLAD-VDGDGVTEIFVTYGDGRVVA 415
Query: 681 FS 682
FS
Sbjct: 416 FS 417
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 49/208 (23%), Positives = 87/208 (41%), Gaps = 11/208 (5%)
Query: 435 IVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLD 494
+V N T +W + A+ + + PT+ D D DG ++ T+ L
Sbjct: 84 LVALNATTGNRRW----NYPIPRANCTIHAVADPTIADYDSDGVDEVFAATTEQTVIGLH 139
Query: 495 -HHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKS 553
G++ + L +V D DG E++ D G V + G +W + L S
Sbjct: 140 PQTGEVEFRQNLTSYGYTQPLVT-DFTGDGTNEVIVVDVKGTVFVFRPNGTAVWTKQLSS 198
Query: 554 LVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLT 613
P++ D DGD +++VV L GN + + +R RT+ + + + T
Sbjct: 199 YTWGQPTVADFDGDDSAELVV-GLGGNGSLYMFEQNGSIR--WQRTN--LFDSSITWMTT 253
Query: 614 KRGEKSKGLTIVTTSFDGYLYLIDGPTS 641
+ + G+ +V + DG + L+DG T
Sbjct: 254 GQADDDPGVEVVAATTDGIVALVDGRTG 281
>gi|373458248|ref|ZP_09550015.1| peptidase S8 and S53 subtilisin kexin sedolisin [Caldithrix abyssi
DSM 13497]
gi|371719912|gb|EHO41683.1| peptidase S8 and S53 subtilisin kexin sedolisin [Caldithrix abyssi
DSM 13497]
Length = 1114
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 66/277 (23%), Positives = 123/277 (44%), Gaps = 43/277 (15%)
Query: 387 PVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGG-------IDIGKYVAGAIVVFN 439
P+IAD++NDG +E++I+ +LK + G I +G+ + + V +
Sbjct: 668 PIIADMNNDGQNEIVISERV---------NGYLKIINGDGSLMLDITVGEQIRSELTVAD 718
Query: 440 LDTK---QVKWTT-----------DLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGT 485
L+ ++ + T +LS S+ A + V D+DGDG +++ G
Sbjct: 719 LNGDGNLEIIFGTMSRNVHAIQVDGSELSGFPLSYSAPLDKGVAVGDVDGDGQPELVFGL 778
Query: 486 SFGLFYVLDHHGKIREKFPLEM-AEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGK 544
+ FYV++ G + FP+++ A + V AD+N G I V T T + + G
Sbjct: 779 TNTDFYVVNTDGSVLSGFPVDLPARVANKPVIADLN--GTIYYVLTTTGRELKILNSAGA 836
Query: 545 GIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLS-----GNIYVLSGKDGSKVRPYPYRT 599
+ V PS+ DV+ DG ++ T + G +Y+++ +G + P+P +
Sbjct: 837 EVLTYDTVDPVNATPSLCDVNNDGELEIAFGTNNSTGNYGKLYLIN-FNGDTLAPFPKKL 895
Query: 600 HGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLI 636
+ + DL G ++TT+ GY+Y +
Sbjct: 896 DAPINTSPVFADLDNDGL----FEVMTTTEGGYVYCL 928
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/179 (22%), Positives = 89/179 (49%), Gaps = 5/179 (2%)
Query: 472 DLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEM-AEIQGAVVAADINDDGKIELVTT 530
D+ GDGN++++ + YV D +G ++ FP+ + + + AD+N+DG+ E+V +
Sbjct: 625 DVTGDGNMEVVAVSEGDSLYVYDSNGNLQNGFPMYLGGTVSMGPIIADMNNDGQNEIVIS 684
Query: 531 D-THGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDG 589
+ +G + +G + + + + ++ D++GDG+ +++ T+S N++ + DG
Sbjct: 685 ERVNGYLKIINGDGSLMLDITVGEQIRSELTVADLNGDGNLEIIFGTMSRNVHAIQ-VDG 743
Query: 590 SKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDI 648
S++ +P + V + D+ G+ + T F Y+ DG VD+
Sbjct: 744 SELSGFPLSYSAPLDKGVAVGDVDGDGQPELVFGLTNTDF--YVVNTDGSVLSGFPVDL 800
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 84/175 (48%), Gaps = 21/175 (12%)
Query: 388 VIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGG--IDIGKYVAGAIVVFNLDTK-- 443
+ D+D DG E++ ++ + ++Y L G +D+ VA V+ +L+
Sbjct: 762 AVGDVDGDGQPELVFGLT---NTDFYVVNTDGSVLSGFPVDLPARVANKPVIADLNGTIY 818
Query: 444 QVKWTTDLDLSTDN---ASFRAY-----IYSSPTVVDLDGDGNLDILVGTS-----FGLF 490
V TT +L N A Y + ++P++ D++ DG L+I GT+ +G
Sbjct: 819 YVLTTTGRELKILNSAGAEVLTYDTVDPVNATPSLCDVNNDGELEIAFGTNNSTGNYGKL 878
Query: 491 YVLDHHGKIREKFPLEM-AEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGK 544
Y+++ +G FP ++ A I + V AD+++DG E++TT G V +G
Sbjct: 879 YLINFNGDTLAPFPKKLDAPINTSPVFADLDNDGLFEVMTTTEGGYVYCLRPDGS 933
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 63/124 (50%), Gaps = 4/124 (3%)
Query: 555 VTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTK 614
V+ + GDV GDG+ +VV + ++YV +G+ +P G V ++ D+
Sbjct: 617 VSDAVAAGDVTGDGNMEVVAVSEGDSLYVYD-SNGNLQNGFPMYLGGTVSMGPIIADMNN 675
Query: 615 RGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTM 674
G+ +++ +GYL +I+G S + +GE S + +++G +L++I TM
Sbjct: 676 DGQNE---IVISERVNGYLKIINGDGSLMLDITVGEQIRSELTVADLNGDGNLEIIFGTM 732
Query: 675 NGNV 678
+ NV
Sbjct: 733 SRNV 736
Score = 44.3 bits (103), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 36/145 (24%), Positives = 69/145 (47%), Gaps = 21/145 (14%)
Query: 500 REKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEG--KGIWEQHLKSLVTQ 557
+ FP+ + AV A D+ DG +E+V ++ + + G + + +L V+
Sbjct: 607 QTGFPITNIAVSDAVAAGDVTGDGNMEVVAVSEGDSLYVYDSNGNLQNGFPMYLGGTVSM 666
Query: 558 GPSIGDVDGDGHSDVVVP-TLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRG 616
GP I D++ DG +++V+ ++G + +++G DGS L++D+T
Sbjct: 667 GPIIADMNNDGQNEIVISERVNGYLKIING-DGS-----------------LMLDITVGE 708
Query: 617 EKSKGLTIVTTSFDGYLYLIDGPTS 641
+ LT+ + DG L +I G S
Sbjct: 709 QIRSELTVADLNGDGNLEIIFGTMS 733
Score = 43.1 bits (100), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 47/86 (54%), Gaps = 4/86 (4%)
Query: 74 IYATPLIADINSDGKLDIVVP---SFLHYLEV-LEGSDGDKMPGWPAFHQSSVHSSPLLY 129
+ ATP + D+N+DG+L+I S +Y ++ L +GD + +P + +++SP+
Sbjct: 847 VNATPSLCDVNNDGELEIAFGTNNSTGNYGKLYLINFNGDTLAPFPKKLDAPINTSPVFA 906
Query: 130 DIDKDGVREIALATYNGEVLFFRVSG 155
D+D DG+ E+ T G V R G
Sbjct: 907 DLDNDGLFEVMTTTEGGYVYCLRPDG 932
>gi|334365833|ref|ZP_08514782.1| FG-GAP repeat protein [Alistipes sp. HGB5]
gi|313157939|gb|EFR57345.1| FG-GAP repeat protein [Alistipes sp. HGB5]
Length = 606
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 64/261 (24%), Positives = 106/261 (40%), Gaps = 70/261 (26%)
Query: 379 VDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVF 438
+D + P IAD D DG ++I G++V G
Sbjct: 169 IDVYGAPNPCIADFDGDGDPDLIC-------------------------GEFVDGLTWFE 203
Query: 439 NLDTK-QVKWTTDLDLSTDNASFRAYI-YSSPTVVDLDGDGNLDILVGTSFGLFYVLDHH 496
N+ T+ + ++ L+ + R ++ P V D DGDG+ D++VG G + H
Sbjct: 204 NVGTRTEPRFAAGRPLANKHGEIRLHLEMIVPVVSDFDGDGHPDLIVGDEDGRVAWVRHT 263
Query: 497 GKIREKFPLEMAEIQ----------GAV---VAADINDDGKIELVTTDTHGNVA------ 537
GK+++ P + + GA+ A D + DGK +++ ++ G +A
Sbjct: 264 GKVKKGMPQFESPVYFTQQADPVKFGALSTPCAFDWDGDGKQDIIAGNSAGEIAFIRNLS 323
Query: 538 -----AW------TAEGKGIWEQHLKSLVTQGP----------SIGDVDGDGHSDVVVPT 576
W T G+ I Q ++ QGP S+ D DGDG D++V +
Sbjct: 324 GGENPVWDAPRLFTVNGRPIRIQAGENGSIQGPAERKWGYTVLSVADWDGDGLPDIIVNS 383
Query: 577 LSGNIYV---LSGKDGSKVRP 594
+ G I L GKDG ++ P
Sbjct: 384 IWGKIEWFRNLGGKDGLRLAP 404
Score = 43.1 bits (100), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 54/236 (22%), Positives = 91/236 (38%), Gaps = 48/236 (20%)
Query: 390 ADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAI--VVFNLDTKQVKW 447
AD D DG ++++ + + D+ + + + L G++ G + V+ L+ +
Sbjct: 105 ADWDGDGDKDIVVGIDTWDDYGWDNAFDSL--------GRWTRGPLHGYVYLLENTGRGY 156
Query: 448 TTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSF-GLFY--------------- 491
+ A Y +P + D DGDG+ D++ G GL +
Sbjct: 157 VNRGKVEAAGAPIDVYGAPNPCIADFDGDGDPDLICGEFVDGLTWFENVGTRTEPRFAAG 216
Query: 492 --VLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEG---KGI 546
+ + HG+IR LEM V +D + DG +L+ D G VA G KG+
Sbjct: 217 RPLANKHGEIR--LHLEMI----VPVVSDFDGDGHPDLIVGDEDGRVAWVRHTGKVKKGM 270
Query: 547 WEQHLKSLVTQG-----------PSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSK 591
+ TQ P D DGDG D++ +G I + G +
Sbjct: 271 PQFESPVYFTQQADPVKFGALSTPCAFDWDGDGKQDIIAGNSAGEIAFIRNLSGGE 326
>gi|159900232|ref|YP_001546479.1| FG-GAP repeat-containing protein [Herpetosiphon aurantiacus DSM
785]
gi|159893271|gb|ABX06351.1| FG-GAP repeat protein [Herpetosiphon aurantiacus DSM 785]
Length = 514
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 72/289 (24%), Positives = 110/289 (38%), Gaps = 63/289 (21%)
Query: 390 ADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTT 449
ADIDNDG +E+++ + + Y G+ ++G + LD + W
Sbjct: 225 ADIDNDGYAEILMGRDELYFYAYNGK------------GQLLSGWPLQTYLDESKRTWGK 272
Query: 450 DLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVG-----------TSFGLFYVLDHHGK 498
D + Y +P + DLD DG +I+ + V+ GK
Sbjct: 273 D--------KYIEYTRVAPAIGDLDNDGKQEIVFAGKVRDPLSNHEQTTSALLVVGADGK 324
Query: 499 IREKF--------PLEMAEI--QGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWE 548
+ + PL+ V AD +DGK+E+V T G + A+ +G +W
Sbjct: 325 RKAGWSVGKTVGRPLKANYFTPNNPVALADFENDGKLEIVVTYDDGTIRAYKHDGTQLWS 384
Query: 549 QHL---KSLVTQGPSIGDVDGDGHSDVVVPTLS----GNIYV---LSGKDGSKVRPYPYR 598
+ K L +IGDV GDG D+V T S N YV G+ PYP +
Sbjct: 385 YNYTSGKKLYASEVAIGDVTGDGKVDIVFGTYSFDTTANTYVRLYALNTSGANQAPYPLK 444
Query: 599 --------THGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGP 639
+ L DL + G+ I+ S G LY+ D P
Sbjct: 445 FTYEPNSTNFKGISAGPTLADLDRDGDTE----ILAHSKGGVLYVWDTP 489
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 60/251 (23%), Positives = 102/251 (40%), Gaps = 65/251 (25%)
Query: 383 ILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDT 442
++P + D++NDG E++ A S G I ++ D
Sbjct: 44 FFASPSVVDLNNDGKLEVLTADS--------------------------TGCIWGWDQDG 77
Query: 443 KQVK---WTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGT--------SFGLFY 491
K++ W T D I SS + D+D DG L+++ GT G +
Sbjct: 78 KELAGFPWKTGTVCDDD-----PRINSSLAIGDIDNDGKLEVVAGTRGSAATSGKRGKVF 132
Query: 492 VLDHHGKIREKFPLEMAEIQ---------GAVVAADINDDGKIEL--VTTDTHG------ 534
V D+ G++R +P EMA +V A+I D ++E+ +TT+ G
Sbjct: 133 VFDNQGRVRSGWPREMAWANITNGNEPEFMSVTIANIVGDSRMEVIGITTNEAGSDTNYA 192
Query: 535 -NVAAWTAEGKGI----WEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDG 589
NV AW+ G + H S S D+D DG++++++ Y +GK G
Sbjct: 193 PNVYAWSHNGATVSGFPTSSHKGSGAFGQVSAADIDNDGYAEILMGRDELYFYAYNGK-G 251
Query: 590 SKVRPYPYRTH 600
+ +P +T+
Sbjct: 252 QLLSGWPLQTY 262
>gi|218960506|ref|YP_001740281.1| putative Gingipain R [Candidatus Cloacamonas acidaminovorans]
gi|167729163|emb|CAO80074.1| putative Gingipain R [Candidatus Cloacamonas acidaminovorans str.
Evry]
Length = 1206
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 61/239 (25%), Positives = 107/239 (44%), Gaps = 25/239 (10%)
Query: 467 SPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAE-IQGAVVAADINDDGKI 525
+P + D+DG+G L+ +VG G Y + G++ FP E+ I + AAD + +G
Sbjct: 821 TPVLADIDGNGILETIVGGINGKLYAVSSSGQLMNGFPFELGGIINSELAAADFDSNGSF 880
Query: 526 ELVTTDTHGNVAAWTAEG--KGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYV 583
E+V T+G + A G + + L +T P+I + ++ +V + +IY+
Sbjct: 881 EIVAGSTNGLLTVVGANGIIRNGFPVQLNGPITGFPTITN------TNRIVCSTPTDIYI 934
Query: 584 LSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYL-----IDG 638
+S +G+ + TH + + ++T + G I + +G LY ID
Sbjct: 935 IS-PEGNIIATRNINTH--ITGGFAVGNITA---DNTGTDIAGVTLNGILYAFTDSGIDL 988
Query: 639 PTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSIN 697
P V+IGE L N+D L++IV +V+ + A PL + IN
Sbjct: 989 P---GFPVNIGENFVCPPLLVNLDDDPQLEIIVHNYMNSVYVYK--ADGSPLSGFPFIN 1042
Score = 42.7 bits (99), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 43/183 (23%), Positives = 79/183 (43%), Gaps = 12/183 (6%)
Query: 499 IREKFPLE-MAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGK--GIWEQHLKSLV 555
I +FP E + + A + AD+N +E++ D +GN + G+ E L+
Sbjct: 714 IDNRFPWETIVNGKSAPLVADLNSSPGLEIIYADVYGNSYIIGSNGELLNTLEAPSGMLI 773
Query: 556 TQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKR 615
+ + G +D D +D+V T SGNI+ L +G + + Y+T + +L D+
Sbjct: 774 NRSFASGPIDSDDQNDLVFCTRSGNIFALR-LNGDII--FNYQTDTSFLFTPVLADIDGN 830
Query: 616 GEKSKGLTIVTTSFDGYLYLI--DGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTT 673
G L + +G LY + G ++G S + A + D +++ +
Sbjct: 831 GI----LETIVGGINGKLYAVSSSGQLMNGFPFELGGIINSELAAADFDSNGSFEIVAGS 886
Query: 674 MNG 676
NG
Sbjct: 887 TNG 889
>gi|373458247|ref|ZP_09550014.1| peptidase S8 and S53 subtilisin kexin sedolisin [Caldithrix abyssi
DSM 13497]
gi|371719911|gb|EHO41682.1| peptidase S8 and S53 subtilisin kexin sedolisin [Caldithrix abyssi
DSM 13497]
Length = 1138
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/147 (24%), Positives = 74/147 (50%), Gaps = 6/147 (4%)
Query: 492 VLDHHGKIREKFPLEMAEIQGA-VVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQH 550
+L+ G++ FP+++ E V AD++ DG+ E+ +G + + +G + ++
Sbjct: 683 LLNDAGQVVAPFPIDLGEYHRVPQVVADLDGDGQQEIAVLSNYGKLKVFKTDGSLLLDRD 742
Query: 551 LKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLV 610
L +V + D+DGDG ++++ T+ ++VL+ G ++ +P V +
Sbjct: 743 LNEVVYGSFAADDLDGDGQPEIIIATMRKKLHVLA-LSGEELPGFPVALSSIAPEGVAIG 801
Query: 611 DLTKRGEKSKGLTIVTTSFDGYLYLID 637
D+ + G K IV ++FD L+L D
Sbjct: 802 DVNEDGRKE----IVVSAFDNTLHLFD 824
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 45/169 (26%), Positives = 69/169 (40%), Gaps = 47/169 (27%)
Query: 388 VIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKW 447
V+AD+D DG E+ + +Y G + VF D +
Sbjct: 707 VVADLDGDGQQEIAVLSNY--------------------------GKLKVFKTDGSLL-- 738
Query: 448 TTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEM 507
D DL+ +Y S DLDGDG +I++ T +VL G+ FP+ +
Sbjct: 739 -LDRDLNE-------VVYGSFAADDLDGDGQPEIIIATMRKKLHVLALSGEELPGFPVAL 790
Query: 508 AEIQGAVVA-ADINDDGKIELVTT---------DTHG-NVAAWTAEGKG 545
+ I VA D+N+DG+ E+V + D +G +A W E G
Sbjct: 791 SSIAPEGVAIGDVNEDGRKEIVVSAFDNTLHLFDRNGAELAGWPVELPG 839
>gi|82523679|emb|CAI78461.1| hypothetical protein [uncultured candidate division WS3 bacterium]
Length = 1063
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 53/204 (25%), Positives = 95/204 (46%), Gaps = 35/204 (17%)
Query: 498 KIREKFPLEMAEIQGAVV--AADINDDGKIELVTTDTHGNVAAWTAEGK------GIWEQ 549
+ ++ +P+++++ G + ADI+ DG E++ G V AW +G G++ +
Sbjct: 522 RWKDGWPVDLSKSLGTMTPRVADIDGDGDGEVIVASRDGEVYAWHHDGTPIVPMCGVFGR 581
Query: 550 H--LKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSK-------VRPYPYRTH 600
+ +T+ PS D+DGDG ++V+V + +G++YV G D +P
Sbjct: 582 FAAVPGGITRSPSAADIDGDGDAEVIVASAAGSLYVWDGTDEDADGYADLHSAGFPVALD 641
Query: 601 GRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYS---MVL 657
G + ++ D +++ GL I S G L ++DG D +G + YS +VL
Sbjct: 642 GPASSTPVVSDF----DEAPGLEIAVASAGGDLTIVDG-----DGSHLGSSPYSFGHLVL 692
Query: 658 ADNVDGGDDLD------LIVTTMN 675
D DLD ++ TT N
Sbjct: 693 EDVCLAAGDLDGDGFSEIVTTTTN 716
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 83/384 (21%), Positives = 138/384 (35%), Gaps = 109/384 (28%)
Query: 376 YVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAI 435
+ V I +P ADID DG +E+I+A + AG++
Sbjct: 582 FAAVPGGITRSPSAADIDGDGDAEVIVASA--------------------------AGSL 615
Query: 436 VVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDH 495
V++ + DL + + S+P V D D L+I V ++ G ++D
Sbjct: 616 YVWDGTDEDADGYADLHSAGFPVALDGPASSTPVVSDFDEAPGLEIAVASAGGDLTIVDG 675
Query: 496 HGKIREKFPLEMAEI---QGAVVAADINDDGKIELVTTDTH-GNVAAWTAEGKGI----- 546
G P + + A D++ DG E+VTT T+ G VAA A+G +
Sbjct: 676 DGSHLGSSPYSFGHLVLEDVCLAAGDLDGDGFSEIVTTTTNRGWVAALNADGSSVGGWPV 735
Query: 547 ----WEQH--------------------------------------------LKSLVTQG 558
WEQ L+ V
Sbjct: 736 LVDSWEQETVNVAVGDIDRAPGGSPEVVAVGSGGVVHVWDGSGRELPGWPVDLRRGVDAR 795
Query: 559 PSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGR-----VMNQVLLVDLT 613
PS+GD+DGDG+ ++V+P+ + + L +G++V +P G + L+ D+
Sbjct: 796 PSLGDLDGDGYLEIVIPSGTSTVEGLR-ANGTRVENWPLAAAGGDSTRPIGASALIGDVD 854
Query: 614 KRGEKSKGLTIVTTSFDGYLYLIDG---------PTSCADVVDIGETSYSMVLADNVDGG 664
G L ++T G ++L D P S + S A ++D
Sbjct: 855 GDGA----LNVLTAGAGGSMFLHDAVSGRPVPGWPYSS-------DPSLGTPWAGDIDND 903
Query: 665 DDLDLIVTTMNGNVFCFSTPAPHH 688
+LD++ +G V P H
Sbjct: 904 GELDVLFAGSSGRVLLMGLPYGHE 927
Score = 47.4 bits (111), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 43/178 (24%), Positives = 73/178 (41%), Gaps = 18/178 (10%)
Query: 467 SPTVVDLDGDGNLDILVGTSFGLFYVLDHHGK-------IREKFPLEMAEIQGAVVAADI 519
+P V D+DGDG+ +++V + G Y H G + +F I + AADI
Sbjct: 539 TPRVADIDGDGDGEVIVASRDGEVYAWHHDGTPIVPMCGVFGRFAAVPGGITRSPSAADI 598
Query: 520 NDDGKIELVTTDTHGNVAAWTA---EGKGIWEQH-------LKSLVTQGPSIGDVDGDGH 569
+ DG E++ G++ W + G + H L + P + D D
Sbjct: 599 DGDGDAEVIVASAAGSLYVWDGTDEDADGYADLHSAGFPVALDGPASSTPVVSDFDEAPG 658
Query: 570 SDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTT 627
++ V + G++ ++ G DGS + PY V+ V L G+ + TT
Sbjct: 659 LEIAVASAGGDLTIVDG-DGSHLGSSPYSFGHLVLEDVCLAAGDLDGDGFSEIVTTTT 715
Score = 44.3 bits (103), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 68/150 (45%), Gaps = 8/150 (5%)
Query: 460 FRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMA------EIQGA 513
R + + P++ DLDGDG L+I++ + L +G E +PL A I +
Sbjct: 788 LRRGVDARPSLGDLDGDGYLEIVIPSGTSTVEGLRANGTRVENWPLAAAGGDSTRPIGAS 847
Query: 514 VVAADINDDGKIELVTTDTHGNVAAWTA-EGKGIWEQHLKSLVTQG-PSIGDVDGDGHSD 571
+ D++ DG + ++T G++ A G+ + S + G P GD+D DG D
Sbjct: 848 ALIGDVDGDGALNVLTAGAGGSMFLHDAVSGRPVPGWPYSSDPSLGTPWAGDIDNDGELD 907
Query: 572 VVVPTLSGNIYVLSGKDGSKVRPYPYRTHG 601
V+ SG + ++ G + + T G
Sbjct: 908 VLFAGSSGRVLLMGLPYGHEAGAMVWSTEG 937
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 39/78 (50%), Gaps = 1/78 (1%)
Query: 74 IYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDK 133
I A+ LI D++ DG L+++ + + + G +PGWP S+ +P DID
Sbjct: 844 IGASALIGDVDGDGALNVLTAGAGGSMFLHDAVSGRPVPGWPYSSDPSL-GTPWAGDIDN 902
Query: 134 DGVREIALATYNGEVLFF 151
DG ++ A +G VL
Sbjct: 903 DGELDVLFAGSSGRVLLM 920
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 24/90 (26%), Positives = 43/90 (47%), Gaps = 6/90 (6%)
Query: 66 WQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSS---- 121
W ++ + A P + D++ DG L+IV+PS +E L ++G ++ WP
Sbjct: 784 WPVDLRRGVDARPSLGDLDGDGYLEIVIPSGTSTVEGLR-ANGTRVENWPLAAAGGDSTR 842
Query: 122 -VHSSPLLYDIDKDGVREIALATYNGEVLF 150
+ +S L+ D+D DG + A G +
Sbjct: 843 PIGASALIGDVDGDGALNVLTAGAGGSMFL 872
>gi|386722376|ref|YP_006188702.1| hypothetical protein B2K_09425 [Paenibacillus mucilaginosus K02]
gi|384089501|gb|AFH60937.1| hypothetical protein B2K_09425 [Paenibacillus mucilaginosus K02]
Length = 1258
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 67/243 (27%), Positives = 105/243 (43%), Gaps = 42/243 (17%)
Query: 463 YIYSSPTVVDLDGDGNLDI------------LVGTSFGLFYVLDHHGKIREKFPLEMAEI 510
Y +S D +GDG+ DI L+G G F LDH ++
Sbjct: 30 YYPASAASGDFNGDGHEDITVANLGGHSVSVLLGDGTGRFTTLDHDTPVQSPR------- 82
Query: 511 QGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKG---IWEQHLKSLVTQGPSIGDVDGD 567
A+ D+N+DGK+++VTT+ V+ +G+G + + + + + +IGD +GD
Sbjct: 83 --AISVGDLNEDGKLDVVTTNVQNTVSVLLGDGEGGLALPDDYSTAGSPRYTAIGDFNGD 140
Query: 568 GHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLL--------VDLTKRGEKS 619
GH D+ V + SG+I V+ G+ + G V V+ +DL S
Sbjct: 141 GHQDLAVASESGHISVMLGQGNGQFGGRTDYHAGAVGRSVVAGNFTGDANLDLAVTNSYS 200
Query: 620 KGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLAD-NVDGGDDLDLIVTTMNGNV 678
+TI+T + G L D AD+ + M AD N DG DL ++NGNV
Sbjct: 201 NDVTILTGTGTGAFVLSD--FYFADIHPM-----IMTTADFNRDG--LADLAAASLNGNV 251
Query: 679 FCF 681
Sbjct: 252 SVL 254
>gi|297526457|ref|YP_003668481.1| FG-GAP repeat protein [Staphylothermus hellenicus DSM 12710]
gi|297255373|gb|ADI31582.1| FG-GAP repeat protein [Staphylothermus hellenicus DSM 12710]
Length = 526
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 56/222 (25%), Positives = 104/222 (46%), Gaps = 22/222 (9%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTK- 443
+TP ++DID DG+ E+II + Y E + +D+G Y+ V+++++
Sbjct: 212 TTPSLSDIDGDGIDEIIIVEGTYLHVIDYSVNEEKYVVYSLDLGSYLVPPPVLYDVNNDG 271
Query: 444 ---QVKWTTD-----LDLSTDNASFRAYI-----YSSPTVVDLDGDGNLDILVGTSFGLF 490
V + D +DL + ++SP++ ++D D +I+VGT GL
Sbjct: 272 SDDAVIVSKDAHVFIVDLLNGGIIGETKLNATDTFASPSIGNVDDDPEPEIVVGTMDGL- 330
Query: 491 YVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTA-EGKGI--- 546
Y+L+ +++ FP ++ Q + + ADI+ DGK E++ G A + G
Sbjct: 331 YILNMSLNVKKYFP-DLQTYQSSPIIADIDGDGKNEIIFGLHTGEFVIVNATKPSGFFRE 389
Query: 547 --WEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSG 586
W + + P+I DVD DG ++++ + N+Y G
Sbjct: 390 VEWLYSTDAPIMASPAISDVDNDGLPEILIGSRDYNMYCFKG 431
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 33/51 (64%)
Query: 546 IWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYP 596
+W+ H + V P+IGD++ DG+ +VV + GN+YV++G +G + YP
Sbjct: 67 LWKTHSELCVASSPAIGDINNDGYKEVVYSSCDGNLYVVNGSNGEVLWKYP 117
Score = 48.1 bits (113), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 59/124 (47%), Gaps = 6/124 (4%)
Query: 42 QLGLPQIDEDALVNTQCPKNLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLE 101
+G I E NT +L W+T + ++P I DIN+DG ++V S L
Sbjct: 44 HVGYVNIREIIGNNTYRFYAFDLLWKTHSELCVASSPAIGDINNDGYKEVVYSSCDGNLY 103
Query: 102 VLEGSDGDKMPGWPAFHQSSVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSG---YMM 158
V+ GS+G+ + +P + + P +YD+D DG +I +G + G +++
Sbjct: 104 VVNGSNGEVLWKYPT---GGMFADPTIYDVDGDGYYDILTVGASGNLYCLNYDGSEKWVI 160
Query: 159 TDKL 162
D++
Sbjct: 161 NDRM 164
Score = 40.8 bits (94), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 78/192 (40%), Gaps = 38/192 (19%)
Query: 383 ILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDT 442
+ S+P I DI+NDG E++ + G + V N
Sbjct: 76 VASSPAIGDINNDGYKEVVYSSC--------------------------DGNLYVVNGSN 109
Query: 443 KQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREK 502
+V W +++ PT+ D+DGDG DIL + G Y L++ G EK
Sbjct: 110 GEVLWKYP----------TGGMFADPTIYDVDGDGYYDILTVGASGNLYCLNYDGS--EK 157
Query: 503 FPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG 562
+ + G+ D++ D +E+V G V G + ++ + PS+
Sbjct: 158 WVINDRMFSGSPAVGDVDGDDDVEVVMGSNDGYVYVINRNGTVEAKIYVGDMPATTPSLS 217
Query: 563 DVDGDGHSDVVV 574
D+DGDG ++++
Sbjct: 218 DIDGDGIDEIII 229
>gi|223939963|ref|ZP_03631830.1| FG-GAP repeat protein [bacterium Ellin514]
gi|223891377|gb|EEF57871.1| FG-GAP repeat protein [bacterium Ellin514]
Length = 938
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 70/236 (29%), Positives = 104/236 (44%), Gaps = 32/236 (13%)
Query: 472 DLDGDGNLDILVGTSFG-LFYVLDHHGKIREKFPLEMAEIQGA----VVAADINDDGKIE 526
D++GD +D++ S G VL + G F + + G+ V AAD+N DGK +
Sbjct: 326 DINGDSKVDLISANSGGNSLTVLTNDGS--GHFAVSASPGAGSGTFGVTAADVNGDGKAD 383
Query: 527 LVTTDTHGN-VAAWTAEGKGIWEQHLKSLVTQGP---SIGDVDGDGHSDVV-----VPTL 577
L++ + GN + T +G G + V GP + DV+GDG D++ V +L
Sbjct: 384 LISANFSGNSLTVLTNDGTGHFVLSASPGVGFGPGSVTAADVNGDGKVDLISVNMSVSSL 443
Query: 578 SGNI------YVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDG 631
S I + LS G+ P PY +N VDL S LT++T + G
Sbjct: 444 SVLINDGAGHFALSSTLGAG--PTPYSVAAADVNGDGQVDLICADSGSTILTVLTNNGTG 501
Query: 632 YLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPH 687
+ L + D+GE S+V A NV+G +DLI N N T P
Sbjct: 502 HFGLFN-------TFDVGEAPVSVV-ATNVNGDGLVDLICANHNANTLTVLTNTPR 549
Score = 48.1 bits (113), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 59/224 (26%), Positives = 95/224 (42%), Gaps = 27/224 (12%)
Query: 467 SPTVV---DLDGDGNLDILVGTSF-GLFYVLDHHGKIREKFP--LEMAEIQGAVVAADIN 520
+PT V D++GDG +D++ VL + G + F L + V+A DIN
Sbjct: 269 APTSVAAADVNGDGKVDLISANQNDSTLTVLTNTGTGQFIFSALLSVGSFPHCVIATDIN 328
Query: 521 DDGKIELVTTDTHGN-VAAWTAEGKG---IWEQHLKSLVTQGPSIGDVDGDGHSDVVVPT 576
D K++L++ ++ GN + T +G G + T G + DV+GDG +D++
Sbjct: 329 GDSKVDLISANSGGNSLTVLTNDGSGHFAVSASPGAGSGTFGVTAADVNGDGKADLISAN 388
Query: 577 LSGNIYVLSGKDGS---------KVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTT 627
SGN + DG+ V P +N VDL L+++
Sbjct: 389 FSGNSLTVLTNDGTGHFVLSASPGVGFGPGSVTAADVNGDGKVDLISVNMSVSSLSVLIN 448
Query: 628 SFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIV 671
G+ L + + G T YS+ AD V+G +DLI
Sbjct: 449 DGAGHFAL-------SSTLGAGPTPYSVAAAD-VNGDGQVDLIC 484
>gi|183230389|ref|XP_657167.2| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|169802957|gb|EAL51781.2| hypothetical protein EHI_049950 [Entamoeba histolytica HM-1:IMSS]
gi|449708166|gb|EMD47680.1| Hypothetical protein EHI5A_157270 [Entamoeba histolytica KU27]
Length = 543
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 50/201 (24%), Positives = 94/201 (46%), Gaps = 5/201 (2%)
Query: 400 MIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNAS 459
+++ V+YF + + + K ID KY I + T ++ T +DL+ N+
Sbjct: 10 LVVPVTYFINPKPSVDRSGQKIEKMIDEKKYCVSGICIIEPSTGKIISLTIIDLTVMNSR 69
Query: 460 FRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLD-HHGKIR--EKFPLEMAEIQGAVVA 516
A + S + G + I+VG + G + + GK++ + +PL++ I G +V
Sbjct: 70 ESAALISGIESIGKYGIPD-TIIVGNTAGRIHQVSLEDGKVKDPQGWPLQVGPIMGRIVV 128
Query: 517 ADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGD-VDGDGHSDVVVP 575
D+ DG++ +VT D +GNV + G+ WE + + G + V G + ++
Sbjct: 129 EDVTRDGELNIVTGDINGNVVCFDMTGRVRWEMNGGGGINHGIEVLRLVLGRKITMLMFG 188
Query: 576 TLSGNIYVLSGKDGSKVRPYP 596
T +G I+ L G G + +P
Sbjct: 189 TNNGKIHCLDGSTGKNIDIFP 209
>gi|330998153|ref|ZP_08321980.1| FG-GAP repeat protein [Paraprevotella xylaniphila YIT 11841]
gi|329569062|gb|EGG50856.1| FG-GAP repeat protein [Paraprevotella xylaniphila YIT 11841]
Length = 1580
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 42/141 (29%), Positives = 67/141 (47%), Gaps = 13/141 (9%)
Query: 464 IYSSPTVVDLD---GDGNLDILVGTSF--GLFYVLDHHGKIREKFPLEMAEIQGAVVAAD 518
Y P V D+D G G +I+ + F ++D HG I+ F + G AD
Sbjct: 1161 CYRPPVVTDVDAPDGKGEKEIVFRSEFEKKPIVIMDAHGNIKASFGNLSGDFFGEPAVAD 1220
Query: 519 INDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKS----LVTQGPSIGDVDGDGHSDVVV 574
++ DG E++ + G+V W +GK S ++ P+I D+DGDG +++V
Sbjct: 1221 LDGDGYKEIICGSSDGHVYVWQHDGKPYLRSPFFSRPGQMLNCSPTICDLDGDGEKEILV 1280
Query: 575 PTLSGN---IYVLSGKDGSKV 592
T + N IY + +DGS V
Sbjct: 1281 TTRNTNLSYIYAIR-QDGSCV 1300
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 64/263 (24%), Positives = 101/263 (38%), Gaps = 62/263 (23%)
Query: 378 NVDSHILSTPVIADIDNDGVSEMIIAVS----YFFDHEYYDNPEHLKELGGIDIGKYVAG 433
N+ P +AD+D DG E+I S Y + H D +L+
Sbjct: 1207 NLSGDFFGEPAVADLDGDGYKEIICGSSDGHVYVWQH---DGKPYLRS------------ 1251
Query: 434 AIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILV---GTSFGLF 490
F+ + + SPT+ DLDGDG +ILV T+
Sbjct: 1252 --PFFSRPGQMLN-------------------CSPTICDLDGDGEKEILVTTRNTNLSYI 1290
Query: 491 YVLDHHGKIREKF----------PLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWT 540
Y + G F P I+ + D+N DG++E+V + V W+
Sbjct: 1291 YAIRQDGSCVGNFDSNASTPACIPYVSNGIEHPLSVGDVNGDGRLEVVALG-YDCVRIWS 1349
Query: 541 AEGKGIWEQHLKSLVTQG------PSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRP 594
G+ + + L L+T+ P + DVDGD D+V IY L DG+ +
Sbjct: 1350 DAGELLINRSLPGLLTESYINLTCPLLADVDGDDAIDIVFHQ-DNLIYALH-NDGTDITG 1407
Query: 595 YPYRTHGRVMNQVLLVDLTKRGE 617
+P T ++ N V + D+ G+
Sbjct: 1408 FPLSTADKMDNGVCVSDVDGDGK 1430
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 42/80 (52%), Gaps = 2/80 (2%)
Query: 466 SSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAE-IQGAVVAADINDDGK 524
+ P + D+DGD +DI+ L Y L + G FPL A+ + V +D++ DGK
Sbjct: 1372 TCPLLADVDGDDAIDIVFHQD-NLIYALHNDGTDITGFPLSTADKMDNGVCVSDVDGDGK 1430
Query: 525 IELVTTDTHGNVAAWTAEGK 544
E++ D GN+ AW GK
Sbjct: 1431 NEIIAADKSGNIYAWKTNGK 1450
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 33/125 (26%), Positives = 55/125 (44%), Gaps = 10/125 (8%)
Query: 524 KIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYV 583
K +V D HGN+ A G + P++ D+DGDG+ +++ + G++YV
Sbjct: 1189 KKPIVIMDAHGNIKASFGNLSGDFFGE--------PAVADLDGDGYKEIICGSSDGHVYV 1240
Query: 584 LSGKDGSKVRPYPYRTHGRVMN-QVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSC 642
+R + G+++N + DL GEK +T T+ Y+Y I SC
Sbjct: 1241 WQHDGKPYLRSPFFSRPGQMLNCSPTICDLDGDGEKEILVTTRNTNLS-YIYAIRQDGSC 1299
Query: 643 ADVVD 647
D
Sbjct: 1300 VGNFD 1304
>gi|167392631|ref|XP_001740233.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165895745|gb|EDR23365.1| hypothetical protein EDI_257980 [Entamoeba dispar SAW760]
Length = 325
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 65/121 (53%), Gaps = 4/121 (3%)
Query: 63 ELRWQTEVS--SSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQS 120
++RW S S I A+P++ D N DG+ ++++P+ HY+E ++ + G K+PG+P +
Sbjct: 95 KIRWTQSTSGHSPILASPVVTDYNHDGRKEVLIPTQSHYIESIDINTGTKIPGFPVIIED 154
Query: 121 SVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDK-LEIPRRKVRKDWYVGLHS 179
S + + ++ I A+ NG + F +G T K ++IP V +W GL +
Sbjct: 155 SGFITQAIPYQEESEEYSIVSAS-NGFIYFINKNGEYNTSKTIKIPPVYVPANWNDGLET 213
Query: 180 D 180
D
Sbjct: 214 D 214
>gi|305667423|ref|YP_003863710.1| hypothetical protein FB2170_14288 [Maribacter sp. HTCC2170]
gi|88709471|gb|EAR01704.1| hypothetical protein FB2170_14288 [Maribacter sp. HTCC2170]
Length = 474
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 103/473 (21%), Positives = 186/473 (39%), Gaps = 74/473 (15%)
Query: 75 YATPLIADINSDGKLDIVVPSFLHYLEV-------LEGSDGDKMPGWPAFHQSSVHSSPL 127
+++P I D+N DG DIV+ + + L+G G + W + + S
Sbjct: 25 FSSPRITDLNKDGIGDIVLGAGRKEFQACDSAVIALDGRSGKML--WKVGAKDQIFGSAA 82
Query: 128 LYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPVDRSHP 187
L+DI DG+ ++ + + E++ + + E P + K+W+ + + +
Sbjct: 83 LHDITNDGIMDVFINGRSAELIAINGKTGEVIWRFEKPIGQ-EKEWFNFYNPQFIPDQNG 141
Query: 188 DVHDDLIVQESEAARMKSMLETKKSTPETNATVTTSTESNPAPATVSNPDVKKVNESLVN 247
D DD+++ + E V +++ + PD ++ S+
Sbjct: 142 DFQDDILISNGGDV----LAEPHDPNRAPGHLVIIDSKNGKILSRAKMPDEGEIYMSVSV 197
Query: 248 VSN-PSEERKV---NESHTEMNIKLPTSVDNSSTTTVSGGTN--SSENGTNTGRRLLEDN 301
+ N PS+ + V T TS+D + +S SE G + D
Sbjct: 198 LPNMPSDYKNVIFGTGGETIGGHLYVTSIDEIYSGDLSNSIQLARSEKKGFVGPGVWVDI 257
Query: 302 NSKGSQE--------------GNDKEDVPVATAENDQALDENADSSFELFRDTDELADEY 347
NS G+ + G E + + N +A SS + R TD+ ++
Sbjct: 258 NSDGTYDIVTNAVDGRLLAFDGKSYEPIWSVSVPNTEAY-----SSVAVGRFTDDDIPDF 312
Query: 348 NYDYDDYVDDAMWGDEEWTEE-----QHEKIEDYVNVDSHILSTPVIADIDNDGVSEMII 402
Y +W D EW+++ + K+E + + LSTPV+ D+ DG E ++
Sbjct: 313 FVSYAQ----GVWPDLEWSKQVMVNGSNGKVEYEDTLGFYQLSTPVVVDLTGDGKDEALL 368
Query: 403 AVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRA 462
++Y +EY K+ I+ T++V DL +N S
Sbjct: 369 PMNYQITNEYQ--------------MKFFHNNIMAVEF-TEKVSAELDLQNEGNNLS--- 410
Query: 463 YIYSSPTVVDLDGDGNLDILV--GTSFGLFYVLD--HHGKIREKFPLEMAEIQ 511
S+P + DLD +G LDI+ GT+ Y D +I KFP+ + EI+
Sbjct: 411 ---STPWIGDLDDNGYLDIIYVHGTNVKQTYTFDGLQINRIDTKFPI-VGEIK 459
>gi|409721644|ref|ZP_11269810.1| hypothetical protein Hham1_03495 [Halococcus hamelinensis 100A6]
gi|448723995|ref|ZP_21706509.1| hypothetical protein C447_12622 [Halococcus hamelinensis 100A6]
gi|445786801|gb|EMA37564.1| hypothetical protein C447_12622 [Halococcus hamelinensis 100A6]
Length = 420
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 83/320 (25%), Positives = 126/320 (39%), Gaps = 50/320 (15%)
Query: 382 HILSTPVIADIDNDGVSEMIIAV-------------SYFFDHEY----YDNP---EHLKE 421
H +S P IAD D+DG E+I A S F H+ Y P + L
Sbjct: 107 HSISDPTIADFDDDGDREVIAATSAEEVVAYDLRNGSVEFRHDLSSYGYSKPLVGDLLPG 166
Query: 422 LGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDI 481
G + + G + F V W SF P + D+DGDG +I
Sbjct: 167 AGNETVVTDLGGGVFTF-AGNGSVAW---------QRSFGDARVRQPAIGDVDGDGEPEI 216
Query: 482 LVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAW-T 540
+G G VL+ + + + A + +DD +EL G V
Sbjct: 217 AIGQLGGEAIVLERDASVAWRQAIPNASATKWMTTGQADDDAALELTFATFFGEVVTLDG 276
Query: 541 AEGKGIWEQHLKSLVTQGPSI---GDVDGDGHSDVVVPTLSGNIYVLSGKDGS---KVRP 594
A G W ++L + QG S+ GD DGDG ++V V G + +GS + R
Sbjct: 277 ANGSVEWRRNLSA---QGASVRVMGDGDGDGRTEVYVVARDGVLRSFDAANGSLDWRTR- 332
Query: 595 YPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADV----VDIGE 650
T+ RVM L DL G+ +V + G + ++D PT+ V D+
Sbjct: 333 LTAETNARVMPPPSLGDLDGDGDPE----LVAVTAAGRVLVVD-PTNGETVDSYERDVPI 387
Query: 651 TSYSMVLADNVDGGDDLDLI 670
++S V + DG D++ +I
Sbjct: 388 NTFSRVADVDGDGNDEIFVI 407
>gi|337745819|ref|YP_004639981.1| hypothetical protein KNP414_01547 [Paenibacillus mucilaginosus
KNP414]
gi|336297008|gb|AEI40111.1| FG-GAP repeat- calx-beta domain-containing protein [Paenibacillus
mucilaginosus KNP414]
Length = 662
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 50/180 (27%), Positives = 85/180 (47%), Gaps = 21/180 (11%)
Query: 469 TVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELV 528
TV +L GD ++ +L+G G F LDH ++ A+ D+N+DGK+++V
Sbjct: 65 TVANL-GDHSVSVLLGDGTGRFTTLDHDTPVQSPR---------AISVGDLNEDGKLDVV 114
Query: 529 TTDTHGNVAAWTAEGKG---IWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLS 585
TT+ V+ +G+G + + + + + +IGD +GDGH D+ V + SG+I V+
Sbjct: 115 TTNVQNTVSVLLGDGEGGLALPDDYSTAGSPRYTAIGDFNGDGHQDLAVASESGHISVML 174
Query: 586 GKDGSKVRPYPYRTHGRVMNQVLL--------VDLTKRGEKSKGLTIVTTSFDGYLYLID 637
G+ + G V V+ +DL S +TI+T + G L D
Sbjct: 175 GQGNGQFGGRTDYHAGAVGRSVVAGNFTGDANLDLAVTNSYSNDVTILTGTGTGAFVLSD 234
>gi|448734572|ref|ZP_21716796.1| hypothetical protein C450_14938 [Halococcus salifodinae DSM 8989]
gi|445800127|gb|EMA50490.1| hypothetical protein C450_14938 [Halococcus salifodinae DSM 8989]
Length = 414
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 49/167 (29%), Positives = 70/167 (41%), Gaps = 12/167 (7%)
Query: 515 VAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVV 574
V AD+ DG ELV D G V G +W + L S P++ D D D +VVV
Sbjct: 155 VVADLTGDGGNELVVVDVQGTVFVLRPNGTAVWTETLSSYTWGQPAVEDFDADSQREVVV 214
Query: 575 PTLSGNIYVLSGKDGSKV--RPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGY 632
T SG + + K G+K R P+ + M T + + + IVT + DG
Sbjct: 215 GTSSGQLVLFDHK-GTKAWNRTRPFESSITWMT-------TGQADDDRATEIVTATVDGR 266
Query: 633 LYLIDGPTSCAD-VVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNV 678
+ IDG D GE + L D DG D ++ +G +
Sbjct: 267 VTAIDGERGAVQWQRDFGEFAAVHTLRDG-DGDGDPEVYAVAKDGKI 312
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 79/207 (38%), Gaps = 42/207 (20%)
Query: 387 PVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVK 446
PV+AD+ DG +E+++ +D V G + V + V
Sbjct: 154 PVVADLTGDGGNELVV----------------------VD----VQGTVFVLRPNGTAV- 186
Query: 447 WTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHG--KIREKFP 504
WT L +Y + P V D D D +++VGTS G + DH G P
Sbjct: 187 WTETL---------SSYTWGQPAVEDFDADSQREVVVGTSSGQLVLFDHKGTKAWNRTRP 237
Query: 505 LEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAE-GKGIWEQHLKSLVTQGPSIGD 563
E + AD DD E+VT G V A E G W++ ++ D
Sbjct: 238 FESSITWMTTGQAD--DDRATEIVTATVDGRVTAIDGERGAVQWQRDFGEFAAV-HTLRD 294
Query: 564 VDGDGHSDVVVPTLSGNIYVLSGKDGS 590
DGDG +V G I L+ DGS
Sbjct: 295 GDGDGDPEVYAVAKDGKIRSLNASDGS 321
Score = 39.7 bits (91), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 25/93 (26%), Positives = 45/93 (48%), Gaps = 3/93 (3%)
Query: 59 PKNLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFH 118
P+ E +Q +++ Y P++AD+ DG ++VV + VL +G + W
Sbjct: 135 PRTGEQVFQYDLTDYGYTQPVVADLTGDGGNELVVVDVQGTVFVLR-PNGTAV--WTETL 191
Query: 119 QSSVHSSPLLYDIDKDGVREIALATYNGEVLFF 151
S P + D D D RE+ + T +G+++ F
Sbjct: 192 SSYTWGQPAVEDFDADSQREVVVGTSSGQLVLF 224
>gi|322417945|ref|YP_004197168.1| FG-GAP repeat-containing protein [Geobacter sp. M18]
gi|320124332|gb|ADW11892.1| FG-GAP repeat protein [Geobacter sp. M18]
Length = 1091
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 45/162 (27%), Positives = 76/162 (46%), Gaps = 11/162 (6%)
Query: 460 FRAYIYSSPTV----VDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVV 515
+R+ Y SP V D++ DG DI+V G ++ D ++E A +
Sbjct: 492 WRSSSYGSPLVRLQLADVNHDGRNDIVVALEDGTIHIYDGS-TLKEIRSFPTAASLRDMA 550
Query: 516 AADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVP 575
AAD++ DG +E+VT+D G + +A+ G + + ++GDVDGD H +++
Sbjct: 551 AADVDGDGNVEIVTSDGIG-IRVHSAD-AGALKWSFANAGGNSLAVGDVDGDAHQEIITS 608
Query: 576 TLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGE 617
T G YV+ G D R ++ +V + DL + G
Sbjct: 609 TYGGKGYVIDGSD----RAVKWQNDASFGARVRVADLNRDGR 646
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 64/256 (25%), Positives = 107/256 (41%), Gaps = 54/256 (21%)
Query: 389 IADIDNDGVSEMIIAVSYFFDH--EYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVK 446
IAD+D DG +E+ + + +D YD HL
Sbjct: 835 IADVDGDGRNELAMTGAILYDGLIRIYDGTSHL--------------------------- 867
Query: 447 WTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLE 506
+ +A++ I+ + + D+DGDG ++I+ G F VLD ++EK+
Sbjct: 868 ------VKRQSATYSNTIFQTMAIGDVDGDGRVEIVTGRDF--LIVLDGT-TLQEKW--R 916
Query: 507 MAEIQGA---VVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGD 563
A I G + AD++ DG +E+V T G+ + G + +S + + D
Sbjct: 917 SASIGGTLYDIKLADLDRDGHVEMVAI-TGGSALVFDGV-TGTLKVSFQSPASV-LELAD 973
Query: 564 VDGDGHSDVVVPTLSGNIYVLSGKDGSKVRP-YPYRTHGRVMNQVLLVDLTKRGEKSKGL 622
VD DG+ +V++ G I V G R Y ++ + V+ + VDL G K L
Sbjct: 974 VDADGNLEVLIGRNDGKIDVFDGGTFQLKRTLYSFKPY--VIQALKAVDLDGSGYKK--L 1029
Query: 623 TIVTTSFDGYLYLIDG 638
I + G L ++DG
Sbjct: 1030 LIAS---GGILTILDG 1042
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 64/253 (25%), Positives = 108/253 (42%), Gaps = 46/253 (18%)
Query: 351 YDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDH 410
YD + A+W E T EQ + + V+ D D DGVSE++ +
Sbjct: 661 YDAVLQTAVW--EIVTAEQVDAL--------------VVDDTDGDGVSEIVYGDGQWGKL 704
Query: 411 EYYDNPEHLKELGGIDIGKYVAG-AIVVFNLD-TKQVKWTTD-----------LDLSTDN 457
D HL++ + ++G A+ +LD K+V W D ST
Sbjct: 705 HGIDVQTHLEKWAVNNTEDGISGVAVGDVDLDGRKEVVWGNGGYSSGPDYLNISDPSTGT 764
Query: 458 ASFRAYIYSSPTVV---DLDGDGNLDILVGT--------SFGLFYVLDHHGKIREKFPL- 505
+R+ + T + DLDGDG ++++ + SF +F +H K ++ P
Sbjct: 765 TKWRSTVTMGMTPLALHDLDGDGATELVLVSQSYSSGDNSFQIFDARNHDLKAVQQIPTN 824
Query: 506 EMAEIQGAVVAADINDDGKIELVTTDT---HGNVAAWTAEGKGIWEQHL--KSLVTQGPS 560
++++ AV AD++ DG+ EL T G + + + Q + + Q +
Sbjct: 825 DLSQYSHAVRIADVDGDGRNELAMTGAILYDGLIRIYDGTSHLVKRQSATYSNTIFQTMA 884
Query: 561 IGDVDGDGHSDVV 573
IGDVDGDG ++V
Sbjct: 885 IGDVDGDGRVEIV 897
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 53/210 (25%), Positives = 90/210 (42%), Gaps = 45/210 (21%)
Query: 389 IADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKEL--------------------GGIDIG 428
+AD+++DG +++++A+ H Y + LKE+ G ++I
Sbjct: 506 LADVNHDGRNDIVVALEDGTIHIY--DGSTLKEIRSFPTAASLRDMAAADVDGDGNVEIV 563
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
I V + D +KW SF +S V D+DGD + +I+ T G
Sbjct: 564 TSDGIGIRVHSADAGALKW-----------SFANAGGNSLAVGDVDGDAHQEIITSTYGG 612
Query: 489 LFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGK-GIW 547
YV+D G R A V AD+N DG++E++ + + A + +W
Sbjct: 613 KGYVID--GSDRAVKWQNDASFGARVRVADLNRDGRVEIIGASAWQKITIYDAVLQTAVW 670
Query: 548 E----QHLKSLVTQGPSIGDVDGDGHSDVV 573
E + + +LV + D DGDG S++V
Sbjct: 671 EIVTAEQVDALV-----VDDTDGDGVSEIV 695
>gi|379719779|ref|YP_005311910.1| FG-GAP repeat- calx-beta domain-containing protein [Paenibacillus
mucilaginosus 3016]
gi|378568451|gb|AFC28761.1| FG-GAP repeat- calx-beta domain-containing protein [Paenibacillus
mucilaginosus 3016]
Length = 399
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/180 (28%), Positives = 86/180 (47%), Gaps = 21/180 (11%)
Query: 469 TVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELV 528
TV +L GD ++ +L+G G F LDH P++ A+ D+N+DGK+++V
Sbjct: 65 TVANL-GDHSVSVLLGDGTGRFTTLDH------DTPVQSPR---AISVGDLNEDGKLDVV 114
Query: 529 TTDTHGNVAAWTAEGKG---IWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLS 585
TT+ V+ +G+G + + + + + +IGD +GDGH D+ V + SG+I V+
Sbjct: 115 TTNVQNTVSVLLGDGEGGLALPDDYSTAGSPRYTAIGDFNGDGHQDLAVASESGHISVML 174
Query: 586 GKDGSKVRPYPYRTHGRVMNQVLL--------VDLTKRGEKSKGLTIVTTSFDGYLYLID 637
G+ + G V V+ +DL S +TI+T + G L D
Sbjct: 175 GQGNGQFGGRTDYHAGAVGRSVVAGNFTGDANLDLAVTNSYSNDVTILTGTGTGAFVLSD 234
>gi|83945730|ref|ZP_00958074.1| VCBS [Oceanicaulis sp. HTCC2633]
gi|83850820|gb|EAP88681.1| VCBS [Oceanicaulis alexandrii HTCC2633]
Length = 473
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 61/236 (25%), Positives = 96/236 (40%), Gaps = 45/236 (19%)
Query: 364 EWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIA-----VSYFFDHE--YYDNP 416
EW E E V + + P ADID DG +++ + Y+F + +
Sbjct: 122 EWIEAAGETA--IVRNWCCVAAGPSFADIDGDGHADLTAGSYSPGLIYWFQGQDGGFRAR 179
Query: 417 EHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGD 476
+ L + GG+D+ I N D AY + P +D D D
Sbjct: 180 QSLTDWGGLDV-------ITRLN--------------EVDEGPHLAYA-AKPAWLDWDDD 217
Query: 477 GNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNV 536
G LD+++G + G V +HG + + ++G V A + G +L +
Sbjct: 218 GRLDLIIGNAQGDLVVRRNHGPAHQP---GITPVEGQPVFATFSWGGSHQLDVFEV---- 270
Query: 537 AAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKV 592
EG Q + LV P++GD DGDGH D++V T SG +Y L G ++
Sbjct: 271 ----VEGGAGPMQDEEYLV---PAVGDWDGDGHPDLIVGTQSGAVYFLRNLGGEEI 319
>gi|332706595|ref|ZP_08426656.1| hypothetical protein LYNGBM3L_22560 [Moorea producens 3L]
gi|332354479|gb|EGJ33958.1| hypothetical protein LYNGBM3L_22560 [Moorea producens 3L]
Length = 450
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/138 (30%), Positives = 68/138 (49%), Gaps = 22/138 (15%)
Query: 512 GAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVT-----------QGPS 560
G ++AADINDDG+ +L+ T G +AA++ GK +W + + VT Q P
Sbjct: 38 GGLIAADINDDGQKDLIITKP-GYIAAYSHSGKALWSKSINIQVTGKSESQGLPGNQAPG 96
Query: 561 I--GDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVR----PYPYRTHGRVMNQVLLVDLTK 614
+ DVDGD ++V+ T ++++ G +G + P P T G LV
Sbjct: 97 VQAADVDGDQKTEVLFLTKDNTLHIVEGFNGETQKTIKLPSPEGTEGWEH----LVVANF 152
Query: 615 RGEKSKGLTIVTTSFDGY 632
RG+ + L + T+ +GY
Sbjct: 153 RGKGDRDLLLQATNAEGY 170
Score = 42.7 bits (99), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 47/206 (22%), Positives = 83/206 (40%), Gaps = 27/206 (13%)
Query: 424 GIDIGKYVAGAIV--VFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDI 481
G +G+Y+A + + + Q W+ D F A ++ V DLDGDG ++
Sbjct: 169 GYRMGRYLAAYPLDNLLKGENLQPLWSRD--------DFLANAHNGARVADLDGDGKDEV 220
Query: 482 LVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDT-HGNVAAWT 540
L GT ++ G++ K P++ I VA D +E+V + GN
Sbjct: 221 LGGT------IISPDGEMLLKIPIK-GHIDSLFVADVRPDIPGLEVVALEEGGGNRVFLY 273
Query: 541 AEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTH 600
+ IW+ H K Q +IGD D P G + +P+ +
Sbjct: 274 NRDRVIWKTHYKHQEPQNAAIGDFD---------PQRPGLEVWCRSRYQKHQKPFVFDAQ 324
Query: 601 GRVMNQVLLVDLTKRGEKSKGLTIVT 626
G+++ + D+ +G KG+ ++
Sbjct: 325 GQLIANYQMDDVAPKGWTDKGVEVIV 350
>gi|119493896|ref|ZP_01624460.1| putative hemagglutinin/hemolysin-related protein [Lyngbya sp. PCC
8106]
gi|119452373|gb|EAW33565.1| putative hemagglutinin/hemolysin-related protein [Lyngbya sp. PCC
8106]
Length = 2003
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 57/214 (26%), Positives = 86/214 (40%), Gaps = 47/214 (21%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKY--VAGAIVVFNLDT 442
STP AD+D DG + I +Y + Y++N D G + + GA N
Sbjct: 24 STPTFADVDGDGDLDAFIGDNYG-NINYFEN----------DGGTFTEITGAANPLN--- 69
Query: 443 KQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG-LFYVLDHHGKIRE 501
F S+PT+ D+DGDG+LD +G SFG +FY + G E
Sbjct: 70 ----------------GFDVGFNSTPTLADVDGDGDLDAFIGQSFGNIFYFQNDGGTFTE 113
Query: 502 ----KFPLEMAEI--QGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLV 555
P ++ AD++ DG ++ + GN+ + +G E +
Sbjct: 114 ITGAANPFNGVDVGFSSTPTLADVDGDGDLDAFIGERDGNINYFENDGGTFTEITGAANP 173
Query: 556 TQG--------PSIGDVDGDGHSDVVVPTLSGNI 581
G P++ DVDGDG D + GNI
Sbjct: 174 FNGFDVGSASTPTLADVDGDGDLDAFIGEFDGNI 207
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 81/212 (38%), Gaps = 43/212 (20%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S+P AD+D DG + I +YY N + + GG F T
Sbjct: 402 SSPTFADVDGDGDLDAFIG-------QYYGNINYFENDGG------------TFTEITGA 442
Query: 445 VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFP 504
+D+ ++ SSPT D+DGDG+LD +G S G ++ G +
Sbjct: 443 ANPLNGVDVGSN---------SSPTFADVDGDGDLDAFIGESLGNINYFENDGGTFTEIT 493
Query: 505 LEMAEIQGAVVA-------ADINDDGKIELVTTDTHGNVAAWTAEGKGIWE--------Q 549
G V AD++ DG ++ + GN+ + +G E
Sbjct: 494 GAANPFNGVDVGFSSTPTLADVDGDGDLDAFIGERDGNINYFENDGGTFTEITGAANPFN 553
Query: 550 HLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNI 581
+ + P++ DVDGDG D + GNI
Sbjct: 554 GVDVGFSSTPTLADVDGDGDLDAFIGEFDGNI 585
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 64/253 (25%), Positives = 99/253 (39%), Gaps = 38/253 (15%)
Query: 465 YSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVA-------A 517
YSSPT D+DGDG+LD +G +G ++ G + + G V A
Sbjct: 401 YSSPTFADVDGDGDLDAFIGQYYGNINYFENDGGTFTEITGAANPLNGVDVGSNSSPTFA 460
Query: 518 DINDDGKIELVTTDTHGNVAAWTAEGKGIWE--------QHLKSLVTQGPSIGDVDGDGH 569
D++ DG ++ ++ GN+ + +G E + + P++ DVDGDG
Sbjct: 461 DVDGDGDLDAFIGESLGNINYFENDGGTFTEITGAANPFNGVDVGFSSTPTLADVDGDGD 520
Query: 570 SDVVVPTLSGNIYVLSGKDGS------KVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLT 623
D + GNI G+ P+ G + L D+ G+ L
Sbjct: 521 LDAFIGERDGNINYFENDGGTFTEITGAANPFNGVDVG-FSSTPTLADVDGDGD----LD 575
Query: 624 IVTTSFDGYL-YL---------IDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTT 673
FDG + Y I G + + D+G S + AD VDG DLD +
Sbjct: 576 AFIGEFDGNINYFENDGGTFTEITGAANPLNGFDVGYNS-TPTFAD-VDGDGDLDAFIGE 633
Query: 674 MNGNVFCFSTPAP 686
+GN+ F P
Sbjct: 634 RDGNILYFENDTP 646
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 56/220 (25%), Positives = 84/220 (38%), Gaps = 47/220 (21%)
Query: 379 VDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKY--VAGAIV 436
VD S+P AD+D DG + I S + Y++N D G + + GA
Sbjct: 449 VDVGSNSSPTFADVDGDGDLDAFIGES-LGNINYFEN----------DGGTFTEITGAAN 497
Query: 437 VFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHH 496
FN D+ S S+PT+ D+DGDG+LD +G G ++
Sbjct: 498 PFN--------GVDVGFS-----------STPTLADVDGDGDLDAFIGERDGNINYFEND 538
Query: 497 GKIREKFPLEMAEIQGAVVA-------ADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQ 549
G + G V AD++ DG ++ + GN+ + +G E
Sbjct: 539 GGTFTEITGAANPFNGVDVGFSSTPTLADVDGDGDLDAFIGEFDGNINYFENDGGTFTEI 598
Query: 550 HLKSLVTQG--------PSIGDVDGDGHSDVVVPTLSGNI 581
+ G P+ DVDGDG D + GNI
Sbjct: 599 TGAANPLNGFDVGYNSTPTFADVDGDGDLDAFIGERDGNI 638
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 57/214 (26%), Positives = 85/214 (39%), Gaps = 47/214 (21%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKY--VAGAIVVFNLDT 442
STP +AD+D DG + I S F + Y+ N D G + + GA FN
Sbjct: 77 STPTLADVDGDGDLDAFIGQS-FGNIFYFQN----------DGGTFTEITGAANPFN--- 122
Query: 443 KQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG-LFYVLDHHGKIRE 501
D+ S S+PT+ D+DGDG+LD +G G + Y + G E
Sbjct: 123 -----GVDVGFS-----------STPTLADVDGDGDLDAFIGERDGNINYFENDGGTFTE 166
Query: 502 ----KFPLEMAEIQGAVVA--ADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLV 555
P ++ A AD++ DG ++ + GN+ + +G E +
Sbjct: 167 ITGAANPFNGFDVGSASTPTLADVDGDGDLDAFIGEFDGNINYFENDGGTFTEITGAANP 226
Query: 556 TQG--------PSIGDVDGDGHSDVVVPTLSGNI 581
G P+ DVDGDG D + G I
Sbjct: 227 LNGFDVGSDSSPTFADVDGDGDLDAFIGERDGTI 260
>gi|357011528|ref|ZP_09076527.1| hypothetical protein PelgB_18842 [Paenibacillus elgii B69]
Length = 1083
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 61/223 (27%), Positives = 91/223 (40%), Gaps = 25/223 (11%)
Query: 387 PVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVK 446
P +AD+ DG ++I+ + D ++ G G Y +
Sbjct: 427 PALADLTGDGRPDLIVGSA--------DGGLYVYRNAGETEGAYAGQQLPAELKAPAAFA 478
Query: 447 WTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVL--DHHGKIREKFP 504
L L T +A Y+S VDL GDG D+L+G G V + G + P
Sbjct: 479 RREPLRLPTGE-PLKAGSYASVAAVDLKGDGRPDLLIGQPDGTLAVAYGEAGGAFKAPVP 537
Query: 505 L--EMAEIQGAVVAA----DINDDGKIELVTTDTHGNVAAWT----AEGKGIWE--QHLK 552
L + I+GA A D+ DG +LV D G V + A G W + L
Sbjct: 538 LLADGKPIRGAAPIAPAVGDVTGDGIPDLVIGDADGQVTLYRGTRGANGAIAWHAGEALF 597
Query: 553 SLVTQ--GPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVR 593
L + PS+ D++GDG +D+VV T G++ V +D R
Sbjct: 598 KLSARFAAPSVRDMNGDGKADLVVGTGEGDLRVYVQEDSGTGR 640
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 41/166 (24%), Positives = 70/166 (42%), Gaps = 39/166 (23%)
Query: 460 FRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIR-----EKFPLEM------- 507
+RAY P + DL GDG D++VG++ G YV + G+ ++ P E+
Sbjct: 423 YRAY----PALADLTGDGRPDLIVGSADGGLYVYRNAGETEGAYAGQQLPAELKAPAAFA 478
Query: 508 --------------AEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWE----- 548
A +V A D+ DG+ +L+ G +A E G ++
Sbjct: 479 RREPLRLPTGEPLKAGSYASVAAVDLKGDGRPDLLIGQPDGTLAVAYGEAGGAFKAPVPL 538
Query: 549 ----QHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGS 590
+ ++ P++GDV GDG D+V+ G + + G G+
Sbjct: 539 LADGKPIRGAAPIAPAVGDVTGDGIPDLVIGDADGQVTLYRGTRGA 584
>gi|399574930|ref|ZP_10768688.1| hypothetical protein HSB1_07270 [Halogranum salarium B-1]
gi|399239198|gb|EJN60124.1| hypothetical protein HSB1_07270 [Halogranum salarium B-1]
Length = 404
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 75/305 (24%), Positives = 126/305 (41%), Gaps = 48/305 (15%)
Query: 382 HILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLD 441
H ++ P IAD D+DGV+E++ A + ++E + F+ D
Sbjct: 96 HSVADPTIADYDDDGVNEVLAATT---ENE-----------------------VAAFDPD 129
Query: 442 TKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIRE 501
+ + ++ N + AY Y+ P V DL +++V + G+ +VL+ G
Sbjct: 130 SGEKEF---------NRTLSAYGYTQPVVADLTPASGHELVVVDASGVVFVLNSDGDALW 180
Query: 502 KFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSI 561
L+ A + AD +DDG+ EL GN + A G W++ S + S
Sbjct: 181 TQDLD-ARVFAQPTVADYDDDGEPELFAAGDDGNATLFAANGSVEWQRQAGSSILWATS- 238
Query: 562 GDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKG 621
GD D D + V T G++ G++G R GR L D + G++
Sbjct: 239 GDADDDAAYEAFVATSGGDVVAFDGREG---REEWRLDAGRYTAVHALEDTDEDGQRE-- 293
Query: 622 LTIVTTSFDGYLYLIDGPTS-CADVVDIGETSYSMV---LADNVDGGDDLDLIVTTMNGN 677
+ T+ DG L +D T A +D+ S M+ + +VDG D++ +G
Sbjct: 294 --LYVTARDGTLRRVDAATGDVAWKIDLTVESVQMMPPPVVGDVDGDGASDIVAAGNDGT 351
Query: 678 VFCFS 682
V S
Sbjct: 352 VSVVS 356
Score = 44.3 bits (103), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 40/157 (25%), Positives = 62/157 (39%), Gaps = 4/157 (2%)
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
K A+V + D W D + T+N + + PT+ D D DG ++L T+
Sbjct: 66 KSAGCALVALSADDGTTVW--DYQIPTENCIIHSV--ADPTIADYDDDGVNEVLAATTEN 121
Query: 489 LFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWE 548
D +E A V AD+ ELV D G V ++G +W
Sbjct: 122 EVAAFDPDSGEKEFNRTLSAYGYTQPVVADLTPASGHELVVVDASGVVFVLNSDGDALWT 181
Query: 549 QHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLS 585
Q L + V P++ D D DG ++ GN + +
Sbjct: 182 QDLDARVFAQPTVADYDDDGEPELFAAGDDGNATLFA 218
>gi|404404925|ref|ZP_10996509.1| hypothetical protein AJC13_05803 [Alistipes sp. JC136]
Length = 686
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 56/213 (26%), Positives = 92/213 (43%), Gaps = 45/213 (21%)
Query: 427 IGKYVAGAIVVFNLDTK-QVKWTTDLDLSTDNASFRAYI-YSSPTVVDLDGDGNLDILVG 484
G++V G N+ T+ + ++ L+ + R ++ P V D DGDG+ D++VG
Sbjct: 274 CGEFVDGLTWFENIGTRTEPRFAAGRPLANKHGEIRLHLEMIVPVVSDFDGDGHPDLIVG 333
Query: 485 TSFGLFYVLDHHGKIREKFPLEMAEIQ----------GAV---VAADINDDGKIELVTTD 531
G + H GK+++ P + + GA+ A D + DGK +++ +
Sbjct: 334 DEDGRVAWVRHTGKVKKGMPQFESPVYFTQQADLVKFGALSTPCAFDWDGDGKQDIIAGN 393
Query: 532 THGNVA-----------AW------TAEGKGIWEQHLKSLVTQGP----------SIGDV 564
+ G +A W T GK I Q ++ QGP S+ D
Sbjct: 394 SAGEIAFIRNLSGGENPVWDAPRLFTVNGKPIRIQAGENGSIQGPAERKWGYTVLSVADW 453
Query: 565 DGDGHSDVVVPTLSGNIYV---LSGKDGSKVRP 594
DGDG D++V ++ G I L KDG K+ P
Sbjct: 454 DGDGLPDIIVNSIWGKIEWFRNLGSKDGLKLAP 486
>gi|116619760|ref|YP_821916.1| hypothetical protein Acid_0626 [Candidatus Solibacter usitatus
Ellin6076]
gi|116222922|gb|ABJ81631.1| conserved repeat domain [Candidatus Solibacter usitatus Ellin6076]
Length = 4666
Score = 53.9 bits (128), Expect = 4e-04, Method: Composition-based stats.
Identities = 60/240 (25%), Positives = 96/240 (40%), Gaps = 17/240 (7%)
Query: 467 SPTVVDLDGDGNLDILVGTSFGLFYVL---DHHGKIREKFPLEMAEIQGAVVAADINDDG 523
S TV D +GDG D+ V + + G +R + A+ D N DG
Sbjct: 2084 SVTVGDFNGDGKPDVAVADGPDRLVTILLGNGDGTLRPGGTFAIGAFGSAIATGDFNGDG 2143
Query: 524 KIELVTTD-THGNVAAWTAEGKGIWEQHLKSLVTQGP---SIGDVDGDGHSDVVVPTLS- 578
K +LV T+ N++ + G G ++ + P +GD +GDG +D+ V + S
Sbjct: 2144 KPDLVVTNGGDDNLSIFLGNGDGTFQAGVNYATIVEPVGVVVGDFNGDGKADLAVSSFSC 2203
Query: 579 -GNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLT-IVTTSFDGYLYLI 636
N+ VL G P Y G+ + + D + G+ + S + L
Sbjct: 2204 CNNVSVLLGNGDGTFHPAIYYAAGQAPWHLAIGDFNRDGKLDLAVADYAGNSVNVLLGNG 2263
Query: 637 DGPTSCADVVDIGETSYSMVLAD-NVDGGDDLDLI------VTTMNGNVFCFSTPAPHHP 689
DG +G YS+ + D N DG DL + V+ ++GN PA +P
Sbjct: 2264 DGTFQTQAAYPVGSHPYSVAVGDFNGDGKPDLAVPGVNTNDVSILSGNGDGTFQPAVSYP 2323
Score = 50.8 bits (120), Expect = 0.003, Method: Composition-based stats.
Identities = 41/126 (32%), Positives = 62/126 (49%), Gaps = 11/126 (8%)
Query: 470 VVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQG---AVVAADINDDGKIE 526
V D +GDG D+ VG SF V GK F + G +V AD+N DG+ +
Sbjct: 2830 VADFNGDGKADVAVG-SFSTAGVNILLGKGDGSFLAPLTNNSGWANFMVLADLNGDGRPD 2888
Query: 527 LVTTDTHGNVAAWTAEGKGIWE--QHLKSLVTQGP----SIGDVDGDGHSDVVVPTLSGN 580
LV D + + G G ++ QHL +L + GP ++GD +GDG +D+ V G
Sbjct: 2889 LVGMDGAASAYILLSNGNGTFQPVQHL-TLNSVGPPDGLAVGDFNGDGRADLAVSNFGGG 2947
Query: 581 IYVLSG 586
+ + +G
Sbjct: 2948 VQLFAG 2953
Score = 50.4 bits (119), Expect = 0.004, Method: Composition-based stats.
Identities = 49/177 (27%), Positives = 73/177 (41%), Gaps = 20/177 (11%)
Query: 513 AVVAADINDDGKIELVTTD-THGNVAAWTAEGKGIWEQHLKSLVTQGPS---IGDVDGDG 568
+V D+N DGK +LV + NV+ G G ++ + P I D D DG
Sbjct: 3484 SVTVGDLNRDGKTDLVIANFASNNVSVLLGSGDGTFQPPVNYAAGTSPQQVEISDFDSDG 3543
Query: 569 HSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEK--------SK 620
D+ V L+G++ +L G + G ++N + + D G SK
Sbjct: 3544 KPDLAVIGLTGSMSILLGNGDGTFQSAMISNVGALLNGMTIGDFNGDGNADIALADTVSK 3603
Query: 621 GLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGN 677
++I+ + DG S A +V G Y V A + DG DL VTT N N
Sbjct: 3604 AVSILLGNGDGTF------QSGASLVPGGNPEY--VAAGDFDGDGRPDLAVTTQNPN 3652
Score = 48.9 bits (115), Expect = 0.011, Method: Composition-based stats.
Identities = 31/122 (25%), Positives = 52/122 (42%), Gaps = 10/122 (8%)
Query: 468 PTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIEL 527
P +V +DG + IL+ G F + H L + D N DG+ +L
Sbjct: 2887 PDLVGMDGAASAYILLSNGNGTFQPVQH-------LTLNSVGPPDGLAVGDFNGDGRADL 2939
Query: 528 VTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPS---IGDVDGDGHSDVVVPTLSGNIYVL 584
++ G V + G G ++Q + + GP +GD +GDG +D+ V + +L
Sbjct: 2940 AVSNFGGGVQLFAGNGNGTFQQAVTYMAGAGPGRLIVGDFNGDGRADIAVANSDNKVSLL 2999
Query: 585 SG 586
G
Sbjct: 3000 LG 3001
Score = 42.7 bits (99), Expect = 0.89, Method: Composition-based stats.
Identities = 59/214 (27%), Positives = 91/214 (42%), Gaps = 17/214 (7%)
Query: 470 VVDLDGDGNLDILVGTSFGLFY---VLDHHGKIREKFPLEMAEIQGA----VVAADINDD 522
V D +GDG D+ TS G FY V +G + P+ +A A + D N D
Sbjct: 2677 VADFNGDGKADLAFITS-GAFYGITVQLGNGDGTFRAPVNIAFAPSAQLFGLAVGDFNHD 2735
Query: 523 GKIELVTTDTHGN-VAAWTAEGKGIWEQHLKSLVTQGPS---IGDVDGDGHSDVVVPTLS 578
GK +L +D+ + VA T G G ++ + PS +GD + DG +D+ + +
Sbjct: 2736 GKPDLTVSDSANSFVAILTGNGDGTFQPEVDYAAGTQPSAIVVGDFNFDGMADIATVSGA 2795
Query: 579 ---GNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLV-DLTKRGEKSKGLTIVTTSFDGYLY 634
G++ VL GK P G + L+V D G+ + +T+ L
Sbjct: 2796 GSDGSVSVLLGKADGSFAPAISSATGTGADVTLMVADFNGDGKADVAVGSFSTAGVNILL 2855
Query: 635 LIDGPTSCADVVDIGETSYSMVLAD-NVDGGDDL 667
+ A + + + MVLAD N DG DL
Sbjct: 2856 GKGDGSFLAPLTNNSGWANFMVLADLNGDGRPDL 2889
>gi|163847306|ref|YP_001635350.1| Pyrrolo-quinoline quinone [Chloroflexus aurantiacus J-10-fl]
gi|222525150|ref|YP_002569621.1| Pyrrolo-quinoline quinone [Chloroflexus sp. Y-400-fl]
gi|163668595|gb|ABY34961.1| Pyrrolo-quinoline quinone [Chloroflexus aurantiacus J-10-fl]
gi|222449029|gb|ACM53295.1| Pyrrolo-quinoline quinone [Chloroflexus sp. Y-400-fl]
Length = 685
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 68/260 (26%), Positives = 99/260 (38%), Gaps = 80/260 (30%)
Query: 376 YVNVDSHILSTPVIADIDNDGVSEMIIAV--------------------SYF-----FDH 410
Y +DS I S+P AD++ DG ++I+ SY+ F +
Sbjct: 213 YFAMDS-IWSSPAFADVNRDGRKDIIVGTDFTPGMVCNPNSITPFPETNSYYESAKGFLY 271
Query: 411 EYYDNPEHLKELGGIDIGKYVA---GAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSS 467
+ NP + + Y A G ++ F K W D S IYSS
Sbjct: 272 AFPANPTFVADP------IYCARDGGNVIGFG---KGFLWAVRFDQS---------IYSS 313
Query: 468 PTVVDLDGDGNLDILVGTSF--------GLFYVLDHHGKIREKFPLEMAEIQGAVVA-AD 518
P V DL+ DG L+++VG+S G + + + EK L E + A D
Sbjct: 314 PAVADLENDGQLEVIVGSSCFYGGNPKPGRWVKIFNAANGTEKMTLNAPECVASSPAIGD 373
Query: 519 INDDGKIELV--------------TTDTHGNVAAWTAEGKG-IW---------EQHLKSL 554
I DGK E+V T G + AWT + IW EQ S
Sbjct: 374 ITGDGKPEIVAAVASGGSINPPVDTPREGGRLVAWTYDNPNPIWNIVVRGSTQEQADMSE 433
Query: 555 VTQGPSIGDVDGDGHSDVVV 574
P I D+DG+G +V++
Sbjct: 434 AFNNPLIADIDGNGSHEVII 453
Score = 44.7 bits (104), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 36/173 (20%), Positives = 69/173 (39%), Gaps = 29/173 (16%)
Query: 377 VNVDSHILSTPVIADIDNDGVSEMIIAVSYFFD-HEYYDNPEHLKELGGIDIGKYVAGAI 435
+N + S+P I DI DG E++ AV+ + D P G +
Sbjct: 359 LNAPECVASSPAIGDITGDGKPEIVAAVASGGSINPPVDTPRE-------------GGRL 405
Query: 436 VVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDH 495
V + D W + ST + + +++P + D+DG+G+ ++++ +
Sbjct: 406 VAWTYDNPNPIWNIVVRGSTQEQADMSEAFNNPLIADIDGNGSHEVIIVVQNSVMIYNGA 465
Query: 496 HGKIREKFPLEMAEIQGAV---------------VAADINDDGKIELVTTDTH 533
++ P+ E Q + ADI++DGK+E+V +H
Sbjct: 466 GQELTPSCPIGGLEAQACMPRKSMFMWMPIRNTPAIADIDNDGKLEIVAAGSH 518
Score = 44.7 bits (104), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 39/72 (54%), Gaps = 3/72 (4%)
Query: 72 SSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPLLYDI 131
S +++TP + D+N G + IV S YL VL + D W F S+ SSP D+
Sbjct: 172 SGVFSTPSVTDVNGAGDIIIVAGSADLYLHVL---NKDGTLRWKYFAMDSIWSSPAFADV 228
Query: 132 DKDGVREIALAT 143
++DG ++I + T
Sbjct: 229 NRDGRKDIIVGT 240
Score = 44.3 bits (103), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 51/193 (26%), Positives = 86/193 (44%), Gaps = 35/193 (18%)
Query: 464 IYSSPTVVDLDGD--GNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEI-------QGAV 514
I SSP V DLDG+ L+I++G G + L ++G++ + + GA
Sbjct: 58 IRSSPVVSDLDGNPSNGLEIVIGDRNGKLFALRNNGQVMWTTQVASCSVTGDDSLLNGAP 117
Query: 515 VAADINDD----------GKIELVTTDTHGNVAAWTAEGKGIWEQHL-------KSLVTQ 557
D+ D GKI L G + A++A G+ +W + +S V
Sbjct: 118 SLFDLRPDLPGLEVIVGYGKI-LADPSCPGGIRAYSATGQPLWNYQIPISSISGQSGVFS 176
Query: 558 GPSIGDVDGDGHSDVVVPTLSGNIYV-LSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRG 616
PS+ DV+G G D+++ S ++Y+ + KDG+ +R + Y + + D+ + G
Sbjct: 177 TPSVTDVNGAG--DIIIVAGSADLYLHVLNKDGT-LR-WKYFAMDSIWSSPAFADVNRDG 232
Query: 617 EKSKGLTIVTTSF 629
K IV T F
Sbjct: 233 RKD---IIVGTDF 242
Score = 42.7 bits (99), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 43/95 (45%), Gaps = 21/95 (22%)
Query: 66 WQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPG-WPAFHQSS--- 121
W SIY++P +AD+ +DG+L+++V S Y G+ PG W ++
Sbjct: 302 WAVRFDQSIYSSPAVADLENDGQLEVIVGSSCFY-------GGNPKPGRWVKIFNAANGT 354
Query: 122 ----------VHSSPLLYDIDKDGVREIALATYNG 146
V SSP + DI DG EI A +G
Sbjct: 355 EKMTLNAPECVASSPAIGDITGDGKPEIVAAVASG 389
>gi|392555249|ref|ZP_10302386.1| hemolysin [Pseudoalteromonas undina NCIMB 2128]
Length = 742
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 60/229 (26%), Positives = 96/229 (41%), Gaps = 29/229 (12%)
Query: 361 GDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHL- 419
G+E W E + +Y + + + AD++NDG+ E+I+ S + Y N L
Sbjct: 366 GEELWKTENLDGTFNYKTIKATSTYSISAADVNNDGLVEVIVRDSTDHSFKIYSNSGELI 425
Query: 420 --------------KELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIY 465
+L G +++ G +V +D VK + L+ D +F+
Sbjct: 426 KAIPKGFAYGNSSISDLDGDGYAEFIIGNSIVNLIDGSIVK--LEKPLNYDKGNFKVI-- 481
Query: 466 SSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKI 525
D D G ++L G+ + D G + + QG + +DD
Sbjct: 482 -DSIAFDSDLGGQQEVLAN---GVLFTKD--GSVLWNKNTQ----QGFSAIGNFDDDEYP 531
Query: 526 ELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVV 574
E+V TD G+V +G IWE + L PSIGDVDG+G D+VV
Sbjct: 532 EIVLTDLDGSVKLLEHDGSIIWEYQSEDLGGGVPSIGDVDGNGEPDIVV 580
>gi|448417007|ref|ZP_21579110.1| hypothetical protein C474_10104 [Halosarcina pallida JCM 14848]
gi|445678690|gb|ELZ31178.1| hypothetical protein C474_10104 [Halosarcina pallida JCM 14848]
Length = 411
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 60/126 (47%), Gaps = 3/126 (2%)
Query: 466 SSPTVVDLDGDGNLDILVGTSFGLFYVLDH-HGKIREKFPLEMAEIQGAVVAADINDDGK 524
+ PTV DLDGDG L++L T+ V D G +R + L +VA D+
Sbjct: 101 ADPTVADLDGDGRLEVLAATTEREVAVFDAASGDVRTRLGLSNYGYTKPLVA-DLAPSPG 159
Query: 525 IELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVL 584
E+V D G + + G +W +L V P++ D D DG +++V +G +
Sbjct: 160 REIVAVDVTGTAFVFASNGTELWTANLDGYVWAQPTVADFDADGSPELLVGRRTGEAVLF 219
Query: 585 SGKDGS 590
+G DGS
Sbjct: 220 AG-DGS 224
>gi|354611490|ref|ZP_09029446.1| FG-GAP repeat protein [Halobacterium sp. DL1]
gi|353196310|gb|EHB61812.1| FG-GAP repeat protein [Halobacterium sp. DL1]
Length = 378
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 57/131 (43%), Gaps = 2/131 (1%)
Query: 463 YIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHH-GKIREKFPLEMAEIQGAVVAADIND 521
+ + P V DLDGDG L++ V T+ D G + L G ++ D
Sbjct: 67 HAVADPVVGDLDGDGTLEVFVATTERRVTGFDARTGDAEFSYELSSYGYSGPLLT-DFVG 125
Query: 522 DGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNI 581
G+ E+V D G V+ A G W + L + PS+ D D DG ++VV G
Sbjct: 126 GGEREVVVVDARGEVSVVHANGTAAWTRQLDAYTWGQPSVADFDADGDRELVVGVAGGGE 185
Query: 582 YVLSGKDGSKV 592
L +DGS V
Sbjct: 186 LNLFEQDGSTV 196
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 60/252 (23%), Positives = 100/252 (39%), Gaps = 40/252 (15%)
Query: 361 GDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAV---------------- 404
GD EW+ + + H ++ PV+ D+D DG E+ +A
Sbjct: 51 GDREWSHPVAPE-----DCAIHAVADPVVGDLDGDGTLEVFVATTERRVTGFDARTGDAE 105
Query: 405 -SYFFDHEYYDNPEHLKELGGIDIGKYVA---GAIVVFNLDTKQVKWTTDLDLSTDNASF 460
SY Y P +GG + V G + V + + WT LD
Sbjct: 106 FSYELSSYGYSGPLLTDFVGGGEREVVVVDARGEVSVVHAN-GTAAWTRQLD-------- 156
Query: 461 RAYIYSSPTVVDLDGDGNLDILVGTS-FGLFYVLDHHGK-IREKFPLEMAEIQGAVVAAD 518
AY + P+V D D DG+ +++VG + G + + G + + L I + A+
Sbjct: 157 -AYTWGQPSVADFDADGDRELVVGVAGGGELNLFEQDGSTVWHRANLTETSIT-WMTTAN 214
Query: 519 INDDGKIELVTTDTHGNVAAWTAE-GKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTL 577
++DD E+V G+V A G+ W + ++L ++GD D DG +V
Sbjct: 215 VDDDPAREIVVATRGGSVVAVDGRTGRAEWTREFEALAAV-RAVGDGDDDGTLEVYATAA 273
Query: 578 SGNIYVLSGKDG 589
G + L + G
Sbjct: 274 DGVLRNLDAESG 285
>gi|393785384|ref|ZP_10373536.1| hypothetical protein HMPREF1071_04404 [Bacteroides salyersiae
CL02T12C01]
gi|392662620|gb|EIY56178.1| hypothetical protein HMPREF1071_04404 [Bacteroides salyersiae
CL02T12C01]
Length = 1628
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 85/342 (24%), Positives = 135/342 (39%), Gaps = 78/342 (22%)
Query: 326 LDENADSSFELFRDTDELADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHILS 385
LD +AD + E+ + + +D + D+D G+E + + N H S
Sbjct: 1213 LDGSADGTMEVIVKSHQTSDIFILDHD--------GNEVRRLNPNVHV---TNGKDHNRS 1261
Query: 386 TPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQV 445
+AD+D DG E+I + YD+ +G Y+
Sbjct: 1262 ALTVADLDGDGQMEIIAS---------YDS-----------LGIYI-------------- 1287
Query: 446 KWTTDLDLSTDNASFRAYI---YSSPTVVDLDGDGNLDIL-----VGTSFGLFYVLDHHG 497
W D T N + A I S+P V DL+ DG +IL + S LD
Sbjct: 1288 -WRQDGTPFTTNPFWGAGIPRLASAPVVCDLNEDGKKEILFSQRQMAESKVFAISLDGDK 1346
Query: 498 KI-----REKFPLEM----AEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWE 548
+ + P + + + + DIN+DG +E+V H V AWT GK I+
Sbjct: 1347 TVAGWDGSQTIPYTVNAVGSTLDHTLSVGDINNDGHLEVVILG-HETVKAWTHTGKLIFS 1405
Query: 549 QHLKSLVTQ--------GPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTH 600
+ +K L Q P + DVDGD D+V + IY L DGS + +P +
Sbjct: 1406 KSIKGLFPQENYASNMNTPILADVDGDAVPDIVF-CCNNYIYALH-NDGSDIIGFPIISD 1463
Query: 601 GRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDG-PTS 641
+ ++ + D+ G K I + Y++ DG PT+
Sbjct: 1464 EKFLDTPCVADIDNDG---KSELIAGNEHELYVWKTDGVPTA 1502
Score = 44.3 bits (103), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 41/173 (23%), Positives = 75/173 (43%), Gaps = 28/173 (16%)
Query: 464 IYSSPTVVDLDG--DGNLDILVGT-SFGLFYVLDHHGKIREKFPLEMAEIQG------AV 514
++ SP + +LDG DG ++++V + ++LDH G + + G A+
Sbjct: 1204 MFQSPVIANLDGSADGTMEVIVKSHQTSDIFILDHDGNEVRRLNPNVHVTNGKDHNRSAL 1263
Query: 515 VAADINDDGKIELVTTDTHGNVAAWTAEG-----KGIWEQHLKSLVTQGPSIGDVDGDG- 568
AD++ DG++E++ + + W +G W + L + P + D++ DG
Sbjct: 1264 TVADLDGDGQMEIIASYDSLGIYIWRQDGTPFTTNPFWGAGIPRLAS-APVVCDLNEDGK 1322
Query: 569 -----------HSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLV 610
S V +L G+ V +G DGS+ PY G ++ L V
Sbjct: 1323 KEILFSQRQMAESKVFAISLDGDKTV-AGWDGSQTIPYTVNAVGSTLDHTLSV 1374
>gi|159896865|ref|YP_001543112.1| hypothetical protein Haur_0332 [Herpetosiphon aurantiacus DSM 785]
gi|159889904|gb|ABX02984.1| hypothetical protein Haur_0332 [Herpetosiphon aurantiacus DSM 785]
Length = 580
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 76/296 (25%), Positives = 113/296 (38%), Gaps = 83/296 (28%)
Query: 326 LDENADSSFELFRDTDELADEYNYDY-DDYVDDAMWGDEEWTE--EQHEKIEDYVNVDSH 382
LD N S E+F DEL Y Y Y +D A W + +T + + DY+
Sbjct: 244 LDNNGKS--EIFVGRDEL---YFYRYKNDGTQYAGWPIQTYTHVNQTTWNVHDYIEFTR- 297
Query: 383 ILSTPVIADIDNDGVSEMIIAVSYFF----DHEYYD----------NPEHLKELGGIDIG 428
S P IAD+DNDG E+I A H YD P+ + G D
Sbjct: 298 --SGPAIADLDNDGSYEVIAAGKVRTPDGDPHNNYDPQSASAVFVTEPDGTRRPGWTDAK 355
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLD--------LSTDNASFRAY----------------I 464
+ AGA + N DLD ++ D+ + RAY +
Sbjct: 356 R--AGAPLDVNFTPNNPIVVADLDGNGEKEFVVTFDDGTIRAYRENGTQIWSYFYAEEPV 413
Query: 465 YSSPT---------VVDLDGDGNLDILVGT-SF--------GLFYVLDHHGKIREKFPLE 506
+PT + D+ GD LDI+ GT SF GL+ + +G + FPL
Sbjct: 414 QGAPTRKVFGSEVVIADVTGDRKLDIVFGTYSFDRLYAGIVGLYALDARNGALHSGFPLS 473
Query: 507 MAE--------------IQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWE 548
+ IQ A +DI+++ +E++ GN+ W G+ + E
Sbjct: 474 LPNEGTASDNSAGSQKGIQAAPTISDIDNNCYVEILAHSRAGNIYVWETLGRNLPE 529
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 53/102 (51%), Gaps = 8/102 (7%)
Query: 465 YSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPL-------EMAEIQGAVVAA 517
+SSPTVVDL DG+ +IL + G + + +G++ FP I G++V A
Sbjct: 62 FSSPTVVDLFKDGSPEILSADATGCIWGYNIYGQLLPGFPWMTGGACANTPRINGSLVVA 121
Query: 518 DINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGP 559
DIN+DG +E+V T G + GK I Q ++++ P
Sbjct: 122 DINEDGLLEIV-VGTRGKGTSVGQRGKVIVYQRNGAILSGWP 162
Score = 46.2 bits (108), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 56/259 (21%), Positives = 92/259 (35%), Gaps = 63/259 (24%)
Query: 381 SHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNL 440
S I P +AD+DN+G SE+ + + + Y ++ G AG +
Sbjct: 233 SGIWGHPALADLDNNGKSEIFVGRDELYFYRYKND------------GTQYAGWPIQTYT 280
Query: 441 DTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVG---------------- 484
Q W + + S P + DLD DG+ +++
Sbjct: 281 HVNQTTWNVH--------DYIEFTRSGPAIADLDNDGSYEVIAAGKVRTPDGDPHNNYDP 332
Query: 485 TSFGLFYVLDHHGKIREKF--------PLEMA-EIQGAVVAADINDDGKIELVTTDTHGN 535
S +V + G R + PL++ +V AD++ +G+ E V T G
Sbjct: 333 QSASAVFVTEPDGTRRPGWTDAKRAGAPLDVNFTPNNPIVVADLDGNGEKEFVVTFDDGT 392
Query: 536 VAAWTAEGKGIWEQHLKSLVTQGPS----------IGDVDGDGHSDVVVPTLSGN----- 580
+ A+ G IW QG I DV GD D+V T S +
Sbjct: 393 IRAYRENGTQIWSYFYAEEPVQGAPTRKVFGSEVVIADVTGDRKLDIVFGTYSFDRLYAG 452
Query: 581 ---IYVLSGKDGSKVRPYP 596
+Y L ++G+ +P
Sbjct: 453 IVGLYALDARNGALHSGFP 471
>gi|229523907|ref|ZP_04413312.1| hemolysin-related protein Vcp [Vibrio cholerae bv. albensis VL426]
gi|229337488|gb|EEO02505.1| hemolysin-related protein Vcp [Vibrio cholerae bv. albensis VL426]
Length = 682
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/156 (23%), Positives = 77/156 (49%), Gaps = 34/156 (21%)
Query: 382 HILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGID---IGKYVAGAIVVF 438
+++ PV+ +++D V+++I+ F+ Y +++ L G+D + Y G ++
Sbjct: 51 QVMAAPVVVQLNDDNVADIIVVT---FEGNQYTQGGYIRALSGVDGSELWSYSNGGVI-- 105
Query: 439 NLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFY-VLDHHG 497
A R + ++ DLDGDGN++I+ ++ + +LD+HG
Sbjct: 106 -------------------ADARYPVAAA----DLDGDGNIEIVSTSTLTPYINILDNHG 142
Query: 498 KIREKFPLEMAEIQ--GAVVAADINDDGKIELVTTD 531
I+++ + + GA+ AD+ DGK+E+++ D
Sbjct: 143 NIKKQILKSASGWRSVGAITLADVTGDGKLEILSAD 178
>gi|424659593|ref|ZP_18096842.1| FG-GAP repeat family protein [Vibrio cholerae HE-16]
gi|408052148|gb|EKG87207.1| FG-GAP repeat family protein [Vibrio cholerae HE-16]
Length = 664
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/156 (23%), Positives = 77/156 (49%), Gaps = 34/156 (21%)
Query: 382 HILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGID---IGKYVAGAIVVF 438
+++ PV+ +++D V+++I+ F+ Y +++ L G+D + Y G ++
Sbjct: 33 QVMAAPVVVQLNDDNVADIIVVT---FEGNQYTQGGYIRALSGVDGSELWSYSNGGVI-- 87
Query: 439 NLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFY-VLDHHG 497
A R + ++ DLDGDGN++I+ ++ + +LD+HG
Sbjct: 88 -------------------ADARYPVAAA----DLDGDGNIEIVSTSTLTPYINILDNHG 124
Query: 498 KIREKFPLEMAEIQ--GAVVAADINDDGKIELVTTD 531
I+++ + + GA+ AD+ DGK+E+++ D
Sbjct: 125 NIKKQILKSASGWRSVGAITLADVTGDGKLEILSAD 160
>gi|430749476|ref|YP_007212384.1| hypothetical protein Theco_1220 [Thermobacillus composti KWC4]
gi|430733441|gb|AGA57386.1| hypothetical protein Theco_1220 [Thermobacillus composti KWC4]
Length = 420
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 81/191 (42%), Gaps = 27/191 (14%)
Query: 388 VIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKW 447
++ D+D DG EM++A DN + ++ + + A+ F+LD + + W
Sbjct: 26 LLGDLDGDGRMEMVLAQP--------DNRQDVRYI------PHQVQALTAFDLDGRLL-W 70
Query: 448 TT---DLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFP 504
D + + A I VD DGDG L++L L + G+ + P
Sbjct: 71 QVGRPDPGAGGPGSDYPAQI------VDCDGDGRLEVLCVMDGRLLVLDGRSGRAKASLP 124
Query: 505 LEMAEIQGAVVAADINDDGKI-ELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGD 563
L ++ AD++ G E + D + NV W A+G G H + P + D
Sbjct: 125 LPAPHAHDCIIPADLSGRGYCGEWLLKDRYRNV--WAADGSGRVLWHHEGNPGHYPWVRD 182
Query: 564 VDGDGHSDVVV 574
+DGDG +V+
Sbjct: 183 LDGDGKDEVMA 193
>gi|322371362|ref|ZP_08045913.1| FG-GAP repeat-containing protein [Haladaptatus paucihalophilus
DX253]
gi|320549060|gb|EFW90723.1| FG-GAP repeat-containing protein [Haladaptatus paucihalophilus
DX253]
Length = 417
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/192 (28%), Positives = 86/192 (44%), Gaps = 15/192 (7%)
Query: 452 DLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQ 511
++ T+N + Y P D DGDG D+LV T+ GK L
Sbjct: 92 EMPTENCNIHGY--GDPIFADYDGDGTQDVLVATTEDKVLGFTSAGKQEFAGNLSWWGYT 149
Query: 512 GAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKS--LVTQGPSIGDVDGDGH 569
+V D GK E+V TD +G V ++A+G +W +H+++ V P++ D DGDG
Sbjct: 150 KPIVTDFTTDPGK-EIVVTDLNGTVFVYSADGSLVWRKHIENSATVVAAPAVADFDGDGA 208
Query: 570 SDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSF 629
++ V ++ +DGS + T G V T + + + IV +F
Sbjct: 209 PELAVGEKKVTVFE---RDGS--VKWQTGTAGSVNWMT-----TAQTDGDPAVEIVAGTF 258
Query: 630 DGYLYLIDGPTS 641
DG++ +IDG T
Sbjct: 259 DGHVTVIDGKTG 270
>gi|149920255|ref|ZP_01908726.1| cell surface protein [Plesiocystis pacifica SIR-1]
gi|149818842|gb|EDM78282.1| cell surface protein [Plesiocystis pacifica SIR-1]
Length = 538
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 58/217 (26%), Positives = 95/217 (43%), Gaps = 52/217 (23%)
Query: 380 DSHILSTPVIADIDNDGVSEMII---AVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIV 436
D + L++P + D++ DG+ +++ +V Y +DH AGA++
Sbjct: 58 DENWLASPALVDVNGDGLLDIVAPRHSVLYVYDH---------------------AGALL 96
Query: 437 VFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGT---SFGL-FYV 492
Q W+ S ++ + R ++ S V DLDGDG+++I V GL V
Sbjct: 97 ------WQTAWSHSASDSPEHGTVR--MWPSAAVGDLDGDGDVEIAVSAHPDDAGLNVAV 148
Query: 493 LDHHGKIREKFPLEMAEIQ-GAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGI----W 547
DH G++ +P A+ + ++ AAD++ DG E++ T A E G W
Sbjct: 149 YDHGGELLPGWPQAYADAEVRSIAAADVDGDGAHEILITKQASGPATNVFELDGTHASGW 208
Query: 548 EQHLKSLVTQGPSI-----------GDVDGDGHSDVV 573
Q + +G I GD+DGDG DVV
Sbjct: 209 PQVGECTAPEGDCIDYGGFNQNIGAGDLDGDGVLDVV 245
Score = 43.9 bits (102), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 63/210 (30%), Positives = 90/210 (42%), Gaps = 36/210 (17%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLD-TK 443
S PVIADID DG E+++ DHE+ + ++ G+ + V N D T+
Sbjct: 301 SPPVIADIDADGEHEIVLG----GDHEHSASTDNQ----GV--------STWVLNADMTR 344
Query: 444 QVKWTTDLD----LSTDNASFRAY-IYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGK 498
W D L DN Y +P+V DLDG+ L+IL+ GL Y G+
Sbjct: 345 PEGWAWPKDSGPPLVYDNLGHNIVPTYPAPSVGDLDGEPGLEILIPAYDGLLYAYRSGGE 404
Query: 499 IREKFPL-EMAEIQGAVVA--ADINDDGKIE-LVTTDTHGNVAAWTAEG------KGIWE 548
+ + + GA A D++ DG E L TT G A G E
Sbjct: 405 LMWTYGFGSQSPFVGASEALIVDLSGDGSPEVLFTTYASGEPRMPEAPAHLIVLDAGGNE 464
Query: 549 QHLKSLVTQG----PSIGDVDGDGHSDVVV 574
H L +G PS+ D+DGDG ++V+
Sbjct: 465 LHEIELAHRGSMAAPSVADLDGDGQLELVI 494
>gi|422909656|ref|ZP_16944299.1| FG-GAP repeat family protein, partial [Vibrio cholerae HE-09]
gi|341634641|gb|EGS59394.1| FG-GAP repeat family protein [Vibrio cholerae HE-09]
Length = 661
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/156 (23%), Positives = 77/156 (49%), Gaps = 34/156 (21%)
Query: 382 HILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGID---IGKYVAGAIVVF 438
+++ PV+ +++D V+++I+ F+ Y +++ L G+D + Y G ++
Sbjct: 51 QVMAAPVVVQLNDDNVADIIVVT---FEGNQYTQGGYIRALSGVDGSELWSYSNGGVI-- 105
Query: 439 NLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFY-VLDHHG 497
A R + ++ DLDGDGN++I+ ++ + +LD+HG
Sbjct: 106 -------------------ADARYPVAAA----DLDGDGNIEIVSTSTLTPYINILDNHG 142
Query: 498 KIREKFPLEMAEIQ--GAVVAADINDDGKIELVTTD 531
I+++ + + GA+ AD+ DGK+E+++ D
Sbjct: 143 NIKKQILKSASGWRSVGAITLADVTGDGKLEILSAD 178
>gi|149922364|ref|ZP_01910799.1| Integrins alpha chain [Plesiocystis pacifica SIR-1]
gi|149816814|gb|EDM76303.1| Integrins alpha chain [Plesiocystis pacifica SIR-1]
Length = 425
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/209 (28%), Positives = 93/209 (44%), Gaps = 46/209 (22%)
Query: 383 ILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDT 442
+ S+ DIDNDGV ++++ +G ++ N
Sbjct: 194 VASSISAGDIDNDGVDDLVVTTP--------------------------SGVALIQN-GP 226
Query: 443 KQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGT--SFGLF-YVLDHHGKI 499
+W T LD+ A+ A + VD+D DG LD+LV SFG YV +HG
Sbjct: 227 GGAEWMTTLDVG--GAAMHAEL------VDMDHDGRLDMLVAVMDSFGDDDYVRIYHGFG 278
Query: 500 REKFP----LEMAEIQGAVVAADINDDGKIELVTTDTHGNVAA-WTAEGKGIWEQHLKSL 554
FP ++ VV D+N DG ++LVT D NV + G+G + + +
Sbjct: 279 DGSFPEWVDHPLSGNPWWVVPGDLNMDGDLDLVTADYGSNVVSILLGNGQGGFSERTEID 338
Query: 555 VTQGP---SIGDVDGDGHSDVVVPTLSGN 580
V GP ++GD++ DG +D+VV + G+
Sbjct: 339 VCAGPQSVAVGDMNNDGANDIVVGCMDGD 367
>gi|441499033|ref|ZP_20981223.1| hypothetical protein C900_03613 [Fulvivirga imtechensis AK7]
gi|441437278|gb|ELR70632.1| hypothetical protein C900_03613 [Fulvivirga imtechensis AK7]
Length = 3840
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 60/115 (52%), Gaps = 7/115 (6%)
Query: 466 SSPTVVDLDGDGNLDI--LVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAA---DIN 520
S P + D+DGDG L+I + G V + G++ EK+ + + + +VA D +
Sbjct: 413 SRPNLGDIDGDGQLEITFIAGNQLFALEVDPNTGELSEKWRRTINDSRSGIVATTVFDFD 472
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGI-WEQHLKS-LVTQGPSIGDVDGDGHSDVV 573
+DG E+V D+ V G+ + W +S +T+GP I DVDGDG +D+
Sbjct: 473 NDGSPEIVYRDSQQLVVIDGETGQNVLWSSSCQSHTMTEGPIIADVDGDGATDLC 527
>gi|310818872|ref|YP_003951230.1| FG-GAP repeat/HVR domain-containing protein [Stigmatella aurantiaca
DW4/3-1]
gi|309391944|gb|ADO69403.1| FG-GAP repeat/HVR domain protein [Stigmatella aurantiaca DW4/3-1]
Length = 969
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/235 (25%), Positives = 100/235 (42%), Gaps = 33/235 (14%)
Query: 390 ADIDNDGVSEMIIAVSYFFDHE------YYDNPEHLKELGGIDIGK-----YVAGAIVVF 438
AD+D DG+ E+I S ++H+ + D + L +G D V+G IV
Sbjct: 227 ADLDGDGLQEVINGRS-IYEHDGALRCTHPDMGQGLAGVGNFDADPAGEVVIVSGGIVAL 285
Query: 439 NLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGK 498
KW L +PT+ D D DG +I V S + V++ G
Sbjct: 286 MDSDCTPKWRVSLQ--------GGGAGGAPTIADFDHDGQPEIGVAGS-ARYTVIETDGT 336
Query: 499 IREKFPLEMAEIQGAVVAA-DINDDGKIELVTTD-THGNVAAWTAEGKGIWEQHLKSLVT 556
++ P++ + + A +A D + DGK E++ D T + + H ++
Sbjct: 337 VKWTSPIQGSNLGAASSSAFDFDGDGKTEIIAADQTRLRIYDGQTGAVRVSLTHSSAIAF 396
Query: 557 QGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHG-----RVMNQ 606
+ DVDGD H+++VVP+ + I ++G +R Y +T R+ NQ
Sbjct: 397 ENAIAVDVDGDNHAELVVPSNNSTISGVAG-----IRVYREKTGAWVNTRRIWNQ 446
>gi|119486609|ref|ZP_01620659.1| hypothetical protein L8106_12700 [Lyngbya sp. PCC 8106]
gi|119456226|gb|EAW37358.1| hypothetical protein L8106_12700 [Lyngbya sp. PCC 8106]
Length = 1213
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 61/265 (23%), Positives = 99/265 (37%), Gaps = 54/265 (20%)
Query: 355 VDDAMWGDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYD 414
VDD + + EQ + +D S+P AD+D DG + ++ + +
Sbjct: 289 VDDGFLSETNF-NEQTGTNNPFNGIDVGYSSSPTFADVDGDGDLDAVVGGA-------FG 340
Query: 415 NPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYS--SPTVVD 472
N + K G YV Q T N F + S +PT+ D
Sbjct: 341 NLNYFKNTGSTTAPIYV------------QQTGT--------NNPFDGIVGSRKTPTLAD 380
Query: 473 LDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQ--------GAVVA---ADIND 521
+DGDG+LD +VG FG + G ++ G+ ++ AD++
Sbjct: 381 VDGDGDLDAVVGDRFGNLQYFKNTGSTTAPIYVQQTGTNNPFDGIAVGSFISPTFADVDG 440
Query: 522 DGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQ------------GPSIGDVDGDGH 569
DG ++ + D GN+ + G +++ T P+ DVDGDG
Sbjct: 441 DGDLDALVGDRFGNLNYFENTGTTTAPIYVEQTGTNNPFDGIDVGSFSSPTFADVDGDGD 500
Query: 570 SDVVVPTLSGNIYVLSGKDGSKVRP 594
D +V +GN+ GS P
Sbjct: 501 LDALVGEFNGNLNYFE-NTGSTTAP 524
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 67/249 (26%), Positives = 95/249 (38%), Gaps = 66/249 (26%)
Query: 466 SSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMA----EIQGAVVA----- 516
SSPT D+DGDG+LD +VG +FG + G ++ G V +
Sbjct: 318 SSPTFADVDGDGDLDAVVGGAFGNLNYFKNTGSTTAPIYVQQTGTNNPFDGIVGSRKTPT 377
Query: 517 -ADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQ------------GPSIGD 563
AD++ DG ++ V D GN+ + G +++ T P+ D
Sbjct: 378 LADVDGDGDLDAVVGDRFGNLQYFKNTGSTTAPIYVQQTGTNNPFDGIAVGSFISPTFAD 437
Query: 564 VDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLT 623
VDGDG D +V GN+ G+ P + V+ T
Sbjct: 438 VDGDGDLDALVGDRFGNLNYFE-NTGTTTAP-------------IYVEQTGTNNP----- 478
Query: 624 IVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCF-- 681
FDG +D+G S S AD VDG DLD +V NGN+ F
Sbjct: 479 -----FDG--------------IDVGSFS-SPTFAD-VDGDGDLDALVGEFNGNLNYFEN 517
Query: 682 --STPAPHH 688
ST AP +
Sbjct: 518 TGSTTAPSY 526
Score = 47.8 bits (112), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 52/219 (23%), Positives = 83/219 (37%), Gaps = 47/219 (21%)
Query: 386 TPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQV 445
TP +AD+D DG + ++ + + N ++ K G YV +Q
Sbjct: 375 TPTLADVDGDGDLDAVVG-------DRFGNLQYFKNTGSTTAPIYV-----------QQT 416
Query: 446 KWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPL 505
D + SPT D+DGDG+LD LVG FG ++ G +
Sbjct: 417 GTNNPFD------GIAVGSFISPTFADVDGDGDLDALVGDRFGNLNYFENTGTTTAPIYV 470
Query: 506 EMA----EIQGAVVA-------ADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSL 554
E G V AD++ DG ++ + + +GN+ + G +++
Sbjct: 471 EQTGTNNPFDGIDVGSFSSPTFADVDGDGDLDALVGEFNGNLNYFENTGSTTAPSYVEQT 530
Query: 555 VTQG------------PSIGDVDGDGHSDVVVPTLSGNI 581
T P+ DVDGDG D +V GN+
Sbjct: 531 GTNNPFDGIDVGSISTPTFADVDGDGDLDALVGERYGNL 569
Score = 40.8 bits (94), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 56/133 (42%), Gaps = 19/133 (14%)
Query: 467 SPTVVDLDGDGNLDILVGTSFGLFYVLDHHG-------KIREKFPLEMAEIQGAVVA--A 517
+PT D+D DG+LD +G S G + G + P + ++ + V A
Sbjct: 29 TPTFADIDNDGDLDAFIGNSAGNTLFFRNTGTAAAPTFTLEATNPFGLTDVGFSAVPTFA 88
Query: 518 DINDDGKIELVTTDTHGNVAAWTAEGKG------IWEQHLKSLVTQG----PSIGDVDGD 567
DI++DG ++ +T GN + G + + L G P+ D+DGD
Sbjct: 89 DIDNDGDLDAFVGNTTGNTLFFRKTGTATAPTFTLEATNPFGLTNAGSFAAPTFADIDGD 148
Query: 568 GHSDVVVPTLSGN 580
G D V +GN
Sbjct: 149 GDLDAFVSERNGN 161
Score = 39.7 bits (91), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 75/315 (23%), Positives = 103/315 (32%), Gaps = 87/315 (27%)
Query: 386 TPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQV 445
TP ADIDNDG + I S N + G A F L+
Sbjct: 29 TPTFADIDNDGDLDAFIGNS-------AGNTLFFRNTG--------TAAAPTFTLEATNP 73
Query: 446 KWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHG-------- 497
TD+ S + PT D+D DG+LD VG + G G
Sbjct: 74 FGLTDVGFS-----------AVPTFADIDNDGDLDAFVGNTTGNTLFFRKTGTATAPTFT 122
Query: 498 -KIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLK---- 552
+ F L A A ADI+ DG ++ ++ +GN + G +
Sbjct: 123 LEATNPFGLTNAGSFAAPTFADIDGDGDLDAFVSERNGNTLFFRNTGTAAAPTFTQEATN 182
Query: 553 --SLVTQG----PSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQ 606
L G P+ D+D DG D V GK+G+ + YR G
Sbjct: 183 PFGLTDVGSFVKPTFADIDNDGDLDAFV----------GGKNGNTLF---YRNTGTDTAP 229
Query: 607 VLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDD 666
T+ GLT DV D +++ ++DG D
Sbjct: 230 T----FTQEATNPFGLT--------------------DVGDFAAPTFT-----DIDGDGD 260
Query: 667 LDLIVTTMNGNVFCF 681
LD V NGN F
Sbjct: 261 LDAFVGNSNGNTLFF 275
>gi|307151429|ref|YP_003886813.1| peptidase M23 [Cyanothece sp. PCC 7822]
gi|306981657|gb|ADN13538.1| Peptidase M23 [Cyanothece sp. PCC 7822]
Length = 655
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 56/217 (25%), Positives = 97/217 (44%), Gaps = 12/217 (5%)
Query: 472 DLDGDGNLDILVGTSFGLF--YVLDHHGKIREKFPLEMAEIQGAVVA-ADINDDGKIELV 528
DL+GD D+++ G ++L+ GK + A+ +VV D+N DG+ +LV
Sbjct: 230 DLNGDRRDDLVLRAPDGNIDGWLLNSQGKATSPHRIGFADNSWSVVGIGDLNGDGRDDLV 289
Query: 529 TTDTHGNVAAW--TAEGKGIWEQHLKSLVTQGPS---IGDVDGDGHSDVVVPTLSGNI-- 581
GN+ W ++GK H S IGD++GDG D+V+ GNI
Sbjct: 290 LRAPDGNIDGWLLNSQGKAT-SPHRIGFADNSWSVVGIGDLNGDGRDDLVLRAPDGNIDG 348
Query: 582 YVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTS 641
++L+ + G P+ + V + DL G L + DG+L G +
Sbjct: 349 WLLNSQ-GKATSPHRIGFADNSWSVVGIGDLNGDGRDDLVLRAPDGNIDGWLLNSQGKAT 407
Query: 642 CADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNV 678
+ + S+S+V +++G DL++ +GN+
Sbjct: 408 SPHRIGFADNSWSVVGIGDLNGDGRDDLVLRAPDGNI 444
Score = 43.1 bits (100), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 60/125 (48%), Gaps = 9/125 (7%)
Query: 465 YSSPTVVDLDGDGNLDILVGTSFGLF--YVLDHHGKIREKFPLEMAEIQGAVVA-ADIND 521
+S + DL+GDG D+++ G ++L+ GK + A+ +VV D+N
Sbjct: 321 WSVVGIGDLNGDGRDDLVLRAPDGNIDGWLLNSQGKATSPHRIGFADNSWSVVGIGDLNG 380
Query: 522 DGKIELVTTDTHGNVAAW--TAEGKGIWEQHLKSLVTQGPS---IGDVDGDGHSDVVVPT 576
DG+ +LV GN+ W ++GK H S IGD++GDG D+V+
Sbjct: 381 DGRDDLVLRAPDGNIDGWLLNSQGKAT-SPHRIGFADNSWSVVGIGDLNGDGRDDLVLRA 439
Query: 577 LSGNI 581
GNI
Sbjct: 440 PDGNI 444
Score = 42.7 bits (99), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 43/170 (25%), Positives = 76/170 (44%), Gaps = 9/170 (5%)
Query: 516 AADINDDGKIELVTTDTHGNVAAW--TAEGKGIWEQHLKSLVTQGPS---IGDVDGDGHS 570
++D+N D + +LV GN+ W ++GK H S IGD++GDG
Sbjct: 228 SSDLNGDRRDDLVLRAPDGNIDGWLLNSQGKAT-SPHRIGFADNSWSVVGIGDLNGDGRD 286
Query: 571 DVVVPTLSGNI--YVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTS 628
D+V+ GNI ++L+ + G P+ + V + DL G L +
Sbjct: 287 DLVLRAPDGNIDGWLLNSQ-GKATSPHRIGFADNSWSVVGIGDLNGDGRDDLVLRAPDGN 345
Query: 629 FDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNV 678
DG+L G + + + S+S+V +++G DL++ +GN+
Sbjct: 346 IDGWLLNSQGKATSPHRIGFADNSWSVVGIGDLNGDGRDDLVLRAPDGNI 395
>gi|115372123|ref|ZP_01459434.1| hemolysin-related protein [Stigmatella aurantiaca DW4/3-1]
gi|115370825|gb|EAU69749.1| hemolysin-related protein [Stigmatella aurantiaca DW4/3-1]
Length = 925
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/235 (25%), Positives = 100/235 (42%), Gaps = 33/235 (14%)
Query: 390 ADIDNDGVSEMIIAVSYFFDHE------YYDNPEHLKELGGIDIGK-----YVAGAIVVF 438
AD+D DG+ E+I S ++H+ + D + L +G D V+G IV
Sbjct: 183 ADLDGDGLQEVINGRS-IYEHDGALRCTHPDMGQGLAGVGNFDADPAGEVVIVSGGIVAL 241
Query: 439 NLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGK 498
KW L +PT+ D D DG +I V S + V++ G
Sbjct: 242 MDSDCTPKWRVSLQ--------GGGAGGAPTIADFDHDGQPEIGVAGS-ARYTVIETDGT 292
Query: 499 IREKFPLEMAEIQGAVVAA-DINDDGKIELVTTD-THGNVAAWTAEGKGIWEQHLKSLVT 556
++ P++ + + A +A D + DGK E++ D T + + H ++
Sbjct: 293 VKWTSPIQGSNLGAASSSAFDFDGDGKTEIIAADQTRLRIYDGQTGAVRVSLTHSSAIAF 352
Query: 557 QGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHG-----RVMNQ 606
+ DVDGD H+++VVP+ + I ++G +R Y +T R+ NQ
Sbjct: 353 ENAIAVDVDGDNHAELVVPSNNSTISGVAG-----IRVYREKTGAWVNTRRIWNQ 402
>gi|373457583|ref|ZP_09549350.1| hypothetical protein Calab_1395 [Caldithrix abyssi DSM 13497]
gi|371719247|gb|EHO41018.1| hypothetical protein Calab_1395 [Caldithrix abyssi DSM 13497]
Length = 990
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/122 (27%), Positives = 54/122 (44%), Gaps = 10/122 (8%)
Query: 436 VVFNLDTKQVKWTTDLDLSTDNASFRAY-----IYSSPTVVDLDGDGNLDILVGTSFGLF 490
+VFN + K T+ TD A+ + +P +VDLDGDG +I+V + G
Sbjct: 748 IVFNSENKIFALNTNGTFVTDFPIEVAFDHADRLIGTPLIVDLDGDGEAEIVVASQNGGL 807
Query: 491 YVLDHHGKIREKFPLEM-AEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKG---- 545
GK+ FPL + ++ G+ ++D +EL+ + G V W G
Sbjct: 808 LAFSRQGKLLVDFPLSIGGQMSGSPALVQWDEDAPLELLAVNDQGLVTVWELNYNGTPET 867
Query: 546 IW 547
IW
Sbjct: 868 IW 869
Score = 43.1 bits (100), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 23/81 (28%), Positives = 46/81 (56%), Gaps = 6/81 (7%)
Query: 469 TVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMA-----EIQGAVVAADINDDG 523
++ D+DG+G +DI+ + +F L+ +G FP+E+A + G + D++ DG
Sbjct: 736 SLADMDGNGRVDIVFNSENKIF-ALNTNGTFVTDFPIEVAFDHADRLIGTPLIVDLDGDG 794
Query: 524 KIELVTTDTHGNVAAWTAEGK 544
+ E+V +G + A++ +GK
Sbjct: 795 EAEIVVASQNGGLLAFSRQGK 815
Score = 40.0 bits (92), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 47/85 (55%), Gaps = 6/85 (7%)
Query: 80 IADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWP---AF-HQSSVHSSPLLYDIDKDG 135
+AD++ +G++DIV S + ++G + +P AF H + +PL+ D+D DG
Sbjct: 737 LADMDGNGRVDIVFNSENKIFAL--NTNGTFVTDFPIEVAFDHADRLIGTPLIVDLDGDG 794
Query: 136 VREIALATYNGEVLFFRVSGYMMTD 160
EI +A+ NG +L F G ++ D
Sbjct: 795 EAEIVVASQNGGLLAFSRQGKLLVD 819
>gi|444911914|ref|ZP_21232084.1| hypothetical protein D187_03785 [Cystobacter fuscus DSM 2262]
gi|444717561|gb|ELW58388.1| hypothetical protein D187_03785 [Cystobacter fuscus DSM 2262]
Length = 827
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 66/231 (28%), Positives = 99/231 (42%), Gaps = 27/231 (11%)
Query: 390 ADIDNDGVSEMIIAVS-YFFDHEYYDNPEHLKELGGIDIGKYVA---GAIVVFNLD---- 441
ADID DG E+I S Y D P + G +G + A G IVV D
Sbjct: 233 ADIDQDGRQELINGRSVYRADGSLLCAPSTVPH-GFAAVGNFDADDRGEIVVAGRDQVSL 291
Query: 442 ---TKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGK 498
T V W+ + P + D DGDG +I + + LF VL G
Sbjct: 292 VDDTCTVLWSVAVPGGG--------HGGVPNIADFDGDGQPEIGIAGNR-LFSVLKADGS 342
Query: 499 IREKFPL-EMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVT- 556
+ P+ +++ + + D + DGK+E+V TD A G ++E S T
Sbjct: 343 VLWSSPIRDLSSGKNSSTTFDFDGDGKLEVVFTDETYLRIYDGATGTVLFETKNSSGTTH 402
Query: 557 QGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKV-RPYPYRTHGRVMNQ 606
+GP + DVDGD +D+VV N + G +G +V +++ R+ NQ
Sbjct: 403 EGPVVADVDGDYQADIVV---GANNHAYPGFNGIRVFHGEGWKSARRIWNQ 450
>gi|406952333|gb|EKD81976.1| Cell surface protein, partial [uncultured bacterium]
Length = 160
Score = 50.8 bits (120), Expect = 0.003, Method: Composition-based stats.
Identities = 32/117 (27%), Positives = 61/117 (52%), Gaps = 5/117 (4%)
Query: 464 IYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDG 523
+ S+P + D+DG G L++ +G+ G+ +D G++ + + + + AVVA D++ G
Sbjct: 43 LISAPEIADIDGSGILNLFIGSKSGMVSRIDDSGRLIWEVRMNTS-VSAAVVARDLDGSG 101
Query: 524 KIELVTTDTHGNVAAWTAEG--KGIWEQHLKSLVTQGPSI--GDVDGDGHSDVVVPT 576
E++ D G V + G K W Q + +T ++ GD++GDG ++ T
Sbjct: 102 IKEIIAKDDGGKVMVFGVTGVPKQGWPQETAANMTWPFNVDAGDINGDGIKEIFTTT 158
Score = 41.6 bits (96), Expect = 2.1, Method: Composition-based stats.
Identities = 36/136 (26%), Positives = 57/136 (41%), Gaps = 13/136 (9%)
Query: 470 VVDLDGDGNLDILVGTSFGLFYVLDHHGK------IREKFPLEMAEIQGAVVAADINDDG 523
+ DL DG +I+V + G Y L+ G+ R K + EI ADI+ G
Sbjct: 3 LADLTKDGATEIIVASQDGKVYCLNALGRENWVLNTRGKILISAPEI------ADIDGSG 56
Query: 524 KIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYV 583
+ L G V+ G+ IWE + + V+ D+DG G +++ G + V
Sbjct: 57 ILNLFIGSKSGMVSRIDDSGRLIWEVRMNTSVSAAVVARDLDGSGIKEIIAKDDGGKVMV 116
Query: 584 LSGKDGSKVRPYPYRT 599
G G + +P T
Sbjct: 117 F-GVTGVPKQGWPQET 131
>gi|145593177|ref|YP_001157474.1| FG-GAP repeat-containing protein [Salinispora tropica CNB-440]
gi|145302514|gb|ABP53096.1| FG-GAP repeat protein [Salinispora tropica CNB-440]
Length = 386
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 36/99 (36%), Positives = 52/99 (52%), Gaps = 8/99 (8%)
Query: 504 PLEMAEIQGA--VVAADINDDGKIELVTTDTHG-NVAAWTAEGKGIWEQHLKSLVTQGP- 559
P+ +A GA +VAAD+N DG+I+LVT + H +V+ +G G + S GP
Sbjct: 195 PVHVAVGDGAADIVAADVNGDGQIDLVTGNHHSDDVSVLLGDGAGGVAPAISSAAGDGPH 254
Query: 560 --SIGDVDGDGHSDVVVPTLSGNI--YVLSGKDGSKVRP 594
+ GD DGDGH D+ P G+ +L DG+ P
Sbjct: 255 GIAAGDFDGDGHVDLATPDYEGDTTSVLLGSGDGTFAAP 293
>gi|158334417|ref|YP_001515589.1| FG-GAP repeat- calx-beta domain-containing protein [Acaryochloris
marina MBIC11017]
gi|158304658|gb|ABW26275.1| FG-GAP repeat/calx-beta domain protein [Acaryochloris marina
MBIC11017]
Length = 1543
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 65/133 (48%), Gaps = 9/133 (6%)
Query: 466 SSPTVVDLDGDGNLDILV--GTSFGLFYVL-DHHGKIREKFPLEMAEIQGAVVAADINDD 522
SS TV D +GD + D++ GTS + +L D G + + +V D N D
Sbjct: 352 SSVTVGDFNGDSHADLVTTNGTSDNVSVLLGDGSGGFSSQTTFAAGDSPSSVTVGDFNGD 411
Query: 523 GKIELVTTD-THGNVAAWTAEGKGIWEQHLKSLVTQGP---SIGDVDGDGHSDVVVPTL- 577
+LVTT+ T NV+ +G G + P +GD +GDGH+D+V+ L
Sbjct: 412 SHADLVTTNGTSDNVSVLLGDGSGGFSSQTTFAAGNTPRSIKVGDFNGDGHADLVMANLN 471
Query: 578 SGNIYVLSGKDGS 590
S N+ VL G DGS
Sbjct: 472 SNNVSVLVG-DGS 483
>gi|427727319|ref|YP_007073556.1| VCBS repeat-containing protein [Nostoc sp. PCC 7524]
gi|427363238|gb|AFY45959.1| VCBS repeat-containing protein [Nostoc sp. PCC 7524]
Length = 1829
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 42/140 (30%), Positives = 59/140 (42%), Gaps = 25/140 (17%)
Query: 464 IYSSPTVVDLDGDGNLDILVGTSFG--LFYVLDHHGKI---------REKFPLEMAEIQG 512
+ ++PT D+DGDG+LD VG + G LFY + G + F L +
Sbjct: 21 LVAAPTFADIDGDGDLDAFVGNNSGNTLFY--QNTGTVTNPQFAAPSTNPFGLTNVGLYA 78
Query: 513 AVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQ-----------GPSI 561
A ADI+ DG ++ + GN + G Q +L T P+
Sbjct: 79 APTFADIDGDGDLDAFVGNRSGNTLFYQNTGTATNPQ-FAALTTNPFGLTNVGLYAAPTF 137
Query: 562 GDVDGDGHSDVVVPTLSGNI 581
D+DGDG D V L GNI
Sbjct: 138 ADIDGDGDLDAFVGNLDGNI 157
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 58/136 (42%), Gaps = 19/136 (13%)
Query: 464 IYSSPTVVDLDGDGNLDILVGTSFG--LFY-----VLDHHGKIREKFPLEMAEIQ--GAV 514
+Y+SPT+VD+DGDG+LD VG G LFY + P + ++ +
Sbjct: 297 LYASPTLVDIDGDGDLDAFVGNFDGNTLFYRNIGTATNPQFNTPTTNPFGLTDVGYLASP 356
Query: 515 VAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVT----------QGPSIGDV 564
ADI+ DG ++ + GN+ + G Q P++ D+
Sbjct: 357 TFADIDGDGDLDAFVGNFDGNILFYRNIGTATNPQFASPTTNPFGLTDVGYFAAPTLADI 416
Query: 565 DGDGHSDVVVPTLSGN 580
DGDG D V L GN
Sbjct: 417 DGDGDLDAFVGNLDGN 432
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 56/137 (40%), Gaps = 19/137 (13%)
Query: 464 IYSSPTVVDLDGDGNLDILVGTSFG--LFY-----VLDHHGKIREKFPLEMAEIQ--GAV 514
Y++PT+VD+DGDG+LD VG S G LFY + P + ++
Sbjct: 628 FYAAPTLVDIDGDGDLDAFVGNSEGNTLFYRNIGTATNPQFAAPTTNPFGLTDVGYLAKP 687
Query: 515 VAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVT----------QGPSIGDV 564
ADI+ DG ++ + GN+ + G Q P+ D+
Sbjct: 688 TFADIDGDGDLDAFVGNRDGNILFYRNIGTATNPQFASPTTNPFGLTGVGSFAAPTFADI 747
Query: 565 DGDGHSDVVVPTLSGNI 581
DGDG D V + NI
Sbjct: 748 DGDGDLDAFVINVDSNI 764
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 54/136 (39%), Gaps = 19/136 (13%)
Query: 464 IYSSPTVVDLDGDGNLDILVGTSFG--LFYV-------LDHHGKIREKFPLEMAEIQGAV 514
+Y++PT D+DGDG+LD VG G LFY F L + A
Sbjct: 76 LYAAPTFADIDGDGDLDAFVGNRSGNTLFYQNTGTATNPQFAALTTNPFGLTNVGLYAAP 135
Query: 515 VAADINDDGKIELVTTDTHGNVAAW----TAEGKGIWEQHLKSL------VTQGPSIGDV 564
ADI+ DG ++ + GN+ + TA + P+ D+
Sbjct: 136 TFADIDGDGDLDAFVGNLDGNILFYRNIGTASNTRFADPSTNPFGLTDVGFNAKPTFADI 195
Query: 565 DGDGHSDVVVPTLSGN 580
DGDG D V L GN
Sbjct: 196 DGDGDLDAFVGNLDGN 211
Score = 44.7 bits (104), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 66/257 (25%), Positives = 97/257 (37%), Gaps = 45/257 (17%)
Query: 464 IYSSPTVVDLDGDGNLDILVGTSFG--LFYV-LDHHGKIR------EKFPLEMAEIQGAV 514
+Y++PT D+DGDG+LD VG G LFY + R F L
Sbjct: 131 LYAAPTFADIDGDGDLDAFVGNLDGNILFYRNIGTASNTRFADPSTNPFGLTDVGFNAKP 190
Query: 515 VAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSL----------VTQGPSIGDV 564
ADI+ DG ++ + GN + G Q + P+ D+
Sbjct: 191 TFADIDGDGDLDAFVGNLDGNTLFYRNIGTATNPQFASPTTNPFGLTDVGLVAAPTFADI 250
Query: 565 DGDGHSDVVVPTLSGNIYVLSGKDGS--KVRPYPYRTHGRVMNQV------LLVDLTKRG 616
DGDG D V L GN + S + +P T+ + V LVD+ G
Sbjct: 251 DGDGDLDAFVGNLDGNTNFSRNQPFSSPNINFHPPTTYPFGVTDVGLYASPTLVDIDGDG 310
Query: 617 EKSKGLTIVTTSFDGYLYL-----------IDGPTSCA-DVVDIGETSYSMVLADNVDGG 664
+ L +FDG + PT+ + D+G + S AD +DG
Sbjct: 311 D----LDAFVGNFDGNTLFYRNIGTATNPQFNTPTTNPFGLTDVGYLA-SPTFAD-IDGD 364
Query: 665 DDLDLIVTTMNGNVFCF 681
DLD V +GN+ +
Sbjct: 365 GDLDAFVGNFDGNILFY 381
Score = 43.9 bits (102), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 32/117 (27%), Positives = 50/117 (42%), Gaps = 7/117 (5%)
Query: 73 SIYATPLIADINSDGKLDIVVPSF----LHYLEVLEGSDGD-KMPGWPAFHQSSVH--SS 125
+YA+P + DI+ DG LD V +F L Y + ++ P F + V +S
Sbjct: 296 GLYASPTLVDIDGDGDLDAFVGNFDGNTLFYRNIGTATNPQFNTPTTNPFGLTDVGYLAS 355
Query: 126 PLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPV 182
P DID DG + + ++G +LF+R G + P VG + P
Sbjct: 356 PTFADIDGDGDLDAFVGNFDGNILFYRNIGTATNPQFASPTTNPFGLTDVGYFAAPT 412
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 68/246 (27%), Positives = 98/246 (39%), Gaps = 42/246 (17%)
Query: 466 SSPTVVDLDGDGNLDILVGTSFG--LFY-----VLDHHGKIREKFPLEMAEIQGAVVA-- 516
++PT D+DGDG+LD VG G +FY + P ++++ V
Sbjct: 906 AAPTFADIDGDGDLDAFVGHFDGNTMFYRNTGTATNPQFAAPTNNPFGLSDVGFNAVPTF 965
Query: 517 ADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLK------SLVTQG----PSIGDVDG 566
ADI+ DG +++ + GN+ + G Q L G P+ D+DG
Sbjct: 966 ADIDGDGDLDVFVGNFDGNILLYRNTGTATNPQFAAPTTNPFGLTNVGTFAKPTFADIDG 1025
Query: 567 DGHSDVVVPTLSGNIYVLSGKDGSKVRPY-------PYR-THGRVMNQVLLVDLTKRGEK 618
DG D V GN G+ P P+ T R LVD+ G+
Sbjct: 1026 DGDLDAFVGNGDGNTLFYR-NTGTATNPQFNTPTTNPFGLTDVRAGAAPTLVDIDGDGDL 1084
Query: 619 SKGLTIVTTSFDGYLYLID----GPTSCADVVDIGETSYSMVLADNVDGGD-DLD---LI 670
T V S L+ ++ PT+ D V I E + +L NV D DLD L
Sbjct: 1085 D---TFVGNSDGNTLFFLNIPNTPPTAANDAVSINENT---LLNGNVLTNDSDLDGSPLT 1138
Query: 671 VTTMNG 676
VT +NG
Sbjct: 1139 VTQVNG 1144
Score = 40.8 bits (94), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 34/132 (25%), Positives = 48/132 (36%), Gaps = 20/132 (15%)
Query: 469 TVVDLDGDGNLDILVGTSFG--LFY--------VLDHHGKIREKFPLEMAEIQGAVVAAD 518
T D+DGDG+LD VG +G FY ++ F L AD
Sbjct: 798 TFADIDGDGDLDAFVGERYGDTAFYQNQPAPPTPINFRPPSTNSFGLTDVGFSAVATLAD 857
Query: 519 INDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSL----------VTQGPSIGDVDGDG 568
I+ DG ++ + GN+ + G Q + P+ D+DGDG
Sbjct: 858 IDGDGDLDAFVGNFDGNILLYRNTGTATNPQFATPINNPFGLTDVGFNAAPTFADIDGDG 917
Query: 569 HSDVVVPTLSGN 580
D V GN
Sbjct: 918 DLDAFVGHFDGN 929
Score = 40.0 bits (92), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 30/114 (26%), Positives = 45/114 (39%), Gaps = 7/114 (6%)
Query: 76 ATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPA-----FHQSSV--HSSPLL 128
A P ADI+ DG LD V F + P + A F S V ++ P
Sbjct: 906 AAPTFADIDGDGDLDAFVGHFDGNTMFYRNTGTATNPQFAAPTNNPFGLSDVGFNAVPTF 965
Query: 129 YDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDWYVGLHSDPV 182
DID DG ++ + ++G +L +R +G + P VG + P
Sbjct: 966 ADIDGDGDLDVFVGNFDGNILLYRNTGTATNPQFAAPTTNPFGLTNVGTFAKPT 1019
>gi|424808153|ref|ZP_18233555.1| hypothetical protein SX4_2071 [Vibrio mimicus SX-4]
gi|342324690|gb|EGU20471.1| hypothetical protein SX4_2071 [Vibrio mimicus SX-4]
Length = 880
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 41/166 (24%), Positives = 78/166 (46%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TPV+A +++D V+++++ F+ Y N ++ L G+D +
Sbjct: 242 NQVMVTPVVAQLNDDNGDGKIDEKDVADLVVVT---FEGNKYTNGGLVRALSGVDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
YV G I+ +A + SP + DLDGDG ++I+ +
Sbjct: 299 NYVNGGIIA-------------------DARY------SPAIGDLDGDGIVEIVTANNRD 333
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + I G + AD+++DG IE++ D
Sbjct: 334 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDNDGSIEILAAD 379
>gi|167537350|ref|XP_001750344.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771172|gb|EDQ84843.1| predicted protein [Monosiga brevicollis MX1]
Length = 428
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 66/260 (25%), Positives = 111/260 (42%), Gaps = 29/260 (11%)
Query: 455 TDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLE---MAEIQ 511
T NA+ A +++ VDLD DG +D+L S+G V +H F + A
Sbjct: 180 TANAARAASVFA----VDLDNDGKMDVL-SASYGDDKVAWYHNTGNGTFSDQKVITAAAD 234
Query: 512 GA--VVAADINDDGKIELVTTDTHGNVAAWTAE-GKGIWEQHLKSLVTQGPS------IG 562
GA V AAD++ DG++++++ + AW G G + +S++T
Sbjct: 235 GARSVFAADLDRDGRVDVLSASFEDDTIAWYRNLGNGAFSA--RSVITSNADGATSVYAA 292
Query: 563 DVDGDGHSDVVVPTLSGNI--YVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSK 620
D+DGDGH DV+ +L+ N + + +GS P V DL + G
Sbjct: 293 DLDGDGHPDVLSSSLADNTIAWYQNHGNGSFSPPKIITNTASQAVSVYAADLDRDGHVD- 351
Query: 621 GLTIVTTSFDGYLYLI----DGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNG 676
I +S D + DG S ++ + V A ++DG D+D++ +
Sbjct: 352 --VISGSSADNKVAWFRNEGDGTFSVEIIISTAANVVTFVRAADLDGDGDMDVLSASSGD 409
Query: 677 NVFCFSTPAPHHPLKAWRSI 696
N + PA H ++ R +
Sbjct: 410 NKIAW-YPANTHLIEVLRRL 428
>gi|258621950|ref|ZP_05716979.1| conserved hypothetical protein [Vibrio mimicus VM573]
gi|258585703|gb|EEW10423.1| conserved hypothetical protein [Vibrio mimicus VM573]
Length = 957
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 41/166 (24%), Positives = 78/166 (46%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TPV+A +++D V+++++ F+ Y N ++ L G+D +
Sbjct: 242 NQVMVTPVVAQLNDDNGDGKIDEKDVADLVVVT---FEGNKYTNGGLVRALSGVDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
YV G I+ +A + SP + DLDGDG ++I+ +
Sbjct: 299 NYVNGGIIA-------------------DARY------SPAIGDLDGDGIVEIVTANNRD 333
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + I G + AD+++DG IE++ D
Sbjct: 334 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDNDGSIEILAAD 379
>gi|334139398|ref|ZP_08512790.1| FG-GAP repeat protein [Paenibacillus sp. HGF7]
gi|333602211|gb|EGL13642.1| FG-GAP repeat protein [Paenibacillus sp. HGF7]
Length = 1073
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 43/161 (26%), Positives = 66/161 (40%), Gaps = 35/161 (21%)
Query: 468 PTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFP-------LEMAEIQG-------- 512
P D++GDG D+L G+S G YV + G E + L++ ++ G
Sbjct: 428 PAFADINGDGKEDLLAGSSDGYVYVYPNLGPKPEAYERDPAPGGLKLPDVYGKPEKLLLT 487
Query: 513 -----------AVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIW------EQHLKSLV 555
+V A D N DGKI+L+ +D G V +G G + + +
Sbjct: 488 SGQPLKTGPHASVHAVDWNADGKIDLLVSDATGAVQIALGQGGGRFAPLTALQDPSGPIK 547
Query: 556 TQGPSI---GDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVR 593
GP+I GD+ G G D+VV G + V +R
Sbjct: 548 VPGPAIAAAGDLRGQGFPDLVVGDADGKVRVYRAPQAGALR 588
Score = 46.2 bits (108), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 65/152 (42%), Gaps = 16/152 (10%)
Query: 453 LSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAE--- 509
L T + ++S VD + DG +D+LV + G + G R PL +
Sbjct: 485 LLTSGQPLKTGPHASVHAVDWNADGKIDLLVSDATGAVQIALGQGGGRFA-PLTALQDPS 543
Query: 510 ----IQGAVVAA--DINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVT------Q 557
+ G +AA D+ G +LV D G V + A G ++V
Sbjct: 544 GPIKVPGPAIAAAGDLRGQGFPDLVVGDADGKVRVYRAPQAGALRLSAGAVVADVGAKYA 603
Query: 558 GPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDG 589
PS+ D+DGDG +D+VV + G++ V + G
Sbjct: 604 APSVRDMDGDGRADLVVGSNEGDLRVFRQESG 635
>gi|94970007|ref|YP_592055.1| integrin-like protein [Candidatus Koribacter versatilis Ellin345]
gi|94552057|gb|ABF41981.1| Integrin-like protein [Candidatus Koribacter versatilis Ellin345]
Length = 1126
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 64/132 (48%), Gaps = 4/132 (3%)
Query: 470 VVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGA--VVAADINDDGKIEL 527
V D +GDG D+ V S G VL +G + P I+ + + D+N DGK+++
Sbjct: 590 VGDFNGDGAPDVAVSNSDGGIAVLLGNGDGTLRAPQLFPAIKSSQSLAIGDLNRDGKLDI 649
Query: 528 VTTDTHGNVAAWTAEGKGIWEQH--LKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLS 585
V +D G+++ + G G ++ ++ +Q ++GD +GDG DV T +L
Sbjct: 650 VASDGSGSISIFLGNGDGTFQTSKVYAAVGSQSVTVGDFNGDGILDVASGTGHTVSLLLG 709
Query: 586 GKDGSKVRPYPY 597
DGS P Y
Sbjct: 710 NGDGSLQPPVNY 721
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 74/271 (27%), Positives = 112/271 (41%), Gaps = 47/271 (17%)
Query: 384 LSTPV---IADIDNDGVSEMIIA------VSYFFDHEYYDNPEHLKELGGIDIGKYVAGA 434
LS+PV AD++ DG ++ +A +S F + H + +G V
Sbjct: 100 LSSPVSIAAADLNGDGKLDLAVANSGSGSISVFLGNGDGTFQSHTD----VAVGTSVQML 155
Query: 435 IVV-FNLDTKQ-----VKWTTDLDLSTDNASFRAYI-YSSPTVV-----DLDGDGNLDIL 482
V FN D K V + + +A+F A Y+ P D +GDG DI+
Sbjct: 156 TVADFNGDGKPDLAVLVDGMVSVLIGKGDATFNAIGEYAKPCATYLATGDFNGDGKTDIV 215
Query: 483 VGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAA--DINDDGKIELV---TTDTHGN-- 535
G VL +G + P+ +I V A DIN DGK++L+ D+ G
Sbjct: 216 AGRQC----VLLGNGDGTFQPPVGSQKIGNTVSTAVGDINGDGKLDLIEGGIGDSDGTPR 271
Query: 536 --VAAWTAEGKGIWE--QHLKSLVT--QGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDG 589
V G G ++ Q + QG + DV+GD H D+V+ + S N+ V++GK
Sbjct: 272 ALVVVLLGNGDGTFQPPQGFFGYGSGVQGLLLADVNGDSHPDIVLSS-SENVEVVNGKGD 330
Query: 590 SKVRP---YPYRTHGRVMNQVLLVDLTKRGE 617
P YP V ++L D T G
Sbjct: 331 GTFEPGVLYPVGNR-PVAGGLVLSDFTGSGR 360
>gi|86610223|ref|YP_478985.1| FG-GAP repeat- ASPIC/UnbV domain-containing protein [Synechococcus
sp. JA-2-3B'a(2-13)]
gi|86558765|gb|ABD03722.1| FG-GAP repeat/ASPIC/UnbV domain protein [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 454
Score = 49.3 bits (116), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 78/169 (46%), Gaps = 27/169 (15%)
Query: 469 TVVDLDGDGNLDILVGTSFG--LFYVLDHHGKIREKFPLEMAEIQ--GAVVAADINDDGK 524
VDLDGDG LD+L G G ++ G ++ P E+A V+AAD ++DG
Sbjct: 233 AAVDLDGDGRLDLLYGNWEGPHRLWIQTPDGFFKDVAPPELARPSRIRTVIAADFDNDGY 292
Query: 525 IELVTTDTH--GNVAAWTAEGKGIWE-------QHLKSLVTQGPSIGDVDGDGHSDVVVP 575
EL + + AW + G W+ + L T G ++ DVDGDG +++
Sbjct: 293 PELFFNNIGEPNRLFAWRS---GRWQAIDIGDAAEPQGLGT-GAAVADVDGDGRLELL-- 346
Query: 576 TLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTI 624
+ G+ G++ + YR N + ++ LT++G ++G +
Sbjct: 347 -------IAHGESGAQPLSF-YRPQANANNWLRVLPLTRQGAPARGALV 387
>gi|294507154|ref|YP_003571212.1| FG-GAP repeat domain-containing protein [Salinibacter ruber M8]
gi|294343482|emb|CBH24260.1| FG-GAP repeat domain protein [Salinibacter ruber M8]
Length = 612
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 70/278 (25%), Positives = 118/278 (42%), Gaps = 54/278 (19%)
Query: 466 SSPTVVDLDGDGNLDILVGTSF-------GLFYVLDHHG-----KIREKFPLEMAEI-QG 512
S+ + D+ GDG D+LV L Y L G ++R++ PL++ + +
Sbjct: 364 STVALGDIQGDGQTDVLVANKIDPTQGETSLAYPLTPTGDKGPPRLRKRSPLDLPDAYRY 423
Query: 513 AVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSL--VTQG----PSIGDVDG 566
A D+N DG +L+ G ++ +G G ++ +L ++ G P++GDV G
Sbjct: 424 APALGDLNGDGADDLILGTWQGALSYHENQGDGTFDAVDGALDGLSGGSNVVPALGDVTG 483
Query: 567 DGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVT 626
+G D+V+ T SG + L G + P + + G V L +L R T+
Sbjct: 484 NGAPDLVLGTASGTL-TLHRNTGGEASPN-FASEG-----VTLAELNGRAAP----TLHD 532
Query: 627 TSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTT---MNGNVFCFST 683
+ DG L L+ G T +VL N G D L+ T ++G
Sbjct: 533 VTGDGRLDLLVG------------TKTGLVLLRNQGGTDGLEFGPPTSVNLDG------- 573
Query: 684 PAPHHPLKAWRSIN-QGRNNVAIRYNRAGIYVTHPSRA 720
P + AW ++ GR ++ + R G+ + PS A
Sbjct: 574 -IPRNATPAWGDLDGDGRTDLVVGGKRGGLVLFQPSAA 610
>gi|449146859|ref|ZP_21777610.1| hemolysin-related protein [Vibrio mimicus CAIM 602]
gi|449077353|gb|EMB48336.1| hemolysin-related protein [Vibrio mimicus CAIM 602]
Length = 957
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 41/166 (24%), Positives = 77/166 (46%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TPV+A +++D V+++++ F+ Y N ++ L GID +
Sbjct: 242 NQVMVTPVVAQLNDDNGDGKIDEKDVADLVVVT---FEGNKYANGGLVRALSGIDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G I+ +A + SP + DLDGDG ++I+ +
Sbjct: 299 NYANGGIIA-------------------DARY------SPAIGDLDGDGIVEIVTANNRD 333
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + I G + AD+++DG IE++ D
Sbjct: 334 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDNDGSIEILAAD 379
>gi|262165178|ref|ZP_06032915.1| hemolysin-related protein RbmC [Vibrio mimicus VM223]
gi|262024894|gb|EEY43562.1| hemolysin-related protein RbmC [Vibrio mimicus VM223]
Length = 957
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 41/166 (24%), Positives = 77/166 (46%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TPV+A +++D V+++++ F+ Y N ++ L GID +
Sbjct: 242 NQVMVTPVVAQLNDDNGDGKIDEKDVADLVVVT---FEGNKYANGGLVRALSGIDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G I+ +A + SP + DLDGDG ++I+ +
Sbjct: 299 NYANGGIIA-------------------DARY------SPAIGDLDGDGIVEIVTANNRD 333
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + I G + AD+++DG IE++ D
Sbjct: 334 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDNDGSIEILAAD 379
>gi|448727188|ref|ZP_21709557.1| hypothetical protein C448_10961 [Halococcus morrhuae DSM 1307]
gi|445791716|gb|EMA42347.1| hypothetical protein C448_10961 [Halococcus morrhuae DSM 1307]
Length = 414
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 44/176 (25%), Positives = 74/176 (42%), Gaps = 13/176 (7%)
Query: 466 SSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQ-GAVVAADINDDGK 524
+ PT+ D D DG +++ T+ L LD +E F + + + AD+ DG
Sbjct: 106 ADPTLADFDDDGTKEVVATTTEDLVTALDPQTGEQE-FTYNLTDYGYTQPLVADVTGDGA 164
Query: 525 IELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVL 584
E V D G G +W + L + P++ D GD S++ V +G VL
Sbjct: 165 NETVVVDFRGTAFVLRPNGSAVWTEKLDAKTEAQPAVADFTGDNESEIAVG--AGQKVVL 222
Query: 585 SGKDGSKV--RPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDG 638
++G+ R P+ + M T + ++ + IV +F G + IDG
Sbjct: 223 LDRNGTVRWNRTQPFDSSITWMT-------TGQADEDAAIEIVAATFGGRVVAIDG 271
>gi|444909700|ref|ZP_21229890.1| hypothetical protein D187_03592 [Cystobacter fuscus DSM 2262]
gi|444720072|gb|ELW60859.1| hypothetical protein D187_03592 [Cystobacter fuscus DSM 2262]
Length = 806
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 43/139 (30%), Positives = 64/139 (46%), Gaps = 15/139 (10%)
Query: 456 DNASFRAYIYSSPTVVDLDGDGNLDIL--VGTSFG--LFYVLDHHGKIREKFPLEMAEIQ 511
D+A F +S VD DGDG LD++ V S FY D HG +R F + +
Sbjct: 672 DSAQFWNEAWS----VDADGDGRLDVVNRVRPSVAELRFYPGDGHGTLRAPFTCALPPLG 727
Query: 512 GAVVAADINDDGKIELVTTDTHGNVAAWTAEGKG-----IWEQHLKSLVTQGPSIGDVDG 566
D+N DG+++++ +T W A G+G W L VT ++ D+ G
Sbjct: 728 RLFAWEDVNGDGRVDVI-AETQDGQGLWLALGQGQGKWSTWSYPLDGAVTWARAV-DLLG 785
Query: 567 DGHSDVVVPTLSGNIYVLS 585
D ++VV SG + V S
Sbjct: 786 DDRPELVVLMSSGELRVFS 804
>gi|448729502|ref|ZP_21711817.1| hypothetical protein C449_06945 [Halococcus saccharolyticus DSM
5350]
gi|445794804|gb|EMA45342.1| hypothetical protein C449_06945 [Halococcus saccharolyticus DSM
5350]
Length = 414
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 54/126 (42%), Gaps = 10/126 (7%)
Query: 515 VAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVV 574
V AD+ ELV D G V G +W + L S P++ D D D ++VV
Sbjct: 155 VVADLTGKETNELVVVDVQGTVFVLRPNGTAVWTETLSSYTWGQPTVEDFDADNRREIVV 214
Query: 575 PTLSGNIYVLSGKDGSKV--RPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGY 632
T SG + VL DG+ R P+ + M T + + + IVT + DG
Sbjct: 215 GTSSGQL-VLFEHDGTTAWNRTRPFESSITWMT-------TGQADDDRATEIVTATVDGR 266
Query: 633 LYLIDG 638
+ IDG
Sbjct: 267 VTAIDG 272
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 49/189 (25%), Positives = 73/189 (38%), Gaps = 19/189 (10%)
Query: 378 NVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVV 437
N H ++ P AD D E +IA S + Y P +++ D+ Y VV
Sbjct: 99 NCTIHAVADPTFADFVGDDTKE-VIATSTEQEVAAY-RPRTGEQVFQYDLTDYGYTQPVV 156
Query: 438 FNLDTKQVKWTTDLDL------------STDNASFRAYIYSSPTVVDLDGDGNLDILVGT 485
+L K+ +D+ + + +Y + PTV D D D +I+VGT
Sbjct: 157 ADLTGKETNELVVVDVQGTVFVLRPNGTAVWTETLSSYTWGQPTVEDFDADNRREIVVGT 216
Query: 486 SFGLFYVLDHHGKI--REKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAE- 542
S G + +H G P E + AD DD E+VT G V A E
Sbjct: 217 SSGQLVLFEHDGTTAWNRTRPFESSITWMTTGQAD--DDRATEIVTATVDGRVTAIDGER 274
Query: 543 GKGIWEQHL 551
G W++
Sbjct: 275 GAVQWQRDF 283
>gi|158521764|ref|YP_001529634.1| FG-GAP repeat-containing protein [Desulfococcus oleovorans Hxd3]
gi|158510590|gb|ABW67557.1| FG-GAP repeat protein [Desulfococcus oleovorans Hxd3]
Length = 768
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 55/211 (26%), Positives = 80/211 (37%), Gaps = 48/211 (22%)
Query: 377 VNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIV 436
+NVD S P ADID DG + + Y + YD + LG ++ + AG
Sbjct: 249 LNVDVGDSSAPAFADIDGDGDLDAFVGSKY----QSYDVSSGI--LG--NVYSFTAGIAY 300
Query: 437 VFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHH 496
NL + T + YS+P D+DGDG+LD VG +G ++
Sbjct: 301 FENLGSVSAPMFTSCNPDNPFDGLETDKYSAPAFADIDGDGDLDAFVGGMYGNLEFFENT 360
Query: 497 GKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHG-NVAAWTAEGKGIWEQHLKSLV 555
G + V T+ HG N AW A+ G + +
Sbjct: 361 GTVDSP-------------------------VMTERHGKNNPAWGAD-VGFYSK------ 388
Query: 556 TQGPSIGDVDGDGHSDVVVPTLSGNIYVLSG 586
P+ D+DGDG D V G +L+G
Sbjct: 389 ---PTFVDIDGDGDLDAFV----GEFGILAG 412
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 56/236 (23%), Positives = 90/236 (38%), Gaps = 63/236 (26%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDH-EYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTK 443
+ P DIDNDG + + Y + EY++N IG
Sbjct: 143 AAPAFVDIDNDGDMDAFVGSGYGYGSIEYFEN-----------IGT------------AS 179
Query: 444 QVKWTTDLDLSTDNASFRAYI--YSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIRE 501
++T T N F + S+P+ VD+DGDG++D+ +G+ G ++ G
Sbjct: 180 APEFTPYAGTETFNNPFGIWSPSESTPSFVDIDGDGDMDLFIGSESGGITYFENIGTASA 239
Query: 502 KF------PLEM-AEIQGAVVAADINDDGKIEL----------VTTDTHGNVAAWTAEGK 544
PL + A ADI+ DG ++ V++ GNV ++TA G
Sbjct: 240 AMFTGTQNPLNVDVGDSSAPAFADIDGDGDLDAFVGSKYQSYDVSSGILGNVYSFTA-GI 298
Query: 545 GIWEQ-------------------HLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNI 581
+E L++ P+ D+DGDG D V + GN+
Sbjct: 299 AYFENLGSVSAPMFTSCNPDNPFDGLETDKYSAPAFADIDGDGDLDAFVGGMYGNL 354
>gi|290956379|ref|YP_003487561.1| hypothetical protein SCAB_18711 [Streptomyces scabiei 87.22]
gi|260645905|emb|CBG68996.1| putative secreted protein [Streptomyces scabiei 87.22]
Length = 457
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 54/179 (30%), Positives = 81/179 (45%), Gaps = 19/179 (10%)
Query: 472 DLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQG-----AVVAADINDDGKIE 526
+ DG G+L + S LF L GK F + QG +VVA D DGK +
Sbjct: 266 NADGKGDLVAVNKDSGALFLYL---GKGDGHFSAAVQIGQGWQSMSSVVAGDFTGDGKAD 322
Query: 527 LVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPS------IGDVDGDGHSDVVVPT-LSG 579
+V D + N+ ++ G + H K + QG S GDV+GDG +D+V +G
Sbjct: 323 IVAVDANSNLQLYSGTGTDV--NHTKQ-IGQGWSGMTNLAAGDVNGDGRADIVAMNRTTG 379
Query: 580 NIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDG 638
+I++ G T+ MN++ + D+ G K+ +T TS D YLY G
Sbjct: 380 DIHLYKGDSSGVTSAGVIGTNFTAMNRLTVADINDDG-KADVVTTHATSGDLYLYTSSG 437
>gi|223940606|ref|ZP_03632450.1| FG-GAP repeat protein [bacterium Ellin514]
gi|223890724|gb|EEF57241.1| FG-GAP repeat protein [bacterium Ellin514]
Length = 443
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 76/166 (45%), Gaps = 20/166 (12%)
Query: 440 LDTKQVKWTTDLDLSTDNASFRAYIYSSPTV---------VDLDGDGNLDILVGTSF-GL 489
+D +W L + T+N + S+P V D++GDG +D++ +
Sbjct: 200 MDLICARWDNHLLVFTNNGMGGFAVSSTPVVGANPVSVVAADVNGDGRMDLITANLWDNT 259
Query: 490 FYVLDHHGKIREKFPLEMAEIQG----AVVAADINDDGKIELVTT-DTHGNVAAWTAEGK 544
VL + G F L + G +V AAD+N DG+++L++ D+ G + T +G
Sbjct: 260 LSVLTNDGS--GGFTLAASPSVGVNPYSVAAADVNGDGRLDLISANDSDGTLTVLTNDGA 317
Query: 545 GIWEQHLKSLVTQGPS---IGDVDGDGHSDVVVPTLSGNIYVLSGK 587
G + V+ GP + DVDGDG D++ + I V+ K
Sbjct: 318 GGFVIATNITVSSGPDSLVVADVDGDGRLDLISANIDHGISVVLNK 363
Score = 43.9 bits (102), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 35/120 (29%), Positives = 59/120 (49%), Gaps = 10/120 (8%)
Query: 462 AYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGA----VVAA 517
Y +S T D++GDG +D++ V ++G F + + GA VVAA
Sbjct: 183 GYFPTSVTAADVNGDGRMDLICARWDNHLLVFTNNG--MGGFAVSSTPVVGANPVSVVAA 240
Query: 518 DINDDGKIELVTTDTHGN-VAAWTAEGKGIWEQHLKSLVTQGP---SIGDVDGDGHSDVV 573
D+N DG+++L+T + N ++ T +G G + V P + DV+GDG D++
Sbjct: 241 DVNGDGRMDLITANLWDNTLSVLTNDGSGGFTLAASPSVGVNPYSVAAADVNGDGRLDLI 300
>gi|258625400|ref|ZP_05720294.1| conserved hypothetical protein [Vibrio mimicus VM603]
gi|258582311|gb|EEW07166.1| conserved hypothetical protein [Vibrio mimicus VM603]
Length = 957
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 40/166 (24%), Positives = 77/166 (46%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TPV+A +++D V+++++ F+ Y N ++ L G+D +
Sbjct: 242 NQVMVTPVVAQLNDDNGDGKIDEKDVADLVVVT---FEGNKYANGGLVRALSGVDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G I+ +A + SP + DLDGDG ++I+ +
Sbjct: 299 NYANGGIIA-------------------DARY------SPAIGDLDGDGIVEIVTANNRD 333
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + I G + AD+++DG IE++ D
Sbjct: 334 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDNDGSIEILAAD 379
>gi|262172057|ref|ZP_06039735.1| hemolysin-related protein RbmC [Vibrio mimicus MB-451]
gi|261893133|gb|EEY39119.1| hemolysin-related protein RbmC [Vibrio mimicus MB-451]
Length = 957
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 40/166 (24%), Positives = 77/166 (46%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TPV+A +++D V+++++ F+ Y N ++ L G+D +
Sbjct: 242 NQVMVTPVVAQLNDDNGDGKIDEKDVADLVVVT---FEGNKYANGGLVRALSGVDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G I+ +A + SP + DLDGDG ++I+ +
Sbjct: 299 NYANGGIIA-------------------DARY------SPAIGDLDGDGIVEIVTANNRD 333
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + I G + AD+++DG IE++ D
Sbjct: 334 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDNDGSIEILAAD 379
>gi|119493454|ref|ZP_01624123.1| hypothetical protein L8106_08556 [Lyngbya sp. PCC 8106]
gi|119452698|gb|EAW33877.1| hypothetical protein L8106_08556 [Lyngbya sp. PCC 8106]
Length = 492
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 37/149 (24%), Positives = 66/149 (44%), Gaps = 17/149 (11%)
Query: 498 KIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGN-VAAWTAEGKGIW--------- 547
KI+ P + E +G ++ AD+N+D + + T H N +A + G+ +W
Sbjct: 53 KIKLDLP-PLKEGKGGLIVADVNNDKHKDFLVT--HPNKIAVYDHSGQKLWVKSVNLQLT 109
Query: 548 ----EQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRV 603
Q L L G GD++ DG ++V+ + +++L GK+G +R ++
Sbjct: 110 NKSENQGLPGLHAPGIQAGDINQDGQAEVLFLSSDHKLHILEGKNGKTIREIQLKSPNGS 169
Query: 604 MNQVLLVDLTKRGEKSKGLTIVTTSFDGY 632
LV RG+ L + T+ GY
Sbjct: 170 QGWEHLVIANFRGKGDSDLLLQATNAKGY 198
Score = 40.4 bits (93), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 53/222 (23%), Positives = 87/222 (39%), Gaps = 43/222 (19%)
Query: 424 GIDIGKYVAGAIV--VFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDI 481
G +G+Y+A + D Q W D +F A ++ V DL+GD NLD
Sbjct: 197 GYRMGRYLAAYATDELLKTDNPQPLWERD--------NFVANAHNGVRVADLNGD-NLDE 247
Query: 482 LVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHG------- 534
++G + ++ G+I K PL+ I VA D +E+V + G
Sbjct: 248 VLGGT-----IISSQGEILLKLPLK-GHIDSLFVADVRPDIPGLEVVVLEEGGEQKKDSG 301
Query: 535 ---------NVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLS 585
V + AE K IWE K+ Q +IGD D P G
Sbjct: 302 NSKNQLKRNRVFLYNAE-KIIWETDYKNWEPQNAAIGDFD---------PNRPGLEIWCR 351
Query: 586 GKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTT 627
+ + +P+ + G V++ L ++ G +G+ ++ T
Sbjct: 352 SRFNTNQKPFVFDAKGEVISNYTLNEIAPPGWTKEGVDVIFT 393
>gi|83816441|ref|YP_445284.1| FG-GAP repeat-containing protein [Salinibacter ruber DSM 13855]
gi|83757835|gb|ABC45948.1| FG-GAP repeat domain protein [Salinibacter ruber DSM 13855]
Length = 619
Score = 47.4 bits (111), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 69/278 (24%), Positives = 117/278 (42%), Gaps = 54/278 (19%)
Query: 466 SSPTVVDLDGDGNLDILVGTSF-------GLFYVLDHHG-----KIREKFPLEMAEI-QG 512
S+ + D+ GDG D+LV L Y L G ++R++ PL++ + +
Sbjct: 371 STVALGDIQGDGQTDVLVANKIDPTQGETSLVYPLTPTGDKGPPRLRKRSPLDLPDAYRY 430
Query: 513 AVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSL--VTQG----PSIGDVDG 566
A D+N DG +L+ G ++ +G G ++ +L ++ G P++GDV G
Sbjct: 431 APALGDLNGDGADDLILGTWQGALSYHENQGDGTFDAVDGALDGLSGGSNVVPALGDVTG 490
Query: 567 DGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVT 626
+G D+V+ T SG + L G + P + + G V L +L R T+
Sbjct: 491 NGAPDLVLGTASGTL-TLHRNTGGEASPN-FASEG-----VTLAELNGRAAP----TLHD 539
Query: 627 TSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTT---MNGNVFCFST 683
+ DG L L+ G T +VL N G D L+ T ++G
Sbjct: 540 VTGDGRLDLLVG------------TKTGLVLLRNQGGTDGLEFGPPTSVNLDG------- 580
Query: 684 PAPHHPLKAWRSIN-QGRNNVAIRYNRAGIYVTHPSRA 720
P + W ++ GR ++ + R G+ + PS A
Sbjct: 581 -IPRNATPVWGDLDGDGRTDLVVGGKRGGLVLFQPSAA 617
>gi|115372719|ref|ZP_01460025.1| vcbs [Stigmatella aurantiaca DW4/3-1]
gi|310823451|ref|YP_003955809.1| FG-GAP repeat protein [Stigmatella aurantiaca DW4/3-1]
gi|115370200|gb|EAU69129.1| vcbs [Stigmatella aurantiaca DW4/3-1]
gi|309396523|gb|ADO73982.1| FG-GAP repeat protein [Stigmatella aurantiaca DW4/3-1]
Length = 803
Score = 47.4 bits (111), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 61/115 (53%), Gaps = 7/115 (6%)
Query: 466 SSPTVVDLDGDGNLDILVGTSF-GLFYVLDHHGK--IREKFPLEMAEIQGAVVAADINDD 522
+S DL+GDG D+ V ++ G VL ++G+ P + E +VVAA++N D
Sbjct: 524 TSVAAADLNGDGRPDLAVTSALSGTVEVLLNYGQGTFAAAVPYNVGEYPSSVVAAELNGD 583
Query: 523 GKIEL-VTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGP---SIGDVDGDGHSDVV 573
GK +L VTT +V+ +G G + + V GP + D++GDG +D+V
Sbjct: 584 GKPDLAVTTQESHHVSVLLNQGTGTFAAAVPYDVGAGPKAVTAADLNGDGVADLV 638
Score = 46.2 bits (108), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 53/191 (27%), Positives = 82/191 (42%), Gaps = 15/191 (7%)
Query: 442 TKQVKWTTDLDLSTDNASFRAYIY-------SSPTVVDLDGDGNLDILVGT--SFGLFYV 492
T + T ++ L+ +F A + SS +L+GDG D+ V T S + +
Sbjct: 542 TSALSGTVEVLLNYGQGTFAAAVPYNVGEYPSSVVAAELNGDGKPDLAVTTQESHHVSVL 601
Query: 493 LDHH-GKIREKFPLEMAEIQGAVVAADINDDGKIELVTT-DTHGNVAAWTAEGKGIWEQH 550
L+ G P ++ AV AAD+N DG +LVT DT V+ +G G +
Sbjct: 602 LNQGTGTFAAAVPYDVGAGPKAVTAADLNGDGVADLVTANDTSATVSVLLNQGNGTFAAA 661
Query: 551 LKSLVTQGPS---IGDVDGDGHSDVVVPT-LSGNIYVLSGKDGSKVRPYPYRTHGRVMNQ 606
+ V PS D++GDG D+ V S ++ VL +D + P + G
Sbjct: 662 VPFSVGLRPSSVTAADLNGDGKVDLAVANKASHDVSVLLNQDRAAFAPAVHYQAGTEPQG 721
Query: 607 VLLVDLTKRGE 617
V DL G+
Sbjct: 722 VTAADLNGDGK 732
Score = 45.8 bits (107), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 74/160 (46%), Gaps = 8/160 (5%)
Query: 466 SSPTVVDLDGDGNLDILV--GTSFGLFYVLDH-HGKIREKFPLEMAEIQGAVVAADINDD 522
+S V DL+ DG D+ V TS + +L+ +G+ P + G+V AAD+N D
Sbjct: 182 ASLAVADLNQDGKPDLAVSNATSGDVSVLLNQGNGRFTPALPANVGAKLGSVAAADLNGD 241
Query: 523 GKIELVTTDTHG-NVAAWTAEGKGIWEQHLKSLVTQGPS---IGDVDGDGHSDVVVP-TL 577
GK +L +T V+ +G G + V + PS + D++GDG D+V+ T
Sbjct: 242 GKPDLAIANTGSRTVSILLNQGSGTFAPADFYTVDEDPSSVAVADLNGDGKPDLVIANTG 301
Query: 578 SGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGE 617
S ++ VL + + G + V VDL G+
Sbjct: 302 SASVSVLLNQGNGTFSAGGRASTGAFPSSVTAVDLNGDGK 341
Score = 40.8 bits (94), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 53/191 (27%), Positives = 83/191 (43%), Gaps = 37/191 (19%)
Query: 453 LSTDNASFRAYIY-------SSPTVVDLDGDGNLDILVGTSF-------------GLFYV 492
L+ N SF I+ + +DL+GDG LD L SF G F
Sbjct: 406 LNRGNGSFENAIHYGGGKRLCAVAALDLNGDGKLD-LAALSFARGSVSTFLNQGDGTFAA 464
Query: 493 LDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTH-GNVAAWTAEGKGIWEQHL 551
HH ++ P +A A D+N DG+I+LVT + + N++ G+G + +
Sbjct: 465 AVHH-SVKTN-PQALA-------ATDLNHDGQIDLVTVNYNLSNLSVLMNRGQGSFAPAI 515
Query: 552 KSLVT---QGPSIGDVDGDGHSDVVVPT-LSGNIYVLSG-KDGSKVRPYPYRTHGRVMNQ 606
+S V + D++GDG D+ V + LSG + VL G+ PY G +
Sbjct: 516 QSKVGTNLTSVAAADLNGDGRPDLAVTSALSGTVEVLLNYGQGTFAAAVPYNV-GEYPSS 574
Query: 607 VLLVDLTKRGE 617
V+ +L G+
Sbjct: 575 VVAAELNGDGK 585
Score = 39.3 bits (90), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 65/255 (25%), Positives = 99/255 (38%), Gaps = 32/255 (12%)
Query: 470 VVDLDGDGNLDILVGTSFG--LFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIEL 527
DL+GDG D++ S + +L+ G ++ AD+N DGK +L
Sbjct: 138 AADLNGDGQTDLVAANSGSNNVSVLLNQGGTFSPATHYSAGTSPASLAVADLNQDGKPDL 197
Query: 528 -VTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG-----DVDGDGHSDVVVP-TLSGN 580
V+ T G+V+ +G G + L + V G +G D++GDG D+ + T S
Sbjct: 198 AVSNATSGDVSVLLNQGNGRFTPALPANV--GAKLGSVAAADLNGDGKPDLAIANTGSRT 255
Query: 581 IYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLI---D 637
+ +L + P + T + V + DL G+ L I T L+ +
Sbjct: 256 VSILLNQGSGTFAPADFYTVDEDPSSVAVADLNGDGKPD--LVIANTGSASVSVLLNQGN 313
Query: 638 GPTSCADVVDIGETSYSMVLAD-NVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSI 696
G S G S+ D N DG DL NGN + S I
Sbjct: 314 GTFSAGGRASTGAFPSSVTAVDLNGDGKPDL----AVANGNSYDVSV-----------LI 358
Query: 697 NQGRNNVAIRYNRAG 711
NQG A+ + AG
Sbjct: 359 NQGDGTFAVAHYGAG 373
>gi|403278960|ref|XP_003931047.1| PREDICTED: integrin alpha-9 [Saimiri boliviensis boliviensis]
Length = 991
Score = 47.4 bits (111), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 49/203 (24%), Positives = 79/203 (38%), Gaps = 40/203 (19%)
Query: 412 YYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVV 471
++ P +GG K + G + +F D + T T +Y SS V
Sbjct: 211 HFSQPSTTDVVGGAPQDKGI-GKVYIFRADRRS--GTLIKIFETSGKKMGSYFGSSLCAV 267
Query: 472 DLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTD 531
DL+ DG D+LV GA + ++I D+G++ +
Sbjct: 268 DLNEDGLSDLLV----------------------------GAPMFSEIRDEGQVTVYINR 299
Query: 532 THGNV-AAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPT-----LSGNIYVLS 585
+G + T G G + H + S+GD+D DG SDV + +G +Y+
Sbjct: 300 GNGALEEQLTLTGDGAYNAHFGESIA---SLGDLDDDGFSDVAIGAPKEDDFAGAVYIYH 356
Query: 586 GKDGSKVRPYPYRTHGRVMNQVL 608
G G V Y + GR ++ VL
Sbjct: 357 GDAGGIVPQYSMKLSGRKISPVL 379
>gi|122945904|sp|Q0JRK4.1|CGLA_ALTCA RecName: Full=Lambda-carrageenase; Flags: Precursor
gi|113927303|emb|CAL37005.1| pre-lambda-carrageenase [Pseudoalteromonas carrageenovora]
Length = 942
Score = 47.0 bits (110), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 36/140 (25%), Positives = 60/140 (42%), Gaps = 11/140 (7%)
Query: 523 GKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIY 582
G ++TT G VAA+ G+ +WE L + + D++GDG ++ GN+Y
Sbjct: 49 GGSSIITTSYEGTVAAYKFNGEKLWENELSGFMNHDIWVQDINGDGLVEIFAANADGNVY 108
Query: 583 VLSGKDGSKVRPYPYRTHGRVMNQVLLV-DLTKRGEKSKGLTIVTTSFDGYLYLIDGPTS 641
++ DGS + + + MN V ++ D K+ +V +D LY I
Sbjct: 109 CINS-DGS--LKWTFGLNEVPMNSVTVISDADKK-------YVVAGGYDKNLYYISTNGE 158
Query: 642 CADVVDIGETSYSMVLADNV 661
++ G S V D V
Sbjct: 159 LLKTIESGTYSEEGVFGDGV 178
>gi|186680837|ref|YP_001864033.1| Na-Ca exchanger/integrin-beta4 [Nostoc punctiforme PCC 73102]
gi|186463289|gb|ACC79090.1| Na-Ca exchanger/integrin-beta4 [Nostoc punctiforme PCC 73102]
Length = 1241
Score = 47.0 bits (110), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 97/221 (43%), Gaps = 30/221 (13%)
Query: 470 VVDLDGDGNLDILVG--TSFGLFYVLDHH-GKIREKFPLEMAEIQG----AVVAADINDD 522
V DL+ DGN D++ +S + +L G + + G +V AD N D
Sbjct: 166 VADLNKDGNTDLVTANKSSQSVSVLLGKGDGTFKPATTFSVVGFNGLNPYSVAVADFNKD 225
Query: 523 GKIELVTT-DTHGNVAAWTAEGKGIWEQHLKSLVTQGP-----SIGDVDGDGHSDVVVP- 575
GK++LVT + N++ +G G ++ + + G ++GD + DG SD+V
Sbjct: 226 GKLDLVTANNVSNNISVLLGKGDGSFQAAVNFDLPSGSAPISIAVGDFNKDGKSDIVTAN 285
Query: 576 TLSGNIYVL----SGKDGSKVR-PYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFD 630
S NI VL +G GS + P R ++ G+ L + ++ F+
Sbjct: 286 NASQNISVLLGNGTGGFGSAINFKVPSRPTSVIVGNF-------NGDGISDLAVTSSYFN 338
Query: 631 GYLYLI---DGPTSCADVVDIGETSYSMVLAD-NVDGGDDL 667
L+ DG + A D+G +S+V+ D N DG DL
Sbjct: 339 NVSILLGNGDGTFNSATQFDVGTNPHSVVIGDFNKDGKSDL 379
>gi|320160191|ref|YP_004173415.1| hypothetical protein ANT_07810 [Anaerolinea thermophila UNI-1]
gi|319994044|dbj|BAJ62815.1| hypothetical protein ANT_07810 [Anaerolinea thermophila UNI-1]
Length = 551
Score = 47.0 bits (110), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 39/153 (25%), Positives = 72/153 (47%), Gaps = 24/153 (15%)
Query: 465 YSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQG------------ 512
YSSP VD++GDG LDI V +++ ++ + G I K +G
Sbjct: 69 YSSPAAVDVNGDGRLDI-VASAYSIWAIDGETGAILWKTRSGHDVTEGFDGVSNVGRTWP 127
Query: 513 AVVAADINDDGKIELVTTDTHGNVAAWTAEG--KGIWEQHLKSLVTQGPSIGDVDGDGHS 570
+ AD++ DG E++T + G ++ +T +G K W Q + +G + D+DG+G +
Sbjct: 128 GIAVADVDRDGTQEIITAHSGGVISVYTLQGRFKPGWYQKPHTSEWRGLLVADLDGNGST 187
Query: 571 DVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRV 603
++ T + GS V + + + G++
Sbjct: 188 MEILATRA---------YGSAVNTWVFSSSGQI 211
Score = 39.7 bits (91), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 48/173 (27%), Positives = 76/173 (43%), Gaps = 37/173 (21%)
Query: 388 VIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVK- 446
VIAD++ DGV E ++AV +D +P + +Y A +FN D + +
Sbjct: 314 VIADVNGDGVRE-VVAVGNMYDCS--QSPY---------LSRYYA--PFIFNADRSRFRT 359
Query: 447 ----WTTD-LDLSTDNASFRAYIYSS---PTVVDLDGDGNLDILVGTSFGLFYVLDHHGK 498
W T+ LD + I S+ P V DLDGDG +IL + G +
Sbjct: 360 GAWDWRTNPLDTGAPISEDYNVIESAEPNPVVADLDGDGVREILFASYDGRVHAFWLDKT 419
Query: 499 IREKFPLEMAEIQGAV-------VAADINDDGKIELVTTDTHGNVAAWTAEGK 544
+P E+ + V V AD++++GK E++ A+WTA+G
Sbjct: 420 EHGNWPFEVYQPAEGVLRFASEPVVADLDNNGKAEVL-------FASWTAKGS 465
>gi|309790342|ref|ZP_07684908.1| FG-GAP repeat protein [Oscillochloris trichoides DG-6]
gi|308227608|gb|EFO81270.1| FG-GAP repeat protein [Oscillochloris trichoides DG6]
Length = 698
Score = 46.6 bits (109), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 46/184 (25%), Positives = 75/184 (40%), Gaps = 38/184 (20%)
Query: 377 VNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIV 436
+N + + S+P + DID+DG+ E++ VS G IG IV
Sbjct: 387 LNAPACVQSSPAVGDIDDDGLLEIVATVS-----------------GDASIGGDGKSDIV 429
Query: 437 VFNLDTKQVKWTTDLD--LSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYV-- 492
++ KWT S N +F + SP + DLDG+G+L+++ + + +
Sbjct: 430 AWDPTNANPKWTMSPGDPNSGSNDAFGGDL-QSPVIADLDGNGSLEVIAANFWSVHILRG 488
Query: 493 -----LDHHGKIREKFPL-EMAEIQGAVVAADINDDGKIELVTTDT----------HGNV 536
L + G L ++ DIN+DGK++LV HG +
Sbjct: 489 SNGQALTYQGTSGPSLSLFAWGTLKSTPAVGDINNDGKLDLVIGGMNVNNDSRVIDHGLL 548
Query: 537 AAWT 540
AWT
Sbjct: 549 YAWT 552
>gi|167515828|ref|XP_001742255.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778879|gb|EDQ92493.1| predicted protein [Monosiga brevicollis MX1]
Length = 399
Score = 46.6 bits (109), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 66/129 (51%), Gaps = 9/129 (6%)
Query: 470 VVDLDGDGNLDILVGT---SFGLFYVLDHHGK-IREKFPLEMAEIQGAVVAADINDDGKI 525
DLD DG+LD+L G+ + +++ + +G + E + + V AAD+N+DG++
Sbjct: 110 AADLDNDGSLDVLSGSINDNNVVWWRNNGNGTFMNEMLISDAVDFTSMVYAADLNNDGRL 169
Query: 526 ELVTTDTHGNVAAWTA-EGKGIW-EQHLKSLVTQGPS---IGDVDGDGHSDVVVPTLSGN 580
++++ N AW G+G + +Q + +L G S D+DGDGH DV+ + N
Sbjct: 170 DVLSASRDDNKVAWYPNNGEGSFSDQRIITLNALGASSVYAADLDGDGHLDVLSASSGDN 229
Query: 581 IYVLSGKDG 589
DG
Sbjct: 230 KLAWYRNDG 238
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 68/156 (43%), Gaps = 18/156 (11%)
Query: 462 AYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQG---AVVAAD 518
A SS DL+ DG LDIL T G +H P+ ++ I +V AAD
Sbjct: 53 AQFASSVFAADLNNDGYLDILSATVRGKVEWYRNHADGTFSNPISISTIMSRTQSVYAAD 112
Query: 519 INDDGKIELVTTDTH-GNVAAWTAEGKGIW--EQHLKSLV--TQGPSIGDVDGDGHSDVV 573
+++DG +++++ + NV W G G + E + V T D++ DG DV+
Sbjct: 113 LDNDGSLDVLSGSINDNNVVWWRNNGNGTFMNEMLISDAVDFTSMVYAADLNNDGRLDVL 172
Query: 574 VPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLL 609
+ +D +KV YP G +Q ++
Sbjct: 173 SAS----------RDDNKVAWYPNNGEGSFSDQRII 198
>gi|68299557|dbj|BAE02738.1| hemolysin [Vibrio anguillarum]
Length = 690
Score = 46.6 bits (109), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 38/162 (23%), Positives = 74/162 (45%), Gaps = 37/162 (22%)
Query: 382 HILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVA 432
+++ PVIA +++D ++++I+ F++ Y ++ L GID +
Sbjct: 52 QVMAAPVIAQLNDDNGDGKIDENDIADIIVVT---FENNKYTQGGLVRALSGIDGSEL-- 106
Query: 433 GAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFY- 491
S DN A SP V DLDG+G +DI++ ++ +
Sbjct: 107 --------------------WSYDNGGIIADARYSPAVADLDGNGVVDIVITSASSPYIT 146
Query: 492 VLDHHGKIREKFPLEM--AEIQGAVVAADINDDGKIELVTTD 531
+LD+ G I+++ + G + +D+N+DG IE+++ D
Sbjct: 147 ILDNEGNIKKQILKHVTGGRSVGTISISDLNNDGSIEIISAD 188
>gi|336123844|ref|YP_004565892.1| Leukocidin S subunit [Vibrio anguillarum 775]
gi|335341567|gb|AEH32850.1| Leukocidin S subunit [Vibrio anguillarum 775]
Length = 689
Score = 46.6 bits (109), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 38/162 (23%), Positives = 74/162 (45%), Gaps = 37/162 (22%)
Query: 382 HILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVA 432
+++ PVIA +++D ++++I+ F++ Y ++ L GID +
Sbjct: 51 QVMAAPVIAQLNDDNGDGKIDENDIADIIVVT---FENNKYTQGGLVRALSGIDGSEL-- 105
Query: 433 GAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFY- 491
S DN A SP V DLDG+G +DI++ ++ +
Sbjct: 106 --------------------WSYDNGGIIADARYSPAVADLDGNGVVDIVITSASSPYIT 145
Query: 492 VLDHHGKIREKFPLEM--AEIQGAVVAADINDDGKIELVTTD 531
+LD+ G I+++ + G + +D+N+DG IE+++ D
Sbjct: 146 ILDNEGNIKKQILKHVTGGRSVGTISISDLNNDGSIEIISAD 187
>gi|162457451|ref|YP_001619819.1| hypothetical protein sce9166 [Sorangium cellulosum So ce56]
gi|161168033|emb|CAN99338.1| hypothetical protein sce9166 [Sorangium cellulosum So ce56]
Length = 579
Score = 46.6 bits (109), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 45/137 (32%), Positives = 64/137 (46%), Gaps = 24/137 (17%)
Query: 451 LDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKF--PLEMA 508
L LS D+ S A++ SP V DLDGDG +ILV T G V G+ + ++ +
Sbjct: 91 LQLSDDDES--AWL-GSPVVADLDGDGEQEILV-THNGNVVVWGPDGEQKWRYDESRQNG 146
Query: 509 EIQGAVVAADINDDGKIELV--------TTDTHGN-VAAWTAEGKGIWEQHLKSLVTQGP 559
I + + AD DG++E+V D G V+ W W ++SL
Sbjct: 147 RIWASPIVADFRGDGQLEVVFAARTTIYMLDASGQLVSGWPV----TWNDEIRSLAA--- 199
Query: 560 SIGDVDGDGHSDVVVPT 576
GD+DGDG DVV +
Sbjct: 200 --GDIDGDGQLDVVAAS 214
>gi|392379919|ref|YP_004987077.1| putative hemagglutinin/hemolysin-related protein [Azospirillum
brasilense Sp245]
gi|356882286|emb|CCD03292.1| putative hemagglutinin/hemolysin-related protein [Azospirillum
brasilense Sp245]
Length = 2998
Score = 46.6 bits (109), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 69/166 (41%), Gaps = 36/166 (21%)
Query: 378 NVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVV 437
NV S+ + P ADID DG ++ I +Y + +Y N +G A + +
Sbjct: 1040 NVGSYANAVPTFADIDGDGDLDVFIGNAYG-NTIFYRN-----------VGTATAPSFTL 1087
Query: 438 FNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHG 497
+ + T+ Y++PT D+DGDG+LD VG FG V + G
Sbjct: 1088 VGTNLFGIDQTS---------------YATPTFADIDGDGDLDFFVGNMFGNVNVYRNVG 1132
Query: 498 KI---------REKFPLEMAEIQGAVVAADINDDGKIELVTTDTHG 534
F LE Q V ADI+ DG ++ + +T+G
Sbjct: 1133 TAAAPSFTLVGNNSFGLENIGGQVRPVFADIDGDGDLDAIIGNTNG 1178
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 64/145 (44%), Gaps = 24/145 (16%)
Query: 461 RAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIRE-KFPLE---------MAEI 510
A ++PT D+DGDG+LD+L+G G + + G F LE +
Sbjct: 985 NAGANATPTFADIDGDGDLDMLIGIQSGDSILYRNVGTATAPSFTLEGTNPFGLGNVGSY 1044
Query: 511 QGAVVA-ADINDDGKIELVTTDTHGN-----------VAAWTAEGKGIWEQHLKSLVTQG 558
AV ADI+ DG +++ + +GN ++T G ++ S T
Sbjct: 1045 ANAVPTFADIDGDGDLDVFIGNAYGNTIFYRNVGTATAPSFTLVGTNLFGIDQTSYAT-- 1102
Query: 559 PSIGDVDGDGHSDVVVPTLSGNIYV 583
P+ D+DGDG D V + GN+ V
Sbjct: 1103 PTFADIDGDGDLDFFVGNMFGNVNV 1127
Score = 40.8 bits (94), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 65/146 (44%), Gaps = 20/146 (13%)
Query: 468 PTVVDLDGDGNLDILVGTSFG--LFY-----VLDHHGKIREKFPLEMAEI--QGAVVAAD 518
P V DLDGDG+ D+LVG + G L+Y + + P + + + AD
Sbjct: 520 PAVADLDGDGDPDVLVGNAAGDTLYYRNIGTATNPTFTLAGTNPFGLGRVGTSASTSFAD 579
Query: 519 INDDGKIELVTTDTHGNVAAWTAEGK---------GIWEQHLKSLVT-QGPSIGDVDGDG 568
I+ DG ++ + + +G + + G G L + T P+ D+DGDG
Sbjct: 580 IDGDGDLDALIGNQNGGIVVYRNVGTRTSPSFMLVGTNPFGLGGVGTAAAPTFADIDGDG 639
Query: 569 HSDVVVPTLSGNIYVLSGKDGSKVRP 594
DV++ +GN+ VL G+ P
Sbjct: 640 DLDVLIGNGAGNL-VLYWNTGTSAAP 664
>gi|262189665|ref|ZP_06048042.1| hemolysin-related protein Vcp [Vibrio cholerae CT 5369-93]
gi|262034455|gb|EEY52818.1| hemolysin-related protein Vcp [Vibrio cholerae CT 5369-93]
Length = 415
Score = 46.6 bits (109), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 37/165 (22%), Positives = 74/165 (44%), Gaps = 43/165 (26%)
Query: 382 HILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IGK 429
+++ P++ +++D V+++I+ F+ Y N +++ L G+D +
Sbjct: 51 QVMAAPIVVQLNDDNGDGKIDGKDVADIIVVT---FEGNKYANGGYIRALSGVDGSELWS 107
Query: 430 YVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGL 489
Y G ++ +A + SP DLDGDG ++I+ ++
Sbjct: 108 YSNGGVIA-------------------DARY------SPAAADLDGDGLIEIVSTSALTP 142
Query: 490 FY-VLDHHGKIREKFPLEMAEIQ--GAVVAADINDDGKIELVTTD 531
+ +LDH G I+++ + + G + ADIN DG IE++ D
Sbjct: 143 YINILDHQGNIKKQLLKSASGWRSVGDIALADINGDGNIEILAAD 187
>gi|229529971|ref|ZP_04419361.1| hemolysin-related protein RbmC [Vibrio cholerae 12129(1)]
gi|229333745|gb|EEN99231.1| hemolysin-related protein RbmC [Vibrio cholerae 12129(1)]
Length = 957
Score = 46.6 bits (109), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 53/212 (25%), Positives = 92/212 (43%), Gaps = 62/212 (29%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TPV+A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 242 NQVMVTPVVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYTNGGLVRALSGVDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 299 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 333
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKG 545
F +LD+ G I+++ P + I G + AD++ DG +E++ D G
Sbjct: 334 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD-------------G 380
Query: 546 IWEQHLKSLVTQGP----SIG-DVDGDGHSDV 572
++ H LV P SI DVDGD +V
Sbjct: 381 VYNYH-SGLVFNHPWAPSSINVDVDGDQQQEV 411
>gi|424659834|ref|ZP_18097083.1| FG-GAP repeat family protein [Vibrio cholerae HE-16]
gi|408051402|gb|EKG86493.1| FG-GAP repeat family protein [Vibrio cholerae HE-16]
Length = 939
Score = 46.2 bits (108), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 40/166 (24%), Positives = 76/166 (45%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TPV+A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 224 NQVMVTPVVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYTNGGLVRALSGVDGSELW 280
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 281 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 315
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + I G + AD++ DG +E++ D
Sbjct: 316 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD 361
>gi|417824038|ref|ZP_12470629.1| FG-GAP repeat family protein [Vibrio cholerae HE48]
gi|340047723|gb|EGR08646.1| FG-GAP repeat family protein [Vibrio cholerae HE48]
Length = 957
Score = 46.2 bits (108), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 53/212 (25%), Positives = 92/212 (43%), Gaps = 62/212 (29%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TPV+A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 242 NQVMVTPVVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYTNGGLVRALSGVDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 299 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 333
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKG 545
F +LD+ G I+++ P + I G + AD++ DG +E++ D G
Sbjct: 334 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD-------------G 380
Query: 546 IWEQHLKSLVTQGP----SIG-DVDGDGHSDV 572
++ H LV P SI DVDGD +V
Sbjct: 381 VYNYH-SGLVFNHPWAPSSINVDVDGDQQQEV 411
>gi|229525543|ref|ZP_04414948.1| hemolysin-related protein RbmC [Vibrio cholerae bv. albensis VL426]
gi|229339124|gb|EEO04141.1| hemolysin-related protein RbmC [Vibrio cholerae bv. albensis VL426]
Length = 957
Score = 46.2 bits (108), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 40/166 (24%), Positives = 76/166 (45%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TPV+A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 242 NQVMVTPVVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYTNGGLVRALSGVDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 299 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 333
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + I G + AD++ DG +E++ D
Sbjct: 334 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD 379
>gi|427706535|ref|YP_007048912.1| FG-GAP repeat-containing protein [Nostoc sp. PCC 7107]
gi|427359040|gb|AFY41762.1| FG-GAP repeat protein [Nostoc sp. PCC 7107]
Length = 415
Score = 46.2 bits (108), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 51/113 (45%), Gaps = 11/113 (9%)
Query: 467 SPTVVDLDGDGNLDIL-----VGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADIND 521
+PTVVD DG+G DI GT+ F G KF L+ + D N
Sbjct: 221 TPTVVDFDGNGKSDIFWRNANTGTNSAWFM----DGTKASKFNLQSQSSSSTAILGDFNG 276
Query: 522 DGKIELVTTDTH-GNVAAWTAEGKGIWEQHLKSL-VTQGPSIGDVDGDGHSDV 572
D K +++ +T G WT G + E K+L + +IGD D DG +D+
Sbjct: 277 DLKTDILWRNTSTGQNKIWTMNGIVVTEAAAKTLDSSWQANIGDFDSDGKTDI 329
>gi|395826664|ref|XP_003786536.1| PREDICTED: integrin alpha-3 [Otolemur garnettii]
Length = 1051
Score = 46.2 bits (108), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 34/133 (25%), Positives = 60/133 (45%), Gaps = 19/133 (14%)
Query: 369 QHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIG 428
+ ++ + V ++ S +AD++NDG ++++ Y+FD +
Sbjct: 293 RRRQVLEGTQVGAYFGSAIALADLNNDGWQDLLVGAPYYFDRK----------------- 335
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSF- 487
+ V GAI VF ++ + T L S A+ +S ++ D++ DG DI VG F
Sbjct: 336 EEVGGAIYVF-MNQAGTSFPTHPSLLLHGPSRSAFGFSVASIGDINQDGFQDIAVGAPFE 394
Query: 488 GLFYVLDHHGKIR 500
GL V +H R
Sbjct: 395 GLGRVYIYHSSSR 407
>gi|422909535|ref|ZP_16944181.1| FG-GAP repeat family protein [Vibrio cholerae HE-09]
gi|341634817|gb|EGS59552.1| FG-GAP repeat family protein [Vibrio cholerae HE-09]
Length = 886
Score = 46.2 bits (108), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 40/166 (24%), Positives = 76/166 (45%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TPV+A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 171 NQVMVTPVVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYTNGGLVRALSGVDGSELW 227
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 228 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 262
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + I G + AD++ DG +E++ D
Sbjct: 263 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD 308
>gi|433456433|ref|ZP_20414478.1| FG-GAP repeat-containing protein [Arthrobacter crystallopoietes
BAB-32]
gi|432196269|gb|ELK52738.1| FG-GAP repeat-containing protein [Arthrobacter crystallopoietes
BAB-32]
Length = 1167
Score = 46.2 bits (108), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 61/127 (48%), Gaps = 14/127 (11%)
Query: 472 DLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVV----AADINDDGKIEL 527
D DGD N D+L G ++ +G+ R +++ A+ + + DG ++L
Sbjct: 917 DFDGDNNPDVLATDRSGRLWLYPGNGEGRMMKRVQIGTGWNAMSQVFSPGNFDGDGGMDL 976
Query: 528 VTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGP--------SIGDVDGDGHSDVVVPTLSG 579
+ D GN+ + +G+G W + K LV +G GD DGDG +DV+ +G
Sbjct: 977 LARDRGGNLWLYPGDGEGGWLK--KVLVGKGGWNSMTEIFGAGDFDGDGSADVLARHKNG 1034
Query: 580 NIYVLSG 586
N+++ G
Sbjct: 1035 NLWLYPG 1041
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 59/131 (45%), Gaps = 21/131 (16%)
Query: 472 DLDGDGNLDILVGTSFG--LFYVLDHHGKIREKFPL------EMAEIQGAVVAADINDDG 523
+ DGDG +D+L G Y D G +K + M EI GA D + DG
Sbjct: 967 NFDGDGGMDLLARDRGGNLWLYPGDGEGGWLKKVLVGKGGWNSMTEIFGA---GDFDGDG 1023
Query: 524 KIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGP--------SIGDVDGDGHSDVVVP 575
+++ +GN+ + +G+G W K +V G S GD D DG +DV+
Sbjct: 1024 SADVLARHKNGNLWLYPGDGEGGW--LAKRIVGHGGWNSMTEIFSPGDFDDDGSADVLAR 1081
Query: 576 TLSGNIYVLSG 586
+GN+++ G
Sbjct: 1082 HKNGNLWLYPG 1092
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 57/131 (43%), Gaps = 21/131 (16%)
Query: 472 DLDGDGNLDILVGTSFG--LFYVLDHHGKIREKFPL------EMAEIQGAVVAADINDDG 523
D DGDG+ D+L G Y D G K + M EI D +DDG
Sbjct: 1018 DFDGDGSADVLARHKNGNLWLYPGDGEGGWLAKRIVGHGGWNSMTEI---FSPGDFDDDG 1074
Query: 524 KIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGP--------SIGDVDGDGHSDVVVP 575
+++ +GN+ + +G G W K +V G S GD DGDG+ DV+
Sbjct: 1075 SADVLARHKNGNLWLYPGDGDGGW--LAKRIVGHGGWNSMTTIFSSGDFDGDGNVDVLAR 1132
Query: 576 TLSGNIYVLSG 586
+GN+++ G
Sbjct: 1133 HKNGNLWLYPG 1143
>gi|167520794|ref|XP_001744736.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777067|gb|EDQ90685.1| predicted protein [Monosiga brevicollis MX1]
Length = 789
Score = 46.2 bits (108), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 82/321 (25%), Positives = 136/321 (42%), Gaps = 50/321 (15%)
Query: 388 VIADIDNDGVSEMIIAVSYFFDHEYYDN-------PEHL---------KELGGIDIGKYV 431
+ AD+DNDG ++I SY +Y N P+ L +G D +V
Sbjct: 50 LTADLDNDGWVDVISVASYNDQVTWYRNLGNGTFGPQRLVGVIHAAFQGAVGDFDHDGWV 109
Query: 432 ---AGAI----VVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVG 484
AG+ +V+ +T ++ + +ST + R + DLD DG LD++V
Sbjct: 110 DVAAGSTNDDSLVWFRNTGGGNFSAGIIVSTVHDEVRGLLAE-----DLDNDGFLDLVV- 163
Query: 485 TSFG---LFYVLDHHGKIR----EKFPLEMAEIQGAVVAADINDDGKIELVT-TDTHGNV 536
+ +G + + + HG + P + + + A D+N DG +++V G V
Sbjct: 164 SDYGNDRIVWYPNAHGAFNASSVQILP-GLCDGPRQLAAGDLNGDGWLDIVAPCINEGRV 222
Query: 537 AAWTAEGKGI---WEQHLKSL-VTQGPSIGDVDGDGHSDVVVPTLSGNIYV---LSGKDG 589
W ++G G W+ ++ L Q S+ DVDGD DVVV + ++V L+ G
Sbjct: 223 LWWPSQGNGSFSSWQLIVQGLDKAQAVSVRDVDGDSAVDVVVGGDNSALFVQNRLASGHG 282
Query: 590 SKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIG 649
R H + Q++L D T+ G F +L L G S + + I
Sbjct: 283 FSTN-LTIRLHD--VRQIILEDFTQSGAPDMLFITGNEFFTWFLNL--GNASFSSPLQIS 337
Query: 650 ETSYSMVLADNVDGGDDLDLI 670
+ + A +VDG LDL+
Sbjct: 338 TGDLNSIAALDVDGDGWLDLV 358
>gi|419832488|ref|ZP_14355950.1| hypothetical protein VCHC61A2_1128 [Vibrio cholerae HC-61A2]
gi|422916700|ref|ZP_16951029.1| FG-GAP repeat family protein [Vibrio cholerae HC-02A1]
gi|423851529|ref|ZP_17719420.1| hypothetical protein VCHC59A1_1058 [Vibrio cholerae HC-59A1]
gi|423880129|ref|ZP_17723025.1| hypothetical protein VCHC60A1_1010 [Vibrio cholerae HC-60A1]
gi|424015823|ref|ZP_17755664.1| hemolysin [Vibrio cholerae HC-55B2]
gi|424635923|ref|ZP_18073938.1| FG-GAP repeat family protein [Vibrio cholerae HC-55A1]
gi|341639123|gb|EGS63752.1| FG-GAP repeat family protein [Vibrio cholerae HC-02A1]
gi|408026254|gb|EKG63271.1| FG-GAP repeat family protein [Vibrio cholerae HC-55A1]
gi|408642466|gb|EKL14210.1| hypothetical protein VCHC60A1_1010 [Vibrio cholerae HC-60A1]
gi|408644053|gb|EKL15763.1| hypothetical protein VCHC59A1_1058 [Vibrio cholerae HC-59A1]
gi|408651132|gb|EKL22388.1| hypothetical protein VCHC61A2_1128 [Vibrio cholerae HC-61A2]
gi|408861497|gb|EKM01087.1| hemolysin [Vibrio cholerae HC-55B2]
Length = 939
Score = 46.2 bits (108), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 77/166 (46%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 224 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 280
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 281 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 315
Query: 489 LFY-VLDHHGKIREKFPLEMAE--IQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P ++ I G + AD++ DG +E++ D
Sbjct: 316 QFITILDNQGNIKKQIPTTESDWRIVGDITLADLDHDGSVEILAAD 361
>gi|429886181|ref|ZP_19367745.1| Hemolysin-related protein RbmC [Vibrio cholerae PS15]
gi|429226971|gb|EKY33034.1| Hemolysin-related protein RbmC [Vibrio cholerae PS15]
Length = 886
Score = 46.2 bits (108), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 40/166 (24%), Positives = 76/166 (45%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TPV+A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 171 NQVMVTPVVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYTNGGLVRALSGVDGSELW 227
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 228 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 262
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + I G + AD++ DG +E++ D
Sbjct: 263 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD 308
>gi|149922946|ref|ZP_01911366.1| Rhs family protein [Plesiocystis pacifica SIR-1]
gi|149816197|gb|EDM75704.1| Rhs family protein [Plesiocystis pacifica SIR-1]
Length = 570
Score = 46.2 bits (108), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 62/218 (28%), Positives = 93/218 (42%), Gaps = 29/218 (13%)
Query: 387 PVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKEL---GGID-IGKYVA-----GAIVV 437
P IAD+DNDG E++ Y + Y HL G + G+YVA GAI V
Sbjct: 191 PAIADLDNDGSPEILALEQYGTNSPYQFADRHLVAFDAEGNLKWSGEYVAQSRGGGAIAV 250
Query: 438 FNLDTKQVKWTTDLDLSTDNASFRAYI-------YSSPTVVDLDGDGNLDILVGTSFGLF 490
+LD + TD + + YS P D D DG+L++L G G
Sbjct: 251 ADLDHDGSPEILAPEYVTDASGQLLWALDDPPTQYSMPVAADFDLDGDLEVLFG---GTA 307
Query: 491 YVLDHHGKIREKFPLEMAEI-QGAVVAADINDDGKIELVTTDTHGNVAAWTAEGK--GIW 547
Y H G + F ++ A +G+ A+ +DD E G A + +G +
Sbjct: 308 Y--HHDGSLY--FDVQGAPTNRGSAAVANFDDDPNPEFYAQ-HDGAHAVFDHDGSLIAVC 362
Query: 548 EQHLKSLVTQGP-SIGDVDGDGHSDVVVPTLSGNIYVL 584
HL P SIGD++GDG ++++ + +Y L
Sbjct: 363 PAHLVMGEAGNPTSIGDLNGDGKAEILF-AIGQKVYAL 399
>gi|311745390|ref|ZP_07719175.1| hypothetical protein ALPR1_02905 [Algoriphagus sp. PR1]
gi|126577936|gb|EAZ82156.1| hypothetical protein ALPR1_02905 [Algoriphagus sp. PR1]
Length = 397
Score = 46.2 bits (108), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 30/122 (24%), Positives = 57/122 (46%), Gaps = 7/122 (5%)
Query: 460 FRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVA--A 517
F ++ V D++ DG LD++ G + + H K + E +++
Sbjct: 42 FSDFVSEGVAVGDVNNDGKLDVMAGPFWFEAPTWEKHEIYEPKLFVAEKEWSNSMLNFDL 101
Query: 518 DINDDGKIELVTTDTHGNVAAWTAEGKGI---WEQHL--KSLVTQGPSIGDVDGDGHSDV 572
D+N DG ++L+ D G W KG+ W+ H+ +++ + P+ D+DGDG D+
Sbjct: 102 DVNLDGWVDLIRVDFPGKAVYWHENPKGMEGHWKVHMIHETVGNESPNFVDIDGDGRKDI 161
Query: 573 VV 574
+
Sbjct: 162 LC 163
>gi|430375928|ref|ZP_19430331.1| FG-GAP repeat-containing protein [Moraxella macacae 0408225]
gi|429541159|gb|ELA09187.1| FG-GAP repeat-containing protein [Moraxella macacae 0408225]
Length = 1986
Score = 46.2 bits (108), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 2/123 (1%)
Query: 467 SPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIE 526
SP + D DGDG DI + S+ + + G + P+ + D ++DGK E
Sbjct: 874 SPVIADFDGDGVADIGIARSYS-YVAMRGDGSVIWDTPINDSSGGTGSTVFDFDNDGKSE 932
Query: 527 LVTTDTHGNVAAWTAEGK-GIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLS 585
V D A GK I + + + P +GD DGDGH+D+++ S + ++S
Sbjct: 933 AVHFDEQYLRIYDAATGKERIKIANNTATAHEYPIVGDFDGDGHADIIMTASSQGVRMVS 992
Query: 586 GKD 588
K+
Sbjct: 993 SKN 995
>gi|423819369|ref|ZP_17715627.1| hypothetical protein VCHC55C2_1014 [Vibrio cholerae HC-55C2]
gi|423997114|ref|ZP_17740373.1| hemolysin [Vibrio cholerae HC-02C1]
gi|424018758|ref|ZP_17758554.1| hemolysin [Vibrio cholerae HC-59B1]
gi|424624301|ref|ZP_18062773.1| FG-GAP repeat family protein [Vibrio cholerae HC-50A1]
gi|424632833|ref|ZP_18070943.1| FG-GAP repeat family protein [Vibrio cholerae HC-52A1]
gi|424639863|ref|ZP_18077753.1| FG-GAP repeat family protein [Vibrio cholerae HC-56A1]
gi|424647897|ref|ZP_18085567.1| FG-GAP repeat family protein [Vibrio cholerae HC-57A1]
gi|443526719|ref|ZP_21092788.1| FG-GAP repeat family protein [Vibrio cholerae HC-78A1]
gi|408014784|gb|EKG52406.1| FG-GAP repeat family protein [Vibrio cholerae HC-50A1]
gi|408020332|gb|EKG57663.1| FG-GAP repeat family protein [Vibrio cholerae HC-52A1]
gi|408025709|gb|EKG62757.1| FG-GAP repeat family protein [Vibrio cholerae HC-56A1]
gi|408035743|gb|EKG72199.1| FG-GAP repeat family protein [Vibrio cholerae HC-57A1]
gi|408636014|gb|EKL08181.1| hypothetical protein VCHC55C2_1014 [Vibrio cholerae HC-55C2]
gi|408853821|gb|EKL93600.1| hemolysin [Vibrio cholerae HC-02C1]
gi|408869065|gb|EKM08369.1| hemolysin [Vibrio cholerae HC-59B1]
gi|443454928|gb|ELT18726.1| FG-GAP repeat family protein [Vibrio cholerae HC-78A1]
Length = 957
Score = 46.2 bits (108), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 77/166 (46%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 242 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 299 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 333
Query: 489 LFY-VLDHHGKIREKFPLEMAE--IQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P ++ I G + AD++ DG +E++ D
Sbjct: 334 QFITILDNQGNIKKQIPTTESDWRIVGDITLADLDHDGSVEILAAD 379
>gi|403528997|ref|YP_006663884.1| FG-GAP repeat domain-containing protein [Arthrobacter sp. Rue61a]
gi|403231424|gb|AFR30846.1| FG-GAP repeat domain protein [Arthrobacter sp. Rue61a]
Length = 337
Score = 46.2 bits (108), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 46/194 (23%), Positives = 84/194 (43%), Gaps = 29/194 (14%)
Query: 464 IYSSPTVVDLDGDGNLDILVGTSFG--LFYVLDHHGKIREKFPLEMA-EIQGAVVA-ADI 519
I ++P DGDG D+ + G L Y D G R + I +V+ D
Sbjct: 68 IEAAPRSRGFDGDGTFDLFARDAAGRLLLYPTDGRGNWRAPQVIGNGWHIYNLLVSPGDF 127
Query: 520 NDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGP-------SIGDVDGDGHSDV 572
+ DG ++++ D +G + + G G W+Q + + QG + GD +GDG +D+
Sbjct: 128 DGDGTVDVMGRDGYGRLFLYQGNGAGAWKQAFQ--IGQGWQGFVNLIAPGDFNGDGTNDI 185
Query: 573 VVPTLSGNIYVLSGKD-GSKVRPYPYRT-----------HGRVMNQVLLVDLTKRGEKSK 620
+ +GN+++ G G + P T H ++ +++ T RG+
Sbjct: 186 LAQDAAGNVFLYPGNGRGGWLAPSKVTTGWQGLDSAGAGHFYGTSEAVVLSRTWRGD--- 242
Query: 621 GLTIVTTSFDGYLY 634
L +++ +GY Y
Sbjct: 243 -LQARSSTRNGYFY 255
>gi|385803985|ref|YP_005840385.1| hypothetical protein Hqrw_2940 [Haloquadratum walsbyi C23]
gi|339729477|emb|CCC40738.1| conserved hypothetical protein [Haloquadratum walsbyi C23]
Length = 408
Score = 46.2 bits (108), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 55/248 (22%), Positives = 93/248 (37%), Gaps = 43/248 (17%)
Query: 361 GDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSY-------------- 406
GD WT + N H ++ P + D+D+DG +E++ A +
Sbjct: 82 GDTRWTYGIPSE-----NCTIHAVADPTVTDVDSDGQNEVLAATTEKEVAILDTNSGDIE 136
Query: 407 ----FFDHEYYD------NPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTD 456
D+ Y P +E+ +D V G + VF D + WT
Sbjct: 137 ARLNLSDYGYTQPIVADLAPTTGQEIIAVD----VTGTVFVFGSDGTTL-WT-------- 183
Query: 457 NASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVA 516
AS Y+++ PTV D D D ++LVG G + H+G + +
Sbjct: 184 -ASLNEYVWAQPTVEDFDRDSRPELLVGGQNGNAVLFGHNGTVEWNTTVSDKGSITWGTH 242
Query: 517 ADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPT 576
++D +E T G++ A + + + + GD D DG S+V T
Sbjct: 243 GQADEDSALESFFATTGGDIVAVDGRNGSVEWRTMIGEFAAVRAFGDADSDGSSEVYATT 302
Query: 577 LSGNIYVL 584
+G ++ L
Sbjct: 303 QNGRVHAL 310
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 41/158 (25%), Positives = 74/158 (46%), Gaps = 7/158 (4%)
Query: 434 AIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVL 493
A++ ++ +WT + ++N + A + PTV D+D DG ++L T+ +L
Sbjct: 73 ALISMDMVDGDTRWT--YGIPSENCTIHAV--ADPTVTDVDSDGQNEVLAATTEKEVAIL 128
Query: 494 D-HHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLK 552
D + G I + L +VA D+ E++ D G V + ++G +W L
Sbjct: 129 DTNSGDIEARLNLSDYGYTQPIVA-DLAPTTGQEIIAVDVTGTVFVFGSDGTTLWTASLN 187
Query: 553 SLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGS 590
V P++ D D D +++V +GN VL G +G+
Sbjct: 188 EYVWAQPTVEDFDRDSRPELLVGGQNGNA-VLFGHNGT 224
>gi|419829517|ref|ZP_14353003.1| hypothetical protein VCHC1A2_1896 [Vibrio cholerae HC-1A2]
gi|424628799|ref|ZP_18067097.1| FG-GAP repeat family protein [Vibrio cholerae HC-51A1]
gi|408058091|gb|EKG92912.1| FG-GAP repeat family protein [Vibrio cholerae HC-51A1]
gi|408621102|gb|EKK94105.1| hypothetical protein VCHC1A2_1896 [Vibrio cholerae HC-1A2]
Length = 886
Score = 46.2 bits (108), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 77/166 (46%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 171 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 227
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 228 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 262
Query: 489 LFY-VLDHHGKIREKFPLEMAE--IQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P ++ I G + AD++ DG +E++ D
Sbjct: 263 QFITILDNQGNIKKQIPTTESDWRIVGDITLADLDHDGSVEILAAD 308
>gi|397606319|gb|EJK59276.1| hypothetical protein THAOC_20524, partial [Thalassiosira oceanica]
Length = 312
Score = 46.2 bits (108), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 45/154 (29%), Positives = 66/154 (42%), Gaps = 32/154 (20%)
Query: 616 GEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDL------ 669
G +GL IV + +I+G TSC +D+G+T SMV AD++ G +DL
Sbjct: 17 GGTGRGLHIVQ-PVGSTINIIEGSTSCEQRLDLGDTIPSMVQADDIHGTGGVDLGAWCVS 75
Query: 670 --------------------IVTTMNGNVFCF-STPAPHHPL---KAWRSINQGRNNVAI 705
+VTT G + S P HPL A + + G N+ +
Sbjct: 76 SSHLLQFFIIFPSADVRFCEVVTTAQGEIITLESDSVPFHPLNTHSAGAARSPGSNSQSH 135
Query: 706 RYNRA-GIYVTHPSRAFRDEEGRNFWVEIEIVDE 738
++ + GI+V SR +RD G V EI D
Sbjct: 136 GFSASQGIFVHPISRQYRDVLGVYLPVTFEIFDR 169
>gi|424023705|ref|ZP_17763365.1| hemolysin, partial [Vibrio cholerae HC-62B1]
gi|408871919|gb|EKM11146.1| hemolysin, partial [Vibrio cholerae HC-62B1]
Length = 949
Score = 45.8 bits (107), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 92/212 (43%), Gaps = 62/212 (29%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 234 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 290
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 291 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 325
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKG 545
F +LD+ G I+++ P + I G + AD++ DG +E++ D G
Sbjct: 326 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD-------------G 372
Query: 546 IWEQHLKSLVTQGP----SIG-DVDGDGHSDV 572
++ H LV P SI DVDGD +V
Sbjct: 373 VYNYH-SGLVFNHPWAPSSINVDVDGDQQQEV 403
>gi|153817236|ref|ZP_01969903.1| hypothetical protein A5C_0944 [Vibrio cholerae NCTC 8457]
gi|126512270|gb|EAZ74864.1| hypothetical protein A5C_0944 [Vibrio cholerae NCTC 8457]
Length = 957
Score = 45.8 bits (107), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 92/212 (43%), Gaps = 62/212 (29%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 242 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 299 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 333
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKG 545
F +LD+ G I+++ P + I G + AD++ DG +E++ D G
Sbjct: 334 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD-------------G 380
Query: 546 IWEQHLKSLVTQGP----SIG-DVDGDGHSDV 572
++ H LV P SI DVDGD +V
Sbjct: 381 VYNYH-SGLVFNHPWAPSSINVDVDGDQQQEV 411
>gi|422306474|ref|ZP_16393650.1| hypothetical protein VCCP1035_1092 [Vibrio cholerae CP1035(8)]
gi|408626365|gb|EKK99230.1| hypothetical protein VCCP1035_1092 [Vibrio cholerae CP1035(8)]
Length = 886
Score = 45.8 bits (107), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 76/166 (45%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 171 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYTNGGLVRALSGVDGSELW 227
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 228 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 262
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + I G + AD++ DG +E++ D
Sbjct: 263 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD 308
>gi|417812966|ref|ZP_12459623.1| FG-GAP repeat family protein [Vibrio cholerae HC-49A2]
gi|340042270|gb|EGR03235.1| FG-GAP repeat family protein [Vibrio cholerae HC-49A2]
Length = 945
Score = 45.8 bits (107), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 92/212 (43%), Gaps = 62/212 (29%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 230 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 286
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 287 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 321
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKG 545
F +LD+ G I+++ P + I G + AD++ DG +E++ D G
Sbjct: 322 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD-------------G 368
Query: 546 IWEQHLKSLVTQGP----SIG-DVDGDGHSDV 572
++ H LV P SI DVDGD +V
Sbjct: 369 VYNYH-SGLVFNHPWAPSSINVDVDGDQQQEV 399
>gi|297200724|ref|ZP_06918121.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
gi|197712293|gb|EDY56327.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
Length = 748
Score = 45.8 bits (107), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 59/121 (48%), Gaps = 10/121 (8%)
Query: 472 DLDGDGNLDILV-GTSFGLFYVLDHHGKIREKFPLEM---AEIQGAVVAADINDDGKIEL 527
DL GDG D+L S G Y+ + G + K +++ A + V A D+N DG ++
Sbjct: 601 DLTGDGRTDLLARKASTGDVYLFANDGASKLKPGVKIRSWATYKKIVGAGDLNGDGFGDV 660
Query: 528 VTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPS------IGDVDGDGHSDVVVPTLSGNI 581
+ D G + + G G ++ +K T G S +GD+ GDG +D+V GN+
Sbjct: 661 LVQDKAGALWRYDGTGTGQVKERVKVFSTWGTSYNVVVGVGDITGDGKNDLVSRDTGGNL 720
Query: 582 Y 582
Y
Sbjct: 721 Y 721
>gi|153802429|ref|ZP_01957015.1| hemolysin [Vibrio cholerae MZO-3]
gi|124122045|gb|EAY40788.1| hemolysin [Vibrio cholerae MZO-3]
Length = 940
Score = 45.8 bits (107), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 76/166 (45%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 224 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYTNGGLVRALSGVDGSELW 280
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 281 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 315
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + I G + AD++ DG +E++ D
Sbjct: 316 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD 361
>gi|153826116|ref|ZP_01978783.1| hypothetical protein A5A_0906 [Vibrio cholerae MZO-2]
gi|149740139|gb|EDM54298.1| hypothetical protein A5A_0906 [Vibrio cholerae MZO-2]
Length = 957
Score = 45.8 bits (107), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 92/212 (43%), Gaps = 62/212 (29%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 242 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 299 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 333
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKG 545
F +LD+ G I+++ P + I G + AD++ DG +E++ D G
Sbjct: 334 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD-------------G 380
Query: 546 IWEQHLKSLVTQGP----SIG-DVDGDGHSDV 572
++ H LV P SI DVDGD +V
Sbjct: 381 VYNYH-SGLVFNHPWAPSSINVDVDGDQQQEV 411
>gi|366164076|ref|ZP_09463831.1| FG-GAP/YD repeat-containing protein [Acetivibrio cellulolyticus
CD2]
Length = 1973
Score = 45.8 bits (107), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 62/125 (49%), Gaps = 18/125 (14%)
Query: 466 SSPTVVDLDGDGNLDIL-VGTSFGLFYVLDHH-GKIRE--KFPLEM---AEIQGA-VVAA 517
S+ ++DL+GDG D+L +G F Y D G E PL + +I A V A
Sbjct: 412 SNTRLIDLNGDGITDLLSIGKDFYCLYYRDKEKGWTTEPVTIPLNLFPPVDINDAHVYFA 471
Query: 518 DINDDGKIELVTTDTHGNVAAWTAEGKGIW--EQHLKS-------LVTQGPSIGDVDGDG 568
D+ DG ++V D+ G V W G GIW ++ LK+ + Q + D+DGDG
Sbjct: 472 DMTGDGLQDIVKVDSCG-VKFWPYLGNGIWGEKRELKNVNSIPLNFIPQRMFLADIDGDG 530
Query: 569 HSDVV 573
SD++
Sbjct: 531 CSDLI 535
>gi|121587697|ref|ZP_01677459.1| hypothetical protein VC274080_1002 [Vibrio cholerae 2740-80]
gi|147674173|ref|YP_001216406.1| hypothetical protein VC0395_A0455 [Vibrio cholerae O395]
gi|227081105|ref|YP_002809656.1| hemolysin-like protein [Vibrio cholerae M66-2]
gi|227117298|ref|YP_002819194.1| hemolysin-related protein [Vibrio cholerae O395]
gi|229505467|ref|ZP_04394977.1| hemolysin-related protein RbmC [Vibrio cholerae BX 330286]
gi|262168653|ref|ZP_06036348.1| hemolysin-related protein RbmC [Vibrio cholerae RC27]
gi|298498951|ref|ZP_07008758.1| conserved hypothetical protein [Vibrio cholerae MAK 757]
gi|121548069|gb|EAX58145.1| hypothetical protein VC274080_1002 [Vibrio cholerae 2740-80]
gi|146316056|gb|ABQ20595.1| hypothetical protein VC0395_A0455 [Vibrio cholerae O395]
gi|227008993|gb|ACP05205.1| hemolysin-related protein [Vibrio cholerae M66-2]
gi|227012748|gb|ACP08958.1| hemolysin-related protein [Vibrio cholerae O395]
gi|229357690|gb|EEO22607.1| hemolysin-related protein RbmC [Vibrio cholerae BX 330286]
gi|262022771|gb|EEY41477.1| hemolysin-related protein RbmC [Vibrio cholerae RC27]
gi|297543284|gb|EFH79334.1| conserved hypothetical protein [Vibrio cholerae MAK 757]
Length = 957
Score = 45.8 bits (107), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 92/212 (43%), Gaps = 62/212 (29%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 242 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 299 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 333
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKG 545
F +LD+ G I+++ P + I G + AD++ DG +E++ D G
Sbjct: 334 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD-------------G 380
Query: 546 IWEQHLKSLVTQGP----SIG-DVDGDGHSDV 572
++ H LV P SI DVDGD +V
Sbjct: 381 VYNYH-SGLVFNHPWAPSSINVDVDGDQQQEV 411
>gi|385810616|ref|YP_005847012.1| hypothetical protein IALB_2039 [Ignavibacterium album JCM 16511]
gi|383802664|gb|AFH49744.1| Hypothetical protein IALB_2039 [Ignavibacterium album JCM 16511]
Length = 1083
Score = 45.8 bits (107), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 49/214 (22%), Positives = 98/214 (45%), Gaps = 47/214 (21%)
Query: 493 LDHHGKIREKFPL--------EMAEIQGAVVAADI---NDDGKIELVTTDTHGNVAAWTA 541
+ G IR FP E+++I G + +D+ G++ +V +T
Sbjct: 691 FNQTGGIRWVFPTTGTPWTVKEISDI-GGLPGSDVVIGTTTGRVYVVDGNT--------- 740
Query: 542 EGKGIWEQHLKSLVTQGPSIG-DVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTH 600
G+ +W+ ++ ++ + + D++ G+ DV+V ++ NI++L G++G+ + + T
Sbjct: 741 -GQQLWQANIGNVFIEDLRVTPDMNNSGYPDVLVSGINPNIFLLEGQNGNVI--WQNNTG 797
Query: 601 GRVMNQVLLVDLTKRGE---KSKGLTIVTTSFDG------YLYLIDG---PTSCADVVDI 648
G ++ + +L DLT G S L + +DG + Y G T+ VVD+
Sbjct: 798 GNILGKDVLSDLTGDGYPEFGSASLNNIVHIYDGRTGQIKFQYAFGGGGNSTAAEHVVDL 857
Query: 649 GETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFS 682
D++DG ++ + + +G V CFS
Sbjct: 858 ----------DDIDGNLSVEFVACSRDGRVICFS 881
Score = 40.4 bits (93), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 43/206 (20%), Positives = 94/206 (45%), Gaps = 11/206 (5%)
Query: 470 VVDLDGDGNLDILVGTSFGLFYVLDHH-GKIREKFPLEMAEIQGAVVAADINDDGKIELV 528
+ D+ G D+++GT+ G YV+D + G+ + + I+ V D+N+ G +++
Sbjct: 713 ISDIGGLPGSDVVIGTTTGRVYVVDGNTGQQLWQANIGNVFIEDLRVTPDMNNSGYPDVL 772
Query: 529 TTDTHGNVAAWTAE-GKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGK 587
+ + N+ + G IW+ + + + D+ GDG+ + +L+ +++ G+
Sbjct: 773 VSGINPNIFLLEGQNGNVIWQNNTGGNILGKDVLSDLTGDGYPEFGSASLNNIVHIYDGR 832
Query: 588 DGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVD 647
G Y + G +VDL + + + V S DG + G T D++
Sbjct: 833 TGQIKFQYAFGGGGNSTAAEHVVDLDDI-DGNLSVEFVACSRDGRVICFSGGT---DIIP 888
Query: 648 IGETSYSMVLADNVDGGDDLDLIVTT 673
+ T++ + G+D++L+ +T
Sbjct: 889 VEMTTFYATV-----NGNDVNLVWST 909
>gi|417815832|ref|ZP_12462464.1| FG-GAP repeat family protein [Vibrio cholerae HCUF01]
gi|418331978|ref|ZP_12942914.1| FG-GAP repeat family protein [Vibrio cholerae HC-06A1]
gi|419825379|ref|ZP_14348884.1| hypothetical protein VCCP10336_0988 [Vibrio cholerae CP1033(6)]
gi|421324555|ref|ZP_15775081.1| FG-GAP repeat family protein [Vibrio cholerae CP1041(14)]
gi|421334809|ref|ZP_15785276.1| FG-GAP repeat family protein [Vibrio cholerae CP1048(21)]
gi|423164373|ref|ZP_17151141.1| FG-GAP repeat family protein [Vibrio cholerae HC-48B2]
gi|423751722|ref|ZP_17711831.1| hypothetical protein VCHC50A2_1005 [Vibrio cholerae HC-50A2]
gi|423892206|ref|ZP_17725889.1| hypothetical protein VCHC62A1_1033 [Vibrio cholerae HC-62A1]
gi|424001526|ref|ZP_17744612.1| hemolysin [Vibrio cholerae HC-17A2]
gi|424026498|ref|ZP_17766111.1| hemolysin [Vibrio cholerae HC-69A1]
gi|424612755|ref|ZP_18051558.1| FG-GAP repeat family protein [Vibrio cholerae HC-41A1]
gi|424644496|ref|ZP_18082244.1| FG-GAP repeat family protein [Vibrio cholerae HC-56A2]
gi|340041558|gb|EGR02524.1| FG-GAP repeat family protein [Vibrio cholerae HCUF01]
gi|356419390|gb|EHH72937.1| FG-GAP repeat family protein [Vibrio cholerae HC-06A1]
gi|356455912|gb|EHI08541.1| FG-GAP repeat family protein [Vibrio cholerae HC-48B2]
gi|395920225|gb|EJH31047.1| FG-GAP repeat family protein [Vibrio cholerae CP1041(14)]
gi|395936670|gb|EJH47393.1| FG-GAP repeat family protein [Vibrio cholerae CP1048(21)]
gi|395961300|gb|EJH71635.1| FG-GAP repeat family protein [Vibrio cholerae HC-56A2]
gi|408015493|gb|EKG53075.1| FG-GAP repeat family protein [Vibrio cholerae HC-41A1]
gi|408610916|gb|EKK84281.1| hypothetical protein VCCP10336_0988 [Vibrio cholerae CP1033(6)]
gi|408639069|gb|EKL10914.1| hypothetical protein VCHC50A2_1005 [Vibrio cholerae HC-50A2]
gi|408658249|gb|EKL29319.1| hypothetical protein VCHC62A1_1033 [Vibrio cholerae HC-62A1]
gi|408848662|gb|EKL88707.1| hemolysin [Vibrio cholerae HC-17A2]
gi|408880279|gb|EKM19204.1| hemolysin [Vibrio cholerae HC-69A1]
Length = 939
Score = 45.8 bits (107), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 92/212 (43%), Gaps = 62/212 (29%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 224 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 280
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 281 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 315
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKG 545
F +LD+ G I+++ P + I G + AD++ DG +E++ D G
Sbjct: 316 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD-------------G 362
Query: 546 IWEQHLKSLVTQGP----SIG-DVDGDGHSDV 572
++ H LV P SI DVDGD +V
Sbjct: 363 VYNYH-SGLVFNHPWAPSSINVDVDGDQQQEV 393
>gi|421328216|ref|ZP_15778730.1| FG-GAP repeat family protein [Vibrio cholerae CP1042(15)]
gi|395929722|gb|EJH40471.1| FG-GAP repeat family protein [Vibrio cholerae CP1042(15)]
Length = 957
Score = 45.8 bits (107), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 92/212 (43%), Gaps = 62/212 (29%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 242 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 299 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 333
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKG 545
F +LD+ G I+++ P + I G + AD++ DG +E++ D G
Sbjct: 334 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD-------------G 380
Query: 546 IWEQHLKSLVTQGP----SIG-DVDGDGHSDV 572
++ H LV P SI DVDGD +V
Sbjct: 381 VYNYH-SGLVFNHPWAPSSINVDVDGDQQQEV 411
>gi|121607456|ref|YP_995263.1| FG-GAP repeat-containing protein [Verminephrobacter eiseniae EF01-2]
gi|121552096|gb|ABM56245.1| FG-GAP repeat protein [Verminephrobacter eiseniae EF01-2]
Length = 2474
Score = 45.8 bits (107), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 60/118 (50%), Gaps = 10/118 (8%)
Query: 467 SPTVVDLDGDGNLDI-LVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKI 525
+PTV D DGDG DI + G S + V G + + + A A D + DG++
Sbjct: 1129 APTVADFDGDGVPDIGVAGASH--YSVFRADGSLLWRTRIYDASANTGSTAFDFDGDGRV 1186
Query: 526 ELVTTDTHGNVAAW-TAEGKGIWEQHLKSLVTQG---PSIGDVDGDGHSDVVVPTLSG 579
E++ D G + + A G+ + H S + G P + DVDGDGH+D+VVP +G
Sbjct: 1187 EVIYRD-QGTLRIFDGATGQVLL--HTDSSSSTGAEYPLVADVDGDGHADLVVPGDAG 1241
>gi|302412583|ref|XP_003004124.1| FG-GAP repeat domain-containing protein [Verticillium albo-atrum
VaMs.102]
gi|261356700|gb|EEY19128.1| FG-GAP repeat domain-containing protein [Verticillium albo-atrum
VaMs.102]
Length = 1130
Score = 45.8 bits (107), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 57/228 (25%), Positives = 90/228 (39%), Gaps = 39/228 (17%)
Query: 430 YVAGAIVVF------NLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILV 483
Y GA+ F N + +Q W LDL T S D+DGDG D
Sbjct: 457 YDGGAVKSFLNTGNLNSNGEQRNW---LDLGTIVPGVSGVTGSMIRFADMDGDGQSD--- 510
Query: 484 GTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVA-ADINDDGKIELVTTDTHGNVAAWTAE 542
F + G IR L + +GA + AD+ DGK ++V+ + G AW +
Sbjct: 511 ------FLAVADDGSIRMWKNLGIVGTKGASLRFADLTGDGKADIVSVNHKGRARAWLNK 564
Query: 543 GKGIW----------EQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNI--YVLSGKDGS 590
G G+W ++ L S DV+GD +D +V G + Y+ +G
Sbjct: 565 GLGVWDDIGEIAPGLDEDLSSATIH---FADVNGDKRADFLVVYGEGGVKAYLNNGNLPD 621
Query: 591 KVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDG 638
K + ++T Q + + + G K + + + YL + DG
Sbjct: 622 KGKDRIWQT-----GQTISEGVGEPGRKVRFADLNGDGYADYLVVFDG 664
>gi|15640946|ref|NP_230577.1| hemolysin-like protein [Vibrio cholerae O1 biovar El Tor str.
N16961]
gi|153822095|ref|ZP_01974762.1| hypothetical protein A5E_1102 [Vibrio cholerae B33]
gi|229510863|ref|ZP_04400342.1| hemolysin-related protein RbmC [Vibrio cholerae B33]
gi|229517984|ref|ZP_04407428.1| hemolysin-related protein RbmC [Vibrio cholerae RC9]
gi|229608486|ref|YP_002879134.1| hemolysin-related protein RbmC [Vibrio cholerae MJ-1236]
gi|254848063|ref|ZP_05237413.1| hemolysin-related protein [Vibrio cholerae MO10]
gi|255744717|ref|ZP_05418668.1| hemolysin-related protein RbmC [Vibrio cholera CIRS 101]
gi|262161149|ref|ZP_06030260.1| hemolysin-related protein RbmC [Vibrio cholerae INDRE 91/1]
gi|360034836|ref|YP_004936599.1| hemolysin-related protein [Vibrio cholerae O1 str. 2010EL-1786]
gi|379740764|ref|YP_005332733.1| hemolysin-like protein [Vibrio cholerae IEC224]
gi|418336725|ref|ZP_12945623.1| FG-GAP repeat family protein [Vibrio cholerae HC-23A1]
gi|418348392|ref|ZP_12953126.1| FG-GAP repeat family protein [Vibrio cholerae HC-43A1]
gi|418355056|ref|ZP_12957777.1| FG-GAP repeat family protein [Vibrio cholerae HC-61A1]
gi|421316145|ref|ZP_15766716.1| FG-GAP repeat family protein [Vibrio cholerae CP1032(5)]
gi|421331235|ref|ZP_15781715.1| FG-GAP repeat family protein [Vibrio cholerae CP1046(19)]
gi|421338704|ref|ZP_15789139.1| FG-GAP repeat family protein [Vibrio cholerae HC-20A2]
gi|421346791|ref|ZP_15797173.1| FG-GAP repeat family protein [Vibrio cholerae HC-46A1]
gi|422891045|ref|ZP_16933435.1| FG-GAP repeat family protein [Vibrio cholerae HC-40A1]
gi|422901922|ref|ZP_16937262.1| FG-GAP repeat family protein [Vibrio cholerae HC-48A1]
gi|422906142|ref|ZP_16940976.1| FG-GAP repeat family protein [Vibrio cholerae HC-70A1]
gi|422912732|ref|ZP_16947251.1| FG-GAP repeat family protein [Vibrio cholerae HFU-02]
gi|422925213|ref|ZP_16958238.1| FG-GAP repeat family protein [Vibrio cholerae HC-38A1]
gi|423144531|ref|ZP_17132140.1| FG-GAP repeat family protein [Vibrio cholerae HC-19A1]
gi|423149211|ref|ZP_17136539.1| FG-GAP repeat family protein [Vibrio cholerae HC-21A1]
gi|423153027|ref|ZP_17140221.1| FG-GAP repeat family protein [Vibrio cholerae HC-22A1]
gi|423155839|ref|ZP_17142943.1| FG-GAP repeat family protein [Vibrio cholerae HC-32A1]
gi|423730498|ref|ZP_17703812.1| hypothetical protein VCHC17A1_1173 [Vibrio cholerae HC-17A1]
gi|423926983|ref|ZP_17730505.1| hypothetical protein VCHC77A1_1037 [Vibrio cholerae HC-77A1]
gi|424005687|ref|ZP_17748667.1| hemolysin [Vibrio cholerae HC-37A1]
gi|424585824|ref|ZP_18025414.1| FG-GAP repeat family protein [Vibrio cholerae CP1030(3)]
gi|424594521|ref|ZP_18033854.1| FG-GAP repeat family protein [Vibrio cholerae CP1040(13)]
gi|424598387|ref|ZP_18037581.1| FG-GAP repeat family protein [Vibrio Cholerae CP1044(17)]
gi|424606119|ref|ZP_18045079.1| FG-GAP repeat family protein [Vibrio cholerae CP1050(23)]
gi|424609953|ref|ZP_18048807.1| FG-GAP repeat family protein [Vibrio cholerae HC-39A1]
gi|424616572|ref|ZP_18055259.1| FG-GAP repeat family protein [Vibrio cholerae HC-42A1]
gi|424652175|ref|ZP_18089651.1| FG-GAP repeat family protein [Vibrio cholerae HC-57A2]
gi|440709208|ref|ZP_20889865.1| hemolysin-related protein RbmC [Vibrio cholerae 4260B]
gi|443503034|ref|ZP_21070016.1| FG-GAP repeat family protein [Vibrio cholerae HC-64A1]
gi|443506942|ref|ZP_21073726.1| FG-GAP repeat family protein [Vibrio cholerae HC-65A1]
gi|443511059|ref|ZP_21077716.1| FG-GAP repeat family protein [Vibrio cholerae HC-67A1]
gi|443514616|ref|ZP_21081147.1| FG-GAP repeat family protein [Vibrio cholerae HC-68A1]
gi|443518421|ref|ZP_21084831.1| FG-GAP repeat family protein [Vibrio cholerae HC-71A1]
gi|443523309|ref|ZP_21089538.1| FG-GAP repeat family protein [Vibrio cholerae HC-72A2]
gi|443530919|ref|ZP_21096934.1| FG-GAP repeat family protein [Vibrio cholerae HC-7A1]
gi|443538265|ref|ZP_21104120.1| FG-GAP repeat family protein [Vibrio cholerae HC-81A1]
gi|9655388|gb|AAF94092.1| hemolysin-related protein [Vibrio cholerae O1 biovar El Tor str.
N16961]
gi|126520367|gb|EAZ77590.1| hypothetical protein A5E_1102 [Vibrio cholerae B33]
gi|229344699|gb|EEO09673.1| hemolysin-related protein RbmC [Vibrio cholerae RC9]
gi|229350828|gb|EEO15769.1| hemolysin-related protein RbmC [Vibrio cholerae B33]
gi|229371141|gb|ACQ61564.1| hemolysin-related protein RbmC [Vibrio cholerae MJ-1236]
gi|254843768|gb|EET22182.1| hemolysin-related protein [Vibrio cholerae MO10]
gi|255737748|gb|EET93142.1| hemolysin-related protein RbmC [Vibrio cholera CIRS 101]
gi|262028899|gb|EEY47552.1| hemolysin-related protein RbmC [Vibrio cholerae INDRE 91/1]
gi|341624090|gb|EGS49600.1| FG-GAP repeat family protein [Vibrio cholerae HC-70A1]
gi|341624853|gb|EGS50332.1| FG-GAP repeat family protein [Vibrio cholerae HC-48A1]
gi|341625516|gb|EGS50962.1| FG-GAP repeat family protein [Vibrio cholerae HC-40A1]
gi|341639916|gb|EGS64521.1| FG-GAP repeat family protein [Vibrio cholerae HFU-02]
gi|341647526|gb|EGS71603.1| FG-GAP repeat family protein [Vibrio cholerae HC-38A1]
gi|356420128|gb|EHH73656.1| FG-GAP repeat family protein [Vibrio cholerae HC-21A1]
gi|356425389|gb|EHH78759.1| FG-GAP repeat family protein [Vibrio cholerae HC-19A1]
gi|356431827|gb|EHH85026.1| FG-GAP repeat family protein [Vibrio cholerae HC-22A1]
gi|356432303|gb|EHH85500.1| FG-GAP repeat family protein [Vibrio cholerae HC-23A1]
gi|356442139|gb|EHH95001.1| FG-GAP repeat family protein [Vibrio cholerae HC-32A1]
gi|356447131|gb|EHH99921.1| FG-GAP repeat family protein [Vibrio cholerae HC-43A1]
gi|356453458|gb|EHI06121.1| FG-GAP repeat family protein [Vibrio cholerae HC-61A1]
gi|356645990|gb|AET26045.1| hemolysin-related protein [Vibrio cholerae O1 str. 2010EL-1786]
gi|378794274|gb|AFC57745.1| hemolysin-related protein [Vibrio cholerae IEC224]
gi|395921102|gb|EJH31922.1| FG-GAP repeat family protein [Vibrio cholerae CP1032(5)]
gi|395932499|gb|EJH43242.1| FG-GAP repeat family protein [Vibrio cholerae CP1046(19)]
gi|395943652|gb|EJH54326.1| FG-GAP repeat family protein [Vibrio cholerae HC-20A2]
gi|395945851|gb|EJH56515.1| FG-GAP repeat family protein [Vibrio cholerae HC-46A1]
gi|395962767|gb|EJH73058.1| FG-GAP repeat family protein [Vibrio cholerae HC-57A2]
gi|395965697|gb|EJH75858.1| FG-GAP repeat family protein [Vibrio cholerae HC-42A1]
gi|395976787|gb|EJH86227.1| FG-GAP repeat family protein [Vibrio cholerae CP1030(3)]
gi|408008915|gb|EKG46860.1| FG-GAP repeat family protein [Vibrio cholerae HC-39A1]
gi|408036169|gb|EKG72615.1| FG-GAP repeat family protein [Vibrio cholerae CP1040(13)]
gi|408044258|gb|EKG80197.1| FG-GAP repeat family protein [Vibrio Cholerae CP1044(17)]
gi|408045569|gb|EKG81382.1| FG-GAP repeat family protein [Vibrio cholerae CP1050(23)]
gi|408626339|gb|EKK99208.1| hypothetical protein VCHC17A1_1173 [Vibrio cholerae HC-17A1]
gi|408656894|gb|EKL27985.1| hypothetical protein VCHC77A1_1037 [Vibrio cholerae HC-77A1]
gi|408847086|gb|EKL87157.1| hemolysin [Vibrio cholerae HC-37A1]
gi|439974797|gb|ELP50933.1| hemolysin-related protein RbmC [Vibrio cholerae 4260B]
gi|443432345|gb|ELS74873.1| FG-GAP repeat family protein [Vibrio cholerae HC-64A1]
gi|443435975|gb|ELS82098.1| FG-GAP repeat family protein [Vibrio cholerae HC-65A1]
gi|443439763|gb|ELS89459.1| FG-GAP repeat family protein [Vibrio cholerae HC-67A1]
gi|443443860|gb|ELS97142.1| FG-GAP repeat family protein [Vibrio cholerae HC-68A1]
gi|443447470|gb|ELT04112.1| FG-GAP repeat family protein [Vibrio cholerae HC-71A1]
gi|443450409|gb|ELT10684.1| FG-GAP repeat family protein [Vibrio cholerae HC-72A2]
gi|443458002|gb|ELT25398.1| FG-GAP repeat family protein [Vibrio cholerae HC-7A1]
gi|443465854|gb|ELT40513.1| FG-GAP repeat family protein [Vibrio cholerae HC-81A1]
Length = 957
Score = 45.8 bits (107), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 92/212 (43%), Gaps = 62/212 (29%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 242 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 299 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 333
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKG 545
F +LD+ G I+++ P + I G + AD++ DG +E++ D G
Sbjct: 334 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD-------------G 380
Query: 546 IWEQHLKSLVTQGP----SIG-DVDGDGHSDV 572
++ H LV P SI DVDGD +V
Sbjct: 381 VYNYH-SGLVFNHPWAPSSINVDVDGDQQQEV 411
>gi|121728035|ref|ZP_01681073.1| hypothetical protein VCV52_0890 [Vibrio cholerae V52]
gi|121629664|gb|EAX62084.1| hypothetical protein VCV52_0890 [Vibrio cholerae V52]
Length = 957
Score = 45.8 bits (107), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 92/212 (43%), Gaps = 62/212 (29%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 242 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 299 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 333
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKG 545
F +LD+ G I+++ P + I G + AD++ DG +E++ D G
Sbjct: 334 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD-------------G 380
Query: 546 IWEQHLKSLVTQGP----SIG-DVDGDGHSDV 572
++ H LV P SI DVDGD +V
Sbjct: 381 VYNYH-SGLVFNHPWAPSSINVDVDGDQQQEV 411
>gi|254225743|ref|ZP_04919349.1| hypothetical protein VCV51_0642 [Vibrio cholerae V51]
gi|125621751|gb|EAZ50079.1| hypothetical protein VCV51_0642 [Vibrio cholerae V51]
Length = 957
Score = 45.8 bits (107), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 76/166 (45%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 242 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 299 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 333
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + I G + AD++ DG +E++ D
Sbjct: 334 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD 379
>gi|297581312|ref|ZP_06943236.1| conserved hypothetical protein [Vibrio cholerae RC385]
gi|297534628|gb|EFH73465.1| conserved hypothetical protein [Vibrio cholerae RC385]
Length = 957
Score = 45.8 bits (107), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 76/166 (45%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 242 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 299 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 333
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + I G + AD++ DG +E++ D
Sbjct: 334 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD 379
>gi|423951815|ref|ZP_17733833.1| hypothetical protein VCHE40_0917 [Vibrio cholerae HE-40]
gi|408660655|gb|EKL31665.1| hypothetical protein VCHE40_0917 [Vibrio cholerae HE-40]
Length = 957
Score = 45.8 bits (107), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 76/166 (45%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 242 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 299 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 333
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + I G + AD++ DG +E++ D
Sbjct: 334 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD 379
>gi|346975254|gb|EGY18706.1| FG-GAP repeat domain-containing protein [Verticillium dahliae
VdLs.17]
Length = 1158
Score = 45.8 bits (107), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 57/228 (25%), Positives = 90/228 (39%), Gaps = 39/228 (17%)
Query: 430 YVAGAIVVF------NLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILV 483
Y GA+ F N + +Q W LDL T S D+DGDG D
Sbjct: 476 YDGGAVKSFLNTGNLNSNGEQRNW---LDLGTIVPGVSGVTGSMIRFADMDGDGQSD--- 529
Query: 484 GTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVA-ADINDDGKIELVTTDTHGNVAAWTAE 542
F + G IR L + +GA + AD+ DGK ++V+ + G AW +
Sbjct: 530 ------FLAVADDGSIRMWKNLGIVGTKGASLRFADLTGDGKADIVSVNHKGRARAWLNK 583
Query: 543 GKGIW----------EQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNI--YVLSGKDGS 590
G G+W ++ L S DV+GD +D +V G + Y+ +G
Sbjct: 584 GLGVWDDIGEIAPGLDEDLSSATIH---FADVNGDKRADFLVVYGEGGVKAYLNNGNLPD 640
Query: 591 KVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDG 638
K + ++T Q + + + G K + + + YL + DG
Sbjct: 641 KGKDRIWQT-----GQTISEGVGEPGRKVRFADLNGDGYADYLVVFDG 683
>gi|229523275|ref|ZP_04412682.1| hemolysin-related protein RbmC [Vibrio cholerae TM 11079-80]
gi|229339638|gb|EEO04653.1| hemolysin-related protein RbmC [Vibrio cholerae TM 11079-80]
Length = 957
Score = 45.8 bits (107), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 76/166 (45%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 242 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 299 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 333
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + I G + AD++ DG +E++ D
Sbjct: 334 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD 379
>gi|384424105|ref|YP_005633463.1| Hemolysin-like protein RbmC [Vibrio cholerae LMA3984-4]
gi|327483658|gb|AEA78065.1| Hemolysin-like protein RbmC [Vibrio cholerae LMA3984-4]
Length = 886
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 76/166 (45%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 171 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 227
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 228 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 262
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + I G + AD++ DG +E++ D
Sbjct: 263 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD 308
>gi|424590172|ref|ZP_18029610.1| FG-GAP repeat family protein [Vibrio cholerae CP1037(10)]
gi|408035209|gb|EKG71684.1| FG-GAP repeat family protein [Vibrio cholerae CP1037(10)]
Length = 886
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 76/166 (45%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 171 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 227
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 228 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 262
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + I G + AD++ DG +E++ D
Sbjct: 263 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD 308
>gi|229512975|ref|ZP_04402441.1| hemolysin-related protein RbmC [Vibrio cholerae TMA 21]
gi|229349868|gb|EEO14822.1| hemolysin-related protein RbmC [Vibrio cholerae TMA 21]
Length = 957
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 76/166 (45%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 242 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 299 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 333
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + I G + AD++ DG +E++ D
Sbjct: 334 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD 379
>gi|153828960|ref|ZP_01981627.1| hemolysin [Vibrio cholerae 623-39]
gi|148875576|gb|EDL73711.1| hemolysin [Vibrio cholerae 623-39]
Length = 886
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 92/212 (43%), Gaps = 62/212 (29%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 171 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 227
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 228 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 262
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKG 545
F +LD+ G I+++ P + I G + AD++ DG +E++ D G
Sbjct: 263 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD-------------G 309
Query: 546 IWEQHLKSLVTQGP----SIG-DVDGDGHSDV 572
++ H LV P SI DVDGD +V
Sbjct: 310 VYNYH-SGLVFNHPWAPSSINVDVDGDQQQEV 340
>gi|421350712|ref|ZP_15801077.1| FG-GAP repeat family protein [Vibrio cholerae HE-25]
gi|395951157|gb|EJH61771.1| FG-GAP repeat family protein [Vibrio cholerae HE-25]
Length = 886
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 92/212 (43%), Gaps = 62/212 (29%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 171 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 227
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 228 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 262
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKG 545
F +LD+ G I+++ P + I G + AD++ DG +E++ D G
Sbjct: 263 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD-------------G 309
Query: 546 IWEQHLKSLVTQGP----SIG-DVDGDGHSDV 572
++ H LV P SI DVDGD +V
Sbjct: 310 VYNYH-SGLVFNHPWAPSSINVDVDGDQQQEV 340
>gi|254285602|ref|ZP_04960566.1| hypothetical protein A33_0870 [Vibrio cholerae AM-19226]
gi|150424464|gb|EDN16401.1| hypothetical protein A33_0870 [Vibrio cholerae AM-19226]
Length = 957
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 92/212 (43%), Gaps = 62/212 (29%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 242 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 299 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRE 333
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKG 545
F +LD+ G I+++ P + I G + AD++ DG +E++ D G
Sbjct: 334 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD-------------G 380
Query: 546 IWEQHLKSLVTQGP----SIG-DVDGDGHSDV 572
++ H LV P SI DVDGD +V
Sbjct: 381 VYNYH-SGLVFNHPWAPSSINVDVDGDQQQEV 411
>gi|373457062|ref|ZP_09548829.1| integral membrane sensor signal transduction histidine kinase
[Caldithrix abyssi DSM 13497]
gi|371718726|gb|EHO40497.1| integral membrane sensor signal transduction histidine kinase
[Caldithrix abyssi DSM 13497]
Length = 1046
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 65/236 (27%), Positives = 93/236 (39%), Gaps = 33/236 (13%)
Query: 467 SPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAV-----VAADIND 521
+ +V D D DG D+ + F + + G +R K + + G V + D N+
Sbjct: 106 AASVADYDNDGLPDLFLAGWGKTFRLFKNLGNVRFKDVTPLLNLHGVVDANQGIWFDANN 165
Query: 522 DGKIELVTTDTHGNVAAWTAEGKG-----IW-EQHLKSLVTQGPSIGDVDGDGHSDVVVP 575
DG ++L TD H + G IW E+ L S +QG D D DG D+ V
Sbjct: 166 DGFLDLYITDEHHANRLLLNKKDGTFREQIWSEEFLDSATSQGALAADFDADGDVDLYVC 225
Query: 576 T-LSGNIYVLSGKDGSKVR---PYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDG 631
G+ +L+ G R P N + DL G+ L I+ + DG
Sbjct: 226 NWFQGDYLLLNDGTGLFHRWQLNLPTLQKAYNSNSAVAADLDNDGD----LDILVATRDG 281
Query: 632 YLYLI------DGPTSCADVVD----IGETSYSMVLAD-NVDGGDDLDLIVTTMNG 676
++ DG AD + IG Y MV D N DG D I T+NG
Sbjct: 282 LVFYYENQSAGDGLLFSADTLKPFYRIGRQVYGMVSEDFNNDGWLD---IFFTVNG 334
Score = 39.3 bits (90), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 36/132 (27%), Positives = 61/132 (46%), Gaps = 15/132 (11%)
Query: 461 RAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGK---------IREKFPLEMAEIQ 511
+AY +S DLD DG+LDILV T GL + ++ + F ++
Sbjct: 254 KAYNSNSAVAADLDNDGDLDILVATRDGLVFYYENQSAGDGLLFSADTLKPFYRIGRQVY 313
Query: 512 GAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHL----KSLVTQGPSIGDVDGD 567
G +V+ D N+DG +++ T +G + +G G + + + ++ G S D D D
Sbjct: 314 G-MVSEDFNNDGWLDIFFT-VNGPNRLYLNQGGGRFGESYDTDRRDNISTGASAADFDRD 371
Query: 568 GHSDVVVPTLSG 579
G D++V G
Sbjct: 372 GDLDLLVGNKKG 383
>gi|421342487|ref|ZP_15792893.1| FG-GAP repeat family protein [Vibrio cholerae HC-43B1]
gi|423734156|ref|ZP_17707370.1| hypothetical protein VCHC41B1_0943 [Vibrio cholerae HC-41B1]
gi|424008440|ref|ZP_17751389.1| hemolysin [Vibrio cholerae HC-44C1]
gi|395945238|gb|EJH55908.1| FG-GAP repeat family protein [Vibrio cholerae HC-43B1]
gi|408631276|gb|EKL03827.1| hypothetical protein VCHC41B1_0943 [Vibrio cholerae HC-41B1]
gi|408865624|gb|EKM05019.1| hemolysin [Vibrio cholerae HC-44C1]
Length = 886
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 76/166 (45%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 171 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 227
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 228 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRE 262
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + I G + AD++ DG +E++ D
Sbjct: 263 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD 308
>gi|418343234|ref|ZP_12950023.1| FG-GAP repeat family protein [Vibrio cholerae HC-28A1]
gi|421320561|ref|ZP_15771118.1| FG-GAP repeat family protein [Vibrio cholerae CP1038(11)]
gi|423159665|ref|ZP_17146633.1| FG-GAP repeat family protein [Vibrio cholerae HC-33A2]
gi|424601135|ref|ZP_18040288.1| FG-GAP repeat family protein [Vibrio cholerae CP1047(20)]
gi|424621523|ref|ZP_18060046.1| FG-GAP repeat family protein [Vibrio cholerae HC-47A1]
gi|424656079|ref|ZP_18093377.1| FG-GAP repeat family protein [Vibrio cholerae HC-81A2]
gi|443534692|ref|ZP_21100591.1| FG-GAP repeat family protein [Vibrio cholerae HC-80A1]
gi|449056562|ref|ZP_21735230.1| Hemolysin-related protein RbmC [Vibrio cholerae O1 str. Inaba
G4222]
gi|356437081|gb|EHH90189.1| FG-GAP repeat family protein [Vibrio cholerae HC-28A1]
gi|356449263|gb|EHI02017.1| FG-GAP repeat family protein [Vibrio cholerae HC-33A2]
gi|395923543|gb|EJH34354.1| FG-GAP repeat family protein [Vibrio cholerae CP1038(11)]
gi|395973332|gb|EJH82894.1| FG-GAP repeat family protein [Vibrio cholerae HC-47A1]
gi|395978077|gb|EJH87467.1| FG-GAP repeat family protein [Vibrio cholerae CP1047(20)]
gi|408056224|gb|EKG91116.1| FG-GAP repeat family protein [Vibrio cholerae HC-81A2]
gi|443462131|gb|ELT33183.1| FG-GAP repeat family protein [Vibrio cholerae HC-80A1]
gi|448264385|gb|EMB01624.1| Hemolysin-related protein RbmC [Vibrio cholerae O1 str. Inaba
G4222]
Length = 886
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 92/212 (43%), Gaps = 62/212 (29%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 171 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 227
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 228 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 262
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKG 545
F +LD+ G I+++ P + I G + AD++ DG +E++ D G
Sbjct: 263 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD-------------G 309
Query: 546 IWEQHLKSLVTQGP----SIG-DVDGDGHSDV 572
++ H LV P SI DVDGD +V
Sbjct: 310 VYNYH-SGLVFNHPWAPSSINVDVDGDQQQEV 340
>gi|417820358|ref|ZP_12466972.1| FG-GAP repeat family protein [Vibrio cholerae HE39]
gi|423979085|ref|ZP_17737382.1| hypothetical protein VCHE46_0919 [Vibrio cholerae HE-46]
gi|340037989|gb|EGQ98963.1| FG-GAP repeat family protein [Vibrio cholerae HE39]
gi|408665690|gb|EKL36500.1| hypothetical protein VCHE46_0919 [Vibrio cholerae HE-46]
Length = 886
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 76/166 (45%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 171 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 227
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 228 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 262
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + I G + AD++ DG +E++ D
Sbjct: 263 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD 308
>gi|261212021|ref|ZP_05926307.1| hemolysin-related protein RbmC [Vibrio sp. RC341]
gi|260838629|gb|EEX65280.1| hemolysin-related protein RbmC [Vibrio sp. RC341]
Length = 957
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 41/166 (24%), Positives = 75/166 (45%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TPV+A +++D V+++II F+ Y ++ L G+D +
Sbjct: 242 NQVMVTPVVAQLNDDNGDGKIDEKDVADLIIVT---FEGNKYTTGGLVRALSGVDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 299 SYANGGVIA-------------------DARY------SPAVADLDGDGIVEIVTTNNRE 333
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F VLD+ G I+++ P + I G + AD++ DG +E++ D
Sbjct: 334 QFITVLDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAPD 379
>gi|419835795|ref|ZP_14359239.1| hemolysin [Vibrio cholerae HC-46B1]
gi|408858549|gb|EKL98223.1| hemolysin [Vibrio cholerae HC-46B1]
Length = 957
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 76/166 (45%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 242 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 299 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRE 333
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + I G + AD++ DG +E++ D
Sbjct: 334 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD 379
>gi|153213864|ref|ZP_01949070.1| hypothetical protein A55_1020 [Vibrio cholerae 1587]
gi|124115698|gb|EAY34518.1| hypothetical protein A55_1020 [Vibrio cholerae 1587]
Length = 957
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 76/166 (45%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 242 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 299 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRE 333
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + I G + AD++ DG +E++ D
Sbjct: 334 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD 379
>gi|421353712|ref|ZP_15804044.1| FG-GAP repeat family protein [Vibrio cholerae HE-45]
gi|395952837|gb|EJH63450.1| FG-GAP repeat family protein [Vibrio cholerae HE-45]
Length = 957
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 76/166 (45%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TP++A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 242 NQVMVTPIVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYANGGLVRALSGVDGSELW 298
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 299 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRE 333
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + I G + AD++ DG +E++ D
Sbjct: 334 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD 379
>gi|32470725|ref|NP_863718.1| cysteine proteinase [Rhodopirellula baltica SH 1]
gi|32442870|emb|CAD71389.1| probable cysteine proteinase [Rhodopirellula baltica SH 1]
Length = 383
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 55/230 (23%), Positives = 94/230 (40%), Gaps = 32/230 (13%)
Query: 471 VDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQG--AVVAADINDDGKIELV 528
D DGDG+LDI G V + G + + E ++ +++A DI+ DG I+L+
Sbjct: 111 ADFDGDGDLDIASGNDMAKGQVFLNDGSGQFRLHGEYGDVSSLRSLLAHDIDQDGDIDLI 170
Query: 529 TTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG------------DVDGDGHSDVVVPT 576
T I+ ++ GPS G D++GDGH D+V+
Sbjct: 171 AT--------CRGRPNRIYVNDGEANFNSGPSFGRNTDSTIDVAAGDLNGDGHMDLVLAN 222
Query: 577 LSG--NIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLY 634
N +L + R P+ V + DL G + ++
Sbjct: 223 RDHQPNEILLGDAELQFSRSVPFGIGDEQSRAVAVADLNTDGHLD--WVVANIGRANQVF 280
Query: 635 LIDGPTSCADVVDIGET---SYSMVLADNVDGGDDLDLIVTTMN--GNVF 679
DG A + G + +Y++V+AD ++ + LD++V +N G+VF
Sbjct: 281 FGDGAGGVAGSAEFGASDSQTYALVIAD-MNNDEVLDIVVGNLNQPGSVF 329
>gi|422922183|ref|ZP_16955376.1| FG-GAP repeat family protein [Vibrio cholerae BJG-01]
gi|341646808|gb|EGS70913.1| FG-GAP repeat family protein [Vibrio cholerae BJG-01]
Length = 886
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 40/166 (24%), Positives = 75/166 (45%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TPV+A +++D V++ I+ F+ Y N ++ L G+D +
Sbjct: 171 NQVMVTPVVAQLNDDNGDGKIDEKDVADFIVVT---FEGNKYTNGGLVRALSGVDGSELW 227
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP V DLDGDG ++I+ +
Sbjct: 228 SYANGGVIA-------------------DARY------SPAVGDLDGDGIVEIVTTNNRD 262
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + I G + AD++ DG +E++ D
Sbjct: 263 QFITILDNQGNIKKQIPTTESGWRIVGDITLADLDHDGSVEILAAD 308
>gi|262402723|ref|ZP_06079284.1| hemolysin-related protein RbmC [Vibrio sp. RC586]
gi|262351505|gb|EEZ00638.1| hemolysin-related protein RbmC [Vibrio sp. RC586]
Length = 897
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 38/166 (22%), Positives = 77/166 (46%), Gaps = 43/166 (25%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGID---IG 428
+ ++ TPV+A +++D V+++++ F+ Y N ++ L G+D +
Sbjct: 182 NQVMVTPVVAQLNDDNRDGKIDEKDVADLVVVT---FEGNKYTNGGLVRALSGVDGSELW 238
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
Y G ++ +A + SP + DLDGDG ++I+ +
Sbjct: 239 NYANGGVIA-------------------DARY------SPAIGDLDGDGIVEIVTTNNRD 273
Query: 489 LFY-VLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKIELVTTD 531
F +LD+ G I+++ P + +I G + AD++ DG +E++ D
Sbjct: 274 QFITILDNRGNIKKQIPTTESGWKIVGDITLADLDHDGSVEILAAD 319
>gi|427736296|ref|YP_007055840.1| VCBS repeat-containing protein [Rivularia sp. PCC 7116]
gi|427371337|gb|AFY55293.1| VCBS repeat-containing protein [Rivularia sp. PCC 7116]
Length = 345
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 58/123 (47%), Gaps = 7/123 (5%)
Query: 469 TVVDLDGDGNLDILV--GTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIE 526
+V D +GDGN DI V +S + +L + +V AD N D K++
Sbjct: 20 SVADFNGDGNSDIAVVNESSDNVLVLLGDDNNFAAPQSFAVGNSPRSVTVADFNGDEKLD 79
Query: 527 LVTTD-THGNVAAWTAEGKGIWEQHLKSLVTQGPS---IGDVDGDGHSDVVVPTL-SGNI 581
L+T + +V+ +G G ++ V P +GD +GDG+SD+ L SGN+
Sbjct: 80 LITANYLSSDVSVLLGDGNGNFDAAQNFAVGNNPDSLIVGDFNGDGNSDIAAINLSSGNV 139
Query: 582 YVL 584
VL
Sbjct: 140 SVL 142
>gi|78188714|ref|YP_379052.1| hypothetical protein Cag_0738 [Chlorobium chlorochromatii CaD3]
gi|78170913|gb|ABB28009.1| VCBS [Chlorobium chlorochromatii CaD3]
Length = 8871
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 72/326 (22%), Positives = 113/326 (34%), Gaps = 79/326 (24%)
Query: 464 IYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAA------ 517
Y+SP +VD+DGDG+LD+ VG + G + G + + V +
Sbjct: 1800 FYASPRLVDIDGDGDLDVFVGNTDGNTLFFRNTGNATSAAFVSASNFGITDVGSSASPRL 1859
Query: 518 -DINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSL---VT-----QGPSIGDVDGDG 568
DI++DG +++ ++ GN + G + + +T P+ D+DGDG
Sbjct: 1860 VDIDNDGDLDVFVGNSDGNTLFFRNTGNATSAAFVSASNFGITDVGGYAAPTFADIDGDG 1919
Query: 569 HSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTS 628
D V GN +R G + + S I
Sbjct: 1920 DLDAFVGNYEGNTLF-------------FRNTGNATSAAFV--------SSGNFGITDVG 1958
Query: 629 FDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNG-NVFCFSTPAPH 687
+D S LAD +D DLD+ V +G +F +TPAP
Sbjct: 1959 YDA----------------------SPTLAD-IDADGDLDVFVGNYDGVTLFFLNTPAPA 1995
Query: 688 HPLKA-WRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPSGSQ 746
A + NNVA + A DE G ++V + D P+ +Q
Sbjct: 1996 GDTTAPTFDVAPATNNVATT--------SFDLSASIDEAGTIYYV--VVADGATAPTAAQ 2045
Query: 747 APYNVTTTLLVPGNYQGERRIKQSQI 772
+ P Y GE S +
Sbjct: 2046 --------VFDPTTYTGEIASSSSAV 2063
Score = 44.3 bits (103), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 44/101 (43%), Gaps = 11/101 (10%)
Query: 76 ATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAF-----------HQSSVHS 124
A P ADI+ DG LD V ++ + + + +K P F ++ S
Sbjct: 4180 AAPTFADIDGDGDLDAFVGNYEGSILFYQNWEQEKYASQPIFVSVETNPFGLTRADNIFS 4239
Query: 125 SPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIP 165
+P DID DG ++ + Y G++LFF G + + P
Sbjct: 4240 APAFADIDNDGDLDLFVGNYKGDMLFFENIGTVSSASFAAP 4280
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 43/85 (50%), Gaps = 5/85 (5%)
Query: 76 ATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPA---FHQSSV--HSSPLLYD 130
A+P + DI+SDG LD V +F + + + + F + V ++SP L D
Sbjct: 1749 ASPTLVDIDSDGDLDAFVGNFDGFTRFFRNTANATSAAFVSSGNFGITDVGFYASPRLVD 1808
Query: 131 IDKDGVREIALATYNGEVLFFRVSG 155
ID DG ++ + +G LFFR +G
Sbjct: 1809 IDGDGDLDVFVGNTDGNTLFFRNTG 1833
Score = 40.8 bits (94), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 38/134 (28%), Positives = 56/134 (41%), Gaps = 18/134 (13%)
Query: 466 SSPTVVDLDGDGNLDILVGT-SFGLFYVLDHHGKIREKFPLEMAEIQ------GAVVA-- 516
+ PT D+DGDG+LD VG+ G+ + + F + + Q GA VA
Sbjct: 1177 AQPTFADIDGDGDLDAFVGSYENGILFFRNTGNASMAAFAVSVDATQFGLTNVGAYVAPT 1236
Query: 517 -ADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSL---VT-----QGPSIGDVDGD 567
ADI+ DG + + GN + G + S +T P+ D+DGD
Sbjct: 1237 FADIDGDGDFDAFVGNKDGNTLFFRNTGNVTSAAFVSSANFGITDIGSYAAPTFADIDGD 1296
Query: 568 GHSDVVVPTLSGNI 581
G DV V + I
Sbjct: 1297 GDLDVFVGSYKSGI 1310
>gi|433458880|ref|ZP_20416759.1| heme peroxidase [Arthrobacter crystallopoietes BAB-32]
gi|432192635|gb|ELK49479.1| heme peroxidase [Arthrobacter crystallopoietes BAB-32]
Length = 1950
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 48/201 (23%), Positives = 83/201 (41%), Gaps = 20/201 (9%)
Query: 472 DLDGDGNLDIL----VGTSFGLFYVLDHHGKIREKFPLEMA--EIQGAVVAADINDDGKI 525
D +GD D++ VGT Y D G + + + + + + D+N DG
Sbjct: 1750 DFNGDDKADLIAKDRVGTGQLWLYPGDGAGGLDRRIRIGASGWNAMNHLFSGDMNGDGTA 1809
Query: 526 ELVTTDTHGNVAAWTAEGKGIWEQHLKSL--------VTQGPSIGDVDGDGHSDVVVPTL 577
+L+ D +G + WT G G + L +T S GD D DG+SDV+ T
Sbjct: 1810 DLMARDVNGRL--WTYPGNGSGRFEPRELTGSRGWNSITALLSPGDFDDDGNSDVIARTT 1867
Query: 578 SGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLI- 636
G +++ G + +RP G Q + G+ + ++ + DG L+L
Sbjct: 1868 DGRLWLYPGNGAAGLRPRSVIGTGWNGMQAIFSPGDPSGDGNP--DVIASDSDGRLWLYP 1925
Query: 637 -DGPTSCADVVDIGETSYSMV 656
+G D IG + ++ +
Sbjct: 1926 GNGTGGWQDRQQIGRSGWTAM 1946
>gi|108757481|ref|YP_635562.1| hypothetical protein MXAN_7458 [Myxococcus xanthus DK 1622]
gi|108461361|gb|ABF86546.1| FG-GAP repeat/fibronectin type III domain protein [Myxococcus
xanthus DK 1622]
Length = 1267
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 68/148 (45%), Gaps = 17/148 (11%)
Query: 463 YIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPL-EMAEIQGAVVAADIND 521
Y + P V+D+DGDG +++V L VL H G R E +G+VV D +
Sbjct: 681 YDITPPAVLDIDGDGAPEVVVSQRDSLL-VLRHTGGGRLAVAFREPFAPRGSVVTGDFDQ 739
Query: 522 DGKIEL--VTTDTHG----NVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDV--- 572
DG ++L V HG + +T E G + + VT D +GDGH D+
Sbjct: 740 DGLVDLMVVGPSLHGFYGRGLLTFTKEFLGTLDGPVPHAVTT-----DFNGDGHLDIVAL 794
Query: 573 VVPTLSGNIYVLSGKDGSKVRPYPYRTH 600
VV +I++L+G DG P R H
Sbjct: 795 VVGNRESSIHLLTG-DGHGRFTAPVRLH 821
>gi|294633243|ref|ZP_06711802.1| secreted protein [Streptomyces sp. e14]
gi|292831024|gb|EFF89374.1| secreted protein [Streptomyces sp. e14]
Length = 571
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 69/250 (27%), Positives = 101/250 (40%), Gaps = 55/250 (22%)
Query: 432 AGAIVVFNLD--------TKQVKWTTDLDLSTDNASFRAYIYSS--PTV--------VDL 473
AGA F+LD + + TT + LST+ +F Y + P+V DL
Sbjct: 72 AGANPQFDLDGDGIDETLVRLMDGTTGVWLSTEGGAFSTYTIGADDPSVRQKDIITPGDL 131
Query: 474 DGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMA---EIQGAVVA-ADINDDGKIELVT 529
DG G D+L ++ G + G + + +I V A D+ DGK +L+
Sbjct: 132 DGTGGPDVLTLSTTGTLSLYHSAGTTGTGYAVWSGGGWQIYNKVFAPGDVTGDGKPDLLA 191
Query: 530 TDTHGNVAAWTAEGK------------GIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTL 577
G++ + G G W + LV +GD++ DG DVV
Sbjct: 192 RTYAGDLYLYAGTGSESAPFRGRVKVGGGWGAY-DQLV----GVGDMNKDGLGDVVARNA 246
Query: 578 SGNIYVLSGKDGSKVRPYPYRTHGRVM--------NQVLLV-DLTKRGEKSKGLTIVTTS 628
SG++Y +GK GS P G+V+ NQ+ D G+ L TTS
Sbjct: 247 SGDLYYYAGK-GSTTAPL----AGKVLIGGGWNTYNQIAAAYDWNSDGQND--LVARTTS 299
Query: 629 FDGYLYLIDG 638
D Y YL DG
Sbjct: 300 GDMYFYLSDG 309
Score = 39.3 bits (90), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 68/297 (22%), Positives = 119/297 (40%), Gaps = 51/297 (17%)
Query: 472 DLDGDGNLDILV--------------GTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAA 517
DLDGDG + LV G +F + + +R+K +I +
Sbjct: 79 DLDGDGIDETLVRLMDGTTGVWLSTEGGAFSTYTIGADDPSVRQK------DI---ITPG 129
Query: 518 DINDDGKIELVTTDTHGNV-----AAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDV 572
D++ G +++T T G + A T G +W + + + GDV GDG D+
Sbjct: 130 DLDGTGGPDVLTLSTTGTLSLYHSAGTTGTGYAVWSGGGWQIYNKVFAPGDVTGDGKPDL 189
Query: 573 VVPTLSGNIYVLSGKDGSKVRPYPYRTH-----GRVMNQVLLVDLTKRGEKSKGLTIVTT 627
+ T +G++Y+ +G GS+ P+ R G V + D+ K G +V
Sbjct: 190 LARTYAGDLYLYAGT-GSESAPFRGRVKVGGGWGAYDQLVGVGDMNKDGLGD----VVAR 244
Query: 628 SFDGYLYLIDG------PTSCADVVDIGETSYSMVLAD---NVDGGDDLDLIVTTMNGNV 678
+ G LY G P + ++ G +Y+ + A N DG + DL+ T +G++
Sbjct: 245 NASGDLYYYAGKGSTTAPLAGKVLIGGGWNTYNQIAAAYDWNSDGQN--DLVARTTSGDM 302
Query: 679 FCFSTPAPHHPLKAWRSINQGRNNVAIRYNR-AGIYVTHPSRAFRDEEGRNFWVEIE 734
+ + + L R + G N + + A S D +G FW ++
Sbjct: 303 YFYLSDGKGKFLPRQR-VGSGFNGIDLLVGSGANPRFGKTSLIGMDTKGTLFWYGVQ 358
>gi|149917525|ref|ZP_01906022.1| FG-GAP repeat/HVR domain protein [Plesiocystis pacifica SIR-1]
gi|149821588|gb|EDM80986.1| FG-GAP repeat/HVR domain protein [Plesiocystis pacifica SIR-1]
Length = 584
Score = 45.4 bits (106), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 48/168 (28%), Positives = 79/168 (47%), Gaps = 24/168 (14%)
Query: 386 TPVIADIDNDGVSEMIIAVSYFFDHE----YYDNPEHLKELGGIDIGKYVAGAIVVFNLD 441
TP + DID DG+ E++ A + +++ LK L + GAI + ++D
Sbjct: 214 TPALGDIDGDGLVEVVAAAYAMGTSQPRLAAFEHDGALKWLSDTTWDSHQDGAIALADVD 273
Query: 442 TK-QVKWTTDLDL-STDNASFRA-YI--------YSSPTVVDLDGDGNLDILVGTSFGLF 490
V+ + L + A A Y+ YS+ T+ DLDGD +L+ILVG +
Sbjct: 274 NDGDVEIAAGVQLFDHEGAQLWANYVNGNPLYGHYSATTMADLDGDEDLEILVGHA---- 329
Query: 491 YVLDHHGKIREKFP-LEMAEIQGAVV---AADINDDGKIELVTTDTHG 534
+H G + +FP E++ GA AD++DDG E++ + G
Sbjct: 330 -AFEHDGALIWEFPEYEVSSQWGASAHAQVADLDDDGAPEVIIAASQG 376
>gi|148266000|ref|YP_001232706.1| hypothetical protein Gura_3989 [Geobacter uraniireducens Rf4]
gi|146399500|gb|ABQ28133.1| conserved repeat domain protein [Geobacter uraniireducens Rf4]
Length = 2528
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 64/240 (26%), Positives = 106/240 (44%), Gaps = 47/240 (19%)
Query: 387 PVIADIDNDGVSEMIIAVSYF-FDHEYYDNPEH-LKELGGIDIGKYVAGAIVVFNLDTKQ 444
PV+AD+D DG SE+I + F +D P++ L+ G+ G + ++ ++D KQ
Sbjct: 1023 PVLADLDGDGHSEIIYGPTIFNWDGSVRCAPDYRLRGGAGMSNGFGYSVSVADLDMDGKQ 1082
Query: 445 --------------VKW--TTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
+KW ++ D T +F Y P VV L + N G G
Sbjct: 1083 EIIAGYTAYNNDCTIKWINSSLTDGFTAVGNFDNDPY--PEVVLLT-NLNPPPFAGGVGG 1139
Query: 489 LFYVLDHHGKIREKFPLEMAEIQ---------GAVVAADINDDGKIEL--------VTTD 531
Y+LDH G+++ P+ + ++ G V AD + DGK E+ + D
Sbjct: 1140 RVYLLDHTGQVKWG-PVFLKNLEPTAKYDMTGGHTVIADFDGDGKPEIGIRGESKYLILD 1198
Query: 532 THGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIY-VLSGKDGS 590
+ G + A W+ + + PS+ D++GDG V++ SG + V GKDG+
Sbjct: 1199 SEGRLKMTLAIPNSGWQMDVPA-----PSVFDLNGDGRPKVLI--HSGPYFRVFEGKDGT 1251
>gi|444526300|gb|ELV14287.1| Integrin alpha-9 [Tupaia chinensis]
Length = 508
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 40/153 (26%), Positives = 63/153 (41%), Gaps = 37/153 (24%)
Query: 462 AYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADIND 521
+Y SS VDL+GDG D+LVG A + ++I D
Sbjct: 174 SYFGSSLCAVDLNGDGLSDLLVG----------------------------APMFSEIRD 205
Query: 522 DGKIELVTTDTHGNVAAWTA-EGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPT---- 576
+G++ + +G + A G G + H + S+GD+D DG DV +
Sbjct: 206 EGQVTVYINRGNGALEEQLALAGDGAYNAHFGESIA---SLGDLDDDGFPDVAIGAPKED 262
Query: 577 -LSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVL 608
+G +Y+ G G V Y + G+ +N VL
Sbjct: 263 DFAGAVYIYHGDAGGIVPQYSMKLSGQKINPVL 295
>gi|322436622|ref|YP_004218834.1| hypothetical protein AciX9_3034 [Granulicella tundricola MP5ACTX9]
gi|321164349|gb|ADW70054.1| FG-GAP repeat protein [Granulicella tundricola MP5ACTX9]
Length = 1344
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 41/133 (30%), Positives = 67/133 (50%), Gaps = 18/133 (13%)
Query: 472 DLDGDGNLDILVGTSFGL-FYVL--DHHGKIREKFPLEMAEIQGAVVAADINDDGKIELV 528
DL+GDG D++V + G+ YVL D G+++ L+ + + D+N DG+I+LV
Sbjct: 242 DLNGDGVNDLMVAGADGISLYVLMSDATGQLKAPALLQPPHGARSFLLKDLNGDGRIDLV 301
Query: 529 TTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG------------DVDGDGHSDVVVPT 576
T+G + + G G + S++ G S G +VDGDG D++ T
Sbjct: 302 VEGTNGEIDIYPGAGDGTFGT--TSIIGSGASDGTTGHGGHLIAAANVDGDGVLDLLTVT 359
Query: 577 LSGNIYVLSGKDG 589
+G + VL G+ G
Sbjct: 360 PAG-LSVLKGQPG 371
>gi|301757685|ref|XP_002914697.1| PREDICTED: integrin alpha-9-like [Ailuropoda melanoleuca]
Length = 976
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 40/153 (26%), Positives = 62/153 (40%), Gaps = 37/153 (24%)
Query: 462 AYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADIND 521
+Y SS VDL+ DG D+LVG A + ++I D
Sbjct: 243 SYFGSSLCAVDLNADGLSDLLVG----------------------------APMFSEIRD 274
Query: 522 DGKIELVTTDTHGNVAAWTA-EGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPT---- 576
+G++ + +G + A G G + H + S+GD+D DG DV +
Sbjct: 275 EGQVTVYINRGNGALEEQLALSGDGAYNAHFGESIA---SLGDLDDDGFPDVAIGAPKED 331
Query: 577 -LSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVL 608
SG +Y+ G G V Y + GR ++ VL
Sbjct: 332 DFSGAVYIYHGDAGGIVPQYSMKLLGRKLSPVL 364
>gi|430751533|ref|YP_007214441.1| FG-GAP repeat-containing protein [Thermobacillus composti KWC4]
gi|430735498|gb|AGA59443.1| FG-GAP repeat protein [Thermobacillus composti KWC4]
Length = 1015
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 43/160 (26%), Positives = 70/160 (43%), Gaps = 20/160 (12%)
Query: 453 LSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIRE---KFPLEMAE 509
LS + S RAY PT VD + DG D ++GT G Y ++ G+ + K P+++
Sbjct: 395 LSQLDTSCRAY----PTAVDWNRDGATDFIIGTHDGYVYFAENDGEATDLVFKKPVKLKG 450
Query: 510 IQ----GAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWE---------QHLKSLVT 556
++ A D + DG +LV +G V + GK + Q L
Sbjct: 451 VRTDSDAAPFVVDWDGDGHPDLVLGSGNGEVVLYRNNGKNKLKLAGTFTAGGQALSVAGA 510
Query: 557 QGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYP 596
P+ D DGDG D++V G++++ G + +P
Sbjct: 511 AAPAAADWDGDGIPDLIVGDAEGSLHLFRGTASGSLELHP 550
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 38/142 (26%), Positives = 63/142 (44%), Gaps = 19/142 (13%)
Query: 466 SSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAV--------VAA 517
++P VVD DGDG+ D+++G+ G + ++GK + K A+ AA
Sbjct: 457 AAPFVVDWDGDGHPDLVLGSGNGEVVLYRNNGKNKLKLAGTFTAGGQALSVAGAAAPAAA 516
Query: 518 DINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVT-----------QGPSIGDVDG 566
D + DG +L+ D G++ + G E H + P D +
Sbjct: 517 DWDGDGIPDLIVGDAEGSLHLFRGTASGSLELHPAEPIRADGEPVNVGRYAAPFATDWNS 576
Query: 567 DGHSDVVVPTLSGNIYVLSGKD 588
DG D++V T SG I +L G++
Sbjct: 577 DGIPDLLVGTESGEIVLLEGRE 598
>gi|281353949|gb|EFB29533.1| hypothetical protein PANDA_002600 [Ailuropoda melanoleuca]
Length = 973
Score = 45.1 bits (105), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 40/153 (26%), Positives = 62/153 (40%), Gaps = 37/153 (24%)
Query: 462 AYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADIND 521
+Y SS VDL+ DG D+LVG A + ++I D
Sbjct: 240 SYFGSSLCAVDLNADGLSDLLVG----------------------------APMFSEIRD 271
Query: 522 DGKIELVTTDTHGNVAAWTA-EGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPT---- 576
+G++ + +G + A G G + H + S+GD+D DG DV +
Sbjct: 272 EGQVTVYINRGNGALEEQLALSGDGAYNAHFGESIA---SLGDLDDDGFPDVAIGAPKED 328
Query: 577 -LSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVL 608
SG +Y+ G G V Y + GR ++ VL
Sbjct: 329 DFSGAVYIYHGDAGGIVPQYSMKLLGRKLSPVL 361
>gi|123320866|sp|Q05JY7.1|CGLA_PSEAS RecName: Full=Lambda-carrageenase; Flags: Precursor
gi|116292126|dbj|BAF35571.1| lambda-carrageenase [Pseudoalteromonas sp. CL19]
Length = 942
Score = 45.1 bits (105), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 35/139 (25%), Positives = 59/139 (42%), Gaps = 9/139 (6%)
Query: 523 GKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIY 582
G ++TT G VAA+ G+ +WE L + + D++GDG ++ GN+Y
Sbjct: 49 GGSSIITTSYEGTVAAYKFNGEKLWENELSGFMNHDIWVQDINGDGLVEIFAANADGNVY 108
Query: 583 VLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSC 642
++ DGS + + + MN V ++ + EK +V +D LY I
Sbjct: 109 CINS-DGS--LKWTFGLNEVPMNSVTVI--SDADEK----YVVAGGYDKNLYYISANGEL 159
Query: 643 ADVVDIGETSYSMVLADNV 661
++ S V D V
Sbjct: 160 LKTIESSAYSEEGVFGDGV 178
>gi|406881054|gb|EKD29216.1| Vcbs [uncultured bacterium]
Length = 3908
Score = 45.1 bits (105), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 56/118 (47%), Gaps = 15/118 (12%)
Query: 474 DGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTH 533
DG N + L+ G FY +EK L E + AD+N DGK ELV D
Sbjct: 7 DGVTNFENLIA---GHFY---SGALFKEKTKLYFGENASCLAQADLNSDGKNELVIADYK 60
Query: 534 GN---VAAWTAEGKGIWEQHLKSLVTQGP---SIGDVDGDGHSD-VVVPTLSGNIYVL 584
N +A++ E I E+ + S Q P + D+DGDG D V+V + S NI VL
Sbjct: 61 NNKIIIASFNYEKGIITEKSIAS--GQSPVDLKVSDMDGDGDLDIVIVNSGSKNITVL 116
>gi|334118850|ref|ZP_08492938.1| peptidase C11 clostripain [Microcoleus vaginatus FGP-2]
gi|333459080|gb|EGK87695.1| peptidase C11 clostripain [Microcoleus vaginatus FGP-2]
Length = 2325
Score = 45.1 bits (105), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 38/154 (24%), Positives = 67/154 (43%), Gaps = 12/154 (7%)
Query: 470 VVDLDGDGNLDILV-----GTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGK 524
+ D + D NLD++ S +F+ G L + A+ D N DG+
Sbjct: 351 IGDFNADNNLDVVATNRNPANSVSIFFGT-GQGTFGNPTNLNVGTSPSAIAVGDFNADGR 409
Query: 525 IELVTTDT-HGNVAAWTAEGKGIWEQHLKSLVTQGP---SIGDVDGDGHSDVVVPTLSGN 580
+L T+ + NV+ + G+G + V P ++GD + DG D+ V +S N
Sbjct: 410 SDLAVTNAGNQNVSILSGTGQGTFSNPANFNVGLSPEDIAVGDFNADGQPDLAVANVSSN 469
Query: 581 -IYVLSGK-DGSKVRPYPYRTHGRVMNQVLLVDL 612
+ +L+G G+ P+ Y T G + + + D
Sbjct: 470 DVSILTGTGQGTFNGPFNYNTGGTTPSAIAVGDF 503
>gi|124007628|ref|ZP_01692332.1| vcbs, putative [Microscilla marina ATCC 23134]
gi|123986926|gb|EAY26691.1| vcbs, putative [Microscilla marina ATCC 23134]
Length = 751
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 63/256 (24%), Positives = 106/256 (41%), Gaps = 58/256 (22%)
Query: 372 KIEDYVNVDSHILS--TPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGK 429
K DY+ + + +S PV+ D + DGVS++ + + D+ + L I
Sbjct: 437 KDTDYLQLSALKVSRLQPVVQDFNQDGVSDLGLTWTK-------DSKVLFQYLPNIATN- 488
Query: 430 YVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG- 488
G V F+L ++ +K+ DL LS + P ++D+D D + D+LVG G
Sbjct: 489 ---GQAVSFDL-SQAIKY--DLPLSGSD---------QPLLLDVDKDADFDLLVGKGNGQ 533
Query: 489 LFYVLDHHGKIREKFPLEMAEIQGA----------VVAADINDDGKIELVTTDTHG---- 534
L Y L+ + F L+ + G + AD + DGK +L+T D G
Sbjct: 534 LEYYLNQGNNLSPDFQLQTTTLGGVAGSSSTRNLRLTVADFDADGKPDLLTGDNSGKLNI 593
Query: 535 --------NVAAWTAEGKGIWEQ--------HLKSLVTQGPSIGDVDGDGHSDVVVPTLS 578
N A WT I + +VT P D++ D + D++V T
Sbjct: 594 YAGFLEQLNNATWTPATDLILNELSREYAPYRFGIMVT--PVAADLNNDQNPDILVGTNG 651
Query: 579 GNIYVLSGKDGSKVRP 594
G ++ + + S+ P
Sbjct: 652 GGVFYVRNDETSQPPP 667
>gi|116013406|dbj|BAF34526.1| hemolysin [Vibrio ordalii]
Length = 689
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 38/163 (23%), Positives = 75/163 (46%), Gaps = 37/163 (22%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYV 431
+ ++ PVIA +++D ++++I+ F++ Y ++ L GID +
Sbjct: 51 NQVMVAPVIAHVNDDNGDGKIDENDIADIIVVT---FENNKYTQGGLVRALSGIDGSEL- 106
Query: 432 AGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFY 491
S DN A SP V DLDG+G +DI+ ++ +
Sbjct: 107 ---------------------WSYDNGGIIADARYSPAVADLDGNGIVDIVTTSASSPYI 145
Query: 492 -VLDHHGKIREKFPLEMAEIQ--GAVVAADINDDGKIELVTTD 531
+LD+ G I+++ ++ + G + +D+N+DG IE+++ D
Sbjct: 146 TILDNKGNIKKQILKHVSGGRSIGTISISDLNNDGSIEIISAD 188
>gi|385809954|ref|YP_005846350.1| hypothetical protein IALB_1372 [Ignavibacterium album JCM 16511]
gi|383802002|gb|AFH49082.1| Hypothetical protein IALB_1372 [Ignavibacterium album JCM 16511]
Length = 1011
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 42/76 (55%), Gaps = 3/76 (3%)
Query: 465 YSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMA--EIQGAVVAADINDD 522
YS+ T+ DL DG+ +++ LF V + G I + FP ++ E G+V+AADI D
Sbjct: 761 YSTITLADLKNDGDNYLIIPAKEKLF-VYNLRGAIADNFPYSLSDDEFNGSVLAADIEGD 819
Query: 523 GKIELVTTDTHGNVAA 538
K E++ GN+ A
Sbjct: 820 SKSEIIAFTKKGNIYA 835
>gi|255036903|ref|YP_003087524.1| FG-GAP repeat-containing protein [Dyadobacter fermentans DSM 18053]
gi|254949659|gb|ACT94359.1| FG-GAP repeat protein [Dyadobacter fermentans DSM 18053]
Length = 1337
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 57/224 (25%), Positives = 96/224 (42%), Gaps = 64/224 (28%)
Query: 390 ADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTT 449
D+D DG S++I+ S + D G+ GA V++ +
Sbjct: 442 GDVDGDGYSDVIVGASMY------------------DKGESNEGAAFVYHGSINGIATAA 483
Query: 450 DLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGT--------SFGLFYVLDHHGK--- 498
L + A+ + +S+ T D++GDG D++VG+ + G +V +HG
Sbjct: 484 KTTLENNQAA-AFFGFSASTAGDVNGDGFSDVIVGSFHFDNGQNNEGGAFV--YHGSANG 540
Query: 499 ----IREKFPLEMAEIQ-GAVVAA--DINDDGKIELV---------------------TT 530
I ++ A Q GA VA+ D+N DG +++ ++
Sbjct: 541 ISTTIAKQLECNQAGAQYGASVASAGDVNGDGYADVIVGANLFDNGQGNEGGAFVYEGSS 600
Query: 531 DTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVV 574
GN +A + EG + +L S V+ S GDV+GDG+SD+VV
Sbjct: 601 SGLGNTSASSQEGNQL-NAYLGSSVS---SAGDVNGDGYSDIVV 640
>gi|372210221|ref|ZP_09498023.1| hypothetical protein FbacS_08885 [Flavobacteriaceae bacterium S85]
Length = 940
Score = 45.1 bits (105), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 54/119 (45%), Gaps = 23/119 (19%)
Query: 527 LVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSG 586
L+ + G + A++ EGK +WEQ L + GD+DGD +VVV +G IY ++
Sbjct: 54 LIASSYEGTLLAYSDEGKLLWEQPLSGFMNHDLWSGDLDGDAIDEVVVANANGTIYCVN- 112
Query: 587 KDGSKVRPYPYRTHGRVM-----NQVLL--VDLTKRGEKSKGLTIVTTSFDGYLYLIDG 638
+ G VM NQ + V + K+ +K+ +V FD LY +
Sbjct: 113 ------------SFGEVMWSFQKNQAPMYAVTMVKKEDKT---YVVAGGFDTSLYYLSA 156
>gi|365540055|ref|ZP_09365230.1| Leukocidin S subunit [Vibrio ordalii ATCC 33509]
Length = 688
Score = 45.1 bits (105), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 38/163 (23%), Positives = 75/163 (46%), Gaps = 37/163 (22%)
Query: 381 SHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYV 431
+ ++ PVIA +++D ++++I+ F++ Y ++ L GID +
Sbjct: 50 NQVMVAPVIAQLNDDNGDGKIDENDIADIIVVT---FENNKYTQGGLVRALSGIDGSEL- 105
Query: 432 AGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFY 491
S DN A SP V DLDG+G +DI+ ++ +
Sbjct: 106 ---------------------WSYDNGGIIADARYSPAVADLDGNGIVDIVTTSASSPYI 144
Query: 492 -VLDHHGKIREKFPLEMAEIQ--GAVVAADINDDGKIELVTTD 531
+LD+ G I+++ ++ + G + +D+N+DG IE+++ D
Sbjct: 145 TILDNKGNIKKQILKHVSGGRSIGTISISDLNNDGSIEIISAD 187
>gi|163789067|ref|ZP_02183511.1| hypothetical protein FBALC1_09677 [Flavobacteriales bacterium
ALC-1]
gi|159875731|gb|EDP69791.1| hypothetical protein FBALC1_09677 [Flavobacteriales bacterium
ALC-1]
Length = 1187
Score = 44.7 bits (104), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 59/122 (48%), Gaps = 11/122 (9%)
Query: 463 YIYSSPTV--VDLDGDGNLDILVGTSFG----LFYVLDH-HGKIREKFPLEMAEIQGAVV 515
Y+ SS V DLDGDG+ D+L + F F LD G + E V
Sbjct: 34 YLGSSLGVFSADLDGDGDKDVLASSYFDDKLVWFENLDGVAGDFAQHTITTSIETPWGVH 93
Query: 516 AADINDDGKIELVTTDTHGNVAAW--TAEGKG--IWEQHLKSLVTQGPSIGDVDGDGHSD 571
AADI+ DG +++++T GN W +G G + +Q++ + S D+DGDG D
Sbjct: 94 AADIDADGDMDILSTALFGNHIIWYENTDGNGDFVLKQYISAYRVNTVSTADMDGDGDLD 153
Query: 572 VV 573
V+
Sbjct: 154 VI 155
>gi|188501671|gb|ACD54788.1| vcbs-like protein [Philodina roseola]
Length = 1460
Score = 44.7 bits (104), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 117/565 (20%), Positives = 211/565 (37%), Gaps = 106/565 (18%)
Query: 79 LIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQSSVHSSPL---LYDIDKDG 135
++ D N DG DIV + + +GD G+P F+ + S P + ++DK+
Sbjct: 664 VVIDFNLDGHYDIVGTNIGDNEVYIFFGNGDGTFGFPKFYATGSDSRPYAIAVANVDKEE 723
Query: 136 VREIALATYN-GEVLFFRVSGYMMTDKLEIPRRKVRKDWYVGLHSDP----VDRSHPDVH 190
EI + + GEV + + + + G S P + + D
Sbjct: 724 QLEIVITLWGTGEVAVLK--------EYDAAAFYHHTSYPAGSSSHPSSVAIGDFNQDNQ 775
Query: 191 DDLIVQESEAARMKSMLETKKSTPETNATVTTSTESNPAPATV------SNPDVKKVN-- 242
D++V + ++ + + T +T+S+P A + N D+ VN
Sbjct: 776 PDIVVTNAGGDNIEILFGSGNGTFNKTVNYQLTTDSSPQNAIIVDINSDQNLDIVVVNAM 835
Query: 243 ESLVNVSNPSEERKVNESHTEMNIKLPTSVDNSSTTTVSGGTNSSENGTNTGRRLLEDNN 302
++ +N+ S++ + E + ++ DNS + T
Sbjct: 836 QNTINIFMGSDDGRFAEQ------AIYSTGDNSHPSAAXAAT------------------ 871
Query: 303 SKGSQEGNDKEDVPVATAENDQALDENADSSFELFRDTDELADEYNYDYDDYVDDAMWGD 362
G +D+ D+ +A TD + + Y+Y + D ++
Sbjct: 872 --GYFNDDDRLDIVIANTG------------------TDNIGIYFGYNYRSFHDQEIYES 911
Query: 363 EEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGV---------SEMIIAVSYFFDHEYY 413
++D+ N DS++ A DN G+ ++MI+ Y Y
Sbjct: 912 NRTLGPSSVVVDDFNN-DSYLDIAATFATSDNFGILLGQGDGLFADMIV---YAVKPGSY 967
Query: 414 DNPEHLKELGG-----IDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSP 468
N + ++ I I + I +F L + + ST NAS ++
Sbjct: 968 PNNLVVSDINNDGNKDIIIANFGTSDIGLF-LGYGNGTFASMTTFSTGNASKPRFV---- 1022
Query: 469 TVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQG------AVVAADINDD 522
TV D + D +DI+V S G + ++ + G F M G A +D N+D
Sbjct: 1023 TVGDFNNDHRIDIVVVDS-GRYSIIVYLGYGNGSFVHTMTYSIGSSSNPYAAAISDFNND 1081
Query: 523 GKIELVTTDTHGN-VAAWTAEGKGIWEQHLKSLVTQG--PS---IGDVDGDGHSDVVVPT 576
++++ + + V + G GI+ + + PS I D++GD SD+ V T
Sbjct: 1082 SRLDIAVANAGSSEVIIFLGYGNGIFASPVAYSASYASDPSCITIADLNGDNRSDIAVST 1141
Query: 577 LSGN-IYVLSG-KDGSKVRPYPYRT 599
SGN I VL G +GS Y T
Sbjct: 1142 FSGNSIAVLFGYGNGSFAAAVTYST 1166
>gi|421615157|ref|ZP_16056190.1| hypothetical protein RBSH_06003 [Rhodopirellula baltica SH28]
gi|408494065|gb|EKJ98690.1| hypothetical protein RBSH_06003 [Rhodopirellula baltica SH28]
Length = 383
Score = 44.7 bits (104), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 55/230 (23%), Positives = 92/230 (40%), Gaps = 32/230 (13%)
Query: 471 VDLDGDGNLDILVGTSF--GLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELV 528
D DGDG+LDI G G ++ D G+ R +++A DI+ DG I+L+
Sbjct: 111 ADFDGDGDLDIASGNDMAKGQVFLNDGSGQFRSHGEYGDVSSLRSLLAHDIDQDGDIDLI 170
Query: 529 TTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG------------DVDGDGHSDVVVPT 576
T I+ ++ GPS G D++GDGH D+V+
Sbjct: 171 AT--------CRGRPNRIYVNGGEANFNSGPSFGRNTDSTIDVAAGDLNGDGHMDLVLAN 222
Query: 577 LSG--NIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLY 634
N +L + R P+ V + DL G + ++
Sbjct: 223 RDHQPNEILLGDAELQFSRSVPFGIGDEHSRAVAVADLNTDGHLD--WVVANIGRANQVF 280
Query: 635 LIDGPTSCADVVDIGET---SYSMVLADNVDGGDDLDLIVTTMN--GNVF 679
DG A + G + +Y++ +AD ++ + LD++V +N G+VF
Sbjct: 281 FGDGAGGVAGSAEFGASDSQTYALAIAD-MNNDEVLDIVVGNLNQPGSVF 329
>gi|410896069|ref|XP_003961522.1| PREDICTED: integrin alpha-3-like [Takifugu rubripes]
Length = 1065
Score = 44.7 bits (104), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 49/172 (28%), Positives = 72/172 (41%), Gaps = 38/172 (22%)
Query: 451 LDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSF---------GLFYV-LDHHGKIR 500
LD + +Y +S V DL+ D D++VG F G Y+ ++ +G R
Sbjct: 282 LDFTIPGEQVGSYFGNSLAVTDLNNDDWNDLIVGAPFYFDRMKDQGGAVYIFMNENGSFR 341
Query: 501 EKFPLEMA----EIQGAVVAA--DINDDGKIELVT----TDTHGNVAAWTAEGKGIWEQH 550
E L + G VAA DIN DG + DT G V W GI EQH
Sbjct: 342 ETATLVLKGPSNSAFGLAVAAVGDINQDGFQDFAVGAPFQDT-GRVYIWLGSDTGISEQH 400
Query: 551 LKSLVTQGPSIG---------------DVDGDGHSDVVVPTLSGNIYVLSGK 587
S V +G ++G D+DG+ + D++V +L I +L +
Sbjct: 401 --SQVIEGKTLGNGGFKTFGYSINGGMDMDGNSYPDILVGSLDDRIALLRAR 450
>gi|345004742|ref|YP_004807595.1| FG-GAP repeat-containing protein [halophilic archaeon DL31]
gi|344320368|gb|AEN05222.1| FG-GAP repeat protein [halophilic archaeon DL31]
Length = 426
Score = 44.3 bits (103), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 66/138 (47%), Gaps = 16/138 (11%)
Query: 472 DLDGDGNLDILVGTSF-GLFYVLDHHG-KIREKFPLEMAEIQGAVVAADINDDGKIELVT 529
+LD DG + VG SF G +D G ++ E+ +A + +I + G+ +
Sbjct: 262 ELDSDGRTET-VGASFDGQVVAVDDDGSRLWEREFGRLAAVH------EIVESGEAAVYV 314
Query: 530 TDTHGNVAAWTAEGKGIWEQHL---KSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSG 586
T+ G V A EG+ +W + + + +T P GD+DGDG ++VV +G + VL
Sbjct: 315 TNKSGTVFALDEEGETVWRREVVEEDTQMTPPPVAGDLDGDGSQELVVAANTGEVTVLDA 374
Query: 587 KDGSKVRPY----PYRTH 600
+ G + Y P TH
Sbjct: 375 ETGDPLASYSRDVPVWTH 392
>gi|409721643|ref|ZP_11269809.1| hypothetical protein Hham1_03490 [Halococcus hamelinensis 100A6]
gi|448723994|ref|ZP_21706508.1| hypothetical protein C447_12617 [Halococcus hamelinensis 100A6]
gi|445786800|gb|EMA37563.1| hypothetical protein C447_12617 [Halococcus hamelinensis 100A6]
Length = 404
Score = 44.3 bits (103), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 60/136 (44%), Gaps = 8/136 (5%)
Query: 448 TTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHH-GKIREKFPL- 505
TTD + + N S + PT+ D DG+G LD+L+ T+ + Y D + G +F L
Sbjct: 89 TTDWERTIRNRSCATHGIGDPTIADFDGNGELDVLIPTTENVLYGYDANDGTETLRFDLT 148
Query: 506 EMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVD 565
AV A + E V D G+V A W+ + + VT D D
Sbjct: 149 SFGYSAPAVFAEPVR-----ETVVADFDGSVFAVRPNETVAWQDQVAAGVTADTKKADFD 203
Query: 566 GDGHSDVVVPTLSGNI 581
GDG +V V + GN+
Sbjct: 204 GDGGPEVAV-SAPGNV 218
>gi|425453498|ref|ZP_18833255.1| FG-GAP repeat/calx-beta domain protein (modular protein)
[Microcystis aeruginosa PCC 9807]
gi|389802933|emb|CCI18085.1| FG-GAP repeat/calx-beta domain protein (modular protein)
[Microcystis aeruginosa PCC 9807]
Length = 2199
Score = 44.3 bits (103), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 56/136 (41%), Gaps = 8/136 (5%)
Query: 467 SPTVVDLDGDGNLDILVGTSF-GLFYVL--DHHGKIREKFPLEMAEIQGAVVAADINDDG 523
S TV D +GDG D+ V +F G VL G + + +V D N DG
Sbjct: 227 SVTVGDFNGDGKSDLAVANNFNGTVSVLLGTGTGSFGTATNFIVGSVPRSVTVGDFNGDG 286
Query: 524 KIELVTTD-THGNVAAWTAEGKGIWEQHLKSLVTQGP---SIGDVDGDGHSDVVVPTL-S 578
K +L + T NV+ G G + V P +GD +GDG SD+ V S
Sbjct: 287 KSDLAVANYTSNNVSVLLGTGTGSFGTATNFSVGNAPLSVRVGDFNGDGKSDLAVANAGS 346
Query: 579 GNIYVLSGKDGSKVRP 594
N+ VL G P
Sbjct: 347 SNVSVLLGTGTGSFGP 362
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 58/129 (44%), Gaps = 9/129 (6%)
Query: 467 SPTVVDLDGDGNLDILVGTSFG---LFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDG 523
S TV D +GDG LD+ V G + G + +V D N DG
Sbjct: 1215 SVTVGDFNGDGKLDLAVANHGGNNVSVLLGTGTGSFGTATNFTVGTNPDSVTVGDFNGDG 1274
Query: 524 KIELVTTDTH--GNVAAWTAEGKGIWEQHLKSLVTQGP---SIGDVDGDGHSDVVVPTLS 578
K++LVT +++ G+V+ G G + V P ++GD +GDG SD+ V S
Sbjct: 1275 KLDLVTANSNAGGSVSILLGTGTGSFGTATNFSVGGFPTSVTVGDFNGDGKSDLAVANNS 1334
Query: 579 G-NIYVLSG 586
N+ VL G
Sbjct: 1335 SQNVSVLLG 1343
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 43/143 (30%), Positives = 62/143 (43%), Gaps = 29/143 (20%)
Query: 464 IYSSP---TVVDLDGDG------------NLDILVGTSFGLFYVLDHHGKIREKFPLEMA 508
+ SSP TV D +GDG N+ +++GT G F + E F
Sbjct: 172 VGSSPYSVTVGDFNGDGKSDLAVANRISQNVSVVLGTGTGSFGTPTNFTVGNEPF----- 226
Query: 509 EIQGAVVAADINDDGKIEL-VTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGP---SIGDV 564
+V D N DGK +L V + +G V+ G G + +V P ++GD
Sbjct: 227 ----SVTVGDFNGDGKSDLAVANNFNGTVSVLLGTGTGSFGTATNFIVGSVPRSVTVGDF 282
Query: 565 DGDGHSDVVVPT-LSGNIYVLSG 586
+GDG SD+ V S N+ VL G
Sbjct: 283 NGDGKSDLAVANYTSNNVSVLLG 305
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 49/116 (42%), Gaps = 19/116 (16%)
Query: 467 SPTVVDLDGDGNLD------ILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADIN 520
S TV D +GDG LD +L+GT G F + +V D N
Sbjct: 1410 SVTVGDFNGDGKLDLATTVSVLLGTGTGSFGAPTN---------FSAGSSANSVTVGDFN 1460
Query: 521 DDGKIEL-VTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGP---SIGDVDGDGHSDV 572
DG+ +L V D G+V+ G G + GP ++GD +GDG SD+
Sbjct: 1461 GDGQSDLAVANDGSGDVSVLLGTGTGSFGTATNFTAGDGPWSVTVGDFNGDGKSDL 1516
Score = 40.8 bits (94), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 56/131 (42%), Gaps = 25/131 (19%)
Query: 466 SSPTVVDLDGDG------------NLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGA 513
+S TV D +GDG N+ +L+GT G F + +PL +
Sbjct: 373 TSVTVGDFNGDGKSDLATANFNSGNVSVLLGTGTGSFGTATNF--TVGSYPL-------S 423
Query: 514 VVAADINDDGKIELVTTDTHGN-VAAWTAEGKGIWEQHLKSLVTQGP---SIGDVDGDGH 569
V D N DGK +L T + GN V+ G G + V P ++GD +GDG
Sbjct: 424 VTVGDFNSDGKSDLATGNYFGNSVSVLLGTGTGSFGTATDFTVGSFPRSVTVGDFNGDGK 483
Query: 570 SDVVVPTLSGN 580
SD+ T + N
Sbjct: 484 SDLATATSNSN 494
>gi|262196413|ref|YP_003267622.1| hypothetical protein [Haliangium ochraceum DSM 14365]
gi|262079760|gb|ACY15729.1| conserved hypothetical protein [Haliangium ochraceum DSM 14365]
Length = 547
Score = 44.3 bits (103), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 61/226 (26%), Positives = 96/226 (42%), Gaps = 37/226 (16%)
Query: 374 EDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAG 433
ED N S+P IADID DG +E++ F + + +E G +
Sbjct: 298 EDSANQAHFTNSSPAIADIDGDGEAELV------FTGSVQNASQDDRERGVV-------- 343
Query: 434 AIVVFNLDTKQVKWTTDLDLSTDNASFRAY-------IYSSPTVVDLDGD-GNLDILVGT 485
VV T+ W + A + + +S TVVD+D + L+++
Sbjct: 344 LFVVRPDGTRPDDWVEPYYVPEYRAGLNDFDGTNVVGLNNSVTVVDIDPERAGLEMIFAD 403
Query: 486 SFGLFYVLDHHGK--IREKFPLEMAEIQGAVVAADINDDGKIELVTT-----DTHGNVAA 538
G + +D G+ ++ + + G VV +D++ DG E+V G +
Sbjct: 404 YDGRIHAVDARGEQLWTYRYTSDARVLTGGVVVSDLSRDGVPEVVFATYSPDQNKGELFV 463
Query: 539 WTAEGKGIWEQHLKSLVTQG----PSIGDVDGDGHSDVVVPTLSGN 580
A G E H SL T+G P+I DVDGDG ++VV +L GN
Sbjct: 464 LDAGGN---ELHTLSLPTRGAMGIPTIADVDGDGTLEIVV-SLKGN 505
Score = 42.7 bits (99), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 75/161 (46%), Gaps = 29/161 (18%)
Query: 447 WTTDLDLSTDNASFRAYIYSSPTVVDL--DGDGNLDILVGTSFGLFYVLDHHGKIREKFP 504
W+ D +L + S I+++P VVDL + DG L+I V + Y D G+ FP
Sbjct: 101 WSADGELLFSH-SVEGRIWAAPVVVDLLPERDG-LEIAV-AARSQVYAWDAKGEELPGFP 157
Query: 505 LEMAEIQGAVVAADINDDGKIELVTTDT-----HGN---VAAWTAEG----------KGI 546
+E + ++ A DI+ DG+ ELVT T +G + A A+G G
Sbjct: 158 VEWRDELRSLAAGDIDGDGRFELVTATTRRLEDNGQRDLLMAIEADGSFARGFPPNTSGA 217
Query: 547 WEQHLKSLVTQG----PSIGDVDGDGHSDVVVPTLSGNIYV 583
E VT G +IGD+DGDG +DV + N Y+
Sbjct: 218 SECDDACYVTGGFDQNLAIGDLDGDGVADVA--AVQDNAYI 256
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 63/148 (42%), Gaps = 20/148 (13%)
Query: 467 SPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIE 526
SP V DLDGDG+++I V + G++ +E I A V D+ +
Sbjct: 76 SPAVADLDGDGSVEIAV-PRHNRLNIWSADGELLFSHSVE-GRIWAAPVVVDLLPERDGL 133
Query: 527 LVTTDTHGNVAAWTAEGKGI------WEQHLKSLVTQGPSIGDVDGDGHSDVVVPTL--- 577
+ V AW A+G+ + W L+SL + GD+DGDG ++V T
Sbjct: 134 EIAVAARSQVYAWDAKGEELPGFPVEWRDELRSL-----AAGDIDGDGRFELVTATTRRL 188
Query: 578 ----SGNIYVLSGKDGSKVRPYPYRTHG 601
++ + DGS R +P T G
Sbjct: 189 EDNGQRDLLMAIEADGSFARGFPPNTSG 216
>gi|149173961|ref|ZP_01852590.1| hypothetical protein PM8797T_05970 [Planctomyces maris DSM 8797]
gi|148847491|gb|EDL61825.1| hypothetical protein PM8797T_05970 [Planctomyces maris DSM 8797]
Length = 645
Score = 44.3 bits (103), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 57/238 (23%), Positives = 91/238 (38%), Gaps = 44/238 (18%)
Query: 391 DIDNDGVSEMIIAVSYFFDHEYYD--NPEHLKELGGIDIGKYVAGAI-----VVFNLDTK 443
D D DG ++++AV + D+ + D N E G++ G + V+ N T
Sbjct: 156 DYDGDGDQDLVVAVGDWGDYGWDDAYNSE----------GQWTNGPLHGFVYVLNNAGTD 205
Query: 444 QV-KWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREK 502
Q ++ + + + + SP D D DG+LD++ G ++ G +
Sbjct: 206 QKPEYEKPAKIKVGDQPLEVFGWPSPNFADFDHDGDLDLICGEFLDQLTYFENTGSRTQP 265
Query: 503 F------------PLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAW--TAEGKGIWE 548
PL+M A D + DG ++LV D G VA T E G
Sbjct: 266 HYVAGRRLHNNGQPLQMDLQMIVPTAIDWDQDGDLDLVIGDEDGRVAFMENTGELLGGLP 325
Query: 549 QHLKSLVTQGPSIG------------DVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRP 594
Q L + + G D DGDG D++V +G I ++ DG P
Sbjct: 326 QFLPPRYFRQKAAGVKFGALATPYAYDWDGDGDEDLIVGNTAGYIGLIENLDGDPHSP 383
>gi|344285885|ref|XP_003414690.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-3-like [Loxodonta
africana]
Length = 1055
Score = 44.3 bits (103), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 33/133 (24%), Positives = 59/133 (44%), Gaps = 19/133 (14%)
Query: 369 QHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIG 428
+ ++ + V ++ S +AD++NDG ++++ Y+FD +
Sbjct: 293 RRRQVLEGTQVGAYFGSAIALADLNNDGWQDLLVGAPYYFDRK----------------- 335
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSF- 487
+ V GAI VF ++ + L S A+ +S ++ D++ DG DI VG F
Sbjct: 336 EEVGGAIYVF-MNQAGNSFPAQPSLLLHGPSRSAFGFSMASIGDINQDGFQDIAVGAPFE 394
Query: 488 GLFYVLDHHGKIR 500
GL V +H R
Sbjct: 395 GLGKVYIYHSSSR 407
>gi|351697454|gb|EHB00373.1| Integrin alpha-9, partial [Heterocephalus glaber]
Length = 973
Score = 44.3 bits (103), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 40/153 (26%), Positives = 62/153 (40%), Gaps = 37/153 (24%)
Query: 462 AYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADIND 521
+Y SS VDL+ DG D+LVG A + ++I D
Sbjct: 240 SYFGSSLCAVDLNRDGLSDLLVG----------------------------APMFSEIRD 271
Query: 522 DGKIELVTTDTHGNVAAWTA-EGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPT---- 576
+G++ + +G + A G + H + S+GD+D DG SDV +
Sbjct: 272 EGQVTVYINRGNGALEEQLALTGDRAYNAHFGESIA---SLGDLDNDGFSDVAIGAPEED 328
Query: 577 -LSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVL 608
+G +Y+ G G V Y + GR +N VL
Sbjct: 329 DFAGAVYIYHGDAGGVVPQYSMKLSGRRINPVL 361
>gi|261210531|ref|ZP_05924824.1| hemolysin-related protein Vcp [Vibrio sp. RC341]
gi|260840316|gb|EEX66887.1| hemolysin-related protein Vcp [Vibrio sp. RC341]
Length = 690
Score = 44.3 bits (103), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 39/68 (57%), Gaps = 3/68 (4%)
Query: 467 SPTVVDLDGDGNLDILVGTSFGLFY-VLDHHGKIREKFPLEMAEIQ--GAVVAADINDDG 523
SP DLDGDG ++I+ ++ + +LDH G I+++ + + G + ADIN DG
Sbjct: 118 SPAAADLDGDGLIEIVSTSALSPYITILDHQGNIKKQLLKSASGWRSVGDIALADINGDG 177
Query: 524 KIELVTTD 531
+E++ D
Sbjct: 178 HLEILAAD 185
>gi|162451750|ref|YP_001614117.1| hypothetical protein sce3477 [Sorangium cellulosum So ce56]
gi|161162332|emb|CAN93637.1| hypothetical protein sce3477 [Sorangium cellulosum So ce56]
Length = 486
Score = 44.3 bits (103), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 50/108 (46%), Gaps = 3/108 (2%)
Query: 470 VVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVT 529
++D D DG LD++V GL ++ + G E + + A D + DG +++V
Sbjct: 192 LLDADHDGALDVVVAQQEGLAFLRNQGGGAFEPPRMIAGPTTFSPTAVDHDGDGVLDIVG 251
Query: 530 TDTHGNVAAWTAEGKGIWE---QHLKSLVTQGPSIGDVDGDGHSDVVV 574
H + + +G G +E + + G GDV GDGH D V+
Sbjct: 252 LGYHSKILVYRGDGGGGFELAHELPAGVFMSGIVTGDVTGDGHPDAVI 299
>gi|115372566|ref|ZP_01459874.1| FG-GAP repeat domain protein [Stigmatella aurantiaca DW4/3-1]
gi|115370528|gb|EAU69455.1| FG-GAP repeat domain protein [Stigmatella aurantiaca DW4/3-1]
Length = 516
Score = 44.3 bits (103), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 59/136 (43%), Gaps = 17/136 (12%)
Query: 471 VDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVA-------ADINDDG 523
D DGDG +D L S G V + G + +I + AD N DG
Sbjct: 257 ADFDGDGRMDYLTVASSGAVSVFLNRGGDGRGGWAPIGQIATGLTTDSNRVRFADFNGDG 316
Query: 524 KIELVTTDTHGNVAAW---TAEGKGIWE--QHLKSLVTQGPS---IGDVDGDGHSDVVVP 575
K + + DT+G+V + +G+G WE + VT S D DGDG +D +V
Sbjct: 317 KADYLLIDTNGSVQVYLNRGGDGRGGWEPIGQIAGGVTTDASRVRFADHDGDGKTDYLVI 376
Query: 576 TLSGNI--YVLSGKDG 589
+G + Y+ G DG
Sbjct: 377 DATGEVQAYLNRGGDG 392
>gi|310819111|ref|YP_003951469.1| esterase [Stigmatella aurantiaca DW4/3-1]
gi|309392183|gb|ADO69642.1| Esterase [Stigmatella aurantiaca DW4/3-1]
Length = 530
Score = 44.3 bits (103), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 59/136 (43%), Gaps = 17/136 (12%)
Query: 471 VDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVA-------ADINDDG 523
D DGDG +D L S G V + G + +I + AD N DG
Sbjct: 271 ADFDGDGRMDYLTVASSGAVSVFLNRGGDGRGGWAPIGQIATGLTTDSNRVRFADFNGDG 330
Query: 524 KIELVTTDTHGNVAAW---TAEGKGIWE--QHLKSLVTQGPS---IGDVDGDGHSDVVVP 575
K + + DT+G+V + +G+G WE + VT S D DGDG +D +V
Sbjct: 331 KADYLLIDTNGSVQVYLNRGGDGRGGWEPIGQIAGGVTTDASRVRFADHDGDGKTDYLVI 390
Query: 576 TLSGNI--YVLSGKDG 589
+G + Y+ G DG
Sbjct: 391 DATGEVQAYLNRGGDG 406
>gi|260777124|ref|ZP_05886018.1| hemolysin-related protein RbmC [Vibrio coralliilyticus ATCC
BAA-450]
gi|260606790|gb|EEX33064.1| hemolysin-related protein RbmC [Vibrio coralliilyticus ATCC
BAA-450]
Length = 949
Score = 44.3 bits (103), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 58/213 (27%), Positives = 90/213 (42%), Gaps = 65/213 (30%)
Query: 382 HILSTPVIAD---------IDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVA 432
++ TPV A IDN V+++I+ F+ Y + ++ L G+D +
Sbjct: 243 QVMVTPVTAQLNDDNGDGRIDNQDVADVIVVT---FEGRNYSSGGLVRALSGVDGSEL-- 297
Query: 433 GAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG--LF 490
W S DN A SP V DLD DG ++I V TS G
Sbjct: 298 --------------W------SYDNGGVIADARYSPAVADLDHDGVVEI-VTTSTGSDFI 336
Query: 491 YVLDHHGKIREKFPLEMAEIQ------GAVVAADINDDGKIELVTTDTHGNVAAWTAEGK 544
VLD+HG +++ ++A I+ G + AD++ DG IE+++ D
Sbjct: 337 KVLDNHGNVKK----QIATIESGWRTVGDIALADLDGDGDIEILSAD------------- 379
Query: 545 GIWEQHLKSLV---TQGPS--IGDVDGDGHSDV 572
G+++ LV T P+ GD DGD +V
Sbjct: 380 GVYDYQSGQLVFSHTWSPASISGDFDGDNRQEV 412
>gi|347602428|sp|F1MMS9.1|ITA3_BOVIN RecName: Full=Integrin alpha-3; AltName: Full=CD49 antigen-like
family member C; AltName: Full=Galactoprotein B3;
Short=GAPB3; AltName: Full=VLA-3 subunit alpha; AltName:
CD_antigen=CD49c; Contains: RecName: Full=Integrin
alpha-3 heavy chain; Contains: RecName: Full=Integrin
alpha-3 light chain; Flags: Precursor
Length = 1050
Score = 43.9 bits (102), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 32/133 (24%), Positives = 60/133 (45%), Gaps = 19/133 (14%)
Query: 369 QHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIG 428
+ ++ + V ++ S +AD++NDG ++++ Y+F+ +
Sbjct: 292 RRRQVLEGTQVGAYFGSAIALADLNNDGWQDLLVGAPYYFERK----------------- 334
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSF- 487
+ V GAI +F ++ + L S A+ +S ++ D++ DG DI VG F
Sbjct: 335 EEVGGAIYIF-MNQAGTSFPDHPSLLLHGPSRSAFGFSVASIGDVNQDGFQDIAVGAPFE 393
Query: 488 GLFYVLDHHGKIR 500
GL V +HG R
Sbjct: 394 GLGKVYIYHGSSR 406
>gi|417405699|gb|JAA49553.1| Putative vitronectin receptor alpha subunit [Desmodus rotundus]
Length = 1051
Score = 43.9 bits (102), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 33/133 (24%), Positives = 59/133 (44%), Gaps = 19/133 (14%)
Query: 369 QHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIG 428
+ ++ + V ++ S +AD++NDG ++++ Y+F+ +
Sbjct: 293 RRRQVLEGTQVGAYFGSAIALADLNNDGWQDLLVGAPYYFERK----------------- 335
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSF- 487
K V GAI VF ++ + L S A+ +S ++ D++ DG DI VG F
Sbjct: 336 KEVGGAIYVF-MNQAGTSFPAHPSLLLHGPSRSAFGFSVASIGDINQDGFQDIAVGAPFE 394
Query: 488 GLFYVLDHHGKIR 500
GL V +H R
Sbjct: 395 GLGTVYIYHSSSR 407
>gi|397617553|gb|EJK64493.1| hypothetical protein THAOC_14768 [Thalassiosira oceanica]
Length = 613
Score = 43.9 bits (102), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 58/215 (26%), Positives = 83/215 (38%), Gaps = 42/215 (19%)
Query: 388 VIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKW 447
+AD + DG + II F+ + Y N LGG W
Sbjct: 170 AVADFNGDGFVDFIIGHEQFYKAKVYLN----DGLGGF--------------------PW 205
Query: 448 TTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVG---TSFGLFYVLDHHGKIRE-KF 503
+L+ D+ RA + D D DGN+DI V Y+ D G
Sbjct: 206 NGSYELAADSHLSRAI-----STCDFDQDGNMDIAVSHHVNQNSRVYMNDGLGNFTAVDL 260
Query: 504 PLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWE--QHLKSLVTQGP-- 559
P + AD+N DG ++V TD+ + + +G G + HL S G
Sbjct: 261 PSTLTTDSRKNACADLNGDGYPDVVMTDSDRH-QVFLNDGAGNFPTMAHLHSSHVGGRIV 319
Query: 560 SIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRP 594
++GD DGDGH DV V T + L+ DG+ P
Sbjct: 320 ALGDFDGDGHVDVFVRT----VIFLNKGDGTFGNP 350
>gi|86605626|ref|YP_474389.1| FG-GAP repeat- ASPIC/UnbV domain-containing protein [Synechococcus
sp. JA-3-3Ab]
gi|86554168|gb|ABC99126.1| FG-GAP repeat/ASPIC/UnbV domain protein [Synechococcus sp.
JA-3-3Ab]
Length = 454
Score = 43.9 bits (102), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 45/169 (26%), Positives = 76/169 (44%), Gaps = 27/169 (15%)
Query: 469 TVVDLDGDGNLDILVGTSFG--LFYVLDHHGKIREKFPLEMAEIQ--GAVVAADINDDGK 524
VDLDGDG LD++ G G ++ G + P E+A V+AAD ++DG
Sbjct: 233 AAVDLDGDGRLDLVYGNWEGPHRLWIQTSSGVFKNVAPPELARPSRIRTVIAADFDNDGY 292
Query: 525 IELVTTDTH--GNVAAWTAEGKGIWEQ-------HLKSLVTQGPSIGDVDGDGHSDVVVP 575
EL + + W + G W+ + L T G ++ DVDGDG +++
Sbjct: 293 PELFFNNIGEPNRLFGWRS---GCWQAIDMGDAVEPQGLGT-GAAVADVDGDGCLELL-- 346
Query: 576 TLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTI 624
V G+ G++ + YR N + ++ LT++G ++G +
Sbjct: 347 -------VAHGESGAQPLSF-YRPLAPGNNWLRVLPLTRQGSPARGALV 387
>gi|440910497|gb|ELR60291.1| Integrin alpha-3 [Bos grunniens mutus]
Length = 1065
Score = 43.9 bits (102), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 32/133 (24%), Positives = 60/133 (45%), Gaps = 19/133 (14%)
Query: 369 QHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIG 428
+ ++ + V ++ S +AD++NDG ++++ Y+F+ +
Sbjct: 292 RRRQVLEGTQVGAYFGSAIALADLNNDGWQDLLVGAPYYFERK----------------- 334
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSF- 487
+ V GAI +F ++ + L S A+ +S ++ D++ DG DI VG F
Sbjct: 335 EEVGGAIYIF-MNQAGTSFPDHPSLLLHGPSRSAFGFSVASIGDVNQDGFQDIAVGAPFE 393
Query: 488 GLFYVLDHHGKIR 500
GL V +HG R
Sbjct: 394 GLGKVYIYHGSSR 406
>gi|77164246|ref|YP_342771.1| Integrins alpha chain [Nitrosococcus oceani ATCC 19707]
gi|254435922|ref|ZP_05049429.1| FG-GAP repeat domain protein [Nitrosococcus oceani AFC27]
gi|76882560|gb|ABA57241.1| Integrins alpha chain [Nitrosococcus oceani ATCC 19707]
gi|207089033|gb|EDZ66305.1| FG-GAP repeat domain protein [Nitrosococcus oceani AFC27]
Length = 447
Score = 43.9 bits (102), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 34/116 (29%), Positives = 62/116 (53%), Gaps = 12/116 (10%)
Query: 470 VVDLDGDGNLDILVGTSFGLFYVLDHHGKIR-----EKFPLEM-AEIQGAVVAADINDDG 523
++D+DGDG+LD++ G +V +GK + E P M + + DIN+DG
Sbjct: 237 LLDIDGDGHLDVVANYYLGP-WVWRGNGKGQWEDSSEGLPRPMIGGLFRGLAVGDINEDG 295
Query: 524 KIELVTTDTHGNVAAWTAEGKGIWEQH---LKSLV--TQGPSIGDVDGDGHSDVVV 574
+++L ++ + KG W++ + S++ +G ++GD+D DGH D+VV
Sbjct: 296 RLDLAVANSINGPEVFLQTQKGSWQRTPDVMPSMLGGAEGIALGDLDRDGHLDLVV 351
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 53/190 (27%), Positives = 77/190 (40%), Gaps = 39/190 (20%)
Query: 466 SSPTVVDLDGDGNLDILVGTSFG---LFYVLDHHGKIREKFP-LEMAEIQGAVVA-ADIN 520
S+P +VD +GDG LD+ + G ++ D G+ ++ L A+ G VA DIN
Sbjct: 55 STPVIVDSNGDGFLDLAAISRVGDGAQVWLGDGKGRWQDSSQGLGPADSCGGGVAFGDIN 114
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWE----------QHLKSLVTQGP---------SI 561
DG ++L D G V + GKG W H +L +G ++
Sbjct: 115 RDGMLDLAVADHCGGVFVYLGNGKGEWTASTERLNPALSHQINLEEEGENTLTGAEDLAL 174
Query: 562 GDVDGDGHSDVVVPTLSGNIYVLSGKDGSK-------VRPYPYRTH--------GRVMNQ 606
GDV+ DG D+VV + + DGS P H G NQ
Sbjct: 175 GDVNEDGFLDLVVAASDEGGFAVYLGDGSGKSWQESDAEGLPTVKHLGPGAEENGGWANQ 234
Query: 607 VLLVDLTKRG 616
V L+D+ G
Sbjct: 235 VQLLDIDGDG 244
>gi|358417374|ref|XP_003583622.1| PREDICTED: integrin alpha-3-like [Bos taurus]
gi|359076806|ref|XP_003587466.1| PREDICTED: integrin alpha-3-like [Bos taurus]
Length = 1065
Score = 43.9 bits (102), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 32/133 (24%), Positives = 60/133 (45%), Gaps = 19/133 (14%)
Query: 369 QHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIG 428
+ ++ + V ++ S +AD++NDG ++++ Y+F+ +
Sbjct: 292 RRRQVLEGTQVGAYFGSAIALADLNNDGWQDLLVGAPYYFERK----------------- 334
Query: 429 KYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSF- 487
+ V GAI +F ++ + L S A+ +S ++ D++ DG DI VG F
Sbjct: 335 EEVGGAIYIF-MNQAGTSFPDHPSLLLHGPSRSAFGFSVASIGDVNQDGFQDIAVGAPFE 393
Query: 488 GLFYVLDHHGKIR 500
GL V +HG R
Sbjct: 394 GLGKVYIYHGSSR 406
>gi|194741308|ref|XP_001953131.1| GF17361 [Drosophila ananassae]
gi|190626190|gb|EDV41714.1| GF17361 [Drosophila ananassae]
Length = 980
Score = 43.9 bits (102), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 45/187 (24%), Positives = 74/187 (39%), Gaps = 31/187 (16%)
Query: 410 HEYYDNPEHLKELGG----IDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIY 465
H + +P L + I G + V + Q +W+ L + RA
Sbjct: 123 HWWRTDPAQLAAISDSASRIGYGSSLGSEFVPCTQISVQKRWSRSLARMNSESPLRA--- 179
Query: 466 SSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKI 525
DL+GDG DI+ G +G+ + + G + + Q A+ ++ +G
Sbjct: 180 -----ADLNGDGIKDIVFG--YGVDDNIHYEG-------IPLPRCQAAMQGEEVPCEG-- 223
Query: 526 ELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLS 585
G VA +G +W+ + V I DVDGDG +D V G IY ++
Sbjct: 224 --------GVVALNGIDGSIMWQSWSVANVFSLHCIADVDGDGGTDCVAAGRLGMIYAIN 275
Query: 586 GKDGSKV 592
G+ GS +
Sbjct: 276 GRTGSVI 282
>gi|297193924|ref|ZP_06911322.1| secreted protein [Streptomyces pristinaespiralis ATCC 25486]
gi|297152002|gb|EDY64195.2| secreted protein [Streptomyces pristinaespiralis ATCC 25486]
Length = 586
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 39/162 (24%), Positives = 71/162 (43%), Gaps = 23/162 (14%)
Query: 454 STDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPL----EMAE 509
+ D+ SF A I + D+DGDG+ D+L G ++V ++PL EM +
Sbjct: 73 AADSPSFGAEIAVARPRSDVDGDGSSDLLHRAWDGQYWVSASGSADTYQYPLGNSSEMYK 132
Query: 510 IQGAVVAAD-----------INDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQG 558
++ D +++ G++ + +G W+ G W+Q K
Sbjct: 133 DAFSIAGLDPAAPDTPTHFTLSETGRLSSYASTPYGGDLVWSGTG---WQQFNKVF---- 185
Query: 559 PSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTH 600
S GD+ GDG DV+ +G +++ K G+ V P+ + +
Sbjct: 186 -SPGDLTGDGVGDVLARNPAGELFLYRAKGGTAVEPFAAKVN 226
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 53/123 (43%), Gaps = 12/123 (9%)
Query: 472 DLDGDGNLDILVGTSFGLFYVLDHHG-----KIREKFPLEMAEIQGAVVAADINDDGKIE 526
DL GDG D++ S G+ ++ +G R K + A D++ DG+ +
Sbjct: 437 DLSGDGKGDLIARDSAGVLWLYRGYGTGAGFASRTKIGSGWGQFSVLTGAGDVDGDGRAD 496
Query: 527 LVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSI-------GDVDGDGHSDVVVPTLSG 579
L+ T G + ++ G K L+ G + GD++GDG D+V G
Sbjct: 497 LIARATDGKLYLYSGTGVASSPFKAKKLIGSGWNTYNKLVAPGDMNGDGRGDLVGRNSDG 556
Query: 580 NIY 582
N+Y
Sbjct: 557 NLY 559
>gi|350590446|ref|XP_003483061.1| PREDICTED: integrin alpha-3-like [Sus scrofa]
Length = 265
Score = 43.9 bits (102), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 33/136 (24%), Positives = 62/136 (45%), Gaps = 19/136 (13%)
Query: 363 EEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKEL 422
E + + ++ + V ++ S +AD++NDG ++++ Y+F+ +
Sbjct: 62 EAGGDLRRRQVLEGTQVGAYFGSAIALADLNNDGWQDLLVGAPYYFERK----------- 110
Query: 423 GGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ V GA+ VF ++ + D L S A+ +S ++ D++ DG DI
Sbjct: 111 ------EEVGGAVYVF-MNQAGTSFPDDPSLLLHGPSRSAFGFSVASIGDINQDGFQDIA 163
Query: 483 VGTSF-GLFYVLDHHG 497
VG F GL V +HG
Sbjct: 164 VGAPFEGLGQVYIYHG 179
>gi|345788880|ref|XP_534221.3| PREDICTED: integrin alpha-9 [Canis lupus familiaris]
Length = 1067
Score = 43.9 bits (102), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 43/182 (23%), Positives = 69/182 (37%), Gaps = 39/182 (21%)
Query: 433 GAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYV 492
G + +F D + S +Y SS VDL+ DG D+LVG
Sbjct: 307 GKVYIFRADRRSGTLIKIFQASGKKMG--SYFGSSLCAVDLNADGLSDLLVG-------- 356
Query: 493 LDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTA-EGKGIWEQHL 551
A + ++I D+G++ + +G + A G G + H
Sbjct: 357 --------------------APMFSEIRDEGQVTVYINRGNGALEEQLALSGDGAYNAHF 396
Query: 552 KSLVTQGPSIGDVDGDGHSDVVVPT-----LSGNIYVLSGKDGSKVRPYPYRTHGRVMNQ 606
+ S+GD+D DG DV + SG +Y+ G G V Y + G+ ++
Sbjct: 397 GESIA---SLGDLDDDGFPDVAIGAPKEDDFSGAVYIYHGDAGGIVPQYSMKLSGQKISP 453
Query: 607 VL 608
VL
Sbjct: 454 VL 455
>gi|428214060|ref|YP_007087204.1| putative Ig domain-containing protein,putative calcium-binding
protein,FG-GAP repeat protein [Oscillatoria acuminata
PCC 6304]
gi|428002441|gb|AFY83284.1| putative Ig domain-containing protein,putative calcium-binding
protein,FG-GAP repeat protein [Oscillatoria acuminata
PCC 6304]
Length = 2110
Score = 43.5 bits (101), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 58/129 (44%), Gaps = 8/129 (6%)
Query: 469 TVVDLDGDGNLDILVG--TSFGLFYVLDH-HGKIREKFPLEMAEIQGAVVAADINDDGKI 525
TV D +GDG+ D+ TS + +L + G + +V D N DG
Sbjct: 360 TVGDFNGDGDPDLATANVTSNNISVLLGNGSGSFSTQTTFAAGSGPVSVTVGDFNGDGDP 419
Query: 526 ELVTTD-THGNVAAWTAEGKGIWEQHLKSLVTQGP---SIGDVDGDGHSDVVVPTLSGNI 581
+L T + T N++ G G + V GP ++GD +GDG+ D+ V S +
Sbjct: 420 DLATANVTSNNISVLLGNGSGSFSTQTTFAVGSGPRSLTVGDFNGDGNPDLAVANSSNTV 479
Query: 582 YVLSGKDGS 590
VL G DGS
Sbjct: 480 SVLLG-DGS 487
>gi|393785938|ref|ZP_10374082.1| hypothetical protein HMPREF1068_00362 [Bacteroides nordii CL02T12C05]
gi|392661052|gb|EIY54649.1| hypothetical protein HMPREF1068_00362 [Bacteroides nordii CL02T12C05]
Length = 1633
Score = 43.5 bits (101), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 92/366 (25%), Positives = 143/366 (39%), Gaps = 89/366 (24%)
Query: 278 TTVSGGTNSSENGTNTGRRLLEDNNSKGSQEGNDKEDVPVATAENDQALDENADSSFELF 337
TTVSG +S +L + G GN ++++ V+T + D+ D+NA + F
Sbjct: 1141 TTVSGFAKTS--------TMLTGQVAFGDLAGNGEQNIVVSTWD-DKYGDKNAVYCYSPF 1191
Query: 338 RDTDELADEYNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHILSTPVIADID--ND 395
D D D+ D +W EK Y + +PVIA++D +D
Sbjct: 1192 -DKD----------GDHKPDLLW----------EKKIPY-----SMFQSPVIANLDGSSD 1225
Query: 396 GVSEMIIAVS-----YFFDHEYYDNPEHLKELG-GIDIGKYVA-GAIVVFNLDTK-QVK- 446
G E++I + DH N L+ + ++ G+ A+ V +LD Q++
Sbjct: 1226 GTMEVVIKSHQTSDIFILDH----NGNELRRINPNVNDGRSCNYSALTVADLDNDGQMEI 1281
Query: 447 -----------WTTDLDLSTDNASFRAYIY---SSPTVVDLDGDGNLDILVG----TSFG 488
W D T N + A I S+P V DL+ DG +IL
Sbjct: 1282 IASYDSLGIYIWRQDGTPFTTNPFWGAGILRLASAPVVCDLNEDGKKEILFSQRQMAESH 1341
Query: 489 LFYV----------LDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAA 538
+F + D I + + + + DIN+DG +E+V H V A
Sbjct: 1342 IFAISLEGDKTVAGWDGTQTIPYTVNVVGSTLDHTLSVGDINNDGHLEVVILG-HEMVKA 1400
Query: 539 WTAEGKGIWEQHLKSLVTQ--------GPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGS 590
W G ++ + + L Q P + DVDGD DVV IY L DGS
Sbjct: 1401 WKHTGDFLFSKSINGLFPQENYAANIDAPVLADVDGDAIPDVVF-CCKNFIYALH-NDGS 1458
Query: 591 KVRPYP 596
+ +P
Sbjct: 1459 DIVGFP 1464
>gi|162448741|ref|YP_001611108.1| hypothetical protein sce0471 [Sorangium cellulosum So ce56]
gi|161159323|emb|CAN90628.1| hypothetical protein predicted by Glimmer/Critica [Sorangium
cellulosum So ce56]
Length = 612
Score = 43.5 bits (101), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 66/145 (45%), Gaps = 18/145 (12%)
Query: 465 YSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGK 524
Y+SP V+DLDGDG + L+ + LF V D +GK ++ + V AD+ DG
Sbjct: 103 YASPVVMDLDGDGRNE-LIAAYYALF-VFDSNGKEIDRAEAGEDRVYAPHVVADLEGDGA 160
Query: 525 IELVTTDTHGNVAAWTAEG---KGIWEQHLKSLVT----QGPSIGDVDGDGHSDVVVPTL 577
I++V + H A G K W + + +G + D+DGDG +VV T
Sbjct: 161 IDIVYGNGHQVFAFEWNNGLALKAGWPADTTTAGSAPEVRGLAAADLDGDGTIEVVATTT 220
Query: 578 --------SGNIYVLSGKDGSKVRP 594
++V S DGS RP
Sbjct: 221 QTATTEDGGAQVFVFS-ADGSLYRP 244
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 70/152 (46%), Gaps = 47/152 (30%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
++PV+ D+D DG +E+I A YY A+ VF+ + K+
Sbjct: 104 ASPVVMDLDGDGRNELIAA--------YY--------------------ALFVFDSNGKE 135
Query: 445 VKWTTDLDLSTDNA-SFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHG-KIREK 502
+ D A + +Y+ V DL+GDG +DI+ G +F ++G ++
Sbjct: 136 I----------DRAEAGEDRVYAPHVVADLEGDGAIDIVYGNGHQVFAFEWNNGLALKAG 185
Query: 503 FPLE------MAEIQGAVVAADINDDGKIELV 528
+P + E++G + AAD++ DG IE+V
Sbjct: 186 WPADTTTAGSAPEVRG-LAAADLDGDGTIEVV 216
>gi|255038282|ref|YP_003088903.1| FG-GAP repeat-containing protein [Dyadobacter fermentans DSM 18053]
gi|254951038|gb|ACT95738.1| FG-GAP repeat-containing protein [Dyadobacter fermentans DSM 18053]
Length = 402
Score = 43.5 bits (101), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 60/141 (42%), Gaps = 27/141 (19%)
Query: 461 RAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVV----- 515
+ +I + D++ DG DIL G + H E+A+ Q +V
Sbjct: 45 KRFIAEGAAMADVNKDGKKDILSGAYWFEAPDWKQH---------ELAKPQEFIVNGSYS 95
Query: 516 ------AADINDDGKIELVTTDTHGNVAAWTAEGK---GIWEQH--LKSLVTQGPSIGDV 564
A D+N DG ++L+ D G AAW + G W H S+ + P + D+
Sbjct: 96 DSFLDFAMDVNQDGWVDLIRIDWPGKAAAWHENPQNKPGHWPMHTIYSSVGNESPQLVDI 155
Query: 565 DGDGHSDVVV--PTLSGNIYV 583
DGDG D++ PT I++
Sbjct: 156 DGDGRLDLLCNDPTAKKVIWL 176
>gi|403382554|ref|ZP_10924611.1| FG-GAP repeat-containing protein [Paenibacillus sp. JC66]
Length = 1060
Score = 43.5 bits (101), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 41/150 (27%), Positives = 64/150 (42%), Gaps = 43/150 (28%)
Query: 480 DILVGTSFGLFYVLDHHGKIREKF-------------------PL-----EMAEIQG--A 513
D+L G++ GL Y + G++ E + PL E+ G A
Sbjct: 420 DLLAGSADGLIYHYANRGQLPEDYASQPLPEGISIPYSFESPEPLLLSSGELLRTSGYSA 479
Query: 514 VVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWE-------QHLKSLVTQG-------- 558
+ AAD+ +G +L+ D G + A + G E Q L L+ G
Sbjct: 480 IAAADLTGNGLDDLLIGDEEGRLWAALRQEAGAAEEQARPVFQSLVPLLADGVQIEVDSY 539
Query: 559 --PSIGDVDGDGHSDVVVPTLSGNIYVLSG 586
P+ GD+DGDG +D++V T SG +Y+ G
Sbjct: 540 AAPAWGDLDGDGLADLIVGTGSGELYLYRG 569
>gi|373852052|ref|ZP_09594852.1| FG-GAP repeat protein [Opitutaceae bacterium TAV5]
gi|372474281|gb|EHP34291.1| FG-GAP repeat protein [Opitutaceae bacterium TAV5]
Length = 710
Score = 43.5 bits (101), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 51/236 (21%), Positives = 84/236 (35%), Gaps = 30/236 (12%)
Query: 388 VIADIDNDGVSEMIIAVSYFFDHEY-------YDNPEHLKELGGIDIGKYVAGAIVVFNL 440
++AD D DG ++I+ Y+ +Y Y E G G ++ N
Sbjct: 172 LLADYDGDGALDLIVGNDYWGIQKYPHGSTKTYGWHAAFDEKGKWTNGPLHGYVYLLRNT 231
Query: 441 DTKQVK-WTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKI 499
T + + + + Y SP + D G G LDI+ G F ++ G
Sbjct: 232 GTTEAPLYAAPVMIEAGGEPIDTYGMPSPMLGDFRGTGKLDIICGEFLDGFTFYENIGTR 291
Query: 500 RE-------KFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGK-------- 544
E + + M VA D N DG++++V D G VA GK
Sbjct: 292 TEPRYAAGRRLGVRMDLCMTKPVAVDFNGDGRLDIVCGDEDGRVALLENTGKVDKDGAPQ 351
Query: 545 -------GIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVR 593
+ +++ + P D DGDG D++ G + + G+ R
Sbjct: 352 FLPPRYFRQFAEYVNFGILSTPYAYDWDGDGLVDIITGCSGGYLGFIKNLGGNPTR 407
>gi|47228962|emb|CAG09477.1| unnamed protein product [Tetraodon nigroviridis]
Length = 2502
Score = 43.5 bits (101), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 51/212 (24%), Positives = 79/212 (37%), Gaps = 45/212 (21%)
Query: 430 YVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYI-----YSSPTVVDL-------DGDG 477
Y G I V+NL +K D+ + S+ Y +S+P +D+ G G
Sbjct: 259 YWTGTIKVYNLTSKSFYSPNKEDVDSHRYSYLGYSVTAGHFSAPNTIDIAAGAPQHSGSG 318
Query: 478 NLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIEL-VTTDTH--- 533
+ I L GK+ + ++ A D+N DG +L V H
Sbjct: 319 KVYIFKIDGSSLVKSFQASGKMMGSY------FGSSLCAVDLNQDGLSDLLVGAPMHSQL 372
Query: 534 ---GNVAAWTAEGKGIWEQHLKSLVTQG------------PSIGDVDGDGHSDVVVPT-- 576
G V + ++G G+ E K+ V G +IGD+D DG DV V
Sbjct: 373 RDEGQVTVYLSKGNGVME---KNAVLSGDNAFNAHFGECIAAIGDIDDDGFQDVAVGAPK 429
Query: 577 ---LSGNIYVLSGKDGSKVRPYPYRTHGRVMN 605
G +Y+ G + Y + GR +N
Sbjct: 430 EDDYGGAVYIYHGDATGIISTYSLKLTGRSVN 461
>gi|297201169|ref|ZP_06918566.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
gi|197713719|gb|EDY57753.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
Length = 605
Score = 43.5 bits (101), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 40/79 (50%), Gaps = 6/79 (7%)
Query: 514 VVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPS------IGDVDGD 567
V A D+N DG +L+ DT N+ + G G + +K G S +GD+ GD
Sbjct: 504 VGAGDLNGDGIGDLLAQDTSNNLYRYDGTGSGTFRARVKVFSNWGASCNAVVGVGDITGD 563
Query: 568 GHSDVVVPTLSGNIYVLSG 586
G +D+V SGN+Y SG
Sbjct: 564 GRTDLVSRDTSGNLYRNSG 582
>gi|167537600|ref|XP_001750468.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771008|gb|EDQ84682.1| predicted protein [Monosiga brevicollis MX1]
Length = 437
Score = 43.5 bits (101), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 60/234 (25%), Positives = 95/234 (40%), Gaps = 48/234 (20%)
Query: 467 SPTVVDLDGDGNLDILVGT----SFGLFYVLDHHGKIREKFPLEMA-EIQGA--VVAADI 519
S D DGDG D+L + S F+ +HG P+ +A + GA + AAD+
Sbjct: 145 SVVTADFDGDGIADVLSASFKDDSIIWFH---NHGNGSFSAPIVLASDAFGAFGIHAADL 201
Query: 520 NDDGKIELVTTDTHGNVAAWTAE-GKGIWEQHLKSLVTQGPS-----IGDVDGDGHSDVV 573
N DG +++++ +V AW G G + + L Q D+D D DV+
Sbjct: 202 NSDGHVDVISASVQDHVVAWYQNYGNGTFSSR-RVLSNQANEARFVFAADLDNDLRLDVL 260
Query: 574 VPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLL------------VDLTKRGEKSKG 621
+ S N K+ Y R +G Q+++ DL G
Sbjct: 261 SASYSDN----------KIAWYRNRGNGTFSAQIIISTECLGATAVHAADLDADGN---- 306
Query: 622 LTIVTTSFDG-----YLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLI 670
L ++ F G Y L + S A+ +D+ T S +LA +VD LD++
Sbjct: 307 LDVLCACFSGHELAWYRNLGNSSFSAANPIDMAATGASTILAADVDKDGRLDVL 360
>gi|431799797|ref|YP_007226701.1| hypothetical protein Echvi_4518 [Echinicola vietnamensis DSM 17526]
gi|430790562|gb|AGA80691.1| hypothetical protein Echvi_4518 [Echinicola vietnamensis DSM 17526]
Length = 655
Score = 43.5 bits (101), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 42/171 (24%), Positives = 68/171 (39%), Gaps = 40/171 (23%)
Query: 468 PTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQ----------GAV--- 514
P VD D DGN+D++VG G ++ H GK PL + GA+
Sbjct: 295 PVAVDWDKDGNMDLVVGDEDGRVALIKHTGKTHNGAPLFESPKYFRQKADNLKFGALATP 354
Query: 515 VAADINDDGKIELVTTDTHGNVA-----------AWT------AEGKGIWEQHLKSLVTQ 557
V D ++DG +++ ++ G +A W +G+ + Q Q
Sbjct: 355 VGVDWDNDGDEDIIAGNSAGYIAFIENLNGKALPKWAEPKLLEVDGQTLRIQAGGKGSIQ 414
Query: 558 GP----------SIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYR 598
GP S+ D DGDG+ D++ ++ G + + S P P R
Sbjct: 415 GPAEAKWGYTTLSVADWDGDGNKDIIFNSIWGKVQWIRNNGKSLEGPLPIR 465
>gi|344238376|gb|EGV94479.1| Integrin alpha-9 [Cricetulus griseus]
Length = 976
Score = 43.5 bits (101), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 86/210 (40%), Gaps = 33/210 (15%)
Query: 430 YVAGAIVVFNL-DTKQVKWTTDLDLSTD----NASFRAYIYSSPTVVDLDGDGNLDILVG 484
Y AG + V NL D K D+ ++ + A +S P++ D+ G D +G
Sbjct: 157 YWAGTVKVLNLTDNTYFKLNDDVIMNRRYTYLGYAVTAGHFSHPSITDVVGGAPQDEGIG 216
Query: 485 TSFGLFYVLDHHG---KIREKFPLEMAEIQGA-VVAADINDDGKIELVT-------TDTH 533
+ +F G KI + +M G+ + A D+N DG +L+
Sbjct: 217 KVY-IFRADRRSGTLVKIFQASGKKMGSYFGSSLCAVDLNMDGLSDLLVGAPMFSEVRDE 275
Query: 534 GNVAAWTAEGKGIWEQHLKSLVTQGP----------SIGDVDGDGHSDVVVPT-----LS 578
G V + ++G G E+ L SL G S+GD+D DG DV V +
Sbjct: 276 GQVTVYLSKGNGALEEQL-SLTGDGAYNAHFGESIASLGDLDDDGFPDVAVGAPKEDDFA 334
Query: 579 GNIYVLSGKDGSKVRPYPYRTHGRVMNQVL 608
G +Y+ G V Y + GR +N +L
Sbjct: 335 GAVYIYHGDANGIVPQYSMKLSGRKINPML 364
>gi|222053915|ref|YP_002536277.1| hypothetical protein Geob_0814 [Geobacter daltonii FRC-32]
gi|221563204|gb|ACM19176.1| conserved repeat domain protein [Geobacter daltonii FRC-32]
Length = 2673
Score = 43.5 bits (101), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 60/230 (26%), Positives = 105/230 (45%), Gaps = 35/230 (15%)
Query: 387 PVIADIDNDGVSEMIIAVSYF-FDHEYYDNPEHLKELG-GIDIGKYVAGAIVVFNLDTKQ 444
PV++DIDNDG +E+I A + +D P++ + G G G + ++ ++D KQ
Sbjct: 1027 PVLSDIDNDGHAEIIDARTVINWDGSVRCAPDYRQSSGLGWTTGFGYSNSVADLDMDGKQ 1086
Query: 445 VKWTTDLDLSTDNASFRAYIYSSP----TVVDLDGDGNLDILV----GTSFGLFYVLDHH 496
+ ++D + + Y S P ++ + D D +I++ G+ Y+L+H
Sbjct: 1087 EIIAGNNAFNSD-CTTKWYNASLPDGLTSIGNFDDDPYPEIVLLNVNGSPGTALYLLEHD 1145
Query: 497 GKIREKFPLEMAEIQ--------GAVVAADINDDGKIELVTT--------DTHGNVAAWT 540
GKI+ P+ + +++ G V AD++ DGK E+ + D G V
Sbjct: 1146 GKIKWG-PVVIRQLEETSYSGHGGHPVIADLDGDGKAEIGISGLNKYFILDRDGRVK--- 1201
Query: 541 AEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGS 590
I S P++ D+DGDG +V+ S N + GKDG+
Sbjct: 1202 ---TTISYPLSGSSSAVAPAVFDLDGDGRPEVLFSDNS-NFMIFDGKDGT 1247
>gi|327404924|ref|YP_004345762.1| FG-GAP repeat-containing protein [Fluviicola taffensis DSM 16823]
gi|327320432|gb|AEA44924.1| FG-GAP repeat protein [Fluviicola taffensis DSM 16823]
Length = 4065
Score = 43.1 bits (100), Expect = 0.58, Method: Composition-based stats.
Identities = 59/243 (24%), Positives = 109/243 (44%), Gaps = 26/243 (10%)
Query: 450 DLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAE 509
D+ + N + R T D D DG +D++V + G G KF ++
Sbjct: 20 DISFTKSNGNSRGM-----TKGDFDNDGLMDVVVAHNSGPTNYSFLKGLGNGKFGAPISG 74
Query: 510 IQGA-----VVAADINDDGKIEL-VTTDTHGNVAAWTAEGKGIWEQHLKSLVT--QGPSI 561
G ++++D N+DG +++ + +++ V+ G G + +T + +I
Sbjct: 75 NTGGTQTTHILSSDFNNDGNLDIAIASNSTNYVSVLLGNGNGTFATATNYTITLPKYLAI 134
Query: 562 GDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEK--S 619
GD++ DG SD+VV + + ++ +L GS P + + G V N +V + G+
Sbjct: 135 GDINEDGASDIVVSSGNAHVSLL----GSVGAPGTF-SLGTVFNVAGMVSDLRLGDMDGD 189
Query: 620 KGLTIVTTSFDGYLYLIDGP----TSCADVVDI-GETSYSMVLADNVDGGDDLDLIVTTM 674
L +VT +G ++ P T V+ + G +S+ L D VD DLD++ T
Sbjct: 190 NHLDVVTAGTNGIGISLNTPGTLGTFATAVLSLAGSGPFSLTLCD-VDNDADLDVVNTNK 248
Query: 675 NGN 677
+ N
Sbjct: 249 SSN 251
>gi|167525190|ref|XP_001746930.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774710|gb|EDQ88337.1| predicted protein [Monosiga brevicollis MX1]
Length = 877
Score = 43.1 bits (100), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 43/154 (27%), Positives = 74/154 (48%), Gaps = 19/154 (12%)
Query: 435 IVVFNLDTKQVKWTTDLD---LSTDNA-SFRAYIYSSPTVVDLDGDGNLDILVG--TSFG 488
+V N D ++ W +L S+++ S + S+ TV DLD DG DI+ G F
Sbjct: 587 VVSANSDDAKIAWYRNLGRGRFSSEHIISADIKLASAVTVADLDDDGWPDIICGGIEGFA 646
Query: 489 LFYVLDH----HGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGK 544
++ D G I L++ I AD+N DG++++VT+++ V + G+
Sbjct: 647 QWFKNDQGAFGRGHIISTVSLDLRTI----AVADMNKDGRLDIVTSNSFA-VVWYRNAGQ 701
Query: 545 GIWEQ----HLKSLVTQGPSIGDVDGDGHSDVVV 574
G+++Q +S + D+ GDG+ DVV
Sbjct: 702 GVFDQDYILAAQSFDQTRAIVADLSGDGYPDVVC 735
Score = 39.3 bits (90), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 41/168 (24%), Positives = 77/168 (45%), Gaps = 14/168 (8%)
Query: 471 VDLDGDGNLDILV---GTSFGLFYVLDHHGKIREKFPLEMAEIQGA--VVAADINDDGKI 525
VDLDGD ++D++ G + +Y +G + GA V AAD+++DG +
Sbjct: 526 VDLDGDADIDVITAGPGDATIAWYANIGNGSFSSSPIPISSPAVGASAVYAADLDNDGLV 585
Query: 526 ELVTTDTHGNVAAWTAE-GKGIW-EQHLKSL---VTQGPSIGDVDGDGHSDVVVPTLSGN 580
++V+ ++ AW G+G + +H+ S + ++ D+D DG D++ + G
Sbjct: 586 DVVSANSDDAKIAWYRNLGRGRFSSEHIISADIKLASAVTVADLDDDGWPDIICGGIEGF 645
Query: 581 IYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTS 628
G+ R + T + + + D+ K G L IVT++
Sbjct: 646 AQWFKNDQGAFGRGHIISTVSLDLRTIAVADMNKDGR----LDIVTSN 689
>gi|326799846|ref|YP_004317665.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326550610|gb|ADZ78995.1| FG-GAP repeat protein [Sphingobacterium sp. 21]
Length = 446
Score = 43.1 bits (100), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 45/174 (25%), Positives = 68/174 (39%), Gaps = 10/174 (5%)
Query: 468 PTVVDLDGDGNLDILVGTSFGLF---YVLDHHGKIREKFPLEMAEIQGAVVAADINDDGK 524
P D DGDG D+ + G++ Y ++ G FP V D + DGK
Sbjct: 260 PIPEDYDGDGKADLATKDNNGIWRIDYAVNGFGSFDVSFPGYGNTSDAKPVPRDYDGDGK 319
Query: 525 IELVTTDTHGNVAA-WTAEGKGIWEQHLKS--LVTQGPSIGDVDGDGHSDVVVPTLSGNI 581
+L D +G ++A G G +++ L L P D DGDG +D+ V +G
Sbjct: 320 ADLSIKDINGFWWIDYSANGFGSFDKKLPGYGLADAIPVPADYDGDGKADLAVKNFNGYW 379
Query: 582 YVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYL 635
Y+ +G Y +G N + D G+ L GY Y+
Sbjct: 380 YIDYAANGFGSFDADYAGYGGKDNIAVPGDYDGDGKADLALKYSY----GYWYI 429
>gi|444918602|ref|ZP_21238668.1| hypothetical protein D187_01414 [Cystobacter fuscus DSM 2262]
gi|444709650|gb|ELW50655.1| hypothetical protein D187_01414 [Cystobacter fuscus DSM 2262]
Length = 1008
Score = 43.1 bits (100), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 58/214 (27%), Positives = 94/214 (43%), Gaps = 19/214 (8%)
Query: 391 DIDNDGVSEMIIAVS-YFFDHEY-YDNPEHLKELGGI-DIGKYVAGAIVVFNL------- 440
D+D DG E+I S Y FD +NP G+ + AG IVV
Sbjct: 228 DLDQDGKLEVINGRSVYNFDGTLRCNNPNVPHGFAGVANFDADAAGEIVVAGRGKVSLLD 287
Query: 441 DTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIR 500
D + WT D+ ++ +A+ PT+ D DGDG L+I + + + V G ++
Sbjct: 288 DDCSLLWTRDVYVTGHSAAGHG---GPPTIADFDGDGQLEIGLPGEWN-YTVYGSDGSVK 343
Query: 501 EKFPLE-MAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVT-QG 558
++ + + A D DDGK+E+V D G W+ S T +
Sbjct: 344 WSSSIQDYSSGRTASTTFDFEDDGKLEVVFADEAYLRIYDGVTGAVRWQTRNSSGTTHEY 403
Query: 559 PSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKV 592
P + DVDGD ++++V + N + G++G +V
Sbjct: 404 PLVVDVDGDNAAEIIVVS---NNHAYPGQNGIRV 434
>gi|149018292|gb|EDL76933.1| rCG25760 [Rattus norvegicus]
Length = 905
Score = 43.1 bits (100), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 39/153 (25%), Positives = 60/153 (39%), Gaps = 37/153 (24%)
Query: 462 AYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADIND 521
+Y SS VDL+ DG D+LVG A + ++I D
Sbjct: 201 SYFGSSLCAVDLNMDGLSDLLVG----------------------------APMFSEIRD 232
Query: 522 DGKIELVTTDTHGNVA-AWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPT---- 576
+G++ + HG + T G + H + S+GD+D DG DV V
Sbjct: 233 EGQVTVYLNQGHGVLEEQLTLTGDAAYNAHFGESIA---SLGDLDDDGFPDVAVGAPMED 289
Query: 577 -LSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVL 608
+G +Y+ G V Y + GR +N +L
Sbjct: 290 DFAGTVYIYHGDAQGIVPQYSMKLSGRKINPIL 322
>gi|87309456|ref|ZP_01091591.1| hypothetical protein DSM3645_24180 [Blastopirellula marina DSM
3645]
gi|87287764|gb|EAQ79663.1| hypothetical protein DSM3645_24180 [Blastopirellula marina DSM
3645]
Length = 644
Score = 43.1 bits (100), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 41/156 (26%), Positives = 72/156 (46%), Gaps = 42/156 (26%)
Query: 467 SPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQ-----------GAVV 515
+PT +D D DG++D++VG G +++H GKI + P + + Q GA+V
Sbjct: 286 TPTAIDWDADGDVDLIVGDEDGRVALVEHTGKIVDGAP-QFRQPQYFQQQADRLKFGALV 344
Query: 516 ---AADINDDGKIELVTTDTHGNVA-----------AWTA------EGKGIWEQHLKSLV 555
+ D ++DG +L+ ++ G +A W A +G+ I Q ++
Sbjct: 345 TPFSVDWDNDGDEDLIAGNSAGYIAFIENIDGGNPPKWAAPRKLEVDGQPIRVQAGENGS 404
Query: 556 TQGPS----------IGDVDGDGHSDVVVPTLSGNI 581
QGP+ + D D DG D++V ++ G I
Sbjct: 405 IQGPAEAKWGYTTLNVADWDHDGRLDLIVNSIWGKI 440
>gi|332876538|ref|ZP_08444300.1| FG-GAP repeat protein [Capnocytophaga sp. oral taxon 329 str.
F0087]
gi|332685505|gb|EGJ58340.1| FG-GAP repeat protein [Capnocytophaga sp. oral taxon 329 str.
F0087]
Length = 1607
Score = 43.1 bits (100), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 62/145 (42%), Gaps = 19/145 (13%)
Query: 450 DLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAE 509
++ L+T + + Y +P + D+D DG +D GT L Y +G EK E E
Sbjct: 732 EICLNTPSNPIGEHDYLTP-IGDMDNDGRVDFASGT---LMYKNTANGTWEEKRLFEGEE 787
Query: 510 IQGAVVAADINDDGKIELVTT------------DTHGNVAAWTAEGKGIWEQHLKSLVTQ 557
V AD N DG +++V + N W E + + Q+ SL
Sbjct: 788 YIEPVAVADFNRDGYLDIVAAYRWDRGTPIYRQRLYLNQGGWKFEARELPAQN-SSLTLV 846
Query: 558 GPSIGDVDGDGHSDVVVPTLSGNIY 582
G I DV+GDG D++ NIY
Sbjct: 847 G--IADVNGDGRPDLIYTDYRKNIY 869
>gi|357046890|ref|ZP_09108507.1| FG-GAP repeat protein [Paraprevotella clara YIT 11840]
gi|355530173|gb|EHG99588.1| FG-GAP repeat protein [Paraprevotella clara YIT 11840]
Length = 1607
Score = 43.1 bits (100), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 40/130 (30%), Positives = 55/130 (42%), Gaps = 19/130 (14%)
Query: 465 YSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGK 524
Y +P + D+D DG +D GT L Y +G EK E E V AD N DG
Sbjct: 747 YLTP-IGDMDNDGRVDFASGT---LMYKNTANGTWEEKRLFEGEEYIEPVAVADFNRDGY 802
Query: 525 IELVTT------------DTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDV 572
+++V + N W E + + Q+ SL G I DV+GDG D+
Sbjct: 803 LDIVAAYRWDRGTPIYRQRLYLNQGGWKFEARELPAQN-SSLTLVG--IADVNGDGRPDL 859
Query: 573 VVPTLSGNIY 582
+ NIY
Sbjct: 860 IYTDYRKNIY 869
>gi|428316007|ref|YP_007113889.1| peptidase C11 clostripain [Oscillatoria nigro-viridis PCC 7112]
gi|428239687|gb|AFZ05473.1| peptidase C11 clostripain [Oscillatoria nigro-viridis PCC 7112]
Length = 2267
Score = 43.1 bits (100), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 37/152 (24%), Positives = 63/152 (41%), Gaps = 12/152 (7%)
Query: 472 DLDGDGNLDILV-----GTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIE 526
D +GD NLD++ S +F+ G + L + A+ D N DG +
Sbjct: 353 DFNGDNNLDVVATNGNPANSVSIFFGT-GQGTFGNQTNLNVGTSPSAIAVGDFNADGSSD 411
Query: 527 LVTTDT-HGNVAAWTAEGKGIWEQHLKSLVTQGP---SIGDVDGDGHSDVVVPTLSGN-I 581
L + + NV+ + G+G + V P ++GD + DG D+ V N +
Sbjct: 412 LAVANADNQNVSILSGTGQGTFSNPANFNVGLSPDDIAVGDFNADGQPDLAVANAGSNDV 471
Query: 582 YVLSGK-DGSKVRPYPYRTHGRVMNQVLLVDL 612
+L+G G+ P+ Y T G + + D
Sbjct: 472 SILTGTGQGTFNGPFNYNTGGTTPRAIAVGDF 503
>gi|444914729|ref|ZP_21234870.1| hypothetical protein D187_07144 [Cystobacter fuscus DSM 2262]
gi|444714345|gb|ELW55228.1| hypothetical protein D187_07144 [Cystobacter fuscus DSM 2262]
Length = 810
Score = 43.1 bits (100), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 57/124 (45%), Gaps = 4/124 (3%)
Query: 472 DLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIE-LVTT 530
DL+ DG LD+ V + G+ +L + + + + AD+N DG+++ L T
Sbjct: 687 DLNKDGFLDLAVAGTGGVQVLLGSASGFQAQAIQALGATAFELALADLNGDGRLDILAAT 746
Query: 531 DTHGNVAAWTAEGKGIWEQHLKSLVTQGP---SIGDVDGDGHSDVVVPTLSGNIYVLSGK 587
T V+ G G +E+ V P ++ DVDGDG D+ + G++ +S
Sbjct: 747 RTSDFVSILAGRGDGTFERPTNHGVLDSPGSLAVADVDGDGALDLFTASFHGDVLQVSRN 806
Query: 588 DGSK 591
G +
Sbjct: 807 QGCR 810
Score = 40.4 bits (93), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 59/129 (45%), Gaps = 7/129 (5%)
Query: 472 DLDGDGNLDILVGTSFGLFYVL-DHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTT 530
DL+ DG LD++ S + +L + G+ + A+ AD N DG +++VTT
Sbjct: 146 DLNRDGRLDLVFARSGQVAVLLGEGTGRFTAAVAYTVPFSPYAIAVADFNRDGSVDVVTT 205
Query: 531 DTHGNVAAWTAEGKGIWEQHLKSLVTQGPS-----IGDVDGDGHSDVVVPTLSGNIYVLS 585
+ N A+ G+G L QG S GD +GDG D++ P SG VL
Sbjct: 206 GSELNSASLLL-GRGDGTFVLAETRPQGLSGYTLRAGDFNGDGIQDLMGPYASGLAVVLG 264
Query: 586 GKDGSKVRP 594
DG+ P
Sbjct: 265 KGDGTFAEP 273
>gi|432906532|ref|XP_004077576.1| PREDICTED: integrin alpha-9-like [Oryzias latipes]
Length = 1009
Score = 43.1 bits (100), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 49/207 (23%), Positives = 80/207 (38%), Gaps = 36/207 (17%)
Query: 430 YVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYI-----YSSPTVVDLDGDGNLDILVG 484
Y G I V+NL T ++ + + S+ Y +SSP VVD+
Sbjct: 193 YWTGTIKVYNL-TSDSFYSPRESIDSHGYSYLGYAVAAGHFSSPNVVDIAAGAPQH---- 247
Query: 485 TSFGLFYVLDHHGKIREKFPLEMAEIQGA-----VVAADINDDGKIEL-VTTDTH----- 533
+ G Y+ G K ++ G+ + A D+N DG +L V H
Sbjct: 248 SGVGKVYIFKIEGSSLVKSFQATGKMMGSYFGSSLCAVDLNQDGLSDLLVGAPMHSQLRD 307
Query: 534 -GNVAAWTAEGKGIWEQHL---------KSLVTQGPSIGDVDGDGHSDVVVPT-----LS 578
G V+ + +G G+ E+H +IGD+D DG+ DV + +
Sbjct: 308 EGQVSVYLGKGNGVMEEHAVLTGDNAFNAHFGESIAAIGDIDDDGYQDVAIGAPKEDDYA 367
Query: 579 GNIYVLSGKDGSKVRPYPYRTHGRVMN 605
G +Y+ G + Y + GR +N
Sbjct: 368 GAVYIYHGGATGITQIYSMKLSGRSLN 394
>gi|426249068|ref|XP_004018274.1| PREDICTED: integrin alpha-9 [Ovis aries]
Length = 1035
Score = 43.1 bits (100), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 56/241 (23%), Positives = 87/241 (36%), Gaps = 49/241 (20%)
Query: 462 AYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADIND 521
+Y SS VDL+ DG D+LV GA + ++I D
Sbjct: 302 SYFGSSLCAVDLNTDGLSDLLV----------------------------GAPMFSEIRD 333
Query: 522 DGKIELVTTDTHGNV-AAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPT---- 576
+G++ + +G + T G G + H + S+GD+D DG DV +
Sbjct: 334 EGQVTVYINKGNGVLEEQLTLSGDGAYNAHFGESIA---SLGDLDDDGFPDVAIGAPKED 390
Query: 577 -LSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLV-------DLTKRGEKSKGLTIVTTS 628
SG +Y+ G V Y + GR ++ VL + + G +TI
Sbjct: 391 DFSGAVYIYHGDARGMVPQYSMKLSGRKISPVLRMFGQSISGGIDMDGNSYPDVTIGAFM 450
Query: 629 FDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHH 688
D + L P DV S ++ DG ++ + T CFS H
Sbjct: 451 SDSVVLLRARPVITVDVSIFLPASINITAPQCHDGQQPVNCLNVTA-----CFSFHGKHV 505
Query: 689 P 689
P
Sbjct: 506 P 506
>gi|354467727|ref|XP_003496320.1| PREDICTED: integrin alpha-9-like [Cricetulus griseus]
Length = 1018
Score = 43.1 bits (100), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 86/210 (40%), Gaps = 33/210 (15%)
Query: 430 YVAGAIVVFNL-DTKQVKWTTDLDLSTD----NASFRAYIYSSPTVVDLDGDGNLDILVG 484
Y AG + V NL D K D+ ++ + A +S P++ D+ G D +G
Sbjct: 199 YWAGTVKVLNLTDNTYFKLNDDVIMNRRYTYLGYAVTAGHFSHPSITDVVGGAPQDEGIG 258
Query: 485 TSFGLFYVLDHHG---KIREKFPLEMAEIQGA-VVAADINDDGKIELVT-------TDTH 533
+ +F G KI + +M G+ + A D+N DG +L+
Sbjct: 259 KVY-IFRADRRSGTLVKIFQASGKKMGSYFGSSLCAVDLNMDGLSDLLVGAPMFSEVRDE 317
Query: 534 GNVAAWTAEGKGIWEQHLKSLVTQGP----------SIGDVDGDGHSDVVVPT-----LS 578
G V + ++G G E+ L SL G S+GD+D DG DV V +
Sbjct: 318 GQVTVYLSKGNGALEEQL-SLTGDGAYNAHFGESIASLGDLDDDGFPDVAVGAPKEDDFA 376
Query: 579 GNIYVLSGKDGSKVRPYPYRTHGRVMNQVL 608
G +Y+ G V Y + GR +N +L
Sbjct: 377 GAVYIYHGDANGIVPQYSMKLSGRKINPML 406
>gi|167533817|ref|XP_001748587.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772828|gb|EDQ86474.1| predicted protein [Monosiga brevicollis MX1]
Length = 440
Score = 43.1 bits (100), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 61/125 (48%), Gaps = 23/125 (18%)
Query: 470 VVDLDGDGNLDILVGTSFG---LFYVLDHHGKIRE--------KFPLEMAEIQGAVVAAD 518
V DL+GD +LDI+V ++ L++ D GK + P +A + D
Sbjct: 300 VADLNGDASLDIVVCNAYDNTVLWFANDGTGKFGPAQAVSSVLRSPTSLATV-------D 352
Query: 519 INDDGKIELVTTDTHGNVAAWTAE-GKGIWEQHLKSLV-TQGP---SIGDVDGDGHSDVV 573
+++DG +++V + + +W G G + + + Q P + GDVDGDGH DVV
Sbjct: 353 LDEDGDLDIVCASGNDGLVSWLRNNGDGSFSAAMTLMSGAQLPYDVAAGDVDGDGHPDVV 412
Query: 574 VPTLS 578
V + S
Sbjct: 413 VASYS 417
Score = 39.3 bits (90), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 58/120 (48%), Gaps = 13/120 (10%)
Query: 472 DLDGDGNLDIL-VGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVA-------ADINDDG 523
DLD DG +D++ G+S G Y L + G F LE I A +A AD+N D
Sbjct: 250 DLDQDGFVDVITAGSSDGTLYWLRNQGG--SSFALETDGIIHAQLASPRDVQVADLNGDA 307
Query: 524 KIELVTTDTHGNVAAWTA-EGKGIW--EQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGN 580
+++V + + N W A +G G + Q + S++ S+ VD D D+ + SGN
Sbjct: 308 SLDIVVCNAYDNTVLWFANDGTGKFGPAQAVSSVLRSPTSLATVDLDEDGDLDIVCASGN 367
>gi|115378782|ref|ZP_01465925.1| integrins alpha chain [Stigmatella aurantiaca DW4/3-1]
gi|115364202|gb|EAU63294.1| integrins alpha chain [Stigmatella aurantiaca DW4/3-1]
Length = 2144
Score = 43.1 bits (100), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 64/131 (48%), Gaps = 14/131 (10%)
Query: 470 VVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIEL-- 527
+VDLD DG+ D++VG + Y+ D G A + A VAAD N DGK+++
Sbjct: 1917 LVDLDLDGDQDMVVGGNSVSVYLNDGSGTFVNGLSPPAAGVPRASVAADFNQDGKVDVAF 1976
Query: 528 VTTDTHGNVAAWTAEGKGIWEQHLKSLVTQG-----PSIGDVDGDGHSDV---VVPTLSG 579
+ T +V+ G G + +K+ V G + GDVD DG D+ P+ +G
Sbjct: 1977 ASNSTAISVSVLLGNGTGSFAGSVKT-VNLGVYCYDVAAGDVDRDGKLDLAAACAPSGTG 2035
Query: 580 ---NIYVLSGK 587
N+ +L G+
Sbjct: 2036 VNVNVRILKGQ 2046
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 36/128 (28%), Positives = 60/128 (46%), Gaps = 15/128 (11%)
Query: 474 DGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTT-DT 532
D G++ +++G G F PL + A+ A++N DG++++V T
Sbjct: 1276 DPAGDVQVMLGMGTGTF---------GAPTPLGVGAGTVALAVAELNRDGQLDVVAARGT 1326
Query: 533 HGNVAAWTAEGKGIWEQHLKSLVTQG-PS---IGDVDGDGHSDVVVPTLSGNIYVLSGKD 588
G V+ G G + L + T G PS + D+DG+GH DV+V + + VL G+
Sbjct: 1327 QGLVSVARGNGDGTFFV-LPPVNTGGTPSDVAVADLDGNGHPDVMVAGVGNAVLVLKGRG 1385
Query: 589 GSKVRPYP 596
+ P P
Sbjct: 1386 DGSLSPTP 1393
>gi|310825483|ref|YP_003957841.1| invasin/intimin cell-adhesion domain-containing protein [Stigmatella
aurantiaca DW4/3-1]
gi|309398555|gb|ADO76014.1| Invasin/intimin cell-adhesion domain protein [Stigmatella aurantiaca
DW4/3-1]
Length = 2219
Score = 43.1 bits (100), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 64/131 (48%), Gaps = 14/131 (10%)
Query: 470 VVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIEL-- 527
+VDLD DG+ D++VG + Y+ D G A + A VAAD N DGK+++
Sbjct: 1992 LVDLDLDGDQDMVVGGNSVSVYLNDGSGTFVNGLSPPAAGVPRASVAADFNQDGKVDVAF 2051
Query: 528 VTTDTHGNVAAWTAEGKGIWEQHLKSLVTQG-----PSIGDVDGDGHSDV---VVPTLSG 579
+ T +V+ G G + +K+ V G + GDVD DG D+ P+ +G
Sbjct: 2052 ASNSTAISVSVLLGNGTGSFAGSVKT-VNLGVYCYDVAAGDVDRDGKLDLAAACAPSGTG 2110
Query: 580 ---NIYVLSGK 587
N+ +L G+
Sbjct: 2111 VNVNVRILKGQ 2121
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 36/128 (28%), Positives = 60/128 (46%), Gaps = 15/128 (11%)
Query: 474 DGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTT-DT 532
D G++ +++G G F PL + A+ A++N DG++++V T
Sbjct: 1351 DPAGDVQVMLGMGTGTF---------GAPTPLGVGAGTVALAVAELNRDGQLDVVAARGT 1401
Query: 533 HGNVAAWTAEGKGIWEQHLKSLVTQG-PS---IGDVDGDGHSDVVVPTLSGNIYVLSGKD 588
G V+ G G + L + T G PS + D+DG+GH DV+V + + VL G+
Sbjct: 1402 QGLVSVARGNGDGTFFV-LPPVNTGGTPSDVAVADLDGNGHPDVMVAGVGNAVLVLKGRG 1460
Query: 589 GSKVRPYP 596
+ P P
Sbjct: 1461 DGSLSPTP 1468
>gi|149920910|ref|ZP_01909371.1| FG-GAP repeat/HVR domain protein [Plesiocystis pacifica SIR-1]
gi|149818182|gb|EDM77637.1| FG-GAP repeat/HVR domain protein [Plesiocystis pacifica SIR-1]
Length = 773
Score = 43.1 bits (100), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 68/300 (22%), Positives = 113/300 (37%), Gaps = 71/300 (23%)
Query: 382 HILSTPVIADIDNDGVSEMII-----------------------AVSYFFDHEYYDNPEH 418
H+ S PV+AD+D D V E++ V D YY +
Sbjct: 212 HVSSIPVVADLDADCVPEIVFNTYQYGVATSEGVIRAIRGDDGSQVWSMTDPTYYSDSTA 271
Query: 419 LKELGGIDI-----------GKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSS 467
+G ID GKYV V + D + WT+D + +N+ +
Sbjct: 272 NPAVGDIDEDGLPEVVVQGEGKYV----VAIDDDGTGL-WTSDPFVGGENSG-------A 319
Query: 468 PTVVDLDGDGNLDILVGTSF----GLFYVLDHHGKIREKFPLEMAEIQGAV-VAADINDD 522
++ ++D G +I+ G + G G R+ QG + AD++ D
Sbjct: 320 VSIANMDNQGAPEIVFGAAVYANDGTLLWEGGAGNGRDG--------QGPISCVADLDGD 371
Query: 523 GKIELVTTDTH----GNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLS 578
+ EL+ +T G+V +G W+ + + D D DG ++V++
Sbjct: 372 FRPELIGGNTAYKTTGSVLGGDFDGSIWWQAEVGDGRC---GVADFDDDGMAEVIL-VRG 427
Query: 579 GNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLT----IVTTSFDGYLY 634
GNIY L+G+DGS + +P + D G+ G V +FDG +
Sbjct: 428 GNIYALNGQDGSLLATFPIPGSNDRGGAPNIADFNGDGQPDIGTAGSTRYVVVTFDGVEF 487
>gi|365539997|ref|ZP_09365172.1| Leukocidin S subunit [Vibrio ordalii ATCC 33509]
Length = 899
Score = 42.7 bits (99), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 37/165 (22%), Positives = 77/165 (46%), Gaps = 37/165 (22%)
Query: 379 VDSHILSTPVIADIDNDG---------VSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGK 429
+ + +++TPV+A +++D V+++I+ F+ Y N ++ L G+D +
Sbjct: 238 LSNQVMATPVVAQLNDDNGDGKIDEKDVADLIVVT---FEGNKYVNGGLVRALSGVDGSE 294
Query: 430 YVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVG-TSFG 488
W+ + + +A + SP V DLDGDG ++I+ TS
Sbjct: 295 L----------------WSYEQGEAIADARY------SPAVADLDGDGVVEIVTANTSSP 332
Query: 489 LFYVLDHHGKIREKF--PLEMAEIQGAVVAADINDDGKIELVTTD 531
+LD++G ++++ G++ AD++ DG +E++ D
Sbjct: 333 YLNILDNNGHLKKQILKAQTGGRSVGSISLADLDGDGSVEILAAD 377
>gi|293349536|ref|XP_002727153.1| PREDICTED: integrin alpha-9 [Rattus norvegicus]
gi|392350430|ref|XP_001061864.3| PREDICTED: integrin alpha-9 [Rattus norvegicus]
Length = 1007
Score = 42.7 bits (99), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 43/182 (23%), Positives = 67/182 (36%), Gaps = 39/182 (21%)
Query: 433 GAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYV 492
G + +F D + S +Y SS VDL+ DG D+LVG
Sbjct: 276 GKVYIFRADRRSGTLIKIFQASGKKMG--SYFGSSLCAVDLNMDGLSDLLVG-------- 325
Query: 493 LDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVA-AWTAEGKGIWEQHL 551
A + ++I D+G++ + HG + T G + H
Sbjct: 326 --------------------APMFSEIRDEGQVTVYLNQGHGVLEEQLTLTGDAAYNAHF 365
Query: 552 KSLVTQGPSIGDVDGDGHSDVVVPT-----LSGNIYVLSGKDGSKVRPYPYRTHGRVMNQ 606
+ S+GD+D DG DV V +G +Y+ G V Y + GR +N
Sbjct: 366 GESIA---SLGDLDDDGFPDVAVGAPMEDDFAGTVYIYHGDAQGIVPQYSMKLSGRKINP 422
Query: 607 VL 608
+L
Sbjct: 423 IL 424
>gi|94968415|ref|YP_590463.1| integrin-like protein [Candidatus Koribacter versatilis Ellin345]
gi|94550465|gb|ABF40389.1| Integrin-like protein [Candidatus Koribacter versatilis Ellin345]
Length = 1490
Score = 42.7 bits (99), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 52/108 (48%), Gaps = 18/108 (16%)
Query: 476 DGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGA------VVAADINDDGKIELVT 529
D NL +L+G G F V K L + + G +V AD+N DG I++
Sbjct: 1116 DNNLWVLLGNGDGTFTV---------KPSLPLVTLSGPDFRNHPLVVADLNGDGNIDVAV 1166
Query: 530 TDTHGNVAAWTAEGKGIWEQHLKSLVTQGPS---IGDVDGDGHSDVVV 574
T+ +G +A + G G ++ L + + G S DV+GDG D+V+
Sbjct: 1167 TNRYGKLAVFLGNGDGTFQSPLTTNLPGGTSQMISADVNGDGKPDLVL 1214
Score = 39.3 bits (90), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 50/181 (27%), Positives = 68/181 (37%), Gaps = 39/181 (21%)
Query: 472 DLDGDGNLDILVGTSFGL----FYVL------DHHGKIREKFPLEMAEI-QGAVVAADIN 520
D + DGNLDI TS L VL H +R F + + + +V D+N
Sbjct: 335 DFNHDGNLDIAAATSGNLANSIMVVLIGNGDGTFHTPVRYNFTGSCSGVSEHYLVGGDLN 394
Query: 521 DDGKIELVTTDTHGNVAAWT--AEGKGIWEQHLKSLVTQGPSI------GDVDGDGHSDV 572
DG ++LV D + W G G ++ VT + GD DGDG D+
Sbjct: 395 GDGNVDLVMNDDECSNLIWVFIGNGDGTFQTPYSVTVTTASEVESPIVAGDFDGDGVPDI 454
Query: 573 VVPTLSGNI--YVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFD 630
V + +LS DG+ P Y G GL I TT F+
Sbjct: 455 AVIGADEQVVHVLLSKGDGTFKTPASY------------------GIGQVGLKIATTDFN 496
Query: 631 G 631
G
Sbjct: 497 G 497
Score = 39.3 bits (90), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 54/113 (47%), Gaps = 10/113 (8%)
Query: 471 VDLDGDGNLDILVGTSFG---LFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIEL 527
DL+GDG+ D++V S + + G + P + V AD+N DGK +L
Sbjct: 1309 ADLNGDGHADLVVTDSIANTVSVSLGNGDGTFQTAVPYAVGLNPEHVTIADLNQDGKPDL 1368
Query: 528 VTTDT--HGNVAAWTAEGKGIWEQHLKSLVTQGPSI----GDVDGDGHSDVVV 574
+ T+ HG V+ G G ++ +++ G SI D++GDG D+ V
Sbjct: 1369 IVTNDNYHGTVSILLGNGNGTFQPQ-QTINAGGNSIFTALTDMNGDGKPDIAV 1420
>gi|167537274|ref|XP_001750306.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771134|gb|EDQ84805.1| predicted protein [Monosiga brevicollis MX1]
Length = 425
Score = 42.7 bits (99), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 59/125 (47%), Gaps = 11/125 (8%)
Query: 467 SPTVVDLDGDGNLDILVGTSFG---LFYVLDHHGKIREKFPLEM-AEIQGAVVAADINDD 522
S DLD DG+LD+L + + +Y + +G + + A +V AAD++ D
Sbjct: 290 SVYAADLDKDGDLDVLSASMYDDKIAWYRNNGNGTFSAQIVVTTNASGANSVYAADLDKD 349
Query: 523 GKIELVTTDTHGNVAAW-TAEGKGIWEQHLKSLVTQGP-----SIGDVDGDGHSDVVVPT 576
G +++++ + AW G G++ + + T P D+D DGH DV+
Sbjct: 350 GYLDILSASISDDKVAWYRNNGNGMFSTQIV-ITTAAPFARSVYAADLDNDGHLDVLSAN 408
Query: 577 LSGNI 581
L GN+
Sbjct: 409 LDGNV 413
>gi|300794559|ref|NP_001179647.1| integrin alpha-9 precursor [Bos taurus]
Length = 1035
Score = 42.7 bits (99), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 55/241 (22%), Positives = 87/241 (36%), Gaps = 49/241 (20%)
Query: 462 AYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADIND 521
+Y SS VDL+ DG D+LV GA + ++I D
Sbjct: 302 SYFGSSLCAVDLNTDGLSDLLV----------------------------GAPMFSEIRD 333
Query: 522 DGKIELVTTDTHGNV-AAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPT---- 576
+G++ + +G + T G G + H + S+GD+D DG DV +
Sbjct: 334 EGQVTVYINKGNGVLEEQLTLSGDGAYNAHFGESIA---SLGDLDDDGFPDVAIGAPKED 390
Query: 577 -LSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLV-------DLTKRGEKSKGLTIVTTS 628
SG +Y+ G V Y + GR ++ VL + + G +T+
Sbjct: 391 DFSGAVYIYHGDARGMVPQYSMKLSGRKISPVLRMFGQSISGGIDMDGNSYPDMTVGAFM 450
Query: 629 FDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHH 688
D + L P DV S ++ DG ++ + T CFS H
Sbjct: 451 SDSVVLLRARPVITVDVSIFLPASINITAPQCHDGQQPVNCLNVTA-----CFSFHGKHV 505
Query: 689 P 689
P
Sbjct: 506 P 506
>gi|296475138|tpg|DAA17253.1| TPA: integrin, alpha 9 [Bos taurus]
Length = 1037
Score = 42.7 bits (99), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 59/270 (21%), Positives = 94/270 (34%), Gaps = 51/270 (18%)
Query: 433 GAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYV 492
G + +F D + S +Y SS VDL+ DG D+LV
Sbjct: 275 GKVYIFRADRRSGTLIKIFQASGKKMG--SYFGSSLCAVDLNTDGLSDLLV--------- 323
Query: 493 LDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNV-AAWTAEGKGIWEQHL 551
GA + ++I D+G++ + +G + T G G + H
Sbjct: 324 -------------------GAPMFSEIRDEGQVTVYINKGNGVLEEQLTLSGDGAYNAHF 364
Query: 552 KSLVTQGPSIGDVDGDGHSDVVVPT-----LSGNIYVLSGKDGSKVRPYPYRTHGRVMNQ 606
+ S+GD+D DG DV + SG +Y+ G V Y + GR ++
Sbjct: 365 GESIA---SLGDLDDDGFPDVAIGAPKEDDFSGAVYIYHGDARGMVPQYSMKLSGRKISP 421
Query: 607 VLLV-------DLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLAD 659
VL + + G +T+ D + L P DV S ++
Sbjct: 422 VLRMFGQSISGGIDMDGNSYPDMTVGAFMSDSVVLLRARPVITVDVSIFLPASINITAPQ 481
Query: 660 NVDGGDDLDLIVTTMNGNVFCFSTPAPHHP 689
DG ++ + T CFS H P
Sbjct: 482 CHDGQQPVNCLNVTA-----CFSFHGKHVP 506
>gi|417305718|ref|ZP_12092667.1| FG-GAP repeat protein [Rhodopirellula baltica WH47]
gi|327537987|gb|EGF24682.1| FG-GAP repeat protein [Rhodopirellula baltica WH47]
Length = 372
Score = 42.7 bits (99), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 56/209 (26%), Positives = 86/209 (41%), Gaps = 41/209 (19%)
Query: 470 VVDLDGDGNLDILVGTS---------FGLFYVLDHHGKIREKFPLEMAEIQGAVVAADIN 520
V D++GDG DILVG +G ++ K+ P+ + DI+
Sbjct: 183 VGDINGDGRNDILVGWGWYEQPAENPWGQPWIAHRDWKLHASLPM---------LIEDID 233
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGI-------WEQH-LKSLVTQGPSI--GDVDGDGHS 570
DG +L+ + H W E G+ W++H + +Q S+ D+DGDG+
Sbjct: 234 QDGDQDLIYGEGHNYGLQWW-ENTGVNDSGSIEWKEHEIDRRFSQPHSMVWVDLDGDGNK 292
Query: 571 DVVVPTLSGNIYVLSGKD-GSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSF 629
D+V Y +G D G K P Y V ++ + G GL IV F
Sbjct: 293 DLVTGK---RYYAHNGNDPGGKEVPCLYWYSFDVASKSFQRHVIDEGRVGTGLQIVAEDF 349
Query: 630 DGYLYLIDGPTSCADVVDIGETSYSMVLA 658
DG DG T D+ G++ ++LA
Sbjct: 350 DG-----DGKT---DLAVAGKSGTYLLLA 370
>gi|393789632|ref|ZP_10377752.1| hypothetical protein HMPREF1068_04032 [Bacteroides nordii CL02T12C05]
gi|392650348|gb|EIY44017.1| hypothetical protein HMPREF1068_04032 [Bacteroides nordii CL02T12C05]
Length = 1443
Score = 42.7 bits (99), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 74/288 (25%), Positives = 117/288 (40%), Gaps = 62/288 (21%)
Query: 380 DSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKEL-----GGIDIGKYVAGA 434
D + PV D +NDG+ ++ V Y +N E + G GKY+
Sbjct: 878 DVCYYACPVWLDYNNDGLMDLF--VPGLKSMNYTNNLEDIAAFLYENKGKGQDGKYLFEE 935
Query: 435 IVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLD 494
+ NL ++ T + + D R ++ +V D D DG LD+++ GL LD
Sbjct: 936 VNAANLAENKMG-ITPIYNTMDGGRSRQWV----SVGDYDKDGYLDLVIT---GLDDYLD 987
Query: 495 -------------HH--------------GKIREKFPLE-----MAEIQGAVVAADINDD 522
HH G IR++ PL+ + +G+V D+++D
Sbjct: 988 PDGKVDENGIPTVHHDRRVVYLYKNNKGKGFIRQETPLDGTKPFLGLSRGSVHFGDMDND 1047
Query: 523 GKIELVTTDT---HGNVAAWTAEGKGIW---EQHLKSLVTQGPSIGDVDGDGHSDVVVPT 576
G +++V++ GN+ + G G + EQ+L S+ D+D DG+ D+VV
Sbjct: 1048 GWLDIVSSGYGPDEGNLHVYWNNGDGTFSENEQYLYGANDSSCSLCDLDADGYMDIVVTG 1107
Query: 577 LSGNIYVLSGKDGSKVRP-YPYRTHGRVMNQVLLVDLTKRGEKSKGLT 623
S N GS + Y YR G N +L DL E G T
Sbjct: 1108 YSRN-------KGSGAKSFYVYRNCGN-RNFEMLNDLFCGFEGVDGAT 1147
>gi|167526930|ref|XP_001747798.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773902|gb|EDQ87538.1| predicted protein [Monosiga brevicollis MX1]
Length = 470
Score = 42.7 bits (99), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 36/122 (29%), Positives = 59/122 (48%), Gaps = 9/122 (7%)
Query: 467 SPTVVDLDGDGNLDILVGTSFG---LFYVLDHHGKIREKFPLE-MAEIQGAVVAADINDD 522
S DLDGDGNLD+L + F +Y + G + + A+ +V AAD++ D
Sbjct: 328 SVYAADLDGDGNLDVLSASRFDNKFAWYRNNGDGTFSTQIVISTAADYAYSVYAADLDGD 387
Query: 523 GKIELVTTDTHGNVAAW-TAEGKGIWE-QHLKSLVTQGPS---IGDVDGDGHSDVVVPTL 577
G ++++T + + AW G G + Q + S G D+D DG DV+ ++
Sbjct: 388 GDFDVLSTSVNDDKIAWYRNNGDGAFSAQIIISTAADGACSVYAADLDSDGDLDVLSASV 447
Query: 578 SG 579
+G
Sbjct: 448 TG 449
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 59/121 (48%), Gaps = 11/121 (9%)
Query: 466 SSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEM-----AEIQGAVVAADIN 520
SS DLDGDGNLD+L SF + + E F E+ A+ +V AAD++
Sbjct: 22 SSVYAADLDGDGNLDVL-SASFLDDKIAWYRNNGDETFSAEIVISTVADYATSVYAADLD 80
Query: 521 DDGKIELVTTDTHGNVAAW-TAEGKG-IWEQHLKSLVTQGPS---IGDVDGDGHSDVVVP 575
D +++++ + N AW G G + Q + + G + D+DGDGH DV+
Sbjct: 81 GDSDLDVLSASINDNKIAWYRNNGDGTLSAQIIITTSANGAASVYAADLDGDGHLDVLSA 140
Query: 576 T 576
+
Sbjct: 141 S 141
>gi|338212573|ref|YP_004656628.1| hypothetical protein [Runella slithyformis DSM 19594]
gi|336306394|gb|AEI49496.1| FG-GAP repeat-containing protein [Runella slithyformis DSM 19594]
Length = 390
Score = 42.7 bits (99), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 54/124 (43%), Gaps = 15/124 (12%)
Query: 461 RAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAV------ 514
+ + V D++ DG D++ G F+ + + E + + G
Sbjct: 38 KMFFSEGVAVGDVNRDGKKDVMAGA----FWFQAPNWERHEIAKGDTFTVNGGYSNSFLN 93
Query: 515 VAADINDDGKIELVTTDTHGNVAAWTAEGK---GIWEQHL--KSLVTQGPSIGDVDGDGH 569
+ D+N DG I+L+ D G W + G W+ H KSL + P GD+DGDG
Sbjct: 94 FSMDVNQDGWIDLIRVDYPGIPVVWHENPQNKPGHWKVHTISKSLGNESPYFGDIDGDGR 153
Query: 570 SDVV 573
+D+V
Sbjct: 154 ADIV 157
>gi|75908990|ref|YP_323286.1| Integrins subunit alpha [Anabaena variabilis ATCC 29413]
gi|75702715|gb|ABA22391.1| Integrins alpha chain [Anabaena variabilis ATCC 29413]
Length = 409
Score = 42.7 bits (99), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 63/141 (44%), Gaps = 6/141 (4%)
Query: 467 SPTVVDLDGDGNLDILV-GTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKI 525
S T+VD DG+G DI T+ G V G K+ L+ + + D N D
Sbjct: 217 SATIVDFDGNGKSDIFWRDTTTGANSVWFMDGIQATKYDLQAQDASWSYSLGDFNGDFTT 276
Query: 526 ELVTTDT-HGNVAAWTAEGKGIWEQHLKSLVTQGPS-IGDVDGDGHSDVV---VPTLSGN 580
+L+ +T G WT G + E L +L + + IGD +GDG +D+ T +
Sbjct: 277 DLLWRNTVTGENKIWTMNGIFVTEGTLNTLSSDWTANIGDFNGDGRTDIFWNNTTTGANT 336
Query: 581 IYVLSGKDGSKVRPYPYRTHG 601
++++G + P R+ G
Sbjct: 337 AWLMNGTSVTSEAFLPSRSPG 357
>gi|383453712|ref|YP_005367701.1| FG-GAP repeat-containing protein [Corallococcus coralloides DSM
2259]
gi|380728227|gb|AFE04229.1| FG-GAP repeat protein [Corallococcus coralloides DSM 2259]
Length = 501
Score = 42.4 bits (98), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 41/79 (51%), Gaps = 4/79 (5%)
Query: 513 AVVAADINDDGKIELVTTD-THGNVAAWTAEGKGIWEQHLKSLVTQGP---SIGDVDGDG 568
AV D N+DG+++++T + T G V+ + G G + L+ P ++GD D DG
Sbjct: 50 AVALGDFNEDGRLDVITANHTAGTVSVLRSLGDGTFAPGLERPTGALPGAVAVGDFDVDG 109
Query: 569 HSDVVVPTLSGNIYVLSGK 587
H D V G + VL G+
Sbjct: 110 HLDAVTSHFGGTVSVLLGR 128
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 48/108 (44%), Gaps = 6/108 (5%)
Query: 472 DLDGDGNLDILVGT-SFGLFYVLDHHGKIREKFPLE--MAEIQGAVVAADINDDGKIELV 528
D + DG LD++ + G VL G LE + GAV D + DG ++ V
Sbjct: 55 DFNEDGRLDVITANHTAGTVSVLRSLGDGTFAPGLERPTGALPGAVAVGDFDVDGHLDAV 114
Query: 529 TTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPS---IGDVDGDGHSDVV 573
T+ G V+ G G + V PS +GD+DGDG DVV
Sbjct: 115 TSHFGGTVSVLLGRGDGTFGARTDHSVGAEPSAVDVGDLDGDGRLDVV 162
>gi|156365496|ref|XP_001626681.1| predicted protein [Nematostella vectensis]
gi|156213567|gb|EDO34581.1| predicted protein [Nematostella vectensis]
Length = 425
Score = 42.4 bits (98), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 33/135 (24%), Positives = 62/135 (45%), Gaps = 19/135 (14%)
Query: 553 SLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDL 612
+L +G + D+D DG +++V+ ++G I+V G+ +P+ ++ V DL
Sbjct: 17 NLFNEGLLLEDIDNDGSNELVLGNVNGEIFVFK---GTASKPWRVSKQNGMITCVGSGDL 73
Query: 613 TKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVD-------IGETSYSMVLADNVDGGD 665
RG+K +V + DG+ +L D TS ++ SYS +L N
Sbjct: 74 WNRGKK----VLVAITADGWCHLFDVKTSAQKSIERKPEDGYTMRPSYSQLLPSNAK--- 126
Query: 666 DLDLIVTTMNGNVFC 680
L++ ++G+ C
Sbjct: 127 --TLLIADIDGDGLC 139
>gi|443326549|ref|ZP_21055200.1| putative calcium-binding protein,FG-GAP repeat protein [Xenococcus
sp. PCC 7305]
gi|442793871|gb|ELS03307.1| putative calcium-binding protein,FG-GAP repeat protein [Xenococcus
sp. PCC 7305]
Length = 1379
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 62/143 (43%), Gaps = 20/143 (13%)
Query: 468 PTVVDLDGDGNLDILVGTSFGLFY-----------VLDHHGKIREKFPLEMAEIQGAVVA 516
PT+ D +GDG LDI VG ++ L Y + + G + ++ E +
Sbjct: 303 PTIGDFNGDGVLDIAVG-NYALPYGSIVPSTVSILLGNGDGSFADSVEYDVGERPNEIAI 361
Query: 517 ADINDDGKIELVTTDTHGN-VAAWTAEGKGIWEQHLKSLVTQGP-------SIGDVDGDG 568
D N DG +L + N ++ G + +++ P S+ D+DGDG
Sbjct: 362 GDFNGDGVSDLAVANVGSNFISVLEGNNDGTFNAAKNIPISEEPYGGAYNVSVEDIDGDG 421
Query: 569 HSDVVVPTLSGNIYVLSGKDGSK 591
+D+VV T S + VL G+ S
Sbjct: 422 INDLVVGTTSSEVVVLQGEGNSN 444
>gi|442317612|ref|YP_007357633.1| hypothetical protein MYSTI_00599 [Myxococcus stipitatus DSM 14675]
gi|441485254|gb|AGC41949.1| hypothetical protein MYSTI_00599 [Myxococcus stipitatus DSM 14675]
Length = 386
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 63/147 (42%), Gaps = 16/147 (10%)
Query: 469 TVVDLDGDGNLDILVGTSFG---LFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKI 525
V D++GDG+ D+L+ F + D G RE P + V AD++ DG +
Sbjct: 102 AVSDVNGDGHQDLLIVGHFSNAMTVRLGDGRGGFREGIPYSLGNHSQQVRVADLDGDGHL 161
Query: 526 ELVTTDTHG----NVAAWTAEGKGIWEQHLKSLVTQGPS---IGDVDGDGHSDVVVPTLS 578
+ +T + N+ G G + + VT P + DV DG DV+V L+
Sbjct: 162 DAITKNAGSGGFFNITTLKGRGDGTFGTAVPHAVTGLPRDLMLADVSADGLPDVLV--LN 219
Query: 579 GNIY----VLSGKDGSKVRPYPYRTHG 601
N + +LS KDG P R G
Sbjct: 220 TNSFTVDILLSKKDGVLASIAPLRLSG 246
>gi|455647730|gb|EMF26662.1| hypothetical protein H114_23197 [Streptomyces gancidicus BKS 13-15]
Length = 1077
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 61/126 (48%), Gaps = 20/126 (15%)
Query: 472 DLDGDGNLDILV-GTSFGLFY----VLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIE 526
DL+GDG++D++ T+ G Y DHH K R K + V A D+N DG+ +
Sbjct: 929 DLNGDGHVDLIARQTTTGDIYFYAGTADHHLKARVKIQTNWKLYKWIVGAGDLNGDGRGD 988
Query: 527 LVTTDTHGNVAAW--TAEGK--------GIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPT 576
L+ D+ G + + TA G G W + +LV GD+ GDG +D++
Sbjct: 989 LLGQDSAGVLWRYYGTAAGSVTRRERVGGGWNAY-SALV----GTGDLSGDGRTDLLARD 1043
Query: 577 LSGNIY 582
+G ++
Sbjct: 1044 TAGKLW 1049
>gi|301511150|ref|ZP_07236387.1| putative competence factor involved in DNA binding and uptake
(ComC) [Acinetobacter baumannii AB058]
Length = 475
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 39 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 97
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 98 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 136
>gi|390957791|ref|YP_006421548.1| FG-GAP repeat-containing protein [Terriglobus roseus DSM 18391]
gi|390412709|gb|AFL88213.1| FG-GAP repeat protein [Terriglobus roseus DSM 18391]
Length = 416
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 59/115 (51%), Gaps = 12/115 (10%)
Query: 471 VDLDGDGNLDILVGTSFGLFYV-LDHHGKIREKFPLEMAEIQGA-----VVAADINDDGK 524
+D+DGDG DI+ + F V L + GK P + EI + D+N+D K
Sbjct: 92 MDVDGDGWTDIIQFSYFAHNIVWLKNPGKTGG--PWKQTEIDNSGPTEFAFLVDLNNDSK 149
Query: 525 I-ELVTTDTHGNV-AAWTAEGKGIWEQHLKSLVTQGPSIG--DVDGDGHSDVVVP 575
EL+ NV AAW G W +H+ + ++ G IG DV+GDG +D+++P
Sbjct: 150 ANELLPEFDRPNVPAAWFELANGKWVKHVVAKMSYGHGIGAGDVNGDGRNDILIP 204
>gi|392953373|ref|ZP_10318927.1| hypothetical protein WQQ_29990 [Hydrocarboniphaga effusa AP103]
gi|391858888|gb|EIT69417.1| hypothetical protein WQQ_29990 [Hydrocarboniphaga effusa AP103]
Length = 935
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 62/225 (27%), Positives = 101/225 (44%), Gaps = 21/225 (9%)
Query: 466 SSPTVVDLDGDGNLDIL--VGTSFGLFYVLDHHG--KIREKFPLEMAEIQGAVVAADIND 521
SS D DGDG++D+ VG LF + D+ G + +++ L A + G + A+ +
Sbjct: 387 SSIQAADFDGDGDIDLAASVGGVIALF-IGDNTGGFSVGDRY-LAGAAVSGMTL-ANFDS 443
Query: 522 DGKIELVTTDTHGN-VAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGN 580
DG +++V +DT V+ +G + L V +G + GD D DG D+ V GN
Sbjct: 444 DGNLDIVASDTKSRAVSVLRGNDEGRFSAALDFRVGRGVAAGDFDRDGKQDIAV-IRGGN 502
Query: 581 IYVLSGK-DGSKVRPYPY---RTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLI 636
+ + G+ G P + R N + VDL G+ ++T + Y+ ++
Sbjct: 503 VEIRMGEGQGRFGNPTSFLGGRPENPSPNSIRSVDLDADGKLD---LVLTNNVGNYVSIL 559
Query: 637 ----DGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGN 677
+G A +GET AD D G DLIV ++ N
Sbjct: 560 FGKANGTFQKAITYSVGETPIDSATADFNDDGFQ-DLIVANVDSN 603
>gi|167536035|ref|XP_001749690.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771838|gb|EDQ85499.1| predicted protein [Monosiga brevicollis MX1]
Length = 1367
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 52/181 (28%), Positives = 77/181 (42%), Gaps = 35/181 (19%)
Query: 424 GIDIGKYVAGA---------IVVFNLDTKQVKWTTDLDLSTDNASF--RAYIYSS----- 467
G D +Y+A A IVV + D K + W +L N +F R I S
Sbjct: 268 GFDSPRYLAAADLDGNGWLDIVVGDRDAKHLLWLPNLG----NGAFGPRRLIQGSLNTLR 323
Query: 468 -PTVVDLDGDGNLDILV--GTSFGLFYVLDHHGKIREKFPLEMAEIQGA----VVAADIN 520
T DLDGDG D+L G L++ + G F + M I + A D+N
Sbjct: 324 MVTTGDLDGDGQADVLCTKGEDEALWFRNEDSGA---SFAVRMLGISAVGSYTMAAGDLN 380
Query: 521 DDGKIELVTTDTHGNVAAWTAE-GKGIWEQ-HLKSLVTQGP---SIGDVDGDGHSDVVVP 575
DG ++++T + + AW G G + + S +T P D+D DGH DV+
Sbjct: 381 GDGLLDVITASSTNDFLAWFPNLGAGRFATLDIISTLTLNPLSVHAADLDLDGHVDVLCA 440
Query: 576 T 576
+
Sbjct: 441 S 441
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 48/168 (28%), Positives = 76/168 (45%), Gaps = 19/168 (11%)
Query: 464 IYSSPT---VVDLDGDGNLDILVG-TSFGLFYVLDHHGKIR-EKFPLEMAEIQGA--VVA 516
+ SPT + D+DGDG DI+ T+ G + G + + + G V
Sbjct: 570 VLQSPTGVVLADVDGDGLRDIICAETAAGRIIWFRNKGAFSFSRLRILANPVAGPSLVAT 629
Query: 517 ADINDDGKIELVTTDTHGNVAAWTA-EGKGIWEQHLK---SLVT-QGPSIGDVDGDGHSD 571
AD+N DG ++ V+T+ + AW A G G + +K S+ + QG ++GD+D DG D
Sbjct: 630 ADLNRDGSLDFVSTNDLDDSVAWYANSGGGEFMPAVKLNGSISSPQGLALGDLDNDGDID 689
Query: 572 VVVPTLSGNIYVLSGKDGSKVRPY--PYRTHG-----RVMNQVLLVDL 612
V + T + G + PY P R + RV +L + L
Sbjct: 690 VTISTYETSFAGWYHLPGYQTEPYLVPERANASFWGLRVETHLLFLGL 737
>gi|386381832|ref|ZP_10067529.1| endonuclease/exonuclease/phosphatase [Streptomyces tsukubaensis
NRRL18488]
gi|385670711|gb|EIF93757.1| endonuclease/exonuclease/phosphatase [Streptomyces tsukubaensis
NRRL18488]
Length = 581
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 57/126 (45%), Gaps = 13/126 (10%)
Query: 472 DLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQG----AVVAADINDDGKIEL 527
D+DGDG D+LV G Y++ +G + + + G + D+ DGK ++
Sbjct: 431 DMDGDGLADLLVAFDNGDLYLVPGNGDGTVDAKIRIGQGWGVYDTVLTPGDVTGDGKADV 490
Query: 528 VTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSI-------GDVDGDGHSDVVVPTLSGN 580
V DT G + + G G E ++ + G + GDV+GDG D++ +G
Sbjct: 491 VARDTAGALFVFPGTGTGTLEA--RTQIGHGWQVYNTVFAPGDVNGDGKPDLLARDNAGT 548
Query: 581 IYVLSG 586
+ +G
Sbjct: 549 LQFYAG 554
>gi|293610735|ref|ZP_06693035.1| conserved hypothetical protein [Acinetobacter sp. SH024]
gi|292827079|gb|EFF85444.1| conserved hypothetical protein [Acinetobacter sp. SH024]
Length = 1284
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 52/99 (52%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDT-K 443
S PV+A+I +GV + ++ V +D + Y+NP GK A+ + + T +
Sbjct: 848 SKPVLANIRYNGVIKRVMIVGGGYD-QCYENPNITLSNSCFTNGKAKGNAVYIIDAKTGE 906
Query: 444 QVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
++ WT+D + DN++ + I S + +D D DG +D L
Sbjct: 907 RLWWTSDTGSNIDNSNMKHSIVSRISTLDRDADGLVDHL 945
>gi|424913174|ref|ZP_18336548.1| FG-GAP repeat protein [Rhizobium leguminosarum bv. viciae USDA
2370]
gi|392844331|gb|EJA96854.1| FG-GAP repeat protein [Rhizobium leguminosarum bv. viciae USDA
2370]
Length = 415
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 56/114 (49%), Gaps = 9/114 (7%)
Query: 470 VVDLDGDGNLDILVGTSFGL--FYVLDHHGKIREKFPLEMAEIQGA--VVAADINDDGKI 525
+ D+D DG+LD++V G Y+ D GK+ K EI V D N DG
Sbjct: 53 MADVDSDGDLDVIVAVEMGASRLYLNDGKGKLTLKSDAFGGEIHDNEHVRVTDFNGDGNA 112
Query: 526 ELV-TTDTHGNVAAWTAEGKGIWEQHLKSL--VTQGPS--IGDVDGDGHSDVVV 574
+ + + G + +GKG + L ++QG + IGDV+GDG +D+VV
Sbjct: 113 DAIFVAEEDGEHQLYFGDGKGNFTNATDRLPGMSQGNALAIGDVNGDGLTDIVV 166
>gi|297193753|ref|ZP_06911151.1| secreted protein [Streptomyces pristinaespiralis ATCC 25486]
gi|297151929|gb|EDY67070.2| secreted protein [Streptomyces pristinaespiralis ATCC 25486]
Length = 587
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/123 (24%), Positives = 51/123 (41%), Gaps = 12/123 (9%)
Query: 472 DLDGDGNLDILVGTSFGLFYVLDHHGK-----IREKFPLEMAEIQGAVVAADINDDGKIE 526
DL DG D++ G+ Y+ +G +++ + + A DI+ DG+ +
Sbjct: 438 DLSSDGKGDLIARDGSGVLYLYRGNGNGTGFAAKQRIGSGWGQFNALLGAGDISGDGRAD 497
Query: 527 LVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQG-------PSIGDVDGDGHSDVVVPTLSG 579
LV G + + G K L+ G + GD+ GDG +D+V T G
Sbjct: 498 LVARAKDGKLYLYEGTGAAAAPFKAKRLIGSGWGAYTKLAAPGDMTGDGRADIVARTSGG 557
Query: 580 NIY 582
+Y
Sbjct: 558 TLY 560
>gi|417821233|ref|ZP_12467847.1| FG-GAP repeat family protein [Vibrio cholerae HE39]
gi|419830357|ref|ZP_14353842.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-1A2]
gi|419834036|ref|ZP_14357491.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-61A2]
gi|422917737|ref|ZP_16952055.1| FG-GAP repeat family protein [Vibrio cholerae HC-02A1]
gi|423822643|ref|ZP_17716653.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-55C2]
gi|423856034|ref|ZP_17720458.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-59A1]
gi|423882908|ref|ZP_17724045.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-60A1]
gi|423956394|ref|ZP_17734948.1| PQQ enzyme repeat family protein [Vibrio cholerae HE-40]
gi|423985180|ref|ZP_17738497.1| PQQ enzyme repeat family protein [Vibrio cholerae HE-46]
gi|423998167|ref|ZP_17741419.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-02C1]
gi|424017061|ref|ZP_17756890.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-55B2]
gi|424019985|ref|ZP_17759771.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-59B1]
gi|424625352|ref|ZP_18063813.1| FG-GAP repeat family protein [Vibrio cholerae HC-50A1]
gi|424629839|ref|ZP_18068126.1| FG-GAP repeat family protein [Vibrio cholerae HC-51A1]
gi|424633882|ref|ZP_18071982.1| FG-GAP repeat family protein [Vibrio cholerae HC-52A1]
gi|424636963|ref|ZP_18074971.1| FG-GAP repeat family protein [Vibrio cholerae HC-55A1]
gi|424640873|ref|ZP_18078756.1| FG-GAP repeat family protein [Vibrio cholerae HC-56A1]
gi|424648941|ref|ZP_18086604.1| FG-GAP repeat family protein [Vibrio cholerae HC-57A1]
gi|443527860|ref|ZP_21093909.1| FG-GAP repeat family protein [Vibrio cholerae HC-78A1]
gi|340038864|gb|EGQ99838.1| FG-GAP repeat family protein [Vibrio cholerae HE39]
gi|341636619|gb|EGS61313.1| FG-GAP repeat family protein [Vibrio cholerae HC-02A1]
gi|408012650|gb|EKG50423.1| FG-GAP repeat family protein [Vibrio cholerae HC-50A1]
gi|408018187|gb|EKG55646.1| FG-GAP repeat family protein [Vibrio cholerae HC-52A1]
gi|408023591|gb|EKG60752.1| FG-GAP repeat family protein [Vibrio cholerae HC-56A1]
gi|408024071|gb|EKG61204.1| FG-GAP repeat family protein [Vibrio cholerae HC-55A1]
gi|408032890|gb|EKG69459.1| FG-GAP repeat family protein [Vibrio cholerae HC-57A1]
gi|408055192|gb|EKG90131.1| FG-GAP repeat family protein [Vibrio cholerae HC-51A1]
gi|408620130|gb|EKK93142.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-1A2]
gi|408634619|gb|EKL06854.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-55C2]
gi|408640937|gb|EKL12719.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-59A1]
gi|408641032|gb|EKL12813.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-60A1]
gi|408648858|gb|EKL20175.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-61A2]
gi|408657583|gb|EKL28661.1| PQQ enzyme repeat family protein [Vibrio cholerae HE-40]
gi|408664411|gb|EKL35248.1| PQQ enzyme repeat family protein [Vibrio cholerae HE-46]
gi|408852522|gb|EKL92344.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-02C1]
gi|408859953|gb|EKL99607.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-55B2]
gi|408867079|gb|EKM06441.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-59B1]
gi|443453732|gb|ELT17550.1| FG-GAP repeat family protein [Vibrio cholerae HC-78A1]
Length = 691
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 38/63 (60%), Gaps = 3/63 (4%)
Query: 472 DLDGDGNLDILVGTSFGLFY-VLDHHGKIREKFPLEMAEIQ--GAVVAADINDDGKIELV 528
DLDGDG ++I+ ++ + +LDH GKI+++ + + G + ADIN DG IE++
Sbjct: 125 DLDGDGLIEIVSTSALTPYINILDHQGKIKKQLLKSASGWRSVGDIALADINGDGNIEIL 184
Query: 529 TTD 531
D
Sbjct: 185 AAD 187
>gi|375136249|ref|YP_004996899.1| putative competence factor involved in DNA binding and uptake
(ComC) [Acinetobacter calcoaceticus PHEA-2]
gi|325123694|gb|ADY83217.1| putative competence factor involved in DNA binding and uptake
(ComC) [Acinetobacter calcoaceticus PHEA-2]
Length = 1252
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 52/99 (52%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDT-K 443
S PV+A+I +GV + ++ V +D + Y+NP GK A+ + + T +
Sbjct: 816 SKPVLANIRYNGVIKRVMIVGGGYD-QCYENPNITLSNSCFTNGKAKGNAVYIIDAKTGE 874
Query: 444 QVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
++ WT+D + DN++ + I S + +D D DG +D L
Sbjct: 875 RLWWTSDTGSNIDNSNMKHSIVSRISTLDRDADGLVDHL 913
>gi|262280315|ref|ZP_06058099.1| pilus assembly protein tip-associated adhesin PilY1 [Acinetobacter
calcoaceticus RUH2202]
gi|262258093|gb|EEY76827.1| pilus assembly protein tip-associated adhesin PilY1 [Acinetobacter
calcoaceticus RUH2202]
Length = 1284
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 52/99 (52%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDT-K 443
S PV+A+I +GV + ++ V +D + Y+NP GK A+ + + T +
Sbjct: 848 SKPVLANIRYNGVIKRVMIVGGGYD-QCYENPNITLSNSCFTNGKAKGNAVYIIDAKTGE 906
Query: 444 QVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
++ WT+D + DN++ + I S + +D D DG +D L
Sbjct: 907 RLWWTSDTGSNIDNSNMKHSIVSRISTLDRDADGLVDHL 945
>gi|255036905|ref|YP_003087526.1| FG-GAP repeat-containing protein [Dyadobacter fermentans DSM 18053]
gi|254949661|gb|ACT94361.1| FG-GAP repeat protein [Dyadobacter fermentans DSM 18053]
Length = 1348
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 58/225 (25%), Positives = 95/225 (42%), Gaps = 66/225 (29%)
Query: 390 ADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTT 449
D+D DG S++I+ S + D G+ GA V++ +
Sbjct: 463 GDVDGDGYSDVIVGASMY------------------DKGESNEGAAFVYHGSINGIATAA 504
Query: 450 DLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSF---------GLFYVLDHHGK-- 498
L + A+ + +S+ T D++GDG D++VG SF G +V +HG
Sbjct: 505 KTTLENNQAA-AFFGFSASTAGDVNGDGFSDVIVG-SFHFDNGQNNEGGAFV--YHGSAN 560
Query: 499 -----IREKFPLEMAEIQ-GAVVAA--DINDDGKIELV---------------------T 529
I ++ A Q GA VA+ D+N DG +++ +
Sbjct: 561 GISTTIAKQLECNQAGAQYGASVASAGDVNGDGYADVIVGANLFDNGQGNEGGAFVYEGS 620
Query: 530 TDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVV 574
+ N +A + EG + +L S V+ S GDV+GDG+SD+VV
Sbjct: 621 SSGLANTSASSQEGNQL-NAYLGSSVS---SAGDVNGDGYSDIVV 661
>gi|427424023|ref|ZP_18914160.1| PilC beta-propeller domain protein [Acinetobacter baumannii WC-136]
gi|425699131|gb|EKU68750.1| PilC beta-propeller domain protein [Acinetobacter baumannii WC-136]
Length = 1284
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 52/99 (52%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDT-K 443
S PV+A+I +GV + ++ V +D + Y+NP GK A+ + + T +
Sbjct: 848 SKPVLANIRYNGVIKRVMIVGGGYD-QCYENPNITLSNSCFTNGKAKGNAVYIIDAKTGE 906
Query: 444 QVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
++ WT+D + DN++ + I S + +D D DG +D L
Sbjct: 907 RLWWTSDTGSNIDNSNMKHSIVSRISTLDRDADGLVDHL 945
>gi|307592322|ref|YP_003899913.1| hypothetical protein Cyan7822_6000 [Cyanothece sp. PCC 7822]
gi|306985967|gb|ADN17847.1| hypothetical protein Cyan7822_6000 [Cyanothece sp. PCC 7822]
Length = 5687
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 47/172 (27%), Positives = 80/172 (46%), Gaps = 35/172 (20%)
Query: 472 DLDGDGNLDILVGTSFGLFYVLDHHG-----KIREKFPL----EMAEIQGAVVA--ADIN 520
D+D DG+LD+++G S G +++++G R+ L + + G V AD++
Sbjct: 2597 DMDTDGDLDLIIGKSSGDVQIIENNGSRLFPSFRQPRALIGDGKALRMSGPVAPTLADLD 2656
Query: 521 DDGKIELVTTDTHGNVAAW---TAEGKGIWEQHLKSLVTQGPSIG-----------DVDG 566
DG ++L++ D+ G ++ + TAEGK + +S+ + P +G D+D
Sbjct: 2657 GDGILDLISIDSTGQMSEFTVGTAEGKMSLQ---RSVAHRLPKLGANEQGYFVQFFDLDQ 2713
Query: 567 DGHSDVVV------PTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDL 612
DG D +V P GN Y G+ PY + +N V L DL
Sbjct: 2714 DGDLDALVDNTRNRPGQLGNGYHYYVNYGTTESPYFVESFHSSVN-VFLRDL 2764
>gi|397616116|gb|EJK63834.1| hypothetical protein THAOC_15488 [Thalassiosira oceanica]
Length = 634
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 60/122 (49%), Gaps = 8/122 (6%)
Query: 470 VVDLDGDGNLDILVGT-SFGLFYVLDH-HGKIREKFPLEMAEIQGAVVAADINDDGKIEL 527
+ D++GDG D +VG+ SF L++ G + L A+ A D N DG ++L
Sbjct: 183 LADINGDGYPDYIVGSDSFNAQVFLNNGSGGFDSPYDLTTQTASRAICACDFNQDGNMDL 242
Query: 528 VTTDTHG-NVAAWTAEGKGIWEQHLKSLV-TQGPSIG----DVDGDGHSDVVVPTLSGNI 581
V + G N + +G + L + G S+G D++GDG+ DV+V SG+
Sbjct: 243 VVSFGQGKNNPVYMNDGSDPPVFAVTDLPDSLGNSVGNACADLNGDGYPDVIVSDYSGDT 302
Query: 582 YV 583
YV
Sbjct: 303 YV 304
>gi|373457298|ref|ZP_09549065.1| FG-GAP repeat protein [Caldithrix abyssi DSM 13497]
gi|371718962|gb|EHO40733.1| FG-GAP repeat protein [Caldithrix abyssi DSM 13497]
Length = 1556
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 50/197 (25%), Positives = 87/197 (44%), Gaps = 30/197 (15%)
Query: 508 AEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWE-QHLKSLVTQGPSIGDVDG 566
+E+ G VVA D+N D + EL+ + +G + + +G ++ + L+ +T P+IG V G
Sbjct: 1012 SEVNG-VVAYDLNGDRRKELIVSGGNGQLHVYDWQGGLLFSVEGLEGDLTV-PAIGQVTG 1069
Query: 567 DGHSDVVVP------TLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRG---- 616
D ++VV TL+ N+YV+ K+G + H V + +L DL G
Sbjct: 1070 DAQVEIVVAGSKEGLTLN-NVYVIDSKNGQLLA--SANLHYNVPSSAVLADLDADGYDEI 1126
Query: 617 ------------EKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGG 664
++ + I T S D P +V G S + A ++D
Sbjct: 1127 LLLTHGNNAPEPPQNSRIFIFTDSSGVLTGFKDWPDDGYALV--GSASVGNLAAADLDQS 1184
Query: 665 DDLDLIVTTMNGNVFCF 681
L ++V T+ ++CF
Sbjct: 1185 GQLSVVVPTVESKLYCF 1201
>gi|434404233|ref|YP_007147118.1| FG-GAP repeat protein [Cylindrospermum stagnale PCC 7417]
gi|428258488|gb|AFZ24438.1| FG-GAP repeat protein [Cylindrospermum stagnale PCC 7417]
Length = 422
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 51/109 (46%), Gaps = 3/109 (2%)
Query: 467 SPTVVDLDGDGNLDILV-GTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKI 525
+PT+VD DG+G D+ T+ G V G + L+ + D N D K
Sbjct: 228 APTIVDFDGNGKSDVFWRNTTTGENSVWFMDGTQATNYALQSQDAAWTYKLGDFNGDFKT 287
Query: 526 ELVTTDTH-GNVAAWTAEGKGIWEQHLKSL-VTQGPSIGDVDGDGHSDV 572
+++ ++ G W G + E L++L T SIGD +GDG +D+
Sbjct: 288 DILWNNSQTGENKIWNMNGIFVTEGALQTLDSTWTSSIGDYNGDGKTDI 336
>gi|359150135|ref|ZP_09182996.1| hypothetical protein StrS4_26121 [Streptomyces sp. S4]
Length = 440
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 35/65 (53%), Gaps = 3/65 (4%)
Query: 468 PTVVDLDGDGNLDILVGTSFGLFYVLDHHGKI---REKFPLEMAEIQGAVVAADINDDGK 524
PTV DLDGDG+LD++V + + + + R + PL GA VA D++ DG
Sbjct: 115 PTVADLDGDGHLDLVVSGAAQVVWGGPDGPRADGPRGRVPLPGGPYAGAPVAGDVDGDGH 174
Query: 525 IELVT 529
+LV
Sbjct: 175 TDLVA 179
>gi|348507312|ref|XP_003441200.1| PREDICTED: integrin alpha-9 [Oreochromis niloticus]
Length = 1030
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 46/212 (21%), Positives = 80/212 (37%), Gaps = 45/212 (21%)
Query: 430 YVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYI-----YSSPTVVDLDG-------DG 477
Y G + V+NL ++ ++ + S+ Y +SSP V+D+ +G
Sbjct: 217 YWTGTVKVYNLTSESFYSPNKDNIDSHRYSYLGYAVTAGHFSSPNVIDIAAGAPQHSTNG 276
Query: 478 NLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIEL-VTTDTH--- 533
+ I L GK+ + ++ A D+N DG +L V H
Sbjct: 277 KVYIFKIDGASLVKSFQASGKMMGSY------FGSSLCAVDLNQDGLSDLLVGAPMHSQL 330
Query: 534 ---GNVAAWTAEGKGIWEQ------------HLKSLVTQGPSIGDVDGDGHSDVVVPT-- 576
G V+ + + G G+ E+ H + +IGD+D DG+ DV +
Sbjct: 331 RDEGQVSVYLSRGNGVMEEDGVLTGDSAFNAHFGECIA---AIGDIDDDGYQDVAIGAPK 387
Query: 577 ---LSGNIYVLSGKDGSKVRPYPYRTHGRVMN 605
G +Y+ G + Y + GR +N
Sbjct: 388 EDEYGGAVYIYHGDATGITKKYSMKLSGRSVN 419
>gi|374984425|ref|YP_004959920.1| hypothetical protein SBI_01668 [Streptomyces bingchenggensis BCW-1]
gi|297155077|gb|ADI04789.1| hypothetical protein SBI_01668 [Streptomyces bingchenggensis BCW-1]
Length = 478
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 55/121 (45%), Gaps = 11/121 (9%)
Query: 472 DLDGDGNLDILV-GTSFGLFYVL----DHHGKIREKFPLEMAEIQGA--VVAADINDDGK 524
D +GDG+ D++ GT G VL G L G +VA DIN DG
Sbjct: 170 DFNGDGHADLVAEGTGEGELRVLYGPFTRQGTAATANTLNSGRTFGPTEMVAGDINGDGS 229
Query: 525 IELVTTDTHGNVAA----WTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGN 580
+LVTT ++ W G+ + +IGDVD DG++D+++ T+ GN
Sbjct: 230 DDLVTTHAFEEMSEPSLYWKGGASGLGSAPTRIDHAAAGTIGDVDKDGYADLIIRTVPGN 289
Query: 581 I 581
+
Sbjct: 290 V 290
>gi|408530574|emb|CCK28748.1| hypothetical protein BN159_4369 [Streptomyces davawensis JCM 4913]
Length = 1025
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 57/126 (45%), Gaps = 11/126 (8%)
Query: 472 DLDGDGNLDILV-GTSFGLFYVLD--HHGKIREKFPL--EMAEIQGAVVAADINDDGKIE 526
DL GDG D+LV S G Y+ GK+ + L + + V DI DGK +
Sbjct: 877 DLSGDGRPDLLVRNASTGAVYLYKGTSTGKLSARVKLYDDWKTYKKIVGVGDITGDGKAD 936
Query: 527 LVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPS------IGDVDGDGHSDVVVPTLSGN 580
L+ D + + +G G + +K G + +GD+ DG +D+V SGN
Sbjct: 937 LLAQDKANTLYRYDGKGDGTFAARVKLFSDWGGTYNAVVGVGDITDDGRADLVSRDTSGN 996
Query: 581 IYVLSG 586
++ SG
Sbjct: 997 LWRNSG 1002
Score = 40.8 bits (94), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 40/88 (45%), Gaps = 5/88 (5%)
Query: 470 VVDLDGDGNLDILVGTSFGLFYVLDHHG----KIREKFPLEMAEIQGAVVA-ADINDDGK 524
V D+ GDG D+L Y D G R K + AVV DI DDG+
Sbjct: 926 VGDITGDGKADLLAQDKANTLYRYDGKGDGTFAARVKLFSDWGGTYNAVVGVGDITDDGR 985
Query: 525 IELVTTDTHGNVAAWTAEGKGIWEQHLK 552
+LV+ DT GN+ + +GKG + K
Sbjct: 986 ADLVSRDTSGNLWRNSGDGKGSFGSRTK 1013
>gi|195498004|ref|XP_002096340.1| GE25117 [Drosophila yakuba]
gi|194182441|gb|EDW96052.1| GE25117 [Drosophila yakuba]
Length = 1028
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 47/207 (22%), Positives = 81/207 (39%), Gaps = 33/207 (15%)
Query: 410 HEYYDNPEHLKELGG----IDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIY 465
H + +P L + I G + V + Q +W+ L + RA
Sbjct: 163 HWWRPDPAQLAAISAGSSRISYGSSLGSEFVPCTQISVQKRWSRSLARMNSESPLRA--- 219
Query: 466 SSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKI 525
DL+GDG D++ G +G+ + + G + + Q A ++ +G
Sbjct: 220 -----ADLNGDGIRDVIFG--YGVDDNIPYEG-------IPLPRCQSAQQGEEVPCEG-- 263
Query: 526 ELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLS 585
G VA +G +W+ + V DVDGDG +D V G IY ++
Sbjct: 264 --------GVVALNGLDGSILWQSWSVANVFSLHCSADVDGDGGTDCVAAGRLGMIYAIN 315
Query: 586 GKDGSKVRPYPYRTHGRVMNQVLLVDL 612
G+ GS + + +R N +++DL
Sbjct: 316 GRTGSVI--WSFRELEVETNSPIVMDL 340
>gi|194899929|ref|XP_001979510.1| GG15875 [Drosophila erecta]
gi|190651213|gb|EDV48468.1| GG15875 [Drosophila erecta]
Length = 1033
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 45/188 (23%), Positives = 75/188 (39%), Gaps = 29/188 (15%)
Query: 425 IDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVG 484
I G + V + Q +WT L + RA DL+GDG D++ G
Sbjct: 187 ISYGSSLGSEFVPCTQISVQKRWTRSLARMNSESPLRA--------ADLNGDGIRDVVFG 238
Query: 485 TSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGK 544
+G+ + + G + + Q A ++ +G G VA +G
Sbjct: 239 --YGVDDNIPYEG-------IPLPRCQSAQQGEEVPCEG----------GVVALNGLDGS 279
Query: 545 GIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVM 604
+W+ + V DVDGDG +D V G IY ++G+ GS + + +R
Sbjct: 280 ILWQSWSVANVFSLHCSADVDGDGGTDCVAAGRLGMIYAINGRTGSVI--WSFRELEVET 337
Query: 605 NQVLLVDL 612
N +++DL
Sbjct: 338 NSPIVMDL 345
>gi|424744602|ref|ZP_18172893.1| PilC beta-propeller domain protein [Acinetobacter baumannii WC-141]
gi|422942815|gb|EKU37851.1| PilC beta-propeller domain protein [Acinetobacter baumannii WC-141]
Length = 1284
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 52/99 (52%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDT-K 443
S PV+A+I +GV + ++ V +D + Y+NP GK A+ + + T +
Sbjct: 848 SKPVLANIRYNGVIKRVMIVGGGYD-QCYENPTITLSNSCFTNGKAKGNAVYIIDAKTGE 906
Query: 444 QVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
++ WT+D + DN++ + I S + +D D DG +D L
Sbjct: 907 RLWWTSDTGSNIDNSNMKHSIVSRISTLDRDADGLVDHL 945
>gi|359777241|ref|ZP_09280531.1| subtilisin family peptidase [Arthrobacter globiformis NBRC 12137]
gi|359305573|dbj|GAB14360.1| subtilisin family peptidase [Arthrobacter globiformis NBRC 12137]
Length = 725
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 30/124 (24%), Positives = 54/124 (43%), Gaps = 9/124 (7%)
Query: 472 DLDGDGNLDILVGTSFGLFYVLDHHGK----IREKFPLEMAEIQGAVVAADINDDGKIEL 527
D +GDG D+L + G ++ +G R++ V D N D + ++
Sbjct: 485 DFNGDGKADVLARENSGFLWLYPGNGSGGWLARKQMGSGWNGFTSLVAPGDFNGDRRADV 544
Query: 528 VTTDTHGNVAAWTAEGKGIWEQHLK-----SLVTQGPSIGDVDGDGHSDVVVPTLSGNIY 582
+ DT G + + G G W +K + T GD++GDG +DV+ +G ++
Sbjct: 545 LARDTTGAMWLYPGNGSGGWLARVKVGSGWNGFTAIVGAGDINGDGKADVLARDTTGALW 604
Query: 583 VLSG 586
+ G
Sbjct: 605 LYPG 608
Score = 39.3 bits (90), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 34/122 (27%), Positives = 51/122 (41%), Gaps = 13/122 (10%)
Query: 472 DLDGDGNLDILVGTSFGLFYVLDHHGK----IREKFPLEMAEIQGAVVAADINDDGKIEL 527
D +GD D+L + G ++ +G R K V A DIN DGK ++
Sbjct: 535 DFNGDRRADVLARDTTGAMWLYPGNGSGGWLARVKVGSGWNGFTAIVGAGDINGDGKADV 594
Query: 528 VTTDTHGNVAAWTAEGKGIWEQHLK-------SLVTQGPSIGDVDGDGHSDVVVPTLSGN 580
+ DT G + + G+G W K GP GDV GD +DV+ +G
Sbjct: 595 LARDTTGALWLYPGNGRGGWLARTKVGSGWNSMTAIVGP--GDVTGDRKADVLARDTAGV 652
Query: 581 IY 582
++
Sbjct: 653 LW 654
>gi|119486148|ref|ZP_01620208.1| hypothetical protein L8106_17432 [Lyngbya sp. PCC 8106]
gi|119456639|gb|EAW37768.1| hypothetical protein L8106_17432 [Lyngbya sp. PCC 8106]
Length = 1010
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 33/129 (25%), Positives = 57/129 (44%), Gaps = 15/129 (11%)
Query: 466 SSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIRE---KFPLEMAEI--QGAVVAADIN 520
S+PT VD +GDG+LD+ +G G + + G P+ + ++ + DI+
Sbjct: 434 SAPTFVDENGDGDLDLFIGAGDGTIAIARNEGTATNPNFSAPIVIRDVGNRSTPTFVDID 493
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVT----------QGPSIGDVDGDGHS 570
+DG +L + DT G + + G ++ P+ D+DGDG
Sbjct: 494 NDGDQDLFSGDTFGRLHYYRNRGNVNGASFDPPIINPFGFSGAGGLSNPAFADIDGDGDH 553
Query: 571 DVVVPTLSG 579
D+ V T +G
Sbjct: 554 DLFVGTATG 562
>gi|11499528|ref|NP_070770.1| cysteine proteinase [Archaeoglobus fulgidus DSM 4304]
gi|2648597|gb|AAB89309.1| cysteine proteinase, putative [Archaeoglobus fulgidus DSM 4304]
Length = 1088
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 32/119 (26%), Positives = 53/119 (44%), Gaps = 1/119 (0%)
Query: 469 TVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELV 528
V D++GDG +I+ G V D +G +F L+ E A+ D+N DG E+V
Sbjct: 967 CVGDVNGDGKDEIVHGDRGDWVRVYDMYGTKLSEFQLDF-EQGDALQCGDLNGDGVDEIV 1025
Query: 529 TTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGK 587
D V + G + E G ++ D +GDG D++ + +V+ G+
Sbjct: 1026 HGDRDDWVRVYNMHGALLGEMRFNFEFGDGFAVADFNGDGKDDIIHGDRGDSFHVVKGE 1084
>gi|113474508|ref|YP_720569.1| Na-Ca exchanger/integrin-beta4 [Trichodesmium erythraeum IMS101]
gi|110165556|gb|ABG50096.1| Na-Ca exchanger/integrin-beta4 [Trichodesmium erythraeum IMS101]
Length = 1113
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 49/177 (27%), Positives = 77/177 (43%), Gaps = 21/177 (11%)
Query: 505 LEMAEIQGAVVAADINDDGKIELVTTD-THGNVAAWTAEGKGIWEQHLKSLVTQGP---S 560
++ +I +V D N DG +L + + NV+ + +G G + V GP +
Sbjct: 208 FKVGDIARSVAVGDFNGDGIPDLAVANFSSDNVSVLSGKGDGSFGSATNFPVGDGPISVA 267
Query: 561 IGDVDGDGHSDVVVPTL-SGNIYVLSGK-DGS-------KVRPYPYRTHGRVMNQVLLVD 611
+GD +GDG D+ V L S N+ VLSGK DGS V P N + D
Sbjct: 268 VGDFNGDGIPDLAVANLWSDNVSVLSGKGDGSFGSATNFPVGDGPISVAVGDFNGDGIPD 327
Query: 612 LTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLAD-NVDGGDDL 667
L + S +++++ DG A +G+ +S+ + D N DG DL
Sbjct: 328 LAVANDGSDNVSVLSGKGDGSF-------GSATSFPVGDGPFSVAVGDFNGDGIPDL 377
>gi|406909050|gb|EKD49391.1| Cadherin [uncultured bacterium]
Length = 222
Score = 42.0 bits (97), Expect = 1.6, Method: Composition-based stats.
Identities = 40/134 (29%), Positives = 62/134 (46%), Gaps = 13/134 (9%)
Query: 472 DLDGDGNLDILVGTSFG--LFYVLDHHG-KIREKF---PLEMAEIQGAVVAADINDDGKI 525
D+DGDG +I + + G + D G K ++K+ + + ++ DI+ DGK
Sbjct: 13 DVDGDGKDEIGIARASGKSTVRIFDFDGGKAKKKYEWNAFGKSHVGASIAFGDIDKDGKA 72
Query: 526 ELVTTDTHGN---VAAWTAEG--KGIWEQHLKSLVTQGPSI--GDVDGDGHSDVVVPTLS 578
E++ G V + A G K + ++ T G + D DGDG DVVV LS
Sbjct: 73 EIIAGQGKGGASLVRTFEASGSPKSLKFTAFEAAYTGGIDVDSADFDGDGKDDVVVSKLS 132
Query: 579 GNIYVLSGKDGSKV 592
G+ V + KD V
Sbjct: 133 GSSDVKTYKDSGTV 146
>gi|167538463|ref|XP_001750895.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163770579|gb|EDQ84265.1| predicted protein [Monosiga brevicollis MX1]
Length = 572
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 61/124 (49%), Gaps = 10/124 (8%)
Query: 459 SFRAYIYSSPTVVDLDGDGNLDILVGTSFG---LFYVLDHHGKIREKFPLEMAEIQGA-- 513
S A S DLD DGNLD+L + +Y + +G ++ + M ++ GA
Sbjct: 225 SMAALSAESVYAADLDNDGNLDVLSASQQQKKIAWYRNNGNGTFSDENIISM-QVAGARF 283
Query: 514 VVAADINDDGKIELVTTDTHGNVAAW-TAEGKGIWEQHLKSLVTQGPS---IGDVDGDGH 569
V AAD+++D ++++ N +W + G + L + +GP+ + D+D DGH
Sbjct: 284 VYAADLDNDNFVDVLCAGISQNEISWYRNDHNGTFTPVLLTTAVKGPTSVYVADLDNDGH 343
Query: 570 SDVV 573
DV+
Sbjct: 344 GDVL 347
>gi|417550405|ref|ZP_12201484.1| PilC beta-propeller domain protein [Acinetobacter baumannii
Naval-18]
gi|400386230|gb|EJP49304.1| PilC beta-propeller domain protein [Acinetobacter baumannii
Naval-18]
Length = 1284
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 848 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 906
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 907 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 945
>gi|421663008|ref|ZP_16103162.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC110]
gi|408714036|gb|EKL59191.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC110]
Length = 1284
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 848 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 906
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 907 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 945
>gi|445442403|ref|ZP_21442350.1| PilC beta-propeller domain protein [Acinetobacter baumannii
WC-A-92]
gi|444763923|gb|ELW88257.1| PilC beta-propeller domain protein [Acinetobacter baumannii
WC-A-92]
Length = 1284
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 848 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 906
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 907 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 945
>gi|421693620|ref|ZP_16133253.1| PilC beta-propeller domain protein [Acinetobacter baumannii WC-692]
gi|404570257|gb|EKA75334.1| PilC beta-propeller domain protein [Acinetobacter baumannii WC-692]
Length = 1284
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 848 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 906
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 907 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 945
>gi|417543868|ref|ZP_12194954.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC032]
gi|417882864|ref|ZP_12527139.1| pilus assembly protein tip-associated adhesin PilY1 [Acinetobacter
baumannii ABNIH4]
gi|421668184|ref|ZP_16108224.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC087]
gi|421669097|ref|ZP_16109125.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC099]
gi|421685832|ref|ZP_16125598.1| PilC beta-propeller domain protein [Acinetobacter baumannii IS-143]
gi|445478634|ref|ZP_21454757.1| PilC beta-propeller domain protein [Acinetobacter baumannii
Naval-78]
gi|342236935|gb|EGU01433.1| pilus assembly protein tip-associated adhesin PilY1 [Acinetobacter
baumannii ABNIH4]
gi|400381756|gb|EJP40434.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC032]
gi|404570859|gb|EKA75931.1| PilC beta-propeller domain protein [Acinetobacter baumannii IS-143]
gi|410380622|gb|EKP33202.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC087]
gi|410389224|gb|EKP41639.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC099]
gi|444774707|gb|ELW98783.1| PilC beta-propeller domain protein [Acinetobacter baumannii
Naval-78]
Length = 1284
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 848 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 906
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 907 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 945
>gi|445460846|ref|ZP_21448445.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC047]
gi|444772310|gb|ELW96428.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC047]
Length = 1284
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 848 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 906
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 907 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 945
>gi|424061940|ref|ZP_17799427.1| hypothetical protein W9M_02141 [Acinetobacter baumannii Ab44444]
gi|404674352|gb|EKB42100.1| hypothetical protein W9M_02141 [Acinetobacter baumannii Ab44444]
Length = 1288
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 852 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 910
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 911 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 949
>gi|417566295|ref|ZP_12217169.1| hypothetical protein ACIN5143_A3359 [Acinetobacter baumannii
OIFC143]
gi|395558051|gb|EJG24052.1| hypothetical protein ACIN5143_A3359 [Acinetobacter baumannii
OIFC143]
Length = 1284
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 848 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 906
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 907 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 945
>gi|284045245|ref|YP_003395585.1| FG-GAP repeat-containing protein [Conexibacter woesei DSM 14684]
gi|283949466|gb|ADB52210.1| FG-GAP repeat protein [Conexibacter woesei DSM 14684]
Length = 616
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 40/133 (30%), Positives = 58/133 (43%), Gaps = 16/133 (12%)
Query: 457 NASFRAYIYSSPTVV--------DLDGDGNLDILVGTSFGL-FYVLDHHGKIREKFPLEM 507
+ S RA I S P V D DGDGN+D + G +FG V+ G F
Sbjct: 87 DGSLRAAI-SQPIVSGGSAVATGDFDGDGNVDAISGGTFGFSLDVMLGRGDTTFAFSSYN 145
Query: 508 AEIQG-AVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSI----- 561
+ +Q A+ AAD + DGK++ +A + +G G + + V G S+
Sbjct: 146 SGLQTEALAAADFDTDGKLDFAVGSGGNEIATFRGDGAGGFSAGSVTEVDPGASVIGLAA 205
Query: 562 GDVDGDGHSDVVV 574
D D DG D+ V
Sbjct: 206 ADFDRDGDPDLAV 218
>gi|425748473|ref|ZP_18866460.1| von Willebrand factor type A domain protein [Acinetobacter
baumannii WC-348]
gi|193078544|gb|ABO13564.2| possible pilus assembly protein tip-associated adhesin PilY1
[Acinetobacter baumannii ATCC 17978]
gi|425491354|gb|EKU57639.1| von Willebrand factor type A domain protein [Acinetobacter
baumannii WC-348]
Length = 1284
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 848 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 906
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 907 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 945
>gi|260556942|ref|ZP_05829159.1| pilus assembly protein tip-associated adhesin PilY1 [Acinetobacter
baumannii ATCC 19606 = CIP 70.34]
gi|260409548|gb|EEX02849.1| pilus assembly protein tip-associated adhesin PilY1 [Acinetobacter
baumannii ATCC 19606 = CIP 70.34]
gi|452949438|gb|EME54906.1| pilus assembly protein tip-associated adhesin PilY1 [Acinetobacter
baumannii MSP4-16]
Length = 1284
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 848 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 906
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 907 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 945
>gi|223939996|ref|ZP_03631862.1| FG-GAP repeat protein [bacterium Ellin514]
gi|223891340|gb|EEF57835.1| FG-GAP repeat protein [bacterium Ellin514]
Length = 677
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 60/121 (49%), Gaps = 19/121 (15%)
Query: 472 DLDGDGNLDILVGTSF--GLFYVLDHHGKIREKFPLEMAEIQGA--VVAADINDDGKIEL 527
D++GDG +D L+ T + +L ++G + F + G VV ADIN DGK +L
Sbjct: 425 DINGDGKVD-LICTEYNSAALTILTNNGSGQFVFNATVPAGSGPTYVVTADINGDGKTDL 483
Query: 528 VTTDTHGN-VAAWTAEGKGIWEQ--------HLKSLVTQGPSIGDVDGDGHSDVVVPTLS 578
V+ + + V+ T G G++ HL+S+ S D++GDG D+V S
Sbjct: 484 VSANYGASTVSVLTNNGSGVFGSLATLNALAHLQSV-----SAADLNGDGKVDLVTANYS 538
Query: 579 G 579
G
Sbjct: 539 G 539
Score = 40.0 bits (92), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 52/111 (46%), Gaps = 11/111 (9%)
Query: 472 DLDGDGNLDILVGTSFGLFYVL---DHHGKIREKFPLEMAEIQGAVVAADINDDGKIELV 528
D++GDG D++ FG + + G + + +V A DIN DGK++L+
Sbjct: 376 DVNGDGKPDLITANLFGASLTILTNNGSGTFGSNATVSAGDQTYSVAAGDINGDGKVDLI 435
Query: 529 TTDTHGNVAAWTA---EGKGIWEQHLKSLVTQGPS---IGDVDGDGHSDVV 573
T+ N AA T G G + + GP+ D++GDG +D+V
Sbjct: 436 CTEY--NSAALTILTNNGSGQFVFNATVPAGSGPTYVVTADINGDGKTDLV 484
>gi|421790933|ref|ZP_16227121.1| PilC beta-propeller domain protein [Acinetobacter baumannii
Naval-2]
gi|410403811|gb|EKP55885.1| PilC beta-propeller domain protein [Acinetobacter baumannii
Naval-2]
Length = 1284
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 848 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 906
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 907 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 945
>gi|385239114|ref|YP_005800453.1| putative competence factor involved in DNA binding and uptake
(ComC) [Acinetobacter baumannii TCDC-AB0715]
gi|421708725|ref|ZP_16148098.1| pilus assembly protein tip-associated adhesin PilY1 [Acinetobacter
baumannii ZWS1219]
gi|345447599|gb|ADX93996.2| putative competence factor involved in DNA binding and uptake
(ComC) [Acinetobacter baumannii TCDC-AB0715]
gi|407189453|gb|EKE60679.1| pilus assembly protein tip-associated adhesin PilY1 [Acinetobacter
baumannii ZWS1219]
Length = 1257
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 821 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 879
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 880 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 918
>gi|424058469|ref|ZP_17795966.1| hypothetical protein W9K_02797 [Acinetobacter baumannii Ab33333]
gi|404665711|gb|EKB33673.1| hypothetical protein W9K_02797 [Acinetobacter baumannii Ab33333]
Length = 1288
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 852 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 910
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 911 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 949
>gi|421673557|ref|ZP_16113494.1| von Willebrand factor type A domain protein [Acinetobacter
baumannii OIFC065]
gi|421690351|ref|ZP_16130022.1| von Willebrand factor type A domain protein [Acinetobacter
baumannii IS-116]
gi|404564623|gb|EKA69802.1| von Willebrand factor type A domain protein [Acinetobacter
baumannii IS-116]
gi|410385775|gb|EKP38259.1| von Willebrand factor type A domain protein [Acinetobacter
baumannii OIFC065]
Length = 1284
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 848 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 906
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 907 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 945
>gi|409721642|ref|ZP_11269808.1| hypothetical protein Hham1_03485 [Halococcus hamelinensis 100A6]
gi|448723993|ref|ZP_21706507.1| hypothetical protein C447_12612 [Halococcus hamelinensis 100A6]
gi|445786799|gb|EMA37562.1| hypothetical protein C447_12612 [Halococcus hamelinensis 100A6]
Length = 416
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 32/126 (25%), Positives = 57/126 (45%), Gaps = 3/126 (2%)
Query: 466 SSPTVVDLDGDGNLDILVGTSFGLFYVLDH-HGKIREKFPLEMAEIQGAVVAADINDDGK 524
+ P + D++ DG ++LV T+ + + G+ + L +VA + +GK
Sbjct: 105 ADPAIADVNQDGTKEVLVETTENRVHGFEGLSGQRLFSYNLTSYGYTKPIVADFVEGEGK 164
Query: 525 IELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVL 584
E++ DT+G G W QH+ V P++ D GDG +++ V N V+
Sbjct: 165 -EIIAVDTNGTAIVIHPNGTVAWRQHVSGYVNSQPALADFSGDGTTELAV-GFGANALVV 222
Query: 585 SGKDGS 590
DG+
Sbjct: 223 FDADGA 228
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 68/312 (21%), Positives = 103/312 (33%), Gaps = 92/312 (29%)
Query: 357 DAMWGDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAV------------ 404
D GD+ W K N H ++ P IAD++ DG E+++
Sbjct: 82 DGTTGDQIWNYSIPLK-----NCTIHSVADPAIADVNQDGTKEVLVETTENRVHGFEGLS 136
Query: 405 -----SYFFDHEYYDNP-------EHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLD 452
SY Y P KE+ +D AIV+ T V W +
Sbjct: 137 GQRLFSYNLTSYGYTKPIVADFVEGEGKEIIAVDTNGT---AIVIHPNGT--VAWRQHV- 190
Query: 453 LSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQG 512
Y+ S P + D GDG ++ VG V D G + +
Sbjct: 191 --------SGYVNSQPALADFSGDGTTELAVGFGANALVVFDADGAVEWSRSSPLNSTIT 242
Query: 513 AVVAADINDDGKIELVTTDTHGNVAAW-TAEGKGIWEQ---------------------- 549
+ ++D E+V +T G V + AEG+ W++
Sbjct: 243 WMATGQTDEDPPTEIVVANTKGQVVVFDGAEGEIEWKRDFGDFSAVRSLEDGDRDGQPEL 302
Query: 550 -------HLKSL-------------------VTQGPSIGDVDGDGHSDVVVPTLSGNIYV 583
L+SL V PS+GDVDGDG ++V T G + V
Sbjct: 303 YVVARDGKLRSLNATTGATEWTTTLDTGDIQVVPAPSLGDVDGDGRPELVSVTNGGVVSV 362
Query: 584 LSGKDGSKVRPY 595
+ + G + Y
Sbjct: 363 IDPRTGDVLGSY 374
>gi|416146985|ref|ZP_11601532.1| Tfp pilus assembly protein, tip-associated adhesin PilY1
[Acinetobacter baumannii AB210]
gi|417571257|ref|ZP_12222114.1| hypothetical protein ACIN5189_A2435 [Acinetobacter baumannii
OIFC189]
gi|417576526|ref|ZP_12227371.1| hypothetical protein ACINNAV7_A1496 [Acinetobacter baumannii
Naval-17]
gi|421628714|ref|ZP_16069480.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC180]
gi|425753829|ref|ZP_18871696.1| PilC beta-propeller domain protein [Acinetobacter baumannii
Naval-113]
gi|445470663|ref|ZP_21451595.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC338]
gi|333365941|gb|EGK47955.1| Tfp pilus assembly protein, tip-associated adhesin PilY1
[Acinetobacter baumannii AB210]
gi|395551705|gb|EJG17714.1| hypothetical protein ACIN5189_A2435 [Acinetobacter baumannii
OIFC189]
gi|395569747|gb|EJG30409.1| hypothetical protein ACINNAV7_A1496 [Acinetobacter baumannii
Naval-17]
gi|408705705|gb|EKL51039.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC180]
gi|425497222|gb|EKU63328.1| PilC beta-propeller domain protein [Acinetobacter baumannii
Naval-113]
gi|444772617|gb|ELW96732.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC338]
Length = 1284
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 848 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 906
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 907 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 945
>gi|115371868|ref|ZP_01459181.1| HYR domain protein [Stigmatella aurantiaca DW4/3-1]
gi|115371103|gb|EAU70025.1| HYR domain protein [Stigmatella aurantiaca DW4/3-1]
Length = 1057
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 58/200 (29%), Positives = 83/200 (41%), Gaps = 23/200 (11%)
Query: 390 ADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKY---------VAGAIVVFNL 440
AD+D DG E+I S + + P G + + V+G V L
Sbjct: 352 ADLDGDGKQEVINGRSIYKHNGTPYCPTAPIPHGFAAVANFDEDSKAEIVVSGYGQVSLL 411
Query: 441 D-TKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDI-LVGTSFGLFYVLDHHGK 498
D QV WT + SP + D D DG +I L GT+ + VLD G
Sbjct: 412 DDNCQVLWTVPVPGGG--------HGGSPNIGDFDNDGVPEIGLPGTN--AYSVLDSDGT 461
Query: 499 IREKFPL-EMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWE-QHLKSLVT 556
+ K P E++ + A D DDGK+E++ D A G+ +E H S
Sbjct: 462 LLWKRPTQELSSGKTGSTAFDFEDDGKLEIIYADEVRLRIYDGATGQVRFETAHSSSTTH 521
Query: 557 QGPSIGDVDGDGHSDVVVPT 576
+ P I DVD D +++VV T
Sbjct: 522 ENPVIADVDSDFAAELVVAT 541
>gi|421624264|ref|ZP_16065137.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC098]
gi|408701832|gb|EKL47254.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC098]
Length = 1284
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 848 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 906
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 907 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 945
>gi|417553911|ref|ZP_12204980.1| PilC beta-propeller domain protein [Acinetobacter baumannii
Naval-81]
gi|417560173|ref|ZP_12211052.1| hypothetical protein ACIN3137_A2434 [Acinetobacter baumannii
OIFC137]
gi|421199681|ref|ZP_15656842.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC109]
gi|421455180|ref|ZP_15904524.1| PilC beta-propeller domain protein [Acinetobacter baumannii IS-123]
gi|421635235|ref|ZP_16075838.1| PilC beta-propeller domain protein [Acinetobacter baumannii
Naval-13]
gi|421803836|ref|ZP_16239748.1| PilC beta-propeller domain protein [Acinetobacter baumannii
WC-A-694]
gi|395522755|gb|EJG10844.1| hypothetical protein ACIN3137_A2434 [Acinetobacter baumannii
OIFC137]
gi|395564678|gb|EJG26329.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC109]
gi|400211418|gb|EJO42380.1| PilC beta-propeller domain protein [Acinetobacter baumannii IS-123]
gi|400390328|gb|EJP57375.1| PilC beta-propeller domain protein [Acinetobacter baumannii
Naval-81]
gi|408702787|gb|EKL48195.1| PilC beta-propeller domain protein [Acinetobacter baumannii
Naval-13]
gi|410412302|gb|EKP64161.1| PilC beta-propeller domain protein [Acinetobacter baumannii
WC-A-694]
Length = 1284
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 848 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 906
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 907 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 945
>gi|169794508|ref|YP_001712301.1| competence factor involved in DNA binding and uptake (ComC)
[Acinetobacter baumannii AYE]
gi|332851328|ref|ZP_08433380.1| hypothetical protein HMPREF0021_00951 [Acinetobacter baumannii
6013150]
gi|332868739|ref|ZP_08438362.1| hypothetical protein HMPREF0020_01999 [Acinetobacter baumannii
6013113]
gi|169147435|emb|CAM85296.1| putative competence factor involved in DNA binding and uptake
(ComC) [Acinetobacter baumannii AYE]
gi|332730044|gb|EGJ61372.1| hypothetical protein HMPREF0021_00951 [Acinetobacter baumannii
6013150]
gi|332733168|gb|EGJ64365.1| hypothetical protein HMPREF0020_01999 [Acinetobacter baumannii
6013113]
Length = 1288
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 852 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 910
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 911 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 949
>gi|445490019|ref|ZP_21459027.1| PilC beta-propeller domain protein [Acinetobacter baumannii AA-014]
gi|444766461|gb|ELW90736.1| PilC beta-propeller domain protein [Acinetobacter baumannii AA-014]
Length = 1284
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 848 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 906
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 907 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 945
>gi|332873801|ref|ZP_08441743.1| hypothetical protein HMPREF0022_01353 [Acinetobacter baumannii
6014059]
gi|384144797|ref|YP_005527507.1| pilus assembly protein tip-associated adhesin PilY1 [Acinetobacter
baumannii MDR-ZJ06]
gi|387122399|ref|YP_006288281.1| hypothetical protein ABTJ_00326 [Acinetobacter baumannii MDR-TJ]
gi|407934268|ref|YP_006849911.1| DNA binding and uptake competence factor ComC [Acinetobacter
baumannii TYTH-1]
gi|417875199|ref|ZP_12520020.1| pilus assembly protein tip-associated adhesin PilY1 [Acinetobacter
baumannii ABNIH2]
gi|417880196|ref|ZP_12524731.1| pilus assembly protein tip-associated adhesin PilY1 [Acinetobacter
baumannii ABNIH3]
gi|421202111|ref|ZP_15659263.1| pilus assembly protein tip-associated adhesin 1 [Acinetobacter
baumannii AC12]
gi|421535108|ref|ZP_15981372.1| protein PilY1 [Acinetobacter baumannii AC30]
gi|421704946|ref|ZP_16144387.1| pilus assembly protein tip-associated adhesin PilY1 [Acinetobacter
baumannii ZWS1122]
gi|332738024|gb|EGJ68909.1| hypothetical protein HMPREF0022_01353 [Acinetobacter baumannii
6014059]
gi|342225733|gb|EGT90718.1| pilus assembly protein tip-associated adhesin PilY1 [Acinetobacter
baumannii ABNIH3]
gi|342226728|gb|EGT91690.1| pilus assembly protein tip-associated adhesin PilY1 [Acinetobacter
baumannii ABNIH2]
gi|347595290|gb|AEP08011.1| pilus assembly protein tip-associated adhesin PilY1 [Acinetobacter
baumannii MDR-ZJ06]
gi|385876891|gb|AFI93986.1| hypothetical protein ABTJ_00326 [Acinetobacter baumannii MDR-TJ]
gi|398328417|gb|EJN44543.1| pilus assembly protein tip-associated adhesin 1 [Acinetobacter
baumannii AC12]
gi|407189039|gb|EKE60267.1| pilus assembly protein tip-associated adhesin PilY1 [Acinetobacter
baumannii ZWS1122]
gi|407902849|gb|AFU39680.1| putative competence factor involved in DNA binding and uptake
(ComC) [Acinetobacter baumannii TYTH-1]
gi|409986988|gb|EKO43177.1| protein PilY1 [Acinetobacter baumannii AC30]
Length = 1288
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 852 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 910
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 911 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 949
>gi|301344871|ref|ZP_07225612.1| putative competence factor involved in DNA binding and uptake
(ComC) [Acinetobacter baumannii AB056]
Length = 1257
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 821 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 879
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 880 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 918
>gi|213158920|ref|YP_002320918.1| pilus assembly protein tip-associated adhesin PilY1 [Acinetobacter
baumannii AB0057]
gi|215482096|ref|YP_002324278.1| hypothetical protein ABBFA_000345 [Acinetobacter baumannii
AB307-0294]
gi|417573561|ref|ZP_12224415.1| PilC beta-propeller domain protein [Acinetobacter baumannii Canada
BC-5]
gi|421620810|ref|ZP_16061738.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC074]
gi|421641630|ref|ZP_16082161.1| PilC beta-propeller domain protein [Acinetobacter baumannii IS-235]
gi|421647972|ref|ZP_16088383.1| PilC beta-propeller domain protein [Acinetobacter baumannii IS-251]
gi|421659711|ref|ZP_16099927.1| PilC beta-propeller domain protein [Acinetobacter baumannii
Naval-83]
gi|421698443|ref|ZP_16137985.1| PilC beta-propeller domain protein [Acinetobacter baumannii IS-58]
gi|421796451|ref|ZP_16232514.1| PilC beta-propeller domain protein [Acinetobacter baumannii
Naval-21]
gi|421800251|ref|ZP_16236230.1| PilC beta-propeller domain protein [Acinetobacter baumannii Canada
BC1]
gi|213058080|gb|ACJ42982.1| pilus assembly protein tip-associated adhesin PilY1 [Acinetobacter
baumannii AB0057]
gi|213986455|gb|ACJ56754.1| hypothetical protein ABBFA_000345 [Acinetobacter baumannii
AB307-0294]
gi|400209129|gb|EJO40099.1| PilC beta-propeller domain protein [Acinetobacter baumannii Canada
BC-5]
gi|404572743|gb|EKA77785.1| PilC beta-propeller domain protein [Acinetobacter baumannii IS-58]
gi|408514382|gb|EKK15988.1| PilC beta-propeller domain protein [Acinetobacter baumannii IS-235]
gi|408516166|gb|EKK17745.1| PilC beta-propeller domain protein [Acinetobacter baumannii IS-251]
gi|408699670|gb|EKL45145.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC074]
gi|408707044|gb|EKL52338.1| PilC beta-propeller domain protein [Acinetobacter baumannii
Naval-83]
gi|410399105|gb|EKP51303.1| PilC beta-propeller domain protein [Acinetobacter baumannii
Naval-21]
gi|410408459|gb|EKP60427.1| PilC beta-propeller domain protein [Acinetobacter baumannii Canada
BC1]
Length = 1284
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 848 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 906
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 907 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 945
>gi|421788977|ref|ZP_16225245.1| PilC beta-propeller domain protein [Acinetobacter baumannii
Naval-82]
gi|410399863|gb|EKP52044.1| PilC beta-propeller domain protein [Acinetobacter baumannii
Naval-82]
Length = 1284
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 848 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 906
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 907 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 945
>gi|421654613|ref|ZP_16094940.1| PilC beta-propeller domain protein [Acinetobacter baumannii
Naval-72]
gi|408510384|gb|EKK12046.1| PilC beta-propeller domain protein [Acinetobacter baumannii
Naval-72]
Length = 1281
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 845 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 903
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 904 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 942
>gi|417872092|ref|ZP_12517003.1| pilus assembly protein tip-associated adhesin PilY1 [Acinetobacter
baumannii ABNIH1]
gi|342223943|gb|EGT89021.1| pilus assembly protein tip-associated adhesin PilY1 [Acinetobacter
baumannii ABNIH1]
Length = 1257
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 821 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 879
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 880 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 918
>gi|184159684|ref|YP_001848023.1| Tfp pilus assembly protein, tip-associated adhesin PilY1
[Acinetobacter baumannii ACICU]
gi|183211278|gb|ACC58676.1| Tfp pilus assembly protein, tip-associated adhesin PilY1
[Acinetobacter baumannii ACICU]
Length = 1257
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 821 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 879
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 880 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 918
>gi|421807354|ref|ZP_16243215.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC035]
gi|410416996|gb|EKP68767.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC035]
Length = 1283
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 847 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 905
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 906 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 944
>gi|421609575|ref|ZP_16050765.1| FG-GAP repeat-containing protein [Rhodopirellula baltica SH28]
gi|408499671|gb|EKK04140.1| FG-GAP repeat-containing protein [Rhodopirellula baltica SH28]
Length = 404
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 55/209 (26%), Positives = 86/209 (41%), Gaps = 41/209 (19%)
Query: 470 VVDLDGDGNLDILVGTS---------FGLFYVLDHHGKIREKFPLEMAEIQGAVVAADIN 520
V D++GDG DILVG +G ++ K+ P+ + DI+
Sbjct: 215 VGDINGDGRNDILVGWGWYEQPAENPWGQPWIAHRDWKLHASLPM---------LIEDID 265
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGI-------WEQH-LKSLVTQGPSI--GDVDGDGHS 570
DG +L+ + H W E G+ W++H + +Q S+ D+DGDG+
Sbjct: 266 QDGDQDLIYGEGHNYGLQWW-ENTGVNDSGSIEWKEHEIDRRFSQPHSMVWVDLDGDGNK 324
Query: 571 DVVVPTLSGNIYVLSGKD-GSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSF 629
D+V Y +G D G K P Y V ++ + G GL IV F
Sbjct: 325 DLVTGK---RYYAHNGNDPGGKEVPCLYWYSFDVASKSFQRHVIDEGRVGTGLQIVAEDF 381
Query: 630 DGYLYLIDGPTSCADVVDIGETSYSMVLA 658
+G DG T D+ G++ ++LA
Sbjct: 382 NG-----DGKT---DLAVAGKSGTYLLLA 402
>gi|384133375|ref|YP_005515987.1| protein PilY1 [Acinetobacter baumannii 1656-2]
gi|322509595|gb|ADX05049.1| PilY1 [Acinetobacter baumannii 1656-2]
Length = 1288
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 852 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 910
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 911 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 949
>gi|301596519|ref|ZP_07241527.1| hypothetical protein AbauAB059_11912 [Acinetobacter baumannii
AB059]
Length = 661
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 225 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 283
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 284 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 322
>gi|445410695|ref|ZP_21433011.1| PilC beta-propeller domain protein [Acinetobacter baumannii
Naval-57]
gi|444779868|gb|ELX03841.1| PilC beta-propeller domain protein [Acinetobacter baumannii
Naval-57]
Length = 1281
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 845 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 903
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 904 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 942
>gi|403674480|ref|ZP_10936735.1| pilus assembly protein tip-associated adhesin PilY1 [Acinetobacter
sp. NCTC 10304]
gi|421649961|ref|ZP_16090343.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC0162]
gi|408512360|gb|EKK14005.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC0162]
Length = 1284
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 848 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 906
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 907 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 945
>gi|357401753|ref|YP_004913678.1| hypothetical protein SCAT_4179 [Streptomyces cattleya NRRL 8057 =
DSM 46488]
gi|386357814|ref|YP_006056060.1| FG-GAP repeat domain-containing protein [Streptomyces cattleya NRRL
8057 = DSM 46488]
gi|337768162|emb|CCB76875.1| FG-GAP repeat domain-containing protein [Streptomyces cattleya NRRL
8057 = DSM 46488]
gi|365808322|gb|AEW96538.1| FG-GAP repeat domain-containing protein [Streptomyces cattleya NRRL
8057 = DSM 46488]
Length = 609
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 63/145 (43%), Gaps = 24/145 (16%)
Query: 472 DLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQG-------AVVAADINDDGK 524
DL GD D++ G S G +V + GK F G V D+ DG+
Sbjct: 459 DLTGDALPDLVAGDSSGNVWV--YPGKGNGTFGARTKAGYGWNVYGANVVGRGDLTRDGR 516
Query: 525 IELVTTDTHGNV----------AAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVV 574
+L+ D GN+ AA+ A K + H + V ++GD+ GDG +D+V
Sbjct: 517 PDLLAQDASGNLWLHPGTGTTPAAFGARIKAGYGYHYTAYV----AVGDLTGDGKADLVT 572
Query: 575 PTLSGNIYVLSGKDGSKVRPYPYRT 599
SGN+++ G GS P RT
Sbjct: 573 RDPSGNLWLYRGT-GSAKAPLAART 596
>gi|182412311|ref|YP_001817377.1| hypothetical protein Oter_0487 [Opitutus terrae PB90-1]
gi|177839525|gb|ACB73777.1| hypothetical protein Oter_0487 [Opitutus terrae PB90-1]
Length = 438
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 50/154 (32%), Positives = 69/154 (44%), Gaps = 27/154 (17%)
Query: 444 QVKWTTDLDLSTDNASFRAYIYSSPTVV---DLDGDGNLDILV---GTSF------GLFY 491
QV W A S+P V D+DGDG+LD++V G F G +
Sbjct: 92 QVLWIRQSPRGVFTEKLLASDMSAPVHVESADMDGDGDLDVIVSSMGVVFPDNDRIGTVF 151
Query: 492 VLDHHGKIREKF-PLEMAEIQGAVV---AADINDDGKIELVTTD---THGNVAAWTAEGK 544
VL++ G R+ F P + E VV AAD+N DGK++LV G + +W E
Sbjct: 152 VLENDG--RQNFTPHAVLEHTARVVDARAADLNRDGKMDLVLGQFGYDQGEI-SWL-ERT 207
Query: 545 GIWEQHLKSLVTQGPSIG----DVDGDGHSDVVV 574
G WE L+ +I D + DGH D+V
Sbjct: 208 GPWEFQRHVLLDLSGAINVCVDDYNADGHLDIVA 241
>gi|126643182|ref|YP_001086166.1| pilus assembly protein tip-associated adhesin PilY1 [Acinetobacter
baumannii ATCC 17978]
Length = 1263
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 827 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 885
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDNA+ + I S + +D D DG +D L
Sbjct: 886 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHL 924
>gi|418392479|ref|ZP_12968250.1| FG-GAP/YD repeat-containing protein, partial [Burkholderia
pseudomallei 354a]
gi|385375324|gb|EIF80106.1| FG-GAP/YD repeat-containing protein, partial [Burkholderia
pseudomallei 354a]
Length = 1009
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 57/124 (45%), Gaps = 26/124 (20%)
Query: 471 VDLDGDGNLDILVGTSFG-LFYVLDHHGKIREK---------FPLEMAEIQGAVVAADIN 520
VDL+GDG DIL+ T G L Y+ D + P+ +A+ V AD+
Sbjct: 415 VDLNGDGITDILLDTGRGWLAYLRDGPASWSDAPRVLPPARTPPVSLADPH--VHLADMT 472
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG-----------DVDGDGH 569
DG +++V T G + W A G W+ + +T P+ G DVDGDG
Sbjct: 473 GDGLMDIVRV-TGGGLTYWPARADGGWDAAIA--MTPSPAFGRDFDPHRLALYDVDGDGC 529
Query: 570 SDVV 573
+D+V
Sbjct: 530 ADLV 533
>gi|218962046|ref|YP_001741821.1| hypothetical protein; putative signal peptide [Candidatus
Cloacamonas acidaminovorans]
gi|167730703|emb|CAO81615.1| hypothetical protein; putative signal peptide [Candidatus
Cloacamonas acidaminovorans str. Evry]
Length = 735
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 81/361 (22%), Positives = 130/361 (36%), Gaps = 102/361 (28%)
Query: 387 PVIADIDNDGVSEMIIAVSYFFDHEYYDN---------PEHLKELGGIDIGKYVAGAIVV 437
P +AD+DNDG ++++ +S Y N E+ +L G DIG Y
Sbjct: 149 PDVADVDNDGDLDLVVGLSEDGSVIIYFNTGSAAAGNFSENNCQLLG-DIGLYAYPVFCD 207
Query: 438 FNLDTKQ----------------VKWTTDLDLSTDNASFRAY----IYSSPTVVDLDGDG 477
F+ D KQ + T+ +N F ++SP +VDL+GDG
Sbjct: 208 FDSDGKQDILCGRDAFGFVYYQNIGTATNPVWEENNTLFAGLGMSTYWNSPDLVDLNGDG 267
Query: 478 NLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVV--------------------AA 517
D++ GT+ G H+G + + G V+
Sbjct: 268 LYDLVYGTAAGPLQYYVHNGTAENPSWQQNTSLFGGVLDVGGASNPVFYDFDGDGDLDLI 327
Query: 518 DINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTL 577
N G ++ AW + + ++ + ++GDVD DG DV++ L
Sbjct: 328 SGNQLGYVKFYRNTGTAYAPAWQEDNS--YFANIHHSIYSAVTVGDVDADGLPDVILGDL 385
Query: 578 SGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLID 637
+G +Y Y G T E+S L V+
Sbjct: 386 NGGLYF-------------YHNTG-----------TGLIEQSGVLPAVSV---------- 411
Query: 638 GPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCF-STPAPHHPLKAWRSI 696
G SC ++D+ + DG DLDL+V GN+F + + P+ PL W +
Sbjct: 412 GGWSCPRLIDM-----------DFDG--DLDLVVGNEAGNLFYYQNNGTPYSPL--WELV 456
Query: 697 N 697
N
Sbjct: 457 N 457
>gi|167718061|ref|ZP_02401297.1| Rhs family protein [Burkholderia pseudomallei DM98]
Length = 1057
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 57/124 (45%), Gaps = 26/124 (20%)
Query: 471 VDLDGDGNLDILVGTSFG-LFYVLDHHGKIREK---------FPLEMAEIQGAVVAADIN 520
VDL+GDG DIL+ T G L Y+ D + P+ +A+ V AD+
Sbjct: 415 VDLNGDGITDILLDTGRGWLAYLRDGPASWSDAPRVLPPARTPPVSLADPH--VHLADMT 472
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG-----------DVDGDGH 569
DG +++V T G + W A G W+ + +T P+ G DVDGDG
Sbjct: 473 GDGLMDIVRV-TGGGLTYWPARADGGWDAAIA--MTPSPAFGRDFDPHRLALYDVDGDGC 529
Query: 570 SDVV 573
+D+V
Sbjct: 530 ADLV 533
>gi|440714890|ref|ZP_20895459.1| hypothetical protein RBSWK_02516 [Rhodopirellula baltica SWK14]
gi|436440262|gb|ELP33614.1| hypothetical protein RBSWK_02516 [Rhodopirellula baltica SWK14]
Length = 394
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 55/209 (26%), Positives = 86/209 (41%), Gaps = 41/209 (19%)
Query: 470 VVDLDGDGNLDILVGTS---------FGLFYVLDHHGKIREKFPLEMAEIQGAVVAADIN 520
V D++GDG DILVG +G ++ K+ P+ + DI+
Sbjct: 205 VGDINGDGRNDILVGWGWYEQPAENPWGQPWIAHRDWKLHASLPM---------LIEDID 255
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGI-------WEQH-LKSLVTQGPSI--GDVDGDGHS 570
DG +L+ + H W E G+ W++H + +Q S+ D+DGDG+
Sbjct: 256 QDGDQDLIYGEGHNYGLQWW-ENTGVNDSGSIEWKEHEIDRRFSQPHSMVWVDLDGDGNK 314
Query: 571 DVVVPTLSGNIYVLSGKD-GSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSF 629
D+V Y +G D G K P Y V ++ + G GL IV F
Sbjct: 315 DLVTGK---RYYAHNGNDPGGKEVPCLYWYSFDVASKSFQRHVVDEGRVGTGLQIVAEDF 371
Query: 630 DGYLYLIDGPTSCADVVDIGETSYSMVLA 658
+G DG T D+ G++ ++LA
Sbjct: 372 NG-----DGKT---DLAVAGKSGTYLLLA 392
>gi|78189520|ref|YP_379858.1| hypothetical protein Cag_1560 [Chlorobium chlorochromatii CaD3]
gi|78171719|gb|ABB28815.1| VCBS [Chlorobium chlorochromatii CaD3]
Length = 1838
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 65/239 (27%), Positives = 106/239 (44%), Gaps = 34/239 (14%)
Query: 391 DIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIG-KYVAGAIVVFNLDTKQVKWTT 449
DIDNDG+ ++I+ +D P D+G + AI VF + + +
Sbjct: 1029 DIDNDGMPDLIVGT------NGHDMPW--------DMGLAQICNAISVFK-NNGDGSFAS 1073
Query: 450 DLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSF--GLFYVLDHH--GKIREKFPL 505
+D + +NA F S D++GDG D L+G ++ G +L ++ G K
Sbjct: 1074 KVDYAIENAFF------SVASADVNGDGQTD-LIGANWTTGGLSILQNNGDGSFASKVDY 1126
Query: 506 EMAEIQGAVVAADINDDGKIELVTTDTHGN-VAAWTAEGKGIWEQHLKSLVTQGPSI--- 561
+ +V + D+N DGK ++ ++ N V+ G G + + P I
Sbjct: 1127 AIPNSFYSVASTDLNSDGKPDIFGSNLAVNGVSILQNNGDGTFASKVDYATGSNPWIVNS 1186
Query: 562 GDVDGDGHSDV-VVPTLSGNIYVLSGK-DGSKVRPYPYRTHGRVMNQVLLVDLTKRGEK 618
D++GDG SD+ VV T S + VL K DG+ + Y T G + + DL K G+
Sbjct: 1187 CDINGDGFSDISVVNTGSNTVSVLINKGDGTFLDKKDYST-GNMPFGLSSADLNKDGKS 1244
Score = 39.7 bits (91), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 59/220 (26%), Positives = 96/220 (43%), Gaps = 26/220 (11%)
Query: 469 TVVDLDGDGNLDILVGT-SFGLFYVLDHHGK--IREKFPLEMAEIQGAVVAADINDDGKI 525
T D++GDG LD++V VL ++G K +V ++D+N DGK+
Sbjct: 322 TSADVNGDGELDLIVANFQSDTVSVLKNNGDGIFATKVDYPTGSCPQSVTSSDVNGDGKL 381
Query: 526 ELVTTDTHGN-VAAWTAEGKGIWEQHLKSLVTQGP---SIGDVDGDGHSDVVVPTL-SGN 580
+L+ T+ + V+ G+G + + P + DV+GDG D++V S
Sbjct: 382 DLIATNWGSDTVSVLENNGEGTFATKVDYATGYSPWPVTSADVNGDGKFDLIVANFYSNT 441
Query: 581 IYVLSGK-DGSKVRPYPYRTH-------GRVMNQVLLVDLTKRGEKSKGLTIVTTSFDG- 631
+ VL DG+ V Y T +N +DL S ++++ + DG
Sbjct: 442 VSVLKNNGDGTFVTKVDYPTGLSPLSVTSADVNGDSELDLIVANMYSDTISVLNNNGDGT 501
Query: 632 YLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIV 671
+ +D PT G YS+ +D V+G LDLIV
Sbjct: 502 FATQVDYPT--------GSFPYSVTSSD-VNGDGKLDLIV 532
Score = 39.3 bits (90), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 61/223 (27%), Positives = 99/223 (44%), Gaps = 28/223 (12%)
Query: 467 SPTVVDLDGDGNLDILVGTSFG--LFYVLDHHGK--IREKFPLEMAEIQGAVVAADINDD 522
S T D++GDG LD L+ T++G VL+++G+ K V +AD+N D
Sbjct: 369 SVTSSDVNGDGKLD-LIATNWGSDTVSVLENNGEGTFATKVDYATGYSPWPVTSADVNGD 427
Query: 523 GKIELVTTDTHGN-VAAWTAEGKGIWEQHLKSLVTQGP---SIGDVDGDGHSDVVVPTL- 577
GK +L+ + + N V+ G G + + P + DV+GD D++V +
Sbjct: 428 GKFDLIVANFYSNTVSVLKNNGDGTFVTKVDYPTGLSPLSVTSADVNGDSELDLIVANMY 487
Query: 578 SGNIYVLSGK-DGS-------KVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSF 629
S I VL+ DG+ +PY +N +DL S ++++ +
Sbjct: 488 SDTISVLNNNGDGTFATQVDYPTGSFPYSVTSSDVNGDGKLDLIVVNYYSNTVSVLNNNG 547
Query: 630 DG-YLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIV 671
DG + +D PT T S V + +V+G LDLIV
Sbjct: 548 DGTFATQVDYPTG---------TWPSSVTSADVNGDGKLDLIV 581
>gi|373456335|ref|ZP_09548102.1| integral membrane sensor signal transduction histidine kinase
[Caldithrix abyssi DSM 13497]
gi|371717999|gb|EHO39770.1| integral membrane sensor signal transduction histidine kinase
[Caldithrix abyssi DSM 13497]
Length = 841
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 55/205 (26%), Positives = 88/205 (42%), Gaps = 30/205 (14%)
Query: 391 DIDNDGVSEMIIAVSYFFDHEYYDNPEHL---KELGGIDIGKYVAGAIVVFNLDTKQVKW 447
D D DG SE+I A+ + N H+ K G I I ++V F D ++ +W
Sbjct: 57 DFDKDGYSEVIQAL-----YNPQINHSHIIVIKHTGAI-INEWVITGFCSFERDLRK-QW 109
Query: 448 TTDLDLSTDNA-SFRAYIYSSP----TVVDLDGDGNL--DILVGTSFGLFYVLDHHGKIR 500
D++ DN +I + TV+D+ N ++ SF L L G
Sbjct: 110 IDFGDVNRDNFDELILFIQQNDSLFLTVIDVKNSENNHGKFIIQRSFLLKRSLQSIG--- 166
Query: 501 EKFPLEMAEIQGAVVAADINDDGKIELVTTDTHG------NVAAWTAEGKGIWEQHLKSL 554
P ++ AVV D+N+DG++ELV T G + + + K I Q+ +
Sbjct: 167 --LPWDIYYNHHAVV--DLNNDGQVELVFTLVSGYSLSPRGIYVYDFKQKRIVHQYETNA 222
Query: 555 VTQGPSIGDVDGDGHSDVVVPTLSG 579
G D+DGDG +++ T +G
Sbjct: 223 ALMGMDFFDLDGDGKKEIIASTYAG 247
>gi|424050774|ref|ZP_17788310.1| hypothetical protein W9G_02666 [Acinetobacter baumannii Ab11111]
gi|404669527|gb|EKB37420.1| hypothetical protein W9G_02666 [Acinetobacter baumannii Ab11111]
Length = 845
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 54/111 (48%), Gaps = 4/111 (3%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 409 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 467
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG--LFYV 492
+ WT+D +TDNA+ + I S + +D D DG +D L G +F+V
Sbjct: 468 RLWWTSDTGSNTDNANMKHSIVSRISTLDRDADGLVDHLYFGDLGGQIFHV 518
>gi|403738960|ref|ZP_10951561.1| hypothetical protein AUCHE_08_05780 [Austwickia chelonae NBRC
105200]
gi|403191610|dbj|GAB78331.1| hypothetical protein AUCHE_08_05780 [Austwickia chelonae NBRC
105200]
Length = 614
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 43/165 (26%), Positives = 69/165 (41%), Gaps = 15/165 (9%)
Query: 436 VVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDH 495
+ F + WT + A FR + SP D +GDG D++ + VL
Sbjct: 436 LFFYIGDGNGGWTGQRQIGHGWAGFRHLV--SPG--DWNGDGVPDLVAVHAQTGALVLST 491
Query: 496 HGKIREKFPLEMAE----IQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHL 551
P+ + ++G + D + DGK++L+ T G + ++ +G+G +
Sbjct: 492 GAGAGLSNPVPIGSGWQLMEGLIAPGDFDGDGKVDLIGRTTAGGLFLYSGDGQGGFRAQR 551
Query: 552 KSLVTQGP-----SIGDVDGDGHSDVVVPTLSG--NIYVLSGKDG 589
+ G SIGD DGDG DV+ SG +Y GK G
Sbjct: 552 QIGWGWGDMSAVWSIGDADGDGRPDVLALDRSGALRLYPGDGKGG 596
>gi|167814225|ref|ZP_02445905.1| hypothetical protein Bpse9_03723 [Burkholderia pseudomallei 91]
Length = 1347
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 57/124 (45%), Gaps = 26/124 (20%)
Query: 471 VDLDGDGNLDILVGTSFG-LFYVLDHHGKIREK---------FPLEMAEIQGAVVAADIN 520
VDL+GDG DIL+ T G L Y+ D + P+ +A+ V AD+
Sbjct: 415 VDLNGDGITDILLDTGRGWLAYLRDGPASWSDAPRVLPPARTPPVSLADPH--VHLADMT 472
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG-----------DVDGDGH 569
DG +++V T G + W A G W+ + +T P+ G DVDGDG
Sbjct: 473 GDGLMDIVRV-TGGGLTYWPARADGGWDAAIA--MTPSPAFGRDFDPHRLALYDVDGDGC 529
Query: 570 SDVV 573
+D+V
Sbjct: 530 ADLV 533
>gi|365121915|ref|ZP_09338825.1| hypothetical protein HMPREF1033_02171 [Tannerella sp. 6_1_58FAA_CT1]
gi|363643628|gb|EHL82935.1| hypothetical protein HMPREF1033_02171 [Tannerella sp. 6_1_58FAA_CT1]
Length = 1397
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 43/161 (26%), Positives = 72/161 (44%), Gaps = 39/161 (24%)
Query: 455 TDNASFRAYIYSSPTVVDLDGDGNLDILVGT------SFGLFY----VLDHHGK------ 498
TD R ++ +V D D DG LDI+V + G+ VL HH +
Sbjct: 959 TDGGRSRQWV----SVGDYDKDGYLDIVVTGRDDYEDADGMLAEDGNVLVHHDRRAVYLY 1014
Query: 499 --------IREKFPLE-----MAEIQGAVVAADINDDGKIELVTTD---THGNVAAWTAE 542
IR++ PL + +G+V AD+++DG +++V++ GN+ +
Sbjct: 1015 KNDKGKRFIRQETPLNGTDPFLGLARGSVYFADMDNDGWLDIVSSGYGPNEGNLHVYWNN 1074
Query: 543 GKGIWE---QHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGN 580
G G + QH ++GD++ DG+ D+VV S N
Sbjct: 1075 GDGTFSETGQHFYGSYDSSCALGDLNADGYMDIVVTGYSRN 1115
>gi|17227763|ref|NP_484311.1| hypothetical protein alr0267 [Nostoc sp. PCC 7120]
gi|17135245|dbj|BAB77791.1| alr0267 [Nostoc sp. PCC 7120]
Length = 408
Score = 41.2 bits (95), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 61/141 (43%), Gaps = 6/141 (4%)
Query: 467 SPTVVDLDGDGNLDILV-GTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKI 525
+ T+VD DG+G D+ T+ G V G K+ L+ + D N D
Sbjct: 216 TATIVDFDGNGKSDVFWRDTTTGANSVWFMDGIQATKYDLQAQDASWTYSLGDFNGDFTT 275
Query: 526 ELVTTDT-HGNVAAWTAEGKGIWEQHLKSLVTQG-PSIGDVDGDGHSDVV---VPTLSGN 580
+L+ +T G WT G + E L +L + SIGD +GDG +D+ T
Sbjct: 276 DLLWRNTVTGENKIWTMNGIFVTEGTLNTLGSDWTASIGDFNGDGRTDIFWNNTTTGENT 335
Query: 581 IYVLSGKDGSKVRPYPYRTHG 601
++++G + P RT G
Sbjct: 336 AWLMNGTSVTSEAFLPSRTPG 356
>gi|451981012|ref|ZP_21929392.1| putative ASPIC/UnbV domain protein [Nitrospina gracilis 3/211]
gi|451761775|emb|CCQ90639.1| putative ASPIC/UnbV domain protein [Nitrospina gracilis 3/211]
Length = 1032
Score = 41.2 bits (95), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 54/111 (48%), Gaps = 10/111 (9%)
Query: 472 DLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQG----AVVAADINDDGKIEL 527
DL+ DG +D++ T G + G+ EK + + EI G +V D+++DG +++
Sbjct: 470 DLNRDGWVDVVAATDNGFEAYQNVEGRRFEKLDVPLGEIAGKEAISVAVVDLDNDGWLDV 529
Query: 528 VTTDTHGNVAAWTAEGK-GIWEQHLK-----SLVTQGPSIGDVDGDGHSDV 572
T H W GK G Q ++ +++T + DV+GDG D+
Sbjct: 530 FVTTFHEGNYLWHNPGKSGKTPQVVRVPNGDTVLTSALAFADVNGDGFLDI 580
>gi|167909509|ref|ZP_02496600.1| hypothetical protein Bpse112_03384 [Burkholderia pseudomallei 112]
Length = 1632
Score = 41.2 bits (95), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 57/124 (45%), Gaps = 26/124 (20%)
Query: 471 VDLDGDGNLDILVGTSFG-LFYVLDHHGKIREK---------FPLEMAEIQGAVVAADIN 520
VDL+GDG DIL+ T G L Y+ D + P+ +A+ V AD+
Sbjct: 415 VDLNGDGITDILLDTGRGWLAYLRDGPASWSDAPRVLPPARTPPVSLADPH--VHLADMT 472
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG-----------DVDGDGH 569
DG +++V T G + W A G W+ + +T P+ G DVDGDG
Sbjct: 473 GDGLMDIVRV-TGGGLTYWPARADGGWDAAIA--MTPSPAFGRDFDPHRLALYDVDGDGC 529
Query: 570 SDVV 573
+D+V
Sbjct: 530 ADLV 533
>gi|148676091|gb|EDL08038.1| integrin alpha 8, isoform CRA_b [Mus musculus]
Length = 1043
Score = 41.2 bits (95), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 36/153 (23%), Positives = 70/153 (45%), Gaps = 19/153 (12%)
Query: 347 YNYDYDDYVDDAMWGDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSY 406
Y+ ++ D+ GD E + I + + S+ T V++D++NDG+ ++++
Sbjct: 268 YSVAAGEFTGDSQQGDYELGFLNTKSI-IVLQMASYFGYTVVVSDVNNDGMDDILVGAPL 326
Query: 407 FFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYS 466
F + E+ NP +E+G + + Y+ + ++F D + T +F + S
Sbjct: 327 FMEREFESNP---REVGQVYL--YLQASALLFQ----------DPQVLTGTETFGRFGSS 371
Query: 467 SPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKI 499
+ DL+ DG DI +G F D GK+
Sbjct: 372 VAHLGDLNQDGYNDIAIGVPFA---GKDQRGKV 401
>gi|300114955|ref|YP_003761530.1| FG-GAP repeat-containing protein [Nitrosococcus watsonii C-113]
gi|299540892|gb|ADJ29209.1| FG-GAP repeat protein [Nitrosococcus watsonii C-113]
Length = 447
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 32/116 (27%), Positives = 62/116 (53%), Gaps = 12/116 (10%)
Query: 470 VVDLDGDGNLDILVGTSFGLFYVLDHHGKIR-----EKFPLEM-AEIQGAVVAADINDDG 523
++D+DGDG+LD++ G +V +GK + + P M + + DIN+DG
Sbjct: 237 LLDIDGDGHLDVVASYYLGP-WVWRGNGKGQWEDYSKGLPRPMIGGLFRGLAVGDINEDG 295
Query: 524 KIELVTTDTHGNVAAWTAEGKGIWEQH---LKSLV--TQGPSIGDVDGDGHSDVVV 574
+++L ++ + +G W++ + S++ +G ++GD+D DGH D+VV
Sbjct: 296 RLDLAVANSINGPEVFLQTQEGSWQRTPDVMPSMLGGAEGIALGDLDRDGHLDLVV 351
Score = 40.4 bits (93), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 51/190 (26%), Positives = 77/190 (40%), Gaps = 39/190 (20%)
Query: 466 SSPTVVDLDGDGNLDILVGTSFG---LFYVLDHHGKIREKFP-LEMAEIQGAVVA-ADIN 520
S+P ++D +GDG LD+ + G ++ D G+ ++ L A+ G VA DIN
Sbjct: 55 STPVMIDSNGDGFLDLAAISRVGDGAQVWLGDGKGRWQDSSQGLGPADSCGGGVAFGDIN 114
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWE----------QHLKSLVTQGP---------SI 561
DG ++L D G + + GKG W H SL +G ++
Sbjct: 115 RDGMLDLAVADHCGGIFVYLGNGKGEWTASTERLNPALSHQISLEEEGENTLTGAEDLAL 174
Query: 562 GDVDGDGHSDVVVPTLSGNIYVLSGKDGSK-------VRPYPYRTH--------GRVMNQ 606
GD++ DG D+VV + + DGS P H G NQ
Sbjct: 175 GDINEDGFLDLVVAASDEGGFAVYLGDGSGKSWQESDAEGLPTVKHLGPGAEENGGWANQ 234
Query: 607 VLLVDLTKRG 616
V L+D+ G
Sbjct: 235 VRLLDIDGDG 244
>gi|254187764|ref|ZP_04894276.1| FG-GAP/YD repeat domain protein [Burkholderia pseudomallei Pasteur
52237]
gi|157935444|gb|EDO91114.1| FG-GAP/YD repeat domain protein [Burkholderia pseudomallei Pasteur
52237]
Length = 1374
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 57/124 (45%), Gaps = 26/124 (20%)
Query: 471 VDLDGDGNLDILVGTSFG-LFYVLDHHGKIREK---------FPLEMAEIQGAVVAADIN 520
VDL+GDG DIL+ T G L Y+ D + P+ +A+ V AD+
Sbjct: 415 VDLNGDGITDILLDTGRGWLAYLRDGPASWSDAPRVLPPARTPPVSLADPH--VHLADMT 472
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG-----------DVDGDGH 569
DG +++V T G + W A G W+ + +T P+ G DVDGDG
Sbjct: 473 GDGLMDIVRV-TGGGLTYWPARADGGWDAAIA--MTPSPAFGRDFDPHRLALYDVDGDGC 529
Query: 570 SDVV 573
+D+V
Sbjct: 530 ADLV 533
>gi|403527864|ref|YP_006662751.1| FG-GAP repeat domain-containing protein [Arthrobacter sp. Rue61a]
gi|403230291|gb|AFR29713.1| FG-GAP repeat domain protein [Arthrobacter sp. Rue61a]
Length = 1027
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 56/129 (43%), Gaps = 18/129 (13%)
Query: 472 DLDGDGNLDILVGTSFGLFYVL--DHHGKI---REKFPLEMAEIQGAVVAADINDDGKIE 526
D GDG D+L S G+ ++ D GK R + + D N DGK +
Sbjct: 880 DFSGDGKSDVLARDSSGILWLYKGDGLGKFTAGRTQVGAGWNVMNAIFSTGDFNGDGKTD 939
Query: 527 LVTTDTHGNVAAWTAEGKGIWEQH---------LKSLVTQGPSIGDVDGDGHSDVVVPTL 577
++ DT G + + G W + +L+ GP GD +G+G +DV+
Sbjct: 940 VLARDTGGALWLYPGNGSSGWGNRSQIGSGWNGMTALL--GP--GDFNGNGKADVLARDA 995
Query: 578 SGNIYVLSG 586
SGN+++ G
Sbjct: 996 SGNLWLYPG 1004
>gi|32473887|ref|NP_866881.1| signal peptide [Rhodopirellula baltica SH 1]
gi|32444423|emb|CAD74422.1| hypothetical protein-signal peptide prediction [Rhodopirellula
baltica SH 1]
Length = 404
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 84/208 (40%), Gaps = 39/208 (18%)
Query: 470 VVDLDGDGNLDILVGTS---------FGLFYVLDHHGKIREKFPLEMAEIQGAVVAADIN 520
V D++GDG DILVG +G ++ K+ P+ + DI+
Sbjct: 215 VGDINGDGRNDILVGWGWYEQPAENPWGQPWIAHRDWKLHASLPM---------LIEDID 265
Query: 521 DDGKIELVTTDTHGNVAAWTA------EGKGIWEQH-LKSLVTQGPSI--GDVDGDGHSD 571
DG +L+ + H W G W++H + +Q S+ D+DGDG+ D
Sbjct: 266 QDGDQDLIYGEGHNYGLQWWENTGANDSGSIEWKEHEIDRRFSQPHSMVWVDLDGDGNKD 325
Query: 572 VVVPTLSGNIYVLSGKD-GSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFD 630
+V Y +G D G K P Y V ++ + G GL IV F+
Sbjct: 326 LVTGK---RYYAHNGNDPGGKEVPCLYWYSFDVASKSFQRHVVDEGRVGTGLQIVAEDFN 382
Query: 631 GYLYLIDGPTSCADVVDIGETSYSMVLA 658
G DG T D+ G++ ++LA
Sbjct: 383 G-----DGKT---DLAVAGKSGTYLLLA 402
>gi|167822704|ref|ZP_02454175.1| hypothetical protein Bpseu9_03441 [Burkholderia pseudomallei 9]
Length = 1625
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 57/124 (45%), Gaps = 26/124 (20%)
Query: 471 VDLDGDGNLDILVGTSFG-LFYVLDHHGKIREK---------FPLEMAEIQGAVVAADIN 520
VDL+GDG DIL+ T G L Y+ D + P+ +A+ V AD+
Sbjct: 415 VDLNGDGITDILLDTGRGWLAYLRDGPASWSDAPRVLPPARTPPVSLADPH--VHLADMT 472
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG-----------DVDGDGH 569
DG +++V T G + W A G W+ + +T P+ G DVDGDG
Sbjct: 473 GDGLMDIVRV-TGGGLTYWPARADGGWDAAIA--MTPSPAFGRDFDPHRLALYDVDGDGC 529
Query: 570 SDVV 573
+D+V
Sbjct: 530 ADLV 533
>gi|313677071|ref|YP_004055067.1| lipoprotein [Marivirga tractuosa DSM 4126]
gi|312943769|gb|ADR22959.1| lipoprotein [Marivirga tractuosa DSM 4126]
Length = 2607
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 51/117 (43%), Gaps = 12/117 (10%)
Query: 470 VVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFP-----LEMAEIQGAVVAADINDDGK 524
V D DGDGNLDI G + GL + FP + + + ADIN DG
Sbjct: 645 VADFDGDGNLDIAAGKNGGLELFQGNGDGTFTAFPSPAFFSTIINDKAGLAIADINKDGN 704
Query: 525 IELVTTDTHGNVAAWTAEGKGIWEQHLKS-LVTQG---PS---IGDVDGDGHSDVVV 574
++++T + V+ +G G + S + G PS + D + DG D+ +
Sbjct: 705 MDIITANISDEVSVLLGDGSGNFAHASYSPFIPAGLGEPSALDVADFNNDGFLDIAL 761
>gi|239990739|ref|ZP_04711403.1| secreted trypsin-like protease [Streptomyces roseosporus NRRL
11379]
gi|291447752|ref|ZP_06587142.1| secreted protein [Streptomyces roseosporus NRRL 15998]
gi|291350699|gb|EFE77603.1| secreted protein [Streptomyces roseosporus NRRL 15998]
Length = 591
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 58/128 (45%), Gaps = 16/128 (12%)
Query: 472 DLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVV-------AADINDDGK 524
DL DG+ DIL S G F++ + GK F + +I G + + D DGK
Sbjct: 441 DLTSDGHPDILSTDSSGAFWL--YPGKGNGTFAARV-KIGGGGLNYNMVRGSGDFTGDGK 497
Query: 525 IELVTTDTHGNVAAWTAEGKGIWEQHLKSLV------TQGPSIGDVDGDGHSDVVVPTLS 578
+++ + G + + GK K LV T + GD GDGH+D++V +
Sbjct: 498 ADIIARGSGGKMYLFKGTGKASAPFAPKILVRTWDVATALVTTGDFTGDGHADIIVRVKT 557
Query: 579 GNIYVLSG 586
G +Y+ G
Sbjct: 558 GELYLYPG 565
>gi|374856885|dbj|BAL59738.1| hypothetical protein HGMM_OP4C374 [uncultured candidate division
OP1 bacterium]
Length = 337
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 60/131 (45%), Gaps = 20/131 (15%)
Query: 472 DLDGDGNLDILVGTSFGLFYVLDHHGK-IREKFPL--EMAEIQGAVVAADINDDGKIELV 528
DLDGDG +ILVG + G Y+L+ GK I +F IQGA + + + L
Sbjct: 141 DLDGDGKQEILVGGT-GAIYILNQSGKLIGSEFRTTDTTIAIQGAQLWSGQRVIVAVHLA 199
Query: 529 TTDTHGNVAAWTAEGKGIWE---QHLKSLV------------TQGPSIGDVDGDGHSDVV 573
GN+ + EG+ +W H+ V P + D+D DG S+++
Sbjct: 200 QGVQDGNLFVLSPEGRLLWRFTNPHINRFVFIFLPSEAPYPILSAPLVVDLDHDGTSEIL 259
Query: 574 VPTLSGNIYVL 584
+ + G +Y+L
Sbjct: 260 LSS-GGRLYLL 269
>gi|344203029|ref|YP_004788172.1| FG-GAP repeat-containing protein [Muricauda ruestringensis DSM
13258]
gi|343954951|gb|AEM70750.1| FG-GAP repeat protein [Muricauda ruestringensis DSM 13258]
Length = 404
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 60/220 (27%), Positives = 92/220 (41%), Gaps = 24/220 (10%)
Query: 466 SSPTVVDLDGDGNLDILVGTSF-----GLFYVLDHHGKIREKFPL-EMAEIQGAVVAADI 519
+S T+VD+D DG+LD LV Y D G + P+ + A+ +AD
Sbjct: 76 ASITLVDVDKDGDLDALVANGRHWAEQNYIYYNDGKGGFKMARPIGNFLDASYAIKSADF 135
Query: 520 NDDGKIEL-VTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPS----IGDVDGDGHSDVVV 574
N+DG +++ V D N + E E+ S T PS I DVD DG D+V+
Sbjct: 136 NNDGFMDIAVANDNIPNTLYFGTENNRFTER--TSFGTISPSRNLEIADVDNDGDMDLVI 193
Query: 575 PTLSG-NIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDG-- 631
N L+ G+ + Q L+ D+ G + ++T G
Sbjct: 194 SNRKAKNEICLNDGQGNFTTVLHFGDGSDQTIQTLVTDINNDGRQD----LITAERQGKN 249
Query: 632 YLYLIDGPTSCADVVDIGE---TSYSMVLAD-NVDGGDDL 667
+YL D S V++ G + S+ +AD N DG D+
Sbjct: 250 KIYLGDEKLSFTQVLEFGNENVETRSIGIADMNKDGFQDI 289
>gi|119962992|ref|YP_948385.1| FG-GAP repeat-containing protein [Arthrobacter aurescens TC1]
gi|119949851|gb|ABM08762.1| FG-GAP repeat domain protein [Arthrobacter aurescens TC1]
Length = 1034
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 56/129 (43%), Gaps = 18/129 (13%)
Query: 472 DLDGDGNLDILVGTSFGLFYVL--DHHGKI---REKFPLEMAEIQGAVVAADINDDGKIE 526
D GDG D+L S G+ ++ D GK R + + D N DGK +
Sbjct: 887 DFSGDGKSDVLARDSSGILWLYKGDGLGKFTAGRTQVGAGWNVMNAIFSTGDFNGDGKTD 946
Query: 527 LVTTDTHGNVAAWTAEGKGIWEQH---------LKSLVTQGPSIGDVDGDGHSDVVVPTL 577
++ DT G + + G W + +L+ GP GD +G+G +DV+
Sbjct: 947 VLARDTGGALWLYPGNGSSGWGNRSQIGSGWNGMTALL--GP--GDFNGNGKADVLARDA 1002
Query: 578 SGNIYVLSG 586
SGN+++ G
Sbjct: 1003 SGNLWLYPG 1011
>gi|149918367|ref|ZP_01906858.1| FG-GAP repeat/HVR domain protein [Plesiocystis pacifica SIR-1]
gi|149820893|gb|EDM80302.1| FG-GAP repeat/HVR domain protein [Plesiocystis pacifica SIR-1]
Length = 781
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 31/118 (26%), Positives = 53/118 (44%), Gaps = 25/118 (21%)
Query: 382 HILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLD 441
++STP +AD+D D + E+I + + + EY N G + D
Sbjct: 216 QVVSTPSVADLDGDCIPEIIF--NTYRNSEYTTN-----------------GILRAIRGD 256
Query: 442 TKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKI 499
T + WT + +A++R+ ++P + DLD DGN +I+ Y LD+ G
Sbjct: 257 TGEKVWTFN------DANYRSDSGATPAIGDLDYDGNPEIVNPGQGQWLYALDYQGNF 308
>gi|148233336|ref|NP_001081444.1| integrin alpha-4 precursor [Xenopus laevis]
gi|3183042|sp|Q91687.1|ITA4_XENLA RecName: Full=Integrin alpha-4; AltName: Full=Integrin alpha-IV;
AltName: Full=VLA-4 subunit alpha; Flags: Precursor
gi|1293703|gb|AAA98673.1| integrin alpha 4 [Xenopus laevis]
Length = 1032
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 52/222 (23%), Positives = 84/222 (37%), Gaps = 41/222 (18%)
Query: 430 YVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYI-----YSSPTVVDLDGDGNLDILVG 484
Y G+I V+N +K DL+ + S+ Y + +P D+ G G
Sbjct: 220 YWTGSIFVYNTTENTIKSYVDLNNAVKFGSYLGYSVGAGHFRTPNGYDVIGGAPQQEQTG 279
Query: 485 TSFGLFYVLDHHGKIREKFPLEMAEIQGA-VVAADINDDGKIELV-------TTDTHGNV 536
+ Y + E ++ GA V AAD+N DG +L+ T G V
Sbjct: 280 RVYIFTYEEKQLTILFEAGGKKLGSYFGAAVCAADLNGDGLSDLLVGAPIQSTIREEGRV 339
Query: 537 AAWTAEGKGIWEQ----------HLKSLVTQGPSIGDVDGDGHSDVVVPT-----LSGNI 581
+ G G E+ + ++GD+D DG DV + L G +
Sbjct: 340 FVYMNTGSGAMEELKFELSGSDLYAARFGETIANLGDIDNDGFEDVAIAAPQEGDLEGAV 399
Query: 582 YVLSGKD------------GSKVRPYPYRTHGRVMNQVLLVD 611
Y+ +G++ GSK Y R G+ ++ VL +D
Sbjct: 400 YIYNGREKGITPSFSQRLQGSKF-GYGLRMFGQSLSNVLDID 440
>gi|385809667|ref|YP_005846063.1| PKD repeat protein [Ignavibacterium album JCM 16511]
gi|383801715|gb|AFH48795.1| PKD repeat protein [Ignavibacterium album JCM 16511]
Length = 669
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 23/86 (26%), Positives = 43/86 (50%), Gaps = 6/86 (6%)
Query: 76 ATPLIADINSDGKLDIVVPSF---LHYLEVLEGSDGDKMPGWPAFHQ---SSVHSSPLLY 129
+TP + D +SD LD++ +F L+Y + S+ P F Q +S+P +
Sbjct: 428 STPFLFDFDSDNDLDLICGAFNGKLYYYKNTGTSENFSFEYVPTFFQIIDVGDNSTPEII 487
Query: 130 DIDKDGVREIALATYNGEVLFFRVSG 155
D D DG ++ + G++ +F+ +G
Sbjct: 488 DFDNDGTNDLFIGNREGKIFYFKNAG 513
>gi|172037455|ref|YP_001803956.1| hypothetical protein cce_2542 [Cyanothece sp. ATCC 51142]
gi|354553663|ref|ZP_08972969.1| FG-GAP repeat protein [Cyanothece sp. ATCC 51472]
gi|171698909|gb|ACB51890.1| hypothetical protein cce_2542 [Cyanothece sp. ATCC 51142]
gi|353554380|gb|EHC23770.1| FG-GAP repeat protein [Cyanothece sp. ATCC 51472]
Length = 1728
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 40/150 (26%), Positives = 63/150 (42%), Gaps = 16/150 (10%)
Query: 466 SSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIRE---KFPLEMAEI--QGAVVAADIN 520
S PT +D + DG+LD VG G V + G + P+ + ++ A ADI+
Sbjct: 1206 SIPTFIDENNDGDLDFFVGAGDGTVAVAHNQGSATDPDFANPIIIRDVGNDSAPTFADID 1265
Query: 521 DDGKIELVTTDTHGNVAAW----TAEGKGIWEQHLKSLVTQG------PSIGDVDGDGHS 570
DG +L DT G + + + G + G P + D+DGDG
Sbjct: 1266 GDGDQDLFVGDTFGRIHYYRNRGSVNGASFDPPVINPFGIGGNGGLAKPDLADIDGDGDH 1325
Query: 571 DVVVPTLSGNIYVLSGKDGSKVRPYPYRTH 600
D+ V +GNI G++ P R++
Sbjct: 1326 DLFVGNATGNILFYR-NTGTRTNPIFTRSN 1354
>gi|94967416|ref|YP_589464.1| integrin like protein [Candidatus Koribacter versatilis Ellin345]
gi|94549466|gb|ABF39390.1| Integrin like protein [Candidatus Koribacter versatilis Ellin345]
Length = 974
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 36/131 (27%), Positives = 54/131 (41%), Gaps = 14/131 (10%)
Query: 470 VVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQ----GAVVAADINDDGKI 525
V D +GDG D+ G + L L + + P + +VA D N DG
Sbjct: 335 VADFNGDGIPDLATGGAGSLSVFLANGAGAFTQVPTTSKTLILGNFATMVAGDFNGDGIT 394
Query: 526 ELVTTDT--HGNVAAWTAEGKGIWEQHLKSLVTQGPSIG--------DVDGDGHSDVVVP 575
++ D V + G G + ++V+ G S G D +GDG +DV VP
Sbjct: 395 DIAALDATFSETVRVYFGSGDGTFTTGPTNMVSPGGSAGAPMVMVTADFNGDGKADVAVP 454
Query: 576 TLSGNIYVLSG 586
+G + VL G
Sbjct: 455 LWNGGVAVLLG 465
>gi|213624000|gb|AAI70506.1| Integrin alpha 4 [Xenopus laevis]
Length = 1032
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 52/222 (23%), Positives = 84/222 (37%), Gaps = 41/222 (18%)
Query: 430 YVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYI-----YSSPTVVDLDGDGNLDILVG 484
Y G+I V+N +K DL+ + S+ Y + +P D+ G G
Sbjct: 220 YWTGSIFVYNTTENTIKSYVDLNNAVKFGSYLGYSVGAGHFRTPNGYDVIGGAPQQEQTG 279
Query: 485 TSFGLFYVLDHHGKIREKFPLEMAEIQGA-VVAADINDDGKIELV-------TTDTHGNV 536
+ Y + E ++ GA V AAD+N DG +L+ T G V
Sbjct: 280 RVYIFTYEEKQLTILFEAGGKKLGSYFGAAVCAADLNGDGLSDLLVGAPIQSTIREEGRV 339
Query: 537 AAWTAEGKGIWEQ----------HLKSLVTQGPSIGDVDGDGHSDVVVPT-----LSGNI 581
+ G G E+ + ++GD+D DG DV + L G +
Sbjct: 340 FVYMNTGSGAMEELKFELSGSDLYAARFGETIANLGDIDNDGFEDVAIAAPQEGDLEGAV 399
Query: 582 YVLSGKD------------GSKVRPYPYRTHGRVMNQVLLVD 611
Y+ +G++ GSK Y R G+ ++ VL +D
Sbjct: 400 YIYNGREKGITPSFSQRLQGSKF-GYGLRMFGQSLSNVLDID 440
>gi|167538461|ref|XP_001750894.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163770578|gb|EDQ84264.1| predicted protein [Monosiga brevicollis MX1]
Length = 444
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 58/227 (25%), Positives = 95/227 (41%), Gaps = 58/227 (25%)
Query: 390 ADIDNDGVSEMIIA------VSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTK 443
AD+DNDG +++ A ++++ +H G + A IV NL T
Sbjct: 108 ADLDNDGYIDVLSAGFGDGTIAWYHNHGN---------------GTFSAPLIVAANLTTA 152
Query: 444 QVKWTTDLD-------LSTD-------------NASFRAYIYSSPTV--------VDLDG 475
+ + DLD LS N +F I + T+ DLD
Sbjct: 153 RSVYAADLDHDNYIDILSASEGDDTVAWYRNYGNGTFSNQIIINQTLDGPRSVYAADLDN 212
Query: 476 DGNLDIL---VGTSFGLFYVLDHHGKIREKFPLEMAEIQG-AVVAADINDDGKIELVTTD 531
DGN+D+L V + +Y + G P+ A AV AAD+++DG +++++
Sbjct: 213 DGNMDVLSASVNDNTIAWYQNNGAGVFSAPKPITTAANGSLAVYAADLDNDGHLDVLSAS 272
Query: 532 THGNVAAWTA-EGKGIWEQH--LKSLVTQGPSI--GDVDGDGHSDVV 573
+ N AW G G + + S + ++ D+D DGH+DVV
Sbjct: 273 YYDNKIAWYHNNGNGTFAPQDVITSAIDGAWTVYAADLDNDGHTDVV 319
>gi|186686591|ref|YP_001869787.1| FG-GAP repeat-containing protein [Nostoc punctiforme PCC 73102]
gi|186469043|gb|ACC84844.1| FG-GAP repeat protein [Nostoc punctiforme PCC 73102]
Length = 780
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 60/214 (28%), Positives = 87/214 (40%), Gaps = 16/214 (7%)
Query: 470 VVDLDGDGNLDILVGTS--FGLFYVL-DHHGKIREKFPLEMAEIQGAVVAADINDDGKIE 526
V D DGDG LD++V S + +L D G + AV AD + DGK++
Sbjct: 120 VEDFDGDGKLDLVVANSDDNNISVLLNDGSGGFGTLTNFAVGTNPNAVAVADFDGDGKLD 179
Query: 527 LVTTDTH-GNVAAWTAEGKGIWEQHLKSLVTQGP---SIGDVDGDGHSDVVVPTLSGNIY 582
LV ++ NV+ +G G + V P ++ D DGDG D+VV N
Sbjct: 180 LVVANSDDNNVSVLLNDGSGGFGTPTNFAVGTNPRSVAVADFDGDGKLDLVVANSDDNNV 239
Query: 583 VLSGKDGSKVRPYPYR-THGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLI----D 637
L DGS P G V + D G+ +V S D + L+
Sbjct: 240 SLLLNDGSGGFGAPTSFAVGTNPRSVAVADFDGDGKLD---LVVANSDDNNVSLLLNDGS 296
Query: 638 GPTSCADVVDIGETSYSMVLADNVDGGDDLDLIV 671
G + +G + S+ + D DG LDL+V
Sbjct: 297 GGFGASTSFAVGTSPRSVAVED-FDGDGKLDLVV 329
Score = 40.4 bits (93), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 53/121 (43%), Gaps = 7/121 (5%)
Query: 467 SPTVVDLDGDGNLDILVGTSFGLFYVL---DHHGKIREKFPLEMAEIQGAVVAADINDDG 523
S V D DGDG LD++V S L D G + +V AD + DG
Sbjct: 215 SVAVADFDGDGKLDLVVANSDDNNVSLLLNDGSGGFGAPTSFAVGTNPRSVAVADFDGDG 274
Query: 524 KIELVTTDT-HGNVAAWTAEGKGIWEQHLKSLVTQGP---SIGDVDGDGHSDVVVPTLSG 579
K++LV ++ NV+ +G G + V P ++ D DGDG D+VV +
Sbjct: 275 KLDLVVANSDDNNVSLLLNDGSGGFGASTSFAVGTSPRSVAVEDFDGDGKLDLVVANSTD 334
Query: 580 N 580
N
Sbjct: 335 N 335
>gi|158521952|ref|YP_001529822.1| hypothetical protein Dole_1941 [Desulfococcus oleovorans Hxd3]
gi|158510778|gb|ABW67745.1| conserved hypothetical protein [Desulfococcus oleovorans Hxd3]
Length = 758
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 52/240 (21%), Positives = 83/240 (34%), Gaps = 34/240 (14%)
Query: 383 ILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDT 442
+ +TP DID DG + + Y FD + E + DI N+ T
Sbjct: 481 VFTTPAFVDIDGDGDMDAFMGARYGFDLDVELAEETPDVVSVSDISDPSVFIQYFQNVGT 540
Query: 443 KQVKWTTDLDLSTDNASF---RAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKI 499
+ + + A+ A+ P +D DGDG+ D+ +G G Y ++ G
Sbjct: 541 PTAPYFAPMPSQQNPAATIGGTAFDDYKPAFMDADGDGDFDLFIGDRDGDIYYFENTGTA 600
Query: 500 REK-------------FPLEMAEIQGAVVAADINDDGKIELVTTDTHG------NVAAWT 540
F L + AD + DG ++ V + H N A T
Sbjct: 601 NAAEFNAALPEYEANPFGLTAVDHNATPTFADTDGDGDMDAVVGEYHSAIHYFENTADAT 660
Query: 541 AEGK------------GIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKD 588
A GI+ + + P+ D+D DG D+ V G I L ++
Sbjct: 661 ATEPIFVERTGTDNPFGIYGDEDGAYLFTSPAFADIDSDGDQDLFVGEWGGRILFLENRE 720
>gi|167533821|ref|XP_001748589.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772830|gb|EDQ86476.1| predicted protein [Monosiga brevicollis MX1]
Length = 435
Score = 40.8 bits (94), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 63/244 (25%), Positives = 98/244 (40%), Gaps = 40/244 (16%)
Query: 389 IADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWT 448
+AD+DNDG ++I A +Y N LG G + A
Sbjct: 104 VADLDNDGWPDVIAACPVSRQLVWYRN------LGTRKFGPAIEVA-------------- 143
Query: 449 TDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG---LFYVLDHHGKIREKFPL 505
L+L+T +A + A DLD DG +D+L + LF+ + L
Sbjct: 144 --LNLTTPSAVYAA---------DLDQDGWVDLLSASEVDGHVLFHKNNGDAGFAVPLVL 192
Query: 506 EMAEIQG-AVVAADINDDGKIELVTTDTHGNVAAWTA-EGKGIWEQHLKSLVTQGPSIG- 562
M+ I G ++ AD++ D ++++ N AW A G G + VT +
Sbjct: 193 SMSAIGGHSITVADLDQDQHLDVLVAGGSNNTVAWFANRGDGTFAAPSFLPVTNAACVVA 252
Query: 563 -DVDGDGHSDVVVPTLS-GNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSK 620
DVDG+G DVVV + S G + + +GS P T + Q+ DL G +
Sbjct: 253 VDVDGNGALDVVVGSSSIGARWFPNHGNGSFASPIAI-TANSAVQQLATGDLDADGHEDL 311
Query: 621 GLTI 624
L +
Sbjct: 312 ALLL 315
>gi|440891121|gb|ELR45029.1| Integrin alpha-9, partial [Bos grunniens mutus]
Length = 462
Score = 40.8 bits (94), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 43/182 (23%), Positives = 68/182 (37%), Gaps = 39/182 (21%)
Query: 433 GAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYV 492
G + +F D + S +Y SS VDL+ DG D+LV
Sbjct: 99 GKVYIFRADRRSGTLIKIFQASGKKMG--SYFGSSLCAVDLNTDGLSDLLV--------- 147
Query: 493 LDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNV-AAWTAEGKGIWEQHL 551
GA + ++I D+G++ + +G + T G G + H
Sbjct: 148 -------------------GAPMFSEIRDEGQVTVYINKGNGVLEEQLTLSGDGAYNAHF 188
Query: 552 KSLVTQGPSIGDVDGDGHSDVVVPT-----LSGNIYVLSGKDGSKVRPYPYRTHGRVMNQ 606
+ S+GD+D DG DV + SG +Y+ G V Y + GR ++
Sbjct: 189 GESIA---SLGDLDDDGFPDVAIGAPKEDDFSGAVYIYHGDARGMVPQYSMKLSGRKISP 245
Query: 607 VL 608
VL
Sbjct: 246 VL 247
>gi|168702115|ref|ZP_02734392.1| Cadherin [Gemmata obscuriglobus UQM 2246]
Length = 1740
Score = 40.8 bits (94), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 65/142 (45%), Gaps = 20/142 (14%)
Query: 472 DLDGDGNLDILVGTSFGLF----YVLDHH-GKIREKFPLEMAEIQGAVVAA--DINDDGK 524
D+DGDG D++ G G V D G R F L G V A D+N DG
Sbjct: 1488 DVDGDGVADLITGAGAGSMGGHVKVFDGATGAQRFSF-LSFDGFAGGVFVATGDVNGDGV 1546
Query: 525 IELVTTDTHGNVAAWTAEGKGIWEQHLKSLVT-----QGP---SIGDVDGDGHSDVVVPT 576
++V + G A G L+S + +G ++GDVDGDG +D+V
Sbjct: 1547 ADIVVSADAG-AAPHVKVFSGADGSLLRSFLAYDAGFRGGVRVAVGDVDGDGFADIVTGN 1605
Query: 577 LSG---NIYVLSGKDGSKVRPY 595
+G N+ V SGKDG+ ++ +
Sbjct: 1606 GAGAAANVKVFSGKDGALLKSF 1627
>gi|156743066|ref|YP_001433195.1| ASPIC/UnbV domain-containing protein [Roseiflexus castenholzii DSM
13941]
gi|156234394|gb|ABU59177.1| ASPIC/UnbV domain protein [Roseiflexus castenholzii DSM 13941]
Length = 453
Score = 40.8 bits (94), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 42/170 (24%), Positives = 75/170 (44%), Gaps = 27/170 (15%)
Query: 472 DLDGDGNLDILVGTSFG--LFYVLDHHGKIREKFPLEMAEIQGA--VVAADINDDGKIEL 527
D DGDG DI+ G G ++ G + P +MAE V+AAD ++DG++E+
Sbjct: 236 DFDGDGRFDIVYGNWEGPHRLFLQKWEGLFVDAAPPDMAEPSRVRTVIAADFDNDGELEI 295
Query: 528 VTTDT---------HGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLS 578
+ G+V WT G + + + G ++GD DGDG ++++
Sbjct: 296 FFNNIGEPNRLFARRGDV--WTKIDPG--DALEPAGLGTGAAVGDFDGDGRLELLISHGE 351
Query: 579 GNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTS 628
++ LS YR + ++ LT+ G ++G +V ++
Sbjct: 352 ADLQPLS----------LYRPVSNEYAWLRVLPLTRAGAPARGAVVVLSA 391
>gi|162455599|ref|YP_001617966.1| hypothetical protein sce7317 [Sorangium cellulosum So ce56]
gi|161166181|emb|CAN97486.1| hypothetical protein predicted by Glimmer/Critica [Sorangium
cellulosum So ce56]
Length = 1219
Score = 40.8 bits (94), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 51/183 (27%), Positives = 83/183 (45%), Gaps = 35/183 (19%)
Query: 423 GGIDIGKYVAGAIVVFNLDTKQVKWTT-DLDLSTDNASFRAYIYSSPTVVDLDGDGNLDI 481
G ID+ +V+G VV + V+W+T L L T + +V DLDGDG L+I
Sbjct: 132 GDIDL-LFVSGGRVVLSDPLGFVRWSTPSLGLQTLH-----------SVDDLDGDGRLEI 179
Query: 482 LVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIEL---------VTTDT 532
+ + G + G I + GA AD N DG ++L V TD+
Sbjct: 180 VALSGAGAAVIDGSSGAILWREDASELGTLGAARLADFNGDGHLDLFLDECGCCTVRTDS 239
Query: 533 HGNVAAWTAEGKGIWEQ--------HLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVL 584
G + ++ A G G E+ H + + ++GD +GDG D++V + ++++
Sbjct: 240 PGVIYSF-ARGFGAVERLPPPPRRVHCNATID---TVGDWNGDGADDLLVSSYD-QLFLV 294
Query: 585 SGK 587
SG
Sbjct: 295 SGS 297
>gi|29829464|ref|NP_824098.1| hypothetical protein SAV_2922 [Streptomyces avermitilis MA-4680]
gi|29606572|dbj|BAC70633.1| putative integrin-like protein [Streptomyces avermitilis MA-4680]
Length = 470
Score = 40.8 bits (94), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 60/149 (40%), Gaps = 22/149 (14%)
Query: 470 VVDLDGDGNLDILVGTSFGLFYVL-DHHGKIREKFPLEMAEIQGAVVAADINDDGKIELV 528
DLDGD D++V G VL + + E+ G V A D N DGK +LV
Sbjct: 106 AADLDGDRYTDLVVNGYDGKAVVLWGSAAGLSGQGSAELPWNGGHVTAGDFNGDGKRDLV 165
Query: 529 TT----------DTHG---NVAAWTAEGKGIWEQHLKSLVTQGPS---IGDVDGDGHSDV 572
T D G + +T EGK Q + + T GP GD+ GDG D+
Sbjct: 166 TVDFPRSDDPDNDDQGMLISYGPFTREGKAASTQAVATSRTFGPGGFVTGDITGDGADDI 225
Query: 573 VV-----PTLSGNIYVLSGKDGSKVRPYP 596
V G+ + GKDG P P
Sbjct: 226 VSNHGFEEMAYGSKFWAGGKDGVNTTPKP 254
>gi|430745373|ref|YP_007204502.1| FG-GAP repeat-containing protein [Singulisphaera acidiphila DSM
18658]
gi|430017093|gb|AGA28807.1| FG-GAP repeat protein [Singulisphaera acidiphila DSM 18658]
Length = 658
Score = 40.8 bits (94), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 45/167 (26%), Positives = 72/167 (43%), Gaps = 41/167 (24%)
Query: 468 PTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPL---------EMAEIQ-GAVV-- 515
P +D D DG++D++VG G +++H GK+ + P E A+++ GA+V
Sbjct: 290 PVAIDWDKDGDVDLVVGDEDGRVALIEHTGKVIDGLPRFQAPAYFRQEAADVKFGALVTP 349
Query: 516 -AADINDDGKIELVTTDTH-----------GNVAAWT------AEGKGIWEQHLKSLVTQ 557
+ D + DG +LV ++ GN W A G I + Q
Sbjct: 350 CSVDWDGDGDEDLVCGNSSGVIGLIENLDGGNPPRWAAPRLLEAAGSPIRIMAGPNGSIQ 409
Query: 558 GP----------SIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRP 594
GP S+ D D DG D+VV ++ G + V GS+ +P
Sbjct: 410 GPCEAKWGYTTLSVADWDRDGQPDLVVNSIWGKV-VWYRNAGSRSKP 455
>gi|343084914|ref|YP_004774209.1| hypothetical protein [Cyclobacterium marinum DSM 745]
gi|342353448|gb|AEL25978.1| FG-GAP repeat protein [Cyclobacterium marinum DSM 745]
Length = 668
Score = 40.8 bits (94), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 34/145 (23%), Positives = 57/145 (39%), Gaps = 25/145 (17%)
Query: 454 STDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLF-YVLDHHGKIREKFP-LEMAEIQ 511
DN F A + +P VD D DG+ DI+ G S G F ++ + G ++ K+ ++ E
Sbjct: 354 QADNLKFGALV--TPVSVDWDNDGDEDIIAGNSAGHFAFIENLDGAVKPKWAEPKLLETD 411
Query: 512 GAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSD 571
G + D+G I+ G S+ D DGDG D
Sbjct: 412 GEPIRIQAGDNGSIQGPAEAKWGYTTL---------------------SVADWDGDGQKD 450
Query: 572 VVVPTLSGNIYVLSGKDGSKVRPYP 596
++ ++ G + + + P P
Sbjct: 451 IIFNSIWGRVEWIKNTGNGLLAPQP 475
>gi|397566667|gb|EJK45144.1| hypothetical protein THAOC_36251 [Thalassiosira oceanica]
Length = 1109
Score = 40.8 bits (94), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 66/144 (45%), Gaps = 31/144 (21%)
Query: 469 TVVDLDGDGNLDILVGTS----FGLFYVLDHHGKIREKFPLEMAEIQGA--VVAADINDD 522
TV DLD DG+ DI+ +S +F +D G E + + +GA VVAAD+N D
Sbjct: 210 TVADLDNDGDPDIITASSGDNTIAVFKNIDK-GTFCEIKEVVDSNAKGARTVVAADLNGD 268
Query: 523 GKIELVTTDTHGNVAAW---------------TAEGKGIWEQHLKSLVTQGPS------- 560
G ++L + + AW T G ++ +K++++ G
Sbjct: 269 GLLDLASASKDDDTVAWYPNTMRNVVNHSTGATHRVGGYFDPSVKNVISNGTESTGAYSL 328
Query: 561 -IGDVDGDGHSDVVVPTLSGNIYV 583
DVD DG D+VV + +GN +V
Sbjct: 329 VAADVDLDGDQDLVVAS-NGNDHV 351
>gi|149918171|ref|ZP_01906663.1| Integrins alpha chain [Plesiocystis pacifica SIR-1]
gi|149820931|gb|EDM80338.1| Integrins alpha chain [Plesiocystis pacifica SIR-1]
Length = 411
Score = 40.8 bits (94), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 55/132 (41%), Gaps = 19/132 (14%)
Query: 472 DLDGDGNLDILV-------GTSFGLFYVL---------DHHGKIREKFPLEMAEIQGAVV 515
D DGDGNLD+ V G L Y+ D G + L E + +
Sbjct: 276 DFDGDGNLDVAVFAPSRTTGGPGNLMYLTTPSVVVLYGDGAGGLTLGASLAAGESPVSGL 335
Query: 516 AADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPS---IGDVDGDGHSDV 572
A DIN+DG I++V D + G + + V GP + D DGDG D+
Sbjct: 336 ATDINNDGAIDIVVADAGEDKLPVFLNRGGEFPDQINIDVAAGPQTLLLEDFDGDGVDDI 395
Query: 573 VVPTLSGNIYVL 584
VV +G + V+
Sbjct: 396 VVGNANGVVAVV 407
>gi|167737097|ref|ZP_02409871.1| FG-GAP/YD repeat domain protein [Burkholderia pseudomallei 14]
Length = 1060
Score = 40.8 bits (94), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 56/124 (45%), Gaps = 26/124 (20%)
Query: 471 VDLDGDGNLDILVGTSFG-LFYVLDHHGKIREK---------FPLEMAEIQGAVVAADIN 520
VDL+GDG DIL+ T G L Y+ D + P+ +A+ V AD+
Sbjct: 415 VDLNGDGITDILLDTGRGWLAYLRDGPASWSDAPRVLPPARTPPVSLADPH--VHLADMT 472
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG-----------DVDGDGH 569
DG +++V T G W A G W+ + +T P+ G DVDGDG
Sbjct: 473 GDGLMDIVRV-TGGGFTYWPARADGGWDAAIA--MTPSPAFGRDFDPHRLALYDVDGDGC 529
Query: 570 SDVV 573
+D+V
Sbjct: 530 ADLV 533
>gi|440697334|ref|ZP_20879759.1| FG-GAP repeat protein [Streptomyces turgidiscabies Car8]
gi|440280380|gb|ELP68123.1| FG-GAP repeat protein [Streptomyces turgidiscabies Car8]
Length = 1025
Score = 40.8 bits (94), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 58/125 (46%), Gaps = 12/125 (9%)
Query: 472 DLDGDGNLDILVGTSFGLF--YVLDHHGKIREKFPL--EMAEIQGAVVAADINDDGKIEL 527
DL GDG D++ G+ Y D G + E+ L + + A D+N DG ++
Sbjct: 880 DLTGDGLADLVARDKAGVLWRYSSDGDGGLTERVELVGGLGGYTRLIGAGDLNRDGIGDM 939
Query: 528 VTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSI------GDVDGDGHSDVVVPTLSGNI 581
V D+ G + W +GKG + + + G S+ GD+ GDG D+V +G +
Sbjct: 940 VGLDSAGALWRWLGDGKGGFGPRAR--IAGGISVKALAVPGDLTGDGRPDLVGLDSAGAL 997
Query: 582 YVLSG 586
+ +G
Sbjct: 998 WRWNG 1002
>gi|436833728|ref|YP_007318944.1| FG-GAP repeat protein [Fibrella aestuarina BUZ 2]
gi|384065141|emb|CCG98351.1| FG-GAP repeat protein [Fibrella aestuarina BUZ 2]
Length = 420
Score = 40.8 bits (94), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 33/127 (25%), Positives = 53/127 (41%), Gaps = 17/127 (13%)
Query: 461 RAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAV------ 514
R ++ V DL+ DG LDI+ GT ++ + RE P + +
Sbjct: 65 REFMSEGVAVADLNKDGKLDIIAGT----YWFEAPNWTRRELIPSRTYDPRQGYSDSFLN 120
Query: 515 VAADINDDGKIELVTTDTHGNVAAWTAEGK---GIWEQHL----KSLVTQGPSIGDVDGD 567
+ D+N DG ++V D G A W K G W + + + + P D+DGD
Sbjct: 121 LGMDVNLDGWDDVVIIDFPGRPAFWLENNKNKPGAWTKRMLVDSMGIANESPGFIDIDGD 180
Query: 568 GHSDVVV 574
G D++
Sbjct: 181 GRLDILC 187
>gi|167538111|ref|XP_001750721.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163770745|gb|EDQ84426.1| predicted protein [Monosiga brevicollis MX1]
Length = 497
Score = 40.8 bits (94), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 41/157 (26%), Positives = 66/157 (42%), Gaps = 11/157 (7%)
Query: 472 DLDGDGNLDILVGTSFG---LFYVLDHHGKIREK-FPLEMAEIQGAVVAADINDDGKIEL 527
DLD DG++D+L S G +Y D G + F + +V AAD+N DG +++
Sbjct: 309 DLDNDGDMDVLSANSVGNTITWYRNDGTGVFPQTLFVASSVDSPRSVYAADLNQDGFLDV 368
Query: 528 VTTDTHGNVAAW--TAEGKGIWEQHLKSLVTQ---GPSIGDVDGDGHSDVVVPTLSGN-- 580
++ ++ N AW G G H+ + D+D DG DV+ ++ N
Sbjct: 369 ISANSGDNSIAWYQNQGGTGFSAPHVITTTASYALAVYAADLDQDGDMDVLSASVVDNKI 428
Query: 581 IYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGE 617
+ + GS P+ T V DL G+
Sbjct: 429 AWYRNNGQGSFSSPFVISTSALYAYSVYAADLDHDGD 465
>gi|167525515|ref|XP_001747092.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774387|gb|EDQ88016.1| predicted protein [Monosiga brevicollis MX1]
Length = 422
Score = 40.8 bits (94), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 48/92 (52%), Gaps = 8/92 (8%)
Query: 467 SPTVVDLDGDGNLDILVGTSFGL----FYVLDHHGKIREKFPLEMAEIQGA--VVAADIN 520
S DLDGDG++D+L + +Y + G K + + I+GA V AAD++
Sbjct: 136 SVVAADLDGDGDMDVLSAIGYDADEVAWYQNNGTGGFSGKRVIA-SNIKGAHSVFAADLD 194
Query: 521 DDGKIELVTTDTHGNVAAW-TAEGKGIWEQHL 551
+DG +++++ D H N +W G G + Q L
Sbjct: 195 NDGDLDVLSADLHDNKVSWYRNNGSGQFSQRL 226
>gi|440715267|ref|ZP_20895814.1| hypothetical protein RBSWK_02871 [Rhodopirellula baltica SWK14]
gi|436439611|gb|ELP33025.1| hypothetical protein RBSWK_02871 [Rhodopirellula baltica SWK14]
Length = 402
Score = 40.8 bits (94), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 54/230 (23%), Positives = 90/230 (39%), Gaps = 32/230 (13%)
Query: 471 VDLDGDGNLDILVGTSF--GLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELV 528
D DGDG+LDI G G ++ D G R +++A DI+ DG I+L+
Sbjct: 130 ADFDGDGDLDIASGNDMAKGQIFLNDGSGLFRLHGEYGDVSSLRSLLAHDIDQDGDIDLI 189
Query: 529 TTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG------------DVDGDGHSDVVVPT 576
T I+ ++ GPS G D++GDGH D+V+
Sbjct: 190 AT--------CRGRPNRIYVNDGEADFNSGPSFGRNTDSTIDVAAGDLNGDGHMDLVLAN 241
Query: 577 L--SGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLY 634
N + + R P+ V + DL G + ++
Sbjct: 242 RDHQPNEILFGDAELQFSRSVPFGIGDEPSRAVAVADLNTDGHLD--WVVANIGRANQVF 299
Query: 635 LIDGPTSCADVVDIGET---SYSMVLADNVDGGDDLDLIVTTMN--GNVF 679
DG A + G + +Y++ +AD ++ + LD++V +N G+VF
Sbjct: 300 FGDGAGGVAGSAEFGASDSQTYALAIAD-MNNDEVLDIVVGNLNQPGSVF 348
>gi|126442186|ref|YP_001057658.1| Rhs family protein [Burkholderia pseudomallei 668]
gi|126221679|gb|ABN85185.1| Rhs family protein [Burkholderia pseudomallei 668]
Length = 2031
Score = 40.8 bits (94), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 57/124 (45%), Gaps = 26/124 (20%)
Query: 471 VDLDGDGNLDILVGTSFG-LFYVLDHHGKIREK---------FPLEMAEIQGAVVAADIN 520
VDL+GDG DIL+ T G L Y+ D + P+ +A+ V AD+
Sbjct: 415 VDLNGDGITDILLDTGRGWLAYLRDGPASWSDAPRVLPPARTPPVSLADPH--VHLADMT 472
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG-----------DVDGDGH 569
DG +++V T G + W A G W+ + +T P+ G DVDGDG
Sbjct: 473 GDGLMDIVRV-TGGGLTYWPARADGGWDAAIA--MTPSPAFGRDFDPHRLALYDVDGDGC 529
Query: 570 SDVV 573
+D+V
Sbjct: 530 ADLV 533
>gi|226199557|ref|ZP_03795114.1| FG-GAP repeat/YD repeat/RHS repeat protein [Burkholderia
pseudomallei Pakistan 9]
gi|225928438|gb|EEH24468.1| FG-GAP repeat/YD repeat/RHS repeat protein [Burkholderia
pseudomallei Pakistan 9]
Length = 2031
Score = 40.8 bits (94), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 57/124 (45%), Gaps = 26/124 (20%)
Query: 471 VDLDGDGNLDILVGTSFG-LFYVLDHHGKIREK---------FPLEMAEIQGAVVAADIN 520
VDL+GDG DIL+ T G L Y+ D + P+ +A+ V AD+
Sbjct: 415 VDLNGDGITDILLDTGRGWLAYLRDGPASWSDAPRVLPPARTPPVSLADPH--VHLADMT 472
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG-----------DVDGDGH 569
DG +++V T G + W A G W+ + +T P+ G DVDGDG
Sbjct: 473 GDGLMDIVRV-TGGGLTYWPARADGGWDAAIA--MTPSPAFGRDFDPHRLALYDVDGDGC 529
Query: 570 SDVV 573
+D+V
Sbjct: 530 ADLV 533
>gi|217419995|ref|ZP_03451501.1| FG-GAP repeat/YD repeat/RHS repeat protein [Burkholderia
pseudomallei 576]
gi|217397299|gb|EEC37315.1| FG-GAP repeat/YD repeat/RHS repeat protein [Burkholderia
pseudomallei 576]
Length = 2031
Score = 40.8 bits (94), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 57/124 (45%), Gaps = 26/124 (20%)
Query: 471 VDLDGDGNLDILVGTSFG-LFYVLDHHGKIREK---------FPLEMAEIQGAVVAADIN 520
VDL+GDG DIL+ T G L Y+ D + P+ +A+ V AD+
Sbjct: 415 VDLNGDGITDILLDTGRGWLAYLRDGPASWSDAPRVLPPARTPPVSLADPH--VHLADMT 472
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG-----------DVDGDGH 569
DG +++V T G + W A G W+ + +T P+ G DVDGDG
Sbjct: 473 GDGLMDIVRV-TGGGLTYWPARADGGWDAAIA--MTPSPAFGRDFDPHRLALYDVDGDGC 529
Query: 570 SDVV 573
+D+V
Sbjct: 530 ADLV 533
>gi|83648291|ref|YP_436726.1| Rhs family protein [Hahella chejuensis KCTC 2396]
gi|83636334|gb|ABC32301.1| Rhs family protein [Hahella chejuensis KCTC 2396]
Length = 3862
Score = 40.8 bits (94), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 56/198 (28%), Positives = 84/198 (42%), Gaps = 25/198 (12%)
Query: 390 ADIDNDGVSEMIIAVSYFFDH---EYYDNPEHLKELGGID------IGKYVAGAIVVFNL 440
ADID DG E+I A +DH N E LG D I G + ++N
Sbjct: 1299 ADIDLDGDMELI-AGPTIYDHTGRRIITNGEGTSALGNFDSDDFPEIVNVYDGYLSLYNH 1357
Query: 441 DTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVG--TSFGLFYVLDHHGK 498
D V W D P V DLDGDG +I V T +G++ + +G
Sbjct: 1358 D-GSVIWK-------DKKIEGGGRGGPPVVADLDGDGVPEIGVAGKTRYGVY---NANGD 1406
Query: 499 IREKFPL-EMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWE-QHLKSLVT 556
+ P + + A A D NDDG++E+V D + G I+E +++
Sbjct: 1407 VVWSQPTRDQSSGVTASSAFDFNDDGRMEIVYGDEYYLRVYDGVSGDVIYEIENVSVTAH 1466
Query: 557 QGPSIGDVDGDGHSDVVV 574
+ P I D+D D H++++
Sbjct: 1467 EYPVIADIDHDDHAEIIA 1484
>gi|53718233|ref|YP_107219.1| hypothetical protein BPSL0590 [Burkholderia pseudomallei K96243]
gi|52208647|emb|CAH34583.1| putative membrane protein [Burkholderia pseudomallei K96243]
Length = 2031
Score = 40.8 bits (94), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 57/124 (45%), Gaps = 26/124 (20%)
Query: 471 VDLDGDGNLDILVGTSFG-LFYVLDHHGKIREK---------FPLEMAEIQGAVVAADIN 520
VDL+GDG DIL+ T G L Y+ D + P+ +A+ V AD+
Sbjct: 415 VDLNGDGITDILLDTGRGWLAYLRDGPASWSDAPRVLPPARTPPVSLADPH--VHLADMT 472
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG-----------DVDGDGH 569
DG +++V T G + W A G W+ + +T P+ G DVDGDG
Sbjct: 473 GDGLMDIVRV-TGGGLTYWPARADGGWDAAIA--MTPSPAFGRDFDPHRLALYDVDGDGC 529
Query: 570 SDVV 573
+D+V
Sbjct: 530 ADLV 533
>gi|167537983|ref|XP_001750658.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163770864|gb|EDQ84542.1| predicted protein [Monosiga brevicollis MX1]
Length = 2548
Score = 40.8 bits (94), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 72/160 (45%), Gaps = 31/160 (19%)
Query: 470 VVDLDGDGNLDILVGTSFGLFYVLDHHGKIR-----EKFPLEMAEIQGA--VVAADINDD 522
V D+DGDG DI+VG G V H + +++P+ A QG + AD N D
Sbjct: 1845 VGDVDGDGIADIVVG---GASRVSLLHASVAPTFATQRYPISTAA-QGVHDLRLADFNGD 1900
Query: 523 GKIELVTTDTHGNVAAWTAEGKGIWEQH--LKSLVTQ--------GPSIGDVDGDGHSDV 572
G++++VT++ G+V G+ + + +VT+ G I DV+ DG D
Sbjct: 1901 GRLDVVTSERFGSVVRVYLNQPGVGDASGPVSWVVTEVASVAGAFGLDIVDVENDGDMD- 1959
Query: 573 VVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDL 612
+V++ +DG VR + + N LL L
Sbjct: 1960 ---------FVVASRDGGDVRLFLNDGNATFTNSALLSGL 1990
>gi|254181833|ref|ZP_04888430.1| FG-GAP/YD repeat domain protein [Burkholderia pseudomallei 1655]
gi|184212371|gb|EDU09414.1| FG-GAP/YD repeat domain protein [Burkholderia pseudomallei 1655]
Length = 2031
Score = 40.8 bits (94), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 57/124 (45%), Gaps = 26/124 (20%)
Query: 471 VDLDGDGNLDILVGTSFG-LFYVLDHHGKIREK---------FPLEMAEIQGAVVAADIN 520
VDL+GDG DIL+ T G L Y+ D + P+ +A+ V AD+
Sbjct: 415 VDLNGDGITDILLDTGRGWLAYLRDGPASWSDAPRVLPPARTPPVSLADPH--VHLADMT 472
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG-----------DVDGDGH 569
DG +++V T G + W A G W+ + +T P+ G DVDGDG
Sbjct: 473 GDGLMDIVRV-TGGGLTYWPARADGGWDAAIA--MTPSPAFGRDFDPHRLALYDVDGDGC 529
Query: 570 SDVV 573
+D+V
Sbjct: 530 ADLV 533
>gi|456388856|gb|EMF54296.1| serine-type endopeptidase [Streptomyces bottropensis ATCC 25435]
Length = 569
Score = 40.8 bits (94), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 64/158 (40%), Gaps = 23/158 (14%)
Query: 472 DLDGDGNLDILVGTSFGLFYVLDHHGK----IREKFPLEMAEIQGAVVAADINDDGKIEL 527
D+ GD D+L S G ++ G R K ++ V D N DGK +L
Sbjct: 416 DVTGDALPDLLSVDSAGALWIYPGKGTGSFGTRVKVGTGWSQYNAVVGHGDFNGDGKADL 475
Query: 528 VT-TDTHGNVAAWTAEGK---GIWEQHLK---------SLVTQGPSIGDVDGDGHSDVVV 574
+ T T NV + GK G + +K +L+T G DV GDG +D++
Sbjct: 476 IARTKTGSNVYLYKGTGKSGTGAFATRIKVRSDWSAYNTLLTPG----DVSGDGRADLLA 531
Query: 575 PTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDL 612
T +G +Y+ G K + T G V DL
Sbjct: 532 RTPAGTLYLYKGT--GKATSEIFGTRGSVGTSYAQYDL 567
>gi|344288139|ref|XP_003415808.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-9-like [Loxodonta
africana]
Length = 1145
Score = 40.8 bits (94), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 38/153 (24%), Positives = 60/153 (39%), Gaps = 37/153 (24%)
Query: 462 AYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADIND 521
+Y SS VDL+ DG D+LVG A + ++I D
Sbjct: 434 SYFGSSLCAVDLNADGLSDLLVG----------------------------APMFSEIRD 465
Query: 522 DGKIELVTTDTHGNVAAWTA-EGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPT---- 576
+G++ +G + A G + H + S+GD+D DG DV +
Sbjct: 466 EGQVTTYINRGNGALEEQVALTGDSAYNAHFGESIA---SLGDLDDDGFPDVAIGAPKED 522
Query: 577 -LSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVL 608
+G +Y+ G G V Y + G+ +N VL
Sbjct: 523 DFAGAVYIYHGDAGGIVPQYSMKLSGQKINPVL 555
>gi|254196583|ref|ZP_04903007.1| FG-GAP/YD repeat domain protein [Burkholderia pseudomallei S13]
gi|169653326|gb|EDS86019.1| FG-GAP/YD repeat domain protein [Burkholderia pseudomallei S13]
Length = 2031
Score = 40.8 bits (94), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 57/124 (45%), Gaps = 26/124 (20%)
Query: 471 VDLDGDGNLDILVGTSFG-LFYVLDHHGKIREK---------FPLEMAEIQGAVVAADIN 520
VDL+GDG DIL+ T G L Y+ D + P+ +A+ V AD+
Sbjct: 415 VDLNGDGITDILLDTGRGWLAYLRDGPASWSDAPRVLPPARTPPVSLADPH--VHLADMT 472
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG-----------DVDGDGH 569
DG +++V T G + W A G W+ + +T P+ G DVDGDG
Sbjct: 473 GDGLMDIVRV-TGGGLTYWPARADGGWDAAIA--MTPSPAFGRDFDPHRLALYDVDGDGC 529
Query: 570 SDVV 573
+D+V
Sbjct: 530 ADLV 533
>gi|407703213|ref|YP_006816361.1| Na-Ca exchanger/integrin-beta4 [Bacillus thuringiensis MC28]
gi|407387628|gb|AFU18122.1| Na-Ca exchanger/integrin-beta4 [Bacillus thuringiensis MC28]
Length = 367
Score = 40.8 bits (94), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 62/134 (46%), Gaps = 12/134 (8%)
Query: 472 DLDGDGNLDILVGT-SFGLFYVLDHHGKIREKFPLEMAE--IQGAVVAADINDDGKIEL- 527
D +GDGNLD+ V SF + G + + + + G ++AAD N DGK++L
Sbjct: 13 DFNGDGNLDLAVANNSFVSILLGVGIGTFLPAMNINIGDSPLLG-ILAADFNGDGKLDLA 71
Query: 528 VTTDTHGN-VAAWTAEGKGIWEQHLKSLVTQGP---SIGDVDGDGHSDVVVPTL---SGN 580
VT D GN V+ G G ++ P + GD +GDG D+ + + + N
Sbjct: 72 VTNDAPGNDVSVLLGNGDGTFQPSTFYPAGDAPIAITTGDFNGDGKLDLAITDVDVGANN 131
Query: 581 IYVLSGKDGSKVRP 594
+ VL G +P
Sbjct: 132 VSVLLGNGDGTFQP 145
>gi|373459465|ref|ZP_09551232.1| FG-GAP repeat protein [Caldithrix abyssi DSM 13497]
gi|371721129|gb|EHO42900.1| FG-GAP repeat protein [Caldithrix abyssi DSM 13497]
Length = 600
Score = 40.8 bits (94), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 73/303 (24%), Positives = 116/303 (38%), Gaps = 51/303 (16%)
Query: 391 DIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWTTD 450
DID DGV E+ +A F D D A+++F+LD T
Sbjct: 227 DIDGDGVEEVAVAFRAFSDASTND-------------------ALMIFSLDGGFAGEFTQ 267
Query: 451 LDLST-DNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKF----PL 505
+ D +YS+ + D+D DGNL+ T FY + + + L
Sbjct: 268 FKIEMIDTTGDWGSVYSA-DITDIDNDGNLEAYFSTDNHTFYEATGADQYQLYYMDTPTL 326
Query: 506 EMAEIQGAVVAADINDDGKIELVTTDTHG---------NVAAWTAEGKGIWEQHLKSLVT 556
+ IQ AVV AD++ DG EL+ T G ++A+ + + + +
Sbjct: 327 DAWTIQ-AVVQADVDGDGTNELLWGHTSGALRMLRGVADLASMNSSNE-VEIAMVNPAGC 384
Query: 557 QGPSIGDVDGDGHSDVVV-PTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKR 615
+G + GD DGDG D+ + SG + + + T+ RV +
Sbjct: 385 RGLTAGDYDGDGKVDIFMGGNYSGAVDRIEYNGSGDIADSASYTYERVFQDTI------- 437
Query: 616 GEKSKGLTIVTTSFDGYLYLI-DGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTM 674
S G + + SF G + I G T+ D+ GE + D GD L + +
Sbjct: 438 --PSGGTRVYSVSFPGDNFCIKQGGTTSNDLNGNGEPELLIAYED----GDSLQNWIVMI 491
Query: 675 NGN 677
GN
Sbjct: 492 EGN 494
>gi|418557997|ref|ZP_13122571.1| FG-GAP/YD repeat-containing protein [Burkholderia pseudomallei
354e]
gi|385363759|gb|EIF69517.1| FG-GAP/YD repeat-containing protein [Burkholderia pseudomallei
354e]
Length = 2031
Score = 40.8 bits (94), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 57/124 (45%), Gaps = 26/124 (20%)
Query: 471 VDLDGDGNLDILVGTSFG-LFYVLDHHGKIREK---------FPLEMAEIQGAVVAADIN 520
VDL+GDG DIL+ T G L Y+ D + P+ +A+ V AD+
Sbjct: 415 VDLNGDGITDILLDTGRGWLAYLRDGPASWSDAPRVLPPARTPPVSLADPH--VHLADMT 472
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG-----------DVDGDGH 569
DG +++V T G + W A G W+ + +T P+ G DVDGDG
Sbjct: 473 GDGLMDIVRV-TGGGLTYWPARADGGWDAAIA--MTPSPAFGRDFDPHRLALYDVDGDGC 529
Query: 570 SDVV 573
+D+V
Sbjct: 530 ADLV 533
>gi|298715382|emb|CBJ27993.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 412
Score = 40.8 bits (94), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 48/91 (52%), Gaps = 11/91 (12%)
Query: 539 WTAEGKGIWEQHLKSL---------VTQGPSIGDVDGDGHSDVVVPTLSGNIY-VLSGKD 588
WTA G G +S+ VT P I + D D SDV++ S ++ VLS +D
Sbjct: 298 WTASGDGPGLAMSRSIGDKMAHSVGVTYEPEITEFDLDKRSDVMLVFASDGVWDVLSAED 357
Query: 589 GSKVRPY-PYRTHGRVMNQVLLVDLTKRGEK 618
KVR + P R V N+VLL+ +++RG++
Sbjct: 358 IRKVRAFKPLRVRRNVFNEVLLLFVSQRGKR 388
>gi|319952539|ref|YP_004163806.1| aspic/unbv domain protein [Cellulophaga algicola DSM 14237]
gi|319421199|gb|ADV48308.1| ASPIC/UnbV domain protein [Cellulophaga algicola DSM 14237]
Length = 1126
Score = 40.4 bits (93), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 19/123 (15%)
Query: 471 VDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQG---AVVAADINDDGKIEL 527
VD+DGD D++ + +G+ + ++G+ KFP ++ ++G + ADI++DG ++L
Sbjct: 849 VDIDGDNKPDLITVSDWGVPNIYKNNGRRLAKFPTDLDNLKGFWNTIETADIDNDGDLDL 908
Query: 528 VTTDTHGNV--AAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGH-SDVVVPTLSGNIYVL 584
+ + NV A + +W I D D +G + + SGN Y L
Sbjct: 909 IIGNQGSNVPYKATNSNPMKMW-------------INDFDSNGTIEQIFTQSFSGNDYPL 955
Query: 585 SGK 587
K
Sbjct: 956 HQK 958
>gi|255035443|ref|YP_003086064.1| TonB-dependent receptor plug [Dyadobacter fermentans DSM 18053]
gi|254948199|gb|ACT92899.1| TonB-dependent receptor plug [Dyadobacter fermentans DSM 18053]
Length = 1122
Score = 40.4 bits (93), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 27/106 (25%), Positives = 47/106 (44%), Gaps = 9/106 (8%)
Query: 439 NLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLD----ILVGTSFGLFYVLD 494
N+ K ++WTTD ++S D + + ++ G G + +VG G+ Y +D
Sbjct: 849 NIQRKALRWTTDFNISFDRNEVVSLGNKTSKLITGPGSGVIGGSHITIVGQPIGMLYGMD 908
Query: 495 HHGKIR-----EKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGN 535
H G + ++FP G V D+++DG I + GN
Sbjct: 909 HLGVYKDQADFDRFPKHTTSQVGTVKFRDVDNDGAITVNDATIIGN 954
>gi|167901282|ref|ZP_02488487.1| Rhs family protein [Burkholderia pseudomallei NCTC 13177]
Length = 2031
Score = 40.4 bits (93), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 57/124 (45%), Gaps = 26/124 (20%)
Query: 471 VDLDGDGNLDILVGTSFG-LFYVLDHHGKIREK---------FPLEMAEIQGAVVAADIN 520
VDL+GDG DIL+ T G L Y+ D + P+ +A+ V AD+
Sbjct: 415 VDLNGDGITDILLDTGRGWLAYLRDGPASWSDAPRVLPPARTPPVSLADPH--VHLADMT 472
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG-----------DVDGDGH 569
DG +++V T G + W A G W+ + +T P+ G DVDGDG
Sbjct: 473 GDGLMDIVRV-TGGGLTYWPARADGGWDAAIA--MTPSPAFGRDFDPHRLALYDVDGDGC 529
Query: 570 SDVV 573
+D+V
Sbjct: 530 ADLV 533
>gi|134279640|ref|ZP_01766352.1| FG-GAP repeat/YD repeat/RHS repeat protein [Burkholderia
pseudomallei 305]
gi|134248840|gb|EBA48922.1| FG-GAP repeat/YD repeat/RHS repeat protein [Burkholderia
pseudomallei 305]
Length = 2031
Score = 40.4 bits (93), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 57/124 (45%), Gaps = 26/124 (20%)
Query: 471 VDLDGDGNLDILVGTSFG-LFYVLDHHGKIREK---------FPLEMAEIQGAVVAADIN 520
VDL+GDG DIL+ T G L Y+ D + P+ +A+ V AD+
Sbjct: 415 VDLNGDGITDILLDTGRGWLAYLRDGPASWSDAPRVLPPARTPPVSLADPH--VHLADMT 472
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG-----------DVDGDGH 569
DG +++V T G + W A G W+ + +T P+ G DVDGDG
Sbjct: 473 GDGLMDIVRV-TGGGLTYWPARADGGWDAAIA--MTPSPAFGRDFDPHRLALYDVDGDGC 529
Query: 570 SDVV 573
+D+V
Sbjct: 530 ADLV 533
>gi|67642373|ref|ZP_00441130.1| FG-GAP/YD repeat domain protein [Burkholderia mallei GB8 horse 4]
gi|238523513|gb|EEP86951.1| FG-GAP/YD repeat domain protein [Burkholderia mallei GB8 horse 4]
Length = 1827
Score = 40.4 bits (93), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 57/124 (45%), Gaps = 26/124 (20%)
Query: 471 VDLDGDGNLDILVGTSFG-LFYVLDHHGKIREK---------FPLEMAEIQGAVVAADIN 520
VDL+GDG DIL+ T G L Y+ D + P+ +A+ V AD+
Sbjct: 231 VDLNGDGITDILLDTGRGWLAYLRDGPASWSDAPRVLPPARTPPVSLADPH--VHLADMT 288
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG-----------DVDGDGH 569
DG +++V T G + W A G W+ + +T P+ G DVDGDG
Sbjct: 289 GDGLMDIVRV-TGGGLTYWPARADGGWDAAIA--MTPSPAFGRDFDPHRLALYDVDGDGC 345
Query: 570 SDVV 573
+D+V
Sbjct: 346 ADLV 349
>gi|53724907|ref|YP_101869.1| hypothetical protein BMA0008 [Burkholderia mallei ATCC 23344]
gi|52428330|gb|AAU48923.1| FG-GAP/YD repeat domain protein [Burkholderia mallei ATCC 23344]
Length = 1806
Score = 40.4 bits (93), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 57/124 (45%), Gaps = 26/124 (20%)
Query: 471 VDLDGDGNLDILVGTSFG-LFYVLDHHGKIREK---------FPLEMAEIQGAVVAADIN 520
VDL+GDG DIL+ T G L Y+ D + P+ +A+ V AD+
Sbjct: 210 VDLNGDGITDILLDTGRGWLAYLRDGPASWSDAPRVLPPARTPPVSLADPH--VHLADMT 267
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG-----------DVDGDGH 569
DG +++V T G + W A G W+ + +T P+ G DVDGDG
Sbjct: 268 GDGLMDIVRV-TGGGLTYWPARADGGWDAAIA--MTPSPAFGRDFDPHRLALYDVDGDGC 324
Query: 570 SDVV 573
+D+V
Sbjct: 325 ADLV 328
>gi|333895469|ref|YP_004469344.1| FG-GAP repeat-containing protein [Alteromonas sp. SN2]
gi|332995487|gb|AEF05542.1| FG-GAP repeat-containing protein [Alteromonas sp. SN2]
Length = 379
Score = 40.4 bits (93), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 51/96 (53%), Gaps = 6/96 (6%)
Query: 505 LEMAEIQGAVVAADIN-DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPS--- 560
+E+ Q +++A+DIN DD + ++ + N+ + ++GKG + PS
Sbjct: 39 IEVGAGQPSIIASDINADDNQDVIIANASDNNIITYLSDGKGTLNRGGIFAAGDSPSGLA 98
Query: 561 IGDVDGDGHSDVVVPTL-SGNIYVLSGK-DGSKVRP 594
I D++ DGH DVVV + I VL GK DG+ V P
Sbjct: 99 ISDINADGHVDVVVANHETSYITVLFGKGDGTFVEP 134
>gi|351731714|ref|ZP_08949405.1| hypothetical protein AradN_18155 [Acidovorax radicis N35]
Length = 858
Score = 40.4 bits (93), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 55/209 (26%), Positives = 91/209 (43%), Gaps = 46/209 (22%)
Query: 463 YIYSSPTVVDLDGDGNLDILV---------GTS---------FGLFYVLDHHGKIREKFP 504
Y S+P + DL+ DG LDI+V GTS FG F ++ H
Sbjct: 153 YGASNPMLADLNHDGRLDIVVAVYDPGNFGGTSYFEVLQNIGFGNFLLVGQHEVTYSGLS 212
Query: 505 LEMAEIQGAVVAADINDDGKIELVTTDT-HGNVAAW-TAEGKGIWEQHLKSLVTQGPSIG 562
+ + + V ADI+ DG+ +LV +D G +W G G + + ++ G S+G
Sbjct: 213 MNKSVREAQV--ADIDGDGEPDLVFSDAVSGGTVSWLKGAGNGSFAPQAQ-ILGAGGSVG 269
Query: 563 --------DVDGDGHSDVVVPTLSG-NIYVLSGKDGSKVRPYP--YRT-----HGRVMNQ 606
D+DGDG D+++ + G +++ G P + T GR M +
Sbjct: 270 NIGDIEVADIDGDGKLDLLMQNIYGQGVWLAKGVGNGIFLPSTKLFTTGVSNGGGRDMGR 329
Query: 607 VLLVDLTKRGEKSKGLTIVTTSFDGYLYL 635
V++ D+ G L +++ DGY+ L
Sbjct: 330 VIVADVNGDGR----LDVIS---DGYVGL 351
>gi|237810809|ref|YP_002895260.1| FG-GAP repeat/YD repeat/RHS repeat protein [Burkholderia
pseudomallei MSHR346]
gi|237503763|gb|ACQ96081.1| FG-GAP repeat/YD repeat/RHS repeat protein [Burkholderia
pseudomallei MSHR346]
Length = 2031
Score = 40.4 bits (93), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 57/124 (45%), Gaps = 26/124 (20%)
Query: 471 VDLDGDGNLDILVGTSFG-LFYVLDHHGKIREK---------FPLEMAEIQGAVVAADIN 520
VDL+GDG DIL+ T G L Y+ D + P+ +A+ V AD+
Sbjct: 415 VDLNGDGITDILLDTGRGWLAYLRDGPASWSDAPRVLPPARTPPVSLADPH--VHLADMT 472
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG-----------DVDGDGH 569
DG +++V T G + W A G W+ + +T P+ G DVDGDG
Sbjct: 473 GDGLMDIVRV-TGGGLTYWPARADGGWDAAIA--MTPSPAFGRDFDPHRLALYDVDGDGC 529
Query: 570 SDVV 573
+D+V
Sbjct: 530 ADLV 533
>gi|121598457|ref|YP_994149.1| FG-GAP/YD repeat-containing protein [Burkholderia mallei SAVP1]
gi|167001188|ref|ZP_02266987.1| FG-GAP/YD repeat domain protein [Burkholderia mallei PRL-20]
gi|121227267|gb|ABM49785.1| FG-GAP/YD repeat domain protein [Burkholderia mallei SAVP1]
gi|243063014|gb|EES45200.1| FG-GAP/YD repeat domain protein [Burkholderia mallei PRL-20]
Length = 1825
Score = 40.4 bits (93), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 57/124 (45%), Gaps = 26/124 (20%)
Query: 471 VDLDGDGNLDILVGTSFG-LFYVLDHHGKIREK---------FPLEMAEIQGAVVAADIN 520
VDL+GDG DIL+ T G L Y+ D + P+ +A+ V AD+
Sbjct: 229 VDLNGDGITDILLDTGRGWLAYLRDGPASWSDAPRVLPPARTPPVSLADPH--VHLADMT 286
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG-----------DVDGDGH 569
DG +++V T G + W A G W+ + +T P+ G DVDGDG
Sbjct: 287 GDGLMDIVRV-TGGGLTYWPARADGGWDAAIA--MTPSPAFGRDFDPHRLALYDVDGDGC 343
Query: 570 SDVV 573
+D+V
Sbjct: 344 ADLV 347
>gi|124383517|ref|YP_001028195.1| hypothetical protein BMA10229_A2231 [Burkholderia mallei NCTC
10229]
Length = 1825
Score = 40.4 bits (93), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 57/124 (45%), Gaps = 26/124 (20%)
Query: 471 VDLDGDGNLDILVGTSFG-LFYVLDHHGKIREK---------FPLEMAEIQGAVVAADIN 520
VDL+GDG DIL+ T G L Y+ D + P+ +A+ V AD+
Sbjct: 229 VDLNGDGITDILLDTGRGWLAYLRDGPASWSDAPRVLPPARTPPVSLADPH--VHLADMT 286
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG-----------DVDGDGH 569
DG +++V T G + W A G W+ + +T P+ G DVDGDG
Sbjct: 287 GDGLMDIVRV-TGGGLTYWPARADGGWDAAIA--MTPSPAFGRDFDPHRLALYDVDGDGC 343
Query: 570 SDVV 573
+D+V
Sbjct: 344 ADLV 347
>gi|444706043|gb|ELW47406.1| Integrin alpha-8 [Tupaia chinensis]
Length = 865
Score = 40.4 bits (93), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 53/108 (49%), Gaps = 15/108 (13%)
Query: 381 SHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNL 440
S+ T V++D++NDG+ +++I F + E+ NP +E+G I + Y+ + + F
Sbjct: 232 SYFGYTVVVSDVNNDGMDDILIGAPLFMEREFESNP---REVGQIYL--YLQVSSLFFR- 285
Query: 441 DTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
D + T +F + + + DL+ DG DI +G FG
Sbjct: 286 ---------DPQILTGTETFGRFGSAVAHLGDLNQDGYNDIAIGVPFG 324
>gi|359777240|ref|ZP_09280530.1| subtilisin family peptidase [Arthrobacter globiformis NBRC 12137]
gi|359305572|dbj|GAB14359.1| subtilisin family peptidase [Arthrobacter globiformis NBRC 12137]
Length = 735
Score = 40.4 bits (93), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 33/122 (27%), Positives = 52/122 (42%), Gaps = 13/122 (10%)
Query: 472 DLDGDGNLDILVGTSFGLFYVLDHHGK----IREKFPLEMAEIQGAVVAADINDDGKIEL 527
D GDG D+L + G ++ +G R++ A D N DGK ++
Sbjct: 545 DFTGDGRADVLARDAAGALWLYPGNGAGGWLARKQVGSGWNVFSAVTGAGDFNVDGKADV 604
Query: 528 VTTDTHGNVAAWTAEGKGIWEQHLK-------SLVTQGPSIGDVDGDGHSDVVVPTLSGN 580
+ DT G + +T G+G W +K GP GDV GD +DV+ +G
Sbjct: 605 LARDTTGALFLYTGNGRGGWLAKIKVGSGWNGMSAIVGP--GDVTGDRKADVLARDSAGV 662
Query: 581 IY 582
++
Sbjct: 663 LW 664
>gi|343083886|ref|YP_004773181.1| hypothetical protein [Cyclobacterium marinum DSM 745]
gi|342352420|gb|AEL24950.1| FG-GAP repeat protein [Cyclobacterium marinum DSM 745]
Length = 376
Score = 40.4 bits (93), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 57/115 (49%), Gaps = 10/115 (8%)
Query: 471 VDLDGDGNLDILVGTSFG-LFYVLDHHGKIREKFPLEMAEIQGA--VVAADINDDGKIEL 527
+D++GDG D + G FG Y +++ G +E E+ A D+++DG +++
Sbjct: 85 MDVNGDGKTDYVTGGWFGKTLYWVENPGSNKEWAVHEITNPGNVETTRAWDVDNDGHLDI 144
Query: 528 VTTDTHGNVAAWT----AEGKGIWEQHLKSLVTQ---GPSIGDVDGDGHSDVVVP 575
V + +A +T A GKG E S+ + G GD++GDG D ++P
Sbjct: 145 VPNTPNQPLAYYTLNRDANGKGTGEFEKHSVTEKHGHGLGFGDINGDGRGDFIIP 199
>gi|149922443|ref|ZP_01910876.1| Na-Ca exchanger/integrin-beta4 [Plesiocystis pacifica SIR-1]
gi|149816723|gb|EDM76214.1| Na-Ca exchanger/integrin-beta4 [Plesiocystis pacifica SIR-1]
Length = 415
Score = 40.4 bits (93), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 43/139 (30%), Positives = 57/139 (41%), Gaps = 13/139 (9%)
Query: 468 PTVV---DLDGDGNLDIL---VGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADIND 521
PT V DL GDG +D++ + SF + G E + + A DIN
Sbjct: 87 PTTVVAADLSGDGEVDLVASSISASFTRVLLGQGGGAFSEPVSSTLGTGVRHLAAGDINL 146
Query: 522 DGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPS-----IGDVDGDGHSDVVVPT 576
DG +LV G A G G + G + IGD+DGDG DVV
Sbjct: 147 DGHADLVVAKVLGATGTGVAPGDGTGSFGAAPIANTGNTPRAVVIGDLDGDGWLDVVTAD 206
Query: 577 LSG-NIYVLSGKDGSKVRP 594
+G +YVL DG+ P
Sbjct: 207 ETGATLYVLL-SDGAGAIP 224
>gi|333995433|ref|YP_004528046.1| hypothetical protein TREAZ_3422 [Treponema azotonutricium ZAS-9]
gi|333737117|gb|AEF83066.1| hypothetical protein TREAZ_3422 [Treponema azotonutricium ZAS-9]
Length = 877
Score = 40.4 bits (93), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 37/81 (45%)
Query: 49 DEDALVNTQCPKNLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDG 108
DED V+T K +W T S+++ + P + + SFL + V + S G
Sbjct: 643 DEDGSVHTIDDKASAAKWGTAYSTALRSPPSFLTVQNKIYAASYPKSFLGEIWVQDASSG 702
Query: 109 DKMPGWPAFHQSSVHSSPLLY 129
+PGWPA+ SPL +
Sbjct: 703 AALPGWPAYVSGIAFGSPLPF 723
>gi|441496987|ref|ZP_20979212.1| hypothetical protein C900_01220 [Fulvivirga imtechensis AK7]
gi|441439227|gb|ELR72546.1| hypothetical protein C900_01220 [Fulvivirga imtechensis AK7]
Length = 1075
Score = 40.4 bits (93), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 67/134 (50%), Gaps = 24/134 (17%)
Query: 470 VVDLDGDGNLDILVG----TSFGLFYVLDHH--GKI------REKFPLEMAEIQGAVVAA 517
V DL+ DG DI+V + G+F + G I ++ F ++ ++ AA
Sbjct: 310 VADLNDDGKPDIVVSEFNTNNAGIFMAENTSSLGAISFNSFHQKTFSGTLSNLK----AA 365
Query: 518 DINDDGKIELVTTDTHGNVAAW----TAEGKGIWEQHLKSLVTQGP----SIGDVDGDGH 569
D+N+DGK++++ T + A + +++G I S+ T G +IGD+DGDG
Sbjct: 366 DLNNDGKLDIIATRFLSSSAVFLLNESSKGGSITFGSPNSVPTDGLPWGLNIGDMDGDGK 425
Query: 570 SDVVVPTLSGNIYV 583
D++V TL + V
Sbjct: 426 EDILVATLGSGLAV 439
>gi|303286431|ref|XP_003062505.1| bardet-biedl syndrome 2 [Micromonas pusilla CCMP1545]
gi|226456022|gb|EEH53324.1| bardet-biedl syndrome 2 [Micromonas pusilla CCMP1545]
Length = 729
Score = 40.4 bits (93), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 51/195 (26%), Positives = 81/195 (41%), Gaps = 37/195 (18%)
Query: 370 HEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIA--------------VSYFFDHEYYDN 415
HE E Y V ++ +AD+D DG++E+ + VS + E N
Sbjct: 147 HEGEEAYWTVSGDNVNAMTLADVDGDGLNELFVCGDDFTIRLFRDEEVVSEITESERVIN 206
Query: 416 PEHLKELGGIDIGKYVA-GAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLD 474
HLK G G +A G + V+ D Q +W + + N Y DLD
Sbjct: 207 VCHLK---GERFGYALANGTVGVY--DGTQRQWR----VKSKNQVTALCAY------DLD 251
Query: 475 GDGNLDILVGTSFGLFYVLDHH-GKI--REKFPLEMAEIQGAVVAADINDDGKIELVTTD 531
GDG ++++G S GL V + G++ ++ F +A A AD DG +L+
Sbjct: 252 GDGERELVIGWSSGLVAVRNESDGELVYKDTFDASIA----AFAMADYRMDGTTDLIACA 307
Query: 532 THGNVAAWTAEGKGI 546
G V + ++ I
Sbjct: 308 VDGEVRGYRSQAGQI 322
>gi|326444995|ref|ZP_08219729.1| putative secreted esterase [Streptomyces clavuligerus ATCC 27064]
Length = 464
Score = 40.4 bits (93), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 27/93 (29%), Positives = 45/93 (48%), Gaps = 19/93 (20%)
Query: 514 VVAADINDDGKIELVTTDTHGNVAAW--TAEGK-----------GIWEQHLKSLVTQGPS 560
+ AAD N DG+ ++ T T GN+ A+ TA+G G W + + L
Sbjct: 215 MAAADFNGDGRPDVATVLTDGNLHAYYTTADGTLQYGRELWKHDGSWGKKTQLLA----- 269
Query: 561 IGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVR 593
GD +GDGH D+ +G++++ G ++R
Sbjct: 270 -GDFNGDGHGDIAATNATGDLHLYPGTKSGQLR 301
>gi|410666280|ref|YP_006918651.1| FG-GAP repeat-containing protein [Simiduia agarivorans SA1 = DSM
21679]
gi|409028637|gb|AFV00922.1| FG-GAP repeat-containing protein [Simiduia agarivorans SA1 = DSM
21679]
Length = 379
Score = 40.4 bits (93), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 41/161 (25%), Positives = 74/161 (45%), Gaps = 12/161 (7%)
Query: 465 YSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFP-LEMAEIQGAVVAADINDDG 523
Y V D++GDG DI+ + ++ + L +E A+ AD+N DG
Sbjct: 197 YRGFAVDDINGDGAPDIVTPNEANIGIAINAGADMAFSLTTLNQSEPPFAIALADMNGDG 256
Query: 524 KIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGP-----SIGDVDGDGHSDVVVPTLS 578
++V ++ +G+G + K+L+ +IGDV+GD D++V + S
Sbjct: 257 NPDVVAATNRDSLTVIPGDGQGTFITSKKTLIKTASGAKQIAIGDVNGDKIDDLLVSSWS 316
Query: 579 GN-IYVLSGKDGSKVRPYPYRTHGRVMNQ--VLLVDLTKRG 616
G + V+ GK + ++ Y ++ H + N + L DL G
Sbjct: 317 GEVVLVIGGK--AALQTYSFK-HSDIPNPWGIALDDLNDDG 354
>gi|148676090|gb|EDL08037.1| integrin alpha 8, isoform CRA_a [Mus musculus]
Length = 1024
Score = 40.4 bits (93), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 30/119 (25%), Positives = 57/119 (47%), Gaps = 18/119 (15%)
Query: 381 SHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNL 440
S+ T V++D++NDG+ ++++ F + E+ NP +E+G + + Y+ + ++F
Sbjct: 318 SYFGYTVVVSDVNNDGMDDILVGAPLFMEREFESNP---REVGQVYL--YLQASALLFQ- 371
Query: 441 DTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKI 499
D + T +F + S + DL+ DG DI +G F D GK+
Sbjct: 372 ---------DPQVLTGTETFGRFGSSVAHLGDLNQDGYNDIAIGVPFA---GKDQRGKV 418
>gi|310824207|ref|YP_003956565.1| FG-GAP repeat/HVR domain-containing protein [Stigmatella aurantiaca
DW4/3-1]
gi|309397279|gb|ADO74738.1| FG-GAP repeat/HVR domain protein [Stigmatella aurantiaca DW4/3-1]
Length = 950
Score = 40.4 bits (93), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 62/139 (44%), Gaps = 13/139 (9%)
Query: 441 DTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDI-LVGTSFGLFYVLDHHGKI 499
D QV WT + SP + D D DG +I L GT+ + VLD G +
Sbjct: 306 DNCQVLWTVPVPGGG--------HGGSPNIGDFDNDGVPEIGLPGTN--AYSVLDSDGTL 355
Query: 500 REKFPL-EMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWE-QHLKSLVTQ 557
K P E++ + A D DDGK+E++ D A G+ +E H S +
Sbjct: 356 LWKRPTQELSSGKTGSTAFDFEDDGKLEIIYADEVRLRIYDGATGQVRFETAHSSSTTHE 415
Query: 558 GPSIGDVDGDGHSDVVVPT 576
P I DVD D +++VV T
Sbjct: 416 NPVIADVDSDFAAELVVAT 434
>gi|47717127|ref|NP_001001309.1| integrin alpha-8 precursor [Mus musculus]
gi|156630937|sp|A2ARA8.1|ITA8_MOUSE RecName: Full=Integrin alpha-8; Contains: RecName: Full=Integrin
alpha-8 heavy chain; Contains: RecName: Full=Integrin
alpha-8 light chain; Flags: Precursor
Length = 1062
Score = 40.4 bits (93), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 30/119 (25%), Positives = 57/119 (47%), Gaps = 18/119 (15%)
Query: 381 SHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNL 440
S+ T V++D++NDG+ ++++ F + E+ NP +E+G + + Y+ + ++F
Sbjct: 318 SYFGYTVVVSDVNNDGMDDILVGAPLFMEREFESNP---REVGQVYL--YLQASALLFQ- 371
Query: 441 DTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKI 499
D + T +F + S + DL+ DG DI +G F D GK+
Sbjct: 372 ---------DPQVLTGTETFGRFGSSVAHLGDLNQDGYNDIAIGVPFA---GKDQRGKV 418
>gi|3098480|gb|AAC15665.1| integrin alpha8 [Mus musculus]
Length = 1012
Score = 40.4 bits (93), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 30/119 (25%), Positives = 57/119 (47%), Gaps = 18/119 (15%)
Query: 381 SHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNL 440
S+ T V++D++NDG+ ++++ F + E+ NP +E+G + + Y+ + ++F
Sbjct: 274 SYFGYTVVVSDVNNDGMDDILVGAPLFMEREFESNP---REVGQVYL--YLQASALLFQ- 327
Query: 441 DTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKI 499
D + T +F + S + DL+ DG DI +G F D GK+
Sbjct: 328 ---------DPQVLTGTETFGRFGSSVAHLGDLNQDGYNDIAIGVPFA---GKDQRGKV 374
>gi|254389161|ref|ZP_05004390.1| esterase [Streptomyces clavuligerus ATCC 27064]
gi|197702877|gb|EDY48689.1| esterase [Streptomyces clavuligerus ATCC 27064]
Length = 516
Score = 40.4 bits (93), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 27/93 (29%), Positives = 45/93 (48%), Gaps = 19/93 (20%)
Query: 514 VVAADINDDGKIELVTTDTHGNVAAW--TAEGK-----------GIWEQHLKSLVTQGPS 560
+ AAD N DG+ ++ T T GN+ A+ TA+G G W + + L
Sbjct: 267 MAAADFNGDGRPDVATVLTDGNLHAYYTTADGTLQYGRELWKHDGSWGKKTQLLA----- 321
Query: 561 IGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVR 593
GD +GDGH D+ +G++++ G ++R
Sbjct: 322 -GDFNGDGHGDIAATNATGDLHLYPGTKSGQLR 353
>gi|453051603|gb|EME99105.1| N-acetylmuramoyl-L-alanine amidase [Streptomyces mobaraensis NBRC
13819 = DSM 40847]
Length = 310
Score = 40.4 bits (93), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 62/162 (38%), Gaps = 29/162 (17%)
Query: 472 DLDGDGNLDILVGTSFGLFYVLDHHGKI---------REKFPLEMAEIQGAVVAADINDD 522
+L G DIL G+ +V GK+ R++ + D+ D
Sbjct: 154 ELGGTYGSDILTRDKQGVLWVHQADGKLGRDYAKLAPRKRVGGGWGQFTAITGRGDLTGD 213
Query: 523 GKIELVTTDTHGNVAAWTAEG------------KGIWEQHLKSLVTQGPSIGDVDGDGHS 570
G+ ++V D G + + G G W Q K + T GDV+GDGH+
Sbjct: 214 GRTDIVARDKSGTLWLYKGTGDINKPFASRTKVGGGWNQFTKLVAT-----GDVNGDGHA 268
Query: 571 DVVVPTLSGNIYVLSGKDGSKVRPYPYRTH--GRVMNQVLLV 610
D++ G +++ G G PY R G NQ LV
Sbjct: 269 DLLAVDGKGALWLYKGT-GKASSPYQSRVKIDGSGWNQYRLV 309
>gi|416408859|ref|ZP_11688406.1| hypothetical protein CWATWH0003_5159t1, partial [Crocosphaera
watsonii WH 0003]
gi|357260714|gb|EHJ10083.1| hypothetical protein CWATWH0003_5159t1, partial [Crocosphaera
watsonii WH 0003]
Length = 2699
Score = 40.4 bits (93), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 58/139 (41%), Gaps = 17/139 (12%)
Query: 384 LSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTK 443
+TP D+DNDG +++++ YY+N + G VVF T
Sbjct: 347 FNTPTFGDVDNDGDADLLVGNPSGI-IRYYENTGTIDTNGN-----------VVFEEKTG 394
Query: 444 QVKWTTDLDLSTDNASF----RAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKI 499
D+++ DN S I S PT+ D+D DG+LD+ +G G+ ++ G
Sbjct: 395 TDNPFADINIEADNTSTNNDNEPIIDSVPTLADIDNDGDLDLFIGERGGVINYFENLG-F 453
Query: 500 REKFPLEMAEIQGAVVAAD 518
E L G VA D
Sbjct: 454 DENGQLTFIAQNGPDVALD 472
>gi|167536015|ref|XP_001749680.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771828|gb|EDQ85489.1| predicted protein [Monosiga brevicollis MX1]
Length = 1811
Score = 40.4 bits (93), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 32/121 (26%), Positives = 58/121 (47%), Gaps = 11/121 (9%)
Query: 470 VVDLDGDGNLDILVGTSFG---LFYVLDHHGKIREKFPL--EMAEIQGAVVAADINDDGK 524
+ DL+GDG LD++ ++ +++ H + + E ++G V AAD+N DG
Sbjct: 235 LADLNGDGALDVIAASTNNNAVVWFQNQGHAQFSSARTIDNEATLVEG-VAAADLNGDGA 293
Query: 525 IELVTTDTHGNVAAWTAE-GKGIW--EQHLKSLVTQGPSI--GDVDGDGHSDVVVPTLSG 579
+++V+ AW G G + Q + +T S+ D+D DGH D++ +
Sbjct: 294 LDIVSAHQGSGTVAWYQNLGGGAFGPAQIVTQSLTGASSVAAADLDADGHVDLLAASYQS 353
Query: 580 N 580
N
Sbjct: 354 N 354
>gi|153213067|ref|ZP_01948605.1| hypothetical protein A55_2093 [Vibrio cholerae 1587]
gi|124116114|gb|EAY34934.1| hypothetical protein A55_2093 [Vibrio cholerae 1587]
Length = 691
Score = 40.4 bits (93), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 37/63 (58%), Gaps = 3/63 (4%)
Query: 472 DLDGDGNLDILVGTSFGLFY-VLDHHGKIREKFPLEMAEIQ--GAVVAADINDDGKIELV 528
DLDGDG ++I+ ++ + +LDH G I+++ + + G + ADIN DG IE++
Sbjct: 125 DLDGDGLIEIVSTSALTPYINILDHQGNIKKQLLKSASGWRSVGDIALADINGDGNIEIL 184
Query: 529 TTD 531
D
Sbjct: 185 AAD 187
>gi|403526072|ref|YP_006660959.1| FG-GAP repeat domain-containing protein [Arthrobacter sp. Rue61a]
gi|403228499|gb|AFR27921.1| FG-GAP repeat domain protein [Arthrobacter sp. Rue61a]
Length = 972
Score = 40.4 bits (93), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 51/186 (27%), Positives = 74/186 (39%), Gaps = 21/186 (11%)
Query: 468 PTVVDLDGDGNLDILVGTSFGLFYVLDHHGK---IREKFPLEM---AEIQGAVVAADIND 521
P+ D +GDG D+L + G Y+ G + + L + + A+ D N
Sbjct: 719 PSAHDFNGDGKADVLASDTAGNLYLYPGDGAGGVLTRQVALAAPAWSTVNEAITPGDFNR 778
Query: 522 DGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSI-------GDVDGDGHSDVVV 574
DGK +L+ G+V W G G+ + V+ G S GD GDG D +
Sbjct: 779 DGKPDLLA--RAGDV-LWFYPGDGVGSFGARVQVSTGWSTMSQLFSPGDFSGDGIPDFIG 835
Query: 575 PTLSGNIYVLSGKD-GSKVRPYPYRTHGRVMNQVLLV-DLTKRGEKSKGLTIVTTSFDGY 632
SG + + G G + P T VMN +L D G KG + S G
Sbjct: 836 RLSSGELRLYEGNGTGGQRTPTTIGTGWNVMNAILSTGDFNGDG---KGDVLARRSDTGA 892
Query: 633 LYLIDG 638
L+L G
Sbjct: 893 LWLYPG 898
>gi|297579405|ref|ZP_06941333.1| conserved hypothetical protein [Vibrio cholerae RC385]
gi|297536999|gb|EFH75832.1| conserved hypothetical protein [Vibrio cholerae RC385]
Length = 691
Score = 40.4 bits (93), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 37/63 (58%), Gaps = 3/63 (4%)
Query: 472 DLDGDGNLDILVGTSFGLFY-VLDHHGKIREKFPLEMAEIQ--GAVVAADINDDGKIELV 528
DLDGDG ++I+ ++ + +LDH G I+++ + + G + ADIN DG IE++
Sbjct: 125 DLDGDGLIEIVSTSALTPYINILDHQGNIKKQLLKSASGWRSVGDIALADINGDGNIEIL 184
Query: 529 TTD 531
D
Sbjct: 185 AAD 187
>gi|421329120|ref|ZP_15779630.1| FG-GAP repeat family protein [Vibrio cholerae CP1042(15)]
gi|395927654|gb|EJH38417.1| FG-GAP repeat family protein [Vibrio cholerae CP1042(15)]
Length = 685
Score = 40.4 bits (93), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 37/63 (58%), Gaps = 3/63 (4%)
Query: 472 DLDGDGNLDILVGTSFGLFY-VLDHHGKIREKFPLEMAEIQ--GAVVAADINDDGKIELV 528
DLDGDG ++I+ ++ + +LDH G I+++ + + G + ADIN DG IE++
Sbjct: 119 DLDGDGLIEIVSTSALTPYINILDHQGNIKKQLLKSASGWRSVGDIALADINGDGNIEIL 178
Query: 529 TTD 531
D
Sbjct: 179 AAD 181
>gi|15641890|ref|NP_231522.1| hemolysin-like protein [Vibrio cholerae O1 biovar El Tor str.
N16961]
gi|121587293|ref|ZP_01677065.1| hypothetical protein VC274080_1930 [Vibrio cholerae 2740-80]
gi|147675607|ref|YP_001217421.1| hypothetical protein VC0395_A1478 [Vibrio cholerae O395]
gi|227082018|ref|YP_002810569.1| hemolysin-like protein [Vibrio cholerae M66-2]
gi|227118336|ref|YP_002820232.1| hemolysin-related protein [Vibrio cholerae O395]
gi|229508014|ref|ZP_04397519.1| hemolysin-related protein Vcp [Vibrio cholerae BX 330286]
gi|229511747|ref|ZP_04401226.1| hemolysin-related protein Vcp [Vibrio cholerae B33]
gi|229518885|ref|ZP_04408328.1| hemolysin-related protein Vcp [Vibrio cholerae RC9]
gi|229607561|ref|YP_002878209.1| hemolysin-related protein Vcp [Vibrio cholerae MJ-1236]
gi|254848975|ref|ZP_05238325.1| hemolysin-related protein [Vibrio cholerae MO10]
gi|255745352|ref|ZP_05419301.1| hemolysin-related protein Vcp [Vibrio cholera CIRS 101]
gi|262153544|ref|ZP_06028673.1| hemolysin-related protein Vcp [Vibrio cholerae INDRE 91/1]
gi|262167444|ref|ZP_06035151.1| hemolysin-related protein Vcp [Vibrio cholerae RC27]
gi|298498073|ref|ZP_07007880.1| conserved hypothetical protein [Vibrio cholerae MAK 757]
gi|360035771|ref|YP_004937534.1| hemolysin-related protein [Vibrio cholerae O1 str. 2010EL-1786]
gi|379741723|ref|YP_005333692.1| hemolysin-like protein [Vibrio cholerae IEC224]
gi|417813932|ref|ZP_12460585.1| FG-GAP repeat family protein [Vibrio cholerae HC-49A2]
gi|417817669|ref|ZP_12464298.1| FG-GAP repeat family protein [Vibrio cholerae HCUF01]
gi|418334907|ref|ZP_12943821.1| FG-GAP repeat family protein [Vibrio cholerae HC-06A1]
gi|418338524|ref|ZP_12947418.1| FG-GAP repeat family protein [Vibrio cholerae HC-23A1]
gi|418346442|ref|ZP_12951204.1| FG-GAP repeat family protein [Vibrio cholerae HC-28A1]
gi|418350204|ref|ZP_12954935.1| FG-GAP repeat family protein [Vibrio cholerae HC-43A1]
gi|418356111|ref|ZP_12958830.1| FG-GAP repeat family protein [Vibrio cholerae HC-61A1]
gi|419826867|ref|ZP_14350366.1| PQQ enzyme repeat family protein [Vibrio cholerae CP1033(6)]
gi|421317790|ref|ZP_15768358.1| FG-GAP repeat family protein [Vibrio cholerae CP1032(5)]
gi|421321659|ref|ZP_15772212.1| FG-GAP repeat family protein [Vibrio cholerae CP1038(11)]
gi|421325458|ref|ZP_15775982.1| FG-GAP repeat family protein [Vibrio cholerae CP1041(14)]
gi|421333027|ref|ZP_15783504.1| FG-GAP repeat family protein [Vibrio cholerae CP1046(19)]
gi|421340047|ref|ZP_15790479.1| FG-GAP repeat family protein [Vibrio cholerae HC-20A2]
gi|421348161|ref|ZP_15798538.1| FG-GAP repeat family protein [Vibrio cholerae HC-46A1]
gi|422896994|ref|ZP_16934444.1| FG-GAP repeat family protein [Vibrio cholerae HC-40A1]
gi|422903196|ref|ZP_16938172.1| FG-GAP repeat family protein [Vibrio cholerae HC-48A1]
gi|422907080|ref|ZP_16941884.1| FG-GAP repeat family protein [Vibrio cholerae HC-70A1]
gi|422913927|ref|ZP_16948433.1| FG-GAP repeat family protein [Vibrio cholerae HFU-02]
gi|422926133|ref|ZP_16959147.1| FG-GAP repeat family protein [Vibrio cholerae HC-38A1]
gi|423145452|ref|ZP_17133046.1| FG-GAP repeat family protein [Vibrio cholerae HC-19A1]
gi|423150128|ref|ZP_17137442.1| FG-GAP repeat family protein [Vibrio cholerae HC-21A1]
gi|423153948|ref|ZP_17141129.1| FG-GAP repeat family protein [Vibrio cholerae HC-22A1]
gi|423157032|ref|ZP_17144125.1| FG-GAP repeat family protein [Vibrio cholerae HC-32A1]
gi|423160602|ref|ZP_17147542.1| FG-GAP repeat family protein [Vibrio cholerae HC-33A2]
gi|423165422|ref|ZP_17152151.1| FG-GAP repeat family protein [Vibrio cholerae HC-48B2]
gi|423731439|ref|ZP_17704742.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-17A1]
gi|423768162|ref|ZP_17712864.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-50A2]
gi|423895330|ref|ZP_17727077.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-62A1]
gi|423930768|ref|ZP_17731471.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-77A1]
gi|424002883|ref|ZP_17745958.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-17A2]
gi|424006672|ref|ZP_17749642.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-37A1]
gi|424024653|ref|ZP_17764304.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-62B1]
gi|424027538|ref|ZP_17767141.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-69A1]
gi|424586811|ref|ZP_18026390.1| FG-GAP repeat family protein [Vibrio cholerae CP1030(3)]
gi|424595457|ref|ZP_18034778.1| FG-GAP repeat family protein [Vibrio cholerae CP1040(13)]
gi|424599374|ref|ZP_18038555.1| FG-GAP repeat family protein [Vibrio Cholerae CP1044(17)]
gi|424602095|ref|ZP_18041237.1| FG-GAP repeat family protein [Vibrio cholerae CP1047(20)]
gi|424607065|ref|ZP_18046009.1| FG-GAP repeat family protein [Vibrio cholerae CP1050(23)]
gi|424610888|ref|ZP_18049727.1| FG-GAP repeat family protein [Vibrio cholerae HC-39A1]
gi|424613701|ref|ZP_18052489.1| FG-GAP repeat family protein [Vibrio cholerae HC-41A1]
gi|424617681|ref|ZP_18056353.1| FG-GAP repeat family protein [Vibrio cholerae HC-42A1]
gi|424622461|ref|ZP_18060969.1| FG-GAP repeat family protein [Vibrio cholerae HC-47A1]
gi|424645424|ref|ZP_18083160.1| FG-GAP repeat family protein [Vibrio cholerae HC-56A2]
gi|424653193|ref|ZP_18090573.1| FG-GAP repeat family protein [Vibrio cholerae HC-57A2]
gi|424657014|ref|ZP_18094299.1| FG-GAP repeat family protein [Vibrio cholerae HC-81A2]
gi|440710090|ref|ZP_20890741.1| hemolysin-related protein Vcp [Vibrio cholerae 4260B]
gi|443504250|ref|ZP_21071208.1| FG-GAP repeat family protein [Vibrio cholerae HC-64A1]
gi|443508148|ref|ZP_21074911.1| FG-GAP repeat family protein [Vibrio cholerae HC-65A1]
gi|443511990|ref|ZP_21078628.1| FG-GAP repeat family protein [Vibrio cholerae HC-67A1]
gi|443515548|ref|ZP_21082059.1| FG-GAP repeat family protein [Vibrio cholerae HC-68A1]
gi|443519342|ref|ZP_21085738.1| FG-GAP repeat family protein [Vibrio cholerae HC-71A1]
gi|443524232|ref|ZP_21090445.1| FG-GAP repeat family protein [Vibrio cholerae HC-72A2]
gi|443531829|ref|ZP_21097843.1| FG-GAP repeat family protein [Vibrio cholerae HC-7A1]
gi|443535627|ref|ZP_21101505.1| FG-GAP repeat family protein [Vibrio cholerae HC-80A1]
gi|443539173|ref|ZP_21105027.1| FG-GAP repeat family protein [Vibrio cholerae HC-81A1]
gi|449055683|ref|ZP_21734351.1| Hemolysin-related protein Vcp [Vibrio cholerae O1 str. Inaba G4222]
gi|9656420|gb|AAF95036.1| hemolysin-related protein [Vibrio cholerae O1 biovar El Tor str.
N16961]
gi|121548453|gb|EAX58511.1| hypothetical protein VC274080_1930 [Vibrio cholerae 2740-80]
gi|146317490|gb|ABQ22029.1| hypothetical protein VC0395_A1478 [Vibrio cholerae O395]
gi|227009906|gb|ACP06118.1| hemolysin-related protein [Vibrio cholerae M66-2]
gi|227013786|gb|ACP09996.1| hemolysin-related protein [Vibrio cholerae O395]
gi|229343574|gb|EEO08549.1| hemolysin-related protein Vcp [Vibrio cholerae RC9]
gi|229351712|gb|EEO16653.1| hemolysin-related protein Vcp [Vibrio cholerae B33]
gi|229355519|gb|EEO20440.1| hemolysin-related protein Vcp [Vibrio cholerae BX 330286]
gi|229370216|gb|ACQ60639.1| hemolysin-related protein Vcp [Vibrio cholerae MJ-1236]
gi|254844680|gb|EET23094.1| hemolysin-related protein [Vibrio cholerae MO10]
gi|255737182|gb|EET92578.1| hemolysin-related protein Vcp [Vibrio cholera CIRS 101]
gi|262024141|gb|EEY42835.1| hemolysin-related protein Vcp [Vibrio cholerae RC27]
gi|262030671|gb|EEY49306.1| hemolysin-related protein Vcp [Vibrio cholerae INDRE 91/1]
gi|297542406|gb|EFH78456.1| conserved hypothetical protein [Vibrio cholerae MAK 757]
gi|340036418|gb|EGQ97394.1| FG-GAP repeat family protein [Vibrio cholerae HC-49A2]
gi|340037392|gb|EGQ98367.1| FG-GAP repeat family protein [Vibrio cholerae HCUF01]
gi|341621287|gb|EGS47033.1| FG-GAP repeat family protein [Vibrio cholerae HC-70A1]
gi|341621430|gb|EGS47175.1| FG-GAP repeat family protein [Vibrio cholerae HC-48A1]
gi|341622355|gb|EGS48018.1| FG-GAP repeat family protein [Vibrio cholerae HC-40A1]
gi|341637588|gb|EGS62266.1| FG-GAP repeat family protein [Vibrio cholerae HFU-02]
gi|341646339|gb|EGS70453.1| FG-GAP repeat family protein [Vibrio cholerae HC-38A1]
gi|356417616|gb|EHH71231.1| FG-GAP repeat family protein [Vibrio cholerae HC-06A1]
gi|356418488|gb|EHH72085.1| FG-GAP repeat family protein [Vibrio cholerae HC-21A1]
gi|356423062|gb|EHH76523.1| FG-GAP repeat family protein [Vibrio cholerae HC-19A1]
gi|356428508|gb|EHH81734.1| FG-GAP repeat family protein [Vibrio cholerae HC-22A1]
gi|356430166|gb|EHH83375.1| FG-GAP repeat family protein [Vibrio cholerae HC-23A1]
gi|356433521|gb|EHH86710.1| FG-GAP repeat family protein [Vibrio cholerae HC-28A1]
gi|356439689|gb|EHH92654.1| FG-GAP repeat family protein [Vibrio cholerae HC-32A1]
gi|356444700|gb|EHH97509.1| FG-GAP repeat family protein [Vibrio cholerae HC-43A1]
gi|356445699|gb|EHH98501.1| FG-GAP repeat family protein [Vibrio cholerae HC-33A2]
gi|356450943|gb|EHI03648.1| FG-GAP repeat family protein [Vibrio cholerae HC-48B2]
gi|356452609|gb|EHI05288.1| FG-GAP repeat family protein [Vibrio cholerae HC-61A1]
gi|356646925|gb|AET26980.1| hemolysin-related protein [Vibrio cholerae O1 str. 2010EL-1786]
gi|378795233|gb|AFC58704.1| hemolysin-related protein [Vibrio cholerae IEC224]
gi|395916048|gb|EJH26878.1| FG-GAP repeat family protein [Vibrio cholerae CP1032(5)]
gi|395917296|gb|EJH28124.1| FG-GAP repeat family protein [Vibrio cholerae CP1041(14)]
gi|395918653|gb|EJH29477.1| FG-GAP repeat family protein [Vibrio cholerae CP1038(11)]
gi|395928429|gb|EJH39182.1| FG-GAP repeat family protein [Vibrio cholerae CP1046(19)]
gi|395939330|gb|EJH50012.1| FG-GAP repeat family protein [Vibrio cholerae HC-20A2]
gi|395942740|gb|EJH53416.1| FG-GAP repeat family protein [Vibrio cholerae HC-46A1]
gi|395958887|gb|EJH69342.1| FG-GAP repeat family protein [Vibrio cholerae HC-56A2]
gi|395959492|gb|EJH69920.1| FG-GAP repeat family protein [Vibrio cholerae HC-57A2]
gi|395962138|gb|EJH72439.1| FG-GAP repeat family protein [Vibrio cholerae HC-42A1]
gi|395970850|gb|EJH80567.1| FG-GAP repeat family protein [Vibrio cholerae HC-47A1]
gi|395973453|gb|EJH83012.1| FG-GAP repeat family protein [Vibrio cholerae CP1030(3)]
gi|395975836|gb|EJH85310.1| FG-GAP repeat family protein [Vibrio cholerae CP1047(20)]
gi|408007210|gb|EKG45303.1| FG-GAP repeat family protein [Vibrio cholerae HC-39A1]
gi|408013110|gb|EKG50855.1| FG-GAP repeat family protein [Vibrio cholerae HC-41A1]
gi|408032350|gb|EKG68936.1| FG-GAP repeat family protein [Vibrio cholerae CP1040(13)]
gi|408041757|gb|EKG77851.1| FG-GAP repeat family protein [Vibrio Cholerae CP1044(17)]
gi|408043240|gb|EKG79246.1| FG-GAP repeat family protein [Vibrio cholerae CP1050(23)]
gi|408053572|gb|EKG88577.1| FG-GAP repeat family protein [Vibrio cholerae HC-81A2]
gi|408607657|gb|EKK81060.1| PQQ enzyme repeat family protein [Vibrio cholerae CP1033(6)]
gi|408624061|gb|EKK97013.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-17A1]
gi|408633864|gb|EKL06156.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-50A2]
gi|408654200|gb|EKL25342.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-77A1]
gi|408655130|gb|EKL26255.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-62A1]
gi|408845280|gb|EKL85396.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-37A1]
gi|408846053|gb|EKL86165.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-17A2]
gi|408870152|gb|EKM09432.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-62B1]
gi|408878841|gb|EKM17834.1| PQQ enzyme repeat family protein [Vibrio cholerae HC-69A1]
gi|439974313|gb|ELP50490.1| hemolysin-related protein Vcp [Vibrio cholerae 4260B]
gi|443431195|gb|ELS73747.1| FG-GAP repeat family protein [Vibrio cholerae HC-64A1]
gi|443435090|gb|ELS81234.1| FG-GAP repeat family protein [Vibrio cholerae HC-65A1]
gi|443438973|gb|ELS88688.1| FG-GAP repeat family protein [Vibrio cholerae HC-67A1]
gi|443442958|gb|ELS96260.1| FG-GAP repeat family protein [Vibrio cholerae HC-68A1]
gi|443446760|gb|ELT03416.1| FG-GAP repeat family protein [Vibrio cholerae HC-71A1]
gi|443449566|gb|ELT09857.1| FG-GAP repeat family protein [Vibrio cholerae HC-72A2]
gi|443457219|gb|ELT24616.1| FG-GAP repeat family protein [Vibrio cholerae HC-7A1]
gi|443461167|gb|ELT32240.1| FG-GAP repeat family protein [Vibrio cholerae HC-80A1]
gi|443465273|gb|ELT39933.1| FG-GAP repeat family protein [Vibrio cholerae HC-81A1]
gi|448264722|gb|EMB01959.1| Hemolysin-related protein Vcp [Vibrio cholerae O1 str. Inaba G4222]
Length = 691
Score = 40.4 bits (93), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 37/63 (58%), Gaps = 3/63 (4%)
Query: 472 DLDGDGNLDILVGTSFGLFY-VLDHHGKIREKFPLEMAEIQ--GAVVAADINDDGKIELV 528
DLDGDG ++I+ ++ + +LDH G I+++ + + G + ADIN DG IE++
Sbjct: 125 DLDGDGLIEIVSTSALTPYINILDHQGNIKKQLLKSASGWRSVGDIALADINGDGNIEIL 184
Query: 529 TTD 531
D
Sbjct: 185 AAD 187
>gi|397615138|gb|EJK63246.1| hypothetical protein THAOC_16112 [Thalassiosira oceanica]
Length = 870
Score = 40.4 bits (93), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 66/144 (45%), Gaps = 31/144 (21%)
Query: 469 TVVDLDGDGNLDILVGTS----FGLFYVLDHHGKIREKFPLEMAEIQGA--VVAADINDD 522
TV DLD DG+ DI+ +S +F +D G E + + +GA VVAAD+N D
Sbjct: 210 TVADLDNDGDPDIITASSGDNTIAVFKNIDK-GTFCEIKEVVDSNAKGARTVVAADLNGD 268
Query: 523 GKIELVTTDTHGNVAAW---------------TAEGKGIWEQHLKSLVTQGPS------- 560
G ++L + + AW T G ++ +K++++ G
Sbjct: 269 GLLDLASASKDDDTVAWYPNTMRNVVNHSTGATHRVGGYFDPSVKNVISNGTESTGAYSL 328
Query: 561 -IGDVDGDGHSDVVVPTLSGNIYV 583
DVD DG D+VV + +GN +V
Sbjct: 329 VAADVDLDGDQDLVVAS-NGNDHV 351
>gi|229520348|ref|ZP_04409774.1| hemolysin-related protein Vcp [Vibrio cholerae TM 11079-80]
gi|419837611|ref|ZP_14361049.1| FG-GAP repeat family protein [Vibrio cholerae HC-46B1]
gi|421344462|ref|ZP_15794865.1| FG-GAP repeat family protein [Vibrio cholerae HC-43B1]
gi|421354554|ref|ZP_15804886.1| FG-GAP repeat family protein [Vibrio cholerae HE-45]
gi|423735562|ref|ZP_17708759.1| FG-GAP repeat family protein [Vibrio cholerae HC-41B1]
gi|424009903|ref|ZP_17752840.1| FG-GAP repeat family protein [Vibrio cholerae HC-44C1]
gi|229342714|gb|EEO07706.1| hemolysin-related protein Vcp [Vibrio cholerae TM 11079-80]
gi|395940542|gb|EJH51223.1| FG-GAP repeat family protein [Vibrio cholerae HC-43B1]
gi|395953679|gb|EJH64292.1| FG-GAP repeat family protein [Vibrio cholerae HE-45]
gi|408629745|gb|EKL02414.1| FG-GAP repeat family protein [Vibrio cholerae HC-41B1]
gi|408856159|gb|EKL95854.1| FG-GAP repeat family protein [Vibrio cholerae HC-46B1]
gi|408863698|gb|EKM03172.1| FG-GAP repeat family protein [Vibrio cholerae HC-44C1]
Length = 691
Score = 40.4 bits (93), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 37/63 (58%), Gaps = 3/63 (4%)
Query: 472 DLDGDGNLDILVGTSFGLFY-VLDHHGKIREKFPLEMAEIQ--GAVVAADINDDGKIELV 528
DLDGDG ++I+ ++ + +LDH G I+++ + + G + ADIN DG IE++
Sbjct: 125 DLDGDGLIEIVSTSALTPYINILDHQGNIKKQLLKSASGWRSVGDIALADINGDGNIEIL 184
Query: 529 TTD 531
D
Sbjct: 185 AAD 187
>gi|153830997|ref|ZP_01983664.1| hypothetical protein A59_1970 [Vibrio cholerae 623-39]
gi|229529090|ref|ZP_04418480.1| hemolysin-related protein Vcp [Vibrio cholerae 12129(1)]
gi|148873522|gb|EDL71657.1| hypothetical protein A59_1970 [Vibrio cholerae 623-39]
gi|229332864|gb|EEN98350.1| hemolysin-related protein Vcp [Vibrio cholerae 12129(1)]
Length = 691
Score = 40.4 bits (93), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 37/63 (58%), Gaps = 3/63 (4%)
Query: 472 DLDGDGNLDILVGTSFGLFY-VLDHHGKIREKFPLEMAEIQ--GAVVAADINDDGKIELV 528
DLDGDG ++I+ ++ + +LDH G I+++ + + G + ADIN DG IE++
Sbjct: 125 DLDGDGLIEIVSTSALTPYINILDHQGNIKKQLLKSASGWRSVGDIALADINGDGNIEIL 184
Query: 529 TTD 531
D
Sbjct: 185 AAD 187
>gi|424591553|ref|ZP_18030980.1| FG-GAP repeat family protein [Vibrio cholerae CP1037(10)]
gi|408031433|gb|EKG68055.1| FG-GAP repeat family protein [Vibrio cholerae CP1037(10)]
Length = 691
Score = 40.0 bits (92), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 37/63 (58%), Gaps = 3/63 (4%)
Query: 472 DLDGDGNLDILVGTSFGLFY-VLDHHGKIREKFPLEMAEIQ--GAVVAADINDDGKIELV 528
DLDGDG ++I+ ++ + +LDH G I+++ + + G + ADIN DG IE++
Sbjct: 125 DLDGDGLIEIVSTSALTPYINILDHQGNIKKQLLKSASGWRSVGDIALADINGDGNIEIL 184
Query: 529 TTD 531
D
Sbjct: 185 AAD 187
>gi|229515268|ref|ZP_04404728.1| hemolysin-related protein Vcp [Vibrio cholerae TMA 21]
gi|229347973|gb|EEO12932.1| hemolysin-related protein Vcp [Vibrio cholerae TMA 21]
Length = 691
Score = 40.0 bits (92), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 37/63 (58%), Gaps = 3/63 (4%)
Query: 472 DLDGDGNLDILVGTSFGLFY-VLDHHGKIREKFPLEMAEIQ--GAVVAADINDDGKIELV 528
DLDGDG ++I+ ++ + +LDH G I+++ + + G + ADIN DG IE++
Sbjct: 125 DLDGDGLIEIVSTSALTPYINILDHQGNIKKQLLKSASGWRSVGDIALADINGDGNIEIL 184
Query: 529 TTD 531
D
Sbjct: 185 AAD 187
>gi|167523715|ref|XP_001746194.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775465|gb|EDQ89089.1| predicted protein [Monosiga brevicollis MX1]
Length = 1228
Score = 40.0 bits (92), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 82/368 (22%), Positives = 147/368 (39%), Gaps = 60/368 (16%)
Query: 365 WTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGG 424
WT+++ V ++H + V D+D DG +++ A S +Y N H G
Sbjct: 874 WTDQRR-----LVTYEAHGVLDVVTVDLDGDGHMDLLSANSGVPAASWYRNTGHGVFAGP 928
Query: 425 IDIGKYVAGAIVVFNLDTKQVKWTTDLD-LSTDNASFRAYI--------YSSPT------ 469
+ + + + + V LD T LD L D AS + + + +PT
Sbjct: 929 VHLTQALQQPMSVAVLDVNH---TGRLDVLVADRASDQVWWVKNLGQGRFGTPTSLLSGI 985
Query: 470 -------VVDLDGDGNLDILVGTSFG---LFYVLDHHGKIREKFPLEMAEIQG--AVVAA 517
V DL+GD DI+ +++ + G + L A+ A+V A
Sbjct: 986 GQINLVRVADLNGDNMTDIICVDDRADAIVWFANEGAGAFGSRRTLSNADQSDIRALVLA 1045
Query: 518 DINDDGKIELVTTDTHG----------NVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGD 567
D++ DG +++++ T G N + + GI Q K + ++ D+DGD
Sbjct: 1046 DLDGDGAVDILSGSTSGSATVVLYHNLNNSGSFSMAHGISSQSEK---VRSLAVADIDGD 1102
Query: 568 GHSDVV--VPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIV 625
G DV+ +P ++ + +G+ T V DL G+ ++
Sbjct: 1103 GRQDVLSAIPDDDAVVWYRNQDNGTFAAFRAITTEAAYATSVATADLDGDGDPD----VL 1158
Query: 626 TTSFDG---YLYLIDGPTSCADVVDIGETS--YSMVLADNVDGGDDLDLIVTTMNGN-VF 679
+ S D Y DG S + +I T+ V A ++D DLD++ ++N N +
Sbjct: 1159 SASGDDRKIAWYQNDGQGSFSAQQEISLTADGAQQVTAADLDNDGDLDIVSASLNDNEIA 1218
Query: 680 CFSTPAPH 687
++ PH
Sbjct: 1219 WYNNDLPH 1226
>gi|162449506|ref|YP_001611873.1| hypothetical protein sce1236 [Sorangium cellulosum So ce56]
gi|161160088|emb|CAN91393.1| putative secreted protein [Sorangium cellulosum So ce56]
Length = 829
Score = 40.0 bits (92), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 41/81 (50%), Gaps = 2/81 (2%)
Query: 61 NLELRWQTEVSS--SIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFH 118
N + +Q +SS + Y +P++AD+N DGKLDIV + + +L G
Sbjct: 237 NGDGTFQAPISSPGADYVSPVLADVNGDGKLDIVASRNGNSITILAGDGAGAFTTSSTLT 296
Query: 119 QSSVHSSPLLYDIDKDGVREI 139
++ L D+D DGVR+I
Sbjct: 297 GTTNCRFITLADLDSDGVRDI 317
>gi|121727140|ref|ZP_01680314.1| hypothetical protein VCV52_1858 [Vibrio cholerae V52]
gi|153825176|ref|ZP_01977843.1| hypothetical protein A5A_2090 [Vibrio cholerae MZO-2]
gi|121630518|gb|EAX62910.1| hypothetical protein VCV52_1858 [Vibrio cholerae V52]
gi|149741155|gb|EDM55206.1| hypothetical protein A5A_2090 [Vibrio cholerae MZO-2]
Length = 691
Score = 40.0 bits (92), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 37/63 (58%), Gaps = 3/63 (4%)
Query: 472 DLDGDGNLDILVGTSFGLFY-VLDHHGKIREKFPLEMAEIQ--GAVVAADINDDGKIELV 528
DLDGDG ++I+ ++ + +LDH G I+++ + + G + ADIN DG IE++
Sbjct: 125 DLDGDGLIEIVSTSALTPYINILDHQGNIKKQLLKSASGWRSVGDIALADINGDGNIEIL 184
Query: 529 TTD 531
D
Sbjct: 185 AAD 187
>gi|448312226|ref|ZP_21501976.1| cell surface protein/ lipoprotein [Natronolimnobius innermongolicus
JCM 12255]
gi|445602733|gb|ELY56705.1| cell surface protein/ lipoprotein [Natronolimnobius innermongolicus
JCM 12255]
Length = 452
Score = 40.0 bits (92), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 64/241 (26%), Positives = 98/241 (40%), Gaps = 54/241 (22%)
Query: 434 AIVVFNLDTKQ---VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLF 490
A V ++D+K+ V+W T++A + SSPTVVD + VG++ G
Sbjct: 2 ATTVDSVDSKESDRVRWQ----FETESA-----VRSSPTVVDGI------VFVGSNDGTL 46
Query: 491 YVLDHH-GKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKG---- 545
Y +D G R + E V AA DDG + + T+GN A G
Sbjct: 47 YAVDAETGTTRWTY-----EPDRDVEAAPAVDDGTVYV---GTNGNKIAAVDAATGDERW 98
Query: 546 ---IWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGR 602
W + +T P++GD V + G++Y L GS + + T G+
Sbjct: 99 EYSFWGETFVRGITSSPAVGD------GLVYIGMTDGHVYAL--DTGSGAEEWSFATDGQ 150
Query: 603 VMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCAD-VVDIGETSYSM-VLADN 660
V + + G T+ S D YLY IDG + D+G+ YS +AD
Sbjct: 151 VASSPTV----------DGGTVYVGSTDDYLYAIDGDSGDEQWAFDVGDPVYSSPTVADG 200
Query: 661 V 661
+
Sbjct: 201 I 201
>gi|417825108|ref|ZP_12471696.1| FG-GAP repeat family protein [Vibrio cholerae HE48]
gi|421351621|ref|ZP_15801986.1| FG-GAP repeat family protein [Vibrio cholerae HE-25]
gi|340046593|gb|EGR07523.1| FG-GAP repeat family protein [Vibrio cholerae HE48]
gi|395952066|gb|EJH62680.1| FG-GAP repeat family protein [Vibrio cholerae HE-25]
Length = 691
Score = 40.0 bits (92), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 37/63 (58%), Gaps = 3/63 (4%)
Query: 472 DLDGDGNLDILVGTSFGLFY-VLDHHGKIREKFPLEMAEIQ--GAVVAADINDDGKIELV 528
DLDGDG ++I+ ++ + +LDH G I+++ + + G + ADIN DG IE++
Sbjct: 125 DLDGDGLIEIVSTSALTPYINILDHQGNIKKQLLKSASGWRSVGDIALADINGDGNIEIL 184
Query: 529 TTD 531
D
Sbjct: 185 AAD 187
>gi|422307829|ref|ZP_16394984.1| PQQ enzyme repeat family protein [Vibrio cholerae CP1035(8)]
gi|408618895|gb|EKK91947.1| PQQ enzyme repeat family protein [Vibrio cholerae CP1035(8)]
Length = 691
Score = 40.0 bits (92), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 37/63 (58%), Gaps = 3/63 (4%)
Query: 472 DLDGDGNLDILVGTSFGLFY-VLDHHGKIREKFPLEMAEIQ--GAVVAADINDDGKIELV 528
DLDGDG ++I+ ++ + +LDH G I+++ + + G + ADIN DG IE++
Sbjct: 125 DLDGDGLIEIVSTSALTPYINILDHQGNIKKQLLKSASGWRSVGDIALADINGDGNIEIL 184
Query: 529 TTD 531
D
Sbjct: 185 AAD 187
>gi|387792601|ref|YP_006257666.1| hypothetical protein Solca_3485 [Solitalea canadensis DSM 3403]
gi|379655434|gb|AFD08490.1| hypothetical protein Solca_3485 [Solitalea canadensis DSM 3403]
Length = 469
Score = 40.0 bits (92), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 34/129 (26%), Positives = 52/129 (40%), Gaps = 8/129 (6%)
Query: 468 PTVVDLDGDGNLDILVGTSFGLF---YVLDHHGKIREKFPLEMAEIQGAVVAADINDDGK 524
P D DGDG D+ + G + Y + GK + Q V AD + DGK
Sbjct: 291 PCPADYDGDGKADLSIRNGQGKWNIDYASNGFGKWDINYSNYGFTGQEFPVPADYDGDGK 350
Query: 525 IELVTTDTHGNVAA--WTAEGKGIWEQHLK---SLVTQGPSIGDVDGDGHSDVVVPTLSG 579
+L ++ + + + G G W+ S + P D DGDG +D+ + G
Sbjct: 351 ADLSIKNSTNGIWYIDYASNGFGAWDMSFNGYGSSSSDKPCPADYDGDGKADLAIRNNKG 410
Query: 580 NIYVLSGKD 588
Y+ KD
Sbjct: 411 EWYIDYAKD 419
>gi|384424880|ref|YP_005634238.1| Hemolysin-like protein Vcp [Vibrio cholerae LMA3984-4]
gi|429886155|ref|ZP_19367721.1| Hemolysin-related protein Vcp [Vibrio cholerae PS15]
gi|327484433|gb|AEA78840.1| Hemolysin-like protein Vcp [Vibrio cholerae LMA3984-4]
gi|429227049|gb|EKY33106.1| Hemolysin-related protein Vcp [Vibrio cholerae PS15]
Length = 691
Score = 40.0 bits (92), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 37/63 (58%), Gaps = 3/63 (4%)
Query: 472 DLDGDGNLDILVGTSFGLFY-VLDHHGKIREKFPLEMAEIQ--GAVVAADINDDGKIELV 528
DLDGDG ++I+ ++ + +LDH G I+++ + + G + ADIN DG IE++
Sbjct: 125 DLDGDGLIEIVSTSALTPYINILDHQGNIKKQLLKSASGWRSVGDIALADINGDGNIEIL 184
Query: 529 TTD 531
D
Sbjct: 185 AAD 187
>gi|148676092|gb|EDL08039.1| integrin alpha 8, isoform CRA_c [Mus musculus]
Length = 981
Score = 40.0 bits (92), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 30/119 (25%), Positives = 57/119 (47%), Gaps = 18/119 (15%)
Query: 381 SHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNL 440
S+ T V++D++NDG+ ++++ F + E+ NP +E+G + + Y+ + ++F
Sbjct: 318 SYFGYTVVVSDVNNDGMDDILVGAPLFMEREFESNP---REVGQVYL--YLQASALLFQ- 371
Query: 441 DTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKI 499
D + T +F + S + DL+ DG DI +G F D GK+
Sbjct: 372 ---------DPQVLTGTETFGRFGSSVAHLGDLNQDGYNDIAIGVPFA---GKDQRGKV 418
>gi|153823442|ref|ZP_01976109.1| hypothetical protein A5E_2184, partial [Vibrio cholerae B33]
gi|126519031|gb|EAZ76254.1| hypothetical protein A5E_2184 [Vibrio cholerae B33]
Length = 659
Score = 40.0 bits (92), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 37/63 (58%), Gaps = 3/63 (4%)
Query: 472 DLDGDGNLDILVGTSFGLFY-VLDHHGKIREKFPLEMAEIQ--GAVVAADINDDGKIELV 528
DLDGDG ++I+ ++ + +LDH G I+++ + + G + ADIN DG IE++
Sbjct: 125 DLDGDGLIEIVSTSALTPYINILDHQGNIKKQLLKSASGWRSVGDIALADINGDGNIEIL 184
Query: 529 TTD 531
D
Sbjct: 185 AAD 187
>gi|94969301|ref|YP_591349.1| integrin-like protein [Candidatus Koribacter versatilis Ellin345]
gi|94551351|gb|ABF41275.1| Integrin-like protein [Candidatus Koribacter versatilis Ellin345]
Length = 469
Score = 40.0 bits (92), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 57/135 (42%), Gaps = 10/135 (7%)
Query: 448 TTDLDLSTDNASFRAYI-YSSPT-----VVDLDGDGNLDILVGTSFGLFYVLDHHGKIRE 501
T D+ + + F AY Y+SP+ VD++GDG D+++ G + + G R
Sbjct: 57 TIDIFFNDHSGGFGAYTSYNSPSGGPIIAVDVNGDGWPDLVIAGGGGTVLLNNGDGTFRP 116
Query: 502 KFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPS- 560
+ VA D N DGK++L + + G G + + G S
Sbjct: 117 GTAPTTKAPASSFVAGDFNKDGKVDLAAVEGT-QIEILLNNGSGTFHSGQVLAMAGGSSN 175
Query: 561 --IGDVDGDGHSDVV 573
+GD D DG+ D+
Sbjct: 176 AVVGDFDSDGNLDIA 190
>gi|421336616|ref|ZP_15787077.1| FG-GAP repeat family protein [Vibrio cholerae CP1048(21)]
gi|395931715|gb|EJH42459.1| FG-GAP repeat family protein [Vibrio cholerae CP1048(21)]
Length = 673
Score = 40.0 bits (92), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 37/63 (58%), Gaps = 3/63 (4%)
Query: 472 DLDGDGNLDILVGTSFGLFY-VLDHHGKIREKFPLEMAEIQ--GAVVAADINDDGKIELV 528
DLDGDG ++I+ ++ + +LDH G I+++ + + G + ADIN DG IE++
Sbjct: 107 DLDGDGLIEIVSTSALTPYINILDHQGNIKKQLLKSASGWRSVGDIALADINGDGNIEIL 166
Query: 529 TTD 531
D
Sbjct: 167 AAD 169
>gi|183179489|ref|ZP_02957700.1| hypothetical protein A51_B1961 [Vibrio cholerae MZO-3]
gi|183012900|gb|EDT88200.1| hypothetical protein A51_B1961 [Vibrio cholerae MZO-3]
Length = 691
Score = 40.0 bits (92), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 37/63 (58%), Gaps = 3/63 (4%)
Query: 472 DLDGDGNLDILVGTSFGLFY-VLDHHGKIREKFPLEMAEIQ--GAVVAADINDDGKIELV 528
DLDGDG ++I+ ++ + +LDH G I+++ + + G + ADIN DG IE++
Sbjct: 125 DLDGDGLIEIVSTSALTPYINILDHQGNIKKQLLKSASGWRSVGDIALADINGDGNIEIL 184
Query: 529 TTD 531
D
Sbjct: 185 AAD 187
>gi|182437686|ref|YP_001825405.1| hypothetical protein SGR_3893 [Streptomyces griseus subsp. griseus
NBRC 13350]
gi|178466202|dbj|BAG20722.1| hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350]
Length = 1049
Score = 40.0 bits (92), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 62/140 (44%), Gaps = 15/140 (10%)
Query: 459 SFRAYIYSSPTVVDLDGDGNLDILV---GTSFGLFYVLDHHGKIREKFPLEMAEIQGAVV 515
++AY SP DL+GDG D+L T Y D G+++ + VV
Sbjct: 892 GWKAYDLYSPG--DLNGDGVADVLARQKSTGSLYLYASDRAGRLKAGVLIGTGWKGLTVV 949
Query: 516 AA-DINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQG-------PSIGDVDGD 567
A D+ DG +++T D+ G + W G G + K + G ++GD+ GD
Sbjct: 950 GAGDLTGDGIGDVLTRDSSGVL--WRYAGTGAGKLAAKQRIGSGWGQYNAIAAVGDITGD 1007
Query: 568 GHSDVVVPTLSGNIYVLSGK 587
G +D+ SG ++ G+
Sbjct: 1008 GRNDLFARDKSGVLWRYEGR 1027
>gi|427401959|ref|ZP_18893031.1| hypothetical protein HMPREF9710_02627 [Massilia timonae CCUG 45783]
gi|425719150|gb|EKU82087.1| hypothetical protein HMPREF9710_02627 [Massilia timonae CCUG 45783]
Length = 452
Score = 40.0 bits (92), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 59/114 (51%), Gaps = 11/114 (9%)
Query: 471 VDLDGDGNLDILVGTSFGL--FYVLDHHGK---IREKFPLEMAEIQGAVVAADINDDGKI 525
+D+D DG+LDI+V G+ Y D GK + + F L M + + V AAD N DG +
Sbjct: 77 IDVDHDGDLDIVVSVEHGVNRLYRNDGGGKLTYVPDAFGLRMHDSE-HVRAADFNRDGHM 135
Query: 526 ELVTTDTHGNV-AAWTAEGKGIWEQHLKSLV--TQGP--SIGDVDGDGHSDVVV 574
++V V + +GKG + L +QG ++GDVDGDG D+ +
Sbjct: 136 DVVFVAEADEVHQLYLGDGKGGFVDASARLPAHSQGNALALGDVDGDGLPDIFI 189
>gi|410901543|ref|XP_003964255.1| PREDICTED: integrin alpha-9-like [Takifugu rubripes]
Length = 1014
Score = 40.0 bits (92), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 47/212 (22%), Positives = 77/212 (36%), Gaps = 45/212 (21%)
Query: 430 YVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYI-----YSSPTVVDL-------DGDG 477
Y G I V+NL + ++ + S+ Y +S+P V+D+ G G
Sbjct: 201 YWTGTIKVYNLTSDSFYSPNKENVDSHRYSYLGYAVAAGHFSAPNVIDIAAGAPQHSGSG 260
Query: 478 NLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIEL-VTTDTH--- 533
+ I L GK+ + ++ A D+N DG +L V H
Sbjct: 261 KVYIFKIDGSSLVKSFQASGKMMGSY------FGSSLCAVDLNQDGLSDLLVGAPMHSQL 314
Query: 534 ---GNVAAWTAEGKGIWEQ------------HLKSLVTQGPSIGDVDGDGHSDVVVPT-- 576
G V + + G G+ E+ H + +IGD+D DG DV +
Sbjct: 315 RDEGQVTVYLSIGNGVMEESAVLSGDNAFNAHFGECIA---AIGDIDDDGFQDVAIGAPK 371
Query: 577 ---LSGNIYVLSGKDGSKVRPYPYRTHGRVMN 605
G +Y+ G + Y + GR +N
Sbjct: 372 EDDYGGAVYIYHGDASGIISTYSMKLTGRSVN 403
>gi|393782946|ref|ZP_10371126.1| hypothetical protein HMPREF1071_01994 [Bacteroides salyersiae
CL02T12C01]
gi|392671304|gb|EIY64778.1| hypothetical protein HMPREF1071_01994 [Bacteroides salyersiae
CL02T12C01]
Length = 572
Score = 40.0 bits (92), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 40/130 (30%), Positives = 66/130 (50%), Gaps = 25/130 (19%)
Query: 469 TVVDLDGDGNLDILVGTSFG--------LFY-VLDHHGKIREKFPLE-----MAEIQGAV 514
T+ D++GDG LDI + T G LFY +D GK ++ E + +
Sbjct: 225 TLADINGDGTLDI-ISTGNGDNWVCLASLFYNTVDKDGKCTYRYVSEKESNLLGVSRANP 283
Query: 515 VAADINDDGKIELVT----TDTHG---NVAAWTAEGKGIWEQHLKSL-VTQ--GPSIGDV 564
+ AD N DG++++V +D G + T EGK + ++ + V Q G ++GDV
Sbjct: 284 LVADFNGDGRMDMVINGEPSDGSGYRNRIYYQTPEGKFVMDKSYPVVPVNQDGGINMGDV 343
Query: 565 DGDGHSDVVV 574
+GDG+ D++V
Sbjct: 344 NGDGNMDLIV 353
>gi|422923192|ref|ZP_16956351.1| FG-GAP repeat family protein, partial [Vibrio cholerae BJG-01]
gi|341644365|gb|EGS68587.1| FG-GAP repeat family protein [Vibrio cholerae BJG-01]
Length = 669
Score = 40.0 bits (92), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 37/63 (58%), Gaps = 3/63 (4%)
Query: 472 DLDGDGNLDILVGTSFGLFY-VLDHHGKIREKFPLEMAEIQ--GAVVAADINDDGKIELV 528
DLDGDG ++I+ ++ + +LDH G I+++ + + G + ADIN DG IE++
Sbjct: 125 DLDGDGLIEIVSTSALTPYINILDHQGNIKKQLLKSASGWRSVGDIALADINGDGNIEIL 184
Query: 529 TTD 531
D
Sbjct: 185 AAD 187
>gi|167523078|ref|XP_001745876.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775677|gb|EDQ89300.1| predicted protein [Monosiga brevicollis MX1]
Length = 436
Score = 40.0 bits (92), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 59/122 (48%), Gaps = 22/122 (18%)
Query: 435 IVVFNLDTKQVKWTTDLDLSTDNASFRAYI-----YSSPTVV---DLDGDGNLDILVGTS 486
IV N ++W+ +LD N SF + S+PT+V DLDGDG +D+L +
Sbjct: 261 IVSGNFQINGLRWSRNLD---GNGSFAQAVSLTTNASAPTLVAIGDLDGDGWVDVLSASG 317
Query: 487 F----GLFYVLDHHGKIREKFPLEMAEIQGA-----VVAADINDDGKIELVTTDTHGNVA 537
F F L H G I + P +A A +V AD++ DG +++++T +
Sbjct: 318 FVNAIQWFRNLGH-GSI-DPAPRPIAHAGAAPQLTSLVVADLDSDGDLDVLSTLPNAETV 375
Query: 538 AW 539
AW
Sbjct: 376 AW 377
Score = 39.3 bits (90), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 32/122 (26%), Positives = 53/122 (43%), Gaps = 12/122 (9%)
Query: 470 VVDLDGDGNLDILVGTSFG--LFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIEL 527
+ D++ D NLD+ S G L YV G + P+ + A D+N+DG ++L
Sbjct: 151 IADVNLDTNLDVFTVDSVGNVLLYVNLGLGSFGSRVPVSDSTGITEAAATDLNNDGLLDL 210
Query: 528 VTTDTHGNVAAWTAEGKGI------WEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNI 581
V ++ W G+ + +L ++ D+D DGH D+V SGN
Sbjct: 211 VYASAVHSIIGWRRNSNGVSFAEDDYSIDSGALSVSSLALVDLDADGHIDIV----SGNF 266
Query: 582 YV 583
+
Sbjct: 267 QI 268
>gi|119963150|ref|YP_946834.1| FG-GAP repeat-containing protein [Arthrobacter aurescens TC1]
gi|119950009|gb|ABM08920.1| FG-GAP repeat domain protein [Arthrobacter aurescens TC1]
Length = 972
Score = 40.0 bits (92), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 51/186 (27%), Positives = 74/186 (39%), Gaps = 21/186 (11%)
Query: 468 PTVVDLDGDGNLDILVGTSFGLFYVLDHHGK---IREKFPLEM---AEIQGAVVAADIND 521
P+ D +GDG D+L + G Y+ G + + L + + A+ D N
Sbjct: 719 PSAHDFNGDGKADVLASDTAGNLYLYPGDGAGGVLTRQVALAAPAWSTVNEAITPGDFNR 778
Query: 522 DGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSI-------GDVDGDGHSDVVV 574
DGK +L+ G+V W G G+ + V+ G S GD GDG D +
Sbjct: 779 DGKPDLLA--RAGDV-LWFYPGDGVGSFGARVQVSTGWSTMSQLFSPGDFSGDGIPDFIG 835
Query: 575 PTLSGNIYVLSGKD-GSKVRPYPYRTHGRVMNQVLLV-DLTKRGEKSKGLTIVTTSFDGY 632
SG + + G G + P T VMN +L D G KG + S G
Sbjct: 836 RLSSGELRLYEGNGTGGQRTPTTIGTGWNVMNAILSTGDFNGDG---KGDVLARRSDTGA 892
Query: 633 LYLIDG 638
L+L G
Sbjct: 893 LWLYPG 898
>gi|241764235|ref|ZP_04762267.1| FG-GAP repeat protein [Acidovorax delafieldii 2AN]
gi|241366404|gb|EER60921.1| FG-GAP repeat protein [Acidovorax delafieldii 2AN]
Length = 1778
Score = 40.0 bits (92), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 54/216 (25%), Positives = 85/216 (39%), Gaps = 55/216 (25%)
Query: 366 TEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGI 425
T E H I D + L+ +AD+D DG E I+AV+ + Y N
Sbjct: 861 TGEHHLSIRDPSIAAAGSLA---LADLDGDGQPE-IVAVTRSYQIVVYRN---------- 906
Query: 426 DIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGT 485
T W +++ + D + A+ +P V D+DGDG +IL G
Sbjct: 907 ----------------TGARWWMSEVVATADG--YPAW--GAPFVADIDGDGRPEILYGN 946
Query: 486 SFGLFYVLDHHGKIREK----FPLEMAEIQGAV---VAADINDDGKIELVTTDTHGNVAA 538
+ VL+H G I+ K F ++ G V ADI GK +V +
Sbjct: 947 A-----VLNHDGSIKWKTTGNFVGSSSQSNGRFSIPVVADIQGTGKANVVLGGSL----- 996
Query: 539 WTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVV 574
++A+G+ +W+ + IGD G + V
Sbjct: 997 YSADGQLVWQAPMDGFT----GIGDFGSKGEPSIAV 1028
>gi|254226052|ref|ZP_04919651.1| hemolysin [Vibrio cholerae V51]
gi|125621435|gb|EAZ49770.1| hemolysin [Vibrio cholerae V51]
Length = 539
Score = 40.0 bits (92), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 37/63 (58%), Gaps = 3/63 (4%)
Query: 472 DLDGDGNLDILVGTSFGLFY-VLDHHGKIREKFPLEMAEIQ--GAVVAADINDDGKIELV 528
DLDGDG ++I+ ++ + +LDH G I+++ + + G + ADIN DG IE++
Sbjct: 125 DLDGDGLIEIVSTSALTPYINILDHQGNIKKQLLKSASGWRSVGDIALADINGDGNIEIL 184
Query: 529 TTD 531
D
Sbjct: 185 AAD 187
>gi|254443031|ref|ZP_05056507.1| FG-GAP repeat domain protein [Verrucomicrobiae bacterium DG1235]
gi|198257339|gb|EDY81647.1| FG-GAP repeat domain protein [Verrucomicrobiae bacterium DG1235]
Length = 1088
Score = 40.0 bits (92), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 35/145 (24%), Positives = 60/145 (41%), Gaps = 21/145 (14%)
Query: 451 LDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDH---------HGKIRE 501
L+ + F + SS ++D+D DG++D++ F D+ G E
Sbjct: 725 LEAKVEGLDFESRSTSSSLLIDVDADGDMDLVQAGGTRRFSYPDYEDNLVWKFEEGSFAE 784
Query: 502 KFPLEMAEI------QGAVVAADINDDGKIELVTTDTHGNVAAWTAEGK------GIWEQ 549
F E ++ ++A D + DGK++LV G W K G+ ++
Sbjct: 785 DFKSEFSKAFSRSGKTSDLLAVDWSGDGKLDLVQAVQWGAPIFWKQTNKGLVRFDGVLDR 844
Query: 550 HLKSLVTQGPSIGDVDGDGHSDVVV 574
+ S + GD DGDG D+V+
Sbjct: 845 NALSGAWSSLASGDFDGDGRLDIVI 869
>gi|163757214|ref|ZP_02164314.1| hypothetical protein KAOT1_02316 [Kordia algicida OT-1]
gi|161322843|gb|EDP94192.1| hypothetical protein KAOT1_02316 [Kordia algicida OT-1]
Length = 831
Score = 40.0 bits (92), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 61/127 (48%), Gaps = 17/127 (13%)
Query: 470 VVDLDGDGNLDILVGTSFGLFYVLDHHGKIR-----EKFPLEMAEIQGAVVAADINDDGK 524
+ D+DGD +LD++ G +++ G +F L + +VAADI++DG
Sbjct: 149 LADIDGDNDLDVITAYGSGGIAWIENLGNNTFSSGLNQFSLGFYGVD-KIVAADIDNDGD 207
Query: 525 IELVTTDTHGNVAAW--TAEGKGIWEQHLKSLVTQGP---------SIGDVDGDGHSDVV 573
++++ + N AW +G+G + + + GP + DVDGD + D+V
Sbjct: 208 MDIIGSRETDNRIAWFENYDGQGSFGYNNPQYIHTGPGSSQNKADFCVIDVDGDTYKDLV 267
Query: 574 VPTLSGN 580
+ + + N
Sbjct: 268 ISSSTQN 274
>gi|153820523|ref|ZP_01973190.1| hemolysin [Vibrio cholerae NCTC 8457]
gi|126508933|gb|EAZ71527.1| hemolysin [Vibrio cholerae NCTC 8457]
Length = 338
Score = 40.0 bits (92), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 37/63 (58%), Gaps = 3/63 (4%)
Query: 472 DLDGDGNLDILVGTSFGLFY-VLDHHGKIREKFPLEMAEIQ--GAVVAADINDDGKIELV 528
DLDGDG ++I+ ++ + +LDH G I+++ + + G + ADIN DG IE++
Sbjct: 125 DLDGDGLIEIVSTSALTPYINILDHQGNIKKQLLKSASGWRSVGDIALADINGDGNIEIL 184
Query: 529 TTD 531
D
Sbjct: 185 AAD 187
>gi|254296125|ref|ZP_04963582.1| FG-GAP/YD repeat domain protein [Burkholderia pseudomallei 406e]
gi|157806038|gb|EDO83208.1| FG-GAP/YD repeat domain protein [Burkholderia pseudomallei 406e]
Length = 2031
Score = 40.0 bits (92), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 56/124 (45%), Gaps = 26/124 (20%)
Query: 471 VDLDGDGNLDILVGTSFG-LFYVLDHHGKIREK---------FPLEMAEIQGAVVAADIN 520
VDL+GDG DIL+ T G L Y+ D + P+ +A+ V AD+
Sbjct: 415 VDLNGDGITDILLDTGRGWLAYLRDGPASWSDAPRVLPPARTPPVSLADPH--VHLADMT 472
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG-----------DVDGDGH 569
DG +++V T G W A G W+ + +T P+ G DVDGDG
Sbjct: 473 GDGLMDIVRV-TGGGFTYWPARADGGWDAAIA--MTPSPAFGRDFDPHRLALYDVDGDGC 529
Query: 570 SDVV 573
+D+V
Sbjct: 530 ADLV 533
>gi|126453108|ref|YP_001064904.1| FG-GAP/YD repeat-containing protein [Burkholderia pseudomallei
1106a]
gi|242317436|ref|ZP_04816452.1| FG-GAP repeat/YD repeat/RHS repeat protein [Burkholderia
pseudomallei 1106b]
gi|403517277|ref|YP_006651410.1| FG-GAP/YD repeat-containing protein [Burkholderia pseudomallei
BPC006]
gi|126226750|gb|ABN90290.1| FG-GAP repeat/YD repeat/RHS repeat protein [Burkholderia
pseudomallei 1106a]
gi|242140675|gb|EES27077.1| FG-GAP repeat/YD repeat/RHS repeat protein [Burkholderia
pseudomallei 1106b]
gi|403072920|gb|AFR14500.1| FG-GAP/YD repeat-containing protein [Burkholderia pseudomallei
BPC006]
Length = 2031
Score = 40.0 bits (92), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 56/124 (45%), Gaps = 26/124 (20%)
Query: 471 VDLDGDGNLDILVGTSFG-LFYVLDHHGKIREK---------FPLEMAEIQGAVVAADIN 520
VDL+GDG DIL+ T G L Y+ D + P+ +A+ V AD+
Sbjct: 415 VDLNGDGITDILLDTGRGWLAYLRDGPASWSDAPRVLPPARTPPVSLADPH--VHLADMT 472
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG-----------DVDGDGH 569
DG +++V T G W A G W+ + +T P+ G DVDGDG
Sbjct: 473 GDGLMDIVRV-TGGGFTYWPARADGGWDAAIA--MTPSPAFGRDFDPHRLALYDVDGDGC 529
Query: 570 SDVV 573
+D+V
Sbjct: 530 ADLV 533
>gi|421680273|ref|ZP_16120128.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC111]
gi|410389642|gb|EKP42053.1| PilC beta-propeller domain protein [Acinetobacter baumannii
OIFC111]
Length = 1284
Score = 40.0 bits (92), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 48/99 (48%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 848 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 906
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDN + + I S + +D D DG +D L
Sbjct: 907 RLWWTSDTGSNTDNTNMKHSIVSRISTLDRDADGLVDHL 945
>gi|21221049|ref|NP_626828.1| hypothetical protein SCO2591 [Streptomyces coelicolor A3(2)]
gi|6983733|emb|CAB75372.1| putative secreted protein [Streptomyces coelicolor A3(2)]
Length = 576
Score = 40.0 bits (92), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 38/81 (46%), Gaps = 16/81 (19%)
Query: 517 ADINDDGKIELVTTDTHGNVAAWTAEGKGI-----------WEQHLKSLVTQGPSIGDVD 565
D+NDDGK +L+ DT G + + +G G W + K + GD
Sbjct: 426 GDLNDDGKGDLLARDTSGTLYLFRGKGHGASLSSPLKVGGGWNAYDKLV-----GAGDFT 480
Query: 566 GDGHSDVVVPTLSGNIYVLSG 586
GDG +D+V T +G +Y+ G
Sbjct: 481 GDGRTDIVARTPAGKLYLYRG 501
>gi|239501939|ref|ZP_04661249.1| pilus assembly protein tip-associated adhesin PilY1 [Acinetobacter
baumannii AB900]
Length = 1288
Score = 40.0 bits (92), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 48/99 (48%), Gaps = 2/99 (2%)
Query: 385 STPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQ 444
S PV+A+I +G ++ V +D + Y+ P GK A+ + + T Q
Sbjct: 852 SKPVLANIRYNGSIRRVLIVGGGYD-QCYEKPNITLTDACFTNGKAKGNAVYIIDAKTGQ 910
Query: 445 -VKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
+ WT+D +TDN + + I S + +D D DG +D L
Sbjct: 911 RLWWTSDTGSNTDNTNMKHSIVSRISTLDRDADGLVDHL 949
>gi|149922947|ref|ZP_01911367.1| FG-GAP repeat/HVR domain protein [Plesiocystis pacifica SIR-1]
gi|149816198|gb|EDM75705.1| FG-GAP repeat/HVR domain protein [Plesiocystis pacifica SIR-1]
Length = 550
Score = 40.0 bits (92), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 51/200 (25%), Positives = 83/200 (41%), Gaps = 18/200 (9%)
Query: 385 STPVIADIDNDGVSEMIIAVS-YFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTK 443
S+PV D+D DG E++ S Y FD E + GG A+ F+ D
Sbjct: 273 SSPVAIDLDLDGAQEVLFGGSAYAFDGELRFHASPANPNGG-------HAAVANFDADPF 325
Query: 444 QVKWTT--DLDLSTDNASFRAYIYSS----PTVVDLDGDGNLDILVG--TSFGLFYVLDH 495
+ D + + S +A+ S + DLDGDG +IL G F L V
Sbjct: 326 PELYLQLDDHRVLEHDGSEKAHCSSGDRNHAAIADLDGDGMAEILYGYHDRFELLAVEGD 385
Query: 496 HGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLV 555
++ + + + A D+ DDG E++ D + + +G +E +
Sbjct: 386 TCSKELSILVDEGDARSSGTAFDLLDDGHAEMIYAD-RSRLRIFALDGSVRFESPRSARA 444
Query: 556 TQG-PSIGDVDGDGHSDVVV 574
+ P I D+D DG +++VV
Sbjct: 445 SSANPVIADIDNDGAAELVV 464
>gi|315648899|ref|ZP_07901993.1| S-layer domain protein [Paenibacillus vortex V453]
gi|315275580|gb|EFU38934.1| S-layer domain protein [Paenibacillus vortex V453]
Length = 993
Score = 40.0 bits (92), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 52/197 (26%), Positives = 83/197 (42%), Gaps = 40/197 (20%)
Query: 472 DLDGDGNLDIL-VGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTT 530
D+DGDG++DI+ + G+F+ ++H + + V A D+N DG+ E++
Sbjct: 238 DIDGDGHIDIMALNDMLGVFFYMNHGDNTYNLQQQPLVTVPQKVYAEDVNGDGQSEIIVA 297
Query: 531 DTHGNVAAWTAEGKG-----IWEQHLKSLVTQGPS-----IGDVDGDGHSDVVVPT---- 576
G V ++ E +W+ LVTQG +GD DG D++V +
Sbjct: 298 TGKG-VDLYSKEYASGQLNLLWQTE---LVTQGNGTHRALVGDYAEDGTPDLIVSSNQAL 353
Query: 577 ----------------LSGNIYVLSGKDGSKVRPYPYR---THGRVMNQVLLVDLTKRGE 617
L Y +S +DG++VR R +HG V D + + +
Sbjct: 354 YIYKAIVVQPEPGTIRLEAADYAVS-EDGAEVRVKVMREGGSHGAVAVTYHTADGSAKAD 412
Query: 618 KSKGLTIVTTSF-DGYL 633
K T T SF DG L
Sbjct: 413 KHYKPTSGTLSFADGEL 429
>gi|76810483|ref|YP_332205.1| hypothetical protein BURPS1710b_0792 [Burkholderia pseudomallei
1710b]
gi|254258540|ref|ZP_04949594.1| FG-GAP/YD repeat domain protein [Burkholderia pseudomallei 1710a]
gi|76579936|gb|ABA49411.1| FG-GAP/YD repeat domain protein [Burkholderia pseudomallei 1710b]
gi|254217229|gb|EET06613.1| FG-GAP/YD repeat domain protein [Burkholderia pseudomallei 1710a]
Length = 2031
Score = 40.0 bits (92), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 56/124 (45%), Gaps = 26/124 (20%)
Query: 471 VDLDGDGNLDILVGTSFG-LFYVLDHHGKIREK---------FPLEMAEIQGAVVAADIN 520
VDL+GDG DIL+ T G L Y+ D + P+ +A+ V AD+
Sbjct: 415 VDLNGDGITDILLDTGRGWLAYLRDGPASWSDAPRVLPPARTPPVSLADPH--VHLADMT 472
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG-----------DVDGDGH 569
DG +++V T G W A G W+ + +T P+ G DVDGDG
Sbjct: 473 GDGLMDIVRV-TGGGFTYWPARADGGWDAAIA--MTPSPAFGRDFDPHRLALYDVDGDGC 529
Query: 570 SDVV 573
+D+V
Sbjct: 530 ADLV 533
>gi|167537628|ref|XP_001750482.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771022|gb|EDQ84696.1| predicted protein [Monosiga brevicollis MX1]
Length = 1454
Score = 40.0 bits (92), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 66/148 (44%), Gaps = 15/148 (10%)
Query: 466 SSPTVVDLDGDGNLDILVGTSF-GLFYVLDHHGKIREKFPLEMA---EIQGAVVAADIND 521
SS TV DLDGD D+ ++ G L + G P +A + V+AADIN
Sbjct: 458 SSITVADLDGDSLPDLAAVSAMQGSLMWLRNEGAGAFAQPTVIANDLDQPFVVLAADING 517
Query: 522 DGKIELVTTDTHGNVAAWTAE-GKGIWEQHLKSLV-TQGP---SIGDVDGDGHSDVVVPT 576
DG ++LV+ + + W+ G G + L GP + ++DGDG D+VV +
Sbjct: 518 DGALDLVSGSSGASTVFWSRNLGNGAFATTAILLANISGPYALAFSELDGDGRRDLVVSS 577
Query: 577 -LSGNIYVLSGKDGSKVRPYPYRTHGRV 603
+Y ++RP YR V
Sbjct: 578 FFQAAVYWY-----PQIRPGTYRPQSLV 600
>gi|324501096|gb|ADY40493.1| Eukaryotic translation initiation factor 2-alpha kinase [Ascaris
suum]
Length = 1075
Score = 40.0 bits (92), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 39/161 (24%), Positives = 69/161 (42%), Gaps = 22/161 (13%)
Query: 522 DGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNI 581
DGK+ + ++HG VA T + L SL + DGH ++VP+L G++
Sbjct: 75 DGKVTALDVNSHGKVAWSTGSDSPLLSGTLNSL--------QLMADGHPYMLVPSLDGSL 126
Query: 582 YVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTS----FDGYLYLID 637
Y+ + D S + P P T VM + D + G TIV+T+ G +
Sbjct: 127 YMFN-MDSSSLNPIPVSTDISVM----IGD-----DAVAGGTIVSTTGVDPITGQVRYHC 176
Query: 638 GPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNV 678
+CA + + Y++++ N +L+ + N+
Sbjct: 177 TTENCAQMATSNPSPYTLIIRRNTQVARAANLVTGSERWNL 217
>gi|167518736|ref|XP_001743708.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777670|gb|EDQ91286.1| predicted protein [Monosiga brevicollis MX1]
Length = 1091
Score = 40.0 bits (92), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 63/131 (48%), Gaps = 13/131 (9%)
Query: 466 SSPTVVDLDGDGNLDILVGT---SFGLFYVLDHHGKI--REKFPLEMAEIQGAVVAADIN 520
S+ V D+D DG++DI+ + SF +Y + R KF + I + AD+N
Sbjct: 507 STFGVADMDNDGHIDIVGASETASFIAWYRNWGNATFDPRPKFVANNSAITYTLAMADLN 566
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPS------IGDVDGDGHSDVVV 574
DG +++V + AW G L++LV G + GDV+GDG D+V
Sbjct: 567 GDGTVDVVAAHYGDDTVAWYRNVGGGRFSPLQNLVISGYNGATGVVAGDVNGDGLPDLV- 625
Query: 575 PTLSGNIYVLS 585
++S N YV S
Sbjct: 626 -SISLNNYVAS 635
>gi|126326699|ref|XP_001377736.1| PREDICTED: integrin alpha-4 [Monodelphis domestica]
Length = 1031
Score = 40.0 bits (92), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 40/153 (26%), Positives = 64/153 (41%), Gaps = 35/153 (22%)
Query: 462 AYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAV-VAADIN 520
+Y +S VDL+ DG D+LVG H IRE +G V V +
Sbjct: 304 SYFGASVCAVDLNADGLSDLLVGAPM--------HSIIRE---------EGRVFVYINSG 346
Query: 521 DDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPT---- 576
+GK++ + T+ G+ ++H ++GD+D DG DV +
Sbjct: 347 SEGKMKELETELVGS------------DKHAARFGESILNLGDIDNDGFEDVAIGAPQED 394
Query: 577 -LSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVL 608
L G IY+ +G+ + R GR ++Q L
Sbjct: 395 DLRGAIYIYNGRIDGISPSFSQRIEGRQISQSL 427
>gi|323456282|gb|EGB12149.1| hypothetical protein AURANDRAFT_70767, partial [Aureococcus
anophagefferens]
Length = 1641
Score = 40.0 bits (92), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 54/242 (22%), Positives = 95/242 (39%), Gaps = 67/242 (27%)
Query: 381 SHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNL 440
+ + P + D+D DG+ ++++ G +D G +V++
Sbjct: 372 TQAYAAPAVGDLDGDGLPDVLV--------------------GSVD------GLVVLYLT 405
Query: 441 DTKQVKWT-TDLDLSTDNASFRAYIYSSPT--VVDLDGDGNLDILVGTSF-GLFYVLDH- 495
+ + T DL D SPT +VD DGDG+LD+++ +++ G Y+L++
Sbjct: 406 TVSEAHYGETWEDLQAD----------SPTQSLVDADGDGDLDVILSSAYGGAVYLLEND 455
Query: 496 HGKIREKFPLEMAEIQG------------AVVAADINDDGKIELVTTDTHGNVAAWTAEG 543
G F L A+ D++ DG ++ D G VA + G
Sbjct: 456 GGAASPHFALHGDGGANAAADAFAGKEHMALAVGDVDGDGAFDVAFGDGDGAVAFFYNRG 515
Query: 544 KG---IWEQHLKSLVTQG--------PSIGDVDGDGHSDVV---VPTLSGNIYVLSGKDG 589
++ LV P +GDVDGD D+V P + +Y +G +
Sbjct: 516 SADAFRFDFADPDLVVDAPDGRTLAKPELGDVDGDADLDLVATYAPAFTPIVYFENGGNA 575
Query: 590 SK 591
S+
Sbjct: 576 SR 577
>gi|149196608|ref|ZP_01873662.1| hypothetical protein LNTAR_08959 [Lentisphaera araneosa HTCC2155]
gi|149140288|gb|EDM28687.1| hypothetical protein LNTAR_08959 [Lentisphaera araneosa HTCC2155]
Length = 380
Score = 40.0 bits (92), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 60/234 (25%), Positives = 92/234 (39%), Gaps = 51/234 (21%)
Query: 363 EEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKEL 422
++WT Q + +D +IAD D DG ++ I V+ F PE K
Sbjct: 114 DQWTFHQ-------ITLDFSACHCVLIADADEDG--KLDIIVNNF-------RPEQTKAP 157
Query: 423 GGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSP---------TVVDL 473
G +IV + + K D +++AY + ++ D+
Sbjct: 158 KG-PTKNLFHSSIVCYPIPAK----------PRDPKAWKAYPLADKDAPGGSHYMSMADI 206
Query: 474 DGDGNLDILVGTS---------FGLFYVLDHHGKIREKFPLEMAEIQGA-VVAADINDDG 523
DGDG ++ VG F ++ ++ K EK + +I + ADIN DG
Sbjct: 207 DGDGKKEMFVGAKGEPFENGNYFAIWKAGNNRLKPWEKVKVFNEQIGATHMYGADINGDG 266
Query: 524 KIELVTTDTHGNVAAW--TAEGKGI-WEQHLKSLVTQGPSIGDVDGDGHSDVVV 574
K +L+ + HG W E K I +Q L T I DVD DG D+V
Sbjct: 267 KNDLIASRGHGKGLVWFKAPEFKAIEIDQELDRPHTM--DIADVDADGDIDLVA 318
>gi|420179809|ref|ZP_14686086.1| Gram-positive signal peptide protein, YSIRK family, partial
[Staphylococcus epidermidis NIHLM053]
gi|394252404|gb|EJD97439.1| Gram-positive signal peptide protein, YSIRK family, partial
[Staphylococcus epidermidis NIHLM053]
Length = 4737
Score = 40.0 bits (92), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 53/166 (31%), Positives = 78/166 (46%), Gaps = 20/166 (12%)
Query: 222 TSTESNPAPATVSNPDVKKVNESLVNVSNPSEERKVNESHTEMNI-KLPTSVDNSSTTTV 280
+ST N A A +SN VK NE+ +S + +N+SHTE N+ P VDNS+ T
Sbjct: 82 SSTSHNQANAPLSN-QVKDPNETSQQLSTNVSDDSINQSHTETNMNNEPLQVDNSTMQTH 140
Query: 281 SGGTNSSENGTNTG---RRLLEDNNSKGSQEGNDKEDVPVATAENDQA----LDENA--- 330
S S NG +G +L E+ ++ NDKE + + Q LD+NA
Sbjct: 141 SKKV-SDSNGNTSGNEHHKLTENVLAESRASKNDKEKENLQEKDKPQQVQPPLDKNALQA 199
Query: 331 --DSSFELFRDTD-ELADEYNYDYD----DYVDDAMWGDEEWTEEQ 369
D+S+ +R D + AD Y DYV+D + ++ EQ
Sbjct: 200 FFDASYYNYRMIDRDRADTTEYQKVKSAFDYVNDLLGNNQNIPSEQ 245
>gi|395827293|ref|XP_003786839.1| PREDICTED: integrin alpha-8 [Otolemur garnettii]
Length = 1063
Score = 39.7 bits (91), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 28/108 (25%), Positives = 53/108 (49%), Gaps = 15/108 (13%)
Query: 381 SHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNL 440
S+ T +++D++NDG+ ++++ F D E+ NP +E+G I + Y+ + +VF
Sbjct: 319 SYFGYTVIVSDVNNDGMDDILVGAPLFMDREFESNP---REVGQIYL--YLQASALVFR- 372
Query: 441 DTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG 488
+ + T + F + S + DL+ DG DI +G F
Sbjct: 373 ---------NPQVLTGSEIFGRFGSSLAHLGDLNQDGYNDIAIGVPFA 411
>gi|326666079|ref|XP_696861.5| PREDICTED: integrin alpha-3 [Danio rerio]
Length = 1022
Score = 39.7 bits (91), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 42/158 (26%), Positives = 65/158 (41%), Gaps = 32/158 (20%)
Query: 462 AYIYSSPTVVDLDGDGNLDILVGTSF---------GLFYV-LDHHGKIREKF------PL 505
+Y +S VVDL+ DG D++VG F G YV ++ +G R K P
Sbjct: 274 SYFGNSVAVVDLNNDGWKDLIVGAPFYFDRKMDKGGAVYVYMNQNGSFRNKSDVVLTGPK 333
Query: 506 EMAEIQGAVVAADINDDGKIELVTTDTH---GNVAAWTAEGKGIWEQHLK---------- 552
+ V D+N DG + + G V+ WT GI ++ +
Sbjct: 334 DSGFGMSVVAIGDVNQDGFQDFAVGAPYHSTGRVSIWTGSKTGISQEPSQVIDGKDIPGG 393
Query: 553 SLVTQGPSIG---DVDGDGHSDVVVPTLSGNIYVLSGK 587
T G SI DVD +G+ D+ V +L + +L +
Sbjct: 394 GFQTFGYSISRGLDVDRNGYPDIAVGSLDDRVVLLRSR 431
>gi|332668839|ref|YP_004451847.1| peptidase S8 and S53 subtilisin kexin sedolisin [Cellulomonas fimi
ATCC 484]
gi|332337877|gb|AEE44460.1| peptidase S8 and S53 subtilisin kexin sedolisin [Cellulomonas fimi
ATCC 484]
Length = 1448
Score = 39.7 bits (91), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 61/144 (42%), Gaps = 14/144 (9%)
Query: 435 IVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLD 494
VV D ++ T + + A F + + D DG LD++V TS V
Sbjct: 696 FVVLRQDAGRLTPWTQVATTFGGAPFAGQVVE---LASWDTDGRLDVVVSTSRPSVQVFR 752
Query: 495 HHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSL 554
+ G R E A + + D+ DG+ ++VTT H +A T G+ L
Sbjct: 753 NDGSGRFTAVAEAAA-EPRLAVGDVTGDGRPDVVTT--HAVLAQRT---DGLLAAPLALP 806
Query: 555 VTQGP-----SIGDVDGDGHSDVV 573
GP ++GDV GDG +DVV
Sbjct: 807 FAGGPLVTSVAVGDVTGDGRADVV 830
>gi|434402616|ref|YP_007145501.1| putative calcium-binding protein [Cylindrospermum stagnale PCC
7417]
gi|428256871|gb|AFZ22821.1| putative calcium-binding protein [Cylindrospermum stagnale PCC
7417]
Length = 1692
Score = 39.7 bits (91), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 54/136 (39%), Gaps = 21/136 (15%)
Query: 465 YSSPTVVDLDGDGNLDILVGTSFG--LFYVLDHHGKIREKFPLEMAEIQGAV-------- 514
Y++PT D+DGDG+LD VG G LFY + F LE G
Sbjct: 77 YATPTFADIDGDGDLDAFVGNVDGNTLFY-RNSGTATAPAFTLEATNPFGLTNVVYVAVP 135
Query: 515 VAADINDDGKIELVTTDTHGNVAAWTAEGKG------IWEQHLKSLVTQG----PSIGDV 564
ADI+ DG ++ + +GN + G + + L G P+ D+
Sbjct: 136 TLADIDGDGDLDAFVGNGYGNTVFYRNTGTSAAPSFTVEATNPFGLTDVGFSAKPTFADI 195
Query: 565 DGDGHSDVVVPTLSGN 580
D DG D V GN
Sbjct: 196 DADGDLDAFVGNFDGN 211
>gi|193215429|ref|YP_001996628.1| peptidase S8/S53 subtilisin kexin sedolisin [Chloroherpeton
thalassium ATCC 35110]
gi|193088906|gb|ACF14181.1| peptidase S8 and S53 subtilisin kexin sedolisin [Chloroherpeton
thalassium ATCC 35110]
Length = 1478
Score = 39.7 bits (91), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 44/175 (25%), Positives = 70/175 (40%), Gaps = 29/175 (16%)
Query: 470 VVDLDGDGNLDILVGTSFGLFYVLDHHGK--IREKFPLEMAEIQG---AVVAADINDDGK 524
V D D DG +ILVG FY+ ++ G I + LE G ++ D +GK
Sbjct: 896 VQDFDEDGLPEILVGDYDANFYIYEYSGSGNIYNQTWLERTRFIGGSNSLAEGDFLGNGK 955
Query: 525 IELVT---TDTHGN-----------VAAWTAEG----KGIWEQHLKS-----LVTQGPSI 561
+ V +D + N W A G + +WEQ + S
Sbjct: 956 KQFVVAAHSDNNQNDDREYAAPVWMYQCWQATGDDTYEKVWEQLFYNYKPAYYFESATSA 1015
Query: 562 GDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRG 616
GDVDGDG ++++ T N+YV D K + +++++ D+ G
Sbjct: 1016 GDVDGDGQDELLILTYP-NLYVFKWDDAEKTFKAIWHYPMSSASELIVADIDGNG 1069
>gi|313677272|ref|YP_004055268.1| fg-gap repeat protein [Marivirga tractuosa DSM 4126]
gi|312943970|gb|ADR23160.1| FG-GAP repeat protein [Marivirga tractuosa DSM 4126]
Length = 1682
Score = 39.7 bits (91), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 70/151 (46%), Gaps = 16/151 (10%)
Query: 470 VVDLDGDGNLDILVGTSFGLFYVLDHHGKI--REKFPLEMAEIQGAVVAADINDDGKIEL 527
+ DLDGDG L+ S Y LD + ++ R + E + I GA + D N DG+ E+
Sbjct: 347 IADLDGDGQLNAAF-VSGKYLYALDENWQLFWRTEVNEETSGITGATLF-DFNGDGQSEV 404
Query: 528 VTTDTHGNVAAWTAEGKGIWEQHLKS-LVTQGPSIGDVDGDGHSDVVVPTLSGNIYV--- 583
V D +G +H +S +T+ P + DVD DG +++ V ++ + V
Sbjct: 405 VYRDEDYLYILNGKDGTINPLKHCRSRTMTEYPIVADVDADGSTEICVVCVTEDHQVSTP 464
Query: 584 ---LSGKDGSKVRPY-----PYRTHGRVMNQ 606
LS ++VR Y P+ RV NQ
Sbjct: 465 GRNLSLDAPAEVRIYKSGAAPWVPARRVWNQ 495
>gi|195984459|gb|ACG63805.1| SxtP [Aphanizomenon sp. NH-5]
Length = 480
Score = 39.7 bits (91), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 55/191 (28%), Positives = 79/191 (41%), Gaps = 39/191 (20%)
Query: 440 LDTKQVKWTTD---LDLSTDNASFRAYI------YS-SPTVVDLDGDGNLD--------- 480
LD W ++ + L N SF A YS S + D++GD NLD
Sbjct: 250 LDLAFTNWVSNKVSVLLGNGNGSFGAATNFPVGTYSISVAIGDVNGDSNLDLAVTNWVNN 309
Query: 481 ---ILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHG-NV 536
+L+G G F F + I +VV AD+N D ++L +T NV
Sbjct: 310 KVSVLLGNGNGSF-------GAATNFSVGTNPI--SVVIADVNGDSNLDLAVANTSSHNV 360
Query: 537 AAWTAEGKGIWEQHLKSLVTQGP---SIGDVDGDGHSDVVVPTLSG-NIYVLSGK-DGSK 591
+ G G + V P +IGDV+GD + D+ V S N+ VL G +GS
Sbjct: 361 SVLLGNGNGSFGAATNFRVGTNPYSVAIGDVNGDSNLDLTVTNYSSNNVSVLLGNGNGSF 420
Query: 592 VRP--YPYRTH 600
P +P T+
Sbjct: 421 GAPTNFPVGTN 431
>gi|149200549|ref|ZP_01877559.1| hypothetical protein LNTAR_09946 [Lentisphaera araneosa HTCC2155]
gi|149136367|gb|EDM24810.1| hypothetical protein LNTAR_09946 [Lentisphaera araneosa HTCC2155]
Length = 377
Score = 39.7 bits (91), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 64/129 (49%), Gaps = 11/129 (8%)
Query: 456 DNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQG--A 513
+N + ++IY VDL+ D D++ GL VL G E ++ A+IQG A
Sbjct: 246 ENQTGASHIYG----VDLNSDNIKDLVAARGHGLG-VLWFKGPNYEAIEID-AKIQGPHA 299
Query: 514 VVAADINDDGKIELVTTDTHGNVAAWTA-EGKGIWEQHLKSLVTQG--PSIGDVDGDGHS 570
D++ DG I++V TD W +GK + +H+ S +G + D++GDG
Sbjct: 300 FDKVDLDGDGDIDMVATDNALKQLVWYENDGKANFTKHIISEKQKGYDLDLQDLNGDGKI 359
Query: 571 DVVVPTLSG 579
D+VV SG
Sbjct: 360 DIVVSGTSG 368
>gi|289771671|ref|ZP_06531049.1| secreted protein [Streptomyces lividans TK24]
gi|289701870|gb|EFD69299.1| secreted protein [Streptomyces lividans TK24]
Length = 576
Score = 39.7 bits (91), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 38/81 (46%), Gaps = 16/81 (19%)
Query: 517 ADINDDGKIELVTTDTHGNVAAWTAEGKGI-----------WEQHLKSLVTQGPSIGDVD 565
D+NDDGK +L+ DT G + + +G G W + K + GD
Sbjct: 426 GDLNDDGKGDLLARDTSGTLYLFRGKGHGASLSSPLKVGGGWNAYDKLV-----GAGDFT 480
Query: 566 GDGHSDVVVPTLSGNIYVLSG 586
GDG +D+V T +G +Y+ G
Sbjct: 481 GDGRTDIVARTPAGELYLYRG 501
>gi|167517759|ref|XP_001743220.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778319|gb|EDQ91934.1| predicted protein [Monosiga brevicollis MX1]
Length = 430
Score = 39.7 bits (91), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 80/175 (45%), Gaps = 17/175 (9%)
Query: 455 TDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFG---LFYVLDHHGKIR-EKFPLEMAEI 510
T NA + +Y++ DLD DG +D+L ++ +Y + G +K A+
Sbjct: 234 TTNAVYARSVYAA----DLDNDGLIDVLSASALDEKVAWYRNNGGGSFSTQKVITTDADD 289
Query: 511 QGAVVAADINDDGKIELVTTDTHGNVAAW-TAEGKGIWEQH--LKSLVTQGPSI--GDVD 565
+V AAD+++DG I++++ H N AW G + + S +Q ++ D+D
Sbjct: 290 ARSVYAADLDNDGYIDVLSASAHDNKIAWYRNNGDSSFSTQIVISSNASQADAVYAADLD 349
Query: 566 GDGHSDVVVPTLSGN---IYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGE 617
DG+ DV+ + N Y +G+ GS P T+ + V DL G+
Sbjct: 350 NDGYIDVLSASSKDNKIAWYHNNGR-GSFSTPIVITTNAEKASSVFAADLDNDGD 403
>gi|365122806|ref|ZP_09339702.1| hypothetical protein HMPREF1033_03048 [Tannerella sp. 6_1_58FAA_CT1]
gi|363641570|gb|EHL80959.1| hypothetical protein HMPREF1033_03048 [Tannerella sp. 6_1_58FAA_CT1]
Length = 1619
Score = 39.7 bits (91), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 59/231 (25%), Positives = 99/231 (42%), Gaps = 46/231 (19%)
Query: 383 ILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDT 442
I S ++D+DNDG E+I +S ++D +L G D+ Y A++
Sbjct: 1251 IYSWLAVSDLDNDGDMEIIAGLSGKVKVWHHDGTPYL----GRDV-FYAESAVI------ 1299
Query: 443 KQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILV-----GTSFGLFYVLDHHG 497
N + Y + SP V DLD DG +I++ G +F +F VL G
Sbjct: 1300 --------------NGETKYYDFKSPIVADLDWDGQQEIVIPGTVGGDAFRIF-VLRADG 1344
Query: 498 KIR------EKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEG---KGIWE 548
++ E + + + V D+ ++G+ +V V W A+G K I
Sbjct: 1345 TLQPGWENPETIYNKNSSVGIDVAVGDLYNNGERNIVVLGKDC-VKIWNADGTLQKNISI 1403
Query: 549 QHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGN---IYVLSGKDGSKVRPYP 596
L+ V P + D+DGD +D+V + + IYV DG +++ +P
Sbjct: 1404 PGLEPDVF-APVLADMDGDEVADIVFGNTASSDNAIYVYR-PDGIQIKGFP 1452
>gi|392379244|ref|YP_004986403.1| protein of unknown function, partial [Azospirillum brasilense Sp245]
gi|356881611|emb|CCD02600.1| protein of unknown function, partial [Azospirillum brasilense Sp245]
Length = 3688
Score = 39.7 bits (91), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 58/134 (43%), Gaps = 29/134 (21%)
Query: 465 YSSPTVVDLDGDGNLDILVGTSFG--LFYVLDHHGKIREKFPLEM--------AEIQGAV 514
++SP++VD+DGDG+LD L+G + G LFY + F LE A
Sbjct: 2078 FASPSLVDIDGDGDLDALIGNADGNTLFY-RNVGTAAAPTFALESTNPFGLGDAGSNATP 2136
Query: 515 VAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQG--------------PS 560
+ DI+ DG ++ + + GN + + I + +G P+
Sbjct: 2137 IFVDIDGDGDLDALIGNDSGNTVLY----RNIGTAAAPTFTLEGTNPFNLGDVGSNAKPT 2192
Query: 561 IGDVDGDGHSDVVV 574
D+DGDG D ++
Sbjct: 2193 FADIDGDGDLDALI 2206
>gi|418019726|ref|ZP_12659155.1| RTX toxin [Candidatus Regiella insecticola R5.15]
gi|347604920|gb|EGY29457.1| RTX toxin [Candidatus Regiella insecticola R5.15]
Length = 1823
Score = 39.7 bits (91), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 39/145 (26%), Positives = 62/145 (42%), Gaps = 23/145 (15%)
Query: 472 DLDGDGNLDILVGTSFGLFYVLD-------HHGKIREKFP-LEMAEIQGAVVAADINDDG 523
D +GDG DI + G FY+L H K+ + P L + DIN DG
Sbjct: 745 DFNGDGRADIGYFSPQG-FYLLTAGENGAYHQAKLIKNIPFLAHNFAHPRRLVGDINRDG 803
Query: 524 KIELV-----TTDTHGNVAAWTAEGKGIW--------EQHLKSLVTQGPSIGDVDGDGHS 570
+ +++ T+ + A+ G + + L+ L GP GD+DGDGH+
Sbjct: 804 RDDIILFPAIRTEGETTLKIMLAQENGSFFPIRQRLPQNILEDLANPGPICGDIDGDGHA 863
Query: 571 DVVVPTLSGNIYVLSGKDGSKVRPY 595
D+V + G I++ G + Y
Sbjct: 864 DLVSFS-GGKIWLYYGTRSGRFSVY 887
>gi|354485839|ref|XP_003505089.1| PREDICTED: integrin alpha-8 [Cricetulus griseus]
Length = 1062
Score = 39.7 bits (91), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 32/119 (26%), Positives = 56/119 (47%), Gaps = 18/119 (15%)
Query: 381 SHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNL 440
S+ T V++D++NDG+ ++++ F D E+ NP KE+G + + Y+ + ++F
Sbjct: 318 SYFGYTVVVSDVNNDGLDDILVGAPLFMDREFESNP---KEVGQVYL--YLQVSALLFQ- 371
Query: 441 DTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKI 499
D + T F + S + DL+ DG DI +G F D GK+
Sbjct: 372 ---------DPQVLTGTEIFGRFGSSVAHLGDLNQDGYNDIAIGVPFA---GKDQRGKV 418
>gi|433459214|ref|ZP_20417051.1| lysozyme M1 [Arthrobacter crystallopoietes BAB-32]
gi|432191692|gb|ELK48630.1| lysozyme M1 [Arthrobacter crystallopoietes BAB-32]
Length = 954
Score = 39.7 bits (91), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 57/136 (41%), Gaps = 21/136 (15%)
Query: 473 LDGDGNLDILVGTSFG--LFYVLDHHGKIREKFPLEMA---EIQGAVVAADINDDGKIEL 527
DGD D+L + G FY G+ + + + + + VV+ D NDDG+ +L
Sbjct: 491 FDGDRYPDLLARHTDGSLWFYAGTGDGRFKARVKVGASGWNSMTDVVVSGDYNDDGRQDL 550
Query: 528 VTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIG---------DVDGDGHSDVVVPTLS 578
+ T+G + G G + S T G G D DGDG DV
Sbjct: 551 LAQHTNGTAYVYPGLGTG----RVGSRSTIGTGWGKFDQLIATRDFDGDGREDVAATQPD 606
Query: 579 GNIYVLSGKDGSKVRP 594
G +++L G+ VRP
Sbjct: 607 GTLWLLR---GNGVRP 619
>gi|311746949|ref|ZP_07720734.1| hypothetical protein ALPR1_11335 [Algoriphagus sp. PR1]
gi|311302592|gb|EAZ82806.2| hypothetical protein ALPR1_11335 [Algoriphagus sp. PR1]
Length = 664
Score = 39.7 bits (91), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 59/235 (25%), Positives = 91/235 (38%), Gaps = 44/235 (18%)
Query: 389 IADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTKQVKWT 448
+ D + DG ++++ + + D+ + DN E G G I++ N+D K ++
Sbjct: 172 MVDYEGDGDLDIVVGMDDWTDYGW-DNA--YDESGNWTNGPLHGYLILLENVDGK---YS 225
Query: 449 TDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLF------------------ 490
+ N Y SP D DGDG+LDI+ G
Sbjct: 226 VKGKIQAANKDIDLYGAPSPNFADFDGDGDLDIICGEFLDRLTYFENVGTRSKPIYKEGS 285
Query: 491 YVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEG-----KG 545
++ +H G I K LEM V+ D + DG I+LV D G VA G K
Sbjct: 286 FLTNHEGTI--KMDLEMI----LPVSIDWDGDGFIDLVVGDEDGRVALVRNTGKLEGQKP 339
Query: 546 IWEQ---------HLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSK 591
++E +LK P D D DG D++ +G I + DG +
Sbjct: 340 VFESPFYFRQKADNLKFGALVTPVSVDWDRDGDEDLIAGNSAGYIAFIENLDGKE 394
>gi|167537545|ref|XP_001750441.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771119|gb|EDQ84791.1| predicted protein [Monosiga brevicollis MX1]
Length = 354
Score = 39.7 bits (91), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 59/117 (50%), Gaps = 11/117 (9%)
Query: 470 VVDLDGDGNLDIL--VGTS-FGLFYVLDHHGKIREKFPLEMAEIQGA--VVAADINDDGK 524
DLD DG LD+L G+S +Y + +G E+ + A GA V A D+++DG
Sbjct: 21 AADLDNDGYLDVLSACGSSDMIAWYRNNGNGTFSEQRVISTAA-DGAYSVFAVDLDNDGH 79
Query: 525 IELVTTDTHGNVAAW-TAEGKGIWE-QHLKSLVTQGP---SIGDVDGDGHSDVVVPT 576
I++++ + + AW G G++ Q + + GP D+D DGH DV+ +
Sbjct: 80 IDVLSASLYDDKIAWYRNSGGGVFSAQRVITTAANGPYSVYAADLDNDGHIDVLSAS 136
>gi|374600236|ref|ZP_09673238.1| Ig domain protein group 2 domain protein [Myroides odoratus DSM
2801]
gi|423325401|ref|ZP_17303242.1| hypothetical protein HMPREF9716_02599 [Myroides odoratimimus CIP
103059]
gi|373911706|gb|EHQ43555.1| Ig domain protein group 2 domain protein [Myroides odoratus DSM
2801]
gi|404606344|gb|EKB05890.1| hypothetical protein HMPREF9716_02599 [Myroides odoratimimus CIP
103059]
Length = 2024
Score = 39.7 bits (91), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 48/168 (28%), Positives = 76/168 (45%), Gaps = 16/168 (9%)
Query: 465 YSSPTVVDLDGDGNLDILV-GTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAA----DI 519
Y S + DL+GDG D+LV G + + + + +F E+ G +A D
Sbjct: 150 YGSIKIADLNGDGLADLLVNGQTDAGYSSKVYFQQADGEFIDSNIELLGTYFSATEIFDA 209
Query: 520 NDDGKIE-LVTTDTHGNVAA---WTAEGKGIWEQH---LKSLVTQGPSIGDVDGDGHSDV 572
N DG + L+T ++ V A + +G G + H L ++ S+ D +GDG DV
Sbjct: 210 NGDGLPDILITGFSNSYVPATQLYLNQGNGNFSPHDSGLGAVYFSSISVADFEGDGDLDV 269
Query: 573 VVPTLSGNI---YVLSGKDGS-KVRPYPYRTHGRVMNQVLLVDLTKRG 616
++ ++G++ VL DG PY G +M LVD K G
Sbjct: 270 LLGGMNGSMTHSLVLYLNDGQGNFTAAPYAFEGIIMGSSALVDYNKDG 317
>gi|302035812|ref|YP_003796134.1| hypothetical protein NIDE0431 [Candidatus Nitrospira defluvii]
gi|300603876|emb|CBK40208.1| exported protein of unknown function, contains FG-GAP repeats
[Candidatus Nitrospira defluvii]
Length = 399
Score = 39.7 bits (91), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 31/125 (24%), Positives = 56/125 (44%), Gaps = 4/125 (3%)
Query: 497 GKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGN-VAAWTAEGKGIWEQHLKSLV 555
G R++ +++ + ++ D N D +++V + + V+ G G +E + V
Sbjct: 91 GTFRDQVQVKVCQEPRSLAINDFNHDDHLDVVLACSGSDQVSILFGRGNGKFEDGPQYPV 150
Query: 556 TQGP---SIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDL 612
+ P S DV+GDGH D+ V + + + G G RP +G V L DL
Sbjct: 151 HRTPVSVSSEDVNGDGHPDLAVALRNDKVKIFLGSAGGNFRPGVQYEYGDTPTSVALKDL 210
Query: 613 TKRGE 617
+ G+
Sbjct: 211 NQDGK 215
>gi|288919160|ref|ZP_06413498.1| FG-GAP repeat protein [Frankia sp. EUN1f]
gi|288349407|gb|EFC83646.1| FG-GAP repeat protein [Frankia sp. EUN1f]
Length = 432
Score = 39.7 bits (91), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 43/137 (31%), Positives = 61/137 (44%), Gaps = 9/137 (6%)
Query: 472 DLDGDGNLDILVG-TSFGLFYVLDHHGKIREKFPLEMAEIQG--AVVAADINDDGKIELV 528
DL G G LD++V +S GL VL +G P+ G ++ D N D ++
Sbjct: 116 DLRGVGVLDVIVADSSSGLISVLLGNGDGTLGLPVPYPAGPGPSSLSVDDFNGDSVPDVA 175
Query: 529 TTDTH-GNVAAWTAEGKGIWEQHLKSLVTQGPSI---GDVDGDGHSDVVVPTLS-GNIYV 583
D + G V+ G G + + L PS GD +GDG +D+ V L + V
Sbjct: 176 AADQYSGTVSVLLGRGDGTFGPPVPQLAGSAPSSLDSGDFNGDGITDIAVVNLGDATVSV 235
Query: 584 LSGK-DGSKVRPYPYRT 599
L G DG+ P PY T
Sbjct: 236 LLGNGDGTLQPPRPYLT 252
>gi|393782324|ref|ZP_10370509.1| hypothetical protein HMPREF1071_01377 [Bacteroides salyersiae
CL02T12C01]
gi|392673595|gb|EIY67054.1| hypothetical protein HMPREF1071_01377 [Bacteroides salyersiae
CL02T12C01]
Length = 1434
Score = 39.7 bits (91), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 55/240 (22%), Positives = 101/240 (42%), Gaps = 45/240 (18%)
Query: 380 DSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFN 439
D + PV D +NDG+ ++ + D+E +N E + + G G V
Sbjct: 869 DVCYYACPVWFDYNNDGLLDLFVPGLKSMDYE--NNLEEIAAFLYRNKGAEKGGQPVFEE 926
Query: 440 LDTKQVKWT----TDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGT---------- 485
++T + + + S D R ++ +V D D DG LD+++
Sbjct: 927 VNTATPAGSKMGISPIYSSKDGGRSRQWV----SVGDYDKDGYLDLIIAGLDDYEDPDGR 982
Query: 486 ------------SFGLFYVLDHHGK--IREKFPLEMAE-----IQGAVVAADINDDGKIE 526
L+ ++ GK I+++ PL E +G+V AD+++DG ++
Sbjct: 983 TDEKGNLIVHHDRRALYLYKNNKGKGFIQQETPLNGTEPFLGLSRGSVHFADMDNDGWLD 1042
Query: 527 LVTTD---THGNVAAWTAEGKGIW---EQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGN 580
+V++ GN+ + G G + +Q+L S+ D+D DG+ D+VV S N
Sbjct: 1043 IVSSGYGPGEGNLRIYWNNGDGTFSENQQYLYGSNDSSCSLCDLDADGYMDIVVTGFSSN 1102
>gi|47226031|emb|CAG04405.1| unnamed protein product [Tetraodon nigroviridis]
Length = 972
Score = 39.7 bits (91), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 47/172 (27%), Positives = 74/172 (43%), Gaps = 38/172 (22%)
Query: 451 LDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSF---------GLFYV-LDHHGKIR 500
LD +Y +S V DL+ D D++VG F G Y+ ++ +G +
Sbjct: 212 LDFVIPGEQVGSYFGNSLAVTDLNNDDWNDLIVGAPFYFDRMKDQGGAVYIFMNENGSFQ 271
Query: 501 EKFPLEMAEIQGA----VVAA--DINDDGKIELVT----TDTHGNVAAWTAEGKGIWEQH 550
+K + + G+ VAA DIN DG +L DT G V W GI EQ+
Sbjct: 272 KKATVVLKGPSGSAFGFAVAAVGDINQDGFQDLAVGAPFQDT-GMVYIWLGSETGISEQY 330
Query: 551 LKSLVTQGPSIG---------------DVDGDGHSDVVVPTLSGNIYVLSGK 587
+++ QG S+ D+DG+ + D+VV +L I +L +
Sbjct: 331 SQAI--QGKSLSNGGFNTFGYSISGGMDMDGNSYPDIVVGSLDDRIALLRAR 380
>gi|29831019|ref|NP_825653.1| integrin-like protein [Streptomyces avermitilis MA-4680]
gi|29608133|dbj|BAC72188.1| putative integrin-like protein [Streptomyces avermitilis MA-4680]
Length = 481
Score = 39.7 bits (91), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 64/159 (40%), Gaps = 40/159 (25%)
Query: 469 TVVDLDGDGNLDILVGTSFGLFYVLDHHGKIR-------EKFPLEMAEIQG--AVVAADI 519
T D DGDG D+ GT+ YV+ G R ++ L G A+ A D
Sbjct: 163 TAGDFDGDGAADLASGTTASHAYVV-RGGFTRSGTTGSAQRIGLPQTAKYGIDAIKAGDT 221
Query: 520 NDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGP---------------SIGDV 564
N D K +LV T A +G G W + + L T G +IGD+
Sbjct: 222 NGDKKADLVLT----YRTALNEDGSGDWSKGVAYLGTSGGLETSVPRGLNGGTTIAIGDI 277
Query: 565 DGDGHSDVVV-----------PTLSGNIYVLSGKDGSKV 592
DGDG+ ++ + +L G + V+ G +G V
Sbjct: 278 DGDGYGEIALGNVFSTEDDHTGSLGGKVTVIRGSEGGPV 316
>gi|39996640|ref|NP_952591.1| lipoprotein [Geobacter sulfurreducens PCA]
gi|409912063|ref|YP_006890528.1| lipoprotein [Geobacter sulfurreducens KN400]
gi|39983521|gb|AAR34914.1| lipoprotein, putative [Geobacter sulfurreducens PCA]
gi|298505654|gb|ADI84377.1| lipoprotein, putative [Geobacter sulfurreducens KN400]
Length = 404
Score = 39.7 bits (91), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 58/223 (26%), Positives = 98/223 (43%), Gaps = 42/223 (18%)
Query: 379 VDSHI-LSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVV 437
+DS + LS + D++NDG+S+++++ D P L EL G + +
Sbjct: 99 IDSSLQLSGITVGDVNNDGLSDLVVS---------GDVP-GLPELRG-------RTKVYI 141
Query: 438 FNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGT---SFGLFYVL- 493
N T + + LSTD A A + DL+ DG DI+V + S GL +
Sbjct: 142 QNRSTHALDPDQEYALSTDTAGTLA-------IADLNSDGRKDIVVASVQGSNGLLSIFF 194
Query: 494 -DHHGKIREKFPLEMAEIQGAVV-AADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHL 551
+ G E+ + + G V AD+N+DG+ +++ +A GI+
Sbjct: 195 QETGGLGAEQTYTSVPVVYGGEVHVADMNNDGRNDIIVQSDLKQLAVIKQSSPGIFNATP 254
Query: 552 KSLVTQGP--------SIGDVDGDGHSDVVV--PTLSG-NIYV 583
Q ++GD++GDG +D+V P +G NI+V
Sbjct: 255 DYYTVQTSYWPDFRSFAVGDINGDGLADIVAADPASNGLNIFV 297
>gi|301621924|ref|XP_002940293.1| PREDICTED: integrin alpha-9-like [Xenopus (Silurana) tropicalis]
Length = 1018
Score = 39.7 bits (91), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 58/280 (20%), Positives = 95/280 (33%), Gaps = 71/280 (25%)
Query: 433 GAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYV 492
G + +F D + V S +Y SS VDL+ DG D+LVG
Sbjct: 258 GKVFIFRTDKRSVSLVKIFQASGKKVG--SYFGSSLCAVDLNSDGLSDLLVG-------- 307
Query: 493 LDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLK 552
A + +++ D+G+ V + G G+ ++H+
Sbjct: 308 --------------------APMFSEVRDEGQ-----------VTVYINRGNGVMDEHIN 336
Query: 553 ---------SLVTQGPSIGDVDGDGHSDVVVPT-----LSGNIYVLSGKDGSKVRPYPYR 598
++GD+D DG DV + +G +Y+ G V Y R
Sbjct: 337 LNGDNAYNAHFGESMANLGDIDDDGFPDVAIGAPQEDDYAGAVYIYHGDTLGIVSKYSMR 396
Query: 599 THGRVMNQVLL---------VDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIG 649
+ + L+ +D+ K G LTI D + L P DV +
Sbjct: 397 ISSKSIGPGLMMFGQSVSGGIDMDKNGYLD--LTIGAFMSDTAVLLRTRPVITVDVTILL 454
Query: 650 ETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHP 689
TS ++ DG ++ + T+ CF H P
Sbjct: 455 PTSINITAPQCHDGTQQINCVNVTV-----CFRFRGKHVP 489
>gi|167538107|ref|XP_001750719.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163770743|gb|EDQ84424.1| predicted protein [Monosiga brevicollis MX1]
Length = 840
Score = 39.7 bits (91), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 72/162 (44%), Gaps = 11/162 (6%)
Query: 467 SPTVVDLDGDGNLDILVGTSFG---LFYVLDHHGKIREKFPLEM-AEIQGAVVAADINDD 522
S DL+ DG LDIL + +Y D G + + A+ +V AAD+++D
Sbjct: 654 SVFAADLNNDGALDILSASRDDNQIAWYRNDGDGNFSTQMVITTDADGAMSVFAADLDND 713
Query: 523 GKIELVTTDTHGNVAAW-TAEGKGIWE-QHLKSLVTQGPS---IGDVDGDGHSDVVVPTL 577
G +++++ H N AW G G + Q + + +G S D+D DGH DV+ +
Sbjct: 714 GAVDVLSASHHDNKIAWYRNNGDGNFSAQMVITTEAEGASSVYAVDLDNDGHLDVLSASS 773
Query: 578 S-GNI-YVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGE 617
S G I + + DG+ T G + V DL G
Sbjct: 774 SDGKIAWYRNNGDGTFSSQIAIVTEGGGAHSVFAADLDNDGH 815
>gi|441499032|ref|ZP_20981222.1| hypothetical protein C900_03612 [Fulvivirga imtechensis AK7]
gi|441437277|gb|ELR70631.1| hypothetical protein C900_03612 [Fulvivirga imtechensis AK7]
Length = 1035
Score = 39.7 bits (91), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 50/178 (28%), Positives = 78/178 (43%), Gaps = 25/178 (14%)
Query: 425 IDIGKYVAGAIVVF-NLDTKQVKWT-TDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDIL 482
I + AG +++F N T + + T + L+ AS + V D+DGDG LDI+
Sbjct: 238 IIVTNSAAGQVIIFENTSTATISFNMTPIVLNIAGASSTTGL----DVQDVDGDGRLDII 293
Query: 483 VGTSFGL-FYVLDHHGKIREKFP-LEMAEIQGA----VVAADINDDGKIELVTTDTHGNV 536
V FG +VL + G F E+ + G + D N +GK+++ GN
Sbjct: 294 VCQFFGNDVFVLRNTGTGSIAFAQAEVLSLSGKSLLNLTTGDFNGNGKLDIAVI---GNA 350
Query: 537 AAW-------TAEGKGIWEQHLKSLVTQGP---SIGDVDGDGHSDVVVPTLSGNIYVL 584
A + G + V GP S GDVDGDG D+V +++ N+ +
Sbjct: 351 ADQVFIFPNNSDPGNINFAAPQPFTVNDGPWGISTGDVDGDGSLDIVTTSVNQNLVAV 408
>gi|167525525|ref|XP_001747097.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774392|gb|EDQ88021.1| predicted protein [Monosiga brevicollis MX1]
Length = 436
Score = 39.7 bits (91), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 44/86 (51%), Gaps = 5/86 (5%)
Query: 470 VVDLDGDGNLDILVGTSFG---LFYVLDHHGKIREKFPLE-MAEIQGAVVAADINDDGKI 525
+DLDGDG++D+L + F +Y D G+ + + AE AV A D++ DG
Sbjct: 302 AIDLDGDGDMDVLSASEFDNKIAWYRNDGTGRFSTQIIISTQAEGARAVYAVDLDGDGDP 361
Query: 526 ELVTTDTHGNVAAW-TAEGKGIWEQH 550
++++ + N AW EG G++
Sbjct: 362 DVLSASRYDNKIAWYRNEGAGVFSTQ 387
>gi|116619849|ref|YP_822005.1| FG-GAP repeat-containing protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116223011|gb|ABJ81720.1| FG-GAP repeat protein [Candidatus Solibacter usitatus Ellin6076]
Length = 2558
Score = 39.7 bits (91), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 45/94 (47%), Gaps = 13/94 (13%)
Query: 514 VVAADINDDGKIELVTTDTHGNVAAWTAEGK--------GIWEQHLKSLVTQGPS----I 561
V AD+N DGK +L+ T V + ++ GI + T GPS I
Sbjct: 245 VFVADLNHDGKADLIVTTATNQVVVFISQSTLGTGSAFHGIAFAAPLIIPTSGPSEFVTI 304
Query: 562 GDVDGDGHSDVVVPTLSGNIYVLSGK-DGSKVRP 594
GDV+GDG D+VV T + + ++ G DGS P
Sbjct: 305 GDVNGDGKKDLVVSTGTPTLTIIPGNGDGSFGSP 338
>gi|78189852|ref|YP_380190.1| Nidogen, extracellular region [Chlorobium chlorochromatii CaD3]
gi|78172051|gb|ABB29147.1| Nidogen, extracellular region [Chlorobium chlorochromatii CaD3]
Length = 1557
Score = 39.7 bits (91), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 42/166 (25%), Positives = 72/166 (43%), Gaps = 14/166 (8%)
Query: 462 AYIYSSPTVVDLDGDGNLDILVGTSFGLF----YVLDHHGKIREKFPLEMAEIQGAVVAA 517
Y +S D +GD +D++V +FG + ++ + + + P + + +VA
Sbjct: 754 GYYATSVASGDFNGDQLMDVVV-ANFGTYSVSVFINNGNNTFTAQEPYLLDDSPYDLVAV 812
Query: 518 DINDDGKIELVTTDTHG-NVAAWTAEGKGIWEQHLKSLVTQGP---SIGDVDGDGHSDVV 573
D N+DG ++L+ +G N++ G G + V P + D +GDG SD+
Sbjct: 813 DYNEDGSLDLLAASNYGNNISVLKGNGNGGFTDCKNYQVGDNPKALATADFNGDGKSDIA 872
Query: 574 VPTLSGNIYV---LSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRG 616
V SGN V L + G + P Y G + + DL G
Sbjct: 873 VAN-SGNNSVSLLLQNEVGEFIAPATYEV-GNNPQAITVADLNGDG 916
>gi|436833792|ref|YP_007319008.1| hypothetical protein FAES_0403 [Fibrella aestuarina BUZ 2]
gi|384065205|emb|CCG98415.1| hypothetical protein FAES_0403 [Fibrella aestuarina BUZ 2]
Length = 1056
Score = 39.7 bits (91), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 72/164 (43%), Gaps = 22/164 (13%)
Query: 448 TTDLDLSTDNASFRAYIYSSPT-----VVDLDGDGNLDILV-----GTSFGLFYVLDHHG 497
T+ L++ST A+FR + P V++ G + V GT+ +F + G
Sbjct: 98 TSRLNVST-TANFRGTLVGDPATGVVQVMNAQPTGVYSVTVTGLGAGTAKRVFTLTVGRG 156
Query: 498 KIREKFPL-------EMAEIQGAVVAADINDDGKIELVTTDTHGN-VAAWTAEGKGIWEQ 549
+ + P+ + E+ A+ D+N+DG ++L+T + N V +G G +
Sbjct: 157 PVCSEAPMVVTAEDVGVGELASAIALGDVNNDGNLDLLTANILANTVTVRLGDGTGRFTG 216
Query: 550 HLKSLVTQGPS---IGDVDGDGHSDVVVPTLSGNIYVLSGKDGS 590
V GPS +GD++ DG D + GN + DG+
Sbjct: 217 TTDIAVGAGPSDVQVGDINNDGKLDFLAVNSFGNSVSVRLGDGT 260
>gi|330995512|ref|ZP_08319416.1| FG-GAP repeat protein [Paraprevotella xylaniphila YIT 11841]
gi|329575424|gb|EGG56966.1| FG-GAP repeat protein [Paraprevotella xylaniphila YIT 11841]
Length = 1606
Score = 39.7 bits (91), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 40/142 (28%), Positives = 60/142 (42%), Gaps = 13/142 (9%)
Query: 450 DLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAE 509
++ L+T N+ Y Y +P + D+D DG +D + T L Y +G EK E
Sbjct: 732 EICLNTSNSPIGQYNYLTP-IGDMDNDGRVDFIENT---LMYKNTANGVWEEKKLYSGEE 787
Query: 510 IQGAVVAADINDDGKIELVTTDTHGNVAAWTAE----GKGIWEQHLKSLVTQGPS----- 560
V AD N DG +++V + +G W+ L TQ S
Sbjct: 788 YIEPVAVADFNRDGYLDIVAAYRWDRGTPIYRQRLYLNQGDWKFEAHELPTQSSSLTLVG 847
Query: 561 IGDVDGDGHSDVVVPTLSGNIY 582
I DV+GDG D++ +IY
Sbjct: 848 IADVNGDGRPDLIYNDYYKDIY 869
>gi|406910655|gb|EKD50620.1| hypothetical protein ACD_62C00479G0001 [uncultured bacterium]
Length = 702
Score = 39.7 bits (91), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 50/215 (23%), Positives = 83/215 (38%), Gaps = 62/215 (28%)
Query: 433 GAIVVFNLDTKQVKWTTDLDLSTDNASFRA--------YIYSSPTVVDLDGDGNLDILV- 483
GA+ VF+ + T D +S DNA F + Y + + D++GDG D+L
Sbjct: 231 GALEVFDFADNDCEGTVDQGISLDNAVFSITGEASGNYFGYRAVVLSDVNGDGIDDVLAS 290
Query: 484 -------GTSFGLFYVLDHHGKIREKFPLEMAEIQ----------GAVVAA--DINDDGK 524
GTS G Y++ G + E A + G +A+ D+N DG
Sbjct: 291 AYKNSEGGTSAGKVYLIYGAGNLSGDLSAENANVTFVGEAANDYLGLAIASAGDVNGDGL 350
Query: 525 IELVTTDTHGNVAAWTAEGK-----------------GIWEQHLKSLVTQG--------- 558
+ + GN A + GK G+++ + + +G
Sbjct: 351 GDFMIA-AKGNDAGGSNAGKVYLIYGRDFSTDDATVDGVFDLTQANFIIKGQAANNYFGH 409
Query: 559 --PSIGDVDGDGHSDVVVP-----TLSGNIYVLSG 586
S GDV+ DG+ D+++ + G IY+ G
Sbjct: 410 ALASAGDVNADGYGDIMISSKDYNSAQGRIYLFQG 444
>gi|322437491|ref|YP_004219581.1| FG-GAP repeat-containing protein [Granulicella tundricola MP5ACTX9]
gi|321165384|gb|ADW71087.1| FG-GAP repeat protein [Granulicella tundricola MP5ACTX9]
Length = 438
Score = 39.7 bits (91), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 46/171 (26%), Positives = 73/171 (42%), Gaps = 19/171 (11%)
Query: 470 VVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFP-LEMAEIQGAVVAADINDDGKIELV 528
+ D+DGDG DIL T+ G F +D P + + ++A D+N DG+++L+
Sbjct: 217 IADIDGDGKADIL--TTHGWFQQVDADADKWVWHPDWTLGDAGFPILAYDVNHDGRMDLI 274
Query: 529 TTDTHGNVAAWTAEG----KGIWEQH-LKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYV 583
HG W +G K W +H + +Q ++ +D +G V+ ++G Y
Sbjct: 275 YGQGHGYGLYWLEQGGTKEKPTWTRHTIDESFSQSHALALLDLNGGGPPVL--VTGKRYR 332
Query: 584 -LSGKDGSKVRP---YPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFD 630
SG D P Y YR G + + G + G IV FD
Sbjct: 333 GHSGADPGSYDPNVVYAYRLPGFERTAISV-----NGTATIGTQIVAGDFD 378
>gi|444913988|ref|ZP_21234134.1| hypothetical protein D187_06304 [Cystobacter fuscus DSM 2262]
gi|444715286|gb|ELW56157.1| hypothetical protein D187_06304 [Cystobacter fuscus DSM 2262]
Length = 441
Score = 39.7 bits (91), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 31/122 (25%), Positives = 62/122 (50%), Gaps = 9/122 (7%)
Query: 469 TVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAA-DINDDGKIEL 527
+ DL+ DG LD++VG G+ +L + + +A+ +AA D+N DGK+++
Sbjct: 157 ALADLNRDGCLDVVVGHGSGISVLLADKATGGLRPAVVVADRDADTLAATDVNRDGKLDI 216
Query: 528 VTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPS-----IGDVDGDGHSDVVVPTLSGNIY 582
++ + + +G G + + + T+ +GD++GDG D+ V +SG +Y
Sbjct: 217 ISLGWSRGASLFHGDGTGAFSR-IVPFATKAAGYNDHEVGDLNGDGIPDLAV--MSGQLY 273
Query: 583 VL 584
+
Sbjct: 274 AV 275
>gi|432952280|ref|XP_004085037.1| PREDICTED: integrin alpha-3-like [Oryzias latipes]
Length = 935
Score = 39.3 bits (90), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 45/156 (28%), Positives = 69/156 (44%), Gaps = 35/156 (22%)
Query: 462 AYIYSSPTVVDLDGDGNLDILVGTSF---------GLFYVLDHHGKIREKFPLEM----- 507
+Y ++ V DL+GDG D+LVG F G Y+ + G E P +
Sbjct: 314 SYFGNAVAVTDLNGDGWNDLLVGAPFYFERQQGKGGAVYIYMNAGAHFESRPTAVLTGPP 373
Query: 508 -AEIQGAVVAA-DINDDGKIEL-VTTDTH--GNVAAWTAEGKGIWEQHLKSLVTQGPSIG 562
+ AV AA D+N DG ++ V H G+V W KG++ + S V +G SI
Sbjct: 374 GSAFGIAVTAAGDLNQDGFQDIAVGAPFHETGSVMIWMGSEKGVFAE--PSQVIRGSSIS 431
Query: 563 --------------DVDGDGHSDVVVPTLSGNIYVL 584
DVDG+ + D++V +L + +L
Sbjct: 432 SRFRTFGYSLSAGVDVDGNRYPDLLVGSLDDTVALL 467
>gi|449135867|ref|ZP_21771296.1| FG-GAP repeat-containing protein [Rhodopirellula europaea 6C]
gi|448885482|gb|EMB15924.1| FG-GAP repeat-containing protein [Rhodopirellula europaea 6C]
Length = 372
Score = 39.3 bits (90), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 84/208 (40%), Gaps = 39/208 (18%)
Query: 470 VVDLDGDGNLDILVG---------TSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADIN 520
V D++GDG DILVG +G ++ K+ P+ + DI+
Sbjct: 183 VGDINGDGRNDILVGWGWYEQPTENPWGQPWIAHRDWKLHASLPM---------LIEDID 233
Query: 521 DDGKIELVTTDTHGNVAAW------TAEGKGIWEQH-LKSLVTQGPSI--GDVDGDGHSD 571
DG +L+ + H W T G W++H + +Q S+ D+DGDG+ D
Sbjct: 234 QDGDQDLIYGEGHNYGLQWWENTGVTDSGSIEWKEHEIDRRFSQPHSMVWVDLDGDGNKD 293
Query: 572 VVVPTLSGNIYVLSGKD-GSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFD 630
+V Y +G D G K P Y V ++ + G GL IV +
Sbjct: 294 LVT---GKRYYAHNGNDPGGKEVPCLYWYSFDVASKSFQRHVIDEGRVGTGLQIVAEDLN 350
Query: 631 GYLYLIDGPTSCADVVDIGETSYSMVLA 658
G DG T D+ G++ ++LA
Sbjct: 351 G-----DGKT---DLAVAGKSGTYLLLA 370
>gi|186681807|ref|YP_001865003.1| FG-GAP repeat-containing protein [Nostoc punctiforme PCC 73102]
gi|186464259|gb|ACC80060.1| FG-GAP repeat protein [Nostoc punctiforme PCC 73102]
Length = 407
Score = 39.3 bits (90), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 51/111 (45%), Gaps = 7/111 (6%)
Query: 467 SPTVVDLDGDGNLDIL---VGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDG 523
+PT+VD DG+G DI T + +D G + +F L+ + D N D
Sbjct: 213 TPTIVDFDGNGKSDIFWRNATTGDNSAWFMD--GAQKTEFALQSQDAAWTASVGDFNGDL 270
Query: 524 KIELVTTDTH-GNVAAWTAEGKGIWEQHLKSLVTQGPS-IGDVDGDGHSDV 572
+++ + G WT G + E L +L + S IGD +GDG +D+
Sbjct: 271 STDILWRNAQTGENKVWTMRGILVTEGALGTLDSSWTSKIGDFNGDGKTDI 321
>gi|74179537|dbj|BAE22455.1| unnamed protein product [Mus musculus]
Length = 675
Score = 39.3 bits (90), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 30/119 (25%), Positives = 57/119 (47%), Gaps = 18/119 (15%)
Query: 381 SHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNL 440
S+ T V++D++NDG+ ++++ F + E+ NP +E+G + + Y+ + ++F
Sbjct: 318 SYFGYTVVVSDVNNDGMDDILVGAPLFMEREFESNP---REVGQVYL--YLQASALLFQ- 371
Query: 441 DTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKI 499
D + T +F + S + DL+ DG DI +G F D GK+
Sbjct: 372 ---------DPQVLTGTETFGRFGSSVAHLGDLNQDGYNDIAIGVPFA---GKDQRGKV 418
>gi|254432883|ref|ZP_05046585.1| FG-GAP repeat domain protein [Cyanobium sp. PCC 7001]
gi|197624563|gb|EDY37123.1| FG-GAP repeat domain protein [Cyanobium sp. PCC 7001]
Length = 613
Score = 39.3 bits (90), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 76/182 (41%), Gaps = 42/182 (23%)
Query: 458 ASFRAYIYSSPTVVDLDGDGNLDILVGTSF------GLFYVLDHHGKIREKFPLEMAEIQ 511
ASF YS + D++GDG D+L+G F G YV+ +GK P+E+A+I
Sbjct: 186 ASFNNSGYSVSSAGDVNGDGLADLLIGAPFAGPYSNGRSYVV--YGKSVNTNPVELADI- 242
Query: 512 GAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSD 571
AA I + + D + + W+ G GDV+GDG +D
Sbjct: 243 ----AAGIGGFVILGEQSADFSSDASGWSVAGA-----------------GDVNGDGLAD 281
Query: 572 VVVPTL--------SGNIYVLSGKDGSK----VRPYPYRTHGRVMNQVLLVDLTKRGEKS 619
++V + SG YV+ GK + + T G V+N D++ R S
Sbjct: 282 LLVGSYGNDANGNNSGRSYVVFGKSDNNNPVDLATLAGSTRGFVINGESDYDVSGRSVAS 341
Query: 620 KG 621
G
Sbjct: 342 AG 343
>gi|297182478|gb|ADI18641.1| rtx toxins and related ca2+-binding proteins [uncultured
Rhodospirillales bacterium HF4000_24M03]
Length = 4593
Score = 39.3 bits (90), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 62/235 (26%), Positives = 103/235 (43%), Gaps = 30/235 (12%)
Query: 470 VVDLDGDGNLDILVGTSF--GLFYVLDHHGKIREKFPLE--MAEIQGA--VVAADINDDG 523
V D+DGDG+LDI V SF G +++G + GA V AD++ DG
Sbjct: 1156 VADMDGDGDLDI-VSVSFLDGTIAWYENNGAADPTWTAANIATSADGAYGVHVADMDGDG 1214
Query: 524 KIELVTTDTHGNVAAWTAEGKG----IWEQHLKSLVTQGPS---IGDVDGDGHSDVVVPT 576
+++V+ + + AW E G W + G + + D+DGDG D+V +
Sbjct: 1215 DLDIVSASFNDDTIAWY-ENNGAADPTWTAANIATSANGATDVHVADMDGDGDLDIVSAS 1273
Query: 577 LS-GNI--YVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDG-- 631
+ G I Y +G T + V + D+ G+ L IV+ S+
Sbjct: 1274 YADGTIAWYENNGAADPTWTAADIATSADLAYDVHIADMDGDGD----LDIVSASYGDDT 1329
Query: 632 -YLYLIDG---PT-SCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCF 681
Y DG PT + AD+ + ++ + +AD +DG DLD++ + N + +
Sbjct: 1330 IAWYENDGAADPTWTAADIATSADGAFDVHVAD-MDGDGDLDIVSASRNDDTIAW 1383
>gi|149921581|ref|ZP_01910031.1| FG-GAP repeat/HVR domain protein [Plesiocystis pacifica SIR-1]
gi|149817533|gb|EDM77002.1| FG-GAP repeat/HVR domain protein [Plesiocystis pacifica SIR-1]
Length = 707
Score = 39.3 bits (90), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 53/124 (42%), Gaps = 13/124 (10%)
Query: 463 YIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFP-----------LEMAEIQ 511
Y PT+ D +GDG+ D+ + G + V+D + P + + Q
Sbjct: 462 YGGGPPTIADFNGDGSPDVAIAGGVG-YAVIDGTKIMDPMIPDVDTFLWIQQTTDCSSAQ 520
Query: 512 GAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQ-HLKSLVTQGPSIGDVDGDGHS 570
D + DG E+V +D A G +W+ + + + P + DVD DGH+
Sbjct: 521 TGSSVFDFDGDGAAEVVYSDQVYLRIYAGATGDVLWQTCNTTGTLRELPVVADVDNDGHA 580
Query: 571 DVVV 574
D+VV
Sbjct: 581 DIVV 584
>gi|420178147|ref|ZP_14684480.1| Gram-positive signal peptide protein, YSIRK family [Staphylococcus
epidermidis NIHLM057]
gi|394246773|gb|EJD92025.1| Gram-positive signal peptide protein, YSIRK family [Staphylococcus
epidermidis NIHLM057]
Length = 10140
Score = 39.3 bits (90), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 53/166 (31%), Positives = 78/166 (46%), Gaps = 20/166 (12%)
Query: 222 TSTESNPAPATVSNPDVKKVNESLVNVSNPSEERKVNESHTEMNI-KLPTSVDNSSTTTV 280
+ST N A A +SN VK NE+ +S + +N+SHTE N+ P VDNS+ T
Sbjct: 82 SSTSHNQANAPLSNQ-VKDPNETSQQLSTNVSDDSINQSHTETNMNNEPLQVDNSTMQTH 140
Query: 281 SGGTNSSENGTNTG---RRLLEDNNSKGSQEGNDKEDVPVATAENDQA----LDENA--- 330
S S NG +G +L E+ ++ NDKE + + Q LD+NA
Sbjct: 141 SKKV-SDSNGNTSGNEHHKLTENVLAESRASKNDKEKENLQEKDKPQQVQPPLDKNALQA 199
Query: 331 --DSSFELFRDTD-ELADEYNYDYD----DYVDDAMWGDEEWTEEQ 369
D+S+ +R D + AD Y DYV+D + ++ EQ
Sbjct: 200 FFDASYYNYRMIDRDRADTTEYQKVKSAFDYVNDLLGNNQNIPSEQ 245
>gi|149178817|ref|ZP_01857398.1| hypothetical protein PM8797T_06527 [Planctomyces maris DSM 8797]
gi|148842358|gb|EDL56740.1| hypothetical protein PM8797T_06527 [Planctomyces maris DSM 8797]
Length = 1742
Score = 39.3 bits (90), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 45/173 (26%), Positives = 70/173 (40%), Gaps = 24/173 (13%)
Query: 469 TVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGA-----VVAADINDDG 523
T DL+ DG D++ G +Y K E +P++ I G DIN D
Sbjct: 44 TFADLNQDGKTDVISGP---YWYEGPEFQKRHEFYPVKEWSINGYSDNFFAFTHDINKDK 100
Query: 524 KIELVTTDTHGNVAAWTAEGKGI---WEQHLK--SLVTQGPSIGDVDGDGHSDVVVPTLS 578
++V G A W A + W++HL ++ + P+ D+ GDG ++V T
Sbjct: 101 WPDIVIIGFPGKEAFWYANPQNKERHWKKHLAHPNVDNESPTYTDLTGDGEPELVFHTGG 160
Query: 579 GNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLTIVTTSFDG 631
Y GKD + P++ H N K G + GL + + DG
Sbjct: 161 QLGYAGPGKDPTA----PWQFHAISPN-------LKYGRFTHGLGVGDVNGDG 202
>gi|108763581|ref|YP_628389.1| hypothetical protein MXAN_0106 [Myxococcus xanthus DK 1622]
gi|108467461|gb|ABF92646.1| FG-GAP repeat domain protein [Myxococcus xanthus DK 1622]
Length = 667
Score = 39.3 bits (90), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 54/141 (38%), Gaps = 27/141 (19%)
Query: 472 DLDGDGNLDILVGTSFGLFYVLDHHGKIREKFPLE---------MAEIQGAVVAADINDD 522
D+DGDG D+L+ S G Y L H K P + + + + D+N D
Sbjct: 245 DVDGDGAQDLLLLDSMGDVYSLLHGCKAGSPGPCDGGLAAQPFWTVDSRTLSLFPDVNGD 304
Query: 523 GKIELVTTDTHGNV--------------AAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDG 568
G E T G V WTA G ++ ++ ++GDVDGDG
Sbjct: 305 GVPERFTGPGGGTVRLHLSEAGGGYASAPTWTAMGDPVFANFGATIT----AVGDVDGDG 360
Query: 569 HSDVVVPTLSGNIYVLSGKDG 589
V G +Y S +G
Sbjct: 361 QRQDFVAGSEGRLYFYSPAEG 381
>gi|297199853|ref|ZP_06917250.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
gi|297147514|gb|EFH28664.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
Length = 499
Score = 39.3 bits (90), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 56/128 (43%), Gaps = 28/128 (21%)
Query: 472 DLDGDGNLDILVGTSFGLFYVLDH---------HGKIREKFPLEMAEIQG------AVVA 516
D DGDG D+ VG+S YV G+ K P+ M G + A
Sbjct: 179 DFDGDGKADLAVGSSSATLYVYKGGFSSSTGLPGGRYTIKPPI-MPGSSGFPYGPEQLTA 237
Query: 517 ADINDDGKIELVT------TDTHGNVAAW---TAEGKGIWE-QHLKSLVTQGPSIGDVDG 566
D+N DG+ +LV TD N W TA G + + LK + G IGD++G
Sbjct: 238 GDVNGDGRTDLVVDGYETQTDNGWNTNYWVPGTASGLSVAGVKTLKPGIVTG--IGDING 295
Query: 567 DGHSDVVV 574
DG+ D+V
Sbjct: 296 DGYGDIVT 303
>gi|254442111|ref|ZP_05055587.1| FG-GAP repeat domain protein [Verrucomicrobiae bacterium DG1235]
gi|198256419|gb|EDY80727.1| FG-GAP repeat domain protein [Verrucomicrobiae bacterium DG1235]
Length = 421
Score = 39.3 bits (90), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 46/93 (49%), Gaps = 18/93 (19%)
Query: 467 SPTVVDLDGDGNLDILVGTSFG--------LFYVLDHHG-KIREKFPLEMAEIQGAVV-A 516
SP VVD+DGDG+LD + ++F V ++ G ++ EK L VV A
Sbjct: 328 SPRVVDIDGDGDLDFVTTSAFNDWTQEDAESLVVFENRGDQVFEKQVLANRPTHLIVVDA 387
Query: 517 ADINDDGKIELVTTDTH--------GNVAAWTA 541
AD++ DG++ELVT H VA W A
Sbjct: 388 ADMDGDGRVELVTGGLHFYPPFVHVSRVALWEA 420
>gi|325982593|ref|YP_004294995.1| hypothetical protein NAL212_1996 [Nitrosomonas sp. AL212]
gi|325532112|gb|ADZ26833.1| hypothetical protein NAL212_1996 [Nitrosomonas sp. AL212]
Length = 1231
Score = 39.3 bits (90), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 38/160 (23%), Positives = 64/160 (40%), Gaps = 28/160 (17%)
Query: 466 SSPTVVDLDGDGNLDILVGTSFGLFYVLDHHG-------KIREKFPLEMAEIQGAV--VA 516
++PT VD+DGDG+ D VG G + G + P ++++ A
Sbjct: 134 ANPTFVDIDGDGDQDAFVGNGVGNTLFFQNTGTANNSVFAVASTNPFGLSDVGYAANPTF 193
Query: 517 ADINDDGKIELVTTDTHGNVAAWTAEGKG---IWEQHLKSLVTQG-------PSIGDVDG 566
DI+ DG + + GN + G ++ + + + G P I D+D
Sbjct: 194 GDIDSDGDYDAFVGNRDGNTLFFENTGTASNPVFTSAITNPLGLGDVGQSASPIIADIDS 253
Query: 567 DGHSDVVVPTLSGN-----IYVLSGKDGSKVRP----YPY 597
DG D+ + SGN I + +G+ P YP+
Sbjct: 254 DGDQDIFIGNGSGNSNIVAITLFFQNNGTASTPVFSAYPF 293
>gi|168335080|ref|ZP_02693190.1| S-layer domain protein [Epulopiscium sp. 'N.t. morphotype B']
Length = 1165
Score = 39.3 bits (90), Expect = 10.0, Method: Compositional matrix adjust.
Identities = 19/62 (30%), Positives = 30/62 (48%), Gaps = 1/62 (1%)
Query: 534 GNVAAWTAEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVR 593
G+V A+ GK IW + G + D+DGDG +V+ + G +Y DG+ +
Sbjct: 280 GSVVAYDNAGKFIWHAKAGETIINGLEVADIDGDGFDEVLAASTDGRLYAFD-HDGTALW 338
Query: 594 PY 595
Y
Sbjct: 339 TY 340
>gi|46123721|ref|XP_386414.1| hypothetical protein FG06238.1 [Gibberella zeae PH-1]
Length = 1461
Score = 39.3 bits (90), Expect = 10.0, Method: Compositional matrix adjust.
Identities = 73/273 (26%), Positives = 110/273 (40%), Gaps = 70/273 (25%)
Query: 376 YVNVDSHILSTPVI--ADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAG 433
Y+N D + +I AD+DNDG +++I Y+N K +D GK
Sbjct: 333 YINPDWKSVKGNMIRLADVDNDGKADLI---------AIYEN-GAAKVWKNVDNGK---- 378
Query: 434 AIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLF-YV 492
++ KW T L+ S D SF+ D+DGDG D ++ S G +
Sbjct: 379 -----KFESMDSKWATGLE-SRDKVSFQ----------DIDGDGYADYVIVYSGGAVKWA 422
Query: 493 LDHH--GKIREKFPLEMAEI----------QGAVVAADINDDGKIELVTTDTHGNVAAWT 540
+ H GK K E AE+ +G+ DI+ DGK + + G V A+
Sbjct: 423 RNTHNNGKDSSKKNWE-AEVTIAPGPSGIPEGSTRIQDIDGDGKADYLVVYDGGAVKAFR 481
Query: 541 ---AEGKGIWEQHLKSLVTQGPSIG----------DVDGDGHSDVVVPTLSGNIYV---- 583
EG W+ L T P IG D+DGDG +D + G+I +
Sbjct: 482 NTLKEGGRNWDD----LGTVAPGIGGVTGDMIRFADMDGDGLADFLAVADDGSIRMWKNL 537
Query: 584 -LSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKR 615
++G G +R GR + ++ VD R
Sbjct: 538 GITGTKGQSIRFADLTGEGR--DDLISVDAKGR 568
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.134 0.391
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 14,627,967,786
Number of Sequences: 23463169
Number of extensions: 683678607
Number of successful extensions: 1988320
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 257
Number of HSP's successfully gapped in prelim test: 3788
Number of HSP's that attempted gapping in prelim test: 1950409
Number of HSP's gapped (non-prelim): 32905
length of query: 857
length of database: 8,064,228,071
effective HSP length: 152
effective length of query: 705
effective length of database: 8,792,793,679
effective search space: 6198919543695
effective search space used: 6198919543695
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 82 (36.2 bits)