BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 031186
(164 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|297734936|emb|CBI17170.3| unnamed protein product [Vitis vinifera]
Length = 927
Score = 202 bits (515), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 117/163 (71%), Positives = 129/163 (79%), Gaps = 4/163 (2%)
Query: 1 MELALQSLSSASFSIITRDLKLRVCTGLHGKAFSSIDIKSQEQERKLPTVSSALPETAAS 60
MELAL +L + +RD K + SS D+K Q Q RK+ VS ALPETAAS
Sbjct: 768 MELALHTLPLKT----SRDFKPHIHNAFDPDPISSTDVKLQAQRRKICRVSYALPETAAS 823
Query: 61 VAIAATVVGAAATLLVRRTKGSEETEIPLKTCEDCGGSGICPECKGEGFVLKKLSEETAE 120
VAIAATVVGAAATLLVRR++ SE TEIPLK CEDCGGSGIC EC GEGFVLKKLSE +AE
Sbjct: 824 VAIAATVVGAAATLLVRRSRPSEATEIPLKICEDCGGSGICSECNGEGFVLKKLSEASAE 883
Query: 121 RARLTAKNMATRYTAGLPKKWSYCTKCSSSRSCATCGGRGKLS 163
+ARLTAKNMATRYTAGLPKKWSYCTKCSS+RSC+TCGG GKLS
Sbjct: 884 KARLTAKNMATRYTAGLPKKWSYCTKCSSARSCSTCGGSGKLS 926
>gi|359479447|ref|XP_002273853.2| PREDICTED: uncharacterized protein LOC100258439 [Vitis vinifera]
Length = 176
Score = 201 bits (510), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 117/163 (71%), Positives = 129/163 (79%), Gaps = 4/163 (2%)
Query: 1 MELALQSLSSASFSIITRDLKLRVCTGLHGKAFSSIDIKSQEQERKLPTVSSALPETAAS 60
MELAL +L + +RD K + SS D+K Q Q RK+ VS ALPETAAS
Sbjct: 17 MELALHTLPLKT----SRDFKPHIHNAFDPDPISSTDVKLQAQRRKICRVSYALPETAAS 72
Query: 61 VAIAATVVGAAATLLVRRTKGSEETEIPLKTCEDCGGSGICPECKGEGFVLKKLSEETAE 120
VAIAATVVGAAATLLVRR++ SE TEIPLK CEDCGGSGIC EC GEGFVLKKLSE +AE
Sbjct: 73 VAIAATVVGAAATLLVRRSRPSEATEIPLKICEDCGGSGICSECNGEGFVLKKLSEASAE 132
Query: 121 RARLTAKNMATRYTAGLPKKWSYCTKCSSSRSCATCGGRGKLS 163
+ARLTAKNMATRYTAGLPKKWSYCTKCSS+RSC+TCGG GKLS
Sbjct: 133 KARLTAKNMATRYTAGLPKKWSYCTKCSSARSCSTCGGSGKLS 175
>gi|224106816|ref|XP_002314295.1| predicted protein [Populus trichocarpa]
gi|222850703|gb|EEE88250.1| predicted protein [Populus trichocarpa]
Length = 163
Score = 194 bits (494), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 120/164 (73%), Positives = 131/164 (79%), Gaps = 1/164 (0%)
Query: 1 MELALQSLSSASFSIITRDLKLRVCTGLHGKAFSSIDIKSQEQERKLPTVSSALPETAAS 60
MELA+ +LS SFS R+ KL+ T SSID+K QEQ RK TV +LPETAAS
Sbjct: 1 MELAVHALSFTSFSK-NRESKLQGRTIPDPATCSSIDVKLQEQGRKTSTVCYSLPETAAS 59
Query: 61 VAIAATVVGAAATLLVRRTKGSEETEIPLKTCEDCGGSGICPECKGEGFVLKKLSEETAE 120
VAIAAT VGAA TLLVRR K SE EIPLKTCEDCGGSGIC EC GEGFVLKKLSEE+AE
Sbjct: 60 VAIAATAVGAAITLLVRRNKPSEADEIPLKTCEDCGGSGICSECSGEGFVLKKLSEESAE 119
Query: 121 RARLTAKNMATRYTAGLPKKWSYCTKCSSSRSCATCGGRGKLSL 164
RARL+AKNMATRYTAGLPKKWSYCTKCSS+RSC+ CGG GKL+
Sbjct: 120 RARLSAKNMATRYTAGLPKKWSYCTKCSSARSCSACGGSGKLNY 163
>gi|255565399|ref|XP_002523690.1| conserved hypothetical protein [Ricinus communis]
gi|223536994|gb|EEF38630.1| conserved hypothetical protein [Ricinus communis]
Length = 213
Score = 186 bits (473), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 102/131 (77%), Positives = 113/131 (86%)
Query: 33 FSSIDIKSQEQERKLPTVSSALPETAASVAIAATVVGAAATLLVRRTKGSEETEIPLKTC 92
+SSI +K Q+ +R T+SSALPET AS+AIAA VVG AAT+LVRRTK +E EI LKTC
Sbjct: 82 WSSIGVKLQKHKRNASTISSALPETTASLAIAAAVVGTAATVLVRRTKATETNEIQLKTC 141
Query: 93 EDCGGSGICPECKGEGFVLKKLSEETAERARLTAKNMATRYTAGLPKKWSYCTKCSSSRS 152
EDC GSG+C ECKGEGFVLKKLSEE+AERARL AKNMATRYTA LPKKWSYCTKCSS+RS
Sbjct: 142 EDCEGSGLCSECKGEGFVLKKLSEESAERARLNAKNMATRYTAALPKKWSYCTKCSSARS 201
Query: 153 CATCGGRGKLS 163
C TCGGRGKLS
Sbjct: 202 CLTCGGRGKLS 212
>gi|7671406|emb|CAB89320.1| putative protein [Arabidopsis thaliana]
Length = 148
Score = 185 bits (470), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 100/165 (60%), Positives = 113/165 (68%), Gaps = 18/165 (10%)
Query: 1 MELALQSLSSASFSIITRDLKLRVCTGLHGKAFSSIDIKSQEQERKLPTVSSALPETAAS 60
MEL + SLSS + DLK S ++ ++ TV AL ETA S
Sbjct: 1 MELRVNSLSSQDVRFLNLDLK-----------------SSHQRAKRSSTVVPALAETAVS 43
Query: 61 VAIAATVVGAAATLLVRR-TKGSEETEIPLKTCEDCGGSGICPECKGEGFVLKKLSEETA 119
+AIAATVVG AAT+LVRR K SEE E +K CE C GSGICPECKGEGFVLKKLS+ A
Sbjct: 44 IAIAATVVGTAATILVRRNNKASEEAEASMKECEACLGSGICPECKGEGFVLKKLSDANA 103
Query: 120 ERARLTAKNMATRYTAGLPKKWSYCTKCSSSRSCATCGGRGKLSL 164
E+ARL AKNMATRYTAGLPKKWSYCTKCSS+RSC CGG GK S+
Sbjct: 104 EKARLAAKNMATRYTAGLPKKWSYCTKCSSTRSCMICGGSGKTSI 148
>gi|30692502|ref|NP_190091.2| uncharacterized protein [Arabidopsis thaliana]
gi|30692507|ref|NP_850657.1| uncharacterized protein [Arabidopsis thaliana]
gi|42572581|ref|NP_974386.1| uncharacterized protein [Arabidopsis thaliana]
gi|222423502|dbj|BAH19721.1| AT3G45050 [Arabidopsis thaliana]
gi|332644463|gb|AEE77984.1| uncharacterized protein [Arabidopsis thaliana]
gi|332644464|gb|AEE77985.1| uncharacterized protein [Arabidopsis thaliana]
gi|332644465|gb|AEE77986.1| uncharacterized protein [Arabidopsis thaliana]
Length = 158
Score = 184 bits (466), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 94/134 (70%), Positives = 106/134 (79%), Gaps = 2/134 (1%)
Query: 33 FSSIDIKSQEQE-RKLPTVSSALPETAASVAIAATVVGAAATLLVRRT-KGSEETEIPLK 90
F ++D+KS Q ++ TV AL ETA S+AIAATVVG AAT+LVRR K SEE E +K
Sbjct: 25 FLNLDLKSSHQRAKRSSTVVPALAETAVSIAIAATVVGTAATILVRRNNKASEEAEASMK 84
Query: 91 TCEDCGGSGICPECKGEGFVLKKLSEETAERARLTAKNMATRYTAGLPKKWSYCTKCSSS 150
CE C GSGICPECKGEGFVLKKLS+ AE+ARL AKNMATRYTAGLPKKWSYCTKCSS+
Sbjct: 85 ECEACLGSGICPECKGEGFVLKKLSDANAEKARLAAKNMATRYTAGLPKKWSYCTKCSST 144
Query: 151 RSCATCGGRGKLSL 164
RSC CGG GK S+
Sbjct: 145 RSCMICGGSGKTSI 158
>gi|297815652|ref|XP_002875709.1| hypothetical protein ARALYDRAFT_484900 [Arabidopsis lyrata subsp.
lyrata]
gi|297321547|gb|EFH51968.1| hypothetical protein ARALYDRAFT_484900 [Arabidopsis lyrata subsp.
lyrata]
Length = 163
Score = 180 bits (457), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 101/165 (61%), Positives = 118/165 (71%), Gaps = 3/165 (1%)
Query: 1 MELALQSLSSASFSIITRDLKLRVCTGLHGKAFSSIDIKSQEQERKLPTVSSALPETAAS 60
MEL+L+ L S SF L + + + F ++ + R L TV+ AL ETA S
Sbjct: 1 MELSLK-LQSFSFPSSVSTLTISLTFEVLFFVFEGVNNARFLKTRSL-TVTPALAETAVS 58
Query: 61 VAIAATVVGAAATLLVRRT-KGSEETEIPLKTCEDCGGSGICPECKGEGFVLKKLSEETA 119
+AIAATVVG AAT+L RR+ K +EE E K CE C GSGICPECKGEGFVLKKLS+ A
Sbjct: 59 IAIAATVVGTAATILARRSSKAAEEAEASTKECEACLGSGICPECKGEGFVLKKLSDANA 118
Query: 120 ERARLTAKNMATRYTAGLPKKWSYCTKCSSSRSCATCGGRGKLSL 164
E+ARLTAKNMATRYTAGLPKKWSYCTKCSS+RSC TCGG GK S+
Sbjct: 119 EKARLTAKNMATRYTAGLPKKWSYCTKCSSTRSCITCGGSGKTSI 163
>gi|359806852|ref|NP_001241314.1| uncharacterized protein LOC100779715 [Glycine max]
gi|255640844|gb|ACU20705.1| unknown [Glycine max]
Length = 148
Score = 176 bits (447), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 102/146 (69%), Positives = 119/146 (81%), Gaps = 1/146 (0%)
Query: 19 DLKLRVCTGLHGKAFS-SIDIKSQEQERKLPTVSSALPETAASVAIAATVVGAAATLLVR 77
DL L V + FS S ++K Q Q R+L TVSSAL ETAAS+A+A TVVGAAATLLV+
Sbjct: 2 DLHLSVRLPVSLPCFSHSQNVKLQLQRRRLSTVSSALAETAASMAVAVTVVGAAATLLVK 61
Query: 78 RTKGSEETEIPLKTCEDCGGSGICPECKGEGFVLKKLSEETAERARLTAKNMATRYTAGL 137
R+K SE ++I K CEDCGGSGIC ECKGEGFVL+K S+E+AE+AR+ AKNMATR+TAGL
Sbjct: 62 RSKTSESSQIQFKACEDCGGSGICSECKGEGFVLRKRSDESAEKARVQAKNMATRFTAGL 121
Query: 138 PKKWSYCTKCSSSRSCATCGGRGKLS 163
PKKWSYCTKCSS RSC+TCGG GKLS
Sbjct: 122 PKKWSYCTKCSSGRSCSTCGGSGKLS 147
>gi|356542656|ref|XP_003539782.1| PREDICTED: uncharacterized protein LOC100778669 [Glycine max]
Length = 148
Score = 175 bits (443), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 100/147 (68%), Positives = 119/147 (80%), Gaps = 1/147 (0%)
Query: 19 DLKLRVCTGLHGKAFS-SIDIKSQEQERKLPTVSSALPETAASVAIAATVVGAAATLLVR 77
DL+++ + FS S ++K Q Q R+L TVS AL ETAAS+A+A TVVGAAATLLV+
Sbjct: 2 DLRVQHPVDVSLPCFSHSQNVKLQLQRRRLSTVSFALAETAASMAVAVTVVGAAATLLVK 61
Query: 78 RTKGSEETEIPLKTCEDCGGSGICPECKGEGFVLKKLSEETAERARLTAKNMATRYTAGL 137
R+K SE T+I K CEDCGGSGIC ECKGEGFVL+K S+E+AE+AR+ AKNMATR+TAGL
Sbjct: 62 RSKTSESTQIQFKVCEDCGGSGICSECKGEGFVLRKRSDESAEKARVQAKNMATRFTAGL 121
Query: 138 PKKWSYCTKCSSSRSCATCGGRGKLSL 164
PKKWSYCTKCSS RSC+TCGG GKLS
Sbjct: 122 PKKWSYCTKCSSGRSCSTCGGSGKLSY 148
>gi|449459098|ref|XP_004147283.1| PREDICTED: uncharacterized protein LOC101218484 [Cucumis sativus]
gi|449501230|ref|XP_004161313.1| PREDICTED: uncharacterized LOC101218484 [Cucumis sativus]
Length = 164
Score = 169 bits (429), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 107/166 (64%), Positives = 123/166 (74%), Gaps = 6/166 (3%)
Query: 1 MELALQSLSSASFSIITRDLKLR---VCTGLHGKAFSSIDIKSQEQERKLPTVSSALPET 57
MEL L + S I R+ K +C L S+ D+K Q R L T++S+LPET
Sbjct: 1 MELTLPTPLFVSIPKIVRNSKPNNNGLCL-LDSNPLSA-DVKLQGGRRTL-TIASSLPET 57
Query: 58 AASVAIAATVVGAAATLLVRRTKGSEETEIPLKTCEDCGGSGICPECKGEGFVLKKLSEE 117
AASVAIAATVVGAAAT L RR K SE E+PL TCEDCGGSG+C ECKGEGFVLKKLS+E
Sbjct: 58 AASVAIAATVVGAAATFLSRRNKNSEAVEVPLITCEDCGGSGLCSECKGEGFVLKKLSDE 117
Query: 118 TAERARLTAKNMATRYTAGLPKKWSYCTKCSSSRSCATCGGRGKLS 163
AERARL AKNMATR+TA LPKKWSYC+KCSS+RSC+TCGG G L+
Sbjct: 118 NAERARLAAKNMATRFTAALPKKWSYCSKCSSARSCSTCGGSGTLN 163
>gi|30692497|ref|NP_850656.1| uncharacterized protein [Arabidopsis thaliana]
gi|26452206|dbj|BAC43191.1| unknown protein [Arabidopsis thaliana]
gi|105829624|gb|ABF74699.1| At3g45050 [Arabidopsis thaliana]
gi|332644462|gb|AEE77983.1| uncharacterized protein [Arabidopsis thaliana]
Length = 130
Score = 136 bits (342), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 72/106 (67%), Positives = 82/106 (77%), Gaps = 2/106 (1%)
Query: 33 FSSIDIKSQEQE-RKLPTVSSALPETAASVAIAATVVGAAATLLVRRT-KGSEETEIPLK 90
F ++D+KS Q ++ TV AL ETA S+AIAATVVG AAT+LVRR K SEE E +K
Sbjct: 25 FLNLDLKSSHQRAKRSSTVVPALAETAVSIAIAATVVGTAATILVRRNNKASEEAEASMK 84
Query: 91 TCEDCGGSGICPECKGEGFVLKKLSEETAERARLTAKNMATRYTAG 136
CE C GSGICPECKGEGFVLKKLS+ AE+ARL AKNMATRYTAG
Sbjct: 85 ECEACLGSGICPECKGEGFVLKKLSDANAEKARLAAKNMATRYTAG 130
>gi|413926809|gb|AFW66741.1| hypothetical protein ZEAMMB73_024799 [Zea mays]
Length = 190
Score = 121 bits (304), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 57/86 (66%), Positives = 69/86 (80%), Gaps = 1/86 (1%)
Query: 77 RRTKGSEETEIPLKTCEDCGGSGICPECKGEGFVLKKLSEETAERARLTAKNMATRYTAG 136
RR +G EE ++ + C DCGG+G+C CKGEGFV K+LSEETA RAR AKNMATRYT+G
Sbjct: 99 RRQEGREE-QVEGEECPDCGGTGLCGRCKGEGFVFKQLSEETATRARKAAKNMATRYTSG 157
Query: 137 LPKKWSYCTKCSSSRSCATCGGRGKL 162
LP KW+YC KCSS+RSC TCGG G++
Sbjct: 158 LPTKWTYCNKCSSTRSCTTCGGSGRI 183
>gi|242063828|ref|XP_002453203.1| hypothetical protein SORBIDRAFT_04g001625 [Sorghum bicolor]
gi|241933034|gb|EES06179.1| hypothetical protein SORBIDRAFT_04g001625 [Sorghum bicolor]
Length = 197
Score = 111 bits (277), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 48/65 (73%), Positives = 55/65 (84%)
Query: 92 CEDCGGSGICPECKGEGFVLKKLSEETAERARLTAKNMATRYTAGLPKKWSYCTKCSSSR 151
C DCGG+G+C CKGEGFV K+LS+ETA +AR AKNMATRYTAGLP KW+YC KCSS+R
Sbjct: 133 CPDCGGTGLCGRCKGEGFVFKQLSDETATKARKAAKNMATRYTAGLPTKWTYCNKCSSTR 192
Query: 152 SCATC 156
SC TC
Sbjct: 193 SCTTC 197
>gi|116779021|gb|ABK21103.1| unknown [Picea sitchensis]
Length = 97
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 59/96 (61%), Positives = 63/96 (65%), Gaps = 1/96 (1%)
Query: 67 VVGAAATLLVRRTKGSEETEIPLKTCEDCGGSGICPECKGEGFVLKKLSEETAERARLTA 126
VVGAA T L R K S E K CE C GSGIC EC GEGF+LK LSE A +AR A
Sbjct: 2 VVGAAVTSLARMNK-STTIEPKQKICEVCNGSGICGECNGEGFILKNLSEAAAAKARQNA 60
Query: 127 KNMATRYTAGLPKKWSYCTKCSSSRSCATCGGRGKL 162
K ATRYTAGL KKW+YC KCS SR C C GRG +
Sbjct: 61 KTAATRYTAGLAKKWNYCPKCSGSRGCIVCDGRGTI 96
>gi|54291568|dbj|BAD62492.1| unknown protein [Oryza sativa Japonica Group]
Length = 162
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 67/114 (58%), Positives = 81/114 (71%), Gaps = 3/114 (2%)
Query: 52 SALPETAASVAIAATVVGAAATLLVRRTKGSEETEIPLKT---CEDCGGSGICPECKGEG 108
+ALPETAA+ AA +VG AA+LLV R E + C +CGG+G+CP CKGEG
Sbjct: 38 AALPETAAACVAAAGLVGVAASLLVGRAAEGARREEEEEEEEECSECGGTGLCPRCKGEG 97
Query: 109 FVLKKLSEETAERARLTAKNMATRYTAGLPKKWSYCTKCSSSRSCATCGGRGKL 162
FV K+L EE A RAR AKNMATRYT+GLP KW+YC +CSS+RSC TCGG G +
Sbjct: 98 FVFKQLPEEAASRARKAAKNMATRYTSGLPTKWTYCNRCSSTRSCTTCGGSGAI 151
>gi|297606569|ref|NP_001058663.2| Os06g0731300 [Oryza sativa Japonica Group]
gi|255677429|dbj|BAF20577.2| Os06g0731300, partial [Oryza sativa Japonica Group]
Length = 134
Score = 102 bits (254), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 67/114 (58%), Positives = 81/114 (71%), Gaps = 3/114 (2%)
Query: 52 SALPETAASVAIAATVVGAAATLLVRRTKGSEETEIPLKT---CEDCGGSGICPECKGEG 108
+ALPETAA+ AA +VG AA+LLV R E + C +CGG+G+CP CKGEG
Sbjct: 10 AALPETAAACVAAAGLVGVAASLLVGRAAEGARREEEEEEEEECSECGGTGLCPRCKGEG 69
Query: 109 FVLKKLSEETAERARLTAKNMATRYTAGLPKKWSYCTKCSSSRSCATCGGRGKL 162
FV K+L EE A RAR AKNMATRYT+GLP KW+YC +CSS+RSC TCGG G +
Sbjct: 70 FVFKQLPEEAASRARKAAKNMATRYTSGLPTKWTYCNRCSSTRSCTTCGGSGAI 123
>gi|167999005|ref|XP_001752208.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696603|gb|EDQ82941.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 195
Score = 95.9 bits (237), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 55/98 (56%), Positives = 67/98 (68%)
Query: 66 TVVGAAATLLVRRTKGSEETEIPLKTCEDCGGSGICPECKGEGFVLKKLSEETAERARLT 125
VVGAA ++R+ K EI CE+C GSGICPEC G+GF++K LS++ A +AR +
Sbjct: 96 VVVGAAVVSILRKEKLPVLPEISRVECEECKGSGICPECNGDGFLMKNLSKDAAAKARAS 155
Query: 126 AKNMATRYTAGLPKKWSYCTKCSSSRSCATCGGRGKLS 163
AK+ ATRYTAGL KKWSYC CS R C C GRG LS
Sbjct: 156 AKDAATRYTAGLAKKWSYCATCSGGRGCPNCEGRGWLS 193
>gi|302804494|ref|XP_002983999.1| hypothetical protein SELMODRAFT_423125 [Selaginella moellendorffii]
gi|300148351|gb|EFJ15011.1| hypothetical protein SELMODRAFT_423125 [Selaginella moellendorffii]
Length = 167
Score = 85.1 bits (209), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 50/115 (43%), Positives = 70/115 (60%), Gaps = 10/115 (8%)
Query: 49 TVSSALPETAASVAIAATVVGAAATLLVRRTKGSEETEIPLKTCEDCG---GSGICPECK 105
+ +AL + A+ +A+ +VGAA+ L R KG+ + +T EDCG G+G+CP+C
Sbjct: 54 VIVAALEDVASGIAVLG-IVGAASFALARLQKGAAD-----QTREDCGACSGTGLCPQCS 107
Query: 106 GEGFVLKKLSEETAERARLTAKNMATRYTAGLPKKWSYCTKCSSSRSCATCGGRG 160
G+GFV+K+L + A + N A R TAGLP KW YC CS SR+C C GRG
Sbjct: 108 GQGFVMKELLPDVAA-KAKKSANAAKRNTAGLPNKWKYCVACSGSRTCLECQGRG 161
>gi|302753432|ref|XP_002960140.1| hypothetical protein SELMODRAFT_437275 [Selaginella moellendorffii]
gi|300171079|gb|EFJ37679.1| hypothetical protein SELMODRAFT_437275 [Selaginella moellendorffii]
Length = 154
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 50/128 (39%), Positives = 77/128 (60%), Gaps = 13/128 (10%)
Query: 36 IDIKSQEQERKLPTVSSALPETAASVAIAATVVGAAATLLVRRTKGSEETEIPLKTCEDC 95
++ + +++ R + +AL + A+ +A+ +VGAA+ L R KG+ + +T +DC
Sbjct: 31 VEARPRKRHR---VIVAALEDVASGIAVLG-IVGAASFALARLQKGAAD-----QTRDDC 81
Query: 96 G---GSGICPECKGEGFVLKKLSEETAERARLTAKNMATRYTAGLPKKWSYCTKCSSSRS 152
G G+G+CP+C G+GFV+K+L + A + N A R TAGLP KW YC CS SR+
Sbjct: 82 GACSGTGLCPQCNGQGFVMKELLPDVAA-KAKKSANAAKRNTAGLPNKWKYCVACSGSRT 140
Query: 153 CATCGGRG 160
C C GRG
Sbjct: 141 CLECQGRG 148
>gi|73670910|ref|YP_306925.1| hypothetical protein Mbar_A3472 [Methanosarcina barkeri str.
Fusaro]
gi|72398072|gb|AAZ72345.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro]
Length = 710
Score = 41.6 bits (96), Expect = 0.11, Method: Composition-based stats.
Identities = 26/87 (29%), Positives = 41/87 (47%), Gaps = 24/87 (27%)
Query: 89 LKTCEDCGGSG-------ICPECKGEGFVLKKLSEETAERARLTAKNMATRYTAGLPKKW 141
+K C DC G G ICP+CKG+G ++ + +++ KN+ + + K
Sbjct: 2 IKECPDCHGRGYEVVSTEICPQCKGKG------KSKSIDFMKMSEKNLDS-----ILKNG 50
Query: 142 SYCTKCSSSRS------CATCGGRGKL 162
+ C KC + S C TC G GK+
Sbjct: 51 AACEKCKGTGSIEVTTPCKTCNGLGKI 77
>gi|150400655|ref|YP_001324421.1| phosphoesterase domain-containing protein [Methanococcus aeolicus
Nankai-3]
gi|150013358|gb|ABR55809.1| phosphoesterase RecJ domain protein [Methanococcus aeolicus
Nankai-3]
Length = 758
Score = 40.0 bits (92), Expect = 0.29, Method: Composition-based stats.
Identities = 28/90 (31%), Positives = 40/90 (44%), Gaps = 26/90 (28%)
Query: 89 LKTCEDCGGSGI-------CPECKGEGFVLKKLSEETAERARLTAKNMATRY---TAGLP 138
+ CE C G+G CPEC+G GF+ EE +++ + ++Y A LP
Sbjct: 2 INKCEICNGTGKKIVKYEECPECEGTGFL-----EEFDTKSQFKRASKNSKYDFDDAELP 56
Query: 139 KKWSYCTKCSSS------RSCATCGGRGKL 162
C C+ S SC CGG GK+
Sbjct: 57 -----CPTCNGSGQIPIYDSCEYCGGSGKV 81
>gi|410671031|ref|YP_006923402.1| phosphoesterase, RecJ-like protein [Methanolobus psychrophilus R15]
gi|409170159|gb|AFV24034.1| phosphoesterase, RecJ-like protein [Methanolobus psychrophilus R15]
Length = 707
Score = 39.7 bits (91), Expect = 0.42, Method: Composition-based stats.
Identities = 25/84 (29%), Positives = 36/84 (42%), Gaps = 24/84 (28%)
Query: 90 KTCEDCGGSG-------ICPECKGEGFVLKKLSEETAERARLTAKNMATRYTAGLPKKWS 142
+ C++CGG G CPECKG G + + +L+ K+M + G S
Sbjct: 3 EKCQECGGKGHKVVDSKKCPECKGAG------KSRSVDLMKLSEKDMDSFLKNG-----S 51
Query: 143 YCTKCSS------SRSCATCGGRG 160
C KC+ + C C GRG
Sbjct: 52 SCMKCNGTGEIEITEQCIACSGRG 75
>gi|298674036|ref|YP_003725786.1| phosphoesterase RecJ domain-containing protein [Methanohalobium
evestigatum Z-7303]
gi|298287024|gb|ADI72990.1| phosphoesterase RecJ domain protein [Methanohalobium evestigatum
Z-7303]
Length = 711
Score = 39.3 bits (90), Expect = 0.54, Method: Composition-based stats.
Identities = 26/82 (31%), Positives = 36/82 (43%), Gaps = 24/82 (29%)
Query: 92 CEDCGGSGI-------CPECKGEGFVLKKLSEETAERARLTAKNMATRYTAGLPKKWSYC 144
C +C G G CPECKG G ++ + +LT +++ + G SYC
Sbjct: 5 CIECEGRGYQIISVEKCPECKGTG------KTKSVDLMKLTEQDVDDFLSDG-----SYC 53
Query: 145 TKCSSS------RSCATCGGRG 160
KC S + C TC GRG
Sbjct: 54 PKCEGSGQIETTKYCETCKGRG 75
>gi|15789721|ref|NP_279545.1| hypothetical protein VNG0489G [Halobacterium sp. NRC-1]
gi|169235436|ref|YP_001688636.1| molecular chaperone dnaJ [Halobacterium salinarum R1]
gi|18202992|sp|Q9HRY3.1|DNAJ_HALSA RecName: Full=Chaperone protein DnaJ
gi|226735573|sp|B0R3H3.1|DNAJ_HALS3 RecName: Full=Chaperone protein DnaJ
gi|10580095|gb|AAG19025.1| heat shock protein [Halobacterium sp. NRC-1]
gi|167726502|emb|CAP13287.1| molecular chaperone DnaJ [Halobacterium salinarum R1]
Length = 391
Score = 38.9 bits (89), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 31/118 (26%), Positives = 49/118 (41%), Gaps = 40/118 (33%)
Query: 57 TAASVAIAATVVGAAATLLVRRTKGSEETEIPLKTCEDCGGSGI--------CPECKGEG 108
T +V ++ G + + VRR + +C DCGGSG CP+C G+G
Sbjct: 140 TDLTVTLSEAYRGVSKQVTVRRPE----------SCADCGGSGYPEDADVRTCPQCDGQG 189
Query: 109 FVLKKLSEETAERARLTAKNMATRYTAGLPKKWSYCTKCS-----SSRSCATCGGRGK 161
V + R T G ++ C++C S +C+TCGG+G+
Sbjct: 190 VVTQ---------VRQTP--------LGRVQQRQECSRCGGEGELHSETCSTCGGQGQ 230
>gi|440751681|ref|ZP_20930884.1| ribonuclease, Rne/Rng family domain protein [Microcystis aeruginosa
TAIHU98]
gi|440176174|gb|ELP55447.1| ribonuclease, Rne/Rng family domain protein [Microcystis aeruginosa
TAIHU98]
Length = 723
Score = 38.9 bits (89), Expect = 0.74, Method: Composition-based stats.
Identities = 40/132 (30%), Positives = 58/132 (43%), Gaps = 3/132 (2%)
Query: 9 SSASFSIITRDLKLRVCTGLHGKAFSSIDIKSQEQERKLPTVSSALPETAASVAIAATVV 68
+S + + I R L+LR G+ F +D + ++Q + L + AL A IA +
Sbjct: 316 NSEAATEIARQLRLRNIGGVVVVDFIDMDSR-RDQLKLLELFNKALKSDKARPQIA-QLS 373
Query: 69 GAAATLLVRRTKGSEETEIPLKTCEDCGGSGICPECKGEGFVLKKLSEETAERARLTAKN 128
L R+ +G E+ KTC+ CGG G GEG + L T RA
Sbjct: 374 ELGLVELTRKRQGKNIYELFGKTCDHCGGLGHLAHLPGEGNAI-ALETPTLSRAEKETLV 432
Query: 129 MATRYTAGLPKK 140
+AT T LP K
Sbjct: 433 IATTNTKVLPDK 444
>gi|425449735|ref|ZP_18829570.1| Ribonuclease E homolog [Microcystis aeruginosa PCC 7941]
gi|389769736|emb|CCI05504.1| Ribonuclease E homolog [Microcystis aeruginosa PCC 7941]
Length = 722
Score = 38.9 bits (89), Expect = 0.76, Method: Composition-based stats.
Identities = 40/132 (30%), Positives = 58/132 (43%), Gaps = 3/132 (2%)
Query: 9 SSASFSIITRDLKLRVCTGLHGKAFSSIDIKSQEQERKLPTVSSALPETAASVAIAATVV 68
+S + + I R L+LR G+ F +D + ++Q + L + AL A IA +
Sbjct: 316 NSEAATEIARQLRLRNIGGVVVVDFIDMDSR-RDQLKLLELFNKALKSDKARPQIA-QLS 373
Query: 69 GAAATLLVRRTKGSEETEIPLKTCEDCGGSGICPECKGEGFVLKKLSEETAERARLTAKN 128
L R+ +G E+ KTC+ CGG G GEG + L T RA
Sbjct: 374 ELGLVELTRKRQGKNIYELFGKTCDHCGGLGHLAHLPGEGNAI-ALETPTLSRAEKETLV 432
Query: 129 MATRYTAGLPKK 140
+AT T LP K
Sbjct: 433 IATTNTKVLPDK 444
>gi|425461779|ref|ZP_18841253.1| Ribonuclease E homolog [Microcystis aeruginosa PCC 9808]
gi|389825301|emb|CCI25025.1| Ribonuclease E homolog [Microcystis aeruginosa PCC 9808]
Length = 723
Score = 38.9 bits (89), Expect = 0.80, Method: Composition-based stats.
Identities = 40/132 (30%), Positives = 58/132 (43%), Gaps = 3/132 (2%)
Query: 9 SSASFSIITRDLKLRVCTGLHGKAFSSIDIKSQEQERKLPTVSSALPETAASVAIAATVV 68
+S + + I R L+LR G+ F +D + ++Q + L + AL A IA +
Sbjct: 316 NSEAATEIARQLRLRNIGGVVVVDFIDMDSR-RDQLKLLELFNKALKSDKARPQIA-QLS 373
Query: 69 GAAATLLVRRTKGSEETEIPLKTCEDCGGSGICPECKGEGFVLKKLSEETAERARLTAKN 128
L R+ +G E+ KTC+ CGG G GEG + L T RA
Sbjct: 374 ELGLVELTRKRQGKNIYELFGKTCDHCGGLGHLAHLPGEGNAI-ALETPTLSRAEKETLV 432
Query: 129 MATRYTAGLPKK 140
+AT T LP K
Sbjct: 433 IATTNTKVLPDK 444
>gi|425436380|ref|ZP_18816816.1| Ribonuclease E homolog [Microcystis aeruginosa PCC 9432]
gi|389678910|emb|CCH92272.1| Ribonuclease E homolog [Microcystis aeruginosa PCC 9432]
Length = 723
Score = 38.9 bits (89), Expect = 0.80, Method: Composition-based stats.
Identities = 40/132 (30%), Positives = 58/132 (43%), Gaps = 3/132 (2%)
Query: 9 SSASFSIITRDLKLRVCTGLHGKAFSSIDIKSQEQERKLPTVSSALPETAASVAIAATVV 68
+S + + I R L+LR G+ F +D + ++Q + L + AL A IA +
Sbjct: 316 NSEAATEIARQLRLRNIGGVVVVDFIDMDSR-RDQLKLLELFNKALKSDKARPQIA-QLS 373
Query: 69 GAAATLLVRRTKGSEETEIPLKTCEDCGGSGICPECKGEGFVLKKLSEETAERARLTAKN 128
L R+ +G E+ KTC+ CGG G GEG + L T RA
Sbjct: 374 ELGLVELTRKRQGKNIYELFGKTCDHCGGLGHLAHLPGEGNAI-ALETPTLSRAEKETLV 432
Query: 129 MATRYTAGLPKK 140
+AT T LP K
Sbjct: 433 IATTNTKVLPDK 444
>gi|297618592|ref|YP_003706697.1| phosphoesterase RecJ domain-containing protein [Methanococcus
voltae A3]
gi|297618652|ref|YP_003706757.1| phosphoesterase RecJ domain-containing protein [Methanococcus
voltae A3]
gi|297377569|gb|ADI35724.1| phosphoesterase RecJ domain protein [Methanococcus voltae A3]
gi|297377629|gb|ADI35784.1| phosphoesterase RecJ domain protein [Methanococcus voltae A3]
Length = 757
Score = 38.1 bits (87), Expect = 1.3, Method: Composition-based stats.
Identities = 24/90 (26%), Positives = 40/90 (44%), Gaps = 26/90 (28%)
Query: 89 LKTCEDCGGSG-------ICPECKGEGFVLKKLSEETAERARLTAKNMATRY---TAGLP 138
+K C+ CGG+G +CP+C+G G+ EE + + + ++Y A +P
Sbjct: 2 IKDCKICGGTGKKVIKYDVCPDCEGTGY-----QEEFETKKHFKSASKNSKYDFDDAEIP 56
Query: 139 KKWSYCTKCSSSRS------CATCGGRGKL 162
C+KC + C C G GK+
Sbjct: 57 -----CSKCEGTGKVPVYGLCEFCEGTGKI 81
>gi|257460800|ref|ZP_05625901.1| hypothetical protein CAMGR0001_1569 [Campylobacter gracilis RM3268]
gi|257442131|gb|EEV17273.1| hypothetical protein CAMGR0001_1569 [Campylobacter gracilis RM3268]
Length = 879
Score = 38.1 bits (87), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 36/76 (47%), Gaps = 5/76 (6%)
Query: 92 CEDCGGSG--ICPECKGEG-FVLKKLSEETAERARLTAKNMATRYTAGLPKKWSYCTKC- 147
C +CGG G IC CKG G + LS A L+ + A + + ++ C C
Sbjct: 149 CANCGGRGRLICSTCKGAGGYSRAALSSTDAHVRGLSGGDSAGSASNSMGYRFISCASCG 208
Query: 148 -SSSRSCATCGGRGKL 162
S SR C CGG G+L
Sbjct: 209 GSGSRICPACGGAGRL 224
>gi|2351849|gb|AAB96891.1| 40 kDa heat shock chaperone protein [Halobacterium salinarum]
Length = 389
Score = 37.7 bits (86), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 31/118 (26%), Positives = 49/118 (41%), Gaps = 40/118 (33%)
Query: 57 TAASVAIAATVVGAAATLLVRRTKGSEETEIPLKTCEDCGGSGI--------CPECKGEG 108
T +V ++ G + + VRR + +C DCGGSG CP+C G+G
Sbjct: 140 TDLTVTLSEAYRGVSKQVTVRRPE----------SCADCGGSGYPEDADVRTCPQCGGQG 189
Query: 109 FVLKKLSEETAERARLTAKNMATRYTAGLPKKWSYCTKCS-----SSRSCATCGGRGK 161
V + R T G ++ C++C S +C+TCGG+G+
Sbjct: 190 VVTQ---------VRQTP--------LGRVQQRQECSRCGGEGELHSETCSTCGGQGQ 230
>gi|419214903|ref|ZP_13757923.1| exonuclease family protein [Escherichia coli DEC8D]
gi|378066287|gb|EHW28424.1| exonuclease family protein [Escherichia coli DEC8D]
Length = 847
Score = 37.4 bits (85), Expect = 1.9, Method: Composition-based stats.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 7/58 (12%)
Query: 88 PLKTCEDCG--GSGICPEC---KGEGFVLKKLSEETAERARLTAKNMATRYTAGLPKK 140
P K C DCG G G CP+C G+ + +EE+ + AR K+ +AGLP K
Sbjct: 571 PEKVCTDCGQTGGGNCPDCGAVMGDATYQETFNEESQDEAR--EKDPEEMESAGLPNK 626
>gi|419718884|ref|ZP_14246185.1| chaperone protein DnaJ [Lachnoanaerobaculum saburreum F0468]
gi|383304954|gb|EIC96338.1| chaperone protein DnaJ [Lachnoanaerobaculum saburreum F0468]
Length = 369
Score = 37.4 bits (85), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 21/34 (61%), Gaps = 5/34 (14%)
Query: 89 LKTCEDCGGSG-----ICPECKGEGFVLKKLSEE 117
+KTC DCGGSG CP+C G G+V K + E
Sbjct: 198 VKTCPDCGGSGKIIKDKCPDCHGNGYVQTKKNFE 231
>gi|315649939|ref|ZP_07903019.1| chaperone DnaJ [Lachnoanaerobaculum saburreum DSM 3986]
gi|315487709|gb|EFU78012.1| chaperone DnaJ [Lachnoanaerobaculum saburreum DSM 3986]
Length = 369
Score = 37.4 bits (85), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 21/34 (61%), Gaps = 5/34 (14%)
Query: 89 LKTCEDCGGSG-----ICPECKGEGFVLKKLSEE 117
+KTC DCGGSG CP+C G G+V K + E
Sbjct: 198 VKTCPDCGGSGKIIKDKCPDCHGNGYVQTKKNFE 231
>gi|167625293|ref|YP_001675587.1| hypothetical protein Shal_3382 [Shewanella halifaxensis HAW-EB4]
gi|167355315|gb|ABZ77928.1| conserved hypothetical protein [Shewanella halifaxensis HAW-EB4]
Length = 630
Score = 37.0 bits (84), Expect = 2.4, Method: Composition-based stats.
Identities = 27/76 (35%), Positives = 35/76 (46%), Gaps = 14/76 (18%)
Query: 92 CEDCGGSGI--CPECKGEGFVLKKLSEETAER---ARLTAKNMATRYTAGLPKKWSYCTK 146
C C GSG C C+G G L LS + R +R + N T YT C+
Sbjct: 125 CNTCNGSGCVRCRSCRGSG-KLNCLSCSGSGRVSVSRYDSYNERTVYTT------ETCST 177
Query: 147 C--SSSRSCATCGGRG 160
C S +R+C +CGG G
Sbjct: 178 CYGSGNRTCTSCGGSG 193
>gi|425467152|ref|ZP_18846436.1| Ribonuclease E homolog [Microcystis aeruginosa PCC 9809]
gi|389830139|emb|CCI28074.1| Ribonuclease E homolog [Microcystis aeruginosa PCC 9809]
Length = 719
Score = 37.0 bits (84), Expect = 2.5, Method: Composition-based stats.
Identities = 39/132 (29%), Positives = 57/132 (43%), Gaps = 3/132 (2%)
Query: 9 SSASFSIITRDLKLRVCTGLHGKAFSSIDIKSQEQERKLPTVSSALPETAASVAIAATVV 68
+S + + I R L+LR G+ F +D + ++Q + L + AL A IA +
Sbjct: 316 NSEAATEIARQLRLRNIGGVVVVDFIDMDSR-RDQLKLLELFNKALKSDKARPQIA-QLS 373
Query: 69 GAAATLLVRRTKGSEETEIPLKTCEDCGGSGICPECKGEGFVLKKLSEETAERARLTAKN 128
L R+ +G E+ KTC+ CGG G GEG + L T RA
Sbjct: 374 ELGLVELTRKRQGKNIYELFGKTCDHCGGLGHLAHLPGEGNAI-ALETPTVSRAEKETFI 432
Query: 129 MATRYTAGLPKK 140
+A T LP K
Sbjct: 433 VAPTNTKVLPDK 444
>gi|425471085|ref|ZP_18849945.1| Ribonuclease E homolog [Microcystis aeruginosa PCC 9701]
gi|389883067|emb|CCI36508.1| Ribonuclease E homolog [Microcystis aeruginosa PCC 9701]
Length = 723
Score = 37.0 bits (84), Expect = 2.6, Method: Composition-based stats.
Identities = 39/132 (29%), Positives = 57/132 (43%), Gaps = 3/132 (2%)
Query: 9 SSASFSIITRDLKLRVCTGLHGKAFSSIDIKSQEQERKLPTVSSALPETAASVAIAATVV 68
+S + + I R L+LR G+ F +D + ++Q + L + AL A IA +
Sbjct: 316 NSEAATEIARQLRLRNIGGVVVVDFIDMDSR-RDQLKLLELFNKALKSDKARPQIA-QLS 373
Query: 69 GAAATLLVRRTKGSEETEIPLKTCEDCGGSGICPECKGEGFVLKKLSEETAERARLTAKN 128
L R+ +G E+ KTC+ CGG G GEG + L T RA
Sbjct: 374 ELGLVELTRKRQGKNIYELFGKTCDHCGGLGHLAHLPGEGNAI-ALETPTVSRAERETFV 432
Query: 129 MATRYTAGLPKK 140
+A T LP K
Sbjct: 433 VAPTNTKVLPDK 444
>gi|425441755|ref|ZP_18822022.1| Ribonuclease E homolog [Microcystis aeruginosa PCC 9717]
gi|389717454|emb|CCH98461.1| Ribonuclease E homolog [Microcystis aeruginosa PCC 9717]
Length = 719
Score = 37.0 bits (84), Expect = 2.7, Method: Composition-based stats.
Identities = 39/132 (29%), Positives = 57/132 (43%), Gaps = 3/132 (2%)
Query: 9 SSASFSIITRDLKLRVCTGLHGKAFSSIDIKSQEQERKLPTVSSALPETAASVAIAATVV 68
+S + + I R L+LR G+ F +D + ++Q + L + AL A IA +
Sbjct: 316 NSEAATEIARQLRLRNIGGVVVVDFIDMDSR-RDQLKLLELFNKALKSDKARPQIA-QLS 373
Query: 69 GAAATLLVRRTKGSEETEIPLKTCEDCGGSGICPECKGEGFVLKKLSEETAERARLTAKN 128
L R+ +G E+ KTC+ CGG G GEG + L T RA
Sbjct: 374 ELGLVELTRKRQGKNIYELFGKTCDHCGGLGHLAHLPGEGNAI-ALETPTVSRAEKETFI 432
Query: 129 MATRYTAGLPKK 140
+A T LP K
Sbjct: 433 VAPTNTKVLPDK 444
>gi|166364261|ref|YP_001656534.1| ribonuclease E [Microcystis aeruginosa NIES-843]
gi|166086634|dbj|BAG01342.1| ribonuclease E [Microcystis aeruginosa NIES-843]
Length = 719
Score = 37.0 bits (84), Expect = 2.7, Method: Composition-based stats.
Identities = 39/132 (29%), Positives = 57/132 (43%), Gaps = 3/132 (2%)
Query: 9 SSASFSIITRDLKLRVCTGLHGKAFSSIDIKSQEQERKLPTVSSALPETAASVAIAATVV 68
+S + + I R L+LR G+ F +D + ++Q + L + AL A IA +
Sbjct: 316 NSEAATEIARQLRLRNIGGVVVVDFIDMDSR-RDQLKLLELFNKALKSDKARPQIA-QLS 373
Query: 69 GAAATLLVRRTKGSEETEIPLKTCEDCGGSGICPECKGEGFVLKKLSEETAERARLTAKN 128
L R+ +G E+ KTC+ CGG G GEG + L T RA
Sbjct: 374 ELGLVELTRKRQGKNIYELFGKTCDHCGGLGHLAHLPGEGNAI-ALETPTVSRAEKETFI 432
Query: 129 MATRYTAGLPKK 140
+A T LP K
Sbjct: 433 VAPTNTKVLPDK 444
>gi|294496570|ref|YP_003543063.1| phosphoesterase RecJ domain protein [Methanohalophilus mahii DSM
5219]
gi|292667569|gb|ADE37418.1| phosphoesterase RecJ domain protein [Methanohalophilus mahii DSM
5219]
Length = 710
Score = 36.6 bits (83), Expect = 3.3, Method: Composition-based stats.
Identities = 28/86 (32%), Positives = 35/86 (40%), Gaps = 24/86 (27%)
Query: 90 KTCEDCGGSGI-------CPECKGEGFVLKKLSEETAERARLTAKNMATRYTAGLPKKWS 142
+ C +CGG+G CPECKG G K S E ++N + G S
Sbjct: 3 EKCSECGGTGARESSIQKCPECKGSG---KPKSMNLME----LSENDVNSFLKGS----S 51
Query: 143 YCTKCSSS------RSCATCGGRGKL 162
C KC S C +CGG G L
Sbjct: 52 VCEKCGGSGEVEIKDPCPSCGGDGFL 77
>gi|91773749|ref|YP_566441.1| phosphoesterase, RecJ-like protein [Methanococcoides burtonii DSM
6242]
gi|91712764|gb|ABE52691.1| Nucleic acid binding protein [Methanococcoides burtonii DSM 6242]
Length = 707
Score = 36.6 bits (83), Expect = 3.7, Method: Composition-based stats.
Identities = 23/82 (28%), Positives = 33/82 (40%), Gaps = 24/82 (29%)
Query: 92 CEDCGGSGI-------CPECKGEGFVLKKLSEETAERARLTAKNMATRYTAGLPKKWSYC 144
C +CGG G CPECKG G + ++ L+ ++ G S C
Sbjct: 5 CNECGGKGYEVLSTEKCPECKGSGKI------KSVNLMDLSQNDVKNFLNEG-----SVC 53
Query: 145 TKCSSS------RSCATCGGRG 160
+KC S + C C G+G
Sbjct: 54 SKCGGSGEIEVRKQCPACSGKG 75
>gi|384251275|gb|EIE24753.1| photosystem I reaction centre subunit VI [Coccomyxa subellipsoidea
C-169]
Length = 133
Score = 36.6 bits (83), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 21/69 (30%), Positives = 36/69 (52%), Gaps = 6/69 (8%)
Query: 35 SIDIKSQEQERKLPTVSS------ALPETAASVAIAATVVGAAATLLVRRTKGSEETEIP 88
S ++ QE E++ P++ + A P T +A T VG A +LL+ KGS++ +P
Sbjct: 57 SWELYGQEDEKRYPSLQAEFFNRAAAPLTRRESLLAFTFVGGAGSLLLWGAKGSKDVALP 116
Query: 89 LKTCEDCGG 97
++ GG
Sbjct: 117 IQQGPKKGG 125
>gi|367000299|ref|XP_003684885.1| hypothetical protein TPHA_0C02980 [Tetrapisispora phaffii CBS 4417]
gi|357523182|emb|CCE62451.1| hypothetical protein TPHA_0C02980 [Tetrapisispora phaffii CBS 4417]
Length = 554
Score = 36.6 bits (83), Expect = 3.8, Method: Composition-based stats.
Identities = 18/46 (39%), Positives = 26/46 (56%), Gaps = 7/46 (15%)
Query: 75 LVRRTKGSEETEIPLKTCEDCGGSG-------ICPECKGEGFVLKK 113
L + K T+ +TC+ CGG+G IC EC+GEGF+ +K
Sbjct: 283 LTQTKKFGPMTQTFTQTCDGCGGNGTFADAQDICNECQGEGFLNEK 328
>gi|54022766|ref|YP_117008.1| hypothetical protein nfa7990 [Nocardia farcinica IFM 10152]
gi|54014274|dbj|BAD55644.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length = 219
Score = 36.6 bits (83), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 20/73 (27%), Positives = 30/73 (41%), Gaps = 24/73 (32%)
Query: 92 CEDCGGSGICPECKGEGFVLKKLSEETAERARLTAKNMATRYTAGLPKKWSYCTKCSSSR 151
C +C G+G CP+C G G++ +E R R C +C R
Sbjct: 156 CYECAGTGWCPQCHGRGWI----PDEQRGRRR--------------------CPECFDRR 191
Query: 152 SCATCGGRGKLSL 164
C C G G+L++
Sbjct: 192 VCPVCEGAGQLAV 204
>gi|359491526|ref|XP_002279102.2| PREDICTED: chaperone protein DnaJ-like [Vitis vinifera]
Length = 497
Score = 36.6 bits (83), Expect = 4.0, Method: Composition-based stats.
Identities = 25/94 (26%), Positives = 39/94 (41%), Gaps = 31/94 (32%)
Query: 83 EETEIP-LKTCEDCGGSG--------ICPECKGEGFVLKKLSEETAERARLTAKNMATRY 133
EE E+ + C++CGG+G C C G+G V+K +
Sbjct: 181 EEIEVSCFEVCDNCGGTGAKSSSCIKTCTNCGGKGGVMK-----------------TQKT 223
Query: 134 TAGLPKKWSYCTKCSS-----SRSCATCGGRGKL 162
G+ + S C+KC + C +CGG GK+
Sbjct: 224 PFGIMSQVSTCSKCGGDGKIITSHCQSCGGHGKV 257
>gi|425444817|ref|ZP_18824858.1| Ribonuclease E homolog [Microcystis aeruginosa PCC 9443]
gi|389735354|emb|CCI01128.1| Ribonuclease E homolog [Microcystis aeruginosa PCC 9443]
Length = 719
Score = 36.2 bits (82), Expect = 4.5, Method: Composition-based stats.
Identities = 39/132 (29%), Positives = 57/132 (43%), Gaps = 3/132 (2%)
Query: 9 SSASFSIITRDLKLRVCTGLHGKAFSSIDIKSQEQERKLPTVSSALPETAASVAIAATVV 68
+S + + I R L+LR G+ F +D + ++Q + L + AL A IA +
Sbjct: 316 NSEAATEIARQLRLRNIGGVVVVDFIDMDSR-RDQLKLLELFNKALKSDKARPQIA-QLS 373
Query: 69 GAAATLLVRRTKGSEETEIPLKTCEDCGGSGICPECKGEGFVLKKLSEETAERARLTAKN 128
L R+ +G E+ KTC+ CGG G GEG + L T RA
Sbjct: 374 ELGLVELTRKRQGKNIYELFGKTCDHCGGLGHLAHLPGEGNAI-ALETPTLSRAEKETFV 432
Query: 129 MATRYTAGLPKK 140
+A T LP K
Sbjct: 433 VAPTNTKVLPDK 444
>gi|159899682|ref|YP_001545929.1| serine/threonine protein kinase [Herpetosiphon aurantiacus DSM 785]
gi|159892721|gb|ABX05801.1| serine/threonine protein kinase [Herpetosiphon aurantiacus DSM 785]
Length = 682
Score = 36.2 bits (82), Expect = 4.8, Method: Composition-based stats.
Identities = 13/34 (38%), Positives = 23/34 (67%), Gaps = 2/34 (5%)
Query: 82 SEETEIPLKTCEDCGGSG--ICPECKGEGFVLKK 113
++ E ++TC DC G G +CP+C+G G ++K+
Sbjct: 474 TQVVENEVQTCSDCKGKGYLVCPDCQGIGMIVKR 507
>gi|297796439|ref|XP_002866104.1| hypothetical protein ARALYDRAFT_331885 [Arabidopsis lyrata subsp.
lyrata]
gi|297311939|gb|EFH42363.1| hypothetical protein ARALYDRAFT_331885 [Arabidopsis lyrata subsp.
lyrata]
Length = 141
Score = 36.2 bits (82), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 38/81 (46%), Gaps = 18/81 (22%)
Query: 44 ERKLPTVSSALPETAASVA-------IAATVVGAAATLLVRRTKGSEETE--IPLKTCED 94
R+L T SAL E A +VA +VG A ++ G E ++ I K CE+
Sbjct: 48 NRRLRTTPSALSELADTVAETGKSEITWQIIVGTIAGIIPFVVAGVEFSKRIIAQKRCEE 107
Query: 95 CGGSGI---------CPECKG 106
CGG+G+ CPEC G
Sbjct: 108 CGGTGLVFRDNKYFRCPECGG 128
>gi|425454336|ref|ZP_18834082.1| Ribonuclease E homolog [Microcystis aeruginosa PCC 9807]
gi|389805028|emb|CCI15494.1| Ribonuclease E homolog [Microcystis aeruginosa PCC 9807]
Length = 719
Score = 35.8 bits (81), Expect = 6.3, Method: Composition-based stats.
Identities = 39/132 (29%), Positives = 57/132 (43%), Gaps = 3/132 (2%)
Query: 9 SSASFSIITRDLKLRVCTGLHGKAFSSIDIKSQEQERKLPTVSSALPETAASVAIAATVV 68
+S + + I R L+LR G+ F +D + ++Q + L + AL A IA +
Sbjct: 316 NSEAATEIARQLRLRNIGGVVVVDFIDMDSR-RDQLKLLELFNKALKSDKARPQIA-QLS 373
Query: 69 GAAATLLVRRTKGSEETEIPLKTCEDCGGSGICPECKGEGFVLKKLSEETAERARLTAKN 128
L R+ +G E+ KTC+ CGG G GEG + L T RA
Sbjct: 374 ELGLVELTRKRQGKNIYELFGKTCDHCGGLGHLAHLPGEGNAI-ALETPTLSRAEKETLV 432
Query: 129 MATRYTAGLPKK 140
+A T LP K
Sbjct: 433 VAPINTKVLPDK 444
>gi|422303790|ref|ZP_16391141.1| Ribonuclease E homolog [Microcystis aeruginosa PCC 9806]
gi|389791184|emb|CCI12972.1| Ribonuclease E homolog [Microcystis aeruginosa PCC 9806]
Length = 719
Score = 35.8 bits (81), Expect = 6.5, Method: Composition-based stats.
Identities = 39/132 (29%), Positives = 57/132 (43%), Gaps = 3/132 (2%)
Query: 9 SSASFSIITRDLKLRVCTGLHGKAFSSIDIKSQEQERKLPTVSSALPETAASVAIAATVV 68
+S + + I R L+LR G+ F +D + ++Q + L + AL A IA +
Sbjct: 316 NSEAATEIARQLRLRNIGGVVVVDFIDMDSR-RDQLKLLELFNKALKSDKARPQIA-QLS 373
Query: 69 GAAATLLVRRTKGSEETEIPLKTCEDCGGSGICPECKGEGFVLKKLSEETAERARLTAKN 128
L R+ +G E+ KTC+ CGG G GEG + L T RA
Sbjct: 374 ELGLVELTRKRQGKNIYELFGKTCDHCGGLGHLAHLPGEGNAI-ALETPTLSRAEKETFV 432
Query: 129 MATRYTAGLPKK 140
+A T LP K
Sbjct: 433 VAPINTKVLPDK 444
>gi|386001335|ref|YP_005919634.1| Phosphoesterase, RecJ domain protein [Methanosaeta harundinacea
6Ac]
gi|357209391|gb|AET64011.1| Phosphoesterase, RecJ domain protein [Methanosaeta harundinacea
6Ac]
Length = 740
Score = 35.8 bits (81), Expect = 6.5, Method: Composition-based stats.
Identities = 24/86 (27%), Positives = 34/86 (39%), Gaps = 23/86 (26%)
Query: 89 LKTCEDCGGSGI-------CPECKGEGFVLKKLSEETAERARLTAKNMATRYTAGLPKKW 141
+K C+ CGG G CP CKG G + E+ +A + T G +
Sbjct: 22 MKLCDRCGGKGYSVTGEKACPNCKGTGRI------ESVSLGEASASEIDELVTKGATR-- 73
Query: 142 SYCTKCSS------SRSCATCGGRGK 161
CT C + +C CGG G+
Sbjct: 74 --CTVCKGEGKIPVTETCEACGGLGR 97
>gi|443313094|ref|ZP_21042707.1| ribonuclease, Rne/Rng family [Synechocystis sp. PCC 7509]
gi|442776902|gb|ELR87182.1| ribonuclease, Rne/Rng family [Synechocystis sp. PCC 7509]
Length = 707
Score = 35.4 bits (80), Expect = 7.0, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 52/124 (41%), Gaps = 4/124 (3%)
Query: 16 ITRDLKLRVCTGLHGKAFSSIDIKSQ-EQERKLPTVSSALPETAASVAIAATVVGAAATL 74
I R L++R G+ F ID+ SQ +Q + L + AL A IA +
Sbjct: 323 IARQLRMRNIAGVIIVDF--IDMDSQKDQLQVLEHFNKALKADKARPQIA-QLSELGLVE 379
Query: 75 LVRRTKGSEETEIPLKTCEDCGGSGICPECKGEGFVLKKLSEETAERARLTAKNMATRYT 134
L R+ +G E+ TC CGG G GEG +K T R R++ + R
Sbjct: 380 LTRKRQGQNIYELFGSTCPTCGGLGHIEHLPGEGDYREKEVAATPTRERISLAEVPERSP 439
Query: 135 AGLP 138
+P
Sbjct: 440 VVIP 443
>gi|221194742|ref|ZP_03567799.1| chaperone protein DnaJ [Atopobium rimae ATCC 49626]
gi|221185646|gb|EEE18036.1| chaperone protein DnaJ [Atopobium rimae ATCC 49626]
Length = 391
Score = 35.4 bits (80), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 23/86 (26%), Positives = 35/86 (40%), Gaps = 30/86 (34%)
Query: 89 LKTCEDCGGSGI--------CPECKGEGFVLKKLSEETAERARLTAKNMATRYTAGLPKK 140
L C+DC GSG+ CP CKG G V++ + G +
Sbjct: 148 LSPCDDCNGSGVAEGGHITTCPRCKGTGAVVQ-----------------VQKTVFGQMQT 190
Query: 141 WSYCTKCSSS-----RSCATCGGRGK 161
+ C +C+ + + C TCGG G+
Sbjct: 191 QAVCPECAGTGKKIDKPCETCGGEGR 216
>gi|390440734|ref|ZP_10228942.1| Ribonuclease E homolog [Microcystis sp. T1-4]
gi|389835963|emb|CCI33068.1| Ribonuclease E homolog [Microcystis sp. T1-4]
Length = 723
Score = 35.4 bits (80), Expect = 8.1, Method: Composition-based stats.
Identities = 39/132 (29%), Positives = 56/132 (42%), Gaps = 3/132 (2%)
Query: 9 SSASFSIITRDLKLRVCTGLHGKAFSSIDIKSQEQERKLPTVSSALPETAASVAIAATVV 68
+S + + I R L+LR G+ F +D + ++Q + L + AL A IA +
Sbjct: 316 NSEAATEIARQLRLRNIGGVVVVDFIDMDSR-RDQLKLLELFNKALKSDKARPQIA-QLS 373
Query: 69 GAAATLLVRRTKGSEETEIPLKTCEDCGGSGICPECKGEGFVLKKLSEETAERARLTAKN 128
L R+ +G E+ KTC CGG G GEG + L T RA
Sbjct: 374 ELGLVELTRKRQGKNIYELFGKTCSHCGGLGHLAHLPGEGNAI-ALETPTLSRAEKETFV 432
Query: 129 MATRYTAGLPKK 140
+A T LP K
Sbjct: 433 VAPTNTKVLPDK 444
>gi|452210595|ref|YP_007490709.1| Hypothetical protein MmTuc01_2107 [Methanosarcina mazei Tuc01]
gi|452100497|gb|AGF97437.1| Hypothetical protein MmTuc01_2107 [Methanosarcina mazei Tuc01]
Length = 710
Score = 35.4 bits (80), Expect = 8.7, Method: Composition-based stats.
Identities = 23/86 (26%), Positives = 37/86 (43%), Gaps = 24/86 (27%)
Query: 90 KTCEDCGGSG-------ICPECKGEGFVLKKLSEETAERARLTAKNMATRYTAGLPKKWS 142
K C DC G G +CP CKG+G ++ + +++ K++ + G +
Sbjct: 3 KECPDCHGRGYEVISTEVCPLCKGKG------KSKSVDFMKISEKDIDSFLKNG-----A 51
Query: 143 YCTKCSS------SRSCATCGGRGKL 162
C KC +R C C G GK+
Sbjct: 52 VCEKCKGKGSIEITRPCEACEGLGKI 77
>gi|21228136|ref|NP_634058.1| hypothetical protein MM_2034 [Methanosarcina mazei Go1]
gi|20906581|gb|AAM31730.1| conserved protein [Methanosarcina mazei Go1]
Length = 710
Score = 35.4 bits (80), Expect = 8.7, Method: Composition-based stats.
Identities = 23/86 (26%), Positives = 37/86 (43%), Gaps = 24/86 (27%)
Query: 90 KTCEDCGGSG-------ICPECKGEGFVLKKLSEETAERARLTAKNMATRYTAGLPKKWS 142
K C DC G G +CP CKG+G ++ + +++ K++ + G +
Sbjct: 3 KECPDCHGRGYEVISTEVCPLCKGKG------KSKSVDFMKISEKDIDSFLKNG-----A 51
Query: 143 YCTKCSS------SRSCATCGGRGKL 162
C KC +R C C G GK+
Sbjct: 52 VCEKCKGKGSIEITRPCEACEGLGKI 77
>gi|443648945|ref|ZP_21130127.1| ribonuclease, Rne/Rng family domain protein [Microcystis aeruginosa
DIANCHI905]
gi|159028468|emb|CAO87275.1| rne [Microcystis aeruginosa PCC 7806]
gi|443335058|gb|ELS49541.1| ribonuclease, Rne/Rng family domain protein [Microcystis aeruginosa
DIANCHI905]
Length = 724
Score = 35.0 bits (79), Expect = 9.3, Method: Composition-based stats.
Identities = 38/132 (28%), Positives = 56/132 (42%), Gaps = 3/132 (2%)
Query: 9 SSASFSIITRDLKLRVCTGLHGKAFSSIDIKSQEQERKLPTVSSALPETAASVAIAATVV 68
+S + + I R L+LR G+ F +D + ++Q + L + AL A IA +
Sbjct: 316 NSEAATEIARQLRLRNIGGVVVVDFIDMDSR-RDQLKLLELFNKALKSDKARPQIA-QLS 373
Query: 69 GAAATLLVRRTKGSEETEIPLKTCEDCGGSGICPECKGEGFVLKKLSEETAERARLTAKN 128
L R+ +G E+ KTC+ CGG G GEG + L T R
Sbjct: 374 ELGLVELTRKRQGKNIYELFGKTCDHCGGLGHLAHLPGEGSAI-ALETPTLSRGEKETLV 432
Query: 129 MATRYTAGLPKK 140
+A T LP K
Sbjct: 433 IAPTNTKVLPDK 444
>gi|373470379|ref|ZP_09561514.1| chaperone protein DnaJ [Lachnospiraceae bacterium oral taxon 082
str. F0431]
gi|371762731|gb|EHO51256.1| chaperone protein DnaJ [Lachnospiraceae bacterium oral taxon 082
str. F0431]
Length = 369
Score = 35.0 bits (79), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 14/30 (46%), Positives = 19/30 (63%), Gaps = 5/30 (16%)
Query: 89 LKTCEDCGGSG-----ICPECKGEGFVLKK 113
+KTC DCGG+G CP+C G G++ K
Sbjct: 198 VKTCPDCGGTGKIIKDKCPDCHGNGYIQTK 227
>gi|225390170|ref|ZP_03759894.1| hypothetical protein CLOSTASPAR_03920 [Clostridium asparagiforme
DSM 15981]
gi|225043764|gb|EEG54010.1| hypothetical protein CLOSTASPAR_03920 [Clostridium asparagiforme
DSM 15981]
Length = 311
Score = 35.0 bits (79), Expect = 10.0, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 39/89 (43%), Gaps = 12/89 (13%)
Query: 81 GSEETEIPLKTCEDCGGSGICPECKGEG-----FVLKKLSEET----AERARLTAKNMAT 131
G + T P C+ CGG G C +C G G F ++S ET + + +
Sbjct: 195 GKKSTSDPY--CDYCGGLGDCGQCLGLGSCDECFGNGEVSCETCFGGGDCPKCDGQGGEN 252
Query: 132 RYTAGLPKKWSYCTKCSSSRSCATCGGRG 160
YT GL +W C +CS C CGG G
Sbjct: 253 HYT-GLDVRWVKCARCSGLGICKRCGGSG 280
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.314 0.127 0.363
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,212,938,787
Number of Sequences: 23463169
Number of extensions: 76289939
Number of successful extensions: 311724
Number of sequences better than 100.0: 155
Number of HSP's better than 100.0 without gapping: 20
Number of HSP's successfully gapped in prelim test: 135
Number of HSP's that attempted gapping in prelim test: 311424
Number of HSP's gapped (non-prelim): 318
length of query: 164
length of database: 8,064,228,071
effective HSP length: 125
effective length of query: 39
effective length of database: 9,426,299,242
effective search space: 367625670438
effective search space used: 367625670438
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 71 (32.0 bits)