BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 048617
(281 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255566896|ref|XP_002524431.1| conserved hypothetical protein [Ricinus communis]
gi|223536315|gb|EEF37966.1| conserved hypothetical protein [Ricinus communis]
Length = 278
Score = 320 bits (821), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 166/227 (73%), Positives = 192/227 (84%), Gaps = 7/227 (3%)
Query: 48 KIPFLTFPSNPRRFLKIPIMSRTNHQLHHHSFATPQSLSDWLKPRLPSDSFASWGVKPGT 107
+IPF+ PSN RRFLK+P MS + + +H FATPQSLSDWLKPRLPSDSFASWGVKPGT
Sbjct: 50 RIPFICLPSNRRRFLKVPAMSISKN---NHPFATPQSLSDWLKPRLPSDSFASWGVKPGT 106
Query: 108 KNVHNLWLELSDGETCLADAIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCRP 167
KNV NLWLELS+GET LAD+ PP+RTV+VVTVR+IG+ N +LVESHQELSDGSVRNRCRP
Sbjct: 107 KNVQNLWLELSEGETSLADSTPPIRTVSVVTVRVIGEKNTVLVESHQELSDGSVRNRCRP 166
Query: 168 LAEKMKSNETPEEAVIRAVKEELGSILNDSITVRIVPGSYSKKLDERNSLSYPGLPARYV 227
L+EKMK NETPE AV RA+KEELGSILND VRIV GSY +K++ERNSLSYPGLPARYV
Sbjct: 167 LSEKMKPNETPEAAVFRAIKEELGSILNDDANVRIVDGSYKEKIEERNSLSYPGLPARYV 226
Query: 228 LHSVDAFVDGLPEEGEFCTEEAEEYADSS---AADKALFVKRHYWKW 271
L+S+D VDGLP + EF TEE EEY+DS ADKA+ V++H+WKW
Sbjct: 227 LYSMDVIVDGLPND-EFSTEEVEEYSDSEDKRTADKAVSVRKHFWKW 272
>gi|359490753|ref|XP_003634156.1| PREDICTED: uncharacterized protein LOC100854009 [Vitis vinifera]
Length = 292
Score = 316 bits (810), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 168/253 (66%), Positives = 195/253 (77%), Gaps = 6/253 (2%)
Query: 22 PASILAAFSVLCLFTCSKPYNTTSTLKIPFLTFPSNPRRFLKIPIMSRTNHQLHHHSFAT 81
P A S+L LF S P T KI L+FP NPRRFLK P M ++ F++
Sbjct: 37 PDLFFTALSLLFLFP-SSPKPQTLFSKISSLSFPQNPRRFLKTPTMISPSNPKPITQFSS 95
Query: 82 PQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAIPPVRTVNVVTVRI 141
PQSL+DWLK RLPSDSFASWGVKPGTKNVHNLWLELS+GET LAD+IPP+RTV VVTVRI
Sbjct: 96 PQSLTDWLKHRLPSDSFASWGVKPGTKNVHNLWLELSEGETSLADSIPPIRTVRVVTVRI 155
Query: 142 IGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILNDSITVR 201
I +DN+IL+ESHQELSDG++R RCRPL+EKMK ET E AV RAVKEELGSI+ + VR
Sbjct: 156 ISEDNRILLESHQELSDGTIRERCRPLSEKMKPGETLECAVHRAVKEELGSIIQGNGNVR 215
Query: 202 IVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEEYADSSA---A 258
IVPGSY +K++ER S SYPGLPA YVLHSV+A+VDGLP +GEFCTEE EEY D + A
Sbjct: 216 IVPGSYEQKVEERVSASYPGLPACYVLHSVNAWVDGLP-DGEFCTEE-EEYRDWNGMGIA 273
Query: 259 DKALFVKRHYWKW 271
+ A+ VKRHYWKW
Sbjct: 274 EMAVSVKRHYWKW 286
>gi|224134953|ref|XP_002321946.1| predicted protein [Populus trichocarpa]
gi|224134957|ref|XP_002321947.1| predicted protein [Populus trichocarpa]
gi|222868942|gb|EEF06073.1| predicted protein [Populus trichocarpa]
gi|222868943|gb|EEF06074.1| predicted protein [Populus trichocarpa]
Length = 216
Score = 300 bits (767), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 145/210 (69%), Positives = 175/210 (83%), Gaps = 7/210 (3%)
Query: 67 MSRTNHQLHHHSFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLAD 126
MS + + H F+TPQSLSDWLKPRLPSDSFASWG+KPGTKN+HNLWLE+S GET LAD
Sbjct: 1 MSVSRKTKNSHHFSTPQSLSDWLKPRLPSDSFASWGIKPGTKNIHNLWLEISQGETFLAD 60
Query: 127 AIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAV 186
+ PP+RTVNVVTV+II K NQ L+ESHQELSDGSVRNRCRPL+EKMK NE+ ++A+ RA+
Sbjct: 61 STPPIRTVNVVTVKIINK-NQTLIESHQELSDGSVRNRCRPLSEKMKPNESFKDAIFRAI 119
Query: 187 KEELGSILNDS--ITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEF 244
EELGSIL D +++ IV GSY +K++ERNS+SYPGLPARYVL+S D V+GLP +GEF
Sbjct: 120 NEELGSILKDGNEVSINIVNGSYKEKVEERNSMSYPGLPARYVLYSADVEVNGLP-DGEF 178
Query: 245 CTEEAEEYADSS---AADKALFVKRHYWKW 271
CTEEAEEY DS A+KA+ VK+H+WKW
Sbjct: 179 CTEEAEEYPDSEEKRVAEKAVSVKKHFWKW 208
>gi|297808463|ref|XP_002872115.1| hypothetical protein ARALYDRAFT_489310 [Arabidopsis lyrata subsp.
lyrata]
gi|297317952|gb|EFH48374.1| hypothetical protein ARALYDRAFT_489310 [Arabidopsis lyrata subsp.
lyrata]
Length = 295
Score = 289 bits (739), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 165/264 (62%), Positives = 200/264 (75%), Gaps = 22/264 (8%)
Query: 22 PASILAAFSVLCLFTCSKPYNTTSTLKIP--FLTFPSNPRRFLKIPIMSRTNHQLHHHSF 79
P LAA S+L L++ KP L +P +FP NPRR +I MSR + F
Sbjct: 32 PDLFLAAVSLLFLWSSPKPL-----LSLPPNRFSFPLNPRR--RITAMSRRSPPPPSQRF 84
Query: 80 ATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAIPPVRTVNVVTV 139
A PQSLSDWL+ RLPSDSFA+WGVKPGTKNVHNLWLELSDGET LAD+ PPVRTVNVVTV
Sbjct: 85 ANPQSLSDWLESRLPSDSFAAWGVKPGTKNVHNLWLELSDGETSLADSTPPVRTVNVVTV 144
Query: 140 RIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN-DSI 198
R+IGK+ +ILVE+HQELSDGS+R+R RPL+EKMK E+P+EAV RA+KEELGSI N D +
Sbjct: 145 RVIGKNGRILVEAHQELSDGSIRDRFRPLSEKMKPEESPDEAVFRAIKEELGSIFNGDDV 204
Query: 199 T--VRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEE-------- 248
+RI+PG+YS++++ERNSLSYPGLPARY LHSVDA V+GLPEE +FCT+E
Sbjct: 205 GQRIRILPGTYSRRVEERNSLSYPGLPARYALHSVDATVEGLPEE-DFCTDEKEYDGGDS 263
Query: 249 -AEEYADSSAADKALFVKRHYWKW 271
E+ ++ AA KA+ VKRHYWKW
Sbjct: 264 TTEDSVETRAAGKAVTVKRHYWKW 287
>gi|357464217|ref|XP_003602390.1| hypothetical protein MTR_3g092810 [Medicago truncatula]
gi|355491438|gb|AES72641.1| hypothetical protein MTR_3g092810 [Medicago truncatula]
Length = 293
Score = 284 bits (726), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 163/286 (56%), Positives = 196/286 (68%), Gaps = 21/286 (7%)
Query: 9 PIQSSGSIKIINPPASILAAFSVLCLFTCSKPYNTTSTLKIPFLTFPSNPRRFLKIPIMS 68
P+Q ++ + + P +L A SV LFT S+ +NT +T PFL+FP PRRFLKIP MS
Sbjct: 4 PLQPLTNLNLPSLPDLLLTALSVCFLFTSSRSHNTPTTTNCPFLSFPLKPRRFLKIPAMS 63
Query: 69 RTNHQLHHHS-------------FATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWL 115
S F +PQSLS+WLKPRLPSDSFA+WGVKPGTKNVHNLWL
Sbjct: 64 NNISTSSLSSSSSSNPNRNNRRNFVSPQSLSEWLKPRLPSDSFATWGVKPGTKNVHNLWL 123
Query: 116 ELSDGETCLADAIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSN 175
ELS GET L D+ PPVRTV VVTVR++GKD +ILVESHQELSDG VR R RPL+EKMK N
Sbjct: 124 ELSQGETSLDDSEPPVRTVQVVTVRVVGKDGKILVESHQELSDGKVRERGRPLSEKMKPN 183
Query: 176 ETPEEAVIRAVKEELGSILNDSI----TVRIVPGSYSKKLDERNSLSYPGLPARYVLHSV 231
E PE A +R +KEELGS++N V I P SY +++ERNS SYPGLP YVLH +
Sbjct: 184 EDPESAAVRGIKEELGSVINGETEICDIVMIDPNSYEMRVEERNSGSYPGLPGCYVLHIL 243
Query: 232 DAFVDGLPEEGEFCTEEAEEYADSS---AADKALFVKRHYWKWRRA 274
A V+GLP EG+FCT E +EY DS A +A+ VK+H+WKW A
Sbjct: 244 SATVEGLP-EGDFCTYEVDEYDDSDDKKVAHQAVSVKKHFWKWVSA 288
>gi|356509151|ref|XP_003523315.1| PREDICTED: uncharacterized protein LOC100783319 [Glycine max]
Length = 292
Score = 280 bits (715), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 155/282 (54%), Positives = 184/282 (65%), Gaps = 37/282 (13%)
Query: 22 PASILAAFSVLCLFTCSKPYNTTSTLKIPFLTFPSNPRRFLKIPIMSRTNHQL------- 74
P +L A SV L + SKP+ ST + PF PRRFLKIP MS TN
Sbjct: 14 PDLLLTALSVCFLVSSSKPHIGGST-RCPF------PRRFLKIPAMSLTNTTTSKSSKNN 66
Query: 75 ---------HHHSFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLA 125
HSFA+PQSLS+WLKPRLPS+SFASWGVKPGTKNVHNLWLELS GET LA
Sbjct: 67 NLNNTLLPPRRHSFASPQSLSEWLKPRLPSESFASWGVKPGTKNVHNLWLELSQGETSLA 126
Query: 126 DAIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRA 185
D+ PPVRTV+VV VR+ GK +ILVESHQELSDG+VR R RPL+EKMK NE PE A +R
Sbjct: 127 DSTPPVRTVHVVLVRVTGKHGKILVESHQELSDGNVRKRGRPLSEKMKPNEDPESAAVRG 186
Query: 186 VKEELGSILNDSI---------TVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVD 236
+ EELGS + V I P SY +++ER+S SYPGLP YVLH++ A V+
Sbjct: 187 IMEELGSAIGGGFRAANFEIDDIVTIDPNSYEMRVEERDSGSYPGLPGCYVLHTLSATVE 246
Query: 237 GLPEEGEFCTEEAEEYA----DSSAADKALFVKRHYWKWRRA 274
GLP EG+F T E +EY D AD+A+ VK+H+W W A
Sbjct: 247 GLP-EGDFSTYEVDEYGGVFQDKIVADEAVSVKKHHWTWVSA 287
>gi|15238541|ref|NP_197834.1| uncharacterized protein [Arabidopsis thaliana]
gi|9758534|dbj|BAB08929.1| unnamed protein product [Arabidopsis thaliana]
gi|17065180|gb|AAL32744.1| Unknown protein [Arabidopsis thaliana]
gi|20259932|gb|AAM13313.1| unknown protein [Arabidopsis thaliana]
gi|332005930|gb|AED93313.1| uncharacterized protein [Arabidopsis thaliana]
Length = 300
Score = 279 bits (713), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 164/265 (61%), Positives = 199/265 (75%), Gaps = 20/265 (7%)
Query: 22 PASILAAFSVLCLFTCSKPYNTTSTLKIPFLTFPSNPRRFLKIPIMSRTNHQLHHHS--F 79
P LAA S+L L++ KP +L +FP NPRR I MSR + + F
Sbjct: 33 PDIFLAAVSLLFLWSSPKPL---VSLPPNRFSFPLNPRR-RSITAMSRRSPPPPPPTQRF 88
Query: 80 ATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAIPPVRTVNVVTV 139
A PQSLSDWL+ RLPSDSFA+WGVKPGTKNVHNLWLELSDGET LAD+ PPVRTVNVVTV
Sbjct: 89 ANPQSLSDWLESRLPSDSFAAWGVKPGTKNVHNLWLELSDGETSLADSTPPVRTVNVVTV 148
Query: 140 RIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN---D 196
R+IGK+ +ILVE+HQELSDGS+R R RPL+EKMK E+P+EAV RA+KEELGSI N D
Sbjct: 149 RVIGKNGRILVEAHQELSDGSIRERFRPLSEKMKPEESPDEAVFRAIKEELGSIFNGDGD 208
Query: 197 SIT--VRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAE-EYA 253
+I ++I+PG+Y+++++ERNSLSYPGLPARY LHSV+A V+GLPE+ +FCTEE E E
Sbjct: 209 NIGQRIKILPGTYNRRVEERNSLSYPGLPARYALHSVNATVEGLPEQ-DFCTEEKEYEGG 267
Query: 254 DSS-------AADKALFVKRHYWKW 271
DS+ AA A+ VKRHYWKW
Sbjct: 268 DSTKDSVETLAAGNAVTVKRHYWKW 292
>gi|449456435|ref|XP_004145955.1| PREDICTED: uncharacterized protein LOC101208896 [Cucumis sativus]
Length = 294
Score = 279 bits (713), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 140/230 (60%), Positives = 178/230 (77%), Gaps = 9/230 (3%)
Query: 50 PFLTFPSNPRRFLKIPIMSRT--NHQLHHHSFATPQSLSDWLKPRLPSDSFASWGVKPGT 107
P + NPRRF KIP +S N +F +PQSLS+WL+PRLPS SFASWGV PGT
Sbjct: 58 PAFSIQLNPRRFFKIPSISMEFPNSNSKSLTFTSPQSLSEWLQPRLPSHSFASWGVIPGT 117
Query: 108 KNVHNLWLELSDGETCLADAIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCRP 167
KN+HNLWLE+S GET LAD+ PP+RT++V+++RII +++L+ESHQ+LSDG++RNR RP
Sbjct: 118 KNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRP 177
Query: 168 LAEKMKSNETPEEAVIRAVKEELGSILND---SITVRIVPGSYSKKLDERNSLSYPGLPA 224
L+EKMK NETPE AV RAV+EELGSIL D S VRIVP SY K++ER+S+SYPGL A
Sbjct: 178 LSEKMKPNETPESAVYRAVQEELGSILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSA 237
Query: 225 RYVLHSVDAFVDGLPEEGEFCTEEAEEYA---DSSAADKALFVKRHYWKW 271
YVLHS+D +V+GLP +G+FCT E EEY D++ AD A+ VK+H+WKW
Sbjct: 238 SYVLHSMDVWVEGLP-DGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKW 286
>gi|449497388|ref|XP_004160388.1| PREDICTED: uncharacterized protein LOC101231604 [Cucumis sativus]
Length = 294
Score = 278 bits (710), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 140/230 (60%), Positives = 177/230 (76%), Gaps = 9/230 (3%)
Query: 50 PFLTFPSNPRRFLKIPIMSRT--NHQLHHHSFATPQSLSDWLKPRLPSDSFASWGVKPGT 107
P + NPRRF KIP +S N +F +PQSLS+WL+PRLPS SFASWGV PGT
Sbjct: 58 PAFSIQLNPRRFFKIPSISMEFPNSNSKSLTFTSPQSLSEWLQPRLPSHSFASWGVIPGT 117
Query: 108 KNVHNLWLELSDGETCLADAIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCRP 167
KN+HNLWLE+S GET LAD+ PP+RT++V+++RII +++L+ESHQ+LSDG++RNR RP
Sbjct: 118 KNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRP 177
Query: 168 LAEKMKSNETPEEAVIRAVKEELGSILND---SITVRIVPGSYSKKLDERNSLSYPGLPA 224
L+EKMK NETPE AV RAV+EELGSIL D S VRIVP SY K++ER+S+SYPGL A
Sbjct: 178 LSEKMKPNETPESAVYRAVQEELGSILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSA 237
Query: 225 RYVLHSVDAFVDGLPEEGEFCTEEAEEYA---DSSAADKALFVKRHYWKW 271
YVLHS+D V+GLP +G+FCT E EEY D++ AD A+ VK+H+WKW
Sbjct: 238 SYVLHSMDVRVEGLP-DGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKW 286
>gi|356516158|ref|XP_003526763.1| PREDICTED: uncharacterized protein LOC100804392 [Glycine max]
Length = 291
Score = 275 bits (704), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 155/281 (55%), Positives = 183/281 (65%), Gaps = 36/281 (12%)
Query: 22 PASILAAFSVLCLFTCSKPYNTTSTLKIPFLTFPSNPRRFLKIPIMSRTNHQL------- 74
P +L A SV L + SKP+ ST + PF PRRFLKIP MS T
Sbjct: 14 PDLLLTALSVCFLVSSSKPHIGGST-RCPF------PRRFLKIPAMSLTTITTTSTSSKN 66
Query: 75 ----------HHHSFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCL 124
HSFA+PQSLS+WLKPRLPS+SFASWGVKPGTKNVHNLWLELS GET L
Sbjct: 67 NNNLNSHLPPRRHSFASPQSLSEWLKPRLPSESFASWGVKPGTKNVHNLWLELSQGETSL 126
Query: 125 ADAIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIR 184
AD+ PPVRTV+VV VR+ GK +ILVESHQ+LSDGSVR R RPL+EKMK NE PE A +R
Sbjct: 127 ADSTPPVRTVHVVLVRVTGKHGRILVESHQDLSDGSVRKRGRPLSEKMKPNEDPESAAVR 186
Query: 185 AVKEELGSILNDSI-------TVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDG 237
+ EELGS + V I P SY +++ER+S SYPGLP YVLH++ A V+
Sbjct: 187 GIMEELGSAIGGGAKNFEIDDIVTIDPNSYEMRVEERDSGSYPGLPGCYVLHTLSATVEC 246
Query: 238 LPEEGEFCTEEAEEYA----DSSAADKALFVKRHYWKWRRA 274
LP EG+FCT E +EY D AD+A+ VK+HYW W A
Sbjct: 247 LP-EGDFCTYEVDEYGGVFEDKIVADEAVSVKKHYWTWVSA 286
>gi|294463008|gb|ADE77042.1| unknown [Picea sitchensis]
Length = 237
Score = 200 bits (508), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 101/204 (49%), Positives = 135/204 (66%), Gaps = 11/204 (5%)
Query: 78 SFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAIPPVRTVNVV 137
+ + Q L+ WL+ R+ S WGV+PGTKNV NLW+EL++GE L D+ PP RTVNV
Sbjct: 25 AIGSAQGLARWLRTRMISAHLDMWGVEPGTKNVINLWVELAEGEITLEDSCPPKRTVNVA 84
Query: 138 TVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILNDS 197
+V I K Q+L+ESHQE+ DGS+R R RPL+EKMK+ ET EE R + EELG +L
Sbjct: 85 SVNIRNKSGQVLLESHQEMGDGSIRFRNRPLSEKMKAGETVEEGCKRGICEELGLVLGSP 144
Query: 198 ITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEE------ 251
+ ++ SY KK++ER+SLSYPGL YV+HSVDA V LPE +F TEE E+
Sbjct: 145 DKIELLMDSYEKKVEERDSLSYPGLLTCYVVHSVDAVVADLPET-DFFTEEDEDGRINGN 203
Query: 252 ----YADSSAADKALFVKRHYWKW 271
++ S +KA+ VK+H+WKW
Sbjct: 204 HGFSFSSSGTREKAIGVKKHFWKW 227
>gi|115469912|ref|NP_001058555.1| Os06g0712200 [Oryza sativa Japonica Group]
gi|53792644|dbj|BAD53658.1| unknown protein [Oryza sativa Japonica Group]
gi|53792882|dbj|BAD54059.1| unknown protein [Oryza sativa Japonica Group]
gi|113596595|dbj|BAF20469.1| Os06g0712200 [Oryza sativa Japonica Group]
gi|125598474|gb|EAZ38254.1| hypothetical protein OsJ_22631 [Oryza sativa Japonica Group]
gi|215692431|dbj|BAG87851.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 241
Score = 179 bits (455), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 104/213 (48%), Positives = 136/213 (63%), Gaps = 24/213 (11%)
Query: 78 SFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAI--------- 128
+F T SL+ +L RLP+ + ASWG PGTK + NL+LELS GE L A
Sbjct: 11 AFPTLDSLAAFLGSRLPASALASWGTAPGTKTLLNLFLELSQGECVLISAAAAAAPPSQQ 70
Query: 129 ----PPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNR--CRPLAEKMKSNETPEEAV 182
P VR V+V +VRI +L+E+ Q LSDG++R+R RPL+EKM+ ETPE A
Sbjct: 71 QQQHPVVRAVHVASVRIRNGRGALLMETGQLLSDGTLRSRGGLRPLSEKMRPGETPEAAA 130
Query: 183 IRAVKEELGSILNDSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDA-FVDGLPEE 241
+RAV+EELG + + VRI+ G +++ER+S SYPGL ARYVLH+VDA V+G+PE+
Sbjct: 131 VRAVREELG----ERVRVRIL-GGEEARVEERDSASYPGLHARYVLHAVDAEVVEGVPED 185
Query: 242 GEFCTEEAEEYADSSAAD---KALFVKRHYWKW 271
GEF TEE E+ D D A+ VKRHYWKW
Sbjct: 186 GEFDTEEGGEHEDEVVVDGAAAAITVKRHYWKW 218
>gi|125556726|gb|EAZ02332.1| hypothetical protein OsI_24435 [Oryza sativa Indica Group]
Length = 244
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 104/216 (48%), Positives = 136/216 (62%), Gaps = 27/216 (12%)
Query: 78 SFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAI--------- 128
+F T SL+ +L RLP+ + ASWG PGTK + NL+LELS GE L A
Sbjct: 11 AFPTLDSLAAFLGSRLPASALASWGTAPGTKTLLNLFLELSQGECVLISAAAAAAPPSQQ 70
Query: 129 -------PPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNR--CRPLAEKMKSNETPE 179
P VR V+V +VRI +L+E+ Q LSDG++R+R RPL+EKM+ ETPE
Sbjct: 71 QQQQQQHPVVRAVHVASVRIRNGRGALLMETGQLLSDGTLRSRGGLRPLSEKMRPGETPE 130
Query: 180 EAVIRAVKEELGSILNDSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDA-FVDGL 238
A +RAV+EELG + + VRI+ G +++ER+S SYPGL ARYVLH+VDA V+G+
Sbjct: 131 AAAVRAVREELG----ERVRVRIL-GGEEARVEERDSASYPGLHARYVLHAVDAEVVEGV 185
Query: 239 PEEGEFCTEEAEEYADSSAAD---KALFVKRHYWKW 271
PE+GEF TEE E+ D D A+ VKRHYWKW
Sbjct: 186 PEDGEFDTEEGGEHEDEVVVDGAAAAITVKRHYWKW 221
>gi|226507796|ref|NP_001142985.1| uncharacterized protein LOC100275445 [Zea mays]
gi|195612554|gb|ACG28107.1| hypothetical protein [Zea mays]
Length = 225
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 107/198 (54%), Positives = 136/198 (68%), Gaps = 14/198 (7%)
Query: 78 SFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAIPP--VRTVN 135
+F T SL+ +L+PRLP + +SWG PGTK + NL+LEL+ G+ C DA P VR V+
Sbjct: 16 AFPTLDSLAAFLRPRLPPGALSSWGAVPGTKTLANLFLELAVGD-CALDAASPRLVRAVH 74
Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRC-RPLAEKMKSNETPEEAVIRAVKEELGSIL 194
V TVRI + LVE+ Q LSDG+VR R RPL+EKM+ E+PE A RAV+EELG+
Sbjct: 75 VATVRIRNRRGARLVETRQLLSDGTVRRRGPRPLSEKMRPGESPEAAAARAVREELGA-- 132
Query: 195 NDSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDA-FVDGLPEEGEFCTEEAEEYA 253
VRI+ G+ +++ER S SYPGLPARYVLH+VDA VDG+PE+GEF TEEA E
Sbjct: 133 --RARVRILGGA-GARVEERESASYPGLPARYVLHAVDAEVVDGVPEDGEFETEEAGE-- 187
Query: 254 DSSAADKALFVKRHYWKW 271
D +A A+ VKRHYW W
Sbjct: 188 DDAAG--AITVKRHYWDW 203
>gi|326501076|dbj|BAJ98769.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326510433|dbj|BAJ87433.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326526951|dbj|BAK00864.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 226
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 106/201 (52%), Positives = 135/201 (67%), Gaps = 11/201 (5%)
Query: 78 SFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCL--ADAIPP---VR 132
+F T SL+ +L+PRLP + ASWG PGTKN+ NL+LELS G+ L A + PP VR
Sbjct: 11 AFPTLDSLAAYLRPRLPGPALASWGSAPGTKNLLNLFLELSCGDCTLLPAASSPPALVVR 70
Query: 133 TVNVVTVRIIGKDNQILVESHQELSDGSVRNR-CRPLAEKMKSNETPEEAVIRAVKEELG 191
V+V TVRI + +LVE+ Q LSDG++R R RPL+EKM+ ETPE A RAV EELG
Sbjct: 71 AVHVATVRIRNRRGALLVETRQLLSDGTLRRRAARPLSEKMRPGETPEAAAARAVCEELG 130
Query: 192 SILNDSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDA-FVDGLPEEGEFCTEEAE 250
+ VRI+ + +++E+ S+SYPGLPARYVLH+VDA V+G+PEEGEF TEEA
Sbjct: 131 ----ERARVRILGAAPEPRVEEQESVSYPGLPARYVLHAVDAELVEGVPEEGEFETEEAG 186
Query: 251 EYADSSAADKALFVKRHYWKW 271
E A+ VKRHYW W
Sbjct: 187 EGEGHDGGGAAITVKRHYWSW 207
>gi|168020019|ref|XP_001762541.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686274|gb|EDQ72664.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 184
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 86/177 (48%), Positives = 113/177 (63%), Gaps = 3/177 (1%)
Query: 97 SFASWGVKPGTKNVHNLWLELSDGETCLADAIPPVRTVNVVTVRIIGKDNQILVESHQEL 156
S +SWGV GTK + NLW EL DGE L D+ PP RTV+V V+I + L+ES QE+
Sbjct: 1 SVSSWGVDSGTKRIGNLWTELVDGEISLEDSKPPKRTVHVAVVKIRNEAGMFLIESSQEM 60
Query: 157 SDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILNDSITVRIVPGSYSKKLDERNS 216
DG +++R RPL+EKM+ E E+A R V EELG + V ++ +Y ++ +ER+S
Sbjct: 61 DDGRMKSRNRPLSEKMRPGENVEDACRRGVFEELGPEMGARDRVEMILETYQREEEERDS 120
Query: 217 LSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEEYADSSAADK--ALFVKRHYWKW 271
SYPGL RYVLHS+ A V GLP EFCTEE E + A+ A+ VK+HYWKW
Sbjct: 121 FSYPGLLTRYVLHSMVASVRGLP-SSEFCTEENEGKRNGHASGIRCAIGVKKHYWKW 176
>gi|413943158|gb|AFW75807.1| hypothetical protein ZEAMMB73_771450 [Zea mays]
Length = 252
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 105/195 (53%), Positives = 133/195 (68%), Gaps = 14/195 (7%)
Query: 78 SFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAIPP--VRTVN 135
+F T SL+ +L+PRLP + +SWG PGTK + NL+LEL+ G+ C DA P VR V+
Sbjct: 43 AFPTLDSLAAFLRPRLPPGALSSWGAVPGTKTLANLFLELAVGD-CALDAASPRVVRAVH 101
Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRC-RPLAEKMKSNETPEEAVIRAVKEELGSIL 194
V TVRI + LVE+ Q LSDG+VR R RPL+EKM+ E+PE A RAV+EELG+
Sbjct: 102 VATVRIRNRRGARLVETRQLLSDGTVRRRGPRPLSEKMRPGESPEAAAARAVREELGA-- 159
Query: 195 NDSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDA-FVDGLPEEGEFCTEEAEEYA 253
VRI+ G +++ER S SYPGLPARYVLH+VDA VDG+PE+GEF TEEA E
Sbjct: 160 --RARVRIL-GGAGARVEERESASYPGLPARYVLHAVDAEVVDGVPEDGEFETEEAGE-- 214
Query: 254 DSSAADKALFVKRHY 268
D +A A+ VKRHY
Sbjct: 215 DDAAG--AITVKRHY 227
>gi|357117587|ref|XP_003560546.1| PREDICTED: uncharacterized protein LOC100835592 [Brachypodium
distachyon]
Length = 233
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 106/204 (51%), Positives = 136/204 (66%), Gaps = 14/204 (6%)
Query: 78 SFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAIPP------- 130
+F T SL+ +LKPRLP + ASWG PGTKN+ NL+LELS G+ L A
Sbjct: 11 AFPTLDSLAAYLKPRLPGPALASWGTAPGTKNLLNLFLELSCGDCTLLPAAASSPPAALV 70
Query: 131 VRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRC-RPLAEKMKSNETPEEAVIRAVKEE 189
VR V+V TVRI + +LVE+ Q LSDG++R R RPL+EKM+ ETPE A RAV+EE
Sbjct: 71 VRAVHVATVRIRNRRGALLVETRQLLSDGTLRRRGPRPLSEKMRPGETPEAAAARAVREE 130
Query: 190 LGSILNDSITVRIV-PGSYSKKLDERNSLSYPGLPARYVLHSVDA-FVDGLPEEGEFCTE 247
LG + + VRI+ P + ++++ER+S SYPGLPARYVLH+VDA V+G+PE+GEF TE
Sbjct: 131 LGEL----VRVRILGPEAEPRRVEERDSASYPGLPARYVLHAVDAEVVEGVPEDGEFDTE 186
Query: 248 EAEEYADSSAADKALFVKRHYWKW 271
EA E A+ VKRHYW W
Sbjct: 187 EAGEGEGHDGGGVAITVKRHYWAW 210
>gi|302822847|ref|XP_002993079.1| hypothetical protein SELMODRAFT_136514 [Selaginella moellendorffii]
gi|300139079|gb|EFJ05827.1| hypothetical protein SELMODRAFT_136514 [Selaginella moellendorffii]
Length = 237
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 85/207 (41%), Positives = 121/207 (58%), Gaps = 27/207 (13%)
Query: 88 WLKPRLPSDS--FASWGVKPGTKNVHNLWLELSDGETCLADAIPPVRTVNVVTVRIIGKD 145
WLK RLP WG PGTK V NLW EL +GE L D+ PP RTV+V +V+I+ +
Sbjct: 30 WLKERLPQGGALLDLWGTAPGTKRVANLWRELLEGEISLEDSRPPKRTVHVASVQIVNES 89
Query: 146 NQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILNDSITVRIVPG 205
++LVE++QE++DG +R R RPL+EKM+ E+ EEA +R + EELG ++ V ++
Sbjct: 90 GEMLVEAYQEMADGRIRPRNRPLSEKMRPGESVEEACLRGISEELGCAIDQ---VALLRE 146
Query: 206 SYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEEYAD----------- 254
SY + +ER S SYPGL RYV+H++ A V LP+ +F TEE E+
Sbjct: 147 SYQRVEEERESFSYPGLSTRYVIHTITAHVKQLPQT-DFDTEEDEDGNGGGGGGGGAAVL 205
Query: 255 ----------SSAADKALFVKRHYWKW 271
+S A+ V++H+WKW
Sbjct: 206 VAASGRTATATSCLGGAVGVRKHFWKW 232
>gi|302787050|ref|XP_002975295.1| hypothetical protein SELMODRAFT_103171 [Selaginella moellendorffii]
gi|300156869|gb|EFJ23496.1| hypothetical protein SELMODRAFT_103171 [Selaginella moellendorffii]
Length = 240
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 85/210 (40%), Positives = 121/210 (57%), Gaps = 30/210 (14%)
Query: 88 WLKPRLPSDS--FASWGVKPGTKNVHNLWLELSDGETCLADAIPPVRTVNVVTVRIIGKD 145
WLK RLP WG PGTK V NLW EL +GE L D+ PP RTV+V +V+I+ +
Sbjct: 30 WLKERLPQGGAQLDLWGTAPGTKRVANLWRELLEGEISLEDSRPPKRTVHVASVQIVNES 89
Query: 146 NQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILNDSITVRIVPG 205
++LVE++QE++DG +R R RPL+EKM+ E+ EEA +R + EELG ++ V ++
Sbjct: 90 GEMLVEAYQEMADGRIRPRNRPLSEKMRPGESVEEACLRGISEELGCAIDQ---VALLRE 146
Query: 206 SYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEEYAD----------- 254
SY + +ER S SYPGL RYV+H++ A V LP+ +F TEE E+
Sbjct: 147 SYQRVEEERESFSYPGLSTRYVIHTITAHVKQLPQT-DFDTEEDEDGNGGGGGSGGGGGA 205
Query: 255 -------------SSAADKALFVKRHYWKW 271
+S A+ V++H+WKW
Sbjct: 206 AVLVVASGRTATATSCLGGAVGVRKHFWKW 235
>gi|242094184|ref|XP_002437582.1| hypothetical protein SORBIDRAFT_10g029760 [Sorghum bicolor]
gi|241915805|gb|EER88949.1| hypothetical protein SORBIDRAFT_10g029760 [Sorghum bicolor]
Length = 260
Score = 134 bits (338), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 97/202 (48%), Positives = 121/202 (59%), Gaps = 19/202 (9%)
Query: 78 SFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAIPP------- 130
+F T SL+ +L+PRLP + ASWG PGTK + NL+LEL+ G+ L
Sbjct: 42 AFPTLDSLAAFLRPRLPPGALASWGAVPGTKTLSNLFLELAVGDCALHAPADADPPPAAP 101
Query: 131 --VRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRC-RPLAEKMKSNETPEEAVIRAVK 187
VR V+V TVRI LVE+ Q LSDG+VR R RPL+EKM+ E+PE A RAV+
Sbjct: 102 RVVRAVHVATVRIRNSRGARLVETRQLLSDGTVRRRGPRPLSEKMRPGESPEAAAARAVR 161
Query: 188 EELGSILNDSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDA-FVDGLPEEGEFCT 246
EELG+ VRI+ G +++ER S SYPGLPARYVLH+VDA VDGLPE+
Sbjct: 162 EELGA----RARVRIL-GGDGARVEERESASYPGLPARYVLHAVDAEVVDGLPED-GEFE 215
Query: 247 EEAEEYADSSAADKALFVKRHY 268
E D A A+ VKRHY
Sbjct: 216 TEETGEDDGDAG--AITVKRHY 235
>gi|361066237|gb|AEW07430.1| Pinus taeda anonymous locus 0_1427_02 genomic sequence
gi|383166917|gb|AFG66456.1| Pinus taeda anonymous locus 0_1427_02 genomic sequence
gi|383166918|gb|AFG66457.1| Pinus taeda anonymous locus 0_1427_02 genomic sequence
gi|383166919|gb|AFG66458.1| Pinus taeda anonymous locus 0_1427_02 genomic sequence
Length = 130
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 50/94 (53%), Positives = 66/94 (70%)
Query: 78 SFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAIPPVRTVNVV 137
+ + Q L+ WL R+ S WG++PGTKNV NLW+EL++GE L D+ PP RTVNVV
Sbjct: 37 AIGSAQGLARWLNTRMISTRLDMWGIEPGTKNVTNLWVELAEGEITLEDSCPPKRTVNVV 96
Query: 138 TVRIIGKDNQILVESHQELSDGSVRNRCRPLAEK 171
+V I K Q+L+ESHQE+ DGS R R RPL++K
Sbjct: 97 SVNIRNKSGQVLLESHQEMGDGSNRVRNRPLSQK 130
>gi|145341152|ref|XP_001415678.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144575901|gb|ABO93970.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 224
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 73/195 (37%), Positives = 97/195 (49%), Gaps = 25/195 (12%)
Query: 77 HSFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLA----------- 125
H AT +L WL R + + GTKN+ +L +EL GE+ L
Sbjct: 6 HEGATTANLGAWLARR--GVAVEKFDRARGTKNLEDLLIELKSGESVLVSERERASGENG 63
Query: 126 -----DAIPP-VRTVNVVTVRII-----GKDNQILVESHQELSDGSV-RNRCRPLAEKMK 173
DA VR V+V+T+R+ +D L+E Q + R R RPL+EKM
Sbjct: 64 GEMNEDARETCVRYVDVLTLRVRRPGSNAEDGMCLIEKEQIFGKNELKRRRNRPLSEKMN 123
Query: 174 SNETPEEAVIRAVKEELGSILNDSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDA 233
E + V RAV+EELGS L D V + +Y + E S SYPGL +R+ LH VDA
Sbjct: 124 FGEHWRDCVERAVREELGSALGDDYVVETLEDTYKLCVSEEMSASYPGLRSRFALHRVDA 183
Query: 234 FVDGLPEEGEFCTEE 248
V GLP+E EF +EE
Sbjct: 184 IVHGLPDEDEFQSEE 198
>gi|255089965|ref|XP_002506904.1| predicted protein [Micromonas sp. RCC299]
gi|226522177|gb|ACO68162.1| predicted protein [Micromonas sp. RCC299]
Length = 234
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 70/199 (35%), Positives = 104/199 (52%), Gaps = 13/199 (6%)
Query: 84 SLSDWLKPR-LPSDSFASWGVKPGTKNVHNLWLELSDGETCLA---DAIPPVRTVNVVTV 139
SLS WL + +D+F + G K++ +L +E+ GET L P R V V+ +
Sbjct: 32 SLSAWLTANGVDADAFG----EDGAKSLVDLAVEIERGETTLVLDPQTSAPRRDVRVLRL 87
Query: 140 RIIGKD---NQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN- 195
+ +D +L+E+ QE G VR R PL+EK+ NE A RAV EELGS ++
Sbjct: 88 LVTDEDARPGMVLIEARQEWESGRVRTRRTPLSEKLLGNEDWRVACARAVNEELGSAIDA 147
Query: 196 DSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGE-FCTEEAEEYAD 254
+ ++R+ S + + +R+SLSYPGL +RYV+++V V GLPE E E
Sbjct: 148 RTFSMRVDESSVKETMMDRDSLSYPGLRSRYVMYTVACDVKGLPEPTEQVFGSSTGEMGF 207
Query: 255 SSAADKALFVKRHYWKWRR 273
+S D + W WRR
Sbjct: 208 TSVEDTPAGWLKATWTWRR 226
>gi|307107732|gb|EFN55974.1| hypothetical protein CHLNCDRAFT_57724 [Chlorella variabilis]
Length = 201
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 71/195 (36%), Positives = 101/195 (51%), Gaps = 16/195 (8%)
Query: 80 ATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCL-ADAIPPVRTVNVVT 138
A Q L+ WL S +G P K++ LW E+ GET L ++ P+R V+V+
Sbjct: 21 ADAQELAQWLAR--GGISLDRYGKGP-AKSLDQLWEEVEAGETQLFLESGRPLREVSVLN 77
Query: 139 VRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILNDSI 198
V + Q L E+ Q L +GS R R PL+EK+ E +EA +R V EELG +L
Sbjct: 78 VLLKNSRGQTLYEAEQVLPNGSKRTRGLPLSEKLLPGERWQEAAVRGVLEELGPVLTADP 137
Query: 199 TVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFV-DGLPEEGEFCTEEAEEYADSSA 257
V + GS + ++ + S SYPGL ++YV V A V GLPE+ +F T+ EE +D
Sbjct: 138 QVEVDEGSMVESVEMKESQSYPGLLSKYVCKRVSARVLSGLPED-DFSTQ--EERSDG-- 192
Query: 258 ADKALFVKRHYWKWR 272
+ H W WR
Sbjct: 193 ------LLEHCWVWR 201
>gi|159463302|ref|XP_001689881.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283869|gb|EDP09619.1| predicted protein [Chlamydomonas reinhardtii]
Length = 203
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 82/142 (57%), Gaps = 2/142 (1%)
Query: 108 KNVHNLWLELSDGETCLA-DAIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCR 166
K VH L+ E+S E+ L + +R VNV+++ I+ QIL E Q L DG R R
Sbjct: 42 KTVHELFDEVSKQESVLELEGGKALRIVNVLSLHILNSRGQILFEDEQVLPDGRSRRRNV 101
Query: 167 PLAEKMKSNETPEEAVIRAVKEELGSILNDSITVRIVPGSYSKKLDERNSLSYPGLPARY 226
P++EKM +NE EA+ RAVKEEL S L TV ++ + + + +S+SYPGL +Y
Sbjct: 102 PVSEKMVANEPWHEALDRAVKEELSSALPLDYTVTLLEEPFMRT-EYSSSMSYPGLLTKY 160
Query: 227 VLHSVDAFVDGLPEEGEFCTEE 248
+ H V A V G+PE TEE
Sbjct: 161 IFHRVKARVSGIPENPFSTTEE 182
>gi|302854431|ref|XP_002958723.1| hypothetical protein VOLCADRAFT_100035 [Volvox carteri f.
nagariensis]
gi|300255898|gb|EFJ40179.1| hypothetical protein VOLCADRAFT_100035 [Volvox carteri f.
nagariensis]
Length = 210
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 58/151 (38%), Positives = 87/151 (57%), Gaps = 2/151 (1%)
Query: 108 KNVHNLWLELSDGETCLA-DAIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCR 166
K VH L+ E++ E+ L + +R VNV+++ I+ QIL E Q L DG R R
Sbjct: 60 KTVHELFDEVTKQESVLELEGGKALRIVNVLSLHILNSRGQILFEDEQVLPDGRSRRRNV 119
Query: 167 PLAEKMKSNETPEEAVIRAVKEELGSILNDSITVRIVPGSYSKKLDERNSLSYPGLPARY 226
P++EKM +E+ A+ RAV+EEL S L + V ++ SY + + +S+SYPGL +Y
Sbjct: 120 PVSEKMVGDESWLAALHRAVREELSSALPEDYKVTLLDDSYFLRTEYSSSMSYPGLLTKY 179
Query: 227 VLHSVDAFVDGLPEEGEFCTEEAEEYADSSA 257
+ H V A V G+PE F T E E +S++
Sbjct: 180 IFHRVRARVSGIPES-PFSTVEPNEGGESAS 209
>gi|224112030|ref|XP_002316059.1| predicted protein [Populus trichocarpa]
gi|222865099|gb|EEF02230.1| predicted protein [Populus trichocarpa]
Length = 117
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 56/138 (40%), Positives = 79/138 (57%), Gaps = 33/138 (23%)
Query: 137 VTVRIIGK--DNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSIL 194
+ VRI+ +IL+E+ QELSD S R + R EKMK+ E+ R+++E+
Sbjct: 1 MVVRILSSCSTKKILLEAQQELSDDSFRPQGREPGEKMKAGES------RSLREK----- 49
Query: 195 NDSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEEYAD 254
VR GS SYPGLP+RY LH V+A+VDGLP+E +FCT+E +EY D
Sbjct: 50 -----VRGKSGS----------TSYPGLPSRYTLHWVNAYVDGLPDE-DFCTDE-DEYKD 92
Query: 255 SSAAD---KALFVKRHYW 269
A +A+ VK+H+W
Sbjct: 93 CGAGGYSSRAVTVKKHFW 110
>gi|428163378|gb|EKX32452.1| hypothetical protein GUITHDRAFT_121382 [Guillardia theta CCMP2712]
Length = 311
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 69/193 (35%), Positives = 92/193 (47%), Gaps = 14/193 (7%)
Query: 83 QSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAIPPVRTVNVVTVRII 142
+SL+ WL+ R S WG + K V + EL GE C R ++VV VRI+
Sbjct: 114 ESLTAWLEIRDIDTSL--WG-QGEAKTVQEFFHELWAGE-CEPLTESAERVISVVKVRIL 169
Query: 143 GKDNQ---ILVESHQEL-SDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILNDSI 198
D L E+ Q+L DG R R RPLAEK + E+ +A +R VKEELGS + +
Sbjct: 170 DHDEAGTWQLYETKQKLCRDGRTRPRNRPLAEKRRPEESSFDAAVRGVKEELGSSIKSTS 229
Query: 199 TVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEEYADSSAA 258
+ + S + + R S S+PGL Y L VD V+GL T E EE + S
Sbjct: 230 RIVVKEDSLQEWTERRMSKSFPGLETVYKLSQVDVEVEGLSRAATGDTFETEETPELSNG 289
Query: 259 DKALFVKRHYWKW 271
K H W W
Sbjct: 290 ------KVHTWTW 296
>gi|412992265|emb|CCO19978.1| PREDICTED: hypothetical protein [Bathycoccus prasinos]
Length = 248
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 64/188 (34%), Positives = 88/188 (46%), Gaps = 39/188 (20%)
Query: 106 GTKNVHNLWLELSDGETCLAD----------AIPPVRTVNVVTVRIIGKD--NQILVESH 153
G K++ +L E+ +GET L + + +R V+VV V I K N+ LVE
Sbjct: 77 GLKSIDDLLKEIENGETVLTEHETTHDDGTVKLSCIRRVSVVVVEITSKSKPNKKLVEYE 136
Query: 154 QELSDGSVRNRCRPLAEKMKS--NETPEEAVIRAVKEELGSILN-------DSITVRIVP 204
Q L G R R R L+EK+ S ETP EA R ++EELG L+ D ++R +P
Sbjct: 137 QTLPSGLARKRERFLSEKIMSGKGETPLEAAKRGIREELGDALSPKAVINIDETSLRSLP 196
Query: 205 GSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEE-GEFCTEEAEEYADSSAADKALF 263
S S SY LP RY + V+ +DGL E+ EF + E KA+
Sbjct: 197 LSSKPSF----SQSYKALPTRYRFYEVNCEIDGLAEDKDEFTSTE-------KCGTKAV- 244
Query: 264 VKRHYWKW 271
WKW
Sbjct: 245 -----WKW 247
>gi|428175671|gb|EKX44560.1| hypothetical protein GUITHDRAFT_139784 [Guillardia theta CCMP2712]
Length = 290
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 90/198 (45%), Gaps = 16/198 (8%)
Query: 79 FATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAIPPVRTVNVVT 138
A+ ++L WL D WG K+ L EL GE+ L R V V
Sbjct: 83 LASEEALERWLVEEAGCD-LNKWG-HGKAKSARELLEELRAGESFLRP--EGTRGVKVAK 138
Query: 139 VRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILNDSI 198
VR+ D + L E+ Q L +G V+ R + LAEK + ETP A R V EELGSI+ +
Sbjct: 139 VRVFDGDYE-LFETSQILDNGVVKERNKQLAEKFRPGETPLHACGRGVSEELGSIVGSNP 197
Query: 199 TVRIVPGSYSKKLDE-RNSLSYPGLPARYVLHSVDAFVDGLPEEG---EFCTEEAEEYAD 254
+ + +E S SYPGL + Y L+ ++A + GL F T E+ E +D
Sbjct: 198 AIHFLRNGEVVAWEETETSSSYPGLRSVYELYWMEAEIQGLHRAATGMRFYTRESSE-SD 256
Query: 255 SSAADKALFVKRHYWKWR 272
K H W+W+
Sbjct: 257 GHCE------KIHVWEWK 268
>gi|428304217|ref|YP_007141042.1| hypothetical protein Cri9333_0586 [Crinalium epipsammum PCC 9333]
gi|428245752|gb|AFZ11532.1| hypothetical protein Cri9333_0586 [Crinalium epipsammum PCC 9333]
Length = 176
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 60/172 (34%), Positives = 83/172 (48%), Gaps = 17/172 (9%)
Query: 104 KPGTKNVHNLWLELSDGET-CLADAIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVR 162
K K+ +L E+ E+ + D P+R + V VR+I D Q L E Q SDG VR
Sbjct: 15 KGQAKDYSSLLQEIEKKESQIIWDNNRPIRLIKVCRVRVICGD-QELYEDRQVFSDGRVR 73
Query: 163 NR-CRPLAEKMKSNETPEEAVIRAVKEELGSILNDSITVRIVPGSYSKKLDERNSLSYPG 221
R R LAEK+ +E E A RA+KEELG L+D + + +E+ S SYPG
Sbjct: 74 KRGIRGLAEKLFKDEKEEIAAKRALKEELG--LSDEASTLVSIHFNGIDNEEKESPSYPG 131
Query: 222 LPARYVLHSVDAFVDGLPEEGEFCTEEAEEYADSSAADKALFVKRHYWKWRR 273
L RY+ F+ LP EE+ + D VK+ Y+ WR+
Sbjct: 132 LKTRYLFWD---FIAYLP-----VQYWQEEFVEGCEDD----VKQTYFVWRK 171
>gi|332662886|ref|YP_004445674.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
gi|332331700|gb|AEE48801.1| hypothetical protein Halhy_0900 [Haliscomenobacter hydrossis DSM
1100]
Length = 190
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 53/153 (34%), Positives = 76/153 (49%), Gaps = 17/153 (11%)
Query: 87 DWLKPRLPSDSFAS--WGVKPGTKNVHNLWLELSDGETCLADAIPP--VRTVNVVTVRII 142
D L+ RL + ++ WG K K + +L E+ DGET L + +R V V I
Sbjct: 13 DDLRQRLENAGISTTDWG-KGTAKTLLHLVKEMEDGETFLEKSADGQLLRYVVVAAADIY 71
Query: 143 ----GKDNQILVESHQELSDGSVRNR--CRPLAEKMKSNETPEEAVIRAVKEELGSILND 196
+ L E Q DG R R R ++EK+K E P EA+IR ++EELG +
Sbjct: 72 YDAPSGERYRLKEDRQVFKDGRERKRNLGRAVSEKIKLAEIPAEAMIRGIREELG--IAG 129
Query: 197 SITVRIVPGSYSKKLDERNSLSYPGLPARYVLH 229
+I + G+ + +D S SYPGL +RYV H
Sbjct: 130 AINW-VEEGATEETVD---SPSYPGLTSRYVSH 158
>gi|294892790|ref|XP_002774235.1| hypothetical protein Pmar_PMAR028239 [Perkinsus marinus ATCC 50983]
gi|239879452|gb|EER06051.1| hypothetical protein Pmar_PMAR028239 [Perkinsus marinus ATCC 50983]
Length = 2924
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 90/196 (45%), Gaps = 38/196 (19%)
Query: 108 KNVHNLWLELSDGETCLA--DAIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRC 165
KNV +L+ EL E C D R ++VV VR+ KD +LVESH+++SDG VR+R
Sbjct: 127 KNVSDLYFELQH-EICWLERDDRGLRRVLDVVIVRLSYKD-LVLVESHEQMSDGRVRSRN 184
Query: 166 RPLAEKMKSNETPEEAVIRAVKEELGSILNDSITVRIVPGSYSKKLDERNSLSYPGLPAR 225
K + +ET A+ R + +ELG LN + SYS++ + +SLSYP +
Sbjct: 185 LLPGVKKRLSETIGAALERWIHKELG--LNFDCLKQAAAYSYSEQ-EADSSLSYP-IRVL 240
Query: 226 YVLHSV------DAFVD------GLPEEGEFCTEE--AEEYADSS--AADKALFVKR--- 266
+ LH + DA GLP+ F T E E D S AD + R
Sbjct: 241 FRLHEMSYQLIPDALSKEQLARIGLPKGHSFQTSEDLCEAVNDDSDNNADHGAVISRGGS 300
Query: 267 -----------HYWKW 271
H+W W
Sbjct: 301 SSCCGYLKSAVHFWGW 316
>gi|294887459|ref|XP_002772120.1| hypothetical protein Pmar_PMAR017350 [Perkinsus marinus ATCC 50983]
gi|239876058|gb|EER03936.1| hypothetical protein Pmar_PMAR017350 [Perkinsus marinus ATCC 50983]
Length = 2503
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 90/196 (45%), Gaps = 38/196 (19%)
Query: 108 KNVHNLWLELSDGETCLA--DAIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRC 165
KNV +L+ EL E C D R ++VV VR+ KD +LVESH+++SDG VR+R
Sbjct: 127 KNVSDLYFELQH-EICWLERDDRGLRRVLDVVIVRLSYKD-LVLVESHEQMSDGRVRSRN 184
Query: 166 RPLAEKMKSNETPEEAVIRAVKEELGSILNDSITVRIVPGSYSKKLDERNSLSYPGLPAR 225
K + +ET A+ R + +ELG LN + SYS++ + +SLSYP +
Sbjct: 185 LLPGVKKRLSETIGAALERWIHKELG--LNFDCLKQAAAYSYSEQ-EADSSLSYP-IRVL 240
Query: 226 YVLHSV------DAFVD------GLPEEGEFCTEE--AEEYADSS--AADKALFVKR--- 266
+ LH + DA GLP+ F T E E D S AD + R
Sbjct: 241 FRLHEMSYQLIPDALSKEQLARIGLPKGHSFQTSEDLCEAVNDDSDNNADHGAVISRGGS 300
Query: 267 -----------HYWKW 271
H+W W
Sbjct: 301 SSCCGYLKSAVHFWGW 316
>gi|406899724|gb|EKD42919.1| hypothetical protein ACD_72C00533G0001 [uncultured bacterium]
Length = 183
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 47/88 (53%), Gaps = 11/88 (12%)
Query: 149 LVESHQELSDGSVRNR--CRPLAEKMKSNETPEEAVIRAVKEELGSILNDSITVRIVPGS 206
L E Q DG VR R R ++EKMK E +A++R ++EELG I+ +
Sbjct: 75 LKEEKQVFGDGRVRVRDFGRAVSEKMKFGEYSHDAMVRGIQEELG------ISSNVTLHE 128
Query: 207 YSKKLDERNSLSYPGLPARYVLHSVDAF 234
+D+++S SYPGL + YV VD F
Sbjct: 129 IETIVDKKDSPSYPGLMSNYV---VDVF 153
>gi|303287598|ref|XP_003063088.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455724|gb|EEH53027.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 250
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 61/212 (28%), Positives = 87/212 (41%), Gaps = 59/212 (27%)
Query: 84 SLSDWLKPR-LPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAIPPVRTVNVVTVRII 142
SL+ WLK + +D + K K + +L LE+ GE+ L V NV +R++
Sbjct: 26 SLATWLKTHGVDADRYG----KNNAKGLADLKLEVDLGESTLEVGPGGVPRRNVRVLRLL 81
Query: 143 GKDN--QILVESHQELSD---GSV---------------------RNRCR---------- 166
+D+ + LVE+ Q +D G V R R R
Sbjct: 82 IRDDAGRTLVEATQRWNDPNTGEVIAERMRGSPLRRVHYLSRRFPRYRVRASLRIGVHNA 141
Query: 167 -----------------PLAEKMKSNETPEEAVIRAVKEELGSILNDSITVRIVPGSYSK 209
P +EK+ E A RAV EELGS L T+ + S +
Sbjct: 142 NAVVWGPVCSDAFETHHPDSEKLIGAEDWRAAAPRAVAEELGSALAPGYTLAVDEASLVR 201
Query: 210 KLDERNSLSYPGLPARYVLHSVDAFVD-GLPE 240
+R SLSYP L +RY + SV A ++ GLPE
Sbjct: 202 ADVDRVSLSYPDLSSRYAMFSVAARIENGLPE 233
>gi|260495745|ref|ZP_05815867.1| mutator MutT protein [Fusobacterium sp. 3_1_33]
gi|260196703|gb|EEW94228.1| mutator MutT protein [Fusobacterium sp. 3_1_33]
Length = 252
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 41/73 (56%)
Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
+ T+ + KDN+ L+ + + +N+ + K++ NETPE+ + R VKEE G IL
Sbjct: 2 ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKNETPEQCLFREVKEETGLILI 61
Query: 196 DSITVRIVPGSYS 208
D I IV +++
Sbjct: 62 DYIHRGIVIFNFN 74
>gi|294867978|ref|XP_002765323.1| hypothetical protein Pmar_PMAR016118 [Perkinsus marinus ATCC 50983]
gi|239865336|gb|EEQ98040.1| hypothetical protein Pmar_PMAR016118 [Perkinsus marinus ATCC 50983]
Length = 1543
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 56/102 (54%), Gaps = 11/102 (10%)
Query: 180 EAVIRAVKEELGSILNDSITVRIVPGSYSKKLDE--RNSLSYPGLPARYVLHSVDAFVDG 237
E AVK+E+ ++D ++VR V K L E R+S +Y GL +YVL++ + +DG
Sbjct: 670 EVCSEAVKKEIAHSVSD-VSVRKVVMRNRKPLVELERSSPTYEGLATKYVLYTTEVLLDG 728
Query: 238 LPEE-GEFCTEEA-EEYADSSAADKALFV------KRHYWKW 271
LP EF T EA +++++D AL + H+W+W
Sbjct: 729 LPSGVEEFETTEAPVSGGNANSSDWALENGHKSSPRVHHWEW 770
>gi|237743627|ref|ZP_04574108.1| mutator mutT protein [Fusobacterium sp. 7_1]
gi|229432658|gb|EEO42870.1| mutator mutT protein [Fusobacterium sp. 7_1]
Length = 252
Score = 41.6 bits (96), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 63/143 (44%), Gaps = 20/143 (13%)
Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
+ T+ + KDN+ L+ + + +N+ + K++ NETPE+ + R VKEE G L
Sbjct: 2 ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKNETPEQCLFREVKEETGLTLI 61
Query: 196 DSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEEYADS 255
D I IV +++ DE P L++ F+ + E C+E D
Sbjct: 62 DYIHRGIVIFNFNN--DE---------PLYMYLYTSKNFLGEVQE----CSE-----GDL 101
Query: 256 SAADKALFVKRHYWKWRRAMREL 278
DK+ + WK + +L
Sbjct: 102 KWIDKSKIFNLNLWKGDKIFLDL 124
>gi|294784232|ref|ZP_06749527.1| mutator MutT protein [Fusobacterium sp. 3_1_27]
gi|294488098|gb|EFG35449.1| mutator MutT protein [Fusobacterium sp. 3_1_27]
Length = 252
Score = 41.2 bits (95), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 40/73 (54%)
Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
+ T+ + KDN+ L+ + + +N+ + K++ NETPE+ + R VKEE G L
Sbjct: 2 ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKNETPEQCLFREVKEETGLTLI 61
Query: 196 DSITVRIVPGSYS 208
D I IV +++
Sbjct: 62 DYIHKGIVIFNFN 74
>gi|421527177|ref|ZP_15973781.1| Mutator MutT protein [Fusobacterium nucleatum ChDC F128]
gi|402256611|gb|EJU07089.1| Mutator MutT protein [Fusobacterium nucleatum ChDC F128]
Length = 252
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 40/73 (54%)
Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
+ T+ + KDN+ L+ + + +N+ + K++ NETPE+ + R VKEE G L
Sbjct: 2 ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKNETPEQCLFREVKEETGLTLV 61
Query: 196 DSITVRIVPGSYS 208
D I IV +++
Sbjct: 62 DYIYRGIVIFNFN 74
>gi|284434558|gb|ADB85305.1| hypothetical protein [Phyllostachys edulis]
Length = 175
Score = 41.2 bits (95), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 42/71 (59%), Gaps = 9/71 (12%)
Query: 158 DGSVRN-RCRPLAEKMKSNETPEEAVIRAVKEELGSILNDSITVRIVPGSYS----KKLD 212
DG++R RPL+EKM+ E+PE A++EE L++ + VRI+ G+ ++++
Sbjct: 82 DGTLRRWGQRPLSEKMRPGESPEATAACAIREE----LSERVRVRILGGAGGAEAKQRVE 137
Query: 213 ERNSLSYPGLP 223
E S SYP P
Sbjct: 138 EWESASYPASP 148
>gi|209965935|ref|YP_002298850.1| hydrolase [Rhodospirillum centenum SW]
gi|209959401|gb|ACJ00038.1| hydrolase, NUDIX family, putative [Rhodospirillum centenum SW]
Length = 151
Score = 41.2 bits (95), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 37/65 (56%), Gaps = 2/65 (3%)
Query: 127 AIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAV 186
++PPVRTV VV ++ D ++L+ Q S+ K+ ETPE+A++R +
Sbjct: 16 SLPPVRTVLVVAAALVDVDGRVLLA--QRPPGKSLAGLWEFPGGKVHEGETPEQALVREL 73
Query: 187 KEELG 191
+EELG
Sbjct: 74 REELG 78
>gi|421144275|ref|ZP_15604191.1| Mutator MutT protein [Fusobacterium nucleatum subsp. fusiforme ATCC
51190]
gi|395489376|gb|EJG10215.1| Mutator MutT protein [Fusobacterium nucleatum subsp. fusiforme ATCC
51190]
Length = 252
Score = 41.2 bits (95), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 40/73 (54%)
Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
+ T+ + KDN+ L+ + + +N+ + K++ NETPE+ + R VKEE G L
Sbjct: 2 ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKNETPEQCLFREVKEETGLTLI 61
Query: 196 DSITVRIVPGSYS 208
D I IV +++
Sbjct: 62 DYIHRGIVIFNFN 74
>gi|237742919|ref|ZP_04573400.1| mutator mutT protein [Fusobacterium sp. 4_1_13]
gi|256846891|ref|ZP_05552345.1| mutator mutT protein [Fusobacterium sp. 3_1_36A2]
gi|229430567|gb|EEO40779.1| mutator mutT protein [Fusobacterium sp. 4_1_13]
gi|256717689|gb|EEU31248.1| mutator mutT protein [Fusobacterium sp. 3_1_36A2]
Length = 252
Score = 41.2 bits (95), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 40/73 (54%)
Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
+ T+ + KDN+ L+ + + +N+ + K++ NETPE+ + R VKEE G L
Sbjct: 2 ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKNETPEQCLFREVKEETGLTLI 61
Query: 196 DSITVRIVPGSYS 208
D I IV +++
Sbjct: 62 DYIHRGIVIFNFN 74
>gi|423137651|ref|ZP_17125294.1| hypothetical protein HMPREF9942_01432 [Fusobacterium nucleatum
subsp. animalis F0419]
gi|371959445|gb|EHO77131.1| hypothetical protein HMPREF9942_01432 [Fusobacterium nucleatum
subsp. animalis F0419]
Length = 252
Score = 41.2 bits (95), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 40/73 (54%)
Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
+ T+ + KDN+ L+ + + +N+ + K++ NETPE+ + R VKEE G L
Sbjct: 2 ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKNETPEQCLFREVKEETGLTLI 61
Query: 196 DSITVRIVPGSYS 208
D I IV +++
Sbjct: 62 DYIHRGIVIFNFN 74
>gi|289765984|ref|ZP_06525362.1| mutator MutT protein [Fusobacterium sp. D11]
gi|289717539|gb|EFD81551.1| mutator MutT protein [Fusobacterium sp. D11]
Length = 252
Score = 41.2 bits (95), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 40/73 (54%)
Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
+ T+ + KDN+ L+ + + +N+ + K++ NETPE+ + R VKEE G L
Sbjct: 2 ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKNETPEQCLFREVKEETGLTLI 61
Query: 196 DSITVRIVPGSYS 208
D I IV +++
Sbjct: 62 DYIHRGIVIFNFN 74
>gi|422315135|ref|ZP_16396574.1| hypothetical protein FPOG_00064 [Fusobacterium periodonticum D10]
gi|404592845|gb|EKA94571.1| hypothetical protein FPOG_00064 [Fusobacterium periodonticum D10]
Length = 153
Score = 41.2 bits (95), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 63/143 (44%), Gaps = 20/143 (13%)
Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
+ T+ + K+N+ L+ + + +N+ + K++ ETPE+ +IR VKEE G L
Sbjct: 2 ITTLCYLEKENKYLMLHRTKKENDINKNKWLGVGGKLEKGETPEQCLIREVKEETGLDLI 61
Query: 196 DSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEEYADS 255
D + IV +Y+ DE P L++ F + E C+E D
Sbjct: 62 DYVHRGIVIFNYND--DE---------PLDMYLYTSKNFFGEIQE----CSE-----GDL 101
Query: 256 SAADKALFVKRHYWKWRRAMREL 278
DK+ K + W+ R EL
Sbjct: 102 KWIDKSEIYKLNLWEGDRIFLEL 124
>gi|336420283|ref|ZP_08600519.1| mutator MutT protein [Fusobacterium sp. 11_3_2]
gi|336161324|gb|EGN64330.1| mutator MutT protein [Fusobacterium sp. 11_3_2]
Length = 252
Score = 41.2 bits (95), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 64/143 (44%), Gaps = 20/143 (13%)
Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
+ T+ + KDN+ L+ + + +N+ + K++ NETPE+ + R VKEE G IL
Sbjct: 2 ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKNETPEQCLFREVKEETGLILI 61
Query: 196 DSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEEYADS 255
+ + IV +++ DE P L++ F+ + E C+E D
Sbjct: 62 NYVHRGIVIFNFND--DE---------PLYMYLYTSKNFLGEVQE----CSE-----GDL 101
Query: 256 SAADKALFVKRHYWKWRRAMREL 278
DK+ + WK + +L
Sbjct: 102 KWIDKSKIFNLNLWKGDKIFLDL 124
>gi|294783695|ref|ZP_06749019.1| mutator MutT protein [Fusobacterium sp. 1_1_41FAA]
gi|294480573|gb|EFG28350.1| mutator MutT protein [Fusobacterium sp. 1_1_41FAA]
Length = 153
Score = 40.8 bits (94), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 64/143 (44%), Gaps = 20/143 (13%)
Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
+ T+ + K+N+ L+ + + +N+ + K++ ETPE+ +IR VKEE G L
Sbjct: 2 ITTLCYLEKENKYLMLHRTKKENDINKNKWLGVGGKLEKGETPEQCLIREVKEETGLDLI 61
Query: 196 DSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEEYADS 255
D + IV +Y++ DE P L++ F + E C+E D
Sbjct: 62 DYVHRGIVIFNYNE--DE---------PLEMYLYTSKNFSGEIQE----CSE-----GDL 101
Query: 256 SAADKALFVKRHYWKWRRAMREL 278
DK+ K + W+ R EL
Sbjct: 102 KWIDKSEIYKLNLWEGDRIFLEL 124
>gi|428768996|ref|YP_007160786.1| hypothetical protein Cyan10605_0603 [Cyanobacterium aponinum PCC
10605]
gi|428683275|gb|AFZ52742.1| hypothetical protein Cyan10605_0603 [Cyanobacterium aponinum PCC
10605]
Length = 193
Score = 40.4 bits (93), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 46/97 (47%), Gaps = 5/97 (5%)
Query: 100 SWGVKPGTKNVHNLWLELSDGETCLADAIPP-VRTVNVVTVRII---GKDNQILVESHQE 155
W + G K + L+ E+ +GE+ L VR + V + +I G + L+E Q
Sbjct: 26 QWNHQQGNKTIEELYQEIKEGESQLKLIQGKLVRLMKVSAIEVIFNLGNNYFQLIEDKQI 85
Query: 156 LSDGSVRNRCRP-LAEKMKSNETPEEAVIRAVKEELG 191
G VR R + + EK+K E + R ++EE+G
Sbjct: 86 FLTGIVRRREQNFITEKIKGGENSLQGAYRGLEEEIG 122
>gi|389695973|ref|ZP_10183615.1| mutator mutT protein [Microvirga sp. WSM3557]
gi|388584779|gb|EIM25074.1| mutator mutT protein [Microvirga sp. WSM3557]
Length = 182
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 55/119 (46%), Gaps = 17/119 (14%)
Query: 128 IPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVK 187
+PP++ VV V +I DN+IL+ E + K++ E PEE +IR ++
Sbjct: 47 VPPLKLTVVVAVALIDTDNRILLAQRPE--GKQLAGLWEFPGGKLEPGERPEETLIRELR 104
Query: 188 EELGSILNDSITVR---IVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGE 243
EELG ITV+ + P +++ E L P Y+ + FV P EG+
Sbjct: 105 EELG------ITVKEPCLAPLTFASHAYETFHLLMP----LYICRRWEGFVQ--PLEGQ 151
>gi|422338999|ref|ZP_16419959.1| mutator MutT protein [Fusobacterium nucleatum subsp. polymorphum
F0401]
gi|355372126|gb|EHG19469.1| mutator MutT protein [Fusobacterium nucleatum subsp. polymorphum
F0401]
Length = 80
Score = 39.7 bits (91), Expect = 1.3, Method: Composition-based stats.
Identities = 23/68 (33%), Positives = 37/68 (54%)
Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
+ T+ + KDN+ L+ + + +N+ + K++ NETPE+ + R VKEE G L
Sbjct: 2 ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKNETPEQCLFREVKEETGLTLI 61
Query: 196 DSITVRIV 203
D I IV
Sbjct: 62 DYIHRGIV 69
>gi|340753337|ref|ZP_08690123.1| mutator mutT protein [Fusobacterium sp. 2_1_31]
gi|340566963|gb|EEO37977.2| mutator mutT protein [Fusobacterium sp. 2_1_31]
Length = 153
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 63/143 (44%), Gaps = 20/143 (13%)
Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
+ T+ + K+N+ L+ + + +N+ + K++ ETPE+ +IR VKEE G L
Sbjct: 2 ITTLCYLEKENKYLMLHRTKKENDINKNKWLGVGGKLEKEETPEQCLIREVKEETGLDLI 61
Query: 196 DSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEEYADS 255
D + IV +Y+ DE P L++ F + E C+E D
Sbjct: 62 DYVHRGIVIFNYND--DE---------PLDMYLYTSKNFSGEIQE----CSE-----GDL 101
Query: 256 SAADKALFVKRHYWKWRRAMREL 278
DK+ K + W+ R EL
Sbjct: 102 KWIDKSEIYKLNLWEGDRIFLEL 124
>gi|262066430|ref|ZP_06026042.1| mutator MutT protein [Fusobacterium periodonticum ATCC 33693]
gi|291379857|gb|EFE87375.1| mutator MutT protein [Fusobacterium periodonticum ATCC 33693]
Length = 153
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 64/143 (44%), Gaps = 20/143 (13%)
Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
+ T+ + K+++ L+ + + +N+ + K++ ETPE+ +IR VKEE G L
Sbjct: 2 ITTLCYLEKESKYLMLHRTKKENDINKNKWLGVGGKLEKGETPEQCLIREVKEETGLDLI 61
Query: 196 DSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEEYADS 255
D + IV +Y++ DE P L++ F + E C+E D
Sbjct: 62 DYVHRGIVIFNYNE--DE---------PLEMYLYTSKNFSGEIQE----CSE-----GDL 101
Query: 256 SAADKALFVKRHYWKWRRAMREL 278
DK+ K + W+ R EL
Sbjct: 102 KWIDKSQVYKLNLWEGDRIFLEL 124
>gi|34762713|ref|ZP_00143703.1| Mutator mutT protein (7,8-dihydro-8-oxoguanine-triphosphatase)
[Fusobacterium nucleatum subsp. vincentii ATCC 49256]
gi|27887612|gb|EAA24691.1| Mutator mutT protein (7,8-dihydro-8-oxoguanine-triphosphatase)
[Fusobacterium nucleatum subsp. vincentii ATCC 49256]
Length = 252
Score = 38.9 bits (89), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 22/73 (30%), Positives = 40/73 (54%)
Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
+ T+ + KDN+ L+ + + +N+ + K++ +ETPE+ + R VKEE G L
Sbjct: 2 ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKSETPEQCLFREVKEETGLTLI 61
Query: 196 DSITVRIVPGSYS 208
D I IV +++
Sbjct: 62 DYIHRGIVIFNFN 74
>gi|19705096|ref|NP_602591.1| Mutator MutT protein [Fusobacterium nucleatum subsp. nucleatum ATCC
25586]
gi|19713020|gb|AAL93890.1| Mutator MutT protein [Fusobacterium nucleatum subsp. nucleatum ATCC
25586]
Length = 252
Score = 38.9 bits (89), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 22/73 (30%), Positives = 40/73 (54%)
Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
+ T+ + KDN+ L+ + + +N+ + K++ +ETPE+ + R VKEE G L
Sbjct: 2 ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKSETPEQCLFREVKEETGLTLI 61
Query: 196 DSITVRIVPGSYS 208
D I IV +++
Sbjct: 62 DYIHRGIVIFNFN 74
>gi|296329356|ref|ZP_06871857.1| mutator MutT protein [Fusobacterium nucleatum subsp. nucleatum ATCC
23726]
gi|296153712|gb|EFG94529.1| mutator MutT protein [Fusobacterium nucleatum subsp. nucleatum ATCC
23726]
Length = 252
Score = 38.9 bits (89), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 22/73 (30%), Positives = 40/73 (54%)
Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
+ T+ + KDN+ L+ + + +N+ + K++ +ETPE+ + R VKEE G L
Sbjct: 2 ITTLCYLEKDNKYLMLHRTKRENDINKNKWLGVGGKLEKSETPEQCLFREVKEETGLTLI 61
Query: 196 DSITVRIVPGSYS 208
D I IV +++
Sbjct: 62 DYIHRGIVIFNFN 74
>gi|345297877|ref|YP_004827235.1| mutator MutT protein [Enterobacter asburiae LF7a]
gi|345091814|gb|AEN63450.1| mutator MutT protein [Enterobacter asburiae LF7a]
Length = 130
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 20/61 (32%), Positives = 35/61 (57%), Gaps = 2/61 (3%)
Query: 131 VRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEEL 190
++T+ + I NQI + Q +D + N+ K++S ETPE+A+IR ++EE+
Sbjct: 1 MKTLQIAVGIIRNPQNQIFIT--QRAADAHMANKWEFPGGKIESGETPEQALIRELQEEV 58
Query: 191 G 191
G
Sbjct: 59 G 59
>gi|336399973|ref|ZP_08580765.1| hypothetical protein HMPREF0404_00056 [Fusobacterium sp. 21_1A]
gi|336159973|gb|EGN63043.1| hypothetical protein HMPREF0404_00056 [Fusobacterium sp. 21_1A]
Length = 252
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 21/73 (28%), Positives = 40/73 (54%)
Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
+ T+ + KDN+ L+ + + +N+ + K++ NETPE+ + R VKEE G L
Sbjct: 2 ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKNETPEQCLFREVKEETGLTLI 61
Query: 196 DSITVRIVPGSYS 208
+ + IV +++
Sbjct: 62 NYVHRGIVIFNFN 74
>gi|357011931|ref|ZP_09076930.1| MutT/NUDIX family protein [Paenibacillus elgii B69]
Length = 156
Score = 38.1 bits (87), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 39/63 (61%), Gaps = 7/63 (11%)
Query: 131 VRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRC-RPLAEKMKSNETPEEAVIRAVKEE 189
+RTV++ +R +D+QILVE E+ D ++ R RP+ ++ E ++ VIR V EE
Sbjct: 3 IRTVSLCLIR---RDDQILVE---EIVDEGLKKRLYRPIGGTVEHGENSKQTVIREVMEE 56
Query: 190 LGS 192
+G+
Sbjct: 57 IGA 59
>gi|126728002|ref|ZP_01743818.1| mutator mutT protein [Sagittula stellata E-37]
gi|126710967|gb|EBA10017.1| mutator mutT protein [Sagittula stellata E-37]
Length = 142
Score = 37.7 bits (86), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 40/80 (50%), Gaps = 2/80 (2%)
Query: 129 PPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKE 188
PP++ + V V +I D ++L+ E S+ K++S ETPE A+IR + E
Sbjct: 9 PPMKIILVSAVALIDPDGRVLLAQRPE--GKSMAGLWEFPGGKVESGETPEAALIRELHE 66
Query: 189 ELGSILNDSITVRIVPGSYS 208
ELG DS + S++
Sbjct: 67 ELGIETWDSCLAPLTFASHT 86
>gi|422939573|ref|ZP_16966951.1| mutator MutT protein [Fusobacterium nucleatum subsp. animalis ATCC
51191]
gi|339890380|gb|EGQ79513.1| mutator MutT protein [Fusobacterium nucleatum subsp. animalis ATCC
51191]
Length = 252
Score = 37.4 bits (85), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 21/73 (28%), Positives = 40/73 (54%)
Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
+ T+ + K N+ L+ + + +N+ + K++ NETPE+ + R VKEE G IL
Sbjct: 2 ITTLCYLEKANKYLMLHRTKKENDINKNKWLGVGGKLEKNETPEQCLFREVKEETGLILI 61
Query: 196 DSITVRIVPGSYS 208
+ + IV +++
Sbjct: 62 NYVHRGIVIFNFN 74
>gi|407771274|ref|ZP_11118634.1| hydrolase [Thalassospira xiamenensis M-5 = DSM 17429]
gi|407285720|gb|EKF11216.1| hydrolase [Thalassospira xiamenensis M-5 = DSM 17429]
Length = 143
Score = 37.0 bits (84), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 23/79 (29%), Positives = 41/79 (51%), Gaps = 2/79 (2%)
Query: 130 PVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEE 189
P +TV V V ++ D+++L+ E S+ K+++ ETPE A++R +KEE
Sbjct: 11 PEKTVFVSAVALVDADHRVLIAQRPE--GKSMAGLWEFPGGKVEAGETPEMALVRELKEE 68
Query: 190 LGSILNDSITVRIVPGSYS 208
LG + +S S++
Sbjct: 69 LGIDITESCLAPFTFASFT 87
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.133 0.401
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,724,982,507
Number of Sequences: 23463169
Number of extensions: 195756176
Number of successful extensions: 473240
Number of sequences better than 100.0: 72
Number of HSP's better than 100.0 without gapping: 51
Number of HSP's successfully gapped in prelim test: 21
Number of HSP's that attempted gapping in prelim test: 473097
Number of HSP's gapped (non-prelim): 75
length of query: 281
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 140
effective length of database: 9,050,888,538
effective search space: 1267124395320
effective search space used: 1267124395320
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 76 (33.9 bits)