BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 023448
(282 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255537559|ref|XP_002509846.1| conserved hypothetical protein [Ricinus communis]
gi|223549745|gb|EEF51233.1| conserved hypothetical protein [Ricinus communis]
Length = 326
Score = 312 bits (799), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 149/281 (53%), Positives = 197/281 (70%), Gaps = 9/281 (3%)
Query: 1 MSASINFSGQAQTQRLCFRALGGNGIVLNSLPSKISCQLMQNTSKIEHHGISIRSLAKCK 60
M++++NFSGQ + +CF+A +G++ S+P K S L+Q K E GI R K
Sbjct: 1 MASTVNFSGQIPQRTICFKAR--SGLMPGSIPFKRSSDLVQKFMKAERWGICFRKQTWNK 58
Query: 61 P------STLVPSASDD-SIEIVPFSASNTIREFYACINEKNLERLETYISDDCCFEDCS 113
+ ++ SA+DD +I + SA++ I++FY CINEK LE ++ YISD+CCFEDCS
Sbjct: 59 QFNYQRLNLVMSSAADDGNIPLNHISAASMIKKFYTCINEKRLEEIDKYISDNCCFEDCS 118
Query: 114 FPKPFQGKKEVMQFLEQLVTSMGQNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGC 173
F P QGKKEVM F +QL T M QNVKFS+E VCE DEFT G+NWHLEWK +PFTRGC
Sbjct: 119 FYSPIQGKKEVMHFYQQLTTGMVQNVKFSIEHVCEDDEFTVGVNWHLEWKRTHIPFTRGC 178
Query: 174 SFYECSLEGETLLIKKARVVIESPIKPGGIVLTLLKNLTSLSDDFPKATEWLLNSPHVIS 233
SFYECS EG+ ++IKK RVV ESPIKPGG+ + LLKN+T++ D+FP EW L SPH+I
Sbjct: 179 SFYECSAEGDRIVIKKVRVVTESPIKPGGLAMILLKNVTAIFDNFPNFAEWFLKSPHLIV 238
Query: 234 TFLLKAYTIFLAPFVRPILAGYINMWNFIARLLGLAFNILI 274
FL+K YT LAPFV P++ GY +W+F AR+ LA ++L+
Sbjct: 239 QFLMKIYTRLLAPFVNPLMGGYARIWSFTARVFALALSMLL 279
>gi|224071972|ref|XP_002303603.1| predicted protein [Populus trichocarpa]
gi|222841035|gb|EEE78582.1| predicted protein [Populus trichocarpa]
Length = 352
Score = 306 bits (783), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 154/281 (54%), Positives = 190/281 (67%), Gaps = 7/281 (2%)
Query: 1 MSASINFSGQAQTQRLCFRALGGNGIVLNSLPSKISCQLMQNTSKIEHHGISIRS----- 55
M+A++NF GQ Q + + G G+ L S P K SCQLM + K+E +R
Sbjct: 1 MAAAVNFPGQLPRQTISLK--GVAGLTLASSPFKGSCQLMLSNVKMEQQVACLRKQVNNK 58
Query: 56 LAKCKPSTLVPSASDDSIEIVPFSASNTIREFYACINEKNLERLETYISDDCCFEDCSFP 115
L + LV SA+ D + S SN I++FY CIN+K L+ L+ YIS+DC FEDCSF
Sbjct: 59 LFNKRWVNLVVSAAADFDLKLGSSPSNMIKQFYTCINDKKLKELDGYISEDCHFEDCSFL 118
Query: 116 KPFQGKKEVMQFLEQLVTSMGQNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSF 175
+P QGK+EVM F QL MG+N+KF +E VCE DE TAG+NWHLEWK Q+PFTRGCSF
Sbjct: 119 QPMQGKREVMHFFRQLTAGMGENMKFIIEHVCEDDEMTAGVNWHLEWKTIQIPFTRGCSF 178
Query: 176 YECSLEGETLLIKKARVVIESPIKPGGIVLTLLKNLTSLSDDFPKATEWLLNSPHVISTF 235
YECS + + L+IKKA VVIESPIKPGGIVLTLLKN+T++ DDFP+A EWLL SPHVI F
Sbjct: 179 YECSHKDDRLVIKKALVVIESPIKPGGIVLTLLKNMTAIFDDFPRAAEWLLKSPHVIMQF 238
Query: 236 LLKAYTIFLAPFVRPILAGYINMWNFIARLLGLAFNILIYI 276
K Y+ LAP V P+LAGYI W IARL A NI + +
Sbjct: 239 CSKIYSRLLAPLVNPLLAGYIRAWKPIARLFAFALNIRLQL 279
>gi|224058447|ref|XP_002299514.1| predicted protein [Populus trichocarpa]
gi|222846772|gb|EEE84319.1| predicted protein [Populus trichocarpa]
Length = 276
Score = 273 bits (699), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 150/284 (52%), Positives = 183/284 (64%), Gaps = 11/284 (3%)
Query: 1 MSASINFSGQAQTQRLCFRALGGNGIVLNSLPSKISCQLMQNTSKIE--HHGISIRSLAK 58
M+A++NF GQ Q +C G G+ P K S Q MQN KI S +S K
Sbjct: 1 MAATVNFYGQLPRQTIC--PEGVAGLTFGFSPFKGSSQSMQNKMKITCLRKRASNKSFNK 58
Query: 59 CKPSTLVPSASDDSIEIVPFSASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPF 118
+ +V +A D ++ P S S+ I++FY CINEK L+ L+ YISDDC FEDCSF +P
Sbjct: 59 RWANMVVSAADDFDFKLGP-SPSHMIKQFYTCINEKKLKELDGYISDDCFFEDCSFLQPM 117
Query: 119 QGKKEVMQFLEQLVTSMGQNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYEC 178
QGKKEVM F QL MGQNVKF +E + + EWK Q+PFT+GCSFYEC
Sbjct: 118 QGKKEVMHFFGQLTAGMGQNVKFILEHSNKLMRIL-----YAEWKTIQIPFTKGCSFYEC 172
Query: 179 SLEGETLLIKKARVVIESPIKPGGIVLTLLKNLTSLSDDFPKATEWLLNSPHVISTFLLK 238
S + + L+IKKA VVIESPIKPGGIVLTLLKN+T++ DDFPKA EWLL SPHVI F K
Sbjct: 173 SQKEDRLVIKKALVVIESPIKPGGIVLTLLKNVTAIFDDFPKAAEWLLKSPHVIMQFCSK 232
Query: 239 AYTIFLAPFVRPILAGYINMWNFIARLLGLAFN-ILIYILKIFS 281
Y+ LAPFV P+LAGYI WN IARL A N IL +++K F
Sbjct: 233 IYSRLLAPFVNPLLAGYIRAWNLIARLFAFALNTILHFLMKYFG 276
>gi|356495695|ref|XP_003516709.1| PREDICTED: uncharacterized protein LOC100779471 [Glycine max]
Length = 218
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 112/204 (54%), Positives = 153/204 (75%)
Query: 79 SASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQN 138
S ++T+ ++Y IN+K+L +L+ IS+D CF+D +F KPFQGKKEV++FLEQL MG+N
Sbjct: 15 SPADTVEQYYTSINDKDLRQLDECISEDACFDDYAFTKPFQGKKEVIRFLEQLTHCMGRN 74
Query: 139 VKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPI 198
V F ++ + EGD+ TA +WHLEW KQ+PFTRGC+F++ S G+ L+I A V+IESPI
Sbjct: 75 VTFRLKHIYEGDDLTAVASWHLEWNEKQIPFTRGCTFFKLSKLGKNLVIWNAEVLIESPI 134
Query: 199 KPGGIVLTLLKNLTSLSDDFPKATEWLLNSPHVISTFLLKAYTIFLAPFVRPILAGYINM 258
KPG IVLTLLKN+TS+ DDFPK TEW L SP+VI T++L+ Y IF+AP++ P+L GYI +
Sbjct: 135 KPGSIVLTLLKNVTSIFDDFPKVTEWFLRSPNVILTWILRVYNIFVAPWLHPLLEGYIKL 194
Query: 259 WNFIARLLGLAFNILIYILKIFSK 282
W F RLL ++I+I K F K
Sbjct: 195 WGFFVRLLNSVITVVIFISKTFFK 218
>gi|359474296|ref|XP_002273128.2| PREDICTED: uncharacterized protein LOC100251848 [Vitis vinifera]
Length = 228
Score = 240 bits (612), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 124/236 (52%), Positives = 158/236 (66%), Gaps = 13/236 (5%)
Query: 1 MSASINFSGQAQTQRLCFRALGGNGIVLNSLPSKISCQLMQNTSKIEHHGISIRSLAKCK 60
M+A I SGQ R + G + L+SLP K +CQL QN + GISI+S+ K K
Sbjct: 1 MAAPIRVSGQTP------RPIAG--LALSSLPMKTACQLKQNRAMTMQQGISIKSMKKNK 52
Query: 61 PS-----TLVPSASDDSIEIVPFSASNTIREFYACINEKNLERLETYISDDCCFEDCSFP 115
V S+ S TI FY CIN+KNL++L +SDDCCF D SFP
Sbjct: 53 TKFRGLYLAVQRRSNSYFGSSGPFPSETIELFYTCINDKNLKQLAKLVSDDCCFNDLSFP 112
Query: 116 KPFQGKKEVMQFLEQLVTSMGQNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSF 175
+PF+GKKEV++F E+L MG+NVKF + VCEGD TA I+WHLEW+GKQ+PFTRGCSF
Sbjct: 113 QPFKGKKEVLRFFEELTAVMGKNVKFRILHVCEGDGLTAAIDWHLEWQGKQIPFTRGCSF 172
Query: 176 YECSLEGETLLIKKARVVIESPIKPGGIVLTLLKNLTSLSDDFPKATEWLLNSPHV 231
YEC+ EG++L+IK+A++VIESPIKPG LTLLK +T+L DDFPKATE L S ++
Sbjct: 173 YECAEEGKSLIIKEAKIVIESPIKPGPFALTLLKIVTTLFDDFPKATESKLASINI 228
>gi|145358721|ref|NP_198962.2| Nuclear transport factor 2 (NTF2) family protein [Arabidopsis
thaliana]
gi|332007299|gb|AED94682.1| Nuclear transport factor 2 (NTF2) family protein [Arabidopsis
thaliana]
Length = 277
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 107/204 (52%), Positives = 144/204 (70%)
Query: 79 SASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQN 138
S + + +FY+ INEKN ++L + IS DC +D SFPKPF+GK+E M+F E+LV SMGQN
Sbjct: 74 SGYDAVMKFYSSINEKNQDQLSSCISSDCFIDDFSFPKPFRGKQEAMEFFEELVKSMGQN 133
Query: 139 VKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPI 198
VKF VE VCEGD +A +NWHLEWKG+++PFTRGCSFYE EG L+I+ AR++IESPI
Sbjct: 134 VKFCVENVCEGDGHSAAVNWHLEWKGRKIPFTRGCSFYEFIDEGGRLVIRNARILIESPI 193
Query: 199 KPGGIVLTLLKNLTSLSDDFPKATEWLLNSPHVISTFLLKAYTIFLAPFVRPILAGYINM 258
KPGGI L+LLKN+T L D+FPK E L P+ I ++ Y +FLAP + ++A Y+ +
Sbjct: 194 KPGGITLSLLKNITFLFDEFPKGAELFLEKPYAIIQAAIRIYGLFLAPLINHVMASYLKL 253
Query: 259 WNFIARLLGLAFNILIYILKIFSK 282
+ +A L I+I I +F K
Sbjct: 254 LSNMAEFFLLVVKIIIKIRNLFFK 277
>gi|297805502|ref|XP_002870635.1| hypothetical protein ARALYDRAFT_493835 [Arabidopsis lyrata subsp.
lyrata]
gi|297316471|gb|EFH46894.1| hypothetical protein ARALYDRAFT_493835 [Arabidopsis lyrata subsp.
lyrata]
Length = 277
Score = 231 bits (588), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 107/204 (52%), Positives = 144/204 (70%)
Query: 79 SASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQN 138
S + + +FY+ INEKN ++L + IS DC +D SF KPF+G+KE M+F E+LV SMGQN
Sbjct: 74 SGYDAVMKFYSSINEKNQDQLSSCISSDCFIDDFSFSKPFRGRKEAMKFFEELVKSMGQN 133
Query: 139 VKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPI 198
VKF VE VCEGD +A +NWHLEWKG+++PFTRGCSFYE + EG L+I+ AR++IESPI
Sbjct: 134 VKFCVENVCEGDGHSAAVNWHLEWKGRKIPFTRGCSFYEFTDEGGKLVIRNARILIESPI 193
Query: 199 KPGGIVLTLLKNLTSLSDDFPKATEWLLNSPHVISTFLLKAYTIFLAPFVRPILAGYINM 258
KPGGI L+LLKN+T L D+FPK E L P+ I ++ Y +FLAP V ++A Y+ +
Sbjct: 194 KPGGITLSLLKNITFLFDEFPKGAELFLEKPYAIIQATIQIYGLFLAPLVNHVMASYLKL 253
Query: 259 WNFIARLLGLAFNILIYILKIFSK 282
+ +A L I+I I +F K
Sbjct: 254 LSNMAEFFLLVVKIIIKIRNLFFK 277
>gi|60547917|gb|AAX23922.1| hypothetical protein At5g41470 [Arabidopsis thaliana]
gi|71905583|gb|AAZ52769.1| hypothetical protein At5g41470 [Arabidopsis thaliana]
Length = 198
Score = 228 bits (580), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 106/197 (53%), Positives = 141/197 (71%)
Query: 86 EFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQNVKFSVEQ 145
+FY+ INEKN ++L + IS DC +D SFPKPF+GK+E M+F E+LV SMGQNVKF VE
Sbjct: 2 KFYSSINEKNQDQLSSCISSDCFIDDFSFPKPFRGKQEAMEFFEELVKSMGQNVKFCVEN 61
Query: 146 VCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPIKPGGIVL 205
VCEGD +A +NWHLEWKG+++PFTRGCSFYE EG L+I+ AR++IESPIKPGGI L
Sbjct: 62 VCEGDGHSAAVNWHLEWKGRKIPFTRGCSFYEFIDEGGRLVIRNARILIESPIKPGGITL 121
Query: 206 TLLKNLTSLSDDFPKATEWLLNSPHVISTFLLKAYTIFLAPFVRPILAGYINMWNFIARL 265
+LLKN+T L D+FPK E L P+ I ++ Y +FLAP + ++A Y+ + + +A
Sbjct: 122 SLLKNITFLFDEFPKGAELFLEKPYAIIQAAIRIYGLFLAPLINHVMASYLKLLSNMAEF 181
Query: 266 LGLAFNILIYILKIFSK 282
L I+I I +F K
Sbjct: 182 FLLVVKIIIKIRNLFFK 198
>gi|414874058|tpg|DAA52615.1| TPA: hypothetical protein ZEAMMB73_891301 [Zea mays]
Length = 270
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 79/198 (39%), Positives = 124/198 (62%), Gaps = 1/198 (0%)
Query: 81 SNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQNVK 140
S+ ++EFY+ +NE+N +RL+ ++ DC ED ++ KP K + F ++L+ SMG+ VK
Sbjct: 71 SDVVQEFYSSLNERNSKRLDKLMAPDCIIEDTAYYKPLDAKCTHIYF-KRLMESMGEKVK 129
Query: 141 FSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPIKP 200
F++++VC+G TA + WHLEW G +PF+RGCSFY CS+ G LLI+K + ESP+KP
Sbjct: 130 FAIDEVCQGAGRTAAVMWHLEWNGYIIPFSRGCSFYICSVNGAVLLIRKIHIFNESPLKP 189
Query: 201 GGIVLTLLKNLTSLSDDFPKATEWLLNSPHVISTFLLKAYTIFLAPFVRPILAGYINMWN 260
G L +L +T+L + FPK E L P ++ + Y ++ PF+ P LA Y + W
Sbjct: 190 GKWALEILNIVTNLFNVFPKPAEGFLKDPEAVAQPFINLYKFYVEPFIIPFLAYYTHFWT 249
Query: 261 FIARLLGLAFNILIYILK 278
++ R L + +IL + K
Sbjct: 250 YVGRGLSMVLHILYNLYK 267
>gi|357114599|ref|XP_003559086.1| PREDICTED: uncharacterized protein LOC100837010 [Brachypodium
distachyon]
Length = 270
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 80/202 (39%), Positives = 124/202 (61%), Gaps = 2/202 (0%)
Query: 81 SNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKK--EVMQFLEQLVTSMGQN 138
S+ I+EFY+ +N+K+ +L+ IS DC ED ++ KP K + +L+ +MG+N
Sbjct: 68 SDVIQEFYSSLNDKDSAQLKKLISPDCIIEDTAYYKPLDIKVLLNTHTYFTRLMEAMGKN 127
Query: 139 VKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPI 198
VKF++++VC+G E T + WHLEW GK +PF +GCSFY CS +GE LI+K + ESP+
Sbjct: 128 VKFAIDEVCQGVEPTVAVMWHLEWNGKTIPFAKGCSFYICSADGEAPLIRKVHIFDESPL 187
Query: 199 KPGGIVLTLLKNLTSLSDDFPKATEWLLNSPHVISTFLLKAYTIFLAPFVRPILAGYINM 258
KPG + L +L +T+L D P E L +P + + Y + PF+ P+LA Y +
Sbjct: 188 KPGKMALEILNLVTNLFDTLPNIAECFLKNPEALVQSFARFYKFCVKPFLVPLLAYYTHF 247
Query: 259 WNFIARLLGLAFNILIYILKIF 280
W+++A+ L + NIL I + F
Sbjct: 248 WSYVAQGLTMVLNILYNIFRRF 269
>gi|357481563|ref|XP_003611067.1| hypothetical protein MTR_5g010050 [Medicago truncatula]
gi|355512402|gb|AES94025.1| hypothetical protein MTR_5g010050 [Medicago truncatula]
Length = 249
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 86/213 (40%), Positives = 114/213 (53%), Gaps = 34/213 (15%)
Query: 26 IVLNSLPSKISCQLMQNTSKIEHHGISIRSLAKCKPSTLVPSASDDSIEIV-----PFSA 80
+ L+ P K +C L H I I+ + C + + + I + P S
Sbjct: 26 LTLSFSPRKTTCNL--------HKKIEIQQRSICVKNVKNGTWDSNRISAILSKRGPKSG 77
Query: 81 SNT------IREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTS 134
N+ + FY CINEK L++L+ YIS D CF D +F PFQGKKEVM FL+QL
Sbjct: 78 KNSMSPAEIVDHFYTCINEKELKQLDEYISQDACFYDYTFINPFQGKKEVMHFLQQLTAG 137
Query: 135 MGQNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVI 194
MGQNVKF V+ +CEGD+ T WHLEWK +Q+PFT GCSF++ E + I+
Sbjct: 138 MGQNVKFIVKNICEGDDLTVAAKWHLEWKKEQIPFTTGCSFFQLEKVEENMTIR------ 191
Query: 195 ESPIKPGGIVLTLLKNLTSLSDDFPKATEWLLN 227
+T+LK +TSL DDFP A E LN
Sbjct: 192 ---------YVTVLKTMTSLFDDFPNAAECELN 215
>gi|242032127|ref|XP_002463458.1| hypothetical protein SORBIDRAFT_01g000255 [Sorghum bicolor]
gi|241917312|gb|EER90456.1| hypothetical protein SORBIDRAFT_01g000255 [Sorghum bicolor]
Length = 226
Score = 140 bits (352), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 72/169 (42%), Positives = 106/169 (62%), Gaps = 8/169 (4%)
Query: 62 STLVPSA------SDDSIEIVPFSA-SNTIREFYACINEKNLERLETYISDDCCFEDCSF 114
+TL P A +D +P S S+ ++EFY+ +NEKN +RL+ ++ DC ED ++
Sbjct: 59 ATLGPDANGGLHNADRRGATLPSSPLSDVVQEFYSSLNEKNSKRLDKLMAPDCIVEDTAY 118
Query: 115 PKPFQGKKEVMQFLEQLVTSMGQNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCS 174
KP K + F ++L+ SMG+ VKF++++VC+G TA + WHLEW G +PFT+GCS
Sbjct: 119 YKPLDAKCTRIYF-KRLMESMGKKVKFAIDEVCQGAGRTAAVMWHLEWNGYIIPFTKGCS 177
Query: 175 FYECSLEGETLLIKKARVVIESPIKPGGIVLTLLKNLTSLSDDFPKATE 223
FY CS G LLI+K + ESP+KPG L +L +T+L + FPK E
Sbjct: 178 FYICSANGAALLIRKIHIFDESPLKPGKWALEILNIVTNLINMFPKLAE 226
>gi|294461593|gb|ADE76357.1| unknown [Picea sitchensis]
Length = 233
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 71/212 (33%), Positives = 114/212 (53%), Gaps = 16/212 (7%)
Query: 24 NGIVLNSLPSKISCQLMQNTSKIEHHGISIRSLAKCKP----------STLVPSASDDSI 73
NG+ + LP C Q ++ +H + L KCK S ++ +++ I
Sbjct: 28 NGLCIAPLP----CGAAQFSTLTTNH--ATWGLLKCKSMVQNGQPNRNSAVLGNSTRTEI 81
Query: 74 EIVPFSASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVT 133
+ P SAS +REFY+ IN + LE + I+++C ++D FP+PF G+K+++ F ++
Sbjct: 82 VLSPRSASEIVREFYSRINSRELEFVGELIAENCVYDDLVFPQPFVGRKDIIDFFKKFTD 141
Query: 134 SMGQNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVV 193
+ +++F ++ + D GI WHLEW+GK PF+RGCSFY C + I R
Sbjct: 142 ATSTDLQFVIDDITSEDSSAVGITWHLEWRGKPFPFSRGCSFYRCEIFNGKQQIIYGRDS 201
Query: 194 IESPIKPGGIVLTLLKNLTSLSDDFPKATEWL 225
+E KPG I L +K +TSL D FP+ + L
Sbjct: 202 VEPATKPGDIALVAIKAVTSLLDRFPQLADRL 233
>gi|225460141|ref|XP_002275923.1| PREDICTED: uncharacterized protein LOC100243104 isoform 1 [Vitis
vinifera]
gi|359495234|ref|XP_003634940.1| PREDICTED: uncharacterized protein LOC100243104 isoform 2 [Vitis
vinifera]
gi|297741015|emb|CBI31327.3| unnamed protein product [Vitis vinifera]
Length = 226
Score = 130 bits (326), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 69/214 (32%), Positives = 107/214 (50%), Gaps = 20/214 (9%)
Query: 32 PSKISCQLMQ-NTSKIEHHGISIRS----LAKCKPSTLVPSAS---------------DD 71
PSK+S L+ +T+ I H RS L + S P+A+ D
Sbjct: 13 PSKLSPPLLHPHTASIATHFTQRRSRRCNLLRVSSSPQNPTATIAEPVTTTQSDEANGDG 72
Query: 72 SIEIVPFSASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQL 131
E + S S+ +R FYA IN +L+ +E I+ +C +ED FP+PF G+K +M F +
Sbjct: 73 GDESILLSGSSVVRAFYAGINSHDLDSVEELIAHNCVYEDLIFPQPFVGRKAIMDFFNKF 132
Query: 132 VTSMGQNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKAR 191
+ ++ ++F ++ + E D G+ WHLEWKG+ PF++GCSFY + I R
Sbjct: 133 IDTISMELQFVIDDISEADSSAVGVTWHLEWKGRPFPFSKGCSFYRLEVLNGKRQIIYGR 192
Query: 192 VVIESPIKPGGIVLTLLKNLTSLSDDFPKATEWL 225
+E KPG L ++ + L FPK +WL
Sbjct: 193 DSVEPFFKPGETALVAIRGVAWLLQRFPKLADWL 226
>gi|15217513|ref|NP_177304.1| nuclear transport factor 2-like protein [Arabidopsis thaliana]
gi|12323719|gb|AAG51815.1|AC016163_4 unknown protein; 43994-42987 [Arabidopsis thaliana]
gi|37202096|gb|AAQ89663.1| At1g71480 [Arabidopsis thaliana]
gi|51969762|dbj|BAD43573.1| unknown protein [Arabidopsis thaliana]
gi|51969880|dbj|BAD43632.1| unknown protein [Arabidopsis thaliana]
gi|51971487|dbj|BAD44408.1| unknown protein [Arabidopsis thaliana]
gi|51971841|dbj|BAD44585.1| unknown protein [Arabidopsis thaliana]
gi|51971939|dbj|BAD44634.1| unknown protein [Arabidopsis thaliana]
gi|222423003|dbj|BAH19485.1| AT1G71480 [Arabidopsis thaliana]
gi|332197085|gb|AEE35206.1| nuclear transport factor 2-like protein [Arabidopsis thaliana]
Length = 216
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 67/197 (34%), Positives = 101/197 (51%), Gaps = 4/197 (2%)
Query: 26 IVLNSLPSK--ISCQLMQNTSKIEHHGISIRSLAKCKPSTLVPSASDDSIEIVPFSASNT 83
+ LN LP + +L++ TS H R A P+ +A+D E P SAS
Sbjct: 17 VSLNHLPPSFFLPTKLVKPTSLT--HSQPPRLSASYGPAAKAATANDVVPETAPTSASEV 74
Query: 84 IREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQNVKFSV 143
+ FYA +N +L + I+ DC +ED F PF G+K ++ F + + S +++F +
Sbjct: 75 VSSFYAAVNVHDLSSVTDLIAQDCVYEDLVFSSPFVGRKAILDFFGKFIESTSTDLQFVI 134
Query: 144 EQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPIKPGGI 203
+ + D G++WHLEWKGK PF++GCSFY + I R +E IKPG
Sbjct: 135 DDISTEDSSAVGVSWHLEWKGKNFPFSKGCSFYRLEVIDGKRQIVYGRDCVEPAIKPGET 194
Query: 204 VLTLLKNLTSLSDDFPK 220
VL +K +T L FP+
Sbjct: 195 VLAAIKGVTWLLQKFPQ 211
>gi|356539020|ref|XP_003537998.1| PREDICTED: uncharacterized protein LOC100813246 [Glycine max]
Length = 227
Score = 127 bits (319), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 56/156 (35%), Positives = 93/156 (59%), Gaps = 1/156 (0%)
Query: 70 DDSIEIVPFSASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLE 129
DD ++V SAS ++ FY IN +++ +E I+++C +ED FP+PF G+KE+++F +
Sbjct: 73 DDDNDVVD-SASAVLKNFYGGINAHDVDSVEYLIAENCVYEDLVFPRPFVGRKEILEFFK 131
Query: 130 QLVTSMGQNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKK 189
+ S ++++F ++ + D + G+ WHLEWKGK PF++GCSFY + I
Sbjct: 132 KFTNSTSKDLQFVIDDLSTEDSSSVGVIWHLEWKGKPFPFSKGCSFYRLEVINGKRQITY 191
Query: 190 ARVVIESPIKPGGIVLTLLKNLTSLSDDFPKATEWL 225
R +E IKPG L ++++T L FP+ WL
Sbjct: 192 GRDCVEPAIKPGDATLAAIRSVTWLLQQFPQLANWL 227
>gi|449518407|ref|XP_004166233.1| PREDICTED: uncharacterized protein LOC101231762 [Cucumis sativus]
Length = 193
Score = 127 bits (318), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 68/163 (41%), Positives = 97/163 (59%), Gaps = 6/163 (3%)
Query: 70 DDSIEIVPFSASNT----IREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVM 125
DDS P S+SN+ I FY CINEKNL+ + TYIS+DC ED F + F+GKK M
Sbjct: 2 DDSFS-CPGSSSNSPGEMIERFYKCINEKNLKEMSTYISEDCLIEDSLFIEKFKGKKAAM 60
Query: 126 QFLEQLVTSMGQNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETL 185
F+E+L SMG +VKF + +V E AG WHLEW+ ++P T+GC+F + E E
Sbjct: 61 SFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTFIDIRDE-ERK 119
Query: 186 LIKKARVVIESPIKPGGIVLTLLKNLTSLSDDFPKATEWLLNS 228
I+K +++ E K G ++L ++K +T L EWL+ +
Sbjct: 120 TIQKIQIINEPQFKAGHLILDIMKLVTLLLAKNSAILEWLIKA 162
>gi|224146286|ref|XP_002325950.1| predicted protein [Populus trichocarpa]
gi|222862825|gb|EEF00332.1| predicted protein [Populus trichocarpa]
Length = 164
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 54/161 (33%), Positives = 94/161 (58%), Gaps = 2/161 (1%)
Query: 67 SASDDSIEIVPF--SASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEV 124
S+ + ++ + P SAS+ +R FY IN +L+ +E I+++C +ED FP+PF G+K +
Sbjct: 4 SSENQTVTVAPSVESASDVVRSFYEGINGHDLDSVEELIAENCVYEDLIFPRPFVGRKAI 63
Query: 125 MQFLEQLVTSMGQNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGET 184
++F + + ++ ++++F ++ + D F G+ WHLEWKGK PF++GCSFY
Sbjct: 64 LEFFNKFIDTVSKDLQFVIDDISNEDSFAVGVTWHLEWKGKSFPFSKGCSFYRLDAVNGK 123
Query: 185 LLIKKARVVIESPIKPGGIVLTLLKNLTSLSDDFPKATEWL 225
I R +E +KPG L ++ +T L FP+ + L
Sbjct: 124 RQIIYGRDSVEPAVKPGEAALVAIRGVTWLLQRFPQLADQL 164
>gi|449450034|ref|XP_004142769.1| PREDICTED: uncharacterized protein LOC101216887 [Cucumis sativus]
Length = 218
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 58/164 (35%), Positives = 100/164 (60%), Gaps = 2/164 (1%)
Query: 61 PSTLVPSASDDSIEIVPFSASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQG 120
P+ VPS D+ SASN +R+FY +N +L +E I+++C +ED F +PF G
Sbjct: 54 PAVTVPSPPTDAPLDTLRSASNVVRDFYDGVNRHDLASVEPLIAENCVYEDLIFSRPFVG 113
Query: 121 KKEVMQFLEQLVTSMGQNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYE-CS 179
+K+++ F ++ S+ ++++F ++ + D G+ WHLEWKGK+ PF++GCSFY
Sbjct: 114 RKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRLAD 173
Query: 180 LEGETLLIKKARVVIESPIKPGGIVLTLLKNLTSLSDDFPKATE 223
++G+ +I AR +E KPG + LT ++ +T L + FP+ +
Sbjct: 174 VDGKKQII-YARDSVEPAFKPGEMALTAIRGVTWLLEQFPQLAD 216
>gi|255574237|ref|XP_002528033.1| protein transporter, putative [Ricinus communis]
gi|223532563|gb|EEF34351.1| protein transporter, putative [Ricinus communis]
Length = 218
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 55/163 (33%), Positives = 93/163 (57%), Gaps = 2/163 (1%)
Query: 63 TLVPSASDDSIEIVPFSASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKK 122
T P+ DD+ + AS+ +R FY IN+ +L +E I+++C +ED FP PF G+K
Sbjct: 57 TPSPATKDDAADTQ--LASDIVRRFYEGINDHDLASVEDLIAENCVYEDLIFPHPFVGRK 114
Query: 123 EVMQFLEQLVTSMGQNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEG 182
+++F ++ + S+ ++++F ++ + D G+ WHLEWKG+ PF++GCSFY +
Sbjct: 115 AILEFFKKFIDSISKDLQFVIDDISTEDSLAVGVTWHLEWKGRPFPFSKGCSFYRLQVLN 174
Query: 183 ETLLIKKARVVIESPIKPGGIVLTLLKNLTSLSDDFPKATEWL 225
I R +E IKPG L +++ L L FP+ + L
Sbjct: 175 SKKQIIYGRDSVEPAIKPGEAALVIIRGLAWLLQRFPQLADQL 217
>gi|218194167|gb|EEC76594.1| hypothetical protein OsI_14445 [Oryza sativa Indica Group]
Length = 612
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 58/142 (40%), Positives = 89/142 (62%), Gaps = 1/142 (0%)
Query: 82 NTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQNVKF 141
+ I++FY+ +NEK+ +RLE I+ DC +D ++ K K + +L+ +MG+N KF
Sbjct: 437 DVIQQFYSSLNEKDSKRLENLIAPDCIIDDNAYYK-LLDIKSTQTYFRRLMDAMGKNFKF 495
Query: 142 SVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPIKPG 201
++++V +G E T + WHLEW GK +PFT+GCSFY CS + L+I+K + ESP+KP
Sbjct: 496 AIDEVSQGVEPTFAVMWHLEWNGKTIPFTKGCSFYICSRKEAALVIRKIHIFQESPVKPC 555
Query: 202 GIVLTLLKNLTSLSDDFPKATE 223
L +L T+L D FP E
Sbjct: 556 KFSLEILNIATNLFDTFPNIAE 577
>gi|449483825|ref|XP_004156703.1| PREDICTED: uncharacterized LOC101216887 [Cucumis sativus]
Length = 218
Score = 124 bits (310), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 57/164 (34%), Positives = 99/164 (60%), Gaps = 2/164 (1%)
Query: 61 PSTLVPSASDDSIEIVPFSASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQG 120
P+ VPS D+ SASN +R+FY +N +L +E I+++C +ED F +PF G
Sbjct: 54 PAVTVPSPPTDAPLDTLRSASNVVRDFYDGVNRHDLASVEPLIAENCVYEDLIFSRPFVG 113
Query: 121 KKEVMQFLEQLVTSMGQNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYE-CS 179
+K+++ F ++ S+ ++++F ++ + D G+ WHLEWKGK+ PF++GC FY
Sbjct: 114 RKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCRFYRLAD 173
Query: 180 LEGETLLIKKARVVIESPIKPGGIVLTLLKNLTSLSDDFPKATE 223
++G+ +I AR +E KPG + LT ++ +T L + FP+ +
Sbjct: 174 VDGKKQII-YARDSVEPAFKPGEMALTAIRGVTWLLEQFPQLAD 216
>gi|356495768|ref|XP_003516745.1| PREDICTED: uncharacterized protein LOC100802494 [Glycine max]
Length = 221
Score = 123 bits (309), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 52/147 (35%), Positives = 86/147 (58%)
Query: 79 SASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQN 138
S S +R FY IN +++ +E ++++C +ED FP+PF G+KE+++F ++ S +
Sbjct: 75 SNSAVVRNFYGGINAHDVDSVEYLMAENCVYEDLVFPRPFVGRKEILEFFKKFTNSTSKG 134
Query: 139 VKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPI 198
++F ++ + D + G+ WHLEWKGK PF++GCSFY + I R +E I
Sbjct: 135 LQFVIDDLSTEDSSSVGVIWHLEWKGKPFPFSKGCSFYRLEVINGKRQITYGRDCVEPAI 194
Query: 199 KPGGIVLTLLKNLTSLSDDFPKATEWL 225
KPG L ++++T L FP+ WL
Sbjct: 195 KPGDAALAAIRSVTWLLQQFPQLANWL 221
>gi|168009022|ref|XP_001757205.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691703|gb|EDQ78064.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 148
Score = 123 bits (308), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 51/140 (36%), Positives = 81/140 (57%)
Query: 80 ASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQNV 139
A+ ++EFYA IN + + + +D+C +ED FP PF G++ ++ F ++ + S+G +
Sbjct: 3 AAVMVQEFYAAINRREITSIGDLFADNCVYEDLVFPTPFTGRQAILDFFKKFMDSVGSEL 62
Query: 140 KFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPIK 199
+F ++ + GD AG+ WHLEW+GK +PF++GCSFY C + R +E K
Sbjct: 63 EFRIDDITTGDPNAAGVIWHLEWRGKPLPFSKGCSFYRCEVLNGKRQFVYGRDAVEPASK 122
Query: 200 PGGIVLTLLKNLTSLSDDFP 219
PG L LK L +L FP
Sbjct: 123 PGDFTLVALKGLAALLRTFP 142
>gi|51536248|dbj|BAD38417.1| nuclear transport factor 2 (NTF2)-like protein [Oryza sativa
Japonica Group]
gi|125604939|gb|EAZ43975.1| hypothetical protein OsJ_28594 [Oryza sativa Japonica Group]
gi|215695383|dbj|BAG90574.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 215
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 56/162 (34%), Positives = 96/162 (59%), Gaps = 5/162 (3%)
Query: 62 STLVPSASDDSIEIVPFSASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGK 121
S++V AS + E SA++ +R FY +N ++L +E I++ C +ED FP PF G+
Sbjct: 51 SSMVEPASGE--ETAARSAADVVRAFYDGVNRRDLAAVEPLIAEGCVYEDLVFPNPFVGR 108
Query: 122 KEVMQFLEQLVTSMGQNVKFSVEQVCEGDEFTA-GINWHLEWKGKQVPFTRGCSFYECSL 180
E++ F + S+ +++F ++ + GD+ A G+ WHL+WKG+ PF+RGCSFY L
Sbjct: 109 AEILGFFAGFMGSVSSDLRFVIDDISAGDDSRAVGVTWHLDWKGRPFPFSRGCSFYRLQL 168
Query: 181 E--GETLLIKKARVVIESPIKPGGIVLTLLKNLTSLSDDFPK 220
+ + L I R +E +KPG L +++ +T + + FP+
Sbjct: 169 DEKQQQLQIVYGRDCVEPAVKPGESALLIIRAVTWIFERFPR 210
>gi|31193920|gb|AAP44755.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108712240|gb|ABG00035.1| expressed protein [Oryza sativa Japonica Group]
Length = 259
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 59/147 (40%), Positives = 91/147 (61%), Gaps = 5/147 (3%)
Query: 82 NTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGK-----KEVMQFLEQLVTSMG 136
+ I++FY+ +NEK+ +RLE I+ DC +D ++ K K + L++L+ +MG
Sbjct: 78 DVIQQFYSSLNEKDSKRLENLIAPDCIIDDNAYYKLLDIKVLLVLHSHITDLQRLMDAMG 137
Query: 137 QNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIES 196
+N KF++++V +G E T + WHLEW GK +PFT+GCSFY CS + L+I+K + ES
Sbjct: 138 KNFKFAIDEVSQGVEPTFAVMWHLEWNGKTIPFTKGCSFYICSRKEAALVIRKIHIFQES 197
Query: 197 PIKPGGIVLTLLKNLTSLSDDFPKATE 223
P+KP L +L T+L D FP E
Sbjct: 198 PVKPCKFSLEILNIATNLFDTFPNIAE 224
>gi|125562981|gb|EAZ08361.1| hypothetical protein OsI_30617 [Oryza sativa Indica Group]
Length = 215
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 56/162 (34%), Positives = 96/162 (59%), Gaps = 5/162 (3%)
Query: 62 STLVPSASDDSIEIVPFSASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGK 121
S++V AS + E SA++ +R FY +N ++L +E I++ C +ED FP PF G+
Sbjct: 51 SSMVEPASGE--ETAARSAADVVRAFYDGLNRRDLAAVEPLIAEGCVYEDLVFPNPFVGR 108
Query: 122 KEVMQFLEQLVTSMGQNVKFSVEQVCEGDEFTA-GINWHLEWKGKQVPFTRGCSFYECSL 180
E++ F + S+ +++F ++ + GD+ A G+ WHL+WKG+ PF+RGCSFY L
Sbjct: 109 AEILGFFAGFMGSVSSDLRFVIDDISAGDDSRAVGVTWHLDWKGRPFPFSRGCSFYRLQL 168
Query: 181 E--GETLLIKKARVVIESPIKPGGIVLTLLKNLTSLSDDFPK 220
+ + L I R +E +KPG L +++ +T + + FP+
Sbjct: 169 DEKQQQLQIVYGRDCVEPAVKPGESALLIIRAVTWIFERFPR 210
>gi|115478242|ref|NP_001062716.1| Os09g0266000 [Oryza sativa Japonica Group]
gi|51536247|dbj|BAD38416.1| nuclear transport factor 2 (NTF2)-like protein [Oryza sativa
Japonica Group]
gi|113630949|dbj|BAF24630.1| Os09g0266000 [Oryza sativa Japonica Group]
gi|215692578|dbj|BAG87998.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 331
Score = 120 bits (302), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 56/162 (34%), Positives = 96/162 (59%), Gaps = 5/162 (3%)
Query: 62 STLVPSASDDSIEIVPFSASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGK 121
S++V AS + E SA++ +R FY +N ++L +E I++ C +ED FP PF G+
Sbjct: 51 SSMVEPASGE--ETAARSAADVVRAFYDGVNRRDLAAVEPLIAEGCVYEDLVFPNPFVGR 108
Query: 122 KEVMQFLEQLVTSMGQNVKFSVEQVCEGDEFTA-GINWHLEWKGKQVPFTRGCSFYECSL 180
E++ F + S+ +++F ++ + GD+ A G+ WHL+WKG+ PF+RGCSFY L
Sbjct: 109 AEILGFFAGFMGSVSSDLRFVIDDISAGDDSRAVGVTWHLDWKGRPFPFSRGCSFYRLQL 168
Query: 181 E--GETLLIKKARVVIESPIKPGGIVLTLLKNLTSLSDDFPK 220
+ + L I R +E +KPG L +++ +T + + FP+
Sbjct: 169 DEKQQQLQIVYGRDCVEPAVKPGESALLIIRAVTWIFERFPR 210
>gi|449452454|ref|XP_004143974.1| PREDICTED: uncharacterized protein LOC101214565 [Cucumis sativus]
Length = 157
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 62/140 (44%), Positives = 86/140 (61%), Gaps = 6/140 (4%)
Query: 70 DDSIEIVPFSASNT----IREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVM 125
DDS P S+SN+ I FY CINEKNL+ + TYIS+DC ED F + F+GKK M
Sbjct: 2 DDSFS-CPGSSSNSPGEMIERFYKCINEKNLKEMSTYISEDCLIEDSLFIEKFKGKKAAM 60
Query: 126 QFLEQLVTSMGQNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETL 185
F+E+L SMG +VKF + +V E AG WHLEW+ ++P T+GC+F + E E
Sbjct: 61 SFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTFIDIRDE-ERK 119
Query: 186 LIKKARVVIESPIKPGGIVL 205
I+K +++ E K G ++L
Sbjct: 120 TIQKIQIINEPQFKAGHLIL 139
>gi|357167107|ref|XP_003581007.1| PREDICTED: uncharacterized protein LOC100842516 [Brachypodium
distachyon]
Length = 196
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 83/147 (56%), Gaps = 3/147 (2%)
Query: 82 NTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQNVKF 141
+REFY +N ++L +E I++ C +ED FP+P G+ +V+ F + + S+ +++F
Sbjct: 50 EVVREFYDGVNRRDLAAVEPLIAEGCVYEDLVFPRPMVGRDQVLGFFGEFMGSISPDLRF 109
Query: 142 SVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEG---ETLLIKKARVVIESPI 198
++ + D G+ WHLEWKG+ PF+RGCSFY C + + + I R +E
Sbjct: 110 VIDDISGLDPSAVGVTWHLEWKGRPFPFSRGCSFYRCQPDPQKQQQIQIVYGRDCVEPAT 169
Query: 199 KPGGIVLTLLKNLTSLSDDFPKATEWL 225
KPG + L +++ +T + + FP L
Sbjct: 170 KPGELALVIIRGVTWIFERFPSLARML 196
>gi|302796609|ref|XP_002980066.1| hypothetical protein SELMODRAFT_58583 [Selaginella moellendorffii]
gi|300152293|gb|EFJ18936.1| hypothetical protein SELMODRAFT_58583 [Selaginella moellendorffii]
Length = 143
Score = 114 bits (285), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 81/141 (57%), Gaps = 1/141 (0%)
Query: 79 SASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQN 138
+A +T+R+FY+ IN + LE I+ DC +ED F KPF G+K +++ E +S+G
Sbjct: 1 AAKDTVRQFYSAINHRQLEIAGDLIASDCVYEDLVFSKPFVGRKAIIELFETFTSSVGPE 60
Query: 139 VKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPI 198
V F ++++ +G+ + G+ WHL+W+G PF++GCSFY C I R ++E
Sbjct: 61 VSFVIDEISDGN-LSVGVTWHLDWRGNVFPFSKGCSFYRCEDVSGDYKIVYGRDIVEPAF 119
Query: 199 KPGGIVLTLLKNLTSLSDDFP 219
K + L + +T L + FP
Sbjct: 120 KINSVALKAIGAVTWLFERFP 140
>gi|224135761|ref|XP_002327297.1| predicted protein [Populus trichocarpa]
gi|222835667|gb|EEE74102.1| predicted protein [Populus trichocarpa]
Length = 150
Score = 114 bits (284), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 55/149 (36%), Positives = 82/149 (55%)
Query: 77 PFSASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMG 136
P +ASN IR FY +N +L +E I+++C +ED FP P G KEV++F +Q S
Sbjct: 2 PATASNVIRRFYDGLNSHDLVAVEEIIAENCVYEDLIFPHPCVGHKEVIEFFQQFFDSTS 61
Query: 137 QNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIES 196
++ F ++ + D G+ WH+EWKGK +P ++GCSFY I R +ES
Sbjct: 62 SDLHFVIDDISGEDSQAVGVIWHIEWKGKPMPCSKGCSFYRLETVNGKRQIIYGRDSVES 121
Query: 197 PIKPGGIVLTLLKNLTSLSDDFPKATEWL 225
IKPG L L++++ L F + T L
Sbjct: 122 AIKPGKAALVLIRSVIWLLQKFDQLTNQL 150
>gi|326522692|dbj|BAJ88392.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 203
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 49/144 (34%), Positives = 81/144 (56%), Gaps = 3/144 (2%)
Query: 85 REFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQNVKFSVE 144
REFY+ +N ++L + I++ C +ED FP+P G+ V+ F + + S+ +++F ++
Sbjct: 60 REFYSGVNRRDLAAVAPLIAEGCVYEDLVFPRPMVGRDRVVGFFGEFMGSVSPDLRFVID 119
Query: 145 QVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLE---GETLLIKKARVVIESPIKPG 201
+ D G+ WHLEWKG+ PF+RGCSFY C + E + I R +E KPG
Sbjct: 120 DISGDDPSAVGVTWHLEWKGRPFPFSRGCSFYRCQPDPQRPEQIQIVYGRDCVEPATKPG 179
Query: 202 GIVLTLLKNLTSLSDDFPKATEWL 225
+ L ++ +T L + FP + L
Sbjct: 180 ELALVAIRGVTWLLERFPSLADRL 203
>gi|226509360|ref|NP_001141296.1| uncharacterized protein LOC100273387 [Zea mays]
gi|194703844|gb|ACF86006.1| unknown [Zea mays]
gi|414876798|tpg|DAA53929.1| TPA: hypothetical protein ZEAMMB73_536788 [Zea mays]
Length = 225
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 83/149 (55%), Gaps = 11/149 (7%)
Query: 82 NTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQNVKF 141
+ +R+FY +N ++L +E I++ C +ED FP+PF G++ V+ F + + ++ +++F
Sbjct: 71 DVVRQFYDGVNRRDLAAVEPLIAEGCVYEDLVFPRPFVGRERVVGFFGEFMGTISPDLQF 130
Query: 142 SVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYEC-----------SLEGETLLIKKA 190
++ + D G+ WHLEW+G+ PF+RGCSFY E + L I
Sbjct: 131 VIDDISAEDSAAVGVTWHLEWRGRPFPFSRGCSFYRLLGSGSDSDADSDSEQQRLQIVYG 190
Query: 191 RVVIESPIKPGGIVLTLLKNLTSLSDDFP 219
R +E KPG + L +++ +T + + FP
Sbjct: 191 RDCVEPAAKPGDLALVIIRGVTWILERFP 219
>gi|9758055|dbj|BAB08518.1| unnamed protein product [Arabidopsis thaliana]
Length = 158
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 48/82 (58%), Positives = 62/82 (75%)
Query: 79 SASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQN 138
S + + +FY+ INEKN ++L + IS DC +D SFPKPF+GK+E M+F E+LV SMGQN
Sbjct: 74 SGYDAVMKFYSSINEKNQDQLSSCISSDCFIDDFSFPKPFRGKQEAMEFFEELVKSMGQN 133
Query: 139 VKFSVEQVCEGDEFTAGINWHL 160
VKF VE VCEGD +A +NWHL
Sbjct: 134 VKFCVENVCEGDGHSAAVNWHL 155
>gi|302811542|ref|XP_002987460.1| hypothetical protein SELMODRAFT_446976 [Selaginella moellendorffii]
gi|300144866|gb|EFJ11547.1| hypothetical protein SELMODRAFT_446976 [Selaginella moellendorffii]
Length = 181
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 77/135 (57%), Gaps = 3/135 (2%)
Query: 79 SASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQN 138
+A +T+R+FY+ IN + LE I+ DC +ED F KPF G+K +++ E +S+G
Sbjct: 29 AAKDTVRQFYSAINHRQLEIAGDLIASDCVYEDLVFSKPFVGRKAIIELFETFTSSVGPE 88
Query: 139 VKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPI 198
V F ++++ +G+ + G+ WHL+W+G PF++GCSFY C I R ++E
Sbjct: 89 VSFVIDEISDGN-LSVGVTWHLDWRGNIFPFSKGCSFYRCEDVSGDYKIVYGRDIVEPAF 147
Query: 199 KPGGIVLTLLKNLTS 213
K L+KN S
Sbjct: 148 KINS--FALVKNDIS 160
>gi|357463753|ref|XP_003602158.1| hypothetical protein MTR_3g090440 [Medicago truncatula]
gi|355491206|gb|AES72409.1| hypothetical protein MTR_3g090440 [Medicago truncatula]
Length = 255
Score = 103 bits (257), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 54/163 (33%), Positives = 90/163 (55%), Gaps = 2/163 (1%)
Query: 68 ASDDSIEIVPFSAS--NTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVM 125
+ D + E +P S S + + EFY N+K+ E L+ +S +C ++D F ++G++ ++
Sbjct: 72 SGDVAPEFLPASPSPVHIVHEFYEAFNKKDTETLKQLLSPNCVYQDLLFYTAYEGQESII 131
Query: 126 QFLEQLVTSMGQNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETL 185
+F + ++ +MG N+ VE V E + HL WK K++PFT GC F+
Sbjct: 132 KFWQSVMDAMGPNIHVFVEDVKESNHVMVTAFMHLVWKEKKLPFTNGCRFFTFEEVEGKF 191
Query: 186 LIKKARVVIESPIKPGGIVLTLLKNLTSLSDDFPKATEWLLNS 228
LI K + E P+KPG +VL LLK + + D +P A +L+S
Sbjct: 192 LISKITCMEEFPLKPGELVLKLLKGIGQILDSYPLAAAAMLDS 234
>gi|359461040|ref|ZP_09249603.1| hypothetical protein ACCM5_20105 [Acaryochloris sp. CCMEE 5410]
Length = 406
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 52/148 (35%), Positives = 76/148 (51%), Gaps = 2/148 (1%)
Query: 76 VPFSASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSM 135
+P A I+ Y INE+N+ + Y+ DCC++D +FP PFQGK V + + M
Sbjct: 12 MPLPAHLLIQAMYEAINERNVAKALEYVDPDCCYQDLNFPTPFQGKAAVEELFTESCKGM 71
Query: 136 GQNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIE 195
++ F V+ + EGD GI WH+E G +P RG SF C L +T + AR ++E
Sbjct: 72 PADLLFVVDDITEGDALAVGILWHVELGGIPLPNGRGVSF--CRLSEKTGKLIFARDLVE 129
Query: 196 SPIKPGGIVLTLLKNLTSLSDDFPKATE 223
IKPG +L+ + L K E
Sbjct: 130 PVIKPGKGAFLILRLIAPLVRLLLKPAE 157
>gi|332708365|ref|ZP_08428343.1| SnoaL-like polyketide cyclase [Moorea producens 3L]
gi|332352858|gb|EGJ32420.1| SnoaL-like polyketide cyclase [Moorea producens 3L]
Length = 394
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 47/136 (34%), Positives = 76/136 (55%), Gaps = 4/136 (2%)
Query: 80 ASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQNV 139
A N I+ Y IN +++ +I D C +ED +F +PF+GK+ V Q LE+ + ++
Sbjct: 5 AHNLIQSMYEAINRRDVNAAMEWIDDQCIYEDLNFSQPFKGKEAVRQLLEESCQGIPDDL 64
Query: 140 KFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECS-LEGETLLIKKARVVIESPI 198
KF ++ + GD G+ WH+E G P RG SFY S + G+ +L AR ++E PI
Sbjct: 65 KFVIDDITTGDPLAVGVLWHVELDGIPFPNGRGVSFYRFSEVTGKLVL---ARDLVEPPI 121
Query: 199 KPGGIVLTLLKNLTSL 214
KPG +++ ++ L
Sbjct: 122 KPGKAAFLIIRLVSPL 137
>gi|302852119|ref|XP_002957581.1| hypothetical protein VOLCADRAFT_98696 [Volvox carteri f.
nagariensis]
gi|300257098|gb|EFJ41351.1| hypothetical protein VOLCADRAFT_98696 [Volvox carteri f.
nagariensis]
Length = 479
Score = 97.1 bits (240), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 50/187 (26%), Positives = 97/187 (51%), Gaps = 11/187 (5%)
Query: 34 KISCQLMQNTSKIEHHGISIRSLAK-----CKPSTLVPSASDDSIEIVPFSASNTIREFY 88
++ C+ + ++ + + + A+ K + VP ++ ++ + + + I +Y
Sbjct: 13 RVKCKHSSSWTRTKEQSPRLWTYARIACYAAKTAEAVPPQAEGTVNGL--TGKDVIVRYY 70
Query: 89 ACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQNVKFSVEQVCE 148
N +L+ + + ++ D + D + PF G+ EV+ +L+++ ++ +++F +E V +
Sbjct: 71 EAYNSGDLDTIASLLAPDVSYHDMIYEDPFVGRDEVLNYLKKVRRTVPADLQFVIEDVTD 130
Query: 149 GDEFTAGINWHLE-WKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPIKPGGIVLTL 207
GD GI WH+E G PF+RGCSFY + G+ I AR ++ES KPG L L
Sbjct: 131 GDPRAVGITWHVECGDGVVFPFSRGCSFYTLNPRGQ---ITTARDLVESAAKPGSTALKL 187
Query: 208 LKNLTSL 214
L LT +
Sbjct: 188 LAALTPV 194
>gi|158335683|ref|YP_001516855.1| hypothetical protein AM1_2533 [Acaryochloris marina MBIC11017]
gi|158305924|gb|ABW27541.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 406
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 74/144 (51%), Gaps = 2/144 (1%)
Query: 80 ASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQNV 139
A I+ Y INE+N+ + Y+ DCC++D +FP PFQGK V + + M ++
Sbjct: 16 AHLLIQAMYEAINERNVAKALEYVDPDCCYQDLNFPIPFQGKAAVGELFTESCKGMPADL 75
Query: 140 KFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPIK 199
F V+ + EGD GI WH+E G +P RG SF C L +T + AR ++E IK
Sbjct: 76 LFVVDDITEGDALAVGILWHVELGGIPLPNGRGVSF--CRLSEKTGKLIFARDLVEPVIK 133
Query: 200 PGGIVLTLLKNLTSLSDDFPKATE 223
PG +L+ + L K E
Sbjct: 134 PGKGAFLILRLIAPLVRLLLKPAE 157
>gi|172035034|ref|YP_001801535.1| hypothetical protein cce_0117 [Cyanothece sp. ATCC 51142]
gi|354551958|ref|ZP_08971266.1| hypothetical protein Cy51472DRAFT_0062 [Cyanothece sp. ATCC 51472]
gi|171696488|gb|ACB49469.1| hypothetical protein cce_0117 [Cyanothece sp. ATCC 51142]
gi|353555280|gb|EHC24668.1| hypothetical protein Cy51472DRAFT_0062 [Cyanothece sp. ATCC 51472]
Length = 386
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 48/155 (30%), Positives = 81/155 (52%), Gaps = 8/155 (5%)
Query: 79 SASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQN 138
+A N I Y IN + +++ ++ DDC +ED +F K FQGKK V ++ ++ +
Sbjct: 5 TAKNIIELIYQAINNREIDQAMQWVDDDCIYEDVNFSKTFQGKKAVKSLFQESCDNVPDD 64
Query: 139 VKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPI 198
+F V+++ GD G+ WH+E +P RG SFY S L+ AR ++E P+
Sbjct: 65 FRFVVDEITTGDPLKVGVLWHVELDNIPIPNGRGVSFYRISETTGKLIF--ARDIVEPPL 122
Query: 199 KPGGIVLTLLKNLTSL------SDDFPKATEWLLN 227
KPG + +++ +T L D P T+ +L+
Sbjct: 123 KPGKLSFIIIRLVTPLIRTILKQDSNPPETQLILS 157
>gi|126659484|ref|ZP_01730617.1| hypothetical protein CY0110_07154 [Cyanothece sp. CCY0110]
gi|126619218|gb|EAZ89954.1| hypothetical protein CY0110_07154 [Cyanothece sp. CCY0110]
Length = 386
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 48/155 (30%), Positives = 80/155 (51%), Gaps = 8/155 (5%)
Query: 79 SASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQN 138
+A + I Y IN + +E+ ++ DDC +ED +F K FQGK+ V ++ + +
Sbjct: 5 TAKDIIESIYQAINNREIEQAMQWVDDDCMYEDVNFSKTFQGKEAVKHLFQESCDNAPND 64
Query: 139 VKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPI 198
KF V+++ GD G+ WH+E +P RG SFY S L+ AR ++E P+
Sbjct: 65 FKFVVDEITTGDPLKVGVLWHVELDNIPIPNGRGVSFYRISETTGKLIF--ARDIVEPPL 122
Query: 199 KPGGIVLTLLKNLTSL------SDDFPKATEWLLN 227
KPG + +++ +T L D P T+ +L+
Sbjct: 123 KPGKLSFVIIRLVTPLIKTLLKQDSNPPKTQLILS 157
>gi|254414879|ref|ZP_05028643.1| hypothetical protein MC7420_1164 [Coleofasciculus chthonoplastes
PCC 7420]
gi|196178368|gb|EDX73368.1| hypothetical protein MC7420_1164 [Coleofasciculus chthonoplastes
PCC 7420]
Length = 391
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 72/135 (53%), Gaps = 2/135 (1%)
Query: 80 ASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQNV 139
A I+ Y IN +++ I D+C +ED +FP+PF+GK V Q E+ + ++
Sbjct: 5 AHLMIKSLYQAINRRDISAAIDCIDDECVYEDLNFPQPFRGKAAVQQLFEESYQGVPDDL 64
Query: 140 KFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPIK 199
F ++ + GD + G+ WH+E G P RG SFY S L+ AR V+E PIK
Sbjct: 65 LFVIDDITTGDSSSTGVLWHVELDGIPFPNGRGVSFYRLSETTGKLI--YARDVVEPPIK 122
Query: 200 PGGIVLTLLKNLTSL 214
PG + +++ +T L
Sbjct: 123 PGKVAFWIIRLVTPL 137
>gi|67924420|ref|ZP_00517847.1| hypothetical protein CwatDRAFT_1831 [Crocosphaera watsonii WH 8501]
gi|416403736|ref|ZP_11687588.1| hypothetical protein CWATWH0003_4349 [Crocosphaera watsonii WH
0003]
gi|67853732|gb|EAM49064.1| hypothetical protein CwatDRAFT_1831 [Crocosphaera watsonii WH 8501]
gi|357261654|gb|EHJ10897.1| hypothetical protein CWATWH0003_4349 [Crocosphaera watsonii WH
0003]
Length = 386
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 47/155 (30%), Positives = 82/155 (52%), Gaps = 8/155 (5%)
Query: 79 SASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQN 138
+A I Y IN +++E+ ++ DDC +ED +F +PFQGK+ V ++ ++ +
Sbjct: 5 TAKEIIESIYQAINNRDIEQAIQWVDDDCLYEDMNFSQPFQGKEAVKTLFQESCDNVPDD 64
Query: 139 VKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPI 198
+KF ++++ D GI WH+E +P RG SFY S L+ AR ++E P+
Sbjct: 65 LKFIIDEITTEDPLKVGILWHVELDDIPLPNGRGVSFYRISETTGKLIF--ARDIVEPPL 122
Query: 199 KPGGIVLTLLKNLTSL------SDDFPKATEWLLN 227
KPG + +++ +T L D P T+ +L+
Sbjct: 123 KPGKLSFLIIRLVTPLVRIFLAKDSNPPKTQLILS 157
>gi|159487887|ref|XP_001701954.1| predicted protein [Chlamydomonas reinhardtii]
gi|158281173|gb|EDP06929.1| predicted protein [Chlamydomonas reinhardtii]
Length = 1224
Score = 93.6 bits (231), Expect = 8e-17, Method: Composition-based stats.
Identities = 48/134 (35%), Positives = 77/134 (57%), Gaps = 3/134 (2%)
Query: 82 NTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQNVKF 141
+ I +Y N +++ + ++ D + D + +PFQG++EV+ +L ++ ++ ++KF
Sbjct: 89 DVIVRYYEAYNAGDIDTIAGLLAPDVSYHDMIYEEPFQGREEVVAYLRKVRKTVPSDLKF 148
Query: 142 SVEQVCEGDEFTAGINWHLEW-KGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPIKP 200
+E V GD GI WH+E G PF+RGCSFY +L+ T I AR ++ES +KP
Sbjct: 149 VIEDVTSGDPRAVGITWHVECGDGVVFPFSRGCSFY--TLDPATGQIVAARDLVESSVKP 206
Query: 201 GGIVLTLLKNLTSL 214
G L LL LT L
Sbjct: 207 GSSALKLLGALTPL 220
>gi|412993256|emb|CCO16789.1| predicted protein [Bathycoccus prasinos]
Length = 478
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 47/145 (32%), Positives = 77/145 (53%), Gaps = 11/145 (7%)
Query: 79 SASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQN 138
A +R Y IN++++ + T++ DD +ED +FP+PF+GK V + E+ T + +
Sbjct: 28 DAEAVVRGMYDAINKRDVTKALTFVDDDIVYEDFNFPEPFRGKARVKKLFEESCTGIPDD 87
Query: 139 VKFSVEQVCE---------GDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKK 189
+ F +E+ G+ T G WH+E G+ P RG SFYE S E L+
Sbjct: 88 LDFIIERSTTSSSSEAGGLGESLTFGCTWHVEIAGEPFPNARGASFYEISKESGKLIY-- 145
Query: 190 ARVVIESPIKPGGIVLTLLKNLTSL 214
AR V+ESP+K G ++++ + L
Sbjct: 146 ARDVVESPMKLGEASFSIIRVVAPL 170
>gi|218246837|ref|YP_002372208.1| hypothetical protein PCC8801_2015 [Cyanothece sp. PCC 8801]
gi|257059878|ref|YP_003137766.1| hypothetical protein Cyan8802_2040 [Cyanothece sp. PCC 8802]
gi|218167315|gb|ACK66052.1| conserved hypothetical protein [Cyanothece sp. PCC 8801]
gi|256590044|gb|ACV00931.1| conserved hypothetical protein [Cyanothece sp. PCC 8802]
Length = 143
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 70/131 (53%), Gaps = 2/131 (1%)
Query: 79 SASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQN 138
+A I Y IN ++++ +I D C +ED +FP+PF GK++V + + + ++ +
Sbjct: 4 TAREVIESIYGAINRRDIDSAMEWIDDQCIYEDLNFPQPFIGKEKVKELFAESMQNIPDD 63
Query: 139 VKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPI 198
++F ++ + D G+ WHLE G P +RG SFY S L+ R ++E P+
Sbjct: 64 LQFVIDDITSEDSLAVGVLWHLELNGMTFPNSRGASFYRLSETTRKLVF--GRDLVEPPV 121
Query: 199 KPGGIVLTLLK 209
K G I +++
Sbjct: 122 KLGKIAFLIIR 132
>gi|255083853|ref|XP_002508501.1| predicted protein [Micromonas sp. RCC299]
gi|226523778|gb|ACO69759.1| predicted protein [Micromonas sp. RCC299]
Length = 428
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 73/134 (54%), Gaps = 2/134 (1%)
Query: 79 SASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQN 138
+A + +R Y IN +++E ++ DD +ED +FP PF+GK+ V + E+ + +
Sbjct: 1 TAESVVRGMYDAINRRDVEAALAFVDDDILYEDFNFPVPFKGKERVRKLFEESCDGIPDD 60
Query: 139 VKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPI 198
+ F V++ +G + G+ W++E +G+ P RG SFY L+ AR V+E P
Sbjct: 61 MLFIVDECTDGGGLSVGMTWYVELEGEPFPNARGASFYRIDPTSGKLVY--ARDVVEPPF 118
Query: 199 KPGGIVLTLLKNLT 212
K G + ++++ +
Sbjct: 119 KLGDVSFSIIRAVA 132
>gi|242042631|ref|XP_002468710.1| hypothetical protein SORBIDRAFT_01g050630 [Sorghum bicolor]
gi|241922564|gb|EER95708.1| hypothetical protein SORBIDRAFT_01g050630 [Sorghum bicolor]
Length = 98
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 55/86 (63%)
Query: 91 INEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQNVKFSVEQVCEGD 150
+N + + +E I++ C +ED FP+PF G++ ++ F + + ++ +++F ++ + D
Sbjct: 1 MNRREVAAVEPLIAEGCVYEDLVFPRPFVGRERIIGFFGKFMGTISLDLQFVIDDISADD 60
Query: 151 EFTAGINWHLEWKGKQVPFTRGCSFY 176
G+ WHLEW+GK PF+RGCSFY
Sbjct: 61 SVAVGVTWHLEWRGKPFPFSRGCSFY 86
>gi|145346791|ref|XP_001417866.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578094|gb|ABO96159.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 429
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 43/137 (31%), Positives = 71/137 (51%), Gaps = 5/137 (3%)
Query: 79 SASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQN 138
A T+RE Y IN ++++ + ++ +ED +FP PF+GK V + E+ + +
Sbjct: 21 DAERTVREMYERINARDVDGALACVDENVVYEDFNFPAPFRGKAAVKKLFEESCEGIPDD 80
Query: 139 VKFSVE-QVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESP 197
+ F ++ C GD G+ WH+E G+ P RGCSFY L+ AR +ESP
Sbjct: 81 LTFVIDASTCGGD--AVGLTWHVELMGEAFPNARGCSFYRVDETNGKLVY--ARDCVESP 136
Query: 198 IKPGGIVLTLLKNLTSL 214
+KPG +++ + L
Sbjct: 137 LKPGEASFGIIRLVAPL 153
>gi|303276012|ref|XP_003057300.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461652|gb|EEH58945.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 399
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 74/137 (54%), Gaps = 2/137 (1%)
Query: 79 SASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQN 138
+A +R Y IN +++ ++ DD +ED +FP PF+GK V + E+ + +
Sbjct: 1 TAETVVRGMYDAINRRDVAAALEFVDDDILYEDFNFPTPFKGKAAVKKLFEESCDGIPDD 60
Query: 139 VKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPI 198
+ F +++ +G + G+ W++E +G+ P RG S Y S T + AR V+ESP+
Sbjct: 61 MLFIIDECTDGGGKSVGMTWYVELEGEPFPNARGASLYRIS--ETTGKLAYARDVVESPL 118
Query: 199 KPGGIVLTLLKNLTSLS 215
K G + ++++ + LS
Sbjct: 119 KLGDVSFSIIRLVAPLS 135
>gi|307104859|gb|EFN53111.1| hypothetical protein CHLNCDRAFT_9491 [Chlorella variabilis]
Length = 391
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 54/195 (27%), Positives = 88/195 (45%), Gaps = 31/195 (15%)
Query: 79 SASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQN 138
+A +Y N K + + I++DC +ED + PF G+ + + ++ + +
Sbjct: 2 TAREAALAYYDSYNSKQMGAVLELIAEDCVYEDLIYQDPFVGRAAIAAYFRKIEALVPPD 61
Query: 139 VKFSVEQVCEGDEFTAGINWHLE-----WKGKQVPFTRGCSFYECSLEGETLLIKKARVV 193
+KF VE + EGD G+ WH+E + + PF+RG SFYE + G+ + AR +
Sbjct: 62 IKFCVEDITEGDPRRCGVRWHVEIADDSGRATEFPFSRGVSFYEVNDLGQIVF---ARDI 118
Query: 194 IESPIKPGGIVLTLLKNLTSLSDDF-PKATEWLLNSPHVISTFLLKAYTIFLAPFVRPIL 252
+E IKPG L+ + + L P A +P V+ T P+
Sbjct: 119 VEPAIKPGAAALSGISVVAPLVRKLGPNA------NPAVLKTL--------------PLA 158
Query: 253 AGYINMWNFIARLLG 267
+G MW F A LG
Sbjct: 159 SG--AMWAFYAGYLG 171
>gi|427420214|ref|ZP_18910397.1| hypothetical protein Lepto7375DRAFT_6094 [Leptolyngbya sp. PCC
7375]
gi|425762927|gb|EKV03780.1| hypothetical protein Lepto7375DRAFT_6094 [Leptolyngbya sp. PCC
7375]
Length = 146
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 68/135 (50%), Gaps = 3/135 (2%)
Query: 80 ASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQNV 139
A N+I+ Y IN +++ + I D CC+ED +F F GK V + ++ ++
Sbjct: 4 AVNSIQSIYEAINHRDVSQAVALIDDQCCYEDLNFSATFMGKAAVQGLFTESCQAVPADL 63
Query: 140 KFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPIK 199
+F ++ + GD+ G+ WH+E G P RG SFY S L+ AR +E +K
Sbjct: 64 QFVIDDMA-GDDLAVGVTWHVELDGIAFPNGRGVSFYRFSPNSGKLIF--ARDCVEPALK 120
Query: 200 PGGIVLTLLKNLTSL 214
PG +++ +T L
Sbjct: 121 PGKAAFAIIRLVTPL 135
>gi|308804507|ref|XP_003079566.1| unnamed protein product [Ostreococcus tauri]
gi|116058021|emb|CAL54224.1| unnamed protein product [Ostreococcus tauri]
Length = 454
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 42/142 (29%), Positives = 72/142 (50%), Gaps = 3/142 (2%)
Query: 73 IEIVPFSASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLV 132
+ V A TI Y IN +++ +++D +ED +FP+PF GK V E+
Sbjct: 37 VTAVDVDARATIATMYDRINARDVAGALECVAEDVVYEDFNFPEPFVGKAAVRALFEESC 96
Query: 133 TSMGQNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARV 192
+ + ++ F V++ G + G WH+E G+ P RGCS Y + +G+ + AR
Sbjct: 97 SGIPDDLAFVVDEWTSGSGASVGCTWHVEIMGEAFPNARGCSLYRVNGDGK---LTYARD 153
Query: 193 VIESPIKPGGIVLTLLKNLTSL 214
V+ESP K G ++++ + L
Sbjct: 154 VVESPAKLGEASFSIIRAVAPL 175
>gi|147794044|emb|CAN71166.1| hypothetical protein VITISV_035107 [Vitis vinifera]
Length = 203
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 74/149 (49%), Gaps = 20/149 (13%)
Query: 32 PSKISCQLMQ-NTSKIEHHGISIRS----LAKCKPSTLVPSAS---------------DD 71
PSK+S L+ +T+ I H RS L + S P+A+ D
Sbjct: 13 PSKLSPPLLHPHTASIATHFTQRRSRRCNLLRVSSSPQNPTATIAEPVTTTQSDEANGDG 72
Query: 72 SIEIVPFSASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQL 131
E + S S+ +R FYA IN +L+ +E I+ +C +ED FP+PF G+K +M F +
Sbjct: 73 GDESILLSGSSVVRAFYAGINSHDLDSVEELIAHNCVYEDLIFPQPFVGRKAIMDFFNKF 132
Query: 132 VTSMGQNVKFSVEQVCEGDEFTAGINWHL 160
+ ++ ++F ++ + E D G+ WHL
Sbjct: 133 IDTISMELQFVIDBISEADSSAVGVTWHL 161
>gi|224066735|ref|XP_002302189.1| predicted protein [Populus trichocarpa]
gi|222843915|gb|EEE81462.1| predicted protein [Populus trichocarpa]
Length = 203
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 72/130 (55%), Gaps = 6/130 (4%)
Query: 58 KCKPSTLVPSASDDSIEIVPFSASNTIREFYACINEKNLERL-ETYISDDCCFEDCSFPK 116
K + VP + D ++ + I F+ IN +N E+L E +S DC F+D F
Sbjct: 78 KLRCGDYVPGFTSDFVDDIGI----LIHRFFYAINSRNDEQLLEDVLSYDCVFKDFIFQI 133
Query: 117 PFQGKKEVMQFLEQLVTSMGQNVKFSVEQVCEGDEFTAGINW-HLEWKGKQVPFTRGCSF 175
F G++ ++QFL +++ +MG N++F +E V +E A + HLEW + +PFTR C+
Sbjct: 134 AFDGEQSIIQFLRKVMMAMGPNIRFKIESVQCKNELQAATAFLHLEWDNQVIPFTRFCTD 193
Query: 176 YECSLEGETL 185
+EC + + L
Sbjct: 194 FECEFDEKLL 203
>gi|147816094|emb|CAN72893.1| hypothetical protein VITISV_022313 [Vitis vinifera]
Length = 267
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 85/184 (46%), Gaps = 23/184 (12%)
Query: 46 IEHHGISIRS-----LAKCKPSTLVPSASDDSIEIVPFSASNTIREFYACINEKNLERLE 100
+ G+S+R + C + P D+ A +++ Y I KN++ +
Sbjct: 62 VVQRGLSLRRKRLPPVVSCSSESTGPREEDNR-------ALEAVQKLYTAIKNKNVKEVS 114
Query: 101 TYISDDC-CFEDCSF---PKPFQGKKEVMQFLEQLVTSMGQN--VKFSVEQVCEGDEFTA 154
I D+C CF C+F +PF GKK+ ++F + L+ +G + ++FS D
Sbjct: 115 DVIGDECRCF--CNFISASQPFHGKKQALEFFDHLMKILGDHASIRFSFTM---HDGMAV 169
Query: 155 GINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPIKPGGIVLTLLKNLTSL 214
G+ W LEWK VP G S+Y C ++IK ++++ I G + L + +T +
Sbjct: 170 GVTWRLEWKNNPVPLGTGFSYYVCQEYRGRVVIKNVNMLMDPLIHLGPLRLRIAGIVTLV 229
Query: 215 SDDF 218
D F
Sbjct: 230 MDKF 233
>gi|219114851|ref|XP_002178221.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409956|gb|EEC49886.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 627
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 49/180 (27%), Positives = 84/180 (46%), Gaps = 14/180 (7%)
Query: 36 SCQLMQNTSKIEHHGISIRSLAKCKPSTLVPSASDDSIEIVPFSASNTIREFYACINEKN 95
S Q + SK+ L +T +A+ ++ E++ S + +++A N ++
Sbjct: 201 SLQTLNRASKV-------MKLTGYNKATATSTATKEASEMIETSFLSLPEKYFAAWNRRD 253
Query: 96 LERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQNVKFSVEQVCE-GDEFTA 154
+ +D ++D +FP+PF GK + L + + F V++V + GD +
Sbjct: 254 MSSAVALFADTVTYDDTAFPEPFSGKTNLSSHLYKCSNAFPSTFTFQVDKVADAGDRIS- 312
Query: 155 GINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPIKPGG---IVLTLLKNL 211
+ WH+E G +PFTRGCSFY + +L V PIK GG +V T+ NL
Sbjct: 313 -VLWHVENDGDDLPFTRGCSFYNVDTKRNEIL-DGIDFVEPGPIKLGGFRLLVSTVRTNL 370
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 67/129 (51%), Gaps = 4/129 (3%)
Query: 81 SNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMG-QNV 139
+ T++ F + NE++ LE + D F+D ++PKP +G E+ + + G + +
Sbjct: 80 AQTVQNFISAYNERDYNTLENMVDPDIEFDDTAYPKPCRGLPELERRWRLTRNAQGDKRI 139
Query: 140 KFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPIK 199
+ +V+ + T GI +HLE ++P RG +F++ S +G LL+KK V ES K
Sbjct: 140 QVAVDDIASSTT-TVGIRFHLENVEGEIPNGRGAAFFQLSNDG--LLVKKVFWVQESAQK 196
Query: 200 PGGIVLTLL 208
G L L
Sbjct: 197 GGEASLQTL 205
>gi|255559074|ref|XP_002520559.1| conserved hypothetical protein [Ricinus communis]
gi|223540219|gb|EEF41792.1| conserved hypothetical protein [Ricinus communis]
Length = 254
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 45/156 (28%), Positives = 77/156 (49%), Gaps = 7/156 (4%)
Query: 63 TLVPSASDDSIEIV--PFSASNTIREFYACINEKNLERLETYISDDC---CFEDCSFPKP 117
+LVP +D S E A T+ + Y I +N+ + I D+C C SF +
Sbjct: 63 SLVPFNNDKSSEPREEDTPALETVLKLYTAIKNQNIHEVSNMIGDECRCVCNFFSSF-ES 121
Query: 118 FQGKKEVMQFLEQLVTSMGQNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYE 177
FQGK++V+ F ++ ++G N++F V Q + D G++W LEW +P +G SFY
Sbjct: 122 FQGKQQVLDFFNYVMQTLGNNIEFVV-QPTKYDGMNVGVSWRLEWSKTHMPLGKGFSFYI 180
Query: 178 CSLEGETLLIKKARVVIESPIKPGGIVLTLLKNLTS 213
C + + I+ + +E + L ++ N+ S
Sbjct: 181 CQIYQGKVTIRNVEMFMEPLLHIEPFRLKIMGNVMS 216
>gi|87241143|gb|ABD33001.1| hypothetical protein MtrDRAFT_AC150798g6v2 [Medicago truncatula]
Length = 267
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 46/181 (25%), Positives = 82/181 (45%), Gaps = 13/181 (7%)
Query: 19 RALGGNGIVLNSLPSKISCQ--LMQNTSKIEHHGISIRSLAKCKPSTLVPSASDDSIEIV 76
R L N + + S SCQ + +N + G+S + K S V +D +++
Sbjct: 39 RKLHRNVDIHYNSISSFSCQTSIHENLYPRKQRGLSFIAFDSKKNSEPVGEDNDQALD-- 96
Query: 77 PFSASNTIREFYACINEKNLERLETYISDDC-CFED-CSFPKPFQGKKEVMQFLEQLVTS 134
+ + Y+ KN++ L ++D+C C + SF + FQGK +V++F L+
Sbjct: 97 ------AVMKLYSAFKNKNIQELSEILADECRCVCNFLSFFQAFQGKTQVLEFFSHLIRL 150
Query: 135 MGQNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVI 194
G N++ V+ D G++W EW +P +G SF+ C +IK + +
Sbjct: 151 FGDNIQIVVKPTLH-DGMNVGVHWKFEWNTIHIPLGKGFSFHICQTYRGKAVIKNIDMFM 209
Query: 195 E 195
E
Sbjct: 210 E 210
>gi|357447893|ref|XP_003594222.1| hypothetical protein MTR_2g025760 [Medicago truncatula]
gi|355483270|gb|AES64473.1| hypothetical protein MTR_2g025760 [Medicago truncatula]
Length = 251
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 46/181 (25%), Positives = 82/181 (45%), Gaps = 13/181 (7%)
Query: 19 RALGGNGIVLNSLPSKISCQ--LMQNTSKIEHHGISIRSLAKCKPSTLVPSASDDSIEIV 76
R L N + + S SCQ + +N + G+S + K S V +D +++
Sbjct: 23 RKLHRNVDIHYNSISSFSCQTSIHENLYPRKQRGLSFIAFDSKKNSEPVGEDNDQALD-- 80
Query: 77 PFSASNTIREFYACINEKNLERLETYISDDC-CFED-CSFPKPFQGKKEVMQFLEQLVTS 134
+ + Y+ KN++ L ++D+C C + SF + FQGK +V++F L+
Sbjct: 81 ------AVMKLYSAFKNKNIQELSEILADECRCVCNFLSFFQAFQGKTQVLEFFSHLIRL 134
Query: 135 MGQNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVI 194
G N++ V+ D G++W EW +P +G SF+ C +IK + +
Sbjct: 135 FGDNIQIVVKPTLH-DGMNVGVHWKFEWNTIHIPLGKGFSFHICQTYRGKAVIKNIDMFM 193
Query: 195 E 195
E
Sbjct: 194 E 194
>gi|428180781|gb|EKX49647.1| hypothetical protein GUITHDRAFT_151421 [Guillardia theta CCMP2712]
Length = 170
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 61/114 (53%), Gaps = 7/114 (6%)
Query: 102 YISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQNVKFSVEQVCEGDEFTAGINWHLE 161
Y +DD +ED + +PF GK V +FL + + F V++V +G + G+ WH+E
Sbjct: 52 YFADDVVYEDMVYSEPFVGKDAVREFLLKTKEMAPPDFVFVVDRVSDGVR-SCGLTWHIE 110
Query: 162 WKGKQVP--FTRGCSFYECSLEGETLLIKKARVVIESPIKPGGIVLTLLKNLTS 213
K + F GCSFYE + EG+ I R +ESP+K G L +L NL +
Sbjct: 111 LKTRPDAGKFANGCSFYELNSEGKICYI---RDTVESPLKIGSFGL-VLANLAA 160
>gi|397568380|gb|EJK46109.1| hypothetical protein THAOC_35242 [Thalassiosira oceanica]
Length = 607
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/95 (26%), Positives = 50/95 (52%), Gaps = 1/95 (1%)
Query: 82 NTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQNVKF 141
+ + ++ N+++++ + ++DC D + F+G+ E + L ++ + + +F
Sbjct: 227 SVVERYFEAWNKRDMKEAVSLFTEDCNMRDLQYDSEFKGRAEFERHLLRVKDCLPGSFEF 286
Query: 142 SVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFY 176
V+ V AG+ WH+E G + FTRGCSFY
Sbjct: 287 VVDDVALSPT-KAGVVWHVENDGSPLAFTRGCSFY 320
>gi|323451680|gb|EGB07556.1| hypothetical protein AURANDRAFT_64644 [Aureococcus anophagefferens]
Length = 235
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 68/138 (49%), Gaps = 16/138 (11%)
Query: 82 NTIREFYACINEKNLERLETYISDDCCFEDCSFPKPF--QGKKEVMQFLE--------QL 131
+ IR Y N+++ R+ + +++D +ED +G++ Q L QL
Sbjct: 82 DAIRGLYKAFNDRDAPRVASLLAEDVVYEDLLLGASTICRGRESFSQALAFHPAFLSAQL 141
Query: 132 VTSMGQNVKFSVEQV-CEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKA 190
MG+ ++ V+ V C+G E + G+ WH+E G+ P +RG S + GE +++
Sbjct: 142 GLPMGR-LELVVDDVACDG-ERSVGVEWHVELNGEPFPLSRGLSLATINAAGE---LERV 196
Query: 191 RVVIESPIKPGGIVLTLL 208
+ E+P + G+V+ L
Sbjct: 197 VDIAEAPWRVVGLVVRPL 214
>gi|449018192|dbj|BAM81594.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 271
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/135 (23%), Positives = 66/135 (48%), Gaps = 6/135 (4%)
Query: 79 SASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQN 138
S + + ++ N++ + + + D +ED +P+ G++ + + L ++ +
Sbjct: 122 SLARVVFRYFDQWNKREIAGIIPLFAPDVFYEDALYPRALHGREALAEHLTKVARVLPPQ 181
Query: 139 VKFSVEQVCEG-DEFTAGINWHLEWK--GKQVPFTRGCSFYECSL---EGETLLIKKARV 192
++F ++ VCE + + +HLE + GKQVPF+RG SF+ E L+
Sbjct: 182 LEFVIDDVCEDLERGKCAVRFHLEDRRTGKQVPFSRGTSFFTAKRAPDAPEGWLLTSGWD 241
Query: 193 VIESPIKPGGIVLTL 207
+ E IK G ++L +
Sbjct: 242 LPEPLIKAGNVLLAM 256
>gi|224004978|ref|XP_002296140.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|209586172|gb|ACI64857.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 373
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 54/114 (47%), Gaps = 12/114 (10%)
Query: 106 DCCFEDCSFPKPFQGKKEVMQFLEQLVTSM------GQNVKFSVEQVCEGDEFTAGINWH 159
D +EDC P+P G+ +V + L V S G++ KF V+++ +G G W
Sbjct: 105 DVVYEDCYEPQPIVGRADVAEHLRARVLSRTGGEKGGKDAKFRVDKISDGSS-ACGFAWT 163
Query: 160 LEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIESPIKPGGIVLTLLKNLTS 213
W + RG +F E + E I+ R + E KPG + L +LK +T+
Sbjct: 164 --WTSGNLEGLRGTTFVELN---EANQIQYVREIPEPLYKPGDLTLEILKAVTA 212
>gi|2832688|emb|CAA16787.1| hypothetical protein [Arabidopsis thaliana]
gi|7269083|emb|CAB79192.1| hypothetical protein [Arabidopsis thaliana]
Length = 205
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 50/99 (50%), Gaps = 3/99 (3%)
Query: 64 LVPSASDDSIEIVPFSASNTIREFYACINEKNLERLETYISDDCCFED--CSFPKPFQGK 121
L D +I + A T+ + Y+ I ++N+E + I D+C F S + QGK
Sbjct: 107 LYTDIKDRNIMEKDYRAEETVLKLYSDIKDRNIEGISEVIGDECQFFSNFLSKYRLLQGK 166
Query: 122 KEVMQFLEQLVTSMGQNVKFSVEQVCEGDEFTAGINWHL 160
K+VM F L+ +G+++K V+ + D T G+ W L
Sbjct: 167 KQVMAFFYWLIMKLGKDIKIIVKPTFK-DGMTVGVQWQL 204
>gi|159485316|ref|XP_001700691.1| predicted protein [Chlamydomonas reinhardtii]
gi|158272033|gb|EDO97840.1| predicted protein [Chlamydomonas reinhardtii]
Length = 213
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 21/70 (30%), Positives = 36/70 (51%)
Query: 86 EFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQNVKFSVEQ 145
F + +N ++L + +SDDC D S K++V +F +V SM + V+ V+
Sbjct: 45 RFISALNARDLSGMLQLVSDDCHHVDLSHEAEGHCKEDVARFYADVVASMPEKVQVVVDD 104
Query: 146 VCEGDEFTAG 155
+ GD+ AG
Sbjct: 105 ITSGDDSRAG 114
>gi|219117710|ref|XP_002179645.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408698|gb|EEC48631.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 378
Score = 45.1 bits (105), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 32/117 (27%), Positives = 53/117 (45%), Gaps = 8/117 (6%)
Query: 106 DCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQNVKFSVEQVCEGDEFTAGINWHLEWKGK 165
D ED FP+P+ GK ++ + V + ++++ +GD G W W
Sbjct: 113 DVLIEDRFFPQPYTGKTNAAAYIMERVAQRKGKGEVRIDRISDGD-IACGFAWT--WTCG 169
Query: 166 QVPFTRGCSFYECSLEGETLLIKKARVVIESPIKPGGIVLTLLKNLTSLSDDFPKAT 222
Q RG +F E + G+ IK + + E KPG + LL+ +T ++ PK T
Sbjct: 170 QEEGLRGTTFVELNENGQ---IKYIQEIPEPLFKPGNLTKELLRAVTQGAE--PKPT 221
>gi|224162057|ref|XP_002338407.1| predicted protein [Populus trichocarpa]
gi|222872181|gb|EEF09312.1| predicted protein [Populus trichocarpa]
Length = 144
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 22/70 (31%), Positives = 38/70 (54%), Gaps = 9/70 (12%)
Query: 84 IREFYACINEKN-LERLETYISDDCCFEDCSFPKPFQGKKEV--------MQFLEQLVTS 134
IR+ + N ++ ++ LE +S DC F+D +F P G++ V MQFL +L+ +
Sbjct: 72 IRKLFRAFNPRSEVQLLEDVLSHDCNFQDLNFRTPVVGEQNVYIDHVQNAMQFLRKLMKA 131
Query: 135 MGQNVKFSVE 144
MG +F +
Sbjct: 132 MGPGTRFKIN 141
>gi|42407773|dbj|BAD08919.1| hypothetical protein [Oryza sativa Japonica Group]
gi|42408132|dbj|BAD09271.1| hypothetical protein [Oryza sativa Japonica Group]
gi|125562081|gb|EAZ07529.1| hypothetical protein OsI_29785 [Oryza sativa Indica Group]
gi|125603929|gb|EAZ43254.1| hypothetical protein OsJ_27852 [Oryza sativa Japonica Group]
Length = 172
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 29/113 (25%), Positives = 50/113 (44%), Gaps = 5/113 (4%)
Query: 78 FSASNTIREFYACINEKNLERLETYISDDCC--FEDCSFPKPFQGKKEVMQFLEQLVTSM 135
F A + + + + +++ L D+C F S Q K + L ++ +
Sbjct: 63 FRAVRRLVKLNSAVQNRSVRELLELAGDECLYFFGRISSIDVSQVSKNMFLLLHAMM--L 120
Query: 136 GQNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIK 188
+V F V + E + F G+ W LEWKGK++P+ C+ + LLIK
Sbjct: 121 RHHVSF-VLKPTENEGFDLGVKWSLEWKGKKLPWDLDCNITTNHVYRGMLLIK 172
>gi|297799778|ref|XP_002867773.1| hypothetical protein ARALYDRAFT_329383 [Arabidopsis lyrata subsp.
lyrata]
gi|297313609|gb|EFH44032.1| hypothetical protein ARALYDRAFT_329383 [Arabidopsis lyrata subsp.
lyrata]
Length = 130
Score = 40.8 bits (94), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 37/74 (50%), Gaps = 2/74 (2%)
Query: 125 MQFLEQLVTSMGQNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGET 184
M F L+ +G+++K V+ + D T G+ W LE + +G SF+ C +
Sbjct: 1 MAFFYWLIKKLGKDIKIIVKPTFK-DGMTVGVQWQLECDKSHIQLGKGFSFHICHMYQGK 59
Query: 185 LLIKKARVVIESPI 198
LLIK + +E PI
Sbjct: 60 LLIKNVEMFME-PI 72
>gi|356557961|ref|XP_003547278.1| PREDICTED: uncharacterized protein LOC100808156 [Glycine max]
Length = 262
Score = 40.4 bits (93), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 26/130 (20%), Positives = 54/130 (41%), Gaps = 4/130 (3%)
Query: 80 ASNTIREFYACINEKNLERLETYISDDCCFEDCSFPKPFQGKKEVMQFLEQLVTSMGQNV 139
A +++ + Y+ KN L SF + FQG+ +V++F L + +G N+
Sbjct: 90 ALDSVMKLYSAFKNKNTHELSADERRRVS-NFLSFFETFQGRTQVLEFFSYLTSILGNNI 148
Query: 140 KFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGETLLIKKARVVIES--P 197
+ + + + G+ W EW +P +G + +I+ +E
Sbjct: 149 QI-IFKPTPHEGVNVGLQWKFEWNKIHLPLWKGFGLHISHTYHGRAVIRNIETFMEPLLH 207
Query: 198 IKPGGIVLTL 207
+KP G+ + +
Sbjct: 208 LKPFGLKMKI 217
>gi|145341508|ref|NP_193968.2| uncharacterized protein [Arabidopsis thaliana]
gi|60547863|gb|AAX23895.1| hypothetical protein At4g22370 [Arabidopsis thaliana]
gi|332659201|gb|AEE84601.1| uncharacterized protein [Arabidopsis thaliana]
Length = 130
Score = 40.4 bits (93), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 37/74 (50%), Gaps = 2/74 (2%)
Query: 125 MQFLEQLVTSMGQNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRGCSFYECSLEGET 184
M F L+ +G+++K V+ + D T G+ W LE + +G SF+ C +
Sbjct: 1 MAFFYWLIMKLGKDIKIIVKPTFK-DGMTVGVQWQLECDKSHIQLGKGFSFHICHMYQGK 59
Query: 185 LLIKKARVVIESPI 198
LLIK + +E PI
Sbjct: 60 LLIKNVEMFME-PI 72
>gi|320160208|ref|YP_004173432.1| hypothetical protein ANT_07980 [Anaerolinea thermophila UNI-1]
gi|319994061|dbj|BAJ62832.1| hypothetical protein ANT_07980 [Anaerolinea thermophila UNI-1]
Length = 125
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 52/114 (45%), Gaps = 20/114 (17%)
Query: 87 FYACINEKNLERLETYISDDCCFEDCSFPKP----FQGKKEVMQFLEQLVTSMGQNVKFS 142
F +N+ N++ + +S+DC FE+ S P P +QGK V F EQ + QN K
Sbjct: 17 FNEALNQGNVDGMMNLMSEDCIFENTS-PAPDGSRYQGKTSVRLFWEQFFKA-SQNPKIE 74
Query: 143 VEQVCE-GDEFTAGINWHLEWK-GKQVPF-TRGCSFYECSLEGETLLIKKARVV 193
+E++ GD A + W W+ G F RG Y LI+ R+
Sbjct: 75 MEEIFACGDH--AVMRWVYTWQDGTGTAFHIRGVDVY---------LIRDGRIA 117
>gi|356532339|ref|XP_003534731.1| PREDICTED: uncharacterized protein LOC100812303 [Glycine max]
Length = 399
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 18/64 (28%), Positives = 34/64 (53%), Gaps = 1/64 (1%)
Query: 113 SFPKPFQGKKEVMQFLEQLVTSMGQNVKFSVEQVCEGDEFTAGINWHLEWKGKQVPFTRG 172
SF + FQG+ +V++F L + +G N++ + + + AG+ W EW +P +G
Sbjct: 65 SFFETFQGRTQVLEFFSYLTSILGNNIQI-IFKPTPHEGVNAGLQWKFEWDKIHLPLWKG 123
Query: 173 CSFY 176
S +
Sbjct: 124 FSLH 127
>gi|320159801|ref|YP_004173025.1| hypothetical protein ANT_03910 [Anaerolinea thermophila UNI-1]
gi|319993654|dbj|BAJ62425.1| hypothetical protein ANT_03910 [Anaerolinea thermophila UNI-1]
Length = 126
Score = 37.7 bits (86), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 22/79 (27%), Positives = 35/79 (44%), Gaps = 4/79 (5%)
Query: 87 FYACINEKNLERLETYISDDCCFEDCSFP--KPFQGKKEVMQFLEQLVTSMGQNVKFSVE 144
FY N + +E + T +S DC FE + P P +G+ + F + + +F V+
Sbjct: 18 FYEACNRQAIEEMLTLLSADCLFESPAAPCGSPLRGRDAIAAFWREFFAAH-PGARFEVQ 76
Query: 145 QVCEGDEFTAGINWHLEWK 163
V G F + W WK
Sbjct: 77 DVF-GAGFRSVARWTCTWK 94
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.322 0.137 0.416
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,232,664,891
Number of Sequences: 23463169
Number of extensions: 166214305
Number of successful extensions: 355223
Number of sequences better than 100.0: 78
Number of HSP's better than 100.0 without gapping: 62
Number of HSP's successfully gapped in prelim test: 16
Number of HSP's that attempted gapping in prelim test: 355102
Number of HSP's gapped (non-prelim): 79
length of query: 282
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 141
effective length of database: 9,050,888,538
effective search space: 1276175283858
effective search space used: 1276175283858
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 76 (33.9 bits)