BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 041297
(379 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255556011|ref|XP_002519040.1| conserved hypothetical protein [Ricinus communis]
gi|223541703|gb|EEF43251.1| conserved hypothetical protein [Ricinus communis]
Length = 396
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 180/340 (52%), Positives = 233/340 (68%), Gaps = 31/340 (9%)
Query: 27 HALLALKRLNESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFLCP- 85
AL LKR ++ Y+++WS DH+ +K N + C CIGK+CMVDII+FLC
Sbjct: 31 EALSGLKRSDDPYLSVWSCDHNNKKNYINNNNNSNE-----CRCIGKICMVDIISFLCKE 85
Query: 86 --------------CFCSFAKDSGIVRHLKPSASLLEAVDLLLGGVQNLVILPAG----- 126
+K G+VR L+P ASLLEA+DL+L G QNLVI P
Sbjct: 86 ENLKNLPRALQEPLSSVLVSKVYGLVRPLEPHASLLEAIDLILEGAQNLVI-PVHSPFTR 144
Query: 127 IKLQPKPSLKSTFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAI 186
KL + S ST HN+ EYCWLTQED+IRY LN IGL +P PN + S NIID ILA+
Sbjct: 145 KKLIHRTSSYSTLHNNREYCWLTQEDIIRYLLNCIGLFSPIPNHTVESLNIIDTESILAV 204
Query: 187 HYDDPAAFAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFSLNSCDETVAAAIATLLAGDL 246
YD+PA+ A+PLI+QS +KQTSVAI+D EG+L+G+ISP++LNSCDE VAAAIATL AG+L
Sbjct: 205 CYDEPASSALPLISQSLVKQTSVAILDIEGKLIGEISPYTLNSCDELVAAAIATLSAGEL 264
Query: 247 MAYMDCGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEI---SSGSCSNSSSSDEESSTG 303
MAY+DCG P +DL+RLVK+RL+E+N+ +LELME++ I SS S SSSSDEE G
Sbjct: 265 MAYVDCGDPPEDLIRLVKERLEERNLEIVLELMEEESGISSSSSSFSSFSSSSDEEFGLG 324
Query: 304 SARSARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
+ S R G+S RV R++AIVC+PWSSL+AV++QA++ R
Sbjct: 325 KSGSFR--GHSTRVARRTDAIVCFPWSSLVAVMIQAISHR 362
>gi|224079247|ref|XP_002305808.1| predicted protein [Populus trichocarpa]
gi|222848772|gb|EEE86319.1| predicted protein [Populus trichocarpa]
Length = 394
Score = 310 bits (794), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 173/337 (51%), Positives = 222/337 (65%), Gaps = 27/337 (8%)
Query: 27 HALLALKRLNESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFLCP- 85
AL ALKR + ++++WS D R + +D E C CIGKVC+ D+I FL
Sbjct: 31 EALSALKRFGDLFLSVWSCDQHHRCNSPRSIKVDFAE----CKCIGKVCLADVICFLSKE 86
Query: 86 --------------CFCSFAKDSGIVRHLKPSASLLEAVDLLLGGVQNLVILP-----AG 126
+K SG+VRHL+P ASLLEA+DL+L G QNLVI P
Sbjct: 87 ENLKNPGKALQEPVSLLLNSKVSGLVRHLEPHASLLEAIDLILEGAQNLVI-PLHNPFTR 145
Query: 127 IKLQPKPSLKSTFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAI 186
KL K + ST HN+ EYCWLTQED++RY LN IGL +PTPN I S NIID +
Sbjct: 146 KKLISKSTANSTLHNNREYCWLTQEDIVRYLLNSIGLFSPTPNHTIESLNIIDTESFFTV 205
Query: 187 HYDDPAAFAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFSLNSCDETVAAAIATLLAGDL 246
HYDDPAA +PLI+QS +KQTSVAI+D +G+L+G+ISPF+LN CDETVAAAIATL AG+L
Sbjct: 206 HYDDPAA--LPLISQSLVKQTSVAILDADGKLIGEISPFTLNFCDETVAAAIATLSAGEL 263
Query: 247 MAYMDCGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEISSGSCSNSSSSDEESSTGSAR 306
MAY++CG P +DL+ LVK+RL+E+N+ L+L+E++ I S S +S SS + G R
Sbjct: 264 MAYIECGDPPEDLIMLVKERLEERNLGPALDLIEEESGILSSSSDSSYSSSSDEEFGMVR 323
Query: 307 SARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
S R G SARV +E IVCYPWSSL+AV++QAL+ R
Sbjct: 324 SGRIAGNSARVGRSTETIVCYPWSSLVAVMIQALSHR 360
>gi|225470342|ref|XP_002269338.1| PREDICTED: CBS domain-containing protein CBSX5 [Vitis vinifera]
Length = 384
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 171/335 (51%), Positives = 220/335 (65%), Gaps = 35/335 (10%)
Query: 28 ALLALKRLNESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFLC--- 84
AL ALKR ++Y+++WS DH+++ K+ +++D C CIGK+CMVD++ FLC
Sbjct: 32 ALAALKRSGDAYLSVWSCDHTSKINKS---HLED------CRCIGKICMVDVVCFLCRED 82
Query: 85 -----------PCFCSFAKDSGIVRHLKPSASLLEAVDLLLGGVQNLVI-----LPAGIK 128
P K G+VRHLKP++ LLEA+DL+L G QN+VI K
Sbjct: 83 NLSCPSDALQSPLSLLLPKVPGLVRHLKPNSRLLEAIDLMLEGAQNIVIPIQSRTNPRKK 142
Query: 129 LQPKPSLKSTFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAIHY 188
L PKPS ST HN E+CWLTQED++R+ LN IG +P P I S NIID I +++Y
Sbjct: 143 LVPKPSFNSTLHNGVEFCWLTQEDVVRFLLNSIGSFSPLPGLTIESLNIIDTENIPSVYY 202
Query: 189 DDPAAFAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFSLNSCDETVAAAIATLLAGDLMA 248
DPA+ A+ I+QS I QTSVA++D+E +LVG+ISPF+L CDETVAAAIATL AGDLMA
Sbjct: 203 HDPASSALTAISQSLINQTSVAVLDQENKLVGEISPFTLACCDETVAAAIATLSAGDLMA 262
Query: 249 YMDCGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEISSGSCSNSSSSDEESSTGSARSA 308
Y+DCG P +DLV+LVK RL+E+ + L+LM S SSSS + G R
Sbjct: 263 YIDCGGPPEDLVQLVKARLEERKLGAFLDLM-------DEEFSYSSSSSSDEEFGFGRRG 315
Query: 309 RSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
SG YSAR+ RSEAIVCYPWSSLMAV++QALA R
Sbjct: 316 GSGKYSARMARRSEAIVCYPWSSLMAVMIQALAHR 350
>gi|449457825|ref|XP_004146648.1| PREDICTED: CBS domain-containing protein CBSX5-like [Cucumis
sativus]
gi|449527799|ref|XP_004170897.1| PREDICTED: CBS domain-containing protein CBSX5-like [Cucumis
sativus]
Length = 398
Score = 288 bits (736), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 176/345 (51%), Positives = 226/345 (65%), Gaps = 40/345 (11%)
Query: 28 ALLALKRLNESYINIWS-SDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFLC-- 84
AL L +++E YI++WS DHS+ K A+ D H C C+GKVCMVDII FLC
Sbjct: 32 ALSILTKIDEGYISVWSCGDHSSSK-----ADSDLH-----CRCVGKVCMVDIICFLCRQ 81
Query: 85 ------------PCFCSFAKDSGIVRHLKPSASLLEAVDLLLGGVQNLVILPAGIKLQPK 132
P + +VRHL+P ASL+EA+DL+ GV NLVI P + + +
Sbjct: 82 ENLLQPAIGLQSPISVLIPEGFELVRHLEPHASLMEAIDLIHDGVHNLVI-PIKMSISKR 140
Query: 133 PSLK--------STFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGIL 184
++ S+ HND EYCWL ED+IRY LN IGL + T PINS NIID IL
Sbjct: 141 KNILKKSLANSISSLHNDQEYCWLAPEDIIRYLLNSIGLFSTTAANPINSFNIIDTNNIL 200
Query: 185 AIHYDDPAAFAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFSLNSCDETVAAAIATLLAG 244
A+ YD+ A +PLI+Q+ I Q+SVAIVD + +L+G+ISPF+LN CDETV AAIATL AG
Sbjct: 201 AVRYDESALSILPLISQALIHQSSVAIVDLDDKLIGEISPFTLNFCDETVVAAIATLTAG 260
Query: 245 DLMAYMDCGRPLKDLVRLVKQRLDEKNMVGLLELMEDD-LEISSGSCSNSSSSDEESSTG 303
+LM Y+DCG P DLV+LVK+RL+EKN+ +LE +E++ L ISS S S SSSD+E G
Sbjct: 261 ELMGYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEESLTISSSSSSICSSSDDEFGCG 320
Query: 304 SARSARSG-----GYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
S+ S GYSARV+ RSEAIVCYPW+SL+AV++QALA R
Sbjct: 321 SSSSGSGRSGRICGYSARVMRRSEAIVCYPWNSLVAVMIQALAHR 365
>gi|224125284|ref|XP_002329767.1| predicted protein [Populus trichocarpa]
gi|222870829|gb|EEF07960.1| predicted protein [Populus trichocarpa]
Length = 391
Score = 281 bits (718), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 169/336 (50%), Positives = 218/336 (64%), Gaps = 30/336 (8%)
Query: 28 ALLALKRLNESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFLC--- 84
AL ALKR + ++++WS DH + +D E C C+GKVC+VD+I FL
Sbjct: 32 ALSALKRSGDLFLSVWSCDHLHHCNSPISIQVDFEE----CKCVGKVCLVDVICFLSVEE 87
Query: 85 -----------PCFCSF-AKDSGIVRHLKPSASLLEAVDLLLGGVQNLVILP-----AGI 127
P +K G+VRHL+P A A+D +LGG NLVI P
Sbjct: 88 NLKNPGKALQEPVSVLLNSKVPGLVRHLEPHAR--HAIDAILGGALNLVI-PLRNPFTRK 144
Query: 128 KLQPKPSLKSTFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAIH 187
KL K + ST HN+ EYCWL QED+IRY LN IGL +PTPN I S +I +H
Sbjct: 145 KLVYKSAANSTLHNNREYCWLAQEDIIRYLLNSIGLFSPTPNHTIESLGLIYSESFFTVH 204
Query: 188 YDDPAAFAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFSLNSCDETVAAAIATLLAGDLM 247
YDDPA+ A+PLI+QS IKQTSVAI+D +G+L+G+ISPF+LN CDETVAAAIATL AG+LM
Sbjct: 205 YDDPASSALPLISQSLIKQTSVAILDTDGKLIGEISPFTLNFCDETVAAAIATLSAGELM 264
Query: 248 AYMDCGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEISSGSCSNSSSSDEESSTGSARS 307
AY+DC P +DL+RLVK+RL+E+N+ L+L+E++ ISS S S SSSSDEE G RS
Sbjct: 265 AYIDCRDPPEDLLRLVKERLEERNLGPALDLIEEESGISSLS-SYSSSSDEE--FGMGRS 321
Query: 308 ARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
G+SA V ++ VCYPWSSL+AV++QAL+ R
Sbjct: 322 GGVSGHSAGVRGTAQTTVCYPWSSLVAVMIQALSHR 357
>gi|356549353|ref|XP_003543058.1| PREDICTED: CBS domain-containing protein CBSX5-like [Glycine max]
Length = 398
Score = 270 bits (690), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 167/338 (49%), Positives = 221/338 (65%), Gaps = 28/338 (8%)
Query: 28 ALLALKRLNESYINIWSSDHSARKRKAAAANIDDHED-SAGCSCIGKVCMVDII------ 80
AL ALKR++++Y+++W+ +HS +++ I E C+CIGKVCMVDII
Sbjct: 33 ALAALKRIDDTYVSVWNCNHSFIRKQQP--QIKSQEQLQCCCTCIGKVCMVDIICFLSKP 90
Query: 81 --------TFLCPCFCSFAKDSGI-VRHLKPSASLLEAVDLLLGGVQNLVILPAGIKLQP 131
FL P +S + VRHL P+ASLLEA+D++ GVQNLVI P I+ +
Sbjct: 91 QNLSSPSAAFLSPISALLHDNSAVLVRHLPPTASLLEAIDVMHEGVQNLVI-PIQIQFES 149
Query: 132 KPSLKSTFHND-SEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAIHYDD 190
S + HND + YCWLTQED+ RY LN IG+ +PTP PIN+ +ID + A+ YDD
Sbjct: 150 LNS-NNVHHNDNTTYCWLTQEDVFRYLLNSIGVFSPTPGNPINTLGVIDTQNLFAVCYDD 208
Query: 191 PAAFAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFSLNSCDETVAAAIATLLAGDLMAYM 250
PA+ + L+A S + Q+S+AIVD G+ G+ISP LNSCDE+V AIATL AGDL AY+
Sbjct: 209 PASSILDLLALSLLYQSSIAIVDPNGKFGGEISPVMLNSCDESVVPAIATLSAGDLTAYI 268
Query: 251 DCGRPLKDLVRLVKQRLDEK-NMVGLLELMEDDLE----ISSGSCSNSSSSDEESSTGSA 305
DCG P +DLV+LVK+R+ EK LLEL+ D+ S S S+S SSDEE +G
Sbjct: 269 DCGGPPEDLVQLVKERVKEKVEEQNLLELLGDETTGTGLTSWSSFSSSCSSDEEFCSG-- 326
Query: 306 RSARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
++ + GGYSARV RSEAIVCY WSSL+AV++QALA R
Sbjct: 327 KNWKLGGYSARVGRRSEAIVCYRWSSLVAVMIQALAHR 364
>gi|357446497|ref|XP_003593526.1| hypothetical protein MTR_2g013110 [Medicago truncatula]
gi|124360612|gb|ABN08611.1| CBS [Medicago truncatula]
gi|355482574|gb|AES63777.1| hypothetical protein MTR_2g013110 [Medicago truncatula]
Length = 396
Score = 267 bits (683), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 156/334 (46%), Positives = 219/334 (65%), Gaps = 23/334 (6%)
Query: 28 ALLALKRLNESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFLC--- 84
AL A+K+ ESYI++W+ HS ++ +D E C CIGKVCMVDII FLC
Sbjct: 34 ALNAIKKHAESYISVWNCHHSINRKPPQTLIKEDFE--FHCKCIGKVCMVDIICFLCRPE 91
Query: 85 -----------PCFCSFAKD-SGIVRHLKPSASLLEAVDLLLGGVQNLVILPAGIKLQPK 132
P A D S +VRH++P+ASLLE +D++ GVQN V++P + + K
Sbjct: 92 NLSSPAAALRSPVPILLADDRSSLVRHIQPNASLLETIDVMDEGVQN-VVMPISDENKCK 150
Query: 133 PSLKSTFHNDSE-YCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAIHYDDP 191
HND YCWL+QED++RY LN IG + TP Q I+ +IID + +++DDP
Sbjct: 151 KKENEILHNDKRAYCWLSQEDVMRYLLNSIGTFSDTPAQSIDKLDIIDTQNLYFLYFDDP 210
Query: 192 AAFAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFSLNSCDETVAAAIATLLAGDLMAYMD 251
A+ A+ L+ S + Q+SVA+VD +G+L+G+ISPF LNSCDE AIATL AGDL+AY+D
Sbjct: 211 ASSALELLTASIVHQSSVAVVDPQGKLIGEISPFMLNSCDEIDVPAIATLSAGDLLAYID 270
Query: 252 CGRPLKDLVRLVKQRLDEKNM-VGLLELM-EDDLEISSGSCSNSSSSDEESSTGSARSAR 309
CG P +DLV+LVK+RL E+N+ +EL+ E S S S++SS ++ S G ++ +
Sbjct: 271 CGGPPEDLVQLVKERLHEQNLDNAAVELLGEGSELSSWSSFSSTSSEEDICSLG--KNWK 328
Query: 310 SGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
GG+S+R++ RSEAIVCYPWSSL+AV++QAL+ R
Sbjct: 329 LGGFSSRIMRRSEAIVCYPWSSLVAVMIQALSHR 362
>gi|255644848|gb|ACU22924.1| unknown [Glycine max]
Length = 398
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 163/337 (48%), Positives = 218/337 (64%), Gaps = 26/337 (7%)
Query: 28 ALLALKRLNESYINIWSSDHSARKRKAAAANIDDHED-SAGCSCIGKVCMVDII------ 80
AL ALKR++++Y+++W+ +HS +++ I E C+CIGKVCMVDII
Sbjct: 33 ALAALKRIDDTYVSVWNCNHSFIRKQQP--QIKSQEQLQCCCTCIGKVCMVDIICFLSKP 90
Query: 81 --------TFLCPCFCSFAKDSGI-VRHLKPSASLLEAVDLLLGGVQNLVILPAGIKLQP 131
FL P +S + VRHL P+ SLLEA+D++ GVQNLVI P I+ +
Sbjct: 91 QNLSSPSAAFLSPISALLHDNSAVLVRHLPPTTSLLEAIDVMHEGVQNLVI-PIQIQFES 149
Query: 132 KPSLKSTFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAIHYDDP 191
S +N++ YCWLTQED+ RY LN IG+ +PTP PIN+ +ID + A+ YDDP
Sbjct: 150 LNSNNVHHNNNTTYCWLTQEDVFRYLLNSIGVFSPTPGNPINTLGVIDTQNLFAVCYDDP 209
Query: 192 AAFAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFSLNSCDETVAAAIATLLAGDLMAYMD 251
A+ + L+A S + Q+S+AIVD G+ G+ISP LNSCDE+V AIATL AGDL AY+D
Sbjct: 210 ASSILDLLALSLLYQSSIAIVDPNGKFGGEISPVMLNSCDESVVPAIATLSAGDLTAYID 269
Query: 252 CGRPLKDLVRLVKQRLDEK-NMVGLLELMEDDLE----ISSGSCSNSSSSDEESSTGSAR 306
CG P +DLV+LVK+R+ EK LLEL+ D+ S S S+S SSDEE +G +
Sbjct: 270 CGGPPEDLVQLVKERVKEKVEEQNLLELLGDETTGTGLTSWSSFSSSCSSDEEFCSG--K 327
Query: 307 SARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
+ + GGYSA V RSEAIVCY WSSL+AV++QALA R
Sbjct: 328 NWKLGGYSAGVGRRSEAIVCYRWSSLVAVMIQALAHR 364
>gi|356555172|ref|XP_003545910.1| PREDICTED: LOW QUALITY PROTEIN: CBS domain-containing protein
CBSX5-like [Glycine max]
Length = 395
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 159/336 (47%), Positives = 220/336 (65%), Gaps = 26/336 (7%)
Query: 28 ALLALKRLNESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFLC--- 84
AL ALKR++++Y+++W+ +HS +++ ++ C+CIGKVCM+DII FL
Sbjct: 32 ALAALKRIDDTYVSVWNCNHSFIRKQQPQIQ---SQNQCCCTCIGKVCMLDIICFLSKPQ 88
Query: 85 -----------PCFCSFAKDSGI-VRHLKPSASLLEAVDLLLGGVQNLVILPAGIKLQPK 132
P + +S + V HL PSASLLEA+D++ GVQNLVI P +++
Sbjct: 89 SLSSPSAALHSPISAALQDNSAVLVLHLPPSASLLEAIDVMQEGVQNLVI-PIQNQVESL 147
Query: 133 PSLKSTFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAIHYDDPA 192
S +N++ YCWLTQED++RY LN IG+ +PTP PIN+ +ID + A+ YDDPA
Sbjct: 148 DSNNVHHNNNTTYCWLTQEDVLRYLLNSIGVFSPTPGNPINTLGVIDTKNLFAVCYDDPA 207
Query: 193 AFAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFSLNSCDETVAAAIATLLAGDLMAYMDC 252
+ + L+A S I Q+SVAIVD G+ G+ISP LNS DE+V AIATL AGDL AY+DC
Sbjct: 208 SSILDLLALSLIYQSSVAIVDPNGKFGGEISPVMLNSYDESVVPAIATLSAGDLTAYIDC 267
Query: 253 GRPLKDLVRLVKQRLDEK-NMVGLLELMEDDLE----ISSGSCSNSSSSDEESSTGSARS 307
G P +DLV+LVK+R+ EK +LEL+ D+ S S S+S SSDEE +G ++
Sbjct: 268 GGPPEDLVQLVKERVKEKVEEQNMLELLGDETTRTGLTSWSSFSSSCSSDEEFCSG--KN 325
Query: 308 ARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
+ GGYSARV RSEAIVC+ WSSL+AV++QAL+ R
Sbjct: 326 WKLGGYSARVGRRSEAIVCHRWSSLVAVMIQALSHR 361
>gi|388520909|gb|AFK48516.1| unknown [Medicago truncatula]
Length = 389
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 156/333 (46%), Positives = 206/333 (61%), Gaps = 42/333 (12%)
Query: 36 NESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFLCP---CFCS--- 89
ES++++W+ DHS + C C+GK+CMVD+I FLC C
Sbjct: 41 GESFVSVWNCDHS---------------EFGQCQCVGKICMVDVIVFLCKQENLLCPSKA 85
Query: 90 --------FAKDSGIVRHLKPSASLLEAVDLLLGGVQNLVI-----LPAGI---KLQPKP 133
F + G+V HL+PS+SLL+A+DL+L G QNLV+ G+ KLQ K
Sbjct: 86 LKASISNVFNEVDGLVVHLEPSSSLLDAIDLILEGAQNLVVPISQTKKGGLSRRKLQQK- 144
Query: 134 SLKSTFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAIHYDDPAA 193
SL HN +E+CWLTQED+IR+ L IG + P Q I+ NII + +L+I Y PA+
Sbjct: 145 SLTINSHNGAEFCWLTQEDVIRFLLGSIGRFSALPAQSIDRLNIIS-SDVLSIDYSSPAS 203
Query: 194 FAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFSLNSCDETVAAAIATLLAGDLMAYMDCG 253
A+ I++S +QTSVAIVD +G +G+ISPF+L CDETVAAAI TL AG LMAY+DCG
Sbjct: 204 SAVEAISKSLTQQTSVAIVDGDGTFIGEISPFTLACCDETVAAAITTLSAGGLMAYIDCG 263
Query: 254 RPLKDLVRLVKQRLDEKNMVGLLE---LMEDDLEISSGSCSNSSSSDEESSTGSARSARS 310
RP +DLVR+VK RL EKN+ LL+ LM S S S+ S S T S + ARS
Sbjct: 264 RPPEDLVRVVKARLKEKNLEKLLQEFTLMTSLTGDMSSSSSSDEESPGRSLTRSGKYARS 323
Query: 311 GGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
YSA+ V ++EAIVC+P SSL+AV+MQA+A R
Sbjct: 324 SSYSAKYVRKAEAIVCHPKSSLIAVMMQAIANR 356
>gi|449434344|ref|XP_004134956.1| PREDICTED: CBS domain-containing protein CBSX5-like [Cucumis
sativus]
Length = 397
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 155/347 (44%), Positives = 209/347 (60%), Gaps = 46/347 (13%)
Query: 28 ALLALKRLNESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFLC--- 84
AL ALK+L E+YI++W+ K + C CIGK+ ++D++ FLC
Sbjct: 32 ALSALKKLGENYISVWNCSSHYSKSSSHY----------DCRCIGKISVLDVVLFLCKEE 81
Query: 85 ----PCFCSFAKDSGI-------VRHLKPSASLLEAVDLLLGGVQNLVILPAGIKLQPKP 133
P + S + VRHL+P ASL+EA+DLLL G QNLV+ +Q +
Sbjct: 82 NLSQPALALQSSVSVLIPPVPVLVRHLEPHASLMEAIDLLLEGAQNLVV-----PIQTRT 136
Query: 134 SLKS-------------TFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDD 180
S KS HN EYCW+TQED+IRY LN IGL +PT P+NS N ID
Sbjct: 137 SAKSREKVLEVVAPFDCPLHNGLEYCWITQEDIIRYLLNSIGLFSPTSITPVNSLNAIDT 196
Query: 181 AGILAIHYDDPAAFAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFSLNSCDETVAAAIAT 240
ILA+HYDDPA A+PL++Q+ I Q+S+AIVD +G+L+G+ISP +LNS DET+ AAI T
Sbjct: 197 VNILALHYDDPALSALPLLSQAIIHQSSIAIVDSDGKLIGEISPLTLNSFDETITAAIVT 256
Query: 241 LLAGDLMAYMDCGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEISSGSCSNSSSSDEES 300
L AG+LMAY++C P + LV+LVK RL+ +N+ GLLE +E++ +S+ S +S S
Sbjct: 257 LSAGELMAYVNCNDPPEYLVQLVKDRLEGRNLRGLLEWVEEESAMSAMSSCSSFCSSSSD 316
Query: 301 STGSARSARSGGY---SARVVHR-SEAIVCYPWSSLMAVIMQALARR 343
+ RSG S R V R SE VC P SSL+AV++QALA R
Sbjct: 317 DDSGSWWGRSGKLRKCSTRQVRRSSEVAVCNPRSSLVAVMIQALALR 363
>gi|356549749|ref|XP_003543253.1| PREDICTED: CBS domain-containing protein CBSX5-like [Glycine max]
Length = 389
Score = 247 bits (631), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 153/309 (49%), Positives = 199/309 (64%), Gaps = 32/309 (10%)
Query: 63 EDSAGCSCIGKVCMVDIITFLC--------------PCFCSFAKDSGIVRHLKPSASLLE 108
E+ C+GK+CMVD+I +LC P KD +V HL+PS+SL E
Sbjct: 52 ENKNEVRCVGKLCMVDVICYLCREDNLLSPSKALKEPLSSILPKDQSLVVHLQPSSSLFE 111
Query: 109 AVDLLLGGVQNLV--ILP---AGI---KLQPKPSLKSTF--HNDSEYCWLTQEDLIRYFL 158
A+DL+L G QNLV ILP +G+ K Q ST H+ E+CWLTQED+IR+ L
Sbjct: 112 AIDLILQGAQNLVVPILPTKRSGVSRRKQQQHQKASSTINSHSSCEFCWLTQEDVIRFLL 171
Query: 159 NFIGLLNPTPNQPINSHNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQTSVAIVDEEGRL 218
IG+ P P I+S II + +LAI Y PA+ A+ I++S +QTSVAIVD +G
Sbjct: 172 GSIGVFTPLPALSIDSLGIIS-SDVLAIDYYSPASSAVGAISKSLTQQTSVAIVDSDGTF 230
Query: 219 VGDISPFSLNSCDETVAAAIATLLAGDLMAYMDCGRPLKDLVRLVKQRLDEKNMVGLLEL 278
+G+ISPF+L CDETVAAAIATL AGDLMAY+DCG P +DLVRLVK RL EKN ++
Sbjct: 231 IGEISPFTLACCDETVAAAIATLSAGDLMAYIDCGGPPEDLVRLVKARLKEKNFE---KM 287
Query: 279 MEDDLEISSGSCSNSSSSDEESST----GSARSARSGGYSARVVHRSEAIVCYPWSSLMA 334
+++ +SS S S+SSDEE T S R ARS YSAR+V ++EAIVC+P SSL+A
Sbjct: 288 LQEFTILSSCESSQSTSSDEELPTRTPARSGRLARSSSYSARMVRKAEAIVCHPKSSLVA 347
Query: 335 VIMQALARR 343
V++QA+A R
Sbjct: 348 VMIQAIAHR 356
>gi|225461389|ref|XP_002284800.1| PREDICTED: CBS domain-containing protein CBSX5 [Vitis vinifera]
gi|302143038|emb|CBI20333.3| unnamed protein product [Vitis vinifera]
Length = 394
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 155/344 (45%), Positives = 214/344 (62%), Gaps = 41/344 (11%)
Query: 27 HALLALKRLNESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFLC-- 84
AL ALK +S+I++WS DHSA NI D C C+GK+CMVD++ +LC
Sbjct: 31 EALSALKTSEDSFISVWSCDHSA--------NIQDE-----CRCVGKICMVDVVCYLCKD 77
Query: 85 ------------PCFCSFAKDSGIVRHLKPSASLLEAVDLLLGGVQNLVI-------LPA 125
P G+V H++P +SLLEA+DL+L G QNLV+ +
Sbjct: 78 DNLLSPSSALKSPVSDLLPNIPGLVMHVEPHSSLLEAIDLILQGAQNLVVPIRSSISNSS 137
Query: 126 GIKLQPKPSLK-STFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGIL 184
KL KP +T H EYCWLTQED++RY L+ IGLL+P PI++ IID +L
Sbjct: 138 RRKLYQKPQTSPTTMHKGCEYCWLTQEDVVRYLLSSIGLLSPIAALPIDTLRIID-TDVL 196
Query: 185 AIHYDDPAAFAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFSLNSCDETVAAAIATLLAG 244
AI+Y PA+ ++P I +S QTSVA+VDE G L+G+ISPF+L CDETVAAAI TL +G
Sbjct: 197 AINYHSPASSSLPAILRSLRDQTSVAVVDENGALIGEISPFTLACCDETVAAAITTLSSG 256
Query: 245 DLMAYMDCGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEISSGSCSNSSSSDEESSTGS 304
DLM+Y+DCG P +++V+ VK RL ++N+ G+LE D +S ++SS + S +
Sbjct: 257 DLMSYIDCGGPPEEIVKTVKTRLKQRNLEGMLEEFALDSSSTSSLSASSSDEESSPSPKT 316
Query: 305 A-----RSARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
A + +RS YSAR+V R+EAIVC+P SSL+AV++QA+A R
Sbjct: 317 ALYRPGKYSRSSSYSARMVRRAEAIVCHPGSSLVAVMIQAIAHR 360
>gi|356544022|ref|XP_003540455.1| PREDICTED: CBS domain-containing protein CBSX5-like [Glycine max]
Length = 390
Score = 240 bits (612), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 144/307 (46%), Positives = 192/307 (62%), Gaps = 39/307 (12%)
Query: 70 CIGKVCMVDIITFLC--------------PCFCSFAKDSGIVRHLKPSASLLEAVDLLLG 115
C+GK+CMVD+I +LC P KD +V HL+PS+SLLEA+DL+L
Sbjct: 57 CVGKLCMVDVICYLCREDNLLSPSKSLKEPLSSILPKDHNLVVHLQPSSSLLEAIDLILQ 116
Query: 116 GVQNLV--ILP---AGIKLQPKPSLKSTF----HNDSEYCWLTQEDLIRYFLNFIGLLNP 166
G QN V ILP +G+ + + K++ H+ E+CWLTQED+IR+ L IG+ P
Sbjct: 117 GAQNFVVPILPTKRSGVSRRKQQHQKASSTINSHSSCEFCWLTQEDVIRFLLGSIGVFTP 176
Query: 167 TPNQPINSHNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFS 226
P I+S I+ + +LAI Y PA+ + I++S +QTSVAIVD +G +G+ISPF+
Sbjct: 177 LPALSIDSLGIVS-SDVLAIDYYSPASSTVGAISKSLAQQTSVAIVDSDGTFIGEISPFT 235
Query: 227 LNSCDETVAAAIATLLAGDLMAYMDCGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEIS 286
L CDETVAAA+ATL AGDLMAY+DCG P +DLVR+VK RL EKN LE M + I
Sbjct: 236 LACCDETVAAAMATLSAGDLMAYIDCGGPPEDLVRVVKARLKEKN----LEKMLQEFTIL 291
Query: 287 SGSC----------SNSSSSDEESSTGSARSARSGGYSARVVHRSEAIVCYPWSSLMAVI 336
S SC S+ S + S R ARS YSAR+V ++EAIVC+P SSL+AV+
Sbjct: 292 S-SCESSQLASSSSSSDEESTTRTPARSGRLARSSSYSARMVRKAEAIVCHPKSSLVAVM 350
Query: 337 MQALARR 343
+QA+A R
Sbjct: 351 IQAIAHR 357
>gi|224128366|ref|XP_002329144.1| predicted protein [Populus trichocarpa]
gi|222869813|gb|EEF06944.1| predicted protein [Populus trichocarpa]
Length = 411
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 141/348 (40%), Positives = 208/348 (59%), Gaps = 32/348 (9%)
Query: 27 HALLALKRLNESYINIWSSDHSARKRKAAAANIDDHE-DSAGCSCIGKVCMVDIITFLC- 84
AL ALK ++++I++W+ DH+A+ N ++ D C C+GKV MVD++ +LC
Sbjct: 31 EALFALKNSDDNFISVWNCDHAAKTNNDYKGNCEEEGCDVCECKCVGKVSMVDVVCYLCK 90
Query: 85 -------------PCFCSFAKDSGIVRHLKPSASLLEAVDLLLGGVQNLVI-----LPAG 126
P + G+V H++P++SLLEA+DL+L G +NLV+
Sbjct: 91 DENLLFPSDALKAPVSVLLPEIPGMVVHVEPTSSLLEAIDLILQGAKNLVVPIKTRYSTR 150
Query: 127 IKLQPKPSLKS-TFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILA 185
K K S+ S T HN E+CWLTQED+IR+FL IGL P P I++ II L
Sbjct: 151 RKQHQKLSITSPTIHNGREFCWLTQEDIIRFFLGSIGLFAPLPALSIDTLGIIS-TEFLT 209
Query: 186 IHYDDPAAFAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFSLNSCDETVAAAIATLLAGD 245
I Y PA + I++S +TSVA++D +G L+G++SPF+L CD++VAAAI TL +GD
Sbjct: 210 IDYHSPAISELEAISRSLADETSVAVIDSDGILIGELSPFTLACCDDSVAAAITTLSSGD 269
Query: 246 LMAYMDCGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEISSGSCSNSSSSDEESSTG-- 303
LMAY+DCG P +DLV++V +RL E+ + +L+ + S+ S S SS + +
Sbjct: 270 LMAYIDCGGPPEDLVKVVMERLKERGLEAMLQEFTNSSCYSTTSSCQSQSSSSDEESASS 329
Query: 304 --------SARSARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
S + +RS YSAR+V R+EAIVC+P SSL+AV++QA+A R
Sbjct: 330 TPVSTLHRSGKYSRSMSYSARMVRRAEAIVCHPKSSLVAVMIQAIAHR 377
>gi|224117210|ref|XP_002317509.1| predicted protein [Populus trichocarpa]
gi|222860574|gb|EEE98121.1| predicted protein [Populus trichocarpa]
Length = 401
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 141/341 (41%), Positives = 201/341 (58%), Gaps = 28/341 (8%)
Query: 27 HALLALKRLNESYINIWSSDHSARKRKAAAANID-DHEDSAGCSCIGKVCMVDIITFLC- 84
L ALK +++++++WS +H+A+ K N + D D C C+GKV MVD+I +LC
Sbjct: 31 EVLFALKNSDDNFLSVWSCEHTAKTNKDYRGNCEEDGCDVGECKCVGKVSMVDVICYLCK 90
Query: 85 -------------PCFCSFAKDSGIVRHLKPSASLLEAVDLLLGGVQNLVILPAGI---- 127
P + G+V H++P++SLL+A+DL+L G +NLV+ P
Sbjct: 91 DENLLSPSDALKAPVSVLLPEIPGMVVHVEPTSSLLDAIDLILQGAKNLVV-PIKTRYSS 149
Query: 128 ----KLQPKPSLKS-TFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAG 182
K K S+ S T HN E+CWLTQED+IR+FL IGL P P I++ II
Sbjct: 150 SSRRKQHQKLSITSPTIHNGREFCWLTQEDIIRFFLGSIGLFAPLPALSIDTLGIIS-TD 208
Query: 183 ILAIHYDDPAAFAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFSLNSCDETVAAAIATLL 242
L I Y PA + I+ S + SVAI+D +G L+G++SPF+L CDE+VAAAI TL
Sbjct: 209 YLTIDYHSPAISELEAISGSLADENSVAIIDSDGILIGELSPFTLACCDESVAAAITTLS 268
Query: 243 AGDLMAYMDCGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEISSGSCSNSSSSDEESST 302
+GDLMAY+DCG P DLV LV RL + + +L+ + S+ S +S+
Sbjct: 269 SGDLMAYIDCGGPPDDLVNLVMTRLKGRGLEAMLQEFTNSSCYSTTSSWSSTPFSALQRP 328
Query: 303 GSARSARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
G + +RS YSAR+V R+EAIVC+P SSL+AV++QA+A R
Sbjct: 329 G--KYSRSMSYSARMVRRAEAIVCHPKSSLVAVMIQAIAHR 367
>gi|255575342|ref|XP_002528574.1| conserved hypothetical protein [Ricinus communis]
gi|223532018|gb|EEF33829.1| conserved hypothetical protein [Ricinus communis]
Length = 408
Score = 231 bits (589), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 147/349 (42%), Positives = 209/349 (59%), Gaps = 37/349 (10%)
Query: 27 HALLALKRLNESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFLC-- 84
AL ALK ++S++++W+ DH ++ D ED C C+GKV +VD+I +LC
Sbjct: 31 EALSALKNSDDSFLSVWNCDHITKRNSGFNC---DREDRDECKCVGKVSIVDVICYLCQD 87
Query: 85 ------------PCFCSFAKDSGIVRHLKPSASLLEAVDLLLGGVQNLVILPAGIKLQPK 132
P K G+V H++PS+SL+EA+DL+L G QNLV+ P +L
Sbjct: 88 KNLVSPSDALKDPVSVLLPKIPGLVMHVEPSSSLVEAIDLILQGAQNLVV-PIKTRLSSS 146
Query: 133 PSLK-------------STFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIID 179
S + +T H E+CWL QED+IR+FL+ IGL +P P I+S II
Sbjct: 147 NSRRKQQQKLSATSTGLTTIHKGREFCWLAQEDIIRFFLSSIGLFSPVPALSIDSLGIIT 206
Query: 180 DAGILAIHYDDPAAFAIPLIAQSHIKQTSVAIVD-EEGRLVGDISPFSLNSCDETVAAAI 238
I+ I Y+ PA+ + I ++ QTSVA+VD +EG L+G++SPF+L CDETVAAAI
Sbjct: 207 TD-IITIDYNSPASATLGAINRALATQTSVAVVDGDEGILIGELSPFTLACCDETVAAAI 265
Query: 239 ATLLAGDLMAYMDCGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEISSGSCSNSSSSDE 298
TL +GDLMAY+DCG P +DLVR+V RL + + +L+ + + SSSS +
Sbjct: 266 TTLSSGDLMAYIDCGGPPEDLVRVVMARLKHRGLEAMLQEFTNSTTSLVSFSTLSSSSSD 325
Query: 299 ESSTG----SARSARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
E ST S + +RS YSAR+V R+EAIVC+P SSL+AV++QA+A R
Sbjct: 326 EESTTTLHRSGKYSRSKSYSARMVRRAEAIVCHPKSSLVAVMIQAIAHR 374
>gi|449447217|ref|XP_004141365.1| PREDICTED: CBS domain-containing protein CBSX5-like [Cucumis
sativus]
Length = 401
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 141/298 (47%), Positives = 186/298 (62%), Gaps = 24/298 (8%)
Query: 68 CSCIGKVCMVDIITFLC-------PCFCSFAKDS-------GIVRHLKPSASLLEAVDLL 113
C C+GK+CMVD+I +LC P A S GIV HL+PSASLLEA+DL+
Sbjct: 72 CRCVGKLCMVDVICYLCKEENLLSPSSALQASVSEILPQIPGIVMHLEPSASLLEAIDLV 131
Query: 114 LGGVQNLVILPAGIKLQP---KPSLKST---FHNDSEYCWLTQEDLIRYFLNFIGLLNPT 167
L G QNLV+ P +L + LK++ H E+CWLTQED+IRY L IGL +P
Sbjct: 132 LQGAQNLVV-PIKTRLGSNSRRKQLKNSTNGIHGGHEFCWLTQEDIIRYLLGSIGLFSPI 190
Query: 168 PNQPINSHNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFSL 227
++S II L+++Y PA+ AI I+ S QTSVA++D +G L+G+ISPF+L
Sbjct: 191 AALSLDSLGIIC-TNALSVNYHSPASSAIGAISHSITNQTSVAVIDGDGILIGEISPFAL 249
Query: 228 NSCDETVAAAIATLLAGDLMAYMDCGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEISS 287
CD+ VAAAI TL +GDLMAY+DCG P +DLV++VK RL + + G+LE +
Sbjct: 250 AGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKDSKLEGMLEEFTNSPSSIG 309
Query: 288 GSCSNSSSSDEE--SSTGSARSARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
SSSSDEE S S R RS YSAR+ R+EAIVC+P SSL+AV++QA+ R
Sbjct: 310 SPSFTSSSSDEEFSPSPSSRRYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAITHR 367
>gi|116787743|gb|ABK24626.1| unknown [Picea sitchensis]
Length = 432
Score = 197 bits (502), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 137/374 (36%), Positives = 201/374 (53%), Gaps = 65/374 (17%)
Query: 28 ALLALKR-LNESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFLC-- 84
AL ALK+ +E+ + +W+ +HS K +D CSC+GKVCMVDII FL
Sbjct: 32 ALKALKQSPHETELGVWNCNHSWLDHKKPTNEGQQLKD---CSCLGKVCMVDIICFLSRD 88
Query: 85 -----PCFCSFAKDSGI--------VRHLKPSASLLEAVDLLLGGVQNLVI--------- 122
P A S + VRH+ P +SLL+A+D +L G QNL++
Sbjct: 89 ESLYDPASALSAPVSSLFLPRIPSRVRHVDPGSSLLQALDFILEGAQNLIVPIENHKRLS 148
Query: 123 -LPAGIKLQPKPSLKSTFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDA 181
G K+ + ST H E+CW+ QED++R+ L FIG+ +P P+ I I++
Sbjct: 149 FKKLGQKIASAGTASSTSHGGKEFCWINQEDVVRFLLGFIGVFSPLPSMTIEDLGIVNRE 208
Query: 182 GILAIHYDDPAAFAIPLIAQSHIKQTSVAIVDEEG----RLVGDISPFSLNSCDETVAAA 237
+L + YD PA+ A+ +I + QT+VA+V+++ +L+G+ISP +L CDETVA A
Sbjct: 209 -VLMVEYDKPASSALQMIQLASNTQTAVAVVEQDPLQGPKLIGEISPSTLMYCDETVALA 267
Query: 238 IATLLAGDLMAYMDCGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLE------------I 285
+ATL AGD MAY+DCG P + LV LV++R+++K +G + E D
Sbjct: 268 LATLSAGDFMAYVDCGGPPESLVDLVRRRINQK--MGPMGEGEQDQNPGNPIPGSDKPLT 325
Query: 286 SSGSCSNSS---SSDEE----SSTGSARSARSG---------GYSARVVHRSEAIVCYPW 329
+ S S S SSDEE S +G R++ GY++R R + C PW
Sbjct: 326 ETDSLSTDSWEESSDEEFSVQSPSGPLRNSNKWSRSCSFNKFGYASR-GRRVAPLTCKPW 384
Query: 330 SSLMAVIMQALARR 343
SSL+AV++QALA R
Sbjct: 385 SSLVAVMVQALAHR 398
>gi|444436437|gb|AGE09586.1| CBS-like protein, partial [Eucalyptus cladocalyx]
Length = 288
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 113/262 (43%), Positives = 161/262 (61%), Gaps = 28/262 (10%)
Query: 28 ALLALKRLNESYINIWSSDHSARKRKAAAANIDDHEDSAG--CSCIGKVCMVDIITFL-- 83
AL AL+ +S++++WS DH R AAA HE G C C+GK+CMVD+I +L
Sbjct: 32 ALSALRSSEDSFLSVWSCDH----RSKAAAAAASHEGGEGGECRCVGKLCMVDVICYLSS 87
Query: 84 ------------CPCFCSFAKDSGIVRHLKPSASLLEAVDLLLGGVQNLV--ILPAGIKL 129
P K G V H++PS+ L+EA+DL+L G QNLV I +
Sbjct: 88 EDSLSSPSEALKAPVSALLPKIPGQVVHVEPSSRLVEAIDLILQGAQNLVVPIQTRSTRR 147
Query: 130 QPKPSLKS-----TFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGIL 184
+ L S T H E+CWLTQED+IR+ LN IG+ +P P + I++ +I +L
Sbjct: 148 KQHQKLSSSTGPITVHKGEEFCWLTQEDVIRFLLNSIGIFSPIPARTIDTLGLITTE-VL 206
Query: 185 AIHYDDPAAFAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFSLNSCDETVAAAIATLLAG 244
A+ Y PA+ A+ I++S ++QTSVA+VD EG L+G+ISP +L C+ETVAAA+ATL G
Sbjct: 207 AVDYHSPASAALEAISRSLVEQTSVAVVDVEGVLIGEISPSTLACCEETVAAAVATLSCG 266
Query: 245 DLMAYMDCGRPLKDLVRLVKQR 266
DLM+Y+DCG P ++LVR+V++
Sbjct: 267 DLMSYIDCGGPPENLVRVVERE 288
>gi|27754524|gb|AAO22709.1| unknown protein [Arabidopsis thaliana]
Length = 391
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 128/346 (36%), Positives = 197/346 (56%), Gaps = 51/346 (14%)
Query: 28 ALLALKRLNESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFLCP-- 85
A+ ALK ++++++W+ +H D +++ C C+GK+ M D+I L
Sbjct: 33 AIAALKSSEDTFLSVWNCNH-------------DDDNNTECECLGKISMADVICHLSKDH 79
Query: 86 --CFCSF--------AKDSGIVRHLKPSASLLEAVDLLLGGVQNLVILPAGIKLQPKPSL 135
C+ K IV H++PS SL+EA+DL++ G QNL++ + KP
Sbjct: 80 DHSLCALNSSVSVLLPKTRSIVLHVQPSCSLIEAIDLIIKGAQNLIV-----PIHTKPYT 134
Query: 136 KSTFHNDS------------EYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDA-G 182
K HND+ +CW+TQED+I++ L FI +P P ++ +I+
Sbjct: 135 KKKQHNDNVSVTTTTHSNGQRFCWITQEDIIQFLLGFIAAFSPLPAMSLSDLGVINSTHT 194
Query: 183 ILAIHYDDPAAFAIPLIAQSHIKQTSVAIVDEEG-----RLVGDISPFSLNSCDETVAAA 237
++A+ Y A+ + ++ + QTSVA+VD EG L+G+ISP +L CDET AAA
Sbjct: 195 VVAVDYHSSASAVVSAVSNALAVQTSVAVVDGEGDDPFTSLIGEISPMTLTCCDETAAAA 254
Query: 238 IATLLAGDLMAYMDCGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEISSGSCSNSSSSD 297
+ATL AGDLMAY+D P + LV++V+ RL++K +VGL+ L + +SS S S+ SS+
Sbjct: 255 VATLSAGDLMAYIDGANPPESLVQIVRNRLEDKGLVGLMSLFD---SLSSYSTSSGYSSE 311
Query: 298 EESSTGSARSARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
EE+ + RS SAR+ +SEAIVC P SSLMAV++QA+A R
Sbjct: 312 EEAPVRTTSYGRSMSSSARMARKSEAIVCNPKSSLMAVMIQAVAHR 357
>gi|57222158|gb|AAW38986.1| At4g27460 [Arabidopsis thaliana]
Length = 409
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 127/346 (36%), Positives = 197/346 (56%), Gaps = 51/346 (14%)
Query: 28 ALLALKRLNESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFLCP-- 85
A+ ALK ++++++W+ +H D +++ C C+GK+ M D+I L
Sbjct: 51 AIAALKSSEDTFLSVWNCNH-------------DDDNNTECECLGKISMADVICHLSKDH 97
Query: 86 --CFCSF--------AKDSGIVRHLKPSASLLEAVDLLLGGVQNLVILPAGIKLQPKPSL 135
C+ K IV H++PS SL+EA+DL++ G QNL++ + KP
Sbjct: 98 DHSLCALNSSVSVLLPKTRSIVLHVQPSCSLIEAIDLIIKGAQNLIV-----PIHTKPYT 152
Query: 136 KSTFHNDS------------EYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDA-G 182
K HND+ +CW+TQED+I++ L FI +P P ++ +I+
Sbjct: 153 KKKQHNDNVSVTTTTHSNGQRFCWITQEDIIQFLLGFIAAFSPLPAMSLSDLGVINSTHT 212
Query: 183 ILAIHYDDPAAFAIPLIAQSHIKQTSVAIVDEEG-----RLVGDISPFSLNSCDETVAAA 237
++A+ Y A+ + ++ + QTSVA+VD EG L+G+ISP +L CDET AAA
Sbjct: 213 VVAVDYHSSASAVVSAVSNALAVQTSVAVVDGEGDDPFTSLIGEISPMTLTCCDETAAAA 272
Query: 238 IATLLAGDLMAYMDCGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEISSGSCSNSSSSD 297
+ATL AGDLMAY+D P + LV++V+ RL++K ++GL+ L + +SS S S+ SS+
Sbjct: 273 VATLSAGDLMAYIDGANPPESLVQIVRNRLEDKGLIGLMSLFD---SLSSYSTSSGYSSE 329
Query: 298 EESSTGSARSARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
EE+ + RS SAR+ +SEAIVC P SSLMAV++QA+A R
Sbjct: 330 EEAPVRTTSYGRSMSSSARMARKSEAIVCNPKSSLMAVMIQAVAHR 375
>gi|30687603|ref|NP_194476.2| cystathionine beta-synthase domain-containing protein [Arabidopsis
thaliana]
gi|322518650|sp|Q84WQ5.2|CBSX5_ARATH RecName: Full=CBS domain-containing protein CBSX5
gi|111074322|gb|ABH04534.1| At4g27460 [Arabidopsis thaliana]
gi|332659945|gb|AEE85345.1| cystathionine beta-synthase domain-containing protein [Arabidopsis
thaliana]
Length = 391
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 127/346 (36%), Positives = 197/346 (56%), Gaps = 51/346 (14%)
Query: 28 ALLALKRLNESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFLCP-- 85
A+ ALK ++++++W+ +H D +++ C C+GK+ M D+I L
Sbjct: 33 AIAALKSSEDTFLSVWNCNH-------------DDDNNTECECLGKISMADVICHLSKDH 79
Query: 86 --CFCSF--------AKDSGIVRHLKPSASLLEAVDLLLGGVQNLVILPAGIKLQPKPSL 135
C+ K IV H++PS SL+EA+DL++ G QNL++ + KP
Sbjct: 80 DHSLCALNSSVSVLLPKTRSIVLHVQPSCSLIEAIDLIIKGAQNLIV-----PIHTKPYT 134
Query: 136 KSTFHNDS------------EYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDA-G 182
K HND+ +CW+TQED+I++ L FI +P P ++ +I+
Sbjct: 135 KKKQHNDNVSVTTTTHSNGQRFCWITQEDIIQFLLGFIAAFSPLPAMSLSDLGVINSTHT 194
Query: 183 ILAIHYDDPAAFAIPLIAQSHIKQTSVAIVDEEG-----RLVGDISPFSLNSCDETVAAA 237
++A+ Y A+ + ++ + QTSVA+VD EG L+G+ISP +L CDET AAA
Sbjct: 195 VVAVDYHSSASAVVSAVSNALAVQTSVAVVDGEGDDPFTSLIGEISPMTLTCCDETAAAA 254
Query: 238 IATLLAGDLMAYMDCGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEISSGSCSNSSSSD 297
+ATL AGDLMAY+D P + LV++V+ RL++K ++GL+ L + +SS S S+ SS+
Sbjct: 255 VATLSAGDLMAYIDGANPPESLVQIVRNRLEDKGLIGLMSLFD---SLSSYSTSSGYSSE 311
Query: 298 EESSTGSARSARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
EE+ + RS SAR+ +SEAIVC P SSLMAV++QA+A R
Sbjct: 312 EEAPVRTTSYGRSMSSSARMARKSEAIVCNPKSSLMAVMIQAVAHR 357
>gi|4972071|emb|CAB43878.1| hypothetical protein [Arabidopsis thaliana]
gi|7269600|emb|CAB81396.1| hypothetical protein [Arabidopsis thaliana]
Length = 433
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 127/346 (36%), Positives = 197/346 (56%), Gaps = 51/346 (14%)
Query: 28 ALLALKRLNESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFLCP-- 85
A+ ALK ++++++W+ +H D +++ C C+GK+ M D+I L
Sbjct: 33 AIAALKSSEDTFLSVWNCNH-------------DDDNNTECECLGKISMADVICHLSKDH 79
Query: 86 --CFCSF--------AKDSGIVRHLKPSASLLEAVDLLLGGVQNLVILPAGIKLQPKPSL 135
C+ K IV H++PS SL+EA+DL++ G QNL++ + KP
Sbjct: 80 DHSLCALNSSVSVLLPKTRSIVLHVQPSCSLIEAIDLIIKGAQNLIV-----PIHTKPYT 134
Query: 136 KSTFHNDS------------EYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDA-G 182
K HND+ +CW+TQED+I++ L FI +P P ++ +I+
Sbjct: 135 KKKQHNDNVSVTTTTHSNGQRFCWITQEDIIQFLLGFIAAFSPLPAMSLSDLGVINSTHT 194
Query: 183 ILAIHYDDPAAFAIPLIAQSHIKQTSVAIVDEEG-----RLVGDISPFSLNSCDETVAAA 237
++A+ Y A+ + ++ + QTSVA+VD EG L+G+ISP +L CDET AAA
Sbjct: 195 VVAVDYHSSASAVVSAVSNALAVQTSVAVVDGEGDDPFTSLIGEISPMTLTCCDETAAAA 254
Query: 238 IATLLAGDLMAYMDCGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEISSGSCSNSSSSD 297
+ATL AGDLMAY+D P + LV++V+ RL++K ++GL+ L + +SS S S+ SS+
Sbjct: 255 VATLSAGDLMAYIDGANPPESLVQIVRNRLEDKGLIGLMSLFD---SLSSYSTSSGYSSE 311
Query: 298 EESSTGSARSARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
EE+ + RS SAR+ +SEAIVC P SSLMAV++QA+A R
Sbjct: 312 EEAPVRTTSYGRSMSSSARMARKSEAIVCNPKSSLMAVMIQAVAHR 357
>gi|297803356|ref|XP_002869562.1| hypothetical protein ARALYDRAFT_328949 [Arabidopsis lyrata subsp.
lyrata]
gi|297315398|gb|EFH45821.1| hypothetical protein ARALYDRAFT_328949 [Arabidopsis lyrata subsp.
lyrata]
Length = 394
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 126/348 (36%), Positives = 189/348 (54%), Gaps = 53/348 (15%)
Query: 28 ALLALKRLNESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFL---- 83
A+ ALK ++++++W+ +H + + C C+GK+ M DII L
Sbjct: 33 AITALKSSEDTFLSVWNCNH-------------NDDVVTECECLGKISMADIICHLSKDH 79
Query: 84 --------CPCFCSFAKDSGIVRHLKPSASLLEAVDLLLGGVQNLVILPAGIKLQPKPSL 135
K IV H++PS SL+EA+DL++ G QNL++ +Q KP
Sbjct: 80 DHTLSALNASVSVLLPKTRSIVLHVQPSCSLIEAIDLIIQGAQNLIV-----PIQTKPFT 134
Query: 136 KSTFHNDS------------EYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDA-G 182
K H D+ +CW+TQED+I++ L I +P P ++ II+
Sbjct: 135 KKRQHKDNVSVTTTTHSNGRRFCWITQEDIIQFLLGSIAAFSPLPAMSLSDLGIINSTHT 194
Query: 183 ILAIHYDDPAAFAIPLIAQSHIKQTSVAIVDEEG------RLVGDISPFSLNSCDETVAA 236
ILA+ Y A+ + I+ + QTSVA+VD EG L+G+ISP +L CDET AA
Sbjct: 195 ILAVDYHSSASAVVSAISNALAVQTSVAVVDGEGDDDPFTYLIGEISPMTLTCCDETAAA 254
Query: 237 AIATLLAGDLMAYMDCGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEISSGSCSNSSSS 296
A+A L AG+L+AY+D P + V+ V+ RL++K ++GLL L + +S S S+ SS
Sbjct: 255 AVAMLSAGELVAYIDGANPPESFVQNVRNRLEDKGLMGLLSLFD---SLSPYSTSSGYSS 311
Query: 297 DEESSTGSARS-ARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
+EE+ + + RS SARV +SEAIVC P SSLMAV++QA+A R
Sbjct: 312 EEEAPARTTSTYGRSMSSSARVARKSEAIVCNPKSSLMAVMIQAVAHR 359
>gi|297792771|ref|XP_002864270.1| hypothetical protein ARALYDRAFT_495451 [Arabidopsis lyrata subsp.
lyrata]
gi|297310105|gb|EFH40529.1| hypothetical protein ARALYDRAFT_495451 [Arabidopsis lyrata subsp.
lyrata]
Length = 408
Score = 181 bits (459), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 121/355 (34%), Positives = 187/355 (52%), Gaps = 52/355 (14%)
Query: 28 ALLALKRLNESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFLCP-- 85
A+ ALK +E ++ +WS +H + ED+ C C+GK+CM D+I +L
Sbjct: 33 AIAALKSSDEPFLTVWSCNHDEKT-----------EDNDKCECLGKICMADVICYLAKFD 81
Query: 86 -------------CFCSFAKDSGIVRHLKPSASLLEAVDLLLGGVQNLVI---------- 122
K +V H++ S +L+EA+DL++ G QNL++
Sbjct: 82 NNVLSLSSAFDESVSVLLPKSRSLVVHVQSSCNLIEAIDLIIKGAQNLIVPIQTKSITKR 141
Query: 123 ------LPAGIKLQPKPSLKSTFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHN 176
L + + + +T N ++CW+TQED+IR+ L+ I + +P P+ I+
Sbjct: 142 RQQQKLLTRNVVVSLTNTTSTTHKNSRQFCWITQEDIIRFLLDSISVFSPLPSLSISDLG 201
Query: 177 IIDDA-GILAIHYDDPAAFAIPLIAQSHIKQTSVAIVDEE-------GRLVGDISPFSLN 228
+I+ ILAI Y AA A+ I+++ + SVA+VD+ L+G+ISP +L
Sbjct: 202 VINSTHTILAIDYYSSAASAVSTISRAILDNVSVAVVDKGCDQEDPCMALIGEISPMTLA 261
Query: 229 SCDETVAAAIATLLAGDLMAYMDCGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEISSG 288
CDET AA+ATL AGDLM+Y+D P + LV +V+ RL++K MVGL+ L++ S
Sbjct: 262 CCDETAVAAVATLSAGDLMSYIDGSGPPESLVGVVRNRLEDKGMVGLISLIDSLSLSSGS 321
Query: 289 SCSNSSSSDEESSTGSARSARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
S S + T S RS +AR+ +S AIVC SSLMAV++QA+A R
Sbjct: 322 SSDEESPAGRTRMTSSY--GRSVSSAARMARKSVAIVCNRKSSLMAVMIQAIAHR 374
>gi|449531313|ref|XP_004172631.1| PREDICTED: CBS domain-containing protein CBSX5-like, partial
[Cucumis sativus]
Length = 283
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 107/246 (43%), Positives = 146/246 (59%), Gaps = 22/246 (8%)
Query: 36 NESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFLC-------PCFC 88
++ ++++W R A D C C+GK+CMVD+I +LC P
Sbjct: 40 HDYFVSVWDCRLPKRGCTGAVDGGAAGGDFECCRCVGKLCMVDVICYLCKEENLLSPSSA 99
Query: 89 SFAKDS-------GIVRHLKPSASLLEAVDLLLGGVQNLVILPAGIKLQP---KPSLKST 138
A S GIV HL+PSASLLEA+DL+L G QNLV+ P +L + LK++
Sbjct: 100 LQASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVV-PIKTRLGSNSRRKQLKNS 158
Query: 139 ---FHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAIHYDDPAAFA 195
H E+CWLTQED+IRY L IGL +P ++S II L+++Y PA+ A
Sbjct: 159 TNGIHGGHEFCWLTQEDIIRYLLGSIGLFSPIAALSLDSLGIIC-TNALSVNYHSPASSA 217
Query: 196 IPLIAQSHIKQTSVAIVDEEGRLVGDISPFSLNSCDETVAAAIATLLAGDLMAYMDCGRP 255
I I+ S QTSVA++D +G L+G+ISPF+L CD+ VAAAI TL +GDLMAY+DCG P
Sbjct: 218 IGAISHSITNQTSVAVIDGDGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGP 277
Query: 256 LKDLVR 261
+DLV+
Sbjct: 278 PEDLVK 283
>gi|9759069|dbj|BAB09547.1| unnamed protein product [Arabidopsis thaliana]
Length = 427
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 122/355 (34%), Positives = 189/355 (53%), Gaps = 52/355 (14%)
Query: 28 ALLALKRLNESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFL---- 83
A+ ALK +E ++ +WS +H + +D+ C C+GK+CM D+I +L
Sbjct: 33 AIAALKSSDEPFLTVWSCNHDEKT-----------DDNDKCECLGKICMADVICYLSKFD 81
Query: 84 -----------CPCFCSFAKDSGIVRHLKPSASLLEAVDLLLGGVQNLVI---------- 122
K +V H++ S SL+EA+DL++ G QNL++
Sbjct: 82 NNVLSLSSAFDASVSVLLPKSRALVVHVQSSCSLIEAIDLIIKGAQNLIVPIHTKSITKR 141
Query: 123 ------LPAGIKLQPKPSLKSTFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHN 176
L + + + +T N E+CW+TQED+IR+ L+ I + +P P+ I+
Sbjct: 142 RQQQKLLKRNVVVSLTNATSTTHKNSREFCWITQEDIIRFLLDSISVFSPLPSLSISDLG 201
Query: 177 IIDDA-GILAIHYDDPAAFAIPLIAQSHIKQTSVAIV----DEEG---RLVGDISPFSLN 228
+I+ ILA+ Y AA A+ I+++ + SVA+V D+E L+G+ISP +L
Sbjct: 202 VINSTHTILAVDYYSSAASAVSAISRAILDNVSVAVVGKGCDQEDPCMVLIGEISPMTLA 261
Query: 229 SCDETVAAAIATLLAGDLMAYMDCGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEISSG 288
CDET AA+ATL AGDLM+Y+D P + LV +V+ RL++K MVGL+ L++ S
Sbjct: 262 CCDETAVAAVATLSAGDLMSYIDGSGPPESLVGVVRNRLEDKGMVGLISLIDSLSLSSGS 321
Query: 289 SCSNSSSSDEESSTGSARSARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
S S + + T S RS +AR+ +S AIVC SSLMAV++QA+A R
Sbjct: 322 SSDEESPAGKTRMTSSY--GRSVSSAARMARKSVAIVCNRKSSLMAVMIQAIAHR 374
>gi|79537386|ref|NP_200186.2| CBS domain-containing protein [Arabidopsis thaliana]
gi|332009019|gb|AED96402.1| CBS domain-containing protein [Arabidopsis thaliana]
Length = 408
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 122/355 (34%), Positives = 189/355 (53%), Gaps = 52/355 (14%)
Query: 28 ALLALKRLNESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFL---- 83
A+ ALK +E ++ +WS +H + +D+ C C+GK+CM D+I +L
Sbjct: 33 AIAALKSSDEPFLTVWSCNHDEKT-----------DDNDKCECLGKICMADVICYLSKFD 81
Query: 84 -----------CPCFCSFAKDSGIVRHLKPSASLLEAVDLLLGGVQNLVI---------- 122
K +V H++ S SL+EA+DL++ G QNL++
Sbjct: 82 NNVLSLSSAFDASVSVLLPKSRALVVHVQSSCSLIEAIDLIIKGAQNLIVPIHTKSITKR 141
Query: 123 ------LPAGIKLQPKPSLKSTFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHN 176
L + + + +T N E+CW+TQED+IR+ L+ I + +P P+ I+
Sbjct: 142 RQQQKLLKRNVVVSLTNATSTTHKNSREFCWITQEDIIRFLLDSISVFSPLPSLSISDLG 201
Query: 177 IIDDA-GILAIHYDDPAAFAIPLIAQSHIKQTSVAIV----DEEG---RLVGDISPFSLN 228
+I+ ILA+ Y AA A+ I+++ + SVA+V D+E L+G+ISP +L
Sbjct: 202 VINSTHTILAVDYYSSAASAVSAISRAILDNVSVAVVGKGCDQEDPCMVLIGEISPMTLA 261
Query: 229 SCDETVAAAIATLLAGDLMAYMDCGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEISSG 288
CDET AA+ATL AGDLM+Y+D P + LV +V+ RL++K MVGL+ L++ S
Sbjct: 262 CCDETAVAAVATLSAGDLMSYIDGSGPPESLVGVVRNRLEDKGMVGLISLIDSLSLSSGS 321
Query: 289 SCSNSSSSDEESSTGSARSARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
S S + + T S RS +AR+ +S AIVC SSLMAV++QA+A R
Sbjct: 322 SSDEESPAGKTRMTSSY--GRSVSSAARMARKSVAIVCNRKSSLMAVMIQAIAHR 374
>gi|110741304|dbj|BAF02202.1| hypothetical protein [Arabidopsis thaliana]
Length = 408
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 122/355 (34%), Positives = 189/355 (53%), Gaps = 52/355 (14%)
Query: 28 ALLALKRLNESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFL---- 83
A+ ALK +E ++ +WS +H + +D+ C C+GK+CM D+I +L
Sbjct: 33 AIAALKSSDEPFLTVWSCNHDEKT-----------DDNDKCECLGKICMADVICYLSKFD 81
Query: 84 -----------CPCFCSFAKDSGIVRHLKPSASLLEAVDLLLGGVQNLVI---------- 122
K +V H++ S SL+EA+DL++ G QNL++
Sbjct: 82 NNVLSLSSAFDASVSVLLPKSRALVVHVQSSCSLIEAIDLIIKGAQNLIVPIHTKSITKR 141
Query: 123 ------LPAGIKLQPKPSLKSTFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHN 176
L + + + +T N E+CW+TQED+IR+ L+ I + +P P+ I+
Sbjct: 142 RQQQKLLKRNVVVSLTNATSTTHKNSREFCWITQEDIIRFLLDSISVFSPLPSLSISDLG 201
Query: 177 IIDDA-GILAIHYDDPAAFAIPLIAQSHIKQTSVAIV----DEEG---RLVGDISPFSLN 228
+I+ ILA+ Y AA A+ I+++ + SVA+V D+E L+G+ISP +L
Sbjct: 202 VINSTHTILAVDYYSSAASAVSAISRAILDNVSVAVVGKGCDQEDPCMVLIGEISPMTLA 261
Query: 229 SCDETVAAAIATLLAGDLMAYMDCGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEISSG 288
CDET AA+ATL AGDLM+Y+D P + LV +V+ RL++K MVGL+ L++ S
Sbjct: 262 CCDETAVAAVATLSAGDLMSYIDGSGPPESLVGVVRNRLEDKGMVGLISLIDSLSLSSGS 321
Query: 289 SCSNSSSSDEESSTGSARSARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
S S + + T S RS +AR+ +S AIVC SSLMAV++QA+A R
Sbjct: 322 SSDEESPAGKTRMTSSY--GRSVSSAARMARKSVAIVCNRKSSLMAVMIQAIAHR 374
>gi|22165070|gb|AAM93687.1| unknown protein [Oryza sativa Japonica Group]
gi|31432888|gb|AAP54464.1| CBS domain-containing protein, putative, expressed [Oryza sativa
Japonica Group]
gi|125532527|gb|EAY79092.1| hypothetical protein OsI_34199 [Oryza sativa Indica Group]
Length = 385
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 117/298 (39%), Positives = 167/298 (56%), Gaps = 23/298 (7%)
Query: 67 GCSCIGKVCMVDIITFLC---------------PCFCSFAKD-SGIVRHLKPSASLLEAV 110
G + G++ + D++ FLC P KD +G VR + P AS+LEA+
Sbjct: 57 GRAVAGRLGLADVLCFLCAAPGALAHPTAALSKPASALLPKDGAGEVRRVDPRASVLEAL 116
Query: 111 DLLLGGVQNLVILPAGIKLQPKPSLKSTFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQ 170
D +L G Q L + + + +YCWLTQEDL+RYFLN I L + +
Sbjct: 117 DAVLSGAQVLAVPLRSGGRRKQLGGGGGGGGGGDYCWLTQEDLVRYFLNSISLFSHVAGR 176
Query: 171 PINSHNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFSLNSC 230
++S ++ +L + + A A+PL+ ++ +T+VA+VD+ G LVG+ISP L SC
Sbjct: 177 SVSSLGLVRADDLLTVRPHEAALSAVPLLRRAIATETAVAVVDDGGHLVGEISPALLASC 236
Query: 231 DETVAAAIATLLAGDLMAYMD-CGRPLKDLVRLVKQRLDEKNMVGLLELMEDD--LEISS 287
DET AAAIATL DLMAY+D G P + ++R VK L K + +LEL+E++ +
Sbjct: 237 DETAAAAIATLSVADLMAYVDYFGAPPEHILRAVKAGLKSKGLDAMLELVENEAVSSFAF 296
Query: 288 GSCSNSSSSDEESSTGSARSAR--SGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
S S SSSSD+E+ +AR R SG Y R E +VC P SSL+AV+MQALA R
Sbjct: 297 SSSSTSSSSDDEAHGRAARLRRPSSGSYGRRSTE--EPVVCSPASSLVAVMMQALAHR 352
>gi|449479605|ref|XP_004155649.1| PREDICTED: CBS domain-containing protein CBSX5-like [Cucumis
sativus]
Length = 369
Score = 160 bits (406), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 124/334 (37%), Positives = 180/334 (53%), Gaps = 48/334 (14%)
Query: 28 ALLALKRLNESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFLC--- 84
AL ALK+L E+YI++W+ K + C CIGK+ ++D++ FLC
Sbjct: 32 ALSALKKLGENYISVWNCSSHYSKSSSHY----------DCRCIGKISVLDVVLFLCKEE 81
Query: 85 ----PCFCSFAKDSGI-------VRHLKPSASLLEAVDLLLGGVQNLVILPAGIKLQPKP 133
P + S + VRHL+P ASL+EA+DLLL G QNLV+ +Q +
Sbjct: 82 NLSQPALALQSSVSVLIPPVPVLVRHLEPHASLMEAIDLLLEGAQNLVV-----PIQTRT 136
Query: 134 SLKSTFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAIHYDDPAA 193
S KS +E ++ F + P+ + + L
Sbjct: 137 SAKS------------REKVLEVVAPFDSPPP---SLPLIPLTPLTRSTFLLYTMTIQHL 181
Query: 194 FAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFSLNSCDETVAAAIATLLAGDLMAYMDCG 253
A+PL++Q+ I Q+S+AIVD +G+L+G+ISP +LNS DET+ AAI TL AG+LMAY++C
Sbjct: 182 SALPLLSQAIIHQSSIAIVDSDGKLIGEISPLTLNSFDETITAAIVTLSAGELMAYVNCN 241
Query: 254 RPLKDLVRLVKQRLDEKNMVGLLELMEDDLEISSGSCSNSSSSDEESSTGSARSARSGGY 313
P + LV+LVK RL+ +N+ GLLE +E++ +S+ S +S S + RSG
Sbjct: 242 DPPEYLVQLVKDRLEGRNLRGLLEWVEEESAMSAMSSCSSFCSSSSDDDSGSWWGRSGKL 301
Query: 314 ---SARVVHR-SEAIVCYPWSSLMAVIMQALARR 343
S R V R SE VC P SSL+AV++QALA R
Sbjct: 302 RKCSTRQVRRSSEVAVCNPRSSLVAVMIQALALR 335
>gi|115482762|ref|NP_001064974.1| Os10g0499400 [Oryza sativa Japonica Group]
gi|113639583|dbj|BAF26888.1| Os10g0499400, partial [Oryza sativa Japonica Group]
Length = 282
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 103/244 (42%), Positives = 145/244 (59%), Gaps = 7/244 (2%)
Query: 105 SLLEAVDLLLGGVQNLVILPAGIKLQPKPSLKSTFHNDSEYCWLTQEDLIRYFLNFIGLL 164
S+LEA+D +L G Q L + + + +YCWLTQEDL+RYFLN I L
Sbjct: 8 SVLEALDAVLSGAQVLAVPLRSGGRRKQLGGGGGGGGGGDYCWLTQEDLVRYFLNSISLF 67
Query: 165 NPTPNQPINSHNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQTSVAIVDEEGRLVGDISP 224
+ + ++S ++ +L + + A A+PL+ ++ +T+VA+VD+ G LVG+ISP
Sbjct: 68 SHVAGRSVSSLGLVRADDLLTVRPHEAALSAVPLLRRAIATETAVAVVDDGGHLVGEISP 127
Query: 225 FSLNSCDETVAAAIATLLAGDLMAYMD-CGRPLKDLVRLVKQRLDEKNMVGLLELMEDD- 282
L SCDET AAAIATL DLMAY+D G P + ++R VK L K + +LEL+E++
Sbjct: 128 ALLASCDETAAAAIATLSVADLMAYVDYFGAPPEHILRAVKAGLKSKGLDAMLELVENEA 187
Query: 283 -LEISSGSCSNSSSSDEESSTGSARSAR--SGGYSARVVHRSEAIVCYPWSSLMAVIMQA 339
+ S S SSSSD+E+ +AR R SG Y R E +VC P SSL+AV+MQA
Sbjct: 188 VSSFAFSSSSTSSSSDDEAHGRAARLRRPSSGSYGRRSTE--EPVVCSPASSLVAVMMQA 245
Query: 340 LARR 343
LA R
Sbjct: 246 LAHR 249
>gi|195644892|gb|ACG41914.1| hypothetical protein [Zea mays]
Length = 374
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 121/293 (41%), Positives = 172/293 (58%), Gaps = 25/293 (8%)
Query: 69 SCIGKVCMVDIITFLC---------------PCFCSFAKD-SGIVRHLKPSASLLEAVDL 112
+ +G+V + D++ FLC P KD +G VR + P +S+LEA+D
Sbjct: 56 AVVGRVGLADVLCFLCTDPEALARPAAVFAKPVSALLPKDGAGEVRRVDPRSSILEALDA 115
Query: 113 LLGGVQNLVI-LPAGIKLQPKPSLKSTFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQP 171
+L G Q L + L AG + K L +D ++CWLTQEDL+RYFLN+I L+ +
Sbjct: 116 VLSGAQVLAVPLRAGGR---KKQLVGA-ADDGDFCWLTQEDLVRYFLNYICLVYNVAARS 171
Query: 172 INSHNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFSLNSCD 231
++S ++ A L++ + + A+PLI ++ +T+VA+V E+G LVG+ISP L +CD
Sbjct: 172 VSSLGLV-RADFLSVRPGEASLSAVPLIRRAVATETAVAVVAEDGHLVGEISPALLAACD 230
Query: 232 ETVAAAIATLLAGDLMAYMD-CGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEISSGSC 290
ET AAAIATL A DLMAY+D P + ++R VK L +K + LL L+ED+ S S
Sbjct: 231 ETAAAAIATLSAADLMAYIDHYVSPPEHILRAVKAGLKDKGLDALLALVEDETLSSFSSL 290
Query: 291 SNSSSSDEESSTGSARSARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
S SSSSDEE+ R SG Y R E +VC P SSL+AV++QALA R
Sbjct: 291 SASSSSDEEAGRAQLRRPSSGSYGRRAAD--EPVVCSPASSLVAVLVQALAHR 341
>gi|226528840|ref|NP_001141815.1| uncharacterized protein LOC100273954 [Zea mays]
gi|194706032|gb|ACF87100.1| unknown [Zea mays]
gi|413933920|gb|AFW68471.1| hypothetical protein ZEAMMB73_518907 [Zea mays]
Length = 374
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 119/294 (40%), Positives = 169/294 (57%), Gaps = 27/294 (9%)
Query: 69 SCIGKVCMVDIITFLC---------------PCFCSFAKD-SGIVRHLKPSASLLEAVDL 112
+ +G+V D++ LC P KD +G VR + P +S+LEA+D
Sbjct: 56 AVVGRVGPADVLCLLCTDPEALARPAAVFSKPVSALLPKDGAGEVRRVDPRSSILEALDA 115
Query: 113 LLGGVQNLVI-LPAGIKLQPKPSLKSTFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQP 171
+L G Q L + L AG + + + D ++CWLTQEDL+RYFLN+I L+ +
Sbjct: 116 ILSGAQVLAVPLRAGGRKK-----QLVGAADGDFCWLTQEDLVRYFLNYICLVYNVAARS 170
Query: 172 INSHNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFSLNSCD 231
++S ++ A L++ + A A+PLI ++ +T+VA+V E+G LVG+ISP L +CD
Sbjct: 171 VSSLGLV-RADFLSVRPGEAALSAVPLIRRAVATETAVAVVAEDGHLVGEISPALLAACD 229
Query: 232 ETVAAAIATLLAGDLMAYMD-CGRPLKDLVRLVKQRLDEKNMVGLLELMEDD-LEISSGS 289
ET AAAIATL A DLMAY+D P + ++R VK L +K + LL L+ED+ L S
Sbjct: 230 ETAAAAIATLSAADLMAYIDHYVSPPEHILRAVKAGLKDKGLDALLALVEDETLSSFSSL 289
Query: 290 CSNSSSSDEESSTGSARSARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
S SSSSDEE+ R SG Y R E +VC P SSL+AV++QALA R
Sbjct: 290 SSASSSSDEEAGRAQLRRPSSGSYGRRAAD--EPVVCSPASSLVAVLVQALAHR 341
>gi|226495341|ref|NP_001146653.1| uncharacterized protein LOC100280253 [Zea mays]
gi|219888199|gb|ACL54474.1| unknown [Zea mays]
Length = 375
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 117/293 (39%), Positives = 167/293 (56%), Gaps = 24/293 (8%)
Query: 69 SCIGKVCMVDIITFLC---------------PCFCSFAKD-SGIVRHLKPSASLLEAVDL 112
+ +G+V + D++ FLC P KD +G VR + P +S+LEA+D
Sbjct: 56 AVVGRVGLADVLCFLCTDPEALARPAVVFSKPVSALLPKDGAGEVRRVDPRSSILEALDA 115
Query: 113 LLGGVQNLVI-LPAGIKLQPKPSLKSTFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQP 171
+L G Q L + L AG + + + ++CWLTQEDL+RYFLN IGL +
Sbjct: 116 VLSGAQVLAVPLRAGWR-KKQLGGGGGSAAAGDFCWLTQEDLVRYFLNSIGLFYHVAARS 174
Query: 172 INSHNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFSLNSCD 231
++S ++ L++ + A A+PLI ++ +T+VA+V E+G L+G+ISP L +CD
Sbjct: 175 VSSLGLV-RTDFLSVRPGEAALSAVPLIRRAVATETAVAVVTEDGHLLGEISPALLAACD 233
Query: 232 ETVAAAIATLLAGDLMAYMD-CGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEISSGSC 290
ET AAAIATL A DLMAY+D G P + + R +K L +K + +L L+ED E S
Sbjct: 234 ETAAAAIATLSAADLMAYVDYFGSPPEHISRAIKAGLKDKGLDAMLALVED--ETLSSFS 291
Query: 291 SNSSSSDEESSTGSARSARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
S SSSSDEE+ R SG Y R E +VC P SSL+AV++QALA R
Sbjct: 292 SASSSSDEEAGRTQLRRPSSGSYGRRSAE--EPVVCSPASSLVAVMVQALAHR 342
>gi|357146851|ref|XP_003574134.1| PREDICTED: CBS domain-containing protein CBSX5-like [Brachypodium
distachyon]
Length = 377
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 115/294 (39%), Positives = 166/294 (56%), Gaps = 25/294 (8%)
Query: 69 SCIGKVCMVDIITFLC---------------PCFCSFAKDS-GIVRHLKPSASLLEAVDL 112
+ +G+ + D++ LC P KD G VR + P +S+LEA+D
Sbjct: 57 AVVGRAGLADVLCLLCASPDALARPAAALAKPVSALLPKDGEGEVRRVDPRSSVLEALDA 116
Query: 113 LLGGVQNLVI--LPAGIKLQPKPSLKSTFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQ 170
+L G Q L + G + + + + D +CWLTQEDL+RYFLN I L +
Sbjct: 117 VLNGAQVLAVPLRSGGGRKKQLGGVAAGVAGD--FCWLTQEDLVRYFLNSISLFYHVAAR 174
Query: 171 PINSHNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFSLNSC 230
++S ++ A L++ D+ A A+PLI S +T+VA+V +G LVG+IS L +C
Sbjct: 175 SVSSLGLV-SADYLSVRPDEAALSAVPLIRASIAAETAVAVVSADGHLVGEISTAHLAAC 233
Query: 231 DETVAAAIATLLAGDLMAYMD-CGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEISSGS 289
DET AAAIATL A DLMAY+D G P + ++R +K L K + +LELMED+ ++S S
Sbjct: 234 DETAAAAIATLSAADLMAYIDYFGSPPEHILRSIKAGLKAKGLDAMLELMEDE-TMTSFS 292
Query: 290 CSNSSSSDEESSTGSARSARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
S+SSSSDE++ R SG + R E +VC P SSL+AV++QALA R
Sbjct: 293 FSSSSSSDEDTGRAHLRRPSSGSFGRRSTE--EPVVCSPASSLVAVMVQALAHR 344
>gi|168032312|ref|XP_001768663.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680162|gb|EDQ66601.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 108/353 (30%), Positives = 159/353 (45%), Gaps = 81/353 (22%)
Query: 70 CIGKVCMVDIITFLC--------------PCFCSFAKDSGIVRHLKPSASLLEAVDLLLG 115
C+GK+ MVDII FL P + V+H+ + L +A+ +
Sbjct: 101 CVGKLGMVDIICFLARDESLADQGAALRTPVSAIVPDSACSVQHVDSKSKLFDALAHVFD 160
Query: 116 GVQNLVI---------------LPAGIKLQPKPSL----------------KSTFHNDSE 144
GVQ+LV+ +P ++ P+ L K H E
Sbjct: 161 GVQHLVVSIDPSVLNRLARYNSMPKPKRVVPQTFLNAKQGKSLGSHLSLSPKIPPHVGQE 220
Query: 145 YCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAIHYDDPAAFAIPLIAQSHI 204
YCWLT ED++++ L+ IGL +P P I II+ A +L++ A A+PLI Q+
Sbjct: 221 YCWLTPEDILQFLLSCIGLFSPLPMMTIQQLGIINMA-VLSVKSTADAITALPLIQQAAR 279
Query: 205 KQTSVAIVDEEG------RLVGDISPFSLNSCDETVAAAIATLLAGDLMAY-MDCGRPLK 257
+ T+VA+V+ +G +LVG+ISPF++ SCDE A A++TL D +A+ DC P
Sbjct: 280 EMTAVAVVEADGENEEDLKLVGEISPFTMKSCDEKAALALSTLSVRDFLAFSRDCECPPN 339
Query: 258 DLVRLVKQRLDEK--NMVGLLELMEDDLEISSGSCSNSSSSDE----------------- 298
LV+LV+ R+ EK ++ L+ D S S S+S
Sbjct: 340 SLVQLVESRICEKLEHLQATESLLSDPDSPVSFPSSPSNSISSVGPLATIYSIDSGEESS 399
Query: 299 --------ESSTGSARSARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
+S TG S S YS R +C PWSSL+AV+ QAL R
Sbjct: 400 SSDEDCIGQSPTGPFGSLHSPSYSYS-KGRLAPNMCRPWSSLVAVMAQALTHR 451
>gi|302800477|ref|XP_002981996.1| hypothetical protein SELMODRAFT_268522 [Selaginella moellendorffii]
gi|302808764|ref|XP_002986076.1| hypothetical protein SELMODRAFT_271819 [Selaginella moellendorffii]
gi|300146224|gb|EFJ12895.1| hypothetical protein SELMODRAFT_271819 [Selaginella moellendorffii]
gi|300150438|gb|EFJ17089.1| hypothetical protein SELMODRAFT_268522 [Selaginella moellendorffii]
Length = 427
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 116/379 (30%), Positives = 171/379 (45%), Gaps = 75/379 (19%)
Query: 28 ALLALKRLNESYINIW--SSDH--SARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFL 83
AL L+ L + I +W +SD AR RK C C+GKV +DI+ +
Sbjct: 32 ALCMLRDLGLAEITVWDCASDQRCEARIRKWECRE---------CQCVGKVNSLDILCYY 82
Query: 84 CP----------------CFCSFAKDSGIVRHLKPSASLLEAVDLLLGGVQNLVI----- 122
+ AK S I RH+ A L +A +L+L G Q ++
Sbjct: 83 AAQDKVHSIEAAAKDPVSVLLTPAKRSQI-RHVDLHARLTDAFNLILDGAQCFIVPLDNR 141
Query: 123 LPAGIKLQPKPSLKST-----------FHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQP 171
+KL KST + ++CWLTQED++R+ L IG+ +P P
Sbjct: 142 RSKSLKLSSLALRKSTSTAAAAAVAIETYKRMDFCWLTQEDVLRFLLGCIGVFSPIPMMS 201
Query: 172 INSHNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQTSVAIV--------DEEGRLVGDIS 223
I II D ++ + + A+ A+PL+ ++ + ++VA+V DE ++VGDIS
Sbjct: 202 IEELGII-DRDVMFVDANAKASDALPLMIRASQQMSAVAVVEADYSSDGDEGLKIVGDIS 260
Query: 224 PFSLNS-CDETVAAAIATLLAGDLMAYMD--CGRPLKDLVRLVKQRLDEKNMVGL-LELM 279
SL S CDET A A+A+L D M Y+ G P + L L+ L+ K L E +
Sbjct: 261 LASLGSCCDETAALALASLSVADFMTYVQDIAGAP-QSLKELILSGLEAKTSKDLSYERV 319
Query: 280 EDDLEI-------------SSGSCSNSSSSDEESSTGSARSAR--SGGYSARVVHRSEAI 324
+ + I S +S S DE +STGS R S + S +
Sbjct: 320 KLGMAIDSSFSDSDTSSSSSGSVLDSSLSDDETASTGSIPRGRTNSAKFQRACPPWSSPL 379
Query: 325 VCYPWSSLMAVIMQALARR 343
C+PWSSL+AV+ QALA R
Sbjct: 380 CCHPWSSLVAVMAQALAFR 398
>gi|413937961|gb|AFW72512.1| hypothetical protein ZEAMMB73_693015 [Zea mays]
Length = 414
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 94/273 (34%), Positives = 143/273 (52%), Gaps = 26/273 (9%)
Query: 95 GIVRHLKPSASLLEAVDLLLG-GVQNLVI-LPAGIKLQ--------PKPSLKSTFHNDSE 144
G+ + P LL+A+D+LL G Q L++ L A + + P + S+
Sbjct: 104 GVTHRVDPQTRLLDAIDVLLTDGCQGLLVPLSARTRKRHHQHQGQAPSSDAGALLATSSD 163
Query: 145 YCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAIHYDDPAAFAIPLIAQSHI 204
C LT+ED++R+ I +P + S ++ + A+H DD IPL+ ++
Sbjct: 164 CCVLTREDIVRHLFGSISHFSPVAALTVASLGLVRRDVVHAVHVDDDGLDVIPLLRRAVS 223
Query: 205 KQTSVAIVDEEGRLVGDISPFSLNSCD-ETVAAAIATLLAGDLMAYMDCG----RPLKDL 259
T+VA+V ++ LVG+I P L SCD E V+AA A L AGD MAY+DC P + L
Sbjct: 224 DGTAVAVVADDDALVGEICPSVLASCDVEAVSAAFAGLSAGDTMAYIDCSLSSHSPPEFL 283
Query: 260 VRLVKQRLDEKNMVGLLELME---DDLEI---SSGSCSNSSSSDEESSTGSARSAR---S 310
VR ++ +L K + + EL+E D + S S +S+SSDE+S +G AR R S
Sbjct: 284 VRAIRAQLAGKGLDAMAELVECAGKDTAVLPLSPSSSLSSTSSDEDSPSGRARRPRRMSS 343
Query: 311 GGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
G + R + + C+ SSL+AV+ QALA R
Sbjct: 344 GSFGWRSTE--DVVACHSESSLVAVMAQALAHR 374
>gi|90398987|emb|CAJ86259.1| H0801D08.17 [Oryza sativa Indica Group]
gi|125550246|gb|EAY96068.1| hypothetical protein OsI_17941 [Oryza sativa Indica Group]
Length = 392
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 99/298 (33%), Positives = 147/298 (49%), Gaps = 34/298 (11%)
Query: 72 GKVCMVDIITFLCPCFCS------------------FAKDSGIVRHLKPSASLLEAVDLL 113
G+VCM D+ FLC A + VR ++P AS++EAVD
Sbjct: 58 GRVCMADVHLFLCGGDGEAASLASPAAALQATLSDLLAAGAPPVRRIEPHASVVEAVDAF 117
Query: 114 LGGVQNLVILPAGIKLQPKPSLKSTFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPIN 173
L G LV+ I+ + + + + CWLT ED++R+F+ IGL PT + ++
Sbjct: 118 LDGAHCLVV---PIR-ERRRRAAAAAAGEMCMCWLTVEDVVRFFVGCIGLFAPTASLSVS 173
Query: 174 SHNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQTSVAIVDEEG---RLVGDISPFSLNSC 230
I+ +A LA+ D A +PL+ + +SVA++ G RL G++SP +L SC
Sbjct: 174 QLGIVREA-TLAVAAGDRALSTVPLLRAALATHSSVAVITGAGIAPRLAGEVSPSALCSC 232
Query: 231 DETVAAAIATLLAGDLMAYMD-----CGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEI 285
D +VAAAIA L AGDL A++ C R L +V L+ D +
Sbjct: 233 DVSVAAAIAALSAGDLTAFLHRSDLRCRRNLPGMVDLLYAG-DPSSWPPSPSSSSSSSSS 291
Query: 286 SSGSCSNSSSSDEESSTGSARSARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
SS S SSSS++E+ G A + AR + + I C+P SSL+AV+ QA+A R
Sbjct: 292 SSSLSSFSSSSEDEAEDGYKHYAPA--PCARRDNNRQIIACHPGSSLVAVMAQAVAHR 347
>gi|357142952|ref|XP_003572749.1| PREDICTED: CBS domain-containing protein CBSX5-like [Brachypodium
distachyon]
Length = 423
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 67/197 (34%), Positives = 104/197 (52%), Gaps = 21/197 (10%)
Query: 93 DSGIVRHLKPSASLLEAVDLLLGG-VQNLVILPAGIKLQPKPSLKSTFHN---------- 141
D + R + SLL+A+D+LL +LV+ G + K H+
Sbjct: 105 DHAVTRRVDAQTSLLDAIDVLLASNAHSLVVALHG---HARAGRKQKHHHLLLHNVSSSA 161
Query: 142 -DSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNII--DDAGILAIHYDDPAAFAIPL 198
D+ YC LTQED++R+ I L P ++S ++ DA A+H D A AIPL
Sbjct: 162 ADASYCVLTQEDIVRHLFGSISLFAPVACLTVSSLGLVRGRDAA-HAVHVDADALDAIPL 220
Query: 199 IAQSHIKQT-SVAIVDEEGRLVGDISPFSLNSCD-ETVAAAIATLLAGDLMAYMDC-GRP 255
+ +S T +VA+V + LVG+I P L +CD E+V+AA A L AGD+M Y+DC P
Sbjct: 221 LRRSMAHCTAAVAVVADGDALVGEICPGVLGACDVESVSAAFAALSAGDVMTYIDCYSSP 280
Query: 256 LKDLVRLVKQRLDEKNM 272
+ L+R ++ +L ++ +
Sbjct: 281 PEFLLRSIRAQLRDRGL 297
>gi|297603581|ref|NP_001054281.2| Os04g0679600 [Oryza sativa Japonica Group]
gi|32487397|emb|CAE05731.1| OSJNBb0017I01.11 [Oryza sativa Japonica Group]
gi|255675890|dbj|BAF16195.2| Os04g0679600 [Oryza sativa Japonica Group]
Length = 398
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 100/300 (33%), Positives = 139/300 (46%), Gaps = 39/300 (13%)
Query: 72 GKVCMVDIITFLCPCFCS------------------FAKDSGIVRHLKPSASLLEAVDLL 113
G+VCM D+ FLC A + VR ++P AS++EAVD
Sbjct: 58 GRVCMADVHLFLCGGDGEAASLASPAAALQATLSDLLAAGAPPVRRIEPHASVVEAVDAF 117
Query: 114 LGGVQNLVILPAGIKLQPKPSLKSTFHNDSEYC--WLTQEDLIRYFLNFIGLLNPTPNQP 171
L G LV+ P E C WLT ED++R+F+ IGL PT
Sbjct: 118 LDGAHCLVV--------PIRERWRRAAAAGEMCMCWLTVEDVVRFFVGCIGLFAPTGLAL 169
Query: 172 INSHNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQTSVAIVDEEG---RLVGDISPFSLN 228
+ I G + D A A+PL++ + +SVA++ G RL G++SP +L
Sbjct: 170 RFLSSGISPRGHAPVAAGDRALSAVPLLSAALATHSSVAVITGAGIAPRLAGEVSPSALC 229
Query: 229 SCDETVAAAIATLLAGDLMAYMD-----CGRPLKDLVRLVKQRLDEKNMVGLLELMEDDL 283
SCD +VAAAIA L AGDL A++ C R L +V L+ D +
Sbjct: 230 SCDVSVAAAIAALSAGDLTAFLHRSDLRCRRNLPGMVDLLYAG-DPSSWPPSPSSSSSSS 288
Query: 284 EISSGSCSNSSSSDEESSTGSARSARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
SS S SSSSD+E+ G A + AR + + I C+P SSL+AV+ QA+A R
Sbjct: 289 SSSSSLSSFSSSSDDEAEDGYKHYAPA--PCARRDNNRQIIACHPGSSLVAVMAQAVAHR 346
>gi|242062444|ref|XP_002452511.1| hypothetical protein SORBIDRAFT_04g027220 [Sorghum bicolor]
gi|241932342|gb|EES05487.1| hypothetical protein SORBIDRAFT_04g027220 [Sorghum bicolor]
Length = 408
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 95/262 (36%), Positives = 136/262 (51%), Gaps = 22/262 (8%)
Query: 100 LKPSASLLEAVDLLLG-GVQNLVI-LPAGIKLQPKPSLKSTFHNDSEYCWLTQEDLIRYF 157
+ P LL+A+D+LL G Q L++ LPA + S+ D C LT+ED++R+
Sbjct: 111 VDPQTRLLDAIDVLLTDGCQGLLVPLPAARARKRHHQAPSSDAADC-CCVLTREDIVRHL 169
Query: 158 LNFIGLLNPTPNQPINSHNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQTSVAIV-DEEG 216
I P + S ++ + A+H DD IPL+ ++ T+VA+V D++
Sbjct: 170 FGSISHFAPVAALTVTSLGLVRR-DVHAVHVDDDGLDVIPLLRRAVSDGTAVAVVADDDC 228
Query: 217 RLVGDISPFSLNSCD-ETVAAAIATLLAGDLMAYMDCG---RPLKDLVRLVKQRLDEKNM 272
LVG+I P L SCD ETV+AA A L A D MAY+DC P + LVR ++ +L K +
Sbjct: 229 ALVGEICPGVLASCDVETVSAAFAALSAADTMAYIDCSLSHSPPEFLVRAIRAQLAGKGL 288
Query: 273 VGLLELME--------DDLEISSGSCSNSSSSDEESSTGSARSAR---SGGYSARVVHRS 321
+ ELME L SS S SSS ++ S G AR R SG + R
Sbjct: 289 EAMAELMECAGNDAASIPLSSSSSLSSTSSSDEDSPSLGRARRPRRMSSGSFGWR--STE 346
Query: 322 EAIVCYPWSSLMAVIMQALARR 343
+ + C+ SSL+AV+ QALA R
Sbjct: 347 DVVACHSGSSLVAVMAQALAHR 368
>gi|168047643|ref|XP_001776279.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672374|gb|EDQ58912.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 641
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 77/253 (30%), Positives = 118/253 (46%), Gaps = 53/253 (20%)
Query: 70 CIGKVCMVDIITFLC--------------PCFCSFAKDSGIVRHLKPSASLLEAVDLLLG 115
C+GK+ MVDI FL P ++ + + H+ + L +A+ +L
Sbjct: 255 CVGKLSMVDITCFLARDESLADPSAALRTPVSAIVSESAFTIVHVDSKSKLFDALAHVLD 314
Query: 116 GVQNLVI---------------LPAGIK------LQPKPS--LKSTF--------HNDSE 144
GV +LV+ +P + L+P L S F E
Sbjct: 315 GVHHLVVSIDQSVSNRLARYNSMPRSARVAHHAILKPNEGKFLDSRFSLPTKIQAEGPQE 374
Query: 145 YCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAIHYDDPAAFAIPLIAQSHI 204
YCWLT ED++++ L+ I L +P P I II+ +L++ D A+PLI Q+
Sbjct: 375 YCWLTPEDILQFLLSCISLFSPLPMMTIQQLGIIN-MDVLSVKSDADVITALPLIQQAAR 433
Query: 205 KQTSVAIVDE------EGRLVGDISPFSLNSCDETVAAAIATLLAGDLMAYM-DCGRPLK 257
T+VA+V+ + +LVG+ISPF++ CDE A A+ATL D +A+ DC P
Sbjct: 434 NMTAVAVVEVEEENVVDLKLVGEISPFTMKGCDEKAALALATLSVRDFLAFSCDCECPPN 493
Query: 258 DLVRLVKQRLDEK 270
LV V+ R+ EK
Sbjct: 494 SLVERVESRICEK 506
>gi|242039205|ref|XP_002466997.1| hypothetical protein SORBIDRAFT_01g018095 [Sorghum bicolor]
gi|241920851|gb|EER93995.1| hypothetical protein SORBIDRAFT_01g018095 [Sorghum bicolor]
Length = 138
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/133 (38%), Positives = 77/133 (57%), Gaps = 6/133 (4%)
Query: 105 SLLEAVDLLLGGVQNLVI-LPAGIK----LQPKPSLKSTFHNDSEYCWLTQEDLIRYFLN 159
S+LEA+D +L G Q L + L AG + + ++CWLTQEDL+RYFLN
Sbjct: 7 SILEALDAVLSGAQVLAVPLRAGGRRKQLVGGGGGGGGGGAAAGDFCWLTQEDLVRYFLN 66
Query: 160 FIGLLNPTPNQPINSHNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQTSVAIVDEEGRLV 219
IGL + ++S ++ L++ + A A+PLI ++ +T+VA+V E+G LV
Sbjct: 67 SIGLFYHVAARSVSSLGLVR-TDYLSVRPGESALSAVPLIRRAVATETAVAVVTEDGHLV 125
Query: 220 GDISPFSLNSCDE 232
G+ISP L +CDE
Sbjct: 126 GEISPALLAACDE 138
>gi|302791375|ref|XP_002977454.1| hypothetical protein SELMODRAFT_443590 [Selaginella moellendorffii]
gi|300154824|gb|EFJ21458.1| hypothetical protein SELMODRAFT_443590 [Selaginella moellendorffii]
Length = 446
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 94/325 (28%), Positives = 156/325 (48%), Gaps = 47/325 (14%)
Query: 64 DSAGCSCIGKVCMVDII-------TFLCPCFCSFAKD-SGIV-----RHLKPSASLLEAV 110
D+ +C+G V +D++ T LC + A+ +G+V + + L +A+
Sbjct: 48 DANLANCLGVVNNLDVLCFLAADHTLLCDLEAALARPIAGLVHRSWIQRVDLHERLSKAL 107
Query: 111 DLLLGGVQNLVI-LPA------GIKLQPKPSLKSTFHNDSEYCWLTQEDLIRYFLNFIGL 163
+L++ GVQ L++ LP ++ + S S+ E CW++QE ++R+ ++ I
Sbjct: 108 ELVIKGVQYLIVPLPKRSRSTRAMEFERNSSSGSSRWPRKEVCWISQEAVMRFLMSCIAA 167
Query: 164 LNPTPNQPINSHNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQTSVAIV----------- 212
P P I II+ + +I ++DPA AI +I Q+ ++VAIV
Sbjct: 168 FCPLPLFTIEELGIINRS-FASIQHEDPAIEAIQIIHQASQAMSAVAIVVPATPSPSPSS 226
Query: 213 -------DEEGRLVGDISPFSLNSCDETV---AAAIATLLAGDLMAYMDCG--RPLKDLV 260
+ +LVGDIS +L DE AAA+ATL A D ++Y+ R K L
Sbjct: 227 ASSTTENQRKLKLVGDISSSTLRRHDENFAAVAAALATLSAADFLSYVRGSNKRSSKILG 286
Query: 261 RLVKQRLDEKNMVGLLELMEDDLEISSGSCSNSSSSDEESSTGSARSARSGGYSARVVHR 320
+L++ RL +K + D+ ++ +G S EESS S RSGG ++ +
Sbjct: 287 KLIETRLAQKVAAAAAAAVADEDQV-AGKPPTPLDSCEESSDSEQDSHRSGGLTSSFQNS 345
Query: 321 SE--AIVCYPWSSLMAVIMQALARR 343
+ C PWSSL AV+ QAL+ +
Sbjct: 346 GSRGCLTCRPWSSLAAVMAQALSHQ 370
>gi|242077698|ref|XP_002448785.1| hypothetical protein SORBIDRAFT_06g033100 [Sorghum bicolor]
gi|241939968|gb|EES13113.1| hypothetical protein SORBIDRAFT_06g033100 [Sorghum bicolor]
Length = 407
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 60/184 (32%), Positives = 96/184 (52%), Gaps = 18/184 (9%)
Query: 105 SLLEAVDLLLGGVQNLVILPAGIKLQPKPSLKSTFHNDSEYCWLTQEDLIRYFLNFIGLL 164
++LEAVD LGG L + P + + CWLT ED++R+ L+ +G+
Sbjct: 104 TVLEAVDAFLGGAHTLAV--------PIRERWRAPADRGKLCWLTVEDVVRFLLSSVGVF 155
Query: 165 NPTPNQPINSHNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQTSVAIVDEEGR------- 217
+ T ++ ++ + A LA+ D A A+PLI + SVA+V G
Sbjct: 156 SATASRSVSELGAVRPAA-LAVAAGDSALTAVPLIRAALASHASVAVVAGTGTGTGFPAR 214
Query: 218 -LVGDISPFSLNSCDETVAAAIATLLAGDLMAYMDCG-RPLKDLVRLVKQRLDEKNMVGL 275
LVG+ISP +L S AAIA L AG L++++D G P + +V+ RL +N++G+
Sbjct: 215 CLVGEISPSALCSGGVATVAAIAALSAGQLVSFLDWGGAPPAATLHIVRSRLRRRNLLGM 274
Query: 276 LELM 279
L+L+
Sbjct: 275 LDLL 278
>gi|302786526|ref|XP_002975034.1| hypothetical protein SELMODRAFT_442750 [Selaginella moellendorffii]
gi|300157193|gb|EFJ23819.1| hypothetical protein SELMODRAFT_442750 [Selaginella moellendorffii]
Length = 446
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 91/338 (26%), Positives = 157/338 (46%), Gaps = 47/338 (13%)
Query: 64 DSAGCSCIGKVCMVDII-------TFLCPCFCSFAKD-SGIV-----RHLKPSASLLEAV 110
D+ +C+G V +D++ T LC + A+ +G+V + + L +A+
Sbjct: 48 DANLANCLGVVNNLDVLCFLAADHTLLCDLEAALARPIAGLVHRSWIQRVDLHERLSKAL 107
Query: 111 DLLLGGVQNLVI-LPA------GIKLQPKPSLKSTFHNDSEYCWLTQEDLIRYFLNFIGL 163
+L++ GVQ L++ LP ++ + S S+ E CW++QE ++R+ ++ I
Sbjct: 108 ELVIKGVQYLIVPLPKRSRSTRAMEFERNSSSASSRWPCKEVCWISQEAVMRFLMSCIAA 167
Query: 164 LNPTPNQPINSHNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQTSVAIV----------- 212
P P I II+ + +I ++ PA AI +I Q+ ++VAIV
Sbjct: 168 FCPLPLFTIEELGIINRS-FASIQHEAPAIEAIQIIHQASQAMSAVAIVVPATPSPSPSS 226
Query: 213 -------DEEGRLVGDISPFSLNSCDETVAAAIATLLAG---DLMAYMDCG--RPLKDLV 260
+ +L+GDIS +L DE AA A L D ++Y+ R K L
Sbjct: 227 ASSTTENQRKLKLLGDISSSTLRRHDENFAAVAAALATLSAADFLSYVRGSNKRSSKILG 286
Query: 261 RLVKQRLDEKNMVGLLELMEDDLEISSGSCSNSSSSDEESSTGSARSARSGGYSARVVHR 320
+L++ RL +K + D+ +++ G S EESS S RSGG ++ +
Sbjct: 287 KLIETRLAQKVAAAAAVAVADEDQVA-GKPPTPLDSCEESSDSEQDSHRSGGLTSSFQNS 345
Query: 321 SE--AIVCYPWSSLMAVIMQALARRDRLRSMAKAENPN 356
+ C PWSSL AV+ QAL+ + + ++E+ N
Sbjct: 346 GSRGCLTCRPWSSLAAVMAQALSHQCGYIWVTESESEN 383
>gi|296084704|emb|CBI25846.3| unnamed protein product [Vitis vinifera]
Length = 131
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 44/107 (41%), Positives = 65/107 (60%), Gaps = 23/107 (21%)
Query: 28 ALLALKRLNESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFLC--- 84
AL ALKR ++Y+++WS DH+++ K+ +++D C CIGK+CMVD++ FLC
Sbjct: 32 ALAALKRSGDAYLSVWSCDHTSKINKS---HLED------CRCIGKICMVDVVCFLCRED 82
Query: 85 --PC---------FCSFAKDSGIVRHLKPSASLLEAVDLLLGGVQNL 120
C K G+VRHLKP++ LLEA+DL+L G QN+
Sbjct: 83 NLSCPSDALQSPLSLLLPKVPGLVRHLKPNSRLLEAIDLMLEGAQNI 129
>gi|222629779|gb|EEE61911.1| hypothetical protein OsJ_16641 [Oryza sativa Japonica Group]
Length = 335
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 86/269 (31%), Positives = 123/269 (45%), Gaps = 31/269 (11%)
Query: 90 FAKDSGIVRHLKPSASLLEAVDLLLGGVQNLVILPAGIKLQPKPSLKSTFHNDSEYC--W 147
A + VR ++P AS++EAVD L G LV+ P E C W
Sbjct: 38 LAAGAPPVRRIEPHASVVEAVDAFLDGAHCLVV--------PIRERWRRAAAAGEMCMCW 89
Query: 148 LTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQT 207
LT ED++R+F+ IGL PT + ++ I+ +A LA+ D A A+PL++ + +
Sbjct: 90 LTVEDVVRFFVGCIGLFAPTASLSVSQLGIVREA-TLAVAAGDRALSAVPLLSAALATHS 148
Query: 208 SVAIVDEEGRLVGDISPFSLNSCDETVAA--------AIATLLAGDLMAYMD-----CGR 254
SVA++ GR P S C AIA L AGDL A++ C R
Sbjct: 149 SVAVI--TGRR--HRPPASPARCRRRRFGPGTYPSRRAIAALSAGDLTAFLHRSDLRCRR 204
Query: 255 PLKDLVRLVKQRLDEKNMVGLLELMEDDLEISSGSCSNSSSSDEESSTGSARSARSGGYS 314
L +V L+ D + SS S SSSSD+E+ G A +
Sbjct: 205 NLPGMVDLLYAG-DPSSWPPSPSSSSSSSSSSSSLSSFSSSSDDEAEDGYKHYAPAP--C 261
Query: 315 ARVVHRSEAIVCYPWSSLMAVIMQALARR 343
AR + + I C+P SSL+AV+ QA+A R
Sbjct: 262 ARRDNNRQIIACHPGSSLVAVMAQAVAHR 290
>gi|224152524|ref|XP_002337247.1| predicted protein [Populus trichocarpa]
gi|222838552|gb|EEE76917.1| predicted protein [Populus trichocarpa]
Length = 145
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 64/111 (57%), Gaps = 15/111 (13%)
Query: 27 HALLALKRLNESYINIWSSDHSARKRKAAAANIDDHE-DSAGCSCIGKVCMVDIITFLC- 84
AL ALK ++++I++W+ DH+A+ N ++ D C C+GKV MVD++ +LC
Sbjct: 31 EALFALKNSDDNFISVWNCDHAAKTNNDYKGNCEEEGCDVCECKCVGKVSMVDVVCYLCK 90
Query: 85 -------------PCFCSFAKDSGIVRHLKPSASLLEAVDLLLGGVQNLVI 122
P + G+V H++P++SLLEA+DL+L G +NLV+
Sbjct: 91 DENLLFPSDALKAPVSVLLPEIPGMVVHVEPTSSLLEAIDLILQGAKNLVV 141
>gi|125540443|gb|EAY86838.1| hypothetical protein OsI_08221 [Oryza sativa Indica Group]
Length = 417
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 60/183 (32%), Positives = 98/183 (53%), Gaps = 21/183 (11%)
Query: 105 SLLEAVDLLLGGVQNLVILPAGIKLQPKPSLKSTFHN-------------DSEYCWLTQE 151
SLL+A+D LL +++P L + H+ ++YC LT+E
Sbjct: 123 SLLDAIDALLSNDAQTLLVP----LHAHAARSRKHHHVHVSGCSPANPAAATDYCVLTRE 178
Query: 152 DLIRYFLNF-IGLLNPTPNQPINSHNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQTSVA 210
D++R+ ++ I L P + + S ++ + A+H DD A AIPL+ +S T+VA
Sbjct: 179 DIVRHLFSYSISLFAPVAARTVASLGLVRR-DVHAVHADDDALDAIPLLRRSIADGTAVA 237
Query: 211 IVDEEGRLVGDISPFSLNSCD-ETVAAAIATLLAGDLMAYMDCG-RPLKDLVRLVKQRLD 268
+V ++ LVG+I P L SCD E+ +AA A L AGD+M Y+DC P + L+R ++ +L
Sbjct: 238 VVADDDALVGEICPGVLGSCDIESASAAFAALSAGDVMTYIDCSLSPPEFLLRSIRAQLK 297
Query: 269 EKN 271
+
Sbjct: 298 GRG 300
>gi|115447517|ref|NP_001047538.1| Os02g0639300 [Oryza sativa Japonica Group]
gi|113537069|dbj|BAF09452.1| Os02g0639300, partial [Oryza sativa Japonica Group]
Length = 304
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 60/183 (32%), Positives = 98/183 (53%), Gaps = 21/183 (11%)
Query: 105 SLLEAVDLLLGGVQNLVILPAGIKLQPKPSLKSTFHN-------------DSEYCWLTQE 151
SLL+A+D LL +++P L + H+ ++YC LT+E
Sbjct: 10 SLLDAIDALLSNDAQTLLVP----LHAHAARSRKHHHVHVSGCSPANPAAATDYCVLTRE 65
Query: 152 DLIRYFLNF-IGLLNPTPNQPINSHNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQTSVA 210
D++R+ ++ I L P + + S ++ + A+H DD A AIPL+ +S T+VA
Sbjct: 66 DIVRHLFSYSISLFAPVAARTVASLGLVR-RDVHAVHADDDALDAIPLLRRSIADGTAVA 124
Query: 211 IVDEEGRLVGDISPFSLNSCD-ETVAAAIATLLAGDLMAYMDCG-RPLKDLVRLVKQRLD 268
+V ++ LVG+I P L SCD E+ +AA A L AGD+M Y+DC P + L+R ++ +L
Sbjct: 125 VVADDDALVGEICPGVLGSCDIESASAAFAALSAGDVMTYIDCSLSPPEFLLRSIRAQLK 184
Query: 269 EKN 271
+
Sbjct: 185 GRG 187
>gi|242059697|ref|XP_002458994.1| hypothetical protein SORBIDRAFT_03g043980 [Sorghum bicolor]
gi|241930969|gb|EES04114.1| hypothetical protein SORBIDRAFT_03g043980 [Sorghum bicolor]
Length = 380
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 75/292 (25%), Positives = 129/292 (44%), Gaps = 24/292 (8%)
Query: 93 DSGIVRHLKPSASLLEAVDLLLGGVQNLVILPAGIKLQPKPSLKSTFHNDSEYCWLTQED 152
+ ++R + P L++A++L+ GV+ ++ +G + S ND ++C L++ED
Sbjct: 96 NPALLREIDPGTRLIDALELMRHGVKRFLVRKSG-SWKGITKRFSVLFND-KFCCLSRED 153
Query: 153 LIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQTSVAIV 212
++R+ + +G L P P I+S I+ Y + +A A+ I + +VA+V
Sbjct: 154 VLRFLIGCLGALAPIPLTQISSLGAINP----QYSYVEASAPAMEAIQKIPQDPCAVAVV 209
Query: 213 ----DEEGRLVGDISPFSLNSCDETVAA-AIATLLAGDLMAYMD--CGRPLKDLVRLVKQ 265
D +++GDIS + L CD AA A+A L AG + D P+
Sbjct: 210 ETAPDGTRKILGDISTYKLWKCDYVSAAWALANLSAGQFVLGADENGSMPISVFPEPPIS 269
Query: 266 RLD--EKNMVGLLELMEDDLEISSGSCSNSSSSDEESSTGSARSARSGGYSARVVHRSEA 323
E+ G + S G SN+ +S + S RSA G RS
Sbjct: 270 PSSPVEEISPGRSPRAKKFSSRSIGFLSNTQAS--QMSAWRTRSAYHRG-------RSTP 320
Query: 324 IVCYPWSSLMAVIMQALARRDRLRSMAKAENPNFCPVRIQAKKQQLSRAVRS 375
++C S+L AV+ Q L+ R + AE+ + V + + + A RS
Sbjct: 321 LMCKTTSTLAAVMAQMLSHRATHVWVTDAESEDGVLVGVVGYTEIFNAATRS 372
>gi|297598199|ref|NP_001045216.2| Os01g0920000 [Oryza sativa Japonica Group]
gi|57899420|dbj|BAD88358.1| CBS domain containing protein-like [Oryza sativa Japonica Group]
gi|57899850|dbj|BAD87634.1| CBS domain containing protein-like [Oryza sativa Japonica Group]
gi|255674004|dbj|BAF07130.2| Os01g0920000 [Oryza sativa Japonica Group]
Length = 384
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 77/328 (23%), Positives = 131/328 (39%), Gaps = 64/328 (19%)
Query: 63 EDSAGCSCIGKVCMVDIITFLCPCFCS-----------FAKDSGIVRHLKPSASLLEAVD 111
E +G +G + +DI TF+ + G++R + P L++A+D
Sbjct: 35 EPPSGARFLGMISALDIATFVAASGVGDRAMAAVVGEVVQPNPGLLREVDPGTRLIDALD 94
Query: 112 LLLGGVQNLVILPAGIKLQPKPSLKSTFHND--------------------------SEY 145
L+ GV+ ++ G ++ ++
Sbjct: 95 LMKQGVKRFLVRKNGAWRGISKRFSVLYNGKWLKNMEATSPTSASSSRELSSSTSSTYKF 154
Query: 146 CWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAIHY--DDPAAFAIPLIAQSH 203
C L++ED++R+ + +G L P P PI+S I+ HY D + A+ I +
Sbjct: 155 CCLSREDILRFLIGCLGALAPIPLSPISSLGAINP------HYCHVDASVPAMEAIQKVP 208
Query: 204 IKQTSVAIV----DEEGRLVGDISPFSLNSCDETVAA-AIATLLAGDLMAYMDCGR--PL 256
++VA+V D +++GDIS + L CD AA A+ L AG + D P+
Sbjct: 209 PDPSAVAVVETTPDGTRKILGDISAYKLWKCDYVAAAWALINLSAGQFVIGADDNESTPI 268
Query: 257 KDL-VRLVKQRLDEKNMVGLLELMEDDLEISSGSCSNSSSSDEESSTGSARSARSGGYSA 315
+ V + L E+ G + + SS S +S + + G RS G
Sbjct: 269 SAIPVPPISSSLVEEIGPGRSPRAK---KFSSRSIGFLNSQAHQMAFGRMRSMYRG---- 321
Query: 316 RVVHRSEAIVCYPWSSLMAVIMQALARR 343
RS ++C SSL AV+ Q L+ R
Sbjct: 322 ----RSAPLMCKSTSSLAAVMAQMLSHR 345
>gi|125573128|gb|EAZ14643.1| hypothetical protein OsJ_04567 [Oryza sativa Japonica Group]
Length = 404
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 77/328 (23%), Positives = 131/328 (39%), Gaps = 64/328 (19%)
Query: 63 EDSAGCSCIGKVCMVDIITFLCPCFCS-----------FAKDSGIVRHLKPSASLLEAVD 111
E +G +G + +DI TF+ + G++R + P L++A+D
Sbjct: 55 EPPSGARFLGMISALDIATFVAASGVGDRAMAAVVGEVVQPNPGLLREVDPGTRLIDALD 114
Query: 112 LLLGGVQNLVILPAGIKLQPKPSLKSTFHND--------------------------SEY 145
L+ GV+ ++ G ++ ++
Sbjct: 115 LMKQGVKRFLVRKNGAWRGISKRFSVLYNGKWLKNMEATSPTSASSSRELSSSTSSTYKF 174
Query: 146 CWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAIHY--DDPAAFAIPLIAQSH 203
C L++ED++R+ + +G L P P PI+S I+ HY D + A+ I +
Sbjct: 175 CCLSREDILRFLIGCLGALAPIPLSPISSLGAINP------HYCHVDASVPAMEAIQKVP 228
Query: 204 IKQTSVAIV----DEEGRLVGDISPFSLNSCDETVAA-AIATLLAGDLMAYMDCGR--PL 256
++VA+V D +++GDIS + L CD AA A+ L AG + D P+
Sbjct: 229 PDPSAVAVVETTPDGTRKILGDISAYKLWKCDYVAAAWALINLSAGQFVIGADDNESTPI 288
Query: 257 KDL-VRLVKQRLDEKNMVGLLELMEDDLEISSGSCSNSSSSDEESSTGSARSARSGGYSA 315
+ V + L E+ G + + SS S +S + + G RS G
Sbjct: 289 SAIPVPPISSSLVEEIGPGRSPRAK---KFSSRSIGFLNSQAHQMAFGRMRSMYRG---- 341
Query: 316 RVVHRSEAIVCYPWSSLMAVIMQALARR 343
RS ++C SSL AV+ Q L+ R
Sbjct: 342 ----RSAPLMCKSTSSLAAVMAQMLSHR 365
>gi|219888727|gb|ACL54738.1| unknown [Zea mays]
gi|414878989|tpg|DAA56120.1| TPA: hypothetical protein ZEAMMB73_423536 [Zea mays]
Length = 378
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 90/199 (45%), Gaps = 16/199 (8%)
Query: 66 AGCSCIGKVCMVDIITFLCPCFCS-----------FAKDSGIVRHLKPSASLLEAVDLLL 114
+G IG + +DI F+ + ++R + P L++A++L+
Sbjct: 58 SGARFIGMISALDIAAFVASAGVGDRSMRAVVGEVVQPNPALLREIDPGTRLIDALELMR 117
Query: 115 GGVQNLVILPAGIKLQPKPSLKSTFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINS 174
GV+ ++ +G + S +N+ ++C L++ED++R+ + +G L P P I+S
Sbjct: 118 HGVKRFLVRKSG-SWKGITKRFSVLYNE-KFCCLSREDVLRFLIGCLGALAPIPLTQISS 175
Query: 175 HNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQTSVAIVDEEGR-LVGDISPFSLNSCDET 233
I+ + PA AI I Q V + + R ++GDIS + L CD
Sbjct: 176 LGAINPQ-YSYVEASAPAMEAIQKIPQDPCAVAVVETMPDGTRSILGDISTYKLWKCDYV 234
Query: 234 VAA-AIATLLAGDLMAYMD 251
AA A+A L AG + D
Sbjct: 235 SAAWALANLSAGQFVIGAD 253
>gi|357131565|ref|XP_003567407.1| PREDICTED: CBS domain-containing protein CBSX6-like [Brachypodium
distachyon]
Length = 367
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 74/300 (24%), Positives = 117/300 (39%), Gaps = 40/300 (13%)
Query: 63 EDSAGCSCIGKVCMVDIITFLCPCFCS------------FAKDSGIVRHLKPSASLLEAV 110
E +G +G + VDI F+ + ++R + P L++A+
Sbjct: 55 EPPSGARFVGMISAVDIAAFVATAADGDRAMREAAVGEVVQPNPELLREVDPGTRLIDAL 114
Query: 111 DLLLGGVQNLVILPAGIKLQPKPSLKSTFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQ 170
+L+ GV+ L++ G + D +C L +ED++R+ + +G L P P
Sbjct: 115 ELMRNGVKRLLLRKNGSWTGLTKRFSMLY--DDRFCCLAREDILRFLIGCLGALAPIPLS 172
Query: 171 PINSHNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQTSVAIV----DEEGRLVGDISPFS 226
I + I + + PA AI I + VA+V D +++GDIS +
Sbjct: 173 RICTLGAI-NPNYCHVEASAPAMEAIQKIPRD---PCGVAVVETMPDGVRKIIGDISAYK 228
Query: 227 LNSCDETVAA-AIATLLAGDLMAYMD--CGRPLKDLVRLVKQRLDEKNMVGLLELMEDDL 283
L CD AA A+A L AG + D P+ + L ++E E
Sbjct: 229 LWKCDYVAAAWALANLSAGQFVIGADENGSTPISAFLELPINS-------SIVEEAEPGR 281
Query: 284 EISSGSCSNSSSSDEESSTGSARSARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
S+ S S ARS G RS + C SSL AV+ Q L+ R
Sbjct: 282 SPRLKKFSSRSIGFLNSQANQARSMYRG--------RSAPLTCRSTSSLAAVMAQMLSHR 333
>gi|125528887|gb|EAY77001.1| hypothetical protein OsI_04957 [Oryza sativa Indica Group]
Length = 404
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 78/329 (23%), Positives = 132/329 (40%), Gaps = 66/329 (20%)
Query: 63 EDSAGCSCIGKVCMVDIITFLCPCFCS-----------FAKDSGIVRHLKPSASLLEAVD 111
E +G +G + +DI F+ + G++R + P L++A+D
Sbjct: 55 EPPSGARFLGMISALDIAAFVAASGVGDRAMAAVVGEVVQPNPGLLREVDPGTRLIDALD 114
Query: 112 LLLGGVQNLVILPAGIKLQPKPSLKSTFHNDS---------------------------E 144
L+ GV+ ++ G + S +N +
Sbjct: 115 LMKQGVKRFLVRKNG-AWRGISKRFSVLYNGKWLKNMEATSPTSASSSRELSSSTSSTYK 173
Query: 145 YCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAIHY--DDPAAFAIPLIAQS 202
+C L++ED++R+ + +G L P P PI+S I+ HY D + A+ I +
Sbjct: 174 FCCLSREDILRFLIGCLGALAPIPLSPISSLGAINP------HYCHVDASVPAMEAIQKV 227
Query: 203 HIKQTSVAIV----DEEGRLVGDISPFSLNSCDETVAA-AIATLLAGDLMAYMDCGR--P 255
++VA+V D +++GDIS + L CD AA A+ L AG + D P
Sbjct: 228 PPDPSAVAVVETTPDGTRKILGDISAYKLWKCDYVAAAWALINLSAGQFVIGADDNESTP 287
Query: 256 LKDL-VRLVKQRLDEKNMVGLLELMEDDLEISSGSCSNSSSSDEESSTGSARSARSGGYS 314
+ + V + L E+ G + + SS S +S + + G RS G
Sbjct: 288 ISAIPVPPISSSLVEEIGPGRSPRAK---KFSSRSIGFLNSQAHQMAFGRMRSMYRG--- 341
Query: 315 ARVVHRSEAIVCYPWSSLMAVIMQALARR 343
RS ++C SSL AV+ Q L+ R
Sbjct: 342 -----RSAPLMCKSTSSLAAVMAQMLSHR 365
>gi|356531916|ref|XP_003534522.1| PREDICTED: CBS domain-containing protein CBSX6-like [Glycine max]
Length = 425
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 80/371 (21%), Positives = 146/371 (39%), Gaps = 83/371 (22%)
Query: 28 ALLALKRLNESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFLCPCF 87
A+ A+ +E I IW +KR ++ D +G + DI+ FL
Sbjct: 32 AIRAIGECHEGTIPIW------KKRSQLGI---ENSDMRQQRFVGILSSFDIVAFLAKSQ 82
Query: 88 CSFAKD--------------SGIVRHLKPSASLLEAVDLLLGGVQNLVI----------- 122
C +D + ++R + P+ L++A+D++ GV+ L++
Sbjct: 83 CLEDQDKALKTPVSEVVVHNNSLLRVVDPATRLIDALDMMKQGVKRLLVPKSVAWKGMSK 142
Query: 123 -----------------------LPAGIKLQPKPSLKSTFHNDSEYCWLTQEDLIRYFLN 159
LP + P S+ YC L++ED++R+ +
Sbjct: 143 RFSVIYYGKWLKNSESPGNSSNNLPLNMNRSPSTSITPI---RDRYCCLSREDVLRFIIG 199
Query: 160 FIGLLNPTPNQPINSHNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQTSVAIV----DEE 215
+G L P P I S I+ +Y + + AI + ++VA++ D +
Sbjct: 200 CLGALAPLPLTSIASLGAINS----NYNYIESSTPAIEATQKLPQDPSAVAVIESTSDGQ 255
Query: 216 GRLVGDISPFSLNSCDETVAA-AIATLLAGDLMAYMDCGRPLKDLVRLVKQRLDEKNMVG 274
+++G+IS L CD AA A+A L AG + ++ + L +
Sbjct: 256 CKIIGEISACKLWKCDYLSAAWALANLSAGQFVMGVEDNVTPRSLPQFS----------- 304
Query: 275 LLELMEDDLEISSGSCSNSSSSDEESSTGSARSARSGGYSARVVH--RSEAIVCYPWSSL 332
L+L + ++++G S S G + S + +R ++ RS + C SSL
Sbjct: 305 -LDLASGENDLANGGGSRKPRKFSSRSVGFFSNTASHSFGSRSMYRGRSAPLTCKITSSL 363
Query: 333 MAVIMQALARR 343
AV+ Q L+ R
Sbjct: 364 AAVLAQMLSHR 374
>gi|414870881|tpg|DAA49438.1| TPA: hypothetical protein ZEAMMB73_283754 [Zea mays]
Length = 127
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/98 (43%), Positives = 56/98 (57%), Gaps = 5/98 (5%)
Query: 247 MAYMD-CGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEISSGSCSNSSSSDEESSTGSA 305
MAY+D G P + + R +K L +K + +L L+ED E S S SSSSDEE+
Sbjct: 1 MAYVDYFGSPPEHISRAIKAGLKDKGLDAMLALVED--ETLSSFSSASSSSDEEAGRTQL 58
Query: 306 RSARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
R SG Y R E +VC P SSL+AV++QALA R
Sbjct: 59 RRPSSGSYGRRSAE--EPVVCSPASSLVAVMVQALAHR 94
>gi|356568477|ref|XP_003552437.1| PREDICTED: CBS domain-containing protein CBSX6-like [Glycine max]
Length = 425
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 76/329 (23%), Positives = 133/329 (40%), Gaps = 76/329 (23%)
Query: 71 IGKVCMVDIITFLCPCFCSFAKD--------------SGIVRHLKPSASLLEAVDLLLGG 116
+G + DI+ FL C +D + ++R + P+ L++A+D++ G
Sbjct: 66 VGILSSFDIVAFLAKSRCLEDQDKALKTPVSEVVVHNNSLLRVVDPATRLIDALDMMKQG 125
Query: 117 VQNLVI----------------------------------LPAGIKLQPKPSLKSTFHND 142
V+ L++ LP + P S+
Sbjct: 126 VKRLLVPKSIAWKGMSKRFSVIYYGKWLKNSESPGNSSNNLPLSMNRSPSTSVTPI---P 182
Query: 143 SEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAIHYDDPAAFAIPLIAQS 202
+YC L++ED++R+ + +G L P P I + I ++ I PA A + Q
Sbjct: 183 DKYCCLSREDVLRFIIGCLGALAPLPLTSIAALEAI-NSNYNYIESSTPAIEATQKLPQ- 240
Query: 203 HIKQTSVAIV----DEEGRLVGDISPFSLNSCDETVAA-AIATLLAGDL-MAYMDCGRPL 256
++VA++ D + +++G+IS L CD AA A+A L AG M D P
Sbjct: 241 --DPSAVAVIESASDGQCKIIGEISACKLWKCDYLSAAWALANLSAGQFVMGVEDNVTP- 297
Query: 257 KDLVRLVKQRLDEKNMVGLLELMEDDLEISSGSCSNSSSSDEESSTGSARSARSGGYSAR 316
+ L E + L+ D+++ + S S G ++ S +S+R
Sbjct: 298 --------RSLPEFS----LDSPSGDIDLVNSGGSRKPRKFSSRSVGFFSNSASHNFSSR 345
Query: 317 VVH--RSEAIVCYPWSSLMAVIMQALARR 343
++ RS + C SSL AV+ Q L+ R
Sbjct: 346 SMYRGRSAPLTCKITSSLAAVLAQMLSHR 374
>gi|110289342|gb|ABG66170.1| CBS domain-containing protein, putative, expressed [Oryza sativa
Japonica Group]
gi|215686451|dbj|BAG87674.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 133
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 45/102 (44%), Positives = 61/102 (59%), Gaps = 7/102 (6%)
Query: 247 MAYMD-CGRPLKDLVRLVKQRLDEKNMVGLLELMEDD--LEISSGSCSNSSSSDEESSTG 303
MAY+D G P + ++R VK L K + +LEL+E++ + S S SSSSD+E+
Sbjct: 1 MAYVDYFGAPPEHILRAVKAGLKSKGLDAMLELVENEAVSSFAFSSSSTSSSSDDEAHGR 60
Query: 304 SARSAR--SGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
+AR R SG Y R E +VC P SSL+AV+MQALA R
Sbjct: 61 AARLRRPSSGSYGRRSTE--EPVVCSPASSLVAVMMQALAHR 100
>gi|242039203|ref|XP_002466996.1| hypothetical protein SORBIDRAFT_01g018090 [Sorghum bicolor]
gi|241920850|gb|EER93994.1| hypothetical protein SORBIDRAFT_01g018090 [Sorghum bicolor]
Length = 128
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 41/98 (41%), Positives = 59/98 (60%), Gaps = 4/98 (4%)
Query: 247 MAYMD-CGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEISSGSCSNSSSSDEESSTGSA 305
MAY+D G P + ++R VK L +K + +L L+ED+ +SS S ++SSS +E +
Sbjct: 1 MAYVDYFGSPPEHILRAVKAGLKDKGLDAMLALIEDE-TLSSFSSASSSSDEEAAGRAQL 59
Query: 306 RSARSGGYSARVVHRSEAIVCYPWSSLMAVIMQALARR 343
R SG Y R E +VC P SSL+AV++QALA R
Sbjct: 60 RRPSSGSYGRRSAE--EPVVCSPASSLVAVMVQALAHR 95
>gi|414869635|tpg|DAA48192.1| TPA: hypothetical protein ZEAMMB73_584600 [Zea mays]
Length = 395
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 40/162 (24%), Positives = 69/162 (42%), Gaps = 49/162 (30%)
Query: 66 AGCSCIGKVCMVDIITFLC------------PCFCSFAKDSGIVRHLKPSASLLEAVDLL 113
A + IG + +D++ FL P A + +VR ++P L+E V+L+
Sbjct: 58 AAATVIGLLSSLDVVAFLASHLGDAAAAMRTPAGDVVAHEPTLVREVEPHTRLIEIVELM 117
Query: 114 LGGVQNLVI-------------------LPAGIKL---------------QPKPSLKST- 138
G + +++ A +K+ Q PS+ +T
Sbjct: 118 KQGARRVLVRKNITEACTVVDKRPFAPFYKAVLKITGTPRLSASEKAVGRQSPPSISTTT 177
Query: 139 -FHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIID 179
F D YC LT+ED++R+ +N +G L PTP Q I+S ++
Sbjct: 178 AFGCD-RYCCLTREDIVRFLINCLGALAPTPLQSISSLGAVN 218
>gi|125562298|gb|EAZ07746.1| hypothetical protein OsI_30001 [Oryza sativa Indica Group]
Length = 398
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/159 (24%), Positives = 67/159 (42%), Gaps = 42/159 (26%)
Query: 65 SAGCSCIGKVCMVDIITFLC-------PCFCSFAKD-----SGIVRHLKPSASLLEAVDL 112
+A + +G + +D++ FL F + A D +VR ++P L+E V+L
Sbjct: 43 TAAATVVGLLSSIDVVAFLANHPGGAAAAFMTPAGDVVPHEHALVRQVQPDTRLIEIVEL 102
Query: 113 LLGGVQNLVI------------------LPAGIKL----------QPKPSLKS--TFHND 142
+ G + +++ A +K+ P P+ +S T
Sbjct: 103 MKQGARRVLVGKNIKEGCAINKQPFAPFYKAVLKITGTPRRNPSPSPSPATRSPSTTLGR 162
Query: 143 SEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDA 181
YC LT+ED++R+ +N +G L P P Q I S I A
Sbjct: 163 DRYCCLTREDIVRFLINCLGALAPIPMQSIASLGAISRA 201
>gi|115477447|ref|NP_001062319.1| Os08g0529200 [Oryza sativa Japonica Group]
gi|42407969|dbj|BAD09107.1| CBS domain containing protein-like [Oryza sativa Japonica Group]
gi|113624288|dbj|BAF24233.1| Os08g0529200 [Oryza sativa Japonica Group]
Length = 418
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/159 (24%), Positives = 67/159 (42%), Gaps = 42/159 (26%)
Query: 65 SAGCSCIGKVCMVDIITFLC-------PCFCSFAKD-----SGIVRHLKPSASLLEAVDL 112
+A + +G + +D++ FL F + A D +VR ++P L+E V+L
Sbjct: 63 TAAATVVGLLSSIDVVAFLANHPGGAAAAFMTPAGDVVPHEHALVRQVQPDTRLIEIVEL 122
Query: 113 LLGGVQNLVI------------------LPAGIKL----------QPKPSLKS--TFHND 142
+ G + +++ A +K+ P P+ +S T
Sbjct: 123 MKQGARRVLVGKNIKEGCAINKQPFAPFYKAVLKITGTPRRNPSPSPSPATRSPSTTLGR 182
Query: 143 SEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDA 181
YC LT+ED++R+ +N +G L P P Q I S I A
Sbjct: 183 DRYCCLTREDIVRFLINCLGALAPIPMQSIASLGAISRA 221
>gi|224128944|ref|XP_002329005.1| predicted protein [Populus trichocarpa]
gi|222839239|gb|EEE77590.1| predicted protein [Populus trichocarpa]
Length = 424
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 58/254 (22%), Positives = 106/254 (41%), Gaps = 62/254 (24%)
Query: 28 ALLALKRLNESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFLCPCF 87
A+ A+ E I +W KRK+ + I+ E +G + +DI+ FL
Sbjct: 32 AIRAIGESTECGIPVW-------KRKSHVSMIETSEMRQQ-RFVGILNSLDIVAFLASTE 83
Query: 88 CSFAKDSGI--------------VRHLKPSASLLEAVDLLLGGVQNLVI----------- 122
C +D I ++ + P+ L++A++++ GV+ L++
Sbjct: 84 CLEDQDKAIKTSVSQVVVPNASLLKQVDPATRLIDALEMMKQGVRRLLVPKSMVWKGMSK 143
Query: 123 ----LPAGIKLQP-----------------KPSLKSTFHNDSEYCWLTQEDLIRYFLNFI 161
L G L+ +PS S N +++C L++ED+IR+ + +
Sbjct: 144 RFSFLYNGKWLKNADASNNSSNNNLTINTNRPSSSSGTSNRNKFCCLSREDVIRFLIGCL 203
Query: 162 GLLNPTPNQPINSHNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQTSVAIV----DEEGR 217
G L P P I+S +I + ++ PA A + H + VA+V D + +
Sbjct: 204 GALAPLPLSSISSLGVI-NPNYTSVEASLPAFEATRKL---HGDPSEVAVVEPIPDGQCK 259
Query: 218 LVGDISPFSLNSCD 231
++G+IS L CD
Sbjct: 260 IIGEISASRLWKCD 273
>gi|224144928|ref|XP_002325465.1| predicted protein [Populus trichocarpa]
gi|222862340|gb|EEE99846.1| predicted protein [Populus trichocarpa]
Length = 422
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 59/261 (22%), Positives = 106/261 (40%), Gaps = 76/261 (29%)
Query: 28 ALLALKRLNESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFLCPCF 87
A+ A+ E I +W KRK+ I++ E +G + +DI+ FL
Sbjct: 32 AIRAIGESTECGIPVW-------KRKSHVGMIENSETRLQ-RFVGILNSLDIVAFLASTE 83
Query: 88 CSFAKDSGI--------------VRHLKPSASLLEAVDLLLGGVQNLV------------ 121
C +D I ++ + P+ L++A++++ GV+ L+
Sbjct: 84 CLEDRDKAIKTPVSQVVVPNTSLLKQVDPATRLIDALEMMKQGVRRLIVPKSMGWKGMSK 143
Query: 122 ---ILPAG----------------IKLQP-KPSLKSTFHNDSEYCWLTQEDLIRYFLNFI 161
IL G + + P +PS S N ++C L++ED+IR+ + +
Sbjct: 144 RFSILYNGKWLKNADTSNSSSNNNLTINPNRPSSSSGTSNRDKFCCLSREDVIRFLIGCL 203
Query: 162 GLLNPTPNQPINSHNIID------DAGILAIHY-----DDPAAFAIPLIAQSHIKQTSVA 210
G L P P I+S I+ +A + AI +DP+A A+
Sbjct: 204 GALAPLPLSSISSLGAINTNYNSLEASLPAIEATRKLPEDPSAIAV-----------VEP 252
Query: 211 IVDEEGRLVGDISPFSLNSCD 231
I + + +++G+IS L CD
Sbjct: 253 IPNGQCKIIGEISASRLWKCD 273
>gi|195614938|gb|ACG29299.1| CBS domain containing protein [Zea mays]
Length = 407
Score = 47.0 bits (110), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 51/232 (21%), Positives = 90/232 (38%), Gaps = 53/232 (22%)
Query: 66 AGCSCIGKVCMVDIITFLCPCFCS-----------FAKDSGIVRHLKPSASLLEAVDLLL 114
+G IG + +DI F+ + ++R + P L++A++L+
Sbjct: 58 SGARFIGMISALDIAAFVASAGVGDRSMRAVVGEVVQPNPALLREIDPGTRLIDALELMR 117
Query: 115 GGVQNLVILPAG---------------------------------IKLQPKPSLKSTFHN 141
GV+ ++ +G +L P+ +
Sbjct: 118 HGVKRFLVRKSGSWKGITKRFSVLYNGKWLKNMESTSPSAASSSSTQLSPRSG------S 171
Query: 142 DSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAIHYDDPAAFAIPLIAQ 201
++C L++ED++R+ + +G L P P I+S I+ + PA AI I Q
Sbjct: 172 AEKFCCLSREDVLRFLIGCLGALAPIPLTQISSLGAINPQ-YSYVEASAPAMEAIQKIPQ 230
Query: 202 SHIKQTSVAIVDEEGR-LVGDISPFSLNSCDETVAA-AIATLLAGDLMAYMD 251
V + + R ++GDIS + L CD AA A+A L AG + D
Sbjct: 231 DPCAVAVVETMPDGTRNILGDISTYKLWKCDYVSAAWALANLSAGQFVIGAD 282
>gi|212274837|ref|NP_001130724.1| CBS domain containing protein [Zea mays]
gi|194689952|gb|ACF79060.1| unknown [Zea mays]
gi|194691690|gb|ACF79929.1| unknown [Zea mays]
gi|223948199|gb|ACN28183.1| unknown [Zea mays]
gi|414878990|tpg|DAA56121.1| TPA: CBS domain containing protein [Zea mays]
Length = 407
Score = 46.6 bits (109), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 51/232 (21%), Positives = 90/232 (38%), Gaps = 53/232 (22%)
Query: 66 AGCSCIGKVCMVDIITFLCPCFCS-----------FAKDSGIVRHLKPSASLLEAVDLLL 114
+G IG + +DI F+ + ++R + P L++A++L+
Sbjct: 58 SGARFIGMISALDIAAFVASAGVGDRSMRAVVGEVVQPNPALLREIDPGTRLIDALELMR 117
Query: 115 GGVQNLVILPAG---------------------------------IKLQPKPSLKSTFHN 141
GV+ ++ +G +L P+ +
Sbjct: 118 HGVKRFLVRKSGSWKGITKRFSVLYNGKWLKNMESTSPSAASSSSTQLSPRSG------S 171
Query: 142 DSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAIHYDDPAAFAIPLIAQ 201
++C L++ED++R+ + +G L P P I+S I+ + PA AI I Q
Sbjct: 172 AEKFCCLSREDVLRFLIGCLGALAPIPLTQISSLGAINPQ-YSYVEASAPAMEAIQKIPQ 230
Query: 202 SHIKQTSVAIVDEEGR-LVGDISPFSLNSCDETVAA-AIATLLAGDLMAYMD 251
V + + R ++GDIS + L CD AA A+A L AG + D
Sbjct: 231 DPCAVAVVETMPDGTRSILGDISTYKLWKCDYVSAAWALANLSAGQFVIGAD 282
>gi|242079871|ref|XP_002444704.1| hypothetical protein SORBIDRAFT_07g026350 [Sorghum bicolor]
gi|241941054|gb|EES14199.1| hypothetical protein SORBIDRAFT_07g026350 [Sorghum bicolor]
Length = 397
Score = 46.6 bits (109), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 40/161 (24%), Positives = 65/161 (40%), Gaps = 48/161 (29%)
Query: 66 AGCSCIGKVCMVDIITFLC------------PCFCSFAKDSGIVRHLKPSASLLEAVDLL 113
A + IG + +D++ FL P A + +VR ++P L+E V+L
Sbjct: 58 AEATVIGLLSSLDVVAFLASHLGDAAAAMRTPAGDVVAHEPALVREVEPHTRLIEIVELT 117
Query: 114 LGGVQNLVI-------------------------------LPAGIKLQPKPSLKST---- 138
G + +++ A K +PS ST
Sbjct: 118 KQGARRVLVSKNITEACTVVDKKPFAPFYKAVLKITGTPTAAASAKAVGRPSQPSTTPSA 177
Query: 139 FHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIID 179
F D YC +T+ED+IR+ +N +G L PTP Q I+S ++
Sbjct: 178 FGCD-RYCCVTREDIIRFLINCLGALAPTPLQSISSLGAVN 217
>gi|125604108|gb|EAZ43433.1| hypothetical protein OsJ_28038 [Oryza sativa Japonica Group]
Length = 419
Score = 46.6 bits (109), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 32/130 (24%), Positives = 52/130 (40%), Gaps = 30/130 (23%)
Query: 82 FLCPCFCSFAKDSGIVRHLKPSASLLEAVDLLLGGVQNLVI------------------- 122
F+ P + +VR ++P L+E V+L+ G + +++
Sbjct: 93 FMTPAGDVVPHEHALVRQVQPDTRLIEIVELMKQGARRVLVGKNIKEGCAINKQPFAPFY 152
Query: 123 -----LPAGIKLQPKPSLK------STFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQP 171
+ + P PS ST YC LT+ED++R+ +N +G L P P Q
Sbjct: 153 KAVLKITGTPRRNPSPSPSPATRSPSTTLGRDRYCCLTREDIVRFLINCLGALAPIPMQS 212
Query: 172 INSHNIIDDA 181
I S I A
Sbjct: 213 IASLGAISRA 222
>gi|388508444|gb|AFK42288.1| unknown [Medicago truncatula]
Length = 422
Score = 46.2 bits (108), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 79/366 (21%), Positives = 140/366 (38%), Gaps = 76/366 (20%)
Query: 28 ALLALKRLNESYINIWSSDHSARKRKAAAANIDDHEDSAGCSCIGKVCMVDIITFLCPCF 87
A+ A+ E I +W +KR + + ++ D +G + D++ FL
Sbjct: 32 AIRAIAESPEGSIPVW------KKR---SQGVIENSDMRQTRFVGILSSFDVVGFLAKSS 82
Query: 88 CSFAKDSG--------IVRH------LKPSASLLEAVDLLLGGVQNLVILPAGIKLQPKP 133
C +D +VR+ + P L++A+D++ GV+ L++ + +
Sbjct: 83 CLEDQDKALKTPVSEFVVRNNYLLKLVDPGTRLIDALDMMKQGVKRLLVPKSIVWKGMSK 142
Query: 134 SLKSTFHND----------------------------SEYCWLTQEDLIRYFLNFIGLLN 165
+H +YC L++ED++R+ + +G L
Sbjct: 143 RFSVIYHGKWLKNPESPSSSNNNLSVNLNGNTSASIRDKYCCLSREDVLRFIIGCLGALA 202
Query: 166 PTPNQPINSHNIIDDAGILAIHYDDPAAFAIPLIAQSHIKQTSVAIV----DEEGRLVGD 221
P P I + I + I PA + + Q ++VA++ D + +++G+
Sbjct: 203 PIPLTSIAALGAI-NPNYSYIESSTPALESTQKVLQD---PSAVAVIESMSDGQCKIIGE 258
Query: 222 ISPFSLNSCDETVAA-AIATLLAGDL-MAYMDCGRPLKDLVRLVKQRLDEKNMVGLLELM 279
IS L CD AA A+A L AG M D P + D
Sbjct: 259 ISAIKLWKCDYLSAAWALANLSAGQFVMGVEDNVTPGSPPDLCINPGAD----------- 307
Query: 280 EDDLEISSGSCSNSSSSDEESSTGSARSARSGGYSARVVH--RSEAIVCYPWSSLMAVIM 337
+DL ++ G S S G ++ S + +R + RS + C SSL AV+
Sbjct: 308 -NDL-VNGGGGSRKLKKFSSRSIGFFSNSPSNSFGSRSMFRGRSTPLTCKMTSSLAAVMA 365
Query: 338 QALARR 343
Q L+ R
Sbjct: 366 QMLSHR 371
>gi|125583012|gb|EAZ23943.1| hypothetical protein OsJ_07670 [Oryza sativa Japonica Group]
Length = 394
Score = 45.1 bits (105), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 56/88 (63%), Gaps = 2/88 (2%)
Query: 186 IHYDDPAAFAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFSLNSCD-ETVAAAIATLLAG 244
+H DD A AIPL+ +S T+VA+V ++ LVG+I P L SCD E+ +AA A L AG
Sbjct: 190 LHADDDALDAIPLLRRSIADGTAVAVVADDDALVGEICPGVLGSCDIESASAAFAALSAG 249
Query: 245 DLMAYMDCG-RPLKDLVRLVKQRLDEKN 271
D+M Y+DC P + L+R ++ +L +
Sbjct: 250 DVMTYIDCSLSPPEFLLRSIRAQLKGRG 277
>gi|357148565|ref|XP_003574815.1| PREDICTED: CBS domain-containing protein CBSX6-like [Brachypodium
distachyon]
Length = 410
Score = 45.1 bits (105), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 30/116 (25%), Positives = 52/116 (44%), Gaps = 29/116 (25%)
Query: 92 KDSGIVRHLKPSASLLEAVDLLLGGVQNLVI----------------------LP----- 124
++ +VR + P A L+E V+L+ G + +++ +P
Sbjct: 103 REQALVREVGPDARLIEIVELMKQGAKGVLVRKNLTEGCTVSSKQPFTPFYKAVPKITGT 162
Query: 125 --AGIKLQPKPSLKSTFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNII 178
AG + S S+ +YC LT+ED++R+ +N +G L P P Q I+S I
Sbjct: 163 QRAGTGQTIRRSPSSSMFGCDKYCCLTREDIVRFLINCLGALAPIPLQSISSLGAI 218
>gi|15218643|ref|NP_176711.1| CBS domain-containing protein [Arabidopsis thaliana]
gi|75244462|sp|Q8GZA4.1|CBSX6_ARATH RecName: Full=CBS domain-containing protein CBSX6
gi|26449327|dbj|BAC41791.1| unknown protein [Arabidopsis thaliana]
gi|332196237|gb|AEE34358.1| CBS domain-containing protein [Arabidopsis thaliana]
Length = 425
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 56/260 (21%), Positives = 108/260 (41%), Gaps = 73/260 (28%)
Query: 28 ALLALKRLNESYINIWSSDHSARKRKAAA-ANIDDHEDSAGCSCIGKVCMVDIITFLCPC 86
A+ A+ E I +W RKR + ++ + +G + +DI+ FL
Sbjct: 32 AIRAIGESTECGIPVW------RKRTTPSLPGFVENSEMRQQRFVGILNSLDIVAFLAKT 85
Query: 87 FC-------------SFAKDSGIVRHLKPSASLLEAVDLLLGGVQNLV------------ 121
C + D+ +++ + P L++A++++ GV+ L+
Sbjct: 86 ECLQEEKAMKIPVSEVVSPDNTLLKQVDPGTRLIDALEMMKQGVRRLLVPKSVVWRGMSK 145
Query: 122 ---ILPAGIKLQP----------------KPSLKSTFHNDSEYCWLTQEDLIRYFLNFIG 162
IL G L+ +P+ T D ++C L++ED+IR+ + +G
Sbjct: 146 RFSILYNGKWLKNSENSSSSSGLSADSTNRPTTSMTSSRD-KFCCLSREDVIRFLIGVLG 204
Query: 163 LLNPTPNQPINSHNIID------DAGILAIHYD-----DPAAFAIPLIAQSHIKQTSVAI 211
L P P I++ II+ +A + AI DP+A A+ ++QT
Sbjct: 205 ALAPLPLTSISTLGIINQNYNFIEASLPAIEATRRPLCDPSAIAV-------LEQTE--- 254
Query: 212 VDEEGRLVGDISPFSLNSCD 231
+++ +++G+IS L CD
Sbjct: 255 NEQQFKIIGEISASKLWKCD 274
>gi|357507705|ref|XP_003624141.1| hypothetical protein MTR_7g079680 [Medicago truncatula]
gi|355499156|gb|AES80359.1| hypothetical protein MTR_7g079680 [Medicago truncatula]
Length = 437
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 55/208 (26%), Positives = 89/208 (42%), Gaps = 25/208 (12%)
Query: 144 EYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAIHYDDPAAFAIPLIAQSH 203
+YC L++ED++R+ + +G L P P I + I + I PA + + Q
Sbjct: 196 KYCCLSREDVLRFIIGCLGALAPIPLTSIAALGAI-NPNYSYIESSTPALESTQKVLQD- 253
Query: 204 IKQTSVAIV----DEEGRLVGDISPFSLNSCDETVAA-AIATLLAGDL-MAYMDCGRPLK 257
++VA++ D + +++G+IS L CD AA A+A L AG M D P
Sbjct: 254 --PSAVAVIESMSDGQCKIIGEISAIKLWKCDYLSAAWALANLSAGQFVMGVEDNVTPGS 311
Query: 258 DLVRLVKQRLDEKNMVGLLELMEDDLEISSGSCSNSSSSDEESSTGSARSARSGGYSARV 317
+ D +DL ++ G S S G ++ S + +R
Sbjct: 312 PPDLCINPGAD------------NDL-VNGGGGSRKLKKFSSRSIGFFSNSPSNSFGSRS 358
Query: 318 VH--RSEAIVCYPWSSLMAVIMQALARR 343
+ RS + C SSL AV+ Q L+ R
Sbjct: 359 MFRGRSTPLTCKMTSSLAAVMAQMLSHR 386
>gi|326498651|dbj|BAK02311.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 403
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 56/114 (49%), Gaps = 13/114 (11%)
Query: 145 YCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAIHY--DDPAAFAIPLIAQS 202
+C L++ED++R+ + + L P P PI + I+ HY + +A A+ I +
Sbjct: 175 FCCLSREDILRFLIGCLSALAPIPLSPICTLGAINP------HYCHVEASAPAMEAIQKI 228
Query: 203 HIKQTSVAIV----DEEGRLVGDISPFSLNSCDETVAA-AIATLLAGDLMAYMD 251
VA+V D +++GDIS + L CD AA A+A L AG + D
Sbjct: 229 PGDPCGVAVVETTPDGVRKIIGDISAYKLWKCDYVAAAWALANLSAGQFVIGAD 282
>gi|297840957|ref|XP_002888360.1| hypothetical protein ARALYDRAFT_475589 [Arabidopsis lyrata subsp.
lyrata]
gi|297334201|gb|EFH64619.1| hypothetical protein ARALYDRAFT_475589 [Arabidopsis lyrata subsp.
lyrata]
Length = 424
Score = 41.6 bits (96), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 56/262 (21%), Positives = 109/262 (41%), Gaps = 77/262 (29%)
Query: 28 ALLALKRLNESYINIWSSDHSARKRKAAAANID---DHEDSAGCSCIGKVCMVDIITFLC 84
A+ A+ E I +W RK + N+ ++ + +G + +DI+ FL
Sbjct: 32 AIRAIGESTECGIPVW--------RKRSTPNLPGFVENSEMRQQRFVGILNSLDIVAFLA 83
Query: 85 PCFC-------------SFAKDSGIVRHLKPSASLLEAVDLLLGGVQNLV---------- 121
C + D+ +++ + P L++A++++ GV+ L+
Sbjct: 84 KSECLQEEKAMKIPVSEVVSPDNTLLKQVDPGTRLIDALEMMKQGVRRLLVPKSVVWRGM 143
Query: 122 -----ILPAGIKLQP----------------KPSLKSTFHNDSEYCWLTQEDLIRYFLNF 160
IL G L+ +P+ T D ++C L++ED+IR+ +
Sbjct: 144 SKRFSILYNGKWLKNSENSSSSSGLAADSTNRPTTSMTSCRD-KFCCLSREDVIRFLIGV 202
Query: 161 IGLLNPTPNQPINSHNIID------DAGILAIHYD-----DPAAFAIPLIAQSHIKQTSV 209
+G L P P I++ II+ +A + AI DP+A A+ ++QT
Sbjct: 203 LGALAPLPLTSISTLGIINQNYNFIEAYLPAIEATRRPPCDPSAIAV-------LEQTE- 254
Query: 210 AIVDEEGRLVGDISPFSLNSCD 231
+++ +++G+IS L CD
Sbjct: 255 --NEQQFKIIGEISASKLWKCD 274
>gi|449470251|ref|XP_004152831.1| PREDICTED: CBS domain-containing protein CBSX6-like [Cucumis
sativus]
gi|449477694|ref|XP_004155096.1| PREDICTED: CBS domain-containing protein CBSX6-like isoform 1
[Cucumis sativus]
gi|449477697|ref|XP_004155097.1| PREDICTED: CBS domain-containing protein CBSX6-like isoform 2
[Cucumis sativus]
Length = 425
Score = 41.2 bits (95), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 85/360 (23%), Positives = 142/360 (39%), Gaps = 100/360 (27%)
Query: 51 KRKAAAANIDDHEDSAGCSCIGKVCMVDIITFLC--------------PCFCSFAKDSGI 96
KRK I++ E +G + +DI+ FL P + + +
Sbjct: 48 KRKTHVGIIENAEMKQQ-RFVGILSSLDIVAFLARSENLEDQERAMKAPVSEAVVPNYSL 106
Query: 97 VRHLKPSASLLEAVDLLLGGVQNLVI---------------LPAGIKLQ----------- 130
+R + P+ L++A++++ GV+ L+I L G L+
Sbjct: 107 LRQVDPATRLIDALEMMKQGVRRLLIRKSVVWKGMSKRFSILYNGKWLKNIDTPGNSSNN 166
Query: 131 -----PKPSLKSTFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIID------ 179
+PS ST + ++C L++ED+IR+ + +G L P P I++ I+
Sbjct: 167 LNLNPNRPSSSSTSTSHDKFCCLSREDVIRFLIGCLGALAPLPLSSISTLEAINPNYCSI 226
Query: 180 DAGILAIHY-----DDPAAFAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFSLNSCD-ET 233
DA AI DDP A A+ + H D + R++G+IS L C+
Sbjct: 227 DASTPAIDISHKLPDDPVAVAV--VENIH---------DNQYRIIGEISASKLWKCNYLA 275
Query: 234 VAAAIATLLAGDLMAYMDCGRPLKDLVRLVKQRLDEKNMVGLLELMEDDLEISSGSCSNS 293
A A+A L AG + + E NM M DL + N
Sbjct: 276 AAWALANLSAGQFVMGV------------------EDNMT---SRMVPDLSTNGNVDEND 314
Query: 294 SSSDEESSTGSARSARSGGYSA--------RVVH--RSEAIVCYPWSSLMAVIMQALARR 343
S++ ++ S+RS G++ R ++ RS + C SSL AV+ Q L+ R
Sbjct: 315 SANGGGATRARKFSSRSIGFNPLSRAFRINRSMYRGRSAPLTCKVTSSLAAVMAQMLSHR 374
>gi|3335341|gb|AAC27143.1|AAC27143 T8F5.10 [Arabidopsis thaliana]
Length = 482
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 57/111 (51%), Gaps = 22/111 (19%)
Query: 132 KPSLKSTFHNDSEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIID------DAGILA 185
+P+ T D ++C L++ED+IR+ + +G L P P I++ II+ +A + A
Sbjct: 232 RPTTSMTSSRD-KFCCLSREDVIRFLIGVLGALAPLPLTSISTLGIINQNYNFIEASLPA 290
Query: 186 IHYD-----DPAAFAIPLIAQSHIKQTSVAIVDEEGRLVGDISPFSLNSCD 231
I DP+A A+ ++QT +++ +++G+IS L CD
Sbjct: 291 IEATRRPLCDPSAIAV-------LEQTE---NEQQFKIIGEISASKLWKCD 331
>gi|225431005|ref|XP_002279210.1| PREDICTED: CBS domain-containing protein CBSX6 [Vitis vinifera]
gi|297735292|emb|CBI17654.3| unnamed protein product [Vitis vinifera]
Length = 427
Score = 39.7 bits (91), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 61/210 (29%), Positives = 94/210 (44%), Gaps = 26/210 (12%)
Query: 143 SEYCWLTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAIHYDDPAAF-AIPLIAQ 201
+++C L++ED+IR+ + +G L P P I+S I +Y A+F AI + +
Sbjct: 183 NKFCCLSREDVIRFVIGCLGALAPLPLSSISSLGAISPN-----YYSIEASFPAIQVTQK 237
Query: 202 SHIKQTSVAIV----DEEGRLVGDISPFSLNSCD-ETVAAAIATLLAGDLMAYMDCGRPL 256
++VA+V D + +++G+IS L CD A A+A L AG + ++
Sbjct: 238 LPQDPSAVAVVESTPDGQYKIIGEISACKLWKCDYLAAAWALANLSAGQFVMGVEDNVTS 297
Query: 257 KDLVRL-VKQRLDEKNMV--GLLELMEDDLEISSGSCSNSSSSDEESSTGSARSARSGGY 313
+ L V E NM G S G SN +S S G++RS G
Sbjct: 298 RSLPHFSVNLTGGENNMANGGASTRQRKFSSRSIGFFSNPAS----PSFGASRSMYRG-- 351
Query: 314 SARVVHRSEAIVCYPWSSLMAVIMQALARR 343
RS + C SSL AV+ Q L+ R
Sbjct: 352 ------RSAPLTCKVTSSLAAVMAQMLSHR 375
>gi|355571203|ref|ZP_09042455.1| PHP domain protein [Methanolinea tarda NOBI-1]
gi|354825591|gb|EHF09813.1| PHP domain protein [Methanolinea tarda NOBI-1]
Length = 374
Score = 38.5 bits (88), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 37/73 (50%), Gaps = 5/73 (6%)
Query: 267 LDEKNMVGLLELMEDDLEISSGSCSNSSSSDEESSTGSARSARSGGYSARVVHRSEAIVC 326
+++KN V L LMED + + S+ STG R +GG AR+VHR + ++
Sbjct: 72 IEDKNRVHHLVLMEDFASFRDLALDLAPSAPGLGSTGRPRVPLTGGEIARLVHRRDGLIG 131
Query: 327 -----YPWSSLMA 334
PW+SL A
Sbjct: 132 PAHAFTPWTSLYA 144
>gi|124263029|ref|YP_001023499.1| hypothetical protein Mpe_B0492 [Methylibium petroleiphilum PM1]
gi|124262275|gb|ABM97264.1| hypothetical protein Mpe_B0492 [Methylibium petroleiphilum PM1]
Length = 121
Score = 37.7 bits (86), Expect = 8.8, Method: Composition-based stats.
Identities = 24/73 (32%), Positives = 34/73 (46%), Gaps = 3/73 (4%)
Query: 158 LNFIGLLNPTPNQ-PINSHNIIDDAGILAIHYDDPAAFAIPLIAQ--SHIKQTSVAIVDE 214
L GL PT P N ++ AG+ YD P FA P +A + + + D
Sbjct: 39 LALAGLCGPTAQAGPANVDGLLAIAGLQLQAYDPPTGFADPRLATVCKEAAKKGLIVTDS 98
Query: 215 EGRLVGDISPFSL 227
+GRLVGD+ S+
Sbjct: 99 QGRLVGDLRAVSV 111
>gi|373456383|ref|ZP_09548150.1| Methyltransferase type 11 [Caldithrix abyssi DSM 13497]
gi|371718047|gb|EHO39818.1| Methyltransferase type 11 [Caldithrix abyssi DSM 13497]
Length = 243
Score = 37.7 bits (86), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 52/119 (43%), Gaps = 5/119 (4%)
Query: 148 LTQEDLIRYFLNFIGLLNPTPNQPINSHNIIDDAGILAIHYDDPAAFAIPL-IAQSHIKQ 206
++++ R + + LL PT Q I I G L H F +PL +A S++K+
Sbjct: 27 FLEDEIARRYQAIVHLLKPTVGQKILE--IGSGGGQLLKHLPAADFFYVPLDLALSNLKK 84
Query: 207 TSVAIVDEEGRLVGDIS--PFSLNSCDETVAAAIATLLAGDLMAYMDCGRPLKDLVRLV 263
+ + GD+ PF S D + A + L L+A +C R LK RLV
Sbjct: 85 IKQQYTQKNLPVTGDVFALPFRAKSFDIVIMAEVLEHLDRPLIALKECHRVLKQGGRLV 143
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.134 0.403
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,586,978,429
Number of Sequences: 23463169
Number of extensions: 225139542
Number of successful extensions: 649290
Number of sequences better than 100.0: 91
Number of HSP's better than 100.0 without gapping: 59
Number of HSP's successfully gapped in prelim test: 32
Number of HSP's that attempted gapping in prelim test: 648886
Number of HSP's gapped (non-prelim): 111
length of query: 379
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 235
effective length of database: 8,980,499,031
effective search space: 2110417272285
effective search space used: 2110417272285
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 78 (34.7 bits)