BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 039969
(184 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|359475374|ref|XP_003631671.1| PREDICTED: uncharacterized protein LOC100854865 [Vitis vinifera]
gi|297741288|emb|CBI32419.3| unnamed protein product [Vitis vinifera]
Length = 209
Score = 244 bits (622), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 129/201 (64%), Positives = 149/201 (74%), Gaps = 20/201 (9%)
Query: 1 MYSISHTSSISFPAKPATTFRCFAASDIPAGPNSLRWPHSSSAADIS--IRVGPD----- 53
+YS S+ S S P +T RC AA D+PAGP+ RW H + AD S +R D
Sbjct: 10 LYS-SNPSIFSLPKFSISTVRCRAA-DLPAGPSFPRWLHLFTTADASAGVRSKQDLDGVG 67
Query: 54 --------SEGTA---VGGSHGGVKVNAKEKKWSRNRESYLTDDSEPLPLPMTYPDSSPV 102
++G+A G + GVKVNA EK+WSRNRESYL DD + LPLPMTYPDSSPV
Sbjct: 68 VGGGVGSTADGSARRSSGNAGSGVKVNAWEKRWSRNRESYLADDDDALPLPMTYPDSSPV 127
Query: 103 PREEIDKRLRCDPEVQDCKEVVYEWTGKCRSCQGSGYVSYYNKRGKEIICKCIPCLGIGY 162
EEID+RLRCDP+V+DCKEVVYEWTGKCRSCQG+G+VSYYNKRGKE ICKCIPCLGIGY
Sbjct: 128 APEEIDRRLRCDPQVEDCKEVVYEWTGKCRSCQGTGFVSYYNKRGKETICKCIPCLGIGY 187
Query: 163 VQKITARKDIEVMEDLDNGKP 183
VQKIT+R DIEVMEDL+NGKP
Sbjct: 188 VQKITSRNDIEVMEDLENGKP 208
>gi|224105541|ref|XP_002313849.1| predicted protein [Populus trichocarpa]
gi|118489605|gb|ABK96604.1| unknown [Populus trichocarpa x Populus deltoides]
gi|222850257|gb|EEE87804.1| predicted protein [Populus trichocarpa]
Length = 198
Score = 241 bits (614), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 127/202 (62%), Positives = 146/202 (72%), Gaps = 24/202 (11%)
Query: 1 MYSISHTSSISFPAKPA----TTFRCFAASDIPAGPNSLRWPHSSSAADISI-----RVG 51
M SIS TS + P+KP +FRC AA D+ + RW +SAAD SI R+G
Sbjct: 1 MLSISPTS-LFLPSKPTFNRTRSFRCSAAGDL----SFPRWLQFTSAADASIGGGSIRIG 55
Query: 52 PDSEGTAVGGSH----------GGVKVNAKEKKWSRNRESYLTDDSEPLPLPMTYPDSSP 101
+ +G A V+VNA+EKKWSRNRESYL D+ + LPLPMTYPDS P
Sbjct: 56 QELDGAASADGERKGFGGNSGGSSVRVNAREKKWSRNRESYLADNEDALPLPMTYPDSKP 115
Query: 102 VPREEIDKRLRCDPEVQDCKEVVYEWTGKCRSCQGSGYVSYYNKRGKEIICKCIPCLGIG 161
VP EEIDKRL CDPEV+DC+EVVYEWTGKCRSCQGSG+VSYYNKRGKE+ CKC+PCLGIG
Sbjct: 116 VPPEEIDKRLSCDPEVEDCREVVYEWTGKCRSCQGSGFVSYYNKRGKEVTCKCVPCLGIG 175
Query: 162 YVQKITARKDIEVMEDLDNGKP 183
YVQKITARKDI VME+LDNGKP
Sbjct: 176 YVQKITARKDIAVMEELDNGKP 197
>gi|363806768|ref|NP_001242534.1| uncharacterized protein LOC100819362 [Glycine max]
gi|255636117|gb|ACU18402.1| unknown [Glycine max]
Length = 180
Score = 240 bits (613), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 118/185 (63%), Positives = 142/185 (76%), Gaps = 6/185 (3%)
Query: 1 MYSISHTSSISFPAKPATTFRCFAAS-DIPAGPNSLRWPHSSSAADISIRVGPDSEGTAV 59
M++ + S++ FP P ++FRC AA+ ++ AGP+ + S A + I PD G V
Sbjct: 1 MFATNAASNLFFPGNP-SSFRCRAAAGNLSAGPSFFQ----SVAGGLRIGQFPDGAGLVV 55
Query: 60 GGSHGGVKVNAKEKKWSRNRESYLTDDSEPLPLPMTYPDSSPVPREEIDKRLRCDPEVQD 119
G GG + KKWSR ESYL +D+EPLPLPMTYPDSSPV +EEIDKRL+CDP++QD
Sbjct: 56 EGRRGGGNEPIRVKKWSRKNESYLDNDTEPLPLPMTYPDSSPVSQEEIDKRLQCDPQIQD 115
Query: 120 CKEVVYEWTGKCRSCQGSGYVSYYNKRGKEIICKCIPCLGIGYVQKITARKDIEVMEDLD 179
CKEVVYEWTGKCRSCQGSGYVSYY KRGK + CKCIPC+GIGYVQKITARKDI+VMEDL+
Sbjct: 116 CKEVVYEWTGKCRSCQGSGYVSYYGKRGKAVTCKCIPCMGIGYVQKITARKDIDVMEDLE 175
Query: 180 NGKPP 184
NGKPP
Sbjct: 176 NGKPP 180
>gi|255555471|ref|XP_002518772.1| conserved hypothetical protein [Ricinus communis]
gi|223542153|gb|EEF43697.1| conserved hypothetical protein [Ricinus communis]
Length = 213
Score = 239 bits (611), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 128/206 (62%), Positives = 146/206 (70%), Gaps = 30/206 (14%)
Query: 8 SSISFPAKPA---------TTFRCFA-ASDIPAGPNSLRW-PHSSSAADI-SIRVGPDSE 55
+S+ FP+KP+ T+ RCFA A D+ P RW H +SAAD SIR+G +S+
Sbjct: 7 NSLFFPSKPSHLLSLPSITTSHRCFASAGDLRHIPTFPRWWFHFTSAADTGSIRIGQESD 66
Query: 56 GTAVGGSHGG------------------VKVNAKEKKWSRNRESYLTDDSEPLPLPMTYP 97
G ++VNAKEKKWSRNRESYL D+ + LPLPMTYP
Sbjct: 67 GATSSPVADSSSSAGPQRRGLGGSGSSSIRVNAKEKKWSRNRESYLVDNEDALPLPMTYP 126
Query: 98 DSSPVPREEIDKRLRCDPEVQDCKEVVYEWTGKCRSCQGSGYVSYYNKRGKEIICKCIPC 157
DS PV EEIDKRLRCDPE+QDC+EVVYEWTGKCRSCQGSGYVSY+NKRGKE ICKC+PC
Sbjct: 127 DSKPVSPEEIDKRLRCDPEIQDCREVVYEWTGKCRSCQGSGYVSYFNKRGKETICKCVPC 186
Query: 158 LGIGYVQKITARKDIEVMEDLDNGKP 183
LGIGYVQKITARKDI VMEDLDNGKP
Sbjct: 187 LGIGYVQKITARKDIGVMEDLDNGKP 212
>gi|356500037|ref|XP_003518841.1| PREDICTED: uncharacterized protein LOC100795440 [Glycine max]
Length = 183
Score = 232 bits (591), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 121/186 (65%), Positives = 145/186 (77%), Gaps = 7/186 (3%)
Query: 1 MYSISHTSSISFPAKPATTFRCFAAS-DIPAGPNSLRWPHSSSAADISIRVGPDSEGTAV 59
M++ + TS++ FP P ++FRC AA+ D+ AGP+ ++ S A + I PD G AV
Sbjct: 3 MFATNATSNLFFPCSP-SSFRCRAAAGDLSAGPSFIQ----SVAGGLRIGQFPDGAGLAV 57
Query: 60 GGSHGGVKVN-AKEKKWSRNRESYLTDDSEPLPLPMTYPDSSPVPREEIDKRLRCDPEVQ 118
G GG + KKWSR ESYL +D+EPLPLPMTYPDSSPV +EEIDKRL+CDP++Q
Sbjct: 58 EGRRGGGGNEPIRVKKWSRKNESYLDNDAEPLPLPMTYPDSSPVSQEEIDKRLQCDPQIQ 117
Query: 119 DCKEVVYEWTGKCRSCQGSGYVSYYNKRGKEIICKCIPCLGIGYVQKITARKDIEVMEDL 178
DCKEVVYEWTGKCRSCQGSGYVSYY KRGK + CKCIPC+GIGYVQKITARKDI+VMEDL
Sbjct: 118 DCKEVVYEWTGKCRSCQGSGYVSYYGKRGKPVTCKCIPCMGIGYVQKITARKDIDVMEDL 177
Query: 179 DNGKPP 184
+NGKPP
Sbjct: 178 ENGKPP 183
>gi|449497625|ref|XP_004160454.1| PREDICTED: protein disulfide-isomerase SCO2-like isoform 2 [Cucumis
sativus]
Length = 212
Score = 227 bits (578), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 125/212 (58%), Positives = 146/212 (68%), Gaps = 30/212 (14%)
Query: 1 MYSISHTSSISFPAKPAT------------TFRCFAASDIPAGPNS-----LRWPH-SSS 42
M++I + SS FP+ P T +FR A+ A P+ +P S++
Sbjct: 1 MFAI-NPSSFFFPSNPTTATARQHSISPPSSFRLSLAATGDASPDPSISRLFNFPSLSAT 59
Query: 43 AADISIRVGPDSEGTA----------VGGSHGGVKVNAKEKKWSRNRESYLTDDSEPLPL 92
A R+G D + A +G VKVNA EKKWSR+RESYL DDS+ LPL
Sbjct: 60 DATGGFRLGQDYDAGASSPVIDGPGRFARGNGTVKVNAMEKKWSRDRESYLVDDSDVLPL 119
Query: 93 PMTYPDSSPVPREEIDKRLRCDPEVQDCKEVVYEWTGKCRSCQGSGYVSYYNKRGKEIIC 152
PMTYPDSSPV +EID+RLRCDP+V+DCKEVVYEWTGKCRSCQGSGYVSYYNKRG+EI C
Sbjct: 120 PMTYPDSSPVSPDEIDRRLRCDPQVEDCKEVVYEWTGKCRSCQGSGYVSYYNKRGREITC 179
Query: 153 KCIPCLGIGYVQKITARKDIEVMEDLD-NGKP 183
KCIPCLGIGYVQKITARKDIE+MEDLD NGKP
Sbjct: 180 KCIPCLGIGYVQKITARKDIELMEDLDNNGKP 211
>gi|18402160|ref|NP_566627.1| protein disulfide isomerase [Arabidopsis thaliana]
gi|75163632|sp|Q93YN0.1|SCO2_ARATH RecName: Full=Protein disulfide-isomerase SCO2; AltName:
Full=Protein SNOWY COTYLEDON 2; Flags: Precursor
gi|16649111|gb|AAL24407.1| Unknown protein [Arabidopsis thaliana]
gi|20148555|gb|AAM10168.1| unknown protein [Arabidopsis thaliana]
gi|166850620|gb|ABY91282.1| snowy cotyledon 2 [Arabidopsis thaliana]
gi|332642689|gb|AEE76210.1| protein disulfide isomerase [Arabidopsis thaliana]
Length = 187
Score = 207 bits (527), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 104/176 (59%), Positives = 126/176 (71%), Gaps = 3/176 (1%)
Query: 5 SHTSSISFPAKPATTFRCFAASDIPAGPNSLRWPHSSSAADISIRVGPDSEGTAVGGSHG 64
SH + P P+ + RC AA+DIP G + +R P + + + R S A GG+
Sbjct: 12 SHRPLVFLPRLPSRSLRCRAAADIPLG-DGIRLPREADSTSDTARSRDVS--VAAGGNGE 68
Query: 65 GVKVNAKEKKWSRNRESYLTDDSEPLPLPMTYPDSSPVPREEIDKRLRCDPEVQDCKEVV 124
G K + WS++ ESYL DD + LPLPMTYPD+SPV + ID+RL+CDP V+DC+EVV
Sbjct: 69 GAKWRKRRLLWSKSGESYLVDDGDALPLPMTYPDTSPVSPDVIDRRLQCDPVVEDCREVV 128
Query: 125 YEWTGKCRSCQGSGYVSYYNKRGKEIICKCIPCLGIGYVQKITARKDIEVMEDLDN 180
YEWTGKCRSCQGSG VSYY KRGKE+ICKCIPC GIGYVQKIT+R DIEVMEDLDN
Sbjct: 129 YEWTGKCRSCQGSGTVSYYKKRGKEVICKCIPCQGIGYVQKITSRTDIEVMEDLDN 184
>gi|449445880|ref|XP_004140700.1| PREDICTED: protein disulfide-isomerase SCO2-like isoform 2 [Cucumis
sativus]
Length = 115
Score = 206 bits (525), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 99/113 (87%), Positives = 107/113 (94%), Gaps = 1/113 (0%)
Query: 72 EKKWSRNRESYLTDDSEPLPLPMTYPDSSPVPREEIDKRLRCDPEVQDCKEVVYEWTGKC 131
EKKWSR+RESYL DDS+ LPLPMTYPDSSPV +EID+RLRCDP+V+DCKEVVYEWTGKC
Sbjct: 2 EKKWSRDRESYLVDDSDVLPLPMTYPDSSPVSPDEIDRRLRCDPQVEDCKEVVYEWTGKC 61
Query: 132 RSCQGSGYVSYYNKRGKEIICKCIPCLGIGYVQKITARKDIEVMEDLD-NGKP 183
RSCQGSGYVSYYNKRG+EI CKCIPCLGIGYVQKITARKDIE+MEDLD NGKP
Sbjct: 62 RSCQGSGYVSYYNKRGREITCKCIPCLGIGYVQKITARKDIELMEDLDNNGKP 114
>gi|297830570|ref|XP_002883167.1| hypothetical protein ARALYDRAFT_898297 [Arabidopsis lyrata subsp.
lyrata]
gi|297329007|gb|EFH59426.1| hypothetical protein ARALYDRAFT_898297 [Arabidopsis lyrata subsp.
lyrata]
Length = 187
Score = 204 bits (519), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 103/176 (58%), Positives = 125/176 (71%), Gaps = 3/176 (1%)
Query: 5 SHTSSISFPAKPATTFRCFAASDIPAGPNSLRWPHSSSAADISIRVGPDSEGTAVGGSHG 64
SH + P+ + RC AA+DIP G + +R P + + + R S A GG+
Sbjct: 12 SHRPLVFLHRLPSRSLRCRAAADIPLG-DGIRLPREADSTSDTARSRDVS--VAAGGNGE 68
Query: 65 GVKVNAKEKKWSRNRESYLTDDSEPLPLPMTYPDSSPVPREEIDKRLRCDPEVQDCKEVV 124
G K + WS++ ESYL DD + LPLPMTYPD+SPV + ID+RL+CDP V+DC+EVV
Sbjct: 69 GAKWRKRRLLWSKSGESYLVDDGDALPLPMTYPDTSPVSPDVIDRRLQCDPVVEDCREVV 128
Query: 125 YEWTGKCRSCQGSGYVSYYNKRGKEIICKCIPCLGIGYVQKITARKDIEVMEDLDN 180
YEWTGKCRSCQGSG VSYY KRGKE+ICKCIPC GIGYVQKIT+R DIEVMEDLDN
Sbjct: 129 YEWTGKCRSCQGSGTVSYYKKRGKEVICKCIPCQGIGYVQKITSRTDIEVMEDLDN 184
>gi|357475403|ref|XP_003607987.1| Calmodulin-like protein [Medicago truncatula]
gi|355509042|gb|AES90184.1| Calmodulin-like protein [Medicago truncatula]
Length = 388
Score = 190 bits (482), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 108/196 (55%), Positives = 131/196 (66%), Gaps = 25/196 (12%)
Query: 4 ISHTSSISFPAKPATTFRCFAASDIPAGPNSL-RWPHSS-SAADISIRV-GPDSEGTAV- 59
+S T++IS FR A++ P S R HSS S S R+ ++ G AV
Sbjct: 2 LSTTNAISL------IFRSRASA---GDPTSFPRLFHSSLSLPTTSGRLPAVETGGAAVR 52
Query: 60 -GGSHGGVKVNAKEKKWSRNRESYLTDDSEPLPLPMTYPDSSPVPREEIDKRLRCDPEVQ 118
G + +KV AKE+KWSRNRE Y+ D+ EPLPLPMTYPDSSPV +EID+RL+CDP+++
Sbjct: 53 RAGGNEWMKVKAKERKWSRNREGYMFDNGEPLPLPMTYPDSSPVSSQEIDQRLQCDPKLE 112
Query: 119 DCKEVVYEWTGKCRSCQGSGYVSYYNKRGKEIICKCIPCLG-----------IGYVQKIT 167
DCKEVVYEWTGKCRSCQGSGYVSYYNKRGKE+ + G YVQKIT
Sbjct: 113 DCKEVVYEWTGKCRSCQGSGYVSYYNKRGKEVTWQMHSLYGNCLMGWSGGRDFRYVQKIT 172
Query: 168 ARKDIEVMEDLDNGKP 183
ARKDI++MEDL NGKP
Sbjct: 173 ARKDIDLMEDLGNGKP 188
>gi|449497620|ref|XP_004160453.1| PREDICTED: protein disulfide-isomerase SCO2-like isoform 1 [Cucumis
sativus]
Length = 230
Score = 179 bits (453), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 102/186 (54%), Positives = 121/186 (65%), Gaps = 31/186 (16%)
Query: 1 MYSISHTSSISFPAKPAT------------TFRCFAASDIPAGPNS-----LRWPHSSSA 43
M++I + SS FP+ P T +FR A+ A P+ +P S SA
Sbjct: 1 MFAI-NPSSFFFPSNPTTATARQHSISPPSSFRLSLAATGDASPDPSISRLFNFP-SLSA 58
Query: 44 ADIS--IRVGPDSEGTA----------VGGSHGGVKVNAKEKKWSRNRESYLTDDSEPLP 91
D + R+G D + A +G VKVNA EKKWSR+RESYL DDS+ LP
Sbjct: 59 TDATGGFRLGQDYDAGASSPVIDGPGRFARGNGTVKVNAMEKKWSRDRESYLVDDSDVLP 118
Query: 92 LPMTYPDSSPVPREEIDKRLRCDPEVQDCKEVVYEWTGKCRSCQGSGYVSYYNKRGKEII 151
LPMTYPDSSPV +EID+RLRCDP+V+DCKEVVYEWTGKCRSCQGSGYVSYYNKRG+EI
Sbjct: 119 LPMTYPDSSPVSPDEIDRRLRCDPQVEDCKEVVYEWTGKCRSCQGSGYVSYYNKRGREIT 178
Query: 152 CKCIPC 157
CKCIPC
Sbjct: 179 CKCIPC 184
>gi|115479565|ref|NP_001063376.1| Os09g0458400 [Oryza sativa Japonica Group]
gi|51536205|dbj|BAD38376.1| unknown protein [Oryza sativa Japonica Group]
gi|113631609|dbj|BAF25290.1| Os09g0458400 [Oryza sativa Japonica Group]
gi|215697752|dbj|BAG91746.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641721|gb|EEE69853.1| hypothetical protein OsJ_29636 [Oryza sativa Japonica Group]
Length = 205
Score = 173 bits (439), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 85/150 (56%), Positives = 105/150 (70%), Gaps = 17/150 (11%)
Query: 52 PDSEGTAVGGSHGGVKVNAKE-----------------KKWSRNRESYLTDDSEPLPLPM 94
P + G V GV+V A+E WS +RESYL DD EPLPLP+
Sbjct: 56 PSTSGGRVAARDPGVRVRAREGAEEEKKGRGRGRRRWWDWWSGDRESYLVDDVEPLPLPL 115
Query: 95 TYPDSSPVPREEIDKRLRCDPEVQDCKEVVYEWTGKCRSCQGSGYVSYYNKRGKEIICKC 154
T PD+ P+ REE+D+RL CD E++DCK V YEWTGKCRSCQG+G VSY+ K+G+E ICKC
Sbjct: 116 TVPDTEPMSREELDRRLSCDVEIEDCKTVSYEWTGKCRSCQGTGLVSYFRKKGRETICKC 175
Query: 155 IPCLGIGYVQKITARKDIEVMEDLDNGKPP 184
+PC GIGYV+KIT R+DIE M++LDNGKPP
Sbjct: 176 VPCAGIGYVRKITFRQDIENMDELDNGKPP 205
>gi|414589672|tpg|DAA40243.1| TPA: hypothetical protein ZEAMMB73_532617 [Zea mays]
Length = 793
Score = 172 bits (437), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 88/179 (49%), Positives = 118/179 (65%), Gaps = 30/179 (16%)
Query: 36 RWPHSSSAADIS----------IRVGPDSEGTAVGGS----HGGVKVNAKE--------- 72
R+PHS +AA+ + R PDS+ + GG GV+V AKE
Sbjct: 615 RFPHSPAAANTTGAASAPDWFRPRRTPDSDPSISGGRVTARDPGVRVKAKEGAEEEKDKK 674
Query: 73 -------KKWSRNRESYLTDDSEPLPLPMTYPDSSPVPREEIDKRLRCDPEVQDCKEVVY 125
++WS ++ESYL DD EPLP+P+T P++ P+ REE+D+RL CD E+ +CK V Y
Sbjct: 675 AKRRRWWERWSGDKESYLVDDVEPLPIPLTVPETEPMSREELDRRLSCDVEIDECKTVSY 734
Query: 126 EWTGKCRSCQGSGYVSYYNKRGKEIICKCIPCLGIGYVQKITARKDIEVMEDLDNGKPP 184
EWTGKCRSCQG+G VSY+ K+GKE ICKC+PC GIGYV+KIT R+DI+ +++LDN K P
Sbjct: 735 EWTGKCRSCQGTGLVSYFRKKGKETICKCVPCAGIGYVRKITLREDIQKIDELDNEKLP 793
>gi|226503125|ref|NP_001144163.1| uncharacterized protein LOC100277018 [Zea mays]
gi|195637846|gb|ACG38391.1| hypothetical protein [Zea mays]
Length = 204
Score = 170 bits (430), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 88/179 (49%), Positives = 118/179 (65%), Gaps = 30/179 (16%)
Query: 36 RWPHSSSAADIS----------IRVGPDSEGTAVGGS----HGGVKVNAKE--------- 72
R+PHS +AA+ + R PDS+ + GG GV+V AKE
Sbjct: 26 RFPHSPAAANTTGAASAPDWFRPRRTPDSDPSISGGRVTARDPGVRVKAKEGAEEEKDKK 85
Query: 73 -------KKWSRNRESYLTDDSEPLPLPMTYPDSSPVPREEIDKRLRCDPEVQDCKEVVY 125
++WS ++ESYL DD EPLP+P+T P++ P+ REE+D+RL CD E+ +CK V Y
Sbjct: 86 AKRRRWWERWSGDKESYLVDDVEPLPIPLTVPETEPMSREELDRRLSCDVEIDECKTVSY 145
Query: 126 EWTGKCRSCQGSGYVSYYNKRGKEIICKCIPCLGIGYVQKITARKDIEVMEDLDNGKPP 184
EWTGKCRSCQG+G VSY+ K+GKE ICKC+PC GIGYV+KIT R+DI+ +++LDN K P
Sbjct: 146 EWTGKCRSCQGTGLVSYFRKKGKETICKCVPCAGIGYVRKITLREDIQKIDELDNEKLP 204
>gi|242044930|ref|XP_002460336.1| hypothetical protein SORBIDRAFT_02g026650 [Sorghum bicolor]
gi|241923713|gb|EER96857.1| hypothetical protein SORBIDRAFT_02g026650 [Sorghum bicolor]
Length = 271
Score = 169 bits (429), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 87/180 (48%), Positives = 118/180 (65%), Gaps = 31/180 (17%)
Query: 36 RWPHSSSAADIS----------IRVGPDSE-----GTAVGGSHGGVKVNAKE-------- 72
R+PHS +AA+ + R PD++ G V GV+V AKE
Sbjct: 92 RFPHSPAAANTTGAASAPDWFRPRRPPDADPSTSPGGRVAARDPGVRVKAKEGAEEEKGK 151
Query: 73 --------KKWSRNRESYLTDDSEPLPLPMTYPDSSPVPREEIDKRLRCDPEVQDCKEVV 124
++WS ++ESYL DD EPLP+P+T P + P+ REE+++RL CD +++DCK V
Sbjct: 152 KGKRRRWWERWSGDKESYLVDDVEPLPIPLTVPGTEPMSREELNRRLSCDVKIEDCKTVS 211
Query: 125 YEWTGKCRSCQGSGYVSYYNKRGKEIICKCIPCLGIGYVQKITARKDIEVMEDLDNGKPP 184
YEWTGKCRSCQG+G VSY+ K+GKE ICKC+PC GIGYV+KIT R+DI+ +++LDNGK P
Sbjct: 212 YEWTGKCRSCQGTGLVSYFRKKGKETICKCVPCAGIGYVRKITYREDIQNIDELDNGKLP 271
>gi|357158765|ref|XP_003578233.1| PREDICTED: uncharacterized protein LOC100846859 [Brachypodium
distachyon]
Length = 204
Score = 166 bits (421), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 89/202 (44%), Positives = 120/202 (59%), Gaps = 24/202 (11%)
Query: 1 MYSISHTSSISFPAK---PATTFRCFAASDIPAGPNSLRWPHSSSAADISIRVGPDSEGT 57
+ S+ S+ P + P T A + A P PH + +D P + G
Sbjct: 9 LLSLRSNPSLPPPGRHRRPHPTPPAAANTTGAASPQEWFRPHRPTDSD------PSTSGG 62
Query: 58 AVGGSHGGVKVNAKE---------------KKWSRNRESYLTDDSEPLPLPMTYPDSSPV 102
V GV+V AKE ++WS ++ESYL DD EPLP+P+T P + P+
Sbjct: 63 RVAARDPGVRVKAKEGSEDKKKGESKRRWWERWSGDKESYLVDDVEPLPIPLTIPGTEPM 122
Query: 103 PREEIDKRLRCDPEVQDCKEVVYEWTGKCRSCQGSGYVSYYNKRGKEIICKCIPCLGIGY 162
REE+D+RL CD ++ DCK V YEWTGKCRSCQG+G VSY+ K+G+E IC C+PC GIGY
Sbjct: 123 SREELDRRLSCDVKIDDCKIVSYEWTGKCRSCQGTGLVSYFRKKGRETICTCVPCSGIGY 182
Query: 163 VQKITARKDIEVMEDLDNGKPP 184
V+KIT R+ E M++LDNG+PP
Sbjct: 183 VRKITYREGAENMDELDNGRPP 204
>gi|326495712|dbj|BAJ85952.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 214
Score = 166 bits (420), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 77/147 (52%), Positives = 103/147 (70%), Gaps = 14/147 (9%)
Query: 52 PDSEGTAVGGSHGGVKVNAKE--------------KKWSRNRESYLTDDSEPLPLPMTYP 97
P + G V GV+V AK+ ++WSR++ESYL DD EPLP+P+T P
Sbjct: 68 PSTSGGRVAARDPGVRVKAKDGAEERKGERKRRWWERWSRDKESYLVDDVEPLPMPLTIP 127
Query: 98 DSSPVPREEIDKRLRCDPEVQDCKEVVYEWTGKCRSCQGSGYVSYYNKRGKEIICKCIPC 157
+ P+ REE+D+RL CD ++ +CK V YEWTGKCRSCQGSG V+Y+ K+GKE IC C+PC
Sbjct: 128 GTEPMSREELDRRLSCDVKIDECKMVSYEWTGKCRSCQGSGLVTYFRKKGKETICTCVPC 187
Query: 158 LGIGYVQKITARKDIEVMEDLDNGKPP 184
GIGYV+KIT R+ E +++LDNG+PP
Sbjct: 188 AGIGYVRKITYREGAENLDELDNGRPP 214
>gi|449445878|ref|XP_004140699.1| PREDICTED: protein disulfide-isomerase SCO2-like isoform 1 [Cucumis
sativus]
Length = 133
Score = 161 bits (407), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 74/86 (86%), Positives = 81/86 (94%)
Query: 72 EKKWSRNRESYLTDDSEPLPLPMTYPDSSPVPREEIDKRLRCDPEVQDCKEVVYEWTGKC 131
EKKWSR+RESYL DDS+ LPLPMTYPDSSPV +EID+RLRCDP+V+DCKEVVYEWTGKC
Sbjct: 2 EKKWSRDRESYLVDDSDVLPLPMTYPDSSPVSPDEIDRRLRCDPQVEDCKEVVYEWTGKC 61
Query: 132 RSCQGSGYVSYYNKRGKEIICKCIPC 157
RSCQGSGYVSYYNKRG+EI CKCIPC
Sbjct: 62 RSCQGSGYVSYYNKRGREITCKCIPC 87
>gi|215767737|dbj|BAG99965.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 198
Score = 139 bits (349), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 68/129 (52%), Positives = 85/129 (65%), Gaps = 17/129 (13%)
Query: 52 PDSEGTAVGGSHGGVKVNAKE-----------------KKWSRNRESYLTDDSEPLPLPM 94
P + G V GV+V A+E WS +RESYL DD EPLPLP+
Sbjct: 56 PSTSGGRVAARDPGVRVRAREGAEEEKKGRGRGRRRWWDWWSGDRESYLVDDVEPLPLPL 115
Query: 95 TYPDSSPVPREEIDKRLRCDPEVQDCKEVVYEWTGKCRSCQGSGYVSYYNKRGKEIICKC 154
T PD+ P+ REE+D+RL CD E++DCK V YEWTGKCRSCQG+G VSY+ K+G+E ICKC
Sbjct: 116 TVPDTEPMSREELDRRLSCDVEIEDCKTVSYEWTGKCRSCQGTGLVSYFRKKGRETICKC 175
Query: 155 IPCLGIGYV 163
+PC GIG +
Sbjct: 176 VPCAGIGEI 184
>gi|168050588|ref|XP_001777740.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670841|gb|EDQ57402.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 348
Score = 137 bits (344), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 81/169 (47%), Positives = 100/169 (59%), Gaps = 19/169 (11%)
Query: 15 KPATTFRCFAASDIPAGPN-----SLRWPHSSSAADISIRVGPDSEGTAVGGSHGGVKVN 69
K T R AA PA PN + +WP +S+ +RVG E VK
Sbjct: 192 KQTRTHRLIAALS-PAFPNQPDLSAFQWPSTSA-----LRVGSQGE------PERQVKAG 239
Query: 70 AKEKKWSRNRESYLTDDSEPLPLPMTYPDSSPVPREEIDKRLRCDPEVQDCKEVVYEWTG 129
AKEK W ++ ES L+ DS PL LP+ YP + P ++E D + C+PEVQDCK VY+WT
Sbjct: 240 AKEK-WPKHNESALSSDS-PLSLPLAYPGTEPATQKERDAMVNCNPEVQDCKNPVYQWTA 297
Query: 130 KCRSCQGSGYVSYYNKRGKEIICKCIPCLGIGYVQKITARKDIEVMEDL 178
KC CQG+G VS+Y KRGKE+I KCI CLGIGYV+KIT R D DL
Sbjct: 298 KCSRCQGTGEVSFYRKRGKEVISKCIACLGIGYVEKITLRADKVDESDL 346
>gi|218202278|gb|EEC84705.1| hypothetical protein OsI_31654 [Oryza sativa Indica Group]
Length = 83
Score = 133 bits (335), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 58/83 (69%), Positives = 72/83 (86%)
Query: 102 VPREEIDKRLRCDPEVQDCKEVVYEWTGKCRSCQGSGYVSYYNKRGKEIICKCIPCLGIG 161
+ REE+D+RL CD E++DCK V YEWTGKCRSCQG+G VSY+ K+G+E ICKC+PC GIG
Sbjct: 1 MSREELDRRLSCDVEIEDCKTVSYEWTGKCRSCQGTGLVSYFRKKGRETICKCVPCAGIG 60
Query: 162 YVQKITARKDIEVMEDLDNGKPP 184
YV+KIT R+DIE M++LDNGKPP
Sbjct: 61 YVRKITFRQDIENMDELDNGKPP 83
>gi|302802033|ref|XP_002982772.1| hypothetical protein SELMODRAFT_59033 [Selaginella moellendorffii]
gi|302818498|ref|XP_002990922.1| hypothetical protein SELMODRAFT_48955 [Selaginella moellendorffii]
gi|300141253|gb|EFJ07966.1| hypothetical protein SELMODRAFT_48955 [Selaginella moellendorffii]
gi|300149362|gb|EFJ16017.1| hypothetical protein SELMODRAFT_59033 [Selaginella moellendorffii]
Length = 91
Score = 124 bits (312), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 57/92 (61%), Positives = 70/92 (76%), Gaps = 1/92 (1%)
Query: 70 AKEKKWSRNRESYLTDDSEPLPLPMTYPDSSPVPREEIDKRLRCDPEVQDCKEVVYEWTG 129
AK K W R ES L D E LPLPMTYP+S P+P+EE++K CDPE+++CK VVY+WTG
Sbjct: 1 AKWKWWERKNESAL-DKDEALPLPMTYPNSEPLPKEEVEKAANCDPEIENCKNVVYQWTG 59
Query: 130 KCRSCQGSGYVSYYNKRGKEIICKCIPCLGIG 161
+CR CQG+G VS+Y K GKE+I CI CLGIG
Sbjct: 60 ECRRCQGTGVVSFYRKNGKEVIGNCISCLGIG 91
>gi|168058269|ref|XP_001781132.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667450|gb|EDQ54080.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 383
Score = 114 bits (284), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 82/221 (37%), Positives = 99/221 (44%), Gaps = 80/221 (36%)
Query: 15 KPATTFRCFAASDIPAGPNS-----LRWPHSSSAADISIRVGPDSEGTAVGGSHGGVKVN 69
K TFR AA D PA PN +WP +S P +G G VK
Sbjct: 171 KQGRTFRLIAALD-PAFPNQPDLSDFQWPSTSR---------PGGQGEIAGE----VKAG 216
Query: 70 AKEKKWSRNRESYLTDDSEPLPLPMTYPDSSPVPREEIDKRLRCDPEVQD---------- 119
AKEK W RN ES L+ DS PL LPM+YP + P ++EID + CDPE++
Sbjct: 217 AKEK-WYRNNESALSGDS-PLSLPMSYPGTKPATQKEIDAMMNCDPELERTVLTCSLELA 274
Query: 120 -----------------------CKEVVYEWTGKCRSCQGSGYVSYYNKRGKEIICKCIP 156
CK+ VY+WT KC CQG+G VS+Y KRGKE+I KCI
Sbjct: 275 CVPAPEDRVGDFSAFWNVGLELYCKDPVYQWTAKCSRCQGTGEVSFYRKRGKEVISKCIA 334
Query: 157 CLGI--------------------------GYVQKITARKD 171
CLGI GYVQK+T R D
Sbjct: 335 CLGIDLAKALNPCPAVHQASHREIDDTHVQGYVQKMTLRAD 375
>gi|374922013|gb|AFA26184.1| hypothetical protein, partial [Lolium perenne]
Length = 59
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 41/59 (69%), Positives = 50/59 (84%)
Query: 126 EWTGKCRSCQGSGYVSYYNKRGKEIICKCIPCLGIGYVQKITARKDIEVMEDLDNGKPP 184
EWTGKCRSCQGSG V+Y+ K+GKE IC C+PC GIGYV+KIT R+ E M++LDNG+PP
Sbjct: 1 EWTGKCRSCQGSGLVTYFRKKGKETICTCVPCAGIGYVRKITYREGAENMDELDNGRPP 59
>gi|159468686|ref|XP_001692505.1| predicted protein [Chlamydomonas reinhardtii]
gi|158278218|gb|EDP03983.1| predicted protein [Chlamydomonas reinhardtii]
Length = 195
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 37/57 (64%)
Query: 113 CDPEVQDCKEVVYEWTGKCRSCQGSGYVSYYNKRGKEIICKCIPCLGIGYVQKITAR 169
CDP ++CK V+Y + KC +C G+G+V ++ + + C+ CLG+GYV+ TAR
Sbjct: 85 CDPSERNCKSVIYTYEAKCYACYGTGFVKGHSNGRRASLSMCMVCLGLGYVRHTTAR 141
>gi|307105532|gb|EFN53781.1| expressed protein [Chlorella variabilis]
Length = 218
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/142 (26%), Positives = 62/142 (43%), Gaps = 17/142 (11%)
Query: 44 ADISIRVGPD------SEGTAVGGSHGGVKVNAKEKKWSRNRESYLTDDSEPLP------ 91
A + GPD EG+A G+ G + + + + + + LP
Sbjct: 36 ARVQAAAGPDLGADRPQEGSAAAGAPGRPQQASSDDQPPPQQHINGNGRVQHLPSSLYAN 95
Query: 92 LPMTYPDSSPVPREEIDKRLRCDPEVQDCKEVVYEWTGKCRSCQGSGYVSYYNKRGKE-I 150
P PD S + EE +C+PEV+ C ++ W +C +C G+G Y G+ +
Sbjct: 96 FPKLPPDGSTLTPEEA----KCNPEVERCHTPIHVWEARCTACDGTGTARSYTSHGRHRL 151
Query: 151 ICKCIPCLGIGYVQKITARKDI 172
+C C+ C G+GYV+ T +
Sbjct: 152 VCVCLLCHGLGYVRHSTTHQHT 173
>gi|196017900|ref|XP_002118676.1| predicted protein [Trichoplax adhaerens]
gi|190578488|gb|EDV18838.1| predicted protein [Trichoplax adhaerens]
Length = 1989
Score = 37.7 bits (86), Expect = 2.2, Method: Composition-based stats.
Identities = 26/87 (29%), Positives = 40/87 (45%), Gaps = 5/87 (5%)
Query: 67 KVNAKEKKWSRNRESYLTDDSEPLPLPMTYPDSSPVPREEIDKRLRCDPEVQDCKEVVYE 126
++ + KK+ R E ++ D +P PM + DS E + D + K VV E
Sbjct: 1534 ELGKRSKKYLRELEEFIITDQSNIPQPMIFDDSKS-ETESLSTVEDSDIKKLKNKNVVIE 1592
Query: 127 WTGKCRSCQGSGYVSYYNKRGKEIICK 153
+T + +GS YV+ R EII K
Sbjct: 1593 FTNDLQQSKGSDYVA----RAFEIIIK 1615
>gi|145346982|ref|XP_001417959.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578187|gb|ABO96252.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 187
Score = 37.0 bits (84), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 41/91 (45%), Gaps = 10/91 (10%)
Query: 89 PLPLPMTYPDSSPVPRE--EIDKRLRCDPEVQDCKEVVYE-----WTGKCRSCQGSGYVS 141
PL LP+TY + P P + +C+P ++ + + +C C G+ V
Sbjct: 42 PLGLPITYHVAPPRPSDGKSTFGDAKCEPSAEELDTCTIQQSTEVYDRRCGKCGGNKLVM 101
Query: 142 YYNKRG---KEIICKCIPCLGIGYVQKITAR 169
+RG +E I C C G GYV+ ++R
Sbjct: 102 STTRRGGRHQETISTCQACGGTGYVRVASSR 132
>gi|440634918|gb|ELR04837.1| hypothetical protein GMDG_07062 [Geomyces destructans 20631-21]
Length = 729
Score = 36.2 bits (82), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 15/27 (55%), Positives = 21/27 (77%)
Query: 77 RNRESYLTDDSEPLPLPMTYPDSSPVP 103
R+RESY +D++P P M+YP +SPVP
Sbjct: 140 RHRESYSQNDAQPQPQHMSYPQASPVP 166
>gi|385790282|ref|YP_005821405.1| daunorubicin resistance protein [Fibrobacter succinogenes subsp.
succinogenes S85]
gi|302327344|gb|ADL26545.1| daunorubicin resistance protein [Fibrobacter succinogenes subsp.
succinogenes S85]
Length = 708
Score = 35.8 bits (81), Expect = 7.4, Method: Composition-based stats.
Identities = 13/34 (38%), Positives = 18/34 (52%)
Query: 129 GKCRSCQGSGYVSYYNKRGKEIICKCIPCLGIGY 162
G C C GSG + Y R +E+ C C+G G+
Sbjct: 532 GACPECSGSGVIIYNQNRNEEVSFVCAKCMGTGF 565
>gi|261415357|ref|YP_003249040.1| excinuclease ATPase subunit-like protein [Fibrobacter succinogenes
subsp. succinogenes S85]
gi|261371813|gb|ACX74558.1| Excinuclease ATPase subunit-like protein [Fibrobacter succinogenes
subsp. succinogenes S85]
Length = 708
Score = 35.8 bits (81), Expect = 7.4, Method: Composition-based stats.
Identities = 13/34 (38%), Positives = 18/34 (52%)
Query: 129 GKCRSCQGSGYVSYYNKRGKEIICKCIPCLGIGY 162
G C C GSG + Y R +E+ C C+G G+
Sbjct: 532 GACPECSGSGVIIYNQNRNEEVSFVCAKCMGTGF 565
>gi|347971568|ref|XP_562787.4| AGAP004259-PA [Anopheles gambiae str. PEST]
gi|333468726|gb|EAL40682.4| AGAP004259-PA [Anopheles gambiae str. PEST]
Length = 493
Score = 35.8 bits (81), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 40/84 (47%), Gaps = 2/84 (2%)
Query: 87 SEPLPLPMTYPDSSPVPREEIDKRLRCDPEVQDCKEV-VYEWTGKCRSCQGSGYVSYYNK 145
S+ L MT+ D P E + R +C ++D + V E T KCR C G G+ +Y ++
Sbjct: 352 SDLLSHQMTHSDERPYGCGECEMRFKCRSHLRDHQTVHTGERTKKCRIC-GKGFGTYGDR 410
Query: 146 RGKEIICKCIPCLGIGYVQKITAR 169
R E+ + + GY K R
Sbjct: 411 RVHELRHENVHPFKCGYCSKTYGR 434
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.133 0.419
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,193,535,270
Number of Sequences: 23463169
Number of extensions: 138492827
Number of successful extensions: 296991
Number of sequences better than 100.0: 45
Number of HSP's better than 100.0 without gapping: 30
Number of HSP's successfully gapped in prelim test: 15
Number of HSP's that attempted gapping in prelim test: 296932
Number of HSP's gapped (non-prelim): 52
length of query: 184
length of database: 8,064,228,071
effective HSP length: 133
effective length of query: 51
effective length of database: 9,238,593,890
effective search space: 471168288390
effective search space used: 471168288390
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 72 (32.3 bits)