BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 033298
(122 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224129182|ref|XP_002320521.1| predicted protein [Populus trichocarpa]
gi|222861294|gb|EEE98836.1| predicted protein [Populus trichocarpa]
Length = 118
Score = 160 bits (405), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 85/124 (68%), Positives = 101/124 (81%), Gaps = 8/124 (6%)
Query: 1 MAARVALYSSLSGK--ASTWPSSWSHNPFTQQYSLSSPPKPRAHGQFKVYANLGGRDGEV 58
MAA+V L SSL+ + PSS+S FT LSSP K R H FK++ANLG DGE+
Sbjct: 1 MAAKVVLSSSLNTHFPVPSKPSSFSSVVFT----LSSPKK-RVH-HFKIHANLGEGDGEL 54
Query: 59 KPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGWIK 118
KPKGK+KF+TRE+EP+++WQ+AGEREGENPMKTP+PYIIIFGMSTPFVILAIAFANGWIK
Sbjct: 55 KPKGKKKFITREQEPEQYWQSAGEREGENPMKTPLPYIIIFGMSTPFVILAIAFANGWIK 114
Query: 119 MPVR 122
+PVR
Sbjct: 115 VPVR 118
>gi|255578705|ref|XP_002530211.1| conserved hypothetical protein [Ricinus communis]
gi|223530258|gb|EEF32158.1| conserved hypothetical protein [Ricinus communis]
Length = 117
Score = 151 bits (381), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 77/122 (63%), Positives = 96/122 (78%), Gaps = 5/122 (4%)
Query: 1 MAARVALYSSLSGKASTWPSSWSHNPFTQQYSLSSPPKPRAHGQFKVYANLGGRDGEVKP 60
MAA+ AL SSL + P+ +P + +S P+ R +FK+ A +GG DGE+KP
Sbjct: 1 MAAKPALSSSLISQIPVLPNFSPSSPLSCTFST---PRKRVQ-KFKINA-IGGGDGELKP 55
Query: 61 KGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGWIKMP 120
KGK+KF+TRE+EP+++WQTAGEREGENPMKTP+PYIIIFGMSTPFVILAIAFANGWIK+P
Sbjct: 56 KGKKKFITREQEPEQYWQTAGEREGENPMKTPLPYIIIFGMSTPFVILAIAFANGWIKVP 115
Query: 121 VR 122
VR
Sbjct: 116 VR 117
>gi|21593859|gb|AAM65826.1| unknown [Arabidopsis thaliana]
Length = 125
Score = 150 bits (380), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 67/88 (76%), Positives = 79/88 (89%)
Query: 35 SPPKPRAHGQFKVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIP 94
PP P QFK++A LGGRDGEVKPK K+KF+T+EEEP+++WQ+ GEREGENPMKTP+P
Sbjct: 38 CPPSPLGFPQFKLHAKLGGRDGEVKPKDKKKFITKEEEPEQYWQSVGEREGENPMKTPLP 97
Query: 95 YIIIFGMSTPFVILAIAFANGWIKMPVR 122
YIIIFGMSTPFVILAIAFANGWIK+P+R
Sbjct: 98 YIIIFGMSTPFVILAIAFANGWIKVPIR 125
>gi|225447539|ref|XP_002268250.1| PREDICTED: uncharacterized protein LOC100264834 [Vitis vinifera]
gi|296085028|emb|CBI28443.3| unnamed protein product [Vitis vinifera]
Length = 118
Score = 150 bits (380), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 77/122 (63%), Positives = 92/122 (75%), Gaps = 4/122 (3%)
Query: 1 MAARVALYSSLSGKASTWPSSWSHNPFTQQYSLSSPPKPRAHGQFKVYANLGGRDGEVKP 60
MAA+VAL S + S S + T + PK RAH F ++A LGG DGE+K
Sbjct: 1 MAAKVALSSCFCTRLPLPSKSSSPSSLTHT---PTSPKKRAH-HFSIHAKLGGGDGEIKQ 56
Query: 61 KGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGWIKMP 120
GK+KF+TREEEP+++WQTAGEREGENPMKTP+PYIIIFGMSTPFVILAIAFANGW+K+P
Sbjct: 57 GGKKKFITREEEPEQYWQTAGEREGENPMKTPLPYIIIFGMSTPFVILAIAFANGWVKVP 116
Query: 121 VR 122
VR
Sbjct: 117 VR 118
>gi|388514561|gb|AFK45342.1| unknown [Lotus japonicus]
Length = 119
Score = 147 bits (372), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 77/122 (63%), Positives = 92/122 (75%), Gaps = 3/122 (2%)
Query: 1 MAARVALYSSLSGKASTWPSSWSHNPFTQQYSLSSPPKPRAHGQFKVYANLGGRDGEVKP 60
MA RV L S L+ K S P S S +P + +SL SP K A +V+A +GG D + K
Sbjct: 1 MATRVTLSSPLAFKTSLLPKSPSLSPVS--FSLGSP-KTNAVVGVRVHAKIGGSDEDTKK 57
Query: 61 KGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGWIKMP 120
K+KF+TR+EEP ++WQTAGEREGENPMKTP+PYIIIFGMSTPFVILAIAFANGWIK+P
Sbjct: 58 GEKKKFITRDEEPQQYWQTAGEREGENPMKTPLPYIIIFGMSTPFVILAIAFANGWIKVP 117
Query: 121 VR 122
VR
Sbjct: 118 VR 119
>gi|18396025|ref|NP_565324.1| uncharacterized protein [Arabidopsis thaliana]
gi|4755200|gb|AAD29067.1| expressed protein [Arabidopsis thaliana]
gi|330250821|gb|AEC05915.1| uncharacterized protein [Arabidopsis thaliana]
Length = 125
Score = 146 bits (369), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 65/88 (73%), Positives = 78/88 (88%)
Query: 35 SPPKPRAHGQFKVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIP 94
PP P QFK++A LGG DGEVKPK K+KF+T++EEP+++WQ+ GEREGENPMKTP+P
Sbjct: 38 CPPSPLGFPQFKLHAKLGGGDGEVKPKDKKKFITKDEEPEQYWQSVGEREGENPMKTPLP 97
Query: 95 YIIIFGMSTPFVILAIAFANGWIKMPVR 122
YIIIFGMSTPFVILAIAFANGWIK+P+R
Sbjct: 98 YIIIFGMSTPFVILAIAFANGWIKVPIR 125
>gi|356501745|ref|XP_003519684.1| PREDICTED: uncharacterized protein LOC100805720 [Glycine max]
Length = 115
Score = 145 bits (366), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 79/122 (64%), Positives = 91/122 (74%), Gaps = 7/122 (5%)
Query: 1 MAARVALYSSLSGKASTWPSSWSHNPFTQQYSLSSPPKPRAHGQFKVYANLGGRDGEVKP 60
MAA++ L S LS K S P S S +P L SP + G KV+A LGG D + K
Sbjct: 1 MAAKLTLSSPLSFKTSFLPKSPSFSP-----GLCSP-RTNVTG-VKVHAKLGGGDEQAKN 53
Query: 61 KGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGWIKMP 120
GK+KF+TREEEP ++WQTAGEREGENPMKTP+PYIIIFGMSTPFVILAIAFANGWIK+P
Sbjct: 54 GGKKKFITREEEPQQYWQTAGEREGENPMKTPLPYIIIFGMSTPFVILAIAFANGWIKVP 113
Query: 121 VR 122
VR
Sbjct: 114 VR 115
>gi|15236299|ref|NP_193086.1| uncharacterized protein [Arabidopsis thaliana]
gi|4678376|emb|CAB41108.1| putative protein [Arabidopsis thaliana]
gi|7268053|emb|CAB78392.1| putative protein [Arabidopsis thaliana]
gi|109946535|gb|ABG48446.1| At4g13500 [Arabidopsis thaliana]
gi|332657886|gb|AEE83286.1| uncharacterized protein [Arabidopsis thaliana]
Length = 125
Score = 145 bits (365), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 65/88 (73%), Positives = 78/88 (88%)
Query: 35 SPPKPRAHGQFKVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIP 94
PP P +FK++A LGG DGEVKPK K+KF+T+EEEP+++WQ+ GEREGENPMKTP+P
Sbjct: 38 CPPSPLGFPEFKLHAKLGGGDGEVKPKDKKKFITKEEEPEQYWQSVGEREGENPMKTPLP 97
Query: 95 YIIIFGMSTPFVILAIAFANGWIKMPVR 122
YIIIFGMSTPFVILAIAFANGWIK+P+R
Sbjct: 98 YIIIFGMSTPFVILAIAFANGWIKVPIR 125
>gi|27754732|gb|AAO22809.1| unknown protein [Arabidopsis thaliana]
Length = 119
Score = 144 bits (364), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 65/88 (73%), Positives = 78/88 (88%)
Query: 35 SPPKPRAHGQFKVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIP 94
PP P +FK++A LGG DGEVKPK K+KF+T+EEEP+++WQ+ GEREGENPMKTP+P
Sbjct: 32 CPPSPLGFPEFKLHAKLGGGDGEVKPKDKKKFITKEEEPEQYWQSVGEREGENPMKTPLP 91
Query: 95 YIIIFGMSTPFVILAIAFANGWIKMPVR 122
YIIIFGMSTPFVILAIAFANGWIK+P+R
Sbjct: 92 YIIIFGMSTPFVILAIAFANGWIKVPIR 119
>gi|297835826|ref|XP_002885795.1| hypothetical protein ARALYDRAFT_480141 [Arabidopsis lyrata subsp.
lyrata]
gi|297331635|gb|EFH62054.1| hypothetical protein ARALYDRAFT_480141 [Arabidopsis lyrata subsp.
lyrata]
Length = 125
Score = 144 bits (364), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 65/88 (73%), Positives = 77/88 (87%)
Query: 35 SPPKPRAHGQFKVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIP 94
P P QFK++A LGG DGEVKPK K+KF+T+EEEP+++WQ+ GEREGENPMKTP+P
Sbjct: 38 CPQSPLRFPQFKLHAKLGGGDGEVKPKDKKKFITKEEEPEQYWQSVGEREGENPMKTPLP 97
Query: 95 YIIIFGMSTPFVILAIAFANGWIKMPVR 122
YIIIFGMSTPFVILAIAFANGWIK+P+R
Sbjct: 98 YIIIFGMSTPFVILAIAFANGWIKVPIR 125
>gi|21618132|gb|AAM67182.1| unknown [Arabidopsis thaliana]
Length = 125
Score = 143 bits (360), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 64/88 (72%), Positives = 78/88 (88%)
Query: 35 SPPKPRAHGQFKVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIP 94
PP P +FK++A LGG DGEVKPK K+KF+T+EEEP+++WQ+ GEREGENPMKTP+P
Sbjct: 38 CPPSPLGFPKFKLHAKLGGGDGEVKPKDKKKFITKEEEPEQYWQSVGEREGENPMKTPLP 97
Query: 95 YIIIFGMSTPFVILAIAFANGWIKMPVR 122
YIIIFG+STPFVILAIAFANGWIK+P+R
Sbjct: 98 YIIIFGISTPFVILAIAFANGWIKVPIR 125
>gi|356554217|ref|XP_003545445.1| PREDICTED: uncharacterized protein LOC100810445 [Glycine max]
Length = 126
Score = 141 bits (355), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 77/123 (62%), Positives = 89/123 (72%), Gaps = 9/123 (7%)
Query: 1 MAARVALYSSLSGKASTWPSSWSHNPFTQQYSLS-SPPKPRAHGQFKVYANLGGRDGEVK 59
MAA++ L S S K S P S S +SL P+ G KV+A LGG D + K
Sbjct: 12 MAAKLTLSSPFSFKTSFLPKSPS-------FSLGLYSPRTNVTG-VKVHAKLGGGDEQAK 63
Query: 60 PKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGWIKM 119
GK+KF+TREEEP ++WQTAGEREGENPMKTP+PYIIIFGMSTPFVILAIAFANGWIK+
Sbjct: 64 KGGKKKFITREEEPQQYWQTAGEREGENPMKTPLPYIIIFGMSTPFVILAIAFANGWIKV 123
Query: 120 PVR 122
PVR
Sbjct: 124 PVR 126
>gi|102140020|gb|ABF70151.1| hypothetical protein MA4_112I10.24 [Musa acuminata]
Length = 113
Score = 139 bits (349), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 63/77 (81%), Positives = 73/77 (94%), Gaps = 1/77 (1%)
Query: 46 KVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPF 105
++YANLGG DGE KP K+KF+TREEEPD++WQTAGER+GENPMKTP+PYIIIFGMSTPF
Sbjct: 38 RIYANLGGADGEAKP-AKKKFITREEEPDQYWQTAGERKGENPMKTPLPYIIIFGMSTPF 96
Query: 106 VILAIAFANGWIKMPVR 122
VILAIAFANGWIKMP++
Sbjct: 97 VILAIAFANGWIKMPMK 113
>gi|242033753|ref|XP_002464271.1| hypothetical protein SORBIDRAFT_01g015270 [Sorghum bicolor]
gi|241918125|gb|EER91269.1| hypothetical protein SORBIDRAFT_01g015270 [Sorghum bicolor]
Length = 132
Score = 137 bits (344), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 62/77 (80%), Positives = 72/77 (93%), Gaps = 1/77 (1%)
Query: 46 KVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPF 105
++ A LGG DGE+KP GK+KF+TREEEP+++WQTAGEREGENPM TP+PYIIIFGMSTPF
Sbjct: 57 RIIAKLGG-DGELKPPGKKKFITREEEPEQYWQTAGEREGENPMMTPLPYIIIFGMSTPF 115
Query: 106 VILAIAFANGWIKMPVR 122
VILAIAFANGWIK+PVR
Sbjct: 116 VILAIAFANGWIKVPVR 132
>gi|414871661|tpg|DAA50218.1| TPA: hypothetical protein ZEAMMB73_766061 [Zea mays]
Length = 125
Score = 134 bits (337), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 62/77 (80%), Positives = 71/77 (92%), Gaps = 2/77 (2%)
Query: 46 KVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPF 105
+V A LG DGE+KP GKRKF+T+EEEP+++WQTAGEREGENPM TP+PYIIIFGMSTPF
Sbjct: 51 RVTAKLG--DGELKPAGKRKFITKEEEPEQYWQTAGEREGENPMMTPLPYIIIFGMSTPF 108
Query: 106 VILAIAFANGWIKMPVR 122
VILAIAFANGWIK+PVR
Sbjct: 109 VILAIAFANGWIKVPVR 125
>gi|226501594|ref|NP_001142893.1| uncharacterized protein LOC100275314 [Zea mays]
gi|195611082|gb|ACG27371.1| hypothetical protein [Zea mays]
gi|195648823|gb|ACG43879.1| hypothetical protein [Zea mays]
Length = 119
Score = 134 bits (337), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 62/77 (80%), Positives = 71/77 (92%), Gaps = 2/77 (2%)
Query: 46 KVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPF 105
+V A LG DGE+KP GKRKF+T+EEEP+++WQTAGEREGENPM TP+PYIIIFGMSTPF
Sbjct: 45 RVTAKLG--DGELKPAGKRKFITKEEEPEQYWQTAGEREGENPMMTPLPYIIIFGMSTPF 102
Query: 106 VILAIAFANGWIKMPVR 122
VILAIAFANGWIK+PVR
Sbjct: 103 VILAIAFANGWIKVPVR 119
>gi|357121221|ref|XP_003562319.1| PREDICTED: uncharacterized protein LOC100841706 [Brachypodium
distachyon]
Length = 119
Score = 133 bits (335), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 61/95 (64%), Positives = 76/95 (80%), Gaps = 4/95 (4%)
Query: 32 SLSSPPKPRAHGQF----KVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGEN 87
+L++ P P G ++ A LGG DGE K GK+KF+T+E+EP+++WQTAGER+GEN
Sbjct: 25 ALATLPLPSKKGTISRRSRISAQLGGGDGETKSDGKKKFITKEQEPEQYWQTAGERKGEN 84
Query: 88 PMKTPIPYIIIFGMSTPFVILAIAFANGWIKMPVR 122
PM TP+PYIIIFGMSTPFVILAIAFANGWIK P+R
Sbjct: 85 PMMTPLPYIIIFGMSTPFVILAIAFANGWIKAPIR 119
>gi|115453991|ref|NP_001050596.1| Os03g0595300 [Oryza sativa Japonica Group]
gi|28269464|gb|AAO38007.1| unknown protein [Oryza sativa Japonica Group]
gi|108709639|gb|ABF97434.1| expressed protein [Oryza sativa Japonica Group]
gi|113549067|dbj|BAF12510.1| Os03g0595300 [Oryza sativa Japonica Group]
gi|215686889|dbj|BAG89739.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218193251|gb|EEC75678.1| hypothetical protein OsI_12483 [Oryza sativa Indica Group]
gi|222625313|gb|EEE59445.1| hypothetical protein OsJ_11626 [Oryza sativa Japonica Group]
Length = 118
Score = 132 bits (333), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 63/78 (80%), Positives = 73/78 (93%), Gaps = 2/78 (2%)
Query: 46 KVYANLGGRDGEVKPKG-KRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTP 104
++ ANLGG DGEVKP G K+KF+TREEEP+++WQ AGER+GENPMKTP+PYIIIFGMSTP
Sbjct: 42 RISANLGG-DGEVKPAGNKKKFITREEEPEQYWQAAGERKGENPMKTPLPYIIIFGMSTP 100
Query: 105 FVILAIAFANGWIKMPVR 122
FVILAIAFANGWIK+PVR
Sbjct: 101 FVILAIAFANGWIKVPVR 118
>gi|449444478|ref|XP_004140001.1| PREDICTED: uncharacterized protein LOC101219556 [Cucumis sativus]
gi|449475630|ref|XP_004154506.1| PREDICTED: uncharacterized protein LOC101231513 [Cucumis sativus]
Length = 123
Score = 132 bits (332), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 63/85 (74%), Positives = 76/85 (89%)
Query: 38 KPRAHGQFKVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYII 97
+P +FK+ ANLGG D E+K GK+KF+T+EEEP+++WQTAGEREGENPMKTP+PYII
Sbjct: 39 RPTPSSRFKLLANLGGGDAEIKKGGKKKFITKEEEPEQYWQTAGEREGENPMKTPLPYII 98
Query: 98 IFGMSTPFVILAIAFANGWIKMPVR 122
IFGMSTPFVILAIAFANGW+K+PVR
Sbjct: 99 IFGMSTPFVILAIAFANGWVKVPVR 123
>gi|357495537|ref|XP_003618057.1| hypothetical protein MTR_5g098530 [Medicago truncatula]
gi|355519392|gb|AET01016.1| hypothetical protein MTR_5g098530 [Medicago truncatula]
Length = 118
Score = 120 bits (301), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/92 (65%), Positives = 75/92 (81%), Gaps = 2/92 (2%)
Query: 31 YSLSSPPKPRAHGQFKVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMK 90
++++SP R KV A LGGRD E K K+KF+TRE+EP+++WQ+AGEREGENPM
Sbjct: 29 HTIASPK--RISLGVKVCAQLGGRDEETKKGEKKKFITREQEPEQYWQSAGEREGENPMM 86
Query: 91 TPIPYIIIFGMSTPFVILAIAFANGWIKMPVR 122
TP+PYII+FGMSTPFVIL IAFANGWIK+P+R
Sbjct: 87 TPLPYIILFGMSTPFVILGIAFANGWIKVPIR 118
>gi|326507980|dbj|BAJ86733.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525657|dbj|BAJ88875.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 120
Score = 108 bits (270), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 48/62 (77%), Positives = 57/62 (91%), Gaps = 1/62 (1%)
Query: 57 EVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGW 116
E KP GK KF+TRE+EP+++WQTAGER+GENPM TP+PYI+IFG STPF+ILAIAFANGW
Sbjct: 55 ETKPDGK-KFITREQEPEQYWQTAGERKGENPMMTPLPYIVIFGFSTPFIILAIAFANGW 113
Query: 117 IK 118
IK
Sbjct: 114 IK 115
>gi|302756923|ref|XP_002961885.1| hypothetical protein SELMODRAFT_69660 [Selaginella moellendorffii]
gi|302817145|ref|XP_002990249.1| hypothetical protein SELMODRAFT_49680 [Selaginella moellendorffii]
gi|300141958|gb|EFJ08664.1| hypothetical protein SELMODRAFT_49680 [Selaginella moellendorffii]
gi|300170544|gb|EFJ37145.1| hypothetical protein SELMODRAFT_69660 [Selaginella moellendorffii]
Length = 70
Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/70 (68%), Positives = 57/70 (81%)
Query: 53 GRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAF 112
G DG K KRKF+T+EEEP+E+WQ+A EREG PM T +PYI+IFG STPFVILA+ F
Sbjct: 1 GGDGGSSSKEKRKFITKEEEPEEYWQSAAEREGRGPMSTLLPYIVIFGFSTPFVILALGF 60
Query: 113 ANGWIKMPVR 122
ANGWIK+PVR
Sbjct: 61 ANGWIKIPVR 70
>gi|148907351|gb|ABR16811.1| unknown [Picea sitchensis]
gi|148907976|gb|ABR17108.1| unknown [Picea sitchensis]
Length = 154
Score = 103 bits (256), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 42/64 (65%), Positives = 55/64 (85%)
Query: 59 KPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGWIK 118
K K+KF+T+E+EP+++WQTA ER+G NPM TP+PYI+IFG TPF+IL +AFANGWIK
Sbjct: 91 KDTDKKKFITKEQEPEQYWQTAAERDGVNPMLTPLPYIVIFGFLTPFIILGVAFANGWIK 150
Query: 119 MPVR 122
+PVR
Sbjct: 151 VPVR 154
>gi|116782086|gb|ABK22363.1| unknown [Picea sitchensis]
Length = 154
Score = 103 bits (256), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 42/64 (65%), Positives = 55/64 (85%)
Query: 59 KPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGWIK 118
K K+KF+T+E+EP+++WQTA ER+G NPM TP+PYI+IFG TPF+IL +AFANGWIK
Sbjct: 91 KDTDKKKFITKEQEPEQYWQTAAERDGVNPMLTPLPYIVIFGFLTPFIILGVAFANGWIK 150
Query: 119 MPVR 122
+PVR
Sbjct: 151 VPVR 154
>gi|168042685|ref|XP_001773818.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674933|gb|EDQ61435.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 69
Score = 87.4 bits (215), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 37/70 (52%), Positives = 55/70 (78%), Gaps = 2/70 (2%)
Query: 51 LGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAI 110
+GG + KP+ +KF+T+E+EPD++W TA EREG+NPM+T +PYI + + +PF ILA+
Sbjct: 1 VGGGADDKKPE--KKFITKEQEPDQYWTTAAEREGKNPMQTLLPYIALLTILSPFAILAV 58
Query: 111 AFANGWIKMP 120
A+AN WIK+P
Sbjct: 59 AYANNWIKLP 68
>gi|302837085|ref|XP_002950102.1| hypothetical protein VOLCADRAFT_104607 [Volvox carteri f.
nagariensis]
gi|300264575|gb|EFJ48770.1| hypothetical protein VOLCADRAFT_104607 [Volvox carteri f.
nagariensis]
Length = 111
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 37/90 (41%), Positives = 56/90 (62%), Gaps = 4/90 (4%)
Query: 33 LSSPPKPRAHGQFKVYANLGGRDGEVKPKG-KRKFVTREEEPDEFWQTAGEREGENPMKT 91
+S P + R H + L G DG+ G K+KF+TREEEP+++W + GE++G NP+K
Sbjct: 23 VSQPARSRRHLCTRA---LLGPDGKPGADGAKKKFITREEEPEQYWSSKGEQDGANPLKD 79
Query: 92 PIPYIIIFGMSTPFVILAIAFANGWIKMPV 121
P+ I I + PF+ LA+A A G+I + V
Sbjct: 80 PLAIIGILAIFFPFIFLAVAVAAGYIDLSV 109
>gi|159474878|ref|XP_001695550.1| hypothetical protein CHLREDRAFT_163712 [Chlamydomonas reinhardtii]
gi|158275561|gb|EDP01337.1| predicted protein [Chlamydomonas reinhardtii]
Length = 110
Score = 67.8 bits (164), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 58/103 (56%), Gaps = 2/103 (1%)
Query: 19 PSSWSHNPFTQQYSLSSPPKPRAHGQFKVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQ 78
P+S S Q ++ P+ A +V A LG DG+ +KF++RE+EP+++W
Sbjct: 8 PASSSVARRLQTSQVAVAPRV-ARRSLQVRALLG-PDGKPGTTSGKKFISREQEPEQYWS 65
Query: 79 TAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGWIKMPV 121
+ GE EG NPMK P+ I + + PF+ L IA A G+I + V
Sbjct: 66 SKGEAEGGNPMKDPLAIIGVLAIFFPFIFLGIAVAAGYIDLSV 108
>gi|74272653|gb|ABA01122.1| unknown [Chlamydomonas incerta]
Length = 108
Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 30/71 (42%), Positives = 45/71 (63%)
Query: 51 LGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAI 110
L G DG+ +KF++RE+EP+++W + GE EG NPMK P+ I + + PF+ L I
Sbjct: 36 LLGPDGKPGTTSGKKFISREQEPEQYWSSKGEAEGANPMKDPLAIIGVVAILFPFIFLGI 95
Query: 111 AFANGWIKMPV 121
A A G+I + V
Sbjct: 96 AIAAGFIDVSV 106
>gi|303282113|ref|XP_003060348.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457819|gb|EEH55117.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 116
Score = 55.8 bits (133), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 36/55 (65%), Gaps = 1/55 (1%)
Query: 63 KRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGWI 117
K+ + RE E E W + GEREGENP+K P+ I G+ PF+ILA+A A G+I
Sbjct: 61 KKPAILRENE-KEAWLSKGEREGENPLKDPMAMFAIGGIMVPFIILAVAGAAGYI 114
>gi|145345393|ref|XP_001417197.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577424|gb|ABO95490.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 105
Score = 55.8 bits (133), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 38/83 (45%), Positives = 46/83 (55%), Gaps = 8/83 (9%)
Query: 40 RAHGQFKVYANLGGRDG---EVKPKGKR--KFVTREEEPDEFWQTAGEREGENPMKTPIP 94
RA G +V GRDG GK K +TRE E E W + EREG NP+K P+
Sbjct: 24 RARGVARVTKR--GRDGVRARATQDGKNGPKKLTRENEK-EAWLSEMEREGANPLKDPMA 80
Query: 95 YIIIFGMSTPFVILAIAFANGWI 117
I I G+ PF+ILAIA A G+I
Sbjct: 81 LIGIGGIMVPFIILAIASAAGFI 103
>gi|308802734|ref|XP_003078680.1| unnamed protein product [Ostreococcus tauri]
gi|116057133|emb|CAL51560.1| unnamed protein product [Ostreococcus tauri]
Length = 98
Score = 55.1 bits (131), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/51 (56%), Positives = 34/51 (66%), Gaps = 1/51 (1%)
Query: 67 VTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGWI 117
+TRE E E W + EREG NPMK P+ I I G+ PFVILAIA A G+I
Sbjct: 47 ITRENEK-EAWLSEMEREGANPMKDPMAMIGIGGLMVPFVILAIASAAGFI 96
>gi|307108039|gb|EFN56280.1| hypothetical protein CHLNCDRAFT_145155 [Chlorella variabilis]
Length = 90
Score = 53.9 bits (128), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 35/53 (66%)
Query: 67 VTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGWIKM 119
+ RE+EP+E+W + ER+G++P + P+ I + + PF+ LA+ A GW+ +
Sbjct: 34 LRREDEPEEYWTSQSERDGKSPFQDPLAQIGLLAIFFPFIFLAVGIAVGWVDL 86
>gi|255085676|ref|XP_002505269.1| hypothetical protein MICPUN_62606 [Micromonas sp. RCC299]
gi|226520538|gb|ACO66527.1| hypothetical protein MICPUN_62606 [Micromonas sp. RCC299]
Length = 115
Score = 52.4 bits (124), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 43/78 (55%), Gaps = 11/78 (14%)
Query: 40 RAHGQFKVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIF 99
RA F+V A E KPK + RE E E W + EREG NP K P+ + I
Sbjct: 47 RASRGFRVVAM-----AEKKPK-----ILRENE-KEAWLSEMEREGANPFKDPMAIVGIL 95
Query: 100 GMSTPFVILAIAFANGWI 117
G++ PFVILA+A A G+I
Sbjct: 96 GITFPFVILAVAGAAGYI 113
>gi|388503774|gb|AFK39953.1| unknown [Lotus japonicus]
Length = 72
Score = 52.0 bits (123), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 32/75 (42%), Positives = 44/75 (58%), Gaps = 3/75 (4%)
Query: 1 MAARVALYSSLSGKASTWPSSWSHNPFTQQYSLSSPPKPRAHGQFKVYANLGGRDGEVKP 60
MA RV L S L+ K S P S S +P + +SL SP K A +V+ +GG D + K
Sbjct: 1 MATRVTLSSPLAFKTSLLPKSPSLSPVS--FSLGSP-KTNAVVGVRVHVKIGGSDEDTKK 57
Query: 61 KGKRKFVTREEEPDE 75
K+KF+TR+EEP +
Sbjct: 58 GEKKKFITRDEEPQQ 72
>gi|298385038|ref|ZP_06994597.1| chitobiase [Bacteroides sp. 1_1_14]
gi|298262182|gb|EFI05047.1| chitobiase [Bacteroides sp. 1_1_14]
Length = 381
Score = 36.6 bits (83), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 43/95 (45%), Gaps = 12/95 (12%)
Query: 27 FTQQYSLSSPPKPRAHGQF--------KVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQ 78
F Y++S H Q+ V AN GE G ++ + + + + FW
Sbjct: 214 FPTYYTVSKEEWDEIHNQYIDVDRTGWGVEANTEELTGEGAVNGHKEALI-DGDLNTFWH 272
Query: 79 TAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFA 113
+ + EG+NP P+P+IIIF M IL+I A
Sbjct: 273 SQWDGEGKNP---PLPHIIIFDMQQTQNILSIELA 304
>gi|29349540|ref|NP_813043.1| chitobiase [Bacteroides thetaiotaomicron VPI-5482]
gi|29341449|gb|AAO79237.1| putative chitobiase [Bacteroides thetaiotaomicron VPI-5482]
Length = 392
Score = 35.8 bits (81), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 42/95 (44%), Gaps = 12/95 (12%)
Query: 27 FTQQYSLSSPPKPRAHGQF--------KVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQ 78
F Y++S H Q+ V AN GE G ++ + + + FW
Sbjct: 225 FPTYYTVSKEEWDEIHNQYIDVDRTGWGVEANTEELTGEGAVNGHKEALI-DGNLNTFWH 283
Query: 79 TAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFA 113
+ + EG+NP P+P+IIIF M IL+I A
Sbjct: 284 SQWDGEGKNP---PLPHIIIFDMQQTQNILSIELA 315
>gi|261334242|emb|CBH17236.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 1650
Score = 35.4 bits (80), Expect = 3.8, Method: Composition-based stats.
Identities = 29/83 (34%), Positives = 34/83 (40%), Gaps = 16/83 (19%)
Query: 23 SHNPFTQQYSLSSPP---------KPRAHGQFKVYANLGGRDGEV-KPKGKRKFVTREEE 72
SH+ F SS P K H N GG D + +P KRKF T EE
Sbjct: 356 SHSQFAGSEQASSSPMFDRSTIPIKEVEHESTLAEFNQGGGDDLISRPHKKRKFETGEES 415
Query: 73 P-----DEFWQT-AGEREGENPM 89
P +FWQ G+R G PM
Sbjct: 416 PRFRKLPDFWQPRRGDRVGRGPM 438
>gi|453087072|gb|EMF15113.1| Bystin-domain-containing protein [Mycosphaerella populorum
SO2202]
Length = 507
Score = 35.4 bits (80), Expect = 3.9, Method: Composition-based stats.
Identities = 21/73 (28%), Positives = 36/73 (49%), Gaps = 2/73 (2%)
Query: 16 STWPSSWSHNPFTQQYSLSSPPKPRAHGQFKVYANLGGRDGEVKPKGKRKFVTREEEPDE 75
ST HNP +++YS ++P K +A + K + G+DG V K R+ + + D
Sbjct: 9 STARQERRHNPLSEEYSPTTPLKQKAGKKRKSTSEANGQDGYVDSKSSRRILDLGQ--DL 66
Query: 76 FWQTAGEREGENP 88
+ ER+ + P
Sbjct: 67 VDEDEAERQAKRP 79
>gi|71754973|ref|XP_828401.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|70833787|gb|EAN79289.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 1650
Score = 35.4 bits (80), Expect = 4.3, Method: Composition-based stats.
Identities = 29/83 (34%), Positives = 34/83 (40%), Gaps = 16/83 (19%)
Query: 23 SHNPFTQQYSLSSPP---------KPRAHGQFKVYANLGGRDGEV-KPKGKRKFVTREEE 72
SH+ F SS P K H N GG D + +P KRKF T EE
Sbjct: 356 SHSQFAGSEQASSSPMFDRSTIPIKEVEHESALAEFNQGGGDDLISRPHKKRKFETGEES 415
Query: 73 P-----DEFWQT-AGEREGENPM 89
P +FWQ G+R G PM
Sbjct: 416 PRFRKLPDFWQPRRGDRVGRGPM 438
>gi|357495539|ref|XP_003618058.1| hypothetical protein MTR_5g098530 [Medicago truncatula]
gi|355519393|gb|AET01017.1| hypothetical protein MTR_5g098530 [Medicago truncatula]
Length = 75
Score = 34.3 bits (77), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 23/32 (71%)
Query: 46 KVYANLGGRDGEVKPKGKRKFVTREEEPDEFW 77
KV A LGGRD E K K+KF+TRE+EP+ +
Sbjct: 42 KVCAQLGGRDEETKKGEKKKFITREQEPEHVF 73
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.133 0.421
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,196,269,222
Number of Sequences: 23463169
Number of extensions: 92450149
Number of successful extensions: 202643
Number of sequences better than 100.0: 54
Number of HSP's better than 100.0 without gapping: 39
Number of HSP's successfully gapped in prelim test: 15
Number of HSP's that attempted gapping in prelim test: 202599
Number of HSP's gapped (non-prelim): 54
length of query: 122
length of database: 8,064,228,071
effective HSP length: 89
effective length of query: 33
effective length of database: 5,976,006,030
effective search space: 197208198990
effective search space used: 197208198990
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 69 (31.2 bits)