BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 033298
         (122 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224129182|ref|XP_002320521.1| predicted protein [Populus trichocarpa]
 gi|222861294|gb|EEE98836.1| predicted protein [Populus trichocarpa]
          Length = 118

 Score =  160 bits (405), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 85/124 (68%), Positives = 101/124 (81%), Gaps = 8/124 (6%)

Query: 1   MAARVALYSSLSGK--ASTWPSSWSHNPFTQQYSLSSPPKPRAHGQFKVYANLGGRDGEV 58
           MAA+V L SSL+      + PSS+S   FT    LSSP K R H  FK++ANLG  DGE+
Sbjct: 1   MAAKVVLSSSLNTHFPVPSKPSSFSSVVFT----LSSPKK-RVH-HFKIHANLGEGDGEL 54

Query: 59  KPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGWIK 118
           KPKGK+KF+TRE+EP+++WQ+AGEREGENPMKTP+PYIIIFGMSTPFVILAIAFANGWIK
Sbjct: 55  KPKGKKKFITREQEPEQYWQSAGEREGENPMKTPLPYIIIFGMSTPFVILAIAFANGWIK 114

Query: 119 MPVR 122
           +PVR
Sbjct: 115 VPVR 118


>gi|255578705|ref|XP_002530211.1| conserved hypothetical protein [Ricinus communis]
 gi|223530258|gb|EEF32158.1| conserved hypothetical protein [Ricinus communis]
          Length = 117

 Score =  151 bits (381), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 77/122 (63%), Positives = 96/122 (78%), Gaps = 5/122 (4%)

Query: 1   MAARVALYSSLSGKASTWPSSWSHNPFTQQYSLSSPPKPRAHGQFKVYANLGGRDGEVKP 60
           MAA+ AL SSL  +    P+    +P +  +S    P+ R   +FK+ A +GG DGE+KP
Sbjct: 1   MAAKPALSSSLISQIPVLPNFSPSSPLSCTFST---PRKRVQ-KFKINA-IGGGDGELKP 55

Query: 61  KGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGWIKMP 120
           KGK+KF+TRE+EP+++WQTAGEREGENPMKTP+PYIIIFGMSTPFVILAIAFANGWIK+P
Sbjct: 56  KGKKKFITREQEPEQYWQTAGEREGENPMKTPLPYIIIFGMSTPFVILAIAFANGWIKVP 115

Query: 121 VR 122
           VR
Sbjct: 116 VR 117


>gi|21593859|gb|AAM65826.1| unknown [Arabidopsis thaliana]
          Length = 125

 Score =  150 bits (380), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 67/88 (76%), Positives = 79/88 (89%)

Query: 35  SPPKPRAHGQFKVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIP 94
            PP P    QFK++A LGGRDGEVKPK K+KF+T+EEEP+++WQ+ GEREGENPMKTP+P
Sbjct: 38  CPPSPLGFPQFKLHAKLGGRDGEVKPKDKKKFITKEEEPEQYWQSVGEREGENPMKTPLP 97

Query: 95  YIIIFGMSTPFVILAIAFANGWIKMPVR 122
           YIIIFGMSTPFVILAIAFANGWIK+P+R
Sbjct: 98  YIIIFGMSTPFVILAIAFANGWIKVPIR 125


>gi|225447539|ref|XP_002268250.1| PREDICTED: uncharacterized protein LOC100264834 [Vitis vinifera]
 gi|296085028|emb|CBI28443.3| unnamed protein product [Vitis vinifera]
          Length = 118

 Score =  150 bits (380), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 77/122 (63%), Positives = 92/122 (75%), Gaps = 4/122 (3%)

Query: 1   MAARVALYSSLSGKASTWPSSWSHNPFTQQYSLSSPPKPRAHGQFKVYANLGGRDGEVKP 60
           MAA+VAL S    +      S S +  T      + PK RAH  F ++A LGG DGE+K 
Sbjct: 1   MAAKVALSSCFCTRLPLPSKSSSPSSLTHT---PTSPKKRAH-HFSIHAKLGGGDGEIKQ 56

Query: 61  KGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGWIKMP 120
            GK+KF+TREEEP+++WQTAGEREGENPMKTP+PYIIIFGMSTPFVILAIAFANGW+K+P
Sbjct: 57  GGKKKFITREEEPEQYWQTAGEREGENPMKTPLPYIIIFGMSTPFVILAIAFANGWVKVP 116

Query: 121 VR 122
           VR
Sbjct: 117 VR 118


>gi|388514561|gb|AFK45342.1| unknown [Lotus japonicus]
          Length = 119

 Score =  147 bits (372), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 77/122 (63%), Positives = 92/122 (75%), Gaps = 3/122 (2%)

Query: 1   MAARVALYSSLSGKASTWPSSWSHNPFTQQYSLSSPPKPRAHGQFKVYANLGGRDGEVKP 60
           MA RV L S L+ K S  P S S +P +  +SL SP K  A    +V+A +GG D + K 
Sbjct: 1   MATRVTLSSPLAFKTSLLPKSPSLSPVS--FSLGSP-KTNAVVGVRVHAKIGGSDEDTKK 57

Query: 61  KGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGWIKMP 120
             K+KF+TR+EEP ++WQTAGEREGENPMKTP+PYIIIFGMSTPFVILAIAFANGWIK+P
Sbjct: 58  GEKKKFITRDEEPQQYWQTAGEREGENPMKTPLPYIIIFGMSTPFVILAIAFANGWIKVP 117

Query: 121 VR 122
           VR
Sbjct: 118 VR 119


>gi|18396025|ref|NP_565324.1| uncharacterized protein [Arabidopsis thaliana]
 gi|4755200|gb|AAD29067.1| expressed protein [Arabidopsis thaliana]
 gi|330250821|gb|AEC05915.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 125

 Score =  146 bits (369), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 65/88 (73%), Positives = 78/88 (88%)

Query: 35  SPPKPRAHGQFKVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIP 94
            PP P    QFK++A LGG DGEVKPK K+KF+T++EEP+++WQ+ GEREGENPMKTP+P
Sbjct: 38  CPPSPLGFPQFKLHAKLGGGDGEVKPKDKKKFITKDEEPEQYWQSVGEREGENPMKTPLP 97

Query: 95  YIIIFGMSTPFVILAIAFANGWIKMPVR 122
           YIIIFGMSTPFVILAIAFANGWIK+P+R
Sbjct: 98  YIIIFGMSTPFVILAIAFANGWIKVPIR 125


>gi|356501745|ref|XP_003519684.1| PREDICTED: uncharacterized protein LOC100805720 [Glycine max]
          Length = 115

 Score =  145 bits (366), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 79/122 (64%), Positives = 91/122 (74%), Gaps = 7/122 (5%)

Query: 1   MAARVALYSSLSGKASTWPSSWSHNPFTQQYSLSSPPKPRAHGQFKVYANLGGRDGEVKP 60
           MAA++ L S LS K S  P S S +P      L SP +    G  KV+A LGG D + K 
Sbjct: 1   MAAKLTLSSPLSFKTSFLPKSPSFSP-----GLCSP-RTNVTG-VKVHAKLGGGDEQAKN 53

Query: 61  KGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGWIKMP 120
            GK+KF+TREEEP ++WQTAGEREGENPMKTP+PYIIIFGMSTPFVILAIAFANGWIK+P
Sbjct: 54  GGKKKFITREEEPQQYWQTAGEREGENPMKTPLPYIIIFGMSTPFVILAIAFANGWIKVP 113

Query: 121 VR 122
           VR
Sbjct: 114 VR 115


>gi|15236299|ref|NP_193086.1| uncharacterized protein [Arabidopsis thaliana]
 gi|4678376|emb|CAB41108.1| putative protein [Arabidopsis thaliana]
 gi|7268053|emb|CAB78392.1| putative protein [Arabidopsis thaliana]
 gi|109946535|gb|ABG48446.1| At4g13500 [Arabidopsis thaliana]
 gi|332657886|gb|AEE83286.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 125

 Score =  145 bits (365), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 65/88 (73%), Positives = 78/88 (88%)

Query: 35  SPPKPRAHGQFKVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIP 94
            PP P    +FK++A LGG DGEVKPK K+KF+T+EEEP+++WQ+ GEREGENPMKTP+P
Sbjct: 38  CPPSPLGFPEFKLHAKLGGGDGEVKPKDKKKFITKEEEPEQYWQSVGEREGENPMKTPLP 97

Query: 95  YIIIFGMSTPFVILAIAFANGWIKMPVR 122
           YIIIFGMSTPFVILAIAFANGWIK+P+R
Sbjct: 98  YIIIFGMSTPFVILAIAFANGWIKVPIR 125


>gi|27754732|gb|AAO22809.1| unknown protein [Arabidopsis thaliana]
          Length = 119

 Score =  144 bits (364), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 65/88 (73%), Positives = 78/88 (88%)

Query: 35  SPPKPRAHGQFKVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIP 94
            PP P    +FK++A LGG DGEVKPK K+KF+T+EEEP+++WQ+ GEREGENPMKTP+P
Sbjct: 32  CPPSPLGFPEFKLHAKLGGGDGEVKPKDKKKFITKEEEPEQYWQSVGEREGENPMKTPLP 91

Query: 95  YIIIFGMSTPFVILAIAFANGWIKMPVR 122
           YIIIFGMSTPFVILAIAFANGWIK+P+R
Sbjct: 92  YIIIFGMSTPFVILAIAFANGWIKVPIR 119


>gi|297835826|ref|XP_002885795.1| hypothetical protein ARALYDRAFT_480141 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297331635|gb|EFH62054.1| hypothetical protein ARALYDRAFT_480141 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 125

 Score =  144 bits (364), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 65/88 (73%), Positives = 77/88 (87%)

Query: 35  SPPKPRAHGQFKVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIP 94
            P  P    QFK++A LGG DGEVKPK K+KF+T+EEEP+++WQ+ GEREGENPMKTP+P
Sbjct: 38  CPQSPLRFPQFKLHAKLGGGDGEVKPKDKKKFITKEEEPEQYWQSVGEREGENPMKTPLP 97

Query: 95  YIIIFGMSTPFVILAIAFANGWIKMPVR 122
           YIIIFGMSTPFVILAIAFANGWIK+P+R
Sbjct: 98  YIIIFGMSTPFVILAIAFANGWIKVPIR 125


>gi|21618132|gb|AAM67182.1| unknown [Arabidopsis thaliana]
          Length = 125

 Score =  143 bits (360), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 64/88 (72%), Positives = 78/88 (88%)

Query: 35  SPPKPRAHGQFKVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIP 94
            PP P    +FK++A LGG DGEVKPK K+KF+T+EEEP+++WQ+ GEREGENPMKTP+P
Sbjct: 38  CPPSPLGFPKFKLHAKLGGGDGEVKPKDKKKFITKEEEPEQYWQSVGEREGENPMKTPLP 97

Query: 95  YIIIFGMSTPFVILAIAFANGWIKMPVR 122
           YIIIFG+STPFVILAIAFANGWIK+P+R
Sbjct: 98  YIIIFGISTPFVILAIAFANGWIKVPIR 125


>gi|356554217|ref|XP_003545445.1| PREDICTED: uncharacterized protein LOC100810445 [Glycine max]
          Length = 126

 Score =  141 bits (355), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 77/123 (62%), Positives = 89/123 (72%), Gaps = 9/123 (7%)

Query: 1   MAARVALYSSLSGKASTWPSSWSHNPFTQQYSLS-SPPKPRAHGQFKVYANLGGRDGEVK 59
           MAA++ L S  S K S  P S S       +SL    P+    G  KV+A LGG D + K
Sbjct: 12  MAAKLTLSSPFSFKTSFLPKSPS-------FSLGLYSPRTNVTG-VKVHAKLGGGDEQAK 63

Query: 60  PKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGWIKM 119
             GK+KF+TREEEP ++WQTAGEREGENPMKTP+PYIIIFGMSTPFVILAIAFANGWIK+
Sbjct: 64  KGGKKKFITREEEPQQYWQTAGEREGENPMKTPLPYIIIFGMSTPFVILAIAFANGWIKV 123

Query: 120 PVR 122
           PVR
Sbjct: 124 PVR 126


>gi|102140020|gb|ABF70151.1| hypothetical protein MA4_112I10.24 [Musa acuminata]
          Length = 113

 Score =  139 bits (349), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 63/77 (81%), Positives = 73/77 (94%), Gaps = 1/77 (1%)

Query: 46  KVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPF 105
           ++YANLGG DGE KP  K+KF+TREEEPD++WQTAGER+GENPMKTP+PYIIIFGMSTPF
Sbjct: 38  RIYANLGGADGEAKP-AKKKFITREEEPDQYWQTAGERKGENPMKTPLPYIIIFGMSTPF 96

Query: 106 VILAIAFANGWIKMPVR 122
           VILAIAFANGWIKMP++
Sbjct: 97  VILAIAFANGWIKMPMK 113


>gi|242033753|ref|XP_002464271.1| hypothetical protein SORBIDRAFT_01g015270 [Sorghum bicolor]
 gi|241918125|gb|EER91269.1| hypothetical protein SORBIDRAFT_01g015270 [Sorghum bicolor]
          Length = 132

 Score =  137 bits (344), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 62/77 (80%), Positives = 72/77 (93%), Gaps = 1/77 (1%)

Query: 46  KVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPF 105
           ++ A LGG DGE+KP GK+KF+TREEEP+++WQTAGEREGENPM TP+PYIIIFGMSTPF
Sbjct: 57  RIIAKLGG-DGELKPPGKKKFITREEEPEQYWQTAGEREGENPMMTPLPYIIIFGMSTPF 115

Query: 106 VILAIAFANGWIKMPVR 122
           VILAIAFANGWIK+PVR
Sbjct: 116 VILAIAFANGWIKVPVR 132


>gi|414871661|tpg|DAA50218.1| TPA: hypothetical protein ZEAMMB73_766061 [Zea mays]
          Length = 125

 Score =  134 bits (337), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 62/77 (80%), Positives = 71/77 (92%), Gaps = 2/77 (2%)

Query: 46  KVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPF 105
           +V A LG  DGE+KP GKRKF+T+EEEP+++WQTAGEREGENPM TP+PYIIIFGMSTPF
Sbjct: 51  RVTAKLG--DGELKPAGKRKFITKEEEPEQYWQTAGEREGENPMMTPLPYIIIFGMSTPF 108

Query: 106 VILAIAFANGWIKMPVR 122
           VILAIAFANGWIK+PVR
Sbjct: 109 VILAIAFANGWIKVPVR 125


>gi|226501594|ref|NP_001142893.1| uncharacterized protein LOC100275314 [Zea mays]
 gi|195611082|gb|ACG27371.1| hypothetical protein [Zea mays]
 gi|195648823|gb|ACG43879.1| hypothetical protein [Zea mays]
          Length = 119

 Score =  134 bits (337), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 62/77 (80%), Positives = 71/77 (92%), Gaps = 2/77 (2%)

Query: 46  KVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPF 105
           +V A LG  DGE+KP GKRKF+T+EEEP+++WQTAGEREGENPM TP+PYIIIFGMSTPF
Sbjct: 45  RVTAKLG--DGELKPAGKRKFITKEEEPEQYWQTAGEREGENPMMTPLPYIIIFGMSTPF 102

Query: 106 VILAIAFANGWIKMPVR 122
           VILAIAFANGWIK+PVR
Sbjct: 103 VILAIAFANGWIKVPVR 119


>gi|357121221|ref|XP_003562319.1| PREDICTED: uncharacterized protein LOC100841706 [Brachypodium
           distachyon]
          Length = 119

 Score =  133 bits (335), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 61/95 (64%), Positives = 76/95 (80%), Gaps = 4/95 (4%)

Query: 32  SLSSPPKPRAHGQF----KVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGEN 87
           +L++ P P   G      ++ A LGG DGE K  GK+KF+T+E+EP+++WQTAGER+GEN
Sbjct: 25  ALATLPLPSKKGTISRRSRISAQLGGGDGETKSDGKKKFITKEQEPEQYWQTAGERKGEN 84

Query: 88  PMKTPIPYIIIFGMSTPFVILAIAFANGWIKMPVR 122
           PM TP+PYIIIFGMSTPFVILAIAFANGWIK P+R
Sbjct: 85  PMMTPLPYIIIFGMSTPFVILAIAFANGWIKAPIR 119


>gi|115453991|ref|NP_001050596.1| Os03g0595300 [Oryza sativa Japonica Group]
 gi|28269464|gb|AAO38007.1| unknown protein [Oryza sativa Japonica Group]
 gi|108709639|gb|ABF97434.1| expressed protein [Oryza sativa Japonica Group]
 gi|113549067|dbj|BAF12510.1| Os03g0595300 [Oryza sativa Japonica Group]
 gi|215686889|dbj|BAG89739.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218193251|gb|EEC75678.1| hypothetical protein OsI_12483 [Oryza sativa Indica Group]
 gi|222625313|gb|EEE59445.1| hypothetical protein OsJ_11626 [Oryza sativa Japonica Group]
          Length = 118

 Score =  132 bits (333), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 63/78 (80%), Positives = 73/78 (93%), Gaps = 2/78 (2%)

Query: 46  KVYANLGGRDGEVKPKG-KRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTP 104
           ++ ANLGG DGEVKP G K+KF+TREEEP+++WQ AGER+GENPMKTP+PYIIIFGMSTP
Sbjct: 42  RISANLGG-DGEVKPAGNKKKFITREEEPEQYWQAAGERKGENPMKTPLPYIIIFGMSTP 100

Query: 105 FVILAIAFANGWIKMPVR 122
           FVILAIAFANGWIK+PVR
Sbjct: 101 FVILAIAFANGWIKVPVR 118


>gi|449444478|ref|XP_004140001.1| PREDICTED: uncharacterized protein LOC101219556 [Cucumis sativus]
 gi|449475630|ref|XP_004154506.1| PREDICTED: uncharacterized protein LOC101231513 [Cucumis sativus]
          Length = 123

 Score =  132 bits (332), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 63/85 (74%), Positives = 76/85 (89%)

Query: 38  KPRAHGQFKVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYII 97
           +P    +FK+ ANLGG D E+K  GK+KF+T+EEEP+++WQTAGEREGENPMKTP+PYII
Sbjct: 39  RPTPSSRFKLLANLGGGDAEIKKGGKKKFITKEEEPEQYWQTAGEREGENPMKTPLPYII 98

Query: 98  IFGMSTPFVILAIAFANGWIKMPVR 122
           IFGMSTPFVILAIAFANGW+K+PVR
Sbjct: 99  IFGMSTPFVILAIAFANGWVKVPVR 123


>gi|357495537|ref|XP_003618057.1| hypothetical protein MTR_5g098530 [Medicago truncatula]
 gi|355519392|gb|AET01016.1| hypothetical protein MTR_5g098530 [Medicago truncatula]
          Length = 118

 Score =  120 bits (301), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 60/92 (65%), Positives = 75/92 (81%), Gaps = 2/92 (2%)

Query: 31  YSLSSPPKPRAHGQFKVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMK 90
           ++++SP   R     KV A LGGRD E K   K+KF+TRE+EP+++WQ+AGEREGENPM 
Sbjct: 29  HTIASPK--RISLGVKVCAQLGGRDEETKKGEKKKFITREQEPEQYWQSAGEREGENPMM 86

Query: 91  TPIPYIIIFGMSTPFVILAIAFANGWIKMPVR 122
           TP+PYII+FGMSTPFVIL IAFANGWIK+P+R
Sbjct: 87  TPLPYIILFGMSTPFVILGIAFANGWIKVPIR 118


>gi|326507980|dbj|BAJ86733.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525657|dbj|BAJ88875.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 120

 Score =  108 bits (270), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 48/62 (77%), Positives = 57/62 (91%), Gaps = 1/62 (1%)

Query: 57  EVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGW 116
           E KP GK KF+TRE+EP+++WQTAGER+GENPM TP+PYI+IFG STPF+ILAIAFANGW
Sbjct: 55  ETKPDGK-KFITREQEPEQYWQTAGERKGENPMMTPLPYIVIFGFSTPFIILAIAFANGW 113

Query: 117 IK 118
           IK
Sbjct: 114 IK 115


>gi|302756923|ref|XP_002961885.1| hypothetical protein SELMODRAFT_69660 [Selaginella moellendorffii]
 gi|302817145|ref|XP_002990249.1| hypothetical protein SELMODRAFT_49680 [Selaginella moellendorffii]
 gi|300141958|gb|EFJ08664.1| hypothetical protein SELMODRAFT_49680 [Selaginella moellendorffii]
 gi|300170544|gb|EFJ37145.1| hypothetical protein SELMODRAFT_69660 [Selaginella moellendorffii]
          Length = 70

 Score =  106 bits (264), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 48/70 (68%), Positives = 57/70 (81%)

Query: 53  GRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAF 112
           G DG    K KRKF+T+EEEP+E+WQ+A EREG  PM T +PYI+IFG STPFVILA+ F
Sbjct: 1   GGDGGSSSKEKRKFITKEEEPEEYWQSAAEREGRGPMSTLLPYIVIFGFSTPFVILALGF 60

Query: 113 ANGWIKMPVR 122
           ANGWIK+PVR
Sbjct: 61  ANGWIKIPVR 70


>gi|148907351|gb|ABR16811.1| unknown [Picea sitchensis]
 gi|148907976|gb|ABR17108.1| unknown [Picea sitchensis]
          Length = 154

 Score =  103 bits (256), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 42/64 (65%), Positives = 55/64 (85%)

Query: 59  KPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGWIK 118
           K   K+KF+T+E+EP+++WQTA ER+G NPM TP+PYI+IFG  TPF+IL +AFANGWIK
Sbjct: 91  KDTDKKKFITKEQEPEQYWQTAAERDGVNPMLTPLPYIVIFGFLTPFIILGVAFANGWIK 150

Query: 119 MPVR 122
           +PVR
Sbjct: 151 VPVR 154


>gi|116782086|gb|ABK22363.1| unknown [Picea sitchensis]
          Length = 154

 Score =  103 bits (256), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 42/64 (65%), Positives = 55/64 (85%)

Query: 59  KPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGWIK 118
           K   K+KF+T+E+EP+++WQTA ER+G NPM TP+PYI+IFG  TPF+IL +AFANGWIK
Sbjct: 91  KDTDKKKFITKEQEPEQYWQTAAERDGVNPMLTPLPYIVIFGFLTPFIILGVAFANGWIK 150

Query: 119 MPVR 122
           +PVR
Sbjct: 151 VPVR 154


>gi|168042685|ref|XP_001773818.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674933|gb|EDQ61435.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 69

 Score = 87.4 bits (215), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 37/70 (52%), Positives = 55/70 (78%), Gaps = 2/70 (2%)

Query: 51  LGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAI 110
           +GG   + KP+  +KF+T+E+EPD++W TA EREG+NPM+T +PYI +  + +PF ILA+
Sbjct: 1   VGGGADDKKPE--KKFITKEQEPDQYWTTAAEREGKNPMQTLLPYIALLTILSPFAILAV 58

Query: 111 AFANGWIKMP 120
           A+AN WIK+P
Sbjct: 59  AYANNWIKLP 68


>gi|302837085|ref|XP_002950102.1| hypothetical protein VOLCADRAFT_104607 [Volvox carteri f.
           nagariensis]
 gi|300264575|gb|EFJ48770.1| hypothetical protein VOLCADRAFT_104607 [Volvox carteri f.
           nagariensis]
          Length = 111

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 37/90 (41%), Positives = 56/90 (62%), Gaps = 4/90 (4%)

Query: 33  LSSPPKPRAHGQFKVYANLGGRDGEVKPKG-KRKFVTREEEPDEFWQTAGEREGENPMKT 91
           +S P + R H   +    L G DG+    G K+KF+TREEEP+++W + GE++G NP+K 
Sbjct: 23  VSQPARSRRHLCTRA---LLGPDGKPGADGAKKKFITREEEPEQYWSSKGEQDGANPLKD 79

Query: 92  PIPYIIIFGMSTPFVILAIAFANGWIKMPV 121
           P+  I I  +  PF+ LA+A A G+I + V
Sbjct: 80  PLAIIGILAIFFPFIFLAVAVAAGYIDLSV 109


>gi|159474878|ref|XP_001695550.1| hypothetical protein CHLREDRAFT_163712 [Chlamydomonas reinhardtii]
 gi|158275561|gb|EDP01337.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 110

 Score = 67.8 bits (164), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 38/103 (36%), Positives = 58/103 (56%), Gaps = 2/103 (1%)

Query: 19  PSSWSHNPFTQQYSLSSPPKPRAHGQFKVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQ 78
           P+S S     Q   ++  P+  A    +V A LG  DG+      +KF++RE+EP+++W 
Sbjct: 8   PASSSVARRLQTSQVAVAPRV-ARRSLQVRALLG-PDGKPGTTSGKKFISREQEPEQYWS 65

Query: 79  TAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGWIKMPV 121
           + GE EG NPMK P+  I +  +  PF+ L IA A G+I + V
Sbjct: 66  SKGEAEGGNPMKDPLAIIGVLAIFFPFIFLGIAVAAGYIDLSV 108


>gi|74272653|gb|ABA01122.1| unknown [Chlamydomonas incerta]
          Length = 108

 Score = 67.0 bits (162), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 30/71 (42%), Positives = 45/71 (63%)

Query: 51  LGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAI 110
           L G DG+      +KF++RE+EP+++W + GE EG NPMK P+  I +  +  PF+ L I
Sbjct: 36  LLGPDGKPGTTSGKKFISREQEPEQYWSSKGEAEGANPMKDPLAIIGVVAILFPFIFLGI 95

Query: 111 AFANGWIKMPV 121
           A A G+I + V
Sbjct: 96  AIAAGFIDVSV 106


>gi|303282113|ref|XP_003060348.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457819|gb|EEH55117.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 116

 Score = 55.8 bits (133), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 27/55 (49%), Positives = 36/55 (65%), Gaps = 1/55 (1%)

Query: 63  KRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGWI 117
           K+  + RE E  E W + GEREGENP+K P+    I G+  PF+ILA+A A G+I
Sbjct: 61  KKPAILRENE-KEAWLSKGEREGENPLKDPMAMFAIGGIMVPFIILAVAGAAGYI 114


>gi|145345393|ref|XP_001417197.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577424|gb|ABO95490.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 105

 Score = 55.8 bits (133), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 38/83 (45%), Positives = 46/83 (55%), Gaps = 8/83 (9%)

Query: 40  RAHGQFKVYANLGGRDG---EVKPKGKR--KFVTREEEPDEFWQTAGEREGENPMKTPIP 94
           RA G  +V     GRDG        GK   K +TRE E  E W +  EREG NP+K P+ 
Sbjct: 24  RARGVARVTKR--GRDGVRARATQDGKNGPKKLTRENEK-EAWLSEMEREGANPLKDPMA 80

Query: 95  YIIIFGMSTPFVILAIAFANGWI 117
            I I G+  PF+ILAIA A G+I
Sbjct: 81  LIGIGGIMVPFIILAIASAAGFI 103


>gi|308802734|ref|XP_003078680.1| unnamed protein product [Ostreococcus tauri]
 gi|116057133|emb|CAL51560.1| unnamed protein product [Ostreococcus tauri]
          Length = 98

 Score = 55.1 bits (131), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 29/51 (56%), Positives = 34/51 (66%), Gaps = 1/51 (1%)

Query: 67  VTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGWI 117
           +TRE E  E W +  EREG NPMK P+  I I G+  PFVILAIA A G+I
Sbjct: 47  ITRENEK-EAWLSEMEREGANPMKDPMAMIGIGGLMVPFVILAIASAAGFI 96


>gi|307108039|gb|EFN56280.1| hypothetical protein CHLNCDRAFT_145155 [Chlorella variabilis]
          Length = 90

 Score = 53.9 bits (128), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 19/53 (35%), Positives = 35/53 (66%)

Query: 67  VTREEEPDEFWQTAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFANGWIKM 119
           + RE+EP+E+W +  ER+G++P + P+  I +  +  PF+ LA+  A GW+ +
Sbjct: 34  LRREDEPEEYWTSQSERDGKSPFQDPLAQIGLLAIFFPFIFLAVGIAVGWVDL 86


>gi|255085676|ref|XP_002505269.1| hypothetical protein MICPUN_62606 [Micromonas sp. RCC299]
 gi|226520538|gb|ACO66527.1| hypothetical protein MICPUN_62606 [Micromonas sp. RCC299]
          Length = 115

 Score = 52.4 bits (124), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 43/78 (55%), Gaps = 11/78 (14%)

Query: 40  RAHGQFKVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQTAGEREGENPMKTPIPYIIIF 99
           RA   F+V A       E KPK     + RE E  E W +  EREG NP K P+  + I 
Sbjct: 47  RASRGFRVVAM-----AEKKPK-----ILRENE-KEAWLSEMEREGANPFKDPMAIVGIL 95

Query: 100 GMSTPFVILAIAFANGWI 117
           G++ PFVILA+A A G+I
Sbjct: 96  GITFPFVILAVAGAAGYI 113


>gi|388503774|gb|AFK39953.1| unknown [Lotus japonicus]
          Length = 72

 Score = 52.0 bits (123), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 32/75 (42%), Positives = 44/75 (58%), Gaps = 3/75 (4%)

Query: 1  MAARVALYSSLSGKASTWPSSWSHNPFTQQYSLSSPPKPRAHGQFKVYANLGGRDGEVKP 60
          MA RV L S L+ K S  P S S +P +  +SL SP K  A    +V+  +GG D + K 
Sbjct: 1  MATRVTLSSPLAFKTSLLPKSPSLSPVS--FSLGSP-KTNAVVGVRVHVKIGGSDEDTKK 57

Query: 61 KGKRKFVTREEEPDE 75
            K+KF+TR+EEP +
Sbjct: 58 GEKKKFITRDEEPQQ 72


>gi|298385038|ref|ZP_06994597.1| chitobiase [Bacteroides sp. 1_1_14]
 gi|298262182|gb|EFI05047.1| chitobiase [Bacteroides sp. 1_1_14]
          Length = 381

 Score = 36.6 bits (83), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 43/95 (45%), Gaps = 12/95 (12%)

Query: 27  FTQQYSLSSPPKPRAHGQF--------KVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQ 78
           F   Y++S       H Q+         V AN     GE    G ++ +  + + + FW 
Sbjct: 214 FPTYYTVSKEEWDEIHNQYIDVDRTGWGVEANTEELTGEGAVNGHKEALI-DGDLNTFWH 272

Query: 79  TAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFA 113
           +  + EG+NP   P+P+IIIF M     IL+I  A
Sbjct: 273 SQWDGEGKNP---PLPHIIIFDMQQTQNILSIELA 304


>gi|29349540|ref|NP_813043.1| chitobiase [Bacteroides thetaiotaomicron VPI-5482]
 gi|29341449|gb|AAO79237.1| putative chitobiase [Bacteroides thetaiotaomicron VPI-5482]
          Length = 392

 Score = 35.8 bits (81), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 42/95 (44%), Gaps = 12/95 (12%)

Query: 27  FTQQYSLSSPPKPRAHGQF--------KVYANLGGRDGEVKPKGKRKFVTREEEPDEFWQ 78
           F   Y++S       H Q+         V AN     GE    G ++ +  +   + FW 
Sbjct: 225 FPTYYTVSKEEWDEIHNQYIDVDRTGWGVEANTEELTGEGAVNGHKEALI-DGNLNTFWH 283

Query: 79  TAGEREGENPMKTPIPYIIIFGMSTPFVILAIAFA 113
           +  + EG+NP   P+P+IIIF M     IL+I  A
Sbjct: 284 SQWDGEGKNP---PLPHIIIFDMQQTQNILSIELA 315


>gi|261334242|emb|CBH17236.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 1650

 Score = 35.4 bits (80), Expect = 3.8,   Method: Composition-based stats.
 Identities = 29/83 (34%), Positives = 34/83 (40%), Gaps = 16/83 (19%)

Query: 23  SHNPFTQQYSLSSPP---------KPRAHGQFKVYANLGGRDGEV-KPKGKRKFVTREEE 72
           SH+ F      SS P         K   H       N GG D  + +P  KRKF T EE 
Sbjct: 356 SHSQFAGSEQASSSPMFDRSTIPIKEVEHESTLAEFNQGGGDDLISRPHKKRKFETGEES 415

Query: 73  P-----DEFWQT-AGEREGENPM 89
           P      +FWQ   G+R G  PM
Sbjct: 416 PRFRKLPDFWQPRRGDRVGRGPM 438


>gi|453087072|gb|EMF15113.1| Bystin-domain-containing protein [Mycosphaerella populorum
          SO2202]
          Length = 507

 Score = 35.4 bits (80), Expect = 3.9,   Method: Composition-based stats.
 Identities = 21/73 (28%), Positives = 36/73 (49%), Gaps = 2/73 (2%)

Query: 16 STWPSSWSHNPFTQQYSLSSPPKPRAHGQFKVYANLGGRDGEVKPKGKRKFVTREEEPDE 75
          ST      HNP +++YS ++P K +A  + K  +   G+DG V  K  R+ +   +  D 
Sbjct: 9  STARQERRHNPLSEEYSPTTPLKQKAGKKRKSTSEANGQDGYVDSKSSRRILDLGQ--DL 66

Query: 76 FWQTAGEREGENP 88
            +   ER+ + P
Sbjct: 67 VDEDEAERQAKRP 79


>gi|71754973|ref|XP_828401.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|70833787|gb|EAN79289.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 1650

 Score = 35.4 bits (80), Expect = 4.3,   Method: Composition-based stats.
 Identities = 29/83 (34%), Positives = 34/83 (40%), Gaps = 16/83 (19%)

Query: 23  SHNPFTQQYSLSSPP---------KPRAHGQFKVYANLGGRDGEV-KPKGKRKFVTREEE 72
           SH+ F      SS P         K   H       N GG D  + +P  KRKF T EE 
Sbjct: 356 SHSQFAGSEQASSSPMFDRSTIPIKEVEHESALAEFNQGGGDDLISRPHKKRKFETGEES 415

Query: 73  P-----DEFWQT-AGEREGENPM 89
           P      +FWQ   G+R G  PM
Sbjct: 416 PRFRKLPDFWQPRRGDRVGRGPM 438


>gi|357495539|ref|XP_003618058.1| hypothetical protein MTR_5g098530 [Medicago truncatula]
 gi|355519393|gb|AET01017.1| hypothetical protein MTR_5g098530 [Medicago truncatula]
          Length = 75

 Score = 34.3 bits (77), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 23/32 (71%)

Query: 46 KVYANLGGRDGEVKPKGKRKFVTREEEPDEFW 77
          KV A LGGRD E K   K+KF+TRE+EP+  +
Sbjct: 42 KVCAQLGGRDEETKKGEKKKFITREQEPEHVF 73


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.133    0.421 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,196,269,222
Number of Sequences: 23463169
Number of extensions: 92450149
Number of successful extensions: 202643
Number of sequences better than 100.0: 54
Number of HSP's better than 100.0 without gapping: 39
Number of HSP's successfully gapped in prelim test: 15
Number of HSP's that attempted gapping in prelim test: 202599
Number of HSP's gapped (non-prelim): 54
length of query: 122
length of database: 8,064,228,071
effective HSP length: 89
effective length of query: 33
effective length of database: 5,976,006,030
effective search space: 197208198990
effective search space used: 197208198990
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 69 (31.2 bits)