BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 044715
(202 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255554469|ref|XP_002518273.1| conserved hypothetical protein [Ricinus communis]
gi|223542493|gb|EEF44033.1| conserved hypothetical protein [Ricinus communis]
Length = 187
Score = 208 bits (529), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 118/197 (59%), Positives = 137/197 (69%), Gaps = 19/197 (9%)
Query: 1 MKVKALVAVKTKLFRPCKKLNQIFKFKLRKSFFIRSLRSRSFPTTKRSKPQKSPSRTRIS 60
MK+KALVA K+KL +PC+KL F+FK+RK FIRSLR R+ RSK P R
Sbjct: 1 MKMKALVAFKSKLVKPCRKLLLTFRFKIRKPVFIRSLRFRA----SRSKTGTRPVRNWFL 56
Query: 61 ALLPS-FRPPKQWKKMEKLAEEHRSVSENPECDQALFSSPITPVLFPSPITPDYVKACRL 119
+L S F P KQ ++M +LAE +SVS+ + LFPSP+TP Y+ A R
Sbjct: 57 TVLHSVFCPAKQSREMNRLAE-LKSVSKAEHEKK----------LFPSPLTPAYITASRT 105
Query: 120 SKRDEV---EDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCR 176
+ R+ EDVEDACRSFENYLVEM+VEEGKV DL DVEELLYCWKNLKCPVFIDLVCR
Sbjct: 106 AMRESFMGSEDVEDACRSFENYLVEMIVEEGKVRDLMDVEELLYCWKNLKCPVFIDLVCR 165
Query: 177 FYGELCKDLFFPEDDED 193
FYGELCKDLF PEDD D
Sbjct: 166 FYGELCKDLFLPEDDND 182
>gi|297745775|emb|CBI15831.3| unnamed protein product [Vitis vinifera]
Length = 187
Score = 184 bits (468), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 107/200 (53%), Positives = 143/200 (71%), Gaps = 15/200 (7%)
Query: 1 MKVKALVAVKTKLFRPCKKLNQIFKFKLRKSFFIRSLRSRSFPTTKRSKPQKSPSRTRIS 60
MKV+ L+A K+KL PC+KL IFKF+ ++ FF+R+L+ R P ++S+ +K+P + R +
Sbjct: 1 MKVRTLIAFKSKLLNPCRKLQPIFKFRFKRPFFLRALQVRRRP--RQSETRKTPPKNRFT 58
Query: 61 ALLPSFRPPKQWKKMEKLAEEHRSVSENPECDQALFSSPITPVLFPSPITPDYVKACRLS 120
ALL FR +Q K+M+++ E RS+S+ + + LFPSP+TP +KA
Sbjct: 59 ALLSLFRFQRQEKEMDRV-RELRSISDAGQERR----------LFPSPLTPASMKAGAAG 107
Query: 121 KR--DEVEDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFY 178
KR ED+EDACRSFENYLVEM+VEEGKV DL DVEELLYCWKNLKCPVFIDLVCRFY
Sbjct: 108 KRVASGHEDLEDACRSFENYLVEMIVEEGKVRDLMDVEELLYCWKNLKCPVFIDLVCRFY 167
Query: 179 GELCKDLFFPEDDEDHANLQ 198
GELCKDLF +++++ + Q
Sbjct: 168 GELCKDLFSTDEEDNEESTQ 187
>gi|42570337|ref|NP_850144.2| ovate family protein 17 [Arabidopsis thaliana]
gi|29725750|gb|AAO89198.1| hypothetical protein [Arabidopsis thaliana]
gi|50058905|gb|AAT69197.1| hypothetical protein At2g30395 [Arabidopsis thaliana]
gi|330253287|gb|AEC08381.1| ovate family protein 17 [Arabidopsis thaliana]
Length = 195
Score = 154 bits (389), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 92/198 (46%), Positives = 118/198 (59%), Gaps = 12/198 (6%)
Query: 1 MKVKA-LVAVKTKLFRPCKKLNQIFKFKLRKSFFIRSLRSRSFPTTKRSKPQKSPSRTRI 59
M+VKA L+ K+KL + C + +F+F++++ FIR LR+R R Q S+ I
Sbjct: 1 MRVKATLINFKSKLSKSCNRFVSLFRFRVKRPVFIRPLRARHGNVKPRH--QHHHSKKPI 58
Query: 60 SALLPSFRPPKQWKKMEKLAEEHRSVSENPECDQALFSSPITPV----LFPSPITPDYVK 115
+ L F + KM S S N + SP+TP LF SPIT Y
Sbjct: 59 CSCL-CFLNSSKNHKMSNAKHRSSSFSVNDDDYSKFMQSPLTPATAKKLFTSPITTPYSS 117
Query: 116 ACR--LSKRDEVED--VEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFI 171
R L+ RD ED VEDACRSFENYL+ ++VEEGK+ DL D+EELL+CWKNLK PVFI
Sbjct: 118 RTRKSLNARDTFEDNAVEDACRSFENYLIHLIVEEGKIDDLMDIEELLFCWKNLKSPVFI 177
Query: 172 DLVCRFYGELCKDLFFPE 189
+LV RFYGELC+DLF E
Sbjct: 178 ELVSRFYGELCRDLFSGE 195
>gi|449463304|ref|XP_004149374.1| PREDICTED: uncharacterized protein LOC101206169 [Cucumis sativus]
Length = 184
Score = 153 bits (387), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 95/203 (46%), Positives = 122/203 (60%), Gaps = 22/203 (10%)
Query: 1 MKVK-ALVAVKTKLFRPCKKLNQIFKFKLRKSFFIRSLRSRSFPTTKRSKPQKSPSRTRI 59
MKV+ LV+ K+KL +PC KL +FKF ++K F I+S +R + S+ P R
Sbjct: 1 MKVEVGLVSFKSKLSKPCSKLLHLFKFPMKKPFSIKSFWAR-----RSSRAISKPRRKTW 55
Query: 60 SALLPSFRPPKQWKKMEKLAE--EHRSVSENPECDQALFSSPITPVLFPSPITPDYVKAC 117
S L R ++ KME++ E SV + E + L LFPSP+ K
Sbjct: 56 SWL----RWLRRVGKMERVRELKSSESVGSDNESREKL--------LFPSPLIRRRTKVA 103
Query: 118 RLSKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRF 177
+ +E E+VEDACRSFENYLVEM++EEGKV DL DVEELLYCW+ LKCPVF+DLV RF
Sbjct: 104 --ASWEEREEVEDACRSFENYLVEMIIEEGKVRDLMDVEELLYCWRKLKCPVFVDLVSRF 161
Query: 178 YGELCKDLFFPEDDEDHANLQTQ 200
YGELCKDLF + N Q +
Sbjct: 162 YGELCKDLFSSHNQAFTPNFQHK 184
>gi|357491457|ref|XP_003616016.1| hypothetical protein MTR_5g075190 [Medicago truncatula]
gi|355517351|gb|AES98974.1| hypothetical protein MTR_5g075190 [Medicago truncatula]
Length = 182
Score = 152 bits (383), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 93/203 (45%), Positives = 114/203 (56%), Gaps = 38/203 (18%)
Query: 1 MKVKALVAVKTKLFRPCKKLNQIFKFKLRKSFFIRSLRSRSFPTTKRSKPQK-------- 52
MKVKAL K+K +PCKKL FK K +K FIR+ R P
Sbjct: 1 MKVKALSIFKSKFLKPCKKLIFFFKLKPKKLIFIRAFRKAKSPKPSSPSKPMSSLLSVFC 60
Query: 53 SPSRTRISALLPSFRPPKQWKKMEKLAEEHRSVSENPECDQALFSSPITPVLFPSPITPD 112
SP ++R +L SF+ P ++ TP+ FPSP+TP
Sbjct: 61 SPKKSRDYNMLQSFKSPSNMQE--------------------------TPI-FPSPLTPA 93
Query: 113 YVKACRL-SKRDEV--EDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPV 169
V+ + + E E+VEDACRSFENYL EM+V EGK DL DVEELLYCWKNLKCPV
Sbjct: 94 CVRKNQTEGSKSEASSEEVEDACRSFENYLAEMIVAEGKTKDLMDVEELLYCWKNLKCPV 153
Query: 170 FIDLVCRFYGELCKDLFFPEDDE 192
F+DLV RFYGELCKDLF P+++E
Sbjct: 154 FVDLVSRFYGELCKDLFSPDNEE 176
>gi|297826449|ref|XP_002881107.1| ATOFP17/OFP17 [Arabidopsis lyrata subsp. lyrata]
gi|297326946|gb|EFH57366.1| ATOFP17/OFP17 [Arabidopsis lyrata subsp. lyrata]
Length = 195
Score = 151 bits (381), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 91/198 (45%), Positives = 118/198 (59%), Gaps = 12/198 (6%)
Query: 1 MKVKA-LVAVKTKLFRPCKKLNQIFKFKLRKSFFIRSLRSRSFPTTKRSKPQKSPSRTRI 59
M+VKA L+ K+KL + C + +F+F++++ FIR LR+R R + P +
Sbjct: 1 MRVKATLINFKSKLSKSCNRFVSLFRFRVKRPVFIRPLRARHGNVKPRHQ-HHHPKKPIC 59
Query: 60 SALLPSFRPPKQWKKMEKLAEEHRSVSENPECDQALFSSPITPV----LFPSPITPDYVK 115
S L F + KM K S S N + SP+TP LF SPIT Y
Sbjct: 60 SCLC--FLSTSKNHKMSKTKPRSSSFSVNDDDYSKFMHSPLTPATAKKLFTSPITTPYSI 117
Query: 116 ACR--LSKRDEVED--VEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFI 171
R L+ R+ ED VEDACRSFENYL+ ++VEEGK+ DL D+EELL+CWKNLK PVFI
Sbjct: 118 RTRKSLNSREAFEDNAVEDACRSFENYLIHLIVEEGKIDDLMDIEELLFCWKNLKSPVFI 177
Query: 172 DLVCRFYGELCKDLFFPE 189
+LV RFYGELC+DLF E
Sbjct: 178 ELVSRFYGELCRDLFSGE 195
>gi|449531350|ref|XP_004172649.1| PREDICTED: uncharacterized LOC101206169 [Cucumis sativus]
Length = 184
Score = 150 bits (379), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 94/203 (46%), Positives = 121/203 (59%), Gaps = 22/203 (10%)
Query: 1 MKVK-ALVAVKTKLFRPCKKLNQIFKFKLRKSFFIRSLRSRSFPTTKRSKPQKSPSRTRI 59
MKV+ LV+ K+KL +PC KL +FKF ++K I+S +R + S+ P R
Sbjct: 1 MKVEVGLVSFKSKLSKPCSKLLHLFKFPMKKPLSIKSFWAR-----RSSRAISKPRRKTW 55
Query: 60 SALLPSFRPPKQWKKMEKLAE--EHRSVSENPECDQALFSSPITPVLFPSPITPDYVKAC 117
S L R ++ KME++ E SV + E + L LFPSP+ K
Sbjct: 56 SWL----RWLRRVGKMERVRELKSSESVDSDNESREKL--------LFPSPLIRRRTKVA 103
Query: 118 RLSKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRF 177
+ +E E+VEDACRSFENYLVEM++EEGKV DL DVEELLYCW+ LKCPVF+DLV RF
Sbjct: 104 --ASWEEREEVEDACRSFENYLVEMIIEEGKVRDLMDVEELLYCWRKLKCPVFVDLVSRF 161
Query: 178 YGELCKDLFFPEDDEDHANLQTQ 200
YGELCKDLF + N Q +
Sbjct: 162 YGELCKDLFSSHNQAFTPNFQHK 184
>gi|224124752|ref|XP_002319413.1| predicted protein [Populus trichocarpa]
gi|222857789|gb|EEE95336.1| predicted protein [Populus trichocarpa]
Length = 119
Score = 145 bits (367), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 72/102 (70%), Positives = 80/102 (78%), Gaps = 2/102 (1%)
Query: 103 VLFPSPITPDYVKACRLSKRDEV--EDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLY 160
+L+PSP+TP Y+KA +KR EDVEDACRSFENYLVEM+VEEGKV DL DVEELLY
Sbjct: 18 MLYPSPLTPAYIKASLATKRQTFGDEDVEDACRSFENYLVEMMVEEGKVRDLADVEELLY 77
Query: 161 CWKNLKCPVFIDLVCRFYGELCKDLFFPEDDEDHANLQTQPK 202
CWKNLKCPVFI LVCRFYGELCKDLF P+ D + PK
Sbjct: 78 CWKNLKCPVFIGLVCRFYGELCKDLFSPDVDNTDVDSPKSPK 119
>gi|224145460|ref|XP_002325650.1| predicted protein [Populus trichocarpa]
gi|222862525|gb|EEF00032.1| predicted protein [Populus trichocarpa]
Length = 160
Score = 135 bits (340), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 89/167 (53%), Positives = 118/167 (70%), Gaps = 18/167 (10%)
Query: 3 VKALVAVKTKLFRPCKKLNQIFKFKLRKSFFIRSLRSRSFPTTKRSKPQKSPSRTRI-SA 61
+KAL ++KLF PCKKL +F+FKL++ FIR L+ R + KP+K+P + R+ ++
Sbjct: 1 MKALAVFRSKLFSPCKKLLLLFRFKLKRPVFIRGLQLRR----RSKKPRKAPQKNRVFNS 56
Query: 62 LLPSFRPPKQWKKMEKLAEEHRSVSENPECDQALFSSPITPVLFPSPITPDYVKACRLSK 121
LL F P ++ +KM++++E RSVSE PEC++ +LFPSP+TP Y+KA K
Sbjct: 57 LLSVFHPLRKSRKMDRVSE-LRSVSE-PECER---------MLFPSPLTPAYIKASLEKK 105
Query: 122 RDEV--EDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLK 166
R EDVEDACRSFENYLVEM+VEEG+V DL DVEELLYCWKNLK
Sbjct: 106 RQTFGDEDVEDACRSFENYLVEMMVEEGQVRDLMDVEELLYCWKNLK 152
>gi|125544696|gb|EAY90835.1| hypothetical protein OsI_12440 [Oryza sativa Indica Group]
Length = 237
Score = 116 bits (290), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 63/104 (60%), Positives = 72/104 (69%), Gaps = 6/104 (5%)
Query: 100 ITPVLFPSPITPDYVK-ACRL-SKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDVEE 157
+ PV PSP TP YVK RL S+R E+ E CRSFE L+EM++EEGKV DL DVEE
Sbjct: 126 LAPV--PSPETPAYVKMVARLRSRRSGGEEEEGPCRSFEERLMEMLLEEGKVRDLQDVEE 183
Query: 158 LLYCWKNLKCPVFIDLVCRFYGELCKDLFFP--EDDEDHANLQT 199
LL CW+ LK PVF++LVCRFYGELCKDLF P ED D T
Sbjct: 184 LLRCWERLKSPVFVELVCRFYGELCKDLFSPGEEDGGDEGASST 227
>gi|79317098|ref|NP_001030983.1| uncharacterized protein [Arabidopsis thaliana]
gi|98961723|gb|ABF59191.1| unknown protein [Arabidopsis thaliana]
gi|332189933|gb|AEE28054.1| uncharacterized protein [Arabidopsis thaliana]
Length = 170
Score = 106 bits (265), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 52/92 (56%), Positives = 67/92 (72%), Gaps = 4/92 (4%)
Query: 104 LFPSPITPDYVKACRLSKRDEVED--VEDACRSFENYLVEMVVEEGKVGDLTDVEELLYC 161
+FP +T KA R S+R +++ V++ACR FENY++ M VE+GK+ DL D+EELLY
Sbjct: 81 MFPLAMTQRNKKA-RTSQRHMLKEDGVKEACRRFENYMIHMFVEDGKIDDLMDMEELLYY 139
Query: 162 WKNLKCPVFIDLVCRFYGELCKDLFFPEDDED 193
WKNL P+FIDLV RFYGELC DL FP D +D
Sbjct: 140 WKNLNSPIFIDLVTRFYGELCTDL-FPSDHDD 170
>gi|297843486|ref|XP_002889624.1| hypothetical protein ARALYDRAFT_333988 [Arabidopsis lyrata subsp.
lyrata]
gi|297335466|gb|EFH65883.1| hypothetical protein ARALYDRAFT_333988 [Arabidopsis lyrata subsp.
lyrata]
Length = 171
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/91 (54%), Positives = 68/91 (74%), Gaps = 4/91 (4%)
Query: 104 LFPSPITPDYVKACRLSKRDEVED--VEDACRSFENYLVEMVVEEGKVGDLTDVEELLYC 161
+FP +T KA R S+R +++ V++AC +FENY++ M+VE+GK+ DL D+EELLY
Sbjct: 82 MFPLAVTQRNKKA-RTSQRHMLKEDGVKEACTNFENYMIHMIVEDGKLDDLMDMEELLYY 140
Query: 162 WKNLKCPVFIDLVCRFYGELCKDLFFPEDDE 192
WKNL P+FIDLV RFYGELC DL FP D++
Sbjct: 141 WKNLNSPIFIDLVTRFYGELCTDL-FPSDND 170
>gi|55168120|gb|AAV43987.1| hypothetical protein [Oryza sativa Japonica Group]
gi|55168196|gb|AAV44062.1| hypothetical protein [Oryza sativa Japonica Group]
gi|222631149|gb|EEE63281.1| hypothetical protein OsJ_18091 [Oryza sativa Japonica Group]
Length = 237
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 60/98 (61%), Positives = 68/98 (69%), Gaps = 4/98 (4%)
Query: 106 PSPITPDYVK-ACRL-SKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWK 163
PSP TP YVK RL S+R E+ E CRSFE L+EM++EEGKV D DVEELL CW+
Sbjct: 130 PSPETPAYVKMVARLRSRRRRGEEEEGPCRSFEERLMEMLLEEGKVRDQQDVEELLRCWE 189
Query: 164 NLKCPVFIDLVCRFYGELCKDLFFP--EDDEDHANLQT 199
LK PVF++LVCRFYGELCKDLF P ED D T
Sbjct: 190 RLKSPVFVELVCRFYGELCKDLFSPGEEDGGDEGASST 227
>gi|242051288|ref|XP_002463388.1| hypothetical protein SORBIDRAFT_02g042940 [Sorghum bicolor]
gi|241926765|gb|EER99909.1| hypothetical protein SORBIDRAFT_02g042940 [Sorghum bicolor]
Length = 222
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 47/67 (70%), Positives = 52/67 (77%)
Query: 127 DVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLF 186
DVE ACRSFE +LVEM+VEE KV DLTDVEELL CW+ L+CP F+ LV RFYGELC DLF
Sbjct: 142 DVEAACRSFERHLVEMLVEERKVMDLTDVEELLCCWEQLRCPAFVRLVGRFYGELCMDLF 201
Query: 187 FPEDDED 193
D D
Sbjct: 202 SSAQDAD 208
>gi|242087471|ref|XP_002439568.1| hypothetical protein SORBIDRAFT_09g013380 [Sorghum bicolor]
gi|241944853|gb|EES17998.1| hypothetical protein SORBIDRAFT_09g013380 [Sorghum bicolor]
Length = 230
Score = 97.4 bits (241), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 54/105 (51%), Positives = 68/105 (64%), Gaps = 13/105 (12%)
Query: 101 TPVLFP--SPITPDYVK----------ACRLSKRDEVEDVEDACRSFENYLVEMVVEEGK 148
TPV+ P SP TP Y+K A ++++ + E+ACRSFE L+EM+VEEGK
Sbjct: 109 TPVVAPVSSPETPAYMKVVARLRLERSAASAGRQEDGDKEEEACRSFETCLMEMLVEEGK 168
Query: 149 VGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLF-FPEDDE 192
DL DVEELL CW+ LK PVF++LVCRFY LC +F P DDE
Sbjct: 169 TRDLQDVEELLRCWERLKSPVFVELVCRFYDGLCNGMFPAPGDDE 213
>gi|414888108|tpg|DAA64122.1| TPA: hypothetical protein ZEAMMB73_869574 [Zea mays]
Length = 217
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 45/76 (59%), Positives = 54/76 (71%)
Query: 118 RLSKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRF 177
R+ +DVE ACRSFE +L+E +VEE +V DLTDVEELL CW+ L+CP + LV RF
Sbjct: 131 RVGHGHGHDDVEAACRSFERHLMETLVEEREVVDLTDVEELLCCWEKLRCPALVRLVGRF 190
Query: 178 YGELCKDLFFPEDDED 193
YGELC DLF P D D
Sbjct: 191 YGELCMDLFSPAQDAD 206
>gi|413949384|gb|AFW82033.1| hypothetical protein ZEAMMB73_537791 [Zea mays]
Length = 216
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 53/104 (50%), Positives = 67/104 (64%), Gaps = 18/104 (17%)
Query: 101 TPVLFPSPI----TPDYVKAC------RLSKRDE--------VEDVEDACRSFENYLVEM 142
TPV+ +P+ TP Y+K R++ R E+ E+ACRSFE+YL+EM
Sbjct: 109 TPVIAVAPVSSTETPAYMKVVARLRSERIAARGGGEDEDKEKEEEEEEACRSFESYLMEM 168
Query: 143 VVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLF 186
+VEEGK DL DVEELL CW+ L+ PVFI+LV FYGELC DLF
Sbjct: 169 LVEEGKARDLKDVEELLQCWERLRSPVFIELVWSFYGELCNDLF 212
>gi|33146438|dbj|BAC79546.1| hypothetical protein [Oryza sativa Japonica Group]
gi|50509520|dbj|BAD31213.1| hypothetical protein [Oryza sativa Japonica Group]
gi|125559611|gb|EAZ05147.1| hypothetical protein OsI_27341 [Oryza sativa Indica Group]
Length = 209
Score = 92.4 bits (228), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 43/65 (66%), Positives = 51/65 (78%)
Query: 126 EDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDL 185
+ VE+ACRSFE +L+EM+VEE KV DL DVEELL CW+ L+ PVF+ LV RFYGELC DL
Sbjct: 133 DGVEEACRSFEKHLMEMLVEERKVRDLMDVEELLCCWEKLRSPVFVQLVGRFYGELCMDL 192
Query: 186 FFPED 190
F D
Sbjct: 193 FSGRD 197
>gi|125601517|gb|EAZ41093.1| hypothetical protein OsJ_25585 [Oryza sativa Japonica Group]
Length = 209
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 43/65 (66%), Positives = 51/65 (78%)
Query: 126 EDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDL 185
+ VE+ACRSFE +L+EM+VEE KV DL DVEELL CW+ L+ PVF+ LV RFYGELC DL
Sbjct: 133 DGVEEACRSFEKHLMEMLVEERKVRDLMDVEELLCCWEKLRSPVFVQLVGRFYGELCMDL 192
Query: 186 FFPED 190
F D
Sbjct: 193 FSGRD 197
>gi|294464704|gb|ADE77859.1| unknown [Picea sitchensis]
Length = 260
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 39/73 (53%), Positives = 51/73 (69%), Gaps = 5/73 (6%)
Query: 127 DVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLF 186
DV++AC+ F L+EMV E+ V DL DVEELLYC++NL CPV+ DLV FY E+C D+F
Sbjct: 181 DVDNACKQFRKCLIEMVAEDKNVRDLMDVEELLYCYQNLTCPVYRDLVSHFYAEICSDIF 240
Query: 187 FPE-----DDEDH 194
P DDE++
Sbjct: 241 LPSASSSFDDEEN 253
>gi|118482746|gb|ABK93291.1| unknown [Populus trichocarpa]
Length = 154
Score = 77.0 bits (188), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 39/64 (60%), Positives = 46/64 (71%), Gaps = 2/64 (3%)
Query: 103 VLFPSPITPDYVKACRLSKRDEV--EDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLY 160
+L+PSP+TP Y+KA +KR EDVEDACRSFENYLVEM+VEEGKV DL DV+
Sbjct: 18 MLYPSPLTPAYIKASLATKRQTFGDEDVEDACRSFENYLVEMMVEEGKVRDLADVKSFCI 77
Query: 161 CWKN 164
K
Sbjct: 78 AGKT 81
>gi|357468391|ref|XP_003604480.1| ATP-dependent RNA helicase chl1 [Medicago truncatula]
gi|355505535|gb|AES86677.1| ATP-dependent RNA helicase chl1 [Medicago truncatula]
Length = 1129
Score = 43.5 bits (101), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 31/47 (65%)
Query: 143 VVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLFFPE 189
++E G + + D+EE L+ + LK PV++D+V RF+ ++ D FP+
Sbjct: 1063 MMESGDLEHVLDIEEALHYYSRLKSPVYLDIVDRFFIDINSDFTFPQ 1109
>gi|168041447|ref|XP_001773203.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675562|gb|EDQ62056.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 958
Score = 43.1 bits (100), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 33/58 (56%), Gaps = 1/58 (1%)
Query: 130 DACRSFENYLVEMVVEEGKVGDLT-DVEELLYCWKNLKCPVFIDLVCRFYGELCKDLF 186
DA +F+N + M++E V +L DVE+ + + L P +IDL+ F+ +C D F
Sbjct: 865 DAVSAFQNEVARMMLEGADVPELNMDVEQFIAGYMKLNSPFYIDLIEDFFKSVCMDCF 922
>gi|302759769|ref|XP_002963307.1| hypothetical protein SELMODRAFT_405091 [Selaginella moellendorffii]
gi|300168575|gb|EFJ35178.1| hypothetical protein SELMODRAFT_405091 [Selaginella moellendorffii]
Length = 622
Score = 42.0 bits (97), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 33/54 (61%), Gaps = 3/54 (5%)
Query: 139 LVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLFFPEDDE 192
+VEMVV + DL D+EE L + L CPVF +V RF+ E C + F ED+E
Sbjct: 539 VVEMVVAD-SFDDLVDLEEFLGYYMLLNCPVFRAMVQRFFSEFCVE--FCEDEE 589
>gi|168008445|ref|XP_001756917.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691788|gb|EDQ78148.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 936
Score = 42.0 bits (97), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 33/59 (55%), Gaps = 1/59 (1%)
Query: 129 EDACRSFENYLVEMVVEEGKVGDLT-DVEELLYCWKNLKCPVFIDLVCRFYGELCKDLF 186
DA SF+N + M++E V +L DVE+ + + L P +IDL+ F+ +C D F
Sbjct: 841 NDAVTSFQNEVARMMLEGVDVPNLNMDVEQFIAGYMKLNSPFYIDLIEDFFKSVCIDCF 899
>gi|302759755|ref|XP_002963300.1| hypothetical protein SELMODRAFT_405078 [Selaginella moellendorffii]
gi|300168568|gb|EFJ35171.1| hypothetical protein SELMODRAFT_405078 [Selaginella moellendorffii]
Length = 509
Score = 42.0 bits (97), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 33/54 (61%), Gaps = 3/54 (5%)
Query: 139 LVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLFFPEDDE 192
+VEMVV + DL D+EE L + L CPVF +V RF+ E C + F ED+E
Sbjct: 426 VVEMVVAD-SFDDLVDLEEFLGYYMLLNCPVFRAMVQRFFSEFCVE--FCEDEE 476
>gi|356527780|ref|XP_003532485.1| PREDICTED: uncharacterized protein LOC100811016 [Glycine max]
Length = 222
Score = 40.8 bits (94), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 35/56 (62%), Gaps = 6/56 (10%)
Query: 137 NYLVEMVVEEGKVGDLTDV------EELLYCWKNLKCPVFIDLVCRFYGELCKDLF 186
N +VE + E ++ D++DV EE+L+ + L CPV++ +V +F+ E+C + F
Sbjct: 148 NRMVEKKLRELEMLDMSDVDYVLDIEEVLHYYSRLTCPVYLQIVDKFFMEMCSEFF 203
>gi|168038473|ref|XP_001771725.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677032|gb|EDQ63508.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 909
Score = 40.4 bits (93), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 32/58 (55%), Gaps = 1/58 (1%)
Query: 130 DACRSFENYLVEMVVEEGKVGDLT-DVEELLYCWKNLKCPVFIDLVCRFYGELCKDLF 186
DA SF++ + M++E DL DVE+ + + L P +IDL+ F+ +C D F
Sbjct: 815 DAVTSFQSEVARMMLEGVDAPDLNMDVEQFIAGYMKLNSPFYIDLIEDFFKSVCIDCF 872
>gi|388513621|gb|AFK44872.1| unknown [Lotus japonicus]
Length = 209
Score = 40.0 bits (92), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 20/70 (28%), Positives = 39/70 (55%), Gaps = 3/70 (4%)
Query: 120 SKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYG 179
SK E V D + + +EM + G V + D+EE+L+ + L PV++D+V +F+
Sbjct: 124 SKNKNEEGVHDLAQKMKE--LEMA-DSGDVEHVLDIEEVLHYYSRLTSPVYLDIVDKFFT 180
Query: 180 ELCKDLFFPE 189
++ ++ P+
Sbjct: 181 DMHSEINLPQ 190
>gi|357468387|ref|XP_003604478.1| hypothetical protein MTR_4g012880 [Medicago truncatula]
gi|355505533|gb|AES86675.1| hypothetical protein MTR_4g012880 [Medicago truncatula]
Length = 147
Score = 39.7 bits (91), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 15/47 (31%), Positives = 30/47 (63%)
Query: 143 VVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLFFPE 189
+++ G + + D+EE L+ + LK PV++D+V RF+ ++ D P+
Sbjct: 17 MMDSGDLEHVLDIEEALHYYSRLKSPVYLDIVNRFFIDINSDFTLPQ 63
>gi|224146631|ref|XP_002326076.1| predicted protein [Populus trichocarpa]
gi|222862951|gb|EEF00458.1| predicted protein [Populus trichocarpa]
Length = 386
Score = 39.3 bits (90), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 18/56 (32%), Positives = 33/56 (58%), Gaps = 1/56 (1%)
Query: 130 DACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDL 185
D R F++ +VEM+VE + D D+EELL C+ +L + D + + + ++ D+
Sbjct: 327 DPQRDFKDSMVEMIVENN-IQDSKDLEELLACYLSLNSKKYHDFIIKAFEQIWFDM 381
>gi|356520569|ref|XP_003528934.1| PREDICTED: uncharacterized protein LOC100795198 [Glycine max]
Length = 239
Score = 38.9 bits (89), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/61 (29%), Positives = 34/61 (55%), Gaps = 6/61 (9%)
Query: 128 VEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLFF 187
VE R E +++ V + D+EE+L+ + L CP ++++V +F+ E+ +LF
Sbjct: 166 VEQKLRELE------MLDMNNVEYVLDIEEVLHYYSRLTCPAYLEIVDKFFLEMYSELFG 219
Query: 188 P 188
P
Sbjct: 220 P 220
>gi|449530203|ref|XP_004172085.1| PREDICTED: uncharacterized protein LOC101229726 [Cucumis sativus]
Length = 325
Score = 38.5 bits (88), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 17/61 (27%), Positives = 35/61 (57%), Gaps = 1/61 (1%)
Query: 125 VEDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKD 184
V+ +D R F +VEM+VE K+ ++E+LL C+ +L + D++ + + ++ D
Sbjct: 258 VKSTKDPQRDFRESMVEMIVE-NKISGSNELEDLLACYLSLNTDEYHDIIVKVFKQIWFD 316
Query: 185 L 185
+
Sbjct: 317 M 317
>gi|449439817|ref|XP_004137682.1| PREDICTED: uncharacterized protein LOC101203459 [Cucumis sativus]
Length = 325
Score = 38.5 bits (88), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 17/61 (27%), Positives = 35/61 (57%), Gaps = 1/61 (1%)
Query: 125 VEDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKD 184
V+ +D R F +VEM+VE K+ ++E+LL C+ +L + D++ + + ++ D
Sbjct: 258 VKSTKDPQRDFRESMVEMIVE-NKISGSNELEDLLACYLSLNTDEYHDIIVKVFKQIWFD 316
Query: 185 L 185
+
Sbjct: 317 M 317
>gi|168007386|ref|XP_001756389.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162692428|gb|EDQ78785.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 960
Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 17/59 (28%), Positives = 36/59 (61%), Gaps = 1/59 (1%)
Query: 129 EDACRSFENYLVEMVVEEGKV-GDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLF 186
++A F + + +M++EEGK ++ DV+E L + +L+ P ++ +V F+ +C D +
Sbjct: 826 QEAVAVFHDDVAQMMLEEGKYCSEVNDVDEFLEGYMSLRSPFYLAMVDDFFRAVCVDSY 884
>gi|168028431|ref|XP_001766731.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681940|gb|EDQ68362.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 995
Score = 38.5 bits (88), Expect = 1.7, Method: Composition-based stats.
Identities = 18/59 (30%), Positives = 35/59 (59%), Gaps = 1/59 (1%)
Query: 129 EDACRSFENYLVEMVVEEGK-VGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLF 186
++A F + + M++EEG+ G+ DVEE L + +L+ P ++ +V F+ +C D +
Sbjct: 899 QEATAVFHDEVARMMLEEGQDCGEAHDVEEFLQGYMSLRSPFYLSMVDNFFRAVCVDCY 957
>gi|255546491|ref|XP_002514305.1| conserved hypothetical protein [Ricinus communis]
gi|223546761|gb|EEF48259.1| conserved hypothetical protein [Ricinus communis]
Length = 230
Score = 38.5 bits (88), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 20/69 (28%), Positives = 38/69 (55%), Gaps = 4/69 (5%)
Query: 120 SKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYG 179
SKRDE A + E +++M V + D+EE+L+ + L CP ++++V +F+
Sbjct: 150 SKRDEERSFLVAQKLKELEMMDM----SNVDHVLDIEEVLHYYSRLTCPAYLEIVDKFFM 205
Query: 180 ELCKDLFFP 188
++ + F P
Sbjct: 206 DMYAEFFGP 214
>gi|168001982|ref|XP_001753693.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695100|gb|EDQ81445.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 796
Score = 37.4 bits (85), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 15/59 (25%), Positives = 35/59 (59%)
Query: 129 EDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLFF 187
++A +F++ + M++EE ++ D+EE L + +LK P ++++V F+ + D +
Sbjct: 736 KEAVAAFQDDIAHMMLEEQDSDEVADIEEFLDGYMHLKSPFYLEVVEEFFRAISYDCYH 794
>gi|336408299|ref|ZP_08588793.1| hypothetical protein HMPREF1018_00808 [Bacteroides sp. 2_1_56FAA]
gi|335939599|gb|EGN01473.1| hypothetical protein HMPREF1018_00808 [Bacteroides sp. 2_1_56FAA]
Length = 1512
Score = 37.4 bits (85), Expect = 4.0, Method: Composition-based stats.
Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 4/82 (4%)
Query: 96 FSSPITPVLFPSPITPDYVKACRLSKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDV 155
F S + P L +T A L K +V++ ++ S + YL + E+G D TD
Sbjct: 1172 FLSKVAPSLSQQSVTEKAWSAIVLQKAGKVKEAQEFMASLKEYLTQ-TDEQGMFFDRTDS 1230
Query: 156 EELLYCWKNLKCPVFIDLVCRF 177
Y W NLK P +D++ F
Sbjct: 1231 P---YAWNNLKVPAHVDVMEAF 1249
>gi|356556188|ref|XP_003546408.1| PREDICTED: uncharacterized protein LOC100802301 [Glycine max]
Length = 217
Score = 37.0 bits (84), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 17/70 (24%), Positives = 39/70 (55%), Gaps = 4/70 (5%)
Query: 116 ACRLSKR---DEVEDVEDACRSFENYLVEM-VVEEGKVGDLTDVEELLYCWKNLKCPVFI 171
+C L K+ + + + + + E+ +++ G V + D+EE L+ + LK PV++
Sbjct: 121 SCELGKKECLNNSKSINEGVHDLTQKMKELEMMDTGDVEHVLDIEEALHYYSRLKSPVYL 180
Query: 172 DLVCRFYGEL 181
D+V +F+ ++
Sbjct: 181 DIVDKFFMDM 190
>gi|423248752|ref|ZP_17229768.1| hypothetical protein HMPREF1066_00778 [Bacteroides fragilis
CL03T00C08]
gi|423253701|ref|ZP_17234632.1| hypothetical protein HMPREF1067_01276 [Bacteroides fragilis
CL03T12C07]
gi|392655330|gb|EIY48973.1| hypothetical protein HMPREF1067_01276 [Bacteroides fragilis
CL03T12C07]
gi|392657693|gb|EIY51324.1| hypothetical protein HMPREF1066_00778 [Bacteroides fragilis
CL03T00C08]
Length = 1926
Score = 37.0 bits (84), Expect = 4.5, Method: Composition-based stats.
Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 4/82 (4%)
Query: 96 FSSPITPVLFPSPITPDYVKACRLSKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDV 155
F S + P L +T A L K +V++ ++ S + YL + E+G D TD
Sbjct: 1586 FLSKVAPSLSQQSVTEKAWSAIVLQKAGKVKEAQEFMASLKEYLTQ-TDEQGMFFDRTDS 1644
Query: 156 EELLYCWKNLKCPVFIDLVCRF 177
Y W NLK P +D++ F
Sbjct: 1645 P---YAWNNLKVPAHVDVMEAF 1663
>gi|60680272|ref|YP_210416.1| hypothetical protein BF0713 [Bacteroides fragilis NCTC 9343]
gi|60491706|emb|CAH06460.1| conserved hypothetical protein [Bacteroides fragilis NCTC 9343]
Length = 1926
Score = 37.0 bits (84), Expect = 4.6, Method: Composition-based stats.
Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 4/82 (4%)
Query: 96 FSSPITPVLFPSPITPDYVKACRLSKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDV 155
F S + P L +T A L K +V++ ++ S + YL + E+G D TD
Sbjct: 1586 FLSKVAPSLSQQSVTEKAWSAIVLQKAGKVKEAQEFMASLKEYLTQ-TDEQGMFFDRTDS 1644
Query: 156 EELLYCWKNLKCPVFIDLVCRF 177
Y W NLK P +D++ F
Sbjct: 1645 P---YAWNNLKVPAHVDVMEAF 1663
>gi|53712079|ref|YP_098071.1| hypothetical protein BF0786 [Bacteroides fragilis YCH46]
gi|52214944|dbj|BAD47537.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
Length = 1926
Score = 37.0 bits (84), Expect = 4.6, Method: Composition-based stats.
Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 4/82 (4%)
Query: 96 FSSPITPVLFPSPITPDYVKACRLSKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDV 155
F S + P L +T A L K +V++ ++ S + YL + E+G D TD
Sbjct: 1586 FLSKVAPSLSQQSVTEKAWSAIVLQKAGKVKEAQEFMASLKEYLTQ-TDEQGMFFDRTDS 1644
Query: 156 EELLYCWKNLKCPVFIDLVCRF 177
Y W NLK P +D++ F
Sbjct: 1645 P---YAWNNLKVPAHVDVMEAF 1663
>gi|423269315|ref|ZP_17248287.1| hypothetical protein HMPREF1079_01369 [Bacteroides fragilis
CL05T00C42]
gi|423273122|ref|ZP_17252069.1| hypothetical protein HMPREF1080_00722 [Bacteroides fragilis
CL05T12C13]
gi|392701109|gb|EIY94269.1| hypothetical protein HMPREF1079_01369 [Bacteroides fragilis
CL05T00C42]
gi|392708154|gb|EIZ01262.1| hypothetical protein HMPREF1080_00722 [Bacteroides fragilis
CL05T12C13]
Length = 1926
Score = 37.0 bits (84), Expect = 4.6, Method: Composition-based stats.
Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 4/82 (4%)
Query: 96 FSSPITPVLFPSPITPDYVKACRLSKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDV 155
F S + P L +T A L K +V++ ++ S + YL + E+G D TD
Sbjct: 1586 FLSKVAPSLSQQSVTEKAWSAIVLQKAGKVKEAQEFMASLKEYLTQ-TDEQGMFFDRTDS 1644
Query: 156 EELLYCWKNLKCPVFIDLVCRF 177
Y W NLK P +D++ F
Sbjct: 1645 P---YAWNNLKVPAHVDVMEAF 1663
>gi|423258926|ref|ZP_17239849.1| hypothetical protein HMPREF1055_02126 [Bacteroides fragilis
CL07T00C01]
gi|423264103|ref|ZP_17243106.1| hypothetical protein HMPREF1056_00793 [Bacteroides fragilis
CL07T12C05]
gi|387776506|gb|EIK38606.1| hypothetical protein HMPREF1055_02126 [Bacteroides fragilis
CL07T00C01]
gi|392706369|gb|EIY99492.1| hypothetical protein HMPREF1056_00793 [Bacteroides fragilis
CL07T12C05]
Length = 1926
Score = 37.0 bits (84), Expect = 4.6, Method: Composition-based stats.
Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 4/82 (4%)
Query: 96 FSSPITPVLFPSPITPDYVKACRLSKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDV 155
F S + P L +T A L K +V++ ++ S + YL + E+G D TD
Sbjct: 1586 FLSKVAPSLSQQSVTEKAWSAIVLQKAGKVKEAQEFMASLKEYLTQ-TDEQGMFFDRTDS 1644
Query: 156 EELLYCWKNLKCPVFIDLVCRF 177
Y W NLK P +D++ F
Sbjct: 1645 P---YAWNNLKVPAHVDVMEAF 1663
>gi|375357118|ref|YP_005109890.1| hypothetical protein BF638R_0756 [Bacteroides fragilis 638R]
gi|301161799|emb|CBW21339.1| conserved hypothetical protein [Bacteroides fragilis 638R]
Length = 1926
Score = 37.0 bits (84), Expect = 4.6, Method: Composition-based stats.
Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 4/82 (4%)
Query: 96 FSSPITPVLFPSPITPDYVKACRLSKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDV 155
F S + P L +T A L K +V++ ++ S + YL + E+G D TD
Sbjct: 1586 FLSKVAPSLSQQSVTEKAWSAIVLQKAGKVKEAQEFMASLKEYLTQ-TDEQGMFFDRTDS 1644
Query: 156 EELLYCWKNLKCPVFIDLVCRF 177
Y W NLK P +D++ F
Sbjct: 1645 P---YAWNNLKVPAHVDVMEAF 1663
>gi|383117040|ref|ZP_09937787.1| hypothetical protein BSHG_0859 [Bacteroides sp. 3_2_5]
gi|251947658|gb|EES87940.1| hypothetical protein BSHG_0859 [Bacteroides sp. 3_2_5]
Length = 1926
Score = 37.0 bits (84), Expect = 4.6, Method: Composition-based stats.
Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 4/82 (4%)
Query: 96 FSSPITPVLFPSPITPDYVKACRLSKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDV 155
F S + P L +T A L K +V++ ++ S + YL + E+G D TD
Sbjct: 1586 FLSKVAPSLSQQSVTEKAWSAIVLQKAGKVKEAQEFMASLKEYLTQ-TDEQGMFFDRTDS 1644
Query: 156 EELLYCWKNLKCPVFIDLVCRF 177
Y W NLK P +D++ F
Sbjct: 1645 P---YAWNNLKVPAHVDVMEAF 1663
>gi|423282004|ref|ZP_17260889.1| hypothetical protein HMPREF1204_00427 [Bacteroides fragilis HMW 615]
gi|404582491|gb|EKA87185.1| hypothetical protein HMPREF1204_00427 [Bacteroides fragilis HMW 615]
Length = 1926
Score = 37.0 bits (84), Expect = 4.7, Method: Composition-based stats.
Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 4/82 (4%)
Query: 96 FSSPITPVLFPSPITPDYVKACRLSKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDV 155
F S + P L +T A L K +V++ ++ S + YL + E+G D TD
Sbjct: 1586 FLSKVAPSLSQQSVTEKAWSAIVLQKAGKVKEAQEFMASLKEYLTQ-TDEQGMFFDRTDS 1644
Query: 156 EELLYCWKNLKCPVFIDLVCRF 177
Y W NLK P +D++ F
Sbjct: 1645 P---YAWNNLKVPAHVDVMEAF 1663
>gi|357521019|ref|XP_003630798.1| hypothetical protein MTR_8g103530 [Medicago truncatula]
gi|355524820|gb|AET05274.1| hypothetical protein MTR_8g103530 [Medicago truncatula]
Length = 203
Score = 37.0 bits (84), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 38/172 (22%), Positives = 72/172 (41%), Gaps = 28/172 (16%)
Query: 34 IRSLRSRSFPTTKRSKPQKSPSRTRISALLPSFRPPKQWKKMEKLAEEHRSVSENPECDQ 93
RSL SRS+ ++ P S + F ++W + + S+ ++
Sbjct: 21 FRSLFSRSYQKIPKTPPHNHSSYNDLEKFYSDF--TEKW-------DSEKGKSKKRRKNK 71
Query: 94 ALFSSPITPV---LFPSPITPDYVKACRLSKRDEVEDVE---------DACRSF------ 135
A+ SS T L + V +++E+E E D+ F
Sbjct: 72 AMLSSSTTKQEEELHVHNHNENMVALNNAKEKNEMEKSEELDGEQCKQDSSSMFNSKSMV 131
Query: 136 ENYLVEM-VVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLF 186
E L E+ ++E V + D+EE+L+ + L CP ++++V +F+ E+ + F
Sbjct: 132 EKKLKELEMLEMSNVDYILDIEEVLHYYSRLTCPSYLEIVDKFFVEMYSEFF 183
>gi|404494779|ref|YP_006718885.1| hypothetical protein Pcar_3089 [Pelobacter carbinolicus DSM 2380]
gi|77546762|gb|ABA90324.1| hypothetical protein Pcar_3089 [Pelobacter carbinolicus DSM 2380]
Length = 204
Score = 37.0 bits (84), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 22/71 (30%), Positives = 34/71 (47%), Gaps = 1/71 (1%)
Query: 78 LAEEHRSVSENPECDQALFSSPITPVLFPSPITPDYV-KACRLSKRDEVEDVEDACRSFE 136
L E + CDQ LF + LF P++ D K R ++ ++ D E R+
Sbjct: 11 LFSEELGIDLQSSCDQELFKWLLAATLFGGPVSRDIARKTYRTFQKYQLLDPESISRAGW 70
Query: 137 NYLVEMVVEEG 147
+YLV V++EG
Sbjct: 71 SYLVHPVMQEG 81
>gi|440913504|gb|ELR62953.1| ATP-binding cassette sub-family A member 3, partial [Bos grunniens
mutus]
Length = 1715
Score = 36.6 bits (83), Expect = 5.9, Method: Composition-based stats.
Identities = 15/43 (34%), Positives = 24/43 (55%)
Query: 144 VEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLF 186
+ VG L +VE LL+ W ++K +FI L+C LC+ +
Sbjct: 865 IRRASVGSLLEVESLLFDWVDVKLNLFIQLICFLKNLLCQQFY 907
>gi|164428824|ref|XP_956393.2| hypothetical protein NCU00083 [Neurospora crassa OR74A]
gi|157072296|gb|EAA27157.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 441
Score = 36.6 bits (83), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 30/48 (62%), Gaps = 3/48 (6%)
Query: 53 SPSRTRISALLPSFRPPKQWKKMEKLAEEHRSVSENPECDQALFSSPI 100
SP+R R+SALLP +R W+ +LA+ RS + + ++A+ +S +
Sbjct: 42 SPTRRRVSALLPPYRADTHWR---RLADTSRSTPTSTKVNKAIMASAV 86
>gi|336469615|gb|EGO57777.1| hypothetical protein NEUTE1DRAFT_109968 [Neurospora tetrasperma
FGSC 2508]
Length = 544
Score = 36.2 bits (82), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 30/48 (62%), Gaps = 3/48 (6%)
Query: 53 SPSRTRISALLPSFRPPKQWKKMEKLAEEHRSVSENPECDQALFSSPI 100
SP+R R+SALLP +R W+ +LA+ RS + + ++A+ +S +
Sbjct: 145 SPTRRRVSALLPPYRANTHWR---RLADTSRSTPTSTKVNKAIMASAV 189
>gi|350290738|gb|EGZ71952.1| hypothetical protein NEUTE2DRAFT_89679 [Neurospora tetrasperma FGSC
2509]
Length = 442
Score = 36.2 bits (82), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 30/48 (62%), Gaps = 3/48 (6%)
Query: 53 SPSRTRISALLPSFRPPKQWKKMEKLAEEHRSVSENPECDQALFSSPI 100
SP+R R+SALLP +R W+ +LA+ RS + + ++A+ +S +
Sbjct: 43 SPTRRRVSALLPPYRANTHWR---RLADTSRSTPTSTKVNKAIMASAV 87
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.136 0.415
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,123,682,630
Number of Sequences: 23463169
Number of extensions: 125753752
Number of successful extensions: 324286
Number of sequences better than 100.0: 72
Number of HSP's better than 100.0 without gapping: 29
Number of HSP's successfully gapped in prelim test: 43
Number of HSP's that attempted gapping in prelim test: 324225
Number of HSP's gapped (non-prelim): 76
length of query: 202
length of database: 8,064,228,071
effective HSP length: 135
effective length of query: 67
effective length of database: 9,191,667,552
effective search space: 615841725984
effective search space used: 615841725984
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 73 (32.7 bits)