BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 022686
(293 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224093009|ref|XP_002309781.1| predicted protein [Populus trichocarpa]
gi|222852684|gb|EEE90231.1| predicted protein [Populus trichocarpa]
Length = 380
Score = 363 bits (931), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 176/291 (60%), Positives = 228/291 (78%), Gaps = 3/291 (1%)
Query: 1 MERLKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDED 60
ME+ ++ +EELLKFTL SHI++TL+F+LG+S FC NLL++DPND + ST + D
Sbjct: 1 MEKQRERIEELLKFTLESHINQTLEFNLGLSKDFCINLLEEDPNDMLCHSTPTP--TPTD 58
Query: 61 SLQGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQLILNKGSS 120
S GV LYPLYKRLA AL RSV GA CRTY+K+ +++ L+QKEE W QLI KG
Sbjct: 59 SFDGVALYPLYKRLASALYRSVKSGAVCRTYEKMVFGDKDSNLKQKEENWDQLIKEKGL- 117
Query: 121 DLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQS 180
+L+++L+ + E+HVQEP+FS +KDG +T+EGRC DY I PG +IL+NK ++LKV+
Sbjct: 118 ELINVLEGISCEIHVQEPYFSLLKDGRKTIEGRCATGDYIRIEPGDLILVNKIVVLKVED 177
Query: 181 VCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQPY 240
V Y SFS+ML+AE+L KVLPGVKT+EEGV+IYRKFYTEEKE SNGVLAIC+SK+A QPY
Sbjct: 178 VRRYASFSKMLQAENLEKVLPGVKTVEEGVKIYRKFYTEEKEMSNGVLAICVSKLAAQPY 237
Query: 241 VTLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNV 291
++LASIL GL Y G++SLLGL+ T GT+S+ALPPPRS LLSSF+ P+ PN+
Sbjct: 238 LSLASILFGLSYGGVRSLLGLADTGGTVSNALPPPRSTLLSSFIFPYNPNI 288
>gi|255586386|ref|XP_002533840.1| conserved hypothetical protein [Ricinus communis]
gi|223526219|gb|EEF28542.1| conserved hypothetical protein [Ricinus communis]
Length = 370
Score = 360 bits (925), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 174/291 (59%), Positives = 231/291 (79%), Gaps = 5/291 (1%)
Query: 1 MERLKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDED 60
ME+L++ +EE++K+TL+SHI++TL FDL +S +FC+NLL+ DPND VS +S
Sbjct: 1 MEQLRNRIEEIVKYTLNSHINQTLGFDLSLSKEFCSNLLRADPNDTVS----LPPNSTSG 56
Query: 61 SLQGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQLILNKGSS 120
S +GVP YPL++RL AL + + +FC+TY + +NE+ L+QKEE+W++LIL KGS
Sbjct: 57 SFEGVPEYPLFRRLGSALYQCIISRSFCKTYDTIEFINEDNSLKQKEEQWNKLILEKGS- 115
Query: 121 DLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQS 180
+L+++L A ELHVQEPFFS +KDGL+T+EGRC +Y+ I PG+++L+NK ++L+V+
Sbjct: 116 ELMNVLMATFHELHVQEPFFSLLKDGLKTIEGRCADDNYSRIEPGALLLINKSVVLEVKD 175
Query: 181 VCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQPY 240
V Y SF +MLEAESL KVLPGVKTIEEGV++YRKFYTEEKE SNGVLAIC+SK QPY
Sbjct: 176 VRRYPSFLKMLEAESLSKVLPGVKTIEEGVEVYRKFYTEEKEMSNGVLAICVSKSPYQPY 235
Query: 241 VTLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNV 291
++LA +LSGLGY GIQSLLG++HT GTISDALPP RS LLSSF LP++PNV
Sbjct: 236 LSLARMLSGLGYTGIQSLLGIAHTVGTISDALPPSRSTLLSSFTLPYRPNV 286
>gi|359495929|ref|XP_002272151.2| PREDICTED: uncharacterized protein LOC100242314 [Vitis vinifera]
gi|296081341|emb|CBI17687.3| unnamed protein product [Vitis vinifera]
Length = 398
Score = 348 bits (893), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 175/289 (60%), Positives = 222/289 (76%), Gaps = 11/289 (3%)
Query: 4 LKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDP-NDAVSPSTSFDFDSDEDSL 62
L +C+EEL+K+TL S ++ TL+ DLG+S +C+ LL+DD D S ST DS
Sbjct: 19 LANCVEELVKYTLYSSVNGTLEIDLGLSKDYCSALLKDDHLTDPTSIST--------DSF 70
Query: 63 QGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQLILNKGSSDL 122
+GVP YPLYKRL+ AL RS+ GAF Y +AL++E+ L+QKEE W++L+++KG +L
Sbjct: 71 EGVPPYPLYKRLSAALYRSIISGAFWEIYSTMALIHEDSSLKQKEE-WNKLVVDKGL-EL 128
Query: 123 VDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVC 182
V+ILK ++FELHVQEPFFS +KDGL+ +EGRC + DYN IG G++IL NKC++L+VQ V
Sbjct: 129 VNILKTIDFELHVQEPFFSQLKDGLKIIEGRCAVGDYNRIGSGALILFNKCLVLEVQDVR 188
Query: 183 HYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQPYVT 242
Y SFS++LE+E L +VLPGVKTIEEGVQIYRKFYT+EKERSNGVLAIC++K A QPY+
Sbjct: 189 RYASFSQLLESEGLAEVLPGVKTIEEGVQIYRKFYTKEKERSNGVLAICVAKPAAQPYIF 248
Query: 243 LASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNV 291
LA IL GL Y G+Q LLG HT GTI +ALPPPRS LLSSFM P KPNV
Sbjct: 249 LAYILFGLSYGGVQRLLGFMHTVGTIPEALPPPRSTLLSSFMSPHKPNV 297
>gi|363808278|ref|NP_001242240.1| uncharacterized protein LOC100807259 [Glycine max]
gi|255644803|gb|ACU22903.1| unknown [Glycine max]
Length = 400
Score = 285 bits (730), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 149/290 (51%), Positives = 201/290 (69%), Gaps = 11/290 (3%)
Query: 3 RLKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDEDSL 62
+L DC+EEL++FTL+S+ L +SS+F +NLL+DD A PS+S +DSL
Sbjct: 20 KLCDCLEELVRFTLNSN-----SHHLNLSSQFFSNLLKDD---ATHPSSSHSLSQPDDSL 71
Query: 63 QGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNE--ECFLEQKEEEWSQLILNKGSS 120
+GVP YPLYKR + AL + ++ FCRT +A+ +E + ++QK EW +LI+ KG
Sbjct: 72 EGVPPYPLYKRFSSALLKCMDSETFCRTGANLAMTDEFEDSSMQQKRNEWHRLIVEKGF- 130
Query: 121 DLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQS 180
++ +ILK+V FE HVQEPFFS + DGL+T+EGRC YN I G++IL NK ++ +VQ
Sbjct: 131 EIENILKSVSFEFHVQEPFFSQLNDGLKTIEGRCATGKYNRIKSGNLILFNKSVVFEVQG 190
Query: 181 VCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQPY 240
V Y +F MLEAESL K LPGV++ EEGV++Y++F TEEKE++NGVLAI +SK QPY
Sbjct: 191 VRRYPTFFAMLEAESLGKGLPGVESSEEGVKVYQRFCTEEKEQANGVLAIVVSKFTPQPY 250
Query: 241 VTLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPN 290
+LA + L Y G+QSLLGL HT GTI +ALPPP S LL+SF P PN
Sbjct: 251 DSLARLFCELSYEGVQSLLGLMHTTGTIPNALPPPISTLLASFNFPCNPN 300
>gi|238479547|ref|NP_001154572.1| RNA-binding ASCH domain protein [Arabidopsis thaliana]
gi|330255179|gb|AEC10273.1| RNA-binding ASCH domain protein [Arabidopsis thaliana]
Length = 388
Score = 280 bits (715), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 149/299 (49%), Positives = 205/299 (68%), Gaps = 20/299 (6%)
Query: 1 MERLKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDED 60
M +++DC++E++KFTL D ++FD+G++ +FC+ LL S D + E
Sbjct: 1 MIKIRDCLDEMVKFTL----DYCVEFDIGLTGEFCSGLL-------CGESVLHDGERIES 49
Query: 61 S-------LQGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQL 113
S GVP YPLYK LAL L +S++ G+FC T++K++L E L++KE+EWS+L
Sbjct: 50 SSYALLHRFSGVPDYPLYKVLALGLLKSIDSGSFCGTFEKISLGKEVIRLKEKEDEWSKL 109
Query: 114 ILNKGSSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIG-PGSVILLNK 172
I KGS +LV+ LK V EL VQEP FS +KDG++TVE RC +Y+ I GS++++NK
Sbjct: 110 INQKGS-ELVNALKDVFSELQVQEPLFSLMKDGIKTVEARCFEEEYDRIRRGGSMVMINK 168
Query: 173 CMMLKVQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICI 232
C+M +V + Y SF E+L+AES KV PG KT+EEG+Q++RK Y ++E NGV+AI +
Sbjct: 169 CLMFEVLELHQYASFYELLKAESSEKVFPGTKTVEEGMQMFRKLYDTDQENFNGVVAIHL 228
Query: 233 SKMATQPYVTLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNV 291
SK QP V LA ILSGL Y G+Q+LLGLSHT G+I ALPPPRS LLSSFMLP+KP +
Sbjct: 229 SKSVAQPCVALAHILSGLSYTGVQNLLGLSHTTGSIFHALPPPRSMLLSSFMLPYKPKI 287
>gi|297828804|ref|XP_002882284.1| hypothetical protein ARALYDRAFT_477581 [Arabidopsis lyrata subsp.
lyrata]
gi|297328124|gb|EFH58543.1| hypothetical protein ARALYDRAFT_477581 [Arabidopsis lyrata subsp.
lyrata]
Length = 240
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 123/262 (46%), Positives = 162/262 (61%), Gaps = 46/262 (17%)
Query: 4 LKDCMEELLKFTLSSHIDETL-DFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDEDSL 62
L++CME LL+FTLSSH+DE++ FDL ++ FC +LL++ + P+
Sbjct: 14 LRECMESLLRFTLSSHLDESVPSFDLDLTRDFCLHLLEEATDSTEKPA------------ 61
Query: 63 QGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKE-----EEWSQLILNK 117
+YK LA AL EC + + E++S+L
Sbjct: 62 -------VYKLLARALS--------------------ECLASEGDNNPNLEKYSKLFHGL 94
Query: 118 GSSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLK 177
G DL+++LK V FELHVQEP+F+ +KDGL+TVEGRC + DY I G+ IL NKC++L+
Sbjct: 95 G-HDLINMLKKVNFELHVQEPYFTQLKDGLKTVEGRCAVGDYMRISSGAFILFNKCLLLE 153
Query: 178 VQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMAT 237
VQ V HY SFSEML E L KVLPGV++IEEGVQ+YR FY+EEKER NGVLAI ++K A
Sbjct: 154 VQDVHHYTSFSEMLIMEGLAKVLPGVESIEEGVQVYRNFYSEEKERMNGVLAIHVAKPAN 213
Query: 238 QPYVTLASILSGLGYVGIQSLL 259
QPY LA +LS L GI+SLL
Sbjct: 214 QPYAALAGVLSELKSTGIKSLL 235
>gi|357156930|ref|XP_003577624.1| PREDICTED: uncharacterized protein LOC100838225 [Brachypodium
distachyon]
Length = 382
Score = 211 bits (536), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 121/288 (42%), Positives = 173/288 (60%), Gaps = 26/288 (9%)
Query: 4 LKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDEDSLQ 63
L + ELL+F LSSH+ D L +S +C+ LL DD + ++ + + +
Sbjct: 20 LSSALGELLRFVLSSHLTAP-DPALPLSPSYCSRLLDDDLCEKLAAELAGCIEEGQ---- 74
Query: 64 GVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQLILNKGSSDLV 123
+P P V GAF + EE +++ EW ++L KG+ +L
Sbjct: 75 -LPEPP------------VGSGAF-------RIPAEEDGPRERDREWEAVLLEKGA-ELK 113
Query: 124 DILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCH 183
+ VEF LHVQEP+F+ + G + VEGR +YN I GS +L NKC++L+V++V
Sbjct: 114 RMYDGVEFVLHVQEPYFTQLSAGTKNVEGRLAAGNYNRITQGSWLLFNKCLLLEVEAVRK 173
Query: 184 YDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQPYVTL 243
Y SF EML+ E + VLPG+ +IE+GV++YRKFYTEEKE S+GVLAI +SK A QPY T+
Sbjct: 174 YSSFLEMLQEEMISNVLPGILSIEDGVKVYRKFYTEEKENSSGVLAISVSKPARQPYETM 233
Query: 244 ASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNV 291
+L+ LGY G+ LLGL++TAGT+ D +PPPRS L+SS M +P V
Sbjct: 234 TGLLARLGYDGLGRLLGLANTAGTVPDGVPPPRSVLISSCMKLHQPTV 281
>gi|21536491|gb|AAM60823.1| unknown [Arabidopsis thaliana]
Length = 244
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 124/272 (45%), Positives = 166/272 (61%), Gaps = 47/272 (17%)
Query: 4 LKDCMEELLKFTLSSHIDETL-DFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDEDSL 62
L++CME LL+F+L SH++E++ FDL ++ FC +LL + DS E S
Sbjct: 14 LRECMESLLRFSLRSHLNESVPSFDLDLTRDFCLHLLGEAT------------DSTEKS- 60
Query: 63 QGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKE-----EEWSQLILNK 117
+YK LA AL EC + + E++S+LI
Sbjct: 61 ------AVYKLLATALS--------------------ECLASEGDKNSNLEKYSKLIHGL 94
Query: 118 GSSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLK 177
G DL+++LK V FELHVQEP+F+ +KDGL+TVEGRC + DY I G+ +L NKC++L+
Sbjct: 95 G-YDLINMLKEVNFELHVQEPYFTQLKDGLKTVEGRCAVGDYMRISSGAFLLFNKCLLLE 153
Query: 178 VQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMAT 237
VQ V Y SFSEML+ E L KVLPGV++IEEGVQ+YR FY+EEKER NGV+AI ++K A
Sbjct: 154 VQDVHRYTSFSEMLKVEGLSKVLPGVESIEEGVQVYRNFYSEEKERMNGVVAIRVAKPAN 213
Query: 238 QPYVTLASILSGLGYVGIQSLLGLSHTAGTIS 269
QP LA +LS L GI+SLL +TAG S
Sbjct: 214 QPSAALAGVLSELKSSGIKSLLD-EYTAGVTS 244
>gi|18396513|ref|NP_566202.1| RNA-binding ASCH domain protein [Arabidopsis thaliana]
gi|6017115|gb|AAF01598.1|AC009895_19 unknown protein [Arabidopsis thaliana]
gi|22655129|gb|AAM98155.1| expressed protein [Arabidopsis thaliana]
gi|32189307|gb|AAP75808.1| At3g03320 [Arabidopsis thaliana]
gi|332640408|gb|AEE73929.1| RNA-binding ASCH domain protein [Arabidopsis thaliana]
Length = 244
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 124/272 (45%), Positives = 165/272 (60%), Gaps = 47/272 (17%)
Query: 4 LKDCMEELLKFTLSSHIDETL-DFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDEDSL 62
L++CME LL+F+L SH++E++ FDL ++ FC +LL + DS E S
Sbjct: 14 LRECMESLLRFSLRSHLNESVPSFDLDLTRDFCLHLLGEAT------------DSTEKS- 60
Query: 63 QGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKE-----EEWSQLILNK 117
+YK LA AL EC + + E++S+LI
Sbjct: 61 ------AVYKLLATALS--------------------ECLASEGDKNSNLEKYSKLIHGL 94
Query: 118 GSSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLK 177
G DL+++LK V FELHVQEP+F+ +KDGL+TVEGRC + DY I G +L NKC++L+
Sbjct: 95 G-YDLINMLKEVNFELHVQEPYFTQLKDGLKTVEGRCAVGDYMRISSGDFLLFNKCLLLE 153
Query: 178 VQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMAT 237
VQ V Y SFSEML+ E L KVLPGV++IEEGVQ+YR FY+EEKER NGV+AI ++K A
Sbjct: 154 VQDVHRYTSFSEMLKVEGLAKVLPGVESIEEGVQVYRNFYSEEKERMNGVVAIRVAKPAN 213
Query: 238 QPYVTLASILSGLGYVGIQSLLGLSHTAGTIS 269
QP LA +LS L GI+SLL +TAG S
Sbjct: 214 QPSAALAGVLSELKSSGIKSLLD-EYTAGVTS 244
>gi|2289012|gb|AAB64341.1| hypothetical protein [Arabidopsis thaliana]
Length = 483
Score = 204 bits (518), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 123/294 (41%), Positives = 165/294 (56%), Gaps = 74/294 (25%)
Query: 5 KDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDEDS--- 61
+DC++E++KFTL D ++FD+G++ +FC+ LL S D + E S
Sbjct: 171 EDCLDEMVKFTL----DYCVEFDIGLTGEFCSGLL-------CGESVLHDGERIESSSYA 219
Query: 62 ----LQGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQLILNK 117
GVP YPLYK LAL L +S++ G+FC T++K++L E L++KE+EWS+LI K
Sbjct: 220 LLHRFSGVPDYPLYKVLALGLLKSIDSGSFCGTFEKISLGKEVIRLKEKEDEWSKLINQK 279
Query: 118 GSSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLK 177
GS +LV+ ELH
Sbjct: 280 GS-ELVN-------ELH------------------------------------------- 288
Query: 178 VQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMAT 237
Y SF E+L+AES KV PG KT+EEG+Q++RK Y ++E NGV+AI +SK
Sbjct: 289 -----QYASFYELLKAESSEKVFPGTKTVEEGMQMFRKLYDTDQENFNGVVAIHLSKSVA 343
Query: 238 QPYVTLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNV 291
QP V LA ILSGL Y G+Q+LLGLSHT G+I ALPPPRS LLSSFMLP+KP +
Sbjct: 344 QPCVALAHILSGLSYTGVQNLLGLSHTTGSIFHALPPPRSMLLSSFMLPYKPKI 397
>gi|293331561|ref|NP_001169685.1| uncharacterized protein LOC100383566 [Zea mays]
gi|224030879|gb|ACN34515.1| unknown [Zea mays]
gi|414873857|tpg|DAA52414.1| TPA: hypothetical protein ZEAMMB73_867603 [Zea mays]
gi|414873858|tpg|DAA52415.1| TPA: hypothetical protein ZEAMMB73_867603 [Zea mays]
Length = 283
Score = 203 bits (516), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 123/289 (42%), Positives = 173/289 (59%), Gaps = 36/289 (12%)
Query: 7 CMEELLKFTLSSHI-----DETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDEDS 61
C+ +LL+F LSSH D+T+ F L S +C LL D
Sbjct: 25 CIGDLLRFVLSSHAAAYPGDDTVAFPL--SPSYCARLLNDGE------------------ 64
Query: 62 LQGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQLILNKGSSD 121
L+++L + + + G V + EE E EE +L+L + ++
Sbjct: 65 --------LFEKLEACIQQCLEEGRLPGPPAVVGIPAEE---EGPEERGWKLLLPEKGAE 113
Query: 122 LVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSV 181
L + AVEFELHVQEP+F+ ++ G++ VEGR +YN I GS++L NKC++L V++V
Sbjct: 114 LKRMYDAVEFELHVQEPYFTQLRAGVKKVEGRLATGNYNRITQGSLLLFNKCLLLNVEAV 173
Query: 182 CHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQPYV 241
Y+SFSEML+ E + VLPG+ +I EGV++YRKFY EEKE S GVLAI +SK +QPY+
Sbjct: 174 RKYNSFSEMLKGEKISNVLPGISSIVEGVKVYRKFYAEEKENSYGVLAISVSKPTSQPYI 233
Query: 242 TLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPN 290
T+ +IL+GLGY G+ LLG++ T GT+ D LPPPRSALLSS M +PN
Sbjct: 234 TMNNILAGLGYDGLGRLLGMAKTTGTVPDGLPPPRSALLSSCMGLVQPN 282
>gi|449468876|ref|XP_004152147.1| PREDICTED: uncharacterized LOC101221961 [Cucumis sativus]
Length = 238
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 119/251 (47%), Positives = 153/251 (60%), Gaps = 41/251 (16%)
Query: 4 LKDCMEELLKFTLSSHIDETLDFD--LGISSKFCTNLLQDDPNDAVSPSTSFDFDSDEDS 61
L+DC+EELLKFTL SHI+ TLD D LG S+ F ++LL + ND P S
Sbjct: 15 LRDCLEELLKFTLQSHINGTLDIDHDLGFSTDFSSHLL--NHNDC--PDVS--------- 61
Query: 62 LQGVPLYPLYKRLALALCRSVN---CGAFCRTYKKVALMNEECFLEQKEEEWSQLILNKG 118
LYK L L +SV+ CG+ L+ E+E + +G
Sbjct: 62 -------RLYKDLVSTLLKSVSKASCGS----------------LDDFEDEEESNEIAEG 98
Query: 119 SSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKV 178
++LV++LK V FELHVQEPFF+ +KDGL+ VEGRC +YN I G++IL NKC++ +V
Sbjct: 99 RAELVNVLKTVNFELHVQEPFFTQLKDGLKRVEGRCAAGNYNRIQSGALILFNKCLLFEV 158
Query: 179 QSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQ 238
Q V Y SF ML+AESL VLPGVKT+ +GVQIYR FY+EEKE SNGVL I + K Q
Sbjct: 159 QDVRQYPSFYAMLKAESLDNVLPGVKTLTDGVQIYRNFYSEEKELSNGVLGIHVKKSVAQ 218
Query: 239 PYVTLASILSG 249
PY+ LA I+S
Sbjct: 219 PYIILARIISA 229
>gi|326510525|dbj|BAJ87479.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 385
Score = 201 bits (512), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 125/291 (42%), Positives = 169/291 (58%), Gaps = 32/291 (10%)
Query: 4 LKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDEDSLQ 63
L + ELL+ LSSH+ D L +S +C+ LL+DD
Sbjct: 23 LSSAVGELLRLVLSSHV-AAPDPALPLSRSYCSRLLEDD--------------------- 60
Query: 64 GVPLYPLYKRLALALCRSVNCGAFCRT---YKKVALMNEECFLEQKEEEWSQLILNKGSS 120
L +LA L G R V EE ++E EW ++ KG+
Sbjct: 61 ------LCDKLAAELAGCAEEGRIPRAPVVAGAVGTPAEENDSRKREGEWEAILREKGA- 113
Query: 121 DLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQS 180
+L I AVEF LHVQEP+F+ + G + VEGR +YN I GS++L NKC++L+V++
Sbjct: 114 ELKRIYDAVEFVLHVQEPYFTQLSAGSKNVEGRLAAGNYNRITQGSLLLFNKCLLLEVEA 173
Query: 181 VCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQPY 240
V Y SFSEML+AE++ VLP + +IEEGV++YR FYTEEKE S GVLAI +SK QPY
Sbjct: 174 VRKYSSFSEMLQAETISNVLPDISSIEEGVKVYRNFYTEEKENSYGVLAISVSKPQIQPY 233
Query: 241 VTLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNV 291
+T+ +LSGLGY G+ LLGL++T+GT+ D LPPP+S L+SS M KP V
Sbjct: 234 ITMTELLSGLGYDGLGRLLGLANTSGTVPDGLPPPKSVLISSCMKLHKPTV 284
>gi|297828047|ref|XP_002881906.1| hypothetical protein ARALYDRAFT_322019 [Arabidopsis lyrata subsp.
lyrata]
gi|297327745|gb|EFH58165.1| hypothetical protein ARALYDRAFT_322019 [Arabidopsis lyrata subsp.
lyrata]
Length = 495
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 122/298 (40%), Positives = 166/298 (55%), Gaps = 74/298 (24%)
Query: 1 MERLKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDED 60
+++L+DC+EE++KFTL D ++FDL ++ FC LL S D + E
Sbjct: 164 VKKLRDCLEEMVKFTL----DCRVEFDLELTDDFCFGLL-------CGESILLDGERIES 212
Query: 61 S-------LQGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQL 113
S GVP YPLYK LAL L +S++ G+ C T++ ++L E +L+++E+EWS+L
Sbjct: 213 SSHALLHRFGGVPDYPLYKLLALGLLKSIDSGSVCGTFENISLGKEVIWLKEREDEWSKL 272
Query: 114 ILNKGSSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKC 173
I KG S+LV+ E+H Q PF
Sbjct: 273 INQKG-SELVN-------EVH-QYPF---------------------------------- 289
Query: 174 MMLKVQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
F E+L+ ES KV PG KT+EEG+Q++RK Y ++E SNGV+AI +S
Sbjct: 290 -------------FYELLKGESSEKVFPGTKTVEEGMQMFRKLYDMDQENSNGVVAINLS 336
Query: 234 KMATQPYVTLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNV 291
K QP V LA ILSGL Y G+Q+LLGLSHT G+I ALPPPRS LLSSFMLP+KP +
Sbjct: 337 KSVVQPCVALAHILSGLSYTGVQNLLGLSHTTGSIFHALPPPRSMLLSSFMLPYKPKI 394
>gi|62733968|gb|AAX96077.1| ProFAR isomerase associated, putative [Oryza sativa Japonica Group]
gi|77549703|gb|ABA92500.1| ProFAR isomerase associated family protein, expressed [Oryza sativa
Japonica Group]
Length = 385
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 96/179 (53%), Positives = 130/179 (72%), Gaps = 1/179 (0%)
Query: 113 LILNKGSSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNK 172
++L KG+ +L + AVEFELHVQEP+F+ ++ G + VEGR +YN I GS++L NK
Sbjct: 107 VVLEKGN-ELKRMYDAVEFELHVQEPYFTQLRAGTKKVEGRLAAGNYNRIAEGSLLLFNK 165
Query: 173 CMMLKVQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICI 232
C++L VQ++ Y SFSEML+AE + +VLPG+ +IE+GV +YRKFYTEEKE GVLAI +
Sbjct: 166 CLLLNVQAIRKYASFSEMLQAEMISEVLPGISSIEQGVGVYRKFYTEEKESFYGVLAISV 225
Query: 233 SKMATQPYVTLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNV 291
SK QPY+ + +L+GLG G+ LLG+ TAGT+ D LPPPRS L+SS M +PNV
Sbjct: 226 SKPTAQPYIIMTELLAGLGSDGLGRLLGMVKTAGTVQDGLPPPRSVLISSCMKLHQPNV 284
>gi|222615810|gb|EEE51942.1| hypothetical protein OsJ_33576 [Oryza sativa Japonica Group]
Length = 405
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 96/179 (53%), Positives = 130/179 (72%), Gaps = 1/179 (0%)
Query: 113 LILNKGSSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNK 172
++L KG+ +L + AVEFELHVQEP+F+ ++ G + VEGR +YN I GS++L NK
Sbjct: 107 VVLEKGN-ELKRMYDAVEFELHVQEPYFTQLRAGTKKVEGRLAAGNYNRIAEGSLLLFNK 165
Query: 173 CMMLKVQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICI 232
C++L VQ++ Y SFSEML+AE + +VLPG+ +IE+GV +YRKFYTEEKE GVLAI +
Sbjct: 166 CLLLNVQAIRKYASFSEMLQAEMISEVLPGISSIEQGVGVYRKFYTEEKESFYGVLAISV 225
Query: 233 SKMATQPYVTLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNV 291
SK QPY+ + +L+GLG G+ LLG+ TAGT+ D LPPPRS L+SS M +PNV
Sbjct: 226 SKPTAQPYIIMTELLAGLGSDGLGRLLGMVKTAGTVQDGLPPPRSVLISSCMKLHQPNV 284
>gi|218185550|gb|EEC67977.1| hypothetical protein OsI_35738 [Oryza sativa Indica Group]
Length = 405
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 96/179 (53%), Positives = 130/179 (72%), Gaps = 1/179 (0%)
Query: 113 LILNKGSSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNK 172
++L KG+ +L + AVEFELHVQEP+F+ ++ G + VEGR +YN I GS++L NK
Sbjct: 107 VVLEKGN-ELKMMYDAVEFELHVQEPYFTQLRAGTKKVEGRLAAGNYNRIAEGSLLLFNK 165
Query: 173 CMMLKVQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICI 232
C++L VQ++ Y SFSEML+AE + +VLPG+ +IE+GV +YRKFYTEEKE GVLAI +
Sbjct: 166 CLLLNVQAIRKYASFSEMLQAEMISEVLPGISSIEQGVGVYRKFYTEEKESFYGVLAISV 225
Query: 233 SKMATQPYVTLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNV 291
SK QPY+ + +L+GLG G+ LLG+ TAGT+ D LPPPRS L+SS M +PNV
Sbjct: 226 SKPTAQPYIIMTELLAGLGSDGLGRLLGMVKTAGTVQDGLPPPRSVLISSCMKLHQPNV 284
>gi|449484877|ref|XP_004157005.1| PREDICTED: uncharacterized LOC101221961 [Cucumis sativus]
Length = 879
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 83/150 (55%), Positives = 108/150 (72%), Gaps = 4/150 (2%)
Query: 105 QKEEEWSQLILNKGSSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGP 164
+ EEE +++ +G ++LV++LK V FELHVQEPFF+ +KDGL+ VEGRC +YN I
Sbjct: 728 EDEEESNEIA--EGRAELVNVLKTVNFELHVQEPFFTQLKDGLKRVEGRCAAGNYNRIQS 785
Query: 165 GSVILLNKCMMLKVQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERS 224
G++IL NKC++ +VQ V Y SF ML+AESL VLP VKT+ +GVQIYR FY+EEKE S
Sbjct: 786 GALILFNKCLLFEVQDVRQYPSFYAMLKAESLDNVLPEVKTLTDGVQIYRNFYSEEKELS 845
Query: 225 NGVLAICISKMATQPYVTLASILSGLGYVG 254
NGVL I + K QPY+ LA I+S Y G
Sbjct: 846 NGVLGIHVKKSVAQPYIILARIIS--AYFG 873
>gi|147846119|emb|CAN83746.1| hypothetical protein VITISV_022668 [Vitis vinifera]
Length = 796
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 85/149 (57%), Positives = 114/149 (76%), Gaps = 10/149 (6%)
Query: 60 DSLQGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQLILNKGS 119
DS +GVP YPLYKRL+ AL RS+ GAF Y +AL++E+ L+QKEE W++L+++KG
Sbjct: 18 DSFEGVPPYPLYKRLSAALYRSIISGAFWEIYSTMALIHEDSSLKQKEE-WNKLVVDKGL 76
Query: 120 SDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQ 179
+LV+ILK ++FELHVQEPFFS +KDGL+ +EGR IG G++IL NKC++L+VQ
Sbjct: 77 -ELVNILKTIDFELHVQEPFFSQLKDGLKIIEGR--------IGSGALILFNKCLVLEVQ 127
Query: 180 SVCHYDSFSEMLEAESLVKVLPGVKTIEE 208
V Y SFS++LE+E L +VLPGVKTIEE
Sbjct: 128 DVRRYASFSQLLESEGLAEVLPGVKTIEE 156
>gi|224155241|ref|XP_002337582.1| predicted protein [Populus trichocarpa]
gi|222839619|gb|EEE77942.1| predicted protein [Populus trichocarpa]
Length = 141
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 77/144 (53%), Positives = 105/144 (72%), Gaps = 3/144 (2%)
Query: 1 MERLKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDED 60
ME+ ++ +EELLKFTL SHI++TL+F+LG+S FC NLL++DPND + ST + D
Sbjct: 1 MEKQRERIEELLKFTLESHINQTLEFNLGLSKDFCINLLEEDPNDMLCHSTPTP--TPTD 58
Query: 61 SLQGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQLILNKGSS 120
S GV LYPLYKRLA AL RSV GA CRTY+K+ +++ L+QKEE W QLI KG
Sbjct: 59 SFDGVALYPLYKRLASALYRSVKSGAVCRTYEKMVFGDKDSNLKQKEENWDQLIKEKG-L 117
Query: 121 DLVDILKAVEFELHVQEPFFSFIK 144
+L+++L+ + E+HVQEP+FS +K
Sbjct: 118 ELINVLEGISCEIHVQEPYFSLLK 141
>gi|449516555|ref|XP_004165312.1| PREDICTED: uncharacterized protein LOC101232651 [Cucumis sativus]
Length = 107
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 62/102 (60%), Positives = 86/102 (84%)
Query: 190 MLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQPYVTLASILSG 249
MLE+E+L +LPGV++I+EG+QI + E++E ++ VLA+CIS + QPY++LA+I+SG
Sbjct: 1 MLESENLQSILPGVESIDEGLQILKSLNREDEEMADSVLALCISSVPFQPYISLAAIISG 60
Query: 250 LGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNV 291
L Y G+Q LLGL+HTAGT++DALPPPRSALLSSF+LP+KP V
Sbjct: 61 LSYEGLQGLLGLAHTAGTVADALPPPRSALLSSFVLPYKPEV 102
>gi|302811653|ref|XP_002987515.1| hypothetical protein SELMODRAFT_126335 [Selaginella moellendorffii]
gi|300144669|gb|EFJ11351.1| hypothetical protein SELMODRAFT_126335 [Selaginella moellendorffii]
Length = 266
Score = 134 bits (336), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 72/159 (45%), Positives = 101/159 (63%), Gaps = 3/159 (1%)
Query: 126 LKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYD 185
V F LHVQ+ F S ++DG + VEGRC Y+ G +++L+ C+ +Q V Y+
Sbjct: 3 FHGVRFRLHVQDRFLSQLRDGSKIVEGRCATGFYSRCGS-NILLMFICLYPLLQDVRAYE 61
Query: 186 SFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQPYVTLAS 245
SF MLE E L VLPG++++EEG+ IYR FY+ E E + GVLA+ +S+ QP + +
Sbjct: 62 SFQNMLETEGLENVLPGIESVEEGIGIYRGFYSRETE-ALGVLALHVSQ-PIQPRNAVTA 119
Query: 246 ILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFM 284
+L L G+ SLLGL T GT+ ALPPPRS LL+SF+
Sbjct: 120 LLQSLKVEGVASLLGLRATKGTLVSALPPPRSRLLASFL 158
>gi|297728215|ref|NP_001176471.1| Os11g0265500 [Oryza sativa Japonica Group]
gi|255679980|dbj|BAH95199.1| Os11g0265500, partial [Oryza sativa Japonica Group]
Length = 138
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 65/125 (52%), Positives = 89/125 (71%)
Query: 166 SVILLNKCMMLKVQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSN 225
S+I+ + ++ Q++ Y SFSEML+AE + +VLPG+ +IE+GV +YRKFYTEEKE
Sbjct: 13 SIIVESFVCLIFFQAIRKYASFSEMLQAEMISEVLPGISSIEQGVGVYRKFYTEEKESFY 72
Query: 226 GVLAICISKMATQPYVTLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFML 285
GVLAI +SK QPY+ + +L+GLG G+ LLG+ TAGT+ D LPPPRS L+SS M
Sbjct: 73 GVLAISVSKPTAQPYIIMTELLAGLGSDGLGRLLGMVKTAGTVQDGLPPPRSVLISSCMK 132
Query: 286 PFKPN 290
+PN
Sbjct: 133 LHQPN 137
>gi|223973711|gb|ACN31043.1| unknown [Zea mays]
Length = 102
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 60/101 (59%), Positives = 77/101 (76%)
Query: 190 MLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQPYVTLASILSG 249
ML+ E + VLPG+ +I EGV++YRKFY EEKE S GVLAI +SK +QPY+T+ +IL+G
Sbjct: 1 MLKGEKISNVLPGIPSIVEGVKVYRKFYAEEKENSYGVLAISVSKPTSQPYITMNNILAG 60
Query: 250 LGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPN 290
LGY G+ LLG++ T GT+ D LPPPRSALLSS M +PN
Sbjct: 61 LGYDGLGRLLGMAKTTGTVPDGLPPPRSALLSSCMGLVQPN 101
>gi|449470818|ref|XP_004153113.1| PREDICTED: uncharacterized protein LOC101215405 [Cucumis sativus]
Length = 139
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 66/142 (46%), Positives = 95/142 (66%), Gaps = 10/142 (7%)
Query: 3 RLKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDEDSL 62
+L+DC++ELLKF L S + T DFDLG+SS FC++L + DP S S + S
Sbjct: 4 KLEDCVQELLKFVLQSSTNGTPDFDLGLSSAFCSSLFKHDP----STSNPLPY-----SK 54
Query: 63 QGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQLILNKGSSDL 122
GVP YPLY+RL+LAL S+ G FC Y+K+ + N E L+QKEE W +LI++KG S++
Sbjct: 55 AGVPPYPLYERLSLALWESLCSGTFCPMYEKMLMKNGESSLKQKEEMWLKLIMDKG-SEM 113
Query: 123 VDILKAVEFELHVQEPFFSFIK 144
V +L+ + EL++ EPFF+ +K
Sbjct: 114 VQMLRTLNLELYIDEPFFTQLK 135
>gi|449454590|ref|XP_004145037.1| PREDICTED: uncharacterized protein LOC101215607 [Cucumis sativus]
Length = 116
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 57/117 (48%), Positives = 77/117 (65%), Gaps = 9/117 (7%)
Query: 3 RLKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDEDSL 62
+L+DC++ELLKF L S + T DFDLG+SS FC++L + DP S S + S
Sbjct: 4 KLEDCVQELLKFVLQSSTNGTPDFDLGLSSAFCSSLFKHDP----STSNPLPY-----SK 54
Query: 63 QGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQLILNKGS 119
GVP YPLY+RL+LAL S+ G FC Y+K+ + N E L+QKEE W +LI++KGS
Sbjct: 55 AGVPPYPLYERLSLALWESLCSGTFCPMYEKMLMKNGESSLKQKEEMWLKLIMDKGS 111
>gi|449516559|ref|XP_004165314.1| PREDICTED: uncharacterized LOC101215607 [Cucumis sativus]
Length = 127
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 59/129 (45%), Positives = 82/129 (63%), Gaps = 10/129 (7%)
Query: 3 RLKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDEDSL 62
+L+DC++ELLKF L S + T DFDLG+SS FC++L + DP S S + S
Sbjct: 4 KLEDCVQELLKFVLQSSTNGTPDFDLGLSSAFCSSLFKHDP----STSNPLPY-----SK 54
Query: 63 QGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQLILNKGSSDL 122
GVP YPLY+RL+LAL S+ G FC Y+K+ + N E L+QKEE W +LI++KG S++
Sbjct: 55 AGVPPYPLYERLSLALWESLCSGTFCPMYEKMLMKNGESSLKQKEEMWLKLIMDKG-SEM 113
Query: 123 VDILKAVEF 131
V + F
Sbjct: 114 VQVRSGCSF 122
>gi|242398993|ref|YP_002994417.1| ProFAR isomerase associated, [Thermococcus sibiricus MM 739]
gi|242265386|gb|ACS90068.1| ProFAR isomerase associated, putative [Thermococcus sibiricus MM
739]
Length = 110
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 49/105 (46%), Positives = 70/105 (66%)
Query: 129 VEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFS 188
+E+E+ ++E + I++G + VEGR I PG VI+ + +KV+S+ Y SF
Sbjct: 1 MEWEMGLKEEYLDLIRNGKKKVEGRLYDEKRRQINPGDVIIFEGKLKVKVKSIKKYPSFK 60
Query: 189 EMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
EMLE E + +VLPGV IEEGV+IYR+FYTEE+E+ GV+AI I
Sbjct: 61 EMLEREGIERVLPGVHNIEEGVKIYRQFYTEEEEKKYGVVAIEIE 105
>gi|168043944|ref|XP_001774443.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674295|gb|EDQ60806.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 272
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 71/230 (30%), Positives = 107/230 (46%), Gaps = 57/230 (24%)
Query: 64 GVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQLILNKGSSDLV 123
G+P+ PLY LA AL R + F R + + E+ + + + W++ ++
Sbjct: 1 GMPVDPLYVHLAAALYRWICTRRFPRNMAPIEGIKEDESWKSRLDAWNETVVTH------ 54
Query: 124 DILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCH 183
GS +L + V
Sbjct: 55 -----------------------------------------GSALL---------EEVHR 64
Query: 184 YDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQ-PYVT 242
Y +F M+E E L+KVLPGV++I EGVQIYRKFY+ KE++ GVL I + + A + P
Sbjct: 65 YPTFQIMIETEGLLKVLPGVESISEGVQIYRKFYSCSKEQTYGVLGIHVVRPAQRDPADL 124
Query: 243 LASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNVV 292
L+ IL+ LG G++ LLG+ T GT+ +ALPPP+S L SF P++
Sbjct: 125 LSDILNTLGIDGVRVLLGMRTTNGTLKEALPPPKSLLADSFSALQNPDIA 174
>gi|302812114|ref|XP_002987745.1| hypothetical protein SELMODRAFT_447090 [Selaginella moellendorffii]
gi|300144637|gb|EFJ11320.1| hypothetical protein SELMODRAFT_447090 [Selaginella moellendorffii]
Length = 215
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 52/109 (47%), Positives = 74/109 (67%), Gaps = 2/109 (1%)
Query: 176 LKVQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKM 235
++ + V Y+SF +MLE E L VLPG++++EEG+ IYR FY+ E E + GVLA+ +S+
Sbjct: 1 METKDVRAYESFQDMLETEGLENVLPGIESVEEGIGIYRGFYSRETE-ALGVLALHVSQ- 58
Query: 236 ATQPYVTLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFM 284
QP + ++L L G+ SLLGL T GT+ ALPPPRS LL+SF+
Sbjct: 59 PIQPRNAVTALLQSLKVEGVASLLGLRATKGTLVSALPPPRSRLLASFL 107
>gi|375082231|ref|ZP_09729299.1| hypothetical protein OCC_08375 [Thermococcus litoralis DSM 5473]
gi|374743119|gb|EHR79489.1| hypothetical protein OCC_08375 [Thermococcus litoralis DSM 5473]
Length = 118
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 46/101 (45%), Positives = 69/101 (68%)
Query: 129 VEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFS 188
+E+E+ +QE + IK+G + +EGR I PG VI+ + ++V+++ Y SF
Sbjct: 1 MEWEMGLQEEYLKLIKEGKKKIEGRLYDEKRRQIKPGDVIIFEGRLKVRVKALRVYPSFK 60
Query: 189 EMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLA 229
EMLE E L +VLP VK+I+EGV++YRKFY+EE+ER GV+A
Sbjct: 61 EMLEKEGLDRVLPNVKSIDEGVKVYRKFYSEEEERKYGVVA 101
>gi|315231674|ref|YP_004072110.1| hypothetical protein TERMP_01912 [Thermococcus barophilus MP]
gi|315184702|gb|ADT84887.1| hypothetical protein TERMP_01912 [Thermococcus barophilus MP]
Length = 116
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 46/105 (43%), Positives = 69/105 (65%)
Query: 129 VEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFS 188
+++E+ +QE + I +G + +EGR I PG +I+ + +KV+ + Y SF
Sbjct: 1 MKWEMGLQEEYLKLIAEGKKKIEGRLYDEKRRQIKPGDIIIFEGKLKVKVKDIRVYPSFK 60
Query: 189 EMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
EMLE E L VLPG+K+IEEGV+IYR+FY+EE+E+ GV+AI I
Sbjct: 61 EMLEKEGLENVLPGIKSIEEGVKIYRQFYSEEEEKKWGVVAIEIE 105
>gi|20093930|ref|NP_613777.1| fused ProFAR isomerase-like protein/uncharacterized protein
[Methanopyrus kandleri AV19]
gi|19886880|gb|AAM01707.1| Archaea-specific enzyme related to ProFAR isomerase (HisA) and
containing an additional uncharacterized domain
[Methanopyrus kandleri AV19]
Length = 350
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 49/103 (47%), Positives = 66/103 (64%)
Query: 133 LHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEMLE 192
+ ++E + + IK+G +TVEGR I PG IL N+ +++KV V YDSF EML
Sbjct: 239 MGLEEEYLNLIKEGKKTVEGRVKDDKRARIKPGDKILFNRRLLVKVIDVREYDSFEEMLR 298
Query: 193 AESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKM 235
E L VLP V +IEEGV+IYR+FY+ KE+ GVLAI I +
Sbjct: 299 EEGLENVLPNVDSIEEGVEIYRRFYSSGKEKMFGVLAIEIEPI 341
>gi|14521811|ref|NP_127287.1| hypothetical protein PAB1060 [Pyrococcus abyssi GE5]
gi|5459031|emb|CAB50517.1| Hypothetical protein PAB1060 [Pyrococcus abyssi GE5]
Length = 111
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/106 (44%), Positives = 69/106 (65%), Gaps = 1/106 (0%)
Query: 129 VEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMM-LKVQSVCHYDSF 187
+E+E+ +QE + IK GL+ +EGR I PG +I+ + ++V+++ Y SF
Sbjct: 1 MEWEMGLQEEYIELIKRGLKKIEGRLYDEKRRKIKPGDIIVFEGGKLKVRVKALRVYKSF 60
Query: 188 SEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
EMLE E + VLPGV ++EEGV+IYRKFY EE+E+ GV+AI I
Sbjct: 61 KEMLEKEGIENVLPGVNSVEEGVKIYRKFYDEEREKKYGVVAIEIE 106
>gi|18976827|ref|NP_578184.1| hypothetical protein PF0455 [Pyrococcus furiosus DSM 3638]
gi|397650960|ref|YP_006491541.1| hypothetical protein PFC_01390 [Pyrococcus furiosus COM1]
gi|18892426|gb|AAL80579.1| hypothetical protein PF0455 [Pyrococcus furiosus DSM 3638]
gi|393188551|gb|AFN03249.1| hypothetical protein PFC_01390 [Pyrococcus furiosus COM1]
Length = 110
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 49/106 (46%), Positives = 68/106 (64%), Gaps = 1/106 (0%)
Query: 129 VEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMM-LKVQSVCHYDSF 187
+E+E+ +QE F IK + +EGR I PG VI + ++V+++ Y+SF
Sbjct: 1 MEWEMGLQEEFLELIKLRKKKIEGRLYDEKRRQIKPGDVISFEGGKLKVRVKAIRVYNSF 60
Query: 188 SEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
EMLE E L VLPGVK+IEEG+Q+YR+FY EEKE+ GV+AI I
Sbjct: 61 REMLEKEGLENVLPGVKSIEEGIQVYRRFYDEEKEKKYGVVAIEIE 106
>gi|157879999|pdb|1S04|A Chain A, Solution Nmr Structure Of Protein Pf0455 From Pyrococcus
Furiosus. Northeast Structural Genomics Consortium
Target Pfr13
Length = 110
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 49/106 (46%), Positives = 68/106 (64%), Gaps = 1/106 (0%)
Query: 129 VEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMM-LKVQSVCHYDSF 187
+E+E+ +QE F IK + +EGR I PG VI + ++V+++ Y+SF
Sbjct: 1 MEWEMGLQEEFLELIKLRKKKIEGRLYDEKRRQIKPGDVISFEGGKLKVRVKAIRVYNSF 60
Query: 188 SEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
EMLE E L VLPGVK+IEEG+Q+YR+FY EEKE+ GV+AI I
Sbjct: 61 REMLEKEGLENVLPGVKSIEEGIQVYRRFYDEEKEKKYGVVAIEIE 106
>gi|332158605|ref|YP_004423884.1| hypothetical protein PNA2_0964 [Pyrococcus sp. NA2]
gi|331034068|gb|AEC51880.1| hypothetical protein PNA2_0964 [Pyrococcus sp. NA2]
Length = 109
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 46/106 (43%), Positives = 68/106 (64%), Gaps = 1/106 (0%)
Query: 129 VEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMM-LKVQSVCHYDSF 187
+E+E+ +QE + IK G + +EGR I PG +I+ + +KV+++ Y SF
Sbjct: 1 MEWEMGLQEEYIELIKSGKKKIEGRLYDEKRRQIKPGDIIIFEGGKLKVKVKAIRVYKSF 60
Query: 188 SEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
EMLE E + VLPGVK +EEGV++YR+FY EE+E+ GV+AI I
Sbjct: 61 KEMLEKEGIENVLPGVKDVEEGVKVYRQFYDEEREKKYGVVAIEIE 106
>gi|14590265|ref|NP_142331.1| hypothetical protein PH0355 [Pyrococcus horikoshii OT3]
gi|160286344|pdb|2Z0T|A Chain A, Crystal Structure Of Hypothetical Protein Ph0355
gi|160286345|pdb|2Z0T|B Chain B, Crystal Structure Of Hypothetical Protein Ph0355
gi|160286346|pdb|2Z0T|C Chain C, Crystal Structure Of Hypothetical Protein Ph0355
gi|160286347|pdb|2Z0T|D Chain D, Crystal Structure Of Hypothetical Protein Ph0355
gi|3256746|dbj|BAA29429.1| 109aa long hypothetical protein [Pyrococcus horikoshii OT3]
Length = 109
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 46/106 (43%), Positives = 68/106 (64%), Gaps = 1/106 (0%)
Query: 129 VEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMM-LKVQSVCHYDSF 187
+++E+ +QE + IK G + +EGR I PG +I+ + +KV+ + Y SF
Sbjct: 1 MKWEMGLQEEYIELIKAGKKKIEGRLYDEKRRQIKPGDIIIFEGGKLKVKVKGIRVYSSF 60
Query: 188 SEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
EMLE E + VLPGVK+IEEGV++YR+FY EE+E+ GV+AI I
Sbjct: 61 KEMLEKEGIENVLPGVKSIEEGVKVYRQFYDEEREKKYGVVAIEIE 106
>gi|409096441|ref|ZP_11216465.1| hypothetical protein TzilA_07282 [Thermococcus zilligii AN1]
Length = 117
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/103 (44%), Positives = 63/103 (61%)
Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
+ + +QE + I +G + +EGR I PG I+ +M V+ V Y SF EM
Sbjct: 4 WRMGLQEEYLRAIAEGRKKIEGRLYDEKRQGIKPGDTIIFEDRLMCVVKDVRVYSSFREM 63
Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
LE+E + VLPGV IEEGV++YRKFY+EEKER GV+AI +
Sbjct: 64 LESEGIENVLPGVGDIEEGVKVYRKFYSEEKERKYGVVAIEVE 106
>gi|212224839|ref|YP_002308075.1| hypothetical protein TON_1688 [Thermococcus onnurineus NA1]
gi|212009796|gb|ACJ17178.1| conserved hypothetical protein [Thermococcus onnurineus NA1]
Length = 116
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/105 (43%), Positives = 64/105 (60%)
Query: 129 VEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFS 188
V + + +QE + I +G + VEGR I PG I+ +M+ V+ + Y SF
Sbjct: 2 VTWRMGLQEEYLKAIAEGRKKVEGRLYDEKRQGIKPGDTIIFENKLMVVVKDIRVYSSFR 61
Query: 189 EMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
EML E L VLPGVK+IEEGV++YRKFY+E+KE+ GV AI +
Sbjct: 62 EMLGKEGLENVLPGVKSIEEGVKVYRKFYSEDKEKKYGVAAIEVE 106
>gi|57641772|ref|YP_184250.1| hypothetical protein TK1837 [Thermococcus kodakarensis KOD1]
gi|57160096|dbj|BAD86026.1| hypothetical protein, conserved, DUF1530 family [Thermococcus
kodakarensis KOD1]
Length = 117
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 47/106 (44%), Positives = 66/106 (62%)
Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
+++ +QE + I +G + +EGR I PG I+ ++ V+ V Y SF EM
Sbjct: 4 WKMGLQEEYLRAIAEGKKKIEGRLYDEKRQKIKPGDEIVFENKLVCVVKDVRVYSSFREM 63
Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMA 236
LE E L VLPGVK+IEEGV++YRKFY+EEKE+ GV AI + +A
Sbjct: 64 LEKEGLENVLPGVKSIEEGVKVYRKFYSEEKEKKYGVAAIEVEPVA 109
>gi|240102242|ref|YP_002958550.1| hypothetical protein TGAM_0184 [Thermococcus gammatolerans EJ3]
gi|239909795|gb|ACS32686.1| Conserved hypothetical protein [Thermococcus gammatolerans EJ3]
Length = 116
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 46/106 (43%), Positives = 68/106 (64%)
Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
+++ +QE + I +G + +EGR +I PG I+ ++ V+ + Y SF EM
Sbjct: 4 WKMGLQEEYLKAIAEGKKKIEGRLYDEKRQAIKPGDEIVFENKLVCVVKDLRVYSSFREM 63
Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMA 236
LE E L VLPGVK+IEEGV++YRKFY+EEKE+ GV+AI + +A
Sbjct: 64 LEKEGLENVLPGVKSIEEGVKVYRKFYSEEKEKKYGVVAIEVEPVA 109
>gi|380742440|tpe|CCE71074.1| TPA: hypothetical protein PAB1060 [Pyrococcus abyssi GE5]
Length = 107
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 45/102 (44%), Positives = 65/102 (63%), Gaps = 1/102 (0%)
Query: 133 LHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMM-LKVQSVCHYDSFSEML 191
+ +QE + IK GL+ +EGR I PG +I+ + ++V+++ Y SF EML
Sbjct: 1 MGLQEEYIELIKRGLKKIEGRLYDEKRRKIKPGDIIVFEGGKLKVRVKALRVYKSFKEML 60
Query: 192 EAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
E E + VLPGV ++EEGV+IYRKFY EE+E+ GV+AI I
Sbjct: 61 EKEGIENVLPGVNSVEEGVKIYRKFYDEEREKKYGVVAIEIE 102
>gi|389852386|ref|YP_006354620.1| hypothetical protein Py04_0963 [Pyrococcus sp. ST04]
gi|388249692|gb|AFK22545.1| hypothetical protein Py04_0963 [Pyrococcus sp. ST04]
Length = 105
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 46/104 (44%), Positives = 65/104 (62%), Gaps = 1/104 (0%)
Query: 133 LHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMM-LKVQSVCHYDSFSEML 191
+ +QE + IK G + +EGR I PG +I + ++V+++ Y +F EML
Sbjct: 1 MGLQEEYLELIKAGKKKIEGRLYDEKRRQIKPGDIISFEGGRLKVRVKAIRVYKTFREML 60
Query: 192 EAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKM 235
E E + VLPGV++IEEGVQIYRKFY EE+E+ GV+AI I M
Sbjct: 61 EKEGIENVLPGVESIEEGVQIYRKFYDEEREKKYGVVAIEIEPM 104
>gi|223477503|ref|YP_002581867.1| ProFAR isomerase associated superfamily protein [Thermococcus sp.
AM4]
gi|214032729|gb|EEB73558.1| ProFAR isomerase associated superfamily protein [Thermococcus sp.
AM4]
Length = 116
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 45/106 (42%), Positives = 67/106 (63%)
Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
+++ +QE + I +G + +EGR +I PG I+ ++ V+ + Y SF EM
Sbjct: 4 WKMGLQEEYLRAIAEGKKKIEGRLYDEKRQAIKPGDEIVFENRLVCVVKDLRVYSSFREM 63
Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMA 236
LE E L VLPGVK IEEGV++YR+FY+EEKE+ GV+AI + +A
Sbjct: 64 LEKEGLENVLPGVKNIEEGVKVYRRFYSEEKEKKYGVVAIEVEPVA 109
>gi|341582462|ref|YP_004762954.1| hypothetical protein GQS_06895 [Thermococcus sp. 4557]
gi|340810120|gb|AEK73277.1| hypothetical protein GQS_06895 [Thermococcus sp. 4557]
Length = 116
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/106 (41%), Positives = 65/106 (61%)
Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
+ + +QE + I +G + +EGR I PG I+ ++ V+ V Y SF EM
Sbjct: 4 WRMGLQEEYLKAIAEGRKRIEGRLYDEKRQGIKPGDEIIFENKLVCVVKDVRVYSSFREM 63
Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMA 236
LE E L VLPGV+++E+GV++YR+FY+EEKER GV AI + +A
Sbjct: 64 LEKEGLENVLPGVESVEDGVRVYRRFYSEEKERKYGVAAIEVEPVA 109
>gi|337285338|ref|YP_004624812.1| hypothetical protein PYCH_18850 [Pyrococcus yayanosii CH1]
gi|334901272|gb|AEH25540.1| hypothetical protein PYCH_18850 [Pyrococcus yayanosii CH1]
Length = 112
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/106 (41%), Positives = 67/106 (63%), Gaps = 1/106 (0%)
Query: 129 VEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMM-LKVQSVCHYDSF 187
+E+E+ +QE + I+ G + +EGR I PG +I + ++V+++ Y SF
Sbjct: 1 MEWEMGLQEEYIELIRTGKKKIEGRLYDEKRRQIKPGDIISFEGGKLKVRVKALRVYSSF 60
Query: 188 SEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
EMLE E L VLPGV +IEEGV++YR+FY EE+E+ GV+AI +
Sbjct: 61 REMLEKEGLENVLPGVDSIEEGVRVYRQFYDEEREKKYGVVAIEVE 106
>gi|390961172|ref|YP_006425006.1| hypothetical protein CL1_1009 [Thermococcus sp. CL1]
gi|390519480|gb|AFL95212.1| hypothetical protein CL1_1009 [Thermococcus sp. CL1]
Length = 110
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 64/104 (61%)
Query: 133 LHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEMLE 192
+ +QE + I +G + +EGR +I PG I+ ++ V+ V Y SF EMLE
Sbjct: 1 MGLQEEYLRAIAEGRKRIEGRLYDEKRQAIRPGDEIVFENKLVCVVKDVRVYSSFREMLE 60
Query: 193 AESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMA 236
E + VLPGV +IEEGV++YR+FY+EEKE+ GV AI + +A
Sbjct: 61 KEGIENVLPGVGSIEEGVKVYRRFYSEEKEKKYGVAAIEVEPVA 104
>gi|326431332|gb|EGD76902.1| ProFAR isomerase associated superfamily [Salpingoeca sp. ATCC
50818]
Length = 128
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 48/109 (44%), Positives = 63/109 (57%), Gaps = 9/109 (8%)
Query: 138 PFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKC--------MMLKVQSVCHYDSFSE 189
P+FSFI G +TVEGR Y +G G I +C + + V V Y S +
Sbjct: 21 PWFSFIAQGKKTVEGRLASDKYTGLGVGCTIAF-RCEPTPDLPPVHVTVTRVVRYPSIAL 79
Query: 190 MLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQ 238
LEAE L + LPG++T+ EGVQ+YR+FYTEE E+ GVLAI I + Q
Sbjct: 80 YLEAEGLSRCLPGIETLAEGVQVYRQFYTEEDEQRLGVLAIHIQRATDQ 128
>gi|282890016|ref|ZP_06298550.1| hypothetical protein pah_c009o037 [Parachlamydia acanthamoebae str.
Hall's coccus]
gi|338174413|ref|YP_004651223.1| hypothetical protein PUV_04190 [Parachlamydia acanthamoebae UV-7]
gi|281500108|gb|EFB42393.1| hypothetical protein pah_c009o037 [Parachlamydia acanthamoebae str.
Hall's coccus]
gi|336478771|emb|CCB85369.1| putative uncharacterized protein [Parachlamydia acanthamoebae UV-7]
Length = 106
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 39/99 (39%), Positives = 62/99 (62%)
Query: 132 ELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEML 191
+++V++ FF IK G +TVEGR S + ++ PG + +N +++ VQ V Y F +ML
Sbjct: 3 DINVEDQFFKLIKQGKKTVEGRLAKSKFLNLAPGDCLRINNQLIVFVQRVDCYPFFRDML 62
Query: 192 EAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAI 230
E L VLPG +++EG IY +FY+ E E+ GV+A+
Sbjct: 63 FHEGLKNVLPGCASLDEGENIYYRFYSREDEKKFGVIAV 101
>gi|406997767|gb|EKE15786.1| hypothetical protein ACD_11C00105G0011 [uncultured bacterium]
Length = 111
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/104 (41%), Positives = 64/104 (61%), Gaps = 1/104 (0%)
Query: 133 LHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLN-KCMMLKVQSVCHYDSFSEML 191
++VQEP++SFI G +TVEGR + SI G ++ L + + L V Y +F EM+
Sbjct: 6 INVQEPYYSFILQGKKTVEGRLNKGKFASIQVGDILELEPEKIKLVVVEKNIYKNFREMI 65
Query: 192 EAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKM 235
E E + V+P I+E V +Y KFYT+E+E+ GV+AI I K+
Sbjct: 66 EKEGIGNVIPDKNDIDEAVNVYYKFYTKEQEQEFGVVAIKIKKL 109
>gi|123494593|ref|XP_001326550.1| Archaea-specific enzyme related to ProFAR isomerase [Trichomonas
vaginalis G3]
gi|121909466|gb|EAY14327.1| Archaea-specific enzyme related to ProFAR isomerase, putative
[Trichomonas vaginalis G3]
Length = 111
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 61/109 (55%), Gaps = 9/109 (8%)
Query: 134 HVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILL-------NKCMMLKVQSVCHYDS 186
H+ EP+FS+++ G +T+EGR +++ I PG +I + +KV + Y S
Sbjct: 5 HLSEPWFSYVEQGKKTIEGRLEREEWSKIKPGDIIEFYNSDTGSKRAFQVKVVDIKKYSS 64
Query: 187 FSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKM 235
F ++ E L +LPGV ++EEG++IY K Y ++ +L I + K+
Sbjct: 65 FKSLINTEGLNNILPGVTSLEEGIKIYEKIYGSTP--NHNILGIKLQKL 111
>gi|338176370|ref|YP_004653180.1| hypothetical protein PUV_23760 [Parachlamydia acanthamoebae UV-7]
gi|336480728|emb|CCB87326.1| putative uncharacterized protein [Parachlamydia acanthamoebae UV-7]
Length = 168
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 59/105 (56%), Gaps = 3/105 (2%)
Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLN---KCMMLKVQSVCHYDSF 187
F +H +P+FS+I+ G++ VEGR + Y I G I + + + V + Y +
Sbjct: 61 FNIHCDDPWFSYIRQGIKPVEGRKSTHTYKKIKVGDQIKFSNGKESFVANVTEIREYSTI 120
Query: 188 SEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICI 232
+ LE +L K LPGVK+IEEG+ IY ++ EEK R G L + I
Sbjct: 121 EQYLEDVTLEKALPGVKSIEEGLNIYYEWTPEEKIRQYGFLGLFI 165
>gi|282892300|ref|ZP_06300696.1| hypothetical protein pah_c221o034 [Parachlamydia acanthamoebae str.
Hall's coccus]
gi|281497891|gb|EFB40242.1| hypothetical protein pah_c221o034 [Parachlamydia acanthamoebae str.
Hall's coccus]
Length = 135
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 59/106 (55%), Gaps = 3/106 (2%)
Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLN---KCMMLKVQSVCHYDSF 187
F +H +P+FS+I+ G++ VEGR + Y I G I + + + V + Y +
Sbjct: 28 FNIHCDDPWFSYIRQGIKPVEGRKSTHTYKKIKVGDQIKFSNGKESFVANVTEIREYSTI 87
Query: 188 SEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
+ LE +L K LPGVK+IEEG+ IY ++ EEK R G L + I
Sbjct: 88 EQYLEDVTLEKALPGVKSIEEGLNIYYEWTPEEKIRQYGFLGLFIK 133
>gi|224133258|ref|XP_002327999.1| predicted protein [Populus trichocarpa]
gi|222837408|gb|EEE75787.1| predicted protein [Populus trichocarpa]
Length = 66
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 50/78 (64%), Positives = 56/78 (71%), Gaps = 13/78 (16%)
Query: 190 MLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQPYVTLASILSG 249
ML AESL KVLPGVKT+E EEKE SNGVLAIC+SK+A QPY++LASIL G
Sbjct: 1 MLHAESLEKVLPGVKTVE----------EEEKEVSNGVLAICVSKLAAQPYLSLASILFG 50
Query: 250 LGYVGIQSLLGLSHTAGT 267
L Y G+QS LG TAGT
Sbjct: 51 LSYGGVQSQLG---TAGT 65
>gi|297823775|ref|XP_002879770.1| hypothetical protein ARALYDRAFT_903127 [Arabidopsis lyrata subsp.
lyrata]
gi|297325609|gb|EFH56029.1| hypothetical protein ARALYDRAFT_903127 [Arabidopsis lyrata subsp.
lyrata]
Length = 130
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 53/141 (37%), Positives = 78/141 (55%), Gaps = 25/141 (17%)
Query: 1 MERLKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDED 60
M+RL+DC+EE++KFTL ++ FDL ++ FC LL S D + E
Sbjct: 1 MDRLRDCLEEMVKFTLDCRVE----FDLELTDDFCFGLL-------CGESILLDGERIES 49
Query: 61 S-------LQGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQL 113
S GVP YPLYK LAL L +S+ G+ C T++ ++L E +E+EWS+L
Sbjct: 50 SSHALLHRFGGVPDYPLYKLLALGLLKSIVSGSVCGTFENISLGKE------REDEWSKL 103
Query: 114 ILNKGSSDLVDILKAVEFELH 134
I KG S+LV++ ++ F L
Sbjct: 104 INQKG-SELVNVRQSQSFILQ 123
>gi|57639990|ref|YP_182468.1| hypothetical protein TK0055 [Thermococcus kodakarensis KOD1]
gi|57158314|dbj|BAD84244.1| hypothetical protein, conserved, DUF1530 family [Thermococcus
kodakarensis KOD1]
Length = 112
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 52/100 (52%)
Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
+ L V++ + +FI+ G + +E R + +I PG I+ N + V V Y++F ++
Sbjct: 4 YRLFVKDEYLNFIRSGEKRIEVRVAYPQFRNIRPGDKIIFNDSIPAVVTEVKKYETFRQV 63
Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAI 230
L E + K+ P + E V+ + Y + KE GV+AI
Sbjct: 64 LREEPIKKIFPDEPSFERAVKRFHNLYPKWKENRYGVIAI 103
>gi|302847948|ref|XP_002955507.1| hypothetical protein VOLCADRAFT_66018 [Volvox carteri f.
nagariensis]
gi|300259130|gb|EFJ43360.1| hypothetical protein VOLCADRAFT_66018 [Volvox carteri f.
nagariensis]
Length = 165
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 35/118 (29%), Positives = 55/118 (46%), Gaps = 19/118 (16%)
Query: 139 FFSFIKDGLETVEGRCTISDYNSIGPGSVILL------------------NKCMMLKVQS 180
+ I+ G +TVEGR + + PG + + + + +
Sbjct: 45 YLELIRSGTKTVEGRIRAGKWADVIPGDIFRFFSTQAPPSSTLPANTNTNSVSVRCRAIT 104
Query: 181 VCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYT-EEKERSNGVLAICISKMAT 237
V YDSF MLE E L LPGV+++EEGV++YR E+E GV+A+ + + T
Sbjct: 105 VRQYDSFQAMLEGEGLAACLPGVQSLEEGVEVYRSIPGYREREAVEGVVAVGLELLLT 162
>gi|223478631|ref|YP_002583381.1| ProFAR isomerase associated superfamily protein [Thermococcus sp.
AM4]
gi|214033857|gb|EEB74683.1| ProFAR isomerase associated superfamily protein [Thermococcus sp.
AM4]
Length = 112
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 49/100 (49%)
Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
+ L V++ + FIK G + +E R I PG I+ N + V V Y++F ++
Sbjct: 4 YRLFVRDEYLDFIKSGEKRIEVRVAYPQLRKIQPGDKIIFNDSVPAVVTEVKRYETFRQV 63
Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAI 230
L E + K+ P + E V+ + Y + KE GV+AI
Sbjct: 64 LRQEPIKKIFPDEPSFERAVKRFHNLYPKWKENRYGVIAI 103
>gi|341583022|ref|YP_004763514.1| hypothetical protein GQS_09715 [Thermococcus sp. 4557]
gi|340810680|gb|AEK73837.1| hypothetical protein GQS_09715 [Thermococcus sp. 4557]
Length = 114
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 50/100 (50%)
Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
+ L V+E + +IK G + +E R + + PG I+ N + V V Y++F ++
Sbjct: 4 YRLRVREEYLDYIKSGEKRIEVRVAYPQFRGMKPGDKIIFNDQVPAVVTGVKEYETFRQV 63
Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAI 230
L E + K+ P + E V+ + Y + KE GV+AI
Sbjct: 64 LREEPIKKIFPDEPSFERAVKRFHGMYPKWKENRYGVIAI 103
>gi|240103056|ref|YP_002959365.1| hypothetical protein TGAM_0999 [Thermococcus gammatolerans EJ3]
gi|239910610|gb|ACS33501.1| Conserved hypothetical protein [Thermococcus gammatolerans EJ3]
Length = 112
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 49/100 (49%)
Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
+ L V++ + FIK G + +E R I PG ++ N + V V Y++F ++
Sbjct: 4 YRLFVRDEYLDFIKSGEKRIEVRVAYPQLRRIQPGDKLIFNDSIPAVVTEVKRYETFRQV 63
Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAI 230
L E + K+ P + E V+ + Y + KE GV+AI
Sbjct: 64 LREEPIKKIFPDEPSFERAVKRFHNLYPKWKENRYGVIAI 103
>gi|212224321|ref|YP_002307557.1| hypothetical protein TON_1172 [Thermococcus onnurineus NA1]
gi|212009278|gb|ACJ16660.1| hypothetical protein, conserved [Thermococcus onnurineus NA1]
Length = 111
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 50/100 (50%)
Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
+ L V++ + +IK G + +E R +I PG IL N + V V Y++F ++
Sbjct: 4 YRLRVRDEYLDYIKSGEKRIEVRVAYPQLRNIKPGDKILFNNEVPAVVTGVKRYETFRQV 63
Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAI 230
L E + K+ P + E ++ + Y + KE GV+AI
Sbjct: 64 LREEQIKKIFPDEPSFERALKRFHNMYPKWKENRYGVIAI 103
>gi|409095591|ref|ZP_11215615.1| hypothetical protein TzilA_02940 [Thermococcus zilligii AN1]
Length = 115
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 48/100 (48%)
Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
+ L V E + FI+ G + +E R I PG I+ N + V V Y++F ++
Sbjct: 4 YHLKVHEEYLEFIRSGEKRIEVRVAYPQLRGIKPGDRIIFNDSIPAIVTEVKRYETFRQV 63
Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAI 230
L E + K+ P + E ++ + Y + KE GV+AI
Sbjct: 64 LREEPIEKIFPDEPSFERALKRFHGMYPKWKENRYGVMAI 103
>gi|390961690|ref|YP_006425524.1| hypothetical protein CL1_1531 [Thermococcus sp. CL1]
gi|390519998|gb|AFL95730.1| hypothetical protein CL1_1531 [Thermococcus sp. CL1]
Length = 112
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 48/100 (48%)
Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
+ L V++ + FIK G + +E R I PG ++ N + V V Y++F ++
Sbjct: 4 YHLKVRDEYLDFIKSGEKRIEVRVAYPQLRKIQPGDKLIFNDSIPAVVTEVKRYETFRQV 63
Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAI 230
L E + K+ P + E V+ + Y + KE GV+ I
Sbjct: 64 LREEPIKKIFPDEPSFERAVKRFHNLYPKWKENRYGVIVI 103
>gi|332158720|ref|YP_004423999.1| hypothetical protein PNA2_1079 [Pyrococcus sp. NA2]
gi|331034183|gb|AEC51995.1| hypothetical protein PNA2_1079 [Pyrococcus sp. NA2]
Length = 116
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/100 (26%), Positives = 51/100 (51%)
Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
+ L++++ + IK G + +E R + + G I+ N + +V V HY++F ++
Sbjct: 4 YRLYLRDEYLEMIKSGKKRIEVRVAYPQFKGMKRGDKIIFNDSIPAEVIEVKHYETFRQV 63
Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAI 230
L E + K+ P + E ++ + Y + KE GV+AI
Sbjct: 64 LREEPIDKIFPDEPSFERALRRFHNMYPKWKENRYGVIAI 103
>gi|156936814|ref|YP_001434610.1| ProFAR isomerase family protein [Ignicoccus hospitalis KIN4/I]
gi|156565798|gb|ABU81203.1| ProFAR isomerase family protein [Ignicoccus hospitalis KIN4/I]
Length = 236
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 51/94 (54%), Gaps = 8/94 (8%)
Query: 132 ELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILL-----NKCMMLKVQSVCHYDS 186
+ ++E +F I G + VEGR + G VI K + KV+ + Y +
Sbjct: 134 RMKLEERWFELIASGEKVVEGRVYDEKRKRLRVGHVIQFKSVQSGKLLYAKVKKLVMYKN 193
Query: 187 FSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEE 220
F EMLE+E +VLPG+ ++EEG+++Y K+Y E
Sbjct: 194 FKEMLESED--RVLPGL-SVEEGLKVYEKYYGPE 224
>gi|14590361|ref|NP_142427.1| hypothetical protein PH0447 [Pyrococcus horikoshii OT3]
gi|3256850|dbj|BAA29533.1| 114aa long hypothetical protein [Pyrococcus horikoshii OT3]
Length = 114
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/100 (25%), Positives = 50/100 (50%)
Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
+ L++++ + +K G + +E R + G I+ N + +V V HY++F ++
Sbjct: 5 YRLYLRDEYLEMVKSGKKRIEVRVAYPQLKGMKRGDKIIFNDEVPAEVIEVKHYETFRQV 64
Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAI 230
L E + K+ P + E ++ + Y + KE GV+AI
Sbjct: 65 LREEPIDKIFPDEPSFERALRRFHNLYPKWKENRYGVIAI 104
>gi|224133328|ref|XP_002328015.1| predicted protein [Populus trichocarpa]
gi|222837424|gb|EEE75803.1| predicted protein [Populus trichocarpa]
Length = 76
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 25/43 (58%), Positives = 32/43 (74%), Gaps = 3/43 (6%)
Query: 206 IEEGVQIYRKFYTEEKERSNGVLAICISKMATQPYVTLASILS 248
+ EG ++ FYTEEKERSNGVLAIC +K A Y++LA+I S
Sbjct: 36 LNEGTEL---FYTEEKERSNGVLAICAAKSAAPSYISLATISS 75
>gi|389852077|ref|YP_006354311.1| hypothetical protein Py04_0634 [Pyrococcus sp. ST04]
gi|388249383|gb|AFK22236.1| hypothetical protein Py04_0634 [Pyrococcus sp. ST04]
Length = 99
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 43/88 (48%)
Query: 143 IKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEMLEAESLVKVLPG 202
IK G + +E R + G I+ N + +V V HY++F ++L E + K+ P
Sbjct: 2 IKSGKKRIEVRVAYPQLKGMKRGDKIIFNDSIPAEVIEVKHYETFRQVLREEPIEKIFPD 61
Query: 203 VKTIEEGVQIYRKFYTEEKERSNGVLAI 230
+ E ++ + Y + KE GVLAI
Sbjct: 62 EPSFERALRRFHNMYPKWKEYRYGVLAI 89
>gi|338731888|ref|YP_004663007.1| hypothetical protein SNE_B25120 [Simkania negevensis Z]
gi|336481271|emb|CCB87871.1| putative uncharacterized protein [Simkania negevensis Z]
Length = 175
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/98 (25%), Positives = 51/98 (52%), Gaps = 12/98 (12%)
Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKC----MMLKVQSVCHYDS 186
+ L+++ + +I DG + +EGR + D+ + G ++ + + SV YDS
Sbjct: 42 YRLNIETQYLLYIIDGKKIIEGRLNVPDFGDMKKGDLVYFTDGNGGQAICSITSVGRYDS 101
Query: 187 FSEMLEAESLVKVLPGV--------KTIEEGVQIYRKF 216
F++ML +E ++ +LP + + + +G +IYR F
Sbjct: 102 FNKMLVSEGVINMLPQIDPNTNSSEEMLLKGTKIYRSF 139
>gi|256396042|ref|YP_003117606.1| hypothetical protein Caci_6932 [Catenulispora acidiphila DSM 44928]
gi|256362268|gb|ACU75765.1| protein of unknown function DUF437 [Catenulispora acidiphila DSM
44928]
Length = 119
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 54/105 (51%), Gaps = 5/105 (4%)
Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILL---NKCMMLKVQSVCHYDSF 187
+E+++ +F + G +T+E R S+ G I ++ +V+ V Y SF
Sbjct: 14 YEMNLYRRYFDMVASGRKTIEVRVHYPKLRSLTTGDHIRFIYDTDSVLTRVKRVARYTSF 73
Query: 188 SEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICI 232
+ML+AE +V P ++ V + R+ Y+ EKE + GVLAI I
Sbjct: 74 EQMLDAEDAARVNPDSPHEQQLVDM-RRIYSPEKE-ALGVLAIEI 116
>gi|294813019|ref|ZP_06771662.1| GCN5-related N-acetyltransferase [Streptomyces clavuligerus ATCC
27064]
gi|294325618|gb|EFG07261.1| GCN5-related N-acetyltransferase [Streptomyces clavuligerus ATCC
27064]
Length = 121
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 56/108 (51%), Gaps = 6/108 (5%)
Query: 130 EFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPG---SVILLNKCMMLKVQSVCHYDS 186
E L +++P+ I G +T+E R PG ++I ++ + ++ +V Y+S
Sbjct: 10 EHSLTIRKPYLDLIASGAKTIEVRVGYPKIRRFAPGDILTIICADESVTTRITAVREYES 69
Query: 187 FSEMLEAESLVKV-LPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
F+ M++AE + PG+ T ++ V R Y EKE + GV A+ ++
Sbjct: 70 FAAMVDAEDAAAIGGPGM-THDQLVAAIRSIYPPEKE-ALGVFALHLA 115
>gi|254392325|ref|ZP_05007509.1| hypothetical protein SSCG_04879 [Streptomyces clavuligerus ATCC
27064]
gi|326441434|ref|ZP_08216168.1| hypothetical protein SclaA2_10233 [Streptomyces clavuligerus ATCC
27064]
gi|197705996|gb|EDY51808.1| hypothetical protein SSCG_04879 [Streptomyces clavuligerus ATCC
27064]
Length = 126
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 56/108 (51%), Gaps = 6/108 (5%)
Query: 130 EFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPG---SVILLNKCMMLKVQSVCHYDS 186
E L +++P+ I G +T+E R PG ++I ++ + ++ +V Y+S
Sbjct: 15 EHSLTIRKPYLDLIASGAKTIEVRVGYPKIRRFAPGDILTIICADESVTTRITAVREYES 74
Query: 187 FSEMLEAESLVKV-LPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
F+ M++AE + PG+ T ++ V R Y EKE + GV A+ ++
Sbjct: 75 FAAMVDAEDAAAIGGPGM-THDQLVAAIRSIYPPEKE-ALGVFALHLA 120
>gi|357400832|ref|YP_004912757.1| hypothetical protein SCAT_3248 [Streptomyces cattleya NRRL 8057 =
DSM 46488]
gi|386356891|ref|YP_006055137.1| hypothetical protein SCATT_32440 [Streptomyces cattleya NRRL 8057 =
DSM 46488]
gi|337767241|emb|CCB75952.1| conserved protein of unknown function [Streptomyces cattleya NRRL
8057 = DSM 46488]
gi|365807399|gb|AEW95615.1| hypothetical protein SCATT_32440 [Streptomyces cattleya NRRL 8057 =
DSM 46488]
Length = 111
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 55/111 (49%), Gaps = 5/111 (4%)
Query: 130 EFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILL---NKCMMLKVQSVCHYDS 186
E +++++P+F IK+G++TVE R I G V++ ++ +V V Y S
Sbjct: 3 ERRINIRKPYFDLIKNGVKTVEVRVGYPGMRKIAAGQVLVFRSGDESCRTEVVKVVEYPS 62
Query: 187 FSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMAT 237
F M +AE + + G E + R Y EKE + GVLAI + + T
Sbjct: 63 FEAMADAEDIA-AIGGDMGRGELLAACRDIYPPEKE-ALGVLAIHLRRPTT 111
>gi|348683950|gb|EGZ23765.1| hypothetical protein PHYSODRAFT_311003 [Phytophthora sojae]
Length = 391
Score = 43.9 bits (102), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 31/135 (22%), Positives = 62/135 (45%), Gaps = 10/135 (7%)
Query: 107 EEEWSQLILNKGSSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGS 166
E+ W Q N G+ L +++ + E + I+ G +++E R ++ Y+ I
Sbjct: 188 EKPWEQS--NVGAFHQEATLPPADYQFALPEKDQALIEQGAKSLEIRLNVTPYSIIRAND 245
Query: 167 VILLNKCMMLKVQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGV--------QIYRKFYT 218
I +N + V +V Y +LEAE++ +LP + G + YR F++
Sbjct: 246 RITINGKTLTAVVAVRKYSRLQSVLEAENVNALLPQSSFVGAGTFNAAAAAERHYRHFFS 305
Query: 219 EEKERSNGVLAICIS 233
+E+E G++ ++
Sbjct: 306 KEEEEQYGLVVFQLA 320
>gi|282890018|ref|ZP_06298552.1| hypothetical protein pah_c009o041 [Parachlamydia acanthamoebae str.
Hall's coccus]
gi|338174415|ref|YP_004651225.1| hypothetical protein PUV_04210 [Parachlamydia acanthamoebae UV-7]
gi|281500110|gb|EFB42395.1| hypothetical protein pah_c009o041 [Parachlamydia acanthamoebae str.
Hall's coccus]
gi|336478773|emb|CCB85371.1| putative uncharacterized protein [Parachlamydia acanthamoebae UV-7]
Length = 227
Score = 43.5 bits (101), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 49/103 (47%), Gaps = 11/103 (10%)
Query: 139 FFSFIKDGLETVEGRCTISDYNSIGPGSVILLN-------KCMMLKVQSVCHYDSFSEML 191
+ I+ G +TVEGR + G I +C++ V + Y SF EML
Sbjct: 64 YLDLIRSGQKTVEGRINSGAFKCARIGDRITFYCRSSAPVQCLITDVNT---YPSFKEML 120
Query: 192 EAESLVKVLPGVKTIEEGVQIYRKFYT-EEKERSNGVLAICIS 233
E L LP V + +G++IY K +E+ +GV+A+ +S
Sbjct: 121 VKEGLKACLPDVTDLSKGIEIYEKIPGYKERASQHGVIALKVS 163
>gi|301114703|ref|XP_002999121.1| hypothetical protein PITG_05482 [Phytophthora infestans T30-4]
gi|262111215|gb|EEY69267.1| hypothetical protein PITG_05482 [Phytophthora infestans T30-4]
Length = 387
Score = 43.5 bits (101), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 35/141 (24%), Positives = 65/141 (46%), Gaps = 10/141 (7%)
Query: 107 EEEWSQLILNKGSSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGS 166
E+ W QL + G L + +++ + E I+ G +++E R + Y+ I
Sbjct: 187 EKPWEQLRV--GVPGLDAAIPPPDYQFALPERDHQLIEQGSKSLEIRLNDAPYSIIHVND 244
Query: 167 VILLNKCMMLKVQSVCHYDSFSEMLEAESLVKVLP-----GVKTIEEGV---QIYRKFYT 218
I +N + V S+ Y S +LEAE++ +LP G +T + YR+F++
Sbjct: 245 RITINGKTLTTVASIRKYSSLQSVLEAENVSALLPQSSFVGAETFNAAAAAERHYRQFFS 304
Query: 219 EEKERSNGVLAICISKMATQP 239
E+E G++ ++ A P
Sbjct: 305 VEEEEHYGLIVFQLAVSAPGP 325
>gi|256829429|ref|YP_003158157.1| N-acetyltransferase GCN5 [Desulfomicrobium baculatum DSM 4028]
gi|256578605|gb|ACU89741.1| GCN5-related N-acetyltransferase [Desulfomicrobium baculatum DSM
4028]
Length = 444
Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 46/94 (48%), Gaps = 3/94 (3%)
Query: 133 LHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILL---NKCMMLKVQSVCHYDSFSE 189
+ +++ F FIK G +T+E R + +I G I+ ++ ++ V+ + Y +FSE
Sbjct: 338 IRLKKNFLEFIKRGKKTLEVRVGYDNIKTIKSGESIVFMSRDERLVRTVKEIRIYKTFSE 397
Query: 190 MLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKER 223
ML E+ + P EE ++ Y KER
Sbjct: 398 MLNHENYKLIAPNSNNREETEKLLSDIYPPHKER 431
>gi|323452594|gb|EGB08467.1| hypothetical protein AURANDRAFT_63756 [Aureococcus anophagefferens]
Length = 1175
Score = 42.0 bits (97), Expect = 0.34, Method: Composition-based stats.
Identities = 29/108 (26%), Positives = 49/108 (45%), Gaps = 11/108 (10%)
Query: 134 HVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILL-----NKCMMLKVQSVCHYDSFS 188
++Q + I G +T EGR + I+ + + + + V +D+F
Sbjct: 510 YIQPKYADDIMSGAKTWEGRACTKWLEKLSVDDWIMFKISGSSDLIYARAREVVKFDTFE 569
Query: 189 EMLEAESLVKVLPGVKTIEEGVQIYRKF------YTEEKERSNGVLAI 230
EML+ L LPG ++ +GV+IYR F + E +GV+AI
Sbjct: 570 EMLDYCGLDACLPGCPSVADGVKIYRSFGCFDGRTYADVEAESGVVAI 617
>gi|323454833|gb|EGB10702.1| hypothetical protein AURANDRAFT_17437, partial [Aureococcus
anophagefferens]
Length = 114
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 42/94 (44%), Gaps = 18/94 (19%)
Query: 135 VQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQS------------VC 182
+Q + IK G +T EGR G + ++ C+ KV + V
Sbjct: 1 LQPKYADLIKSGAKTWEGR------AYAGWLEHVSVDDCITFKVTTRGAERVIARALDVR 54
Query: 183 HYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKF 216
+D+F +ML+ L LPG + EEG +IYR F
Sbjct: 55 RFDTFEDMLDHCGLDACLPGCASREEGAKIYRSF 88
>gi|239618457|ref|YP_002941779.1| hypothetical protein Kole_2098 [Kosmotoga olearia TBF 19.5.1]
gi|239507288|gb|ACR80775.1| protein of unknown function DUF437 [Kosmotoga olearia TBF 19.5.1]
Length = 114
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 46/103 (44%), Gaps = 7/103 (6%)
Query: 137 EPFFSFIKDGLETVEGRCTISDYNSIGPGSVIL------LNKCMMLKVQSVCHYDSFSEM 190
EPF IK + +E R I G I+ LN+ + +KV + HY++F ++
Sbjct: 10 EPF-ELIKTRKKVIEVRLNDEKRQKIKIGDTIVFSKLPDLNEKLRVKVVGLLHYETFEQL 68
Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
KT+E ++ K YT E+ER GVL I I
Sbjct: 69 YRDIPFKYFGREGKTLEWMLESTYKIYTREQERKYGVLGIRIE 111
>gi|340376712|ref|XP_003386876.1| PREDICTED: hypothetical protein LOC100635148 [Amphimedon
queenslandica]
Length = 287
Score = 39.3 bits (90), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 243 LASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFM 284
L L LG G+ LLG+ T G++ D LPP R LLSSF+
Sbjct: 141 LTETLQTLGMRGVMDLLGIRQTVGSV-DVLPPSRETLLSSFV 181
>gi|298705875|emb|CBJ29020.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 420
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 23/41 (56%)
Query: 247 LSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPF 287
+S +G GI+ LLGL T G+I LPP R LL + P
Sbjct: 240 ISSIGQAGIRDLLGLKRTVGSIDGCLPPSRRVLLKAINKPH 280
>gi|342210775|ref|ZP_08703527.1| hypothetical protein GIG_02286 [Mycoplasma anatis 1340]
gi|341578896|gb|EGS29237.1| hypothetical protein GIG_02286 [Mycoplasma anatis 1340]
Length = 385
Score = 37.4 bits (85), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 54/108 (50%), Gaps = 13/108 (12%)
Query: 132 ELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILL-----NK-CMMLKVQSVCHYD 185
+L +QE F I++G +T+E R +++ G V+ L NK ++++ + Y
Sbjct: 4 KLFIQEKFLKKIREGKKTIEMRLNDPRRHNLKEGMVLKLENIDDNKDIQLIEITKINKYK 63
Query: 186 SFSEMLEAESLVKVLPGVKTIEEGV---QIYRKFYTEEKERSNGVLAI 230
+F E+ + E +L G EE +Y FY+ EK+ GV+AI
Sbjct: 64 NFDELYKCED--PILLGYDCAEEANPNDMLY--FYSREKQDEWGVIAI 107
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.136 0.396
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,417,666,378
Number of Sequences: 23463169
Number of extensions: 173163378
Number of successful extensions: 419673
Number of sequences better than 100.0: 87
Number of HSP's better than 100.0 without gapping: 69
Number of HSP's successfully gapped in prelim test: 18
Number of HSP's that attempted gapping in prelim test: 419525
Number of HSP's gapped (non-prelim): 99
length of query: 293
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 152
effective length of database: 9,050,888,538
effective search space: 1375735057776
effective search space used: 1375735057776
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)