BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 022686
         (293 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224093009|ref|XP_002309781.1| predicted protein [Populus trichocarpa]
 gi|222852684|gb|EEE90231.1| predicted protein [Populus trichocarpa]
          Length = 380

 Score =  363 bits (931), Expect = 6e-98,   Method: Compositional matrix adjust.
 Identities = 176/291 (60%), Positives = 228/291 (78%), Gaps = 3/291 (1%)

Query: 1   MERLKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDED 60
           ME+ ++ +EELLKFTL SHI++TL+F+LG+S  FC NLL++DPND +  ST     +  D
Sbjct: 1   MEKQRERIEELLKFTLESHINQTLEFNLGLSKDFCINLLEEDPNDMLCHSTPTP--TPTD 58

Query: 61  SLQGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQLILNKGSS 120
           S  GV LYPLYKRLA AL RSV  GA CRTY+K+   +++  L+QKEE W QLI  KG  
Sbjct: 59  SFDGVALYPLYKRLASALYRSVKSGAVCRTYEKMVFGDKDSNLKQKEENWDQLIKEKGL- 117

Query: 121 DLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQS 180
           +L+++L+ +  E+HVQEP+FS +KDG +T+EGRC   DY  I PG +IL+NK ++LKV+ 
Sbjct: 118 ELINVLEGISCEIHVQEPYFSLLKDGRKTIEGRCATGDYIRIEPGDLILVNKIVVLKVED 177

Query: 181 VCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQPY 240
           V  Y SFS+ML+AE+L KVLPGVKT+EEGV+IYRKFYTEEKE SNGVLAIC+SK+A QPY
Sbjct: 178 VRRYASFSKMLQAENLEKVLPGVKTVEEGVKIYRKFYTEEKEMSNGVLAICVSKLAAQPY 237

Query: 241 VTLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNV 291
           ++LASIL GL Y G++SLLGL+ T GT+S+ALPPPRS LLSSF+ P+ PN+
Sbjct: 238 LSLASILFGLSYGGVRSLLGLADTGGTVSNALPPPRSTLLSSFIFPYNPNI 288


>gi|255586386|ref|XP_002533840.1| conserved hypothetical protein [Ricinus communis]
 gi|223526219|gb|EEF28542.1| conserved hypothetical protein [Ricinus communis]
          Length = 370

 Score =  360 bits (925), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 174/291 (59%), Positives = 231/291 (79%), Gaps = 5/291 (1%)

Query: 1   MERLKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDED 60
           ME+L++ +EE++K+TL+SHI++TL FDL +S +FC+NLL+ DPND VS       +S   
Sbjct: 1   MEQLRNRIEEIVKYTLNSHINQTLGFDLSLSKEFCSNLLRADPNDTVS----LPPNSTSG 56

Query: 61  SLQGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQLILNKGSS 120
           S +GVP YPL++RL  AL + +   +FC+TY  +  +NE+  L+QKEE+W++LIL KGS 
Sbjct: 57  SFEGVPEYPLFRRLGSALYQCIISRSFCKTYDTIEFINEDNSLKQKEEQWNKLILEKGS- 115

Query: 121 DLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQS 180
           +L+++L A   ELHVQEPFFS +KDGL+T+EGRC   +Y+ I PG+++L+NK ++L+V+ 
Sbjct: 116 ELMNVLMATFHELHVQEPFFSLLKDGLKTIEGRCADDNYSRIEPGALLLINKSVVLEVKD 175

Query: 181 VCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQPY 240
           V  Y SF +MLEAESL KVLPGVKTIEEGV++YRKFYTEEKE SNGVLAIC+SK   QPY
Sbjct: 176 VRRYPSFLKMLEAESLSKVLPGVKTIEEGVEVYRKFYTEEKEMSNGVLAICVSKSPYQPY 235

Query: 241 VTLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNV 291
           ++LA +LSGLGY GIQSLLG++HT GTISDALPP RS LLSSF LP++PNV
Sbjct: 236 LSLARMLSGLGYTGIQSLLGIAHTVGTISDALPPSRSTLLSSFTLPYRPNV 286


>gi|359495929|ref|XP_002272151.2| PREDICTED: uncharacterized protein LOC100242314 [Vitis vinifera]
 gi|296081341|emb|CBI17687.3| unnamed protein product [Vitis vinifera]
          Length = 398

 Score =  348 bits (893), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 175/289 (60%), Positives = 222/289 (76%), Gaps = 11/289 (3%)

Query: 4   LKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDP-NDAVSPSTSFDFDSDEDSL 62
           L +C+EEL+K+TL S ++ TL+ DLG+S  +C+ LL+DD   D  S ST        DS 
Sbjct: 19  LANCVEELVKYTLYSSVNGTLEIDLGLSKDYCSALLKDDHLTDPTSIST--------DSF 70

Query: 63  QGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQLILNKGSSDL 122
           +GVP YPLYKRL+ AL RS+  GAF   Y  +AL++E+  L+QKEE W++L+++KG  +L
Sbjct: 71  EGVPPYPLYKRLSAALYRSIISGAFWEIYSTMALIHEDSSLKQKEE-WNKLVVDKGL-EL 128

Query: 123 VDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVC 182
           V+ILK ++FELHVQEPFFS +KDGL+ +EGRC + DYN IG G++IL NKC++L+VQ V 
Sbjct: 129 VNILKTIDFELHVQEPFFSQLKDGLKIIEGRCAVGDYNRIGSGALILFNKCLVLEVQDVR 188

Query: 183 HYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQPYVT 242
            Y SFS++LE+E L +VLPGVKTIEEGVQIYRKFYT+EKERSNGVLAIC++K A QPY+ 
Sbjct: 189 RYASFSQLLESEGLAEVLPGVKTIEEGVQIYRKFYTKEKERSNGVLAICVAKPAAQPYIF 248

Query: 243 LASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNV 291
           LA IL GL Y G+Q LLG  HT GTI +ALPPPRS LLSSFM P KPNV
Sbjct: 249 LAYILFGLSYGGVQRLLGFMHTVGTIPEALPPPRSTLLSSFMSPHKPNV 297


>gi|363808278|ref|NP_001242240.1| uncharacterized protein LOC100807259 [Glycine max]
 gi|255644803|gb|ACU22903.1| unknown [Glycine max]
          Length = 400

 Score =  285 bits (730), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 149/290 (51%), Positives = 201/290 (69%), Gaps = 11/290 (3%)

Query: 3   RLKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDEDSL 62
           +L DC+EEL++FTL+S+        L +SS+F +NLL+DD   A  PS+S      +DSL
Sbjct: 20  KLCDCLEELVRFTLNSN-----SHHLNLSSQFFSNLLKDD---ATHPSSSHSLSQPDDSL 71

Query: 63  QGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNE--ECFLEQKEEEWSQLILNKGSS 120
           +GVP YPLYKR + AL + ++   FCRT   +A+ +E  +  ++QK  EW +LI+ KG  
Sbjct: 72  EGVPPYPLYKRFSSALLKCMDSETFCRTGANLAMTDEFEDSSMQQKRNEWHRLIVEKGF- 130

Query: 121 DLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQS 180
           ++ +ILK+V FE HVQEPFFS + DGL+T+EGRC    YN I  G++IL NK ++ +VQ 
Sbjct: 131 EIENILKSVSFEFHVQEPFFSQLNDGLKTIEGRCATGKYNRIKSGNLILFNKSVVFEVQG 190

Query: 181 VCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQPY 240
           V  Y +F  MLEAESL K LPGV++ EEGV++Y++F TEEKE++NGVLAI +SK   QPY
Sbjct: 191 VRRYPTFFAMLEAESLGKGLPGVESSEEGVKVYQRFCTEEKEQANGVLAIVVSKFTPQPY 250

Query: 241 VTLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPN 290
            +LA +   L Y G+QSLLGL HT GTI +ALPPP S LL+SF  P  PN
Sbjct: 251 DSLARLFCELSYEGVQSLLGLMHTTGTIPNALPPPISTLLASFNFPCNPN 300


>gi|238479547|ref|NP_001154572.1| RNA-binding ASCH domain protein [Arabidopsis thaliana]
 gi|330255179|gb|AEC10273.1| RNA-binding ASCH domain protein [Arabidopsis thaliana]
          Length = 388

 Score =  280 bits (715), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 149/299 (49%), Positives = 205/299 (68%), Gaps = 20/299 (6%)

Query: 1   MERLKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDED 60
           M +++DC++E++KFTL    D  ++FD+G++ +FC+ LL          S   D +  E 
Sbjct: 1   MIKIRDCLDEMVKFTL----DYCVEFDIGLTGEFCSGLL-------CGESVLHDGERIES 49

Query: 61  S-------LQGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQL 113
           S         GVP YPLYK LAL L +S++ G+FC T++K++L  E   L++KE+EWS+L
Sbjct: 50  SSYALLHRFSGVPDYPLYKVLALGLLKSIDSGSFCGTFEKISLGKEVIRLKEKEDEWSKL 109

Query: 114 ILNKGSSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIG-PGSVILLNK 172
           I  KGS +LV+ LK V  EL VQEP FS +KDG++TVE RC   +Y+ I   GS++++NK
Sbjct: 110 INQKGS-ELVNALKDVFSELQVQEPLFSLMKDGIKTVEARCFEEEYDRIRRGGSMVMINK 168

Query: 173 CMMLKVQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICI 232
           C+M +V  +  Y SF E+L+AES  KV PG KT+EEG+Q++RK Y  ++E  NGV+AI +
Sbjct: 169 CLMFEVLELHQYASFYELLKAESSEKVFPGTKTVEEGMQMFRKLYDTDQENFNGVVAIHL 228

Query: 233 SKMATQPYVTLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNV 291
           SK   QP V LA ILSGL Y G+Q+LLGLSHT G+I  ALPPPRS LLSSFMLP+KP +
Sbjct: 229 SKSVAQPCVALAHILSGLSYTGVQNLLGLSHTTGSIFHALPPPRSMLLSSFMLPYKPKI 287


>gi|297828804|ref|XP_002882284.1| hypothetical protein ARALYDRAFT_477581 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328124|gb|EFH58543.1| hypothetical protein ARALYDRAFT_477581 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 240

 Score =  221 bits (563), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 123/262 (46%), Positives = 162/262 (61%), Gaps = 46/262 (17%)

Query: 4   LKDCMEELLKFTLSSHIDETL-DFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDEDSL 62
           L++CME LL+FTLSSH+DE++  FDL ++  FC +LL++  +    P+            
Sbjct: 14  LRECMESLLRFTLSSHLDESVPSFDLDLTRDFCLHLLEEATDSTEKPA------------ 61

Query: 63  QGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKE-----EEWSQLILNK 117
                  +YK LA AL                     EC   + +     E++S+L    
Sbjct: 62  -------VYKLLARALS--------------------ECLASEGDNNPNLEKYSKLFHGL 94

Query: 118 GSSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLK 177
           G  DL+++LK V FELHVQEP+F+ +KDGL+TVEGRC + DY  I  G+ IL NKC++L+
Sbjct: 95  G-HDLINMLKKVNFELHVQEPYFTQLKDGLKTVEGRCAVGDYMRISSGAFILFNKCLLLE 153

Query: 178 VQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMAT 237
           VQ V HY SFSEML  E L KVLPGV++IEEGVQ+YR FY+EEKER NGVLAI ++K A 
Sbjct: 154 VQDVHHYTSFSEMLIMEGLAKVLPGVESIEEGVQVYRNFYSEEKERMNGVLAIHVAKPAN 213

Query: 238 QPYVTLASILSGLGYVGIQSLL 259
           QPY  LA +LS L   GI+SLL
Sbjct: 214 QPYAALAGVLSELKSTGIKSLL 235


>gi|357156930|ref|XP_003577624.1| PREDICTED: uncharacterized protein LOC100838225 [Brachypodium
           distachyon]
          Length = 382

 Score =  211 bits (536), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 121/288 (42%), Positives = 173/288 (60%), Gaps = 26/288 (9%)

Query: 4   LKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDEDSLQ 63
           L   + ELL+F LSSH+    D  L +S  +C+ LL DD  + ++   +   +  +    
Sbjct: 20  LSSALGELLRFVLSSHLTAP-DPALPLSPSYCSRLLDDDLCEKLAAELAGCIEEGQ---- 74

Query: 64  GVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQLILNKGSSDLV 123
            +P  P            V  GAF        +  EE    +++ EW  ++L KG+ +L 
Sbjct: 75  -LPEPP------------VGSGAF-------RIPAEEDGPRERDREWEAVLLEKGA-ELK 113

Query: 124 DILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCH 183
            +   VEF LHVQEP+F+ +  G + VEGR    +YN I  GS +L NKC++L+V++V  
Sbjct: 114 RMYDGVEFVLHVQEPYFTQLSAGTKNVEGRLAAGNYNRITQGSWLLFNKCLLLEVEAVRK 173

Query: 184 YDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQPYVTL 243
           Y SF EML+ E +  VLPG+ +IE+GV++YRKFYTEEKE S+GVLAI +SK A QPY T+
Sbjct: 174 YSSFLEMLQEEMISNVLPGILSIEDGVKVYRKFYTEEKENSSGVLAISVSKPARQPYETM 233

Query: 244 ASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNV 291
             +L+ LGY G+  LLGL++TAGT+ D +PPPRS L+SS M   +P V
Sbjct: 234 TGLLARLGYDGLGRLLGLANTAGTVPDGVPPPRSVLISSCMKLHQPTV 281


>gi|21536491|gb|AAM60823.1| unknown [Arabidopsis thaliana]
          Length = 244

 Score =  210 bits (534), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 124/272 (45%), Positives = 166/272 (61%), Gaps = 47/272 (17%)

Query: 4   LKDCMEELLKFTLSSHIDETL-DFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDEDSL 62
           L++CME LL+F+L SH++E++  FDL ++  FC +LL +              DS E S 
Sbjct: 14  LRECMESLLRFSLRSHLNESVPSFDLDLTRDFCLHLLGEAT------------DSTEKS- 60

Query: 63  QGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKE-----EEWSQLILNK 117
                  +YK LA AL                     EC   + +     E++S+LI   
Sbjct: 61  ------AVYKLLATALS--------------------ECLASEGDKNSNLEKYSKLIHGL 94

Query: 118 GSSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLK 177
           G  DL+++LK V FELHVQEP+F+ +KDGL+TVEGRC + DY  I  G+ +L NKC++L+
Sbjct: 95  G-YDLINMLKEVNFELHVQEPYFTQLKDGLKTVEGRCAVGDYMRISSGAFLLFNKCLLLE 153

Query: 178 VQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMAT 237
           VQ V  Y SFSEML+ E L KVLPGV++IEEGVQ+YR FY+EEKER NGV+AI ++K A 
Sbjct: 154 VQDVHRYTSFSEMLKVEGLSKVLPGVESIEEGVQVYRNFYSEEKERMNGVVAIRVAKPAN 213

Query: 238 QPYVTLASILSGLGYVGIQSLLGLSHTAGTIS 269
           QP   LA +LS L   GI+SLL   +TAG  S
Sbjct: 214 QPSAALAGVLSELKSSGIKSLLD-EYTAGVTS 244


>gi|18396513|ref|NP_566202.1| RNA-binding ASCH domain protein [Arabidopsis thaliana]
 gi|6017115|gb|AAF01598.1|AC009895_19 unknown protein [Arabidopsis thaliana]
 gi|22655129|gb|AAM98155.1| expressed protein [Arabidopsis thaliana]
 gi|32189307|gb|AAP75808.1| At3g03320 [Arabidopsis thaliana]
 gi|332640408|gb|AEE73929.1| RNA-binding ASCH domain protein [Arabidopsis thaliana]
          Length = 244

 Score =  210 bits (534), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 124/272 (45%), Positives = 165/272 (60%), Gaps = 47/272 (17%)

Query: 4   LKDCMEELLKFTLSSHIDETL-DFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDEDSL 62
           L++CME LL+F+L SH++E++  FDL ++  FC +LL +              DS E S 
Sbjct: 14  LRECMESLLRFSLRSHLNESVPSFDLDLTRDFCLHLLGEAT------------DSTEKS- 60

Query: 63  QGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKE-----EEWSQLILNK 117
                  +YK LA AL                     EC   + +     E++S+LI   
Sbjct: 61  ------AVYKLLATALS--------------------ECLASEGDKNSNLEKYSKLIHGL 94

Query: 118 GSSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLK 177
           G  DL+++LK V FELHVQEP+F+ +KDGL+TVEGRC + DY  I  G  +L NKC++L+
Sbjct: 95  G-YDLINMLKEVNFELHVQEPYFTQLKDGLKTVEGRCAVGDYMRISSGDFLLFNKCLLLE 153

Query: 178 VQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMAT 237
           VQ V  Y SFSEML+ E L KVLPGV++IEEGVQ+YR FY+EEKER NGV+AI ++K A 
Sbjct: 154 VQDVHRYTSFSEMLKVEGLAKVLPGVESIEEGVQVYRNFYSEEKERMNGVVAIRVAKPAN 213

Query: 238 QPYVTLASILSGLGYVGIQSLLGLSHTAGTIS 269
           QP   LA +LS L   GI+SLL   +TAG  S
Sbjct: 214 QPSAALAGVLSELKSSGIKSLLD-EYTAGVTS 244


>gi|2289012|gb|AAB64341.1| hypothetical protein [Arabidopsis thaliana]
          Length = 483

 Score =  204 bits (518), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 123/294 (41%), Positives = 165/294 (56%), Gaps = 74/294 (25%)

Query: 5   KDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDEDS--- 61
           +DC++E++KFTL    D  ++FD+G++ +FC+ LL          S   D +  E S   
Sbjct: 171 EDCLDEMVKFTL----DYCVEFDIGLTGEFCSGLL-------CGESVLHDGERIESSSYA 219

Query: 62  ----LQGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQLILNK 117
                 GVP YPLYK LAL L +S++ G+FC T++K++L  E   L++KE+EWS+LI  K
Sbjct: 220 LLHRFSGVPDYPLYKVLALGLLKSIDSGSFCGTFEKISLGKEVIRLKEKEDEWSKLINQK 279

Query: 118 GSSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLK 177
           GS +LV+       ELH                                           
Sbjct: 280 GS-ELVN-------ELH------------------------------------------- 288

Query: 178 VQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMAT 237
                 Y SF E+L+AES  KV PG KT+EEG+Q++RK Y  ++E  NGV+AI +SK   
Sbjct: 289 -----QYASFYELLKAESSEKVFPGTKTVEEGMQMFRKLYDTDQENFNGVVAIHLSKSVA 343

Query: 238 QPYVTLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNV 291
           QP V LA ILSGL Y G+Q+LLGLSHT G+I  ALPPPRS LLSSFMLP+KP +
Sbjct: 344 QPCVALAHILSGLSYTGVQNLLGLSHTTGSIFHALPPPRSMLLSSFMLPYKPKI 397


>gi|293331561|ref|NP_001169685.1| uncharacterized protein LOC100383566 [Zea mays]
 gi|224030879|gb|ACN34515.1| unknown [Zea mays]
 gi|414873857|tpg|DAA52414.1| TPA: hypothetical protein ZEAMMB73_867603 [Zea mays]
 gi|414873858|tpg|DAA52415.1| TPA: hypothetical protein ZEAMMB73_867603 [Zea mays]
          Length = 283

 Score =  203 bits (516), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 123/289 (42%), Positives = 173/289 (59%), Gaps = 36/289 (12%)

Query: 7   CMEELLKFTLSSHI-----DETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDEDS 61
           C+ +LL+F LSSH      D+T+ F L  S  +C  LL D                    
Sbjct: 25  CIGDLLRFVLSSHAAAYPGDDTVAFPL--SPSYCARLLNDGE------------------ 64

Query: 62  LQGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQLILNKGSSD 121
                   L+++L   + + +  G        V +  EE   E  EE   +L+L +  ++
Sbjct: 65  --------LFEKLEACIQQCLEEGRLPGPPAVVGIPAEE---EGPEERGWKLLLPEKGAE 113

Query: 122 LVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSV 181
           L  +  AVEFELHVQEP+F+ ++ G++ VEGR    +YN I  GS++L NKC++L V++V
Sbjct: 114 LKRMYDAVEFELHVQEPYFTQLRAGVKKVEGRLATGNYNRITQGSLLLFNKCLLLNVEAV 173

Query: 182 CHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQPYV 241
             Y+SFSEML+ E +  VLPG+ +I EGV++YRKFY EEKE S GVLAI +SK  +QPY+
Sbjct: 174 RKYNSFSEMLKGEKISNVLPGISSIVEGVKVYRKFYAEEKENSYGVLAISVSKPTSQPYI 233

Query: 242 TLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPN 290
           T+ +IL+GLGY G+  LLG++ T GT+ D LPPPRSALLSS M   +PN
Sbjct: 234 TMNNILAGLGYDGLGRLLGMAKTTGTVPDGLPPPRSALLSSCMGLVQPN 282


>gi|449468876|ref|XP_004152147.1| PREDICTED: uncharacterized LOC101221961 [Cucumis sativus]
          Length = 238

 Score =  202 bits (515), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 119/251 (47%), Positives = 153/251 (60%), Gaps = 41/251 (16%)

Query: 4   LKDCMEELLKFTLSSHIDETLDFD--LGISSKFCTNLLQDDPNDAVSPSTSFDFDSDEDS 61
           L+DC+EELLKFTL SHI+ TLD D  LG S+ F ++LL  + ND   P  S         
Sbjct: 15  LRDCLEELLKFTLQSHINGTLDIDHDLGFSTDFSSHLL--NHNDC--PDVS--------- 61

Query: 62  LQGVPLYPLYKRLALALCRSVN---CGAFCRTYKKVALMNEECFLEQKEEEWSQLILNKG 118
                   LYK L   L +SV+   CG+                L+  E+E     + +G
Sbjct: 62  -------RLYKDLVSTLLKSVSKASCGS----------------LDDFEDEEESNEIAEG 98

Query: 119 SSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKV 178
            ++LV++LK V FELHVQEPFF+ +KDGL+ VEGRC   +YN I  G++IL NKC++ +V
Sbjct: 99  RAELVNVLKTVNFELHVQEPFFTQLKDGLKRVEGRCAAGNYNRIQSGALILFNKCLLFEV 158

Query: 179 QSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQ 238
           Q V  Y SF  ML+AESL  VLPGVKT+ +GVQIYR FY+EEKE SNGVL I + K   Q
Sbjct: 159 QDVRQYPSFYAMLKAESLDNVLPGVKTLTDGVQIYRNFYSEEKELSNGVLGIHVKKSVAQ 218

Query: 239 PYVTLASILSG 249
           PY+ LA I+S 
Sbjct: 219 PYIILARIISA 229


>gi|326510525|dbj|BAJ87479.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 385

 Score =  201 bits (512), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 125/291 (42%), Positives = 169/291 (58%), Gaps = 32/291 (10%)

Query: 4   LKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDEDSLQ 63
           L   + ELL+  LSSH+    D  L +S  +C+ LL+DD                     
Sbjct: 23  LSSAVGELLRLVLSSHV-AAPDPALPLSRSYCSRLLEDD--------------------- 60

Query: 64  GVPLYPLYKRLALALCRSVNCGAFCRT---YKKVALMNEECFLEQKEEEWSQLILNKGSS 120
                 L  +LA  L      G   R       V    EE    ++E EW  ++  KG+ 
Sbjct: 61  ------LCDKLAAELAGCAEEGRIPRAPVVAGAVGTPAEENDSRKREGEWEAILREKGA- 113

Query: 121 DLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQS 180
           +L  I  AVEF LHVQEP+F+ +  G + VEGR    +YN I  GS++L NKC++L+V++
Sbjct: 114 ELKRIYDAVEFVLHVQEPYFTQLSAGSKNVEGRLAAGNYNRITQGSLLLFNKCLLLEVEA 173

Query: 181 VCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQPY 240
           V  Y SFSEML+AE++  VLP + +IEEGV++YR FYTEEKE S GVLAI +SK   QPY
Sbjct: 174 VRKYSSFSEMLQAETISNVLPDISSIEEGVKVYRNFYTEEKENSYGVLAISVSKPQIQPY 233

Query: 241 VTLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNV 291
           +T+  +LSGLGY G+  LLGL++T+GT+ D LPPP+S L+SS M   KP V
Sbjct: 234 ITMTELLSGLGYDGLGRLLGLANTSGTVPDGLPPPKSVLISSCMKLHKPTV 284


>gi|297828047|ref|XP_002881906.1| hypothetical protein ARALYDRAFT_322019 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327745|gb|EFH58165.1| hypothetical protein ARALYDRAFT_322019 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 495

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 122/298 (40%), Positives = 166/298 (55%), Gaps = 74/298 (24%)

Query: 1   MERLKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDED 60
           +++L+DC+EE++KFTL    D  ++FDL ++  FC  LL          S   D +  E 
Sbjct: 164 VKKLRDCLEEMVKFTL----DCRVEFDLELTDDFCFGLL-------CGESILLDGERIES 212

Query: 61  S-------LQGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQL 113
           S         GVP YPLYK LAL L +S++ G+ C T++ ++L  E  +L+++E+EWS+L
Sbjct: 213 SSHALLHRFGGVPDYPLYKLLALGLLKSIDSGSVCGTFENISLGKEVIWLKEREDEWSKL 272

Query: 114 ILNKGSSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKC 173
           I  KG S+LV+       E+H Q PF                                  
Sbjct: 273 INQKG-SELVN-------EVH-QYPF---------------------------------- 289

Query: 174 MMLKVQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
                        F E+L+ ES  KV PG KT+EEG+Q++RK Y  ++E SNGV+AI +S
Sbjct: 290 -------------FYELLKGESSEKVFPGTKTVEEGMQMFRKLYDMDQENSNGVVAINLS 336

Query: 234 KMATQPYVTLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNV 291
           K   QP V LA ILSGL Y G+Q+LLGLSHT G+I  ALPPPRS LLSSFMLP+KP +
Sbjct: 337 KSVVQPCVALAHILSGLSYTGVQNLLGLSHTTGSIFHALPPPRSMLLSSFMLPYKPKI 394


>gi|62733968|gb|AAX96077.1| ProFAR isomerase associated, putative [Oryza sativa Japonica Group]
 gi|77549703|gb|ABA92500.1| ProFAR isomerase associated family protein, expressed [Oryza sativa
           Japonica Group]
          Length = 385

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 96/179 (53%), Positives = 130/179 (72%), Gaps = 1/179 (0%)

Query: 113 LILNKGSSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNK 172
           ++L KG+ +L  +  AVEFELHVQEP+F+ ++ G + VEGR    +YN I  GS++L NK
Sbjct: 107 VVLEKGN-ELKRMYDAVEFELHVQEPYFTQLRAGTKKVEGRLAAGNYNRIAEGSLLLFNK 165

Query: 173 CMMLKVQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICI 232
           C++L VQ++  Y SFSEML+AE + +VLPG+ +IE+GV +YRKFYTEEKE   GVLAI +
Sbjct: 166 CLLLNVQAIRKYASFSEMLQAEMISEVLPGISSIEQGVGVYRKFYTEEKESFYGVLAISV 225

Query: 233 SKMATQPYVTLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNV 291
           SK   QPY+ +  +L+GLG  G+  LLG+  TAGT+ D LPPPRS L+SS M   +PNV
Sbjct: 226 SKPTAQPYIIMTELLAGLGSDGLGRLLGMVKTAGTVQDGLPPPRSVLISSCMKLHQPNV 284


>gi|222615810|gb|EEE51942.1| hypothetical protein OsJ_33576 [Oryza sativa Japonica Group]
          Length = 405

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 96/179 (53%), Positives = 130/179 (72%), Gaps = 1/179 (0%)

Query: 113 LILNKGSSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNK 172
           ++L KG+ +L  +  AVEFELHVQEP+F+ ++ G + VEGR    +YN I  GS++L NK
Sbjct: 107 VVLEKGN-ELKRMYDAVEFELHVQEPYFTQLRAGTKKVEGRLAAGNYNRIAEGSLLLFNK 165

Query: 173 CMMLKVQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICI 232
           C++L VQ++  Y SFSEML+AE + +VLPG+ +IE+GV +YRKFYTEEKE   GVLAI +
Sbjct: 166 CLLLNVQAIRKYASFSEMLQAEMISEVLPGISSIEQGVGVYRKFYTEEKESFYGVLAISV 225

Query: 233 SKMATQPYVTLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNV 291
           SK   QPY+ +  +L+GLG  G+  LLG+  TAGT+ D LPPPRS L+SS M   +PNV
Sbjct: 226 SKPTAQPYIIMTELLAGLGSDGLGRLLGMVKTAGTVQDGLPPPRSVLISSCMKLHQPNV 284


>gi|218185550|gb|EEC67977.1| hypothetical protein OsI_35738 [Oryza sativa Indica Group]
          Length = 405

 Score =  181 bits (459), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 96/179 (53%), Positives = 130/179 (72%), Gaps = 1/179 (0%)

Query: 113 LILNKGSSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNK 172
           ++L KG+ +L  +  AVEFELHVQEP+F+ ++ G + VEGR    +YN I  GS++L NK
Sbjct: 107 VVLEKGN-ELKMMYDAVEFELHVQEPYFTQLRAGTKKVEGRLAAGNYNRIAEGSLLLFNK 165

Query: 173 CMMLKVQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICI 232
           C++L VQ++  Y SFSEML+AE + +VLPG+ +IE+GV +YRKFYTEEKE   GVLAI +
Sbjct: 166 CLLLNVQAIRKYASFSEMLQAEMISEVLPGISSIEQGVGVYRKFYTEEKESFYGVLAISV 225

Query: 233 SKMATQPYVTLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNV 291
           SK   QPY+ +  +L+GLG  G+  LLG+  TAGT+ D LPPPRS L+SS M   +PNV
Sbjct: 226 SKPTAQPYIIMTELLAGLGSDGLGRLLGMVKTAGTVQDGLPPPRSVLISSCMKLHQPNV 284


>gi|449484877|ref|XP_004157005.1| PREDICTED: uncharacterized LOC101221961 [Cucumis sativus]
          Length = 879

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 83/150 (55%), Positives = 108/150 (72%), Gaps = 4/150 (2%)

Query: 105 QKEEEWSQLILNKGSSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGP 164
           + EEE +++   +G ++LV++LK V FELHVQEPFF+ +KDGL+ VEGRC   +YN I  
Sbjct: 728 EDEEESNEIA--EGRAELVNVLKTVNFELHVQEPFFTQLKDGLKRVEGRCAAGNYNRIQS 785

Query: 165 GSVILLNKCMMLKVQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERS 224
           G++IL NKC++ +VQ V  Y SF  ML+AESL  VLP VKT+ +GVQIYR FY+EEKE S
Sbjct: 786 GALILFNKCLLFEVQDVRQYPSFYAMLKAESLDNVLPEVKTLTDGVQIYRNFYSEEKELS 845

Query: 225 NGVLAICISKMATQPYVTLASILSGLGYVG 254
           NGVL I + K   QPY+ LA I+S   Y G
Sbjct: 846 NGVLGIHVKKSVAQPYIILARIIS--AYFG 873


>gi|147846119|emb|CAN83746.1| hypothetical protein VITISV_022668 [Vitis vinifera]
          Length = 796

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 85/149 (57%), Positives = 114/149 (76%), Gaps = 10/149 (6%)

Query: 60  DSLQGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQLILNKGS 119
           DS +GVP YPLYKRL+ AL RS+  GAF   Y  +AL++E+  L+QKEE W++L+++KG 
Sbjct: 18  DSFEGVPPYPLYKRLSAALYRSIISGAFWEIYSTMALIHEDSSLKQKEE-WNKLVVDKGL 76

Query: 120 SDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQ 179
            +LV+ILK ++FELHVQEPFFS +KDGL+ +EGR        IG G++IL NKC++L+VQ
Sbjct: 77  -ELVNILKTIDFELHVQEPFFSQLKDGLKIIEGR--------IGSGALILFNKCLVLEVQ 127

Query: 180 SVCHYDSFSEMLEAESLVKVLPGVKTIEE 208
            V  Y SFS++LE+E L +VLPGVKTIEE
Sbjct: 128 DVRRYASFSQLLESEGLAEVLPGVKTIEE 156


>gi|224155241|ref|XP_002337582.1| predicted protein [Populus trichocarpa]
 gi|222839619|gb|EEE77942.1| predicted protein [Populus trichocarpa]
          Length = 141

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 77/144 (53%), Positives = 105/144 (72%), Gaps = 3/144 (2%)

Query: 1   MERLKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDED 60
           ME+ ++ +EELLKFTL SHI++TL+F+LG+S  FC NLL++DPND +  ST     +  D
Sbjct: 1   MEKQRERIEELLKFTLESHINQTLEFNLGLSKDFCINLLEEDPNDMLCHSTPTP--TPTD 58

Query: 61  SLQGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQLILNKGSS 120
           S  GV LYPLYKRLA AL RSV  GA CRTY+K+   +++  L+QKEE W QLI  KG  
Sbjct: 59  SFDGVALYPLYKRLASALYRSVKSGAVCRTYEKMVFGDKDSNLKQKEENWDQLIKEKG-L 117

Query: 121 DLVDILKAVEFELHVQEPFFSFIK 144
           +L+++L+ +  E+HVQEP+FS +K
Sbjct: 118 ELINVLEGISCEIHVQEPYFSLLK 141


>gi|449516555|ref|XP_004165312.1| PREDICTED: uncharacterized protein LOC101232651 [Cucumis sativus]
          Length = 107

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 62/102 (60%), Positives = 86/102 (84%)

Query: 190 MLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQPYVTLASILSG 249
           MLE+E+L  +LPGV++I+EG+QI +    E++E ++ VLA+CIS +  QPY++LA+I+SG
Sbjct: 1   MLESENLQSILPGVESIDEGLQILKSLNREDEEMADSVLALCISSVPFQPYISLAAIISG 60

Query: 250 LGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNV 291
           L Y G+Q LLGL+HTAGT++DALPPPRSALLSSF+LP+KP V
Sbjct: 61  LSYEGLQGLLGLAHTAGTVADALPPPRSALLSSFVLPYKPEV 102


>gi|302811653|ref|XP_002987515.1| hypothetical protein SELMODRAFT_126335 [Selaginella moellendorffii]
 gi|300144669|gb|EFJ11351.1| hypothetical protein SELMODRAFT_126335 [Selaginella moellendorffii]
          Length = 266

 Score =  134 bits (336), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 72/159 (45%), Positives = 101/159 (63%), Gaps = 3/159 (1%)

Query: 126 LKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYD 185
              V F LHVQ+ F S ++DG + VEGRC    Y+  G  +++L+  C+   +Q V  Y+
Sbjct: 3   FHGVRFRLHVQDRFLSQLRDGSKIVEGRCATGFYSRCGS-NILLMFICLYPLLQDVRAYE 61

Query: 186 SFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQPYVTLAS 245
           SF  MLE E L  VLPG++++EEG+ IYR FY+ E E + GVLA+ +S+   QP   + +
Sbjct: 62  SFQNMLETEGLENVLPGIESVEEGIGIYRGFYSRETE-ALGVLALHVSQ-PIQPRNAVTA 119

Query: 246 ILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFM 284
           +L  L   G+ SLLGL  T GT+  ALPPPRS LL+SF+
Sbjct: 120 LLQSLKVEGVASLLGLRATKGTLVSALPPPRSRLLASFL 158


>gi|297728215|ref|NP_001176471.1| Os11g0265500 [Oryza sativa Japonica Group]
 gi|255679980|dbj|BAH95199.1| Os11g0265500, partial [Oryza sativa Japonica Group]
          Length = 138

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 65/125 (52%), Positives = 89/125 (71%)

Query: 166 SVILLNKCMMLKVQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSN 225
           S+I+ +   ++  Q++  Y SFSEML+AE + +VLPG+ +IE+GV +YRKFYTEEKE   
Sbjct: 13  SIIVESFVCLIFFQAIRKYASFSEMLQAEMISEVLPGISSIEQGVGVYRKFYTEEKESFY 72

Query: 226 GVLAICISKMATQPYVTLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFML 285
           GVLAI +SK   QPY+ +  +L+GLG  G+  LLG+  TAGT+ D LPPPRS L+SS M 
Sbjct: 73  GVLAISVSKPTAQPYIIMTELLAGLGSDGLGRLLGMVKTAGTVQDGLPPPRSVLISSCMK 132

Query: 286 PFKPN 290
             +PN
Sbjct: 133 LHQPN 137


>gi|223973711|gb|ACN31043.1| unknown [Zea mays]
          Length = 102

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 60/101 (59%), Positives = 77/101 (76%)

Query: 190 MLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQPYVTLASILSG 249
           ML+ E +  VLPG+ +I EGV++YRKFY EEKE S GVLAI +SK  +QPY+T+ +IL+G
Sbjct: 1   MLKGEKISNVLPGIPSIVEGVKVYRKFYAEEKENSYGVLAISVSKPTSQPYITMNNILAG 60

Query: 250 LGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPN 290
           LGY G+  LLG++ T GT+ D LPPPRSALLSS M   +PN
Sbjct: 61  LGYDGLGRLLGMAKTTGTVPDGLPPPRSALLSSCMGLVQPN 101


>gi|449470818|ref|XP_004153113.1| PREDICTED: uncharacterized protein LOC101215405 [Cucumis sativus]
          Length = 139

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 66/142 (46%), Positives = 95/142 (66%), Gaps = 10/142 (7%)

Query: 3   RLKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDEDSL 62
           +L+DC++ELLKF L S  + T DFDLG+SS FC++L + DP    S S    +     S 
Sbjct: 4   KLEDCVQELLKFVLQSSTNGTPDFDLGLSSAFCSSLFKHDP----STSNPLPY-----SK 54

Query: 63  QGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQLILNKGSSDL 122
            GVP YPLY+RL+LAL  S+  G FC  Y+K+ + N E  L+QKEE W +LI++KG S++
Sbjct: 55  AGVPPYPLYERLSLALWESLCSGTFCPMYEKMLMKNGESSLKQKEEMWLKLIMDKG-SEM 113

Query: 123 VDILKAVEFELHVQEPFFSFIK 144
           V +L+ +  EL++ EPFF+ +K
Sbjct: 114 VQMLRTLNLELYIDEPFFTQLK 135


>gi|449454590|ref|XP_004145037.1| PREDICTED: uncharacterized protein LOC101215607 [Cucumis sativus]
          Length = 116

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 57/117 (48%), Positives = 77/117 (65%), Gaps = 9/117 (7%)

Query: 3   RLKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDEDSL 62
           +L+DC++ELLKF L S  + T DFDLG+SS FC++L + DP    S S    +     S 
Sbjct: 4   KLEDCVQELLKFVLQSSTNGTPDFDLGLSSAFCSSLFKHDP----STSNPLPY-----SK 54

Query: 63  QGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQLILNKGS 119
            GVP YPLY+RL+LAL  S+  G FC  Y+K+ + N E  L+QKEE W +LI++KGS
Sbjct: 55  AGVPPYPLYERLSLALWESLCSGTFCPMYEKMLMKNGESSLKQKEEMWLKLIMDKGS 111


>gi|449516559|ref|XP_004165314.1| PREDICTED: uncharacterized LOC101215607 [Cucumis sativus]
          Length = 127

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 59/129 (45%), Positives = 82/129 (63%), Gaps = 10/129 (7%)

Query: 3   RLKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDEDSL 62
           +L+DC++ELLKF L S  + T DFDLG+SS FC++L + DP    S S    +     S 
Sbjct: 4   KLEDCVQELLKFVLQSSTNGTPDFDLGLSSAFCSSLFKHDP----STSNPLPY-----SK 54

Query: 63  QGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQLILNKGSSDL 122
            GVP YPLY+RL+LAL  S+  G FC  Y+K+ + N E  L+QKEE W +LI++KG S++
Sbjct: 55  AGVPPYPLYERLSLALWESLCSGTFCPMYEKMLMKNGESSLKQKEEMWLKLIMDKG-SEM 113

Query: 123 VDILKAVEF 131
           V +     F
Sbjct: 114 VQVRSGCSF 122


>gi|242398993|ref|YP_002994417.1| ProFAR isomerase associated, [Thermococcus sibiricus MM 739]
 gi|242265386|gb|ACS90068.1| ProFAR isomerase associated, putative [Thermococcus sibiricus MM
           739]
          Length = 110

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 49/105 (46%), Positives = 70/105 (66%)

Query: 129 VEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFS 188
           +E+E+ ++E +   I++G + VEGR        I PG VI+    + +KV+S+  Y SF 
Sbjct: 1   MEWEMGLKEEYLDLIRNGKKKVEGRLYDEKRRQINPGDVIIFEGKLKVKVKSIKKYPSFK 60

Query: 189 EMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
           EMLE E + +VLPGV  IEEGV+IYR+FYTEE+E+  GV+AI I 
Sbjct: 61  EMLEREGIERVLPGVHNIEEGVKIYRQFYTEEEEKKYGVVAIEIE 105


>gi|168043944|ref|XP_001774443.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674295|gb|EDQ60806.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 272

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 71/230 (30%), Positives = 107/230 (46%), Gaps = 57/230 (24%)

Query: 64  GVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQLILNKGSSDLV 123
           G+P+ PLY  LA AL R +    F R    +  + E+   + + + W++ ++        
Sbjct: 1   GMPVDPLYVHLAAALYRWICTRRFPRNMAPIEGIKEDESWKSRLDAWNETVVTH------ 54

Query: 124 DILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCH 183
                                                    GS +L         + V  
Sbjct: 55  -----------------------------------------GSALL---------EEVHR 64

Query: 184 YDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQ-PYVT 242
           Y +F  M+E E L+KVLPGV++I EGVQIYRKFY+  KE++ GVL I + + A + P   
Sbjct: 65  YPTFQIMIETEGLLKVLPGVESISEGVQIYRKFYSCSKEQTYGVLGIHVVRPAQRDPADL 124

Query: 243 LASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPFKPNVV 292
           L+ IL+ LG  G++ LLG+  T GT+ +ALPPP+S L  SF     P++ 
Sbjct: 125 LSDILNTLGIDGVRVLLGMRTTNGTLKEALPPPKSLLADSFSALQNPDIA 174


>gi|302812114|ref|XP_002987745.1| hypothetical protein SELMODRAFT_447090 [Selaginella moellendorffii]
 gi|300144637|gb|EFJ11320.1| hypothetical protein SELMODRAFT_447090 [Selaginella moellendorffii]
          Length = 215

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 52/109 (47%), Positives = 74/109 (67%), Gaps = 2/109 (1%)

Query: 176 LKVQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKM 235
           ++ + V  Y+SF +MLE E L  VLPG++++EEG+ IYR FY+ E E + GVLA+ +S+ 
Sbjct: 1   METKDVRAYESFQDMLETEGLENVLPGIESVEEGIGIYRGFYSRETE-ALGVLALHVSQ- 58

Query: 236 ATQPYVTLASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFM 284
             QP   + ++L  L   G+ SLLGL  T GT+  ALPPPRS LL+SF+
Sbjct: 59  PIQPRNAVTALLQSLKVEGVASLLGLRATKGTLVSALPPPRSRLLASFL 107


>gi|375082231|ref|ZP_09729299.1| hypothetical protein OCC_08375 [Thermococcus litoralis DSM 5473]
 gi|374743119|gb|EHR79489.1| hypothetical protein OCC_08375 [Thermococcus litoralis DSM 5473]
          Length = 118

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 46/101 (45%), Positives = 69/101 (68%)

Query: 129 VEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFS 188
           +E+E+ +QE +   IK+G + +EGR        I PG VI+    + ++V+++  Y SF 
Sbjct: 1   MEWEMGLQEEYLKLIKEGKKKIEGRLYDEKRRQIKPGDVIIFEGRLKVRVKALRVYPSFK 60

Query: 189 EMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLA 229
           EMLE E L +VLP VK+I+EGV++YRKFY+EE+ER  GV+A
Sbjct: 61  EMLEKEGLDRVLPNVKSIDEGVKVYRKFYSEEEERKYGVVA 101


>gi|315231674|ref|YP_004072110.1| hypothetical protein TERMP_01912 [Thermococcus barophilus MP]
 gi|315184702|gb|ADT84887.1| hypothetical protein TERMP_01912 [Thermococcus barophilus MP]
          Length = 116

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 46/105 (43%), Positives = 69/105 (65%)

Query: 129 VEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFS 188
           +++E+ +QE +   I +G + +EGR        I PG +I+    + +KV+ +  Y SF 
Sbjct: 1   MKWEMGLQEEYLKLIAEGKKKIEGRLYDEKRRQIKPGDIIIFEGKLKVKVKDIRVYPSFK 60

Query: 189 EMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
           EMLE E L  VLPG+K+IEEGV+IYR+FY+EE+E+  GV+AI I 
Sbjct: 61  EMLEKEGLENVLPGIKSIEEGVKIYRQFYSEEEEKKWGVVAIEIE 105


>gi|20093930|ref|NP_613777.1| fused ProFAR isomerase-like protein/uncharacterized protein
           [Methanopyrus kandleri AV19]
 gi|19886880|gb|AAM01707.1| Archaea-specific enzyme related to ProFAR isomerase (HisA) and
           containing an additional uncharacterized domain
           [Methanopyrus kandleri AV19]
          Length = 350

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 49/103 (47%), Positives = 66/103 (64%)

Query: 133 LHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEMLE 192
           + ++E + + IK+G +TVEGR        I PG  IL N+ +++KV  V  YDSF EML 
Sbjct: 239 MGLEEEYLNLIKEGKKTVEGRVKDDKRARIKPGDKILFNRRLLVKVIDVREYDSFEEMLR 298

Query: 193 AESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKM 235
            E L  VLP V +IEEGV+IYR+FY+  KE+  GVLAI I  +
Sbjct: 299 EEGLENVLPNVDSIEEGVEIYRRFYSSGKEKMFGVLAIEIEPI 341


>gi|14521811|ref|NP_127287.1| hypothetical protein PAB1060 [Pyrococcus abyssi GE5]
 gi|5459031|emb|CAB50517.1| Hypothetical protein PAB1060 [Pyrococcus abyssi GE5]
          Length = 111

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 47/106 (44%), Positives = 69/106 (65%), Gaps = 1/106 (0%)

Query: 129 VEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMM-LKVQSVCHYDSF 187
           +E+E+ +QE +   IK GL+ +EGR        I PG +I+     + ++V+++  Y SF
Sbjct: 1   MEWEMGLQEEYIELIKRGLKKIEGRLYDEKRRKIKPGDIIVFEGGKLKVRVKALRVYKSF 60

Query: 188 SEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
            EMLE E +  VLPGV ++EEGV+IYRKFY EE+E+  GV+AI I 
Sbjct: 61  KEMLEKEGIENVLPGVNSVEEGVKIYRKFYDEEREKKYGVVAIEIE 106


>gi|18976827|ref|NP_578184.1| hypothetical protein PF0455 [Pyrococcus furiosus DSM 3638]
 gi|397650960|ref|YP_006491541.1| hypothetical protein PFC_01390 [Pyrococcus furiosus COM1]
 gi|18892426|gb|AAL80579.1| hypothetical protein PF0455 [Pyrococcus furiosus DSM 3638]
 gi|393188551|gb|AFN03249.1| hypothetical protein PFC_01390 [Pyrococcus furiosus COM1]
          Length = 110

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 49/106 (46%), Positives = 68/106 (64%), Gaps = 1/106 (0%)

Query: 129 VEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMM-LKVQSVCHYDSF 187
           +E+E+ +QE F   IK   + +EGR        I PG VI      + ++V+++  Y+SF
Sbjct: 1   MEWEMGLQEEFLELIKLRKKKIEGRLYDEKRRQIKPGDVISFEGGKLKVRVKAIRVYNSF 60

Query: 188 SEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
            EMLE E L  VLPGVK+IEEG+Q+YR+FY EEKE+  GV+AI I 
Sbjct: 61  REMLEKEGLENVLPGVKSIEEGIQVYRRFYDEEKEKKYGVVAIEIE 106


>gi|157879999|pdb|1S04|A Chain A, Solution Nmr Structure Of Protein Pf0455 From Pyrococcus
           Furiosus. Northeast Structural Genomics Consortium
           Target Pfr13
          Length = 110

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 49/106 (46%), Positives = 68/106 (64%), Gaps = 1/106 (0%)

Query: 129 VEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMM-LKVQSVCHYDSF 187
           +E+E+ +QE F   IK   + +EGR        I PG VI      + ++V+++  Y+SF
Sbjct: 1   MEWEMGLQEEFLELIKLRKKKIEGRLYDEKRRQIKPGDVISFEGGKLKVRVKAIRVYNSF 60

Query: 188 SEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
            EMLE E L  VLPGVK+IEEG+Q+YR+FY EEKE+  GV+AI I 
Sbjct: 61  REMLEKEGLENVLPGVKSIEEGIQVYRRFYDEEKEKKYGVVAIEIE 106


>gi|332158605|ref|YP_004423884.1| hypothetical protein PNA2_0964 [Pyrococcus sp. NA2]
 gi|331034068|gb|AEC51880.1| hypothetical protein PNA2_0964 [Pyrococcus sp. NA2]
          Length = 109

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 46/106 (43%), Positives = 68/106 (64%), Gaps = 1/106 (0%)

Query: 129 VEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMM-LKVQSVCHYDSF 187
           +E+E+ +QE +   IK G + +EGR        I PG +I+     + +KV+++  Y SF
Sbjct: 1   MEWEMGLQEEYIELIKSGKKKIEGRLYDEKRRQIKPGDIIIFEGGKLKVKVKAIRVYKSF 60

Query: 188 SEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
            EMLE E +  VLPGVK +EEGV++YR+FY EE+E+  GV+AI I 
Sbjct: 61  KEMLEKEGIENVLPGVKDVEEGVKVYRQFYDEEREKKYGVVAIEIE 106


>gi|14590265|ref|NP_142331.1| hypothetical protein PH0355 [Pyrococcus horikoshii OT3]
 gi|160286344|pdb|2Z0T|A Chain A, Crystal Structure Of Hypothetical Protein Ph0355
 gi|160286345|pdb|2Z0T|B Chain B, Crystal Structure Of Hypothetical Protein Ph0355
 gi|160286346|pdb|2Z0T|C Chain C, Crystal Structure Of Hypothetical Protein Ph0355
 gi|160286347|pdb|2Z0T|D Chain D, Crystal Structure Of Hypothetical Protein Ph0355
 gi|3256746|dbj|BAA29429.1| 109aa long hypothetical protein [Pyrococcus horikoshii OT3]
          Length = 109

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 46/106 (43%), Positives = 68/106 (64%), Gaps = 1/106 (0%)

Query: 129 VEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMM-LKVQSVCHYDSF 187
           +++E+ +QE +   IK G + +EGR        I PG +I+     + +KV+ +  Y SF
Sbjct: 1   MKWEMGLQEEYIELIKAGKKKIEGRLYDEKRRQIKPGDIIIFEGGKLKVKVKGIRVYSSF 60

Query: 188 SEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
            EMLE E +  VLPGVK+IEEGV++YR+FY EE+E+  GV+AI I 
Sbjct: 61  KEMLEKEGIENVLPGVKSIEEGVKVYRQFYDEEREKKYGVVAIEIE 106


>gi|409096441|ref|ZP_11216465.1| hypothetical protein TzilA_07282 [Thermococcus zilligii AN1]
          Length = 117

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 46/103 (44%), Positives = 63/103 (61%)

Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
           + + +QE +   I +G + +EGR        I PG  I+    +M  V+ V  Y SF EM
Sbjct: 4   WRMGLQEEYLRAIAEGRKKIEGRLYDEKRQGIKPGDTIIFEDRLMCVVKDVRVYSSFREM 63

Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
           LE+E +  VLPGV  IEEGV++YRKFY+EEKER  GV+AI + 
Sbjct: 64  LESEGIENVLPGVGDIEEGVKVYRKFYSEEKERKYGVVAIEVE 106


>gi|212224839|ref|YP_002308075.1| hypothetical protein TON_1688 [Thermococcus onnurineus NA1]
 gi|212009796|gb|ACJ17178.1| conserved hypothetical protein [Thermococcus onnurineus NA1]
          Length = 116

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 46/105 (43%), Positives = 64/105 (60%)

Query: 129 VEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFS 188
           V + + +QE +   I +G + VEGR        I PG  I+    +M+ V+ +  Y SF 
Sbjct: 2   VTWRMGLQEEYLKAIAEGRKKVEGRLYDEKRQGIKPGDTIIFENKLMVVVKDIRVYSSFR 61

Query: 189 EMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
           EML  E L  VLPGVK+IEEGV++YRKFY+E+KE+  GV AI + 
Sbjct: 62  EMLGKEGLENVLPGVKSIEEGVKVYRKFYSEDKEKKYGVAAIEVE 106


>gi|57641772|ref|YP_184250.1| hypothetical protein TK1837 [Thermococcus kodakarensis KOD1]
 gi|57160096|dbj|BAD86026.1| hypothetical protein, conserved, DUF1530 family [Thermococcus
           kodakarensis KOD1]
          Length = 117

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 47/106 (44%), Positives = 66/106 (62%)

Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
           +++ +QE +   I +G + +EGR        I PG  I+    ++  V+ V  Y SF EM
Sbjct: 4   WKMGLQEEYLRAIAEGKKKIEGRLYDEKRQKIKPGDEIVFENKLVCVVKDVRVYSSFREM 63

Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMA 236
           LE E L  VLPGVK+IEEGV++YRKFY+EEKE+  GV AI +  +A
Sbjct: 64  LEKEGLENVLPGVKSIEEGVKVYRKFYSEEKEKKYGVAAIEVEPVA 109


>gi|240102242|ref|YP_002958550.1| hypothetical protein TGAM_0184 [Thermococcus gammatolerans EJ3]
 gi|239909795|gb|ACS32686.1| Conserved hypothetical protein [Thermococcus gammatolerans EJ3]
          Length = 116

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 46/106 (43%), Positives = 68/106 (64%)

Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
           +++ +QE +   I +G + +EGR       +I PG  I+    ++  V+ +  Y SF EM
Sbjct: 4   WKMGLQEEYLKAIAEGKKKIEGRLYDEKRQAIKPGDEIVFENKLVCVVKDLRVYSSFREM 63

Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMA 236
           LE E L  VLPGVK+IEEGV++YRKFY+EEKE+  GV+AI +  +A
Sbjct: 64  LEKEGLENVLPGVKSIEEGVKVYRKFYSEEKEKKYGVVAIEVEPVA 109


>gi|380742440|tpe|CCE71074.1| TPA: hypothetical protein PAB1060 [Pyrococcus abyssi GE5]
          Length = 107

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 45/102 (44%), Positives = 65/102 (63%), Gaps = 1/102 (0%)

Query: 133 LHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMM-LKVQSVCHYDSFSEML 191
           + +QE +   IK GL+ +EGR        I PG +I+     + ++V+++  Y SF EML
Sbjct: 1   MGLQEEYIELIKRGLKKIEGRLYDEKRRKIKPGDIIVFEGGKLKVRVKALRVYKSFKEML 60

Query: 192 EAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
           E E +  VLPGV ++EEGV+IYRKFY EE+E+  GV+AI I 
Sbjct: 61  EKEGIENVLPGVNSVEEGVKIYRKFYDEEREKKYGVVAIEIE 102


>gi|389852386|ref|YP_006354620.1| hypothetical protein Py04_0963 [Pyrococcus sp. ST04]
 gi|388249692|gb|AFK22545.1| hypothetical protein Py04_0963 [Pyrococcus sp. ST04]
          Length = 105

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 46/104 (44%), Positives = 65/104 (62%), Gaps = 1/104 (0%)

Query: 133 LHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMM-LKVQSVCHYDSFSEML 191
           + +QE +   IK G + +EGR        I PG +I      + ++V+++  Y +F EML
Sbjct: 1   MGLQEEYLELIKAGKKKIEGRLYDEKRRQIKPGDIISFEGGRLKVRVKAIRVYKTFREML 60

Query: 192 EAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKM 235
           E E +  VLPGV++IEEGVQIYRKFY EE+E+  GV+AI I  M
Sbjct: 61  EKEGIENVLPGVESIEEGVQIYRKFYDEEREKKYGVVAIEIEPM 104


>gi|223477503|ref|YP_002581867.1| ProFAR isomerase associated superfamily protein [Thermococcus sp.
           AM4]
 gi|214032729|gb|EEB73558.1| ProFAR isomerase associated superfamily protein [Thermococcus sp.
           AM4]
          Length = 116

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 45/106 (42%), Positives = 67/106 (63%)

Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
           +++ +QE +   I +G + +EGR       +I PG  I+    ++  V+ +  Y SF EM
Sbjct: 4   WKMGLQEEYLRAIAEGKKKIEGRLYDEKRQAIKPGDEIVFENRLVCVVKDLRVYSSFREM 63

Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMA 236
           LE E L  VLPGVK IEEGV++YR+FY+EEKE+  GV+AI +  +A
Sbjct: 64  LEKEGLENVLPGVKNIEEGVKVYRRFYSEEKEKKYGVVAIEVEPVA 109


>gi|341582462|ref|YP_004762954.1| hypothetical protein GQS_06895 [Thermococcus sp. 4557]
 gi|340810120|gb|AEK73277.1| hypothetical protein GQS_06895 [Thermococcus sp. 4557]
          Length = 116

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 44/106 (41%), Positives = 65/106 (61%)

Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
           + + +QE +   I +G + +EGR        I PG  I+    ++  V+ V  Y SF EM
Sbjct: 4   WRMGLQEEYLKAIAEGRKRIEGRLYDEKRQGIKPGDEIIFENKLVCVVKDVRVYSSFREM 63

Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMA 236
           LE E L  VLPGV+++E+GV++YR+FY+EEKER  GV AI +  +A
Sbjct: 64  LEKEGLENVLPGVESVEDGVRVYRRFYSEEKERKYGVAAIEVEPVA 109


>gi|337285338|ref|YP_004624812.1| hypothetical protein PYCH_18850 [Pyrococcus yayanosii CH1]
 gi|334901272|gb|AEH25540.1| hypothetical protein PYCH_18850 [Pyrococcus yayanosii CH1]
          Length = 112

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 44/106 (41%), Positives = 67/106 (63%), Gaps = 1/106 (0%)

Query: 129 VEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMM-LKVQSVCHYDSF 187
           +E+E+ +QE +   I+ G + +EGR        I PG +I      + ++V+++  Y SF
Sbjct: 1   MEWEMGLQEEYIELIRTGKKKIEGRLYDEKRRQIKPGDIISFEGGKLKVRVKALRVYSSF 60

Query: 188 SEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
            EMLE E L  VLPGV +IEEGV++YR+FY EE+E+  GV+AI + 
Sbjct: 61  REMLEKEGLENVLPGVDSIEEGVRVYRQFYDEEREKKYGVVAIEVE 106


>gi|390961172|ref|YP_006425006.1| hypothetical protein CL1_1009 [Thermococcus sp. CL1]
 gi|390519480|gb|AFL95212.1| hypothetical protein CL1_1009 [Thermococcus sp. CL1]
          Length = 110

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 64/104 (61%)

Query: 133 LHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEMLE 192
           + +QE +   I +G + +EGR       +I PG  I+    ++  V+ V  Y SF EMLE
Sbjct: 1   MGLQEEYLRAIAEGRKRIEGRLYDEKRQAIRPGDEIVFENKLVCVVKDVRVYSSFREMLE 60

Query: 193 AESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMA 236
            E +  VLPGV +IEEGV++YR+FY+EEKE+  GV AI +  +A
Sbjct: 61  KEGIENVLPGVGSIEEGVKVYRRFYSEEKEKKYGVAAIEVEPVA 104


>gi|326431332|gb|EGD76902.1| ProFAR isomerase associated superfamily [Salpingoeca sp. ATCC
           50818]
          Length = 128

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 48/109 (44%), Positives = 63/109 (57%), Gaps = 9/109 (8%)

Query: 138 PFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKC--------MMLKVQSVCHYDSFSE 189
           P+FSFI  G +TVEGR     Y  +G G  I   +C        + + V  V  Y S + 
Sbjct: 21  PWFSFIAQGKKTVEGRLASDKYTGLGVGCTIAF-RCEPTPDLPPVHVTVTRVVRYPSIAL 79

Query: 190 MLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQ 238
            LEAE L + LPG++T+ EGVQ+YR+FYTEE E+  GVLAI I +   Q
Sbjct: 80  YLEAEGLSRCLPGIETLAEGVQVYRQFYTEEDEQRLGVLAIHIQRATDQ 128


>gi|282890016|ref|ZP_06298550.1| hypothetical protein pah_c009o037 [Parachlamydia acanthamoebae str.
           Hall's coccus]
 gi|338174413|ref|YP_004651223.1| hypothetical protein PUV_04190 [Parachlamydia acanthamoebae UV-7]
 gi|281500108|gb|EFB42393.1| hypothetical protein pah_c009o037 [Parachlamydia acanthamoebae str.
           Hall's coccus]
 gi|336478771|emb|CCB85369.1| putative uncharacterized protein [Parachlamydia acanthamoebae UV-7]
          Length = 106

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 39/99 (39%), Positives = 62/99 (62%)

Query: 132 ELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEML 191
           +++V++ FF  IK G +TVEGR   S + ++ PG  + +N  +++ VQ V  Y  F +ML
Sbjct: 3   DINVEDQFFKLIKQGKKTVEGRLAKSKFLNLAPGDCLRINNQLIVFVQRVDCYPFFRDML 62

Query: 192 EAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAI 230
             E L  VLPG  +++EG  IY +FY+ E E+  GV+A+
Sbjct: 63  FHEGLKNVLPGCASLDEGENIYYRFYSREDEKKFGVIAV 101


>gi|406997767|gb|EKE15786.1| hypothetical protein ACD_11C00105G0011 [uncultured bacterium]
          Length = 111

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 43/104 (41%), Positives = 64/104 (61%), Gaps = 1/104 (0%)

Query: 133 LHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLN-KCMMLKVQSVCHYDSFSEML 191
           ++VQEP++SFI  G +TVEGR     + SI  G ++ L  + + L V     Y +F EM+
Sbjct: 6   INVQEPYYSFILQGKKTVEGRLNKGKFASIQVGDILELEPEKIKLVVVEKNIYKNFREMI 65

Query: 192 EAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKM 235
           E E +  V+P    I+E V +Y KFYT+E+E+  GV+AI I K+
Sbjct: 66  EKEGIGNVIPDKNDIDEAVNVYYKFYTKEQEQEFGVVAIKIKKL 109


>gi|123494593|ref|XP_001326550.1| Archaea-specific enzyme related to ProFAR isomerase [Trichomonas
           vaginalis G3]
 gi|121909466|gb|EAY14327.1| Archaea-specific enzyme related to ProFAR isomerase, putative
           [Trichomonas vaginalis G3]
          Length = 111

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 35/109 (32%), Positives = 61/109 (55%), Gaps = 9/109 (8%)

Query: 134 HVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILL-------NKCMMLKVQSVCHYDS 186
           H+ EP+FS+++ G +T+EGR    +++ I PG +I          +   +KV  +  Y S
Sbjct: 5   HLSEPWFSYVEQGKKTIEGRLEREEWSKIKPGDIIEFYNSDTGSKRAFQVKVVDIKKYSS 64

Query: 187 FSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKM 235
           F  ++  E L  +LPGV ++EEG++IY K Y      ++ +L I + K+
Sbjct: 65  FKSLINTEGLNNILPGVTSLEEGIKIYEKIYGSTP--NHNILGIKLQKL 111


>gi|338176370|ref|YP_004653180.1| hypothetical protein PUV_23760 [Parachlamydia acanthamoebae UV-7]
 gi|336480728|emb|CCB87326.1| putative uncharacterized protein [Parachlamydia acanthamoebae UV-7]
          Length = 168

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 39/105 (37%), Positives = 59/105 (56%), Gaps = 3/105 (2%)

Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLN---KCMMLKVQSVCHYDSF 187
           F +H  +P+FS+I+ G++ VEGR +   Y  I  G  I  +   +  +  V  +  Y + 
Sbjct: 61  FNIHCDDPWFSYIRQGIKPVEGRKSTHTYKKIKVGDQIKFSNGKESFVANVTEIREYSTI 120

Query: 188 SEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICI 232
            + LE  +L K LPGVK+IEEG+ IY ++  EEK R  G L + I
Sbjct: 121 EQYLEDVTLEKALPGVKSIEEGLNIYYEWTPEEKIRQYGFLGLFI 165


>gi|282892300|ref|ZP_06300696.1| hypothetical protein pah_c221o034 [Parachlamydia acanthamoebae str.
           Hall's coccus]
 gi|281497891|gb|EFB40242.1| hypothetical protein pah_c221o034 [Parachlamydia acanthamoebae str.
           Hall's coccus]
          Length = 135

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 39/106 (36%), Positives = 59/106 (55%), Gaps = 3/106 (2%)

Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLN---KCMMLKVQSVCHYDSF 187
           F +H  +P+FS+I+ G++ VEGR +   Y  I  G  I  +   +  +  V  +  Y + 
Sbjct: 28  FNIHCDDPWFSYIRQGIKPVEGRKSTHTYKKIKVGDQIKFSNGKESFVANVTEIREYSTI 87

Query: 188 SEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
            + LE  +L K LPGVK+IEEG+ IY ++  EEK R  G L + I 
Sbjct: 88  EQYLEDVTLEKALPGVKSIEEGLNIYYEWTPEEKIRQYGFLGLFIK 133


>gi|224133258|ref|XP_002327999.1| predicted protein [Populus trichocarpa]
 gi|222837408|gb|EEE75787.1| predicted protein [Populus trichocarpa]
          Length = 66

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 50/78 (64%), Positives = 56/78 (71%), Gaps = 13/78 (16%)

Query: 190 MLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMATQPYVTLASILSG 249
           ML AESL KVLPGVKT+E           EEKE SNGVLAIC+SK+A QPY++LASIL G
Sbjct: 1   MLHAESLEKVLPGVKTVE----------EEEKEVSNGVLAICVSKLAAQPYLSLASILFG 50

Query: 250 LGYVGIQSLLGLSHTAGT 267
           L Y G+QS LG   TAGT
Sbjct: 51  LSYGGVQSQLG---TAGT 65


>gi|297823775|ref|XP_002879770.1| hypothetical protein ARALYDRAFT_903127 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297325609|gb|EFH56029.1| hypothetical protein ARALYDRAFT_903127 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 130

 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 53/141 (37%), Positives = 78/141 (55%), Gaps = 25/141 (17%)

Query: 1   MERLKDCMEELLKFTLSSHIDETLDFDLGISSKFCTNLLQDDPNDAVSPSTSFDFDSDED 60
           M+RL+DC+EE++KFTL   ++    FDL ++  FC  LL          S   D +  E 
Sbjct: 1   MDRLRDCLEEMVKFTLDCRVE----FDLELTDDFCFGLL-------CGESILLDGERIES 49

Query: 61  S-------LQGVPLYPLYKRLALALCRSVNCGAFCRTYKKVALMNEECFLEQKEEEWSQL 113
           S         GVP YPLYK LAL L +S+  G+ C T++ ++L  E      +E+EWS+L
Sbjct: 50  SSHALLHRFGGVPDYPLYKLLALGLLKSIVSGSVCGTFENISLGKE------REDEWSKL 103

Query: 114 ILNKGSSDLVDILKAVEFELH 134
           I  KG S+LV++ ++  F L 
Sbjct: 104 INQKG-SELVNVRQSQSFILQ 123


>gi|57639990|ref|YP_182468.1| hypothetical protein TK0055 [Thermococcus kodakarensis KOD1]
 gi|57158314|dbj|BAD84244.1| hypothetical protein, conserved, DUF1530 family [Thermococcus
           kodakarensis KOD1]
          Length = 112

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 52/100 (52%)

Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
           + L V++ + +FI+ G + +E R     + +I PG  I+ N  +   V  V  Y++F ++
Sbjct: 4   YRLFVKDEYLNFIRSGEKRIEVRVAYPQFRNIRPGDKIIFNDSIPAVVTEVKKYETFRQV 63

Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAI 230
           L  E + K+ P   + E  V+ +   Y + KE   GV+AI
Sbjct: 64  LREEPIKKIFPDEPSFERAVKRFHNLYPKWKENRYGVIAI 103


>gi|302847948|ref|XP_002955507.1| hypothetical protein VOLCADRAFT_66018 [Volvox carteri f.
           nagariensis]
 gi|300259130|gb|EFJ43360.1| hypothetical protein VOLCADRAFT_66018 [Volvox carteri f.
           nagariensis]
          Length = 165

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 35/118 (29%), Positives = 55/118 (46%), Gaps = 19/118 (16%)

Query: 139 FFSFIKDGLETVEGRCTISDYNSIGPGSVILL------------------NKCMMLKVQS 180
           +   I+ G +TVEGR     +  + PG +                     +  +  +  +
Sbjct: 45  YLELIRSGTKTVEGRIRAGKWADVIPGDIFRFFSTQAPPSSTLPANTNTNSVSVRCRAIT 104

Query: 181 VCHYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKFYT-EEKERSNGVLAICISKMAT 237
           V  YDSF  MLE E L   LPGV+++EEGV++YR      E+E   GV+A+ +  + T
Sbjct: 105 VRQYDSFQAMLEGEGLAACLPGVQSLEEGVEVYRSIPGYREREAVEGVVAVGLELLLT 162


>gi|223478631|ref|YP_002583381.1| ProFAR isomerase associated superfamily protein [Thermococcus sp.
           AM4]
 gi|214033857|gb|EEB74683.1| ProFAR isomerase associated superfamily protein [Thermococcus sp.
           AM4]
          Length = 112

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 49/100 (49%)

Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
           + L V++ +  FIK G + +E R        I PG  I+ N  +   V  V  Y++F ++
Sbjct: 4   YRLFVRDEYLDFIKSGEKRIEVRVAYPQLRKIQPGDKIIFNDSVPAVVTEVKRYETFRQV 63

Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAI 230
           L  E + K+ P   + E  V+ +   Y + KE   GV+AI
Sbjct: 64  LRQEPIKKIFPDEPSFERAVKRFHNLYPKWKENRYGVIAI 103


>gi|341583022|ref|YP_004763514.1| hypothetical protein GQS_09715 [Thermococcus sp. 4557]
 gi|340810680|gb|AEK73837.1| hypothetical protein GQS_09715 [Thermococcus sp. 4557]
          Length = 114

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 50/100 (50%)

Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
           + L V+E +  +IK G + +E R     +  + PG  I+ N  +   V  V  Y++F ++
Sbjct: 4   YRLRVREEYLDYIKSGEKRIEVRVAYPQFRGMKPGDKIIFNDQVPAVVTGVKEYETFRQV 63

Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAI 230
           L  E + K+ P   + E  V+ +   Y + KE   GV+AI
Sbjct: 64  LREEPIKKIFPDEPSFERAVKRFHGMYPKWKENRYGVIAI 103


>gi|240103056|ref|YP_002959365.1| hypothetical protein TGAM_0999 [Thermococcus gammatolerans EJ3]
 gi|239910610|gb|ACS33501.1| Conserved hypothetical protein [Thermococcus gammatolerans EJ3]
          Length = 112

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 49/100 (49%)

Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
           + L V++ +  FIK G + +E R        I PG  ++ N  +   V  V  Y++F ++
Sbjct: 4   YRLFVRDEYLDFIKSGEKRIEVRVAYPQLRRIQPGDKLIFNDSIPAVVTEVKRYETFRQV 63

Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAI 230
           L  E + K+ P   + E  V+ +   Y + KE   GV+AI
Sbjct: 64  LREEPIKKIFPDEPSFERAVKRFHNLYPKWKENRYGVIAI 103


>gi|212224321|ref|YP_002307557.1| hypothetical protein TON_1172 [Thermococcus onnurineus NA1]
 gi|212009278|gb|ACJ16660.1| hypothetical protein, conserved [Thermococcus onnurineus NA1]
          Length = 111

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 50/100 (50%)

Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
           + L V++ +  +IK G + +E R       +I PG  IL N  +   V  V  Y++F ++
Sbjct: 4   YRLRVRDEYLDYIKSGEKRIEVRVAYPQLRNIKPGDKILFNNEVPAVVTGVKRYETFRQV 63

Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAI 230
           L  E + K+ P   + E  ++ +   Y + KE   GV+AI
Sbjct: 64  LREEQIKKIFPDEPSFERALKRFHNMYPKWKENRYGVIAI 103


>gi|409095591|ref|ZP_11215615.1| hypothetical protein TzilA_02940 [Thermococcus zilligii AN1]
          Length = 115

 Score = 58.2 bits (139), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 48/100 (48%)

Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
           + L V E +  FI+ G + +E R        I PG  I+ N  +   V  V  Y++F ++
Sbjct: 4   YHLKVHEEYLEFIRSGEKRIEVRVAYPQLRGIKPGDRIIFNDSIPAIVTEVKRYETFRQV 63

Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAI 230
           L  E + K+ P   + E  ++ +   Y + KE   GV+AI
Sbjct: 64  LREEPIEKIFPDEPSFERALKRFHGMYPKWKENRYGVMAI 103


>gi|390961690|ref|YP_006425524.1| hypothetical protein CL1_1531 [Thermococcus sp. CL1]
 gi|390519998|gb|AFL95730.1| hypothetical protein CL1_1531 [Thermococcus sp. CL1]
          Length = 112

 Score = 57.4 bits (137), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 48/100 (48%)

Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
           + L V++ +  FIK G + +E R        I PG  ++ N  +   V  V  Y++F ++
Sbjct: 4   YHLKVRDEYLDFIKSGEKRIEVRVAYPQLRKIQPGDKLIFNDSIPAVVTEVKRYETFRQV 63

Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAI 230
           L  E + K+ P   + E  V+ +   Y + KE   GV+ I
Sbjct: 64  LREEPIKKIFPDEPSFERAVKRFHNLYPKWKENRYGVIVI 103


>gi|332158720|ref|YP_004423999.1| hypothetical protein PNA2_1079 [Pyrococcus sp. NA2]
 gi|331034183|gb|AEC51995.1| hypothetical protein PNA2_1079 [Pyrococcus sp. NA2]
          Length = 116

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 26/100 (26%), Positives = 51/100 (51%)

Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
           + L++++ +   IK G + +E R     +  +  G  I+ N  +  +V  V HY++F ++
Sbjct: 4   YRLYLRDEYLEMIKSGKKRIEVRVAYPQFKGMKRGDKIIFNDSIPAEVIEVKHYETFRQV 63

Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAI 230
           L  E + K+ P   + E  ++ +   Y + KE   GV+AI
Sbjct: 64  LREEPIDKIFPDEPSFERALRRFHNMYPKWKENRYGVIAI 103


>gi|156936814|ref|YP_001434610.1| ProFAR isomerase family protein [Ignicoccus hospitalis KIN4/I]
 gi|156565798|gb|ABU81203.1| ProFAR isomerase family protein [Ignicoccus hospitalis KIN4/I]
          Length = 236

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 32/94 (34%), Positives = 51/94 (54%), Gaps = 8/94 (8%)

Query: 132 ELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILL-----NKCMMLKVQSVCHYDS 186
            + ++E +F  I  G + VEGR        +  G VI        K +  KV+ +  Y +
Sbjct: 134 RMKLEERWFELIASGEKVVEGRVYDEKRKRLRVGHVIQFKSVQSGKLLYAKVKKLVMYKN 193

Query: 187 FSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEE 220
           F EMLE+E   +VLPG+ ++EEG+++Y K+Y  E
Sbjct: 194 FKEMLESED--RVLPGL-SVEEGLKVYEKYYGPE 224


>gi|14590361|ref|NP_142427.1| hypothetical protein PH0447 [Pyrococcus horikoshii OT3]
 gi|3256850|dbj|BAA29533.1| 114aa long hypothetical protein [Pyrococcus horikoshii OT3]
          Length = 114

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 25/100 (25%), Positives = 50/100 (50%)

Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEM 190
           + L++++ +   +K G + +E R        +  G  I+ N  +  +V  V HY++F ++
Sbjct: 5   YRLYLRDEYLEMVKSGKKRIEVRVAYPQLKGMKRGDKIIFNDEVPAEVIEVKHYETFRQV 64

Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAI 230
           L  E + K+ P   + E  ++ +   Y + KE   GV+AI
Sbjct: 65  LREEPIDKIFPDEPSFERALRRFHNLYPKWKENRYGVIAI 104


>gi|224133328|ref|XP_002328015.1| predicted protein [Populus trichocarpa]
 gi|222837424|gb|EEE75803.1| predicted protein [Populus trichocarpa]
          Length = 76

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 25/43 (58%), Positives = 32/43 (74%), Gaps = 3/43 (6%)

Query: 206 IEEGVQIYRKFYTEEKERSNGVLAICISKMATQPYVTLASILS 248
           + EG ++   FYTEEKERSNGVLAIC +K A   Y++LA+I S
Sbjct: 36  LNEGTEL---FYTEEKERSNGVLAICAAKSAAPSYISLATISS 75


>gi|389852077|ref|YP_006354311.1| hypothetical protein Py04_0634 [Pyrococcus sp. ST04]
 gi|388249383|gb|AFK22236.1| hypothetical protein Py04_0634 [Pyrococcus sp. ST04]
          Length = 99

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/88 (29%), Positives = 43/88 (48%)

Query: 143 IKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQSVCHYDSFSEMLEAESLVKVLPG 202
           IK G + +E R        +  G  I+ N  +  +V  V HY++F ++L  E + K+ P 
Sbjct: 2   IKSGKKRIEVRVAYPQLKGMKRGDKIIFNDSIPAEVIEVKHYETFRQVLREEPIEKIFPD 61

Query: 203 VKTIEEGVQIYRKFYTEEKERSNGVLAI 230
             + E  ++ +   Y + KE   GVLAI
Sbjct: 62  EPSFERALRRFHNMYPKWKEYRYGVLAI 89


>gi|338731888|ref|YP_004663007.1| hypothetical protein SNE_B25120 [Simkania negevensis Z]
 gi|336481271|emb|CCB87871.1| putative uncharacterized protein [Simkania negevensis Z]
          Length = 175

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/98 (25%), Positives = 51/98 (52%), Gaps = 12/98 (12%)

Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKC----MMLKVQSVCHYDS 186
           + L+++  +  +I DG + +EGR  + D+  +  G ++          +  + SV  YDS
Sbjct: 42  YRLNIETQYLLYIIDGKKIIEGRLNVPDFGDMKKGDLVYFTDGNGGQAICSITSVGRYDS 101

Query: 187 FSEMLEAESLVKVLPGV--------KTIEEGVQIYRKF 216
           F++ML +E ++ +LP +        + + +G +IYR F
Sbjct: 102 FNKMLVSEGVINMLPQIDPNTNSSEEMLLKGTKIYRSF 139


>gi|256396042|ref|YP_003117606.1| hypothetical protein Caci_6932 [Catenulispora acidiphila DSM 44928]
 gi|256362268|gb|ACU75765.1| protein of unknown function DUF437 [Catenulispora acidiphila DSM
           44928]
          Length = 119

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 54/105 (51%), Gaps = 5/105 (4%)

Query: 131 FELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILL---NKCMMLKVQSVCHYDSF 187
           +E+++   +F  +  G +T+E R       S+  G  I        ++ +V+ V  Y SF
Sbjct: 14  YEMNLYRRYFDMVASGRKTIEVRVHYPKLRSLTTGDHIRFIYDTDSVLTRVKRVARYTSF 73

Query: 188 SEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICI 232
            +ML+AE   +V P     ++ V + R+ Y+ EKE + GVLAI I
Sbjct: 74  EQMLDAEDAARVNPDSPHEQQLVDM-RRIYSPEKE-ALGVLAIEI 116


>gi|294813019|ref|ZP_06771662.1| GCN5-related N-acetyltransferase [Streptomyces clavuligerus ATCC
           27064]
 gi|294325618|gb|EFG07261.1| GCN5-related N-acetyltransferase [Streptomyces clavuligerus ATCC
           27064]
          Length = 121

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 30/108 (27%), Positives = 56/108 (51%), Gaps = 6/108 (5%)

Query: 130 EFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPG---SVILLNKCMMLKVQSVCHYDS 186
           E  L +++P+   I  G +T+E R          PG   ++I  ++ +  ++ +V  Y+S
Sbjct: 10  EHSLTIRKPYLDLIASGAKTIEVRVGYPKIRRFAPGDILTIICADESVTTRITAVREYES 69

Query: 187 FSEMLEAESLVKV-LPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
           F+ M++AE    +  PG+ T ++ V   R  Y  EKE + GV A+ ++
Sbjct: 70  FAAMVDAEDAAAIGGPGM-THDQLVAAIRSIYPPEKE-ALGVFALHLA 115


>gi|254392325|ref|ZP_05007509.1| hypothetical protein SSCG_04879 [Streptomyces clavuligerus ATCC
           27064]
 gi|326441434|ref|ZP_08216168.1| hypothetical protein SclaA2_10233 [Streptomyces clavuligerus ATCC
           27064]
 gi|197705996|gb|EDY51808.1| hypothetical protein SSCG_04879 [Streptomyces clavuligerus ATCC
           27064]
          Length = 126

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 30/108 (27%), Positives = 56/108 (51%), Gaps = 6/108 (5%)

Query: 130 EFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPG---SVILLNKCMMLKVQSVCHYDS 186
           E  L +++P+   I  G +T+E R          PG   ++I  ++ +  ++ +V  Y+S
Sbjct: 15  EHSLTIRKPYLDLIASGAKTIEVRVGYPKIRRFAPGDILTIICADESVTTRITAVREYES 74

Query: 187 FSEMLEAESLVKV-LPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
           F+ M++AE    +  PG+ T ++ V   R  Y  EKE + GV A+ ++
Sbjct: 75  FAAMVDAEDAAAIGGPGM-THDQLVAAIRSIYPPEKE-ALGVFALHLA 120


>gi|357400832|ref|YP_004912757.1| hypothetical protein SCAT_3248 [Streptomyces cattleya NRRL 8057 =
           DSM 46488]
 gi|386356891|ref|YP_006055137.1| hypothetical protein SCATT_32440 [Streptomyces cattleya NRRL 8057 =
           DSM 46488]
 gi|337767241|emb|CCB75952.1| conserved protein of unknown function [Streptomyces cattleya NRRL
           8057 = DSM 46488]
 gi|365807399|gb|AEW95615.1| hypothetical protein SCATT_32440 [Streptomyces cattleya NRRL 8057 =
           DSM 46488]
          Length = 111

 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 55/111 (49%), Gaps = 5/111 (4%)

Query: 130 EFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILL---NKCMMLKVQSVCHYDS 186
           E  +++++P+F  IK+G++TVE R        I  G V++    ++    +V  V  Y S
Sbjct: 3   ERRINIRKPYFDLIKNGVKTVEVRVGYPGMRKIAAGQVLVFRSGDESCRTEVVKVVEYPS 62

Query: 187 FSEMLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICISKMAT 237
           F  M +AE +   + G     E +   R  Y  EKE + GVLAI + +  T
Sbjct: 63  FEAMADAEDIA-AIGGDMGRGELLAACRDIYPPEKE-ALGVLAIHLRRPTT 111


>gi|348683950|gb|EGZ23765.1| hypothetical protein PHYSODRAFT_311003 [Phytophthora sojae]
          Length = 391

 Score = 43.9 bits (102), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 31/135 (22%), Positives = 62/135 (45%), Gaps = 10/135 (7%)

Query: 107 EEEWSQLILNKGSSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGS 166
           E+ W Q   N G+      L   +++  + E   + I+ G +++E R  ++ Y+ I    
Sbjct: 188 EKPWEQS--NVGAFHQEATLPPADYQFALPEKDQALIEQGAKSLEIRLNVTPYSIIRAND 245

Query: 167 VILLNKCMMLKVQSVCHYDSFSEMLEAESLVKVLPGVKTIEEGV--------QIYRKFYT 218
            I +N   +  V +V  Y     +LEAE++  +LP    +  G         + YR F++
Sbjct: 246 RITINGKTLTAVVAVRKYSRLQSVLEAENVNALLPQSSFVGAGTFNAAAAAERHYRHFFS 305

Query: 219 EEKERSNGVLAICIS 233
           +E+E   G++   ++
Sbjct: 306 KEEEEQYGLVVFQLA 320


>gi|282890018|ref|ZP_06298552.1| hypothetical protein pah_c009o041 [Parachlamydia acanthamoebae str.
           Hall's coccus]
 gi|338174415|ref|YP_004651225.1| hypothetical protein PUV_04210 [Parachlamydia acanthamoebae UV-7]
 gi|281500110|gb|EFB42395.1| hypothetical protein pah_c009o041 [Parachlamydia acanthamoebae str.
           Hall's coccus]
 gi|336478773|emb|CCB85371.1| putative uncharacterized protein [Parachlamydia acanthamoebae UV-7]
          Length = 227

 Score = 43.5 bits (101), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 49/103 (47%), Gaps = 11/103 (10%)

Query: 139 FFSFIKDGLETVEGRCTISDYNSIGPGSVILLN-------KCMMLKVQSVCHYDSFSEML 191
           +   I+ G +TVEGR     +     G  I          +C++  V +   Y SF EML
Sbjct: 64  YLDLIRSGQKTVEGRINSGAFKCARIGDRITFYCRSSAPVQCLITDVNT---YPSFKEML 120

Query: 192 EAESLVKVLPGVKTIEEGVQIYRKFYT-EEKERSNGVLAICIS 233
             E L   LP V  + +G++IY K    +E+   +GV+A+ +S
Sbjct: 121 VKEGLKACLPDVTDLSKGIEIYEKIPGYKERASQHGVIALKVS 163


>gi|301114703|ref|XP_002999121.1| hypothetical protein PITG_05482 [Phytophthora infestans T30-4]
 gi|262111215|gb|EEY69267.1| hypothetical protein PITG_05482 [Phytophthora infestans T30-4]
          Length = 387

 Score = 43.5 bits (101), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 35/141 (24%), Positives = 65/141 (46%), Gaps = 10/141 (7%)

Query: 107 EEEWSQLILNKGSSDLVDILKAVEFELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGS 166
           E+ W QL +  G   L   +   +++  + E     I+ G +++E R   + Y+ I    
Sbjct: 187 EKPWEQLRV--GVPGLDAAIPPPDYQFALPERDHQLIEQGSKSLEIRLNDAPYSIIHVND 244

Query: 167 VILLNKCMMLKVQSVCHYDSFSEMLEAESLVKVLP-----GVKTIEEGV---QIYRKFYT 218
            I +N   +  V S+  Y S   +LEAE++  +LP     G +T        + YR+F++
Sbjct: 245 RITINGKTLTTVASIRKYSSLQSVLEAENVSALLPQSSFVGAETFNAAAAAERHYRQFFS 304

Query: 219 EEKERSNGVLAICISKMATQP 239
            E+E   G++   ++  A  P
Sbjct: 305 VEEEEHYGLIVFQLAVSAPGP 325


>gi|256829429|ref|YP_003158157.1| N-acetyltransferase GCN5 [Desulfomicrobium baculatum DSM 4028]
 gi|256578605|gb|ACU89741.1| GCN5-related N-acetyltransferase [Desulfomicrobium baculatum DSM
           4028]
          Length = 444

 Score = 42.0 bits (97), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 26/94 (27%), Positives = 46/94 (48%), Gaps = 3/94 (3%)

Query: 133 LHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILL---NKCMMLKVQSVCHYDSFSE 189
           + +++ F  FIK G +T+E R    +  +I  G  I+    ++ ++  V+ +  Y +FSE
Sbjct: 338 IRLKKNFLEFIKRGKKTLEVRVGYDNIKTIKSGESIVFMSRDERLVRTVKEIRIYKTFSE 397

Query: 190 MLEAESLVKVLPGVKTIEEGVQIYRKFYTEEKER 223
           ML  E+   + P     EE  ++    Y   KER
Sbjct: 398 MLNHENYKLIAPNSNNREETEKLLSDIYPPHKER 431


>gi|323452594|gb|EGB08467.1| hypothetical protein AURANDRAFT_63756 [Aureococcus anophagefferens]
          Length = 1175

 Score = 42.0 bits (97), Expect = 0.34,   Method: Composition-based stats.
 Identities = 29/108 (26%), Positives = 49/108 (45%), Gaps = 11/108 (10%)

Query: 134 HVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILL-----NKCMMLKVQSVCHYDSFS 188
           ++Q  +   I  G +T EGR        +     I+      +  +  + + V  +D+F 
Sbjct: 510 YIQPKYADDIMSGAKTWEGRACTKWLEKLSVDDWIMFKISGSSDLIYARAREVVKFDTFE 569

Query: 189 EMLEAESLVKVLPGVKTIEEGVQIYRKF------YTEEKERSNGVLAI 230
           EML+   L   LPG  ++ +GV+IYR F         + E  +GV+AI
Sbjct: 570 EMLDYCGLDACLPGCPSVADGVKIYRSFGCFDGRTYADVEAESGVVAI 617


>gi|323454833|gb|EGB10702.1| hypothetical protein AURANDRAFT_17437, partial [Aureococcus
           anophagefferens]
          Length = 114

 Score = 41.6 bits (96), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 28/94 (29%), Positives = 42/94 (44%), Gaps = 18/94 (19%)

Query: 135 VQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILLNKCMMLKVQS------------VC 182
           +Q  +   IK G +T EGR         G    + ++ C+  KV +            V 
Sbjct: 1   LQPKYADLIKSGAKTWEGR------AYAGWLEHVSVDDCITFKVTTRGAERVIARALDVR 54

Query: 183 HYDSFSEMLEAESLVKVLPGVKTIEEGVQIYRKF 216
            +D+F +ML+   L   LPG  + EEG +IYR F
Sbjct: 55  RFDTFEDMLDHCGLDACLPGCASREEGAKIYRSF 88


>gi|239618457|ref|YP_002941779.1| hypothetical protein Kole_2098 [Kosmotoga olearia TBF 19.5.1]
 gi|239507288|gb|ACR80775.1| protein of unknown function DUF437 [Kosmotoga olearia TBF 19.5.1]
          Length = 114

 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 46/103 (44%), Gaps = 7/103 (6%)

Query: 137 EPFFSFIKDGLETVEGRCTISDYNSIGPGSVIL------LNKCMMLKVQSVCHYDSFSEM 190
           EPF   IK   + +E R        I  G  I+      LN+ + +KV  + HY++F ++
Sbjct: 10  EPF-ELIKTRKKVIEVRLNDEKRQKIKIGDTIVFSKLPDLNEKLRVKVVGLLHYETFEQL 68

Query: 191 LEAESLVKVLPGVKTIEEGVQIYRKFYTEEKERSNGVLAICIS 233
                        KT+E  ++   K YT E+ER  GVL I I 
Sbjct: 69  YRDIPFKYFGREGKTLEWMLESTYKIYTREQERKYGVLGIRIE 111


>gi|340376712|ref|XP_003386876.1| PREDICTED: hypothetical protein LOC100635148 [Amphimedon
           queenslandica]
          Length = 287

 Score = 39.3 bits (90), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 20/42 (47%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 243 LASILSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFM 284
           L   L  LG  G+  LLG+  T G++ D LPP R  LLSSF+
Sbjct: 141 LTETLQTLGMRGVMDLLGIRQTVGSV-DVLPPSRETLLSSFV 181


>gi|298705875|emb|CBJ29020.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 420

 Score = 38.5 bits (88), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 18/41 (43%), Positives = 23/41 (56%)

Query: 247 LSGLGYVGIQSLLGLSHTAGTISDALPPPRSALLSSFMLPF 287
           +S +G  GI+ LLGL  T G+I   LPP R  LL +   P 
Sbjct: 240 ISSIGQAGIRDLLGLKRTVGSIDGCLPPSRRVLLKAINKPH 280


>gi|342210775|ref|ZP_08703527.1| hypothetical protein GIG_02286 [Mycoplasma anatis 1340]
 gi|341578896|gb|EGS29237.1| hypothetical protein GIG_02286 [Mycoplasma anatis 1340]
          Length = 385

 Score = 37.4 bits (85), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 31/108 (28%), Positives = 54/108 (50%), Gaps = 13/108 (12%)

Query: 132 ELHVQEPFFSFIKDGLETVEGRCTISDYNSIGPGSVILL-----NK-CMMLKVQSVCHYD 185
           +L +QE F   I++G +T+E R      +++  G V+ L     NK   ++++  +  Y 
Sbjct: 4   KLFIQEKFLKKIREGKKTIEMRLNDPRRHNLKEGMVLKLENIDDNKDIQLIEITKINKYK 63

Query: 186 SFSEMLEAESLVKVLPGVKTIEEGV---QIYRKFYTEEKERSNGVLAI 230
           +F E+ + E    +L G    EE      +Y  FY+ EK+   GV+AI
Sbjct: 64  NFDELYKCED--PILLGYDCAEEANPNDMLY--FYSREKQDEWGVIAI 107


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.136    0.396 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,417,666,378
Number of Sequences: 23463169
Number of extensions: 173163378
Number of successful extensions: 419673
Number of sequences better than 100.0: 87
Number of HSP's better than 100.0 without gapping: 69
Number of HSP's successfully gapped in prelim test: 18
Number of HSP's that attempted gapping in prelim test: 419525
Number of HSP's gapped (non-prelim): 99
length of query: 293
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 152
effective length of database: 9,050,888,538
effective search space: 1375735057776
effective search space used: 1375735057776
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)